Sample records for pooled-template sequencing implications

  1. Insights into mutagenesis using Escherichia coli chromosomal lacZ strains that enable detection of a wide spectrum of mutational events.

    PubMed

    Seier, Tracey; Padgett, Dana R; Zilberberg, Gal; Sutera, Vincent A; Toha, Noor; Lovett, Susan T

    2011-06-01

    Strand misalignments at DNA repeats during replication are implicated in mutational hotspots. To study these events, we have generated strains carrying mutations in the Escherichia coli chromosomal lacZ gene that revert via deletion of a short duplicated sequence or by template switching within imperfect inverted repeat (quasipalindrome, QP) sequences. Using these strains, we demonstrate that mutation of the distal repeat of a quasipalindrome, with respect to replication fork movement, is about 10-fold higher than the proximal repeat, consistent with more common template switching on the leading strand. The leading strand bias was lost in the absence of exonucleases I and VII, suggesting that it results from more efficient suppression of template switching by 3' exonucleases targeted to the lagging strand. The loss of 3' exonucleases has no effect on strand misalignment at direct repeats to produce deletion. To compare these events to other mutations, we have reengineered reporters (designed by Cupples and Miller 1989) that detect specific base substitutions or frameshifts in lacZ with the reverting lacZ locus on the chromosome rather than an F' element. This set allows rapid screening of potential mutagens, environmental conditions, or genetic loci for effects on a broad set of mutational events. We found that hydroxyurea (HU), which depletes dNTP pools, slightly elevated templated mutations at inverted repeats but had no effect on deletions, simple frameshifts, or base substitutions. Mutations in nucleotide diphosphate kinase, ndk, significantly elevated simple mutations but had little effect on the templated class. Zebularine, a cytosine analog, elevated all classes.

  2. Recombination of polynucleotide sequences using random or defined primers

    DOEpatents

    Arnold, Frances H.; Shao, Zhixin; Affholter, Joseph A.; Zhao, Huimin H; Giver, Lorraine J.

    2000-01-01

    A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.

  3. Recombination of polynucleotide sequences using random or defined primers

    DOEpatents

    Arnold, Frances H.; Shao, Zhixin; Affholter, Joseph A.; Zhao, Huimin; Giver, Lorraine J.

    2001-01-01

    A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.

  4. Deconstructing the Polymerase Chain Reaction: Understanding and Correcting Bias Associated with Primer Degeneracies and Primer-Template Mismatches

    PubMed Central

    Green, Stefan J.; Venkatramanan, Raghavee; Naqib, Ankur

    2015-01-01

    The polymerase chain reaction (PCR) is sensitive to mismatches between primer and template, and mismatches can lead to inefficient amplification of targeted regions of DNA template. In PCRs in which a degenerate primer pool is employed, each primer can behave differently. Therefore, inefficiencies due to different primer melting temperatures within a degenerate primer pool, in addition to mismatches between primer binding sites and primers, can lead to a distortion of the true relative abundance of targets in the original DNA pool. A theoretical analysis indicated that a combination of primer-template and primer-amplicon interactions during PCR cycles 3–12 is potentially responsible for this distortion. To test this hypothesis, we developed a novel amplification strategy, entitled “Polymerase-exonuclease (PEX) PCR”, in which primer-template interactions and primer-amplicon interactions are separated. The PEX PCR method substantially and significantly improved the evenness of recovery of sequences from a mock community of known composition, and allowed for amplification of templates with introduced mismatches near the 3’ end of the primer annealing sites. When the PEX PCR method was applied to genomic DNA extracted from complex environmental samples, a significant shift in the observed microbial community was detected. Furthermore, the PEX PCR method provides a mechanism to identify which primers in a primer pool are annealing to target gDNA. Primer utilization patterns revealed that at high annealing temperatures in the PEX PCR method, perfect match annealing predominates, while at lower annealing temperatures, primers with up to four mismatches with templates can contribute substantially to amplification. The PEX PCR method is simple to perform, is limited to PCR mixes and a single exonuclease step which can be performed without reaction cleanup, and is recommended for reactions in which degenerate primer pools are used or when mismatches between primers and template are possible. PMID:25996930

  5. Sources of PCR-induced distortions in high-throughput sequencing data sets

    PubMed Central

    Kebschull, Justus M.; Zador, Anthony M.

    2015-01-01

    PCR permits the exponential and sequence-specific amplification of DNA, even from minute starting quantities. PCR is a fundamental step in preparing DNA samples for high-throughput sequencing. However, there are errors associated with PCR-mediated amplification. Here we examine the effects of four important sources of error—bias, stochasticity, template switches and polymerase errors—on sequence representation in low-input next-generation sequencing libraries. We designed a pool of diverse PCR amplicons with a defined structure, and then used Illumina sequencing to search for signatures of each process. We further developed quantitative models for each process, and compared predictions of these models to our experimental data. We find that PCR stochasticity is the major force skewing sequence representation after amplification of a pool of unique DNA amplicons. Polymerase errors become very common in later cycles of PCR but have little impact on the overall sequence distribution as they are confined to small copy numbers. PCR template switches are rare and confined to low copy numbers. Our results provide a theoretical basis for removing distortions from high-throughput sequencing data. In addition, our findings on PCR stochasticity will have particular relevance to quantification of results from single cell sequencing, in which sequences are represented by only one or a few molecules. PMID:26187991

  6. ECB deacylase mutants

    DOEpatents

    Arnold, Frances H.; Shao, Zhixin; Zhao, Huimin; Giver, Lorraine J.

    2002-01-01

    A method for in vitro mutagenesis and recombination of polynucleotide sequences based on polymerase-catalyzed extension of primer oligonucleotides is disclosed. The method involves priming template polynucleotide(s) with random-sequences or defined-sequence primers to generate a pool of short DNA fragments with a low level of point mutations. The DNA fragments are subjected to denaturization followed by annealing and further enzyme-catalyzed DNA polymerization. This procedure is repeated a sufficient number of times to produce full-length genes which comprise mutants of the original template polynucleotides. These genes can be further amplified by the polymerase chain reaction and cloned into a vector for expression of the encoded proteins.

  7. Single-cell template strand sequencing by Strand-seq enables the characterization of individual homologs.

    PubMed

    Sanders, Ashley D; Falconer, Ester; Hills, Mark; Spierings, Diana C J; Lansdorp, Peter M

    2017-06-01

    The ability to distinguish between genome sequences of homologous chromosomes in single cells is important for studies of copy-neutral genomic rearrangements (such as inversions and translocations), building chromosome-length haplotypes, refining genome assemblies, mapping sister chromatid exchange events and exploring cellular heterogeneity. Strand-seq is a single-cell sequencing technology that resolves the individual homologs within a cell by restricting sequence analysis to the DNA template strands used during DNA replication. This protocol, which takes up to 4 d to complete, relies on the directionality of DNA, in which each single strand of a DNA molecule is distinguished based on its 5'-3' orientation. Culturing cells in a thymidine analog for one round of cell division labels nascent DNA strands, allowing for their selective removal during genomic library construction. To preserve directionality of template strands, genomic preamplification is bypassed and labeled nascent strands are nicked and not amplified during library preparation. Each single-cell library is multiplexed for pooling and sequencing, and the resulting sequence data are aligned, mapping to either the minus or plus strand of the reference genome, to assign template strand states for each chromosome in the cell. The major adaptations to conventional single-cell sequencing protocols include harvesting of daughter cells after a single round of BrdU incorporation, bypassing of whole-genome amplification, and removal of the BrdU + strand during Strand-seq library preparation. By sequencing just template strands, the structure and identity of each homolog are preserved.

  8. Emergence of Distinct Brome Mosaic Virus Recombinants Is Determined by the Polarity of the Inoculum RNA

    PubMed Central

    Kwon, Sun-Jung

    2012-01-01

    Despite overwhelming interest in the impact exerted by recombination during evolution of RNA viruses, the relative contribution of the polarity of inoculum templates remains poorly understood. Here, by agroinfiltrating Nicotiana benthamiana leaves, we show that brome mosaic virus (BMV) replicase is competent to initiate positive-strand [(+)-strand] synthesis on an ectopically expressed RNA3 negative strand [(−) strand] and faithfully complete the replication cycle. Consequently, we sought to examine the role of RNA polarity in BMV recombination by expressing a series of replication-defective mutants of BMV RNA3 in (+) or (−) polarity. Temporal analysis of progeny sequences revealed that the genetic makeup of the primary recombinant pool is determined by the polarity of the inoculum template. When the polarity of the inoculum template was (+), the recombinant pool that accumulated during early phases of replication was a mixture of nonhomologous recombinants. These are longer than the inoculum template length, and a nascent 3′ untranslated region (UTR) of wild-type (WT) RNA1 or RNA2 was added to the input mutant RNA3 3′ UTR due to end-to-end template switching by BMV replicase during (−)-strand synthesis. In contrast, when the polarity of the inoculum was (−), the progeny contained a pool of native-length homologous recombinants generated by template switching of BMV replicase with a nascent UTR from WT RNA1 or RNA2 during (+)-strand synthesis. Repair of a point mutation caused by polymerase error occurred only when the polarity of the inoculum template was (+). These results contribute to the explanation of the functional role of RNA polarity in recombination mediated by copy choice mechanisms. PMID:22357282

  9. Transcription blockage by homopurine DNA sequences: role of sequence composition and single-strand breaks

    PubMed Central

    Belotserkovskii, Boris P.; Neil, Alexander J.; Saleh, Syed Shayon; Shin, Jane Hae Soo; Mirkin, Sergei M.; Hanawalt, Philip C.

    2013-01-01

    The ability of DNA to adopt non-canonical structures can affect transcription and has broad implications for genome functioning. We have recently reported that guanine-rich (G-rich) homopurine-homopyrimidine sequences cause significant blockage of transcription in vitro in a strictly orientation-dependent manner: when the G-rich strand serves as the non-template strand [Belotserkovskii et al. (2010) Mechanisms and implications of transcription blockage by guanine-rich DNA sequences., Proc. Natl Acad. Sci. USA, 107, 12816–12821]. We have now systematically studied the effect of the sequence composition and single-stranded breaks on this blockage. Although substitution of guanine by any other base reduced the blockage, cytosine and thymine reduced the blockage more significantly than adenine substitutions, affirming the importance of both G-richness and the homopurine-homopyrimidine character of the sequence for this effect. A single-strand break in the non-template strand adjacent to the G-rich stretch dramatically increased the blockage. Breaks in the non-template strand result in much weaker blockage signals extending downstream from the break even in the absence of the G-rich stretch. Our combined data support the notion that transcription blockage at homopurine-homopyrimidine sequences is caused by R-loop formation. PMID:23275544

  10. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

    PubMed Central

    2011-01-01

    Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed. PMID:21794110

  11. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes.

    PubMed

    Feltus, Frank A; Saski, Christopher A; Mockaitis, Keithanne; Haiminen, Niina; Parida, Laxmi; Smith, Zachary; Ford, James; Staton, Margaret E; Ficklin, Stephen P; Blackmon, Barbara P; Cheng, Chun-Huai; Schnell, Raymond J; Kuhn, David N; Motamayor, Juan-Carlos

    2011-07-27

    BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed.

  12. Sequence selection by dynamical symmetry breaking in an autocatalytic binary polymer model

    NASA Astrophysics Data System (ADS)

    Fellermann, Harold; Tanaka, Shinpei; Rasmussen, Steen

    2017-12-01

    Template-directed replication of nucleic acids is at the essence of all living beings and a major milestone for any origin of life scenario. We present an idealized model of prebiotic sequence replication, where binary polymers act as templates for their autocatalytic replication, thereby serving as each others reactants and products in an intertwined molecular ecology. Our model demonstrates how autocatalysis alters the qualitative and quantitative system dynamics in counterintuitive ways. Most notably, numerical simulations reveal a very strong intrinsic selection mechanism that favors the appearance of a few population structures with highly ordered and repetitive sequence patterns when starting from a pool of monomers. We demonstrate both analytically and through simulation how this "selection of the dullest" is caused by continued symmetry breaking through random fluctuations in the transient dynamics that are amplified by autocatalysis and eventually propagate to the population level. The impact of these observations on related prebiotic mathematical models is discussed.

  13. Eighty routes to a ribonucleotide world; dispersion and stringency in the decisive selection.

    PubMed

    Yarus, Michael

    2018-05-21

    We examine the initial emergence of genetics; that is, of an inherited chemical capability. The crucial actors are ribonucleotides, occasionally meeting in a prebiotic landscape. Previous work identified six influential variables during such random ribonucleotide pooling. Geochemical pools can be in periodic danger (e.g., from tides) or constant danger (e.g., from unfavorable weather). Such pools receive Gaussian nucleotide amounts sporadically, at random times, or get varying substrates simultaneously. Pools use cross-templated RNA synthesis (5'-5' product from 5'-3' template) or para-templated (5'-5' product from 5'-5' template) synthesis. Pools can undergo mild or strong selection, and be recently initiated (early) or late in age. Considering > 80 combinations of these variables, selection calculations identify a superior route. Most likely, an early, sporadically fed, cross-templating pool in constant danger, receiving ≥ 1 mM nucleotides while under strong selection for a coenzyme-like product will host selection of the first encoded biochemical functions. Predominantly templated products emerge from a critical event, the starting bloc selection, which exploits inevitable differences among early pools. Favorable selection has a simple rationale; it is increased by product dispersion (sd/mean), by selection intensity (mild or strong), or by combining these factors as stringency, reciprocal fraction of pools selected (1/sfsel). To summarize: chance utility, acting via a preference for disperse, templated coenzyme-like dinucleotides, uses stringent starting bloc selection to quickly establish majority encoded/genetic expression. Despite its computational origin, starting bloc selection is largely independent of specialized assumptions. This ribodinucleotide route to inheritance may also have facilitated 5'-3' chemical RNA replication. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  14. The steric gate of DNA polymerase ι regulates ribonucleotide incorporation and deoxyribonucleotide fidelity.

    PubMed

    Donigan, Katherine A; McLenigan, Mary P; Yang, Wei; Goodman, Myron F; Woodgate, Roger

    2014-03-28

    Accurate DNA synthesis in vivo depends on the ability of DNA polymerases to select dNTPs from a nucleotide pool dominated by NTPs. High fidelity replicative polymerases have evolved to efficiently exclude NTPs while copying long stretches of undamaged DNA. However, to bypass DNA damage, cells utilize specialized low fidelity polymerases to perform translesion DNA synthesis (TLS). Of interest is human DNA polymerase ι (pol ι), which has been implicated in TLS of oxidative and UV-induced lesions. Here, we evaluate the ability of pol ι to incorporate NTPs during DNA synthesis. pol ι incorporates and extends NTPs opposite damaged and undamaged template bases in a template-specific manner. The Y39A "steric gate" pol ι mutant is considerably more active in the presence of Mn(2+) compared with Mg(2+) and exhibits a marked increase in NTP incorporation and extension, and surprisingly, it also exhibits increased dNTP base selectivity. Our results indicate that a single residue in pol ι is able to discriminate between NTPs and dNTPs during DNA synthesis. Because wild-type pol ι incorporates NTPs in a template-specific manner, certain DNA sequences may be "at risk" for elevated mutagenesis during pol ι-dependent TLS. Molecular modeling indicates that the constricted active site of wild-type pol ι becomes more spacious in the Y39A variant. Therefore, the Y39A substitution not only permits incorporation of ribonucleotides but also causes the enzyme to favor faithful Watson-Crick base pairing over mutagenic configurations.

  15. Microbeads display of proteins using emulsion PCR and cell-free protein synthesis.

    PubMed

    Gan, Rui; Yamanaka, Yumiko; Kojima, Takaaki; Nakano, Hideo

    2008-01-01

    We developed a method for coupling protein to its coding DNA on magnetic microbeads using emulsion PCR and cell-free protein synthesis in emulsion. A PCR mixture containing streptavidin-coated microbeads was compartmentalized by water-in-oil (w/o) emulsion with estimated 0.5 template molecules per droplet. The template molecules were amplified and immobilized on beads via bead-linked reverse primers and biotinylated forward primers. After amplification, the templates were sequentially labeled with streptavidin and biotinylated anti-glutathione S-transferase (GST) antibody. The pool of beads was then subjected to cell-free protein synthesis compartmentalized in another w/o emulsion, in which templates were coupled to their coding proteins. We mixed two types of DNA templates of Histidine6 tag (His6)-fused and FLAG tag-fused GST in a ratio of 1:1,000 (His6: FLAG) for use as a model DNA library. After incubation with fluorescein isothiocyanate (FITC)-labeled anti-His6 (C-term) antibody, the beads with the His6 gene were enriched 917-fold in a single-round screening by using flow cytometry. A library with a theoretical diversity of 10(6) was constructed by randomizing the middle four residues of the His6 tag. After a two-round screening, the randomized sequences were substantially converged to peptide-encoding sequences recognized by the anti-His6 antibody.

  16. Rolling Circle Amplification of Complete Nematode Mitochondrial Genomes

    PubMed Central

    Tang, Sha; Hyman, Bradley C.

    2005-01-01

    To enable investigation of nematode mitochondrial DNA evolution, methodology has been developed to amplify intact nematode mitochondrial genomes in preparative yields using a rolling circle replication strategy. Successful reactions were generated from whole cell template DNA prepared by alkaline lysis of the rhabditid nematode Caenorhabditis elegans and a mermithid nematode, Thaumamermis cosgrovei. These taxa, representing the two major nematode classes Chromodorea and Enoplea, maintain mitochondrial genomes of 13.8 kb and 20.0 kb, respectively. Efficient amplifications were conducted on template DNA isolated from individual or pooled nematodes that were alive or stored at -80°C. Unexpectedly, these experiments revealed that multiple T. cosgrovei mitochondrial DNA haplotypes are maintained in our local population. Rolling circle amplification products can be used as templates for standard PCR reactions with specific primers that target mitochondrial genes or for direct DNA sequencing. PMID:19262866

  17. Uncoupling of sgRNAs from their associated barcodes during PCR amplification of combinatorial CRISPR screens

    PubMed Central

    2018-01-01

    Many implementations of pooled screens in mammalian cells rely on linking an element of interest to a barcode, with the latter subsequently quantitated by next generation sequencing. However, substantial uncoupling between these paired elements during lentiviral production has been reported, especially as the distance between elements increases. We detail that PCR amplification is another major source of uncoupling, and becomes more pronounced with increased amounts of DNA template molecules and PCR cycles. To lessen uncoupling in systems that use paired elements for detection, we recommend minimizing the distance between elements, using low and equal template DNA inputs for plasmid and genomic DNA during PCR, and minimizing the number of PCR cycles. We also present a vector design for conducting combinatorial CRISPR screens that enables accurate barcode-based detection with a single short sequencing read and minimal uncoupling. PMID:29799876

  18. DNA polymerase preference determines PCR priming efficiency.

    PubMed

    Pan, Wenjing; Byrne-Steele, Miranda; Wang, Chunlin; Lu, Stanley; Clemmons, Scott; Zahorchak, Robert J; Han, Jian

    2014-01-30

    Polymerase chain reaction (PCR) is one of the most important developments in modern biotechnology. However, PCR is known to introduce biases, especially during multiplex reactions. Recent studies have implicated the DNA polymerase as the primary source of bias, particularly initiation of polymerization on the template strand. In our study, amplification from a synthetic library containing a 12 nucleotide random portion was used to provide an in-depth characterization of DNA polymerase priming bias. The synthetic library was amplified with three commercially available DNA polymerases using an anchored primer with a random 3' hexamer end. After normalization, the next generation sequencing (NGS) results of the amplified libraries were directly compared to the unamplified synthetic library. Here, high throughput sequencing was used to systematically demonstrate and characterize DNA polymerase priming bias. We demonstrate that certain sequence motifs are preferred over others as primers where the six nucleotide sequences at the 3' end of the primer, as well as the sequences four base pairs downstream of the priming site, may influence priming efficiencies. DNA polymerases in the same family from two different commercial vendors prefer similar motifs, while another commercially available enzyme from a different DNA polymerase family prefers different motifs. Furthermore, the preferred priming motifs are GC-rich. The DNA polymerase preference for certain sequence motifs was verified by amplification from single-primer templates. We incorporated the observed DNA polymerase preference into a primer-design program that guides the placement of the primer to an optimal location on the template. DNA polymerase priming bias was characterized using a synthetic library amplification system and NGS. The characterization of DNA polymerase priming bias was then utilized to guide the primer-design process and demonstrate varying amplification efficiencies among three commercially available DNA polymerases. The results suggest that the interaction of the DNA polymerase with the primer:template junction during the initiation of DNA polymerization is very important in terms of overall amplification bias and has broader implications for both the primer design process and multiplex PCR.

  19. The Steric Gate of DNA Polymerase ι Regulates Ribonucleotide Incorporation and Deoxyribonucleotide Fidelity*

    PubMed Central

    Donigan, Katherine A.; McLenigan, Mary P.; Yang, Wei; Goodman, Myron F.; Woodgate, Roger

    2014-01-01

    Accurate DNA synthesis in vivo depends on the ability of DNA polymerases to select dNTPs from a nucleotide pool dominated by NTPs. High fidelity replicative polymerases have evolved to efficiently exclude NTPs while copying long stretches of undamaged DNA. However, to bypass DNA damage, cells utilize specialized low fidelity polymerases to perform translesion DNA synthesis (TLS). Of interest is human DNA polymerase ι (pol ι), which has been implicated in TLS of oxidative and UV-induced lesions. Here, we evaluate the ability of pol ι to incorporate NTPs during DNA synthesis. pol ι incorporates and extends NTPs opposite damaged and undamaged template bases in a template-specific manner. The Y39A “steric gate” pol ι mutant is considerably more active in the presence of Mn2+ compared with Mg2+ and exhibits a marked increase in NTP incorporation and extension, and surprisingly, it also exhibits increased dNTP base selectivity. Our results indicate that a single residue in pol ι is able to discriminate between NTPs and dNTPs during DNA synthesis. Because wild-type pol ι incorporates NTPs in a template-specific manner, certain DNA sequences may be “at risk” for elevated mutagenesis during pol ι-dependent TLS. Molecular modeling indicates that the constricted active site of wild-type pol ι becomes more spacious in the Y39A variant. Therefore, the Y39A substitution not only permits incorporation of ribonucleotides but also causes the enzyme to favor faithful Watson-Crick base pairing over mutagenic configurations. PMID:24532793

  20. Direct Detection and Sequencing of Damaged DNA Bases

    PubMed Central

    2011-01-01

    Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597

  1. Direct detection and sequencing of damaged DNA bases.

    PubMed

    Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas

    2011-12-20

    Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.

  2. Reading of the non-template DNA by transcription elongation factors.

    PubMed

    Svetlov, Vladimir; Nudler, Evgeny

    2018-05-14

    Unlike transcription initiation and termination, which have easily discernable signals such as promoters and terminators, elongation is regulated through a dynamic network involving RNA/DNA pause signals and states- rather than sequence-specific protein interactions. A report by Nedialkov et al. (in press) provides experimental evidence for sequence-specific recruitment of elongation factor RfaH to transcribing RNA polymerase (RNAP) and outlines the mechanism of gene expression regulation by restraint ("locking") of the DNA non-template strand. According to this model, the elongation complex pauses at the so called "operon polarity sequence" (found in some long bacterial operons coding for virulence genes), when the usually flexible non-template DNA strand adopts a distinct hairpin-loop conformation on the surface of transcribing RNAP. Sequence-specific binding of RfaH to this DNA segment facilitates conversion of RfaH from its inactive closed to its active open conformation. The interaction network formed between RfaH, non-template DNA, and RNAP locks DNA in a conformation that renders the elongation complex resistant to pausing and termination. The effects of such locking on transcript elongation can be mimicked by restraint of the non-template strand due to its shortening. This work advances our understanding of regulation of transcript elongation and has important implications for the action of general transcription factors, such as NusG, which lack apparent sequence-specificity, as well as for the mechanisms of other processes linked to transcription such as transcription-coupled DNA repair. This article is protected by copyright. All rights reserved. © 2018 John Wiley & Sons Ltd.

  3. Effort versus Reward: Preparing Samples for Fungal Community Characterization in High-Throughput Sequencing Surveys of Soils

    PubMed Central

    Song, Zewei; Schlatter, Dan; Kennedy, Peter; Kinkel, Linda L.; Kistler, H. Corby; Nguyen, Nhu; Bates, Scott T.

    2015-01-01

    Next generation fungal amplicon sequencing is being used with increasing frequency to study fungal diversity in various ecosystems; however, the influence of sample preparation on the characterization of fungal community is poorly understood. We investigated the effects of four procedural modifications to library preparation for high-throughput sequencing (HTS). The following treatments were considered: 1) the amount of soil used in DNA extraction, 2) the inclusion of additional steps (freeze/thaw cycles, sonication, or hot water bath incubation) in the extraction procedure, 3) the amount of DNA template used in PCR, and 4) the effect of sample pooling, either physically or computationally. Soils from two different ecosystems in Minnesota, USA, one prairie and one forest site, were used to assess the generality of our results. The first three treatments did not significantly influence observed fungal OTU richness or community structure at either site. Physical pooling captured more OTU richness compared to individual samples, but total OTU richness at each site was highest when individual samples were computationally combined. We conclude that standard extraction kit protocols are well optimized for fungal HTS surveys, but because sample pooling can significantly influence OTU richness estimates, it is important to carefully consider the study aims when planning sampling procedures. PMID:25974078

  4. Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10

    PubMed Central

    Zhang, Yang

    2014-01-01

    We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems. PMID:23760925

  5. Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10.

    PubMed

    Zhang, Yang

    2014-02-01

    We develop and test a new pipeline in CASP10 to predict protein structures based on an interplay of I-TASSER and QUARK for both free-modeling (FM) and template-based modeling (TBM) targets. The most noteworthy observation is that sorting through the threading template pool using the QUARK-based ab initio models as probes allows the detection of distant-homology templates which might be ignored by the traditional sequence profile-based threading alignment algorithms. Further template assembly refinement by I-TASSER resulted in successful folding of two medium-sized FM targets with >150 residues. For TBM, the multiple threading alignments from LOMETS are, for the first time, incorporated into the ab initio QUARK simulations, which were further refined by I-TASSER assembly refinement. Compared with the traditional threading assembly refinement procedures, the inclusion of the threading-constrained ab initio folding models can consistently improve the quality of the full-length models as assessed by the GDT-HA and hydrogen-bonding scores. Despite the success, significant challenges still exist in domain boundary prediction and consistent folding of medium-size proteins (especially beta-proteins) for nonhomologous targets. Further developments of sensitive fold-recognition and ab initio folding methods are critical for solving these problems. Copyright © 2013 Wiley Periodicals, Inc.

  6. Differentiation of the glucocerebrosidase gene from pseudogene by long-template PCR: Implications for Gaucher disease

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tayebi, N.; Cushner, S.; Sidransky, E.

    1996-09-01

    We describe the use of long-template PCR to differentiate the glucocerebrosidase gene from its pseudogene, which will simplify molecular diagnostic testing and the detection of known and new mutations in patients with Gaucher disease. Gaucher disease results from the inherited deficiency of the lysosomal enzyme, glucocerebrosidase. Sixteen kilobases downstream of the glucocerebrosidase gene is a pseudogene, which is {approximately}2 kb shorter and has >96% identity to the coding regions of the functional gene. Many mutations encountered in Gaucher patients are identical to sequences ordinarily found only in the pseudogene, and some result from recombination between the gene and pseudogene. Thus,more » for diagnostic purposes it is essential to differentiate between sequences from the gene and pseudogene. 9 refs., 1 fig.« less

  7. Cloning-free template DNA preparation for cell-free protein synthesis via two-step PCR using versatile primer designs with short 3'-UTR.

    PubMed

    Nomoto, Mika; Tada, Yasuomi

    2018-01-01

    Cell-free protein synthesis (CFPS) systems largely retain the endogenous translation machinery of the host organism, making them highly applicable for proteomics analysis of diverse biological processes. However, laborious and time-consuming cloning procedures hinder progress with CFPS systems. Herein, we report the development of a rapid and efficient two-step polymerase chain reaction (PCR) method to prepare linear DNA templates for a wheat germ CFPS system. We developed a novel, effective short 3'-untranslated region (3'-UTR) sequence that facilitates translation. Application of the short 3'-UTR to two-step PCR enabled the generation of various transcription templates from the same plasmid, including fusion proteins with N- or C-terminal tags, and truncated proteins. Our method supports the cloning-free expression of target proteins using an mRNA pool from biological material. The established system is a highly versatile platform for in vitro protein synthesis using wheat germ CFPS. © 2017 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.

  8. Optimization of rotamers prior to template minimization improves stability predictions made by computational protein design.

    PubMed

    Davey, James A; Chica, Roberto A

    2015-04-01

    Computational protein design (CPD) predictions are highly dependent on the structure of the input template used. However, it is unclear how small differences in template geometry translate to large differences in stability prediction accuracy. Herein, we explored how structural changes to the input template affect the outcome of stability predictions by CPD. To do this, we prepared alternate templates by Rotamer Optimization followed by energy Minimization (ROM) and used them to recapitulate the stability of 84 protein G domain β1 mutant sequences. In the ROM process, side-chain rotamers for wild-type (WT) or mutant sequences are optimized on crystal or nuclear magnetic resonance (NMR) structures prior to template minimization, resulting in alternate structures termed ROM templates. We show that use of ROM templates prepared from sequences known to be stable results predominantly in improved prediction accuracy compared to using the minimized crystal or NMR structures. Conversely, ROM templates prepared from sequences that are less stable than the WT reduce prediction accuracy by increasing the number of false positives. These observed changes in prediction outcomes are attributed to differences in side-chain contacts made by rotamers in ROM templates. Finally, we show that ROM templates prepared from sequences that are unfolded or that adopt a nonnative fold result in the selective enrichment of sequences that are also unfolded or that adopt a nonnative fold, respectively. Our results demonstrate the existence of a rotamer bias caused by the input template that can be harnessed to skew predictions toward sequences displaying desired characteristics. © 2014 The Protein Society.

  9. Operating and Maintaining Energy Smart Schools Action Plan Template - All Action Plans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    none,

    2009-07-01

    EnergySmart Schools action plan templates for benchmarking, lighting, HVAC, water heating, building envelope, transformer, plug loads, kitchen equipment, swimming pool, building automation system, other.

  10. Ion Torrent sequencing as a tool for mutation discovery in the flax (Linum usitatissimum L.) genome.

    PubMed

    Galindo-González, Leonardo; Pinzón-Latorre, David; Bergen, Erik A; Jensen, Dustin C; Deyholos, Michael K

    2015-01-01

    Detection of induced mutations is valuable for inferring gene function and for developing novel germplasm for crop improvement. Many reverse genetics approaches have been developed to identify mutations in genes of interest within a mutagenized population, including some approaches that rely on next-generation sequencing (e.g. exome capture, whole genome resequencing). As an alternative to these genome or exome-scale methods, we sought to develop a scalable and efficient method for detection of induced mutations that could be applied to a small number of target genes, using Ion Torrent technology. We developed this method in flax (Linum usitatissimum), to demonstrate its utility in a crop species. We used an amplicon-based approach in which DNA samples from an ethyl methanesulfonate (EMS)-mutagenized population were pooled and used as template in PCR reactions to amplify a region of each gene of interest. Barcodes were incorporated during PCR, and the pooled amplicons were sequenced using an Ion Torrent PGM. A pilot experiment with known SNPs showed that they could be detected at a frequency > 0.3% within the pools. We then selected eight genes for which we wanted to discover novel mutations, and applied our approach to screen 768 individuals from the EMS population, using either the Ion 314 or Ion 316 chips. Out of 29 potential mutations identified after processing the NGS reads, 16 mutations were confirmed using Sanger sequencing. The methodology presented here demonstrates the utility of Ion Torrent technology in detecting mutation variants in specific genome regions for large populations of a species such as flax. The methodology could be scaled-up to test >100 genes using the higher capacity chips now available from Ion Torrent.

  11. Ligation Bias in Illumina Next-Generation DNA Libraries: Implications for Sequencing Ancient Genomes

    PubMed Central

    Seguin-Orlando, Andaine; Schubert, Mikkel; Clary, Joel; Stagegaard, Julia; Alberdi, Maria T.; Prado, José Luis; Prieto, Alfredo; Willerslev, Eske; Orlando, Ludovic

    2013-01-01

    Ancient DNA extracts consist of a mixture of endogenous molecules and contaminant DNA templates, often originating from environmental microbes. These two populations of templates exhibit different chemical characteristics, with the former showing depurination and cytosine deamination by-products, resulting from post-mortem DNA damage. Such chemical modifications can interfere with the molecular tools used for building second-generation DNA libraries, and limit our ability to fully characterize the true complexity of ancient DNA extracts. In this study, we first use fresh DNA extracts to demonstrate that library preparation based on adapter ligation at AT-overhangs are biased against DNA templates starting with thymine residues, contrarily to blunt-end adapter ligation. We observe the same bias on fresh DNA extracts sheared on Bioruptor, Covaris and nebulizers. This contradicts previous reports suggesting that this bias could originate from the methods used for shearing DNA. This also suggests that AT-overhang adapter ligation efficiency is affected in a sequence-dependent manner and results in an uneven representation of different genomic contexts. We then show how this bias could affect the base composition of ancient DNA libraries prepared following AT-overhang ligation, mainly by limiting the ability to ligate DNA templates starting with thymines and therefore deaminated cytosines. This results in particular nucleotide misincorporation damage patterns, deviating from the signature generally expected for authenticating ancient sequence data. Consequently, we show that models adequate for estimating post-mortem DNA damage levels must be robust to the molecular tools used for building ancient DNA libraries. PMID:24205269

  12. TESS: a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites.

    PubMed Central

    Wallace, A. C.; Borkakoti, N.; Thornton, J. M.

    1997-01-01

    It is well established that sequence templates such as those in the PROSITE and PRINTS databases are powerful tools for predicting the biological function and tertiary structure for newly derived protein sequences. The number of X-ray and NMR protein structures is increasing rapidly and it is apparent that a 3D equivalent of the sequence templates is needed. Here, we describe an algorithm called TESS that automatically derives 3D templates from structures deposited in the Brookhaven Protein Data Bank. While a new sequence can be searched for sequence patterns, a new structure can be scanned against these 3D templates to identify functional sites. As examples, 3D templates are derived for enzymes with an O-His-O "catalytic triad" and for the ribonucleases and lysozymes. When these 3D templates are applied to a large data set of nonidentical proteins, several interesting hits are located. This suggests that the development of a 3D template database may help to identify the function of new protein structures, if unknown, as well as to design proteins with specific functions. PMID:9385633

  13. A comparison of RNA with DNA in template-directed synthesis

    NASA Technical Reports Server (NTRS)

    Zielinski, M.; Kozlov, I. A.; Orgel, L. E.; Bada, J. L. (Principal Investigator)

    2000-01-01

    Nonenzymatic template-directed copying of RNA sequences rich in cytidylic acid using nucleoside 5'-(2-methylimidazol-1-yl phosphates) as substrates is substantially more efficient than the copying of corresponding DNA sequences. However, many sequences cannot be copied, and the prospect of replication in this system is remote, even for RNA. Surprisingly, wobble-pairing leads to much more efficient incorporation of G opposite U on RNA templates than of G opposite T on DNA templates.

  14. Strong transcription blockage mediated by R-loop formation within a G-rich homopurine–homopyrimidine sequence localized in the vicinity of the promoter

    PubMed Central

    Soo Shin, Jane Hae

    2017-01-01

    Abstract Guanine-rich (G-rich) homopurine–homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA–DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA–DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription ‘bursting’) and may also have practical implications for the design of expression vectors. PMID:28498974

  15. Strong transcription blockage mediated by R-loop formation within a G-rich homopurine-homopyrimidine sequence localized in the vicinity of the promoter.

    PubMed

    Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C

    2017-06-20

    Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Nonenzymatic template-directed synthesis on hairpin oligonucleotides. 3. Incorporation of adenosine and uridine residues

    NASA Technical Reports Server (NTRS)

    Wu, T.; Orgel, L. E.

    1992-01-01

    We have used [32P]-labeled hairpin oligonucleotides to study template-directed synthesis on templates containing one or more A or T residues within a run of C residues. When nucleoside-5'-phosphoro(2-methyl)imidazolides are used as substrates, isolated A and T residues function efficiently in facilitating the incorporation of U and A, respectively. The reactions are regiospecific, producing mainly 3'-5'-phosphodiester bonds. Pairs of consecutive non-C residues are copied much less efficiently. Limited synthesis of CA and AC sequences on templates containing TG and GT sequences was observed along with some synthesis of the AA sequences on templates containing TT sequences. The other dimer sequences investigated, AA, AG, GA, TA, and AT, could not be copied. If A is absent from the reaction mixture, misincorporation of G residues is a significant reaction on templates containing an isolated T residue or two consecutive T residues. However, if both A and G are present, A is incorporated to a much greater extent than G. We believe that wobble-pairing between T and G is responsible for misincorporation when only G is present.

  17. Comparative modeling without implicit sequence alignments.

    PubMed

    Kolinski, Andrzej; Gront, Dominik

    2007-10-01

    The number of known protein sequences is about thousand times larger than the number of experimentally solved 3D structures. For more than half of the protein sequences a close or distant structural analog could be identified. The key starting point in a classical comparative modeling is to generate the best possible sequence alignment with a template or templates. With decreasing sequence similarity, the number of errors in the alignments increases and these errors are the main causes of the decreasing accuracy of the molecular models generated. Here we propose a new approach to comparative modeling, which does not require the implicit alignment - the model building phase explores geometric, evolutionary and physical properties of a template (or templates). The proposed method requires prior identification of a template, although the initial sequence alignment is ignored. The model is built using a very efficient reduced representation search engine CABS to find the best possible superposition of the query protein onto the template represented as a 3D multi-featured scaffold. The criteria used include: sequence similarity, predicted secondary structure consistency, local geometric features and hydrophobicity profile. For more difficult cases, the new method qualitatively outperforms existing schemes of comparative modeling. The algorithm unifies de novo modeling, 3D threading and sequence-based methods. The main idea is general and could be easily combined with other efficient modeling tools as Rosetta, UNRES and others.

  18. Whole genome sequencing of cytogenetically balanced chromosome translocations identifies potentially pathological gene disruptions and highlights the importance of microhomology in the mechanism of formation

    PubMed Central

    Gustavsson, Peter; Förster, Alisa; Hofmeister, Wolfgang; Wincent, Josephine; Zachariadis, Vasilios; Anderlid, Britt-Marie; Nordgren, Ann; Mäkitie, Outi; Wirta, Valtteri; Käller, Max; Vezzi, Francesco; Lupski, James R; Nordenskjöld, Magnus; Lundberg, Elisabeth Syk; Carvalho, Claudia M. B.; Lindstrand, Anna

    2016-01-01

    Most balanced translocations are thought to result mechanistically from non-homologous endjoining (NHEJ) or, in rare cases of recurrent events, by nonallelic homologous recombination (NAHR). Here, we use low coverage mate pair whole genome sequencing to fine map rearrangement breakpoint junctions in both phenotypically normal and affected translocation carriers. In total, 46 junctions from 22 carriers of balanced translocations were characterized. Genes were disrupted in 48% of the breakpoints; recessive genes in four normal carriers and known dominant intellectual disability genes in three affected carriers. Finally, seven candidate disease genes were disrupted in five carriers with neurocognitive disabilities (SVOPL, SUSD1, TOX, NCALD, SLC4A10) and one XX-male carrier with Tourette syndrome (LYPD6, GPC5). Breakpoint junction analyses revealed microhomology and small templated insertions in a substantive fraction of the analyzed translocations (17.4%; n=4); an observation that was substantiated by reanalysis of 37 previously published translocation junctions. Microhomology associated with templated-insertions is a characteristic seen in the breakpoint junctions of rearrangements mediated by the error prone replication-based repair mechanisms (RBMs). Our data implicate that a mechanism involving template switching might contribute to the formation of at least 15% of the interchromosomal translocation events. PMID:27862604

  19. Primer ID Validates Template Sampling Depth and Greatly Reduces the Error Rate of Next-Generation Sequencing of HIV-1 Genomic RNA Populations

    PubMed Central

    Zhou, Shuntai; Jones, Corbin; Mieczkowski, Piotr

    2015-01-01

    ABSTRACT Validating the sampling depth and reducing sequencing errors are critical for studies of viral populations using next-generation sequencing (NGS). We previously described the use of Primer ID to tag each viral RNA template with a block of degenerate nucleotides in the cDNA primer. We now show that low-abundance Primer IDs (offspring Primer IDs) are generated due to PCR/sequencing errors. These artifactual Primer IDs can be removed using a cutoff model for the number of reads required to make a template consensus sequence. We have modeled the fraction of sequences lost due to Primer ID resampling. For a typical sequencing run, less than 10% of the raw reads are lost to offspring Primer ID filtering and resampling. The remaining raw reads are used to correct for PCR resampling and sequencing errors. We also demonstrate that Primer ID reveals bias intrinsic to PCR, especially at low template input or utilization. cDNA synthesis and PCR convert ca. 20% of RNA templates into recoverable sequences, and 30-fold sequence coverage recovers most of these template sequences. We have directly measured the residual error rate to be around 1 in 10,000 nucleotides. We use this error rate and the Poisson distribution to define the cutoff to identify preexisting drug resistance mutations at low abundance in an HIV-infected subject. Collectively, these studies show that >90% of the raw sequence reads can be used to validate template sampling depth and to dramatically reduce the error rate in assessing a genetically diverse viral population using NGS. IMPORTANCE Although next-generation sequencing (NGS) has revolutionized sequencing strategies, it suffers from serious limitations in defining sequence heterogeneity in a genetically diverse population, such as HIV-1 due to PCR resampling and PCR/sequencing errors. The Primer ID approach reveals the true sampling depth and greatly reduces errors. Knowing the sampling depth allows the construction of a model of how to maximize the recovery of sequences from input templates and to reduce resampling of the Primer ID so that appropriate multiplexing can be included in the experimental design. With the defined sampling depth and measured error rate, we are able to assign cutoffs for the accurate detection of minority variants in viral populations. This approach allows the power of NGS to be realized without having to guess about sampling depth or to ignore the problem of PCR resampling, while also being able to correct most of the errors in the data set. PMID:26041299

  20. Chromosome rearrangements via template switching between diverged repeated sequences

    PubMed Central

    Anand, Ranjith P.; Tsaponina, Olga; Greenwell, Patricia W.; Lee, Cheng-Sheng; Du, Wei; Petes, Thomas D.

    2014-01-01

    Recent high-resolution genome analyses of cancer and other diseases have revealed the occurrence of microhomology-mediated chromosome rearrangements and copy number changes. Although some of these rearrangements appear to involve nonhomologous end-joining, many must have involved mechanisms requiring new DNA synthesis. Models such as microhomology-mediated break-induced replication (MM-BIR) have been invoked to explain these rearrangements. We examined BIR and template switching between highly diverged sequences in Saccharomyces cerevisiae, induced during repair of a site-specific double-strand break (DSB). Our data show that such template switches are robust mechanisms that give rise to complex rearrangements. Template switches between highly divergent sequences appear to be mechanistically distinct from the initial strand invasions that establish BIR. In particular, such jumps are less constrained by sequence divergence and exhibit a different pattern of microhomology junctions. BIR traversing repeated DNA sequences frequently results in complex translocations analogous to those seen in mammalian cells. These results suggest that template switching among repeated genes is a potent driver of genome instability and evolution. PMID:25367035

  1. Patterns of Spinal Sensory-Motor Connectivity Prescribed by a Dorsoventral Positional Template

    PubMed Central

    Sürmeli, Gülşen; Akay, Turgay; Ippolito, Gregory; Tucker, Philip W; Jessell, Thomas M

    2011-01-01

    Summary Sensory-motor circuits in the spinal cord are constructed with a fine specificity that coordinates motor behavior, but the mechanisms that direct sensory connections with their motor neuron partners remain unclear. The dorsoventral settling position of motor pools in the spinal cord is known to match the distal-to-proximal position of their muscle targets in the limb, but the significance of invariant motor neuron positioning is unknown. An analysis of sensory-motor connectivity patterns in FoxP1 mutant mice, where motor neuron position has been scrambled, shows that the final pattern of sensory-motor connections is initiated by the projection of sensory axons to discrete dorsoventral domains of the spinal cord without regard for motor neuron subtype, or indeed, the presence of motor neurons. By implication, the clustering and dorsoventral settling position of motor neuron pools serves as a determinant of the pattern of sensory input specificity, and thus motor coordination. PMID:22036571

  2. Information theory-based algorithm for in silico prediction of PCR products with whole genomic sequences as templates.

    PubMed

    Cao, Youfang; Wang, Lianjie; Xu, Kexue; Kou, Chunhai; Zhang, Yulei; Wei, Guifang; He, Junjian; Wang, Yunfang; Zhao, Liping

    2005-07-26

    A new algorithm for assessing similarity between primer and template has been developed based on the hypothesis that annealing of primer to template is an information transfer process. Primer sequence is converted to a vector of the full potential hydrogen numbers (3 for G or C, 2 for A or T), while template sequence is converted to a vector of the actual hydrogen bond numbers formed after primer annealing. The former is considered as source information and the latter destination information. An information coefficient is calculated as a measure for fidelity of this information transfer process and thus a measure of similarity between primer and potential annealing site on template. Successful prediction of PCR products from whole genomic sequences with a computer program based on the algorithm demonstrated the potential of this new algorithm in areas like in silico PCR and gene finding.

  3. Replicase activity of purified recombinant protein P2 of double-stranded RNA bacteriophage phi6.

    PubMed

    Makeyev, E V; Bamford, D H

    2000-01-04

    In nature, synthesis of both minus- and plus-sense RNA strands of all the known double-stranded RNA viruses occurs in the interior of a large protein assembly referred to as the polymerase complex. In addition to other proteins, the complex contains a putative polymerase possessing characteristic sequence motifs. However, none of the previous studies has shown template-dependent RNA synthesis directly with an isolated putative polymerase protein. In this report, recombinant protein P2 of double-stranded RNA bacteriophage phi6 was purified and demonstrated in an in vitro enzymatic assay to act as the replicase. The enzyme efficiently utilizes phage-specific, positive-sense RNA substrates to produce double-stranded RNA molecules, which are formed by newly synthesized, full-length minus-strands base paired with the plus-strand templates. P2-catalyzed replication is also shown to be very effective with a broad range of heterologous single-stranded RNA templates. The importance and implications of these results are discussed.

  4. Base pairing among three cis-acting sequences contributes to template switching during hepadnavirus reverse transcription.

    PubMed

    Liu, Ning; Tian, Ru; Loeb, Daniel D

    2003-02-18

    Synthesis of the relaxed-circular (RC) DNA genome of hepadnaviruses requires two template switches during plus-strand DNA synthesis: primer translocation and circularization. Although primer translocation and circularization use different donor and acceptor sequences, and are distinct temporally, they share the common theme of switching from one end of the minus-strand template to the other end. Studies of duck hepatitis B virus have indicated that, in addition to the donor and acceptor sequences, three other cis-acting sequences, named 3E, M, and 5E, are required for the synthesis of RC DNA by contributing to primer translocation and circularization. The mechanism by which 3E, M, and 5E act was not known. We present evidence that these sequences function by base pairing with each other within the minus-strand template. 3E base-pairs with one portion of M (M3) and 5E base-pairs with an adjacent portion of M (M5). We found that disrupting base pairing between 3E and M3 and between 5E and M5 inhibited primer translocation and circularization. More importantly, restoring base pairing with mutant sequences restored the production of RC DNA. These results are consistent with the model that, within duck hepatitis B virus capsids, the ends of the minus-strand template are juxtaposed via base pairing to facilitate the two template switches during plus-strand DNA synthesis.

  5. Effect of sustained elevated temperature prior to amplification on template copy number estimation using digital polymerase chain reaction.

    PubMed

    Bhat, Somanath; McLaughlin, Jacob L H; Emslie, Kerry R

    2011-02-21

    Digital polymerase chain reaction (dPCR) has the potential to enable accurate quantification of target DNA copy number provided that all target DNA molecules are successfully amplified. Following duplex dPCR analysis from a linear DNA target sequence that contains single copies of two independent template sequences, we have observed that amplification of both templates in a single partition does not always occur. To investigate this finding, we heated the target DNA solution to 95 °C for increasing time intervals and then immediately chilled on ice prior to preparing the dPCR mix. We observed an exponential decline in estimated copy number (R(2)≥ 0.98) of the two template sequences when amplified from either a linearized plasmid or a 388 base pair (bp) amplicon containing the same two template sequences. The distribution of amplifiable templates and the final concentration (copies per µL) were both affected by heat treatment of the samples at 95 °C from 0 s to 30 min. The proportion of target sequences from which only one of the two templates was amplified in a single partition (either 1507 or hmg only) increased over time, while the proportion of target sequences where both templates were amplified (1507 and hmg) in each individual partition declined rapidly from 94% to 52% (plasmid) and 88% to 31% (388 bp amplicon) suggesting an increase in number of targets from which both templates no longer amplify. A 10 min incubation at 95 °C reduced the initial amplifiable template concentration of the plasmid and the 388 bp amplicon by 59% and 91%, respectively. To determine if a similar decrease in amplifiable target occurs during the default pre-activation step of typical PCR amplification protocol, we used mastermixes with a 20 s or 10 min hot-start. The choice of mastermix and consequent pre-activation time did not affect the estimated plasmid concentration. Therefore, we conclude that prolonged exposure of this DNA template to elevated temperatures could lead to significant bias in dPCR measurements. However, care must be taken when designing PCR and non-PCR based experiments by reducing exposure of the DNA template to sustained elevated temperatures in order to improve accuracy in copy number estimation and concentration determination.

  6. Facilitated sequence counting and assembly by template mutagenesis

    PubMed Central

    Levy, Dan; Wigler, Michael

    2014-01-01

    Presently, inferring the long-range structure of the DNA templates is limited by short read lengths. Accurate template counts suffer from distortions occurring during PCR amplification. We explore the utility of introducing random mutations in identical or nearly identical templates to create distinguishable patterns that are inherited during subsequent copying. We simulate the applications of this process under assumptions of error-free sequencing and perfect mapping, using cytosine deamination as a model for mutation. The simulations demonstrate that within readily achievable conditions of nucleotide conversion and sequence coverage, we can accurately count the number of otherwise identical molecules as well as connect variants separated by long spans of identical sequence. We discuss many potential applications, such as transcript profiling, isoform assembly, haplotype phasing, and de novo genome assembly. PMID:25313059

  7. Cloning and characterization of an 11S legumin, Car i 4, a major allergen in pecan.

    PubMed

    Sharma, Girdhari M; Irsigler, Andre; Dhanarajan, Pushparani; Ayuso, Rosalia; Bardina, Luda; Sampson, Hugh A; Roux, Kenneth H; Sathe, Shridhar K

    2011-09-14

    Among tree nut allergens, pecan allergens remain to be identified and characterized. The objective was to demonstrate the IgE-binding ability of pecan 11S legumin and characterize its sequential IgE-binding epitopes. The 11S legumin gene was amplified from a pecan cDNA library and expressed as a fusion protein in Escherichia coli. The native 11S legumin in pecan extract was identified by mass spectrometry/mass spectrometry (MS/MS). Sequential epitopes were determined by probing the overlapping peptides with three serum pools prepared from different patients' sera. A three-dimensional model was generated using almond legumin as a template and compared with known sequential epitopes on other allergenic tree nut homologues. Of 28 patients tested by dot blot, 16 (57%) bound to 11S legumin, designated Car i 4. MS/MS sequencing of native 11S legumin identified 33 kDa acidic and 20-22 kDa basic subunits. Both pecan and walnut seed protein extracts inhibited IgE binding to recombinant Car i 4, suggesting cross-reactivity with Jug r 4. Sequential epitope mapping results of Car i 4 revealed weak, moderate, and strong reactivity of serum pools against 10, 5, and 4 peptides, respectively. Seven peptides were recognized by all three serum pools, of which two were strongly reactive. The strongly reactive peptides were located in three discrete regions of the Car i 4 acidic subunit sequence (residues 118-132, 208-219, and 238-249). Homology modeling of Car i 4 revealed significant overlapping regions shared in common with other tree nut legumins.

  8. Partial bisulfite conversion for unique template sequencing

    PubMed Central

    Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael

    2018-01-01

    Abstract We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. PMID:29161423

  9. Continuous in vitro evolution of bacteriophage RNA polymerase promoters

    NASA Technical Reports Server (NTRS)

    Breaker, R. R.; Banerji, A.; Joyce, G. F.

    1994-01-01

    Rapid in vitro evolution of bacteriophage T7, T3, and SP6 RNA polymerase promoters was achieved by a method that allows continuous enrichment of DNAs that contain functional promoter elements. This method exploits the ability of a special class of nucleic acid molecules to replicate continuously in the presence of both a reverse transcriptase and a DNA-dependent RNA polymerase. Replication involves the synthesis of both RNA and cDNA intermediates. The cDNA strand contains an embedded promoter sequence, which becomes converted to a functional double-stranded promoter element, leading to the production of RNA transcripts. Synthetic cDNAs, including those that contain randomized promoter sequences, can be used to initiate the amplification cycle. However, only those cDNAs that contain functional promoter sequences are able to produce RNA transcripts. Furthermore, each RNA transcript encodes the RNA polymerase promoter sequence that was responsible for initiation of its own transcription. Thus, the population of amplifying molecules quickly becomes enriched for those templates that encode functional promoters. Optimal promoter sequences for phage T7, T3, and SP6 RNA polymerase were identified after a 2-h amplification reaction, initiated in each case with a pool of synthetic cDNAs encoding greater than 10(10) promoter sequence variants.

  10. A computational proposal for designing structured RNA pools for in vitro selection of RNAs.

    PubMed

    Kim, Namhee; Gan, Hin Hark; Schlick, Tamar

    2007-04-01

    Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems.

  11. Restricted N-glycan conformational space in the PDB and its implication in glycan structure modeling.

    PubMed

    Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil

    2013-01-01

    Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures.

  12. Restricted N-glycan Conformational Space in the PDB and Its Implication in Glycan Structure Modeling

    PubMed Central

    Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil

    2013-01-01

    Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures. PMID:23516343

  13. Molecular barcodes detect redundancy and contamination in hairpin-bisulfite PCR

    PubMed Central

    Miner, Brooks E.; Stöger, Reinhard J.; Burden, Alice F.; Laird, Charles D.; Hansen, R. Scott

    2004-01-01

    PCR amplification of limited amounts of DNA template carries an increased risk of product redundancy and contamination. We use molecular barcoding to label each genomic DNA template with an individual sequence tag prior to PCR amplification. In addition, we include molecular ‘batch-stamps’ that effectively label each genomic template with a sample ID and analysis date. This highly sensitive method identifies redundant and contaminant sequences and serves as a reliable method for positive identification of desired sequences; we can therefore capture accurately the genomic template diversity in the sample analyzed. Although our application described here involves the use of hairpin-bisulfite PCR for amplification of double-stranded DNA, the method can readily be adapted to single-strand PCR. Useful applications will include analyses of limited template DNA for biomedical, ancient DNA and forensic purposes. PMID:15459281

  14. Exome Pool-Seq in neurodevelopmental disorders.

    PubMed

    Popp, Bernt; Ekici, Arif B; Thiel, Christian T; Hoyer, Juliane; Wiesener, Antje; Kraus, Cornelia; Reis, André; Zweier, Christiane

    2017-12-01

    High throughput sequencing has greatly advanced disease gene identification, especially in heterogeneous entities. Despite falling costs this is still an expensive and laborious technique, particularly when studying large cohorts. To address this problem we applied Exome Pool-Seq as an economic and fast screening technology in neurodevelopmental disorders (NDDs). Sequencing of 96 individuals can be performed in eight pools of 12 samples on less than one Illumina sequencer lane. In a pilot study with 96 cases we identified 27 variants, likely or possibly affecting function. Twenty five of these were identified in 923 established NDD genes (based on SysID database, status November 2016) (ACTB, AHDC1, ANKRD11, ATP6V1B2, ATRX, CASK, CHD8, GNAS, IFIH1, KCNQ2, KMT2A, KRAS, MAOA, MED12, MED13L, RIT1, SETD5, SIN3A, TCF4, TRAPPC11, TUBA1A, WAC, ZBTB18, ZMYND11), two in 543 (SysID) candidate genes (ZNF292, BPTF), and additionally a de novo loss-of-function variant in LRRC7, not previously implicated in NDDs. Most of them were confirmed to be de novo, but we also identified X-linked or autosomal-dominantly or autosomal-recessively inherited variants. With a detection rate of 28%, Exome Pool-Seq achieves comparable results to individual exome analyses but reduces costs by >85%. Compared with other large scale approaches using Molecular Inversion Probes (MIP) or gene panels, it allows flexible re-analysis of data. Exome Pool-Seq is thus well suited for large-scale, cost-efficient and flexible screening in characterized but heterogeneous entities like NDDs.

  15. Base pairing among three cis-acting sequences contributes to template switching during hepadnavirus reverse transcription

    PubMed Central

    Liu, Ning; Tian, Ru; Loeb, Daniel D.

    2003-01-01

    Synthesis of the relaxed-circular (RC) DNA genome of hepadnaviruses requires two template switches during plus-strand DNA synthesis: primer translocation and circularization. Although primer translocation and circularization use different donor and acceptor sequences, and are distinct temporally, they share the common theme of switching from one end of the minus-strand template to the other end. Studies of duck hepatitis B virus have indicated that, in addition to the donor and acceptor sequences, three other cis-acting sequences, named 3E, M, and 5E, are required for the synthesis of RC DNA by contributing to primer translocation and circularization. The mechanism by which 3E, M, and 5E act was not known. We present evidence that these sequences function by base pairing with each other within the minus-strand template. 3E base-pairs with one portion of M (M3) and 5E base-pairs with an adjacent portion of M (M5). We found that disrupting base pairing between 3E and M3 and between 5E and M5 inhibited primer translocation and circularization. More importantly, restoring base pairing with mutant sequences restored the production of RC DNA. These results are consistent with the model that, within duck hepatitis B virus capsids, the ends of the minus-strand template are juxtaposed via base pairing to facilitate the two template switches during plus-strand DNA synthesis. PMID:12578983

  16. Evaluation of the Implications of Nanoscale Architectures on Contextual Knowledge Discovery and Memory: Self-Assembled Architectures and Memory

    DTIC Science & Technology

    2008-05-01

    patterns. Our strategy to nucleate Ag nanoparticles has been to use a templating protein (e.g., streptavidin) that has been chemically pre- charged with...assembly is used to direct the formation of switching devices and wires to create logic circuitry, memory, and I/O interfaces . We can control the reaction...determines the formation of structures (through complementarity ). Sequence design is important because it determines many aspects of the target DNA

  17. Primer-independent RNA sequencing with bacteriophage phi6 RNA polymerase and chain terminators.

    PubMed

    Makeyev, E V; Bamford, D H

    2001-05-01

    Here we propose a new general method for directly determining RNA sequence based on the use of the RNA-dependent RNA polymerase from bacteriophage phi6 and the chain terminators (RdRP sequencing). The following properties of the polymerase render it appropriate for this application: (1) the phi6 polymerase can replicate a number of single-stranded RNA templates in vitro. (2) In contrast to the primer-dependent DNA polymerases utilized in the sequencing procedure by Sanger et al. (Proc Natl Acad Sci USA, 1977, 74:5463-5467), it initiates nascent strand synthesis without a primer, starting the polymerization on the very 3'-terminus of the template. (3) The polymerase can incorporate chain-terminating nucleotide analogs into the nascent RNA chain to produce a set of base-specific termination products. Consequently, 3' proximal or even complete sequence of many target RNA molecules can be rapidly deduced without prior sequence information. The new technique proved useful for sequencing several synthetic ssRNA templates. Furthermore, using genomic segments of the bluetongue virus we show that RdRP sequencing can also be applied to naturally occurring dsRNA templates. This suggests possible uses of the method in the RNA virus research and diagnostics.

  18. A statistical method for the detection of variants from next-generation resequencing of DNA pools.

    PubMed

    Bansal, Vikas

    2010-06-15

    Next-generation sequencing technologies have enabled the sequencing of several human genomes in their entirety. However, the routine resequencing of complete genomes remains infeasible. The massive capacity of next-generation sequencers can be harnessed for sequencing specific genomic regions in hundreds to thousands of individuals. Sequencing-based association studies are currently limited by the low level of multiplexing offered by sequencing platforms. Pooled sequencing represents a cost-effective approach for studying rare variants in large populations. To utilize the power of DNA pooling, it is important to accurately identify sequence variants from pooled sequencing data. Detection of rare variants from pooled sequencing represents a different challenge than detection of variants from individual sequencing. We describe a novel statistical approach, CRISP [Comprehensive Read analysis for Identification of Single Nucleotide Polymorphisms (SNPs) from Pooled sequencing] that is able to identify both rare and common variants by using two approaches: (i) comparing the distribution of allele counts across multiple pools using contingency tables and (ii) evaluating the probability of observing multiple non-reference base calls due to sequencing errors alone. Information about the distribution of reads between the forward and reverse strands and the size of the pools is also incorporated within this framework to filter out false variants. Validation of CRISP on two separate pooled sequencing datasets generated using the Illumina Genome Analyzer demonstrates that it can detect 80-85% of SNPs identified using individual sequencing while achieving a low false discovery rate (3-5%). Comparison with previous methods for pooled SNP detection demonstrates the significantly lower false positive and false negative rates for CRISP. Implementation of this method is available at http://polymorphism.scripps.edu/~vbansal/software/CRISP/.

  19. Partial bisulfite conversion for unique template sequencing.

    PubMed

    Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan

    2018-01-25

    We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. A potential role for RNA interference in controlling the activity of the human LINE-1 retrotransposon.

    PubMed

    Soifer, Harris S; Zaragoza, Adriana; Peyvan, Maany; Behlke, Mark A; Rossi, John J

    2005-01-01

    Long interspersed nuclear elements (LINE-1 or L1) comprise 17% of the human genome, although only 80-100 L1s are considered retrotransposition-competent (RC-L1). Despite their small number, RC-L1s are still potential hazards to genome integrity through insertional mutagenesis, unequal recombination and chromosome rearrangements. In this study, we provide several lines of evidence that the LINE-1 retrotransposon is susceptible to RNA interference (RNAi). First, double-stranded RNA (dsRNA) generated in vitro from an L1 template is converted into functional short interfering RNA (siRNA) by DICER, the RNase III enzyme that initiates RNAi in human cells. Second, pooled siRNA from in vitro cleavage of L1 dsRNA, as well as synthetic L1 siRNA, targeting the 5'-UTR leads to sequence-specific mRNA degradation of an L1 fusion transcript. Finally, both synthetic and pooled siRNA suppressed retrotransposition from a highly active RC-L1 clone in cell culture assay. Our report is the first to demonstrate that a human transposable element is subjected to RNAi.

  1. Refining the Results of a Classical SELEX Experiment by Expanding the Sequence Data Set of an Aptamer Pool Selected for Protein A

    PubMed Central

    2018-01-01

    New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus. In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of KD = 20 ± 1 nM. PMID:29495282

  2. Refining the Results of a Classical SELEX Experiment by Expanding the Sequence Data Set of an Aptamer Pool Selected for Protein A.

    PubMed

    Stoltenburg, Regina; Strehlitz, Beate

    2018-02-24

    New, as yet undiscovered aptamers for Protein A were identified by applying next generation sequencing (NGS) to a previously selected aptamer pool. This pool was obtained in a classical SELEX (Systematic Evolution of Ligands by EXponential enrichment) experiment using the FluMag-SELEX procedure followed by cloning and Sanger sequencing. PA#2/8 was identified as the only Protein A-binding aptamer from the Sanger sequence pool, and was shown to be able to bind intact cells of Staphylococcus aureus . In this study, we show the extension of the SELEX results by re-sequencing of the same aptamer pool using a medium throughput NGS approach and data analysis. Both data pools were compared. They confirm the selection of a highly complex and heterogeneous oligonucleotide pool and show consistently a high content of orphans as well as a similar relative frequency of certain sequence groups. But in contrast to the Sanger data pool, the NGS pool was clearly dominated by one sequence group containing the known Protein A-binding aptamer PA#2/8 as the most frequent sequence in this group. In addition, we found two new sequence groups in the NGS pool represented by PA-C10 and PA-C8, respectively, which also have high specificity for Protein A. Comparative affinity studies reveal differences between the aptamers and confirm that PA#2/8 remains the most potent sequence within the selected aptamer pool reaching affinities in the low nanomolar range of K D = 20 ± 1 nM.

  3. Polymerase ribozyme efficiency increased by G/T-rich DNA oligonucleotides

    PubMed Central

    Yao, Chengguo; Müller, Ulrich F.

    2011-01-01

    The RNA world hypothesis states that the early evolution of life went through a stage where RNA served as genome and as catalyst. The replication of RNA world organisms would have been facilitated by ribozymes that catalyze RNA polymerization. To recapitulate an RNA world in the laboratory, a series of RNA polymerase ribozymes was developed previously. However, these ribozymes have a polymerization efficiency that is too low for self-replication, and the most efficient ribozymes prefer one specific template sequence. The limiting factor for polymerization efficiency is the weak sequence-independent binding to its primer/template substrate. Most of the known polymerase ribozymes bind an RNA heptanucleotide to form the P2 duplex on the ribozyme. By modifying this heptanucleotide, we were able to significantly increase polymerization efficiency. Truncations at the 3′-terminus of this heptanucleotide increased full-length primer extension by 10-fold, on a specific template sequence. In contrast, polymerization on several different template sequences was improved dramatically by replacing the RNA heptanucleotide with DNA oligomers containing randomized sequences of 15 nt. The presence of G and T in the random sequences was sufficient for this effect, with an optimal composition of 60% G and 40% T. Our results indicate that these DNA sequences function by establishing many weak and nonspecific base-pairing interactions to the single-stranded portion of the template. Such low-specificity interactions could have had important functions in an RNA world. PMID:21622900

  4. Gause's Principle and the Effect of Resource Partitioning on the Dynamical Coexistence of Replicating Templates

    PubMed Central

    Szilágyi, András; Zachar, István; Szathmáry, Eörs

    2013-01-01

    Models of competitive template replication, although basic for replicator dynamics and primordial evolution, have not yet taken different sequences explicitly into account, neither have they analyzed the effect of resource partitioning (feeding on different resources) on coexistence. Here we show by analytical and numerical calculations that Gause's principle of competitive exclusion holds for template replicators if resources (nucleotides) affect growth linearly and coexistence is at fixed point attractors. Cases of complementary or homologous pairing between building blocks with parallel or antiparallel strands show no deviation from the rule that the nucleotide compositions of stably coexisting species must be different and there cannot be more coexisting replicator species than nucleotide types. Besides this overlooked mechanism of template coexistence we show also that interesting sequence effects prevail as parts of sequences that are copied earlier affect coexistence more strongly due to the higher concentration of the corresponding replication intermediates. Template and copy always count as one species due their constraint of strict stoichiometric coupling. Stability of fixed-point coexistence tends to decrease with the length of sequences, although this effect is unlikely to be detrimental for sequences below 100 nucleotides. In sum, resource partitioning (niche differentiation) is the default form of competitive coexistence for replicating templates feeding on a cocktail of different nucleotides, as it may have been the case in the RNA world. Our analysis of different pairing and strand orientation schemes is relevant for artificial and potentially astrobiological genetics. PMID:23990769

  5. Non coding extremities of the seven influenza virus type C vRNA segments: effect on transcription and replication by the type C and type A polymerase complexes

    PubMed Central

    Crescenzo-Chaigne, Bernadette; Barbezange, Cyril; van der Werf, Sylvie

    2008-01-01

    Background The transcription/replication of the influenza viruses implicate the terminal nucleotide sequences of viral RNA, which comprise sequences at the extremities conserved among the genomic segments as well as variable 3' and 5' non-coding (NC) regions. The plasmid-based system for the in vivo reconstitution of functional ribonucleoproteins, upon expression of viral-like RNAs together with the nucleoprotein and polymerase proteins has been widely used to analyze transcription/replication of influenza viruses. It was thus shown that the type A polymerase could transcribe and replicate type A, B, or C vRNA templates whereas neither type B nor type C polymerases were able to transcribe and replicate type A templates efficiently. Here we studied the importance of the NC regions from the seven segments of type C influenza virus for efficient transcription/replication by the type A and C polymerases. Results The NC sequences of the seven genomic segments of the type C influenza virus C/Johannesburg/1/66 strain were found to be more variable in length than those of the type A and B viruses. The levels of transcription/replication of viral-like vRNAs harboring the NC sequences of the respective type C virus segments flanking the CAT reporter gene were comparable in the presence of either type C or type A polymerase complexes except for the NS and PB2-like vRNAs. For the NS-like vRNA, the transcription/replication level was higher after introduction of a U residue at position 6 in the 5' NC region as for all other segments. For the PB2-like vRNA the CAT expression level was particularly reduced with the type C polymerase. Analysis of mutants of the 5' NC sequence in the PB2-like vRNA, the shortest 5' NC sequence among the seven segments, showed that additional sequences within the PB2 ORF were essential for the efficiency of transcription but not replication by the type C polymerase complex. Conclusion In the context of a PB2-like reporter vRNA template, the sequence upstream the polyU stretch plays a role in the transcription/replication process by the type C polymerase complex. PMID:18973655

  6. Short Communication: Analysis of Minor Populations of Human Immunodeficiency Virus by Primer Identification and Insertion-Deletion and Carry Forward Correction Pipelines.

    PubMed

    Hughes, Paul; Deng, Wenjie; Olson, Scott C; Coombs, Robert W; Chung, Michael H; Frenkel, Lisa M

    2016-03-01

    Accurate analysis of minor populations of drug-resistant HIV requires analysis of a sufficient number of viral templates. We assessed the effect of experimental conditions on the analysis of HIV pol 454 pyrosequences generated from plasma using (1) the "Insertion-deletion (indel) and Carry Forward Correction" (ICC) pipeline, which clusters sequence reads using a nonsubstitution approach and can correct for indels and carry forward errors, and (2) the "Primer Identification (ID)" method, which facilitates construction of a consensus sequence to correct for sequencing errors and allelic skewing. The Primer ID and ICC methods produced similar estimates of viral diversity, but differed in the number of sequence variants generated. Sequence preparation for ICC was comparably simple, but was limited by an inability to assess the number of templates analyzed and allelic skewing. The more costly Primer ID method corrected for allelic skewing and provided the number of viral templates analyzed, which revealed that amplifiable HIV templates varied across specimens and did not correlate with clinical viral load. This latter observation highlights the value of the Primer ID method, which by determining the number of templates amplified, enables more accurate assessment of minority species in the virus population, which may be relevant to prescribing effective antiretroviral therapy.

  7. Storyboard method of end-user programming with natural language configuration

    DOEpatents

    Bouchard, Ann M; Osbourn, Gordon C

    2013-11-19

    A technique for end-user programming includes populating a template with graphically illustrated actions and then invoking a command to generate a screen element based on the template. The screen element is rendered within a computing environment and provides a mechanism for triggering execution of a sequence of user actions. The sequence of user actions is based at least in part on the graphically illustrated actions populated into the template.

  8. Inaccurate DNA synthesis in cell extracts of yeast producing active human DNA polymerase iota.

    PubMed

    Makarova, Alena V; Grabow, Corinn; Gening, Leonid V; Tarantul, Vyacheslav Z; Tahirov, Tahir H; Bessho, Tadayoshi; Pavlov, Youri I

    2011-01-31

    Mammalian Pol ι has an unusual combination of properties: it is stimulated by Mn(2+) ions, can bypass some DNA lesions and misincorporates "G" opposite template "T" more frequently than incorporates the correct "A." We recently proposed a method of detection of Pol ι activity in animal cell extracts, based on primer extension opposite the template T with a high concentration of only two nucleotides, dGTP and dATP (incorporation of "G" versus "A" method of Gening, abbreviated as "misGvA"). We provide unambiguous proof of the "misGvA" approach concept and extend the applicability of the method for the studies of variants of Pol ι in the yeast model system with different cation cofactors. We produced human Pol ι in baker's yeast, which do not have a POLI ortholog. The "misGvA" activity is absent in cell extracts containing an empty vector, or producing catalytically dead Pol ι, or Pol ι lacking exon 2, but is robust in the strain producing wild-type Pol ι or its catalytic core, or protein with the active center L62I mutant. The signature pattern of primer extension products resulting from inaccurate DNA synthesis by extracts of cells producing either Pol ι or human Pol η is different. The DNA sequence of the template is critical for the detection of the infidelity of DNA synthesis attributed to DNA Pol ι. The primer/template and composition of the exogenous DNA precursor pool can be adapted to monitor replication fidelity in cell extracts expressing various error-prone Pols or mutator variants of accurate Pols. Finally, we demonstrate that the mutation rates in yeast strains producing human DNA Pols ι and η are not elevated over the control strain, despite highly inaccurate DNA synthesis by their extracts.

  9. Optimizing Multi-Station Template Matching to Identify and Characterize Induced Seismicity in Ohio

    NASA Astrophysics Data System (ADS)

    Brudzinski, M. R.; Skoumal, R.; Currie, B. S.

    2014-12-01

    As oil and gas well completions utilizing multi-stage hydraulic fracturing have become more commonplace, the potential for seismicity induced by the deep disposal of frac-related flowback waters and the hydraulic fracturing process itself has become increasingly important. While it is rare for these processes to induce felt seismicity, the recent increase in the number of deep injection wells and volumes injected have been suspected to have contributed to a substantial increase of events = M 3 in the continental U.S. over the past decade. Earthquake template matching using multi-station waveform cross-correlation is an adept tool for investigating potentially induced sequences due to its proficiency at identifying similar/repeating seismic events. We have sought to refine this approach by investigating a variety of seismic sequences and determining the optimal parameters (station combinations, template lengths and offsets, filter frequencies, data access method, etc.) for identifying induced seismicity. When applied to a sequence near a wastewater injection well in Youngstown, Ohio, our optimized template matching routine yielded 566 events while other template matching studies found ~100-200 events. We also identified 77 events on 4-12 March 2014 that are temporally and spatially correlated with active hydraulic fracturing in Poland Township, Ohio. We find similar improvement in characterizing sequences in Washington and Harrison Counties, which appear to be related to wastewater injection and hydraulic fracturing, respectively. In the Youngstown and Poland Township cases, focal mechanisms and double difference relocation using the cross-correlation matrix finds left-lateral faults striking roughly east-west near the top of the basement. We have also used template matching to determine isolated earthquakes near several other wastewater injection wells are unlikely to be induced based on a lack of similar/repeating sequences. Optimized template matching utilizes high-quality reliable stations within pre-existing seismic networks and is therefore a cost-efficient monitoring strategy for identifying and characterizing potentially induced seismic sequences.

  10. Template-based protein structure modeling using the RaptorX web server.

    PubMed

    Källberg, Morten; Wang, Haipeng; Wang, Sheng; Peng, Jian; Wang, Zhiyong; Lu, Hui; Xu, Jinbo

    2012-07-19

    A key challenge of modern biology is to uncover the functional role of the protein entities that compose cellular proteomes. To this end, the availability of reliable three-dimensional atomic models of proteins is often crucial. This protocol presents a community-wide web-based method using RaptorX (http://raptorx.uchicago.edu/) for protein secondary structure prediction, template-based tertiary structure modeling, alignment quality assessment and sophisticated probabilistic alignment sampling. RaptorX distinguishes itself from other servers by the quality of the alignment between a target sequence and one or multiple distantly related template proteins (especially those with sparse sequence profiles) and by a novel nonlinear scoring function and a probabilistic-consistency algorithm. Consequently, RaptorX delivers high-quality structural models for many targets with only remote templates. At present, it takes RaptorX ~35 min to finish processing a sequence of 200 amino acids. Since its official release in August 2011, RaptorX has processed ~6,000 sequences submitted by ~1,600 users from around the world.

  11. Template-based protein structure modeling using the RaptorX web server

    PubMed Central

    Källberg, Morten; Wang, Haipeng; Wang, Sheng; Peng, Jian; Wang, Zhiyong; Lu, Hui; Xu, Jinbo

    2016-01-01

    A key challenge of modern biology is to uncover the functional role of the protein entities that compose cellular proteomes. To this end, the availability of reliable three-dimensional atomic models of proteins is often crucial. This protocol presents a community-wide web-based method using RaptorX (http://raptorx.uchicago.edu/) for protein secondary structure prediction, template-based tertiary structure modeling, alignment quality assessment and sophisticated probabilistic alignment sampling. RaptorX distinguishes itself from other servers by the quality of the alignment between a target sequence and one or multiple distantly related template proteins (especially those with sparse sequence profiles) and by a novel nonlinear scoring function and a probabilistic-consistency algorithm. Consequently, RaptorX delivers high-quality structural models for many targets with only remote templates. At present, it takes RaptorX ~35 min to finish processing a sequence of 200 amino acids. Since its official release in August 2011, RaptorX has processed ~6,000 sequences submitted by ~1,600 users from around the world. PMID:22814390

  12. Population-based rare variant detection via pooled exome or custom hybridization capture with or without individual indexing.

    PubMed

    Ramos, Enrique; Levinson, Benjamin T; Chasnoff, Sara; Hughes, Andrew; Young, Andrew L; Thornton, Katherine; Li, Allie; Vallania, Francesco L M; Province, Michael; Druley, Todd E

    2012-12-06

    Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.

  13. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Yazhen; Musser, Sarah K.; Saleh, Sam

    1,N{sup 2}-Propanodeoxyguanosine (PdG) is a stable structural analogue for the 3-(2'-deoxy-{beta}-d-erythro-pentofuranosyl)pyrimido[1,2-?]purin-10(3H)-one (M{sub 1}dG) adduct derived from exposure of DNA to base propenals and to malondialdehyde. The structures of ternary polymerase-DNA-dNTP complexes for three template-primer DNA sequences were determined, with the Y-family Sulfolobus solfataricus DNA polymerase IV (Dpo4), at resolutions between 2.4 and 2.7 {angstrom}. Three template 18-mer-primer 13-mer sequences, 5'-d(TCACXAAATCCTTCCCCC)-3'{center_dot}5'-d(GGGGGAAGGATTT)-3' (template I), 5'-d(TCACXGAATCCTTCCCCC)-3'{center_dot}5'-d(GGGGGAAGGATTC)-3' (template II), and 5'-d(TCATXGAATCCTTCCCCC)-3'{center_dot}5'-d(GGGGGAAGGATTC)-3' (template III), where X is PdG, were analyzed. With templates I and II, diffracting ternary complexes including dGTP were obtained. The dGTP did not pair with PdG, but instead with the 5'-neighboring templatemore » dC, utilizing Watson-Crick geometry. Replication bypass experiments with the template-primer 5?-TCACXAAATCCTTACGAGCATCGCCCCC-3'{center_dot}5'-GGGGGCGATGCTCGTAAGGATTT-3', where X is PdG, which includes PdG in the 5'-CXA-3' template sequence as in template I, showed that the Dpo4 polymerase inserted dGTP and dATP when challenged by the PdG adduct. For template III, in which the template sequence was 5'-TXG-3', a diffracting ternary complex including dATP was obtained. The dATP did not pair with PdG, but instead with the 5'-neighboring T, utilizing Watson-Crick geometry. Thus, all three ternary complexes were of the 'type II' structure described for ternary complexes with native DNA [Ling, H., Boudsocq, F., Woodgate, R., and Yang, W. (2001) Cell 107, 91--102]. The PdG adduct remained in the anti conformation about the glycosyl bond in each of these threee ternary complexes. These results provide insight into how -1 frameshift mutations might be generated for the PdG adduct, a structural model for the exocylic M{sub 1}dG adduct formed by malondialdehyde.« less

  14. Computational Redesign of Thioredoxin Is Hypersensitive toward Minor Conformational Changes in the Backbone Template

    PubMed Central

    Christensen, Signe; Horowitz, Scott; Bardwell, James C.A.; Olsen, Johan G.; Willemoës, Martin; Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper; Hamelryck, Thomas; Winther, Jakob R.

    2017-01-01

    Despite the development of powerful computational tools, the full-sequence design of proteins still remains a challenging task. To investigate the limits and capabilities of computational tools, we conducted a study of the ability of the program Rosetta to predict sequences that recreate the authentic fold of thioredoxin. Focusing on the influence of conformational details in the template structures, we based our study on 8 experimentally determined template structures and generated 120 designs from each. For experimental evaluation, we chose six sequences from each of the eight templates by objective criteria. The 48 selected sequences were evaluated based on their progressive ability to (1) produce soluble protein in Escherichia coli and (2) yield stable monomeric protein, and (3) on the ability of the stable, soluble proteins to adopt the target fold. Of the 48 designs, we were able to synthesize 32, 20 of which resulted in soluble protein. Of these, only two were sufficiently stable to be purified. An X-ray crystal structure was solved for one of the designs, revealing a close resemblance to the target structure. We found a significant difference among the eight template structures to realize the above three criteria despite their high structural similarity. Thus, in order to improve the success rate of computational full-sequence design methods, we recommend that multiple template structures are used. Furthermore, this study shows that special care should be taken when optimizing the geometry of a structure prior to computational design when using a method that is based on rigid conformations. PMID:27659562

  15. Computational Redesign of Thioredoxin Is Hypersensitive toward Minor Conformational Changes in the Backbone Template.

    PubMed

    Johansson, Kristoffer E; Tidemand Johansen, Nicolai; Christensen, Signe; Horowitz, Scott; Bardwell, James C A; Olsen, Johan G; Willemoës, Martin; Lindorff-Larsen, Kresten; Ferkinghoff-Borg, Jesper; Hamelryck, Thomas; Winther, Jakob R

    2016-10-23

    Despite the development of powerful computational tools, the full-sequence design of proteins still remains a challenging task. To investigate the limits and capabilities of computational tools, we conducted a study of the ability of the program Rosetta to predict sequences that recreate the authentic fold of thioredoxin. Focusing on the influence of conformational details in the template structures, we based our study on 8 experimentally determined template structures and generated 120 designs from each. For experimental evaluation, we chose six sequences from each of the eight templates by objective criteria. The 48 selected sequences were evaluated based on their progressive ability to (1) produce soluble protein in Escherichia coli and (2) yield stable monomeric protein, and (3) on the ability of the stable, soluble proteins to adopt the target fold. Of the 48 designs, we were able to synthesize 32, 20 of which resulted in soluble protein. Of these, only two were sufficiently stable to be purified. An X-ray crystal structure was solved for one of the designs, revealing a close resemblance to the target structure. We found a significant difference among the eight template structures to realize the above three criteria despite their high structural similarity. Thus, in order to improve the success rate of computational full-sequence design methods, we recommend that multiple template structures are used. Furthermore, this study shows that special care should be taken when optimizing the geometry of a structure prior to computational design when using a method that is based on rigid conformations. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. The de Bono LAMS Sequence Series: Template Designs as Knowledge-Mobilising Strategy for 21st Century Higher Education

    ERIC Educational Resources Information Center

    Dobozy, Eva

    2012-01-01

    In this paper, the five interlocking de Bono LAMS sequences are introduced as a new form of generic template designs. This transdisciplinary knowledge-mobilising strategy is based on Edward de Bono's attention-directing ideas and thinking skills, commonly known as the CoRT tools. The development of the de Bono LAMS sequence series is an important…

  17. Modulation of mutagenesis in eukaryotes by DNA replication fork dynamics and quality of nucleotide pools

    PubMed Central

    Waisertreiger, Irina S.-R.; Liston, Victoria G.; Menezes, Miriam R.; Kim, Hyun-Min; Lobachev, Kirill S.; Stepchenkova, Elena I.; Tahirov, Tahir H.; Rogozin, Igor B.; Pavlov, Youri. I.

    2014-01-01

    The rate of mutations in eukaryotes depends on a plethora of factors and is not immediately derived from the fidelity of DNA polymerases (Pols). Replication of chromosomes containing the anti-parallel strands of duplex DNA occurs through the copying of leading and lagging strand templates by a trio of Pols α, δ and ε, with the assistance of Pol ζ and Y-family Pols at difficult DNA template structures or sites of DNA damage. The parameters of the synthesis at a given location are dictated by the quality and quantity of nucleotides in the pools, replication fork architecture, transcription status, regulation of Pol switches, and structure of chromatin. The result of these transactions is a subject of survey and editing by DNA repair. PMID:23055184

  18. Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata.

    PubMed

    Fracassetti, Marco; Griffin, Philippa C; Willi, Yvonne

    2015-01-01

    Sequencing pooled DNA of multiple individuals from a population instead of sequencing individuals separately has become popular due to its cost-effectiveness and simple wet-lab protocol, although some criticism of this approach remains. Here we validated a protocol for pooled whole-genome re-sequencing (Pool-seq) of Arabidopsis lyrata libraries prepared with low amounts of DNA (1.6 ng per individual). The validation was based on comparing single nucleotide polymorphism (SNP) frequencies obtained by pooling with those obtained by individual-based Genotyping By Sequencing (GBS). Furthermore, we investigated the effect of sample number, sequencing depth per individual and variant caller on population SNP frequency estimates. For Pool-seq data, we compared frequency estimates from two SNP callers, VarScan and Snape; the former employs a frequentist SNP calling approach while the latter uses a Bayesian approach. Results revealed concordance correlation coefficients well above 0.8, confirming that Pool-seq is a valid method for acquiring population-level SNP frequency data. Higher accuracy was achieved by pooling more samples (25 compared to 14) and working with higher sequencing depth (4.1× per individual compared to 1.4× per individual), which increased the concordance correlation coefficient to 0.955. The Bayesian-based SNP caller produced somewhat higher concordance correlation coefficients, particularly at low sequencing depth. We recommend pooling at least 25 individuals combined with sequencing at a depth of 100× to produce satisfactory frequency estimates for common SNPs (minor allele frequency above 0.05).

  19. Isolation of novel ribozymes that ligate AMP-activated RNA substrates

    NASA Technical Reports Server (NTRS)

    Hager, A. J.; Szostak, J. W.

    1997-01-01

    BACKGROUND: The protein enzymes RNA ligase and DNA ligase catalyze the ligation of nucleic acids via an adenosine-5'-5'-pyrophosphate 'capped' RNA or DNA intermediate. The activation of nucleic acid substrates by adenosine 5'-monophosphate (AMP) may be a vestige of 'RNA world' catalysis. AMP-activated ligation seems ideally suited for catalysis by ribozymes (RNA enzymes), because an RNA motif capable of tightly and specifically binding AMP has previously been isolated. RESULTS: We used in vitro selection and directed evolution to explore the ability of ribozymes to catalyze the template-directed ligation of AMP-activated RNAs. We subjected a pool of 10(15) RNA molecules, each consisting of long random sequences flanking a mutagenized adenosine triphosphate (ATP) aptamer, to ten rounds of in vitro selection, including three rounds involving mutagenic polymerase chain reaction. Selection was for the ligation of an oligonucleotide to the 5'-capped active pool RNA species. Many different ligase ribozymes were isolated; these ribozymes had rates of reaction up to 0.4 ligations per hour, corresponding to rate accelerations of approximately 5 x10(5) over the templated, but otherwise uncatalyzed, background reaction rate. Three characterized ribozymes catalyzed the formation of 3'-5'-phosphodiester bonds and were highly specific for activation by AMP at the ligation site. CONCLUSIONS: The existence of a new class of ligase ribozymes is consistent with the hypothesis that the unusual mechanism of the biological ligases resulted from a conservation of mechanism during an evolutionary replacement of a primordial ribozyme ligase by a more modern protein enzyme. The newly isolated ligase ribozymes may also provide a starting point for the isolation of ribozymes that catalyze the polymerization of AMP-activated oligonucleotides or mononucleotides, which might have been the prebiotic analogs of nucleoside triphosphates.

  20. Formation of template-switching artifacts by linear amplification.

    PubMed

    Chakravarti, Dhrubajyoti; Mailander, Paula C

    2008-07-01

    Linear amplification is a method of synthesizing single-stranded DNA from either a single-stranded DNA or one strand of a double-stranded DNA. In this protocol, molecules of a single primer DNA are extended by multiple rounds of DNA synthesis at high temperature using thermostable DNA polymerases. Although linear amplification generates the intended full-length single-stranded product, it is more efficient over single-stranded templates than double-stranded templates. We analyzed linear amplification over single- or double-stranded mouse H-ras DNA (exon 1-2 region). The single-stranded H-ras template yielded only the intended product. However, when the double-stranded template was used, additional artifact products were observed. Increasing the concentration of the double-stranded template produced relatively higher amounts of these artifact products. One of the artifact DNA bands could be mapped and analyzed by sequencing. It contained three template-switching products. These DNAs were formed by incomplete DNA strand extension over the template strand, followed by switching to the complementary strand at a specific Ade nucleotide within a putative hairpin sequence, from which DNA synthesis continued over the complementary strand.

  1. Prediction of Protein Structure by Template-Based Modeling Combined with the UNRES Force Field.

    PubMed

    Krupa, Paweł; Mozolewska, Magdalena A; Joo, Keehyoung; Lee, Jooyoung; Czaplewski, Cezary; Liwo, Adam

    2015-06-22

    A new approach to the prediction of protein structures that uses distance and backbone virtual-bond dihedral angle restraints derived from template-based models and simulations with the united residue (UNRES) force field is proposed. The approach combines the accuracy and reliability of template-based methods for the segments of the target sequence with high similarity to those having known structures with the ability of UNRES to pack the domains correctly. Multiplexed replica-exchange molecular dynamics with restraints derived from template-based models of a given target, in which each restraint is weighted according to the accuracy of the prediction of the corresponding section of the molecule, is used to search the conformational space, and the weighted histogram analysis method and cluster analysis are applied to determine the families of the most probable conformations, from which candidate predictions are selected. To test the capability of the method to recover template-based models from restraints, five single-domain proteins with structures that have been well-predicted by template-based methods were used; it was found that the resulting structures were of the same quality as the best of the original models. To assess whether the new approach can improve template-based predictions with incorrectly predicted domain packing, four such targets were selected from the CASP10 targets; for three of them the new approach resulted in significantly better predictions compared with the original template-based models. The new approach can be used to predict the structures of proteins for which good templates can be found for sections of the sequence or an overall good template can be found for the entire sequence but the prediction quality is remarkably weaker in putative domain-linker regions.

  2. Synthesis of RNA oligomers on heterogeneous templates

    NASA Technical Reports Server (NTRS)

    Ertem, G.; Ferris, J. P.

    1996-01-01

    The concept of an RNA world in the chemical origin of life is appealing, as nucleic acids are capable of both information storage and acting as templates that catalyse the synthesis of complementary molecules. Template-directed synthesis has been demonstrated for homogeneous oligonucleotides that, like natural nucleic acids, have 3',5' linkages between the nucleotide monomers. But it seems likely that prebiotic routes to RNA-like molecules would have produced heterogeneous molecules with various kinds of phosphodiester linkages and both linear and cyclic nucleotide chains. Here we show that such heterogeneity need be no obstacle to the templating of complementary molecules. Specifically, we show that heterogeneous oligocytidylates, formed by the montmorillonite clay-catalysed condensation of actuated monomers, can serve as templates for the synthesis of oligoguanylates. Furthermore, we show that oligocytidylates that are exclusively 2',5'-linked can also direct synthesis of oligoguanylates. Such heterogeneous templating reactions could have increased the diversity of the pool of protonucleic acids from which life ultimately emerged.

  3. A Versatile Platform for Nanotechnology Based on Circular Permutation of a Chaperonin Protein

    NASA Technical Reports Server (NTRS)

    Paavola, Chad; McMillan, Andrew; Trent, Jonathan; Chan, Suzanne; Mazzarella, Kellen; Li, Yi-Fen

    2004-01-01

    A number of protein complexes have been developed as nanoscale templates. These templates can be functionalized using the peptide sequences that bind inorganic materials. However, it is difficult to integrate peptides into a specific position within a protein template. Integrating intact proteins with desirable binding or catalytic activities is an even greater challenge. We present a general method for modifying protein templates using circular permutation so that additional peptide sequence can be added in a wide variety of specific locations. Circular permutation is a reordering of the polypeptide chain such that the original termini are joined and new termini are created elsewhere in the protein. New sequence can be joined to the protein termini without perturbing the protein structure and with minimal limitation on the size and conformation of the added sequence. We have used this approach to modify a chaperonin protein template, placing termini at five different locations distributed across the surface of the protein complex. These permutants are competent to form the double-ring structures typical of chaperonin proteins. The permuted double-rings also form the same assemblies as the unmodified protein. We fused a fluorescent protein to two representative permutants and demonstrated that it assumes its active structure and does not interfere with assembly of chaperonin double-rings.

  4. Fluorescent DNA-templated silver nanoclusters

    NASA Astrophysics Data System (ADS)

    Lin, Ruoqian

    Because of the ultra-small size and biocompatibility of silver nanoclusters, they have attracted much research interest for their applications in biolabeling. Among the many ways of synthesizing silver nanoclusters, DNA templated method is particularly attractive---the high tunability of DNA sequences provides another degree of freedom for controlling the chemical and photophysical properties. However, systematic studies about how DNA sequences and concentrations are controlling the photophysical properties are still lacking. The aim of this thesis is to investigate the binding mechanisms of silver clusters binding and single stranded DNAs. Here in this thesis, we report synthesis and characterization of DNA-templated silver nanoclusters and provide a systematic interrogation of the effects of DNA concentrations and sequences, including lengths and secondary structures. We performed a series of syntheses utilizing five different sequences to explore the optimal synthesis condition. By characterizing samples with UV-vis and fluorescence spectroscopy, we achieved the most proper reactants ratio and synthesis conditions. Two of them were chosen for further concentration dependence studies and sequence dependence studies. We found that cytosine-rich sequences are more likely to produce silver nanoclusters with stronger fluorescence signals; however, sequences with hairpin secondary structures are more capable in stabilizing silver nanoclusters. In addition, the fluorescence peak emission intensities and wavelengths of the DNA templated silver clusters have sequence dependent fingerprints. This potentially can be applied to sequence sensing in the future. However all the current conclusions are not warranted; there is still difficulty in formulating general rules in DNA strand design and silver nanocluster production. Further investigation of more sequences could solve these questions in the future.

  5. Triple helix purification and sequencing

    DOEpatents

    Wang, Renfeng; Smith, Lloyd M.; Tong, Xinchun E.

    1995-01-01

    Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis.

  6. Triple helix purification and sequencing

    DOEpatents

    Wang, R.; Smith, L.M.; Tong, X.E.

    1995-03-28

    Disclosed herein are methods, kits, and equipment for purifying single stranded circular DNA and then using the DNA for DNA sequencing purposes. Templates are provided with an insert having a hybridization region. An elongated oligonucleotide has two regions that are complementary to the insert and the oligo is bound to a magnetic anchor. The oligo hybridizes to the insert on two sides to form a stable triple helix complex. The anchor can then be used to drag the template out of solution using a magnet. The system can purify sequencing templates, and if desired the triple helix complex can be opened up to a double helix so that the oligonucleotide will act as a primer for further DNA synthesis. 4 figures.

  7. DNA-Templated Polymerization of Side-Chain-Functionalized Peptide Nucleic Acid Aldehydes

    PubMed Central

    Kleiner, Ralph E.; Brudno, Yevgeny; Birnbaum, Michael E.; Liu, David R.

    2009-01-01

    The DNA-templated polymerization of synthetic building blocks provides a potential route to the laboratory evolution of sequence-defined polymers with structures and properties not necessarily limited to those of natural biopolymers. We previously reported the efficient and sequence-specific DNA-templated polymerization of peptide nucleic acid (PNA) aldehydes. Here, we report the enzyme-free, DNA-templated polymerization of side-chain-functionalized PNA tetramer and pentamer aldehydes. We observed that the polymerization of tetramer and pentamer PNA building blocks with a single lysine-based side chain at various positions in the building block could proceed efficiently and sequence-specifically. In addition, DNA-templated polymerization also proceeded efficiently and in a sequence-specific manner with pentamer PNA aldehydes containing two or three lysine side chains in a single building block to generate more densely functionalized polymers. To further our understanding of side-chain compatibility and expand the capabilities of this system, we also examined the polymerization efficiencies of 20 pentamer building blocks each containing one of five different side-chain groups and four different side-chain regio- and stereochemistries. Polymerization reactions were efficient for all five different side-chain groups and for three of the four combinations of side-chain regio- and stereochemistries. Differences in the efficiency and initial rate of polymerization correlate with the apparent melting temperature of each building block, which is dependent on side-chain regio- and stereochemistry, but relatively insensitive to side-chain structure among the substrates tested. Our findings represent a significant step towards the evolution of sequence-defined synthetic polymers and also demonstrate that enzyme-free nucleic acid-templated polymerization can occur efficiently using substrates with a wide range of side-chain structures, functionalization positions within each building block, and functionalization densities. PMID:18341334

  8. Impact of phlebotomine sand flies on U.S. military operations at Tallil Air Base, Iraq: 4. Detection and identification of leishmania parasites in sand flies.

    PubMed

    Coleman, Russell E; Hochberg, Lisa P; Swanson, Katherine I; Lee, John S; McAvin, James C; Moulton, John K; Eddington, David O; Groebner, Jennifer L; O'Guinn, Monica L; Putnam, John L

    2009-05-01

    Sand flies collected between April 2003 and November 2004 at Tallil Air Base, Iraq, were evaluated for the presence of Leishmania parasites using a combination of a real-time Leishmania-generic polymerase chain reaction (PCR) assay and sequencing of a 360-bp fragment of the glucose-6-phosphate-isomerase (GPI) gene. A total of 2,505 pools containing 26,574 sand flies were tested using the real-time PCR assay. Leishmania DNA was initially detected in 536 pools; however, after extensive retesting with the real-time PCR assay, a total of 456 pools were considered positive and 80 were considered indeterminate. A total of 532 samples were evaluated for Leishmania GPI by sequencing, to include 439 PCR-positive samples, 80 PCR-indeterminate samples, and 13 PCR-negative samples. Leishmania GPI was detected in 284 samples that were sequenced, to include 281 (64%) of the PCR-positive samples and 3 (4%) of the PCR-indeterminate samples. Of the 284 sequences identified as Leishmania, 261 (91.9%) were L. tarentolae, 18 (6.3%) were L. donovani-complex parasites, 3 (1.1%) were L. tropica, and 2 were similar to both L. major and L. tropica. Minimum field infection rates were 0.09% for L. donovani-complex parasites, 0.02% for L. tropica, and 0.01% for the L. major/tropica-like parasite. Subsequent sequencing of a 600-bp region of the "Hyper" gene of 12 of the L. donovani-complex parasites showed that all 12 parasites were L. infantum. These data suggest that L. infantum was the primary leishmanial threat to U.S. military personnel deployed to Tallil Air Base. The implications of these findings are discussed.

  9. Generation of Synthetic Copolymer Libraries by Combinatorial Assembly on Nucleic Acid Templates.

    PubMed

    Kong, Dehui; Yeung, Wayland; Hili, Ryan

    2016-07-11

    Recent advances in nucleic acid-templated copolymerization have expanded the scope of sequence-controlled synthetic copolymers beyond the molecular architectures witnessed in nature. This has enabled the power of molecular evolution to be applied to synthetic copolymer libraries to evolve molecular function ranging from molecular recognition to catalysis. This Review seeks to summarize different approaches available to generate sequence-defined monodispersed synthetic copolymer libraries using nucleic acid-templated polymerization. Key concepts and principles governing nucleic acid-templated polymerization, as well as the fidelity of various copolymerization technologies, will be described. The Review will focus on methods that enable the combinatorial generation of copolymer libraries and their molecular evolution for desired function.

  10. Investigation of rare and low-frequency variants using high-throughput sequencing with pooled DNA samples

    PubMed Central

    Wang, Jingwen; Skoog, Tiina; Einarsdottir, Elisabet; Kaartokallio, Tea; Laivuori, Hannele; Grauers, Anna; Gerdhem, Paul; Hytönen, Marjo; Lohi, Hannes; Kere, Juha; Jiao, Hong

    2016-01-01

    High-throughput sequencing using pooled DNA samples can facilitate genome-wide studies on rare and low-frequency variants in a large population. Some major questions concerning the pooling sequencing strategy are whether rare and low-frequency variants can be detected reliably, and whether estimated minor allele frequencies (MAFs) can represent the actual values obtained from individually genotyped samples. In this study, we evaluated MAF estimates using three variant detection tools with two sets of pooled whole exome sequencing (WES) and one set of pooled whole genome sequencing (WGS) data. Both GATK and Freebayes displayed high sensitivity, specificity and accuracy when detecting rare or low-frequency variants. For the WGS study, 56% of the low-frequency variants in Illumina array have identical MAFs and 26% have one allele difference between sequencing and individual genotyping data. The MAF estimates from WGS correlated well (r = 0.94) with those from Illumina arrays. The MAFs from the pooled WES data also showed high concordance (r = 0.88) with those from the individual genotyping data. In conclusion, the MAFs estimated from pooled DNA sequencing data reflect the MAFs in individually genotyped samples well. The pooling strategy can thus be a rapid and cost-effective approach for the initial screening in large-scale association studies. PMID:27633116

  11. Universal Sequence Replication, Reversible Polymerization and Early Functional Biopolymers: A Model for the Initiation of Prebiotic Sequence Evolution

    PubMed Central

    Walker, Sara Imari; Grover, Martha A.; Hud, Nicholas V.

    2012-01-01

    Many models for the origin of life have focused on understanding how evolution can drive the refinement of a preexisting enzyme, such as the evolution of efficient replicase activity. Here we present a model for what was, arguably, an even earlier stage of chemical evolution, when polymer sequence diversity was generated and sustained before, and during, the onset of functional selection. The model includes regular environmental cycles (e.g. hydration-dehydration cycles) that drive polymers between times of replication and functional activity, which coincide with times of different monomer and polymer diffusivity. Template-directed replication of informational polymers, which takes place during the dehydration stage of each cycle, is considered to be sequence-independent. New sequences are generated by spontaneous polymer formation, and all sequences compete for a finite monomer resource that is recycled via reversible polymerization. Kinetic Monte Carlo simulations demonstrate that this proposed prebiotic scenario provides a robust mechanism for the exploration of sequence space. Introduction of a polymer sequence with monomer synthetase activity illustrates that functional sequences can become established in a preexisting pool of otherwise non-functional sequences. Functional selection does not dominate system dynamics and sequence diversity remains high, permitting the emergence and spread of more than one functional sequence. It is also observed that polymers spontaneously form clusters in simulations where polymers diffuse more slowly than monomers, a feature that is reminiscent of a previous proposal that the earliest stages of life could have been defined by the collective evolution of a system-wide cooperation of polymer aggregates. Overall, the results presented demonstrate the merits of considering plausible prebiotic polymer chemistries and environments that would have allowed for the rapid turnover of monomer resources and for regularly varying monomer/polymer diffusivities. PMID:22493682

  12. JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

    PubMed Central

    2009-01-01

    Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962

  13. Sliding over the Blocks in Enzyme-Free RNA Copying – One-Pot Primer Extension in Ice

    PubMed Central

    Löffler, Philipp M. G.; Groen, Joost; Dörr, Mark; Monnard, Pierre-Alain

    2013-01-01

    Template-directed polymerization of RNA in the absence of enzymes is the basis for an information transfer in the ‘RNA-world’ hypothesis and in novel nucleic acid based technology. Previous investigations established that only cytidine rich strands are efficient templates in bulk aqueous solutions while a few specific sequences completely block the extension of hybridized primers. We show that a eutectic water/ice system can support Pb2+/Mg2+-ion catalyzed extension of a primer across such sequences, i.e. AA, AU and AG, in a one-pot synthesis. Using mixtures of imidazole activated nucleotide 5′-monophosphates, the two first “blocking” residues could be passed during template-directed polymerization, i.e., formation of triply extended products containing a high fraction of faithful copies was demonstrated. Across the AG sequence, a mismatch sequence was formed in similar amounts to the correct product due to U·G wobble pairing. Thus, the template-directed extension occurs both across pyrimidine and purine rich sequences and insertions of pyrimidines did not inhibit the subsequent insertions. Products were mainly formed with 2′-5′-phosphodiester linkages, however, the abundance of 3′–5′-linkages was higher than previously reported for pyrimidine insertions. When enzyme-free, template-directed RNA polymerization is performed in a eutectic water ice environment, various intrinsic reaction limitations observed in bulk solution can then be overcome. PMID:24058695

  14. An Accurate Scalable Template-based Alignment Algorithm

    PubMed Central

    Gardner, David P.; Xu, Weijia; Miranker, Daniel P.; Ozer, Stuart; Cannone, Jamie J.; Gutell, Robin R.

    2013-01-01

    The rapid determination of nucleic acid sequences is increasing the number of sequences that are available. Inherent in a template or seed alignment is the culmination of structural and functional constraints that are selecting those mutations that are viable during the evolution of the RNA. While we might not understand these structural and functional, template-based alignment programs utilize the patterns of sequence conservation to encapsulate the characteristics of viable RNA sequences that are aligned properly. We have developed a program that utilizes the different dimensions of information in rCAD, a large RNA informatics resource, to establish a profile for each position in an alignment. The most significant include sequence identity and column composition in different phylogenetic taxa. We have compared our methods with a maximum of eight alternative alignment methods on different sets of 16S and 23S rRNA sequences with sequence percent identities ranging from 50% to 100%. The results showed that CRWAlign outperformed the other alignment methods in both speed and accuracy. A web-based alignment server is available at http://www.rna.ccbb.utexas.edu/SAE/2F/CRWAlign. PMID:24772376

  15. A LabVIEW based template for user created experiment automation.

    PubMed

    Kim, D J; Fisk, Z

    2012-12-01

    We have developed an expandable software template to automate user created experiments. The LabVIEW based template is easily modifiable to add together user created measurements, controls, and data logging with virtually any type of laboratory equipment. We use reentrant sequential selection to implement sequence script making it possible to wrap a long series of the user created experiments and execute them in sequence. Details of software structure and application examples for scanning probe microscope and automated transport experiments using custom built laboratory electronics and a cryostat are described.

  16. Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.

    PubMed

    Meier, Armin; Söding, Johannes

    2015-10-01

    Homology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the residues in protein sequence space. Although multiple templates have been shown to generally increase model quality over single templates, the information from multiple templates has so far been combined using empirically motivated, heuristic approaches. We present here a rigorous statistical framework for multi-template homology modeling. First, we find that the query proteins' atomic distance restraints can be accurately described by two-component Gaussian mixtures. This insight allowed us to apply the standard laws of probability theory to combine restraints from multiple templates. Second, we derive theoretically optimal weights to correct for the redundancy among related templates. Third, a heuristic template selection strategy is proposed. We improve the average GDT-ha model quality score by 11% over single template modeling and by 6.5% over a conventional multi-template approach on a set of 1000 query proteins. Robustness with respect to wrong constraints is likewise improved. We have integrated our multi-template modeling approach with the popular MODELLER homology modeling software in our free HHpred server http://toolkit.tuebingen.mpg.de/hhpred and also offer open source software for running MODELLER with the new restraints at https://bitbucket.org/soedinglab/hh-suite.

  17. Neo-Darwinism, the Modern Synthesis and selfish genes: are they of use in physiology?

    PubMed Central

    Noble, Denis

    2011-01-01

    This article argues that the gene-centric interpretations of evolution, and more particularly the selfish gene expression of those interpretations, form barriers to the integration of physiological science with evolutionary theory. A gene-centred approach analyses the relationships between genotypes and phenotypes in terms of differences (change the genotype and observe changes in phenotype). We now know that, most frequently, this does not correctly reveal the relationships because of extensive buffering by robust networks of interactions. By contrast, understanding biological function through physiological analysis requires an integrative approach in which the activity of the proteins and RNAs formed from each DNA template is analysed in networks of interactions. These networks also include components that are not specified by nuclear DNA. Inheritance is not through DNA sequences alone. The selfish gene idea is not useful in the physiological sciences, since selfishness cannot be defined as an intrinsic property of nucleotide sequences independently of gene frequency, i.e. the ‘success’ in the gene pool that is supposed to be attributable to the ‘selfish’ property. It is not a physiologically testable hypothesis. PMID:21135048

  18. Neo-Darwinism, the modern synthesis and selfish genes: are they of use in physiology?

    PubMed

    Noble, Denis

    2011-03-01

    This article argues that the gene-centric interpretations of evolution, and more particularly the selfish gene expression of those interpretations, form barriers to the integration of physiological science with evolutionary theory. A gene-centred approach analyses the relationships between genotypes and phenotypes in terms of differences (change the genotype and observe changes in phenotype). We now know that, most frequently, this does not correctly reveal the relationships because of extensive buffering by robust networks of interactions. By contrast, understanding biological function through physiological analysis requires an integrative approach in which the activity of the proteins and RNAs formed from each DNA template is analysed in networks of interactions. These networks also include components that are not specified by nuclear DNA. Inheritance is not through DNA sequences alone. The selfish gene idea is not useful in the physiological sciences, since selfishness cannot be defined as an intrinsic property of nucleotide sequences independently of gene frequency, i.e. the 'success' in the gene pool that is supposed to be attributable to the 'selfish' property. It is not a physiologically testable hypothesis.

  19. Design of association studies with pooled or un-pooled next-generation sequencing data.

    PubMed

    Kim, Su Yeon; Li, Yingrui; Guo, Yiran; Li, Ruiqiang; Holmkvist, Johan; Hansen, Torben; Pedersen, Oluf; Wang, Jun; Nielsen, Rasmus

    2010-07-01

    Most common hereditary diseases in humans are complex and multifactorial. Large-scale genome-wide association studies based on SNP genotyping have only identified a small fraction of the heritable variation of these diseases. One explanation may be that many rare variants (a minor allele frequency, MAF <5%), which are not included in the common genotyping platforms, may contribute substantially to the genetic variation of these diseases. Next-generation sequencing, which would allow the analysis of rare variants, is now becoming so cheap that it provides a viable alternative to SNP genotyping. In this paper, we present cost-effective protocols for using next-generation sequencing in association mapping studies based on pooled and un-pooled samples, and identify optimal designs with respect to total number of individuals, number of individuals per pool, and the sequencing coverage. We perform a small empirical study to evaluate the pooling variance in a realistic setting where pooling is combined with exon-capturing. To test for associations, we develop a likelihood ratio statistic that accounts for the high error rate of next-generation sequencing data. We also perform extensive simulations to determine the power and accuracy of this method. Overall, our findings suggest that with a fixed cost, sequencing many individuals at a more shallow depth with larger pool size achieves higher power than sequencing a small number of individuals in higher depth with smaller pool size, even in the presence of high error rates. Our results provide guidelines for researchers who are developing association mapping studies based on next-generation sequencing. (c) 2010 Wiley-Liss, Inc.

  20. An Evaluation of Different Target Enrichment Methods in Pooled Sequencing Designs for Complex Disease Association Studies

    PubMed Central

    Day-Williams, Aaron G.; McLay, Kirsten; Drury, Eleanor; Edkins, Sarah; Coffey, Alison J.; Palotie, Aarno; Zeggini, Eleftheria

    2011-01-01

    Pooled sequencing can be a cost-effective approach to disease variant discovery, but its applicability in association studies remains unclear. We compare sequence enrichment methods coupled to next-generation sequencing in non-indexed pools of 1, 2, 10, 20 and 50 individuals and assess their ability to discover variants and to estimate their allele frequencies. We find that pooled resequencing is most usefully applied as a variant discovery tool due to limitations in estimating allele frequency with high enough accuracy for association studies, and that in-solution hybrid-capture performs best among the enrichment methods examined regardless of pool size. PMID:22069447

  1. Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes.

    PubMed

    Haiminen, Niina; Feltus, F Alex; Parida, Laxmi

    2011-04-15

    We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies.

  2. Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster

    PubMed Central

    Zhu, Yuan; Bergland, Alan O.; González, Josefa; Petrov, Dmitri A.

    2012-01-01

    The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive. PMID:22848651

  3. Analysis of Duck Hepatitis B Virus Reverse Transcription Indicates a Common Mechanism for the Two Template Switches during Plus-Strand DNA Synthesis

    PubMed Central

    Havert, Michael B.; Ji, Lin; Loeb, Daniel D.

    2002-01-01

    The synthesis of the hepadnavirus relaxed circular DNA genome requires two template switches, primer translocation and circularization, during plus-strand DNA synthesis. Repeated sequences serve as donor and acceptor templates for these template switches, with direct repeat 1 (DR1) and DR2 for primer translocation and 5′r and 3′r for circularization. These donor and acceptor sequences are at, or near, the ends of the minus-strand DNA. Analysis of plus-strand DNA synthesis of duck hepatitis B virus (DHBV) has indicated that there are at least three other cis-acting sequences that make contributions during the synthesis of relaxed circular DNA. These sequences, 5E, M, and 3E, are located near the 5′ end, the middle, and the 3′ end of minus-strand DNA, respectively. The mechanism by which these sequences contribute to the synthesis of plus-strand DNA was unclear. Our aim was to better understand the mechanism by which 5E and M act. We localized the DHBV 5E element to a short sequence of approximately 30 nucleotides that is 100 nucleotides 3′ of DR2 on minus-strand DNA. We found that the new 5E mutants were partially defective for primer translocation/utilization at DR2. They were also invariably defective for circularization. In addition, examination of several new DHBV M variants indicated that they too were defective for primer translocation/utilization and circularization. Thus, this analysis indicated that 5E and M play roles in both primer translocation/utilization and circularization. In conjunction with earlier findings that 3E functions in both template switches, our findings indicate that the processes of primer translocation and circularization share a common underlying mechanism. PMID:11861843

  4. An electrooculogram-based binary saccade sequence classification (BSSC) technique for augmentative communication and control.

    PubMed

    Keegan, Johnalan; Burke, Edward; Condron, James

    2009-01-01

    In the field of assistive technology, the electrooculogram (EOG) can be used as a channel of communication and the basis of a man-machine interface. For many people with severe motor disabilities, simple actions such as changing the TV channel require assistance. This paper describes a method of detecting saccadic eye movements and the use of a saccade sequence classification algorithm to facilitate communication and control. Saccades are fast eye movements that occurs when a person's gaze jumps from one fixation point to another. The classification is based on pre-defined sequences of saccades, guided by a static visual template (e.g. a page or poster). The template, consisting of a table of symbols each having a clearly identifiable fixation point, is situated within view of the user. To execute a particular command, the user moves his or her gaze through a pre-defined path of eye movements. This results in a well-formed sequence of saccades which are translated into a command if a match is found in a library of predefined sequences. A coordinate transformation algorithm is applied to each candidate sequence of recorded saccades to mitigate the effect of changes in the user's position and orientation relative to the visual template. Upon recognition of a saccade sequence from the library, its associated command is executed. A preliminary experiment in which two subjects were instructed to perform a series of command sequences consisting of 8 different commands are presented in the final sections. The system is also shown to be extensible to facilitate convenient text entry via an alphabetic visual template.

  5. Automated one-step DNA sequencing based on nanoliter reaction volumes and capillary electrophoresis.

    PubMed

    Pang, H M; Yeung, E S

    2000-08-01

    An integrated system with a nano-reactor for cycle-sequencing reaction coupled to on-line purification and capillary gel electrophoresis has been demonstrated. Fifty nanoliters of reagent solution, which includes dye-labeled terminators, polymerase, BSA and template, was aspirated and mixed with the template inside the nano-reactor followed by cycle-sequencing reaction. The reaction products were then purified by a size-exclusion chromatographic column operated at 50 degrees C followed by room temperature on-line injection of the DNA fragments into a capillary for gel electrophoresis. Over 450 bases of DNA can be separated and identified. As little as 25 nl reagent solution can be used for the cycle-sequencing reaction with a slightly shorter read length. Significant savings on reagent cost is achieved because the remaining stock solution can be reused without contamination. The steps of cycle sequencing, on-line purification, injection, DNA separation, capillary regeneration, gel-filling and fluidic manipulation were performed with complete automation. This system can be readily multiplexed for high-throughput DNA sequencing or PCR analysis directly from templates or even biological materials.

  6. Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes

    PubMed Central

    2011-01-01

    Background We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. Results The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. Conclusions BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. PMID:21496274

  7. Template-Based 3D Reconstruction of Non-rigid Deformable Object from Monocular Video

    NASA Astrophysics Data System (ADS)

    Liu, Yang; Peng, Xiaodong; Zhou, Wugen; Liu, Bo; Gerndt, Andreas

    2018-06-01

    In this paper, we propose a template-based 3D surface reconstruction system of non-rigid deformable objects from monocular video sequence. Firstly, we generate a semi-dense template of the target object with structure from motion method using a subsequence video. This video can be captured by rigid moving camera orienting the static target object or by a static camera observing the rigid moving target object. Then, with the reference template mesh as input and based on the framework of classical template-based methods, we solve an energy minimization problem to get the correspondence between the template and every frame to get the time-varying mesh to present the deformation of objects. The energy terms combine photometric cost, temporal and spatial smoothness cost as well as as-rigid-as-possible cost which can enable elastic deformation. In this paper, an easy and controllable solution to generate the semi-dense template for complex objects is presented. Besides, we use an effective iterative Schur based linear solver for the energy minimization problem. The experimental evaluation presents qualitative deformation objects reconstruction results with real sequences. Compare against the results with other templates as input, the reconstructions based on our template have more accurate and detailed results for certain regions. The experimental results show that the linear solver we used performs better efficiency compared to traditional conjugate gradient based solver.

  8. Microbial contributions to coupled arsenic and sulfur cycling in the acid-sulfide hot spring Champagne Pool, New Zealand.

    PubMed

    Hug, Katrin; Maher, William A; Stott, Matthew B; Krikowa, Frank; Foster, Simon; Moreau, John W

    2014-01-01

    Acid-sulfide hot springs are analogs of early Earth geothermal systems where microbial metal(loid) resistance likely first evolved. Arsenic is a metalloid enriched in the acid-sulfide hot spring Champagne Pool (Waiotapu, New Zealand). Arsenic speciation in Champagne Pool follows reaction paths not yet fully understood with respect to biotic contributions and coupling to biogeochemical sulfur cycling. Here we present quantitative arsenic speciation from Champagne Pool, finding arsenite dominant in the pool, rim and outflow channel (55-75% total arsenic), and dithio- and trithioarsenates ubiquitously present as 18-25% total arsenic. In the outflow channel, dimethylmonothioarsenate comprised ≤9% total arsenic, while on the outflow terrace thioarsenates were present at 55% total arsenic. We also quantified sulfide, thiosulfate, sulfate and elemental sulfur, finding sulfide and sulfate as major species in the pool and outflow terrace, respectively. Elemental sulfur concentration reached a maximum at the terrace. Phylogenetic analysis of 16S rRNA genes from metagenomic sequencing revealed the dominance of Sulfurihydrogenibium at all sites and an increased archaeal population at the rim and outflow channel. Several phylotypes were found closely related to known sulfur- and sulfide-oxidizers, as well as sulfur- and sulfate-reducers. Bioinformatic analysis revealed genes underpinning sulfur redox transformations, consistent with sulfur speciation data, and illustrating a microbial role in sulfur-dependent transformation of arsenite to thioarsenate. Metagenomic analysis also revealed genes encoding for arsenate reductase at all sites, reflecting the ubiquity of thioarsenate and a need for microbial arsenate resistance despite anoxic conditions. Absence of the arsenite oxidase gene, aio, at all sites suggests prioritization of arsenite detoxification over coupling to energy conservation. Finally, detection of methyl arsenic in the outflow channel, in conjunction with increased sequences from Aquificaceae, supports a role for methyltransferase in thermophilic arsenic resistance. Our study highlights microbial contributions to coupled arsenic and sulfur cycling at Champagne Pool, with implications for understanding the evolution of microbial arsenic resistance in sulfidic geothermal systems.

  9. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    PubMed

    Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong

    2015-01-01

    Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  10. Open source database of images DEIMOS: extension for large-scale subjective image quality assessment

    NASA Astrophysics Data System (ADS)

    Vítek, Stanislav

    2014-09-01

    DEIMOS (Database of Images: Open Source) is an open-source database of images and video sequences for testing, verification and comparison of various image and/or video processing techniques such as compression, reconstruction and enhancement. This paper deals with extension of the database allowing performing large-scale web-based subjective image quality assessment. Extension implements both administrative and client interface. The proposed system is aimed mainly at mobile communication devices, taking into account advantages of HTML5 technology; it means that participants don't need to install any application and assessment could be performed using web browser. The assessment campaign administrator can select images from the large database and then apply rules defined by various test procedure recommendations. The standard test procedures may be fully customized and saved as a template. Alternatively the administrator can define a custom test, using images from the pool and other components, such as evaluating forms and ongoing questionnaires. Image sequence is delivered to the online client, e.g. smartphone or tablet, as a fully automated assessment sequence or viewer can decide on timing of the assessment if required. Environmental data and viewing conditions (e.g. illumination, vibrations, GPS coordinates, etc.), may be collected and subsequently analyzed.

  11. SynTrack: DNA Assembly Workflow Management (SynTrack) v2.0.1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    MENG, XIANWEI; SIMIRENKO, LISA

    2016-12-01

    SynTrack is a dynamic, workflow-driven data management system that tracks the DNA build process: Management of the hierarchical relationships of the DNA fragments; Monitoring of process tasks for the assembly of multiple DNA fragments into final constructs; Creations of vendor order forms with selectable building blocks. Organizing plate layouts barcodes for vendor/pcr/fusion/chewback/bioassay/glycerol/master plate maps (default/condensed); Creating or updating Pre-Assembly/Assembly process workflows with selected building blocks; Generating Echo pooling instructions based on plate maps; Tracking of building block orders, received and final assembled for delivering; Bulk updating of colony or PCR amplification information, fusion PCR and chewback results; Updating with QA/QCmore » outcome with .csv & .xlsx template files; Re-work assembly workflow enabled before and after sequencing validation; and Tracking of plate/well data changes and status updates and reporting of master plate status with QC outcomes.« less

  12. Enzymatic production of 'monoclonal stoichiometric' single-stranded DNA oligonucleotides.

    PubMed

    Ducani, Cosimo; Kaul, Corinna; Moche, Martin; Shih, William M; Högberg, Björn

    2013-07-01

    Single-stranded oligonucleotides are important as research tools, as diagnostic probes, in gene therapy and in DNA nanotechnology. Oligonucleotides are typically produced via solid-phase synthesis, using polymer chemistries that are limited relative to what biological systems produce. The number of errors in synthetic DNA increases with oligonucleotide length, and the resulting diversity of sequences can be a problem. Here we present the 'monoclonal stoichiometric' (MOSIC) method for enzyme-mediated production of DNA oligonucleotides. We amplified oligonucleotides from clonal templates derived from single bacterial colonies and then digested cutter hairpins in the products, which released pools of oligonucleotides with precisely controlled relative stoichiometric ratios. We prepared 14-378-nucleotide MOSIC oligonucleotides either by in vitro rolling-circle amplification or by amplification of phagemid DNA in Escherichia coli. Analyses of the formation of a DNA crystal and folding of DNA nanostructures confirmed the scalability, purity and stoichiometry of the produced oligonucleotides.

  13. Enzymatic Production of Monoclonal Stoichiometric Single-Stranded DNA Oligonucleotides

    PubMed Central

    Ducani, Cosimo; Kaul, Corinna; Moche, Martin; Shih, William M.; Högberg, Björn

    2013-01-01

    Single-stranded oligonucleotides are important as research tools as probes for diagnostics and gene therapy. Today, production of oligonucleotides is done via solid-phase synthesis. However, the capabilities of current polymer chemistry are limited in comparison to what can be produced in biological systems. The errors in synthetic DNA increases with oligonucleotide length, and sequence diversity can often be a problem. Here, we present the Monoclonal Stoichiometric (MOSIC) method for enzymatic DNA oligonucleotide production. Using this method, we amplify oligonucleotides from clonal templates followed by digestion of a cutter-hairpin, resulting in pools of monoclonal oligonucleotides with precisely controlled relative stoichiometric ratios. We present data where MOSIC oligonucleotides, 14–378 nt long, were prepared either by in vitro rolling-circle amplification, or by amplification in Escherichia coli in the form of phagemid DNA. The formation of a DNA crystal and folding of DNA nanostructures confirmed the scalability, purity and stoichiometry of the produced oligonucleotides. PMID:23727986

  14. A template-finding algorithm and a comprehensive benchmark for homology modeling of proteins

    PubMed Central

    Vallat, Brinda Kizhakke; Pillardy, Jaroslaw; Elber, Ron

    2010-01-01

    The first step in homology modeling is to identify a template protein for the target sequence. The template structure is used in later phases of the calculation to construct an atomically detailed model for the target. We have built from the Protein Data Bank a large-scale learning set that includes tens of millions of pair matches that can be either a true template or a false one. Discriminatory learning (learning from positive and negative examples) is employed to train a decision tree. Each branch of the tree is a mathematical programming model. The decision tree is tested on an independent set from PDB entries and on the sequences of CASP7. It provides significant enrichment of true templates (between 50-100 percent) when compared to PSI-BLAST. The model is further verified by building atomically detailed structures for each of the tentative true templates with modeller. The probability that a true match does not yield an acceptable structural model (within 6Å RMSD from the native structure), decays linearly as a function of the TM structural-alignment score. PMID:18300226

  15. Inaccurate DNA Synthesis in Cell Extracts of Yeast Producing Active Human DNA Polymerase Iota

    PubMed Central

    Makarova, Alena V.; Grabow, Corinn; Gening, Leonid V.; Tarantul, Vyacheslav Z.; Tahirov, Tahir H.; Bessho, Tadayoshi; Pavlov, Youri I.

    2011-01-01

    Mammalian Pol ι has an unusual combination of properties: it is stimulated by Mn2+ ions, can bypass some DNA lesions and misincorporates “G” opposite template “T” more frequently than incorporates the correct “A.” We recently proposed a method of detection of Pol ι activity in animal cell extracts, based on primer extension opposite the template T with a high concentration of only two nucleotides, dGTP and dATP (incorporation of “G” versus “A” method of Gening, abbreviated as “misGvA”). We provide unambiguous proof of the “misGvA” approach concept and extend the applicability of the method for the studies of variants of Pol ι in the yeast model system with different cation cofactors. We produced human Pol ι in baker's yeast, which do not have a POLI ortholog. The “misGvA” activity is absent in cell extracts containing an empty vector, or producing catalytically dead Pol ι, or Pol ι lacking exon 2, but is robust in the strain producing wild-type Pol ι or its catalytic core, or protein with the active center L62I mutant. The signature pattern of primer extension products resulting from inaccurate DNA synthesis by extracts of cells producing either Pol ι or human Pol η is different. The DNA sequence of the template is critical for the detection of the infidelity of DNA synthesis attributed to DNA Pol ι. The primer/template and composition of the exogenous DNA precursor pool can be adapted to monitor replication fidelity in cell extracts expressing various error-prone Pols or mutator variants of accurate Pols. Finally, we demonstrate that the mutation rates in yeast strains producing human DNA Pols ι and η are not elevated over the control strain, despite highly inaccurate DNA synthesis by their extracts. PMID:21304950

  16. Detection and Resolution of Cryptosporidium Species and Species Mixtures by Genus-Specific Nested PCR-Restriction Fragment Length Polymorphism Analysis, Direct Sequencing, and Cloning ▿

    PubMed Central

    Ruecker, Norma J.; Hoffman, Rebecca M.; Chalmers, Rachel M.; Neumann, Norman F.

    2011-01-01

    Molecular methods incorporating nested PCR-restriction fragment length polymorphism (RFLP) analysis of the 18S rRNA gene of Cryptosporidium species were validated to assess performance based on limit of detection (LoD) and for detecting and resolving mixtures of species and genotypes within a single sample. The 95% LoD was determined for seven species (Cryptosporidium hominis, C. parvum, C. felis, C. meleagridis, C. ubiquitum, C. muris, and C. andersoni) and ranged from 7 to 11 plasmid template copies with overlapping 95% confidence limits. The LoD values for genomic DNA from oocysts on microscope slides were 7 and 10 template copies for C. andersoni and C. parvum, respectively. The repetitive nested PCR-RFLP slide protocol had an LoD of 4 oocysts per slide. When templates of two species were mixed in equal ratios in the nested PCR-RFLP reaction mixture, there was no amplification bias toward one species over another. At high ratios of template mixtures (>1:10), there was a reduction or loss of detection of the less abundant species by RFLP analysis, most likely due to heteroduplex formation in the later cycles of the PCR. Replicate nested PCR was successful at resolving many mixtures of Cryptosporidium at template concentrations near or below the LoD. The cloning of nested PCR products resulted in 17% of the cloned sequences being recombinants of the two original templates. Limiting-dilution nested PCR followed by the sequencing of PCR products resulted in no sequence anomalies, suggesting that this method is an effective and accurate way to study the species diversity of Cryptosporidium, particularly for environmental water samples, in which mixtures of parasites are common. PMID:21498746

  17. New insights into transcription fidelity: thermal stability of non-canonical structures in template DNA regulates transcriptional arrest, pause, and slippage.

    PubMed

    Tateishi-Karimata, Hisae; Isono, Noburu; Sugimoto, Naoki

    2014-01-01

    The thermal stability and topology of non-canonical structures of G-quadruplexes and hairpins in template DNA were investigated, and the effect of non-canonical structures on transcription fidelity was evaluated quantitatively. We designed ten template DNAs: A linear sequence that does not have significant higher-order structure, three sequences that form hairpin structures, and six sequences that form G-quadruplex structures with different stabilities. Templates with non-canonical structures induced the production of an arrested, a slipped, and a full-length transcript, whereas the linear sequence produced only a full-length transcript. The efficiency of production for run-off transcripts (full-length and slipped transcripts) from templates that formed the non-canonical structures was lower than that from the linear. G-quadruplex structures were more effective inhibitors of full-length product formation than were hairpin structure even when the stability of the G-quadruplex in an aqueous solution was the same as that of the hairpin. We considered that intra-polymerase conditions may differentially affect the stability of non-canonical structures. The values of transcription efficiencies of run-off or arrest transcripts were correlated with stabilities of non-canonical structures in the intra-polymerase condition mimicked by 20 wt% polyethylene glycol (PEG). Transcriptional arrest was induced when the stability of the G-quadruplex structure (-ΔG°37) in the presence of 20 wt% PEG was more than 8.2 kcal mol(-1). Thus, values of stability in the presence of 20 wt% PEG are an important indicator of transcription perturbation. Our results further our understanding of the impact of template structure on the transcription process and may guide logical design of transcription-regulating drugs.

  18. New Insights into Transcription Fidelity: Thermal Stability of Non-Canonical Structures in Template DNA Regulates Transcriptional Arrest, Pause, and Slippage

    PubMed Central

    Tateishi-Karimata, Hisae; Isono, Noburu; Sugimoto, Naoki

    2014-01-01

    The thermal stability and topology of non-canonical structures of G-quadruplexes and hairpins in template DNA were investigated, and the effect of non-canonical structures on transcription fidelity was evaluated quantitatively. We designed ten template DNAs: A linear sequence that does not have significant higher-order structure, three sequences that form hairpin structures, and six sequences that form G-quadruplex structures with different stabilities. Templates with non-canonical structures induced the production of an arrested, a slipped, and a full-length transcript, whereas the linear sequence produced only a full-length transcript. The efficiency of production for run-off transcripts (full-length and slipped transcripts) from templates that formed the non-canonical structures was lower than that from the linear. G-quadruplex structures were more effective inhibitors of full-length product formation than were hairpin structure even when the stability of the G-quadruplex in an aqueous solution was the same as that of the hairpin. We considered that intra-polymerase conditions may differentially affect the stability of non-canonical structures. The values of transcription efficiencies of run-off or arrest transcripts were correlated with stabilities of non-canonical structures in the intra-polymerase condition mimicked by 20 wt% polyethylene glycol (PEG). Transcriptional arrest was induced when the stability of the G-quadruplex structure (−ΔGo 37) in the presence of 20 wt% PEG was more than 8.2 kcal mol−1. Thus, values of stability in the presence of 20 wt% PEG are an important indicator of transcription perturbation. Our results further our understanding of the impact of template structure on the transcription process and may guide logical design of transcription-regulating drugs. PMID:24594642

  19. Sequence-Controlled Polymerization on Facially Amphiphilic Templates at Interfaces

    DTIC Science & Technology

    2016-06-14

    controlled chain growth polymerization. We will synthesize a ?- conjugated “parent” polymer by iterative exponential growth (IEG), attach cyclic olefin...template that is programmed to direct sequence- controlled chain growth polymerization. We will synthesize a ?- conjugated “parent” polymer by iterative...polymerization. We will synthesize a π- conjugated “parent” polymer by organometallic iterative exponential growth (IEG),2 attach cyclic olefin “daughter

  20. Strategies for Achieving High Sequencing Accuracy for Low Diversity Samples and Avoiding Sample Bleeding Using Illumina Platform

    PubMed Central

    Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga

    2015-01-01

    Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer’s, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how analysis can be repeated from saved sequencing images using the Long Template Protocol to increase accuracy. PMID:25860802

  1. Plastome Sequence Determination and Comparative Analysis for Members of the Lolium-Festuca Grass Species Complex

    PubMed Central

    Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.

    2013-01-01

    Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121

  2. External and semi-internal controls for PCR amplification of homologous sequences in mixed templates.

    PubMed

    Kalle, Elena; Gulevich, Alexander; Rensing, Christopher

    2013-11-01

    In a mixed template, the presence of homologous target DNA sequences creates environments that almost inevitably give rise to artifacts and biases during PCR. Heteroduplexes, chimeras, and skewed template-to-product ratios are the exclusive attributes of mixed template PCR and never occur in a single template assay. Yet, multi-template PCR has been used without appropriate attention to quality control and assay validation, in spite of the fact that such practice diminishes the reliability of results. External and internal amplification controls became obligatory elements of good laboratory practice in different PCR assays. We propose the inclusion of an analogous approach as a quality control system for multi-template PCR applications. The amplification controls must take into account the characteristics of multi-template PCR and be able to effectively monitor particular assay performance. This study demonstrated the efficiency of a model mixed template as an adequate external amplification control for a particular PCR application. The conditions of multi-template PCR do not allow implementation of a classic internal control; therefore we developed a convenient semi-internal control as an acceptable alternative. In order to evaluate the effects of inhibitors, a model multi-template mix was amplified in a mixture with DNAse-treated sample. Semi-internal control allowed establishment of intervals for robust PCR performance for different samples, thus enabling correct comparison of the samples. The complexity of the external and semi-internal amplification controls must be comparable with the assumed complexity of the samples. We also emphasize that amplification controls should be applied in multi-template PCR regardless of the post-assay method used to analyze products. © 2013 Elsevier B.V. All rights reserved.

  3. Long-range barcode labeling-sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Feng; Zhang, Tao; Singh, Kanwar K.

    Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.

  4. AMPLIFICATION OF RIBOSOMAL RNA SEQUENCES - Book Chapter

    EPA Science Inventory

    This book chapter contains the following headings and subheadings: Introduction; Experimental Approach - Precautions, Template, Primers, Reaction Conditions, Enhancers, Post Amplification; Procedures - Template DNA, Basic PCR, Thermal Cycle Parameters, Enzyme Addition, Agarose Ge...

  5. Structure and Temporal Dynamics of Populations within Wheat Streak Mosaic Virus Isolates

    PubMed Central

    Hall, Jeffrey S.; French, Roy; Morris, T. Jack; Stenger, Drake C.

    2001-01-01

    Variation within the Type and Sidney 81 strains of wheat streak mosaic virus was assessed by single-strand conformation polymorphism (SSCP) analysis and confirmed by nucleotide sequencing. Limiting-dilution subisolates (LDSIs) of each strain were evaluated for polymorphism in the P1, P3, NIa, and CP cistrons. Different SSCP patterns among LDSIs of a strain were associated with single-nucleotide substitutions. Sidney 81 LDSI-S10 was used as founding inoculum to establish three lineages each in wheat, corn, and barley. The P1, HC-Pro, P3, CI, NIa, NIb, and CP cistrons of LDSI-S10 and each lineage at passages 1, 3, 6, and 9 were evaluated for polymorphism. By passage 9, each lineage differed in consensus sequence from LDSI-S10. The majority of substitutions occurred within NIa and CP, although at least one change occurred in each cistron except HC-Pro and P3. Most consensus sequence changes among lineages were independent, with substitutions accumulating over time. However, LDSI-S10 bore a variant nucleotide (G6016) in NIa that was restored to A6016 in eight of nine lineages by passage 6. This near-global reversion is most easily explained by selection. Examination of nonconsensus variation revealed a pool of unique substitutions (singletons) that remained constant in frequency during passage, regardless of the host species examined. These results suggest that mutations arising by viral polymerase error are generated at a constant rate but that most newly generated mutants are sequestered in virions and do not serve as replication templates. Thus, a substantial fraction of variation generated is static and has yet to be tested for relative fitness. In contrast, nonsingleton variation increased upon passage, suggesting that some mutants do serve as replication templates and may become established in a population. Replicated mutants may or may not rise to prominence to become the consensus sequence in a lineage, with the fate of any particular mutant subject to selection and stochastic processes such as genetic drift and population growth factors. PMID:11581391

  6. Templated sequence insertion polymorphisms in the human genome

    NASA Astrophysics Data System (ADS)

    Onozawa, Masahiro; Aplan, Peter

    2016-11-01

    Templated Sequence Insertion Polymorphism (TSIP) is a recently described form of polymorphism recognized in the human genome, in which a sequence that is templated from a distant genomic region is inserted into the genome, seemingly at random. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; Class 1 TSIPs show features of insertions that are mediated via the LINE-1 ORF2 protein, including 1) target-site duplication (TSD), 2) polyadenylation 10-30 nucleotides downstream of a “cryptic” polyadenylation signal, and 3) preference for insertion at a 5’-TTTT/A-3’ sequence. In contrast, class 2 TSIPs show features consistent with repair of a DNA double-strand break via insertion of a DNA “patch” that is derived from a distant genomic region. Survey of a large number of normal human volunteers demonstrates that most individuals have 25-30 TSIPs, and that these TSIPs track with specific geographic regions. Similar to other forms of human polymorphism, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases.

  7. Mechanism of transcription termination by RNA polymerase III utilizes a nontemplate-strand sequence-specific signal element

    PubMed Central

    Arimbasseri, Aneeshkumar G.; Maraia, Richard J.

    2015-01-01

    SUMMARY Understanding the mechanism of transcription termination by a eukaryotic RNA polymerase (RNAP) has been limited by lack of a characterizable intermediate that reflects transition from an elongation complex to a true termination event. While other multisubunit RNAPs require multipartite cis-signals and/or ancillary factors to mediate pausing and release of the nascent transcript from the clutches of these enzymes, RNAP III does so with precision and efficiency on a simple oligo(dT) tract, independent of other cis-elements or trans-factors. We report a RNAP III pre-termination complex that reveals termination mechanisms controlled by sequence-specific elements in the non-template strand. Furthermore, the TFIIF-like, RNAP III subunit, C37 is required for this function of the non-template strand signal. The results reveal the RNAP III terminator as an information-rich control element. While the template strand promotes destabilization via a weak oligo(rU:dA) hybrid, the non-template strand provides distinct sequence-specific destabilizing information through interactions with the C37 subunit. PMID:25959395

  8. Differential mitochondrial DNA and gene expression in inherited retinal dysplasia in miniature Schnauzer dogs.

    PubMed

    Appleyard, Greg D; Forsyth, George W; Kiehlbauch, Laura M; Sigfrid, Kristen N; Hanik, Heather L J; Quon, Anita; Loewen, Matthew E; Grahn, Bruce H

    2006-05-01

    To investigate the molecular basis of inherited retinal dysplasia in miniature Schnauzers. Retina and retinal pigment epithelial tissues were collected from canine subjects at the age of 3 weeks. Total RNA isolated from these tissues was reverse transcribed to make representative cDNA pools that were compared for differences in gene expression by using a subtractive hybridization technique referred to as representational difference analysis (RDA). Expression differences identified by RDA were confirmed and quantified by real-time reverse-transcription PCR. Mitochondrial morphology from leukocytes and skeletal muscle of normal and affected miniature Schnauzers was examined by transmission electron microscopy. RDA screening of retinal pigment epithelial cDNA identified differences in mRNA transcript coding for two mitochondrial (mt) proteins--cytochrome oxidase subunit 1 and NADH dehydrogenase subunit 6--in affected dogs. Contrary to expectations, these identified sequences did not contain mutations. Based on the implication of mt-DNA-encoded proteins by the RDA experiments we used real-time PCR to compare the relative amounts of mt-DNA template in white blood cells from normal and affected dogs. White blood cells of affected dogs contained less than 30% of the normal amount of two specific mtDNA sequences, compared with the content of the nuclear-encoded glyceraldehyde-3-phosphate dehydrogenase (GA-3-PDH) reference gene. Retina and RPE tissue from affected dogs had reduced mRNA transcript levels for the two mitochondrial genes detected in the RDA experiment. Transcript levels for another mtDNA-encoded gene as well as the nuclear-encoded mitochondrial Tfam transcription factor were reduced in these tissues in affected dogs. Mitochondria from affected dogs were reduced in number and size and were unusually electron dense. Reduced levels of nuclear and mitochondrial transcripts in the retina and RPE of miniature Schnauzers affected with retinal dysplasia suggest that the pathogenesis of the disorder may arise from a lowered energy supply to the retina and RPE.

  9. Design of multi-phase dynamic chemical networks

    NASA Astrophysics Data System (ADS)

    Chen, Chenrui; Tan, Junjun; Hsieh, Ming-Chien; Pan, Ting; Goodwin, Jay T.; Mehta, Anil K.; Grover, Martha A.; Lynn, David G.

    2017-08-01

    Template-directed polymerization reactions enable the accurate storage and processing of nature's biopolymer information. This mutualistic relationship of nucleic acids and proteins, a network known as life's central dogma, is now marvellously complex, and the progressive steps necessary for creating the initial sequence and chain-length-specific polymer templates are lost to time. Here we design and construct dynamic polymerization networks that exploit metastable prion cross-β phases. Mixed-phase environments have been used for constructing synthetic polymers, but these dynamic phases emerge naturally from the growing peptide oligomers and create environments suitable both to nucleate assembly and select for ordered templates. The resulting templates direct the amplification of a phase containing only chain-length-specific peptide-like oligomers. Such multi-phase biopolymer dynamics reveal pathways for the emergence, self-selection and amplification of chain-length- and possibly sequence-specific biopolymers.

  10. RaptorX server: a resource for template-based protein structure modeling.

    PubMed

    Källberg, Morten; Margaryan, Gohar; Wang, Sheng; Ma, Jianzhu; Xu, Jinbo

    2014-01-01

    Assigning functional properties to a newly discovered protein is a key challenge in modern biology. To this end, computational modeling of the three-dimensional atomic arrangement of the amino acid chain is often crucial in determining the role of the protein in biological processes. We present a community-wide web-based protocol, RaptorX server ( http://raptorx.uchicago.edu ), for automated protein secondary structure prediction, template-based tertiary structure modeling, and probabilistic alignment sampling.Given a target sequence, RaptorX server is able to detect even remotely related template sequences by means of a novel nonlinear context-specific alignment potential and probabilistic consistency algorithm. Using the protocol presented here it is thus possible to obtain high-quality structural models for many target protein sequences when only distantly related protein domains have experimentally solved structures. At present, RaptorX server can perform secondary and tertiary structure prediction of a 200 amino acid target sequence in approximately 30 min.

  11. Determination of the promoter region of mouse ribosomal RNA gene by an in vitro transcription system.

    PubMed Central

    Yamamoto, O; Takakusa, N; Mishima, Y; Kominami, R; Muramatsu, M

    1984-01-01

    Sequences required for a faithful and efficient transcription of a cloned mouse ribosomal RNA gene (rDNA) are determined by testing a series of deletion mutants in an in vitro transcription system utilizing two kinds of mouse cellular extract. Deletion of sequences upstream of -40 or downstream of +52 causes only slight reduction in promoter activity as compared with the "wild-type" template. For upstream deletion mutants, the removal of a sequence between -40 and -35 causes a significant decrease in the capacity to direct efficient initiation. This decrease becomes more pronounced when the deletion reaches -32 and the sequence A-T-C-T-T-T, conserved among mouse, rat, and human rDNAs, is lost. Residual template activity is further reduced as more upstream sequence is deleted and finally becomes undetectable when the deletion is extended from -22 down to -17, corresponding to the loss of the conserved sequence T-A-T-T-G. As for downstream deletion mutants, the removal of the sequence downstream of +23 causes some (and further deletions up to +11 cause a more) serious decrease in template activity in vitro. These deletions involve other conserved sequences downstream of the transcription start site. However, the removal of the original transcription start site does not abolish the transcription initiation completely, provided that the whole upstream sequence is intact. Images PMID:6320178

  12. Novel encoding methods for DNA-templated chemical libraries.

    PubMed

    Li, Gang; Zheng, Wenlu; Liu, Ying; Li, Xiaoyu

    2015-06-01

    Among various types of DNA-encoded chemical libraries, DNA-templated library takes advantage of the sequence-specificity of DNA hybridization, enabling not only highly effective DNA-templated chemical reactions, but also high fidelity in library encoding. This brief review summarizes recent advances that have been made on the encoding strategies for DNA-templated libraries, and it also highlights their respective advantages and limitations for the preparation of DNA-encoded libraries. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. A Stochastic Point Cloud Sampling Method for Multi-Template Protein Comparative Modeling.

    PubMed

    Li, Jilong; Cheng, Jianlin

    2016-05-10

    Generating tertiary structural models for a target protein from the known structure of its homologous template proteins and their pairwise sequence alignment is a key step in protein comparative modeling. Here, we developed a new stochastic point cloud sampling method, called MTMG, for multi-template protein model generation. The method first superposes the backbones of template structures, and the Cα atoms of the superposed templates form a point cloud for each position of a target protein, which are represented by a three-dimensional multivariate normal distribution. MTMG stochastically resamples the positions for Cα atoms of the residues whose positions are uncertain from the distribution, and accepts or rejects new position according to a simulated annealing protocol, which effectively removes atomic clashes commonly encountered in multi-template comparative modeling. We benchmarked MTMG on 1,033 sequence alignments generated for CASP9, CASP10 and CASP11 targets, respectively. Using multiple templates with MTMG improves the GDT-TS score and TM-score of structural models by 2.96-6.37% and 2.42-5.19% on the three datasets over using single templates. MTMG's performance was comparable to Modeller in terms of GDT-TS score, TM-score, and GDT-HA score, while the average RMSD was improved by a new sampling approach. The MTMG software is freely available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/mtmg.html.

  14. A Stochastic Point Cloud Sampling Method for Multi-Template Protein Comparative Modeling

    PubMed Central

    Li, Jilong; Cheng, Jianlin

    2016-01-01

    Generating tertiary structural models for a target protein from the known structure of its homologous template proteins and their pairwise sequence alignment is a key step in protein comparative modeling. Here, we developed a new stochastic point cloud sampling method, called MTMG, for multi-template protein model generation. The method first superposes the backbones of template structures, and the Cα atoms of the superposed templates form a point cloud for each position of a target protein, which are represented by a three-dimensional multivariate normal distribution. MTMG stochastically resamples the positions for Cα atoms of the residues whose positions are uncertain from the distribution, and accepts or rejects new position according to a simulated annealing protocol, which effectively removes atomic clashes commonly encountered in multi-template comparative modeling. We benchmarked MTMG on 1,033 sequence alignments generated for CASP9, CASP10 and CASP11 targets, respectively. Using multiple templates with MTMG improves the GDT-TS score and TM-score of structural models by 2.96–6.37% and 2.42–5.19% on the three datasets over using single templates. MTMG’s performance was comparable to Modeller in terms of GDT-TS score, TM-score, and GDT-HA score, while the average RMSD was improved by a new sampling approach. The MTMG software is freely available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/mtmg.html. PMID:27161489

  15. Genome characterization of Long Island tick rhabdovirus, a new virus identified in Amblyomma americanum ticks.

    PubMed

    Tokarz, Rafal; Sameroff, Stephen; Leon, Maria Sanchez; Jain, Komal; Lipkin, W Ian

    2014-02-11

    Ticks are implicated as hosts to a wide range of animal and human pathogens. The full range of microbes harbored by ticks has not yet been fully explored. As part of a viral surveillance and discovery project in arthropods, we used unbiased high-throughput sequencing to examine viromes of ticks collected on Long Island, New York in 2013. We detected and sequenced the complete genome of a novel rhabdovirus originating from a pool of Amblyomma americanum ticks. This virus, which we provisionally name Long Island tick rhabdovirus, is distantly related to Moussa virus from Africa. The Long Island tick rhabdovirus may represent a novel species within family Rhabdoviridae.

  16. Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases

    PubMed Central

    Zajac, Pawel; Islam, Saiful; Hochgerner, Hannah; Lönnerberg, Peter; Linnarsson, Sten

    2013-01-01

    Reverse transcriptases derived from Moloney Murine Leukemia Virus (MMLV) have an intrinsic terminal transferase activity, which causes the addition of a few non-templated nucleotides at the 3´ end of cDNA, with a preference for cytosine. This mechanism can be exploited to make the reverse transcriptase switch template from the RNA molecule to a secondary oligonucleotide during first-strand cDNA synthesis, and thereby to introduce arbitrary barcode or adaptor sequences in the cDNA. Because the mechanism is relatively efficient and occurs in a single reaction, it has recently found use in several protocols for single-cell RNA sequencing. However, the base preference of the terminal transferase activity is not known in detail, which may lead to inefficiencies in template switching when starting from tiny amounts of mRNA. Here, we used fully degenerate oligos to determine the exact base preference at the template switching site up to a distance of ten nucleotides. We found a strong preference for guanosine at the first non-templated nucleotide, with a greatly reduced bias at progressively more distant positions. Based on this result, and a number of careful optimizations, we report conditions for efficient template switching for cDNA amplification from single cells. PMID:24392002

  17. Multifunctionality of a picornavirus polymerase domain: nuclear localization signal and nucleotide recognition.

    PubMed

    Ferrer-Orta, Cristina; de la Higuera, Ignacio; Caridi, Flavia; Sánchez-Aparicio, María Teresa; Moreno, Elena; Perales, Celia; Singh, Kamalendra; Sarafianos, Stefan G; Sobrino, Francisco; Domingo, Esteban; Verdaguer, Nuria

    2015-07-01

    The N-terminal region of the foot-and-mouth disease virus (FMDV) 3D polymerase contains the sequence MRKTKLAPT (residues 16 to 24) that acts as a nuclear localization signal. A previous study showed that substitutions K18E and K20E diminished the transport to the nucleus of 3D and 3CD and severely impaired virus infectivity. These residues have also been implicated in template binding, as seen in the crystal structures of different 3D-RNA elongation complexes. Here, we report the biochemical and structural characterization of different mutant polymerases harboring substitutions at residues 18 and 20, in particular, K18E, K18A, K20E, K20A, and the double mutant K18A K20A (KAKA). All mutant enzymes exhibit low RNA binding activity, low processivity, and alterations in nucleotide recognition, including increased incorporation of ribavirin monophosphate (RMP) relative to the incorporation of cognate nucleotides compared with the wild-type enzyme. The structural analysis shows an unprecedented flexibility of the 3D mutant polymerases, including both global rearrangements of the closed-hand architecture and local conformational changes at loop β9-α11 (within the polymerase motif B) and at the template-binding channel. Specifically, in 3D bound to RNA, both K18E and K20E induced the opening of new pockets in the template channel where the downstream templating nucleotide at position +2 binds. The comparisons of free and RNA-bound enzymes suggest that the structural rearrangements may occur in a concerted mode to regulate RNA replication, processivity, and fidelity. Thus, the N-terminal region of FMDV 3D that acts as a nuclear localization signal (NLS) and in template binding is also involved in nucleotide recognition and can affect the incorporation of nucleotide analogues. The study documents multifunctionality of a nuclear localization signal (NLS) located at the N-terminal region of the foot-and-mouth disease viral polymerase (3D). Amino acid substitutions at this polymerase region can impair the transport of 3D to the nucleus, reduce 3D binding to RNA, and alter the relative incorporation of standard nucleoside monophosphate versus ribavirin monophosphate. Structural data reveal that the conformational changes in this region, forming part of the template channel entry, would be involved in nucleotide discrimination. The results have implications for the understanding of viral polymerase function and for lethal mutagenesis mechanisms. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  18. Multifunctionality of a Picornavirus Polymerase Domain: Nuclear Localization Signal and Nucleotide Recognition

    PubMed Central

    Ferrer-Orta, Cristina; de la Higuera, Ignacio; Caridi, Flavia; Sánchez-Aparicio, María Teresa; Moreno, Elena; Perales, Celia; Singh, Kamalendra; Sarafianos, Stefan G.; Sobrino, Francisco; Domingo, Esteban

    2015-01-01

    ABSTRACT The N-terminal region of the foot-and-mouth disease virus (FMDV) 3D polymerase contains the sequence MRKTKLAPT (residues 16 to 24) that acts as a nuclear localization signal. A previous study showed that substitutions K18E and K20E diminished the transport to the nucleus of 3D and 3CD and severely impaired virus infectivity. These residues have also been implicated in template binding, as seen in the crystal structures of different 3D-RNA elongation complexes. Here, we report the biochemical and structural characterization of different mutant polymerases harboring substitutions at residues 18 and 20, in particular, K18E, K18A, K20E, K20A, and the double mutant K18A K20A (KAKA). All mutant enzymes exhibit low RNA binding activity, low processivity, and alterations in nucleotide recognition, including increased incorporation of ribavirin monophosphate (RMP) relative to the incorporation of cognate nucleotides compared with the wild-type enzyme. The structural analysis shows an unprecedented flexibility of the 3D mutant polymerases, including both global rearrangements of the closed-hand architecture and local conformational changes at loop β9-α11 (within the polymerase motif B) and at the template-binding channel. Specifically, in 3D bound to RNA, both K18E and K20E induced the opening of new pockets in the template channel where the downstream templating nucleotide at position +2 binds. The comparisons of free and RNA-bound enzymes suggest that the structural rearrangements may occur in a concerted mode to regulate RNA replication, processivity, and fidelity. Thus, the N-terminal region of FMDV 3D that acts as a nuclear localization signal (NLS) and in template binding is also involved in nucleotide recognition and can affect the incorporation of nucleotide analogues. IMPORTANCE The study documents multifunctionality of a nuclear localization signal (NLS) located at the N-terminal region of the foot-and-mouth disease viral polymerase (3D). Amino acid substitutions at this polymerase region can impair the transport of 3D to the nucleus, reduce 3D binding to RNA, and alter the relative incorporation of standard nucleoside monophosphate versus ribavirin monophosphate. Structural data reveal that the conformational changes in this region, forming part of the template channel entry, would be involved in nucleotide discrimination. The results have implications for the understanding of viral polymerase function and for lethal mutagenesis mechanisms. PMID:25903341

  19. Template-Based Modeling of Protein-RNA Interactions.

    PubMed

    Zheng, Jinfang; Kundrotas, Petras J; Vakser, Ilya A; Liu, Shiyong

    2016-09-01

    Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes.

  20. How to Choose the Suitable Template for Homology Modelling of GPCRs: 5-HT7 Receptor as a Test Case.

    PubMed

    Shahaf, Nir; Pappalardo, Matteo; Basile, Livia; Guccione, Salvatore; Rayan, Anwar

    2016-09-01

    G protein-coupled receptors (GPCRs) are a super-family of membrane proteins that attract great pharmaceutical interest due to their involvement in almost every physiological activity, including extracellular stimuli, neurotransmission, and hormone regulation. Currently, structural information on many GPCRs is mainly obtained by the techniques of computer modelling in general and by homology modelling in particular. Based on a quantitative analysis of eighteen antagonist-bound, resolved structures of rhodopsin family "A" receptors - also used as templates to build 153 homology models - it was concluded that a higher sequence identity between two receptors does not guarantee a lower RMSD between their structures, especially when their pair-wise sequence identity (within trans-membrane domain and/or in binding pocket) lies between 25 % and 40 %. This study suggests that we should consider all template receptors having a sequence identity ≤50 % with the query receptor. In fact, most of the GPCRs, compared to the currently available resolved structures of GPCRs, fall within this range and lack a correlation between structure and sequence. When testing suitability for structure-based drug design, it was found that choosing as a template the most similar resolved protein, based on sequence resemblance only, led to unsound results in many cases. Molecular docking analyses were carried out, and enrichment factors as well as attrition rates were utilized as criteria for assessing suitability for structure-based drug design. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. R-loops: targets for nuclease cleavage and repeat instability.

    PubMed

    Freudenreich, Catherine H

    2018-01-11

    R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.

  2. A Robust and Engineerable Self-Assembling Protein Template for the Synthesis and Patterning of Ordered Nanoparticle Arrays

    NASA Technical Reports Server (NTRS)

    McMillan, R. Andrew; Howard, Jeanie; Zaluzec, Nestor J.; Kagawa, Hiromi K.; Li, Yi-Fen; Paavola, Chad D.; Trent, Jonathan D.

    2004-01-01

    Self-assembling biomolecules that form highly ordered structures have attracted interest as potential alternatives to conventional lithographic processes for patterning materials. Here we introduce a general technique for patterning materials on the nanoscale using genetically modified protein cage structures called chaperonins that self-assemble into crystalline templates. Constrained chemical synthesis of transition metal nanoparticles is specific to templates genetically functionalized with poly-Histidine sequences. These arrays of materials are ordered by the nanoscale structure of the crystallized protein. This system may be easily adapted to pattern a variety of materials given the rapidly growing list of peptide sequences selected by screening for specificity for inorganic materials.

  3. RNA-Catalyzed RNA Ligation on an External RNA Template

    NASA Technical Reports Server (NTRS)

    McGinness, Kathleen E.; Joyce, Gerald F.

    2002-01-01

    Variants of the hc ligase ribozyme, which catalyzes ligation of the 3' end of an RNA substrate to the 5' end of the ribozyme, were utilized to evolve a ribozyme that catalyzes ligation reactions on an external RNA template. The evolved ribozyme catalyzes the joining of an oligonucleotide 3'-hydroxyl to the 5'-triphosphate of an RNA hairpin molecule. The ribozyme can also utilize various substrate sequences, demonstrating a largely sequence-independent mechanism for substrate recognition. The ribozyme also carries out the ligation of two oligonucleotides that are bound at adjacent positions on a complementary template. Finally, it catalyzes addition of mononucleoside '5-triphosphates onto the '3 end of an oligonucleotide primer in a template-dependent manner. The development of ribozymes that catalyze polymerase-type reactions contributes to the notion that an RNA world could have existed during the early history of life on Earth.

  4. An accurate algorithm for the detection of DNA fragments from dilution pool sequencing experiments.

    PubMed

    Bansal, Vikas

    2018-01-01

    The short read lengths of current high-throughput sequencing technologies limit the ability to recover long-range haplotype information. Dilution pool methods for preparing DNA sequencing libraries from high molecular weight DNA fragments enable the recovery of long DNA fragments from short sequence reads. These approaches require computational methods for identifying the DNA fragments using aligned sequence reads and assembling the fragments into long haplotypes. Although a number of computational methods have been developed for haplotype assembly, the problem of identifying DNA fragments from dilution pool sequence data has not received much attention. We formulate the problem of detecting DNA fragments from dilution pool sequencing experiments as a genome segmentation problem and develop an algorithm that uses dynamic programming to optimize a likelihood function derived from a generative model for the sequence reads. This algorithm uses an iterative approach to automatically infer the mean background read depth and the number of fragments in each pool. Using simulated data, we demonstrate that our method, FragmentCut, has 25-30% greater sensitivity compared with an HMM based method for fragment detection and can also detect overlapping fragments. On a whole-genome human fosmid pool dataset, the haplotypes assembled using the fragments identified by FragmentCut had greater N50 length, 16.2% lower switch error rate and 35.8% lower mismatch error rate compared with two existing methods. We further demonstrate the greater accuracy of our method using two additional dilution pool datasets. FragmentCut is available from https://bansal-lab.github.io/software/FragmentCut. vibansal@ucsd.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  5. GDF5 PROGENITORS GIVE RISE TO FIBROCARTILAGE CELLS THAT MINERALIZE VIA HEDGEHOG SIGNALING TO FORM THE ZONAL ENTHESIS

    PubMed Central

    Dyment, Nathaniel A.; Breidenbach, Andrew P.; Schwartz, Andrea G.; Russell, Ryan P.; Aschbacher-Smith, Lindsey; Liu, Han; Hagiwara, Yusuke; Jiang, Rulang; Thomopoulos, Stavros; Butler, David L.; Rowe, David W.

    2015-01-01

    The sequence of events that leads to the formation of a functionally graded enthesis is not clearly defined. The current study demonstrates that clonal expansion of Gdf5 progenitors contributes to linear growth of the enthesis. Prior to mineralization, Col1+ cells in the enthesis appose Col2+ cells of the underlying primary cartilage. At the onset of enthesis mineralization, cells at the base of the enthesis express alkaline phosphatase, Indian hedgehog, and ColX as they mineralize. The mineralization front then extends towards the tendon midsubstance as cells above the front become encapsulated in mineralized fibrocartilage over time. The hedgehog (Hh) pathway regulates this process, as Hh-responsive Gli1+ cells within the developing enthesis mature from unmineralized to mineralized fibrochondrocytes in response to activated signaling. Hh signaling is required for mineralization, as tissue-specific deletion of its obligate transducer Smoothened in the developing tendon and enthesis cells leads to significant reductions in the apposition of mineralized fibrocartilage. Together, these findings provide a spatiotemporal map of events – from expansion of the embryonic progenitor pool to synthesis of the collagen template and finally mineralization of this template – that leads to the formation of the mature zonal enthesis. These results can inform future tendon-to-bone repair strategies to create a mechanically functional enthesis in which tendon collagen fibers are anchored to bone through mineralized fibrocartilage. PMID:26141957

  6. Template-based structure modeling of protein-protein interactions

    PubMed Central

    Szilagyi, Andras; Zhang, Yang

    2014-01-01

    The structure of protein-protein complexes can be constructed by using the known structure of other protein complexes as a template. The complex structure templates are generally detected either by homology-based sequence alignments or, given the structure of monomer components, by structure-based comparisons. Critical improvements have been made in recent years by utilizing interface recognition and by recombining monomer and complex template libraries. Encouraging progress has also been witnessed in genome-wide applications of template-based modeling, with modeling accuracy comparable to high-throughput experimental data. Nevertheless, bottlenecks exist due to the incompleteness of the proteinprotein complex structure library and the lack of methods for distant homologous template identification and full-length complex structure refinement. PMID:24721449

  7. Hydroxyapatite-binding peptides for bone growth and inhibition

    DOEpatents

    Bertozzi, Carolyn R [Berkeley, CA; Song, Jie [Shrewsbury, MA; Lee, Seung-Wuk [Walnut Creek, CA

    2011-09-20

    Hydroxyapatite (HA)-binding peptides are selected using combinatorial phage library display. Pseudo-repetitive consensus amino acid sequences possessing periodic hydroxyl side chains in every two or three amino acid sequences are obtained. These sequences resemble the (Gly-Pro-Hyp).sub.x repeat of human type I collagen, a major component of extracellular matrices of natural bone. A consistent presence of basic amino acid residues is also observed. The peptides are synthesized by the solid-phase synthetic method and then used for template-driven HA-mineralization. Microscopy reveal that the peptides template the growth of polycrystalline HA crystals .about.40 nm in size.

  8. Iterated function systems for DNA replication

    NASA Astrophysics Data System (ADS)

    Gaspard, Pierre

    2017-10-01

    The kinetic equations of DNA replication are shown to be exactly solved in terms of iterated function systems, running along the template sequence and giving the statistical properties of the copy sequences, as well as the kinetic and thermodynamic properties of the replication process. With this method, different effects due to sequence heterogeneity can be studied, in particular, a transition between linear and sublinear growths in time of the copies, and a transition between continuous and fractal distributions of the local velocities of the DNA polymerase along the template. The method is applied to the human mitochondrial DNA polymerase γ without and with exonuclease proofreading.

  9. In vivo insertion pool sequencing identifies virulence factors in a complex fungal–host interaction

    PubMed Central

    Uhse, Simon; Pflug, Florian G.; Stirnberg, Alexandra; Ehrlinger, Klaus; von Haeseler, Arndt

    2018-01-01

    Large-scale insertional mutagenesis screens can be powerful genome-wide tools if they are streamlined with efficient downstream analysis, which is a serious bottleneck in complex biological systems. A major impediment to the success of next-generation sequencing (NGS)-based screens for virulence factors is that the genetic material of pathogens is often underrepresented within the eukaryotic host, making detection extremely challenging. We therefore established insertion Pool-Sequencing (iPool-Seq) on maize infected with the biotrophic fungus U. maydis. iPool-Seq features tagmentation, unique molecular barcodes, and affinity purification of pathogen insertion mutant DNA from in vivo-infected tissues. In a proof of concept using iPool-Seq, we identified 28 virulence factors, including 23 that were previously uncharacterized, from an initial pool of 195 candidate effector mutants. Because of its sensitivity and quantitative nature, iPool-Seq can be applied to any insertional mutagenesis library and is especially suitable for genetically complex setups like pooled infections of eukaryotic hosts. PMID:29684023

  10. Autonomous replication of nucleic acids by polymerization/nicking enzyme/DNAzyme cascades for the amplified detection of DNA and the aptamer-cocaine complex.

    PubMed

    Wang, Fuan; Freage, Lina; Orbach, Ron; Willner, Itamar

    2013-09-03

    The progressive development of amplified DNA sensors and aptasensors using replication/nicking enzymes/DNAzyme machineries is described. The sensing platforms are based on the tailoring of a DNA template on which the recognition of the target DNA or the formation of the aptamer-substrate complex trigger on the autonomous isothermal replication/nicking processes and the displacement of a Mg(2+)-dependent DNAzyme that catalyzes the generation of a fluorophore-labeled nucleic acid acting as readout signal for the analyses. Three different DNA sensing configurations are described, where in the ultimate configuration the target sequence is incorporated into a nucleic acid blocker structure associated with the sensing template. The target-triggered isothermal autonomous replication/nicking process on the modified template results in the formation of the Mg(2+)-dependent DNAzyme tethered to a free strand consisting of the target sequence. This activates additional template units for the nucleic acid self-replication process, resulting in the ultrasensitive detection of the target DNA (detection limit 1 aM). Similarly, amplified aptamer-based sensing platforms for cocaine are developed along these concepts. The modification of the cocaine-detection template by the addition of a nucleic acid sequence that enables the autonomous secondary coupled activation of a polymerization/nicking machinery and DNAzyme generation path leads to an improved analysis of cocaine (detection limit 10 nM).

  11. Rapid gene identification in sugar beet using deep sequencing of DNA from phenotypic pools selected from breeding panels.

    PubMed

    Ries, David; Holtgräwe, Daniela; Viehöver, Prisca; Weisshaar, Bernd

    2016-03-15

    The combination of bulk segregant analysis (BSA) and next generation sequencing (NGS), also known as mapping by sequencing (MBS), has been shown to significantly accelerate the identification of causal mutations for species with a reference genome sequence. The usual approach is to cross homozygous parents that differ for the monogenic trait to address, to perform deep sequencing of DNA from F2 plants pooled according to their phenotype, and subsequently to analyze the allele frequency distribution based on a marker table for the parents studied. The method has been successfully applied for EMS induced mutations as well as natural variation. Here, we show that pooling genetically diverse breeding lines according to a contrasting phenotype also allows high resolution mapping of the causal gene in a crop species. The test case was the monogenic locus causing red vs. green hypocotyl color in Beta vulgaris (R locus). We determined the allele frequencies of polymorphic sequences using sequence data from two diverging phenotypic pools of 180 B. vulgaris accessions each. A single interval of about 31 kbp among the nine chromosomes was identified which indeed contained the causative mutation. By applying a variation of the mapping by sequencing approach, we demonstrated that phenotype-based pooling of diverse accessions from breeding panels and subsequent direct determination of the allele frequency distribution can be successfully applied for gene identification in a crop species. Our approach made it possible to identify a small interval around the causative gene. Sequencing of parents or individual lines was not necessary. Whenever the appropriate plant material is available, the approach described saves time compared to the generation of an F2 population. In addition, we provide clues for planning similar experiments with regard to pool size and the sequencing depth required.

  12. Whole exome sequencing in recurrent early pregnancy loss

    PubMed Central

    Qiao, Ying; Wen, Jiadi; Tang, Flamingo; Martell, Sally; Shomer, Naomi; Leung, Peter C.K.; Stephenson, Mary D.; Rajcan-Separovic, Evica

    2016-01-01

    STUDY HYPOTHESIS Exome sequencing can identify genetic causes of idiopathic recurrent pregnancy loss (RPL). STUDY FINDING We identified compound heterozygous deleterious mutations affecting DYNC2H1 and ALOX15 in two out of four families with RPL. Both genes have a role in early development. Bioinformatics analysis of all genes with rare and putatively pathogenic mutations in miscarriages and couples showed enrichment in pathways relevant to pregnancy loss, including the complement and coagulation cascades pathways. WHAT IS KNOWN ALREADY Next generation sequencing (NGS) is increasingly being used to identify known and novel gene mutations in children with developmental delay and in fetuses with ultrasound-detected anomalies. In contrast, NGS is rarely used to study pregnancy loss. Chromosome microarray analysis detects putatively causative DNA copy number variants (CNVs) in ∼2% of miscarriages and CNVs of unknown significance (predominantly parental in origin) in up to 40% of miscarriages. Therefore, a large number of miscarriages still have an unknown cause. STUDY DESIGN, SAMPLES/MATERIALS, METHODS Whole exome sequencing (WES) was performed using Illumina HiSeq 2000 platform on seven euploid miscarriages from four families with RPL. Golden Helix SVS v8.1.5 was used for data assessment and inheritance analysis for deleterious DNA variants predicted to severely disrupt protein-coding genes by introducing a frameshift, loss of the stop codon, gain of the stop codon, changes in splicing or the initial codon. Webgestalt (http://bioinfo.vanderbilt.edu/webgestalt/) was used for pathway and disease association enrichment analysis of a gene pool containing putatively pathogenic variants in miscarriages and couples in comparison to control gene pools. MAIN RESULTS AND THE ROLE OF CHANCE Compound heterozygous mutations in DYNC2H1 and ALOX15 were identified in miscarriages from two families with RPL. DYNC2H1 is involved in cilia biogenesis and has been associated with fetal lethality in humans. ALOX15 is expressed in placenta and its dysregulation has been associated with inflammation, placental, dysfunction, abnormal oxidative stress response and angiogenesis. The pool of putatively pathogenic single nucleotide variants (SNVs) and small insertions and deletions (indels) detected in the miscarriages showed enrichment in ‘complement and coagulation cascades pathway’, and ‘ciliary motility disorders’. We conclude that CNVs, individual SNVs and pool of deleterious gene mutations identified by exome sequencing could contribute to RPL. LIMITATIONS, REASONS FOR CAUTION The size of our sample cohort is small. The functional effect of candidate mutations should be evaluated to determine whether the mutations are causative. WIDER IMPLICATIONS OF THE FINDINGS This is the first study to assess whether SNVs may contribute to the pathogenesis of miscarriage. Furthermore, our findings suggest that collective effect of mutations in relevant biological pathways could be implicated in RPL. STUDY FUNDING AND COMPETING INTEREST(S) The study was funded by Canadian Institutes of Health Research (grant MOP 106467) and Michael Smith Foundation of Health Research Career Scholar salary award to ERS. PMID:26826164

  13. I-TASSER: fully automated protein structure prediction in CASP8.

    PubMed

    Zhang, Yang

    2009-01-01

    The I-TASSER algorithm for 3D protein structure prediction was tested in CASP8, with the procedure fully automated in both the Server and Human sections. The quality of the server models is close to that of human ones but the human predictions incorporate more diverse templates from other servers which improve the human predictions in some of the distant homology targets. For the first time, the sequence-based contact predictions from machine learning techniques are found helpful for both template-based modeling (TBM) and template-free modeling (FM). In TBM, although the accuracy of the sequence based contact predictions is on average lower than that from template-based ones, the novel contacts in the sequence-based predictions, which are complementary to the threading templates in the weakly or unaligned regions, are important to improve the global and local packing in these regions. Moreover, the newly developed atomic structural refinement algorithm was tested in CASP8 and found to improve the hydrogen-bonding networks and the overall TM-score, which is mainly due to its ability of removing steric clashes so that the models can be generated from cluster centroids. Nevertheless, one of the major issues of the I-TASSER pipeline is the model selection where the best models could not be appropriately recognized when the correct templates are detected only by the minority of the threading algorithms. There are also problems related with domain-splitting and mirror image recognition which mainly influences the performance of I-TASSER modeling in the FM-based structure predictions. Copyright 2009 Wiley-Liss, Inc.

  14. Template Dimerization Promotes an Acceptor Invasion-Induced Transfer Mechanism during Human Immunodeficiency Virus Type 1 Minus-Strand Synthesis

    PubMed Central

    Balakrishnan, Mini; Roques, Bernard P.; Fay, Philip J.; Bambara, Robert A.

    2003-01-01

    The biochemical mechanism of template switching by human immunodeficiency virus type 1 (HIV-1) reverse transcriptase and the role of template dimerization were examined. Homologous donor-acceptor template pairs derived from the HIV-1 untranslated leader region and containing the wild-type and mutant dimerization initiation sequences (DIS) were used to examine the efficiency and distribution of transfers. Inhibiting donor-acceptor interaction was sufficient to reduce transfers in DIS-containing template pairs, indicating that template dimerization, and not the mere presence of the DIS, promotes efficient transfers. Additionally, we show evidence that the overall transfer process spans an extended region of the template and proceeds through a two-step mechanism. Transfer is initiated through an RNase H-facilitated acceptor invasion step, while synthesis continues on the donor template. The invasion then propagates towards the primer terminus by branch migration. Transfer is completed with the translocation of the primer terminus at a site distant from the invasion point. In our system, most invasions initiated before synthesis reached the DIS. However, transfer of the primer terminus predominantly occurred after synthesis through the DIS. The two steps were separated by 60 to 80 nucleotides. Sequence markers revealed the position of primer terminus switch, whereas DNA oligomers designed to block acceptor-cDNA interactions defined sites of invasion. Within the region of homology, certain positions on the template were inherently more favorable for invasion than others. In templates with DIS, the proximity of the acceptor facilitates invasion, thereby enhancing transfer efficiency. Nucleocapsid protein enhanced the overall efficiency of transfers but did not alter the mechanism. PMID:12663778

  15. Gene expression profiling of pre-eclamptic placentae by RNA sequencing.

    PubMed

    Kaartokallio, Tea; Cervera, Alejandra; Kyllönen, Anjuska; Laivuori, Krista; Kere, Juha; Laivuori, Hannele

    2015-09-21

    Pre-eclampsia is a common and complex pregnancy disorder that often involves impaired placental development. In order to identify altered gene expression in pre-eclamptic placenta, we sequenced placental transcriptomes of nine pre-eclamptic and nine healthy pregnant women in pools of three. The differential gene expression was tested both by including all the pools in the analysis and by excluding some of the pools based on phenotypic characteristics. From these analyses, we identified altogether 53 differently expressed genes, a subset of which was validated by qPCR in 20 cases and 19 controls. Furthermore, we conducted pathway and functional analyses which revealed disturbed vascular function and immunological balance in pre-eclamptic placenta. Some of the genes identified in our study have been reported by numerous microarray studies (BHLHE40, FSTL3, HK2, HTRA4, LEP, PVRL4, SASH1, SIGLEC6), but many have been implicated in only few studies or have not previously been linked to pre-eclampsia (ARMS2, BTNL9, CCSAP, DIO2, FER1L4, HPSE, LOC100129345, LYN, MYO7B, NCMAP, NDRG1, NRIP1, PLIN2, SBSPON, SERPINB9, SH3BP5, TET3, TPBG, ZNF175). Several of the molecules produced by these genes may have a role in the pathogenesis of pre-eclampsia, and some could qualify as biomarkers for prediction or detection of this pregnancy complication.

  16. Gene expression profiling of pre-eclamptic placentae by RNA sequencing

    PubMed Central

    Kaartokallio, Tea; Cervera, Alejandra; Kyllönen, Anjuska; Laivuori, Krista; Laivuori, Hannele; Heinonen, Seppo; Kajantie, Eero; Kere, Juha; Kivinen, Katja; Pouta, Anneli

    2015-01-01

    Pre-eclampsia is a common and complex pregnancy disorder that often involves impaired placental development. In order to identify altered gene expression in pre-eclamptic placenta, we sequenced placental transcriptomes of nine pre-eclamptic and nine healthy pregnant women in pools of three. The differential gene expression was tested both by including all the pools in the analysis and by excluding some of the pools based on phenotypic characteristics. From these analyses, we identified altogether 53 differently expressed genes, a subset of which was validated by qPCR in 20 cases and 19 controls. Furthermore, we conducted pathway and functional analyses which revealed disturbed vascular function and immunological balance in pre-eclamptic placenta. Some of the genes identified in our study have been reported by numerous microarray studies (BHLHE40, FSTL3, HK2, HTRA4, LEP, PVRL4, SASH1, SIGLEC6), but many have been implicated in only few studies or have not previously been linked to pre-eclampsia (ARMS2, BTNL9, CCSAP, DIO2, FER1L4, HPSE, LOC100129345, LYN, MYO7B, NCMAP, NDRG1, NRIP1, PLIN2, SBSPON, SERPINB9, SH3BP5, TET3, TPBG, ZNF175). Several of the molecules produced by these genes may have a role in the pathogenesis of pre-eclampsia, and some could qualify as biomarkers for prediction or detection of this pregnancy complication. PMID:26388242

  17. Multiplexed resequencing analysis to identify rare variants in pooled DNA with barcode indexing using next-generation sequencer.

    PubMed

    Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji

    2010-07-01

    We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.

  18. Homopolymer tail-mediated ligation PCR: a streamlined and highly efficient method for DNA cloning and library construction.

    PubMed

    Lazinski, David W; Camilli, Andrew

    2013-01-01

    The amplification of DNA fragments, cloned between user-defined 5' and 3' end sequences, is a prerequisite step in the use of many current applications including massively parallel sequencing (MPS). Here we describe an improved method, called homopolymer tail-mediated ligation PCR (HTML-PCR), that requires very little starting template, minimal hands-on effort, is cost-effective, and is suited for use in high-throughput and robotic methodologies. HTML-PCR starts with the addition of homopolymer tails of controlled lengths to the 3' termini of a double-stranded genomic template. The homopolymer tails enable the annealing-assisted ligation of a hybrid oligonucleotide to the template's recessed 5' ends. The hybrid oligonucleotide has a user-defined sequence at its 5' end. This primer, together with a second primer composed of a longer region complementary to the homopolymer tail and fused to a second 5' user-defined sequence, are used in a PCR reaction to generate the final product. The user-defined sequences can be varied to enable compatibility with a wide variety of downstream applications. We demonstrate our new method by constructing MPS libraries starting from nanogram and sub-nanogram quantities of Vibrio cholerae and Streptococcus pneumoniae genomic DNA.

  19. Droplet-based pyrosequencing using digital microfluidics.

    PubMed

    Boles, Deborah J; Benton, Jonathan L; Siew, Germaine J; Levy, Miriam H; Thwar, Prasanna K; Sandahl, Melissa A; Rouse, Jeremy L; Perkins, Lisa C; Sudarsan, Arjun P; Jalili, Roxana; Pamula, Vamsee K; Srinivasan, Vijay; Fair, Richard B; Griffin, Peter B; Eckhardt, Allen E; Pollack, Michael G

    2011-11-15

    The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., "sample-to-sequence" capability) could eventually be achieved using this low-cost platform.

  20. Multiview human activity recognition system based on spatiotemporal template for video surveillance system

    NASA Astrophysics Data System (ADS)

    Kushwaha, Alok Kumar Singh; Srivastava, Rajeev

    2015-09-01

    An efficient view invariant framework for the recognition of human activities from an input video sequence is presented. The proposed framework is composed of three consecutive modules: (i) detect and locate people by background subtraction, (ii) view invariant spatiotemporal template creation for different activities, (iii) and finally, template matching is performed for view invariant activity recognition. The foreground objects present in a scene are extracted using change detection and background modeling. The view invariant templates are constructed using the motion history images and object shape information for different human activities in a video sequence. For matching the spatiotemporal templates for various activities, the moment invariants and Mahalanobis distance are used. The proposed approach is tested successfully on our own viewpoint dataset, KTH action recognition dataset, i3DPost multiview dataset, MSR viewpoint action dataset, VideoWeb multiview dataset, and WVU multiview human action recognition dataset. From the experimental results and analysis over the chosen datasets, it is observed that the proposed framework is robust, flexible, and efficient with respect to multiple views activity recognition, scale, and phase variations.

  1. The role of sequence context, nucleotide pool balance and stress in 2′-deoxynucleotide misincorporation in viral, bacterial and mammalian RNA

    PubMed Central

    Wang, Jin; Dong, Hongping; Chionh, Yok Hian; McBee, Megan E.; Sirirungruang, Sasilada; Cunningham, Richard P.; Shi, Pei-Yong; Dedon, Peter C.

    2016-01-01

    The misincorporation of 2′-deoxyribonucleotides (dNs) into RNA has important implications for the function of non-coding RNAs, the translational fidelity of coding RNAs and the mutagenic evolution of viral RNA genomes. However, quantitative appreciation for the degree to which dN misincorporation occurs is limited by the lack of analytical tools. Here, we report a method to hydrolyze RNA to release 2′-deoxyribonucleotide-ribonucleotide pairs (dNrN) that are then quantified by chromatography-coupled mass spectrometry (LC-MS). Using this platform, we found misincorporated dNs occurring at 1 per 103 to 105 ribonucleotide (nt) in mRNA, rRNAs and tRNA in human cells, Escherichia coli, Saccharomyces cerevisiae and, most abundantly, in the RNA genome of dengue virus. The frequency of dNs varied widely among organisms and sequence contexts, and partly reflected the in vitro discrimination efficiencies of different RNA polymerases against 2′-deoxyribonucleoside 5′-triphosphates (dNTPs). Further, we demonstrate a strong link between dN frequencies in RNA and the balance of dNTPs and ribonucleoside 5′-triphosphates (rNTPs) in the cellular pool, with significant stress-induced variation of dN incorporation. Potential implications of dNs in RNA are discussed, including the possibilities of dN incorporation in RNA as a contributing factor in viral evolution and human disease, and as a host immune defense mechanism against viral infections. PMID:27365049

  2. A label-free fluorescent direct detection of live Salmonella typhimurium using cascade triple trigger sequences-regenerated strand displacement amplification and hairpin template-generated-scaffolded silver nanoclusters.

    PubMed

    Zhang, Peng; Liu, Hui; Li, Xiaocheng; Ma, Suzhen; Men, Shuai; Wei, Heng; Cui, Jingjing; Wang, Hongning

    2017-01-15

    The harm of Salmonella typhimurium (S. typhimurium) to public health mainly by the consumption of contaminated agricultural products or water stresses an urgent need for rapid detection methods to help control the spread of S. typhimurium. In this work, an intelligently designed sensor system took creative advantage of triple trigger sequences-regenerated strand displacement amplification and self-protective hairpin template-generated-scaffolded silver nanoclusters (AgNCs) for the first time. In the presence of live S. typhimurium, single-stranded trigger sequences were released from aptamer-trigger sequences complex, initiating a branch migration to open the hairpin template I containing complementary scaffolds of AgNCs. Then the first strand displacement amplification was induced to produce numerous scaffolds of AgNCs and reporter strands which initiated a branch migration to open the hairpin template II containing complementary scaffolds of AgNCs. Then the second strand displacement amplification was induced to generate numerous scaffolds of AgNCs and trigger sequences which initiated the third branch migration and strand displacement amplification to produce numerous scaffolds of AgNCs and reporter strands in succession. Cyclically, the reproduction of the trigger sequences and cascade successive production of scaffolds were achieved successfully, forming highly fluorescent AgNCs, thus providing significantly enhanced fluorescent signals to achieve ultrasensitive detection of live S. typhimurium down to 50 CFU/mL with a linear range from 10 2 to 10 7 CFU/mL. It is the first report on a fluorescent biosensor for detecting viable S. typhimurium directly, which can distinguish from heat denatured S. typhimurium. And it develops a new strategy to generate the DNA-scaffolds for forming AgNCs. Copyright © 2016 Elsevier B.V. All rights reserved.

  3. Mechanism of chimera formation during the Multiple Displacement Amplification reaction.

    PubMed

    Lasken, Roger S; Stockwell, Timothy B

    2007-04-12

    Multiple Displacement Amplification (MDA) is a method used for amplifying limiting DNA sources. The high molecular weight amplified DNA is ideal for DNA library construction. While this has enabled genomic sequencing from one or a few cells of unculturable microorganisms, the process is complicated by the tendency of MDA to generate chimeric DNA rearrangements in the amplified DNA. Determining the source of the DNA rearrangements would be an important step towards reducing or eliminating them. Here, we characterize the major types of chimeras formed by carrying out an MDA whole genome amplification from a single E. coli cell and sequencing by the 454 Life Sciences method. Analysis of 475 chimeras revealed the predominant reaction mechanisms that create the DNA rearrangements. The highly branched DNA synthesized in MDA can assume many alternative secondary structures. DNA strands extended on an initial template can be displaced becoming available to prime on a second template creating the chimeras. Evidence supports a model in which branch migration can displace 3'-ends freeing them to prime on the new templates. More than 85% of the resulting DNA rearrangements were inverted sequences with intervening deletions that the model predicts. Intramolecular rearrangements were favored, with displaced 3'-ends reannealing to single stranded 5'-strands contained within the same branched DNA molecule. In over 70% of the chimeric junctions, the 3' termini had initiated priming at complimentary sequences of 2-21 nucleotides (nts) in the new templates. Formation of chimeras is an important limitation to the MDA method, particularly for whole genome sequencing. Identification of the mechanism for chimera formation provides new insight into the MDA reaction and suggests methods to reduce chimeras. The 454 sequencing approach used here will provide a rapid method to assess the utility of reaction modifications.

  4. Mechanism of chimera formation during the Multiple Displacement Amplification reaction

    PubMed Central

    Lasken, Roger S; Stockwell, Timothy B

    2007-01-01

    Background Multiple Displacement Amplification (MDA) is a method used for amplifying limiting DNA sources. The high molecular weight amplified DNA is ideal for DNA library construction. While this has enabled genomic sequencing from one or a few cells of unculturable microorganisms, the process is complicated by the tendency of MDA to generate chimeric DNA rearrangements in the amplified DNA. Determining the source of the DNA rearrangements would be an important step towards reducing or eliminating them. Results Here, we characterize the major types of chimeras formed by carrying out an MDA whole genome amplification from a single E. coli cell and sequencing by the 454 Life Sciences method. Analysis of 475 chimeras revealed the predominant reaction mechanisms that create the DNA rearrangements. The highly branched DNA synthesized in MDA can assume many alternative secondary structures. DNA strands extended on an initial template can be displaced becoming available to prime on a second template creating the chimeras. Evidence supports a model in which branch migration can displace 3'-ends freeing them to prime on the new templates. More than 85% of the resulting DNA rearrangements were inverted sequences with intervening deletions that the model predicts. Intramolecular rearrangements were favored, with displaced 3'-ends reannealing to single stranded 5'-strands contained within the same branched DNA molecule. In over 70% of the chimeric junctions, the 3' termini had initiated priming at complimentary sequences of 2–21 nucleotides (nts) in the new templates. Conclusion Formation of chimeras is an important limitation to the MDA method, particularly for whole genome sequencing. Identification of the mechanism for chimera formation provides new insight into the MDA reaction and suggests methods to reduce chimeras. The 454 sequencing approach used here will provide a rapid method to assess the utility of reaction modifications. PMID:17430586

  5. West Nile virus, Anopheles flavivirus, a novel flavivirus as well as Merida-like rhabdovirus Turkey in field-collected mosquitoes from Thrace and Anatolia.

    PubMed

    Öncü, Ceren; Brinkmann, Annika; Günay, Filiz; Kar, Sırrı; Öter, Kerem; Sarıkaya, Yasemen; Nitsche, Andreas; Linton, Yvonne-Marie; Alten, Bülent; Ergünay, Koray

    2018-01-01

    Mosquitoes are involved in the transmission and maintenance of several viral diseases with significant health impact. Biosurveillance efforts have also revealed insect-specific viruses, observed to cocirculate with pathogenic strains. This report describes the findings of flavivirus and rhabdovirus screening, performed in eastern Thrace and Aegean region of Anatolia during 2016, including and expanding on locations with previously-documented virus activity. A mosquito cohort of 1545 individuals comprising 14 species were collected and screened in 108 pools via generic and specific amplification and direct metagenomics by next generation sequencing. Seven mosquito pools (6.4%) were positive in the flavivirus screening. West Nile virus lineage 1 clade 1a sequences were characterized in a pool Culex pipiens sensu lato specimens, providing the initial virus detection in Aegean region following 2010 outbreak. In an Anopheles maculipennis sensu lato pool, sequences closely-related to Anopheles flaviviruses were obtained, with similarities to several African and Australian strains of this new insect-specific flavivirus clade. In pools comprising Uranotaenia unguiculata (n=3), Cx. pipiens s.l. (n=1) and Aedes caspius (n=1) mosquitoes, sequences of a novel flavivirus, distantly-related to Flavivirus AV2011, identified previously in Spain and Turkey, were characterized. Moreover, DNA forms of the novel flavivirus were detected in two Ur. unguiculata pools. These sequences were highly-similar to the sequences amplified from viral RNA, with undisrupted reading frames, suggest the occurrence of viral DNA forms in natural conditions within mosquito hosts. Rhabdovirus screening revealed sequences of a recently-described novel virus, named the Merida-like virus Turkey (MERDLVT) in 5 Cx. pipiens s.l. pools (4.6%). Partial L and N gene sequences of MERDLVT were well-conserved among strains, with evidence for geographical clustering in phylogenetic analyses. Metagenomics provided the near-full genomic sequence in a specimen, revealing an identical genome organization and limited divergence from the prototype MERDLVT isolate. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Repair of DNA double-strand breaks by templated nucleotide sequence insertions derived from distant regions of the genome.

    PubMed

    Onozawa, Masahiro; Zhang, Zhenhua; Kim, Yoo Jung; Goldberg, Liat; Varga, Tamas; Bergsagel, P Leif; Kuehl, W Michael; Aplan, Peter D

    2014-05-27

    We used the I-SceI endonuclease to produce DNA double-strand breaks (DSBs) and observed that a fraction of these DSBs were repaired by insertion of sequences, which we termed "templated sequence insertions" (TSIs), derived from distant regions of the genome. These TSIs were derived from genic, retrotransposon, or telomere sequences and were not deleted from the donor site in the genome, leading to the hypothesis that they were derived from reverse-transcribed RNA. Cotransfection of RNA and an I-SceI expression vector demonstrated insertion of RNA-derived sequences at the DNA-DSB site, and TSIs were suppressed by reverse-transcriptase inhibitors. Both observations support the hypothesis that TSIs were derived from RNA templates. In addition, similar insertions were detected at sites of DNA DSBs induced by transcription activator-like effector nuclease proteins. Whole-genome sequencing of myeloma cell lines revealed additional TSIs, demonstrating that repair of DNA DSBs via insertion was not restricted to experimentally produced DNA DSBs. Analysis of publicly available databases revealed that many of these TSIs are polymorphic in the human genome. Taken together, these results indicate that insertional events should be considered as alternatives to gross chromosomal rearrangements in the interpretation of whole-genome sequence data and that this mutagenic form of DNA repair may play a role in genetic disease, exon shuffling, and mammalian evolution.

  7. Simultaneous identification and DNA barcoding of six Eimeria species infecting turkeys using PCR primers targeting the mitochondrial cytochrome c oxidase subunit I (mtCOI) locus.

    PubMed

    Hafeez, Mian A; Shivaramaiah, Srichaitanya; Dorsey, Kristi Moore; Ogedengbe, Mosun E; El-Sherry, Shiem; Whale, Julia; Cobean, Julie; Barta, John R

    2015-05-01

    Species-specific PCR primers targeting the mitochondrial cytochrome c oxidase subunit I (mtCOI) locus were generated that allow for the specific identification of the most common Eimeria species infecting turkeys (i.e., Eimeria adenoeides, Eimeria meleagrimitis, Eimeria gallopavonis, Eimeria meleagridis, Eimeria dispersa, and Eimeria innocua). PCR reaction chemistries were optimized with respect to divalent cation (MgCl2) and dNTP concentrations, as well as PCR cycling conditions (particularly anneal temperature for primers). Genomic DNA samples from single oocyst-derived lines of six Eimeria species were tested to establish specificity and sensitivity of these newly designed primer pairs. A mixed 60-ng total DNA sample containing 10 ng of each of the six Eimeria species was used as DNA template to demonstrate specific amplification of the correct product using each of the species-specific primer pairs. Ten nanograms of each of the five non-target Eimeria species was pooled to provide a non-target, control DNA sample suitable to test the specificity of each primer pair. The amplifications of the COI region with species-specific primer pairs from pooled samples yielded products of expected sizes (209 to 1,012 bp) and no amplification of non-target Eimeria sp. DNA was detected using the non-target, control DNA samples. These primer pairs specific for Eimeria spp. of turkeys did not amplify any of the seven Eimeria species infecting chickens. The newly developed PCR primers can be used as a diagnostic tool capable of specifically identifying six turkey Eimeria species; additionally, sequencing of the PCR amplification products yields sequence-based genotyping data suitable for identification and molecular phylogenetics.

  8. Whole Genome Sequence Analysis of Mutations Accumulated in rad27Δ Yeast Strains with Defects in the Processing of Okazaki Fragments Indicates Template-Switching Events

    PubMed Central

    Omer, Sumita; Lavi, Bar; Mieczkowski, Piotr A.; Covo, Shay; Hazkani-Covo, Einat

    2017-01-01

    Okazaki fragments that are formed during lagging strand DNA synthesis include an initiating primer consisting of both RNA and DNA. The RNA fragment must be removed before the fragments are joined. In Saccharomyces cerevisiae, a key player in this process is the structure-specific flap endonuclease, Rad27p (human homolog FEN1). To obtain a genomic view of the mutational consequence of loss of RAD27, a S. cerevisiae rad27Δ strain was subcultured for 25 generations and sequenced using Illumina paired-end sequencing. Out of the 455 changes observed in 10 colonies isolated the two most common types of events were insertions or deletions (INDELs) in simple sequence repeats (SSRs) and INDELs mediated by short direct repeats. Surprisingly, we also detected a previously neglected class of 21 template-switching events. These events were presumably generated by quasi-palindrome to palindrome correction, as well as palindrome elongation. The formation of these events is best explained by folding back of the stalled nascent strand and resumption of DNA synthesis using the same nascent strand as a template. Evidence of quasi-palindrome to palindrome correction that could be generated by template switching appears also in yeast genome evolution. Out of the 455 events, 55 events appeared in multiple isolates; further analysis indicates that these loci are mutational hotspots. Since Rad27 acts on the lagging strand when the leading strand should not contain any gaps, we propose a mechanism favoring intramolecular strand switching over an intermolecular mechanism. We note that our results open new ways of understanding template switching that occurs during genome instability and evolution. PMID:28974572

  9. ModeRNA server: an online tool for modeling RNA 3D structures.

    PubMed

    Rother, Magdalena; Milanowska, Kaja; Puton, Tomasz; Jeleniewicz, Jaroslaw; Rother, Kristian; Bujnicki, Janusz M

    2011-09-01

    The diverse functional roles of non-coding RNA molecules are determined by their underlying structure. ModeRNA server is an online tool for RNA 3D structure modeling by the comparative approach, based on a template RNA structure and a user-defined target-template sequence alignment. It offers an option to search for potential templates, given the target sequence. The server also provides tools for analyzing, editing and formatting of RNA structure files. It facilitates the use of the ModeRNA software and offers new options in comparison to the standalone program. ModeRNA server was implemented using the Python language and the Django web framework. It is freely available at http://iimcb.genesilico.pl/modernaserver. iamb@genesilico.pl.

  10. Genome characterization of Long Island tick rhabdovirus, a new virus identified in Amblyomma americanum ticks

    PubMed Central

    2014-01-01

    Background Ticks are implicated as hosts to a wide range of animal and human pathogens. The full range of microbes harbored by ticks has not yet been fully explored. Methods As part of a viral surveillance and discovery project in arthropods, we used unbiased high-throughput sequencing to examine viromes of ticks collected on Long Island, New York in 2013. Results We detected and sequenced the complete genome of a novel rhabdovirus originating from a pool of Amblyomma americanum ticks. This virus, which we provisionally name Long Island tick rhabdovirus, is distantly related to Moussa virus from Africa. Conclusions The Long Island tick rhabdovirus may represent a novel species within family Rhabdoviridae. PMID:24517260

  11. Substrate recognition by ribonucleoprotein ribonuclease MRP

    PubMed Central

    Esakova, Olga; Perederina, Anna; Quan, Chao; Berezin, Igor; Krasilnikov, Andrey S.

    2011-01-01

    The ribonucleoprotein complex ribonuclease (RNase) MRP is a site-specific endoribonuclease essential for the survival of the eukaryotic cell. RNase MRP closely resembles RNase P (a universal endoribonuclease responsible for the maturation of the 5′ ends of tRNA) but recognizes distinct substrates including pre-rRNA and mRNA. Here we report the results of an in vitro selection of Saccharomyces cerevisiae RNase MRP substrates starting from a pool of random sequences. The results indicate that RNase MRP cleaves single-stranded RNA and is sensitive to sequences in the immediate vicinity of the cleavage site requiring a cytosine at the position +4 relative to the cleavage site. Structural implications of the differences in substrate recognition by RNases P and MRP are discussed. PMID:21173200

  12. Substrate recognition by ribonucleoprotein ribonuclease MRP.

    PubMed

    Esakova, Olga; Perederina, Anna; Quan, Chao; Berezin, Igor; Krasilnikov, Andrey S

    2011-02-01

    The ribonucleoprotein complex ribonuclease (RNase) MRP is a site-specific endoribonuclease essential for the survival of the eukaryotic cell. RNase MRP closely resembles RNase P (a universal endoribonuclease responsible for the maturation of the 5' ends of tRNA) but recognizes distinct substrates including pre-rRNA and mRNA. Here we report the results of an in vitro selection of Saccharomyces cerevisiae RNase MRP substrates starting from a pool of random sequences. The results indicate that RNase MRP cleaves single-stranded RNA and is sensitive to sequences in the immediate vicinity of the cleavage site requiring a cytosine at the position +4 relative to the cleavage site. Structural implications of the differences in substrate recognition by RNases P and MRP are discussed.

  13. Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.

    PubMed

    Vallat, Brinda; Madrid-Aliste, Carlos; Fiser, Andras

    2015-08-01

    Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We present here a novel computational method that uses a library of supersecondary structure fragments, known as Smotifs, to model protein structures. The library of Smotifs has saturated over time, providing a theoretical foundation for efficient modeling. The method relies on weak sequence signals from remotely related protein structures to create a library of Smotif fragments specific to the target protein sequence. This Smotif library is exploited in a fragment assembly protocol to sample decoys, which are assessed by a composite scoring function. Since the Smotif fragments are larger in size compared to the ones used in other fragment-based methods, the proposed modeling algorithm, SmotifTF, can employ an exhaustive sampling during decoy assembly. SmotifTF successfully predicts the overall fold of the target proteins in about 50% of the test cases and performs competitively when compared to other state of the art prediction methods, especially when sequence signal to remote homologs is diminishing. Smotif-based modeling is complementary to current prediction methods and provides a promising direction in addressing the structure prediction problem, especially when targeting larger proteins for modeling.

  14. Template-based modeling and ab initio refinement of protein oligomer structures using GALAXY in CAPRI round 30.

    PubMed

    Lee, Hasup; Baek, Minkyung; Lee, Gyu Rie; Park, Sangwoo; Seok, Chaok

    2017-03-01

    Many proteins function as homo- or hetero-oligomers; therefore, attempts to understand and regulate protein functions require knowledge of protein oligomer structures. The number of available experimental protein structures is increasing, and oligomer structures can be predicted using the experimental structures of related proteins as templates. However, template-based models may have errors due to sequence differences between the target and template proteins, which can lead to functional differences. Such structural differences may be predicted by loop modeling of local regions or refinement of the overall structure. In CAPRI (Critical Assessment of PRotein Interactions) round 30, we used recently developed features of the GALAXY protein modeling package, including template-based structure prediction, loop modeling, model refinement, and protein-protein docking to predict protein complex structures from amino acid sequences. Out of the 25 CAPRI targets, medium and acceptable quality models were obtained for 14 and 1 target(s), respectively, for which proper oligomer or monomer templates could be detected. Symmetric interface loop modeling on oligomer model structures successfully improved model quality, while loop modeling on monomer model structures failed. Overall refinement of the predicted oligomer structures consistently improved the model quality, in particular in interface contacts. Proteins 2017; 85:399-407. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  15. Template-Based Modeling of Protein-RNA Interactions

    PubMed Central

    Zheng, Jinfang; Kundrotas, Petras J.; Vakser, Ilya A.

    2016-01-01

    Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes. PMID:27662342

  16. Application of multi-objective optimization to pooled experiments of next generation sequencing for detection of rare mutations.

    PubMed

    Zilinskas, Julius; Lančinskas, Algirdas; Guarracino, Mario Rosario

    2014-01-01

    In this paper we propose some mathematical models to plan a Next Generation Sequencing experiment to detect rare mutations in pools of patients. A mathematical optimization problem is formulated for optimal pooling, with respect to minimization of the experiment cost. Then, two different strategies to replicate patients in pools are proposed, which have the advantage to decrease the overall costs. Finally, a multi-objective optimization formulation is proposed, where the trade-off between the probability to detect a mutation and overall costs is taken into account. The proposed solutions are devised in pursuance of the following advantages: (i) the solution guarantees mutations are detectable in the experimental setting, and (ii) the cost of the NGS experiment and its biological validation using Sanger sequencing is minimized. Simulations show replicating pools can decrease overall experimental cost, thus making pooling an interesting option.

  17. Rational design of new materials using recombinant structural proteins: Current state and future challenges.

    PubMed

    Sutherland, Tara D; Huson, Mickey G; Rapson, Trevor D

    2018-01-01

    Sequence-definable polymers are seen as a prerequisite for design of future materials, with many polymer scientists regarding such polymers as the holy grail of polymer science. Recombinant proteins are sequence-defined polymers. Proteins are dictated by DNA templates and therefore the sequence of amino acids in a protein is defined, and molecular biology provides tools that allow redesign of the DNA as required. Despite this advantage, proteins are underrepresented in materials science. In this publication we investigate the advantages and limitations of using proteins as templates for rational design of new materials. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.

  18. High-throughput sequencing: a failure mode analysis.

    PubMed

    Yang, George S; Stott, Jeffery M; Smailus, Duane; Barber, Sarah A; Balasundaram, Miruna; Marra, Marco A; Holt, Robert A

    2005-01-04

    Basic manufacturing principles are becoming increasingly important in high-throughput sequencing facilities where there is a constant drive to increase quality, increase efficiency, and decrease operating costs. While high-throughput centres report failure rates typically on the order of 10%, the causes of sporadic sequencing failures are seldom analyzed in detail and have not, in the past, been formally reported. Here we report the results of a failure mode analysis of our production sequencing facility based on detailed evaluation of 9,216 ESTs generated from two cDNA libraries. Two categories of failures are described; process-related failures (failures due to equipment or sample handling) and template-related failures (failures that are revealed by close inspection of electropherograms and are likely due to properties of the template DNA sequence itself). Preventative action based on a detailed understanding of failure modes is likely to improve the performance of other production sequencing pipelines.

  19. Technical adequacy of bisulfite sequencing and pyrosequencing for detection of mitochondrial DNA methylation: Sources and avoidance of false-positive detection.

    PubMed

    Owa, Chie; Poulin, Matthew; Yan, Liying; Shioda, Toshi

    2018-01-01

    The existence of cytosine methylation in mammalian mitochondrial DNA (mtDNA) is a controversial subject. Because detection of DNA methylation depends on resistance of 5'-modified cytosines to bisulfite-catalyzed conversion to uracil, examined parameters that affect technical adequacy of mtDNA methylation analysis. Negative control amplicons (NCAs) devoid of cytosine methylation were amplified to cover the entire human or mouse mtDNA by long-range PCR. When the pyrosequencing template amplicons were gel-purified after bisulfite conversion, bisulfite pyrosequencing of NCAs did not detect significant levels of bisulfite-resistant cytosines (brCs) at ND1 (7 CpG sites) or CYTB (8 CpG sites) genes (CI95 = 0%-0.94%); without gel-purification, significant false-positive brCs were detected from NCAs (CI95 = 4.2%-6.8%). Bisulfite pyrosequencing of highly purified, linearized mtDNA isolated from human iPS cells or mouse liver detected significant brCs (~30%) in human ND1 gene when the sequencing primer was not selective in bisulfite-converted and unconverted templates. However, repeated experiments using a sequencing primer selective in bisulfite-converted templates almost completely (< 0.8%) suppressed brC detection, supporting the false-positive nature of brCs detected using the non-selective primer. Bisulfite-seq deep sequencing of linearized, gel-purified human mtDNA detected 9.4%-14.8% brCs for 9 CpG sites in ND1 gene. However, because all these brCs were associated with adjacent non-CpG brCs showing the same degrees of bisulfite resistance, DNA methylation in this mtDNA-encoded gene was not confirmed. Without linearization, data generated by bisulfite pyrosequencing or deep sequencing of purified mtDNA templates did not pass the quality control criteria. Shotgun bisulfite sequencing of human mtDNA detected extremely low levels of CpG methylation (<0.65%) over non-CpG methylation (<0.55%). Taken together, our study demonstrates that adequacy of mtDNA methylation analysis using methods dependent on bisulfite conversion needs to be established for each experiment, taking effects of incomplete bisulfite conversion and template impurity or topology into consideration.

  20. In vitro synthesis of minus-strand RNA by an isolated cereal yellow dwarf virus RNA-dependent RNA polymerase requires VPg and a stem-loop structure at the 3' end of the virus RNA.

    PubMed

    Osman, Toba A M; Coutts, Robert H A; Buck, Kenneth W

    2006-11-01

    Cereal yellow dwarf virus (CYDV) RNA has a 5'-terminal genome-linked protein (VPg). We have expressed the VPg region of the CYDV genome in bacteria and used the purified protein (bVPg) to raise an antiserum which was able to detect free VPg in extracts of CYDV-infected oat plants. A template-dependent RNA-dependent RNA polymerase (RdRp) has been produced from a CYDV membrane-bound RNA polymerase by treatment with BAL 31 nuclease. The RdRp was template specific, being able to utilize templates from CYDV plus- and minus-strand RNAs but not those of three unrelated viruses, Red clover necrotic mosaic virus, Cucumber mosaic virus, and Tobacco mosaic virus. RNA synthesis catalyzed by the RdRp required a 3'-terminal GU sequence and the presence of bVPg. Additionally, synthesis of minus-strand RNA on a plus-strand RNA template required the presence of a putative stem-loop structure near the 3' terminus of CYDV RNA. The base-paired stem, a single-nucleotide (A) bulge in the stem, and the sequence of a tetraloop were all required for the template activity. Evidence was produced showing that minus-strand synthesis in vitro was initiated by priming by bVPg at the 3' end of the template. The data are consistent with a model in which the RdRp binds to the stem-loop structure which positions the active site to recognize the 3'-terminal GU sequence for initiation of RNA synthesis by the addition of an A residue to VPg.

  1. In Vitro Synthesis of Minus-Strand RNA by an Isolated Cereal Yellow Dwarf Virus RNA-Dependent RNA Polymerase Requires VPg and a Stem-Loop Structure at the 3′ End of the Virus RNA▿

    PubMed Central

    Osman, Toba A. M.; Coutts, Robert H. A.; Buck, Kenneth W.

    2006-01-01

    Cereal yellow dwarf virus (CYDV) RNA has a 5′-terminal genome-linked protein (VPg). We have expressed the VPg region of the CYDV genome in bacteria and used the purified protein (bVPg) to raise an antiserum which was able to detect free VPg in extracts of CYDV-infected oat plants. A template-dependent RNA-dependent RNA polymerase (RdRp) has been produced from a CYDV membrane-bound RNA polymerase by treatment with BAL 31 nuclease. The RdRp was template specific, being able to utilize templates from CYDV plus- and minus-strand RNAs but not those of three unrelated viruses, Red clover necrotic mosaic virus, Cucumber mosaic virus, and Tobacco mosaic virus. RNA synthesis catalyzed by the RdRp required a 3′-terminal GU sequence and the presence of bVPg. Additionally, synthesis of minus-strand RNA on a plus-strand RNA template required the presence of a putative stem-loop structure near the 3′ terminus of CYDV RNA. The base-paired stem, a single-nucleotide (A) bulge in the stem, and the sequence of a tetraloop were all required for the template activity. Evidence was produced showing that minus-strand synthesis in vitro was initiated by priming by bVPg at the 3′ end of the template. The data are consistent with a model in which the RdRp binds to the stem-loop structure which positions the active site to recognize the 3′-terminal GU sequence for initiation of RNA synthesis by the addition of an A residue to VPg. PMID:16928757

  2. Infrared dim moving target tracking via sparsity-based discriminative classifier and convolutional network

    NASA Astrophysics Data System (ADS)

    Qian, Kun; Zhou, Huixin; Wang, Bingjian; Song, Shangzhen; Zhao, Dong

    2017-11-01

    Infrared dim and small target tracking is a great challenging task. The main challenge for target tracking is to account for appearance change of an object, which submerges in the cluttered background. An efficient appearance model that exploits both the global template and local representation over infrared image sequences is constructed for dim moving target tracking. A Sparsity-based Discriminative Classifier (SDC) and a Convolutional Network-based Generative Model (CNGM) are combined with a prior model. In the SDC model, a sparse representation-based algorithm is adopted to calculate the confidence value that assigns more weights to target templates than negative background templates. In the CNGM model, simple cell feature maps are obtained by calculating the convolution between target templates and fixed filters, which are extracted from the target region at the first frame. These maps measure similarities between each filter and local intensity patterns across the target template, therefore encoding its local structural information. Then, all the maps form a representation, preserving the inner geometric layout of a candidate template. Furthermore, the fixed target template set is processed via an efficient prior model. The same operation is applied to candidate templates in the CNGM model. The online update scheme not only accounts for appearance variations but also alleviates the migration problem. At last, collaborative confidence values of particles are utilized to generate particles' importance weights. Experiments on various infrared sequences have validated the tracking capability of the presented algorithm. Experimental results show that this algorithm runs in real-time and provides a higher accuracy than state of the art algorithms.

  3. mrtailor: a tool for PDB-file preparation for the generation of external restraints.

    PubMed

    Gruene, Tim

    2013-09-01

    Model building starting from, for example, a molecular-replacement solution with low sequence similarity introduces model bias, which can be difficult to detect, especially at low resolution. The program mrtailor removes low-similarity regions from a template PDB file according to sequence similarity between the target sequence and the template sequence and maps the target sequence onto the PDB file. The modified PDB file can be used to generate external restraints for low-resolution refinement with reduced model bias and can be used as a starting point for model building and refinement. The program can call ProSMART [Nicholls et al. (2012), Acta Cryst. D68, 404-417] directly in order to create external restraints suitable for REFMAC5 [Murshudov et al. (2011), Acta Cryst. D67, 355-367]. Both a command-line version and a GUI exist.

  4. A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

    PubMed

    Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

    2011-09-01

    Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.

  5. Template-Directed Copolymerization, Random Walks along Disordered Tracks, and Fractals

    NASA Astrophysics Data System (ADS)

    Gaspard, Pierre

    2016-12-01

    In biology, template-directed copolymerization is the fundamental mechanism responsible for the synthesis of DNA, RNA, and proteins. More than 50 years have passed since the discovery of DNA structure and its role in coding genetic information. Yet, the kinetics and thermodynamics of information processing in DNA replication, transcription, and translation remain poorly understood. Challenging issues are the facts that DNA or RNA sequences constitute disordered media for the motion of polymerases or ribosomes while errors occur in copying the template. Here, it is shown that these issues can be addressed and sequence heterogeneity effects can be quantitatively understood within a framework revealing universal aspects of information processing at the molecular scale. In steady growth regimes, the local velocities of polymerases or ribosomes along the template are distributed as the continuous or fractal invariant set of a so-called iterated function system, which determines the copying error probabilities. The growth may become sublinear in time with a scaling exponent that can also be deduced from the iterated function system.

  6. An In Vitro Translation, Selection, and Amplification System for Peptide Nucleic Acids

    PubMed Central

    Brudno, Yevgeny; Birnbaum, Michael E.; Kleiner, Ralph E.; Liu, David R.

    2009-01-01

    Methods to evolve synthetic, rather than biological, polymers could significantly expand the functional potential of polymers that emerge from in vitro evolution. Requirements for synthetic polymer evolution include: (i) sequence-specific polymerization of synthetic building blocks on an amplifiable template; (ii) display of the newly translated polymer strand in a manner that allows it to adopt folded structures; (iii) selection of synthetic polymer libraries for desired binding or catalytic properties; and (iv) amplification of template sequences surviving selection in a manner that allows subsequent translation. Here we report the development of such a system for peptide nucleic acids (PNAs) using a set of twelve PNA pentamer building blocks. We validated the system by performing six iterated cycles of translation, selection, and amplification on a library of 4.3 × 108 PNA-encoding DNA templates and observed >1,000,000-fold overall enrichment of a template encoding a biotinylated (streptavidin-binding) PNA. These results collectively provide an experimental foundation for PNA evolution in the laboratory. PMID:20081830

  7. Pooled-DNA Sequencing for Elucidating New Genomic Risk Factors, Rare Variants Underlying Alzheimer's Disease.

    PubMed

    Jin, Sheng Chih; Benitez, Bruno A; Deming, Yuetiva; Cruchaga, Carlos

    2016-01-01

    Analyses of genome-wide association studies (GWAS) for complex disorders usually identify common variants with a relatively small effect size that only explain a small proportion of phenotypic heritability. Several studies have suggested that a significant fraction of heritability may be explained by low-frequency (minor allele frequency (MAF) of 1-5 %) and rare-variants that are not contained in the commercial GWAS genotyping arrays (Schork et al., Curr Opin Genet Dev 19:212, 2009). Rare variants can also have relatively large effects on risk for developing human diseases or disease phenotype (Cruchaga et al., PLoS One 7:e31039, 2012). However, it is necessary to perform next-generation sequencing (NGS) studies in a large population (>4,000 samples) to detect a significant rare-variant association. Several NGS methods, such as custom capture sequencing and amplicon-based sequencing, are designed to screen a small proportion of the genome, but most of these methods are limited in the number of samples that can be multiplexed (i.e. most sequencing kits only provide 96 distinct index). Additionally, the sequencing library preparation for 4,000 samples remains expensive and thus conducting NGS studies with the aforementioned methods are not feasible for most research laboratories.The need for low-cost large scale rare-variant detection makes pooled-DNA sequencing an ideally efficient and cost-effective technique to identify rare variants in target regions by sequencing hundreds to thousands of samples. Our recent work has demonstrated that pooled-DNA sequencing can accurately detect rare variants in targeted regions in multiple DNA samples with high sensitivity and specificity (Jin et al., Alzheimers Res Ther 4:34, 2012). In these studies we used a well-established pooled-DNA sequencing approach and a computational package, SPLINTER (short indel prediction by large deviation inference and nonlinear true frequency estimation by recursion) (Vallania et al., Genome Res 20:1711, 2010), for accurate identification of rare variants in large DNA pools. Given an average sequencing coverage of 30× per haploid genome, SPLINTER can detect rare variants and short indels up to 4 base pairs (bp) with high sensitivity and specificity (up to 1 haploid allele in a pool as large as 500 individuals). Step-by-step instructions on how to conduct pooled-DNA sequencing experiments and data analyses are described in this chapter.

  8. Non-radioactive detection of trinucleotide repeat size variability.

    PubMed

    Tomé, Stéphanie; Nicole, Annie; Gomes-Pereira, Mario; Gourdon, Genevieve

    2014-03-06

    Many human diseases are associated with the abnormal expansion of unstable trinucleotide repeat sequences. The mechanisms of trinucleotide repeat size mutation have not been fully dissected, and their understanding must be grounded on the detailed analysis of repeat size distributions in human tissues and animal models. Small-pool PCR (SP-PCR) is a robust, highly sensitive and efficient PCR-based approach to assess the levels of repeat size variation, providing both quantitative and qualitative data. The method relies on the amplification of a very low number of DNA molecules, through sucessive dilution of a stock genomic DNA solution. Radioactive Southern blot hybridization is sensitive enough to detect SP-PCR products derived from single template molecules, separated by agarose gel electrophoresis and transferred onto DNA membranes. We describe a variation of the detection method that uses digoxigenin-labelled locked nucleic acid probes. This protocol keeps the sensitivity of the original method, while eliminating the health risks associated with the manipulation of radiolabelled probes, and the burden associated with their regulation, manipulation and waste disposal.

  9. Properties of targeted preamplification in DNA and cDNA quantification.

    PubMed

    Andersson, Daniel; Akrap, Nina; Svec, David; Godfrey, Tony E; Kubista, Mikael; Landberg, Göran; Ståhlberg, Anders

    2015-01-01

    Quantification of small molecule numbers often requires preamplification to generate enough copies for accurate downstream enumerations. Here, we studied experimental parameters in targeted preamplification and their effects on downstream quantitative real-time PCR (qPCR). To evaluate different strategies, we monitored the preamplification reaction in real-time using SYBR Green detection chemistry followed by melting curve analysis. Furthermore, individual targets were evaluated by qPCR. The preamplification reaction performed best when a large number of primer pairs was included in the primer pool. In addition, preamplification efficiency, reproducibility and specificity were found to depend on the number of template molecules present, primer concentration, annealing time and annealing temperature. The amount of nonspecific PCR products could also be reduced about 1000-fold using bovine serum albumin, glycerol and formamide in the preamplification. On the basis of our findings, we provide recommendations how to perform robust and highly accurate targeted preamplification in combination with qPCR or next-generation sequencing.

  10. Homopolymer tail-mediated ligation PCR: a streamlined and highly efficient method for DNA cloning and library construction

    PubMed Central

    Lazinski, David W.; Camilli, Andrew

    2013-01-01

    The amplification of DNA fragments, cloned between user-defined 5′ and 3′ end sequences, is a prerequisite step in the use of many current applications including massively parallel sequencing (MPS). Here we describe an improved method, called homopolymer tail-mediated ligation PCR (HTML-PCR), that requires very little starting template, minimal hands-on effort, is cost-effective, and is suited for use in high-throughput and robotic methodologies. HTML-PCR starts with the addition of homopolymer tails of controlled lengths to the 3′ termini of a double-stranded genomic template. The homopolymer tails enable the annealing-assisted ligation of a hybrid oligonucleotide to the template's recessed 5′ ends. The hybrid oligonucleotide has a user-defined sequence at its 5′ end. This primer, together with a second primer composed of a longer region complementary to the homopolymer tail and fused to a second 5′ user-defined sequence, are used in a PCR reaction to generate the final product. The user-defined sequences can be varied to enable compatibility with a wide variety of downstream applications. We demonstrate our new method by constructing MPS libraries starting from nanogram and sub-nanogram quantities of Vibrio cholerae and Streptococcus pneumoniae genomic DNA. PMID:23311318

  11. Zinc-binding Domain of the Bacteriophage T7 DNA Primase Modulates Binding to the DNA Template*

    PubMed Central

    Lee, Seung-Joo; Zhu, Bin; Akabayov, Barak; Richardson, Charles C.

    2012-01-01

    The zinc-binding domain (ZBD) of prokaryotic DNA primases has been postulated to be crucial for recognition of specific sequences in the single-stranded DNA template. To determine the molecular basis for this role in recognition, we carried out homolog-scanning mutagenesis of the zinc-binding domain of DNA primase of bacteriophage T7 using a bacterial homolog from Geobacillus stearothermophilus. The ability of T7 DNA primase to catalyze template-directed oligoribonucleotide synthesis is eliminated by substitution of any five-amino acid residue-long segment within the ZBD. The most significant defect occurs upon substitution of a region (Pro-16 to Cys-20) spanning two cysteines that coordinate the zinc ion. The role of this region in primase function was further investigated by generating a protein library composed of multiple amino acid substitutions for Pro-16, Asp-18, and Asn-19 followed by genetic screening for functional proteins. Examination of proteins selected from the screening reveals no change in sequence-specific recognition. However, the more positively charged residues in the region facilitate DNA binding, leading to more efficient oligoribonucleotide synthesis on short templates. The results suggest that the zinc-binding mode alone is not responsible for sequence recognition, but rather its interaction with the RNA polymerase domain is critical for DNA binding and for sequence recognition. Consequently, any alteration in the ZBD that disturbs its conformation leads to loss of DNA-dependent oligoribonucleotide synthesis. PMID:23024359

  12. Co-circulation of West Nile virus and distinct insect-specific flaviviruses in Turkey.

    PubMed

    Ergünay, Koray; Litzba, Nadine; Brinkmann, Annika; Günay, Filiz; Sarıkaya, Yasemen; Kar, Sırrı; Örsten, Serra; Öter, Kerem; Domingo, Cristina; Erisoz Kasap, Özge; Özkul, Aykut; Mitchell, Luke; Nitsche, Andreas; Alten, Bülent; Linton, Yvonne-Marie

    2017-03-20

    Active vector surveillance provides an efficient tool for monitoring the presence or spread of emerging or re-emerging vector-borne viruses. This study was undertaken to investigate the circulation of flaviviruses. Mosquitoes were collected from 58 locations in 10 provinces across the Aegean, Thrace and Mediterranean Anatolian regions of Turkey in 2014 and 2015. Following morphological identification, mosquitoes were pooled and screened by nested and real-time PCR assays. Detected viruses were further characterised by sequencing. Positive pools were inoculated onto cell lines for virus isolation. Next generation sequencing was employed for genomic characterisation of the isolates. A total of 12,711 mosquito specimens representing 15 species were screened in 594 pools. Eleven pools (2%) were reactive in the virus screening assays. Sequencing revealed West Nile virus (WNV) in one Culex pipiens (s.l.) pool from Thrace. WNV sequence corresponded to lineage one clade 1a but clustered distinctly from the Turkish prototype isolate. In 10 pools, insect-specific flaviviruses were characterised as Culex theileri flavivirus in 5 pools of Culex theileri and one pool of Cx. pipiens (s.l.), Ochlerotatus caspius flavivirus in two pools of Aedes (Ochlerotatus) caspius, Flavivirus AV-2011 in one pool of Culiseta annulata, and an undetermined flavivirus in one pool of Uranotaenia unguiculata from the Aegean and Thrace regions. DNA forms or integration of the detected insect-specific flaviviruses were not observed. A virus strain, tentatively named as "Ochlerotatus caspius flavivirus Turkey", was isolated from an Ae. caspius pool in C6/36 cells. The viral genome comprised 10,370 nucleotides with a putative polyprotein of 3,385 amino acids that follows the canonical flavivirus polyprotein organisation. Sequence comparisons and phylogenetic analyses revealed the close relationship of this strain with Ochlerotatus caspius flavivirus from Portugal and Hanko virus from Finland. Several conserved structural and amino acid motifs were identified. We identified WNV and several distinct insect-specific flaviviruses during an extensive biosurveillance study of mosquitoes in various regions of Turkey in 2014 and 2015. Ongoing circulation of WNV is revealed, with an unprecedented genetic diversity. A probable replicating form of an insect flavivirus identified only in DNA form was detected.

  13. Microbial community profiles and microbial carbon cycling in Orca Basin

    NASA Astrophysics Data System (ADS)

    Hyde, A.; Teske, A.; Joye, S. B.; Montoya, J. P.; Nigro, L.

    2016-12-01

    Orca Basin is the largest seafloor brine pools in the world, covering over 400 km2 and reaching brine layer depths of 200 m. The brine pool contains water 8 times denser than the overlying seawater and is separated from the overlying water column by a sharp pycnocline that prevents vertical mixing. The transition from ambient seawater to brine occurs over 100 m [2150 to 2250 m] and is characterized by distinct changes in temperature, salinity, chemical conditions, oxygen, and organic matter concentration. The sharp brine-seawater interface results in a sharp pycnocline, which serves as a particle trap for sinking marine organic matter. Previous studies have used lipids to show that this organic-rich interface is host to an active microbial community which is potentially involved in deep-sea carbon remineralization and metal-cycling. Additionally, previous work on methane, ethane, and propane concentrations and 13C-isotopic signatures has also implicated the brine pool, as well as the interface, as sources for biogenic low-molecular weight hydrocarbons, resulting from the high concentration of suspended organic matter above and within the brine pool. Here we investigate the profiles of microbial community composition and metabolic potential in Orca Basin, ranging from seawater through the Orca Basin chemocline and into the deep Orca Basin brine. To characterize the microbial community and stratification, we used high-throughput bacterial and archaeal 16S rRNA gene sequencing of filtered water above, within, and below the Orca Basin chemocline. Our sequence data shows that three distinct and unique communities exist in the Orca Basin water column. We also use thermodynamic modeling of hydrocarbon degradation to investigate the favorability of C1-C3 hydrocarbon oxidation at the brine-seawater interface and the potential for Orca Basin to serve as a deep-sea hydrocarbon sink.

  14. Merida virus, a putative novel rhabdovirus discovered in Culex and Ochlerotatus spp. mosquitoes in the Yucatan Peninsula of Mexico.

    PubMed

    Charles, Jermilia; Firth, Andrew E; Loroño-Pino, Maria A; Garcia-Rejon, Julian E; Farfan-Ale, Jose A; Lipkin, W Ian; Blitvich, Bradley J; Briese, Thomas

    2016-04-01

    Sequences corresponding to a putative, novel rhabdovirus [designated Merida virus (MERDV)] were initially detected in a pool of Culex quinquefasciatus collected in the Yucatan Peninsula of Mexico. The entire genome was sequenced, revealing 11 798 nt and five major ORFs, which encode the nucleoprotein (N), phosphoprotein (P), matrix protein (M), glycoprotein (G) and RNA-dependent RNA polymerase (L). The deduced amino acid sequences of the N, G and L proteins have no more than 24, 38 and 43 % identity, respectively, to the corresponding sequences of all other known rhabdoviruses, whereas those of the P and M proteins have no significant identity with any sequences in GenBank and their identity is only suggested based on their genome position. Using specific reverse transcription-PCR assays established from the genome sequence, 27 571 C. quinquefasciatus which had been sorted in 728 pools were screened to assess the prevalence of MERDV in nature and 25 pools were found positive. The minimal infection rate (calculated as the number of positive mosquito pools per 1000 mosquitoes tested) was 0.9, and similar for both females and males. Screening another 140 pools of 5484 mosquitoes belonging to four other genera identified positive pools of Ochlerotatus spp. mosquitoes, indicating that the host range is not restricted to C. quinquefasciatus. Attempts to isolate MERDV in C6/36 and Vero cells were unsuccessful. In summary, we provide evidence that a previously undescribed rhabdovirus occurs in mosquitoes in Mexico.

  15. Evaluation of Different Oligonucleotide Base Substitutions at CpG Binding sites in Multiplex Bisulfite-PCR sequencing.

    PubMed

    Lu, Jennifer; Ru, Kelin; Candiloro, Ida; Dobrovic, Alexander; Korbie, Darren; Trau, Matt

    2017-03-22

    Multiplex bisulfite-PCR sequencing is a convenient and scalable method for the quantitative determination of the methylation state of target DNA regions. A challenge of this application is the presence of CpGs in the same region where primers are being placed. A common solution to the presence of CpGs within a primer-binding region is to substitute a base degeneracy at the cytosine position. However, the efficacy of different substitutions and the extent to which bias towards methylated or unmethylated templates may occur has never been evaluated in bisulfite multiplex sequencing applications. In response, we examined the performance of four different primer substitutions at the cytosine position of CpG's contained within the PCR primers. In this study, deoxyinosine-, 5-nitroindole-, mixed-base primers and primers with an abasic site were evaluated across a series of methylated controls. Primers that contained mixed- or deoxyinosine- base modifications performed most robustly. Mixed-base primers were further selected to determine the conditions that induce bias towards methylated templates. This identified an optimized set of conditions where the methylated state of bisulfite DNA templates can be accurately assessed using mixed-base primers, and expands the scope of bisulfite resequencing assays when working with challenging templates.

  16. Homology modeling of a Class A GPCR in the inactive conformation: A quantitative analysis of the correlation between model/template sequence identity and model accuracy.

    PubMed

    Costanzi, Stefano; Skorski, Matthew; Deplano, Alessandro; Habermehl, Brett; Mendoza, Mary; Wang, Keyun; Biederman, Michelle; Dawson, Jessica; Gao, Jia

    2016-11-01

    With the present work we quantitatively studied the modellability of the inactive state of Class A G protein-coupled receptors (GPCRs). Specifically, we constructed models of one of the Class A GPCRs for which structures solved in the inactive state are available, namely the β 2 AR, using as templates each of the other class members for which structures solved in the inactive state are also available. Our results showed a detectable linear correlation between model accuracy and model/template sequence identity. This suggests that the likely accuracy of the homology models that can be built for a given receptor can be generally forecasted on the basis of the available templates. We also probed whether sequence alignments that allow for the presence of gaps within the transmembrane domains to account for structural irregularities afford better models than the classical alignment procedures that do not allow for the presence of gaps within such domains. As our results indicated, although the overall differences are very subtle, the inclusion of internal gaps within the transmembrane domains has a noticeable a beneficial effect on the local structural accuracy of the domain in question. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Enabling multiplexed testing of pooled donor cells through whole-genome sequencing.

    PubMed

    Chan, Yingleong; Chan, Ying Kai; Goodman, Daniel B; Guo, Xiaoge; Chavez, Alejandro; Lim, Elaine T; Church, George M

    2018-04-19

    We describe a method that enables the multiplex screening of a pool of many different donor cell lines. Our method accurately predicts each donor proportion from the pool without requiring the use of unique DNA barcodes as markers of donor identity. Instead, we take advantage of common single nucleotide polymorphisms, whole-genome sequencing, and an algorithm to calculate the proportions from the sequencing data. By testing using simulated and real data, we showed that our method robustly predicts the individual proportions from a mixed-pool of numerous donors, thus enabling the multiplexed testing of diverse donor cells en masse.More information is available at https://pgpresearch.med.harvard.edu/poolseq/.

  18. The Limits of Template-Directed Synthesis with Nucleoside-5'-Phosphoro(2-Methyl) Imidazolides

    NASA Technical Reports Server (NTRS)

    Hill, Aubrey R., Jr.; Orgel, Leslie E.; Wu, Taifeng

    1993-01-01

    In earlier work we have shown that C-rich templates containing isolated A, T or G residues and short oligo(G) sequences can be copied effectively using nucleoside-5'-phosphoro(2-methyl)imidazolides as substrates. We now show that isolated A or T residues within an oligo(G) sequence are a complete block to copying and that an isolated C residue is copied inefficiently. Replication is possible only if there are two complementary oligonucleotides each of which acts as a template to facilitate the synthesis of the other. We emphasize the severity of the problems that need to be overcome to make possible non-enzymatic replication in homogeneous aqueous solution. We conclude that an efficient catalyst was involved in the origin of polynucleotide replication.

  19. Noise-robust speech recognition through auditory feature detection and spike sequence decoding.

    PubMed

    Schafer, Phillip B; Jin, Dezhe Z

    2014-03-01

    Speech recognition in noisy conditions is a major challenge for computer systems, but the human brain performs it routinely and accurately. Automatic speech recognition (ASR) systems that are inspired by neuroscience can potentially bridge the performance gap between humans and machines. We present a system for noise-robust isolated word recognition that works by decoding sequences of spikes from a population of simulated auditory feature-detecting neurons. Each neuron is trained to respond selectively to a brief spectrotemporal pattern, or feature, drawn from the simulated auditory nerve response to speech. The neural population conveys the time-dependent structure of a sound by its sequence of spikes. We compare two methods for decoding the spike sequences--one using a hidden Markov model-based recognizer, the other using a novel template-based recognition scheme. In the latter case, words are recognized by comparing their spike sequences to template sequences obtained from clean training data, using a similarity measure based on the length of the longest common sub-sequence. Using isolated spoken digits from the AURORA-2 database, we show that our combined system outperforms a state-of-the-art robust speech recognizer at low signal-to-noise ratios. Both the spike-based encoding scheme and the template-based decoding offer gains in noise robustness over traditional speech recognition methods. Our system highlights potential advantages of spike-based acoustic coding and provides a biologically motivated framework for robust ASR development.

  20. Rate in template-directed polymer synthesis.

    PubMed

    Saito, Takuya

    2014-06-01

    We discuss the temporal efficiency of template-directed polymer synthesis, such as DNA replication and transcription, under a given template string. To weigh the synthesis speed and accuracy on the same scale, we propose a template-directed synthesis (TDS) rate, which contains an expression analogous to that for the Shannon entropy. Increasing the synthesis speed accelerates the TDS rate, but the TDS rate is lowered if the produced sequences are diversified. We apply the TDS rate to some production system models and investigate how the balance between the speed and the accuracy is affected by changes in the system conditions.

  1. Use of polymerase chain reaction technique to confirm VecTest screening results in Plasmodium falciparum and Plasmodium vivax VK 210 laboratory-infected Anopheles stephensi mosquitoes.

    PubMed

    Santos-Ciminera, Patricia D; Acheé, Nicole L; Quinnan, Gerald V; Roberts, Donald R

    2004-09-01

    We evaluated polymerase chain reaction (PCR) to confirm immunoassays for malaria parasites in mosquito pools after a failure to detect malaria with PCR during an outbreak in which pools tested positive using VecTest and enzyme-linked immunosorbent assay (ELISA). We combined VecTest, ELISA, and PCR to detect Plasmodium falciparum and Plasmodium vivax VK 210. Each mosquito pool, prepared in triplicate, consisted of 1 exposed Anopheles stephensi and up to 9 unfed mosquitoes. The results of VecTest and ELISA were concordant. DNA from a subset of the pools, 1 representative of each ratio of infected to uninfected mosquitoes, was extracted and used as template in PCR. All P. vivax pools were PCR positive but some needed additional processing for removal of apparent inhibitors before positive results were obtained. One of the pools selected for P. falciparum was negative by PCR, probably because of losses or contamination during DNA extraction; 2 remaining pools at this ratio were PCR positive. Testing pools by VecTest, ELISA, and PCR is feasible, and PCR is useful for confirmation of immunoassays. An additional step might be needed to remove potential inhibitors from pools prior to PCR.

  2. Computational protein design: validation and possible relevance as a tool for homology searching and fold recognition.

    PubMed

    Schmidt Am Busch, Marcel; Sedano, Audrey; Simonson, Thomas

    2010-05-05

    Protein fold recognition usually relies on a statistical model of each fold; each model is constructed from an ensemble of natural sequences belonging to that fold. A complementary strategy may be to employ sequence ensembles produced by computational protein design. Designed sequences can be more diverse than natural sequences, possibly avoiding some limitations of experimental databases. WE EXPLORE THIS STRATEGY FOR FOUR SCOP FAMILIES: Small Kunitz-type inhibitors (SKIs), Interleukin-8 chemokines, PDZ domains, and large Caspase catalytic subunits, represented by 43 structures. An automated procedure is used to redesign the 43 proteins. We use the experimental backbones as fixed templates in the folded state and a molecular mechanics model to compute the interaction energies between sidechain and backbone groups. Calculations are done with the Proteins@Home volunteer computing platform. A heuristic algorithm is used to scan the sequence and conformational space, yielding 200,000-300,000 sequences per backbone template. The results confirm and generalize our earlier study of SH2 and SH3 domains. The designed sequences ressemble moderately-distant, natural homologues of the initial templates; e.g., the SUPERFAMILY, profile Hidden-Markov Model library recognizes 85% of the low-energy sequences as native-like. Conversely, Position Specific Scoring Matrices derived from the sequences can be used to detect natural homologues within the SwissProt database: 60% of known PDZ domains are detected and around 90% of known SKIs and chemokines. Energy components and inter-residue correlations are analyzed and ways to improve the method are discussed. For some families, designed sequences can be a useful complement to experimental ones for homologue searching. However, improved tools are needed to extract more information from the designed profiles before the method can be of general use.

  3. Genotyping of 25 leukemia-associated genes in a single work flow by next-generation sequencing technology with low amounts of input template DNA.

    PubMed

    Rinke, Jenny; Schäfer, Vivien; Schmidt, Mathias; Ziermann, Janine; Kohlmann, Alexander; Hochhaus, Andreas; Ernst, Thomas

    2013-08-01

    We sought to establish a convenient, sensitive next-generation sequencing (NGS) method for genotyping the 26 most commonly mutated leukemia-associated genes in a single work flow and to optimize this method for low amounts of input template DNA. We designed 184 PCR amplicons that cover all of the candidate genes. NGS was performed with genomic DNA (gDNA) from a cohort of 10 individuals with chronic myelomonocytic leukemia. The results were compared with NGS data obtained from sequencing of DNA generated by whole-genome amplification (WGA) of 20 ng template gDNA. Differences between gDNA and WGA samples in variant frequencies were determined for 2 different WGA kits. For gDNA samples, 25 of 26 genes were successfully sequenced with a sensitivity of 5%, which was achieved by a median coverage of 492 reads (range, 308-636 reads) per amplicon. We identified 24 distinct mutations in 11 genes. With WGA samples, we reliably detected all mutations above 5% sensitivity with a median coverage of 506 reads (range, 256-653 reads) per amplicon. With all variants included in the analysis, WGA amplification by the 2 kits tested yielded differences in variant frequencies that ranged from -28.19% to +9.94% [mean (SD) difference, -0.2% (4.08%)] and from -35.03% to +18.67% [mean difference, -0.75% (5.12%)]. Our method permits simultaneous analysis of a wide range of leukemia-associated target genes in a single sequencing run. NGS can be performed after WGA of template DNA for reliable detection of variants without introducing appreciable bias.

  4. Droplet-Based Pyrosequencing Using Digital Microfluidics

    PubMed Central

    Boles, Deborah J.; Benton, Jonathan L.; Siew, Germaine J.; Levy, Miriam H.; Thwar, Prasanna K.; Sandahl, Melissa A.; Rouse, Jeremy L.; Perkins, Lisa C.; Sudarsan, Arjun P.; Jalili, Roxana; Pamula, Vamsee K.; Srinivasan, Vijay; Fair, Richard B.; Griffin, Peter B.; Eckhardt, Allen E.; Pollack, Michael G.

    2013-01-01

    The feasibility of implementing pyrosequencing chemistry within droplets using electrowetting-based digital microfluidics is reported. An array of electrodes patterned on a printed-circuit board was used to control the formation, transportation, merging, mixing, and splitting of submicroliter-sized droplets contained within an oil-filled chamber. A three-enzyme pyrosequencing protocol was implemented in which individual droplets contained enzymes, deoxyribonucleotide triphosphates (dNTPs), and DNA templates. The DNA templates were anchored to magnetic beads which enabled them to be thoroughly washed between nucleotide additions. Reagents and protocols were optimized to maximize signal over background, linearity of response, cycle efficiency, and wash efficiency. As an initial demonstration of feasibility, a portion of a 229 bp Candida parapsilosis template was sequenced using both a de novo protocol and a resequencing protocol. The resequencing protocol generated over 60 bp of sequence with 100% sequence accuracy based on raw pyrogram levels. Excellent linearity was observed for all of the homopolymers (two, three, or four nucleotides) contained in the C. parapsilosis sequence. With improvements in microfluidic design it is expected that longer reads, higher throughput, and improved process integration (i.e., “sample-to-sequence” capability) could eventually be achieved using this low-cost platform. PMID:21932784

  5. Induction of Strain-Transcending Immunity against Plasmodium chabaudi adami Malaria with a Multiepitope DNA Vaccine

    PubMed Central

    Scorza, T.; Grubb, K.; Smooker, P.; Rainczuk, A.; Proll, D.; Spithill, T. W.

    2005-01-01

    A major goal of current malaria vaccine programs is to develop multivalent vaccines that will protect humans against the many heterologous malaria strains that circulate in endemic areas. We describe a multiepitope DNA vaccine, derived from a genomic Plasmodium chabaudi adami DS DNA expression library of 30,000 plasmids, which induces strain-transcending immunity in mice against challenge with P. c. adami DK. Segregation of this library and DNA sequence analysis identified vaccine subpools encoding open reading frames (ORFs)/peptides of >9 amino acids [aa] (the V9+ pool, 303 plasmids) and >50 aa (V50+ pool, 56 plasmids), respectively. The V9+ and V50+ plasmid vaccine subpools significantly cross-protected mice against heterologous P. c. adami DK challenge, and protection correlated with the induction of both specific gamma interferon production by splenic cells and opsonizing antibodies. Bioinformatic analysis showed that 22 of the V50+ ORFs were polypeptides conserved among three or more Plasmodium spp., 13 of which are predicted hypothetical proteins. Twenty-nine of these ORFs are orthologues of predicted Plasmodium falciparum sequences known to be expressed in the blood stage, suggesting that this vaccine pool encodes multiple blood-stage antigens. The results have implications for malaria vaccine design by providing proof-of-principle that significant strain-transcending immunity can be induced using multiepitope blood-stage DNA vaccines and suggest that both cellular responses and opsonizing antibodies are necessary for optimal protection against P. c. adami. PMID:15845504

  6. Template-directed synthesis on the pentanucleotide CpCpGpCpC

    NASA Technical Reports Server (NTRS)

    Inoue, T.; Joyce, G. F.; Grzeskowiak, K.; Orgel, L. E.; Brown, J. M.; Reese, C. B.

    1984-01-01

    Experiments in which CpCpGpCpC is used as a template to facilitate the co-oligomerization of 2-MeImpG and 2-MeImpC are described. It is shown that 3' to 5' prime-linked pGpGpCpGpG, whose sequence is complementary to that of the template, is substantially the most adundant pentameric product of the template-directed reaction. The yield of pGpGpCpGpG is never large (less than 20 percent), presumably becauase off-template reactions consume template-directed products. Thus pGpGpCpGpG is converted to the various isomers of G5C and G4C2 by off-template terminal addition of G or C. The 3' to 5' isomer of GpG is elongated on the template to give GpGpC, GpGpCpG, and GpGpCpGpG, while the 2' to 5' isomer does not initiate the synthesis of detectable amounts of longer oligomers.

  7. HDOCK: a web server for protein–protein and protein–DNA/RNA docking based on a hybrid strategy

    PubMed Central

    Yan, Yumeng; Zhang, Di; Zhou, Pei; Li, Botong

    2017-01-01

    Abstract Protein–protein and protein–DNA/RNA interactions play a fundamental role in a variety of biological processes. Determining the complex structures of these interactions is valuable, in which molecular docking has played an important role. To automatically make use of the binding information from the PDB in docking, here we have presented HDOCK, a novel web server of our hybrid docking algorithm of template-based modeling and free docking, in which cases with misleading templates can be rescued by the free docking protocol. The server supports protein–protein and protein–DNA/RNA docking and accepts both sequence and structure inputs for proteins. The docking process is fast and consumes about 10–20 min for a docking run. Tested on the cases with weakly homologous complexes of <30% sequence identity from five docking benchmarks, the HDOCK pipeline tied with template-based modeling on the protein–protein and protein–DNA benchmarks and performed better than template-based modeling on the three protein–RNA benchmarks when the top 10 predictions were considered. The performance of HDOCK became better when more predictions were considered. Combining the results of HDOCK and template-based modeling by ranking first of the template-based model further improved the predictive power of the server. The HDOCK web server is available at http://hdock.phys.hust.edu.cn/. PMID:28521030

  8. Phylogenetic tree of 16s rRNA sequences from sulfate-reducing bacteria in a sandy marine sediment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Devereux, R.; Mundfrom, G.W.

    1994-01-01

    Phylogenetic divergence among sulfate-reducing bateria in an estuarine sediment sample was investigated by PCR amplification and comparison of partial 16S rDNA sequences. Twenty unique 16S rDNA sequences were found, 12 from delta subclass bacteria based on overall sequence similarity (82-91%). Two successive PCR amplifications were used to obtain and clone the 16S rDNA. The first reaction used templates derived from phosphate-buffered saline washed sediment with primers designed to amplify nearly full-length bacterial domain 16S rDNA. A produce from a first reaction was used as template in a second reaction with primers designed to selectivity amplify a region of 16S rDNAmore » genes of sulfate-reducing bacteria. A phylogenetic tree incorporating the cloned sequences suggests the presence of yet to be cultivated lines of sulfate-reducing bacteria within the sediment sample.« less

  9. Gdf5 progenitors give rise to fibrocartilage cells that mineralize via hedgehog signaling to form the zonal enthesis.

    PubMed

    Dyment, Nathaniel A; Breidenbach, Andrew P; Schwartz, Andrea G; Russell, Ryan P; Aschbacher-Smith, Lindsey; Liu, Han; Hagiwara, Yusuke; Jiang, Rulang; Thomopoulos, Stavros; Butler, David L; Rowe, David W

    2015-09-01

    The sequence of events that leads to the formation of a functionally graded enthesis is not clearly defined. The current study demonstrates that clonal expansion of Gdf5 progenitors contributes to linear growth of the enthesis. Prior to mineralization, Col1+ cells in the enthesis appose Col2+ cells of the underlying primary cartilage. At the onset of enthesis mineralization, cells at the base of the enthesis express alkaline phosphatase, Indian hedgehog, and ColX as they mineralize. The mineralization front then extends towards the tendon midsubstance as cells above the front become encapsulated in mineralized fibrocartilage over time. The hedgehog (Hh) pathway regulates this process, as Hh-responsive Gli1+ cells within the developing enthesis mature from unmineralized to mineralized fibrochondrocytes in response to activated signaling. Hh signaling is required for mineralization, as tissue-specific deletion of its obligate transducer Smoothened in the developing tendon and enthesis cells leads to significant reductions in the apposition of mineralized fibrocartilage. Together, these findings provide a spatiotemporal map of events - from expansion of the embryonic progenitor pool to synthesis of the collagen template and finally mineralization of this template - that leads to the formation of the mature zonal enthesis. These results can inform future tendon-to-bone repair strategies to create a mechanically functional enthesis in which tendon collagen fibers are anchored to bone through mineralized fibrocartilage. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Targeting the Atypical Chemokine Receptor ACKR3/CXCR7: Phase 1 - Phage Display Peptide Identification and Characterization.

    PubMed

    Vestal, R D; LaJeunesse, D R; Taylor, E W

    2016-01-01

    One of the greatest challenges in fighting cancer is cell targeting and biomarker selection. The Atypical Chemokine Receptor ACKR3/CXCR7 is expressed on many cancer cell types, including breast cancer and glioblastoma, and binds the endogenous ligands SDF1/CXCL12 and ITAC/CXCL11. A 20 amino acid region of the ACKR3/CXCR7 N-terminus was synthesized and targeted with the NEB PhD-7 Phage Display Peptide Library. Twenty-nine phages were isolated and heptapeptide inserts sequenced; of these, 23 sequences were unique. A 3D molecular model was created for the ACKR3/CXCR7 N-terminus by mutating the corresponding region of the crystal structure of CXCR4 with bound SDF1/CXCL12. A ClustalW alignment was performed on each peptide sequence using the entire SDF1/CXCL12 sequence as the template. The 23-peptide sequences showed similarity to three distinct regions of the SDF1/CXCL12 molecule. A 3D molecular model was made for each of the phage peptide inserts to visually identify potential areas of steric interference of peptides that simulated CXCL12 regions not in contact with the receptor's Nterminus. An ELISA analysis of the relative binding affinity between the peptides identified 9 peptides with statistically significant results. The candidate pool of 9 peptides was further reduced to 3 peptides based on their affinity for the targeted N-terminus region peptide versus no target peptide present or a scrambled negative control peptide. The results clearly show the Phage Display protocol can be used to target a synthesized region of the ACKR3/CXCR7 N-terminus. The 3 peptides chosen, P20, P3, and P9, will be the basis for further targeting studies.

  11. Models for mirror symmetry breaking via β-sheet-controlled copolymerization: (i) mass balance and (ii) probabilistic treatment.

    PubMed

    Blanco, Celia; Hochberg, David

    2012-12-06

    Experimental mechanisms that yield the growth of homochiral copolymers over their heterochiral counterparts have been advocated by Lahav and co-workers. These chiral amplification mechanisms proceed through racemic β-sheet-controlled polymerization operative in both surface crystallites as well as in solution. We develop two complementary theoretical models for these template-induced desymmetrization processes leading to multicomponent homochiral copolymers. First, assuming reversible β-sheet formation, the equilibrium between the free monomer pool and the polymer strand within the template is assumed. This yields coupled nonlinear mass balance equations whose solutions are used to calculate enantiomeric excesses and average lengths of the homochiral chains formed. The second approach is a probabilistic treatment based on random polymerization. The occlusion probabilities depend on the polymerization activation energies for each monomer species and are proportional to the concentrations of the monomers in solution in the constant pool approximation. The monomer occlusion probabilities are represented geometrically in terms of unit simplexes from which conditions for maximizing or minimizing the likelihood for mirror symmetry breaking can be determined.

  12. An automated method for modeling proteins on known templates using distance geometry.

    PubMed

    Srinivasan, S; March, C J; Sudarsanam, S

    1993-02-01

    We present an automated method incorporated into a software package, FOLDER, to fold a protein sequence on a given three-dimensional (3D) template. Starting with the sequence alignment of a family of homologous proteins, tertiary structures are modeled using the known 3D structure of one member of the family as a template. Homologous interatomic distances from the template are used as constraints. For nonhomologous regions in the model protein, the lower and the upper bounds for the interatomic distances are imposed by steric constraints and the globular dimensions of the template, respectively. Distance geometry is used to embed an ensemble of structures consistent with these distance bounds. Structures are selected from this ensemble based on minimal distance error criteria, after a penalty function optimization step. These structures are then refined using energy optimization methods. The method is tested by simulating the alpha-chain of horse hemoglobin using the alpha-chain of human hemoglobin as the template and by comparing the generated models with the crystal structure of the alpha-chain of horse hemoglobin. We also test the packing efficiency of this method by reconstructing the atomic positions of the interior side chains beyond C beta atoms of a protein domain from a known 3D structure. In both test cases, models retain the template constraints and any additionally imposed constraints while the packing of the interior residues is optimized with no short contacts or bond deformations. To demonstrate the use of this method in simulating structures of proteins with nonhomologous disulfides, we construct a model of murine interleukin (IL)-4 using the NMR structure of human IL-4 as the template. The resulting geometry of the nonhomologous disulfide in the model structure for murine IL-4 is consistent with standard disulfide geometry.

  13. Merida virus, a putative novel rhabdovirus discovered in Culex and Ochlerotatus spp. mosquitoes in the Yucatan Peninsula of Mexico

    PubMed Central

    Charles, Jermilia; Firth, Andrew E.; Loroño-Pino, Maria A.; Garcia-Rejon, Julian E.; Farfan-Ale, Jose A.; Lipkin, W. Ian; Briese, Thomas

    2016-01-01

    Sequences corresponding to a putative, novel rhabdovirus [designated Merida virus (MERDV)] were initially detected in a pool of Culex quinquefasciatus collected in the Yucatan Peninsula of Mexico. The entire genome was sequenced, revealing 11 798 nt and five major ORFs, which encode the nucleoprotein (N), phosphoprotein (P), matrix protein (M), glycoprotein (G) and RNA-dependent RNA polymerase (L). The deduced amino acid sequences of the N, G and L proteins have no more than 24, 38 and 43 % identity, respectively, to the corresponding sequences of all other known rhabdoviruses, whereas those of the P and M proteins have no significant identity with any sequences in GenBank and their identity is only suggested based on their genome position. Using specific reverse transcription-PCR assays established from the genome sequence, 27 571 C. quinquefasciatus which had been sorted in 728 pools were screened to assess the prevalence of MERDV in nature and 25 pools were found positive. The minimal infection rate (calculated as the number of positive mosquito pools per 1000 mosquitoes tested) was 0.9, and similar for both females and males. Screening another 140 pools of 5484 mosquitoes belonging to four other genera identified positive pools of Ochlerotatus spp. mosquitoes, indicating that the host range is not restricted to C. quinquefasciatus. Attempts to isolate MERDV in C6/36 and Vero cells were unsuccessful. In summary, we provide evidence that a previously undescribed rhabdovirus occurs in mosquitoes in Mexico. PMID:26868915

  14. A general strategy for cloning viroids and other small circular RNAs that uses minimal amounts of template and does not require prior knowledge of its sequence.

    PubMed

    Navarro, B; Daròs, J A; Flores, R

    1996-01-01

    Two PCR-based methods are described for obtaining clones of small circular RNAs of unknown sequence and for which only minute amounts are available. To avoid introducing any assumption about the RNA sequence, synthesis of the cDNAs is initiated with random primers. The cDNA population is then PCR-amplified using a primer whose sequence is present at both sides of the cDNAs, since they have been obtained with random hexamers and then a linker with the sequence of the PCR primer has been ligated to their termini, or because the cDNAs have been synthesized with an oligonucleotide that contains the sequence of the PCR primer at its 5' end and six randomized positions at its 3' end. The procedures need only approximately 50 ng of purified RNA template. The reasons for the emergence of cloning artifacts and precautions to avoid them are discussed.

  15. Short template switch events explain mutation clusters in the human genome.

    PubMed

    Löytynoja, Ari; Goldman, Nick

    2017-06-01

    Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press.

  16. Asymmetric segregation of template DNA strands in basal-like human breast cancer cell lines

    PubMed Central

    2013-01-01

    Background and methods Stem or progenitor cells from healthy tissues have the capacity to co-segregate their template DNA strands during mitosis. Here, we set out to test whether breast cancer cell lines also possess the ability to asymmetrically segregate their template DNA strands via non-random chromosome co-segregation, and whether this ability correlates with certain properties attributed to breast cancer stem cells (CSCs). We quantified the frequency of asymmetric segregation of template DNA strands in 12 human breast cancer cell lines, and correlated the frequency to molecular subtype, CD44+/CD24-/lo phenotype, and invasion/migration ability. We tested if co-culture with human mesenchymal stem cells, which are known to increase self-renewal, can alter the frequency of asymmetric segregation of template DNA in breast cancer. Results We found a positive correlation between asymmetric segregation of template DNA and the breast cancer basal-like and claudin-low subtypes. There was an inverse correlation between asymmetric segregation of template DNA and Her2 expression. Breast cancer samples with evidence of asymmetric segregation of template DNA had significantly increased invasion and borderline significantly increased migration abilities. Samples with high CD44+/CD24-/lo surface expression were more likely to harbor a consistent population of cells that asymmetrically segregated its template DNA; however, symmetric self-renewal was enriched in the CD44+/CD24-/lo population. Co-culturing breast cancer cells with human mesenchymal stem cells expanded the breast CSC pool and decreased the frequency of asymmetric segregation of template DNA. Conclusions Breast cancer cells within the basal-like subtype can asymmetrically segregate their template DNA strands through non-random chromosome segregation. The frequency of asymmetric segregation of template DNA can be modulated by external factors that influence expansion or self-renewal of CSC populations. Future studies to uncover the underlying mechanisms driving asymmetric segregation of template DNA and dictating cell fate at the time of cell division may explain how CSCs are maintained in tumors. PMID:24238140

  17. A Problem-Solving Template for Integrating Qualitative and Quantitative Physics Instruction

    ERIC Educational Resources Information Center

    Fink, Janice M.; Mankey, Gary J.

    2010-01-01

    A problem-solving template enables a methodology of instruction that integrates aspects of both sequencing and conceptual learning. It is designed to enhance critical-thinking skills when used within the framework of a learner-centered approach to teaching, where regular, thorough assessments of student learning are key components of the…

  18. Automated side-chain model building and sequence assignment by template matching.

    PubMed

    Terwilliger, Thomas C

    2003-01-01

    An algorithm is described for automated building of side chains in an electron-density map once a main-chain model is built and for alignment of the protein sequence to the map. The procedure is based on a comparison of electron density at the expected side-chain positions with electron-density templates. The templates are constructed from average amino-acid side-chain densities in 574 refined protein structures. For each contiguous segment of main chain, a matrix with entries corresponding to an estimate of the probability that each of the 20 amino acids is located at each position of the main-chain model is obtained. The probability that this segment corresponds to each possible alignment with the sequence of the protein is estimated using a Bayesian approach and high-confidence matches are kept. Once side-chain identities are determined, the most probable rotamer for each side chain is built into the model. The automated procedure has been implemented in the RESOLVE software. Combined with automated main-chain model building, the procedure produces a preliminary model suitable for refinement and extension by an experienced crystallographer.

  19. Stem-Loop RNA Hairpins in Giant Viruses: Invading rRNA-Like Repeats and a Template Free RNA

    PubMed Central

    Seligmann, Hervé; Raoult, Didier

    2018-01-01

    We examine the hypothesis that de novo template-free RNAs still form spontaneously, as they did at the origins of life, invade modern genomes, contribute new genetic material. Previously, analyses of RNA secondary structures suggested that some RNAs resembling ancestral (t)RNAs formed recently de novo, other parasitic sequences cluster with rRNAs. Here positive control analyses of additional RNA secondary structures confirm ancestral and de novo statuses of RNA grouped according to secondary structure. Viroids with branched stems resemble de novo RNAs, rod-shaped viroids resemble rRNA secondary structures, independently of GC contents. 5′ UTR leading regions of West Nile and Dengue flavivirid viruses resemble de novo and rRNA structures, respectively. An RNA homologous with Megavirus, Dengue and West Nile genomes, copperhead snake microsatellites and levant cotton repeats, not templated by Mimivirus' genome, persists throughout Mimivirus' infection. Its secondary structure clusters with candidate de novo RNAs. The saltatory phyletic distribution and secondary structure of Mimivirus' peculiar RNA suggest occasional template-free polymerization of this sequence, rather than noncanonical transcriptions (swinger polymerization, posttranscriptional editing). PMID:29449833

  20. A comparison of different functions for predicted protein model quality assessment.

    PubMed

    Li, Juan; Fang, Huisheng

    2016-07-01

    In protein structure prediction, a considerable number of models are usually produced by either the Template-Based Method (TBM) or the ab initio prediction. The purpose of this study is to find the critical parameter in assessing the quality of the predicted models. A non-redundant template library was developed and 138 target sequences were modeled. The target sequences were all distant from the proteins in the template library and were aligned with template library proteins on the basis of the transformation matrix. The quality of each model was first assessed with QMEAN and its six parameters, which are C_β interaction energy (C_beta), all-atom pairwise energy (PE), solvation energy (SE), torsion angle energy (TAE), secondary structure agreement (SSA), and solvent accessibility agreement (SAE). Finally, the alignment score (score) was also used to assess the quality of model. Hence, a total of eight parameters (i.e., QMEAN, C_beta, PE, SE, TAE, SSA, SAE, score) were independently used to assess the quality of each model. The results indicate that SSA is the best parameter to estimate the quality of the model.

  1. Ferrate oxidation of murine leukemia virus reverse transcriptase: identification of the template-primer binding domain.

    PubMed

    Reddy, G; Nanduri, V B; Basu, A; Modak, M J

    1991-08-20

    Treatment of murine leukemia virus reverse transcriptase (MuLV RT) with potassium ferrate, an oxidizing agent known to oxidize amino acids involved in phosphate binding domains of proteins, results in the irreversible inactivation of both the DNA polymerase and the RNase H activities. Significant protection from ferrate-mediated inactivation is observed in the presence of template-primer but not in the presence of substrate deoxynucleoside triphosphates. Furthermore, ferrate-treated enzyme loses template-primer binding activity as judged by UV-mediated cross-linking of radiolabeled DNA. Comparative tryptic peptide mapping by reverse-phase HPLC of native and ferrate-oxidized enzyme indicated the presence of two new peptides eluting at 38 and 57 min and a significant loss of a peptide eluting at 74 min. Purification, amino acid composition, and sequencing of these affected peptides revealed that they correspond to amino acid residues 285-295, 630-640, and 586-599, respectively, in the primary amino acid sequence of MuLV RT. These results indicate that the domains constituted by the above peptides are important for the template-primer binding function in MuLV RT. Peptide I is located in the polymerase domain whereas peptides II and III are located in the RNase H domain. Amino acid sequence analysis of peptides I and II suggested Lys-285 and Cys-635 as the probable sites of ferrate action.

  2. The quest for rare variants: pooled multiplexed next generation sequencing in plants.

    PubMed

    Marroni, Fabio; Pinosio, Sara; Morgante, Michele

    2012-01-01

    Next generation sequencing (NGS) instruments produce an unprecedented amount of sequence data at contained costs. This gives researchers the possibility of designing studies with adequate power to identify rare variants at a fraction of the economic and labor resources required by individual Sanger sequencing. As of today, few research groups working in plant sciences have exploited this potentiality, showing that pooled NGS provides results in excellent agreement with those obtained by individual Sanger sequencing. The aim of this review is to convey to the reader the general ideas underlying the use of pooled NGS for the identification of rare variants. To facilitate a thorough understanding of the possibilities of the method, we will explain in detail the possible experimental and analytical approaches and discuss their advantages and disadvantages. We will show that information on allele frequency obtained by pooled NGS can be used to accurately compute basic population genetics indexes such as allele frequency, nucleotide diversity, and Tajima's D. Finally, we will discuss applications and future perspectives of the multiplexed NGS approach.

  3. Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA.

    PubMed

    Holt, Kathryn E; Teo, Yik Y; Li, Heng; Nair, Satheesh; Dougan, Gordon; Wain, John; Parkhill, Julian

    2009-08-15

    Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded > or =80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40x, declining only slightly at read depths 20-40x. The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/.

  4. HDOCK: a web server for protein-protein and protein-DNA/RNA docking based on a hybrid strategy.

    PubMed

    Yan, Yumeng; Zhang, Di; Zhou, Pei; Li, Botong; Huang, Sheng-You

    2017-07-03

    Protein-protein and protein-DNA/RNA interactions play a fundamental role in a variety of biological processes. Determining the complex structures of these interactions is valuable, in which molecular docking has played an important role. To automatically make use of the binding information from the PDB in docking, here we have presented HDOCK, a novel web server of our hybrid docking algorithm of template-based modeling and free docking, in which cases with misleading templates can be rescued by the free docking protocol. The server supports protein-protein and protein-DNA/RNA docking and accepts both sequence and structure inputs for proteins. The docking process is fast and consumes about 10-20 min for a docking run. Tested on the cases with weakly homologous complexes of <30% sequence identity from five docking benchmarks, the HDOCK pipeline tied with template-based modeling on the protein-protein and protein-DNA benchmarks and performed better than template-based modeling on the three protein-RNA benchmarks when the top 10 predictions were considered. The performance of HDOCK became better when more predictions were considered. Combining the results of HDOCK and template-based modeling by ranking first of the template-based model further improved the predictive power of the server. The HDOCK web server is available at http://hdock.phys.hust.edu.cn/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Pyrosequencing for Microbial Identification and Characterization

    PubMed Central

    Cummings, Patrick J.; Ahmed, Ray; Durocher, Jeffrey A.; Jessen, Adam; Vardi, Tamar; Obom, Kristina M.

    2013-01-01

    Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns. PMID:23995536

  6. Pyrosequencing for microbial identification and characterization.

    PubMed

    Cummings, Patrick J; Ahmed, Ray; Durocher, Jeffrey A; Jessen, Adam; Vardi, Tamar; Obom, Kristina M

    2013-08-22

    Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns.

  7. Methods for sequencing GC-rich and CCT repeat DNA templates

    DOEpatents

    Robinson, Donna L.

    2007-02-20

    The present invention is directed to a PCR-based method of cycle sequencing DNA and other polynucleotide sequences having high CG content and regions of high GC content, and includes for example DNA strands with a high Cytosine and/or Guanosine content and repeated motifs such as CCT repeats.

  8. Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics.

    PubMed

    Timmermans, M J T N; Dodsworth, S; Culverwell, C L; Bocak, L; Ahrens, D; Littlewood, D T J; Pons, J; Vogler, A P

    2010-11-01

    Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.

  9. Problem-Solving Test: Pyrosequencing

    ERIC Educational Resources Information Center

    Szeberenyi, Jozsef

    2013-01-01

    Terms to be familiar with before you start to solve the test: Maxam-Gilbert sequencing, Sanger sequencing, gel electrophoresis, DNA synthesis reaction, polymerase chain reaction, template, primer, DNA polymerase, deoxyribonucleoside triphosphates, orthophosphate, pyrophosphate, nucleoside monophosphates, luminescence, acid anhydride bond,…

  10. Improving homology modeling of G-protein coupled receptors through multiple-template derived conserved inter-residue interactions

    NASA Astrophysics Data System (ADS)

    Chaudhari, Rajan; Heim, Andrew J.; Li, Zhijun

    2015-05-01

    Evidenced by the three-rounds of G-protein coupled receptors (GPCR) Dock competitions, improving homology modeling methods of helical transmembrane proteins including the GPCRs, based on templates of low sequence identity, remains an eminent challenge. Current approaches addressing this challenge adopt the philosophy of "modeling first, refinement next". In the present work, we developed an alternative modeling approach through the novel application of available multiple templates. First, conserved inter-residue interactions are derived from each additional template through conservation analysis of each template-target pairwise alignment. Then, these interactions are converted into distance restraints and incorporated in the homology modeling process. This approach was applied to modeling of the human β2 adrenergic receptor using the bovin rhodopsin and the human protease-activated receptor 1 as templates and improved model quality was demonstrated compared to the homology model generated by standard single-template and multiple-template methods. This method of "refined restraints first, modeling next", provides a fast and complementary way to the current modeling approaches. It allows rational identification and implementation of additional conserved distance restraints extracted from multiple templates and/or experimental data, and has the potential to be applicable to modeling of all helical transmembrane proteins.

  11. FANCJ promotes DNA synthesis through G-quadruplex structures

    PubMed Central

    Castillo Bosch, Pau; Segura-Bayona, Sandra; Koole, Wouter; van Heteren, Jane T; Dewar, James M; Tijsterman, Marcel; Knipscheer, Puck

    2014-01-01

    Our genome contains many G-rich sequences, which have the propensity to fold into stable secondary DNA structures called G4 or G-quadruplex structures. These structures have been implicated in cellular processes such as gene regulation and telomere maintenance. However, G4 sequences are prone to mutations particularly upon replication stress or in the absence of specific helicases. To investigate how G-quadruplex structures are resolved during DNA replication, we developed a model system using ssDNA templates and Xenopus egg extracts that recapitulates eukaryotic G4 replication. Here, we show that G-quadruplex structures form a barrier for DNA replication. Nascent strand synthesis is blocked at one or two nucleotides from the G4. After transient stalling, G-quadruplexes are efficiently unwound and replicated. In contrast, depletion of the FANCJ/BRIP1 helicase causes persistent replication stalling at G-quadruplex structures, demonstrating a vital role for this helicase in resolving these structures. FANCJ performs this function independently of the classical Fanconi anemia pathway. These data provide evidence that the G4 sequence instability in FANCJ−/− cells and Fancj/dog1 deficient C. elegans is caused by replication stalling at G-quadruplexes. PMID:25193968

  12. Characterization of NIST human mitochondrial DNA SRM-2392 and SRM-2392-I standard reference materials by next generation sequencing.

    PubMed

    Riman, Sarah; Kiesler, Kevin M; Borsuk, Lisa A; Vallone, Peter M

    2017-07-01

    Standard Reference Materials SRM 2392 and 2392-I are intended to provide quality control when amplifying and sequencing human mitochondrial genome sequences. The National Institute of Standards and Technology (NIST) offers these SRMs to laboratories performing DNA-based forensic human identification, molecular diagnosis of mitochondrial diseases, mutation detection, evolutionary anthropology, and genetic genealogy. The entire mtGenome (∼16569bp) of SRM 2392 and 2392-I have previously been characterized at NIST by Sanger sequencing. Herein, we used the sensitivity, specificity, and accuracy offered by next generation sequencing (NGS) to: (1) re-sequence the certified values of the SRM 2392 and 2392-I; (2) confirm Sanger data with a high coverage new sequencing technology; (3) detect lower level heteroplasmies (<20%); and thus (4) support mitochondrial sequencing communities in the adoption of NGS methods. To obtain a consensus sequence for the SRMs as well as identify and control any bias, sequencing was performed using two NGS platforms and data was analyzed using different bioinformatics pipelines. Our results confirm five low level heteroplasmy sites that were not previously observed with Sanger sequencing: three sites in the GM09947A template in SRM 2392 and two sites in the HL-60 template in SRM 2392-I. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  14. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  15. Life cycle environmental implications of residential swimming pools.

    PubMed

    Forrest, Nigel; Williams, Eric

    2010-07-15

    Ownership of private swimming pools in the U.S. grew 2 to 4% per annum from 1997 to 2007. The environmental implications of pool ownership are analyzed by hybrid life cycle assessment (LCA) for nine U.S. cities. An operational model is constructed estimating consumption of chemicals, water, and energy for a typical residential pool. The model incorporates geographical climatic variations and upstream water and energy use from electricity and water supply networks. Results vary considerably by city: a factor of 5-6 for both water and energy use. Water use is driven by aridness and length of the swimming season, while energy use is mainly driven by length of the swimming season. Water and energy impacts of pools are significant, particularly in arid climates. In Phoenix for example pools account for 22% and 13% of a household's electricity and water use, respectively. Measures to reduce water and energy use in pools such as optimizing the pump schedule and covering the pool in winter can realize greater savings than many common household efficiency improvements. Private versus community pools are also compared. Community pools in Phoenix use 60% less swimming pool water and energy per household than subdivisions without community pools.

  16. Hierarchy and extremes in selections from pools of randomized proteins

    PubMed Central

    Boyer, Sébastien; Biswas, Dipanwita; Kumar Soshee, Ananda; Scaramozzino, Natale; Nizak, Clément; Rivoire, Olivier

    2016-01-01

    Variation and selection are the core principles of Darwinian evolution, but quantitatively relating the diversity of a population to its capacity to respond to selection is challenging. Here, we examine this problem at a molecular level in the context of populations of partially randomized proteins selected for binding to well-defined targets. We built several minimal protein libraries, screened them in vitro by phage display, and analyzed their response to selection by high-throughput sequencing. A statistical analysis of the results reveals two main findings. First, libraries with the same sequence diversity but built around different “frameworks” typically have vastly different responses; second, the distribution of responses of the best binders in a library follows a simple scaling law. We show how an elementary probabilistic model based on extreme value theory rationalizes the latter finding. Our results have implications for designing synthetic protein libraries, estimating the density of functional biomolecules in sequence space, characterizing diversity in natural populations, and experimentally investigating evolvability (i.e., the potential for future evolution). PMID:26969726

  17. Hierarchy and extremes in selections from pools of randomized proteins.

    PubMed

    Boyer, Sébastien; Biswas, Dipanwita; Kumar Soshee, Ananda; Scaramozzino, Natale; Nizak, Clément; Rivoire, Olivier

    2016-03-29

    Variation and selection are the core principles of Darwinian evolution, but quantitatively relating the diversity of a population to its capacity to respond to selection is challenging. Here, we examine this problem at a molecular level in the context of populations of partially randomized proteins selected for binding to well-defined targets. We built several minimal protein libraries, screened them in vitro by phage display, and analyzed their response to selection by high-throughput sequencing. A statistical analysis of the results reveals two main findings. First, libraries with the same sequence diversity but built around different "frameworks" typically have vastly different responses; second, the distribution of responses of the best binders in a library follows a simple scaling law. We show how an elementary probabilistic model based on extreme value theory rationalizes the latter finding. Our results have implications for designing synthetic protein libraries, estimating the density of functional biomolecules in sequence space, characterizing diversity in natural populations, and experimentally investigating evolvability (i.e., the potential for future evolution).

  18. Recombinant viral RdRps can initiate RNA synthesis from circular templates

    PubMed Central

    RANJITH-KUMAR, C.T.; KAO, C.C.

    2006-01-01

    The crystal structure of the recombinant hepatitis C virus (HCV) RNA-dependent RNA polymerase (RdRp) revealed extensive interactions between the fingers and the thumb subdomains, resulting in a closed conformation with an established template channel that should specifically accept single-stranded templates. We made circularized RNA templates and found that they were efficiently used by the HCV RdRp to synthesize product RNAs that are significantly longer than the template, suggesting that RdRp could exist in an open conformation prior to template binding. RNA synthesis using circular RNA templates had properties similar to those previously documented for linear RNA, including a need for higher GTP concentration for initiation, usage of GTP analogs, sensitivity to salt, and involvement of active-site residues for product formation. Some products were resistant to challenge with the template competitor heparin, indicating that the elongation complexes remain bound to template and are competent for RNA synthesis. Other products were not elongated in the presence of heparin, indicating that the elongation complex was terminated. Lastly, recombinant RdRps from two other flaviviruses and from the Pseudomonas phage φ6 also could use circular RNA templates for RNA-dependent RNA synthesis, although the φ6 RdRp could only use circular RNAs made from the 3′-terminal sequence of the φ6 genome. PMID:16373481

  19. Landscape of Insertion Polymorphisms in the Human Genome

    PubMed Central

    Onozawa, Masahiro; Goldberg, Liat; Aplan, Peter D.

    2015-01-01

    Nucleotide substitutions, small (<50 bp) insertions or deletions (indels), and large (>50 bp) deletions are well-known causes of genetic variation within the human genome. We recently reported a previously unrecognized form of polymorphic insertions, termed templated sequence insertion polymorphism (TSIP), in which the inserted sequence was templated from a distant genomic region, and was inserted in the genome through reverse transcription of an RNA intermediate. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; class 1 TSIPs show target site duplication, polyadenylation, and preference for insertion at a 5′-TTTT/A-3′ sequence, suggesting a LINE-1 based insertion mechanism, whereas class 2 TSIPs show features consistent with repair of a DNA double strand break by nonhomologous end joining. To gain a more complete picture of TSIPs throughout the human population, we evaluated whole-genome sequence from 52 individuals, and identified 171 TSIPs. Most individuals had 25–30 TSIPs, and common (present in >20% of individuals) TSIPs were found in individuals throughout the world, whereas rare TSIPs tended to cluster in specific geographic regions. The number of rare TSIPs was greater than the number of common TSIPs, suggesting that TSIP generation is an ongoing process. Intriguingly, mitochondrial sequences were a frequent template for class 2 insertions, used more commonly than any nuclear chromosome. Similar to single nucleotide polymorphisms and indels, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases, and can be useful in tracking historical migration of populations. PMID:25745018

  20. Caught in the middle with multiple displacement amplification: the myth of pooling for avoiding multiple displacement amplification bias in a metagenome.

    PubMed

    Marine, Rachel; McCarren, Coleen; Vorrasane, Vansay; Nasko, Dan; Crowgey, Erin; Polson, Shawn W; Wommack, K Eric

    2014-01-30

    Shotgun metagenomics has become an important tool for investigating the ecology of microorganisms. Underlying these investigations is the assumption that metagenome sequence data accurately estimates the census of microbial populations. Multiple displacement amplification (MDA) of microbial community DNA is often used in cases where it is difficult to obtain enough DNA for sequencing; however, MDA can result in amplification biases that may impact subsequent estimates of population census from metagenome data. Some have posited that pooling replicate MDA reactions negates these biases and restores the accuracy of population analyses. This assumption has not been empirically tested. Using mock viral communities, we examined the influence of pooling on population-scale analyses. In pooled and single reaction MDA treatments, sequence coverage of viral populations was highly variable and coverage patterns across viral genomes were nearly identical, indicating that initial priming biases were reproducible and that pooling did not alleviate biases. In contrast, control unamplified sequence libraries showed relatively even coverage across phage genomes. MDA should be avoided for metagenomic investigations that require quantitative estimates of microbial taxa and gene functional groups. While MDA is an indispensable technique in applications such as single-cell genomics, amplification biases cannot be overcome by combining replicate MDA reactions. Alternative library preparation techniques should be utilized for quantitative microbial ecology studies utilizing metagenomic sequencing approaches.

  1. DNA Sequences from Formalin-Fixed Nematodes: Integrating Molecular and Morphological Approaches to Taxonomy

    PubMed Central

    Thomas, W. Kelley; Vida, J. T.; Frisse, Linda M.; Mundo, Manuel; Baldwin, James G.

    1997-01-01

    To effectively integrate DNA sequence analysis and classical nematode taxonomy, we must be able to obtain DNA sequences from formalin-fixed specimens. Microdissected sections of nematodes were removed from specimens fixed in formalin, using standard protocols and without destroying morphological features. The fixed sections provided sufficient template for multiple polymerase chain reaction-based DNA sequence analyses. PMID:19274156

  2. Using structure to explore the sequence alignment space of remote homologs.

    PubMed

    Kuziemko, Andrew; Honig, Barry; Petrey, Donald

    2011-10-01

    Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is "optimal" in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are "suboptimal" in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for "modelability", we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended.

  3. Foreshocks and delayed triggering of the 2016 MW7.1 Te Araroa earthquake and dynamic reinvigoration of its aftershock sequence by the MW7.8 Kaikōura earthquake, New Zealand

    NASA Astrophysics Data System (ADS)

    Warren-Smith, Emily; Fry, Bill; Kaneko, Yoshihiro; Chamberlain, Calum J.

    2018-01-01

    We analyze the preparatory period of the September 2016 MW7.1 Te Araroa foreshock-mainshock sequence in the Northern Hikurangi margin, New Zealand, and subsequent reinvigoration of Te Araroa aftershocks driven by a large distant earthquake (the November 2016 MW7.8 Kaikōura earthquake). By adopting a matched-filter detection workflow using 582 well-defined template events, we generate an improved foreshock and aftershock catalog for the Te Araroa sequence (>8,000 earthquakes over 66 d). Templates characteristic of the MW7.1 sequence (including the mainshock template) detect several highly correlating events (ML2.5-3.5) starting 12 min after a MW5.7 foreshock. These pre-cursory events occurred within ∼1 km of the mainshock and migrate bilaterally, suggesting precursory slip was triggered by the foreshock on the MW7.1 fault patch prior to mainshock failure. We extend our matched-filter routine to examine the interactions between high dynamic stresses resulting from passing surface waves of the November 2016 MW7.8 Kaikōura earthquake, and the evolution of the Te Araroa aftershock sequence. We observe a sudden spike in moment release of the aftershock sequence immediately following peak dynamic Coulomb stresses of 50-150 kPa on the MW7.1 fault plane. The triggered increase in moment release culminated in a MW5.1 event, immediately followed by a ∼3 h temporal stress shadow. Our observations document the preparatory period of a major subduction margin earthquake following a significant foreshock, and quantify dynamic reinvigoration of a distant on-going major aftershock sequence amid a period of temporal clustering of seismic activity in New Zealand.

  4. BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.

    PubMed

    Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F

    2014-01-01

    We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.

  5. SFESA: a web server for pairwise alignment refinement by secondary structure shifts.

    PubMed

    Tong, Jing; Pei, Jimin; Grishin, Nick V

    2015-09-03

    Protein sequence alignment is essential for a variety of tasks such as homology modeling and active site prediction. Alignment errors remain the main cause of low-quality structure models. A bioinformatics tool to refine alignments is needed to make protein alignments more accurate. We developed the SFESA web server to refine pairwise protein sequence alignments. Compared to the previous version of SFESA, which required a set of 3D coordinates for a protein, the new server will search a sequence database for the closest homolog with an available 3D structure to be used as a template. For each alignment block defined by secondary structure elements in the template, SFESA evaluates alignment variants generated by local shifts and selects the best-scoring alignment variant. A scoring function that combines the sequence score of profile-profile comparison and the structure score of template-derived contact energy is used for evaluation of alignments. PROMALS pairwise alignments refined by SFESA are more accurate than those produced by current advanced alignment methods such as HHpred and CNFpred. In addition, SFESA also improves alignments generated by other software. SFESA is a web-based tool for alignment refinement, designed for researchers to compute, refine, and evaluate pairwise alignments with a combined sequence and structure scoring of alignment blocks. To our knowledge, the SFESA web server is the only tool that refines alignments by evaluating local shifts of secondary structure elements. The SFESA web server is available at http://prodata.swmed.edu/sfesa.

  6. Registry in a tube: multiplexed pools of retrievable parts for genetic design space exploration

    PubMed Central

    Woodruff, Lauren B. A.; Gorochowski, Thomas E.; Roehner, Nicholas; Densmore, Douglas; Gordon, D. Benjamin; Nicol, Robert

    2017-01-01

    Abstract Genetic designs can consist of dozens of genes and hundreds of genetic parts. After evaluating a design, it is desirable to implement changes without the cost and burden of starting the construction process from scratch. Here, we report a two-step process where a large design space is divided into deep pools of composite parts, from which individuals are retrieved and assembled to build a final construct. The pools are built via multiplexed assembly and sequenced using next-generation sequencing. Each pool consists of ∼20 Mb of up to 5000 unique and sequence-verified composite parts that are barcoded for retrieval by PCR. This approach is applied to a 16-gene nitrogen fixation pathway, which is broken into pools containing a total of 55 848 composite parts (71.0 Mb). The pools encompass an enormous design space (1043 possible 23 kb constructs), from which an algorithm-guided 192-member 4.5 Mb library is built. Next, all 1030 possible genetic circuits based on 10 repressors (NOR/NOT gates) are encoded in pools where each repressor is fused to all permutations of input promoters. These demonstrate that multiplexing can be applied to encompass entire design spaces from which individuals can be accessed and evaluated. PMID:28007941

  7. A Neo-Darwinian View of Learning and Its Value for Science and Science Education.

    ERIC Educational Resources Information Center

    Schaverien, Lynette; Cosgrove, Mark

    The modern history of biology shows how Darwin's selectionist theory has replaced instructionist theories in explaining the operations of living things: first with inheritance through the gene pool of the 1850s, and second with the replacement of a template theory of immune system function in the 1960s. Today scholars in several disciplines…

  8. Nucleic acid arrays and methods of synthesis

    DOEpatents

    Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles

    2001-01-01

    The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.

  9. Qualitative and quantitative assessment of Illumina's forensic STR and SNP kits on MiSeq FGx™.

    PubMed

    Sharma, Vishakha; Chow, Hoi Yan; Siegel, Donald; Wurmbach, Elisa

    2017-01-01

    Massively parallel sequencing (MPS) is a powerful tool transforming DNA analysis in multiple fields ranging from medicine, to environmental science, to evolutionary biology. In forensic applications, MPS offers the ability to significantly increase the discriminatory power of human identification as well as aid in mixture deconvolution. However, before the benefits of any new technology can be employed, a thorough evaluation of its quality, consistency, sensitivity, and specificity must be rigorously evaluated in order to gain a detailed understanding of the technique including sources of error, error rates, and other restrictions/limitations. This extensive study assessed the performance of Illumina's MiSeq FGx MPS system and ForenSeq™ kit in nine experimental runs including 314 reaction samples. In-depth data analysis evaluated the consequences of different assay conditions on test results. Variables included: sample numbers per run, targets per run, DNA input per sample, and replications. Results are presented as heat maps revealing patterns for each locus. Data analysis focused on read numbers (allele coverage), drop-outs, drop-ins, and sequence analysis. The study revealed that loci with high read numbers performed better and resulted in fewer drop-outs and well balanced heterozygous alleles. Several loci were prone to drop-outs which led to falsely typed homozygotes and therefore to genotype errors. Sequence analysis of allele drop-in typically revealed a single nucleotide change (deletion, insertion, or substitution). Analyses of sequences, no template controls, and spurious alleles suggest no contamination during library preparation, pooling, and sequencing, but indicate that sequencing or PCR errors may have occurred due to DNA polymerase infidelities. Finally, we found utilizing Illumina's FGx System at recommended conditions does not guarantee 100% outcomes for all samples tested, including the positive control, and required manual editing due to low read numbers and/or allele drop-in. These findings are important for progressing towards implementation of MPS in forensic DNA testing.

  10. An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis

    PubMed Central

    Brender, Jeffrey R.; Czajka, Jeff; Marsh, David; Gray, Felicia; Cierpicki, Tomasz; Zhang, Yang

    2013-01-01

    Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality. PMID:24204234

  11. Detection of Helicobacter and Campylobacter spp. from the aquatic environment of marine mammals.

    PubMed

    Goldman, C G; Matteo, M J; Loureiro, J D; Degrossi, J; Teves, S; Heredia, S Rodriguez; Alvarez, K; González, A Beltrán; Catalano, M; Boccio, J; Cremaschi, G; Solnick, J V; Zubillaga, M B

    2009-01-13

    The mechanism by which Helicobacter species are transmitted remains unclear. To examine the possible role of environmental transmission in marine mammals, we sought the presence of Helicobacter spp. and non-Helicobacter bacteria within the order Campylobacterales in water from the aquatic environment of marine mammals, and in fish otoliths regurgitated by dolphins. Water was collected from six pools, two inhabited by dolphins and four inhabited by seals. Regurgitated otoliths were collected from the bottom of dolphins' pools. Samples were evaluated by culture, PCR and DNA sequence analysis. Sequences from dolphins' water and from regurgitated otoliths clustered with 99.8-100% homology with sequences from gastric fluids, dental plaque and saliva from dolphins living in those pools, and with 99.5% homology with H. cetorum. Sequences from seals' water clustered with 99.5% homology with a sequence amplified from a Northern sea lion (AY203900). Control PCR on source water for the pools and from otoliths dissected from feeder fish were negative. The findings of Helicobacter spp. DNA in the aquatic environment suggests that contaminated water from regurgitated fish otoliths and perhaps other tissues may play a role in Helicobacter transmission among marine mammals.

  12. EGO-1, a C. elegans RdRP, Modulates Gene Expression via Production of mRNA-Templated Short Antisense RNAs

    PubMed Central

    Maniar, Jay M.; Fire, Andrew Z.

    2011-01-01

    SUMMARY Background The development of the germline in Caenorhabditis elegans is a complex process involving the regulation of thousands of genes in a coordinated manner. Several genes required for small RNA biogenesis and function are among those required for the proper organization of the germline. EGO-1 is a putative RNA-directed RNA polymerase (RdRP) that is required for multiple aspects of C. elegans germline development and efficient RNAi of germline-expressed genes. RdRPs have been proposed to act through a variety of mechanisms including the post-transcriptional targeting of specific mRNAs as well as through a direct interaction with chromatin. Despite extensive investigation, the molecular role of EGO-1 has remained enigmatic. Results Here we use high-throughput small RNA and messenger RNA sequencing to investigate EGO-1 function. We found that EGO-1 is required to produce a distinct pool of small RNAs antisense to a number of germline-expressed mRNAs through several developmental stages. These potential mRNA targets fall into distinct classes, including genes required for kinetochore and nuclear pore assembly, histone-modifying activities and centromeric proteins. We also found several RNAi-related genes to be targets of EGO-1. Finally, we show a strong association between the loss of small RNAs and the rise of mRNA levels in ego-1(−) animals. Conclusions Our data support the conclusion that EGO-1 produces triphosphorylated small RNAs derived from mRNA templates and that these small RNAs modulate gene expression through the targeting of their cognate mRNAs. PMID:21396820

  13. Ethical considerations of research policy for personal genome analysis: the approach of the Genome Science Project in Japan.

    PubMed

    Minari, Jusaku; Shirai, Tetsuya; Kato, Kazuto

    2014-12-01

    As evidenced by high-throughput sequencers, genomic technologies have recently undergone radical advances. These technologies enable comprehensive sequencing of personal genomes considerably more efficiently and less expensively than heretofore. These developments present a challenge to the conventional framework of biomedical ethics; under these changing circumstances, each research project has to develop a pragmatic research policy. Based on the experience with a new large-scale project-the Genome Science Project-this article presents a novel approach to conducting a specific policy for personal genome research in the Japanese context. In creating an original informed-consent form template for the project, we present a two-tiered process: making the draft of the template following an analysis of national and international policies; refining the draft template in conjunction with genome project researchers for practical application. Through practical use of the template, we have gained valuable experience in addressing challenges in the ethical review process, such as the importance of sharing details of the latest developments in genomics with members of research ethics committees. We discuss certain limitations of the conventional concept of informed consent and its governance system and suggest the potential of an alternative process using information technology.

  14. Kinetics and thermodynamics of exonuclease-deficient DNA polymerases

    NASA Astrophysics Data System (ADS)

    Gaspard, Pierre

    2016-04-01

    A kinetic theory is developed for exonuclease-deficient DNA polymerases, based on the experimental observation that the rates depend not only on the newly incorporated nucleotide, but also on the previous one, leading to the growth of Markovian DNA sequences from a Bernoullian template. The dependencies on nucleotide concentrations and template sequence are explicitly taken into account. In this framework, the kinetic and thermodynamic properties of DNA replication, in particular, the mean growth velocity, the error probability, and the entropy production are calculated analytically in terms of the rate constants and the concentrations. Theory is compared with numerical simulations for the DNA polymerases of T7 viruses and human mitochondria.

  15. Decoding DNA labels by melting curve analysis using real-time PCR.

    PubMed

    Balog, József A; Fehér, Liliána Z; Puskás, László G

    2017-12-01

    Synthetic DNA has been used as an authentication code for a diverse number of applications. However, existing decoding approaches are based on either DNA sequencing or the determination of DNA length variations. Here, we present a simple alternative protocol for labeling different objects using a small number of short DNA sequences that differ in their melting points. Code amplification and decoding can be done in two steps using quantitative PCR (qPCR). To obtain a DNA barcode with high complexity, we defined 8 template groups, each having 4 different DNA templates, yielding 158 (>2.5 billion) combinations of different individual melting temperature (Tm) values and corresponding ID codes. The reproducibility and specificity of the decoding was confirmed by using the most complex template mixture, which had 32 different products in 8 groups with different Tm values. The industrial applicability of our protocol was also demonstrated by labeling a drone with an oil-based paint containing a predefined DNA code, which was then successfully decoded. The method presented here consists of a simple code system based on a small number of synthetic DNA sequences and a cost-effective, rapid decoding protocol using a few qPCR reactions, enabling a wide range of authentication applications.

  16. Precision oncology using a limited number of cells: optimization of whole genome amplification products for sequencing applications.

    PubMed

    Sho, Shonan; Court, Colin M; Winograd, Paul; Lee, Sangjun; Hou, Shuang; Graeber, Thomas G; Tseng, Hsian-Rong; Tomlinson, James S

    2017-07-01

    Sequencing analysis of circulating tumor cells (CTCs) enables "liquid biopsy" to guide precision oncology strategies. However, this requires low-template whole genome amplification (WGA) that is prone to errors and biases from uneven amplifications. Currently, quality control (QC) methods for WGA products, as well as the number of CTCs needed for reliable downstream sequencing, remain poorly defined. We sought to define strategies for selecting and generating optimal WGA products from low-template input as it relates to their potential applications in precision oncology strategies. Single pancreatic cancer cells (HPAF-II) were isolated using laser microdissection. WGA was performed using multiple displacement amplification (MDA), multiple annealing and looping based amplification (MALBAC) and PicoPLEX. Quality of amplified DNA products were assessed using a multiplex/RT-qPCR based method that evaluates for 8-cancer related genes and QC-scores were assigned. We utilized this scoring system to assess the impact of de novo modifications to the WGA protocol. WGA products were subjected to Sanger sequencing, array comparative genomic hybridization (aCGH) and next generation sequencing (NGS) to evaluate their performances in respective downstream analyses providing validation of the QC-score. Single-cell WGA products exhibited a significant sample-to-sample variability in amplified DNA quality as assessed by our 8-gene QC assay. Single-cell WGA products that passed the pre-analysis QC had lower amplification bias and improved aCGH/NGS performance metrics when compared to single-cell WGA products that failed the QC. Increasing the number of cellular input resulted in improved QC-scores overall, but a resultant WGA product that consistently passed the QC step required a starting cellular input of at least 20-cells. Our modified-WGA protocol effectively reduced this number, achieving reproducible high-quality WGA products from ≥5-cells as a starting template. A starting cellular input of 5 to 10-cells amplified using the modified-WGA achieved aCGH and NGS results that closely matched that of unamplified, batch genomic DNA. The modified-WGA protocol coupled with the 8-gene QC serve as an effective strategy to enhance the quality of low-template WGA reactions. Furthermore, a threshold number of 5-10 cells are likely needed for a reliable WGA reaction and product with high fidelity to the original starting template.

  17. Using structural knowledge in the protein data bank to inform the search for potential host-microbe protein interactions in sequence space: application to Mycobacterium tuberculosis.

    PubMed

    Mahajan, Gaurang; Mande, Shekhar C

    2017-04-04

    A comprehensive map of the human-M. tuberculosis (MTB) protein interactome would help fill the gaps in our understanding of the disease, and computational prediction can aid and complement experimental studies towards this end. Several sequence-based in silico approaches tap the existing data on experimentally validated protein-protein interactions (PPIs); these PPIs serve as templates from which novel interactions between pathogen and host are inferred. Such comparative approaches typically make use of local sequence alignment, which, in the absence of structural details about the interfaces mediating the template interactions, could lead to incorrect inferences, particularly when multi-domain proteins are involved. We propose leveraging the domain-domain interaction (DDI) information in PDB complexes to score and prioritize candidate PPIs between host and pathogen proteomes based on targeted sequence-level comparisons. Our method picks out a small set of human-MTB protein pairs as candidates for physical interactions, and the use of functional meta-data suggests that some of them could contribute to the in vivo molecular cross-talk between pathogen and host that regulates the course of the infection. Further, we present numerical data for Pfam domain families that highlights interaction specificity on the domain level. Not every instance of a pair of domains, for which interaction evidence has been found in a few instances (i.e. structures), is likely to functionally interact. Our sorting approach scores candidates according to how "distant" they are in sequence space from known examples of DDIs (templates). Thus, it provides a natural way to deal with the heterogeneity in domain-level interactions. Our method represents a more informed application of local alignment to the sequence-based search for potential human-microbial interactions that uses available PPI data as a prior. Our approach is somewhat limited in its sensitivity by the restricted size and diversity of the template dataset, but, given the rapid accumulation of solved protein complex structures, its scope and utility are expected to keep steadily improving.

  18. Variation of mutational burden in healthy human tissues suggests non-random strand segregation and allows measuring somatic mutation rates.

    PubMed

    Werner, Benjamin; Sottoriva, Andrea

    2018-06-01

    The immortal strand hypothesis poses that stem cells could produce differentiated progeny while conserving the original template strand, thus avoiding accumulating somatic mutations. However, quantitating the extent of non-random DNA strand segregation in human stem cells remains difficult in vivo. Here we show that the change of the mean and variance of the mutational burden with age in healthy human tissues allows estimating strand segregation probabilities and somatic mutation rates. We analysed deep sequencing data from healthy human colon, small intestine, liver, skin and brain. We found highly effective non-random DNA strand segregation in all adult tissues (mean strand segregation probability: 0.98, standard error bounds (0.97,0.99)). In contrast, non-random strand segregation efficiency is reduced to 0.87 (0.78,0.88) in neural tissue during early development, suggesting stem cell pool expansions due to symmetric self-renewal. Healthy somatic mutation rates differed across tissue types, ranging from 3.5 × 10-9/bp/division in small intestine to 1.6 × 10-7/bp/division in skin.

  19. Insertion sequences enrichment in extreme Red sea brine pool vent.

    PubMed

    Elbehery, Ali H A; Aziz, Ramy K; Siam, Rania

    2017-03-01

    Mobile genetic elements are major agents of genome diversification and evolution. Limited studies addressed their characteristics, including abundance, and role in extreme habitats. One of the rare natural habitats exposed to multiple-extreme conditions, including high temperature, salinity and concentration of heavy metals, are the Red Sea brine pools. We assessed the abundance and distribution of different mobile genetic elements in four Red Sea brine pools including the world's largest known multiple-extreme deep-sea environment, the Red Sea Atlantis II Deep. We report a gradient in the abundance of mobile genetic elements, dramatically increasing in the harshest environment of the pool. Additionally, we identified a strong association between the abundance of insertion sequences and extreme conditions, being highest in the harshest and deepest layer of the Red Sea Atlantis II Deep. Our comparative analyses of mobile genetic elements in secluded, extreme and relatively non-extreme environments, suggest that insertion sequences predominantly contribute to polyextremophiles genome plasticity.

  20. Template for assessing climate change impacts and management options: TACCIMO user guide version 2.2

    Treesearch

    Emrys Treasure; Steven McNulty; Jennifer Moore Myers; Lisa Nicole Jennings

    2014-01-01

    The Template for Assessing Climate Change Impacts and Management Options (TACCIMO) is a Web-based tool developed by the Forest Service, U.S. Department of Agriculture to assist Federal, State, and private land managers and planners with evaluation of climate change science implications for sustainable natural resource management. TACCIMO is a dynamic information...

  1. Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.

    PubMed Central

    Barnes, W M; Bevan, M

    1983-01-01

    A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723

  2. RNA 3D Structure Modeling by Combination of Template-Based Method ModeRNA, Template-Free Folding with SimRNA, and Refinement with QRNAS.

    PubMed

    Piatkowski, Pawel; Kasprzak, Joanna M; Kumar, Deepak; Magnus, Marcin; Chojnowski, Grzegorz; Bujnicki, Janusz M

    2016-01-01

    RNA encompasses an essential part of all known forms of life. The functions of many RNA molecules are dependent on their ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that either utilize information derived from known structures of other RNA molecules (by way of template-based modeling) or attempt to simulate the physical process of RNA structure formation (by way of template-free modeling). All computational methods suffer from various limitations that make theoretical models less reliable than high-resolution experimentally determined structures. This chapter provides a protocol for computational modeling of RNA 3D structure that overcomes major limitations by combining two complementary approaches: template-based modeling that is capable of predicting global architectures based on similarity to other molecules but often fails to predict local unique features, and template-free modeling that can predict the local folding, but is limited to modeling the structure of relatively small molecules. Here, we combine the use of a template-based method ModeRNA with a template-free method SimRNA. ModeRNA requires a sequence alignment of the target RNA sequence to be modeled with a template of the known structure; it generates a model that predicts the structure of a conserved core and provides a starting point for modeling of variable regions. SimRNA can be used to fold small RNAs (<80 nt) without any additional structural information, and to refold parts of models for larger RNAs that have a correctly modeled core. ModeRNA can be either downloaded, compiled and run locally or run through a web interface at http://genesilico.pl/modernaserver/ . SimRNA is currently available to download for local use as a precompiled software package at http://genesilico.pl/software/stand-alone/simrna and as a web server at http://genesilico.pl/SimRNAweb . For model optimization we use QRNAS, available at http://genesilico.pl/qrnas .

  3. Divergent allele advantage at MHC-DRB through direct and maternal genotypic effects and its consequences for allele pool composition and mating

    PubMed Central

    Lenz, Tobias L.; Mueller, Birte; Trillmich, Fritz; Wolf, Jochen B. W.

    2013-01-01

    It is still debated whether main individual fitness differences in natural populations can be attributed to genome-wide effects or to particular loci of outstanding functional importance such as the major histocompatibility complex (MHC). In a long-term monitoring project on Galápagos sea lions (Zalophus wollebaeki), we collected comprehensive fitness and mating data for a total of 506 individuals. Controlling for genome-wide inbreeding, we find strong associations between the MHC locus and nearly all fitness traits. The effect was mainly attributable to MHC sequence divergence and could be decomposed into contributions of own and maternal genotypes. In consequence, the population seems to have evolved a pool of highly divergent alleles conveying near-optimal MHC divergence even by random mating. Our results demonstrate that a single locus can significantly contribute to fitness in the wild and provide conclusive evidence for the ‘divergent allele advantage’ hypothesis, a special form of balancing selection with interesting evolutionary implications. PMID:23677346

  4. Evolution in a Test Tube: Exploring the Structure and Function of RNA Probes

    DTIC Science & Technology

    2008-05-02

    Bartel, D.P. and Szostak, J.W. (1993) Isolation of New Ribozymes from a Large Pool of Random Sequences. Science, New Series 261, 1141-1418. 24...Szostak, J.W. (1993) Isolation of New Ribozymes from a Large Pool of Random Sequences. Science, New Series 261, 1141-1418. Chen, Ying; Carlini

  5. Two Successive Reactions on a DNA Template: A Strategy for Improving Background and Specificity in Nucleic Acid Detection

    PubMed Central

    Franzini, Raphael M.

    2015-01-01

    We report a new strategy for template-mediated fluorogenic chemistry that results in enhanced performance for the fluorescence detection of nucleic acids. In this approach, two successive templated reactions are required to induce a fluorescence signal, rather than only one. These novel fluorescein-labeled oligonucleotide probes, termed 2-STAR probes, contain two quencher groups tethered by separate reductively cleavable linkers. When a 2-STAR quenched probe binds adjacent to either two successive mono triphenyl-phosphine (TPP)-DNAs or a dual TPP-DNA, the two quenchers are released, resulting in a fluorescence signal. Because of the requirement for two consecutive reactions, 2-STAR probes display an unprecedented level of sequence-specificity for template-mediated probe designs. At the same time, background emission generated by off-template reactions or incomplete quenching is among the lowest of any fluorogenic reactive probes for the detection of DNA or RNA. PMID:21294182

  6. Spatially orthogonal chemical functionalization of a hierarchical pore network for catalytic cascade reactions

    NASA Astrophysics Data System (ADS)

    Parlett, Christopher M. A.; Isaacs, Mark A.; Beaumont, Simon K.; Bingham, Laura M.; Hondow, Nicole S.; Wilson, Karen; Lee, Adam F.

    2016-02-01

    The chemical functionality within porous architectures dictates their performance as heterogeneous catalysts; however, synthetic routes to control the spatial distribution of individual functions within porous solids are limited. Here we report the fabrication of spatially orthogonal bifunctional porous catalysts, through the stepwise template removal and chemical functionalization of an interconnected silica framework. Selective removal of polystyrene nanosphere templates from a lyotropic liquid crystal-templated silica sol-gel matrix, followed by extraction of the liquid crystal template, affords a hierarchical macroporous-mesoporous architecture. Decoupling of the individual template extractions allows independent functionalization of macropore and mesopore networks on the basis of chemical and/or size specificity. Spatial compartmentalization of, and directed molecular transport between, chemical functionalities affords control over the reaction sequence in catalytic cascades; herein illustrated by the Pd/Pt-catalysed oxidation of cinnamyl alcohol to cinnamic acid. We anticipate that our methodology will prompt further design of multifunctional materials comprising spatially compartmentalized functions.

  7. Development of a High Angular Resolution Diffusion Imaging Human Brain Template

    PubMed Central

    Varentsova, Anna; Zhang, Shengwei; Arfanakis, Konstantinos

    2014-01-01

    Brain diffusion templates contain rich information about the microstructure of the brain, and are used as references in spatial normalization or in the development of brain atlases. The accuracy of diffusion templates constructed based on the diffusion tensor (DT) model is limited in regions with complex neuronal micro-architecture. High angular resolution diffusion imaging (HARDI) overcomes limitations of the DT model and is capable of resolving intravoxel heterogeneity. However, when HARDI is combined with multiple-shot sequences to minimize image artifacts, the scan time becomes inappropriate for human brain imaging. In this work, an artifact-free HARDI template of the human brain was developed from low angular resolution multiple-shot diffusion data. The resulting HARDI template was produced in ICBM-152 space based on Turboprop diffusion data, was shown to resolve complex neuronal micro-architecture in regions with intravoxel heterogeneity, and contained fiber orientation information consistent with known human brain anatomy. PMID:24440528

  8. Molecular replication

    NASA Technical Reports Server (NTRS)

    Orgel, L. E.

    1986-01-01

    The object of our research program is to understand how polynucleotide replication originated on the primitive Earth. This is a central issue in studies of the origins of life, since a process similar to modern DNA and RNA synthesis is likely to have formed the basis for the most primitive system of genetic information transfer. The major conclusion of studies so far is that a preformed polynucleotide template under many different experimental conditions will facilitate the synthesis of a new oligonucleotide with a sequence complementary to that of the template. It has been shown, for example, that poly(C) facilitates the synthesis of long oligo(G)s and that the short template CCGCC facilities the synthesis of its complement GGCGG. Very recently we have shown that template-directed synthesis is not limited to the standard oligonucleotide substrates. Nucleic acid-like molecules with a pyrophosphate group replacing the phosphate of the standard nucleic acid backbone are readily synthesized from deoxynucleotide 3'-5'-diphosphates on appropriate templates.

  9. Hierarchical Feature Extraction With Local Neural Response for Image Recognition.

    PubMed

    Li, Hong; Wei, Yantao; Li, Luoqing; Chen, C L P

    2013-04-01

    In this paper, a hierarchical feature extraction method is proposed for image recognition. The key idea of the proposed method is to extract an effective feature, called local neural response (LNR), of the input image with nontrivial discrimination and invariance properties by alternating between local coding and maximum pooling operation. The local coding, which is carried out on the locally linear manifold, can extract the salient feature of image patches and leads to a sparse measure matrix on which maximum pooling is carried out. The maximum pooling operation builds the translation invariance into the model. We also show that other invariant properties, such as rotation and scaling, can be induced by the proposed model. In addition, a template selection algorithm is presented to reduce computational complexity and to improve the discrimination ability of the LNR. Experimental results show that our method is robust to local distortion and clutter compared with state-of-the-art algorithms.

  10. Next generation sequencing technology: a powerful tool for the genome characterization of sugarcane mosaic virus from Sorghum almum

    USDA-ARS?s Scientific Manuscript database

    Next generation sequencing (NGS) technology was used to analyze the occurrence of viruses in Sorghum almum plants in Florida exhibiting mosaic symptoms. Total RNA was extracted from symptomatic leaves and used as a template for cDNA library preparation. The resulting library was sequenced on an Illu...

  11. Performance evaluation of a mitogenome capture and Illumina sequencing protocol using non-probative, case-type skeletal samples: Implications for the use of a positive control in a next-generation sequencing procedure.

    PubMed

    Marshall, Charla; Sturk-Andreaggi, Kimberly; Daniels-Higginbotham, Jennifer; Oliver, Robert Sean; Barritt-Ross, Suzanne; McMahon, Timothy P

    2017-11-01

    Next-generation ancient DNA technologies have the potential to assist in the analysis of degraded DNA extracted from forensic specimens. Mitochondrial genome (mitogenome) sequencing, specifically, may be of benefit to samples that fail to yield forensically relevant genetic information using conventional PCR-based techniques. This report summarizes the Armed Forces Medical Examiner System's Armed Forces DNA Identification Laboratory's (AFMES-AFDIL) performance evaluation of a Next-Generation Sequencing protocol for degraded and chemically treated past accounting samples. The procedure involves hybridization capture for targeted enrichment of mitochondrial DNA, massively parallel sequencing using Illumina chemistry, and an automated bioinformatic pipeline for forensic mtDNA profile generation. A total of 22 non-probative samples and associated controls were processed in the present study, spanning a range of DNA quantity and quality. Data were generated from over 100 DNA libraries by ten DNA analysts over the course of five months. The results show that the mitogenome sequencing procedure is reliable and robust, sensitive to low template (one ng control DNA) as well as degraded DNA, and specific to the analysis of the human mitogenome. Haplotypes were overall concordant between NGS replicates and with previously generated Sanger control region data. Due to the inherent risk for contamination when working with low-template, degraded DNA, a contamination assessment was performed. The consumables were shown to be void of human DNA contaminants and suitable for forensic use. Reagent blanks and negative controls were analyzed to determine the background signal of the procedure. This background signal was then used to set analytical and reporting thresholds, which were designated at 4.0X (limit of detection) and 10.0X (limit of quantiation) average coverage across the mitogenome, respectively. Nearly all human samples exceeded the reporting threshold, although coverage was reduced in chemically treated samples resulting in a ∼58% passing rate for these poor-quality samples. A concordance assessment demonstrated the reliability of the NGS data when compared to known Sanger profiles. One case sample was shown to be mixed with a co-processed sample and two reagent blanks indicated the presence of DNA above the analytical threshold. This contamination was attributed to sequencing crosstalk from simultaneously sequenced high-quality samples to include the positive control. Overall this study demonstrated that hybridization capture and Illumina sequencing provide a viable method for mitogenome sequencing of degraded and chemically treated skeletal DNA samples, yet may require alternative measures of quality control. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  12. Enhanced sensitivity for detection of low-level germline mosaic RB1 mutations in sporadic retinoblastoma cases using deep semiconductor sequencing.

    PubMed

    Chen, Zhao; Moran, Kimberly; Richards-Yutz, Jennifer; Toorens, Erik; Gerhart, Daniel; Ganguly, Tapan; Shields, Carol L; Ganguly, Arupa

    2014-03-01

    Sporadic retinoblastoma (RB) is caused by de novo mutations in the RB1 gene. Often, these mutations are present as mosaic mutations that cannot be detected by Sanger sequencing. Next-generation deep sequencing allows unambiguous detection of the mosaic mutations in lymphocyte DNA. Deep sequencing of the RB1 gene on lymphocyte DNA from 20 bilateral and 70 unilateral RB cases was performed, where Sanger sequencing excluded the presence of mutations. The individual exons of the RB1 gene from each sample were amplified, pooled, ligated to barcoded adapters, and sequenced using semiconductor sequencing on an Ion Torrent Personal Genome Machine. Six low-level mosaic mutations were identified in bilateral RB and four in unilateral RB cases. The incidence of low-level mosaic mutation was estimated to be 30% and 6%, respectively, in sporadic bilateral and unilateral RB cases, previously classified as mutation negative. The frequency of point mutations detectable in lymphocyte DNA increased from 96% to 97% for bilateral RB and from 13% to 18% for unilateral RB. The use of deep sequencing technology increased the sensitivity of the detection of low-level germline mosaic mutations in the RB1 gene. This finding has significant implications for improved clinical diagnosis, genetic counseling, surveillance, and management of RB. © 2013 WILEY PERIODICALS, INC.

  13. Spiking of contemporary human template DNA with ancient DNA extracts induces mutations under PCR and generates nonauthentic mitochondrial sequences.

    PubMed

    Pusch, Carsten M; Bachmann, Lutz

    2004-05-01

    Proof of authenticity is the greatest challenge in palaeogenetic research, and many safeguards have become standard routine in laboratories specialized on ancient DNA research. Here we describe an as-yet unknown source of artifacts that will require special attention in the future. We show that ancient DNA extracts on their own can have an inhibitory and mutagenic effect under PCR. We have spiked PCR reactions including known human test DNA with 14 selected ancient DNA extracts from human and nonhuman sources. We find that the ancient DNA extracts inhibit the amplification of large fragments to different degrees, suggesting that the usual control against contaminations, i.e., the absence of long amplifiable fragments, is not sufficient. But even more important, we find that the extracts induce mutations in a nonrandom fashion. We have amplified a 148-bp stretch of the mitochondrial HVRI from contemporary human template DNA in spiked PCR reactions. Subsequent analysis of 547 sequences from cloned amplicons revealed that the vast majority (76.97%) differed from the correct sequence by single nucleotide substitutions and/or indels. In total, 34 positions of a 103-bp alignment are affected, and most mutations occur repeatedly in independent PCR amplifications. Several of the induced mutations occur at positions that have previously been detected in studies of ancient hominid sequences, including the Neandertal sequences. Our data imply that PCR-induced mutations are likely to be an intrinsic and general problem of PCR amplifications of ancient templates. Therefore, ancient DNA sequences should be considered with caution, at least as long as the molecular basis for the extract-induced mutations is not understood.

  14. Registry in a tube: multiplexed pools of retrievable parts for genetic design space exploration.

    PubMed

    Woodruff, Lauren B A; Gorochowski, Thomas E; Roehner, Nicholas; Mikkelsen, Tarjei S; Densmore, Douglas; Gordon, D Benjamin; Nicol, Robert; Voigt, Christopher A

    2017-02-17

    Genetic designs can consist of dozens of genes and hundreds of genetic parts. After evaluating a design, it is desirable to implement changes without the cost and burden of starting the construction process from scratch. Here, we report a two-step process where a large design space is divided into deep pools of composite parts, from which individuals are retrieved and assembled to build a final construct. The pools are built via multiplexed assembly and sequenced using next-generation sequencing. Each pool consists of ∼20 Mb of up to 5000 unique and sequence-verified composite parts that are barcoded for retrieval by PCR. This approach is applied to a 16-gene nitrogen fixation pathway, which is broken into pools containing a total of 55 848 composite parts (71.0 Mb). The pools encompass an enormous design space (1043 possible 23 kb constructs), from which an algorithm-guided 192-member 4.5 Mb library is built. Next, all 1030 possible genetic circuits based on 10 repressors (NOR/NOT gates) are encoded in pools where each repressor is fused to all permutations of input promoters. These demonstrate that multiplexing can be applied to encompass entire design spaces from which individuals can be accessed and evaluated. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Evaluation of Signature Erosion in Ebola Virus Due to Genomic Drift and Its Impact on the Performance of Diagnostic Assays

    PubMed Central

    Sozhamannan, Shanmuga; Holland, Mitchell Y.; Hall, Adrienne T.; Negrón, Daniel A.; Ivancich, Mychal; Koehler, Jeffrey W.; Minogue, Timothy D.; Campbell, Catherine E.; Berger, Walter J.; Christopher, George W.; Goodwin, Bruce G.; Smith, Michael A.

    2015-01-01

    Genome sequence analyses of the 2014 Ebola Virus (EBOV) isolates revealed a potential problem with the diagnostic assays currently in use; i.e., drifting genomic profiles of the virus may affect the sensitivity or even produce false-negative results. We evaluated signature erosion in ebolavirus molecular assays using an in silico approach and found frequent potential false-negative and false-positive results. We further empirically evaluated many EBOV assays, under real time PCR conditions using EBOV Kikwit (1995) and Makona (2014) RNA templates. These results revealed differences in performance between assays but were comparable between the old and new EBOV templates. Using a whole genome approach and a novel algorithm, termed BioVelocity, we identified new signatures that are unique to each of EBOV, Sudan virus (SUDV), and Reston virus (RESTV). Interestingly, many of the current assay signatures do not fall within these regions, indicating a potential drawback in the past assay design strategies. The new signatures identified in this study may be evaluated with real-time reverse transcription PCR (rRT-PCR) assay development and validation. In addition, we discuss regulatory implications and timely availability to impact a rapidly evolving outbreak using existing but perhaps less than optimal assays versus redesign these assays for addressing genomic changes. PMID:26090727

  16. Multiple template-based fluoroscopic tracking of lung tumor mass without implanted fiducial markers

    NASA Astrophysics Data System (ADS)

    Cui, Ying; Dy, Jennifer G.; Sharp, Gregory C.; Alexander, Brian; Jiang, Steve B.

    2007-10-01

    Precise lung tumor localization in real time is particularly important for some motion management techniques, such as respiratory gating or beam tracking with a dynamic multi-leaf collimator, due to the reduced clinical tumor volume (CTV) to planning target volume (PTV) margin and/or the escalated dose. There might be large uncertainties in deriving tumor position from external respiratory surrogates. While tracking implanted fiducial markers has sufficient accuracy, this procedure may not be widely accepted due to the risk of pneumothorax. Previously, we have developed a technique to generate gating signals from fluoroscopic images without implanted fiducial markers using a template matching method (Berbeco et al 2005 Phys. Med. Biol. 50 4481-90, Cui et al 2007 Phys. Med. Biol. 52 741-55). In this paper, we present an extension of this method to multiple-template matching for directly tracking the lung tumor mass in fluoroscopy video. The basic idea is as follows: (i) during the patient setup session, a pair of orthogonal fluoroscopic image sequences are taken and processed off-line to generate a set of reference templates that correspond to different breathing phases and tumor positions; (ii) during treatment delivery, fluoroscopic images are continuously acquired and processed; (iii) the similarity between each reference template and the processed incoming image is calculated; (iv) the tumor position in the incoming image is then estimated by combining the tumor centroid coordinates in reference templates with proper weights based on the measured similarities. With different handling of image processing and similarity calculation, two such multiple-template tracking techniques have been developed: one based on motion-enhanced templates and Pearson's correlation score while the other based on eigen templates and mean-squared error. The developed techniques have been tested on six sequences of fluoroscopic images from six lung cancer patients against the reference tumor positions manually determined by a radiation oncologist. The tumor centroid coordinates automatically detected using both methods agree well with the manually marked reference locations. The eigenspace tracking method performs slightly better than the motion-enhanced method, with average localization errors less than 2 pixels (1 mm) and the error at a 95% confidence level of about 2-4 pixels (1-2 mm). This work demonstrates the feasibility of direct tracking of a lung tumor mass in fluoroscopic images without implanted fiducial markers using multiple reference templates.

  17. Evolution of sequence-defined highly functionalized nucleic acid polymers

    NASA Astrophysics Data System (ADS)

    Chen, Zhen; Lichtor, Phillip A.; Berliner, Adrian P.; Chen, Jonathan C.; Liu, David R.

    2018-03-01

    The evolution of sequence-defined synthetic polymers made of building blocks beyond those compatible with polymerase enzymes or the ribosome has the potential to generate new classes of receptors, catalysts and materials. Here we describe a ligase-mediated DNA-templated polymerization and in vitro selection system to evolve highly functionalized nucleic acid polymers (HFNAPs) made from 32 building blocks that contain eight chemically diverse side chains on a DNA backbone. Through iterated cycles of polymer translation, selection and reverse translation, we discovered HFNAPs that bind proprotein convertase subtilisin/kexin type 9 (PCSK9) and interleukin-6, two protein targets implicated in human diseases. Mutation and reselection of an active PCSK9-binding polymer yielded evolved polymers with high affinity (KD = 3 nM). This evolved polymer potently inhibited the binding between PCSK9 and the low-density lipoprotein receptor. Structure-activity relationship studies revealed that specific side chains at defined positions in the polymers are required for binding to their respective targets. Our findings expand the chemical space of evolvable polymers to include densely functionalized nucleic acids with diverse, researcher-defined chemical repertoires.

  18. Integrated on-line system for DNA sequencing by capillary electrophoresis: From template to called bases

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ton, H.; Yeung, E.S.

    1997-02-15

    An integrated on-line prototype for coupling a microreactor to capillary electrophoresis for DNA sequencing has been demonstrated. A dye-labeled terminator cycle-sequencing reaction is performed in a fused-silica capillary. Subsequently, the sequencing ladder is directly injected into a size-exclusion chromatographic column operated at nearly 95{degree}C for purification. On-line injection to a capillary for electrophoresis is accomplished at a junction set at nearly 70{degree}C. High temperature at the purification column and injection junction prevents the renaturation of DNA fragments during on-line transfer without affecting the separation. The high solubility of DNA in and the relatively low ionic strength of 1 x TEmore » buffer permit both effective purification and electrokinetic injection of the DNA sample. The system is compatible with highly efficient separations by a replaceable poly(ethylene oxide) polymer solution in uncoated capillary tubes. Future automation and adaptation to a multiple-capillary array system should allow high-speed, high-throughput DNA sequencing from templates to called bases in one step. 32 refs., 5 figs.« less

  19. Selection of Optimal Polypurine Tract Region Sequences during Moloney Murine Leukemia Virus Replication

    PubMed Central

    Robson, Nicole D.; Telesnitsky, Alice

    2000-01-01

    Retrovirus plus-strand synthesis is primed by a cleavage remnant of the polypurine tract (PPT) region of viral RNA. In this study, we tested replication properties for Moloney murine leukemia viruses with targeted mutations in the PPT and in conserved sequences upstream, as well as for pools of mutants with randomized sequences in these regions. The importance of maintaining some purine residues within the PPT was indicated both by examining the evolution of random PPT pools and from the replication properties of targeted mutants. Although many different PPT sequences could support efficient replication and one mutant that contained two differences in the core PPT was found to replicate as well as the wild type, some sequences in the core PPT clearly conferred advantages over others. Contributions of sequences upstream of the core PPT were examined with deletion mutants. A conserved T-stretch within the upstream sequence was examined in detail and found to be unimportant to helper functions. Evolution of virus pools containing randomized T-stretch sequences demonstrated marked preference for the wild-type sequence in six of its eight positions. These findings demonstrate that maintenance of the T-rich element is more important to viral replication than is maintenance of the core PPT. PMID:11044073

  20. Habitat sequencing and the importance of discharge in inferences

    Treesearch

    Robert H. Hilderbrand; A. Dennis Lemly; C. Andrew Dolloff

    1999-01-01

    The authors constructed stream maps for a low-­gradient trout stream in southwestern Virginia during autumn (base flow) and spring (elevated flows) to compare spatial and temporal variation in stream habitats. Pool-riffle sequencing and total area occupied by pools and riffles changed substantially depending on the level of discharge: reduced discharge resulted in an...

  1. Functional annotation by sequence-weighted structure alignments: statistical analysis and case studies from the Protein 3000 structural genomics project in Japan.

    PubMed

    Standley, Daron M; Toh, Hiroyuki; Nakamura, Haruki

    2008-09-01

    A method to functionally annotate structural genomics targets, based on a novel structural alignment scoring function, is proposed. In the proposed score, position-specific scoring matrices are used to weight structurally aligned residue pairs to highlight evolutionarily conserved motifs. The functional form of the score is first optimized for discriminating domains belonging to the same Pfam family from domains belonging to different families but the same CATH or SCOP superfamily. In the optimization stage, we consider four standard weighting functions as well as our own, the "maximum substitution probability," and combinations of these functions. The optimized score achieves an area of 0.87 under the receiver-operating characteristic curve with respect to identifying Pfam families within a sequence-unique benchmark set of domain pairs. Confidence measures are then derived from the benchmark distribution of true-positive scores. The alignment method is next applied to the task of functionally annotating 230 query proteins released to the public as part of the Protein 3000 structural genomics project in Japan. Of these queries, 78 were found to align to templates with the same Pfam family as the query or had sequence identities > or = 30%. Another 49 queries were found to match more distantly related templates. Within this group, the template predicted by our method to be the closest functional relative was often not the most structurally similar. Several nontrivial cases are discussed in detail. Finally, 103 queries matched templates at the fold level, but not the family or superfamily level, and remain functionally uncharacterized. 2008 Wiley-Liss, Inc.

  2. Isolation and sequencing of Dashli virus, a novel Sicilian-like virus in sandflies from Iran; genetic and phylogenetic evidence for the creation of one novel species within the Phlebovirus genus in the Phenuiviridae family.

    PubMed

    Alkan, Cigdem; Moin Vaziri, Vahideh; Ayhan, Nazli; Badakhshan, Mehdi; Bichaud, Laurence; Rahbarian, Nourina; Javadian, Ezat-Aldin; Alten, Bulent; de Lamballerie, Xavier; Charrel, Remi N

    2017-12-01

    Phlebotomine sandflies are vectors of phleboviruses that cause sandfly fever or meningitis with significant implications for public health. Although several strains of these viruses had been isolated in Iran in the late 1970's, there was no recent data about the present situation at the outset of this study. Entomological investigations performed in 2009 and 2011 in Iran collected 4,770 sandflies from 10 different regions. Based on morphological identification, they were sorted into 315 pools according to species, sex, trapping station and date of capture. A phlebovirus, provisionally named Dashli virus (DASHV), was isolated from one pool of Sergentomyia spp, and subsequently DASHV RNA was detected in a second pool of Phlebotomus papatasi. Genetic and phylogenetic analyses based on complete coding genomic sequences indicated that (i) DASHV is most closely related to the Iranian isolates of Sandfly fever Sicilian virus [SFSV], (ii) there is a common ancestor to DASHV, Sandfly fever Sicilian- (SFS) and SFS-like viruses isolated in Italy, India, Turkey, and Cyprus (lineage I), (iii) DASHV is more distantly related with Corfou and Toros viruses (lineage II) although common ancestry is supported with 100% bootstrap, (iii) lineage I can be subdivided into sublineage Ia including all SFSV, SFCV and SFTV except those isolated in Iran which forms sublineage Ib (DASHV). Accordingly, we suggest to approve Sandfly fever Sicilian virus species consisting of the all aforementioned viruses. Owing that most of these viruses have been identified in human patients with febrile illness, DASHV should be considered as a potential human pathogen in Iran.

  3. Information transfer from peptide nucleic acids to RNA by template-directed syntheses

    NASA Technical Reports Server (NTRS)

    Schmidt, J. G.; Nielsen, P. E.; Orgel, L. E.; Bada, J. L. (Principal Investigator)

    1997-01-01

    Peptide nucleic acids (PNAs) are uncharged analogs of DNA and RNA in which the ribose-phosphate backbone is substituted by a backbone held together by amide bonds. PNAs are interesting as models of alternative genetic systems because they form potentially informational base paired helical structures. A PNA C10 oligomer has been shown to act as template for efficient formation of oligoguanylates from activated guanosine ribonucleotides. In a previous paper we used heterosequences of DNA as templates in sequence-dependent polymerization of PNA dimers. In this paper we show that information can be transferred from PNA to RNA. We describe the reactions of activated mononucleotides on heterosequences of PNA. Adenylic, cytidylic and guanylic acids were incorporated into the products opposite their complement on PNA, although less efficiently than on DNA templates.

  4. Co-evolving Physical and Biological Organization in Step-pool Channels: Experiments from a Restoration Reach on Wildcat Creek, California

    NASA Astrophysics Data System (ADS)

    Chin, A.; O'Dowd, A. P.; Mendez, P. K.; Velasco, K. Z.; Leventhal, R. D.; Storesund, R.; Laurencio, L. R.

    2014-12-01

    Step-pools are important features in fluvial systems. Through energy dissipation, step-pools provide stability in high-energy environments that otherwise may erode and degrade. Although research has focused on geomorphological aspects of step-pool channels, the ecological significance of step-pool streams is increasingly recognized. Step-pool streams often contain higher density and diversity of benthic macroinvertebrates and are critical habitats for organisms such as salmonids and tailed frogs. Step-pools are therefore increasingly used to restore eroding channels and improve ecological conditions. This paper addresses a restoration reach of Wildcat Creek in Berkeley, California that featured an installation of step-pools in 2012. The design framework recognized step-pool formation as a self-organizing process that produces a rhythmic morphology. After placing step particles at locations where step-pools are expected to form according to hydraulic theory, the self-organizing approach allowed fluvial processes to refine the rocks into adjusted sequences over time. In addition, a 30-meter "experimental" reach was created to explore the co-evolution of geomorphological and ecological characteristics. After constructing a plane bed channel, boulders and cobbles piled at the upstream end allowed natural flows to mobilize and sort them into step-pool sequences. Ground surveys and LiDAR recorded the development of step-pool sequences over several seasons. Concurrent sampling of benthic macroinvertebrates documented the formation of biological communities in conjunction with habitat. Biological sampling in an upstream reference reach provided a comparison with the restored reach over time. Results to date show an emergent step-pool channel with steps that segment the plane bed into initial step and pool habitats. Biological communities are beginning to form, showing more distinction among habitat types during some seasons, although they do not yet approach reference values at this stage of development. Research over longer timeframes is needed to reveal how biological and physical characteristics may co-organize toward an equilibrium landscape. Such integrated understanding will assist development of innovative restoration designs.

  5. Magic Pools: Parallel Assessment of Transposon Delivery Vectors in Bacteria

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liu, Hualan; Price, Morgan N.; Waters, Robert Jordan

    Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. Tomore » identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylumBacteroidetes. IMPORTANCEMolecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.« less

  6. Magic Pools: Parallel Assessment of Transposon Delivery Vectors in Bacteria

    DOE PAGES

    Liu, Hualan; Price, Morgan N.; Waters, Robert Jordan; ...

    2018-01-16

    Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. Tomore » identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylumBacteroidetes. IMPORTANCEMolecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.« less

  7. Enrichment of individual KIR2DL4 sequences from genomic DNA using long-template PCR and allele-specific hybridization to magnetic bead-bound oligonucleotide probes.

    PubMed

    Roberts, C H; Turino, C; Madrigal, J A; Marsh, S G E

    2007-06-01

    DNA enrichment by allele-specific hybridization (DEASH) was used as a means to isolate individual alleles of the killer cell immunoglobulin-like receptor (KIR2DL4) gene from heterozygous genomic DNA. Using long-template polymerase chain reaction (LT-PCR), the complete KIR2DL4 gene was amplified from a cell line that had previously been characterized for its KIR gene content by PCR using sequence-specific primers (PCR-SSP). The whole gene amplicons were sequenced and we identified two heterozygous positions in accordance with the predictions of the PCR-SSP. The amplicons were then hybridized to allele-specific, biotinylated oligonucleotide probes and through binding to streptavidin-coated beads, the targeted alleles were enriched. A second PCR amplified only the exonic regions of the enriched allele, and these were then sequenced in full. We show DEASH to be capable of enriching single alleles from a heterozygous PCR product, and through sequencing the enriched DNA, we are able to produce complete coding sequences of the KIR2DL4 alleles in accordance with the typing predicted by PCR-SSP.

  8. Formation of oligonucleotide-PNA-chimeras by template-directed ligation

    NASA Technical Reports Server (NTRS)

    Koppitz, M.; Nielsen, P. E.; Orgel, L. E.; Bada, J. L. (Principal Investigator)

    1998-01-01

    DNA sequences have previously been reported to act as templates for the synthesis of PNA, and vice versa. A continuous evolutionary transition from an informational replicating system based on one polymer to a system based on the other would be facilitated if it were possible to form chimeras, that is molecules that contain monomers of both types. Here we show that ligation to form chimeras proceeds efficiently both on PNA and on DNA templates. The efficiency of ligation is primarily determined by the number of backbone bonds at the ligation site and the relative orientation of template and substrate strands. The most efficient reactions result in the formation of chimeras with ligation junctions resembling the structures of the backbones of PNA and DNA and with antiparallel alignment of both components of the chimera with the template, that is, ligations involving formation of 3'-phosphoramidate and 5'-ester bonds. However, double helices involving PNA are stable both with antiparallel and parallel orientation of the two strands. Ligation on PNA but not on DNA templates is, therefore, sometimes possible on templates with reversed orientation. The relevance of these findings to discussions of possible transitions between genetic systems is discussed.

  9. Fluorogenic DNA Sequencing in PDMS Microreactors

    PubMed Central

    Sims, Peter A.; Greenleaf, William J.; Duan, Haifeng; Xie, X. Sunney

    2012-01-01

    We have developed a multiplex sequencing-by-synthesis method combining terminal-phosphate labeled fluorogenic nucleotides (TPLFNs) and resealable microreactors. In the presence of phosphatase, the incorporation of a non-fluorescent TPLFN into a DNA primer by DNA polymerase results in a fluorophore. We immobilize DNA templates within polydimethylsiloxane (PDMS) microreactors, sequentially introduce one of the four identically labeled TPLFNs, seal the microreactors, allow template-directed TPLFN incorporation, and measure the signal from the fluorophores trapped in the microreactors. This workflow allows sequencing in a manner akin to pyrosequencing but without constant monitoring of each microreactor. With cycle times of <10 minutes, we demonstrate 30 base reads with ∼99% raw accuracy. “Fluorogenic pyrosequencing” combines benefits of pyrosequencing, such as rapid turn-around, native DNA generation, and single-color detection, with benefits of fluorescence-based approaches, such as highly sensitive detection and simple parallelization. PMID:21666670

  10. New insights into the promoterless transcription of DNA coligo templates by RNA polymerase III.

    PubMed

    Lama, Lodoe; Seidl, Christine I; Ryan, Kevin

    2014-01-01

    Chemically synthesized DNA can carry small RNA sequence information but converting that information into small RNA is generally thought to require large double-stranded promoters in the context of plasmids, viruses and genes. We previously found evidence that circularized oligodeoxynucleotides (coligos) containing certain sequences and secondary structures can template the synthesis of small RNA by RNA polymerase III in vitro and in human cells. By using immunoprecipitated RNA polymerase III we now report corroborating evidence that this enzyme is the sole polymerase responsible for coligo transcription. The immobilized polymerase enabled experiments showing that coligo transcripts can be formed through transcription termination without subsequent 3' end trimming. To better define the determinants of productive transcription, a structure-activity relationship study was performed using over 20 new coligos. The results show that unpaired nucleotides in the coligo stem facilitate circumtranscription, but also that internal loops and bulges should be kept small to avoid secondary transcription initiation sites. A polymerase termination sequence embedded in the double-stranded region of a hairpin-encoding coligo stem can antagonize transcription. Using lessons learned from new and old coligos, we demonstrate how to convert poorly transcribed coligos into productive templates. Our findings support the possibility that coligos may prove useful as chemically synthesized vectors for the ectopic expression of small RNA in human cells.

  11. ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles

    PubMed Central

    Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G.; Gelly, Jean-Christophe

    2016-01-01

    Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation —with Protein Blocks—, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the ‘Hard’ category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297

  12. ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles.

    PubMed

    Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe

    2016-06-20

    Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/.

  13. New insights into phosphorus management in agriculture--A crop rotation approach.

    PubMed

    Łukowiak, Remigiusz; Grzebisz, Witold; Sassenrath, Gretchen F

    2016-01-15

    This manuscript presents research results examining phosphorus (P) management in a soil–plant system for three variables: i) internal resources of soil available phosphorus, ii) cropping sequence, and iii) external input of phosphorus (manure, fertilizers). The research was conducted in long-term cropping sequences with oilseed rape (10 rotations) and maize (six rotations) over three consecutive growing seasons (2004/2005, 2005/2006, and 2006/2007) in a production farm on soils originated from Albic Luvisols in Poland. The soil available phosphorus pool, measured as calcium chloride extractable P (CCE-P), constituted 28% to 67% of the total phosphorus input (PTI) to the soil–plant system in the spring. Oilseed rape and maize dominant cropping sequences showed a significant potential to utilize the CCE-P pool within the soil profile. Cropping sequences containing oilseed rape significantly affected the CCE-P pool, and in turn contributed to the P(TI). The P(TI) uptake use efficiency was 50% on average. Therefore, the CCE-P pool should be taken into account as an important component of a sound and reliable phosphorus balance. The instability of the yield prediction, based on the P(TI), was mainly due to an imbalanced management of both farmyard manure and phosphorus fertilizer. Oilseed rape plants provide a significant positive impact on the CCE-P pool after harvest, improving the productive stability of the entire cropping sequence. This phenomenon was documented by the P(TI) increase during wheat cultivation following oilseed rape. The Unit Phosphorus Uptake index also showed a higher stability in oilseed rape cropping systems compared to rotations based on maize. Cropping sequences are a primary factor impacting phosphorus management. Judicious implementation of crop rotations can improve soil P resources, efficiency of crop P use, and crop yield and yield stability. Use of cropping sequences can reduce the need for external P sources such as farmyard manure and chemical fertilizers.

  14. A Template-Based Protein Structure Reconstruction Method Using Deep Autoencoder Learning.

    PubMed

    Li, Haiou; Lyu, Qiang; Cheng, Jianlin

    2016-12-01

    Protein structure prediction is an important problem in computational biology, and is widely applied to various biomedical problems such as protein function study, protein design, and drug design. In this work, we developed a novel deep learning approach based on a deeply stacked denoising autoencoder for protein structure reconstruction. We applied our approach to a template-based protein structure prediction using only the 3D structural coordinates of homologous template proteins as input. The templates were identified for a target protein by a PSI-BLAST search. 3DRobot (a program that automatically generates diverse and well-packed protein structure decoys) was used to generate initial decoy models for the target from the templates. A stacked denoising autoencoder was trained on the decoys to obtain a deep learning model for the target protein. The trained deep model was then used to reconstruct the final structural model for the target sequence. With target proteins that have highly similar template proteins as benchmarks, the GDT-TS score of the predicted structures is greater than 0.7, suggesting that the deep autoencoder is a promising method for protein structure reconstruction.

  15. MetaGO: Predicting Gene Ontology of Non-homologous Proteins Through Low-Resolution Protein Structure Prediction and Protein-Protein Network Mapping.

    PubMed

    Zhang, Chengxin; Zheng, Wei; Freddolino, Peter L; Zhang, Yang

    2018-03-10

    Homology-based transferal remains the major approach to computational protein function annotations, but it becomes increasingly unreliable when the sequence identity between query and template decreases below 30%. We propose a novel pipeline, MetaGO, to deduce Gene Ontology attributes of proteins by combining sequence homology-based annotation with low-resolution structure prediction and comparison, and partner's homology-based protein-protein network mapping. The pipeline was tested on a large-scale set of 1000 non-redundant proteins from the CAFA3 experiment. Under the stringent benchmark conditions where templates with >30% sequence identity to the query are excluded, MetaGO achieves average F-measures of 0.487, 0.408, and 0.598, for Molecular Function, Biological Process, and Cellular Component, respectively, which are significantly higher than those achieved by other state-of-the-art function annotations methods. Detailed data analysis shows that the major advantage of the MetaGO lies in the new functional homolog detections from partner's homology-based network mapping and structure-based local and global structure alignments, the confidence scores of which can be optimally combined through logistic regression. These data demonstrate the power of using a hybrid model incorporating protein structure and interaction networks to deduce new functional insights beyond traditional sequence homology-based referrals, especially for proteins that lack homologous function templates. The MetaGO pipeline is available at http://zhanglab.ccmb.med.umich.edu/MetaGO/. Copyright © 2018. Published by Elsevier Ltd.

  16. A general method to eliminate laboratory induced recombinants during massive, parallel sequencing of cDNA library.

    PubMed

    Waugh, Caryll; Cromer, Deborah; Grimm, Andrew; Chopra, Abha; Mallal, Simon; Davenport, Miles; Mak, Johnson

    2015-04-09

    Massive, parallel sequencing is a potent tool for dissecting the regulation of biological processes by revealing the dynamics of the cellular RNA profile under different conditions. Similarly, massive, parallel sequencing can be used to reveal the complexity of viral quasispecies that are often found in the RNA virus infected host. However, the production of cDNA libraries for next-generation sequencing (NGS) necessitates the reverse transcription of RNA into cDNA and the amplification of the cDNA template using PCR, which may introduce artefact in the form of phantom nucleic acids species that can bias the composition and interpretation of original RNA profiles. Using HIV as a model we have characterised the major sources of error during the conversion of viral RNA to cDNA, namely excess RNA template and the RNaseH activity of the polymerase enzyme, reverse transcriptase. In addition we have analysed the effect of PCR cycle on detection of recombinants and assessed the contribution of transfection of highly similar plasmid DNA to the formation of recombinant species during the production of our control viruses. We have identified RNA template concentrations, RNaseH activity of reverse transcriptase, and PCR conditions as key parameters that must be carefully optimised to minimise chimeric artefacts. Using our optimised RT-PCR conditions, in combination with our modified PCR amplification procedure, we have developed a reliable technique for accurate determination of RNA species using NGS technology.

  17. Functional Diversity of Haloacid Dehalogenase Superfamily Phosphatases from Saccharomyces cerevisiae: BIOCHEMICAL, STRUCTURAL, AND EVOLUTIONARY INSIGHTS.

    PubMed

    Kuznetsova, Ekaterina; Nocek, Boguslaw; Brown, Greg; Makarova, Kira S; Flick, Robert; Wolf, Yuri I; Khusnutdinova, Anna; Evdokimova, Elena; Jin, Ke; Tan, Kemin; Hanson, Andrew D; Hasnain, Ghulam; Zallot, Rémi; de Crécy-Lagard, Valérie; Babu, Mohan; Savchenko, Alexei; Joachimiak, Andrzej; Edwards, Aled M; Koonin, Eugene V; Yakunin, Alexander F

    2015-07-24

    The haloacid dehalogenase (HAD)-like enzymes comprise a large superfamily of phosphohydrolases present in all organisms. The Saccharomyces cerevisiae genome encodes at least 19 soluble HADs, including 10 uncharacterized proteins. Here, we biochemically characterized 13 yeast phosphatases from the HAD superfamily, which includes both specific and promiscuous enzymes active against various phosphorylated metabolites and peptides with several HADs implicated in detoxification of phosphorylated compounds and pseudouridine. The crystal structures of four yeast HADs provided insight into their active sites, whereas the structure of the YKR070W dimer in complex with substrate revealed a composite substrate-binding site. Although the S. cerevisiae and Escherichia coli HADs share low sequence similarities, the comparison of their substrate profiles revealed seven phosphatases with common preferred substrates. The cluster of secondary substrates supporting significant activity of both S. cerevisiae and E. coli HADs includes 28 common metabolites that appear to represent the pool of potential activities for the evolution of novel HAD phosphatases. Evolution of novel substrate specificities of HAD phosphatases shows no strict correlation with sequence divergence. Thus, evolution of the HAD superfamily combines the conservation of the overall substrate pool and the substrate profiles of some enzymes with remarkable biochemical and structural flexibility of other superfamily members. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  18. Ambient groundwater flow diminishes nitrogen cycling in streams

    NASA Astrophysics Data System (ADS)

    Azizian, M.; Grant, S. B.; Rippy, M.; Detwiler, R. L.; Boano, F.; Cook, P. L. M.

    2017-12-01

    Modeling and experimental studies demonstrate that ambient groundwater reduces hyporheic exchange, but the implications of this observation for stream N-cycling is not yet clear. We utilized a simple process-based model (the Pumping and Streamline Segregation or PASS model) to evaluate N- cycling over two scales of hyporheic exchange (fluvial ripples and riffle-pool sequences), ten ambient groundwater and stream flow scenarios (five gaining and losing conditions and two stream discharges), and three biogeochemical settings (identified based on a principal component analysis of previously published measurements in streams throughout the United States). Model-data comparisons indicate that our model provides realistic estimates for direct denitrification of stream nitrate, but overpredicts nitrification and coupled nitrification-denitrification. Riffle-pool sequences are responsible for most of the N-processing, despite the fact that fluvial ripples generate 3-11 times more hyporheic exchange flux. Across all scenarios, hyporheic exchange flux and the Damkohler Number emerge as primary controls on stream N-cycling; the former regulates trafficking of nutrients and oxygen across the sediment-water interface, while the latter quantifies the relative rates of organic carbon mineralization and advective transport in streambed sediments. Vertical groundwater flux modulates both of these master variables in ways that tend to diminish stream N-cycling. Thus, anthropogenic perturbations of ambient groundwater flows (e.g., by urbanization, agricultural activities, groundwater mining, and/or climate change) may compromise some of the key ecosystem services provided by streams.

  19. Novel microsatellite DNA markers indicate strict parthenogenesis and few genotypes in the invasive willow sawfly Nematus oligospilus.

    PubMed

    Caron, V; Norgate, M; Ede, F J; Nyman, T; Sunnucks, P

    2013-02-01

    Invasive organisms can have major impacts on the environment. Some invasive organisms are parthenogenetic in their invasive range and, therefore, exist as a number of asexual lineages (=clones). Determining the reproductive mode of invasive species has important implications for understanding the evolutionary genetics of such species, more especially, for management-relevant traits. The willow sawfly Nematus oligospilus Förster (Hymenoptera: Tenthredinidae) has been introduced unintentionally into several countries in the Southern Hemisphere where it has subsequently become invasive. To assess the population expansion, reproductive mode and host-plant relationships of this insect, microsatellite markers were developed and applied to natural populations sampled from the native and expanded range, along with sequencing of the cytochrome-oxidase I mitochondrial DNA (mtDNA) region. Other tenthredinids across a spectrum of taxonomic similarity to N. oligospilus and having a range of life strategies were also tested. Strict parthenogenesis was apparent within invasive N. oligospilus populations throughout the Southern Hemisphere, which comprised only a small number of genotypes. Sequences of mtDNA were identical for all individuals tested in the invasive range. The microsatellite markers were used successfully in several sawfly species, especially Nematus spp. and other genera of the Nematini tribe, with the degree of success inversely related to genetic divergence as estimated from COI sequences. The confirmation of parthenogenetic reproduction in N. oligospilus and the fact that it has a very limited pool of genotypes have important implications for understanding and managing this species and its biology, including in terms of phenotypic diversity, host relationships, implications for spread and future adaptive change. It would appear to be an excellent model study system for understanding evolution of invasive parthenogens that diverge without sexual reproduction and genetic recombination.

  20. Strain-specific and pooled genome sequences for populations of Drosophila melanogaster from three continents.

    PubMed Central

    Bergman, Casey M.; Haddrill, Penelope R.

    2015-01-01

    To contribute to our general understanding of the evolutionary forces that shape variation in genome sequences in nature, we have sequenced genomes from 50 isofemale lines and six pooled samples from populations of Drosophila melanogaster on three continents. Analysis of raw and reference-mapped reads indicates the quality of these genomic sequence data is very high. Comparison of the predicted and experimentally-determined Wolbachia infection status of these samples suggests that strain or sample swaps are unlikely to have occurred in the generation of these data. Genome sequences are freely available in the European Nucleotide Archive under accession ERP009059. Isofemale lines can be obtained from the Drosophila Species Stock Center. PMID:25717372

  1. Strain-specific and pooled genome sequences for populations of Drosophila melanogaster from three continents.

    PubMed

    Bergman, Casey M; Haddrill, Penelope R

    2015-01-01

    To contribute to our general understanding of the evolutionary forces that shape variation in genome sequences in nature, we have sequenced genomes from 50 isofemale lines and six pooled samples from populations of Drosophila melanogaster on three continents. Analysis of raw and reference-mapped reads indicates the quality of these genomic sequence data is very high. Comparison of the predicted and experimentally-determined Wolbachia infection status of these samples suggests that strain or sample swaps are unlikely to have occurred in the generation of these data. Genome sequences are freely available in the European Nucleotide Archive under accession ERP009059. Isofemale lines can be obtained from the Drosophila Species Stock Center.

  2. Costing nursing education programs. It's as easy as 1-2-3.

    PubMed

    Fisher, M L; Hume, R; Emerick, R

    1998-01-01

    Staff development departments are pressured to reveal the costs of their educational programs and to compete with outside vendors for programming. The process of implementing a spreadsheet template for costing out staff development programs is described. The template is easy to use and supports "what if" analysis. This model allows educators to evaluate cost implications of curricular decisions and to better negotiate with internal and external customers.

  3. Extension of base mispairs by Taq DNA polymerase: implications for single nucleotide discrimination in PCR.

    PubMed Central

    Huang, M M; Arnheim, N; Goodman, M F

    1992-01-01

    Thermus aquaticus (Taq) DNA polymerase was used to measure the extension efficiency for all configurations of matched and mismatched base pairs at template-primer 3'-termini. The transition mispairs, A(primer).C, C.A, G.T, and T.G were extended 10(-3) to 10(-4)-fold less efficiently than their correctly paired counterparts. Relative efficiencies for extending transversion mispairs were 10(-4) to 10(-5) for T.C and T.T, about 10(-6) for A.A, and less than 10(-6) for G.A, A.G, G.G and C.C. The transversion mispair C(primer).T was extended with high efficiency, about 10(-2) compared to a correct A.T basepair. The unexpected ease of extending the C.T mismatch was not likely to have been caused by primer-template misalignment. Taq polymerase was observed to bind with similar affinities to each of the correctly paired and mispaired primer-template 3'-ends. Thus, the failure of Taq polymerase to extend mismatches efficiently appears to be an intrinsic property of the enzyme and not due to an inability to bind to 3'-terminal mispairs. For almost all of the mispairs, C.T being the exception, Taq polymerase exhibits about 100 to 1000-fold greater discrimination against mismatch extension compared to avian myeloblastosis reverse transcriptase and HIV-1 reverse transcriptase which extend most mismatched basepairs permissively. Relative mismatch extension efficiencies for Taq polymerase were measured at 45 degrees C, 55 degrees C and 70 degrees C and found to be independent of temperature. The mispair extension data should be important in designing experiments using PCR to distinguish between sequences that vary by a single nucleotide. Images PMID:1408758

  4. Template DNA-strand co-segregation and asymmetric cell division in skeletal muscle stem cells.

    PubMed

    Shinin, Vasily; Gayraud-Morel, Barbara; Tajbakhsh, Shahragim

    2009-01-01

    Stem cells are present in all tissues and organs, and are crucial for normal regulated growth. How the pool size of stem cells and their progeny is regulated to establish the tissue prenatally, then maintain it throughout life, is a key question in biology and medicine. The ability to precisely locate stem and progenitors requires defining lineage progression from stem to differentiated cells, assessing the mode of cell expansion and self-renewal and identifying markers to assess the different cell states within the lineage. We have shown that during lineage progression from a quiescent adult muscle satellite cell to a differentiated myofibre, both symmetric and asymmetric divisions take place. Furthermore, we provide evidence that a sub-population of label retaining satellite cells co-segregate template DNA strands to one daughter cell. These findings provide a means of identifying presumed stem and progenitor cells within the lineage. In addition, asymmetric segregation of template DNA and the cytoplasmic protein Numb provides a landmark to define cell behaviour as self-renewal and differentiation decisions are being executed.

  5. Stroke Treatment Academic Industry Roundtable Recommendations for Individual Data Pooling Analyses in Stroke.

    PubMed

    Lees, Kennedy R; Khatri, Pooja

    2016-08-01

    Pooled analysis of individual patient data from stroke trials can deliver more precise estimates of treatment effect, enhance power to examine prespecified subgroups, and facilitate exploration of treatment-modifying influences. Analysis plans should be declared, and preferably published, before trial results are known. For pooling trials that used diverse analytic approaches, an ordinal analysis is favored, with justification for considering deaths and severe disability jointly. Because trial pooling is an incremental process, analyses should follow a sequential approach, with statistical adjustment for iterations. Updated analyses should be published when revised conclusions have a clinical implication. However, caution is recommended in declaring pooled findings that may prejudice ongoing trials, unless clinical implications are compelling. All contributing trial teams should contribute to leadership, data verification, and authorship of pooled analyses. Development work is needed to enable reliable inferences to be drawn about individual drug or device effects that contribute to a pooled analysis, versus a class effect, if the treatment strategy combines ≥2 such drugs or devices. Despite the practical challenges, pooled analyses are powerful and essential tools in interpreting clinical trial findings and advancing clinical care. © 2016 American Heart Association, Inc.

  6. Development of a high angular resolution diffusion imaging human brain template.

    PubMed

    Varentsova, Anna; Zhang, Shengwei; Arfanakis, Konstantinos

    2014-05-01

    Brain diffusion templates contain rich information about the microstructure of the brain, and are used as references in spatial normalization or in the development of brain atlases. The accuracy of diffusion templates constructed based on the diffusion tensor (DT) model is limited in regions with complex neuronal micro-architecture. High angular resolution diffusion imaging (HARDI) overcomes limitations of the DT model and is capable of resolving intravoxel heterogeneity. However, when HARDI is combined with multiple-shot sequences to minimize image artifacts, the scan time becomes inappropriate for human brain imaging. In this work, an artifact-free HARDI template of the human brain was developed from low angular resolution multiple-shot diffusion data. The resulting HARDI template was produced in ICBM-152 space based on Turboprop diffusion data, was shown to resolve complex neuronal micro-architecture in regions with intravoxel heterogeneity, and contained fiber orientation information consistent with known human brain anatomy. Copyright © 2014 Elsevier Inc. All rights reserved.

  7. A viscous solvent enables information transfer from gene-length nucleic acids in a model prebiotic replication cycle

    NASA Astrophysics Data System (ADS)

    He, Christine; Gállego, Isaac; Laughlin, Brandon; Grover, Martha A.; Hud, Nicholas V.

    2017-04-01

    Many hypotheses concerning the nature of early life assume that genetic information was once transferred through the template-directed synthesis of RNA, before the emergence of coded enzymes. However, attempts to demonstrate enzyme-free, template-directed synthesis of nucleic acids have been limited by 'strand inhibition', whereby transferring information from a template strand in the presence of its complementary strand is inhibited by the stability of the template duplex. Here, we use solvent viscosity to circumvent strand inhibition, demonstrating information transfer from a gene-length template (>300 nt) within a longer (545 bp or 3 kb) duplex. These results suggest that viscous environments on the prebiotic Earth, generated periodically by water evaporation, could have facilitated nucleic acid replication—particularly of long, structured sequences such as ribozymes. Our approach works with DNA and RNA, suggesting that viscosity-mediated replication is possible for a range of genetic polymers, perhaps even for informational polymers that may have preceded RNA.

  8. Development of 7TM receptor-ligand complex models using ligand-biased, semi-empirical helix-bundle repacking in torsion space: application to the agonist interaction of the human dopamine D2 receptor.

    PubMed

    Malo, Marcus; Persson, Ronnie; Svensson, Peder; Luthman, Kristina; Brive, Lars

    2013-03-01

    Prediction of 3D structures of membrane proteins, and of G-protein coupled receptors (GPCRs) in particular, is motivated by their importance in biological systems and the difficulties associated with experimental structure determination. In the present study, a novel method for the prediction of 3D structures of the membrane-embedded region of helical membrane proteins is presented. A large pool of candidate models are produced by repacking of the helices of a homology model using Monte Carlo sampling in torsion space, followed by ranking based on their geometric and ligand-binding properties. The trajectory is directed by weak initial restraints to orient helices towards the original model to improve computation efficiency, and by a ligand to guide the receptor towards a chosen conformational state. The method was validated by construction of the β1 adrenergic receptor model in complex with (S)-cyanopindolol using bovine rhodopsin as template. In addition, models of the dopamine D2 receptor were produced with the selective and rigid agonist (R)-N-propylapomorphine ((R)-NPA) present. A second quality assessment was implemented by evaluating the results from docking of a library of 29 ligands with known activity, which further discriminated between receptor models. Agonist binding and recognition by the dopamine D2 receptor is interpreted using the 3D structure model resulting from the approach. This method has a potential for modeling of all types of helical transmembrane proteins for which a structural template with sequence homology sufficient for homology modeling is not available or is in an incorrect conformational state, but for which sufficient empirical information is accessible.

  9. Combined Use of 16S Ribosomal DNA and 16S rRNA To Study the Bacterial Community of Polychlorinated Biphenyl-Polluted Soil

    PubMed Central

    Nogales, Balbina; Moore, Edward R. B.; Llobet-Brossa, Enrique; Rossello-Mora, Ramon; Amann, Rudolf; Timmis, Kenneth N.

    2001-01-01

    The bacterial diversity assessed from clone libraries prepared from rRNA (two libraries) and ribosomal DNA (rDNA) (one library) from polychlorinated biphenyl (PCB)-polluted soil has been analyzed. A good correspondence of the community composition found in the two types of library was observed. Nearly 29% of the cloned sequences in the rDNA library were identical to sequences in the rRNA libraries. More than 60% of the total cloned sequence types analyzed were grouped in phylogenetic groups (a clone group with sequence similarity higher than 97% [98% for Burkholderia and Pseudomonas-type clones]) represented in both types of libraries. Some of those phylogenetic groups, mostly represented by a single (or pair) of cloned sequence type(s), were observed in only one of the types of library. An important difference between the libraries was the lack of clones representative of the Actinobacteria in the rDNA library. The PCB-polluted soil exhibited a high bacterial diversity which included representatives of two novel lineages. The apparent abundance of bacteria affiliated to the beta-subclass of the Proteobacteria, and to the genus Burkholderia in particular, was confirmed by fluorescence in situ hybridization analysis. The possible influence on apparent diversity of low template concentrations was assessed by dilution of the RNA template prior to amplification by reverse transcription-PCR. Although differences in the composition of the two rRNA libraries obtained from high and low RNA concentrations were observed, the main components of the bacterial community were represented in both libraries, and therefore their detection was not compromised by the lower concentrations of template used in this study. PMID:11282645

  10. Parallel tagged next-generation sequencing on pooled samples - a new approach for population genetics in ecology and conservation.

    PubMed

    Zavodna, Monika; Grueber, Catherine E; Gemmell, Neil J

    2013-01-01

    Next-generation sequencing (NGS) on pooled samples has already been broadly applied in human medical diagnostics and plant and animal breeding. However, thus far it has been only sparingly employed in ecology and conservation, where it may serve as a useful diagnostic tool for rapid assessment of species genetic diversity and structure at the population level. Here we undertake a comprehensive evaluation of the accuracy, practicality and limitations of parallel tagged amplicon NGS on pooled population samples for estimating species population diversity and structure. We obtained 16S and Cyt b data from 20 populations of Leiopelma hochstetteri, a frog species of conservation concern in New Zealand, using two approaches - parallel tagged NGS on pooled population samples and individual Sanger sequenced samples. Data from each approach were then used to estimate two standard population genetic parameters, nucleotide diversity (π) and population differentiation (FST), that enable population genetic inference in a species conservation context. We found a positive correlation between our two approaches for population genetic estimates, showing that the pooled population NGS approach is a reliable, rapid and appropriate method for population genetic inference in an ecological and conservation context. Our experimental design also allowed us to identify both the strengths and weaknesses of the pooled population NGS approach and outline some guidelines and suggestions that might be considered when planning future projects.

  11. Flow structure through pool-riffle sequences and a conceptual model for their sustainability in gravel-bed rivers

    Treesearch

    D. Caamano; P. Goodwin; J. M. Buffington

    2010-01-01

    Detailed field measurements and simulations of three-dimensional flow structure were used to develop a conceptual model to explain the sustainability of self-formed pool-riffle sequences in gravel-bed rivers. The analysis was conducted at the Red River Wildlife Management Area in Idaho, USA, and enabled characterization of the flow structure through two consecutive...

  12. Introduced Scotch broom (Cytisus scoparius) invades the genome of native populations in vulnerable heathland habitats.

    PubMed

    Rostgaard Nielsen, Lene; Brandes, Ursula; Dahl Kjaer, Erik; Fjellheim, Siri

    2016-06-01

    Cytisus scoparius is a global invasive species that affects local flora and fauna at the intercontinental level. Its natural distribution spans across Europe, but seeds have also been moved among countries, mixing plants of native and non-native genetic origins. Hybridization between the introduced and native gene pool is likely to threaten both the native gene pool and the local flora. In this study, we address the potential threat of invasive C. scoparius to local gene pools in vulnerable heathlands. We used nuclear single nucleotide polymorphic (SNP) and simple sequence repeat (SSR) markers together with plastid SSR and indel markers to investigate the level and direction of gene flow between invasive and native heathland C. scoparius. Analyses of population structures confirmed the presence of two gene pools: one native and the other invasive. The nuclear genome of the native types was highly introgressed with the invasive genome, and we observed advanced-generation hybrids, suggesting that hybridization has been occurring for several generations. There is asymmetrical gene flow from the invasive to the native gene pool, which can be attributed to higher fecundity in the invasive individuals, measured by the number of flowers and seed pods. Strong spatial genetic structure in plastid markers and weaker structure in nuclear markers suggest that seeds spread over relatively short distances and that gene flow over longer distances is mainly facilitated by pollen dispersal. We further show that the growth habits of heathland plants become more vigorous with increased introgression from the invaders. Implications of the findings are discussed in relation to future management of invading C. scoparius. © 2016 John Wiley & Sons Ltd.

  13. Alteration of hairpin ribozyme specificity utilizing PCR.

    PubMed

    DeGrandis, P; Hampel, A; Galasinski, S; Borneman, J; Siwkowski, A; Altschuler, M

    1994-12-01

    We have developed a method by which a researcher can quickly alter the specificity of a trans hairpin ribozyme. Utilizing this PCR method, two oligonucleotides, and any target vector, new ribozyme template sequences can be generated without the synthesis of longer oligonucleotides. We have produced templates with altered specificity for both standard and modified (larger) ribozymes. After transcription, these ribozymes show specific cleavage activity with the new substrate beta-glucuronidase (GUS), and no activity against the original substrate (HIV-1, 5' leader sequence). Utilizing this technique, it is also possible to produce an inactive ribozyme that can be used as an antisense control. Applications of this procedure would provide a rapid and economical system for the assessment of trans ribozyme activity.

  14. Preparation of Small RNAs Using Rolling Circle Transcription and Site-Specific RNA Disconnection.

    PubMed

    Wang, Xingyu; Li, Can; Gao, Xiaomeng; Wang, Jing; Liang, Xingguo

    2015-01-13

    A facile and robust RNA preparation protocol was developed by combining rolling circle transcription (RCT) with RNA cleavage by RNase H. Circular DNA with a complementary sequence was used as the template for promoter-free transcription. With the aid of a 2'-O-methylated DNA, the RCT-generated tandem repeats of the desired RNA sequence were disconnected at the exact end-to-end position to harvest the desired RNA oligomers. Compared with the template DNA, more than 4 × 10(3) times the amount of small RNA products were obtained when modest cleavage was carried out during transcription. Large amounts of RNA oligomers could easily be obtained by simply increasing the reaction volume.

  15. The protein structure prediction problem could be solved using the current PDB library

    PubMed Central

    Zhang, Yang; Skolnick, Jeffrey

    2005-01-01

    For single-domain proteins, we examine the completeness of the structures in the current Protein Data Bank (PDB) library for use in full-length model construction of unknown sequences. To address this issue, we employ a comprehensive benchmark set of 1,489 medium-size proteins that cover the PDB at the level of 35% sequence identity and identify templates by structure alignment. With homologous proteins excluded, we can always find similar folds to native with an average rms deviation (RMSD) from native of 2.5 Å with ≈82% alignment coverage. These template structures often contain a significant number of insertions/deletions. The tasser algorithm was applied to build full-length models, where continuous fragments are excised from the top-scoring templates and reassembled under the guide of an optimized force field, which includes consensus restraints taken from the templates and knowledge-based statistical potentials. For almost all targets (except for 2/1,489), the resultant full-length models have an RMSD to native below 6 Å (97% of them below 4 Å). On average, the RMSD of full-length models is 2.25 Å, with aligned regions improved from 2.5 Å to 1.88 Å, comparable with the accuracy of low-resolution experimental structures. Furthermore, starting from state-of-the-art structural alignments, we demonstrate a methodology that can consistently bring template-based alignments closer to native. These results are highly suggestive that the protein-folding problem can in principle be solved based on the current PDB library by developing efficient fold recognition algorithms that can recover such initial alignments. PMID:15653774

  16. CRISPR/Cas9 cleavages in budding yeast reveal templated insertions and strand-specific insertion/deletion profiles.

    PubMed

    Lemos, Brenda R; Kaplan, Adam C; Bae, Ji Eun; Ferrazzoli, Alexander E; Kuo, James; Anand, Ranjith P; Waterman, David P; Haber, James E

    2018-02-27

    Harnessing CRISPR-Cas9 technology provides an unprecedented ability to modify genomic loci via DNA double-strand break (DSB) induction and repair. We analyzed nonhomologous end-joining (NHEJ) repair induced by Cas9 in budding yeast and found that the orientation of binding of Cas9 and its guide RNA (gRNA) profoundly influences the pattern of insertion/deletions (indels) at the site of cleavage. A common indel created by Cas9 is a 1-bp (+1) insertion that appears to result from Cas9 creating a 1-nt 5' overhang that is filled in by a DNA polymerase and ligated. The origin of +1 insertions was investigated by using two gRNAs with PAM sequences located on opposite DNA strands but designed to cleave the same sequence. These templated +1 insertions are dependent on the X-family DNA polymerase, Pol4. Deleting Pol4 also eliminated +2 and +3 insertions, which are biased toward homonucleotide insertions. Using inverted PAM sequences, we also found significant differences in overall NHEJ efficiency and repair profiles, suggesting that the binding of the Cas9:gRNA complex influences subsequent NHEJ processing. As with events induced by the site-specific HO endonuclease, CRISPR-Cas9-mediated NHEJ repair depends on the Ku heterodimer and DNA ligase 4. Cas9 events are highly dependent on the Mre11-Rad50-Xrs2 complex, independent of Mre11's nuclease activity. Inspection of the outcomes of a large number of Cas9 cleavage events in mammalian cells reveals a similar templated origin of +1 insertions in human cells, but also a significant frequency of similarly templated +2 insertions.

  17. Experiments on Pool-riffle Sequences with Multi-fractional Sediment Bed During Floods

    NASA Astrophysics Data System (ADS)

    Rodriguez, J. F.; Vahidi, E.; Bayat, E.; de Almeida, G. A. M.; Saco, P. M.

    2017-12-01

    The morphodynamics of pools and riffles has been the subject of research for over a century and has more recently attracted intense attention for their central role in providing habitat diversity conditions, both in terms of flow and substrate. Initial efforts to explain the long-term stability of the pool-riffle (PR) sequences (often referred to as self-maintenance) focused almost exclusively on cross sectional flow characteristics (either average or near bed velocity or shear stress), using episodic shifts in higher shear stress or velocities from riffles to pools during floods (i.e. reversal conditions) as an indication of the long-term self-maintenance of the structures.. However, less attention has been paid to the interactions of flow unsteadiness, sediment supply and sedimentological contrasts as the drivers for maintaining PR sequences. Here we investigate these effects through laboratory experiments on a scaled-down PR sequence of an existing gravel bed river. Froude similitude and equality of Shields' number were applied to scale one- to four-year recurrence flood events and sediment size distributions, respectively. We conducted experiments with different hydrographs and different sedimentological conditions. In each experiment we continuously measured velocities and shear stresses (using acoustic velocity profilers) bed levels (using a bed profiler) and bed grain size distribution (using an automatic digital technique on the painted bed sediments) during the hydrographs. Our results show that the most important factors for self-maintenance were the sediment bed composition, the level of infilling of the pool and the sediment supply grainsize distribution. These results highlight the need to consider the time varying sedimentological characteristics of a PR sequence to assess its capacity for self-maintenance.

  18. Loss of DHR sequences at Browns Ferry Unit One - accident-sequence analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cook, D.H.; Grene, S.R.; Harrington, R.M.

    1983-05-01

    This study describes the predicted response of Unit One at the Browns Ferry Nuclear Plant to a postulated loss of decay heat removal (DHR) capability following scram from full power with the power conversion system unavailable. In accident sequences without DHR capability, the residual heat removal (RHR) system functions of pressure suppression pool cooling and reactor vessel shutdown cooling are unavailable. Consequently, all decay heat energy is stored in the pressure suppression pool with a concomitant increase in pool temperature and primary containment pressure. With the assumption that DHR capability is not regained during the lengthy course of this accidentmore » sequence, the containment ultimately fails by overpressurization. Although unlikely, this catastrophic failure might lead to loss of the ability to inject cooling water into the reactor vessel, causing subsequent core uncovery and meltdown. The timing of these events and the effective mitigating actions that might be taken by the operator are discussed in this report.« less

  19. Isolation of candidate genes of Friedreich`s ataxia on chromosome 9q13

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Montermini, L.; Zara, F.; Pandolfo, M.

    1994-09-01

    Friedreich`s ataxia (FRDA) is an autosomal recessive degenerative disease involving the central and peripheral nervous system and the heart. The mutated gene in FRDA has recently been localized within a 450 Kb interval on chromosome 9q13 between the markers D9S202/FR1/FR8. We have been able to confirm such localization for the disease gene by analysis of extended haplotype in consanguineous families. Cases of loss of marker homozygosity, which are likely to be due to ancient recombinations, have been found to involve D9S110, D9S15, and D9S111 on the telomeric side, and FR5 on the centromeric side, while homozygosity was always found formore » a core haplotype including D9S5, FD1, and D9S202. We constructed a YAC contig spanning the region between the telomeric markers and FR5, and cosmids have been obtained from the YACs. In order to isolate transcribed sequences from the FRDA candidate region we are utilizing a combination of approaches, including hybridization of YACs and cosmids to an arrayed human heart cDNA library, cDNA direct selection, and exon amplification. A transcribed sequence near the telomeric end of the region has been isolated by cDNA direct selection using pooled cosmids as genomic template and primary human heart, muscle, brain, liver and placenta cDNAs as cDNA source. We have shown this sequence to be the human equivalent of ZO-2, a tight junction protein previously described in the dog. No mutations of this gene have been found in FRDA subjects. Additional cDNA have recently been isolated and they are currently being evaluated.« less

  20. Sequence-independent construction of ordered combinatorial libraries with predefined crossover points.

    PubMed

    Jézéquel, Laetitia; Loeper, Jacqueline; Pompon, Denis

    2008-11-01

    Combinatorial libraries coding for mosaic enzymes with predefined crossover points constitute useful tools to address and model structure-function relationships and for functional optimization of enzymes based on multivariate statistics. The presented method, called sequence-independent generation of a chimera-ordered library (SIGNAL), allows easy shuffling of any predefined amino acid segment between two or more proteins. This method is particularly well adapted to the exchange of protein structural modules. The procedure could also be well suited to generate ordered combinatorial libraries independent of sequence similarities in a robotized manner. Sequence segments to be recombined are first extracted by PCR from a single-stranded template coding for an enzyme of interest using a biotin-avidin-based method. This technique allows the reduction of parental template contamination in the final library. Specific PCR primers allow amplification of two complementary mosaic DNA fragments, overlapping in the region to be exchanged. Fragments are finally reassembled using a fusion PCR. The process is illustrated via the construction of a set of mosaic CYP2B enzymes using this highly modular approach.

  1. Assessment of primer/template mismatch effects on real-time PCR amplification of target taxa for GMO quantification.

    PubMed

    Ghedira, Rim; Papazova, Nina; Vuylsteke, Marnik; Ruttink, Tom; Taverniers, Isabel; De Loose, Marc

    2009-10-28

    GMO quantification, based on real-time PCR, relies on the amplification of an event-specific transgene assay and a species-specific reference assay. The uniformity of the nucleotide sequences targeted by both assays across various transgenic varieties is an important prerequisite for correct quantification. Single nucleotide polymorphisms (SNPs) frequently occur in the maize genome and might lead to nucleotide variation in regions used to design primers and probes for reference assays. Further, they may affect the annealing of the primer to the template and reduce the efficiency of DNA amplification. We assessed the effect of a minor DNA template modification, such as a single base pair mismatch in the primer attachment site, on real-time PCR quantification. A model system was used based on the introduction of artificial mismatches between the forward primer and the DNA template in the reference assay targeting the maize starch synthase (SSIIb) gene. The results show that the presence of a mismatch between the primer and the DNA template causes partial to complete failure of the amplification of the initial DNA template depending on the type and location of the nucleotide mismatch. With this study, we show that the presence of a primer/template mismatch affects the estimated total DNA quantity to a varying degree.

  2. PCR and magnetic bead-mediated target capture for the isolation of short interspersed nucleotide elements in fishes.

    PubMed

    Liu, Dong; Zhu, Guoli; Tang, Wenqiao; Yang, Jinquan; Guo, Hongyi

    2012-01-01

    Short interspersed nucleotide elements (SINEs), a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR). Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR). The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180-360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNA(Leu)-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs.

  3. PCR and Magnetic Bead-Mediated Target Capture for the Isolation of Short Interspersed Nucleotide Elements in Fishes

    PubMed Central

    Liu, Dong; Zhu, Guoli; Tang, Wenqiao; Yang, Jinquan; Guo, Hongyi

    2012-01-01

    Short interspersed nucleotide elements (SINEs), a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR). Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR). The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180–360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNALeu-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs. PMID:22408437

  4. Single nucleotide resolution RNA-seq uncovers new regulatory mechanisms in the opportunistic pathogen Streptococcus agalactiae.

    PubMed

    Rosinski-Chupin, Isabelle; Sauvage, Elisabeth; Sismeiro, Odile; Villain, Adrien; Da Cunha, Violette; Caliot, Marie-Elise; Dillies, Marie-Agnès; Trieu-Cuot, Patrick; Bouloc, Philippe; Lartigue, Marie-Frédérique; Glaser, Philippe

    2015-05-30

    Streptococcus agalactiae, or Group B Streptococcus, is a leading cause of neonatal infections and an increasing cause of infections in adults with underlying diseases. In an effort to reconstruct the transcriptional networks involved in S. agalactiae physiology and pathogenesis, we performed an extensive and robust characterization of its transcriptome through a combination of differential RNA-sequencing in eight different growth conditions or genetic backgrounds and strand-specific RNA-sequencing. Our study identified 1,210 transcription start sites (TSSs) and 655 transcript ends as well as 39 riboswitches and cis-regulatory regions, 39 cis-antisense non-coding RNAs and 47 small RNAs potentially acting in trans. Among these putative regulatory RNAs, ten were differentially expressed in response to an acid stress and two riboswitches sensed directly or indirectly the pH modification. Strikingly, 15% of the TSSs identified were associated with the incorporation of pseudo-templated nucleotides, showing that reiterative transcription is a pervasive process in S. agalactiae. In particular, 40% of the TSSs upstream genes involved in nucleotide metabolism show reiterative transcription potentially regulating gene expression, as exemplified for pyrG and thyA encoding the CTP synthase and the thymidylate synthase respectively. This comprehensive map of the transcriptome at the single nucleotide resolution led to the discovery of new regulatory mechanisms in S. agalactiae. It also provides the basis for in depth analyses of transcriptional networks in S. agalactiae and of the regulatory role of reiterative transcription following variations of intra-cellular nucleotide pools.

  5. Whole exome sequencing in recurrent early pregnancy loss.

    PubMed

    Qiao, Ying; Wen, Jiadi; Tang, Flamingo; Martell, Sally; Shomer, Naomi; Leung, Peter C K; Stephenson, Mary D; Rajcan-Separovic, Evica

    2016-05-01

    Exome sequencing can identify genetic causes of idiopathic recurrent pregnancy loss (RPL). We identified compound heterozygous deleterious mutations affecting DYNC2H1 and ALOX15 in two out of four families with RPL. Both genes have a role in early development. Bioinformatics analysis of all genes with rare and putatively pathogenic mutations in miscarriages and couples showed enrichment in pathways relevant to pregnancy loss, including the complement and coagulation cascades pathways. Next generation sequencing (NGS) is increasingly being used to identify known and novel gene mutations in children with developmental delay and in fetuses with ultrasound-detected anomalies. In contrast, NGS is rarely used to study pregnancy loss. Chromosome microarray analysis detects putatively causative DNA copy number variants (CNVs) in ∼2% of miscarriages and CNVs of unknown significance (predominantly parental in origin) in up to 40% of miscarriages. Therefore, a large number of miscarriages still have an unknown cause. Whole exome sequencing (WES) was performed using Illumina HiSeq 2000 platform on seven euploid miscarriages from four families with RPL. Golden Helix SVS v8.1.5 was used for data assessment and inheritance analysis for deleterious DNA variants predicted to severely disrupt protein-coding genes by introducing a frameshift, loss of the stop codon, gain of the stop codon, changes in splicing or the initial codon. Webgestalt (http://bioinfo.vanderbilt.edu/webgestalt/) was used for pathway and disease association enrichment analysis of a gene pool containing putatively pathogenic variants in miscarriages and couples in comparison to control gene pools. Compound heterozygous mutations in DYNC2H1 and ALOX15 were identified in miscarriages from two families with RPL. DYNC2H1 is involved in cilia biogenesis and has been associated with fetal lethality in humans. ALOX15 is expressed in placenta and its dysregulation has been associated with inflammation, placental, dysfunction, abnormal oxidative stress response and angiogenesis. The pool of putatively pathogenic single nucleotide variants (SNVs) and small insertions and deletions (indels) detected in the miscarriages showed enrichment in 'complement and coagulation cascades pathway', and 'ciliary motility disorders'. We conclude that CNVs, individual SNVs and pool of deleterious gene mutations identified by exome sequencing could contribute to RPL. The size of our sample cohort is small. The functional effect of candidate mutations should be evaluated to determine whether the mutations are causative. This is the first study to assess whether SNVs may contribute to the pathogenesis of miscarriage. Furthermore, our findings suggest that collective effect of mutations in relevant biological pathways could be implicated in RPL. The study was funded by Canadian Institutes of Health Research (grant MOP 106467) and Michael Smith Foundation of Health Research Career Scholar salary award to ERS. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  6. Integrated design, execution, and analysis of arrayed and pooled CRISPR genome-editing experiments.

    PubMed

    Canver, Matthew C; Haeussler, Maximilian; Bauer, Daniel E; Orkin, Stuart H; Sanjana, Neville E; Shalem, Ophir; Yuan, Guo-Cheng; Zhang, Feng; Concordet, Jean-Paul; Pinello, Luca

    2018-05-01

    CRISPR (clustered regularly interspaced short palindromic repeats) genome-editing experiments offer enormous potential for the evaluation of genomic loci using arrayed single guide RNAs (sgRNAs) or pooled sgRNA libraries. Numerous computational tools are available to help design sgRNAs with optimal on-target efficiency and minimal off-target potential. In addition, computational tools have been developed to analyze deep-sequencing data resulting from genome-editing experiments. However, these tools are typically developed in isolation and oftentimes are not readily translatable into laboratory-based experiments. Here, we present a protocol that describes in detail both the computational and benchtop implementation of an arrayed and/or pooled CRISPR genome-editing experiment. This protocol provides instructions for sgRNA design with CRISPOR (computational tool for the design, evaluation, and cloning of sgRNA sequences), experimental implementation, and analysis of the resulting high-throughput sequencing data with CRISPResso (computational tool for analysis of genome-editing outcomes from deep-sequencing data). This protocol allows for design and execution of arrayed and pooled CRISPR experiments in 4-5 weeks by non-experts, as well as computational data analysis that can be performed in 1-2 d by both computational and noncomputational biologists alike using web-based and/or command-line versions.

  7. GalaxyGPCRloop: Template-Based and Ab Initio Structure Sampling of the Extracellular Loops of G-Protein-Coupled Receptors.

    PubMed

    Won, Jonghun; Lee, Gyu Rie; Park, Hahnbeom; Seok, Chaok

    2018-06-07

    The second extracellular loops (ECL2s) of G-protein-coupled receptors (GPCRs) are often involved in GPCR functions, and their structures have important implications in drug discovery. However, structure prediction of ECL2 is difficult because of its long length and the structural diversity among different GPCRs. In this study, a new ECL2 conformational sampling method involving both template-based and ab initio sampling was developed. Inspired by the observation of similar ECL2 structures of closely related GPCRs, a template-based sampling method employing loop structure templates selected from the structure database was developed. A new metric for evaluating similarity of the target loop to templates was introduced for template selection. An ab initio loop sampling method was also developed to treat cases without highly similar templates. The ab initio method is based on the previously developed fragment assembly and loop closure method. A new sampling component that takes advantage of secondary structure prediction was added. In addition, a conserved disulfide bridge restraining ECL2 conformation was predicted and analytically incorporated into sampling, reducing the effective dimension of the conformational search space. The sampling method was combined with an existing energy function for comparison with previously reported loop structure prediction methods, and the benchmark test demonstrated outstanding performance.

  8. Pooling across cells to normalize single-cell RNA sequencing data with many zero counts.

    PubMed

    Lun, Aaron T L; Bach, Karsten; Marioni, John C

    2016-04-27

    Normalization of single-cell RNA sequencing data is necessary to eliminate cell-specific biases prior to downstream analyses. However, this is not straightforward for noisy single-cell data where many counts are zero. We present a novel approach where expression values are summed across pools of cells, and the summed values are used for normalization. Pool-based size factors are then deconvolved to yield cell-based factors. Our deconvolution approach outperforms existing methods for accurate normalization of cell-specific biases in simulated data. Similar behavior is observed in real data, where deconvolution improves the relevance of results of downstream analyses.

  9. Unimodular sequence design under frequency hopping communication compatibility requirements

    NASA Astrophysics Data System (ADS)

    Ge, Peng; Cui, Guolong; Kong, Lingjiang; Yang, Jianyu

    2016-12-01

    The integrated design for both radar and anonymous communication has drawn more attention recently since wireless communication system appeals to enhance security and reliability. Given the frequency hopping (FH) communication system, an effective way to realize integrated design is to meet the spectrum compatibility between these two systems. The paper deals with a unimodular sequence design technique which considers optimizing both the spectrum compatibility and peak sidelobes levels (PSL) of auto-correlation function (ACF). The spectrum compatibility requirement realizes anonymous communication for the FH system and provides this system lower probability of intercept (LPI) since the spectrum of the FH system is hidden in that of the radar system. The proposed algorithm, named generalized fitting template (GFT) technique, converts the sequence optimization design problem to a iterative fitting process. In this process, the power spectrum density (PSD) and PSL behaviors of the generated sequences fit both PSD and PSL templates progressively. Two templates are established based on the spectrum compatibility requirement and the expected PSL. As noted, in order to ensure the communication security and reliability, spectrum compatibility requirement is given a higher priority to achieve in the GFT algorithm. This algorithm realizes this point by adjusting the weight adaptively between these two terms during the iteration process. The simulation results are analyzed in terms of bit error rate (BER), PSD, PSL, and signal-interference rate (SIR) for both the radar and FH systems. The performance of GFT is compared with SCAN, CAN, FRE, CYC, and MAT algorithms in the above aspects, which shows its good effectiveness.

  10. Shifted Transversal Design smart-pooling for high coverage interactome mapping

    PubMed Central

    Xin, Xiaofeng; Rual, Jean-François; Hirozane-Kishikawa, Tomoko; Hill, David E.; Vidal, Marc; Boone, Charles; Thierry-Mieg, Nicolas

    2009-01-01

    “Smart-pooling,” in which test reagents are multiplexed in a highly redundant manner, is a promising strategy for achieving high efficiency, sensitivity, and specificity in systems-level projects. However, previous applications relied on low redundancy designs that do not leverage the full potential of smart-pooling, and more powerful theoretical constructions, such as the Shifted Transversal Design (STD), lack experimental validation. Here we evaluate STD smart-pooling in yeast two-hybrid (Y2H) interactome mapping. We employed two STD designs and two established methods to perform ORFeome-wide Y2H screens with 12 baits. We found that STD pooling achieves similar levels of sensitivity and specificity as one-on-one array-based Y2H, while the costs and workloads are divided by three. The screening-sequencing approach is the most cost- and labor-efficient, yet STD identifies about twofold more interactions. Screening-sequencing remains an appropriate method for quickly producing low-coverage interactomes, while STD pooling appears as the method of choice for obtaining maps with higher coverage. PMID:19447967

  11. New insights into the promoterless transcription of DNA coligo templates by RNA polymerase III

    PubMed Central

    Lama, Lodoe; Seidl, Christine I; Ryan, Kevin

    2014-01-01

    Chemically synthesized DNA can carry small RNA sequence information but converting that information into small RNA is generally thought to require large double-stranded promoters in the context of plasmids, viruses and genes. We previously found evidence that circularized oligodeoxynucleotides (coligos) containing certain sequences and secondary structures can template the synthesis of small RNA by RNA polymerase III in vitro and in human cells. By using immunoprecipitated RNA polymerase III we now report corroborating evidence that this enzyme is the sole polymerase responsible for coligo transcription. The immobilized polymerase enabled experiments showing that coligo transcripts can be formed through transcription termination without subsequent 3′ end trimming. To better define the determinants of productive transcription, a structure-activity relationship study was performed using over 20 new coligos. The results show that unpaired nucleotides in the coligo stem facilitate circumtranscription, but also that internal loops and bulges should be kept small to avoid secondary transcription initiation sites. A polymerase termination sequence embedded in the double-stranded region of a hairpin-encoding coligo stem can antagonize transcription. Using lessons learned from new and old coligos, we demonstrate how to convert poorly transcribed coligos into productive templates. Our findings support the possibility that coligos may prove useful as chemically synthesized vectors for the ectopic expression of small RNA in human cells. PMID:25764216

  12. A site-directed mutagenesis method particularly useful for creating otherwise difficult-to-make mutants and alanine scanning.

    PubMed

    Wan, Haisu; Li, Yongwen; Fan, Yu; Meng, Fanrong; Chen, Chen; Zhou, Qinghua

    2012-01-15

    Site-directed mutagenesis has become routine in molecular biology. However, many mutants can still be very difficult to create. Complicated chimerical mutations, tandem repeats, inverted sequences, GC-rich regions, and/or heavy secondary structures can cause inefficient or incorrect binding of the mutagenic primer to the target sequence and affect the subsequent amplification. In theory, these problems can be avoided by introducing the mutations into the target sequence using mutagenic fragments and so removing the need for primer-template annealing. The cassette mutagenesis uses the mutagenic fragment in its protocol; however, in most cases it needs to perform two rounds of mutagenic primer-based mutagenesis to introduce suitable restriction enzyme sites into templates and is not suitable for routine mutagenesis. Here we describe a highly efficient method in which the template except the region to be mutated is amplified by polymerase chain reaction (PCR) and the type IIs restriction enzyme-digested PCR product is directly ligated with the mutagenic fragment. Our method requires no assistance of mutagenic primers. We have used this method to create various types of difficult-to-make mutants with mutagenic frequencies of nearly 100%. Our protocol has many advantages over the prevalent QuikChange method and is a valuable tool for studies on gene structure and function. Copyright © 2011 Elsevier Inc. All rights reserved.

  13. ModeRNA: a tool for comparative modeling of RNA 3D structure

    PubMed Central

    Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

    2011-01-01

    RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639

  14. Self-defining memories, scripts, and the life story: narrative identity in personality and psychotherapy.

    PubMed

    Singer, Jefferson A; Blagov, Pavel; Berry, Meredith; Oost, Kathryn M

    2013-12-01

    An integrative model of narrative identity builds on a dual memory system that draws on episodic memory and a long-term self to generate autobiographical memories. Autobiographical memories related to critical goals in a lifetime period lead to life-story memories, which in turn become self-defining memories when linked to an individual's enduring concerns. Self-defining memories that share repetitive emotion-outcome sequences yield narrative scripts, abstracted templates that filter cognitive-affective processing. The life story is the individual's overarching narrative that provides unity and purpose over the life course. Healthy narrative identity combines memory specificity with adaptive meaning-making to achieve insight and well-being, as demonstrated through a literature review of personality and clinical research, as well as new findings from our own research program. A clinical case study drawing on this narrative identity model is also presented with implications for treatment and research. © 2012 Wiley Periodicals, Inc.

  15. RNA-primed complementary-sense DNA synthesis of the geminivirus African cassava mosaic virus.

    PubMed Central

    Saunders, K; Lucy, A; Stanley, J

    1992-01-01

    The plant DNA virus African cassava mosaic virus (ACMV) is believed to replicate by a rolling circle mechanism. To investigate complementary-sense DNA (lagging strand) synthesis, we have analysed the heterogenous form of complementary-sense DNA (H3 DNA) from infected Nicotiana benthamiana by two-dimensional agarose gel electrophoresis and blot hybridisation. The presence of an RNA moeity is demonstrated by comparison of results for nucleic acids resolved on neutral/alkaline and neutral/formamide gels, suggesting that complementary-sense DNA synthesis on the virus-sense single-stranded DNA template is preceded by the synthesis of an RNA primer. Hybridisation with probes to specific parts of ACMV DNA A genome indicates that synthesis of the putative RNA primer initiates between nucleotides 2581-221, a region that includes intergenic sequences that have been implicated in geminivirus DNA replication and the control of gene expression. Images PMID:1475192

  16. Mode-dependent templates and scan order for H.264/AVC-based intra lossless coding.

    PubMed

    Gu, Zhouye; Lin, Weisi; Lee, Bu-Sung; Lau, Chiew Tong; Sun, Ming-Ting

    2012-09-01

    In H.264/advanced video coding (AVC), lossless coding and lossy coding share the same entropy coding module. However, the entropy coders in the H.264/AVC standard were original designed for lossy video coding and do not yield adequate performance for lossless video coding. In this paper, we analyze the problem with the current lossless coding scheme and propose a mode-dependent template (MD-template) based method for intra lossless coding. By exploring the statistical redundancy of the prediction residual in the H.264/AVC intra prediction modes, more zero coefficients are generated. By designing a new scan order for each MD-template, the scanned coefficients sequence fits the H.264/AVC entropy coders better. A fast implementation algorithm is also designed. With little computation increase, experimental results confirm that the proposed fast algorithm achieves about 7.2% bit saving compared with the current H.264/AVC fidelity range extensions high profile.

  17. Nucleic acid and nucleotide-mediated synthesis of inorganic nanoparticles

    NASA Astrophysics Data System (ADS)

    Berti, Lorenzo; Burley, Glenn A.

    2008-02-01

    Since the advent of practical methods for achieving DNA metallization, the use of nucleic acids as templates for the synthesis of inorganic nanoparticles (NPs) has become an active area of study. It is now widely recognized that nucleic acids have the ability to control the growth and morphology of inorganic NPs. These biopolymers are particularly appealing as templating agents as their ease of synthesis in conjunction with the possibility of screening nucleotide composition, sequence and length, provides the means to modulate the physico-chemical properties of the resulting NPs. Several synthetic procedures leading to NPs with interesting photophysical properties as well as studies aimed at rationalizing the mechanism of nucleic acid-templated NP synthesis are now being reported. This progress article will outline the current understanding of the nucleic acid-templated process and provides an up to date reference in this nascent field.

  18. Generation of sequence signatures from DNA amplification fingerprints with mini-hairpin and microsatellite primers.

    PubMed

    Caetano-Anollés, G; Gresshoff, P M

    1996-06-01

    DNA amplification fingerprinting (DAF) with mini-hairpins harboring arbitrary "core" sequences at their 3' termini were used to fingerprint a variety of templates, including PCR products and whole genomes, to establish genetic relationships between plant tax at the interspecific and intraspecific level, and to identify closely related fungal isolates and plant accessions. No correlation was observed between the sequence of the arbitrary core, the stability of the mini-hairpin structure and DAF efficiency. Mini-hairpin primers with short arbitrary cores and primers complementary to simple sequence repeats present in microsatellites were also used to generate arbitrary signatures from amplification profiles (ASAP). The ASAP strategy is a dual-step amplification procedure that uses at least one primer in each fingerprinting stage. ASAP was able to reproducibly amplify DAF products (representing about 10-15 kb of sequence) following careful optimization of amplification parameters such as primer and template concentration. Avoidance of primer sequences partially complementary to DAF product termini was necessary in order to produce distinct fingerprints. This allowed the combinatorial use of oligomers in nucleic acid screening, with numerous ASAP fingerprinting reactions based on a limited number of primer sequences. Mini-hairpin primers and ASAP analysis significantly increased detection of polymorphic DNA, separating closely related bermudagrass (Cynodon) cultivars and detecting putatively linked markers in bulked segregant analysis of the soybean (Glycine max) supernodulation (nitrate-tolerant symbiosis) locus.

  19. Targeting the atypical chemokine receptor ACKR3/CXCR7 for the treatment of cancer and other diseases

    NASA Astrophysics Data System (ADS)

    Vestal, Richard D., Jr.

    One of the greatest challenges in fighting cancer is cell targeting and biomarker selection. The Atypical Chemokine Receptor ACKR3/CXCR7 is expressed on many cancer cell types, including breast cancer and glioblastoma, and binds the endogenous ligands SDF1/CXCL12 and ITAC/CXCL11. A 20 amino acid region of the ACKR3/CXCR7 N-terminus was synthesized and targeted with the NEB PhD-7 Phage Display Peptide Library. Twenty-nine phages were isolated and heptapeptide inserts sequenced; of these, 23 sequences were unique. A 3D molecular model was created for the ACKR3/CXCR7 N-terminus by mutating the corresponding region of the crystal structure of CXCR4 with bound SDF1/CXCL12. A ClustalW alignment was performed on each peptide sequence using the entire SDF1/CXCL12 sequence as the template. The 23-peptide sequences showed similarity to three distinct regions of the SDF1/CXCL12 molecule. A 3D molecular model was made for each of the phage peptide inserts to visually identify potential areas of steric interference of peptides that simulated CXCL12 regions not in contact with the receptor's N-terminus. An ELISA analysis of the relative binding affinity between the peptides identified 9 peptides with statistically significant results. The candidate pool of 9 peptides was further reduced to 3 peptides based on their affinity for the targeted N-terminus region peptide versus no target peptide present or a scrambled negative control peptide. The results clearly show the Phage Display protocol can be used to target a synthesized region of the ACKR3/CXCR7 N-terminus. The 3 peptides chosen, P20, P3, and P9, showed no effect on the viability or proliferation upon exposure to MCF-7 and U87-MG cells. Membrane binding, colocalization, and cellular uptake were confirmed by whole-cell ELISA and confocal microscopy. The recovered peptides did not activate the receptor as confirmed by a Beta-Arrestin recruitment assay. The data shows that the peptide sequences recovered from the phage display protocol are viable candidates for targeting cancer cells and delivering material to them.

  20. Pooled Sequencing of 531 Genes in Inflammatory Bowel Disease Identifies an Associated Rare Variant in BTNL2 and Implicates Other Immune Related Genes

    PubMed Central

    Prescott, Natalie J.; Lehne, Benjamin; Stone, Kristina; Lee, James C.; Taylor, Kirstin; Knight, Jo; Papouli, Efterpi; Mirza, Muddassar M.; Simpson, Michael A.; Spain, Sarah L.; Lu, Grace; Fraternali, Franca; Bumpstead, Suzannah J.; Gray, Emma; Amar, Ariella; Bye, Hannah; Green, Peter; Chung-Faye, Guy; Hayee, Bu’Hussain; Pollok, Richard; Satsangi, Jack; Parkes, Miles; Barrett, Jeffrey C.; Mansfield, John C.; Sanderson, Jeremy; Lewis, Cathryn M.; Weale, Michael E.; Schlitt, Thomas; Mathew, Christopher G.

    2015-01-01

    The contribution of rare coding sequence variants to genetic susceptibility in complex disorders is an important but unresolved question. Most studies thus far have investigated a limited number of genes from regions which contain common disease associated variants. Here we investigate this in inflammatory bowel disease by sequencing the exons and proximal promoters of 531 genes selected from both genome-wide association studies and pathway analysis in pooled DNA panels from 474 cases of Crohn’s disease and 480 controls. 80 variants with evidence of association in the sequencing experiment or with potential functional significance were selected for follow up genotyping in 6,507 IBD cases and 3,064 population controls. The top 5 disease associated variants were genotyped in an extension panel of 3,662 IBD cases and 3,639 controls, and tested for association in a combined analysis of 10,147 IBD cases and 7,008 controls. A rare coding variant p.G454C in the BTNL2 gene within the major histocompatibility complex was significantly associated with increased risk for IBD (p = 9.65x10−10, OR = 2.3[95% CI = 1.75–3.04]), but was independent of the known common associated CD and UC variants at this locus. Rare (<1%) and low frequency (1–5%) variants in 3 additional genes showed suggestive association (p<0.005) with either an increased risk (ARIH2 c.338-6C>T) or decreased risk (IL12B p.V298F, and NICN p.H191R) of IBD. These results provide additional insights into the involvement of the inhibition of T cell activation in the development of both sub-phenotypes of inflammatory bowel disease. We suggest that although rare coding variants may make a modest overall contribution to complex disease susceptibility, they can inform our understanding of the molecular pathways that contribute to pathogenesis. PMID:25671699

  1. An Efficient Strategy for Broad-Range Detection of Low Abundance Bacteria without DNA Decontamination of PCR Reagents

    PubMed Central

    Chang, Shy-Shin; Hsu, Hsung-Ling; Cheng, Ju-Chien; Tseng, Ching-Ping

    2011-01-01

    Background Bacterial DNA contamination in PCR reagents has been a long standing problem that hampers the adoption of broad-range PCR in clinical and applied microbiology, particularly in detection of low abundance bacteria. Although several DNA decontamination protocols have been reported, they all suffer from compromised PCR efficiency or detection limits. To date, no satisfactory solution has been found. Methodology/Principal Findings We herein describe a method that solves this long standing problem by employing a broad-range primer extension-PCR (PE-PCR) strategy that obviates the need for DNA decontamination. In this method, we first devise a fusion probe having a 3′-end complementary to the template bacterial sequence and a 5′-end non-bacterial tag sequence. We then hybridize the probes to template DNA, carry out primer extension and remove the excess probes using an optimized enzyme mix of Klenow DNA polymerase and exonuclease I. This strategy allows the templates to be distinguished from the PCR reagent contaminants and selectively amplified by PCR. To prove the concept, we spiked the PCR reagents with Staphylococcus aureus genomic DNA and applied PE-PCR to amplify template bacterial DNA. The spiking DNA neither interfered with template DNA amplification nor caused false positive of the reaction. Broad-range PE-PCR amplification of the 16S rRNA gene was also validated and minute quantities of template DNA (10–100 fg) were detectable without false positives. When adapting to real-time and high-resolution melting (HRM) analytical platforms, the unique melting profiles for the PE-PCR product can be used as the molecular fingerprints to further identify individual bacterial species. Conclusions/Significance Broad-range PE-PCR is simple, efficient, and completely obviates the need to decontaminate PCR reagents. When coupling with real-time and HRM analyses, it offers a new avenue for bacterial species identification with a limited source of bacterial DNA, making it suitable for use in clinical and applied microbiology laboratories. PMID:21637859

  2. Enzymatic Synthesis of Self-assembled Dicer Substrate RNA Nanostructures for Programmable Gene Silencing.

    PubMed

    Jang, Bora; Kim, Boyoung; Kim, Hyunsook; Kwon, Hyokyoung; Kim, Minjeong; Seo, Yunmi; Colas, Marion; Jeong, Hansaem; Jeong, Eun Hye; Lee, Kyuri; Lee, Hyukjin

    2018-06-08

    Enzymatic synthesis of RNA nanostructures is achieved by isothermal rolling circle transcription (RCT). Each arm of RNA nanostructures provides a functional role of Dicer substrate RNA inducing sequence specific RNA interference (RNAi). Three different RNAi sequences (GFP, RFP, and BFP) are incorporated within the three-arm junction RNA nanostructures (Y-RNA). The template and helper DNA strands are designed for the large-scale in vitro synthesis of RNA strands to prepare self-assembled Y-RNA. Interestingly, Dicer processing of Y-RNA is highly influenced by its physical structure and different gene silencing activity is achieved depending on its arm length and overhang. In addition, enzymatic synthesis allows the preparation of various Y-RNA structures using a single DNA template offering on demand regulation of multiple target genes.

  3. Primer3_masker: integrating masking of template sequence with primer design software.

    PubMed

    Kõressaar, Triinu; Lepamets, Maarja; Kaplinski, Lauris; Raime, Kairi; Andreson, Reidar; Remm, Maido

    2018-06-01

    Designing PCR primers for amplifying regions of eukaryotic genomes is a complicated task because the genomes contain a large number of repeat sequences and other regions unsuitable for amplification by PCR. We have developed a novel k-mer based masking method that uses a statistical model to detect and mask failure-prone regions on the DNA template prior to primer design. We implemented the software as a standalone software primer3_masker and integrated it into the primer design program Primer3. The standalone version of primer3_masker is implemented in C. The source code is freely available at https://github.com/bioinfo-ut/primer3_masker/ (standalone version for Linux and macOS) and at https://github.com/primer3-org/primer3/ (integrated version). Primer3 web application that allows masking sequences of 196 animal and plant genomes is available at http://primer3.ut.ee/. maido.remm@ut.ee. Supplementary data are available at Bioinformatics online.

  4. A thiamin-utilizing ribozyme decarboxylates a pyruvate-like substrate

    NASA Astrophysics Data System (ADS)

    Cernak, Paul; Sen, Dipankar

    2013-11-01

    Vitamins are hypothesized to be relics of an RNA world, and were probably participants in RNA-mediated primordial metabolism. If catalytic RNAs, or ribozymes, could harness vitamin cofactors to aid their function in a manner similar to protein enzymes, it would enable them to catalyse a much larger set of chemical reactions. The cofactor thiamin diphosphate, a derivative of vitamin B1 (thiamin), is used by enzymes to catalyse difficult metabolic reactions, including decarboxylation of stable α-keto acids such as pyruvate. Here, we report a ribozyme that uses free thiamin to decarboxylate a pyruvate-based suicide substrate (LnkPB). Thiamin conjugated to biotin was used to isolate catalytic individuals from a pool of random-sequence RNAs attached to LnkPB. Analysis of a stable guanosine adduct obtained via digestion of an RNA sequence (clone dc4) showed the expected decarboxylation product. The discovery of a prototypic thiamin-utilizing ribozyme has implications for the role of RNA in orchestrating early metabolic cycles.

  5. Digital PCR methods improve detection sensitivity and measurement precision of low abundance mtDNA deletions.

    PubMed

    Belmonte, Frances R; Martin, James L; Frescura, Kristin; Damas, Joana; Pereira, Filipe; Tarnopolsky, Mark A; Kaufman, Brett A

    2016-04-28

    Mitochondrial DNA (mtDNA) mutations are a common cause of primary mitochondrial disorders, and have also been implicated in a broad collection of conditions, including aging, neurodegeneration, and cancer. Prevalent among these pathogenic variants are mtDNA deletions, which show a strong bias for the loss of sequence in the major arc between, but not including, the heavy and light strand origins of replication. Because individual mtDNA deletions can accumulate focally, occur with multiple mixed breakpoints, and in the presence of normal mtDNA sequences, methods that detect broad-spectrum mutations with enhanced sensitivity and limited costs have both research and clinical applications. In this study, we evaluated semi-quantitative and digital PCR-based methods of mtDNA deletion detection using double-stranded reference templates or biological samples. Our aim was to describe key experimental assay parameters that will enable the analysis of low levels or small differences in mtDNA deletion load during disease progression, with limited false-positive detection. We determined that the digital PCR method significantly improved mtDNA deletion detection sensitivity through absolute quantitation, improved precision and reduced assay standard error.

  6. Digital PCR methods improve detection sensitivity and measurement precision of low abundance mtDNA deletions

    PubMed Central

    Belmonte, Frances R.; Martin, James L.; Frescura, Kristin; Damas, Joana; Pereira, Filipe; Tarnopolsky, Mark A.; Kaufman, Brett A.

    2016-01-01

    Mitochondrial DNA (mtDNA) mutations are a common cause of primary mitochondrial disorders, and have also been implicated in a broad collection of conditions, including aging, neurodegeneration, and cancer. Prevalent among these pathogenic variants are mtDNA deletions, which show a strong bias for the loss of sequence in the major arc between, but not including, the heavy and light strand origins of replication. Because individual mtDNA deletions can accumulate focally, occur with multiple mixed breakpoints, and in the presence of normal mtDNA sequences, methods that detect broad-spectrum mutations with enhanced sensitivity and limited costs have both research and clinical applications. In this study, we evaluated semi-quantitative and digital PCR-based methods of mtDNA deletion detection using double-stranded reference templates or biological samples. Our aim was to describe key experimental assay parameters that will enable the analysis of low levels or small differences in mtDNA deletion load during disease progression, with limited false-positive detection. We determined that the digital PCR method significantly improved mtDNA deletion detection sensitivity through absolute quantitation, improved precision and reduced assay standard error. PMID:27122135

  7. indCAPS: A tool for designing screening primers for CRISPR/Cas9 mutagenesis events.

    PubMed

    Hodgens, Charles; Nimchuk, Zachary L; Kieber, Joseph J

    2017-01-01

    Genetic manipulation of organisms using CRISPR/Cas9 technology generally produces small insertions/deletions (indels) that can be difficult to detect. Here, we describe a technique to easily and rapidly identify such indels. Sequence-identified mutations that alter a restriction enzyme recognition site can be readily distinguished from wild-type alleles using a cleaved amplified polymorphic sequence (CAPS) technique. If a restriction site is created or altered by the mutation such that only one allele contains the restriction site, a polymerase chain reaction (PCR) followed by a restriction digest can be used to distinguish the two alleles. However, in the case of most CRISPR-induced alleles, no such restriction sites are present in the target sequences. In this case, a derived CAPS (dCAPS) approach can be used in which mismatches are purposefully introduced in the oligonucleotide primers to create a restriction site in one, but not both, of the amplified templates. Web-based tools exist to aid dCAPS primer design, but when supplied sequences that include indels, the current tools often fail to suggest appropriate primers. Here, we report the development of a Python-based, species-agnostic web tool, called indCAPS, suitable for the design of PCR primers used in dCAPS assays that is compatible with indels. This tool should have wide utility for screening editing events following CRISPR/Cas9 mutagenesis as well as for identifying specific editing events in a pool of CRISPR-mediated mutagenesis events. This tool was field-tested in a CRISPR mutagenesis experiment targeting a cytokinin receptor (AHK3) in Arabidopsis thaliana. The tool suggested primers that successfully distinguished between wild-type and edited alleles of a target locus and facilitated the isolation of two novel ahk3 null alleles. Users can access indCAPS and design PCR primers to employ dCAPS to identify CRISPR/Cas9 alleles at http://indcaps.kieber.cloudapps.unc.edu/.

  8. GalaxyTBM: template-based modeling by building a reliable core and refining unreliable local regions.

    PubMed

    Ko, Junsu; Park, Hahnbeom; Seok, Chaok

    2012-08-10

    Protein structures can be reliably predicted by template-based modeling (TBM) when experimental structures of homologous proteins are available. However, it is challenging to obtain structures more accurate than the single best templates by either combining information from multiple templates or by modeling regions that vary among templates or are not covered by any templates. We introduce GalaxyTBM, a new TBM method in which the more reliable core region is modeled first from multiple templates and less reliable, variable local regions, such as loops or termini, are then detected and re-modeled by an ab initio method. This TBM method is based on "Seok-server," which was tested in CASP9 and assessed to be amongst the top TBM servers. The accuracy of the initial core modeling is enhanced by focusing on more conserved regions in the multiple-template selection and multiple sequence alignment stages. Additional improvement is achieved by ab initio modeling of up to 3 unreliable local regions in the fixed framework of the core structure. Overall, GalaxyTBM reproduced the performance of Seok-server, with GalaxyTBM and Seok-server resulting in average GDT-TS of 68.1 and 68.4, respectively, when tested on 68 single-domain CASP9 TBM targets. For application to multi-domain proteins, GalaxyTBM must be combined with domain-splitting methods. Application of GalaxyTBM to CASP9 targets demonstrates that accurate protein structure prediction is possible by use of a multiple-template-based approach, and ab initio modeling of variable regions can further enhance the model quality.

  9. Relative Packing Groups in Template-Based Structure Prediction: Cooperative Effects of True Positive Constraints

    PubMed Central

    Day, Ryan; Qu, Xiaotao; Swanson, Rosemarie; Bohannan, Zach; Bliss, Robert

    2011-01-01

    Abstract Most current template-based structure prediction methods concentrate on finding the correct backbone conformation and then packing sidechains within that backbone. Our packing-based method derives distance constraints from conserved relative packing groups (RPGs). In our refinement approach, the RPGs provide a level of resolution that restrains global topology while allowing conformational sampling. In this study, we test our template-based structure prediction method using 51 prediction units from CASP7 experiments. RPG-based constraints are able to substantially improve approximately two-thirds of starting templates. Upon deeper investigation, we find that true positive spatial constraints, especially those non-local in sequence, derived from the RPGs were important to building nearer native models. Surprisingly, the fraction of incorrect or false positive constraints does not strongly influence the quality of the final candidate. This result indicates that our RPG-based true positive constraints sample the self-consistent, cooperative interactions of the native structure. The lack of such reinforcing cooperativity explains the weaker effect of false positive constraints. Generally, these findings are encouraging indications that RPGs will improve template-based structure prediction. PMID:21210729

  10. An Empirical Template Library of Stellar Spectra for a Wide Range of Spectral Classes, Luminosity Classes, and Metallicities Using SDSS BOSS Spectra

    NASA Astrophysics Data System (ADS)

    Kesseli, Aurora Y.; West, Andrew A.; Veyette, Mark; Harrison, Brandon; Feldman, Dan; Bochanski, John J.

    2017-06-01

    We present a library of empirical stellar spectra created using spectra from the Sloan Digital Sky Survey’s Baryon Oscillation Spectroscopic Survey. The templates cover spectral types O5 through L3, are binned by metallicity from -2.0 dex through +1.0 dex, and are separated into main-sequence (dwarf) stars and giant stars. With recently developed M dwarf metallicity indicators, we are able to extend the metallicity bins down through the spectral subtype M8, making this the first empirical library with this degree of temperature and metallicity coverage. The wavelength coverage for the templates is from 3650 to 10200 Å at a resolution of better than R ˜ 2000. Using the templates, we identify trends in color space with metallicity and surface gravity, which will be useful for analyzing large data sets from upcoming missions like the Large Synoptic Survey Telescope. Along with the templates, we are releasing a code for automatically (and/or visually) identifying the spectral type and metallicity of a star.

  11. An Empirical Template Library of Stellar Spectra for a Wide Range of Spectral Classes, Luminosity Classes, and Metallicities Using SDSS BOSS Spectra

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kesseli, Aurora Y.; West, Andrew A.; Veyette, Mark

    We present a library of empirical stellar spectra created using spectra from the Sloan Digital Sky Survey’s Baryon Oscillation Spectroscopic Survey. The templates cover spectral types O5 through L3, are binned by metallicity from −2.0 dex through +1.0 dex, and are separated into main-sequence (dwarf) stars and giant stars. With recently developed M dwarf metallicity indicators, we are able to extend the metallicity bins down through the spectral subtype M8, making this the first empirical library with this degree of temperature and metallicity coverage. The wavelength coverage for the templates is from 3650 to 10200 Å at a resolution ofmore » better than R  ∼ 2000. Using the templates, we identify trends in color space with metallicity and surface gravity, which will be useful for analyzing large data sets from upcoming missions like the Large Synoptic Survey Telescope. Along with the templates, we are releasing a code for automatically (and/or visually) identifying the spectral type and metallicity of a star.« less

  12. AquaPathogen X--A template database for tracking field isolates of aquatic pathogens

    USGS Publications Warehouse

    Emmenegger, Evi; Kurath, Gael

    2012-01-01

    AquaPathogen X is a template database for recording information on individual isolates of aquatic pathogens and is available for download from the U.S. Geological Survey (USGS) Western Fisheries Research Center (WFRC) website (http://wfrc.usgs.gov). This template database can accommodate the nucleotide sequence data generated in molecular epidemiological studies along with the myriad of abiotic and biotic traits associated with isolates of various pathogens (for example, viruses, parasites, or bacteria) from multiple aquatic animal host species (for example, fish, shellfish, or shrimp). The simultaneous cataloging of isolates from different aquatic pathogens is a unique feature to the AquaPathogen X database, which can be used in surveillance of emerging aquatic animal diseases and clarification of main risk factors associated with pathogen incursions into new water systems. As a template database, the data fields are empty upon download and can be modified to user specifications. For example, an application of the template database that stores the epidemiological profiles of fish virus isolates, called Fish ViroTrak (fig. 1), was also developed (Emmenegger and others, 2011).

  13. Intermediate Templates Guided Groupwise Registration of Diffusion Tensor Images

    PubMed Central

    Jia, Hongjun; Yap, Pew-Thian; Wu, Guorong; Wang, Qian; Shen, Dinggang

    2010-01-01

    Registration of a population of diffusion tensor images (DTIs) is one of the key steps in medical image analysis, and it plays an important role in the statistical analysis of white matter related neurological diseases. However, pairwise registration with respect to a pre-selected template may not give precise results if the selected template deviates significantly from the distribution of images. To cater for more accurate and consistent registration, a novel framework is proposed for groupwise registration with the guidance from one or more intermediate templates determined from the population of images. Specifically, we first use a Euclidean distance, defined as a combinative measure based on the FA map and ADC map, for gauging the similarity of each pair of DTIs. A fully connected graph is then built with each node denoting an image and each edge denoting the distance between a pair of images. The root template image is determined automatically as the image with the overall shortest path length to all other images on the minimum spanning tree (MST) of the graph. Finally, a sequence of registration steps is applied to progressively warping each image towards the root template image with the help of intermediate templates distributed along its path to the root node on the MST. Extensive experimental results using diffusion tensor images of real subjects indicate that registration accuracy and fiber tract alignment are significantly improved, compared with the direct registration from each image to the root template image. PMID:20851197

  14. Ambient groundwater flow diminishes nitrate processing in the hyporheic zone of streams

    NASA Astrophysics Data System (ADS)

    Azizian, Morvarid; Boano, Fulvio; Cook, Perran L. M.; Detwiler, Russell L.; Rippy, Megan A.; Grant, Stanley B.

    2017-05-01

    Modeling and experimental studies demonstrate that ambient groundwater reduces hyporheic exchange, but the implications of this observation for stream N-cycling is not yet clear. Here we utilize a simple process-based model (the Pumping and Streamline Segregation or PASS model) to evaluate N-cycling over two scales of hyporheic exchange (fluvial ripples and riffle-pool sequences), ten ambient groundwater and stream flow scenarios (five gaining and losing conditions and two stream discharges), and three biogeochemical settings (identified based on a principal component analysis of previously published measurements in streams throughout the United States). Model-data comparisons indicate that our model provides realistic estimates for direct denitrification of stream nitrate, but overpredicts nitrification and coupled nitrification-denitrification. Riffle-pool sequences are responsible for most of the N-processing, despite the fact that fluvial ripples generate 3-11 times more hyporheic exchange flux. Across all scenarios, hyporheic exchange flux and the Damköhler Number emerge as primary controls on stream N-cycling; the former regulates trafficking of nutrients and oxygen across the sediment-water interface, while the latter quantifies the relative rates of organic carbon mineralization and advective transport in streambed sediments. Vertical groundwater flux modulates both of these master variables in ways that tend to diminish stream N-cycling. Thus, anthropogenic perturbations of ambient groundwater flows (e.g., by urbanization, agricultural activities, groundwater mining, and/or climate change) may compromise some of the key ecosystem services provided by streams.

  15. Effects of cerebellar nuclear inactivation on the learning of a complex forelimb movement in cats.

    PubMed

    Wang, J J; Shimansky, Y; Bracha, V; Bloedel, J R

    1998-05-01

    The purpose of this study was to determine the effects of inactivating concurrently the cerebellar interposed and dentate nuclei on the capacity of cats to acquire and retain a complex, goal-directed forelimb movement. To assess the effects on acquisition, cats were required to learn to move a vertical manipulandum bar through a two-segment template with a shape approximating an inverted "L" after the injection of muscimol (saline for the control group) in the interposed and dentate cerebellar nuclei. During training periods, they were exposed progressively to more difficult templates, which were created by decreasing the angle between the two segments of the template. After determining the most difficult template the injected animals could learn within the specified time and performance constraints, the retraining phase of the experiment was initiated in which the cats were required to execute the same sequence of templates in the absence of any injection. This stage of the experiment assessed retention and determined the extent of any relearning required to execute the task at criterion levels. Next, the animals were overtrained without any injection on the most difficult template they could perform. Finally, to determine the effects of nuclear inactivation on retention after extensive retraining, their capacity to perform the same template was determined after muscimol injection in the interposed and dentate nuclei. The findings show that during the inactivation of the dentate and interposed nuclei the animals could learn to execute the more difficult templates. However, when required to execute the most difficult template learned under muscimol on the day after injections were discontinued, the cats had to "relearn" (reacquire) the movement. Finally, when the cerebellar nuclei were inactivated after the animals learned the task in the absence of any injections during the retraining phase, retention was not blocked. The data indicate that the intermediate and lateral cerebellum are not required either for learning this type of complex voluntary movement or for retaining the capacity to perform the task once it is learned. Nevertheless, when the cerebellum becomes available for executing a task learned in the absence of this structure, reacquisition of the behavior usually is necessary. It is hypothesized that the relearning observed after acquisition during muscimol inactivation reflects the tendency of the system to incorporate the cerebellum into the interactions responsible for the learning and performance of a motor sequence that is optimal for executing the task.

  16. Non-Homologous End Joining and Homology Directed DNA Repair Frequency of Double-Stranded Breaks Introduced by Genome Editing Reagents.

    PubMed

    Zaboikin, Michail; Zaboikina, Tatiana; Freter, Carl; Srinivasakumar, Narasimhachar

    2017-01-01

    Genome editing using transcription-activator like effector nucleases or RNA guided nucleases allows one to precisely engineer desired changes within a given target sequence. The genome editing reagents introduce double stranded breaks (DSBs) at the target site which can then undergo DNA repair by non-homologous end joining (NHEJ) or homology directed recombination (HDR) when a template DNA molecule is available. NHEJ repair results in indel mutations at the target site. As PCR amplified products from mutant target regions are likely to exhibit different melting profiles than PCR products amplified from wild type target region, we designed a high resolution melting analysis (HRMA) for rapid identification of efficient genome editing reagents. We also designed TaqMan assays using probes situated across the cut site to discriminate wild type from mutant sequences present after genome editing. The experiments revealed that the sensitivity of the assays to detect NHEJ-mediated DNA repair could be enhanced by selection of transfected cells to reduce the contribution of unmodified genomic DNA from untransfected cells to the DNA melting profile. The presence of donor template DNA lacking the target sequence at the time of genome editing further enhanced the sensitivity of the assays for detection of mutant DNA molecules by excluding the wild-type sequences modified by HDR. A second TaqMan probe that bound to an adjacent site, outside of the primary target cut site, was used to directly determine the contribution of HDR to DNA repair in the presence of the donor template sequence. The TaqMan qPCR assay, designed to measure the contribution of NHEJ and HDR in DNA repair, corroborated the results from HRMA. The data indicated that genome editing reagents can produce DSBs at high efficiency in HEK293T cells but a significant proportion of these are likely masked by reversion to wild type as a result of HDR. Supplying a donor plasmid to provide a template for HDR (that eliminates a PCR amplifiable target) revealed these cryptic DSBs and facilitated the determination of the true efficacy of genome editing reagents. The results indicated that in HEK293T cells, approximately 40% of the DSBs introduced by genome editing, were available for participation in HDR.

  17. Detection of new HLA-DPB1 alleles generated by interallelic gene conversion using PCR amplification of DPB1 second exon sequences from sperm

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Erlich, H.; Zangenberg, G.; Bugawan, T.

    The rate at which allelic diversity at the HLA class I and class II loci evolves has been the subject of considerable controversy as have the mechanisms which generate new alleles. The patchwork pattern of polymorphism, particularly within the second exon of the HLA-DPB1 locus where the polymorphic sequence motifs are localized to 6 discrete regions, is consistent with the hypothesis that much of the allelic sequence variation may have been generated by segmental exchange (gene conversion). To measure the rate of new DPB1 variant generation, we have developed a strategy in which DPB1 second exon sequences are amplified frommore » pools of FACS-sorted sperm (n=50) from a heterozygous sperm donor. Pools of sperm from these heterozygous individuals are amplified with an allele-specific primer for one allele and analyzed with sequence-specific oligonucleotide probes (SSOP) complementary to the other allele. This screening procedure, which is capable of detecting a single variant molecule in a pool of parental alleles, allows the identification of new variants that have been generated by recombination and/or gene conversion between the two parental alleles. To control for potential PCR artifacts, the same screening procedure was carried out with mixtures of sperm from DPB1 *0301/*0301 and DPB1 *0401/ 0401 individuals. Pools containing putative new variants DPB1 alleles were analyzed further by cloning into M13 and sequencing the M13 clones. Our current estimate is that about 1/10,000 sperm from these heterozygous individuals represents a new DPB1 allele generated by micro-gene conversion within the second exon.« less

  18. Modeling IrisCode and its variants as convex polyhedral cones and its security implications.

    PubMed

    Kong, Adams Wai-Kin

    2013-03-01

    IrisCode, developed by Daugman, in 1993, is the most influential iris recognition algorithm. A thorough understanding of IrisCode is essential, because over 100 million persons have been enrolled by this algorithm and many biometric personal identification and template protection methods have been developed based on IrisCode. This paper indicates that a template produced by IrisCode or its variants is a convex polyhedral cone in a hyperspace. Its central ray, being a rough representation of the original biometric signal, can be computed by a simple algorithm, which can often be implemented in one Matlab command line. The central ray is an expected ray and also an optimal ray of an objective function on a group of distributions. This algorithm is derived from geometric properties of a convex polyhedral cone but does not rely on any prior knowledge (e.g., iris images). The experimental results show that biometric templates, including iris and palmprint templates, produced by different recognition methods can be matched through the central rays in their convex polyhedral cones and that templates protected by a method extended from IrisCode can be broken into. These experimental results indicate that, without a thorough security analysis, convex polyhedral cone templates cannot be assumed secure. Additionally, the simplicity of the algorithm implies that even junior hackers without knowledge of advanced image processing and biometric databases can still break into protected templates and reveal relationships among templates produced by different recognition methods.

  19. CABS-fold: Server for the de novo and consensus-based prediction of protein structure.

    PubMed

    Blaszczyk, Maciej; Jamroz, Michal; Kmiecik, Sebastian; Kolinski, Andrzej

    2013-07-01

    The CABS-fold web server provides tools for protein structure prediction from sequence only (de novo modeling) and also using alternative templates (consensus modeling). The web server is based on the CABS modeling procedures ranked in previous Critical Assessment of techniques for protein Structure Prediction competitions as one of the leading approaches for de novo and template-based modeling. Except for template data, fragmentary distance restraints can also be incorporated into the modeling process. The web server output is a coarse-grained trajectory of generated conformations, its Jmol representation and predicted models in all-atom resolution (together with accompanying analysis). CABS-fold can be freely accessed at http://biocomp.chem.uw.edu.pl/CABSfold.

  20. CABS-fold: server for the de novo and consensus-based prediction of protein structure

    PubMed Central

    Blaszczyk, Maciej; Jamroz, Michal; Kmiecik, Sebastian; Kolinski, Andrzej

    2013-01-01

    The CABS-fold web server provides tools for protein structure prediction from sequence only (de novo modeling) and also using alternative templates (consensus modeling). The web server is based on the CABS modeling procedures ranked in previous Critical Assessment of techniques for protein Structure Prediction competitions as one of the leading approaches for de novo and template-based modeling. Except for template data, fragmentary distance restraints can also be incorporated into the modeling process. The web server output is a coarse-grained trajectory of generated conformations, its Jmol representation and predicted models in all-atom resolution (together with accompanying analysis). CABS-fold can be freely accessed at http://biocomp.chem.uw.edu.pl/CABSfold. PMID:23748950

  1. Design of DNA pooling to allow incorporation of covariates in rare variants analysis.

    PubMed

    Guan, Weihua; Li, Chun

    2014-01-01

    Rapid advances in next-generation sequencing technologies facilitate genetic association studies of an increasingly wide array of rare variants. To capture the rare or less common variants, a large number of individuals will be needed. However, the cost of a large scale study using whole genome or exome sequencing is still high. DNA pooling can serve as a cost-effective approach, but with a potential limitation that the identity of individual genomes would be lost and therefore individual characteristics and environmental factors could not be adjusted in association analysis, which may result in power loss and a biased estimate of genetic effect. For case-control studies, we propose a design strategy for pool creation and an analysis strategy that allows covariate adjustment, using multiple imputation technique. Simulations show that our approach can obtain reasonable estimate for genotypic effect with only slight loss of power compared to the much more expensive approach of sequencing individual genomes. Our design and analysis strategies enable more powerful and cost-effective sequencing studies of complex diseases, while allowing incorporation of covariate adjustment.

  2. Increased Fos expression among midbrain dopaminergic cell groups during birdsong tutoring.

    PubMed

    Nordeen, E J; Holtzman, D A; Nordeen, K W

    2009-08-01

    During avian vocal learning, birds memorize conspecific song patterns and then use auditory feedback to match their vocal output to this acquired template. Some models of song learning posit that during tutoring, conspecific visual, social and/or auditory cues activate neuromodulatory systems that encourage acquisition of the tutor's song and attach incentive value to that specific acoustic pattern. This hypothesis predicts that stimuli experienced during social tutoring activate cell populations capable of signaling reward. Using immunocytochemistry for the protein product of the immediate early gene c-Fos, we found that brief exposure of juvenile male zebra finches to a live familiar male tutor increased the density of Fos+ cells within two brain regions implicated in reward processing: the ventral tegmental area (VTA) and substantia nigra pars compacta (SNc). This activation of Fos appears to involve both dopaminergic and non-dopaminergic VTA/SNc neurons. Intriguingly, a familiar tutor was more effective than a novel tutor in stimulating Fos expression within these regions. In the periaqueductal gray, a dopamine-enriched cell population that has been implicated in emotional processing, Fos labeling also was increased after tutoring, with a familiar tutor again being more effective than a novel conspecific. As several neural regions implicated in song acquisition receive strong dopaminergic projections from these midbrain nuclei, their activation in conjunction with hearing the tutor's song could help to establish sensory representations that later guide motor sequence learning.

  3. Single-cell genomic sequencing using Multiple Displacement Amplification.

    PubMed

    Lasken, Roger S

    2007-10-01

    Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).

  4. Automation and integration of multiplexed on-line sample preparation with capillary electrophoresis for DNA sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tan, H.

    1999-03-31

    The purpose of this research is to develop a multiplexed sample processing system in conjunction with multiplexed capillary electrophoresis for high-throughput DNA sequencing. The concept from DNA template to called bases was first demonstrated with a manually operated single capillary system. Later, an automated microfluidic system with 8 channels based on the same principle was successfully constructed. The instrument automatically processes 8 templates through reaction, purification, denaturation, pre-concentration, injection, separation and detection in a parallel fashion. A multiplexed freeze/thaw switching principle and a distribution network were implemented to manage flow direction and sample transportation. Dye-labeled terminator cycle-sequencing reactions are performedmore » in an 8-capillary array in a hot air thermal cycler. Subsequently, the sequencing ladders are directly loaded into a corresponding size-exclusion chromatographic column operated at {approximately} 60 C for purification. On-line denaturation and stacking injection for capillary electrophoresis is simultaneously accomplished at a cross assembly set at {approximately} 70 C. Not only the separation capillary array but also the reaction capillary array and purification columns can be regenerated after every run. DNA sequencing data from this system allow base calling up to 460 bases with accuracy of 98%.« less

  5. Quantitative Assessment of RNA-Protein Interactions with High Throughput Sequencing - RNA Affinity Profiling (HiTS-RAP)

    PubMed Central

    Ozer, Abdullah; Tome, Jacob M.; Friedman, Robin C.; Gheba, Dan; Schroth, Gary P.; Lis, John T.

    2016-01-01

    Because RNA-protein interactions play a central role in a wide-array of biological processes, methods that enable a quantitative assessment of these interactions in a high-throughput manner are in great demand. Recently, we developed the High Throughput Sequencing-RNA Affinity Profiling (HiTS-RAP) assay, which couples sequencing on an Illumina GAIIx with the quantitative assessment of one or several proteins’ interactions with millions of different RNAs in a single experiment. We have successfully used HiTS-RAP to analyze interactions of EGFP and NELF-E proteins with their corresponding canonical and mutant RNA aptamers. Here, we provide a detailed protocol for HiTS-RAP, which can be completed in about a month (8 days hands-on time) including the preparation and testing of recombinant proteins and DNA templates, clustering DNA templates on a flowcell, high-throughput sequencing and protein binding with GAIIx, and finally data analysis. We also highlight aspects of HiTS-RAP that can be further improved and points of comparison between HiTS-RAP and two other recently developed methods, RNA-MaP and RBNS. A successful HiTS-RAP experiment provides the sequence and binding curves for approximately 200 million RNAs in a single experiment. PMID:26182240

  6. Telomere extension by telomerase and ALT generates variant repeats by mechanistically distinct processes

    PubMed Central

    Lee, Michael; Hills, Mark; Conomos, Dimitri; Stutz, Michael D.; Dagg, Rebecca A.; Lau, Loretta M.S.; Reddel, Roger R.; Pickett, Hilda A.

    2014-01-01

    Telomeres are terminal repetitive DNA sequences on chromosomes, and are considered to comprise almost exclusively hexameric TTAGGG repeats. We have evaluated telomere sequence content in human cells using whole-genome sequencing followed by telomere read extraction in a panel of mortal cell strains and immortal cell lines. We identified a wide range of telomere variant repeats in human cells, and found evidence that variant repeats are generated by mechanistically distinct processes during telomerase- and ALT-mediated telomere lengthening. Telomerase-mediated telomere extension resulted in biased repeat synthesis of variant repeats that differed from the canonical sequence at positions 1 and 3, but not at positions 2, 4, 5 or 6. This indicates that telomerase is most likely an error-prone reverse transcriptase that misincorporates nucleotides at specific positions on the telomerase RNA template. In contrast, cell lines that use the ALT pathway contained a large range of variant repeats that varied greatly between lines. This is consistent with variant repeats spreading from proximal telomeric regions throughout telomeres in a stochastic manner by recombination-mediated templating of DNA synthesis. The presence of unexpectedly large numbers of variant repeats in cells utilizing either telomere maintenance mechanism suggests a conserved role for variant sequences at human telomeres. PMID:24225324

  7. Mapping neurofibromatosis 1 homologous loci by fluorescence in situ hybridization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Viskochil, D.; Breidenbach, H.H.; Cawthon, R.

    Neurofibromatosis 1 maps to chromosome band 17q11.2 and the NF1 gene is comprised of 59 exons that span approximately 335 kb of genomic DNA. In order to further analyze the structure of NF1 from exons 2 through 27b, we isolated a number of cosmid and bacteriophage P-1 genomic clones using NF1-exon probes under high-stringency hybridization conditions. Using tagged, intron-based primers and DNA from various clones as a template, we PCR-amplified and sequenced individual NF1 exons. The exon sequences in PCR products from several genomic clones differed from the exon sequence derived from cloned NF1 cDNAs. Clones with variant sequences weremore » mapped by fluorescence in situ hybridization under high-stringency conditions. Three clones mapped to chromosome band 15q11.2, one mapped to 14q11.2, one mapped to both 2q14.1-14.3 and 14q11.2, one mapped to 2q33-34, and one mapped to both 18q11.2 and 21q21. Even though some PCR-product sequences retained proper splice junctions and open reading frames, we have yet to identify cDNAs that correspond to the variant exon sequences. We are now sequencing clones that map to NF1-homologous loci in order to develop discriminating primer pairs for the exclusive amplification of NF1-specific sequences in our efforts to develop a comprehensive NF1 mutation screen using genomic DNA as template. The role of NF1-homologous sequences may play in neurofibromatosis 1 is not clear.« less

  8. Balancing gene expression without library construction via a reusable sRNA pool.

    PubMed

    Ghodasara, Amar; Voigt, Christopher A

    2017-07-27

    Balancing protein expression is critical when optimizing genetic systems. Typically, this requires library construction to vary the genetic parts controlling each gene, which can be expensive and time-consuming. Here, we develop sRNAs corresponding to 15nt 'target' sequences that can be inserted upstream of a gene. The targeted gene can be repressed from 1.6- to 87-fold by controlling sRNA expression using promoters of different strength. A pool is built where six sRNAs are placed under the control of 16 promoters that span a ∼103-fold range of strengths, yielding ∼107 combinations. This pool can simultaneously optimize up to six genes in a system. This requires building only a single system-specific construct by placing a target sequence upstream of each gene and transforming it with the pre-built sRNA pool. The resulting library is screened and the top clone is sequenced to determine the promoter controlling each sRNA, from which the fold-repression of the genes can be inferred. The system is then rebuilt by rationally selecting parts that implement the optimal expression of each gene. We demonstrate the versatility of this approach by using the same pool to optimize a metabolic pathway (β-carotene) and genetic circuit (XNOR logic gate). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Sequencing technologies - the next generation.

    PubMed

    Metzker, Michael L

    2010-01-01

    Demand has never been greater for revolutionary technologies that deliver fast, inexpensive and accurate genome information. This challenge has catalysed the development of next-generation sequencing (NGS) technologies. The inexpensive production of large volumes of sequence data is the primary advantage over conventional methods. Here, I present a technical review of template preparation, sequencing and imaging, genome alignment and assembly approaches, and recent advances in current and near-term commercially available NGS instruments. I also outline the broad range of applications for NGS technologies, in addition to providing guidelines for platform selection to address biological questions of interest.

  10. Impact of library preparation protocols and template quantity on the metagenomic reconstruction of a mock microbial community

    DOE PAGES

    Bowers, Robert M.; Clum, Alicia; Tice, Hope; ...

    2015-10-24

    Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less

  11. Impact of library preparation protocols and template quantity on the metagenomic reconstruction of a mock microbial community

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowers, Robert M.; Clum, Alicia; Tice, Hope

    Background: The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composi tion and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for largemore » initial DNA template quantities during library preparation. Results: Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions: This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities.« less

  12. Novel rare variations of the oxytocin receptor (OXTR) gene in autism spectrum disorder individuals.

    PubMed

    Liu, Xiaoxi; Kawashima, Minae; Miyagawa, Taku; Otowa, Takeshi; Latt, Khun Zaw; Thiri, Myo; Nishida, Hisami; Sugiyama, Toshiro; Tsurusaki, Yoshinori; Matsumoto, Naomichi; Mabuchi, Akihiko; Tokunaga, Katsushi; Sasaki, Tsukasa

    2015-01-01

    The oxytocin receptor (OXTR) gene has been implicated as a risk gene for autism spectrum disorder (ASD)-a neurodevelopmental disorder with essential features of impairments in social communication and reciprocal interaction. The genetic associations between common variations in OXTR and ASD have been reported in multiple ethnic populations. However, little is known about the distribution of rare variations within OXTR in ASD patients. In this study, we resequenced the full length of OXTR in 105 ASD individuals using an approach that combined the power of next-generation sequencing technology, long-range PCR and DNA pooling. We demonstrated that rare variants with minor allele frequency as low as 0.05% could be reliably detected by our method. We identified 28 novel variants including potential functional variants in the intron region and one rare missense variant (R150S). We subsequently performed Sanger sequencing and validated five novel variants located in previously suggested candidate regions in ASD individuals. Further sequencing of 312 healthy subjects showed that the burden of rare variants is significantly higher in ASDs compared with healthy individuals. Our results support that the rare variation in OXTR gene might be involved in ASD.

  13. Adsorption and condensation of amino acids and nucleotides with soluble mineral salts

    NASA Technical Reports Server (NTRS)

    Orenberg, J.; Lahav, N.

    1986-01-01

    The directed synthesis of biopolymers in an abiotic environment is presumably a cyclic sequence of steps which may be realized in a fluctuating environment such as a prebiotic pond undergoing wetting-drying cycles. Soluble mineral salts have been proposed as an essential component of this fluctuating environment. The following sequence may be considered as a most primitive mechanism of information transfer in a fluctuating environment: (1) adsorption of a biomolecule onto a soluable mineral salt surface to act as an adsorbed template; (2) specific adsorption of biomonomers onto the adsorbed template; (3) condensation of the adsorbed biomonomers; and (4) desorption of the elongated oligomer. In this investigation, the salts selected for study were CaSO4.2H2O(gypsum), SrSO4, and several other metal sulfates and chlorides. Adsorption of the monomeric species, gly, 5'AMP 5'GMP, and 5'CMP was investigated. The adsorbed template biopolymers used were Poly-A, Poly-G, Poly-C, and Poly-U. The results of studies involving these experimental participants, the first two steps of the proposed primitive information transfer mechanism, and condensation of amino acids to form oligomers in a fluctuating environment are to be reported.

  14. Terminations of DNA synthesis on 'proflavine and light'-treated phi X174 single-stranded DNA.

    PubMed

    Piette, J; Calberg-Bacq, C M; Lopez, M; van de Vorst, A

    1984-04-05

    Bacteriophage phi X174 single-stranded DNA molecules were primed with five different restriction fragments and irradiated with visible light in the presence of proflavine. This photodamaged DNA was used as template for the in vitro complementary chain synthesis by E. coli DNA polymerase I (Klenow fragment). Chain terminations were observed by polyacrylamide gel electrophoresis of the synthesized products and localized by comparison with standard sequencing performed simultaneously on the untreated template. 90% of the chain terminations occurred one nucleotide before a guanine residue in the template strand. More than 80% of the sequenced guanine residues were blocking lesions demonstrating the absence of 'hot-spots' for the photodamaging effect of proflavine. At a defined position, the chain termination frequency increased linearly with the irradiation time and was directly influenced by the proflavine concentration present. An important part of lesions resulted from the action of singlet oxygen produced by excited proflavine as shown by the effect that both NaN3 and 2H2O exerted on the reaction. The induced blocking lesions must be important in vivo since no complete replicative forms could be extracted from cell infected with bacteriophages inactivated by 'proflavine and light' treatment.

  15. Single-stranded DNA-binding Protein in Vitro Eliminates the Orientation-dependent Impediment to Polymerase Passage on CAG/CTG Repeats*

    PubMed Central

    Delagoutte, Emmanuelle; Goellner, Geoffrey M.; Guo, Jie; Baldacci, Giuseppe; McMurray, Cynthia T.

    2008-01-01

    Small insertions and deletions of trinucleotide repeats (TNRs) can occur by polymerase slippage and hairpin formation on either template or newly synthesized strands during replication. Although not predicted by a slippage model, deletions occur preferentially when 5′-CTG is in the lagging strand template and are highly favored over insertion events in rapidly replicating cells. The mechanism for the deletion bias and the orientation dependence of TNR instability is poorly understood. We report here that there is an orientation-dependent impediment to polymerase progression on 5′-CAG and 5′-CTG repeats that can be relieved by the binding of single-stranded DNA-binding protein. The block depends on the primary sequence of the TNR but does not correlate with the thermodynamic stability of hairpins. The orientation-dependent block of polymerase passage is the strongest when 5′-CAG is the template. We propose a “template-push” model in which the slow speed of DNA polymerase across the 5′-CAG leading strand template creates a threat to helicase-polymerase coupling. To prevent uncoupling, the TNR template is pushed out and by-passed. Hairpins do not cause the block, but appear to occur as a consequence of polymerase pass-over. PMID:18263578

  16. High-Throughput Parallel Sequencing to Measure Fitness of Leptospira interrogans Transposon Insertion Mutants during Acute Infection

    PubMed Central

    Matsunaga, James; Haake, David A.

    2016-01-01

    Pathogenic species of Leptospira are the causative agents of leptospirosis, a zoonotic disease that causes mortality and morbidity worldwide. The understanding of the virulence mechanisms of Leptospira spp is still at an early stage due to the limited number of genetic tools available for this microorganism. The development of random transposon mutagenesis in pathogenic strains a decade ago has contributed to the identification of several virulence factors. In this study, we used the transposon sequencing (Tn-Seq) technique, which combines transposon mutagenesis with massive parallel sequencing, to study the in vivo fitness of a pool of Leptospira interrogans mutants. We infected hamsters with a pool of 42 mutants (input pool), which included control mutants with insertions in four genes previously analyzed by virulence testing (loa22, ligB, flaA1, and lic20111) and 23 mutants with disrupted signal transduction genes. We quantified the mutants in different tissues (blood, kidney and liver) at 4 days post-challenge by high-throughput sequencing and compared the frequencies of mutants recovered from tissues to their frequencies in the input pool. Control mutants that were less fit in the Tn-Seq experiment were attenuated for virulence when tested separately in the hamster model of lethal leptospirosis. Control mutants with unaltered fitness were as virulent as the wild-type strain. We identified two mutants with the transposon inserted in the same putative adenylate/guanylate cyclase gene (lic12327) that had reduced in vivo fitness in blood, kidney and liver. Both lic12327 mutants were attenuated for virulence when tested individually in hamsters. Growth of the control mutants and lic12327 mutants in culture medium were similar to that of the wild-type strain. These results demonstrate the feasibility of screening large pools of L. interrogans transposon mutants for those with altered fitness, and potentially attenuated virulence, by transposon sequencing. PMID:27824878

  17. BioWord: A sequence manipulation suite for Microsoft Word

    PubMed Central

    2012-01-01

    Background The ability to manipulate, edit and process DNA and protein sequences has rapidly become a necessary skill for practicing biologists across a wide swath of disciplines. In spite of this, most everyday sequence manipulation tools are distributed across several programs and web servers, sometimes requiring installation and typically involving frequent switching between applications. To address this problem, here we have developed BioWord, a macro-enabled self-installing template for Microsoft Word documents that integrates an extensive suite of DNA and protein sequence manipulation tools. Results BioWord is distributed as a single macro-enabled template that self-installs with a single click. After installation, BioWord will open as a tab in the Office ribbon. Biologists can then easily manipulate DNA and protein sequences using a familiar interface and minimize the need to switch between applications. Beyond simple sequence manipulation, BioWord integrates functionality ranging from dyad search and consensus logos to motif discovery and pair-wise alignment. Written in Visual Basic for Applications (VBA) as an open source, object-oriented project, BioWord allows users with varying programming experience to expand and customize the program to better meet their own needs. Conclusions BioWord integrates a powerful set of tools for biological sequence manipulation within a handy, user-friendly tab in a widely used word processing software package. The use of a simple scripting language and an object-oriented scheme facilitates customization by users and provides a very accessible educational platform for introducing students to basic bioinformatics algorithms. PMID:22676326

  18. BioWord: a sequence manipulation suite for Microsoft Word.

    PubMed

    Anzaldi, Laura J; Muñoz-Fernández, Daniel; Erill, Ivan

    2012-06-07

    The ability to manipulate, edit and process DNA and protein sequences has rapidly become a necessary skill for practicing biologists across a wide swath of disciplines. In spite of this, most everyday sequence manipulation tools are distributed across several programs and web servers, sometimes requiring installation and typically involving frequent switching between applications. To address this problem, here we have developed BioWord, a macro-enabled self-installing template for Microsoft Word documents that integrates an extensive suite of DNA and protein sequence manipulation tools. BioWord is distributed as a single macro-enabled template that self-installs with a single click. After installation, BioWord will open as a tab in the Office ribbon. Biologists can then easily manipulate DNA and protein sequences using a familiar interface and minimize the need to switch between applications. Beyond simple sequence manipulation, BioWord integrates functionality ranging from dyad search and consensus logos to motif discovery and pair-wise alignment. Written in Visual Basic for Applications (VBA) as an open source, object-oriented project, BioWord allows users with varying programming experience to expand and customize the program to better meet their own needs. BioWord integrates a powerful set of tools for biological sequence manipulation within a handy, user-friendly tab in a widely used word processing software package. The use of a simple scripting language and an object-oriented scheme facilitates customization by users and provides a very accessible educational platform for introducing students to basic bioinformatics algorithms.

  19. Synthesis and Pharmacology of α/β(3)-Peptides Based on the Melanocortin Agonist Ac-His-dPhe-Arg-Trp-NH2 Sequence.

    PubMed

    Singh, Anamika; Tala, Srinivasa R; Flores, Viktor; Freeman, Katie; Haskell-Luevano, Carrie

    2015-05-14

    The melanocortin-3 and -4 receptors are expressed in the brain and play key roles in regulating feeding behavior, metabolism, and energy homeostasis. In the present study, incorporation of β(3)-amino acids into a melanocortin tetrapeptide template was investigated. Four linear α/β(3)-hybrid tetrapeptides were designed with the modifications at the Phe, Arg, and Trp residues in the agonist sequence Ac-His-dPhe-Arg-Trp-NH2. The most potent mouse melanocortin-4 receptor (mMC4R) agonist, Ac-His-dPhe-Arg-β(3)hTrp-NH2 (8) showed 35-fold selectivity versus the mMC3R. The study presented here has identified a new template with heterogeneous backbone for designing potent and selective melanocortin receptor ligands.

  20. Synthesis and Pharmacology of α/β3-Peptides Based on the Melanocortin Agonist Ac-His-dPhe-Arg-Trp-NH2 Sequence

    PubMed Central

    2015-01-01

    The melanocortin-3 and -4 receptors are expressed in the brain and play key roles in regulating feeding behavior, metabolism, and energy homeostasis. In the present study, incorporation of β3-amino acids into a melanocortin tetrapeptide template was investigated. Four linear α/β3-hybrid tetrapeptides were designed with the modifications at the Phe, Arg, and Trp residues in the agonist sequence Ac-His-dPhe-Arg-Trp-NH2. The most potent mouse melanocortin-4 receptor (mMC4R) agonist, Ac-His-dPhe-Arg-β3hTrp-NH2 (8) showed 35-fold selectivity versus the mMC3R. The study presented here has identified a new template with heterogeneous backbone for designing potent and selective melanocortin receptor ligands. PMID:26005535

  1. The Complete Genomic Sequence of Pepper Yellow Leaf Curl Virus (PYLCV) and Its Implications for Our Understanding of Evolution Dynamics in the Genus Polerovirus

    PubMed Central

    Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel

    2013-01-01

    We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range. PMID:23936244

  2. The complete genomic sequence of pepper yellow leaf curl virus (PYLCV) and its implications for our understanding of evolution dynamics in the genus polerovirus.

    PubMed

    Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel

    2013-01-01

    We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range.

  3. Co-operation between Polymerases and Nucleotide Synthetases in the RNA World.

    PubMed

    Kim, Ye Eun; Higgs, Paul G

    2016-11-01

    It is believed that life passed through an RNA World stage in which replication was sustained by catalytic RNAs (ribozymes). The two most obvious types of ribozymes are a polymerase, which uses a neighbouring strand as a template to make a complementary sequence to the template, and a nucleotide synthetase, which synthesizes monomers for use by the polymerase. When a chemical source of monomers is available, the polymerase can survive on its own. When the chemical supply of monomers is too low, nucleotide production by the synthetase is essential and the two ribozymes can only survive when they are together. Here we consider a computational model to investigate conditions under which coexistence and cooperation of these two types of ribozymes is possible. The model considers six types of strands: the two functional sequences, the complementary strands to these sequences (which are required as templates), and non-functional mutants of the two sequences (which act as parasites). Strands are distributed on a two-dimensional lattice. Polymerases replicate strands on neighbouring sites and synthetases produce monomers that diffuse in the local neighbourhood. We show that coexistence of unlinked polymerases and synthetases is possible in this spatial model under conditions in which neither sequence could survive alone; hence, there is a selective force for increasing complexity. Coexistence is dependent on the relative lengths of the two functional strands, the strand diffusion rate, the monomer diffusion rate, and the rate of deleterious mutations. The sensitivity of this two-ribozyme system suggests that evolution of a system of many types of ribozymes would be difficult in a purely spatial model with unlinked genes. We therefore speculate that linkage of genes onto mini-chromosomes and encapsulation of strands in protocells would have been important fairly early in the history of life as a means of enabling more complex systems to evolve.

  4. Limited copy number - high resolution melting (LCN-HRM) enables the detection and identification by sequencing of low level mutations in cancer biopsies

    PubMed Central

    Do, Hongdo; Dobrovic, Alexander

    2009-01-01

    Background Mutation detection in clinical tumour samples is challenging when the proportion of tumour cells, and thus mutant alleles, is low. The limited sensitivity of conventional sequencing necessitates the adoption of more sensitive approaches. High resolution melting (HRM) is more sensitive than sequencing but identification of the mutation is desirable, particularly when it is important to discriminate false positives due to PCR errors or template degradation from true mutations. We thus developed limited copy number - high resolution melting (LCN-HRM) which applies limiting dilution to HRM. Multiple replicate reactions with a limited number of target sequences per reaction allow low level mutations to be detected. The dilutions used (based on Ct values) are chosen such that mutations, if present, can be detected by the direct sequencing of amplicons with aberrant melting patterns. Results Using cell lines heterozygous for mutations, we found that the mutations were not readily detected when they comprised 10% of total alleles (20% tumour cells) by sequencing, whereas they were readily detectable at 5% total alleles by standard HRM. LCN-HRM allowed these mutations to be identified by direct sequencing of those positive reactions. LCN-HRM was then used to review formalin-fixed paraffin-embedded (FFPE) clinical samples showing discordant findings between sequencing and HRM for KRAS exon 2 and EGFR exons 19 and 21. Both true mutations present at low levels and sequence changes due to artefacts were detected by LCN-HRM. The use of high fidelity polymerases showed that the majority of the artefacts were derived from the damaged template rather than replication errors during amplification. Conclusion LCN-HRM bridges the sensitivity gap between HRM and sequencing and is effective in distinguishing between artefacts and true mutations. PMID:19811662

  5. Limited copy number-high resolution melting (LCN-HRM) enables the detection and identification by sequencing of low level mutations in cancer biopsies.

    PubMed

    Do, Hongdo; Dobrovic, Alexander

    2009-10-08

    Mutation detection in clinical tumour samples is challenging when the proportion of tumour cells, and thus mutant alleles, is low. The limited sensitivity of conventional sequencing necessitates the adoption of more sensitive approaches. High resolution melting (HRM) is more sensitive than sequencing but identification of the mutation is desirable, particularly when it is important to discriminate false positives due to PCR errors or template degradation from true mutations.We thus developed limited copy number - high resolution melting (LCN-HRM) which applies limiting dilution to HRM. Multiple replicate reactions with a limited number of target sequences per reaction allow low level mutations to be detected. The dilutions used (based on Ct values) are chosen such that mutations, if present, can be detected by the direct sequencing of amplicons with aberrant melting patterns. Using cell lines heterozygous for mutations, we found that the mutations were not readily detected when they comprised 10% of total alleles (20% tumour cells) by sequencing, whereas they were readily detectable at 5% total alleles by standard HRM. LCN-HRM allowed these mutations to be identified by direct sequencing of those positive reactions.LCN-HRM was then used to review formalin-fixed paraffin-embedded (FFPE) clinical samples showing discordant findings between sequencing and HRM for KRAS exon 2 and EGFR exons 19 and 21. Both true mutations present at low levels and sequence changes due to artefacts were detected by LCN-HRM. The use of high fidelity polymerases showed that the majority of the artefacts were derived from the damaged template rather than replication errors during amplification. LCN-HRM bridges the sensitivity gap between HRM and sequencing and is effective in distinguishing between artefacts and true mutations.

  6. Multiplexed droplet single-cell RNA-sequencing using natural genetic variation.

    PubMed

    Kang, Hyun Min; Subramaniam, Meena; Targ, Sasha; Nguyen, Michelle; Maliskova, Lenka; McCarthy, Elizabeth; Wan, Eunice; Wong, Simon; Byrnes, Lauren; Lanata, Cristina M; Gate, Rachel E; Mostafavi, Sara; Marson, Alexander; Zaitlen, Noah; Criswell, Lindsey A; Ye, Chun Jimmie

    2018-01-01

    Droplet single-cell RNA-sequencing (dscRNA-seq) has enabled rapid, massively parallel profiling of transcriptomes. However, assessing differential expression across multiple individuals has been hampered by inefficient sample processing and technical batch effects. Here we describe a computational tool, demuxlet, that harnesses natural genetic variation to determine the sample identity of each droplet containing a single cell (singlet) and detect droplets containing two cells (doublets). These capabilities enable multiplexed dscRNA-seq experiments in which cells from unrelated individuals are pooled and captured at higher throughput than in standard workflows. Using simulated data, we show that 50 single-nucleotide polymorphisms (SNPs) per cell are sufficient to assign 97% of singlets and identify 92% of doublets in pools of up to 64 individuals. Given genotyping data for each of eight pooled samples, demuxlet correctly recovers the sample identity of >99% of singlets and identifies doublets at rates consistent with previous estimates. We apply demuxlet to assess cell-type-specific changes in gene expression in 8 pooled lupus patient samples treated with interferon (IFN)-β and perform eQTL analysis on 23 pooled samples.

  7. Attempted nonenzymatic template-directed oligomerizations on a polyadenylic acid template: implications for the nature of the first genetic material

    NASA Technical Reports Server (NTRS)

    Stribling, R.; Miller, S. L.

    1991-01-01

    Previous attempts to produce nonenzymatic template-directed oligomerizations of activated pyrimidines on polypurine templates have been unsuccessful. The only efficient reactions are those where the template is composed primarily of pyrimidines, especially cytosine. Because molecular evolution requires that a synthesized daughter polynucleotide be capable of acting as a template for the synthesis of the original polynucleotide, the one-way replication achieved thus far is inadequate to initiate an evolving system. Several uracil analogs were used in this investigation in order to search for possible replacements for uracil. The monomers used in this investigation were the imidazolides of UMP, xanthosine 5'-monophosphate, the bis-monophosphates of the acyclic nucleosides of uracil, and 2,4-quinazolinedione. The concentrations of various salts, buffers, pH, and temperature were among the different variables investigated in attempts to find conditions that would permit template-directed oligomerizations. Although the different monomers in this study demonstrated varying abilities to form very short oligomers, we were unable to detect any enhancement of this oligomerization that could be attributed to the poly(A) template. Although special conditions might be found that would allow purine-rich templates to work, these reactions cannot be considered robust. The results of our experiments suggest that pyrimidines were not part of the original replicating system on the primitive Earth. It has already been shown that ribose is an unlikely component of the first replicating systems, and we now suggest that phosphate was absent as well. This is due to the low solubility of phosphate in the present ocean (3 x 10(-6) M), as well as the difficulty of prebiotic activation of phosphates.

  8. Bypass of a Nick by the Replisome of Bacteriophage T7*

    PubMed Central

    Zhu, Bin; Lee, Seung-Joo; Richardson, Charles C.

    2011-01-01

    DNA polymerase and DNA helicase are essential components of DNA replication. The helicase unwinds duplex DNA to provide single-stranded templates for DNA synthesis by the DNA polymerase. In bacteriophage T7, movement of either the DNA helicase or the DNA polymerase alone terminates upon encountering a nick in duplex DNA. Using a minicircular DNA, we show that the helicase·polymerase complex can bypass a nick, albeit at reduced efficiency of 7%, on the non-template strand to continue rolling circle DNA synthesis. A gap in the non-template strand cannot be bypassed. The efficiency of bypass synthesis depends on the DNA sequence downstream of the nick. A nick on the template strand cannot be bypassed. Addition of T7 single-stranded DNA-binding protein to the complex stimulates nick bypass 2-fold. We propose that the association of helicase with the polymerase prevents dissociation of the helicase upon encountering a nick, allowing the helicase to continue unwinding of the duplex downstream of the nick. PMID:21701044

  9. Bypass of a nick by the replisome of bacteriophage T7.

    PubMed

    Zhu, Bin; Lee, Seung-Joo; Richardson, Charles C

    2011-08-12

    DNA polymerase and DNA helicase are essential components of DNA replication. The helicase unwinds duplex DNA to provide single-stranded templates for DNA synthesis by the DNA polymerase. In bacteriophage T7, movement of either the DNA helicase or the DNA polymerase alone terminates upon encountering a nick in duplex DNA. Using a minicircular DNA, we show that the helicase · polymerase complex can bypass a nick, albeit at reduced efficiency of 7%, on the non-template strand to continue rolling circle DNA synthesis. A gap in the non-template strand cannot be bypassed. The efficiency of bypass synthesis depends on the DNA sequence downstream of the nick. A nick on the template strand cannot be bypassed. Addition of T7 single-stranded DNA-binding protein to the complex stimulates nick bypass 2-fold. We propose that the association of helicase with the polymerase prevents dissociation of the helicase upon encountering a nick, allowing the helicase to continue unwinding of the duplex downstream of the nick.

  10. Improved multiple displacement amplification (iMDA) and ultraclean reagents.

    PubMed

    Motley, S Timothy; Picuri, John M; Crowder, Chris D; Minich, Jeremiah J; Hofstadler, Steven A; Eshoo, Mark W

    2014-06-06

    Next-generation sequencing sample preparation requires nanogram to microgram quantities of DNA; however, many relevant samples are comprised of only a few cells. Genomic analysis of these samples requires a whole genome amplification method that is unbiased and free of exogenous DNA contamination. To address these challenges we have developed protocols for the production of DNA-free consumables including reagents and have improved upon multiple displacement amplification (iMDA). A specialized ethylene oxide treatment was developed that renders free DNA and DNA present within Gram positive bacterial cells undetectable by qPCR. To reduce DNA contamination in amplification reagents, a combination of ion exchange chromatography, filtration, and lot testing protocols were developed. Our multiple displacement amplification protocol employs a second strand-displacing DNA polymerase, improved buffers, improved reaction conditions and DNA free reagents. The iMDA protocol, when used in combination with DNA-free laboratory consumables and reagents, significantly improved efficiency and accuracy of amplification and sequencing of specimens with moderate to low levels of DNA. The sensitivity and specificity of sequencing of amplified DNA prepared using iMDA was compared to that of DNA obtained with two commercial whole genome amplification kits using 10 fg (~1-2 bacterial cells worth) of bacterial genomic DNA as a template. Analysis showed >99% of the iMDA reads mapped to the template organism whereas only 0.02% of the reads from the commercial kits mapped to the template. To assess the ability of iMDA to achieve balanced genomic coverage, a non-stochastic amount of bacterial genomic DNA (1 pg) was amplified and sequenced, and data obtained were compared to sequencing data obtained directly from genomic DNA. The iMDA DNA and genomic DNA sequencing had comparable coverage 99.98% of the reference genome at ≥1X coverage and 99.9% at ≥5X coverage while maintaining both balance and representation of the genome. The iMDA protocol in combination with DNA-free laboratory consumables, significantly improved the ability to sequence specimens with low levels of DNA. iMDA has broad utility in metagenomics, diagnostics, ancient DNA analysis, pre-implantation embryo screening, single-cell genomics, whole genome sequencing of unculturable organisms, and forensic applications for both human and microbial targets.

  11. Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases

    PubMed Central

    Llorens, Carlos; Futami, Ricardo; Renaud, Gabriel; Moya, Andrés

    2009-01-01

    Background Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. Results In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database . Conclusion The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives. Reviewers This article was reviewed by Daniel H. Haft, Vladimir Kapitonov (nominated by Jerry Jurka), and Ben M. Dunn (nominated by Claus Wilke). PMID:19173708

  12. Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11

    PubMed Central

    Cao, Renzhi; Bhattacharya, Debswapna; Adhikari, Badri; Li, Jilong; Cheng, Jianlin

    2015-01-01

    Model evaluation and selection is an important step and a big challenge in template-based protein structure prediction. Individual model quality assessment methods designed for recognizing some specific properties of protein structures often fail to consistently select good models from a model pool because of their limitations. Therefore, combining multiple complimentary quality assessment methods is useful for improving model ranking and consequently tertiary structure prediction. Here, we report the performance and analysis of our human tertiary structure predictor (MULTICOM) based on the massive integration of 14 diverse complementary quality assessment methods that was successfully benchmarked in the 11th Critical Assessment of Techniques of Protein Structure prediction (CASP11). The predictions of MULTICOM for 39 template-based domains were rigorously assessed by six scoring metrics covering global topology of Cα trace, local all-atom fitness, side chain quality, and physical reasonableness of the model. The results show that the massive integration of complementary, diverse single-model and multi-model quality assessment methods can effectively leverage the strength of single-model methods in distinguishing quality variation among similar good models and the advantage of multi-model quality assessment methods of identifying reasonable average-quality models. The overall excellent performance of the MULTICOM predictor demonstrates that integrating a large number of model quality assessment methods in conjunction with model clustering is a useful approach to improve the accuracy, diversity, and consequently robustness of template-based protein structure prediction. PMID:26369671

  13. AST: an automated sequence-sampling method for improving the taxonomic diversity of gene phylogenetic trees.

    PubMed

    Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

    2014-01-01

    A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php.

  14. AST: An Automated Sequence-Sampling Method for Improving the Taxonomic Diversity of Gene Phylogenetic Trees

    PubMed Central

    Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

    2014-01-01

    A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php. PMID:24892935

  15. Pool Formation in Boulder-Bed Streams: Implications From 1-D and 2-D Numerical Modeling

    NASA Astrophysics Data System (ADS)

    Harrison, L. R.; Keller, E. A.

    2003-12-01

    In mountain rivers of Southern California, boulder-large roughness elements strongly influence flow hydraulics and pool formation and maintenance. In these systems, boulders appear to control the stream morphology by converging flow and producing deep pools during channel forming discharges. Our research goal is to develop quantitative relationships between boulder roughness elements, temporal patterns of scour and fill, and geomorphic processes that are important in producing pool habitat. The longitudinal distribution of shear stress, unit stream power and velocity were estimated along a 48 m reach on Rattlesnake Creek, using the HEC-RAS v 3.0 and River 2-D numerical models. The reach has an average slope of 0.02 and consists of a pool-riffle sequence with a large boulder constriction directly above the pool. Model runs were performed for a range of stream discharges to test if scour and fill thresholds for pool and riffle environments could be identified. Results from the HEC-RAS simulations identified that thresholds in shear stress, unit stream power and mean velocity occur above a discharge of 5.0 cms. Results from the one-dimensional analysis suggest that the reversal in competency is likely due to changes in cross-sectional width at varying flows. River 2-D predictions indicated that strong transverse velocity gradients were present through the pool at higher modeled discharges. At a flow of 0.5 cms (roughly 1/10th bankfull discharge), velocities are estimated at 0.6 m/s and 1.3 m/s for the pool and riffle, respectively. During discharges of 5.15 cms (approximate bankfull discharge), the maximum velocity in the pool center increased to nearly 3.0 m/s, while the maximum velocity over the riffle is estimated at approximately 2.5 cms. These results are consistent with those predicted by HEC-RAS, though the reversal appears to be limited to a narrow jet that occurs through the pool head and pool center. Model predictions suggest that the velocity reversal is produced by a boulder-bedrock constriction that rapidly decreases the channel width above the pool by roughly 25 percent. The width constriction creates highly turbulent flow capable of scouring bed material through the pool. The high velocity core that is produced through the pool center appears to be enhanced by the formation of a large eddy directly below the boulder. Values of unit stream power and shear stress indicate that the pool exit is an area of deposition of bed material due to a decrease in tractive force. The presence of a strong transverse velocity gradient suggests that only a portion of the flow is responsible for scouring bed material. After we eliminate the dead water zone, the lowest five percent of the velocity range, patterns of effective width between pools and riffles begin to emerge. The ratio of flow width between adjacent pools and riffles is one measure of flow convergence. At a discharge of 0.5 cms, the ratio of effective width between pools and riffles is roughly 1:1, implying that there is uniform flow with little flow convergence. At a discharge of 5.15 cms the width ratio between the pool and riffle is about 1:3, demonstrating the strong convergent flow patterns at the pool head. The observed effective width relationship suggests that when considering restoration designs, boulders should be placed in areas that replicate natural convergence and divergence patterns in order to maximize pool area and depth.

  16. DNA sequencing with pyrophosphatase

    DOEpatents

    Tabor, S.; Richardson, C.C.

    1996-03-12

    A kit or solution is disclosed for use in extension of an oligonucleotide primer having a first single-stranded region on a template molecule and having a second single-stranded region homologous to the first single-stranded region. The first agent is able to cause extension of the first single-stranded region of the primer on the second single-stranded region of the template in a reaction mixture. The second agent is able to reduce the amount of pyrophosphate in the reaction mixture below the amount produced during the extension in the absence of the second agent.

  17. DNA sequencing with pyrophosphatase

    DOEpatents

    Tabor, Stanley; Richardson, Charles C.

    1996-03-12

    A kit or solution for use in extension of an oligonucleotide primer having a first single-stranded region on a template molecule having a second single-stranded region homologous to the first single-stranded region, comprising a first agent able to cause extension of the first single-stranded region of the primer on the second single-stranded region of the template in a reaction mixture, and a second agent able to reduce the amount of pyrophosphate in the reaction mixture below the amount produced during the extension in the absence of the second agent.

  18. PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.

    PubMed

    Liu, Chao Chun; Ji, Hezhao

    2018-06-26

    The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  19. Iron ion and iron hydroxide adsorption to charge-neutral phosphatidylcholine templates

    DOE PAGES

    Wang, Wenjie; Zhang, Honghu; Feng, Shuren; ...

    2016-07-13

    Surface-sensitive X-ray scattering and spectroscopy techniques reveal significant adsorption of iron ions and iron-hydroxide (Fe(III)) complexes to a charge-neutral zwitterionic template of phosphatidylcholine (PC). The PC template is formed by a Langmuir monolayer of dipalmitoyl-PC (DPPC) that is spread on the surface of 2 to 40 μM FeCl 3 solutions at physiological levels of KCl (100 mM). At 40 μM of Fe(III) as many as ~3 iron atoms are associated with each PC group. Grazing incidence X-ray diffraction measurements indicate a significant disruption in the in-plane ordering of DPPC molecules upon iron adsorption. The binding of iron-hydroxide complexes to amore » neutral PC surface is yet another example of nonelectrostatic, presumably covalent bonding to a charge-neutral organic template. Furthermore, the strong binding and the disruption of in-plane lipid structure has biological implications on the integrity of PC-derived lipid membranes, including those based on sphingomyelin.« less

  20. Sequence-similar, structure-dissimilar protein pairs in the PDB.

    PubMed

    Kosloff, Mickey; Kolodny, Rachel

    2008-05-01

    It is often assumed that in the Protein Data Bank (PDB), two proteins with similar sequences will also have similar structures. Accordingly, it has proved useful to develop subsets of the PDB from which "redundant" structures have been removed, based on a sequence-based criterion for similarity. Similarly, when predicting protein structure using homology modeling, if a template structure for modeling a target sequence is selected by sequence alone, this implicitly assumes that all sequence-similar templates are equivalent. Here, we show that this assumption is often not correct and that standard approaches to create subsets of the PDB can lead to the loss of structurally and functionally important information. We have carried out sequence-based structural superpositions and geometry-based structural alignments of a large number of protein pairs to determine the extent to which sequence similarity ensures structural similarity. We find many examples where two proteins that are similar in sequence have structures that differ significantly from one another. The source of the structural differences usually has a functional basis. The number of such proteins pairs that are identified and the magnitude of the dissimilarity depend on the approach that is used to calculate the differences; in particular sequence-based structure superpositioning will identify a larger number of structurally dissimilar pairs than geometry-based structural alignments. When two sequences can be aligned in a statistically meaningful way, sequence-based structural superpositioning provides a meaningful measure of structural differences. This approach and geometry-based structure alignments reveal somewhat different information and one or the other might be preferable in a given application. Our results suggest that in some cases, notably homology modeling, the common use of nonredundant datasets, culled from the PDB based on sequence, may mask important structural and functional information. We have established a data base of sequence-similar, structurally dissimilar protein pairs that will help address this problem (http://luna.bioc.columbia.edu/rachel/seqsimstrdiff.htm).

  1. BAC-pool 454-sequencing: A rapid and efficient approach to sequence complex tetraploid cotton genomes

    USDA-ARS?s Scientific Manuscript database

    New and emerging next generation sequencing technologies have been promising in reducing sequencing costs, but not significantly for complex polyploid plant genomes such as cotton. Large and highly repetitive genome of G. hirsutum (~2.5GB) is less amenable and cost-intensive with traditional BAC-by...

  2. A review of basin morphology and pool hydrology of isolated ponded wetlands: implications for seasonal forest pools of the northeastern United States

    Treesearch

    Robert T. Brooks; Robert T. Brooks

    2005-01-01

    Seasonal forest pools (SFPs) are geographically- and hydrologically- isolated ponded wetlands, in that they are topographically isolated from other surface waters. SFPs occur commonly throughout the temperate forests of the eastern United States and adjacent Canada. SFPs are ephemeral in occurrence, typically drying annually. The regular drying of SFPs excludes fish...

  3. Toward a General Approach for RNA-Templated Hierarchical Assembly of Split-Proteins

    PubMed Central

    Furman, Jennifer L.; Badran, Ahmed H.; Ajulo, Oluyomi; Porter, Jason R.; Stains, Cliff I.; Segal, David J.; Ghosh, Indraneel

    2010-01-01

    The ability to conditionally turn on a signal or induce a function in the presence of a user-defined RNA target has potential applications in medicine and synthetic biology. Although sequence-specific pumilio repeat proteins can target a limited set of ssRNA sequences, there are no general methods for targeting ssRNA with designed proteins. As a first step toward RNA recognition, we utilized the RNA binding domain of argonaute, implicated in RNA interference, for specifically targeting generic 2-nucleotide, 3' overhangs of any dsRNA. We tested the reassembly of a split-luciferase enzyme guided by argonaute-mediated recognition of newly generated nucleotide overhangs when ssRNA is targeted by a designed complementary guide sequence. This approach was successful when argonaute was utilized in conjunction with a pumilio repeat and expanded the scope of potential ssRNA targets. However, targeting any desired ssRNA remained elusive as two argonaute domains provided minimal reassembled split-luciferase. We next designed and tested a second hierarchical assembly, wherein ssDNA guides are appended to DNA hairpins that serve as a scaffold for high affinity zinc fingers attached to split-luciferase. In the presence of a ssRNA target containing adjacent sequences complementary to the guides, the hairpins are brought into proximity, allowing for zinc finger binding and concomitant reassembly of the fragmented luciferase. The scope of this new approach was validated by specifically targeting RNA encoding VEGF, hDM2, and HER2. These approaches provide potentially general design paradigms for the conditional reassembly of fragmented proteins in the presence of any desired ssRNA target. PMID:20681585

  4. Multiple templates-based homology modeling enhances structure quality of AT1 receptor: validation by molecular dynamics and antagonist docking.

    PubMed

    Sokkar, Pandian; Mohandass, Shylajanaciyar; Ramachandran, Murugesan

    2011-07-01

    We present a comparative account on 3D-structures of human type-1 receptor (AT1) for angiotensin II (AngII), modeled using three different methodologies. AngII activates a wide spectrum of signaling responses via the AT1 receptor that mediates physiological control of blood pressure and diverse pathological actions in cardiovascular, renal, and other cell types. Availability of 3D-model of AT1 receptor would significantly enhance the development of new drugs for cardiovascular diseases. However, templates of AT1 receptor with low sequence similarity increase the complexity in straightforward homology modeling, and hence there is a need to evaluate different modeling methodologies in order to use the models for sensitive applications such as rational drug design. Three models were generated for AT1 receptor by, (1) homology modeling with bovine rhodopsin as template, (2) homology modeling with multiple templates and (3) threading using I-TASSER web server. Molecular dynamics (MD) simulation (15 ns) of models in explicit membrane-water system, Ramachandran plot analysis and molecular docking with antagonists led to the conclusion that multiple template-based homology modeling outweighs other methodologies for AT1 modeling.

  5. A difference tracking algorithm based on discrete sine transform

    NASA Astrophysics Data System (ADS)

    Liu, HaoPeng; Yao, Yong; Lei, HeBing; Wu, HaoKun

    2018-04-01

    Target tracking is an important field of computer vision. The template matching tracking algorithm based on squared difference matching (SSD) and standard correlation coefficient (NCC) matching is very sensitive to the gray change of image. When the brightness or gray change, the tracking algorithm will be affected by high-frequency information. Tracking accuracy is reduced, resulting in loss of tracking target. In this paper, a differential tracking algorithm based on discrete sine transform is proposed to reduce the influence of image gray or brightness change. The algorithm that combines the discrete sine transform and the difference algorithm maps the target image into a image digital sequence. The Kalman filter predicts the target position. Using the Hamming distance determines the degree of similarity between the target and the template. The window closest to the template is determined the target to be tracked. The target to be tracked updates the template. Based on the above achieve target tracking. The algorithm is tested in this paper. Compared with SSD and NCC template matching algorithms, the algorithm tracks target stably when image gray or brightness change. And the tracking speed can meet the read-time requirement.

  6. A prebiotic template-directed peptide synthesis based on amyloids.

    PubMed

    Rout, Saroj K; Friedmann, Michael P; Riek, Roland; Greenwald, Jason

    2018-01-16

    The prebiotic replication of information-coding molecules is a central problem concerning life's origins. Here, we report that amyloids composed of short peptides can direct the sequence-selective, regioselective and stereoselective condensation of amino acids. The addition of activated DL-arginine and DL-phenylalanine to the peptide RFRFR-NH 2 in the presence of the complementary template peptide Ac-FEFEFEFE-NH 2 yields the isotactic product FRFRFRFR-NH 2 , 1 of 64 possible triple addition products, under conditions in which the absence of template yields only single and double additions of mixed stereochemistry. The templating mechanism appears to be general in that a different amyloid formed by (Orn)V(Orn)V(Orn)V(Orn)V-NH 2 and Ac-VDVDVDVDV-NH 2 is regioselective and stereoselective for N-terminal, L-amino-acid addition while the ornithine-valine peptide alone yields predominantly sidechain condensation products with little stereoselectivity. Furthermore, the templating reaction is stable over a wide range of pH (5.6-8.6), salt concentration (0-4 M NaCl), and temperature (25-90 °C), making the amyloid an attractive model for a prebiotic peptide replicating system.

  7. Single Molecule Bioelectronics and Their Application to Amplification-Free Measurement of DNA Lengths

    PubMed Central

    Gül, O. Tolga; Pugliese, Kaitlin M.; Choi, Yongki; Sims, Patrick C.; Pan, Deng; Rajapakse, Arith J.; Weiss, Gregory A.; Collins, Philip G.

    2016-01-01

    As biosensing devices shrink smaller and smaller, they approach a scale in which single molecule electronic sensing becomes possible. Here, we review the operation of single-enzyme transistors made using single-walled carbon nanotubes. These novel hybrid devices transduce the motions and catalytic activity of a single protein into an electronic signal for real-time monitoring of the protein’s activity. Analysis of these electronic signals reveals new insights into enzyme function and proves the electronic technique to be complementary to other single-molecule methods based on fluorescence. As one example of the nanocircuit technique, we have studied the Klenow Fragment (KF) of DNA polymerase I as it catalytically processes single-stranded DNA templates. The fidelity of DNA polymerases makes them a key component in many DNA sequencing techniques, and here we demonstrate that KF nanocircuits readily resolve DNA polymerization with single-base sensitivity. Consequently, template lengths can be directly counted from electronic recordings of KF’s base-by-base activity. After measuring as few as 20 copies, the template length can be determined with <1 base pair resolution, and different template lengths can be identified and enumerated in solutions containing template mixtures. PMID:27348011

  8. Single Molecule Bioelectronics and Their Application to Amplification-Free Measurement of DNA Lengths.

    PubMed

    Gül, O Tolga; Pugliese, Kaitlin M; Choi, Yongki; Sims, Patrick C; Pan, Deng; Rajapakse, Arith J; Weiss, Gregory A; Collins, Philip G

    2016-06-24

    As biosensing devices shrink smaller and smaller, they approach a scale in which single molecule electronic sensing becomes possible. Here, we review the operation of single-enzyme transistors made using single-walled carbon nanotubes. These novel hybrid devices transduce the motions and catalytic activity of a single protein into an electronic signal for real-time monitoring of the protein's activity. Analysis of these electronic signals reveals new insights into enzyme function and proves the electronic technique to be complementary to other single-molecule methods based on fluorescence. As one example of the nanocircuit technique, we have studied the Klenow Fragment (KF) of DNA polymerase I as it catalytically processes single-stranded DNA templates. The fidelity of DNA polymerases makes them a key component in many DNA sequencing techniques, and here we demonstrate that KF nanocircuits readily resolve DNA polymerization with single-base sensitivity. Consequently, template lengths can be directly counted from electronic recordings of KF's base-by-base activity. After measuring as few as 20 copies, the template length can be determined with <1 base pair resolution, and different template lengths can be identified and enumerated in solutions containing template mixtures.

  9. Support for HIV-1 Intervention Therapy

    DTIC Science & Technology

    1993-10-01

    I. Kiselev, and E. S. Severin. 1990. Amplification of DNA 46 sequences of Epstein - Barr and human immunodeficiency viruses using DNA-polymerase from... develop and validate assays that predict or demonstrate disease progression for use in interventional trials with an emphasis on molecular biologic...to stay on the leading edge of technology development . A potential problem in obtaining quality sequence information is the occurrence of template

  10. Construction of the BAC Library of Small Abalone (Haliotis diversicolor) for Gene Screening and Genome Characterization.

    PubMed

    Jiang, Likun; You, Weiwei; Zhang, Xiaojun; Xu, Jian; Jiang, Yanliang; Wang, Kai; Zhao, Zixia; Chen, Baohua; Zhao, Yunfeng; Mahboob, Shahid; Al-Ghanim, Khalid A; Ke, Caihuan; Xu, Peng

    2016-02-01

    The small abalone (Haliotis diversicolor) is one of the most important aquaculture species in East Asia. To facilitate gene cloning and characterization, genome analysis, and genetic breeding of it, we constructed a large-insert bacterial artificial chromosome (BAC) library, which is an important genetic tool for advanced genetics and genomics research. The small abalone BAC library includes 92,610 clones with an average insert size of 120 Kb, equivalent to approximately 7.6× of the small abalone genome. We set up three-dimensional pools and super pools of 18,432 BAC clones for target gene screening using PCR method. To assess the approach, we screened 12 target genes in these 18,432 BAC clones and identified 16 positive BAC clones. Eight positive BAC clones were then sequenced and assembled with the next generation sequencing platform. The assembled contigs representing these 8 BAC clones spanned 928 Kb of the small abalone genome, providing the first batch of genome sequences for genome evaluation and characterization. The average GC content of small abalone genome was estimated as 40.33%. A total of 21 protein-coding genes, including 7 target genes, were annotated into the 8 BACs, which proved the feasibility of PCR screening approach with three-dimensional pools in small abalone BAC library. One hundred fifty microsatellite loci were also identified from the sequences for marker development in the future. The BAC library and clone pools provided valuable resources and tools for genetic breeding and conservation of H. diversicolor.

  11. Full-genome dengue virus sequencing in mosquito saliva shows lack of convergent positive selection during transmission by Aedes aegypti

    PubMed Central

    Cao-Lormeau, Van-Mai; Lambrechts, Louis

    2017-01-01

    Abstract Like other pathogens with high mutation and replication rates, within-host dengue virus (DENV) populations evolve during infection of their main mosquito vector, Aedes aegypti. Within-host DENV evolution during transmission provides opportunities for adaptation and emergence of novel virus variants. Recent studies of DENV genetic diversity failed to detect convergent evolution of adaptive mutations in mosquito tissues such as midgut and salivary glands, suggesting that convergent positive selection is not a major driver of within-host DENV evolution in the vector. However, it is unknown whether this conclusion extends to the transmitted viral subpopulation because it is technically difficult to sequence DENV genomes in mosquito saliva. Here, we achieved DENV full-genome sequencing by pooling saliva samples collected non-sacrificially from 49 to 163 individual Ae. aegypti mosquitoes previously infected with one of two DENV-1 genotypes. We compared the transmitted viral subpopulations found in the pooled saliva samples collected in time series with the input viral population present in the infectious blood meal. In all pooled saliva samples examined, the full-genome consensus sequence of the input viral population was unchanged. Although the pooling strategy prevents analysis of individual saliva samples, our results demonstrate the lack of strong convergent positive selection during a single round of DENV transmission by Ae. aegypti. This finding reinforces the idea that genetic drift and purifying selection are the dominant evolutionary forces shaping within-host DENV genetic diversity during transmission by mosquitoes. PMID:29497564

  12. Generation of non-genomic oligonucleotide tag sequences for RNA template-specific PCR

    PubMed Central

    Pinto, Fernando Lopes; Svensson, Håkan; Lindblad, Peter

    2006-01-01

    Background In order to overcome genomic DNA contamination in transcriptional studies, reverse template-specific polymerase chain reaction, a modification of reverse transcriptase polymerase chain reaction, is used. The possibility of using tags whose sequences are not found in the genome further improves reverse specific polymerase chain reaction experiments. Given the absence of software available to produce genome suitable tags, a simple tool to fulfill such need was developed. Results The program was developed in Perl, with separate use of the basic local alignment search tool, making the tool platform independent (known to run on Windows XP and Linux). In order to test the performance of the generated tags, several molecular experiments were performed. The results show that Tagenerator is capable of generating tags with good priming properties, which will deliberately not result in PCR amplification of genomic DNA. Conclusion The program Tagenerator is capable of generating tag sequences that combine genome absence with good priming properties for RT-PCR based experiments, circumventing the effects of genomic DNA contamination in an RNA sample. PMID:16820068

  13. Template-guided vs. non-guided drilling in site preparation of dental implants.

    PubMed

    Scherer, Uta; Stoetzer, Marcus; Ruecker, Martin; Gellrich, Nils-Claudius; von See, Constantin

    2015-07-01

    Clinical success of oral implants is related to primary stability and osseointegration. These parameters are associated with delicate surgical techniques. We herein studied whether template-guided drilling has a significant influence on drillholes diameter and accuracy in an in vitro model. Fresh cadaveric porcine mandibles were used for drilling experiments of four experimental groups. Each group consisted of three operators, comparing guide templates for drilling with free-handed procedure. Operators without surgical knowledge were grouped together, contrasting highly experienced oral surgeons in other groups. A total of 180 drilling actions were performed, and diameters were recorded at multiple depth levels, with a precision measuring instrument. Template-guided drilling procedure improved accuracy on a very significant level in comparison with free-handed drilling operation (p ≤ 0.001). Inaccuracy of free-handed drilling became more significant in relation to measurement depth. High homogenic uniformity of template-guided drillholes was significantly stronger than unguided drilling operations by highly experienced oral surgeons (p ≤ 0.001). Template-guided drilling procedure leads to significantly enhanced accuracy. Significant results compared to free-handed drilling actions were achieved, irrespective of the clinical experience level of the operator. Template-guided drilling procedures lead to a more predictable clinical diameter. It shows that any set of instruments has to be carefully chosen to match the specific implant system. The current in vitro study is implicating an improvement of implant bed preparation but needs to be confirmed in clinical studies.

  14. A revised velocity-reversal and sediment-sorting model for a high-gradient, pool-riffle stream

    USGS Publications Warehouse

    Thompson, D.M.; Wohl, E.E.; Jarrett, R.D.

    1996-01-01

    Sediment-sorting processes related to varying channel-bed morphology were investigated from April to November 1993 along a 1-km pool-riffle and step-pool reach of North Saint Vrain Creek, a small mountain stream in the Rocky Mountains of northern Colorado. Measured cross-sectional areas of flow were used to suggest higher velocities in pools than in riffles at high flow. Three hundred and sixteen tracer particles, ranging in size from 16 mm to 256 mm, were placed in two separate pool-riffle-pool sequences and used to assess sediment-sorting patterns and sediment-transport competence variations. Tracer-particle depositional evidence indicated higher sediment-transport competence in pools than in riffles at high flow. Pool-riffle sediment sorting may be created by velocity reversals, and more localized sorting results from gravitational forces along the upstream sloping portion of the channel bed located at the downstream end of pools.

  15. Characterization of the stability and folding of H2A.Z chromatin particles: implications for transcriptional activation.

    PubMed

    Abbott, D W; Ivanova, V S; Wang, X; Bonner, W M; Ausió, J

    2001-11-09

    H2A.Z and H2A.1 nucleosome core particles and oligonucleosome arrays were obtained using recombinant versions of these histones and a native histone H2B/H3/H4 complement reconstituted onto appropriate DNA templates. Analysis of the reconstituted nucleosome core particles using native polyacrylamide gel electrophoresis and DNase I footprinting showed that H2A.Z nucleosome core particles were almost structurally indistinguishable from its H2A.1 or native chicken erythrocyte counterparts. While this result is in good agreement with the recently published crystallographic structure of the H2A.Z nucleosome core particle (Suto, R. K., Clarkson, M J., Tremethick, D. J., and Luger, K. (2000) Nat. Struct. Biol. 7, 1121-1124), the ionic strength dependence of the sedimentation coefficient of these particles exhibits a substantial destabilization, which is most likely the result of the histone H2A.Z-H2B dimer binding less tightly to the nucleosome. Analytical ultracentrifuge analysis of the H2A.Z 208-12, a DNA template consisting of 12 tandem repeats of a 208-base pair sequence derived from the sea urchin Lytechinus variegatus 5 S rRNA gene, reconstituted oligonucleosome complexes in the absence of histone H1 shows that their NaCl-dependent folding ability is significantly reduced. These results support the notion that the histone H2A.Z variant may play a chromatin-destabilizing role, which may be important for transcriptional activation.

  16. Intercontinental convergence of stream fish community traits along geomorphic and hydraulic gradients

    USGS Publications Warehouse

    Lamouroux, N.; Poff, N.L.; Angermeier, P.L.

    2002-01-01

    Community convergence across biogeographically distinct regions suggests the existence of key, repeated, evolutionary mechanisms relating community characteristics to the environment. However, convergence studies at the community level often involve only qualitative comparisons of the environment and may fail to identify which environmental variables drive community structure. We tested the hypothesis that the biological traits of fish communities on two continents (Europe and North America) are similarly related to environmental conditions. Specifically, from observations of individual fish made at the microhabitat scale (a few square meters) within French streams, we generated habitat preference models linking traits of fish species to local scale hydraulic conditions (Froude number), Using this information, we then predicted how hydraulics and geomorphology at the larger scale of stream reaches (several pool-riffle sequences) should quantitatively influence the trait composition of fish communities. Trait composition for fishes in stream reaches with low Froude number at low flow or high proportion of pools was predicted as nonbenthic, large, fecund, long-lived, nonstreamlined, and weak swimmers. We tested our predictions in contrasting stream reaches in France (n = 11) and Virginia, USA (n = 76), using analyses of covariance to quantify the relative influence of continent vs. physical habitat variables on fish traits. The reach-scale convergence analysis indicated that trait proportions in the communities differed between continents (up to 55% of the variance in each trait was explained by "continent"), partly due to distinct evolutionary histories. However, within continents, trait proportions were comparably related to the hydraulic and geomorphic variables (up to 54% of the variance within continents explained). In particular, a synthetic measure of fish traits in reaches was well explained (50% of its variance) by the Froude number independently of the continent. The effect of physical variables did not differ across continents for most traits, confirming our predictions qualitatively and quantitatively. Therefore, despite phylogenetic and historical differences between continents, fish communities of France and Virginia exhibit convergence in biological traits related to hydraulics and geomorphology. This convergence reflects morphological and behavioral adaptations to physical stress in streams. This study supports the existence of a habitat template for ecological strategies. Some key quantitative variables that define this habitat template can be identified by characterizing how individual organisms use their physical environment, and by using dimensionless physical variables that reveal common energetic properties in different systems. Overall, quantitative tests of community convergence are efficient tools to demonstrate that some community traits are predictable from environmental features.

  17. Break-induced replication and recombinational telomere elongation in yeast.

    PubMed

    McEachern, Michael J; Haber, James E

    2006-01-01

    When a telomere becomes unprotected or if only one end of a chromosomal double-strand break succeeds in recombining with a template sequence, DNA can be repaired by a recombination-dependent DNA replication process termed break-induced replication (BIR). In budding yeasts, there are two BIR pathways, one dependent on the Rad51 recombinase protein and one Rad51 independent; these two repair processes lead to different types of survivors in cells lacking the telomerase enzyme that is required for normal telomere maintenance. Recombination at telomeres is triggered by either excessive telomere shortening or disruptions in the function of telomere-binding proteins. Telomere elongation by BIR appears to often occur through a "roll and spread" mechanism. In this process, a telomeric circle produced by recombination at a dysfunctional telomere acts as a template for a rolling circle BIR event to form an elongated telomere. Additional BIR events can then copy the elongated sequence to all other telomeres.

  18. TALE proteins search DNA using a rotationally decoupled mechanism.

    PubMed

    Cuculis, Luke; Abil, Zhanar; Zhao, Huimin; Schroeder, Charles M

    2016-10-01

    Transcription activator-like effector (TALE) proteins are a class of programmable DNA-binding proteins used extensively for gene editing. Despite recent progress, however, little is known about their sequence search mechanism. Here, we use single-molecule experiments to study TALE search along DNA. Our results show that TALEs utilize a rotationally decoupled mechanism for nonspecific search, despite remaining associated with DNA templates during the search process. Our results suggest that the protein helical structure enables TALEs to adopt a loosely wrapped conformation around DNA templates during nonspecific search, facilitating rapid one-dimensional (1D) diffusion under a range of solution conditions. Furthermore, this model is consistent with a previously reported two-state mechanism for TALE search that allows these proteins to overcome the search speed-stability paradox. Taken together, our results suggest that TALE search is unique among the broad class of sequence-specific DNA-binding proteins and supports efficient 1D search along DNA.

  19. Detection of genetically modified organisms (GMOs) using isothermal amplification of target DNA sequences.

    PubMed

    Lee, David; La Mura, Maurizio; Allnutt, Theo R; Powell, Wayne

    2009-02-02

    The most common method of GMO detection is based upon the amplification of GMO-specific DNA amplicons using the polymerase chain reaction (PCR). Here we have applied the loop-mediated isothermal amplification (LAMP) method to amplify GMO-related DNA sequences, 'internal' commonly-used motifs for controlling transgene expression and event-specific (plant-transgene) junctions. We have tested the specificity and sensitivity of the technique for use in GMO studies. Results show that detection of 0.01% GMO in equivalent background DNA was possible and dilutions of template suggest that detection from single copies of the template may be possible using LAMP. This work shows that GMO detection can be carried out using LAMP for routine screening as well as for specific events detection. Moreover, the sensitivity and ability to amplify targets, even with a high background of DNA, here demonstrated, highlights the advantages of this isothermal amplification when applied for GMO detection.

  20. Challenges and opportunities for improving food quality and nutrition through plant biotechnology.

    PubMed

    Francis, David; Finer, John J; Grotewold, Erich

    2017-04-01

    Plant biotechnology has been around since the advent of humankind, resulting in tremendous improvements in plant cultivation through crop domestication, breeding and selection. The emergence of transgenic approaches involving the introduction of defined DNA sequences into plants by humans has rapidly changed the surface of our planet by further expanding the gene pool used by plant breeders for plant improvement. Transgenic approaches in food plants have raised concerns on the merits, social implications, ecological risks and true benefits of plant biotechnology. The recently acquired ability to precisely edit plant genomes by modifying native genes without introducing new genetic material offers new opportunities to rapidly exploit natural variation, create new variation and incorporate changes with the goal to generate more productive and nutritious plants. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies

    PubMed Central

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-01-01

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. PMID:27172202

  2. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies.

    PubMed

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-07-07

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. Copyright © 2016 Chen et al.

  3. Object recognition with hierarchical discriminant saliency networks.

    PubMed

    Han, Sunhyoung; Vasconcelos, Nuno

    2014-01-01

    The benefits of integrating attention and object recognition are investigated. While attention is frequently modeled as a pre-processor for recognition, we investigate the hypothesis that attention is an intrinsic component of recognition and vice-versa. This hypothesis is tested with a recognition model, the hierarchical discriminant saliency network (HDSN), whose layers are top-down saliency detectors, tuned for a visual class according to the principles of discriminant saliency. As a model of neural computation, the HDSN has two possible implementations. In a biologically plausible implementation, all layers comply with the standard neurophysiological model of visual cortex, with sub-layers of simple and complex units that implement a combination of filtering, divisive normalization, pooling, and non-linearities. In a convolutional neural network implementation, all layers are convolutional and implement a combination of filtering, rectification, and pooling. The rectification is performed with a parametric extension of the now popular rectified linear units (ReLUs), whose parameters can be tuned for the detection of target object classes. This enables a number of functional enhancements over neural network models that lack a connection to saliency, including optimal feature denoising mechanisms for recognition, modulation of saliency responses by the discriminant power of the underlying features, and the ability to detect both feature presence and absence. In either implementation, each layer has a precise statistical interpretation, and all parameters are tuned by statistical learning. Each saliency detection layer learns more discriminant saliency templates than its predecessors and higher layers have larger pooling fields. This enables the HDSN to simultaneously achieve high selectivity to target object classes and invariance. The performance of the network in saliency and object recognition tasks is compared to those of models from the biological and computer vision literatures. This demonstrates benefits for all the functional enhancements of the HDSN, the class tuning inherent to discriminant saliency, and saliency layers based on templates of increasing target selectivity and invariance. Altogether, these experiments suggest that there are non-trivial benefits in integrating attention and recognition.

  4. POOL server: machine learning application for functional site prediction in proteins.

    PubMed

    Somarowthu, Srinivas; Ondrechen, Mary Jo

    2012-08-01

    We present an automated web server for partial order optimum likelihood (POOL), a machine learning application that combines computed electrostatic and geometric information for high-performance prediction of catalytic residues from 3D structures. Input features consist of THEMATICS electrostatics data and pocket information from ConCavity. THEMATICS measures deviation from typical, sigmoidal titration behavior to identify functionally important residues and ConCavity identifies binding pockets by analyzing the surface geometry of protein structures. Both THEMATICS and ConCavity (structure only) do not require the query protein to have any sequence or structure similarity to other proteins. Hence, POOL is applicable to proteins with novel folds and engineered proteins. As an additional option for cases where sequence homologues are available, users can include evolutionary information from INTREPID for enhanced accuracy in site prediction. The web site is free and open to all users with no login requirements at http://www.pool.neu.edu. m.ondrechen@neu.edu Supplementary data are available at Bioinformatics online.

  5. Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

    PubMed Central

    Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

    2016-01-01

    DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962

  6. Characterization of the mammalian miRNA turnover landscape

    PubMed Central

    Guo, Yanwen; Liu, Jun; Elfenbein, Sarah J.; Ma, Yinghong; Zhong, Mei; Qiu, Caihong; Ding, Ye; Lu, Jun

    2015-01-01

    Steady state cellular microRNA (miRNA) levels represent the balance between miRNA biogenesis and turnover. The kinetics and sequence determinants of mammalian miRNA turnover during and after miRNA maturation are not fully understood. Through a large-scale study on mammalian miRNA turnover, we report the co-existence of multiple cellular miRNA pools with distinct turnover kinetics and biogenesis properties and reveal previously unrecognized sequence features for fast turnover miRNAs. We measured miRNA turnover rates in eight mammalian cell types with a combination of expression profiling and deep sequencing. While most miRNAs are stable, a subset of miRNAs, mostly miRNA*s, turnovers quickly, many of which display a two-step turnover kinetics. Moreover, different sequence isoforms of the same miRNA can possess vastly different turnover rates. Fast turnover miRNA isoforms are enriched for 5′ nucleotide bias against Argonaute-(AGO)-loading, but also additional 3′ and central sequence features. Modeling based on two fast turnover miRNA*s miR-222-5p and miR-125b-1-3p, we unexpectedly found that while both miRNA*s are associated with AGO, they strongly differ in HSP90 association and sensitivity to HSP90 inhibition. Our data characterize the landscape of genome-wide miRNA turnover in cultured mammalian cells and reveal differential HSP90 requirements for different miRNA*s. Our findings also implicate rules for designing stable small RNAs, such as siRNAs. PMID:25653157

  7. High-throughput, pooled sequencing identifies mutations in NUBPL and FOXRED1 in human complex I deficiency

    PubMed Central

    Calvo, Sarah E; Tucker, Elena J; Compton, Alison G; Kirby, Denise M; Crawford, Gabriel; Burtt, Noel P; Rivas, Manuel A; Guiducci, Candace; Bruno, Damien L; Goldberger, Olga A; Redman, Michelle C; Wiltshire, Esko; Wilson, Callum J; Altshuler, David; Gabriel, Stacey B; Daly, Mark J; Thorburn, David R; Mootha, Vamsi K

    2010-01-01

    Discovering the molecular basis of mitochondrial respiratory chain disease is challenging given the large number of both mitochondrial and nuclear genes involved. We report a strategy of focused candidate gene prediction, high-throughput sequencing, and experimental validation to uncover the molecular basis of mitochondrial complex I (CI) disorders. We created five pools of DNA from a cohort of 103 patients and then performed deep sequencing of 103 candidate genes to spotlight 151 rare variants predicted to impact protein function. We used confirmatory experiments to establish genetic diagnoses in 22% of previously unsolved cases, and discovered that defects in NUBPL and FOXRED1 can cause CI deficiency. Our study illustrates how large-scale sequencing, coupled with functional prediction and experimental validation, can reveal novel disease-causing mutations in individual patients. PMID:20818383

  8. Novel genetic tools for studying food-borne Salmonella.

    PubMed

    Andrews-Polymenis, Helene L; Santiviago, Carlos A; McClelland, Michael

    2009-04-01

    Nontyphoidal Salmonellae are highly prevalent food-borne pathogens. High-throughput sequencing of Salmonella genomes is expanding our knowledge of the evolution of serovars and epidemic isolates. Genome sequences have also allowed the creation of complete microarrays. Microarrays have improved the throughput of in vivo expression technology (IVET) used to uncover promoters active during infection. In another method, signature tagged mutagenesis (STM), pools of mutants are subjected to selection. Changes in the population are monitored on a microarray, revealing genes under selection. Complete genome sequences permit the construction of pools of targeted in-frame deletions that have improved STM by minimizing the number of clones and the polarity of each mutant. Together, genome sequences and the continuing development of new tools for functional genomics will drive a revolution in the understanding of Salmonellae in many different niches that are critical for food safety.

  9. Cryptic Hepatitis B and E in Patients With Acute Hepatitis of Unknown Etiology.

    PubMed

    Ganova-Raeva, Lilia; Punkova, Lili; Campo, David S; Dimitrova, Zoya; Skums, Pavel; Vu, Nga H; Dat, Do T; Dalton, Harry R; Khudyakov, Yury

    2015-12-15

    Up to 30% of acute viral hepatitis has no known etiology. To determine the disease etiology in patients with acute hepatitis of unknown etiology (HUE), serum specimens were obtained from 38 patients residing in the United Kingdom and Vietnam and from 26 healthy US blood donors. All specimens tested negative for known viral infections causing hepatitis, using commercially available serological and nucleic acid assays. Specimens were processed by sequence-independent complementary DNA amplification and next-generation sequencing (NGS). Sufficient material for individual NGS libraries was obtained from 12 HUE cases and 26 blood donors; the remaining HUE cases were sequenced as a pool. Read mapping was done by targeted and de novo assembly. Sequences from hepatitis B virus (HBV) were detected in 7 individuals with HUE (58.3%) and the pooled library, and hepatitis E virus (HEV) was detected in 2 individuals with HUE (16.7%) and the pooled library. Both HEV-positive cases were coinfected with HBV. HBV sequences belonged to genotypes A, D, or G, and HEV sequences belonged to genotype 3. No known hepatotropic viruses were detected in the tested normal human sera. NGS-based detection of HBV and HEV infections is more sensitive than using commercially available assays. HBV and HEV may be cryptically associated with HUE. Published by Oxford University Press on behalf of the Infectious Diseases Society of America 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  10. Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space

    PubMed Central

    Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R.; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J.

    2013-01-01

    For the vast majority of species – including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding. PMID:23592960

  11. Combinatorial pooling enables selective sequencing of the barley gene space.

    PubMed

    Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J

    2013-04-01

    For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.

  12. A graphene-based biosensing platform based on the release of DNA probes and rolling circle amplification.

    PubMed

    Liu, Meng; Song, Jinping; Shuang, Shaomin; Dong, Chuan; Brennan, John D; Li, Yingfu

    2014-06-24

    We report a versatile biosensing platform capable of achieving ultrasensitive detection of both small-molecule and macromolecular targets. The system features three components: reduced graphene oxide for its ability to adsorb single-stranded DNA molecules nonspecifically, DNA aptamers for their ability to bind reduced graphene oxide but undergo target-induced conformational changes that facilitate their release from the reduced graphene oxide surface, and rolling circle amplification (RCA) for its ability to amplify a primer-template recognition event into repetitive sequence units that can be easily detected. The key to the design is the tagging of a short primer to an aptamer sequence, which results in a small DNA probe that allows for both effective probe adsorption onto the reduced graphene oxide surface to mask the primer domain in the absence of the target, as well as efficient probe release in the presence of the target to make the primer available for template binding and RCA. We also made an observation that the circular template, which on its own does not cause a detectable level of probe release from the reduced graphene oxide, augments target-induced probe release. The synergistic release of DNA probes is interpreted to be a contributing factor for the high detection sensitivity. The broad utility of the platform is illustrated though engineering three different sensors that are capable of achieving ultrasensitive detection of a protein target, a DNA sequence and a small-molecule analyte. We envision that the approach described herein will find useful applications in the biological, medical, and environmental fields.

  13. Sequence-Specific DNA Photosplitting of Crosslinked DNAs Containing the 3-Cyanovinylcarbazole Nucleoside by Using DNA Strand Displacement.

    PubMed

    Nakamura, Shigetaka; Kawabata, Hayato; Fujimoto, Kenzo

    2016-08-17

    An oligodeoxynucleotide (ODN) containing the ultrafast reversible 3-cyanovinylcarbazole ((CNV) K) photo-crosslinker was photo-crosslinked to a complementary strand upon exposure to 366 nm irradiation and photosplit by use of 312 nm irradiation. In this paper we report that the photoreaction of (CNV) K on irradiation at 366 nm involves a photostationary state and that its reaction can be controlled by temperature. Guided by this new insight, we proposed and have now demonstrated previously unknown photosplitting of (CNV) K aided by DNA strand displacement as an alternative to heating. The photo-crosslinked double-stranded DNA (dsDNA) underwent >80 % photosplitting aided by DNA strand displacement on irradiation at 366 nm without heating. In this photosplitting based on DNA strand displacement, the relative thermal stability of the invader strand with respect to the template strands plays an important role, and an invader strand/template strand system that is more stable than the passenger strand/template strand system induces photosplitting without heating. This new strand-displacement-aided photosplitting occurred in a sequence-specific manner through irradiation at 366 nm in the presence of an invader strand. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Vision-based measurement for rotational speed by improving Lucas-Kanade template tracking algorithm.

    PubMed

    Guo, Jie; Zhu, Chang'an; Lu, Siliang; Zhang, Dashan; Zhang, Chunyu

    2016-09-01

    Rotational angle and speed are important parameters for condition monitoring and fault diagnosis of rotating machineries, and their measurement is useful in precision machining and early warning of faults. In this study, a novel vision-based measurement algorithm is proposed to complete this task. A high-speed camera is first used to capture the video of the rotational object. To extract the rotational angle, the template-based Lucas-Kanade algorithm is introduced to complete motion tracking by aligning the template image in the video sequence. Given the special case of nonplanar surface of the cylinder object, a nonlinear transformation is designed for modeling the rotation tracking. In spite of the unconventional and complex form, the transformation can realize angle extraction concisely with only one parameter. A simulation is then conducted to verify the tracking effect, and a practical tracking strategy is further proposed to track consecutively the video sequence. Based on the proposed algorithm, instantaneous rotational speed (IRS) can be measured accurately and efficiently. Finally, the effectiveness of the proposed algorithm is verified on a brushless direct current motor test rig through the comparison with results obtained by the microphone. Experimental results demonstrate that the proposed algorithm can extract accurately rotational angles and can measure IRS with the advantage of noncontact and effectiveness.

  15. Silicifying Biofilm Exopolymers on a Hot-Spring Microstromatolite: Templating Nanometer-Thick Laminae

    NASA Astrophysics Data System (ADS)

    Handley, Kim M.; Turner, Sue J.; Campbell, Kathleen A.; Mountain, Bruce W.

    2008-08-01

    Exopolymeric substances (EPS) are an integral component of microbial biofilms; however, few studies have addressed their silicification and preservation in hot-spring deposits. Through comparative analyses with the use of a range of microscopy techniques, we identified abundant EPS significant to the textural development of spicular, microstromatolitic, siliceous sinter at Champagne Pool, Waiotapu, New Zealand. Examination of biofilms coating sinter surfaces by confocal laser scanning microscopy (CLSM), environmental scanning electron microscopy (ESEM), cryo-scanning electron microscopy (cryo-SEM), and transmission electron microscopy (TEM) revealed contraction of the gelatinous EPS matrix into films (approximately 10 nm thick) or fibrillar structures, which is common in conventional SEM analyses and analogous to products of naturally occurring desiccation. Silicification of fibrillar EPS contributed to the formation of filamentous sinter. Matrix surfaces or dehydrated films templated sinter laminae (nanometers to microns thick) that, in places, preserved fenestral voids beneath. Laminae of similar thickness are, in general, common to spicular geyserites. This is the first report to demonstrate EPS templation of siliceous stromatolite laminae. Considering the ubiquity of biofilms on surfaces in hot-spring environments, EPS silicification studies are likely to be important to a better understanding of the origins of laminae in other modern and ancient stromatolitic sinters, and EPS potentially may serve as biosignatures in extraterrestrial rocks.

  16. Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding

    PubMed Central

    Best, Katharine; Oakes, Theres; Heather, James M.; Shawe-Taylor, John; Chain, Benny

    2015-01-01

    The polymerase chain reaction (PCR) is one of the most widely used techniques in molecular biology. In combination with High Throughput Sequencing (HTS), PCR is widely used to quantify transcript abundance for RNA-seq, and in the context of analysis of T and B cell receptor repertoires. In this study, we combine DNA barcoding with HTS to quantify PCR output from individual target molecules. We develop computational tools that simulate both the PCR branching process itself, and the subsequent subsampling which typically occurs during HTS sequencing. We explore the influence of different types of heterogeneity on sequencing output, and compare them to experimental results where the efficiency of amplification is measured by barcodes uniquely identifying each molecule of starting template. Our results demonstrate that the PCR process introduces substantial amplification heterogeneity, independent of primer sequence and bulk experimental conditions. This heterogeneity can be attributed both to inherited differences between different template DNA molecules, and the inherent stochasticity of the PCR process. The results demonstrate that PCR heterogeneity arises even when reaction and substrate conditions are kept as constant as possible, and therefore single molecule barcoding is essential in order to derive reproducible quantitative results from any protocol combining PCR with HTS. PMID:26459131

  17. Synthetic Molecular Evolution of Membrane-Active Peptides

    NASA Astrophysics Data System (ADS)

    Wimley, William

    The physical chemistry of membrane partitioning largely determines the function of membrane active peptides. Membrane-active peptides have potential utility in many areas, including in the cellular delivery of polar compounds, cancer therapy, biosensor design, and in antibacterial, antiviral and antifungal therapies. Yet, despite decades of research on thousands of known examples, useful sequence-structure-function relationships are essentially unknown. Because peptide-membrane interactions within the highly fluid bilayer are dynamic and heterogeneous, accounts of mechanism are necessarily vague and descriptive, and have little predictive power. This creates a significant roadblock to advances in the field. We are bypassing that roadblock with synthetic molecular evolution: iterative peptide library design and orthogonal high-throughput screening. We start with template sequences that have at least some useful activity, and create small, focused libraries using structural and biophysical principles to design the sequence space around the template. Orthogonal high-throughput screening is used to identify gain-of-function peptides by simultaneously selecting for several different properties (e.g. solubility, activity and toxicity). Multiple generations of iterative library design and screening have enabled the identification of membrane-active sequences with heretofore unknown properties, including clinically relevant, broad-spectrum activity against drug-resistant bacteria and enveloped viruses as well as pH-triggered macromolecular poration.

  18. Predicted cycloartenol synthase protein from Kandelia obovata and Rhizophora stylosa using online software of Phyre2 and Swiss-model

    NASA Astrophysics Data System (ADS)

    Basyuni, M.; Sulistiyono, N.; Wati, R.; Sumardi; Oku, H.; Baba, S.; Sagami, H.

    2018-03-01

    Cloning of Kandelia obovata KcCAS gene (previously known as Kandelia candel) and Rhizophora stylosa RsCAS have already have been reported and encoded cycloartenol synthases. In this study, the predicted KcCAS and RsCAS protein were analyzed using online software of Phyre2 and Swiss-model. The protein modelling for KcCAS and RsCAS cycloartenol synthases was determined using Pyre2 had similar results with slightly different in sequence identity. By contrast, the Swiss-model for KcCAS slightly had higher sequence identity (47.31%) and Qmean (0.70) compared to RsCAS. No difference of ligands binding site which is considered as modulators for both cycloartenol synthases. The range of predicted protein derived from 91-757 amino acid residues with coverage sequence similarities 0.86, respectively from template model of lanosterol synthase from the human. Homology modelling revealed that 706 residues (93% of the amino acid sequence) had been modelled with 100.0% confidence by the single highest scoring template for both KcCAS and RsCAS using Phyre2. This coverage was more elevated than swiss-model predicted (86%). The present study suggested that both genes are responsible for the genesis of cycloartenol in these mangrove plants.

  19. SvABA: genome-wide detection of structural variants and indels by local assembly.

    PubMed

    Wala, Jeremiah A; Bandopadhayay, Pratiti; Greenwald, Noah F; O'Rourke, Ryan; Sharpe, Ted; Stewart, Chip; Schumacher, Steve; Li, Yilong; Weischenfeldt, Joachim; Yao, Xiaotong; Nusbaum, Chad; Campbell, Peter; Getz, Gad; Meyerson, Matthew; Zhang, Cheng-Zhong; Imielinski, Marcin; Beroukhim, Rameen

    2018-04-01

    Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA's performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ∼4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs. © 2018 Wala et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Aplysia attractin: biophysical characterization and modeling of a water-borne pheromone.

    PubMed Central

    Schein, C H; Nagle, G T; Page, J S; Sweedler, J V; Xu, Y; Painter, S D; Braun, W

    2001-01-01

    Attractin, a 58-residue protein secreted by the mollusk Aplysia californica, stimulates sexually mature animals to approach egg cordons. Attractin from five different Aplysia species are approximately 40% identical in sequence. Recombinant attractin, expressed in insect cells and purified by reverse-phase high-performance liquid chromatography (RP-HPLC), is active in a bioassay using A. brasiliana; its circular dichroism (CD) spectrum indicates a predominantly alpha-helical structure. Matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) characterization of proteolytic fragments identified disulfide bonds between the six conserved cysteines (I-VI, II-V, III-IV, where the Roman numeral indicates the order of occurrence in the primary sequence). Attractin has no significant similarity to any other sequence in the database. The protozoan Euplotes pheromones were selected by fold recognition as possible templates. These diverse proteins have three alpha-helices, with six cysteine residues disulfide-bonded in a different pattern from attractin. Model structures with good stereochemical parameters were prepared using the EXDIS/DIAMOD/FANTOM program suite and constraints based on sequence alignments with the Euplotes templates and the attractin disulfide bonds. A potential receptor-binding site is suggested based on these data. Future structural characterization of attractin will be needed to confirm these models. PMID:11423429

  1. Structural behavior and dynamics of an anomalous fluid between attractive and repulsive walls: templating, molding, and superdiffusion.

    PubMed

    Leoni, Fabio; Franzese, Giancarlo

    2014-11-07

    Confinement can modify the dynamics, the thermodynamics, and the structural properties of liquid water, the prototypical anomalous liquid. By considering a generic model for anomalous liquids, suitable for describing solutions of globular proteins, colloids, or liquid metals, we study by molecular dynamics simulations the effect that an attractive wall with structure and a repulsive wall without structure have on the phases, the crystal nucleation, and the dynamics of the fluid. We find that at low temperatures the large density of the attractive wall induces a high-density, high-energy structure in the first layer ("templating" effect). In turn, the first layer induces a "molding" effect on the second layer determining a structure with reduced energy and density, closer to the average density of the system. This low-density, low-energy structure propagates further through the layers by templating effect and can involve all the existing layers at the lowest temperatures investigated. Therefore, although the high-density, high-energy structure does not self-reproduce further than the first layer, the structured wall can have a long-range influence thanks to a sequence of templating, molding, and templating effects through the layers. We find that the walls also have an influence on the dynamics of the liquid, with a stronger effect near the attractive wall. In particular, we observe that the dynamics is largely heterogeneous (i) among the layers, as a consequence of the sequence of structures caused by the walls presence, and (ii) within the same layer, due to superdiffusive liquid veins within a frozen matrix of particles near the walls at low temperature and high density. Hence, the partial freezing of the first layer does not correspond necessarily to an effective reduction of the channel's section in terms of transport properties, as suggested by other authors.

  2. An ensemble approach to protein fold classification by integration of template-based assignment and support vector machine classifier.

    PubMed

    Xia, Jiaqi; Peng, Zhenling; Qi, Dawei; Mu, Hongbo; Yang, Jianyi

    2017-03-15

    Protein fold classification is a critical step in protein structure prediction. There are two possible ways to classify protein folds. One is through template-based fold assignment and the other is ab-initio prediction using machine learning algorithms. Combination of both solutions to improve the prediction accuracy was never explored before. We developed two algorithms, HH-fold and SVM-fold for protein fold classification. HH-fold is a template-based fold assignment algorithm using the HHsearch program. SVM-fold is a support vector machine-based ab-initio classification algorithm, in which a comprehensive set of features are extracted from three complementary sequence profiles. These two algorithms are then combined, resulting to the ensemble approach TA-fold. We performed a comprehensive assessment for the proposed methods by comparing with ab-initio methods and template-based threading methods on six benchmark datasets. An accuracy of 0.799 was achieved by TA-fold on the DD dataset that consists of proteins from 27 folds. This represents improvement of 5.4-11.7% over ab-initio methods. After updating this dataset to include more proteins in the same folds, the accuracy increased to 0.971. In addition, TA-fold achieved >0.9 accuracy on a large dataset consisting of 6451 proteins from 184 folds. Experiments on the LE dataset show that TA-fold consistently outperforms other threading methods at the family, superfamily and fold levels. The success of TA-fold is attributed to the combination of template-based fold assignment and ab-initio classification using features from complementary sequence profiles that contain rich evolution information. http://yanglab.nankai.edu.cn/TA-fold/. yangjy@nankai.edu.cn or mhb-506@163.com. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  3. Rapid earthquake detection through GPU-Based template matching

    NASA Astrophysics Data System (ADS)

    Mu, Dawei; Lee, En-Jui; Chen, Po

    2017-12-01

    The template-matching algorithm (TMA) has been widely adopted for improving the reliability of earthquake detection. The TMA is based on calculating the normalized cross-correlation coefficient (NCC) between a collection of selected template waveforms and the continuous waveform recordings of seismic instruments. In realistic applications, the computational cost of the TMA is much higher than that of traditional techniques. In this study, we provide an analysis of the TMA and show how the GPU architecture provides an almost ideal environment for accelerating the TMA and NCC-based pattern recognition algorithms in general. So far, our best-performing GPU code has achieved a speedup factor of more than 800 with respect to a common sequential CPU code. We demonstrate the performance of our GPU code using seismic waveform recordings from the ML 6.6 Meinong earthquake sequence in Taiwan.

  4. USING LINKED MICROMAP PLOTS TO CHARACTERIZE OMERNIK ECOREGIONS

    EPA Science Inventory

    The paper introduces linked micromap (LM plots for presenting environmental summaries. The LM template includes parallel sequences of micromap, able, and statistical summary graphics panels with attention paid to perceptual grouping, sorting and linking of the summary components...

  5. Pyrosequencing as a tool for the identification of common isolates of Mycobacterium sp.

    PubMed

    Tuohy, Marion J; Hall, Gerri S; Sholtis, Mary; Procop, Gary W

    2005-04-01

    Pyrosequencing technology, sequencing by addition, was evaluated for categorization of mycobacterial isolates. One hundred and eighty-nine isolates, including 18 ATCC and Trudeau Mycobacterial Culture Collection (TMC) strains, were studied. There were 38 Mycobacterium tuberculosis complex, 27 M. kansasii, 27 MAI complex, 21 M. marinum, 14 M. gordonae, 20 M. chelonae-abscessus group, 10 M. fortuitum, 5 M. xenopi, 3 M. celatum, 2 M. terrae complex, 20 M. mucogenicum, and 2 M. scrofulaceum. Nucleic acid extracts were prepared from solid media or MGIT broth. Traditional PCR was performed with one of the primers biotinylated; the assay targeted a portion of the 16S rRNA gene that contains a hypervariable region, which has been previously shown to be useful for the identification of mycobacteria. The PSQ Sample Preparation Kit was used, and the biotinylated PCR product was processed to a single-stranded DNA template. The sequencing primer was hybridized to the DNA template in a PSQ96 plate. Incorporation of the complementary nucleotides resulted in light generation peaks, forming a pyrogram, which was evaluated by the instrument software. Thirty basepairs were used for isolate categorization. Manual interpretation of the sequences was performed if the quality of the 30-bp sequence was in doubt or if more than 4 bp homopolymers were recognized. Sequences with more than 5 bp of bad quality were deemed unacceptable. When blasted against GenBank, 179 of 189 sequences (94.7%) assigned isolates to the correct molecular genus or group. Ten M. gordonae isolates had more than 5 bp of bad quality sequence and were not accepted. Pyrosequencing of this hypervariable region afforded rapid and acceptable characterization of common, routinely isolated clinical Mycobacterium sp. Algorithms are recommended for further differentiation with an additional sequencing primer or additional biochemicals.

  6. Rat L (long interspersed repeated DNA) elements contain guanine-rich homopurine sequences that induce unpairing of contiguous duplex DNA.

    PubMed Central

    Usdin, K; Furano, A V

    1988-01-01

    The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed. Images PMID:2837766

  7. FANCJ suppresses microsatellite instability and lymphomagenesis independent of the Fanconi anemia pathway.

    PubMed

    Matsuzaki, Kenichiro; Borel, Valerie; Adelman, Carrie A; Schindler, Detlev; Boulton, Simon J

    2015-12-15

    Microsatellites are short tandem repeat sequences that are highly prone to expansion/contraction due to their propensity to form non-B-form DNA structures, which hinder DNA polymerases and provoke template slippage. Although error correction by mismatch repair plays a key role in preventing microsatellite instability (MSI), which is a hallmark of Lynch syndrome, activities must also exist that unwind secondary structures to facilitate replication fidelity. Here, we report that Fancj helicase-deficient mice, while phenotypically resembling Fanconi anemia (FA), are also hypersensitive to replication inhibitors and predisposed to lymphoma. Whereas metabolism of G4-DNA structures is largely unaffected in Fancj(-/-) mice, high levels of spontaneous MSI occur, which is exacerbated by replication inhibition. In contrast, MSI is not observed in Fancd2(-/-) mice but is prevalent in human FA-J patients. Together, these data implicate FANCJ as a key factor required to counteract MSI, which is functionally distinct from its role in the FA pathway. © 2015 Matsuzaki et al.; Published by Cold Spring Harbor Laboratory Press.

  8. Mechanism of Microhomology-Mediated End-Joining Promoted by Human DNA Polymerase Theta

    PubMed Central

    Kent, Tatiana; Chandramouly, Gurushankar; McDevitt, Shane Michael; Ozdemir, Ahmet Y.; Pomerantz, Richard T.

    2014-01-01

    Microhomology-mediated end-joining (MMEJ) is an error-prone alternative double-strand break repair pathway that utilizes sequence microhomology to recombine broken DNA. Although MMEJ is implicated in cancer development, the mechanism of this pathway is unknown. We demonstrate that purified human DNA polymerase θ (Polθ) performs MMEJ of DNA containing 3’ single-strand DNA overhangs with two or more base-pairs of homology, including DNA modeled after telomeres, and show that MMEJ is dependent on Polθ in human cells. Our data support a mechanism whereby Polθ facilitates end-joining and microhomology annealing then utilizes the opposing overhang as a template in trans which stabilizes the DNA synapse. Polθ exhibits a preference for DNA containing a 5’-terminal phosphate, similar to polymerases involved in non-homologous end-joining. Lastly, we identify a conserved loop domain that is essential for MMEJ and higher-order structures of Polθ which likely promote DNA synapse formation. PMID:25643323

  9. Tax credits and purchasing pools: will this marriage work?

    PubMed

    Trude, S; Ginsburg, P B

    2001-04-01

    Bipartisan interest is growing in Congress for using federal tax credits to help low-income families buy health insurance. Regardless of the approach taken, tax credit policies must address risk selection issues to ensure coverage for the chronically ill. Proposals that link tax credits to purchasing pools would avoid risk selection by grouping risks similar to the way large employers do. Voluntary purchasing pools have had only limited success, however. This Issue Brief discusses linking tax credits to purchasing pools. It uses information from the Center for Studying Health System Change's (HSC) site visits to 12 communities as well as other research to assess the role of purchasing pools nationwide and the key issues and implications of linking tax credits and pools.

  10. Molecular Detection of Tick-Borne Pathogen Diversities in Ticks from Livestock and Reptiles along the Shores and Adjacent Islands of Lake Victoria and Lake Baringo, Kenya.

    PubMed

    Omondi, David; Masiga, Daniel K; Fielding, Burtram C; Kariuki, Edward; Ajamma, Yvonne Ukamaka; Mwamuye, Micky M; Ouso, Daniel O; Villinger, Jandouwe

    2017-01-01

    Although diverse tick-borne pathogens (TBPs) are endemic to East Africa, with recognized impact on human and livestock health, their diversity and specific interactions with tick and vertebrate host species remain poorly understood in the region. In particular, the role of reptiles in TBP epidemiology remains unknown, despite having been implicated with TBPs of livestock among exported tortoises and lizards. Understanding TBP ecologies, and the potential role of common reptiles, is critical for the development of targeted transmission control strategies for these neglected tropical disease agents. During the wet months (April-May; October-December) of 2012-2013, we surveyed TBP diversity among 4,126 ticks parasitizing livestock and reptiles at homesteads along the shores and islands of Lake Baringo and Lake Victoria in Kenya, regions endemic to diverse neglected tick-borne diseases. After morphological identification of 13 distinct Rhipicephalus, Amblyomma , and Hyalomma tick species, ticks were pooled (≤8 individuals) by species, host, sampling site, and collection date into 585 tick pools. By supplementing previously established molecular assays for TBP detection with high-resolution melting analysis of PCR products before sequencing, we identified high frequencies of potential disease agents of ehrlichiosis (12.48% Ehrlichia ruminantium , 9.06% Ehrlichia canis ), anaplasmosis (6.32% Anaplasma ovis , 14.36% Anaplasma platys , and 3.08% Anaplasma bovis ,), and rickettsiosis (6.15% Rickettsia africae , 2.22% Rickettsia aeschlimannii , 4.27% Rickettsia rhipicephali , and 4.95% Rickettsia spp.), as well as Paracoccus sp. and apicomplexan hemoparasites (0.51% Theileria sp., 2.56% Hepatozoon fitzsimonsi , and 1.37% Babesia caballi ) among tick pools. Notably, we identified E. ruminantium in both Amblyomma and Rhipicephalus pools of ticks sampled from livestock in both study areas as well as in Amblyomma falsomarmoreum (66.7%) and Amblyomma nuttalli (100%) sampled from tortoises and Amblyomma sparsum (63.6%) sampled in both cattle and tortoises at Lake Baringo. Similarly, we identified E. canis in rhipicephaline ticks sampled from livestock and dogs in both regions and Amblyomma latum (75%) sampled from monitor lizards at Lake Victoria. These novel tick-host-pathogen interactions have implications on the risk of disease transmission to humans and domestic animals and highlight the complexity of TBP ecologies, which may include reptiles as reservoir species, in sub-Saharan Africa.

  11. Molecular Detection of Tick-Borne Pathogen Diversities in Ticks from Livestock and Reptiles along the Shores and Adjacent Islands of Lake Victoria and Lake Baringo, Kenya

    PubMed Central

    Omondi, David; Masiga, Daniel K.; Fielding, Burtram C.; Kariuki, Edward; Ajamma, Yvonne Ukamaka; Mwamuye, Micky M.; Ouso, Daniel O.; Villinger, Jandouwe

    2017-01-01

    Although diverse tick-borne pathogens (TBPs) are endemic to East Africa, with recognized impact on human and livestock health, their diversity and specific interactions with tick and vertebrate host species remain poorly understood in the region. In particular, the role of reptiles in TBP epidemiology remains unknown, despite having been implicated with TBPs of livestock among exported tortoises and lizards. Understanding TBP ecologies, and the potential role of common reptiles, is critical for the development of targeted transmission control strategies for these neglected tropical disease agents. During the wet months (April–May; October–December) of 2012–2013, we surveyed TBP diversity among 4,126 ticks parasitizing livestock and reptiles at homesteads along the shores and islands of Lake Baringo and Lake Victoria in Kenya, regions endemic to diverse neglected tick-borne diseases. After morphological identification of 13 distinct Rhipicephalus, Amblyomma, and Hyalomma tick species, ticks were pooled (≤8 individuals) by species, host, sampling site, and collection date into 585 tick pools. By supplementing previously established molecular assays for TBP detection with high-resolution melting analysis of PCR products before sequencing, we identified high frequencies of potential disease agents of ehrlichiosis (12.48% Ehrlichia ruminantium, 9.06% Ehrlichia canis), anaplasmosis (6.32% Anaplasma ovis, 14.36% Anaplasma platys, and 3.08% Anaplasma bovis,), and rickettsiosis (6.15% Rickettsia africae, 2.22% Rickettsia aeschlimannii, 4.27% Rickettsia rhipicephali, and 4.95% Rickettsia spp.), as well as Paracoccus sp. and apicomplexan hemoparasites (0.51% Theileria sp., 2.56% Hepatozoon fitzsimonsi, and 1.37% Babesia caballi) among tick pools. Notably, we identified E. ruminantium in both Amblyomma and Rhipicephalus pools of ticks sampled from livestock in both study areas as well as in Amblyomma falsomarmoreum (66.7%) and Amblyomma nuttalli (100%) sampled from tortoises and Amblyomma sparsum (63.6%) sampled in both cattle and tortoises at Lake Baringo. Similarly, we identified E. canis in rhipicephaline ticks sampled from livestock and dogs in both regions and Amblyomma latum (75%) sampled from monitor lizards at Lake Victoria. These novel tick–host–pathogen interactions have implications on the risk of disease transmission to humans and domestic animals and highlight the complexity of TBP ecologies, which may include reptiles as reservoir species, in sub-Saharan Africa. PMID:28620610

  12. A new surveillance and response tool: risk map of infected Oncomelania hupensis detected by Loop-mediated isothermal amplification (LAMP) from pooled samples.

    PubMed

    Tong, Qun-Bo; Chen, Rui; Zhang, Yi; Yang, Guo-Jing; Kumagai, Takashi; Furushima-Shimogawara, Rieko; Lou, Di; Yang, Kun; Wen, Li-Yong; Lu, Shao-Hong; Ohta, Nobuo; Zhou, Xiao-Nong

    2015-01-01

    Although schistosomiasis remains a serious health problem worldwide, significant achievements in schistosomiasis control has been made in the People's Republic of China. The disease has been eliminated in five out of 12 endemic provinces, and the prevalence in remaining endemic areas is very low and is heading toward elimination. A rapid and sensitive method for monitoring the distribution of infected Oncomelania hupensis is urgently required. We applied a loop-mediated isothermal amplification (LAMP) assay targeting 28S rDNA for the rapid and effective detection of Schistosoma japonicum DNA in infected and prepatent infected O. hupensis snails. The detection limit of the LAMP method was 100 fg of S. japonicum genomic DNA. To promote the application of the approach in the field, the LAMP assay was used to detect infection in pooled samples of field-collected snails. In the pooled sample detection, snails were collected from 28 endemic areas, and 50 snails from each area were pooled based on the maximum pool size estimation, crushed together and DNA was extracted from each pooled sample as template for the LAMP assay. Based on the formula for detection from pooled samples, the proportion of positive pooled samples and the positive proportion of O. hupensis detected by LAMP of Xima village reached 66.67% and 1.33%, while those of Heini, Hongjia, Yangjiang and Huangshan villages were 33.33% and 0.67%, and those of Tuanzhou and Suliao villages were 16.67% and 0.33%, respectively. The remaining 21 monitoring field sites gave negative results. A risk map for the transmission of schistosomiasis was constructed using ArcMap, based on the positive proportion of O. hupensis infected with S. japonicum, as detected by the LAMP assay, which will form a guide for surveillance and response strategies in high risk areas. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. Rolling-circle amplification under topological constraints

    PubMed Central

    Kuhn, Heiko; Demidov, Vadim V.; Frank-Kamenetskii, Maxim D.

    2002-01-01

    We have performed rolling-circle amplification (RCA) reactions on three DNA templates that differ distinctly in their topology: an unlinked DNA circle, a linked DNA circle within a pseudorotaxane-type structure and a linked DNA circle within a catenane. In the linked templates, the single-stranded circle (dubbed earring probe) is threaded, with the aid of two peptide nucleic acid openers, between the two strands of double-stranded DNA (dsDNA). We have found that the RCA efficiency of amplification was essentially unaffected when the linked templates were employed. By showing that the DNA catenane remains intact after RCA reactions, we prove that certain DNA polymerases can carry out the replicative synthesis under topological constraints allowing detection of several hundred copies of a dsDNA marker without DNA denaturation. Our finding may have practical implications in the area of DNA diagnostics. PMID:11788721

  14. Nucleotide Sequence Analysis of RNA Synthesized from Rabbit Globin Complementary DNA

    PubMed Central

    Poon, Raymond; Paddock, Gary V.; Heindell, Howard; Whitcome, Philip; Salser, Winston; Kacian, Dan; Bank, Arthur; Gambino, Roberto; Ramirez, Francesco

    1974-01-01

    Rabbit globin complementary DNA made with RNA-dependent DNA polymerase (reverse transcriptase) was used as template for in vitro synthesis of 32P-labeled RNA. The sequences of the nucleotides in most of the fragments resulting from combined ribonuclease T1 and alkaline phosphatase digestion have been determined. Several fragments were long enough to fit uniquely with the α or β globin amino-acid sequences. These data demonstrate that the cDNA was copied from globin mRNA and contained no detectable contaminants. Images PMID:4139714

  15. Evaluation of accuracy in implant site preparation performed in single- or multi-step drilling procedures.

    PubMed

    Marheineke, Nadine; Scherer, Uta; Rücker, Martin; von See, Constantin; Rahlf, Björn; Gellrich, Nils-Claudius; Stoetzer, Marcus

    2018-06-01

    Dental implant failure and insufficient osseointegration are proven results of mechanical and thermal damage during the surgery process. We herein performed a comparative study of a less invasive single-step drilling preparation protocol and a conventional multiple drilling sequence. Accuracy of drilling holes was precisely analyzed and the influence of different levels of expertise of the handlers and additional use of drill template guidance was evaluated. Six experimental groups, deployed in an osseous study model, were representing template-guided and freehanded drilling actions in a stepwise drilling procedure in comparison to a single-drill protocol. Each experimental condition was studied by the drilling actions of respectively three persons without surgical knowledge as well as three highly experienced oral surgeons. Drilling actions were performed and diameters were recorded with a precision measuring instrument. Less experienced operators were able to significantly increase the drilling accuracy using a guiding template, especially when multi-step preparations are performed. Improved accuracy without template guidance was observed when experienced operators were executing single-step versus multi-step technique. Single-step drilling protocols have shown to produce more accurate results than multi-step procedures. The outcome of any protocol can be further improved by use of guiding templates. Operator experience can be a contributing factor. Single-step preparations are less invasive and are promoting osseointegration. Even highly experienced surgeons are achieving higher levels of accuracy by combining this technique with template guidance. Hereby template guidance enables a reduction of hands-on time and side effects during surgery and lead to a more predictable clinical diameter.

  16. Systematic analysis of enzymatic DNA polymerization using oligo-DNA templates and triphosphate analogs involving 2',4'-bridged nucleosides.

    PubMed

    Kuwahara, Masayasu; Obika, Satoshi; Nagashima, Jun-ichi; Ohta, Yuki; Suto, Yoshiyuki; Ozaki, Hiroaki; Sawai, Hiroaki; Imanishi, Takeshi

    2008-08-01

    In order to systematically analyze the effects of nucleoside modification of sugar moieties in DNA polymerase reactions, we synthesized 16 modified templates containing 2',4'-bridged nucleotides and three types of 2',4'-bridged nucleoside-5'-triphospates with different bridging structures. Among the five types of thermostable DNA polymerases used, Taq, Phusion HF, Vent(exo-), KOD Dash and KOD(exo-), the KOD Dash and KOD(exo-) DNA polymerases could smoothly read through the modified templates containing 2'-O,4'-C-methylene-linked nucleotides at intervals of a few nucleotides, even at standard enzyme concentrations for 5 min. Although the Vent(exo-) DNA polymerase also read through these modified templates, kinetic study indicates that the KOD(exo-) DNA polymerase was found to be far superior to the Vent(exo-) DNA polymerase in accurate incorporation of nucleotides. When either of the DNA polymerase was used, the presence of 2',4'-bridged nucleotides on a template strand substantially decreased the reaction rates of nucleotide incorporations. The modified templates containing sequences of seven successive 2',4'-bridged nucleotides could not be completely transcribed by any of the DNA polymerases used; yields of longer elongated products decreased in the order of steric bulkiness of the modified sugars. Successive incorporation of 2',4'-bridged nucleotides into extending strands using 2',4'-bridged nucleoside-5'-triphospates was much more difficult. These data indicate that the sugar modification would have a greater effect on the polymerase reaction when it is adjacent to the elongation terminus than when it is on the template as well, as in base modification.

  17. A genetically anchored physical framework for Theobroma cacao cv. Matina 1-6

    PubMed Central

    2011-01-01

    Background The fermented dried seeds of Theobroma cacao (cacao tree) are the main ingredient in chocolate. World cocoa production was estimated to be 3 million tons in 2010 with an annual estimated average growth rate of 2.2%. The cacao bean production industry is currently under threat from a rise in fungal diseases including black pod, frosty pod, and witches' broom. In order to address these issues, genome-sequencing efforts have been initiated recently to facilitate identification of genetic markers and genes that could be utilized to accelerate the release of robust T. cacao cultivars. However, problems inherent with assembly and resolution of distal regions of complex eukaryotic genomes, such as gaps, chimeric joins, and unresolvable repeat-induced compressions, have been unavoidably encountered with the sequencing strategies selected. Results Here, we describe the construction of a BAC-based integrated genetic-physical map of the T. cacao cultivar Matina 1-6 which is designed to augment and enhance these sequencing efforts. Three BAC libraries, each comprised of 10× coverage, were constructed and fingerprinted. 230 genetic markers from a high-resolution genetic recombination map and 96 Arabidopsis-derived conserved ortholog set (COS) II markers were anchored using pooled overgo hybridization. A dense tile path consisting of 29,383 BACs was selected and end-sequenced. The physical map consists of 154 contigs and 4,268 singletons. Forty-nine contigs are genetically anchored and ordered to chromosomes for a total span of 307.2 Mbp. The unanchored contigs (105) span 67.4 Mbp and therefore the estimated genome size of T. cacao is 374.6 Mbp. A comparative analysis with A. thaliana, V. vinifera, and P. trichocarpa suggests that comparisons of the genome assemblies of these distantly related species could provide insights into genome structure, evolutionary history, conservation of functional sites, and improvements in physical map assembly. A comparison between the two T. cacao cultivars Matina 1-6 and Criollo indicates a high degree of collinearity in their genomes, yet rearrangements were also observed. Conclusions The results presented in this study are a stand-alone resource for functional exploitation and enhancement of Theobroma cacao but are also expected to complement and augment ongoing genome-sequencing efforts. This resource will serve as a template for refinement of the T. cacao genome through gap-filling, targeted re-sequencing, and resolution of repetitive DNA arrays. PMID:21846342

  18. A genetically anchored physical framework for Theobroma cacao cv. Matina 1-6.

    PubMed

    Saski, Christopher A; Feltus, Frank A; Staton, Margaret E; Blackmon, Barbara P; Ficklin, Stephen P; Kuhn, David N; Schnell, Raymond J; Shapiro, Howard; Motamayor, Juan Carlos

    2011-08-16

    The fermented dried seeds of Theobroma cacao (cacao tree) are the main ingredient in chocolate. World cocoa production was estimated to be 3 million tons in 2010 with an annual estimated average growth rate of 2.2%. The cacao bean production industry is currently under threat from a rise in fungal diseases including black pod, frosty pod, and witches' broom. In order to address these issues, genome-sequencing efforts have been initiated recently to facilitate identification of genetic markers and genes that could be utilized to accelerate the release of robust T. cacao cultivars. However, problems inherent with assembly and resolution of distal regions of complex eukaryotic genomes, such as gaps, chimeric joins, and unresolvable repeat-induced compressions, have been unavoidably encountered with the sequencing strategies selected. Here, we describe the construction of a BAC-based integrated genetic-physical map of the T. cacao cultivar Matina 1-6 which is designed to augment and enhance these sequencing efforts. Three BAC libraries, each comprised of 10× coverage, were constructed and fingerprinted. 230 genetic markers from a high-resolution genetic recombination map and 96 Arabidopsis-derived conserved ortholog set (COS) II markers were anchored using pooled overgo hybridization. A dense tile path consisting of 29,383 BACs was selected and end-sequenced. The physical map consists of 154 contigs and 4,268 singletons. Forty-nine contigs are genetically anchored and ordered to chromosomes for a total span of 307.2 Mbp. The unanchored contigs (105) span 67.4 Mbp and therefore the estimated genome size of T. cacao is 374.6 Mbp. A comparative analysis with A. thaliana, V. vinifera, and P. trichocarpa suggests that comparisons of the genome assemblies of these distantly related species could provide insights into genome structure, evolutionary history, conservation of functional sites, and improvements in physical map assembly. A comparison between the two T. cacao cultivars Matina 1-6 and Criollo indicates a high degree of collinearity in their genomes, yet rearrangements were also observed. The results presented in this study are a stand-alone resource for functional exploitation and enhancement of Theobroma cacao but are also expected to complement and augment ongoing genome-sequencing efforts. This resource will serve as a template for refinement of the T. cacao genome through gap-filling, targeted re-sequencing, and resolution of repetitive DNA arrays.

  19. Phylogenetic relationships among Lactuca (Asteraceae) species and related genera based on ITS-1 DNA sequences.

    PubMed

    Koopman, W J; Guetta, E; van de Wiel, C C; Vosman, B; van den Berg, R G

    1998-11-01

    Internal transcribed spacer (ITS-1) sequences from 97 accessions representing 23 species of Lactuca and related genera were determined and used to evaluate species relationships of Lactuca sensu lato (s.l.). The ITS-1 phylogenies, calculated using PAUP and PHYLIP, correspond better to the classification of Feráková than to other classifications evaluated, although the inclusion of sect. Lactuca subsect. Cyanicae is not supported. Therefore, exclusion of subsect. Cyanicae from Lactuca sensu Feráková is proposed. The amended genus contains the entire gene pool (sensu Harlan and De Wet) of cultivated lettuce (Lactuca sativa). The position of the species in the amended classification corresponds to their position in the lettuce gene pool. In the ITS-1 phylogenies, a clade with L. sativa, L. serriola, L. dregeana, L. altaica, and L. aculeata represents the primary gene pool. L. virosa and L. saligna, branching off closest to this clade, encompass the secondary gene pool. L. virosa is possibly of hybrid origin. The primary and secondary gene pool species are classified in sect. Lactuca subsect. Lactuca. The species L. quercina, L. viminea, L. sibirica, and L. tatarica, branching off next, represent the tertiary gene pool. They are classified in Lactuca sect. Lactucopsis, sect. Phaenixopus, and sect. Mulgedium, respectively. L. perennis and L. tenerrima, classified in sect. Lactuca subsect. Cyanicae, form clades with species from related genera and are not part of the lettuce gene pool.

  20. Exome copy number variation detection: Use of a pool of unrelated healthy tissue as reference sample.

    PubMed

    Wenric, Stephane; Sticca, Tiberio; Caberg, Jean-Hubert; Josse, Claire; Fasquelle, Corinne; Herens, Christian; Jamar, Mauricette; Max, Stéphanie; Gothot, André; Caers, Jo; Bours, Vincent

    2017-01-01

    An increasing number of bioinformatic tools designed to detect CNVs (copy number variants) in tumor samples based on paired exome data where a matched healthy tissue constitutes the reference have been published in the recent years. The idea of using a pool of unrelated healthy DNA as reference has previously been formulated but not thoroughly validated. As of today, the gold standard for CNV calling is still aCGH but there is an increasing interest in detecting CNVs by exome sequencing. We propose to design a metric allowing the comparison of two CNV profiles, independently of the technique used and assessed the validity of using a pool of unrelated healthy DNA instead of a matched healthy tissue as reference in exome-based CNV detection. We compared the CNV profiles obtained with three different approaches (aCGH, exome sequencing with a matched healthy tissue as reference, exome sequencing with a pool of eight unrelated healthy tissue as reference) on three multiple myeloma samples. We show that the usual analyses performed to compare CNV profiles (deletion/amplification ratios and CNV size distribution) lack in precision when confronted with low LRR values, as they only consider the binary status of each CNV. We show that the metric-based distance constitutes a more accurate comparison of two CNV profiles. Based on these analyses, we conclude that a reliable picture of CNV alterations in multiple myeloma samples can be obtained from whole-exome sequencing in the absence of a matched healthy sample. © 2016 WILEY PERIODICALS, INC.

  1. Nanometer-Scale Chemistry of a Calcite Biomineralization Template: Implications for Skeletal Composition and Nucleation.

    PubMed

    Branson, Oscar; Bonnin, Elisa A; Perea, Daniel E; Spero, Howard J; Zhu, Zihua; Winters, Maria; Hönisch, Bärbel; Russell, Ann D; Fehrenbacher, Jennifer S; Gagnon, Alexander C

    2016-11-15

    Plankton, corals, and other organisms produce calcium carbonate skeletons that are integral to their survival, form a key component of the global carbon cycle, and record an archive of past oceanographic conditions in their geochemistry. A key aspect of the formation of these biominerals is the interaction between organic templating structures and mineral precipitation processes. Laboratory-based studies have shown that these atomic-scale processes can profoundly influence the architecture and composition of minerals, but their importance in calcifying organisms is poorly understood because it is difficult to measure the chemistry of in vivo biomineral interfaces at spatially relevant scales. Understanding the role of templates in biomineral nucleation, and their importance in skeletal geochemistry requires an integrated, multiscale approach, which can place atom-scale observations of organic-mineral interfaces within a broader structural and geochemical context. Here we map the chemistry of an embedded organic template structure within a carbonate skeleton of the foraminifera Orbulina universa using both atom probe tomography (APT), a 3D chemical imaging technique with Ångström-level spatial resolution, and time-of-flight secondary ionization mass spectrometry (ToF-SIMS), a 2D chemical imaging technique with submicron resolution. We quantitatively link these observations, revealing that the organic template in O. universa is uniquely enriched in both Na and Mg, and contributes to intraskeletal chemical heterogeneity. Our APT analyses reveal the cation composition of the organic surface, offering evidence to suggest that cations other than Ca 2+ , previously considered passive spectator ions in biomineral templating, may be important in defining the energetics of carbonate nucleation on organic templates.

  2. Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11.

    PubMed

    Cao, Renzhi; Bhattacharya, Debswapna; Adhikari, Badri; Li, Jilong; Cheng, Jianlin

    2016-09-01

    Model evaluation and selection is an important step and a big challenge in template-based protein structure prediction. Individual model quality assessment methods designed for recognizing some specific properties of protein structures often fail to consistently select good models from a model pool because of their limitations. Therefore, combining multiple complimentary quality assessment methods is useful for improving model ranking and consequently tertiary structure prediction. Here, we report the performance and analysis of our human tertiary structure predictor (MULTICOM) based on the massive integration of 14 diverse complementary quality assessment methods that was successfully benchmarked in the 11th Critical Assessment of Techniques of Protein Structure prediction (CASP11). The predictions of MULTICOM for 39 template-based domains were rigorously assessed by six scoring metrics covering global topology of Cα trace, local all-atom fitness, side chain quality, and physical reasonableness of the model. The results show that the massive integration of complementary, diverse single-model and multi-model quality assessment methods can effectively leverage the strength of single-model methods in distinguishing quality variation among similar good models and the advantage of multi-model quality assessment methods of identifying reasonable average-quality models. The overall excellent performance of the MULTICOM predictor demonstrates that integrating a large number of model quality assessment methods in conjunction with model clustering is a useful approach to improve the accuracy, diversity, and consequently robustness of template-based protein structure prediction. Proteins 2016; 84(Suppl 1):247-259. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.

  3. Image-driven Population Analysis through Mixture Modeling

    PubMed Central

    Sabuncu, Mert R.; Balci, Serdar K.; Shenton, Martha E.; Golland, Polina

    2009-01-01

    We present iCluster, a fast and efficient algorithm that clusters a set of images while co-registering them using a parameterized, nonlinear transformation model. The output of the algorithm is a small number of template images that represent different modes in a population. This is in contrast with traditional, hypothesis-driven computational anatomy approaches that assume a single template to construct an atlas. We derive the algorithm based on a generative model of an image population as a mixture of deformable template images. We validate and explore our method in four experiments. In the first experiment, we use synthetic data to explore the behavior of the algorithm and inform a design choice on parameter settings. In the second experiment, we demonstrate the utility of having multiple atlases for the application of localizing temporal lobe brain structures in a pool of subjects that contains healthy controls and schizophrenia patients. Next, we employ iCluster to partition a data set of 415 whole brain MR volumes of subjects aged 18 through 96 years into three anatomical subgroups. Our analysis suggests that these subgroups mainly correspond to age groups. The templates reveal significant structural differences across these age groups that confirm previous findings in aging research. In the final experiment, we run iCluster on a group of 15 patients with dementia and 15 age-matched healthy controls. The algorithm produces two modes, one of which contains dementia patients only. These results suggest that the algorithm can be used to discover sub-populations that correspond to interesting structural or functional “modes.” PMID:19336293

  4. VizieR Online Data Catalog: 05 through L3 empirical stellar spectra from SDSS (Kesseli+, 2017)

    NASA Astrophysics Data System (ADS)

    Kesseli, A. Y.; West, A. A.; Veyette, M.; Harrison, B.; Feldman, D.; Bochanski, J. J.

    2017-08-01

    We present a library of empirical stellar spectra created using spectra from the Sloan Digital Sky Survey's Baryon Oscillation Spectroscopic Survey. The templates cover spectral types O5 through L3, are binned by metallicity from -2.0dex through +1.0dex, and are separated into main-sequence (dwarf) stars and giant stars. With recently developed M dwarf metallicity indicators, we are able to extend the metallicity bins down through the spectral subtype M8, making this the first empirical library with this degree of temperature and metallicity coverage. The wavelength coverage for the templates is from 3650 to 10200Å at a resolution of better than R~2000. Using the templates, we identify trends in color space with metallicity and surface gravity, which will be useful for analyzing large data sets from upcoming missions like the Large Synoptic Survey Telescope. Along with the templates, we are releasing a code for automatically (and/or visually) identifying the spectral type and metallicity of a star. (3 data files).

  5. Extensive Geographic Mosaicism in Avian Influenza Viruses from Gulls in the Northern Hemisphere

    PubMed Central

    Wille, Michelle; Robertson, Gregory J.; Whitney, Hugh; Bishop, Mary Anne; Runstadler, Jonathan A.; Lang, Andrew S.

    2011-01-01

    Due to limited interaction of migratory birds between Eurasia and America, two independent avian influenza virus (AIV) gene pools have evolved. There is evidence of low frequency reassortment between these regions, which has major implications in global AIV dynamics. Indeed, all currently circulating lineages of the PB1 and PA segments in North America are of Eurasian origin. Large-scale analyses of intercontinental reassortment have shown that viruses isolated from Charadriiformes (gulls, terns, and shorebirds) are the major contributor of these outsider events. To clarify the role of gulls in AIV dynamics, specifically in movement of genes between geographic regions, we have sequenced six gull AIV isolated in Alaska and analyzed these along with 142 other available gull virus sequences. Basic investigations of host species and the locations and times of isolation reveal biases in the available sequence information. Despite these biases, our analyses reveal a high frequency of geographic reassortment in gull viruses isolated in America. This intercontinental gene mixing is not found in the viruses isolated from gulls in Eurasia. This study demonstrates that gulls are important as vectors for geographically reassorted viruses, particularly in America, and that more surveillance effort should be placed on this group of birds. PMID:21697989

  6. Overview of recurrent chromosomal losses in retinoblastoma detected by low coverage next generation sequencing

    PubMed Central

    García-Chequer, A.J.; Méndez-Tenorio, A.; Olguín-Ruiz, G.; Sánchez-Vallejo, C.; Isa, P.; Arias, C.F.; Torres, J.; Hernández-Angeles, A.; Ramírez-Ortiz, M.A.; Lara, C.; Cabrera-Muñoz, M.L.; Sadowinski-Pine, S.; Bravo-Ortiz, J.C.; Ramón-García, G.; Diegopérez-Ramírez, J.; Ramírez-Reyes, G.; Casarrubias-Islas, R.; Ramírez, J.; Orjuela, M.A.; Ponce-Castañeda, M.V.

    2016-01-01

    Genes are frequently lost or gained in malignant tumors and the analysis of these changes can be informative about the underlying tumor biology. Retinoblastoma is a pediatric intraocular malignancy, and since deletions in chromosome 13 have been described in this tumor, we performed genome wide sequencing with the Illumina platform to test whether recurrent losses could be detected in low coverage data from DNA pools of Rb cases. An in silico reference profile for each pool was created from the human genome sequence GRCh37p5; a chromosome integrity score and a graphics 40 Kb window analysis approach, allowed us to identify with high resolution previously reported non random recurrent losses in all chromosomes of these tumors. We also found a pattern of gains and losses associated to clear and dark cytogenetic bands respectively. We further analyze a pool of medulloblastoma and found a more stable genomic profile and previously reported losses in this tumor. This approach facilitates identification of recurrent deletions from many patients that may be biological relevant for tumor development. PMID:26883451

  7. Tests of selection in pooled case-control data: an empirical study.

    PubMed

    Udpa, Nitin; Zhou, Dan; Haddad, Gabriel G; Bafna, Vineet

    2011-01-01

    For smaller organisms with faster breeding cycles, artificial selection can be used to create sub-populations with different phenotypic traits. Genetic tests can be employed to identify the causal markers for the phenotypes, as a precursor to engineering strains with a combination of traits. Traditional approaches involve analyzing crosses of inbred strains to test for co-segregation with genetic markers. Here we take advantage of cheaper next generation sequencing techniques to identify genetic signatures of adaptation to the selection constraints. Obtaining individual sequencing data is often unrealistic due to cost and sample issues, so we focus on pooled genomic data. We explore a series of statistical tests for selection using pooled case (under selection) and control populations. The tests generally capture skews in the scaled frequency spectrum of alleles in a region, which are indicative of a selective sweep. Extensive simulations are used to show that these approaches work well for a wide range of population divergence times and strong selective pressures. Control vs control simulations are used to determine an empirical False Positive Rate, and regions under selection are determined using a 1% FPR level. We show that pooling does not have a significant impact on statistical power. The tests are also robust to reasonable variations in several different parameters, including window size, base-calling error rate, and sequencing coverage. We then demonstrate the viability (and the challenges) of one of these methods in two independent Drosophila populations (Drosophila melanogaster) bred under selection for hypoxia and accelerated development, respectively. Testing for extreme hypoxia tolerance showed clear signals of selection, pointing to loci that are important for hypoxia adaptation. Overall, we outline a strategy for finding regions under selection using pooled sequences, then devise optimal tests for that strategy. The approaches show promise for detecting selection, even several generations after fixation of the beneficial allele has occurred.

  8. Mixing effects on nitrogen and oxygen concentrations and the relationship to mean residence time in a hyporheic zone of a riffle-pool sequence

    USGS Publications Warehouse

    Naranjo, Ramon C.; Niswonger, Richard G.; Clinton Davis,

    2015-01-01

    Flow paths and residence times in the hyporheic zone are known to influence biogeochemical processes such as nitrification and denitrification. The exchange across the sediment-water interface may involve mixing of surface water and groundwater through complex hyporheic flow paths that contribute to highly variable biogeochemically active zones. Despite the recognition of these patterns in the literature, conceptualization and analysis of flow paths and nitrogen transformations beneath riffle-pool sequences often neglect to consider bed form driven exchange along the entire reach. In this study, the spatial and temporal distribution of dissolved oxygen (DO), nitrate (NO3-) and ammonium (NH4+) were monitored in the hyporheic zone beneath a riffle-pool sequence on a losing section of the Truckee River, NV. Spatially-varying hyporheic exchange and the occurrence of multi-scale hyporheic mixing cells are shown to influence concentrations of DO and NO3- and the mean residence time (MRT) of riffle and pool areas. Distinct patterns observed in piezometers are shown to be influenced by the first large flow event following a steady 8 month period of low flow conditions. Increases in surface water discharge resulted in reversed hydraulic gradients and production of nitrate through nitrification at small vertical spatial scales (0.10 to 0.25 m) beneath the sediment-water interface. In areas with high downward flow rates and low MRT, denitrification may be limited. The use of a longitudinal two-dimensional flow model helped identify important mechanisms such as multi-scale hyporheic mixing cells and spatially varying MRT, an important driver for nitrogen transformation in the riverbed. Our observations of DO and NO3- concentrations and model simulations highlight the role of multi-scale hyporheic mixing cells on MRT and nitrogen transformations in the hyporheic zone of riffle-pool sequences. This article is protected by copyright. All rights reserved.

  9. Polydopamine-Coated Magnetic Molecularly Imprinted Polymers with Fragment Template for Identification of Pulsatilla Saponin Metabolites in Rat Feces with UPLC-Q-TOF-MS.

    PubMed

    Zhang, Yu-Zhen; Zhang, Jia-Wei; Wang, Chong-Zhi; Zhou, Lian-Di; Zhang, Qi-Hui; Yuan, Chun-Su

    2018-01-24

    In this work, a modified pretreatment method using magnetic molecularly imprinted polymers (MMIPs) was successfully applied to study the metabolites of an important botanical with ultraperformance liquid chromatography/quadrupole time-of-flight mass spectrometry (UPLC-Q-TOF-MS). The MMIPs for glucoside-specific adsorption was used to identify metabolites of Pulsatilla chinensis in rat feces. Polymers were prepared by using Fe 3 O 4 nanoparticles as the supporting matrix, d-glucose as fragment template, and dopamine as the functional monomer and cross-linker. Results showed that MMIPs exhibited excellent extraction performance, large adsorption capacity (5.65 mg/g), fast kinetics (60 min), and magnetic separation. Furthermore, the MMIPs coupled with UPLC-Q-TOF-MS were successfully utilized for the identification of 17 compounds including 15 metabolites from the Pulsatilla saponin metabolic pool. This study provides a reliable protocol for the separation and identification of saponin metabolites in a complex biological sample, including those from herbal medicines.

  10. Topological transformation of a trefoil knot into a [2]catenane.

    PubMed

    Prakasam, Thirumurugan; Bilbeisi, Rana A; El-Khoury, Roberto; Charbonnière, Loïc J; Elhabiri, Mourad; Esposito, Gennaro; Olsen, John-Carl; Trabolsi, Ali

    2017-12-21

    Topological transformation of a zinc-templated trefoil knot, Zn-TK, into a zinc-templated [2]catenane, Zn-[2]C, was studied. The net reaction 2 Zn-TK→3 Zn-[2]C was accomplished in 89% yield by heating a solution of Zn-TK in D 2 O. Kinetic investigation by 1 H NMR spectroscopy and high resolution mass spectrometry revealed that the mechanism is complex, involving a large pool of intermediates that form after imine bond cleavage. Bromide ions, which can occupy the central cavity of Zn-TK, inhibited the reaction. Two similar transformations were also studied, one of a cadmium-containing trefoil knot, Cd-TK, into a cadmium-containing catenane, Cd-[2]C, and the other of Cd-TK into Zn-[2]C. The latter transformation could be achieved in one step at high temperature or in two steps via transmetallation to form Zn-TK at room temperature followed by topological conversion of Zn-TK to Zn-[2]C at high temperature.

  11. Biometric template revocation

    NASA Astrophysics Data System (ADS)

    Arndt, Craig M.

    2004-08-01

    Biometric are a powerful technology for identifying humans both locally and at a distance. In order to perform identification or verification biometric systems capture an image of some biometric of a user or subject. The image is then converted mathematical to representation of the person call a template. Since we know that every human in the world is different each human will have different biometric images (different fingerprints, or faces, etc.). This is what makes biometrics useful for identification. However unlike a credit card number or a password to can be given to a person and later revoked if it is compromised and biometric is with the person for life. The problem then is to develop biometric templates witch can be easily revoked and reissued which are also unique to the user and can be easily used for identification and verification. In this paper we develop and present a method to generate a set of templates which are fully unique to the individual and also revocable. By using bases set compression algorithms in an n-dimensional orthogonal space we can represent a give biometric image in an infinite number of equally valued and unique ways. The verification and biometric matching system would be presented with a given template and revocation code. The code will then representing where in the sequence of n-dimensional vectors to start the recognition.

  12. Optimizing Pt/TiO2 templates for textured PZT growth and MEMS devices

    NASA Astrophysics Data System (ADS)

    Potrepka, Daniel; Fox, Glenn; Sanchez, Luz; Polcawich, Ronald

    2013-03-01

    Crystallographic texture of lead zirconate titanate (PZT) thin films strongly influences piezoelectric properties used in MEMS applications. Textured growth can be achieved by relying on crystal growth habit and can also be initiated by the use of a seed-layer heteroepitaxial template. Template choice and the process used to form it determine structural quality, ultimately influencing performance and reliability of MEMS PZT devices such as switches, filters, and actuators. This study focuses on how 111-textured PZT is generated by a combination of crystal habit and templating mechanisms that occur in the PZT/bottom-electrode stack. The sequence begins with 0001-textured Ti deposited on thermally grown SiO2 on a Si wafer. The Ti is converted to 100-textured TiO2 (rutile) through thermal oxidation. Then 111-textured Pt can be grown to act as a template for 111-textured PZT. Ti and Pt are deposited by DC magnetron sputtering. TiO2 and Pt film textures and structure were optimized by variation of sputtering deposition times, temperatures and power levels, and post-deposition anneal conditions. The relationship between Ti, TiO2, and Pt texture and their impact on PZT growth will be presented. Also affiliated with U.S. Army Research Lab, Adelphi, MD 20783, USA

  13. Multifunctional Dumbbell-Shaped DNA-Templated Selective Formation of Fluorescent Silver Nanoclusters or Copper Nanoparticles for Sensitive Detection of Biomolecules.

    PubMed

    Chen, Jinyang; Ji, Xinghu; Tinnefeld, Philip; He, Zhike

    2016-01-27

    In this work, a multifunctional template for selective formation of fluorescent silver nanoclusters (AgNCs) or copper nanoparticles (CuNPs) is put forward. This dumbbell-shaped (DS) DNA template is made up of two cytosine hairpin loops and an adenine-thymine-rich double-helical stem which is closed by the loops. The cytosine loops act as specific regions for the growth of AgNCs, and the double-helical stem serves as template for the CuNPs formation. By carefully investigating the sequence and length of DS DNA, we present the optimal design of the template. Benefiting from the smart design and facile synthesis, a simple, label-free, and ultrasensitive fluorescence strategy for adenosine triphosphate (ATP) detection is proposed. Through the systematic comparison, it is found that the strategy based on CuNPs formation is more sensitive for ATP assay than that based on AgNCs synthesis, and the detection limitation was found to be 81 pM. What's more, the CuNPs formation-based method is successfully applied in the detection of ATP in human serum as well as the determination of cellular ATP. In addition to small target molecule, the sensing strategy was also extended to the detection of biomacromolecule (DNA), which illustrates the generality of this biosensor.

  14. Mosaic PPM1D mutations are associated with predisposition to breast and ovarian cancer.

    PubMed

    Ruark, Elise; Snape, Katie; Humburg, Peter; Loveday, Chey; Bajrami, Ilirjana; Brough, Rachel; Rodrigues, Daniel Nava; Renwick, Anthony; Seal, Sheila; Ramsay, Emma; Duarte, Silvana Del Vecchio; Rivas, Manuel A; Warren-Perry, Margaret; Zachariou, Anna; Campion-Flora, Adriana; Hanks, Sandra; Murray, Anne; Ansari Pour, Naser; Douglas, Jenny; Gregory, Lorna; Rimmer, Andrew; Walker, Neil M; Yang, Tsun-Po; Adlard, Julian W; Barwell, Julian; Berg, Jonathan; Brady, Angela F; Brewer, Carole; Brice, Glen; Chapman, Cyril; Cook, Jackie; Davidson, Rosemarie; Donaldson, Alan; Douglas, Fiona; Eccles, Diana; Evans, D Gareth; Greenhalgh, Lynn; Henderson, Alex; Izatt, Louise; Kumar, Ajith; Lalloo, Fiona; Miedzybrodzka, Zosia; Morrison, Patrick J; Paterson, Joan; Porteous, Mary; Rogers, Mark T; Shanley, Susan; Walker, Lisa; Gore, Martin; Houlston, Richard; Brown, Matthew A; Caufield, Mark J; Deloukas, Panagiotis; McCarthy, Mark I; Todd, John A; Turnbull, Clare; Reis-Filho, Jorge S; Ashworth, Alan; Antoniou, Antonis C; Lord, Christopher J; Donnelly, Peter; Rahman, Nazneen

    2013-01-17

    Improved sequencing technologies offer unprecedented opportunities for investigating the role of rare genetic variation in common disease. However, there are considerable challenges with respect to study design, data analysis and replication. Using pooled next-generation sequencing of 507 genes implicated in the repair of DNA in 1,150 samples, an analytical strategy focused on protein-truncating variants (PTVs) and a large-scale sequencing case-control replication experiment in 13,642 individuals, here we show that rare PTVs in the p53-inducible protein phosphatase PPM1D are associated with predisposition to breast cancer and ovarian cancer. PPM1D PTV mutations were present in 25 out of 7,781 cases versus 1 out of 5,861 controls (P = 1.12 × 10(-5)), including 18 mutations in 6,912 individuals with breast cancer (P = 2.42 × 10(-4)) and 12 mutations in 1,121 individuals with ovarian cancer (P = 3.10 × 10(-9)). Notably, all of the identified PPM1D PTVs were mosaic in lymphocyte DNA and clustered within a 370-base-pair region in the final exon of the gene, carboxy-terminal to the phosphatase catalytic domain. Functional studies demonstrate that the mutations result in enhanced suppression of p53 in response to ionizing radiation exposure, suggesting that the mutant alleles encode hyperactive PPM1D isoforms. Thus, although the mutations cause premature protein truncation, they do not result in the simple loss-of-function effect typically associated with this class of variant, but instead probably have a gain-of-function effect. Our results have implications for the detection and management of breast and ovarian cancer risk. More generally, these data provide new insights into the role of rare and of mosaic genetic variants in common conditions, and the use of sequencing in their identification.

  15. Mosaic PPM1D mutations are associated with predisposition to breast and ovarian cancer

    PubMed Central

    Ruark, Elise; Snape, Katie; Humburg, Peter; Loveday, Chey; Bajrami, Ilirjana; Brough, Rachel; Rodrigues, Daniel Nava; Renwick, Anthony; Seal, Sheila; Ramsay, Emma; Duarte, Silvana Del Vecchio; Rivas, Manuel A.; Warren-Perry, Margaret; Zachariou, Anna; Campion-Flora, Adriana; Hanks, Sandra; Murray, Anne; Pour, Naser Ansari; Douglas, Jenny; Gregory, Lorna; Rimmer, Andrew; Walker, Neil M.; Yang, Tsun-Po; Adlard, Julian W.; Barwell, Julian; Berg, Jonathan; Brady, Angela F.; Brewer, Carole; Brice, Glen; Chapman, Cyril; Cook, Jackie; Davidson, Rosemarie; Donaldson, Alan; Douglas, Fiona; Eccles, Diana; Evans, D. Gareth; Greenhalgh, Lynn; Henderson, Alex; Izatt, Louise; Kumar, Ajith; Lalloo, Fiona; Miedzybrodzka, Zosia; Morrison, Patrick J.; Paterson, Joan; Porteous, Mary; Rogers, Mark T.; Shanley, Susan; Walker, Lisa; Gore, Martin; Houlston, Richard; Brown, Matthew A.; Caufield, Mark J.; Deloukas, Panagiotis; McCarthy, Mark I.; Todd, John A.; Turnbull, Clare; Reis-Filho, Jorge S.; Ashworth, Alan; Antoniou, Antonis C.; Lord, Christopher J.; Donnelly, Peter; Rahman, Nazneen

    2013-01-01

    Improved sequencing technologies offer unprecedented opportunities for investigating the role of rare genetic variation in common disease. However, there are considerable challenges with respect to study design, data analysis and replication1. Here, using pooled next-generation sequencing of 507 genes implicated in the repair of DNA in 1,150 samples, an analytical strategy focussed on protein truncating variants (PTVs) and a large-scale sequencing case-control replication experiment in 13,642 individuals, we show that rare PTVs in the p53 inducible protein phosphatase PPM1D are associated with predisposition to breast cancer and to ovarian cancer. PPM1D PTV mutations were present in 25/7781 cases vs 1/5861 controls; P=1.12×10−5, which included 18 mutations in 6,912 individuals with breast cancer; P = 2.42×10−4 and 12 mutations in 1,121 individuals with ovarian cancer; P = 3.10×10−9. Notably, all the identified PPM1D PTVs were mosaic in lymphocyte DNA and clustered within a 370 bp region in the final exon of the gene, C-terminal to the phosphatase catalytic domain. Functional studies demonstrated that the mutations result in enhanced suppression of p53 in response to ionising radiation exposure, suggesting the mutant alleles encode hyperactive PPM1D isoforms. Thus, although the mutations cause premature protein truncation, they do not result in the simple loss-of-function typically associated with this class of variant, but instead likely have a gain-of-function effect. Our results have implications for the detection and management of breast and ovarian cancer risk. More generally, these data provide new insights into the role of rare and of mosaic genetic variants in common conditions, and the utility of sequencing in their identification. PMID:23242139

  16. Characterizing convective cold pools: Characterizing Convective Cold Pools

    DOE PAGES

    Drager, Aryeh J.; van den Heever, Susan C.

    2017-05-09

    Cold pools produced by convective storms play an important role in Earth's climate system. However, a common framework does not exist for objectively identifying convective cold pools in observations and models. The present study investigates convective cold pools within a simulation of tropical continental convection that uses a cloud-resolving model with a coupled land-surface model. Multiple variables are assessed for their potential in identifying convective cold pool boundaries, and a novel technique is developed and tested for identifying and tracking cold pools in numerical model simulations. This algorithm is based on surface rainfall rates and radial gradients in the densitymore » potential temperature field. The algorithm successfully identifies near-surface cold pool boundaries and is able to distinguish between connected cold pools. Once cold pools have been identified and tracked, composites of cold pool evolution are then constructed, and average cold pool properties are investigated. Wet patches are found to develop within the centers of cold pools where the ground has been soaked with rainwater. These wet patches help to maintain cool surface temperatures and reduce cold pool dissipation, which has implications for the development of subsequent convection.« less

  17. Characterizing convective cold pools: Characterizing Convective Cold Pools

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Drager, Aryeh J.; van den Heever, Susan C.

    Cold pools produced by convective storms play an important role in Earth's climate system. However, a common framework does not exist for objectively identifying convective cold pools in observations and models. The present study investigates convective cold pools within a simulation of tropical continental convection that uses a cloud-resolving model with a coupled land-surface model. Multiple variables are assessed for their potential in identifying convective cold pool boundaries, and a novel technique is developed and tested for identifying and tracking cold pools in numerical model simulations. This algorithm is based on surface rainfall rates and radial gradients in the densitymore » potential temperature field. The algorithm successfully identifies near-surface cold pool boundaries and is able to distinguish between connected cold pools. Once cold pools have been identified and tracked, composites of cold pool evolution are then constructed, and average cold pool properties are investigated. Wet patches are found to develop within the centers of cold pools where the ground has been soaked with rainwater. These wet patches help to maintain cool surface temperatures and reduce cold pool dissipation, which has implications for the development of subsequent convection.« less

  18. Identification of presumed ancestral DNA sequences of phaseolin in Phaseolus vulgaris.

    PubMed Central

    Kami, J; Velásquez, V B; Debouck, D G; Gepts, P

    1995-01-01

    Common bean (Phaseolus vulgaris) consists of two major geographic gene pools, one distributed in Mexico, Central America, and Colombia and the other in the southern Andes (southern Peru, Bolivia, and Argentina). Amplification and sequencing of members of the multigene family coding for phaseolin, the major seed storage protein of the common bean, provide evidence for accumulation of tandem direct repeats in both introns and exons during evolution of the multigene family in this species. The presumed ancestral phaseolin sequences, without tandem repeats, were found in recently discovered but nearly extinct wild common bean populations of Ecuador and northern Peru that are intermediate between the two major gene pools of the species based on geographical and molecular arguments. Our results illustrate the usefulness of tandem direct repeats in establishing the polarity of DNA sequence divergence and therefore in proposing phylogenies. Images Fig. 1 Fig. 3 PMID:7862642

  19. Nanopore-CMOS Interfaces for DNA Sequencing

    PubMed Central

    Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim

    2016-01-01

    DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces. PMID:27509529

  20. Nanopore-CMOS Interfaces for DNA Sequencing.

    PubMed

    Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim

    2016-08-06

    DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces.

  1. Improving quality and patient satisfaction in a pediatric resident continuity clinic through advanced access scheduling.

    PubMed

    Tuli, Sanjeev Y; Thompson, Lindsay A; Ryan, Kathleen A; Srinivas, Ganga L; Fillipps, Donald J; Young, Christopher M; Tuli, Sonal S

    2010-06-01

    To evaluate the impact of advanced access scheduling in a pediatric residency clinic on resident and patient satisfaction, medical education, practice quality, and efficiency. Residents were assigned to either the advanced access template (10 appointments available to patients and 2 physician overbooks) or the prior template (5 available and 8 overbooks). Outcomes included resident and patient satisfaction, appointment availability, and continuity of care and clinic costs. Patient satisfaction improved in 7 areas (P < .001). Residents in either template did not report an impact on medical education experiences. Significant increases were realized with appointment availability and the number of patients seen. Continuity also increased as the overflow/acute visits decreased (P < .001). Overall costs per visit decreased 22%. Because of the significant improvements in access, continuity, and efficiency, all residents were switched to the advanced access template after completion of the study. Improvement in access to the primary physician has a significant impact on patient satisfaction with health care delivery. This model optimizes the limited time that residents have in continuity clinic, and it has implications for health care delivery quality improvement.

  2. Pooled-BAC sequencing of a black pod resistance region (cBPQTL12) in T. cacao

    USDA-ARS?s Scientific Manuscript database

    Whole genome sequencing (WGS) is an expensive and technically challenging endeavor. An alternative to WGS is to sequence specific chromosomal segments of biological interest (e.g. a QTL interval). This method is cheaper than WGS and reduces the risk of misassembly from distal parts of the genome. Us...

  3. Environmental genomics of "Haloquadratum walsbyi" in a saltern crystallizer indicates a large pool of accessory genes in an otherwise coherent species

    PubMed Central

    Legault, Boris A; Lopez-Lopez, Arantxa; Alba-Casado, Jose Carlos; Doolittle, W Ford; Bolhuis, Henk; Rodriguez-Valera, Francisco; Papke, R Thane

    2006-01-01

    Background Mature saturated brine (crystallizers) communities are largely dominated (>80% of cells) by the square halophilic archaeon "Haloquadratum walsbyi". The recent cultivation of the strain HBSQ001 and thesequencing of its genome allows comparison with the metagenome of this taxonomically simplified environment. Similar studies carried out in other extreme environments have revealed very little diversity in gene content among the cell lineages present. Results The metagenome of the microbial community of a crystallizer pond has been analyzed by end sequencing a 2000 clone fosmid library and comparing the sequences obtained with the genome sequence of "Haloquadratum walsbyi". The genome of the sequenced strain was retrieved nearly complete within this environmental DNA library. However, many ORF's that could be ascribed to the "Haloquadratum" metapopulation by common genome characteristics or scaffolding to the strain genome were not present in the specific sequenced isolate. Particularly, three regions of the sequenced genome were associated with multiple rearrangements and the presence of different genes from the metapopulation. Many transposition and phage related genes were found within this pool which, together with the associated atypical GC content in these areas, supports lateral gene transfer mediated by these elements as the most probable genetic cause of this variability. Additionally, these sequences were highly enriched in putative regulatory and signal transduction functions. Conclusion These results point to a large pan-genome (total gene repertoire of the genus/species) even in this highly specialized extremophile and at a single geographic location. The extensive gene repertoire is what might be expected of a population that exploits a diverse nutrient pool, resulting from the degradation of biomass produced at lower salinities. PMID:16820057

  4. Method and Apparatus for Evaluating the Visual Quality of Processed Digital Video Sequences

    NASA Technical Reports Server (NTRS)

    Watson, Andrew B. (Inventor)

    2002-01-01

    A Digital Video Quality (DVQ) apparatus and method that incorporate a model of human visual sensitivity to predict the visibility of artifacts. The DVQ method and apparatus are used for the evaluation of the visual quality of processed digital video sequences and for adaptively controlling the bit rate of the processed digital video sequences without compromising the visual quality. The DVQ apparatus minimizes the required amount of memory and computation. The input to the DVQ apparatus is a pair of color image sequences: an original (R) non-compressed sequence, and a processed (T) sequence. Both sequences (R) and (T) are sampled, cropped, and subjected to color transformations. The sequences are then subjected to blocking and discrete cosine transformation, and the results are transformed to local contrast. The next step is a time filtering operation which implements the human sensitivity to different time frequencies. The results are converted to threshold units by dividing each discrete cosine transform coefficient by its respective visual threshold. At the next stage the two sequences are subtracted to produce an error sequence. The error sequence is subjected to a contrast masking operation, which also depends upon the reference sequence (R). The masked errors can be pooled in various ways to illustrate the perceptual error over various dimensions, and the pooled error can be converted to a visual quality measure.

  5. Phylogeographic Differentiation of Mitochondrial DNA in Han Chinese

    PubMed Central

    Yao, Yong-Gang; Kong, Qing-Peng; Bandelt, Hans-Jürgen; Kivisild, Toomas; Zhang, Ya-Ping

    2002-01-01

    To characterize the mitochondrial DNA (mtDNA) variation in Han Chinese from several provinces of China, we have sequenced the two hypervariable segments of the control region and the segment spanning nucleotide positions 10171–10659 of the coding region, and we have identified a number of specific coding-region mutations by direct sequencing or restriction-fragment–length–polymorphism tests. This allows us to define new haplogroups (clades of the mtDNA phylogeny) and to dissect the Han mtDNA pool on a phylogenetic basis, which is a prerequisite for any fine-grained phylogeographic analysis, the interpretation of ancient mtDNA, or future complete mtDNA sequencing efforts. Some of the haplogroups under study differ considerably in frequencies across different provinces. The southernmost provinces show more pronounced contrasts in their regional Han mtDNA pools than the central and northern provinces. These and other features of the geographical distribution of the mtDNA haplogroups observed in the Han Chinese make an initial Paleolithic colonization from south to north plausible but would suggest subsequent migration events in China that mainly proceeded from north to south and east to west. Lumping together all regional Han mtDNA pools into one fictive general mtDNA pool or choosing one or two regional Han populations to represent all Han Chinese is inappropriate for prehistoric considerations as well as for forensic purposes or medical disease studies. PMID:11836649

  6. Role of Nuclear Pools of Aminoacyl-tRNA Synthetases in tRNA Nuclear Export

    PubMed Central

    Azad, Abul K.; Stanford, David R.; Sarkar, Srimonti; Hopper, Anita K.

    2001-01-01

    Reports of nuclear tRNA aminoacylation and its role in tRNA nuclear export (Lund and Dahlberg, 1998; Sarkar et al., 1999; Grosshans et al., 2000a) have led to the prediction that there should be nuclear pools of aminoacyl-tRNA synthetases. We report that in budding yeast there are nuclear pools of tyrosyl-tRNA synthetase, Tys1p. By sequence alignments we predicted a Tys1p nuclear localization sequence and showed it to be sufficient for nuclear location of a passenger protein. Mutations of this nuclear localization sequence in endogenous Tys1p reduce nuclear Tys1p pools, indicating that the motif is also important for nucleus location. The mutations do not significantly affect catalytic activity, but they do cause defects in export of tRNAs to the cytosol. Despite export defects, the cells are viable, indicating that nuclear tRNA aminoacylation is not required for all tRNA nuclear export paths. Because the tRNA nuclear exportin, Los1p, is also unessential, we tested whether tRNA aminoacylation and Los1p operate in alternative tRNA nuclear export paths. No genetic interactions between aminoacyl-tRNA synthetases and Los1p were detected, indicating that tRNA nuclear aminoacylation and Los1p operate in the same export pathway or there are more than two pathways for tRNA nuclear export. PMID:11359929

  7. Role of nuclear pools of aminoacyl-tRNA synthetases in tRNA nuclear export.

    PubMed

    Azad, A K; Stanford, D R; Sarkar, S; Hopper, A K

    2001-05-01

    Reports of nuclear tRNA aminoacylation and its role in tRNA nuclear export (Lund and Dahlberg, 1998; Sarkar et al., 1999; Grosshans et al., 20001) have led to the prediction that there should be nuclear pools of aminoacyl-tRNA synthetases. We report that in budding yeast there are nuclear pools of tyrosyl-tRNA synthetase, Tys1p. By sequence alignments we predicted a Tys1p nuclear localization sequence and showed it to be sufficient for nuclear location of a passenger protein. Mutations of this nuclear localization sequence in endogenous Tys1p reduce nuclear Tys1p pools, indicating that the motif is also important for nucleus location. The mutations do not significantly affect catalytic activity, but they do cause defects in export of tRNAs to the cytosol. Despite export defects, the cells are viable, indicating that nuclear tRNA aminoacylation is not required for all tRNA nuclear export paths. Because the tRNA nuclear exportin, Los1p, is also unessential, we tested whether tRNA aminoacylation and Los1p operate in alternative tRNA nuclear export paths. No genetic interactions between aminoacyl-tRNA synthetases and Los1p were detected, indicating that tRNA nuclear aminoacylation and Los1p operate in the same export pathway or there are more than two pathways for tRNA nuclear export.

  8. Aromatic claw: A new fold with high aromatic content that evades structural prediction: Aromatic Claw

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sachleben, Joseph R.; Adhikari, Aashish N.; Gawlak, Grzegorz

    2016-11-10

    We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in β-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily α-helical with a small two-stranded β-sheet withmore » a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.« less

  9. DNA sequence requirements for the accurate transcription of a protein-coding plastid gene in a plastid in vitro system from mustard (Sinapis alba L.)

    PubMed Central

    Link, Gerhard

    1984-01-01

    A nuclease-treated plastid extract from mustard (Sinapis alba L.) allows efficient transcription of cloned plastid DNA templates. In this in vitro system, the major runoff transcript of the truncated gene for the 32 000 mol. wt. photosystem II protein was accurately initiated from a site close to or identical with the in vivo start site. By using plasmids with deletions in the 5'-flanking region of this gene as templates, a DNA region required for efficient and selective initiation was detected ˜28-35 nucleotides upstream of the transcription start site. This region contains the sequence element TTGACA, which matches the consensus sequence for prokaryotic `−35' promoter elements. In the absence of this region, a region ˜13-27 nucleotides upstream of the start site still enables a basic level of specific transcription. This second region contains the sequence element TATATAA, which matches the consensus sequence for the `TATA' box of genes transcribed by RNA polymerase II (or B). The region between the `TATA'-like element and the transcription start site is not sufficient but may be required for specific transcription of the plastid gene. This latter region contains the sequence element TATACT, which resembles the prokaryotic `−10' (Pribnow) box. Based on the structural and transcriptional features of the 5' upstream region, a `promoter switch' mechanism is proposed, which may account for the developmentally regulated expression of this plastid gene. ImagesFig. 1.Fig. 2.Fig. 3.Fig. 4.Figure 5. PMID:16453540

  10. Genomics of crop wild relatives: expanding the gene pool for crop improvement.

    PubMed

    Brozynska, Marta; Furtado, Agnelo; Henry, Robert J

    2016-04-01

    Plant breeders require access to new genetic diversity to satisfy the demands of a growing human population for more food that can be produced in a variable or changing climate and to deliver the high-quality food with nutritional and health benefits demanded by consumers. The close relatives of domesticated plants, crop wild relatives (CWRs), represent a practical gene pool for use by plant breeders. Genomics of CWR generates data that support the use of CWR to expand the genetic diversity of crop plants. Advances in DNA sequencing technology are enabling the efficient sequencing of CWR and their increased use in crop improvement. As the sequencing of genomes of major crop species is completed, attention has shifted to analysis of the wider gene pool of major crops including CWR. A combination of de novo sequencing and resequencing is required to efficiently explore useful genetic variation in CWR. Analysis of the nuclear genome, transcriptome and maternal (chloroplast and mitochondrial) genome of CWR is facilitating their use in crop improvement. Genome analysis results in discovery of useful alleles in CWR and identification of regions of the genome in which diversity has been lost in domestication bottlenecks. Targeting of high priority CWR for sequencing will maximize the contribution of genome sequencing of CWR. Coordination of global efforts to apply genomics has the potential to accelerate access to and conservation of the biodiversity essential to the sustainability of agriculture and food production. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  11. Extended Minus-Strand DNA as Template for R-U5-Mediated Second-Strand Transfer in Recombinational Rescue of Primer Binding Site-Modified Retroviral Vectors

    PubMed Central

    Mikkelsen, Jacob Giehm; Lund, Anders H.; Dybkær, Karen; Duch, Mogens; Pedersen, Finn Skou

    1998-01-01

    We have previously demonstrated recombinational rescue of primer binding site (PBS)-impaired Akv murine leukemia virus-based vectors involving initial priming on endogenous viral sequences and template switching during cDNA synthesis to obtain PBS complementarity in second-strand transfer of reverse transcription (Mikkelsen et al., J. Virol. 70:1439–1447, 1996). By use of the same forced recombination system, we have now found recombinant proviruses of different structures, suggesting that PBS knockout vectors may be rescued through initial priming on endogenous virus RNA, read-through of the mutated PBS during minus-strand synthesis, and subsequent second-strand transfer mediated by the R-U5 complementarity of the plus strand and the extended minus-strand DNA acceptor template. Mechanisms for R-U5-mediated second-strand transfer and its possible role in retrovirus replication and evolution are discussed. PMID:9499117

  12. Comparison of Grouping Methods for Template Extraction from VA Medical Record Text.

    PubMed

    Redd, Andrew M; Gundlapalli, Adi V; Divita, Guy; Tran, Le-Thuy; Pettey, Warren B P; Samore, Matthew H

    2017-01-01

    We investigate options for grouping templates for the purpose of template identification and extraction from electronic medical records. We sampled a corpus of 1000 documents originating from Veterans Health Administration (VA) electronic medical record. We grouped documents through hashing and binning tokens (Hashed) as well as by the top 5% of tokens identified as important through the term frequency inverse document frequency metric (TF-IDF). We then compared the approaches on the number of groups with 3 or more and the resulting longest common subsequences (LCSs) common to all documents in the group. We found that the Hashed method had a higher success rate for finding LCSs, and longer LCSs than the TF-IDF method, however the TF-IDF approach found more groups than the Hashed and subsequently more long sequences, however the average length of LCSs were lower. In conclusion, each algorithm appears to have areas where it appears to be superior.

  13. Telomerase Mechanism of Telomere Synthesis

    PubMed Central

    Wu, R. Alex; Upton, Heather E.; Vogan, Jacob M.; Collins, Kathleen

    2017-01-01

    Telomerase is the essential reverse transcriptase required for linear chromosome maintenance in most eukaryotes. Telomerase supplements the tandem array of simple-sequence repeats at chromosome ends to compensate for the DNA erosion inherent in genome replication. The template for telomerase reverse transcriptase is within the RNA subunit of the ribonucleoprotein complex, which in cells contains additional telomerase holoenzyme proteins that assemble the active ribonucleoprotein and promote its function at telomeres. Telomerase is distinct among polymerases in its reiterative reuse of an internal template. The template is precisely defined, processively copied, and regenerated by release of single-stranded product DNA. New specificities of nucleic acid handling that underlie the catalytic cycle of repeat synthesis derive from both active site specialization and new motif elaborations in protein and RNA subunits. Studies of telomerase provide unique insights into cellular requirements for genome stability, tissue renewal, and tumorigenesis as well as new perspectives on dynamic ribonucleoprotein machines. PMID:28141967

  14. SPOT-ligand 2: improving structure-based virtual screening by binding-homology search on an expanded structural template library.

    PubMed

    Litfin, Thomas; Zhou, Yaoqi; Yang, Yuedong

    2017-04-15

    The high cost of drug discovery motivates the development of accurate virtual screening tools. Binding-homology, which takes advantage of known protein-ligand binding pairs, has emerged as a powerful discrimination technique. In order to exploit all available binding data, modelled structures of ligand-binding sequences may be used to create an expanded structural binding template library. SPOT-Ligand 2 has demonstrated significantly improved screening performance over its previous version by expanding the template library 15 times over the previous one. It also performed better than or similar to other binding-homology approaches on the DUD and DUD-E benchmarks. The server is available online at http://sparks-lab.org . yaoqi.zhou@griffith.edu.au or yuedong.yang@griffith.edu.au. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  15. Protein family clustering for structural genomics.

    PubMed

    Yan, Yongpan; Moult, John

    2005-10-28

    A major goal of structural genomics is the provision of a structural template for a large fraction of protein domains. The magnitude of this task depends on the number and nature of protein sequence families. With a large number of bacterial genomes now fully sequenced, it is possible to obtain improved estimates of the number and diversity of families in that kingdom. We have used an automated clustering procedure to group all sequences in a set of genomes into protein families. Bench-marking shows the clustering method is sensitive at detecting remote family members, and has a low level of false positives. This comprehensive protein family set has been used to address the following questions. (1) What is the structure coverage for currently known families? (2) How will the number of known apparent families grow as more genomes are sequenced? (3) What is a practical strategy for maximizing structure coverage in future? Our study indicates that approximately 20% of known families with three or more members currently have a representative structure. The study indicates also that the number of apparent protein families will be considerably larger than previously thought: We estimate that, by the criteria of this work, there will be about 250,000 protein families when 1000 microbial genomes have been sequenced. However, the vast majority of these families will be small, and it will be possible to obtain structural templates for 70-80% of protein domains with an achievable number of representative structures, by systematically sampling the larger families.

  16. A Segment of the Apospory-Specific Genomic Region Is Highly Microsyntenic Not Only between the Apomicts Pennisetum squamulatum and Buffelgrass, But Also with a Rice Chromosome 11 Centromeric-Proximal Genomic Region1[W

    PubMed Central

    Gualtieri, Gustavo; Conner, Joann A.; Morishige, Daryl T.; Moore, L. David; Mullet, John E.; Ozias-Akins, Peggy

    2006-01-01

    Bacterial artificial chromosome (BAC) clones from apomicts Pennisetum squamulatum and buffelgrass (Cenchrus ciliaris), isolated with the apospory-specific genomic region (ASGR) marker ugt197, were assembled into contigs that were extended by chromosome walking. Gene-like sequences from contigs were identified by shotgun sequencing and BLAST searches, and used to isolate orthologous rice contigs. Additional gene-like sequences in the apomicts' contigs were identified by bioinformatics using fully sequenced BACs from orthologous rice contigs as templates, as well as by interspecies, whole-contig cross-hybridizations. Hierarchical contig orthology was rapidly assessed by constructing detailed long-range contig molecular maps showing the distribution of gene-like sequences and markers, and searching for microsyntenic patterns of sequence identity and spatial distribution within and across species contigs. We found microsynteny between P. squamulatum and buffelgrass contigs. Importantly, this approach also enabled us to isolate from within the rice (Oryza sativa) genome contig Rice A, which shows the highest microsynteny and is most orthologous to the ugt197-containing C1C buffelgrass contig. Contig Rice A belongs to the rice genome database contig 77 (according to the current September 12, 2003, rice fingerprint contig build) that maps proximal to the chromosome 11 centromere, a feature that interestingly correlates with the mapping of ASGR-linked BACs proximal to the centromere or centromere-like sequences. Thus, relatedness between these two orthologous contigs is supported both by their molecular microstructure and by their centromeric-proximal location. Our discoveries promote the use of a microsynteny-based positional-cloning approach using the rice genome as a template to aid in constructing the ASGR toward the isolation of genes underlying apospory. PMID:16415213

  17. A segment of the apospory-specific genomic region is highly microsyntenic not only between the apomicts Pennisetum squamulatum and buffelgrass, but also with a rice chromosome 11 centromeric-proximal genomic region.

    PubMed

    Gualtieri, Gustavo; Conner, Joann A; Morishige, Daryl T; Moore, L David; Mullet, John E; Ozias-Akins, Peggy

    2006-03-01

    Bacterial artificial chromosome (BAC) clones from apomicts Pennisetum squamulatum and buffelgrass (Cenchrus ciliaris), isolated with the apospory-specific genomic region (ASGR) marker ugt197, were assembled into contigs that were extended by chromosome walking. Gene-like sequences from contigs were identified by shotgun sequencing and BLAST searches, and used to isolate orthologous rice contigs. Additional gene-like sequences in the apomicts' contigs were identified by bioinformatics using fully sequenced BACs from orthologous rice contigs as templates, as well as by interspecies, whole-contig cross-hybridizations. Hierarchical contig orthology was rapidly assessed by constructing detailed long-range contig molecular maps showing the distribution of gene-like sequences and markers, and searching for microsyntenic patterns of sequence identity and spatial distribution within and across species contigs. We found microsynteny between P. squamulatum and buffelgrass contigs. Importantly, this approach also enabled us to isolate from within the rice (Oryza sativa) genome contig Rice A, which shows the highest microsynteny and is most orthologous to the ugt197-containing C1C buffelgrass contig. Contig Rice A belongs to the rice genome database contig 77 (according to the current September 12, 2003, rice fingerprint contig build) that maps proximal to the chromosome 11 centromere, a feature that interestingly correlates with the mapping of ASGR-linked BACs proximal to the centromere or centromere-like sequences. Thus, relatedness between these two orthologous contigs is supported both by their molecular microstructure and by their centromeric-proximal location. Our discoveries promote the use of a microsynteny-based positional-cloning approach using the rice genome as a template to aid in constructing the ASGR toward the isolation of genes underlying apospory.

  18. Conserved Sequences at the Origin of Adenovirus DNA Replication

    PubMed Central

    Stillman, Bruce W.; Topp, William C.; Engler, Jeffrey A.

    1982-01-01

    The origin of adenovirus DNA replication lies within an inverted sequence repetition at either end of the linear, double-stranded viral DNA. Initiation of DNA replication is primed by a deoxynucleoside that is covalently linked to a protein, which remains bound to the newly synthesized DNA. We demonstrate that virion-derived DNA-protein complexes from five human adenovirus serological subgroups (A to E) can act as a template for both the initiation and the elongation of DNA replication in vitro, using nuclear extracts from adenovirus type 2 (Ad2)-infected HeLa cells. The heterologous template DNA-protein complexes were not as active as the homologous Ad2 DNA, most probably due to inefficient initiation by Ad2 replication factors. In an attempt to identify common features which may permit this replication, we have also sequenced the inverted terminal repeated DNA from human adenovirus serotypes Ad4 (group E), Ad9 and Ad10 (group D), and Ad31 (group A), and we have compared these to previously determined sequences from Ad2 and Ad5 (group C), Ad7 (group B), and Ad12 and Ad18 (group A) DNA. In all cases, the sequence around the origin of DNA replication can be divided into two structural domains: a proximal A · T-rich region which is partially conserved among these serotypes, and a distal G · C-rich region which is less well conserved. The G · C-rich region contains sequences similar to sequences present in papovavirus replication origins. The two domains may reflect a dual mechanism for initiation of DNA replication: adenovirus-specific protein priming of replication, and subsequent utilization of this primer by host replication factors for completion of DNA synthesis. Images PMID:7143575

  19. Preferential access to genetic information from endogenous hominin ancient DNA and accurate quantitative SNP-typing via SPEX

    PubMed Central

    Brotherton, Paul; Sanchez, Juan J.; Cooper, Alan; Endicott, Phillip

    2010-01-01

    The analysis of targeted genetic loci from ancient, forensic and clinical samples is usually built upon polymerase chain reaction (PCR)-generated sequence data. However, many studies have shown that PCR amplification from poor-quality DNA templates can create sequence artefacts at significant levels. With hominin (human and other hominid) samples, the pervasive presence of highly PCR-amplifiable human DNA contaminants in the vast majority of samples can lead to the creation of recombinant hybrids and other non-authentic artefacts. The resulting PCR-generated sequences can then be difficult, if not impossible, to authenticate. In contrast, single primer extension (SPEX)-based approaches can genotype single nucleotide polymorphisms from ancient fragments of DNA as accurately as modern DNA. A single SPEX-type assay can amplify just one of the duplex DNA strands at target loci and generate a multi-fold depth-of-coverage, with non-authentic recombinant hybrids reduced to undetectable levels. Crucially, SPEX-type approaches can preferentially access genetic information from damaged and degraded endogenous ancient DNA templates over modern human DNA contaminants. The development of SPEX-type assays offers the potential for highly accurate, quantitative genotyping from ancient hominin samples. PMID:19864251

  20. Molecular analysis of 16S rRNA genes identifies potentially periodontal pathogenic bacteria and archaea in the plaque of partially erupted third molars.

    PubMed

    Mansfield, J M; Campbell, J H; Bhandari, A R; Jesionowski, A M; Vickerman, M M

    2012-07-01

    Small subunit rRNA sequencing and phylogenetic analysis were used to identify cultivable and uncultivable microorganisms present in the dental plaque of symptomatic and asymptomatic partially erupted third molars to determine the prevalence of putative periodontal pathogens in pericoronal sites. Template DNA prepared from subgingival plaque collected from partially erupted symptomatic and asymptomatic mandibular third molars and healthy incisors was used in polymerase chain reaction with broad-range oligonucleotide primers to amplify 16S rRNA bacterial and archaeal genes. Amplicons were cloned, sequenced, and compared with known nucleotide sequences in online databases to identify the microorganisms present. Two thousand three hundred two clones from the plaque of 12 patients carried bacterial sequences from 63 genera belonging to 11 phyla, including members of the uncultivable TM7, SR1, and Chloroflexi, and difficult-to-cultivate Synergistetes and Spirochaetes. Dialister invisus, Filifactor alocis, Fusobacterium nucleatum, Porphyromonas endodontalis, Prevotella denticola, Tannerella forsythia, and Treponema denticola, which have been associated with periodontal disease, were found in significantly greater abundance in pericoronal compared with incisor sites. Dialister invisus and F nucleatum were found in greater abundance in sites exhibiting clinical symptoms. The archaeal species, Methanobrevibacter oralis, which has been associated with severe periodontitis, was found in 3 symptomatic patients. These findings have provided new insights into the complex microbiota of pericoronitis. Several bacterial and archaeal species implicated in periodontal disease were recovered in greater incidence and abundance from the plaque of partially erupted third molars compared with incisors, supporting the hypothesis that the pericoronal region may provide a favored niche for periodontal pathogens in otherwise healthy mouths. Copyright © 2012 American Association of Oral and Maxillofacial Surgeons. Published by Elsevier Inc. All rights reserved.

Top