Sample records for single molecule-based sequencing

  1. Single-Molecule Electrical Random Resequencing of DNA and RNA

    NASA Astrophysics Data System (ADS)

    Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

    2012-07-01

    Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.

  2. Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics

    PubMed Central

    Ardui, Simon; Ameur, Adam; Vermeesch, Joris R; Hestand, Matthew S

    2018-01-01

    Abstract Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio's single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing. PMID:29401301

  3. Assembly and diploid architecture of an individual human genome via single-molecule technologies

    PubMed Central

    Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

    2015-01-01

    We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality. PMID:26121404

  4. Assembly and diploid architecture of an individual human genome via single-molecule technologies.

    PubMed

    Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

    2015-08-01

    We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

  5. Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

    PubMed

    Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

    2017-11-28

    Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.

  6. Development of a reference material of a single DNA molecule for the quality control of PCR testing.

    PubMed

    Mano, Junichi; Hatano, Shuko; Futo, Satoshi; Yoshii, Junji; Nakae, Hiroki; Naito, Shigehiro; Takabatake, Reona; Kitta, Kazumi

    2014-09-02

    We developed a reference material of a single DNA molecule with a specific nucleotide sequence. The double-strand linear DNA which has PCR target sequences at the both ends was prepared as a reference DNA molecule, and we named the PCR targets on each side as confirmation sequence and standard sequence. The highly diluted solution of the reference molecule was dispensed into 96 wells of a plastic PCR plate to make the average number of molecules in a well below one. Subsequently, the presence or absence of the reference molecule in each well was checked by real-time PCR targeting for the confirmation sequence. After an enzymatic treatment of the reaction mixture in the positive wells for the digestion of PCR products, the resultant solution was used as the reference material of a single DNA molecule with the standard sequence. PCR analyses revealed that the prepared samples included only one reference molecule with high probability. The single-molecule reference material developed in this study will be useful for the absolute evaluation of a detection limit of PCR-based testing methods, the quality control of PCR analyses, performance evaluations of PCR reagents and instruments, and the preparation of an accurate calibration curve for real-time PCR quantitation.

  7. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  8. DNA and RNA sequencing by nanoscale reading through programmable electrophoresis and nanoelectrode-gated tunneling and dielectric detection

    DOEpatents

    Lee, James W.; Thundat, Thomas G.

    2005-06-14

    An apparatus and method for performing nucleic acid (DNA and/or RNA) sequencing on a single molecule. The genetic sequence information is obtained by probing through a DNA or RNA molecule base by base at nanometer scale as though looking through a strip of movie film. This DNA sequencing nanotechnology has the theoretical capability of performing DNA sequencing at a maximal rate of about 1,000,000 bases per second. This enhanced performance is made possible by a series of innovations including: novel applications of a fine-tuned nanometer gap for passage of a single DNA or RNA molecule; thin layer microfluidics for sample loading and delivery; and programmable electric fields for precise control of DNA or RNA movement. Detection methods include nanoelectrode-gated tunneling current measurements, dielectric molecular characterization, and atomic force microscopy/electrostatic force microscopy (AFM/EFM) probing for nanoscale reading of the nucleic acid sequences.

  9. Single-molecule study of thymidine glycol and i-motif through the alpha-hemolysin ion channel

    NASA Astrophysics Data System (ADS)

    He, Lidong

    Nanopore-based devices have emerged as a single-molecule detection and analysis tool for a wide range of applications. Through electrophoretically driving DNA molecules across a nanosized pore, a lot of information can be received, including unfolding kinetics and DNA-protein interactions. This single-molecule method has the potential to sequence kilobase length DNA polymers without amplification or labeling, approaching "the third generation" genome sequencing for around $1000 within 24 hours. alpha-Hemolysin biological nanopores have the advantages of excellent stability, low-noise level, and precise site-directed mutagenesis for engineering this protein nanopore. The first work presented in this thesis established the current signal of the thymidine glycol lesion in DNA oligomers through an immobilization experiment. The thymidine glycol enantiomers were differentiated from each other by different current blockage levels. Also, the effect of bulky hydrophobic adducts to the current blockage was investigated. Secondly, the alpha-hemolysin nanopore was used to study the human telomere i-motif and RET oncogene i-motif at a single-molecule level. In Chapter 3, it was demonstrated that the alpha-hemolysin nanopore can differentiate an i-motif form and single-strand DNA form at different pH values based on the same sequence. In addition, it shows potential to differentiate the folding topologies generated from the same DNA sequence.

  10. Long-range barcode labeling-sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Feng; Zhang, Tao; Singh, Kanwar K.

    Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.

  11. Single molecule sequencing of the M13 virus genome without amplification

    PubMed Central

    Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X.; Yan, Qin; Deem, Michael W.; He, Jiankui

    2017-01-01

    Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias. PMID:29253901

  12. Single molecule sequencing of the M13 virus genome without amplification.

    PubMed

    Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X; Yan, Qin; Deem, Michael W; He, Jiankui

    2017-01-01

    Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias.

  13. Single-Molecule Denaturation Mapping of Genomic DNA in Nanofluidic Channels

    NASA Astrophysics Data System (ADS)

    Reisner, Walter; Larsen, Niels; Kristensen, Anders; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik

    2009-03-01

    We have developed a new DNA barcoding technique based on the partial denaturation of extended fluorescently labeled DNA molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions and can be explained by calculations of sequence-dependent melting probability. We believe this melting mapping technology is the first optically based single molecule technique sensitive to genome wide sequence variation that does not require an additional enzymatic labeling or restriction scheme.

  14. Single-Molecule Counting of Point Mutations by Transient DNA Binding

    NASA Astrophysics Data System (ADS)

    Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan

    2017-03-01

    High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.

  15. Single molecule sequencing-guided scaffolding and correction of draft assemblies.

    PubMed

    Zhu, Shenglong; Chen, Danny Z; Emrich, Scott J

    2017-12-06

    Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies. We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-corrected single molecule sequences. To achieve this goal, we formulate a maximum alternating path cover problem. We prove that this problem is NP-hard, and solve it by a 2-approximation algorithm. Our experimental results show that our approach can improve the structural correctness of target assemblies in the cost of some contiguity, even with smaller amounts of long reads. In addition, our reassembling process can also serve as a competitive scaffolder relative to well-established assembly benchmarks.

  16. Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

    PubMed Central

    Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

    2016-01-01

    DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962

  17. Method for rapid base sequencing in DNA and RNA with two base labeling

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Posner, R.G.; Marrone, B.L.; Hammond, M.L.; Simpson, D.J.

    1995-04-11

    A method is described for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand. 4 figures.

  18. Method for rapid base sequencing in DNA and RNA with two base labeling

    DOEpatents

    Jett, James H.; Keller, Richard A.; Martin, John C.; Posner, Richard G.; Marrone, Babetta L.; Hammond, Mark L.; Simpson, Daniel J.

    1995-01-01

    Method for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand.

  19. Nanopore-based fourth-generation DNA sequencing technology.

    PubMed

    Feng, Yanxiao; Zhang, Yuechuan; Ying, Cuifeng; Wang, Deqiang; Du, Chunlei

    2015-02-01

    Nanopore-based sequencers, as the fourth-generation DNA sequencing technology, have the potential to quickly and reliably sequence the entire human genome for less than $1000, and possibly for even less than $100. The single-molecule techniques used by this technology allow us to further study the interaction between DNA and protein, as well as between protein and protein. Nanopore analysis opens a new door to molecular biology investigation at the single-molecule scale. In this article, we have reviewed academic achievements in nanopore technology from the past as well as the latest advances, including both biological and solid-state nanopores, and discussed their recent and potential applications. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

  20. Theoretical electrical conductivity of hydrogen-bonded benzamide-derived molecules and single DNA bases.

    PubMed

    Chen, Xiang

    2013-09-01

    A benzamide molecule is used as a "reader" molecule to form hydrogen bonds with five single DNA bases, i.e., four normal single DNA bases A,T,C,G and one for 5methylC. The whole molecule is then attached to the gold surface so that a meta-molecule junction is formed. We calculate the transmission function and conductance for the five metal-molecule systems, with the implementation of density functional theory-based non-equilibrium Green function method. Our results show that each DNA base exhibits a unique conductance and most of them are on the pS level. The distinguishable conductance of each DNA base provides a way for the fast sequencing of DNA. We also investigate the dependence of conductivity of such a metal-molecule system on the hydrogen bond length between the "reader" molecule and DNA base, which shows that conductance follows an exponential decay as the hydrogen bond length increases, i.e., the conductivity is highly sensitive to the change in hydrogen bond length.

  1. Toward the 1,000 dollars human genome.

    PubMed

    Bennett, Simon T; Barnes, Colin; Cox, Anthony; Davies, Lisa; Brown, Clive

    2005-06-01

    Revolutionary new technologies, capable of transforming the economics of sequencing, are providing an unparalleled opportunity to analyze human genetic variation comprehensively at the whole-genome level within a realistic timeframe and at affordable costs. Current estimates suggest that it would cost somewhere in the region of 30 million US dollars to sequence an entire human genome using Sanger-based sequencing, and on one machine it would take about 60 years. Solexa is widely regarded as a company with the necessary disruptive technology to be the first to achieve the ultimate goal of the so-called 1,000 dollars human genome - the conceptual cost-point needed for routine analysis of individual genomes. Solexa's technology is based on completely novel sequencing chemistry capable of sequencing billions of individual DNA molecules simultaneously, a base at a time, to enable highly accurate, low cost analysis of an entire human genome in a single experiment. When applied over a large enough genomic region, these new approaches to resequencing will enable the simultaneous detection and typing of known, as well as unknown, polymorphisms, and will also offer information about patterns of linkage disequilibrium in the population being studied. Technological progress, leading to the advent of single-molecule-based approaches, is beginning to dramatically drive down costs and increase throughput to unprecedented levels, each being several orders of magnitude better than that which is currently available. A new sequencing paradigm based on single molecules will be faster, cheaper and more sensitive, and will permit routine analysis at the whole-genome level.

  2. Recent patents of nanopore DNA sequencing technology: progress and challenges.

    PubMed

    Zhou, Jianfeng; Xu, Bingqian

    2010-11-01

    DNA sequencing techniques witnessed fast development in the last decades, primarily driven by the Human Genome Project. Among the proposed new techniques, Nanopore was considered as a suitable candidate for the single DNA sequencing with ultrahigh speed and very low cost. Several fabrication and modification techniques have been developed to produce robust and well-defined nanopore devices. Many efforts have also been done to apply nanopore to analyze the properties of DNA molecules. By comparing with traditional sequencing techniques, nanopore has demonstrated its distinctive superiorities in main practical issues, such as sample preparation, sequencing speed, cost-effective and read-length. Although challenges still remain, recent researches in improving the capabilities of nanopore have shed a light to achieve its ultimate goal: Sequence individual DNA strand at single nucleotide level. This patent review briefly highlights recent developments and technological achievements for DNA analysis and sequencing at single molecule level, focusing on nanopore based methods.

  3. Single Molecule Nano-Metronome

    PubMed Central

    Buranachai, Chittanon; McKinney, Sean A.; Ha, Taekjip

    2008-01-01

    We constructed a DNA-based nano-mechanical device called the nano-metronome. Our device is made by introducing complementary single stranded overhangs at the two arms of the DNA four-way junction. The ticking rates of this stochastic metronome depend on ion concentrations and can be changed by a set of DNA-based switches to deactivate/reactivate the sticky end. Since the device displays clearly distinguishable responses even with a single basepair difference, it may lead to a single molecule sensor of minute sequence differences of a target DNA. PMID:16522050

  4. [The principle and application of the single-molecule real-time sequencing technology].

    PubMed

    Yanhu, Liu; Lu, Wang; Li, Yu

    2015-03-01

    Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modifications. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.

  5. Correlation dynamics and enhanced signals for the identification of serial biomolecules and DNA bases.

    PubMed

    Ahmed, Towfiq; Haraldsen, Jason T; Rehr, John J; Di Ventra, Massimiliano; Schuller, Ivan; Balatsky, Alexander V

    2014-03-28

    Nanopore-based sequencing has demonstrated a significant potential for the development of fast, accurate, and cost-efficient fingerprinting techniques for next generation molecular detection and sequencing. We propose a specific multilayered graphene-based nanopore device architecture for the recognition of single biomolecules. Molecular detection and analysis can be accomplished through the detection of transverse currents as the molecule or DNA base translocates through the nanopore. To increase the overall signal-to-noise ratio and the accuracy, we implement a new 'multi-point cross-correlation' technique for identification of DNA bases or other molecules on the single molecular level. We demonstrate that the cross-correlations between each nanopore will greatly enhance the transverse current signal for each molecule. We implement first-principles transport calculations for DNA bases surveyed across a multilayered graphene nanopore system to illustrate the advantages of the proposed geometry. A time-series analysis of the cross-correlation functions illustrates the potential of this method for enhancing the signal-to-noise ratio. This work constitutes a significant step forward in facilitating fingerprinting of single biomolecules using solid state technology.

  6. Reducing assembly complexity of microbial genomes with single-molecule sequencing.

    PubMed

    Koren, Sergey; Harhay, Gregory P; Smith, Timothy P L; Bono, James L; Harhay, Dayna M; Mcvey, Scott D; Radune, Diana; Bergman, Nicholas H; Phillippy, Adam M

    2013-01-01

    The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.

  7. Hybrid error correction and de novo assembly of single-molecule sequencing reads

    PubMed Central

    Koren, Sergey; Schatz, Michael C.; Walenz, Brian P.; Martin, Jeffrey; Howard, Jason; Ganapathy, Ganeshkumar; Wang, Zhong; Rasko, David A.; McCombie, W. Richard; Jarvis, Erich D.; Phillippy, Adam M.

    2012-01-01

    Emerging single-molecule sequencing instruments can generate multi-kilobase sequences with the potential to dramatically improve genome and transcriptome assembly. However, the high error rate of single-molecule reads is challenging, and has limited their use to resequencing bacteria. To address this limitation, we introduce a novel correction algorithm and assembly strategy that utilizes shorter, high-identity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on Pacbio RS reads of phage, prokaryotic, and eukaryotic whole genomes, including the novel genome of the parrot Melopsittacus undulatus, as well as for RNA-seq reads of the corn (Zea mays) transcriptome. Our approach achieves over 99.9% read correction accuracy and produces substantially better assemblies than current sequencing strategies: in the best example, quintupling the median contig size relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly. PMID:22750884

  8. Highly sensitive detection of mutations in CHO cell recombinant DNA using multi-parallel single molecule real-time DNA sequencing.

    PubMed

    Cartwright, Joseph F; Anderson, Karin; Longworth, Joseph; Lobb, Philip; James, David C

    2018-06-01

    High-fidelity replication of biologic-encoding recombinant DNA sequences by engineered mammalian cell cultures is an essential pre-requisite for the development of stable cell lines for the production of biotherapeutics. However, immortalized mammalian cells characteristically exhibit an increased point mutation frequency compared to mammalian cells in vivo, both across their genomes and at specific loci (hotspots). Thus unforeseen mutations in recombinant DNA sequences can arise and be maintained within producer cell populations. These may affect both the stability of recombinant gene expression and give rise to protein sequence variants with variable bioactivity and immunogenicity. Rigorous quantitative assessment of recombinant DNA integrity should therefore form part of the cell line development process and be an essential quality assurance metric for instances where synthetic/multi-component assemblies are utilized to engineer mammalian cells, such as the assessment of recombinant DNA fidelity or the mutability of single-site integration target loci. Based on Pacific Biosciences (Menlo Park, CA) single molecule real-time (SMRT™) circular consensus sequencing (CCS) technology we developed a rDNA sequence analysis tool to process the multi-parallel sequencing of ∼40,000 single recombinant DNA molecules. After statistical filtering of raw sequencing data, we show that this analytical method is capable of detecting single point mutations in rDNA to a minimum single mutation frequency of 0.0042% (<1/24,000 bases). Using a stable CHO transfectant pool harboring a randomly integrated 5 kB plasmid construct encoding GFP we found that 28% of recombinant plasmid copies contained at least one low frequency (<0.3%) point mutation. These mutations were predominantly found in GC base pairs (85%) and that there was no positional bias in mutation across the plasmid sequence. There was no discernable difference between the mutation frequencies of coding and non-coding DNA. The putative ratio of non-synonymous and synonymous changes within the open reading frames (ORFs) in the plasmid sequence indicates that natural selection does not impact upon the prevalence of these mutations. Here we have demonstrated the abundance of mutations that fall outside of the reported range of detection of next generation sequencing (NGS) and second generation sequencing (SGS) platforms, providing a methodology capable of being utilized in cell line development platforms to identify the fidelity of recombinant genes throughout the production process. © 2018 Wiley Periodicals, Inc.

  9. Electrostatic melting in a single-molecule field-effect transistor with applications in genomic identification

    PubMed Central

    Vernick, Sefi; Trocchia, Scott M.; Warren, Steven B.; Young, Erik F.; Bouilly, Delphine; Gonzalez, Ruben L.; Nuckolls, Colin; Shepard, Kenneth L.

    2017-01-01

    The study of biomolecular interactions at the single-molecule level holds great potential for both basic science and biotechnology applications. Single-molecule studies often rely on fluorescence-based reporting, with signal levels limited by photon emission from single optical reporters. The point-functionalized carbon nanotube transistor, known as the single-molecule field-effect transistor, is a bioelectronics alternative based on intrinsic molecular charge that offers significantly higher signal levels for detection. Such devices are effective for characterizing DNA hybridization kinetics and thermodynamics and enabling emerging applications in genomic identification. In this work, we show that hybridization kinetics can be directly controlled by electrostatic bias applied between the device and the surrounding electrolyte. We perform the first single-molecule experiments demonstrating the use of electrostatics to control molecular binding. Using bias as a proxy for temperature, we demonstrate the feasibility of detecting various concentrations of 20-nt target sequences from the Ebolavirus nucleoprotein gene in a constant-temperature environment. PMID:28516911

  10. Quantum-Sequencing: Fast electronic single DNA molecule sequencing

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.

  11. UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy

    PubMed Central

    2017-01-01

    Unique Molecular Identifiers (UMIs) are random oligonucleotide barcodes that are increasingly used in high-throughput sequencing experiments. Through a UMI, identical copies arising from distinct molecules can be distinguished from those arising through PCR amplification of the same molecule. However, bioinformatic methods to leverage the information from UMIs have yet to be formalized. In particular, sequencing errors in the UMI sequence are often ignored or else resolved in an ad hoc manner. We show that errors in the UMI sequence are common and introduce network-based methods to account for these errors when identifying PCR duplicates. Using these methods, we demonstrate improved quantification accuracy both under simulated conditions and real iCLIP and single-cell RNA-seq data sets. Reproducibility between iCLIP replicates and single-cell RNA-seq clustering are both improved using our proposed network-based method, demonstrating the value of properly accounting for errors in UMIs. These methods are implemented in the open source UMI-tools software package. PMID:28100584

  12. ampliMethProfiler: a pipeline for the analysis of CpG methylation profiles of targeted deep bisulfite sequenced amplicons.

    PubMed

    Scala, Giovanni; Affinito, Ornella; Palumbo, Domenico; Florio, Ermanno; Monticelli, Antonella; Miele, Gennaro; Chiariotti, Lorenzo; Cocozza, Sergio

    2016-11-25

    CpG sites in an individual molecule may exist in a binary state (methylated or unmethylated) and each individual DNA molecule, containing a certain number of CpGs, is a combination of these states defining an epihaplotype. Classic quantification based approaches to study DNA methylation are intrinsically unable to fully represent the complexity of the underlying methylation substrate. Epihaplotype based approaches, on the other hand, allow methylation profiles of cell populations to be studied at the single molecule level. For such investigations, next-generation sequencing techniques can be used, both for quantitative and for epihaplotype analysis. Currently available tools for methylation analysis lack output formats that explicitly report CpG methylation profiles at the single molecule level and that have suited statistical tools for their interpretation. Here we present ampliMethProfiler, a python-based pipeline for the extraction and statistical epihaplotype analysis of amplicons from targeted deep bisulfite sequencing of multiple DNA regions. ampliMethProfiler tool provides an easy and user friendly way to extract and analyze the epihaplotype composition of reads from targeted bisulfite sequencing experiments. ampliMethProfiler is written in python language and requires a local installation of BLAST and (optionally) QIIME tools. It can be run on Linux and OS X platforms. The software is open source and freely available at http://amplimethprofiler.sourceforge.net .

  13. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  14. Universal digital high-resolution melt: a novel approach to broad-based profiling of heterogeneous biological samples.

    PubMed

    Fraley, Stephanie I; Hardick, Justin; Masek, Billie J; Jo Masek, Billie; Athamanolap, Pornpat; Rothman, Richard E; Gaydos, Charlotte A; Carroll, Karen C; Wakefield, Teresa; Wang, Tza-Huei; Yang, Samuel

    2013-10-01

    Comprehensive profiling of nucleic acids in genetically heterogeneous samples is important for clinical and basic research applications. Universal digital high-resolution melt (U-dHRM) is a new approach to broad-based PCR diagnostics and profiling technologies that can overcome issues of poor sensitivity due to contaminating nucleic acids and poor specificity due to primer or probe hybridization inaccuracies for single nucleotide variations. The U-dHRM approach uses broad-based primers or ligated adapter sequences to universally amplify all nucleic acid molecules in a heterogeneous sample, which have been partitioned, as in digital PCR. Extensive assay optimization enables direct sequence identification by algorithm-based matching of melt curve shape and Tm to a database of known sequence-specific melt curves. We show that single-molecule detection and single nucleotide sensitivity is possible. The feasibility and utility of U-dHRM is demonstrated through detection of bacteria associated with polymicrobial blood infection and microRNAs (miRNAs) associated with host response to infection. U-dHRM using broad-based 16S rRNA gene primers demonstrates universal single cell detection of bacterial pathogens, even in the presence of larger amounts of contaminating bacteria; U-dHRM using universally adapted Lethal-7 miRNAs in a heterogeneous mixture showcases the single copy sensitivity and single nucleotide specificity of this approach.

  15. Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

    PubMed Central

    Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

    2012-01-01

    RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676

  16. Characterization of individual polynucleotide molecules using a membrane channel

    NASA Technical Reports Server (NTRS)

    Kasianowicz, J. J.; Brandin, E.; Branton, D.; Deamer, D. W.

    1996-01-01

    We show that an electric field can drive single-stranded RNA and DNA molecules through a 2.6-nm diameter ion channel in a lipid bilayer membrane. Because the channel diameter can accommodate only a single strand of RNA or DNA, each polymer traverses the membrane as an extended chain that partially blocks the channel. The passage of each molecule is detected as a transient decrease of ionic current whose duration is proportional to polymer length. Channel blockades can therefore be used to measure polynucleotide length. With further improvements, the method could in principle provide direct, high-speed detection of the sequence of bases in single molecules of DNA or RNA.

  17. Nanochannel Device with Embedded Nanopore: a New Approach for Single-Molecule DNA Analysis and Manipulation

    NASA Astrophysics Data System (ADS)

    Zhang, Yuning; Reisner, Walter

    2012-02-01

    Nanopore and nanochannel based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with nanpore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a nanopore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We will discuss our recent progress on device fabrication and characterization. In particular, we demonstrate that we can detect - using fluorescent microscopy - successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. In particular, we show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore, suggesting that the embedded pore could be used as a nanoscale window through which to interrogate a nanochannel extended DNA molecule.

  18. Nanochannel Device with Embedded Nanopore: a New Approach for Single-Molecule DNA Analysis and Manipulation

    NASA Astrophysics Data System (ADS)

    Zhang, Yuning; Reisner, Walter

    2013-03-01

    Nanopore and nanochannel based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with embedded pore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a pore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We demonstrate that we can optically detect successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. In particular, we show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore, suggesting that the pore could be used as a nanoscale window through which to interrogate a nanochannel extended DNA molecule. Furthermore, electrical measurements through the nanopore are performed, indicating that DNA sensing is feasible using the nanochannel-nanopore device.

  19. Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

    PubMed Central

    Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

    2016-01-01

    Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524

  20. Distinguishing Individual DNA Bases in a Network by Non-Resonant Tip-Enhanced Raman Scattering.

    PubMed

    Zhang, Rui; Zhang, Xianbiao; Wang, Huifang; Zhang, Yao; Jiang, Song; Hu, Chunrui; Zhang, Yang; Luo, Yi; Dong, Zhenchao

    2017-05-08

    The importance of identifying DNA bases at the single-molecule level is well recognized for many biological applications. Although such identification can be achieved by electrical measurements using special setups, it is still not possible to identify single bases in real space by optical means owing to the diffraction limit. Herein, we demonstrate the outstanding ability of scanning tunneling microscope (STM)-controlled non-resonant tip-enhanced Raman scattering (TERS) to unambiguously distinguish two individual complementary DNA bases (adenine and thymine) with a spatial resolution down to 0.9 nm. The distinct Raman fingerprints identified for the two molecules allow to differentiate in real space individual DNA bases in coupled base pairs. The demonstrated ability of non-resonant Raman scattering with super-high spatial resolution will significantly extend the applicability of TERS, opening up new routes for single-molecule DNA sequencing. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. Efficient use of single molecule time traces to resolve kinetic rates, models and uncertainties

    NASA Astrophysics Data System (ADS)

    Schmid, Sonja; Hugel, Thorsten

    2018-03-01

    Single molecule time traces reveal the time evolution of unsynchronized kinetic systems. Especially single molecule Förster resonance energy transfer (smFRET) provides access to enzymatically important time scales, combined with molecular distance resolution and minimal interference with the sample. Yet the kinetic analysis of smFRET time traces is complicated by experimental shortcomings—such as photo-bleaching and noise. Here we recapitulate the fundamental limits of single molecule fluorescence that render the classic, dwell-time based kinetic analysis unsuitable. In contrast, our Single Molecule Analysis of Complex Kinetic Sequences (SMACKS) considers every data point and combines the information of many short traces in one global kinetic rate model. We demonstrate the potential of SMACKS by resolving the small kinetic effects caused by different ionic strengths in the chaperone protein Hsp90. These results show an unexpected interrelation between conformational dynamics and ATPase activity in Hsp90.

  2. Mapping DNA polymerase errors by single-molecule sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lee, David F.; Lu, Jenny; Chang, Seungwoo

    Genomic integrity is compromised by DNA polymerase replication errors, which occur in a sequence-dependent manner across the genome. Accurate and complete quantification of a DNA polymerase's error spectrum is challenging because errors are rare and difficult to detect. We report a high-throughput sequencing assay to map in vitro DNA replication errors at the single-molecule level. Unlike previous methods, our assay is able to rapidly detect a large number of polymerase errors at base resolution over any template substrate without quantification bias. To overcome the high error rate of high-throughput sequencing, our assay uses a barcoding strategy in which each replicationmore » product is tagged with a unique nucleotide sequence before amplification. Here, this allows multiple sequencing reads of the same product to be compared so that sequencing errors can be found and removed. We demonstrate the ability of our assay to characterize the average error rate, error hotspots and lesion bypass fidelity of several DNA polymerases.« less

  3. Mapping DNA polymerase errors by single-molecule sequencing

    DOE PAGES

    Lee, David F.; Lu, Jenny; Chang, Seungwoo; ...

    2016-05-16

    Genomic integrity is compromised by DNA polymerase replication errors, which occur in a sequence-dependent manner across the genome. Accurate and complete quantification of a DNA polymerase's error spectrum is challenging because errors are rare and difficult to detect. We report a high-throughput sequencing assay to map in vitro DNA replication errors at the single-molecule level. Unlike previous methods, our assay is able to rapidly detect a large number of polymerase errors at base resolution over any template substrate without quantification bias. To overcome the high error rate of high-throughput sequencing, our assay uses a barcoding strategy in which each replicationmore » product is tagged with a unique nucleotide sequence before amplification. Here, this allows multiple sequencing reads of the same product to be compared so that sequencing errors can be found and removed. We demonstrate the ability of our assay to characterize the average error rate, error hotspots and lesion bypass fidelity of several DNA polymerases.« less

  4. Rapid method to detect duplex formation in sequencing by hybridization methods

    DOEpatents

    Mirzabekov, A.D.; Timofeev, E.N.; Florentiev, V.L.; Kirillov, E.V.

    1999-01-19

    A method for determining the existence of duplexes of oligonucleotide complementary molecules is provided. A plurality of immobilized oligonucleotide molecules, each of a specific length and each having a specific base sequence, is contacted with complementary, single stranded oligonucleotide molecules to form a duplex. Each duplex facilitates intercalation of a fluorescent dye between the base planes of the duplex. The invention also provides for a method for constructing oligonucleotide matrices comprising confining light sensitive fluid to a surface and exposing the light-sensitive fluid to a light pattern. This causes the fluid exposed to the light to coalesce into discrete units and adhere to the surface. This places each of the units in contact with a set of different oligonucleotide molecules so as to allow the molecules to disperse into the units. 13 figs.

  5. Rapid method to detect duplex formation in sequencing by hybridization methods

    DOEpatents

    Mirzabekov, Andrei Darievich; Timofeev, Edward Nikolaevich; Florentiev, Vladimer Leonidovich; Kirillov, Eugene Vladislavovich

    1999-01-01

    A method for determining the existence of duplexes of oligonucleotide complementary molecules is provided whereby a plurality of immobilized oligonucleotide molecules, each of a specific length and each having a specific base sequence, is contacted with complementary, single stranded oligonucleotide molecules to form a duplex so as to facilitate intercalation of a fluorescent dye between the base planes of the duplex. The invention also provides for a method for constructing oligonucleotide matrices comprising confining light sensitive fluid to a surface, exposing said light-sensitive fluid to a light pattern so as to cause the fluid exposed to the light to coalesce into discrete units and adhere to the surface; and contacting each of the units with a set of different oligonucleotide molecules so as to allow the molecules to disperse into the units.

  6. DNA origami-based shape IDs for single-molecule nanomechanical genotyping

    NASA Astrophysics Data System (ADS)

    Zhang, Honglu; Chao, Jie; Pan, Dun; Liu, Huajie; Qiang, Yu; Liu, Ke; Cui, Chengjun; Chen, Jianhua; Huang, Qing; Hu, Jun; Wang, Lianhui; Huang, Wei; Shi, Yongyong; Fan, Chunhai

    2017-04-01

    Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ~10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level.

  7. DNA origami-based shape IDs for single-molecule nanomechanical genotyping

    PubMed Central

    Zhang, Honglu; Chao, Jie; Pan, Dun; Liu, Huajie; Qiang, Yu; Liu, Ke; Cui, Chengjun; Chen, Jianhua; Huang, Qing; Hu, Jun; Wang, Lianhui; Huang, Wei; Shi, Yongyong; Fan, Chunhai

    2017-01-01

    Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ∼10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level. PMID:28382928

  8. Single Nucleobase Identification Using Biophysical Signatures from Nanoelectronic Quantum Tunneling.

    PubMed

    Korshoj, Lee E; Afsari, Sepideh; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

    2017-03-01

    Nanoelectronic DNA sequencing can provide an important alternative to sequencing-by-synthesis by reducing sample preparation time, cost, and complexity as a high-throughput next-generation technique with accurate single-molecule identification. However, sample noise and signature overlap continue to prevent high-resolution and accurate sequencing results. Probing the molecular orbitals of chemically distinct DNA nucleobases offers a path for facile sequence identification, but molecular entropy (from nucleotide conformations) makes such identification difficult when relying only on the energies of lowest-unoccupied and highest-occupied molecular orbitals (LUMO and HOMO). Here, nine biophysical parameters are developed to better characterize molecular orbitals of individual nucleobases, intended for single-molecule DNA sequencing using quantum tunneling of charges. For this analysis, theoretical models for quantum tunneling are combined with transition voltage spectroscopy to obtain measurable parameters unique to the molecule within an electronic junction. Scanning tunneling spectroscopy is then used to measure these nine biophysical parameters for DNA nucleotides, and a modified machine learning algorithm identified nucleobases. The new parameters significantly improve base calling over merely using LUMO and HOMO frontier orbital energies. Furthermore, high accuracies for identifying DNA nucleobases were observed at different pH conditions. These results have significant implications for developing a robust and accurate high-throughput nanoelectronic DNA sequencing technique. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Secondary structure prediction and structure-specific sequence analysis of single-stranded DNA.

    PubMed

    Dong, F; Allawi, H T; Anderson, T; Neri, B P; Lyamichev, V I

    2001-08-01

    DNA sequence analysis by oligonucleotide binding is often affected by interference with the secondary structure of the target DNA. Here we describe an approach that improves DNA secondary structure prediction by combining enzymatic probing of DNA by structure-specific 5'-nucleases with an energy minimization algorithm that utilizes the 5'-nuclease cleavage sites as constraints. The method can identify structural differences between two DNA molecules caused by minor sequence variations such as a single nucleotide mutation. It also demonstrates the existence of long-range interactions between DNA regions separated by >300 nt and the formation of multiple alternative structures by a 244 nt DNA molecule. The differences in the secondary structure of DNA molecules revealed by 5'-nuclease probing were used to design structure-specific probes for mutation discrimination that target the regions of structural, rather than sequence, differences. We also demonstrate the performance of structure-specific 'bridge' probes complementary to non-contiguous regions of the target molecule. The structure-specific probes do not require the high stringency binding conditions necessary for methods based on mismatch formation and permit mutation detection at temperatures from 4 to 37 degrees C. Structure-specific sequence analysis is applied for mutation detection in the Mycobacterium tuberculosis katG gene and for genotyping of the hepatitis C virus.

  10. A dynamic bead-based microarray for parallel DNA detection

    NASA Astrophysics Data System (ADS)

    Sochol, R. D.; Casavant, B. P.; Dueck, M. E.; Lee, L. P.; Lin, L.

    2011-05-01

    A microfluidic system has been designed and constructed by means of micromachining processes to integrate both microfluidic mixing of mobile microbeads and hydrodynamic microbead arraying capabilities on a single chip to simultaneously detect multiple bio-molecules. The prototype system has four parallel reaction chambers, which include microchannels of 18 × 50 µm2 cross-sectional area and a microfluidic mixing section of 22 cm length. Parallel detection of multiple DNA oligonucleotide sequences was achieved via molecular beacon probes immobilized on polystyrene microbeads of 16 µm diameter. Experimental results show quantitative detection of three distinct DNA oligonucleotide sequences from the Hepatitis C viral (HCV) genome with single base-pair mismatch specificity. Our dynamic bead-based microarray offers an effective microfluidic platform to increase parallelization of reactions and improve microbead handling for various biological applications, including bio-molecule detection, medical diagnostics and drug screening.

  11. Single Molecule Bioelectronics and Their Application to Amplification-Free Measurement of DNA Lengths

    PubMed Central

    Gül, O. Tolga; Pugliese, Kaitlin M.; Choi, Yongki; Sims, Patrick C.; Pan, Deng; Rajapakse, Arith J.; Weiss, Gregory A.; Collins, Philip G.

    2016-01-01

    As biosensing devices shrink smaller and smaller, they approach a scale in which single molecule electronic sensing becomes possible. Here, we review the operation of single-enzyme transistors made using single-walled carbon nanotubes. These novel hybrid devices transduce the motions and catalytic activity of a single protein into an electronic signal for real-time monitoring of the protein’s activity. Analysis of these electronic signals reveals new insights into enzyme function and proves the electronic technique to be complementary to other single-molecule methods based on fluorescence. As one example of the nanocircuit technique, we have studied the Klenow Fragment (KF) of DNA polymerase I as it catalytically processes single-stranded DNA templates. The fidelity of DNA polymerases makes them a key component in many DNA sequencing techniques, and here we demonstrate that KF nanocircuits readily resolve DNA polymerization with single-base sensitivity. Consequently, template lengths can be directly counted from electronic recordings of KF’s base-by-base activity. After measuring as few as 20 copies, the template length can be determined with <1 base pair resolution, and different template lengths can be identified and enumerated in solutions containing template mixtures. PMID:27348011

  12. Single Molecule Bioelectronics and Their Application to Amplification-Free Measurement of DNA Lengths.

    PubMed

    Gül, O Tolga; Pugliese, Kaitlin M; Choi, Yongki; Sims, Patrick C; Pan, Deng; Rajapakse, Arith J; Weiss, Gregory A; Collins, Philip G

    2016-06-24

    As biosensing devices shrink smaller and smaller, they approach a scale in which single molecule electronic sensing becomes possible. Here, we review the operation of single-enzyme transistors made using single-walled carbon nanotubes. These novel hybrid devices transduce the motions and catalytic activity of a single protein into an electronic signal for real-time monitoring of the protein's activity. Analysis of these electronic signals reveals new insights into enzyme function and proves the electronic technique to be complementary to other single-molecule methods based on fluorescence. As one example of the nanocircuit technique, we have studied the Klenow Fragment (KF) of DNA polymerase I as it catalytically processes single-stranded DNA templates. The fidelity of DNA polymerases makes them a key component in many DNA sequencing techniques, and here we demonstrate that KF nanocircuits readily resolve DNA polymerization with single-base sensitivity. Consequently, template lengths can be directly counted from electronic recordings of KF's base-by-base activity. After measuring as few as 20 copies, the template length can be determined with <1 base pair resolution, and different template lengths can be identified and enumerated in solutions containing template mixtures.

  13. Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes.

    PubMed

    Fredlake, Christopher P; Hert, Daniel G; Kan, Cheuk-Wai; Chiesl, Thomas N; Root, Brian E; Forster, Ryan E; Barron, Annelise E

    2008-01-15

    To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require approximately 70 min to deliver approximately 650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered "hybrid" mechanism of DNA electromigration, in which DNA molecules alternate rapidly between repeating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs.

  14. Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes

    PubMed Central

    Fredlake, Christopher P.; Hert, Daniel G.; Kan, Cheuk-Wai; Chiesl, Thomas N.; Root, Brian E.; Forster, Ryan E.; Barron, Annelise E.

    2008-01-01

    To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require ≈70 min to deliver ≈650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered “hybrid” mechanism of DNA electromigration, in which DNA molecules alternate rapidly between reptating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs. PMID:18184818

  15. Identification of Biomolecular Building Blocks by Recognition Tunneling: Stride towards Nanopore Sequencing of Biomolecules

    NASA Astrophysics Data System (ADS)

    Sen, Suman

    DNA, RNA and Protein are three pivotal biomolecules in human and other organisms, playing decisive roles in functionality, appearance, diseases development and other physiological phenomena. Hence, sequencing of these biomolecules acquires the prime interest in the scientific community. Single molecular identification of their building blocks can be done by a technique called Recognition Tunneling (RT) based on Scanning Tunneling Microscope (STM). A single layer of specially designed recognition molecule is attached to the STM electrodes, which trap the targeted molecules (DNA nucleoside monophosphates, RNA nucleoside monophosphates or amino acids) inside the STM nanogap. Depending on their different binding interactions with the recognition molecules, the analyte molecules generate stochastic signal trains accommodating their "electronic fingerprints". Signal features are used to detect the molecules using a machine learning algorithm and different molecules can be identified with significantly high accuracy. This, in turn, paves the way for rapid, economical nanopore sequencing platform, overcoming the drawbacks of Next Generation Sequencing (NGS) techniques. To read DNA nucleotides with high accuracy in an STM tunnel junction a series of nitrogen-based heterocycles were designed and examined to check their capabilities to interact with naturally occurring DNA nucleotides by hydrogen bonding in the tunnel junction. These recognition molecules are Benzimidazole, Imidazole, Triazole and Pyrrole. Benzimidazole proved to be best among them showing DNA nucleotide classification accuracy close to 99%. Also, Imidazole reader can read an abasic monophosphate (AP), a product from depurination or depyrimidination that occurs 10,000 times per human cell per day. In another study, I have investigated a new universal reader, 1-(2-mercaptoethyl)pyrene (Pyrene reader) based on stacking interactions, which should be more specific to the canonical DNA nucleosides. In addition, Pyrene reader showed higher DNA base-calling accuracy compare to Imidazole reader, the workhorse in our previous projects. In my other projects, various amino acids and RNA nucleoside monophosphates were also classified with significantly high accuracy using RT. Twenty naturally occurring amino acids and various RNA nucleosides (four canonical and two modified) were successfully identified. Thus, we envision nanopore sequencing biomolecules using Recognition Tunneling (RT) that should provide comprehensive betterment over current technologies in terms of time, chemical and instrumental cost and capability of de novo sequencing.

  16. Direct Detection and Sequencing of Damaged DNA Bases

    PubMed Central

    2011-01-01

    Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597

  17. Direct detection and sequencing of damaged DNA bases.

    PubMed

    Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas

    2011-12-20

    Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.

  18. Rapid method to detect duplex formation in sequencing by hybridization methods, a method for constructing containment structures for reagent interaction

    DOEpatents

    Mirzabekov, Andrei Darievich; Yershov, Gennadiy Moiseyevich; Guschin, Dmitry Yuryevich; Gemmell, Margaret Anne; Shick, Valentine V.; Proudnikov, Dmitri Y.; Timofeev, Edward N.

    2002-01-01

    A method for determining the existence of duplexes of oligonucleotide complementary molecules is provided whereby a plurality of immobilized oligonucleotide molecules, each of a specific length and each having a specific base sequence, is contacted with complementary, single stranded oligonucleotide molecules to form a duplex so as to facilitate intercalation of a fluorescent dye between the base planes of the duplex. The invention also provides for a method for constructing oligonucleotide matrices comprising confining light sensitive fluid to a surface, exposing said light-sensitive fluid to a light pattern so as to cause the fluid exposed to the light to polymerize into discrete units and adhere to the surface; and contacting each of the units with a set of different oligonucleotide molecules so as to allow the molecules to disperse into the units.

  19. A reference bacterial genome dataset generated on the MinION™ portable single-molecule nanopore sequencer.

    PubMed

    Quick, Joshua; Quinlan, Aaron R; Loman, Nicholas J

    2014-01-01

    The MinION™ is a new, portable single-molecule sequencer developed by Oxford Nanopore Technologies. It measures four inches in length and is powered from the USB 3.0 port of a laptop computer. The MinION™ measures the change in current resulting from DNA strands interacting with a charged protein nanopore. These measurements can then be used to deduce the underlying nucleotide sequence. We present a read dataset from whole-genome shotgun sequencing of the model organism Escherichia coli K-12 substr. MG1655 generated on a MinION™ device during the early-access MinION™ Access Program (MAP). Sequencing runs of the MinION™ are presented, one generated using R7 chemistry (released in July 2014) and one using R7.3 (released in September 2014). Base-called sequence data are provided to demonstrate the nature of data produced by the MinION™ platform and to encourage the development of customised methods for alignment, consensus and variant calling, de novo assembly and scaffolding. FAST5 files containing event data within the HDF5 container format are provided to assist with the development of improved base-calling methods.

  20. Assessing the performance of the Oxford Nanopore Technologies MinION

    PubMed Central

    Laver, T.; Harrison, J.; O’Neill, P.A.; Moore, K.; Farbos, A.; Paszkiewicz, K.; Studholme, D.J.

    2015-01-01

    The Oxford Nanopore Technologies (ONT) MinION is a new sequencing technology that potentially offers read lengths of tens of kilobases (kb) limited only by the length of DNA molecules presented to it. The device has a low capital cost, is by far the most portable DNA sequencer available, and can produce data in real-time. It has numerous prospective applications including improving genome sequence assemblies and resolution of repeat-rich regions. Before such a technology is widely adopted, it is important to assess its performance and limitations in respect of throughput and accuracy. In this study we assessed the performance of the MinION by re-sequencing three bacterial genomes, with very different nucleotide compositions ranging from 28.6% to 70.7%; the high G + C strain was underrepresented in the sequencing reads. We estimate the error rate of the MinION (after base calling) to be 38.2%. Mean and median read lengths were 2 kb and 1 kb respectively, while the longest single read was 98 kb. The whole length of a 5 kb rRNA operon was covered by a single read. As the first nanopore-based single molecule sequencer available to researchers, the MinION is an exciting prospect; however, the current error rate limits its ability to compete with existing sequencing technologies, though we do show that MinION sequence reads can enhance contiguity of de novo assembly when used in conjunction with Illumina MiSeq data. PMID:26753127

  1. Development of Single-Molecule DNA Sequencing Platform Based on Single-Molecule Electrical Conductance

    DTIC Science & Technology

    2015-05-25

    nanoparticles , Nature Nanotechnology 7, 197-203. 11. Dreaden, E. C., Alkilany, A. M., Huang, X. H., Murphy, C. J., and El-Sayed, M. A. (2012) The...13840-13851. 14. Llevot, A., and Astruc, D. (2012) Applications of vectorized gold nanoparticles to the diagnosis and therapy of cancer , Chem. Soc. Rev...caused by the injection of gold nanoparticles , Nanotechnology 21, 485102. 25. Dykman, L. A., Matora, L. Y., and Bogatyrev, V. A. (1996) Use of

  2. Diff-seq: A high throughput sequencing-based mismatch detection assay for DNA variant enrichment and discovery

    PubMed Central

    Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin

    2018-01-01

    Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139

  3. Noninvasive prenatal testing for Wilson disease by use of circulating single-molecule amplification and resequencing technology (cSMART).

    PubMed

    Lv, Weigang; Wei, Xianda; Guo, Ruolan; Liu, Qin; Zheng, Yu; Chang, Jiazhen; Bai, Ting; Li, Haoxian; Zhang, Jianguang; Song, Zhuo; Cram, David S; Liang, Desheng; Wu, Lingqian

    2015-01-01

    Noninvasive prenatal testing (NIPT) for monogenic diseases by use of PCR-based strategies requires precise quantification of mutant fetal alleles circulating in the maternal plasma. The study describes the development and validation of a novel assay termed circulating single-molecule amplification and resequencing technology (cSMART) for counting single allelic molecules in plasma. Here we demonstrate the suitability of cSMART for NIPT, with Wilson Disease (WD) as proof of concept. We used Sanger and whole-exome sequencing to identify familial ATP7B (ATPase, Cu(++) transporting, β polypeptide) gene mutations. For cSMART, single molecules were tagged with unique barcodes and circularized, and alleles were targeted and replicated by inverse PCR. The unique single allelic molecules were identified by sequencing and counted, and the percentage of mutant alleles in the original maternal plasma sample was used to determine fetal genotypes. Four families with WD pedigrees consented to the study. Using Sanger and whole-exome sequencing, we mapped the pathogenic ATP7B mutations in each pedigree and confirmed the proband's original diagnosis of WD. After validation of cSMART with defined plasma models mimicking fetal inheritance of paternal, maternal, or both parental mutant alleles, we retrospectively showed in second pregnancies that the fetal genotypes assigned by invasive testing and NIPT were concordant. We developed a reliable and accurate NIPT assay that correctly diagnosed the fetal genotypes in 4 pregnancies at risk for WD. This novel technology has potential as a universal strategy for NIPT of other monogenic disorders, since it requires only knowledge of the parental pathogenic mutations. © 2014 American Association for Clinical Chemistry.

  4. Homogeneous assay of target molecules based on chemiluminescence resonance energy transfer (CRET) using DNAzyme-linked aptamers.

    PubMed

    Mun, Hyoyoung; Jo, Eun-Jung; Li, Taihua; Joung, Hyou-Arm; Hong, Dong-Gu; Shim, Won-Bo; Jung, Cheulhee; Kim, Min-Gon

    2014-08-15

    We have designed a single-stranded DNAzyme-aptamer sensor for homogeneous target molecular detection based on chemiluminescence resonance energy transfer (CRET). The structure of the engineered single-stranded DNA (ssDNA) includes the horseradish peroxidase (HRP)-like DNAzyme, optimum-length linker (10-mer-length DNA), and target-specific aptamer sequences. A quencher dye was modified at the 3' end of the aptamer sequence. The incorporation of hemin into the G-quadruplex structure of DNAzyme yields an active HRP-like activity that catalyzes luminol to generate a chemiluminescence (CL) signal. In the presence of target molecules, such as ochratoxin A (OTA), adenosine triphosphate (ATP), or thrombin, the aptamer sequence was folded due to the formation of the aptamer/analyte complex, which induced the quencher dye close to the DNAzyme structure. Consequently, the CRET occurred between a DNAzyme-catalyzed chemiluminescence reaction and the quencher dye. Our results showed that CRET-based DNAzyme-aptamer biosensing enabled specific OTA analysis with a limit of detection of 0.27ng/mL. The CRET platform needs no external light source and avoids autofluorescence and photobleaching, and target molecules can be detected specifically and sensitively in a homogeneous manner. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Sequencing of adenine in DNA by scanning tunneling microscopy

    NASA Astrophysics Data System (ADS)

    Tanaka, Hiroyuki; Taniguchi, Masateru

    2017-08-01

    The development of DNA sequencing technology utilizing the detection of a tunnel current is important for next-generation sequencer technologies based on single-molecule analysis technology. Using a scanning tunneling microscope, we previously reported that dI/dV measurements and dI/dV mapping revealed that the guanine base (purine base) of DNA adsorbed onto the Cu(111) surface has a characteristic peak at V s = -1.6 V. If, in addition to guanine, the other purine base of DNA, namely, adenine, can be distinguished, then by reading all the purine bases of each single strand of a DNA double helix, the entire base sequence of the original double helix can be determined due to the complementarity of the DNA base pair. Therefore, the ability to read adenine is important from the viewpoint of sequencing. Here, we report on the identification of adenine by STM topographic and spectroscopic measurements using a synthetic DNA oligomer and viral DNA.

  6. Antibody-Mediated Small Molecule Detection Using Programmable DNA-Switches.

    PubMed

    Rossetti, Marianna; Ippodrino, Rudy; Marini, Bruna; Palleschi, Giuseppe; Porchetta, Alessandro

    2018-06-13

    The development of rapid, cost-effective, and single-step methods for the detection of small molecules is crucial for improving the quality and efficiency of many applications ranging from life science to environmental analysis. Unfortunately, current methodologies still require multiple complex, time-consuming washing and incubation steps, which limit their applicability. In this work we present a competitive DNA-based platform that makes use of both programmable DNA-switches and antibodies to detect small target molecules. The strategy exploits both the advantages of proximity-based methods and structure-switching DNA-probes. The platform is modular and versatile and it can potentially be applied for the detection of any small target molecule that can be conjugated to a nucleic acid sequence. Here the rational design of programmable DNA-switches is discussed, and the sensitive, rapid, and single-step detection of different environmentally relevant small target molecules is demonstrated.

  7. Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

    PubMed

    Hong, Jungeui; Gresham, David

    2017-11-01

    Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.

  8. DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

    PubMed Central

    Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

    2014-01-01

    As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252

  9. Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

    PubMed

    Ozsolak, Fatih

    2016-01-01

    With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.

  10. A single-molecule sequencing assay for the comprehensive profiling of T4 DNA ligase fidelity and bias during DNA end-joining.

    PubMed

    Potapov, Vladimir; Ong, Jennifer L; Langhorst, Bradley W; Bilotti, Katharina; Cahoon, Dan; Canton, Barry; Knight, Thomas F; Evans, Thomas C; Lohman, Gregory Js

    2018-05-08

    DNA ligases are key enzymes in molecular and synthetic biology that catalyze the joining of breaks in duplex DNA and the end-joining of DNA fragments. Ligation fidelity (discrimination against the ligation of substrates containing mismatched base pairs) and bias (preferential ligation of particular sequences over others) have been well-studied in the context of nick ligation. However, almost no data exist for fidelity and bias in end-joining ligation contexts. In this study, we applied Pacific Biosciences Single-Molecule Real-Time sequencing technology to directly sequence the products of a highly multiplexed ligation reaction. This method has been used to profile the ligation of all three-base 5'-overhangs by T4 DNA ligase under typical ligation conditions in a single experiment. We report the relative frequency of all ligation products with or without mismatches, the position-dependent frequency of each mismatch, and the surprising observation that 5'-TNA overhangs ligate extremely inefficiently compared to all other Watson-Crick pairings. The method can easily be extended to profile other ligases, end-types (e.g. blunt ends and overhangs of different lengths), and the effect of adjacent sequence on the ligation results. Further, the method has the potential to provide new insights into the thermodynamics of annealing and the kinetics of end-joining reactions.

  11. Genome Sequence of Bacillus cereus Strain TG1-6, a Plant-Beneficial Rhizobacterium That Is Highly Salt Tolerant

    PubMed Central

    2018-01-01

    ABSTRACT The complete genome sequence of Bacillus cereus strain TG1-6, which is a highly salt-tolerant rhizobacterium that enhances plant tolerance to drought stress, is reported here. The sequencing process was performed based on a combination of pyrosequencing and single-molecule sequencing. The complete genome is estimated to be approximately 5.42 Mb, containing a total of 5,610 predicted protein-coding DNA sequences (CDSs). PMID:29748401

  12. Deep learning for single-molecule science

    NASA Astrophysics Data System (ADS)

    Albrecht, Tim; Slabaugh, Gregory; Alonso, Eduardo; Al-Arif, SM Masudur R.

    2017-10-01

    Exploring and making predictions based on single-molecule data can be challenging, not only due to the sheer size of the datasets, but also because a priori knowledge about the signal characteristics is typically limited and poor signal-to-noise ratio. For example, hypothesis-driven data exploration, informed by an expectation of the signal characteristics, can lead to interpretation bias or loss of information. Equally, even when the different data categories are known, e.g., the four bases in DNA sequencing, it is often difficult to know how to make best use of the available information content. The latest developments in machine learning (ML), so-called deep learning (DL) offer interesting, new avenues to address such challenges. In some applications, such as speech and image recognition, DL has been able to outperform conventional ML strategies and even human performance. However, to date DL has not been applied much in single-molecule science, presumably in part because relatively little is known about the ‘internal workings’ of such DL tools within single-molecule science as a field. In this Tutorial, we make an attempt to illustrate in a step-by-step guide how one of those, a convolutional neural network (CNN), may be used for base calling in DNA sequencing applications. We compare it with a SVM as a more conventional ML method, and discuss some of the strengths and weaknesses of the approach. In particular, a ‘deep’ neural network has many features of a ‘black box’, which has important implications on how we look at and interpret data.

  13. Serogroup-level resolution of the “Super-7” Shiga toxin-producing Escherichia coli using nanopore single-molecule DNA sequencing

    USDA-ARS?s Scientific Manuscript database

    DNA sequencing and other DNA-based methods, such as PCR, are now broadly used for detection and identification of bacterial foodborne pathogens. For the identification of foodborne bacterial pathogens, it is important to make taxonomic assignments to the species, or even subspecies level. Long-read ...

  14. Sequence analysis of cultivated strawberry (Fragaria × ananassa Duch.) using microdissected single somatic chromosomes.

    PubMed

    Yanagi, Tomohiro; Shirasawa, Kenta; Terachi, Mayuko; Isobe, Sachiko

    2017-01-01

    Cultivated strawberry ( Fragaria  ×  ananassa Duch.) has homoeologous chromosomes because of allo-octoploidy. For example, two homoeologous chromosomes that belong to different sub-genome of allopolyploids have similar base sequences. Thus, when conducting de novo assembly of DNA sequences, it is difficult to determine whether these sequences are derived from the same chromosome. To avoid the difficulties associated with homoeologous chromosomes and demonstrate the possibility of sequencing allopolyploids using single chromosomes, we conducted sequence analysis using microdissected single somatic chromosomes of cultivated strawberry. Three hundred and ten somatic chromosomes of the Japanese octoploid strawberry 'Reiko' were individually selected under a light microscope using a microdissection system. DNA from 288 of the dissected chromosomes was successfully amplified using a DNA amplification kit. Using next-generation sequencing, we decoded the base sequences of the amplified DNA segments, and on the basis of mapping, we identified DNA sequences from 144 samples that were best matched to the reference genomes of the octoploid strawberry, F.  ×  ananassa , and the diploid strawberry, F. vesca . The 144 samples were classified into seven pseudo-molecules of F. vesca . The coverage rates of the DNA sequences from the single chromosome onto all pseudo-molecular sequences varied from 3 to 29.9%. We demonstrated an efficient method for sequence analysis of allopolyploid plants using microdissected single chromosomes. On the basis of our results, we believe that whole-genome analysis of allopolyploid plants can be enhanced using methodology that employs microdissected single chromosomes.

  15. Solid-State and Biological Nanopore for Real-Time Sensing of Single Chemical and Sequencing of DNA.

    PubMed

    Haque, Farzin; Li, Jinghong; Wu, Hai-Chen; Liang, Xing-Jie; Guo, Peixuan

    2013-02-01

    Sensitivity and specificity are two most important factors to take into account for molecule sensing, chemical detection and disease diagnosis. A perfect sensitivity is to reach the level where a single molecule can be detected. An ideal specificity is to reach the level where the substance can be detected in the presence of many contaminants. The rapidly progressing nanopore technology is approaching this threshold. A wide assortment of biomotors and cellular pores in living organisms perform diverse biological functions. The elegant design of these transportation machineries has inspired the development of single molecule detection based on modulations of the individual current blockage events. The dynamic growth of nanotechnology and nanobiotechnology has stimulated rapid advances in the study of nanopore based instrumentation over the last decade, and inspired great interest in sensing of single molecules including ions, nucleotides, enantiomers, drugs, and polymers such as PEG, RNA, DNA, and polypeptides. This sensing technology has been extended to medical diagnostics and third generation high throughput DNA sequencing. This review covers current nanopore detection platforms including both biological pores and solid state counterparts. Several biological nanopores have been studied over the years, but this review will focus on the three best characterized systems including α-hemolysin and MspA, both containing a smaller channel for the detection of single-strand DNA, as well as bacteriophage phi29 DNA packaging motor connector that contains a larger channel for the passing of double stranded DNA. The advantage and disadvantage of each system are compared; their current and potential applications in nanomedicine, biotechnology, and nanotechnology are discussed.

  16. Solid-State and Biological Nanopore for Real-Time Sensing of Single Chemical and Sequencing of DNA

    PubMed Central

    Haque, Farzin; Li, Jinghong; Wu, Hai-Chen; Liang, Xing-Jie; Guo, Peixuan

    2013-01-01

    Sensitivity and specificity are two most important factors to take into account for molecule sensing, chemical detection and disease diagnosis. A perfect sensitivity is to reach the level where a single molecule can be detected. An ideal specificity is to reach the level where the substance can be detected in the presence of many contaminants. The rapidly progressing nanopore technology is approaching this threshold. A wide assortment of biomotors and cellular pores in living organisms perform diverse biological functions. The elegant design of these transportation machineries has inspired the development of single molecule detection based on modulations of the individual current blockage events. The dynamic growth of nanotechnology and nanobiotechnology has stimulated rapid advances in the study of nanopore based instrumentation over the last decade, and inspired great interest in sensing of single molecules including ions, nucleotides, enantiomers, drugs, and polymers such as PEG, RNA, DNA, and polypeptides. This sensing technology has been extended to medical diagnostics and third generation high throughput DNA sequencing. This review covers current nanopore detection platforms including both biological pores and solid state counterparts. Several biological nanopores have been studied over the years, but this review will focus on the three best characterized systems including α-hemolysin and MspA, both containing a smaller channel for the detection of single-strand DNA, as well as bacteriophage phi29 DNA packaging motor connector that contains a larger channel for the passing of double stranded DNA. The advantage and disadvantage of each system are compared; their current and potential applications in nanomedicine, biotechnology, and nanotechnology are discussed. PMID:23504223

  17. Nanopore analysis of polymers in solution.

    NASA Astrophysics Data System (ADS)

    Deamer, David

    2002-03-01

    Nanopores represent a novel approach for investigating macromolecules in solution. Polymers that have been analyzed by this technique include polyethylene glycol (PEG), certain proteins and nucleic acids. The a-hemolysin pore inserted into lipid bilayers provides continuous non-gated ion current through a pore diameter of approximately 1.5 - 2 nm. Nucleic acid molecules can be driven through the pore by imposing a voltage across the supporting membrane. Single stranded, but not double stranded nucleic acids pass through in strict linear sequence from one end of the molecule to the other. While in the pore, the molecule reduces ionic current, and properties of the ionic current blockade such as duration, mean amplitude and modulations of amplitude provide information about structure and composition of the nucleic acid. For a given molecular species, the duration of the blockade is a function of chain length, and the rate of blockades is linearly related to concentration. More recent studies have shown that the a-hemolysin nanopore can discriminate between synthetic DNA molecules differing by a single base pair or even a single nucleotide. These results indicate that a nanopore may have the resolution required for nucleic acid sequencing applications.

  18. [Biophysics of single molecules].

    PubMed

    Serdiuk, I N; Deriusheva, E I

    2011-01-01

    The modern methods of research of biological molecules whose application led to the development of a new field of science, biophysics of single molecules, are reviewed. The measurement of the characteristics of single molecules enables one to reveal their individual features, and it is just for this reason that much more information can be obtained from one molecule than from the entire ensample of molecules. The high sensitivity of the methods considered in detail makes it possible to come close to the solution of the basic problem of practical importance, namely, the determination of the nucleotide sequence of a single DNA molecule.

  19. Single-molecule Protein Unfolding in Solid State Nanopores

    PubMed Central

    Talaga, David S.; Li, Jiali

    2009-01-01

    We use single silicon nitride nanopores to study folded, partially folded and unfolded single proteins by measuring their excluded volumes. The DNA-calibrated translocation signals of β-lactoglobulin and histidine-containing phosphocarrier protein match quantitatively with that predicted by a simple sum of the partial volumes of the amino acids in the polypeptide segment inside the pore when translocation stalls due to the primary charge sequence. Our analysis suggests that the majority of the protein molecules were linear or looped during translocation and that the electrical forces present under physiologically relevant potentials can unfold proteins. Our results show that the nanopore translocation signals are sensitive enough to distinguish the folding state of a protein and distinguish between proteins based on the excluded volume of a local segment of the polypeptide chain that transiently stalls in the nanopore due to the primary sequence of charges. PMID:19530678

  20. Sequence-Dependent Elasticity and Electrostatics of Single-Stranded DNA: Signatures of Base-Stacking

    PubMed Central

    McIntosh, Dustin B.; Duggan, Gina; Gouil, Quentin; Saleh, Omar A.

    2014-01-01

    Base-stacking is a key factor in the energetics that determines nucleic acid structure. We measure the tensile response of single-stranded DNA as a function of sequence and monovalent salt concentration to examine the effects of base-stacking on the mechanical and thermodynamic properties of single-stranded DNA. By comparing the elastic response of highly stacked poly(dA) and that of a polypyrimidine sequence with minimal stacking, we find that base-stacking in poly(dA) significantly enhances the polymer’s rigidity. The unstacking transition of poly(dA) at high force reveals that the intrinsic electrostatic tension on the molecule varies significantly more weakly on salt concentration than mean-field predictions. Further, we provide a model-independent estimate of the free energy difference between stacked poly(dA) and unstacked polypyrimidine, finding it to be ∼−0.25 kBT/base and nearly constant over three orders of magnitude in salt concentration. PMID:24507606

  1. Single molecule targeted sequencing for cancer gene mutation detection.

    PubMed

    Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui

    2016-05-19

    With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.

  2. Single molecule quantitation and sequencing of rare translocations using microfluidic nested digital PCR.

    PubMed

    Shuga, Joe; Zeng, Yong; Novak, Richard; Lan, Qing; Tang, Xiaojiang; Rothman, Nathaniel; Vermeulen, Roel; Li, Laiyu; Hubbard, Alan; Zhang, Luoping; Mathies, Richard A; Smith, Martyn T

    2013-09-01

    Cancers are heterogeneous and genetically unstable. New methods are needed that provide the sensitivity and specificity to query single cells at the genetic loci that drive cancer progression, thereby enabling researchers to study the progression of individual tumors. Here, we report the development and application of a bead-based hemi-nested microfluidic droplet digital PCR (dPCR) technology to achieve 'quantitative' measurement and single-molecule sequencing of somatically acquired carcinogenic translocations at extremely low levels (<10(-6)) in healthy subjects. We use this technique in our healthy study population to determine the overall concentration of the t(14;18) translocation, which is strongly associated with follicular lymphoma. The nested dPCR approach improves the detection limit to 1×10(-7) or lower while maintaining the analysis efficiency and specificity. Further, the bead-based dPCR enabled us to isolate and quantify the relative amounts of the various clonal forms of t(14;18) translocation in these subjects, and the single-molecule sensitivity and resolution of dPCR led to the discovery of new clonal forms of t(14;18) that were otherwise masked by the conventional quantitative PCR measurements. In this manner, we created a quantitative map for this carcinogenic mutation in this healthy population and identified the positions on chromosomes 14 and 18 where the vast majority of these t(14;18) events occur.

  3. The Shine-Dalgarno sequence of riboswitch-regulated single mRNAs shows ligand-dependent accessibility bursts

    NASA Astrophysics Data System (ADS)

    Rinaldi, Arlie J.; Lund, Paul E.; Blanco, Mario R.; Walter, Nils G.

    2016-01-01

    In response to intracellular signals in Gram-negative bacteria, translational riboswitches--commonly embedded in messenger RNAs (mRNAs)--regulate gene expression through inhibition of translation initiation. It is generally thought that this regulation originates from occlusion of the Shine-Dalgarno (SD) sequence upon ligand binding; however, little direct evidence exists. Here we develop Single Molecule Kinetic Analysis of RNA Transient Structure (SiM-KARTS) to investigate the ligand-dependent accessibility of the SD sequence of an mRNA hosting the 7-aminomethyl-7-deazaguanine (preQ1)-sensing riboswitch. Spike train analysis reveals that individual mRNA molecules alternate between two conformational states, distinguished by `bursts' of probe binding associated with increased SD sequence accessibility. Addition of preQ1 decreases the lifetime of the SD's high-accessibility (bursting) state and prolongs the time between bursts. In addition, ligand-jump experiments reveal imperfect riboswitching of single mRNA molecules. Such complex ligand sensing by individual mRNA molecules rationalizes the nuanced ligand response observed during bulk mRNA translation.

  4. Nanofluidic Device with Embedded Nanopore

    NASA Astrophysics Data System (ADS)

    Zhang, Yuning; Reisner, Walter

    2014-03-01

    Nanofluidic based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with nanpore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a nanopore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We demonstrate that we can detect - using fluorescent microscopy - successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. We also show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore until a certain voltage bias is added.

  5. DNA nanomapping using CRISPR-Cas9 as a programmable nanoparticle.

    PubMed

    Mikheikin, Andrey; Olsen, Anita; Leslie, Kevin; Russell-Pavier, Freddie; Yacoot, Andrew; Picco, Loren; Payton, Oliver; Toor, Amir; Chesney, Alden; Gimzewski, James K; Mishra, Bud; Reed, Jason

    2017-11-21

    Progress in whole-genome sequencing using short-read (e.g., <150 bp), next-generation sequencing technologies has reinvigorated interest in high-resolution physical mapping to fill technical gaps that are not well addressed by sequencing. Here, we report two technical advances in DNA nanotechnology and single-molecule genomics: (1) we describe a labeling technique (CRISPR-Cas9 nanoparticles) for high-speed AFM-based physical mapping of DNA and (2) the first successful demonstration of using DVD optics to image DNA molecules with high-speed AFM. As a proof of principle, we used this new "nanomapping" method to detect and map precisely BCL2-IGH translocations present in lymph node biopsies of follicular lymphoma patents. This HS-AFM "nanomapping" technique can be complementary to both sequencing and other physical mapping approaches.

  6. Nanosecond to submillisecond dynamics in dye-labeled single-stranded DNA, as revealed by ensemble measurements and photon statistics at single-molecule level.

    PubMed

    Kaji, Takahiro; Ito, Syoji; Iwai, Shigenori; Miyasaka, Hiroshi

    2009-10-22

    Single-molecule and ensemble time-resolved fluorescence measurements were applied for the investigation of the conformational dynamics of single-stranded DNA, ssDNA, connected with a fluorescein dye by a C6 linker, where the motions both of DNA and the C6 linker affect the geometry of the system. From the ensemble measurement of the fluorescence quenching via photoinduced electron transfer with a guanine base in the DNA sequence, three main conformations were found in aqueous solution: a conformation unaffected by the guanine base in the excited state lifetime of fluorescein, a conformation in which the fluorescence is dynamically quenched in the excited-state lifetime, and a conformation leading to rapid quenching via nonfluorescent complex. The analysis by using the parameters acquired from the ensemble measurements for interphoton time distribution histograms and FCS autocorrelations by the single-molecule measurement revealed that interconversion in these three conformations took place with two characteristic time constants of several hundreds of nanoseconds and tens of microseconds. The advantage of the combination use of the ensemble measurements with the single-molecule detections for rather complex dynamic motions is discussed by integrating the experimental results with those obtained by molecular dynamics simulation.

  7. Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

    PubMed

    Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

    2017-04-26

    We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.

  8. Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease

    NASA Astrophysics Data System (ADS)

    Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.

    2018-04-01

    We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.

  9. Biophysics of protein-DNA interactions and chromosome organization

    PubMed Central

    Marko, John F.

    2014-01-01

    The function of DNA in cells depends on its interactions with protein molecules, which recognize and act on base sequence patterns along the double helix. These notes aim to introduce basic polymer physics of DNA molecules, biophysics of protein-DNA interactions and their study in single-DNA experiments, and some aspects of large-scale chromosome structure. Mechanisms for control of chromosome topology will also be discussed. PMID:25419039

  10. Multiplex single-molecule interaction profiling of DNA-barcoded proteins.

    PubMed

    Gu, Liangcai; Li, Chao; Aach, John; Hill, David E; Vidal, Marc; Church, George M

    2014-11-27

    In contrast with advances in massively parallel DNA sequencing, high-throughput protein analyses are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule protein detection using optical methods is limited by the number of spectrally non-overlapping chromophores. Here we introduce a single-molecular-interaction sequencing (SMI-seq) technology for parallel protein interaction profiling leveraging single-molecule advantages. DNA barcodes are attached to proteins collectively via ribosome display or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide thin film to construct a random single-molecule array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies) and analysed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimetre. Furthermore, protein interactions can be measured on the basis of the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor and antibody-binding profiling, are demonstrated. SMI-seq enables 'library versus library' screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity.

  11. Identifying single bases in a DNA oligomer with electron tunnelling.

    PubMed

    Huang, Shuo; He, Jin; Chang, Shuai; Zhang, Peiming; Liang, Feng; Li, Shengqin; Tuchband, Michael; Fuhrmann, Alexander; Ros, Robert; Lindsay, Stuart

    2010-12-01

    It has been proposed that single molecules of DNA could be sequenced by measuring the physical properties of the bases as they pass through a nanopore. Theoretical calculations suggest that electron tunnelling can identify bases in single-stranded DNA without enzymatic processing, and it was recently experimentally shown that tunnelling can sense individual nucleotides and nucleosides. Here, we report that tunnelling electrodes functionalized with recognition reagents can identify a single base flanked by other bases in short DNA oligomers. The residence time of a single base in a recognition junction is on the order of a second, but pulling the DNA through the junction with a force of tens of piconewtons would yield reading speeds of tens of bases per second.

  12. Highly parallel single-molecule amplification approach based on agarose droplet polymerase chain reaction for efficient and cost-effective aptamer selection.

    PubMed

    Zhang, Wei Yun; Zhang, Wenhua; Liu, Zhiyuan; Li, Cong; Zhu, Zhi; Yang, Chaoyong James

    2012-01-03

    We have developed a novel method for efficiently screening affinity ligands (aptamers) from a complex single-stranded DNA (ssDNA) library by employing single-molecule emulsion polymerase chain reaction (PCR) based on the agarose droplet microfluidic technology. In a typical systematic evolution of ligands by exponential enrichment (SELEX) process, the enriched library is sequenced first, and tens to hundreds of aptamer candidates are analyzed via a bioinformatic approach. Possible candidates are then chemically synthesized, and their binding affinities are measured individually. Such a process is time-consuming, labor-intensive, inefficient, and expensive. To address these problems, we have developed a highly efficient single-molecule approach for aptamer screening using our agarose droplet microfluidic technology. Statistically diluted ssDNA of the pre-enriched library evolved through conventional SELEX against cancer biomarker Shp2 protein was encapsulated into individual uniform agarose droplets for droplet PCR to generate clonal agarose beads. The binding capacity of amplified ssDNA from each clonal bead was then screened via high-throughput fluorescence cytometry. DNA clones with high binding capacity and low K(d) were chosen as the aptamer and can be directly used for downstream biomedical applications. We have identified an ssDNA aptamer that selectively recognizes Shp2 with a K(d) of 24.9 nM. Compared to a conventional sequencing-chemical synthesis-screening work flow, our approach avoids large-scale DNA sequencing and expensive, time-consuming DNA synthesis of large populations of DNA candidates. The agarose droplet microfluidic approach is thus highly efficient and cost-effective for molecular evolution approaches and will find wide application in molecular evolution technologies, including mRNA display, phage display, and so on. © 2011 American Chemical Society

  13. Single Molecule Visualization of Protein-DNA Complexes: Watching Machines at Work

    NASA Astrophysics Data System (ADS)

    Kowalczykowski, Stephen

    2013-03-01

    We can now watch individual proteins acting on single molecules of DNA. Such imaging provides unprecedented interrogation of fundamental biophysical processes. Visualization is achieved through the application of two complementary procedures. In one, single DNA molecules are attached to a polystyrene bead and are then captured by an optical trap. The DNA, a worm-like coil, is extended either by the force of solution flow in a micro-fabricated channel, or by capturing the opposite DNA end in a second optical trap. In the second procedure, DNA is attached by one end to a glass surface. The coiled DNA is elongated either by continuous solution flow or by subsequently tethering the opposite end to the surface. Protein action is visualized by fluorescent reporters: fluorescent dyes that bind double-stranded DNA (dsDNA), fluorescent biosensors for single-stranded DNA (ssDNA), or fluorescently-tagged proteins. Individual molecules are imaged using either epifluorescence microscopy or total internal reflection fluorescence (TIRF) microscopy. Using these approaches, we imaged the search for DNA sequence homology conducted by the RecA-ssDNA filament. The manner by which RecA protein finds a single homologous sequence in the genome had remained undefined for almost 30 years. Single-molecule imaging revealed that the search occurs through a mechanism termed ``intersegmental contact sampling,'' in which the randomly coiled structure of DNA is essential for reiterative sampling of DNA sequence identity: an example of parallel processing. In addition, the assembly of RecA filaments on single molecules of single-stranded DNA was visualized. Filament assembly requires nucleation of a protein dimer on DNA, and subsequent growth occurs via monomer addition. Furthermore, we discovered a class of proteins that catalyzed both nucleation and growth of filaments, revealing how the cell controls assembly of this protein-DNA complex.

  14. Exploring Connectivity in Sequence Space of Functional RNA

    NASA Technical Reports Server (NTRS)

    Wei, Chenyu; Pohorille, Andrzej; Popovic, Milena; Ditzler, Mark

    2017-01-01

    Emergence of replicable genetic molecules was one of the marking points in the origin of life, evolution of which can be conceptualized as a walk through the space of all possible sequences. A theoretical concept of fitness landscape helps to understand evolutionary processes through assigning a value of fitness to each genotype. Then, evolution of a phenotype is viewed as a series of consecutive, single-point mutations. Natural selection biases evolution toward peaks of high fitness and away from valleys of low fitness. whereas neutral drift occurs in the sequence space without direction as mutations are introduced at random. Large networks of neutral or near-neutral mutations on a fitness landscape, especially for sufficiently long genomes, are possible or even inevitable. Their detection in experiments, however, has been elusive. Although a few near-neutral evolutionary pathways have been found, recent experimental evidence indicates landscapes consist of largely isolated islands. The generality of these results, however, is not clear, as the genome length or the fraction of functional molecules in the genotypic space might have been insufficient for the emergence of large, neutral networks. Thorough investigation on the structure of the fitness landscape is essential to understand the mechanisms of evolution of early genomes. RNA molecules are commonly assumed to play the pivotal role in the origin of genetic systems. They are widely believed to be early, if not the earliest, genetic and catalytic molecules, with abundant biochemical activities as aptamers and ribozymes, i.e. RNA molecules capable, respectively, to bind small molecules or catalyze chemical reactions. Here, we present results of our recent studies on the structure of the sequence space of RNA ligase ribozymes selected through in vitro evolution. Several hundred thousands of sequences active to a different degree were obtained by way of deep sequencing. Analysis of these sequences revealed several large clusters defined such that every sequence in a cluster can be reached from any other sequence in the same cluster through a series of single point mutations. Sequences in a single cluster appear to adopt more than one secondary structure. The mechanism of refolding within a single cluster was examined. To shed light on possible evolutionary paths in the space of ribozymes, the connectivity between clusters was investigated. The effect of length of RNA molecules on the structure of the fitness landscape and possible evolutionary paths was examined by way of comparing functional sequences of 20 and 80 nucleobases in length. It was found that sequences of different lengths shared secondary structure motifs that were presumed responsible for catalytic activity, with increasing complexity and global structural rearrangements emerging in longer molecules.

  15. Accurate multiplex polony sequencing of an evolved bacterial genome.

    PubMed

    Shendure, Jay; Porreca, Gregory J; Reppas, Nikos B; Lin, Xiaoxia; McCutcheon, John P; Rosenbaum, Abraham M; Wang, Michael D; Zhang, Kun; Mitra, Robi D; Church, George M

    2005-09-09

    We describe a DNA sequencing technology in which a commonly available, inexpensive epifluorescence microscope is converted to rapid nonelectrophoretic DNA sequencing automation. We apply this technology to resequence an evolved strain of Escherichia coli at less than one error per million consensus bases. A cell-free, mate-paired library provided single DNA molecules that were amplified in parallel to 1-micrometer beads by emulsion polymerase chain reaction. Millions of beads were immobilized in a polyacrylamide gel and subjected to automated cycles of sequencing by ligation and four-color imaging. Cost per base was roughly one-ninth as much as that of conventional sequencing. Our protocols were implemented with off-the-shelf instrumentation and reagents.

  16. A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.

    PubMed

    Chen, Shi-Yi; Deng, Feilong; Jia, Xianbo; Li, Cao; Lai, Song-Jia

    2017-08-09

    It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.

  17. De novo assembly and phasing of a Korean human genome.

    PubMed

    Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon

    2016-10-13

    Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.

  18. Highly multiplexed subcellular RNA sequencing in situ

    PubMed Central

    Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Yang, Joyce L.; Terry, Richard; Jeanty, Sauveur S. F.; Li, Chao; Amamoto, Ryoji; Peters, Derek T.; Turczyk, Brian M.; Marblestone, Adam H.; Inverso, Samuel A.; Bernard, Amy; Mali, Prashant; Rios, Xavier; Aach, John; Church, George M.

    2014-01-01

    Understanding the spatial organization of gene expression with single nucleotide resolution requires localizing the sequences of expressed RNA transcripts within a cell in situ. Here we describe fluorescent in situ RNA sequencing (FISSEQ), in which stably cross-linked cDNA amplicons are sequenced within a biological sample. Using 30-base reads from 8,742 genes in situ, we examined RNA expression and localization in human primary fibroblasts using a simulated wound healing assay. FISSEQ is compatible with tissue sections and whole mount embryos, and reduces the limitations of optical resolution and noisy signals on single molecule detection. Our platform enables massively parallel detection of genetic elements, including gene transcripts and molecular barcodes, and can be used to investigate cellular phenotype, gene regulation, and environment in situ. PMID:24578530

  19. An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing.

    PubMed

    Zimin, Aleksey V; Stevens, Kristian A; Crepeau, Marc W; Puiu, Daniela; Wegrzyn, Jill L; Yorke, James A; Langley, Charles H; Neale, David B; Salzberg, Steven L

    2017-01-01

    The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25 361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107 821, 61% larger than the previous assembly. © The Author 2017. Published by Oxford University Press.

  20. Erratum to: An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing.

    PubMed

    Zimin, Aleksey V; Stevens, Kristian A; Crepeau, Marc W; Puiu, Daniela; Wegrzyn, Jill L; Yorke, James A; Langley, Charles H; Neale, David B; Salzberg, Steven L

    2017-10-01

    The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25 361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107 821, 61% larger than the previous assembly. © The Authors 2017. Published by Oxford University Press.

  1. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

    PubMed

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

    2015-11-26

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.

  2. Protein mechanics: from single molecules to functional biomaterials.

    PubMed

    Li, Hongbin; Cao, Yi

    2010-10-19

    Elastomeric proteins act as the essential functional units in a wide variety of biomechanical machinery and serve as the basic building blocks for biological materials that exhibit superb mechanical properties. These proteins provide the desired elasticity, mechanical strength, resilience, and toughness within these materials. Understanding the mechanical properties of elastomeric protein-based biomaterials is a multiscale problem spanning from the atomistic/molecular level to the macroscopic level. Uncovering the design principles of individual elastomeric building blocks is critical both for the scientific understanding of multiscale mechanics of biomaterials and for the rational engineering of novel biomaterials with desirable mechanical properties. The development of single-molecule force spectroscopy techniques has provided methods for characterizing mechanical properties of elastomeric proteins one molecule at a time. Single-molecule atomic force microscopy (AFM) is uniquely suited to this purpose. Molecular dynamic simulations, protein engineering techniques, and single-molecule AFM study have collectively revealed tremendous insights into the molecular design of single elastomeric proteins, which can guide the design and engineering of elastomeric proteins with tailored mechanical properties. Researchers are focusing experimental efforts toward engineering artificial elastomeric proteins with mechanical properties that mimic or even surpass those of natural elastomeric proteins. In this Account, we summarize our recent experimental efforts to engineer novel artificial elastomeric proteins and develop general and rational methodologies to tune the nanomechanical properties of elastomeric proteins at the single-molecule level. We focus on general design principles used for enhancing the mechanical stability of proteins. These principles include the development of metal-chelation-based general methodology, strategies to control the unfolding hierarchy of multidomain elastomeric proteins, and the design of novel elastomeric proteins that exhibit stimuli-responsive mechanical properties. Moving forward, we are now exploring the use of these artificial elastomeric proteins as building blocks of protein-based biomaterials. Ultimately, we would like to rationally tailor mechanical properties of elastomeric protein-based materials by programming the molecular sequence, and thus nanomechanical properties, of elastomeric proteins at the single-molecule level. This step would help bridge the gap between single protein mechanics and material biomechanics, revealing how the mechanical properties of individual elastomeric proteins are translated into the properties of macroscopic materials.

  3. Using Synthetic Nanopores for Single-Molecule Analyses: Detecting SNPs, Trapping DNA Molecules, and the Prospects for Sequencing DNA

    ERIC Educational Resources Information Center

    Dimitrov, Valentin V.

    2009-01-01

    This work focuses on studying properties of DNA molecules and DNA-protein interactions using synthetic nanopores, and it examines the prospects of sequencing DNA using synthetic nanopores. We have developed a method for discriminating between alleles that uses a synthetic nanopore to measure the binding of a restriction enzyme to DNA. There exists…

  4. The sequence of sequencers: The history of sequencing DNA

    PubMed Central

    Heather, James M.; Chain, Benjamin

    2016-01-01

    Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. PMID:26554401

  5. Thermoelectric effect and its dependence on molecular length and sequence in single DNA molecules.

    PubMed

    Li, Yueqi; Xiang, Limin; Palma, Julio L; Asai, Yoshihiro; Tao, Nongjian

    2016-04-15

    Studying the thermoelectric effect in DNA is important for unravelling charge transport mechanisms and for developing relevant applications of DNA molecules. Here we report a study of the thermoelectric effect in single DNA molecules. By varying the molecular length and sequence, we tune the charge transport in DNA to either a hopping- or tunnelling-dominated regimes. The thermoelectric effect is small and insensitive to the molecular length in the hopping regime. In contrast, the thermoelectric effect is large and sensitive to the length in the tunnelling regime. These findings indicate that one may control the thermoelectric effect in DNA by varying its sequence and length. We describe the experimental results in terms of hopping and tunnelling charge transport models.

  6. Thermoelectric effect and its dependence on molecular length and sequence in single DNA molecules

    PubMed Central

    Li, Yueqi; Xiang, Limin; Palma, Julio L.; Asai, Yoshihiro; Tao, Nongjian

    2016-01-01

    Studying the thermoelectric effect in DNA is important for unravelling charge transport mechanisms and for developing relevant applications of DNA molecules. Here we report a study of the thermoelectric effect in single DNA molecules. By varying the molecular length and sequence, we tune the charge transport in DNA to either a hopping- or tunnelling-dominated regimes. The thermoelectric effect is small and insensitive to the molecular length in the hopping regime. In contrast, the thermoelectric effect is large and sensitive to the length in the tunnelling regime. These findings indicate that one may control the thermoelectric effect in DNA by varying its sequence and length. We describe the experimental results in terms of hopping and tunnelling charge transport models. PMID:27079152

  7. FANTOM5 CAGE profiles of human and mouse samples.

    PubMed

    Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-Ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A; Babina, Magda; Baillie, J Kenneth; Barnett, Timothy C; Beckhouse, Anthony G; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J; Clevers, Hans C; Davis, Carrie A; Detmar, Michael; Dohi, Taeko; Edge, Albert S B; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C; Faulkner, Geoffrey J; Ferrai, Carmelo; Fisher, Malcolm E; Forrester, Lesley M; Fujita, Rie; Furusawa, Jun-Ichi; Geijtenbeek, Teunis B; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J; Hume, David A; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I; Kempfle, Judith S; Kenna, Tony J; Kere, Juha; Khachigian, Levon M; Kitamura, Toshio; Klein, Sarah; Klinken, S Peter; Knox, Alan J; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-Sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J; Motohashi, Hozumi; Mummery, Christine L; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A; Winteringham, Louise N; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R R; Hayashizaki, Yoshihide

    2017-08-29

    In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.

  8. FANTOM5 CAGE profiles of human and mouse samples

    PubMed Central

    Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A.; Babina, Magda; Baillie, J. Kenneth; Barnett, Timothy C.; Beckhouse, Anthony G.; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J.; Clevers, Hans C.; Davis, Carrie A.; Detmar, Michael; Dohi, Taeko; Edge, Albert S.B.; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C.; Faulkner, Geoffrey J.; Ferrai, Carmelo; Fisher, Malcolm E.; Forrester, Lesley M.; Fujita, Rie; Furusawa, Jun-ichi; Geijtenbeek, Teunis B.; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J.; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J.; Hume, David A.; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I.; Kempfle, Judith S.; Kenna, Tony J.; Kere, Juha; Khachigian, Levon M.; Kitamura, Toshio; Klein, Sarah; Klinken, S. Peter; Knox, Alan J.; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J.; Motohashi, Hozumi; Mummery, Christine L.; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A.; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G.; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W.; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M.; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M.; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A.; Winteringham, Louise N.; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G.; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O.; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R.R.; Hayashizaki, Yoshihide

    2017-01-01

    In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities. PMID:28850106

  9. Using Multiorder Time-Correlation Functions (TCFs) To Elucidate Biomolecular Reaction Pathways from Microsecond Single-Molecule Fluorescence Experiments.

    PubMed

    Phelps, Carey; Israels, Brett; Marsh, Morgan C; von Hippel, Peter H; Marcus, Andrew H

    2016-12-29

    Recent advances in single-molecule fluorescence imaging have made it possible to perform measurements on microsecond time scales. Such experiments have the potential to reveal detailed information about the conformational changes in biological macromolecules, including the reaction pathways and dynamics of the rearrangements involved in processes, such as sequence-specific DNA "breathing" and the assembly of protein-nucleic acid complexes. Because microsecond-resolved single-molecule trajectories often involve "sparse" data, that is, they contain relatively few data points per unit time, they cannot be easily analyzed using the standard protocols that were developed for single-molecule experiments carried out with tens-of-millisecond time resolution and high "data density." Here, we describe a generalized approach, based on time-correlation functions, to obtain kinetic information from microsecond-resolved single-molecule fluorescence measurements. This approach can be used to identify short-lived intermediates that lie on reaction pathways connecting relatively long-lived reactant and product states. As a concrete illustration of the potential of this methodology for analyzing specific macromolecular systems, we accompany the theoretical presentation with the description of a specific biologically relevant example drawn from studies of reaction mechanisms of the assembly of the single-stranded DNA binding protein of the T4 bacteriophage replication complex onto a model DNA replication fork.

  10. Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae.

    PubMed

    Zhang, Hui; Hall, Nathan; McElroy, J Scott; Lowe, Elijah K; Goertzen, Leslie R

    2017-02-05

    Eleusine indica, also known as goosegrass, is a serious weed in at least 42 countries. In this paper we report the complete plastid genome sequence of goosegrass obtained by de novo assembly of paired-end and mate-paired reads generated by Illumina sequencing of total genomic DNA. The goosegrass plastome is a circular molecule of 135,151bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 20,919 bases. The large (LSC) and the small (SSC) single-copy regions span 80,667 bases and 12,646 bases, respectively. The plastome of goosegrass has 38.19% GC content and includes 108 unique genes, of which 76 are protein-coding, 28 are transfer RNA, and 4 are ribosomal RNA. The goosegrass plastome sequence was compared to eight other species of Poaceae. Although generally conserved with respect to Poaceae, this genomic resource will be useful for evolutionary studies within this weed species and the genus Eleusine. Copyright © 2016. Published by Elsevier B.V.

  11. Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank

    PubMed Central

    Dutta, Shuchismita; Dimitropoulos, Dimitris; Feng, Zukang; Persikova, Irina; Sen, Sanchayita; Shao, Chenghua; Westbrook, John; Young, Jasmine; Zhuravleva, Marina A; Kleywegt, Gerard J; Berman, Helen M

    2014-01-01

    With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called “group” was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently. © 2013 Wiley Periodicals, Inc. Biopolymers 101: 659–668, 2014. PMID:24173824

  12. Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12.

    PubMed

    Anton, Brian P; Mongodin, Emmanuel F; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R; Roberts, Richard J; Raleigh, Elisabeth A

    2015-01-01

    We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.

  13. Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12

    PubMed Central

    Anton, Brian P.; Mongodin, Emmanuel F.; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R.; Roberts, Richard J.; Raleigh, Elisabeth A.

    2015-01-01

    We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems. PMID:26010885

  14. MethylViewer: computational analysis and editing for bisulfite sequencing and methyltransferase accessibility protocol for individual templates (MAPit) projects.

    PubMed

    Pardo, Carolina E; Carr, Ian M; Hoffman, Christopher J; Darst, Russell P; Markham, Alexander F; Bonthron, David T; Kladde, Michael P

    2011-01-01

    Bisulfite sequencing is a widely-used technique for examining cytosine DNA methylation at nucleotide resolution along single DNA strands. Probing with cytosine DNA methyltransferases followed by bisulfite sequencing (MAPit) is an effective technique for mapping protein-DNA interactions. Here, MAPit methylation footprinting with M.CviPI, a GC methyltransferase we previously cloned and characterized, was used to probe hMLH1 chromatin in HCT116 and RKO colorectal cancer cells. Because M.CviPI-probed samples contain both CG and GC methylation, we developed a versatile, visually-intuitive program, called MethylViewer, for evaluating the bisulfite sequencing results. Uniquely, MethylViewer can simultaneously query cytosine methylation status in bisulfite-converted sequences at as many as four different user-defined motifs, e.g. CG, GC, etc., including motifs with degenerate bases. Data can also be exported for statistical analysis and as publication-quality images. Analysis of hMLH1 MAPit data with MethylViewer showed that endogenous CG methylation and accessible GC sites were both mapped on single molecules at high resolution. Disruption of positioned nucleosomes on single molecules of the PHO5 promoter was detected in budding yeast using M.CviPII, increasing the number of enzymes available for probing protein-DNA interactions. MethylViewer provides an integrated solution for primer design and rapid, accurate and detailed analysis of bisulfite sequencing or MAPit datasets from virtually any biological or biochemical system.

  15. Multiplex single-molecule interaction profiling of DNA barcoded proteins

    PubMed Central

    Gu, Liangcai; Li, Chao; Aach, John; Hill, David E.; Vidal, Marc; Church, George M.

    2014-01-01

    In contrast with advances in massively parallel DNA sequencing1, high-throughput protein analyses2-4 are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule (SM) protein detection achieved using optical methods5 is limited by the number of spectrally nonoverlapping chromophores. Here, we introduce a single molecular interaction-sequencing (SMI-Seq) technology for parallel protein interaction profiling leveraging SM advantages. DNA barcodes are attached to proteins collectively via ribosome display6 or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide (PAA) thin film to construct a random SM array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies)7 and analyzed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimeter. Furthermore, protein interactions can be measured based on the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor (GPCR) and antibody binding profiling, were demonstrated. SMI-Seq enables “library vs. library” screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity. PMID:25252978

  16. Detecting and Analyzing Genetic Recombination Using RDP4.

    PubMed

    Martin, Darren P; Murrell, Ben; Khoosal, Arjun; Muhire, Brejnev

    2017-01-01

    Recombination between nucleotide sequences is a major process influencing the evolution of most species on Earth. The evolutionary value of recombination has been widely debated and so too has its influence on evolutionary analysis methods that assume nucleotide sequences replicate without recombining. When nucleic acids recombine, the evolution of the daughter or recombinant molecule cannot be accurately described by a single phylogeny. This simple fact can seriously undermine the accuracy of any phylogenetics-based analytical approach which assumes that the evolutionary history of a set of recombining sequences can be adequately described by a single phylogenetic tree. There are presently a large number of available methods and associated computer programs for analyzing and characterizing recombination in various classes of nucleotide sequence datasets. Here we examine the use of some of these methods to derive and test recombination hypotheses using multiple sequence alignments.

  17. Subangstrom Measurements of Enzyme Function Using a Biological Nanopore, SPRNT.

    PubMed

    Laszlo, A H; Derrrington, I M; Gundlach, J H

    2017-01-01

    Nanopores are emerging as new single-molecule tools in the study of enzymes. Based on the progress in nanopore sequencing of DNA, a tool called Single-molecule Picometer Resolution Nanopore Tweezers (SPRNT) was developed to measure the movement of enzymes along DNA in real time. In this new method, an enzyme is loaded onto a DNA (or RNA) molecule. A single-stranded DNA end of this complex is drawn into a nanopore by an electrostatic potential that is applied across the pore. The single-stranded DNA passes through the pore's constriction until the enzyme comes into contact with the pore. Further progression of the DNA through the pore is then controlled by the enzyme. An ion current that flows through the pore's constriction is modulated by the DNA in the constriction. Analysis of ion current changes reveals the advance of the DNA with high spatiotemporal precision, thereby providing a real-time record of the enzyme's activity. Using an engineered version of the protein nanopore MspA, SPRNT has spatial resolution as small as 40pm at millisecond timescales, while simultaneously providing the DNA's sequence within the enzyme. In this chapter, SPRNT is introduced and its extraordinary potential is exemplified using the helicase Hel308. Two distinct substates are observed for each one-nucleotide advance; one of these about half-nucleotide long steps is ATP dependent and the other is ATP independent. The spatiotemporal resolution of this low-cost single-molecule technique lifts the study of enzymes to a new level of precision, enabling exploration of hitherto unobservable enzyme dynamics in real time. © 2017 Elsevier Inc. All rights reserved.

  18. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P.

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less

  19. Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

    DOE PAGES

    VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...

    2015-11-11

    Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less

  20. DNA confinement in nanochannels: physics and biological applications

    NASA Astrophysics Data System (ADS)

    Reisner, Walter; Pedersen, Jonas N.; Austin, Robert H.

    2012-10-01

    DNA is the central storage molecule of genetic information in the cell, and reading that information is a central problem in biology. While sequencing technology has made enormous advances over the past decade, there is growing interest in platforms that can readout genetic information directly from long single DNA molecules, with the ultimate goal of single-cell, single-genome analysis. Such a capability would obviate the need for ensemble averaging over heterogeneous cellular populations and eliminate uncertainties introduced by cloning and molecular amplification steps (thus enabling direct assessment of the genome in its native state). In this review, we will discuss how the information contained in genomic-length single DNA molecules can be accessed via physical confinement in nanochannels. Due to self-avoidance interactions, DNA molecules will stretch out when confined in nanochannels, creating a linear unscrolling of the genome along the channel for analysis. We will first review the fundamental physics of DNA nanochannel confinement—including the effect of varying ionic strength—and then discuss recent applications of these systems to genomic mapping. Apart from the intense biological interest in extracting linear sequence information from elongated DNA molecules, from a physics view these systems are fascinating as they enable probing of single-molecule conformation in environments with dimensions that intersect key physical length-scales in the 1 nm to 100 µm range.

  1. DNA confinement in nanochannels: physics and biological applications.

    PubMed

    Reisner, Walter; Pedersen, Jonas N; Austin, Robert H

    2012-10-01

    DNA is the central storage molecule of genetic information in the cell, and reading that information is a central problem in biology. While sequencing technology has made enormous advances over the past decade, there is growing interest in platforms that can readout genetic information directly from long single DNA molecules, with the ultimate goal of single-cell, single-genome analysis. Such a capability would obviate the need for ensemble averaging over heterogeneous cellular populations and eliminate uncertainties introduced by cloning and molecular amplification steps (thus enabling direct assessment of the genome in its native state). In this review, we will discuss how the information contained in genomic-length single DNA molecules can be accessed via physical confinement in nanochannels. Due to self-avoidance interactions, DNA molecules will stretch out when confined in nanochannels, creating a linear unscrolling of the genome along the channel for analysis. We will first review the fundamental physics of DNA nanochannel confinement--including the effect of varying ionic strength--and then discuss recent applications of these systems to genomic mapping. Apart from the intense biological interest in extracting linear sequence information from elongated DNA molecules, from a physics view these systems are fascinating as they enable probing of single-molecule conformation in environments with dimensions that intersect key physical length-scales in the 1 nm to 100 µm range.

  2. Schemes of detecting nuclear spin correlations by dynamical decoupling based quantum sensing

    NASA Astrophysics Data System (ADS)

    Ma, Wen-Long Ma; Liu, Ren-Bao

    Single-molecule sensitivity of nuclear magnetic resonance (NMR) and angstrom resolution of magnetic resonance imaging (MRI) are the highest challenges in magnetic microscopy. Recent development in dynamical decoupling (DD) enhanced diamond quantum sensing has enabled NMR of single nuclear spins and nanoscale NMR. Similar to conventional NMR and MRI, current DD-based quantum sensing utilizes the frequency fingerprints of target nuclear spins. Such schemes, however, cannot resolve different nuclear spins that have the same noise frequency or differentiate different types of correlations in nuclear spin clusters. Here we show that the first limitation can be overcome by using wavefunction fingerprints of target nuclear spins, which is much more sensitive than the ''frequency fingerprints'' to weak hyperfine interaction between the targets and a sensor, while the second one can be overcome by a new design of two-dimensional DD sequences composed of two sets of periodic DD sequences with different periods, which can be independently set to match two different transition frequencies. Our schemes not only offer an approach to breaking the resolution limit set by ''frequency gradients'' in conventional MRI, but also provide a standard approach to correlation spectroscopy for single-molecule NMR.

  3. Single-molecule DNA detection with an engineered MspA protein nanopore

    PubMed Central

    Butler, Tom Z.; Pavlenok, Mikhail; Derrington, Ian M.; Niederweis, Michael; Gundlach, Jens H.

    2008-01-01

    Nanopores hold great promise as single-molecule analytical devices and biophysical model systems because the ionic current blockades they produce contain information about the identity, concentration, structure, and dynamics of target molecules. The porin MspA of Mycobacterium smegmatis has remarkable stability against environmental stresses and can be rationally modified based on its crystal structure. Further, MspA has a short and narrow channel constriction that is promising for DNA sequencing because it may enable improved characterization of short segments of a ssDNA molecule that is threaded through the pore. By eliminating the negative charge in the channel constriction, we designed and constructed an MspA mutant capable of electronically detecting and characterizing single molecules of ssDNA as they are electrophoretically driven through the pore. A second mutant with additional exchanges of negatively-charged residues for positively-charged residues in the vestibule region exhibited a factor of ≈20 higher interaction rates, required only half as much voltage to observe interaction, and allowed ssDNA to reside in the vestibule ≈100 times longer than the first mutant. Our results introduce MspA as a nanopore for nucleic acid analysis and highlight its potential as an engineerable platform for single-molecule detection and characterization applications. PMID:19098105

  4. The sequence of sequencers: The history of sequencing DNA.

    PubMed

    Heather, James M; Chain, Benjamin

    2016-01-01

    Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  5. Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.

    PubMed

    Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun

    2018-02-28

    Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. SINGLE MOLECULE APPROACHES TO BIOLOGY, 2010 GORDON RESEARCH CONFERENCE, JUNE 27-JULY 2, 2010, ITALY

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Professor William Moerner

    2010-07-09

    The 2010 Gordon Conference on Single-Molecule Approaches to Biology focuses on cutting-edge research in single-molecule science. Tremendous technical developments have made it possible to detect, identify, track, and manipulate single biomolecules in an ambient environment or even in a live cell. Single-molecule approaches have changed the way many biological problems are addressed, and new knowledge derived from these approaches continues to emerge. The ability of single-molecule approaches to avoid ensemble averaging and to capture transient intermediates and heterogeneous behavior renders them particularly powerful in elucidating mechanisms of biomolecular machines: what they do, how they work individually, how they work together,more » and finally, how they work inside live cells. The burgeoning use of single-molecule methods to elucidate biological problems is a highly multidisciplinary pursuit, involving both force- and fluorescence-based methods, the most up-to-date advances in microscopy, innovative biological and chemical approaches, and nanotechnology tools. This conference seeks to bring together top experts in molecular and cell biology with innovators in the measurement and manipulation of single molecules, and will provide opportunities for junior scientists and graduate students to present their work in poster format and to exchange ideas with leaders in the field. A number of excellent poster presenters will be selected for short oral talks. Topics as diverse as single-molecule sequencing, DNA/RNA/protein interactions, folding machines, cellular biophysics, synthetic biology and bioengineering, force spectroscopy, new method developments, superresolution imaging in cells, and novel probes for single-molecule imaging will be on the program. Additionally, the collegial atmosphere of this Conference, with programmed discussion sessions as well as opportunities for informal gatherings in the afternoons and evenings in the beauty of the Il Ciocco site in Tuscany, provides an avenue for scientists from different disciplines to interact and brainstorm and promotes cross-disciplinary collaborations directed toward compelling biological problems.« less

  7. Diagnostic Applications of Next Generation Sequencing in Immunogenetics and Molecular Oncology

    PubMed Central

    Grumbt, Barbara; Eck, Sebastian H.; Hinrichsen, Tanja; Hirv, Kaimo

    2013-01-01

    Summary With the introduction of the next generation sequencing (NGS) technologies, remarkable new diagnostic applications have been established in daily routine. Implementation of NGS is challenging in clinical diagnostics, but definite advantages and new diagnostic possibilities make the switch to the technology inevitable. In addition to the higher sequencing capacity, clonal sequencing of single molecules, multiplexing of samples, higher diagnostic sensitivity, workflow miniaturization, and cost benefits are some of the valuable features of the technology. After the recent advances, NGS emerged as a proven alternative for classical Sanger sequencing in the typing of human leukocyte antigens (HLA). By virtue of the clonal amplification of single DNA molecules ambiguous typing results can be avoided. Simultaneously, a higher sample throughput can be achieved by tagging of DNA molecules with multiplex identifiers and pooling of PCR products before sequencing. In our experience, up to 380 samples can be typed for HLA-A, -B, and -DRB1 in high-resolution during every sequencing run. In molecular oncology, NGS shows a markedly increased sensitivity in comparison to the conventional Sanger sequencing and is developing to the standard diagnostic tool in detection of somatic mutations in cancer cells with great impact on personalized treatment of patients. PMID:23922545

  8. Single-cell template strand sequencing by Strand-seq enables the characterization of individual homologs.

    PubMed

    Sanders, Ashley D; Falconer, Ester; Hills, Mark; Spierings, Diana C J; Lansdorp, Peter M

    2017-06-01

    The ability to distinguish between genome sequences of homologous chromosomes in single cells is important for studies of copy-neutral genomic rearrangements (such as inversions and translocations), building chromosome-length haplotypes, refining genome assemblies, mapping sister chromatid exchange events and exploring cellular heterogeneity. Strand-seq is a single-cell sequencing technology that resolves the individual homologs within a cell by restricting sequence analysis to the DNA template strands used during DNA replication. This protocol, which takes up to 4 d to complete, relies on the directionality of DNA, in which each single strand of a DNA molecule is distinguished based on its 5'-3' orientation. Culturing cells in a thymidine analog for one round of cell division labels nascent DNA strands, allowing for their selective removal during genomic library construction. To preserve directionality of template strands, genomic preamplification is bypassed and labeled nascent strands are nicked and not amplified during library preparation. Each single-cell library is multiplexed for pooling and sequencing, and the resulting sequence data are aligned, mapping to either the minus or plus strand of the reference genome, to assign template strand states for each chromosome in the cell. The major adaptations to conventional single-cell sequencing protocols include harvesting of daughter cells after a single round of BrdU incorporation, bypassing of whole-genome amplification, and removal of the BrdU + strand during Strand-seq library preparation. By sequencing just template strands, the structure and identity of each homolog are preserved.

  9. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments.

    PubMed

    Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

    2013-09-24

    Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp.

  10. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments

    PubMed Central

    Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

    2013-01-01

    Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp. PMID:24019490

  11. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

    PubMed

    Chin, Chen-Shan; Alexander, David H; Marks, Patrick; Klammer, Aaron A; Drake, James; Heiner, Cheryl; Clum, Alicia; Copeland, Alex; Huddleston, John; Eichler, Evan E; Turner, Stephen W; Korlach, Jonas

    2013-06-01

    We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

  12. Reducing assembly complexity of microbial genomes with single-molecule sequencing

    USDA-ARS?s Scientific Manuscript database

    Genome assembly algorithms cannot fully reconstruct microbial chromosomes from the DNA reads output by first or second-generation sequencing instruments. Therefore, most genomes are left unfinished due to the significant resources required to manually close gaps left in the draft assemblies. Single-...

  13. Conformational Smear Characterization and Binning of Single-Molecule Conductance Measurements for Enhanced Molecular Recognition.

    PubMed

    Korshoj, Lee E; Afsari, Sepideh; Chatterjee, Anushree; Nagpal, Prashant

    2017-11-01

    Electronic conduction or charge transport through single molecules depends primarily on molecular structure and anchoring groups and forms the basis for a wide range of studies from molecular electronics to DNA sequencing. Several high-throughput nanoelectronic methods such as mechanical break junctions, nanopores, conductive atomic force microscopy, scanning tunneling break junctions, and static nanoscale electrodes are often used for measuring single-molecule conductance. In these measurements, "smearing" due to conformational changes and other entropic factors leads to large variances in the observed molecular conductance, especially in individual measurements. Here, we show a method for characterizing smear in single-molecule conductance measurements and demonstrate how binning measurements according to smear can significantly enhance the use of individual conductance measurements for molecular recognition. Using quantum point contact measurements on single nucleotides within DNA macromolecules, we demonstrate that the distance over which molecular junctions are maintained is a measure of smear, and the resulting variance in unbiased single measurements depends on this smear parameter. Our ability to identify individual DNA nucleotides at 20× coverage increases from 81.3% accuracy without smear analysis to 93.9% with smear characterization and binning (SCRIB). Furthermore, merely 7 conductance measurements (7× coverage) are needed to achieve 97.8% accuracy for DNA nucleotide recognition when only low molecular smear measurements are used, which represents a significant improvement over contemporary sequencing methods. These results have important implications in a broad range of molecular electronics applications from designing robust molecular switches to nanoelectronic DNA sequencing.

  14. Single-molecule DNA unzipping reveals asymmetric modulation of a transcription factor by its binding site sequence and context

    PubMed Central

    Rudnizky, Sergei; Khamis, Hadeel; Malik, Omri; Squires, Allison H; Meller, Amit; Melamed, Philippa

    2018-01-01

    Abstract Most functional transcription factor (TF) binding sites deviate from their ‘consensus’ recognition motif, although their sites and flanking sequences are often conserved across species. Here, we used single-molecule DNA unzipping with optical tweezers to study how Egr-1, a TF harboring three zinc fingers (ZF1, ZF2 and ZF3), is modulated by the sequence and context of its functional sites in the Lhb gene promoter. We find that both the core 9 bp bound to Egr-1 in each of the sites, and the base pairs flanking them, modulate the affinity and structure of the protein–DNA complex. The effect of the flanking sequences is asymmetric, with a stronger effect for the sequence flanking ZF3. Characterization of the dissociation time of Egr-1 revealed that a local, mechanical perturbation of the interactions of ZF3 destabilizes the complex more effectively than a perturbation of the ZF1 interactions. Our results reveal a novel role for ZF3 in the interaction of Egr-1 with other proteins and the DNA, providing insight on the regulation of Lhb and other genes by Egr-1. Moreover, our findings reveal the potential of small changes in DNA sequence to alter transcriptional regulation, and may shed light on the organization of regulatory elements at promoters. PMID:29253225

  15. Single-Molecule Denaturation Mapping of DNA in Nanofluidic Channels

    NASA Astrophysics Data System (ADS)

    Reisner, Walter; Larsen, Niels; Silahtaroglu, Asli; Kristensen, Anders; Tommerup, Niels; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik

    2010-03-01

    Nanochannel based DNA stretching can serve as a platform for a new optical mapping technique based on measuring the pattern of partial melting along the extended molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions along the molecule and can be explained by calculations of sequence-dependent melting probability. Specifically, we obtain experimental melting profiles for T4, T7, lambda-phage and bacterial artificial chromosome DNA (from human chromosome 12) and compare these profiles to theory. In addition, we demonstrate that the BAC melting profile can be used to align the BAC to its correct position on chromosome 12.

  16. Methods And Devices For Characterizing Duplex Nucleic Acid Molecules

    DOEpatents

    Akeson, Mark; Vercoutere, Wenonah; Haussler, David; Winters-Hilt, Stephen

    2005-08-30

    Methods and devices are provided for characterizing a duplex nucleic acid, e.g., a duplex DNA molecule. In the subject methods, a fluid conducting medium that includes a duplex nucleic acid molecule is contacted with a nanopore under the influence of an applied electric field and the resulting changes in current through the nanopore caused by the duplex nucleic acid molecule are monitored. The observed changes in current through the nanopore are then employed as a set of data values to characterize the duplex nucleic acid, where the set of data values may be employed in raw form or manipulated, e.g., into a current blockade profile. Also provided are nanopore devices for practicing the subject methods, where the subject nanopore devices are characterized by the presence of an algorithm which directs a processing means to employ monitored changes in current through a nanopore to characterize a duplex nucleic acid molecule responsible for the current changes. The subject methods and devices find use in a variety of applications, including, among other applications, the identification of an analyte duplex DNA molecule in a sample, the specific base sequence at a single nulceotide polymorphism (SNP), and the sequencing of duplex DNA molecules.

  17. Slowing down single-molecule trafficking through a protein nanopore reveals intermediates for peptide translocation

    NASA Astrophysics Data System (ADS)

    Mereuta, Loredana; Roy, Mahua; Asandei, Alina; Lee, Jong Kook; Park, Yoonkyung; Andricioaei, Ioan; Luchian, Tudor

    2014-01-01

    The microscopic details of how peptides translocate one at a time through nanopores are crucial determinants for transport through membrane pores and important in developing nano-technologies. To date, the translocation process has been too fast relative to the resolution of the single molecule techniques that sought to detect its milestones. Using pH-tuned single-molecule electrophysiology and molecular dynamics simulations, we demonstrate how peptide passage through the α-hemolysin protein can be sufficiently slowed down to observe intermediate single-peptide sub-states associated to distinct structural milestones along the pore, and how to control residence time, direction and the sequence of spatio-temporal state-to-state dynamics of a single peptide. Molecular dynamics simulations of peptide translocation reveal the time- dependent ordering of intermediate structures of the translocating peptide inside the pore at atomic resolution. Calculations of the expected current ratios of the different pore-blocking microstates and their time sequencing are in accord with the recorded current traces.

  18. Single Molecule Spectroscopy of Amino Acids and Peptides by Recognition Tunneling

    PubMed Central

    Zhao, Yanan; Ashcroft, Brian; Zhang, Peiming; Liu, Hao; Sen, Suman; Song, Weisi; Im, JongOne; Gyarfas, Brett; Manna, Saikat; Biswas, Sovan; Borges, Chad; Lindsay, Stuart

    2014-01-01

    The human proteome has millions of protein variants due to alternative RNA splicing and post-translational modifications, and variants that are related to diseases are frequently present in minute concentrations. For DNA and RNA, low concentrations can be amplified using the polymerase chain reaction, but there is no such reaction for proteins. Therefore, the development of single molecule protein sequencing is a critical step in the search for protein biomarkers. Here we show that single amino acids can be identified by trapping the molecules between two electrodes that are coated with a layer of recognition molecules and measuring the electron tunneling current across the junction. A given molecule can bind in more than one way in the junction, and we therefore use a machine-learning algorithm to distinguish between the sets of electronic ‘fingerprints’ associated with each binding motif. With this recognition tunneling technique, we are able to identify D, L enantiomers, a methylated amino acid, isobaric isomers, and short peptides. The results suggest that direct electronic sequencing of single proteins could be possible by sequentially measuring the products of processive exopeptidase digestion, or by using a molecular motor to pull proteins through a tunnel junction integrated with a nanopore. PMID:24705512

  19. Extending the spectrum of DNA sequences retrieved from ancient bones and teeth

    PubMed Central

    Glocke, Isabelle; Meyer, Matthias

    2017-01-01

    The number of DNA fragments surviving in ancient bones and teeth is known to decrease with fragment length. Recent genetic analyses of Middle Pleistocene remains have shown that the recovery of extremely short fragments can prove critical for successful retrieval of sequence information from particularly degraded ancient biological material. Current sample preparation techniques, however, are not optimized to recover DNA sequences from fragments shorter than ∼35 base pairs (bp). Here, we show that much shorter DNA fragments are present in ancient skeletal remains but lost during DNA extraction. We present a refined silica-based DNA extraction method that not only enables efficient recovery of molecules as short as 25 bp but also doubles the yield of sequences from longer fragments due to improved recovery of molecules with single-strand breaks. Furthermore, we present strategies for monitoring inefficiencies in library preparation that may result from co-extraction of inhibitory substances during DNA extraction. The combination of DNA extraction and library preparation techniques described here substantially increases the yield of DNA sequences from ancient remains and provides access to a yet unexploited source of highly degraded DNA fragments. Our work may thus open the door for genetic analyses on even older material. PMID:28408382

  20. Protein sequencing via nanopore based devices: a nanofluidics perspective

    NASA Astrophysics Data System (ADS)

    Chinappi, Mauro; Cecconi, Fabio

    2018-05-01

    Proteins perform a huge number of central functions in living organisms, thus all the new techniques allowing their precise, fast and accurate characterization at single-molecule level certainly represent a burst in proteomics with important biomedical impact. In this review, we describe the recent progresses in the developing of nanopore based devices for protein sequencing. We start with a critical analysis of the main technical requirements for nanopore protein sequencing, summarizing some ideas and methodologies that have recently appeared in the literature. In the last sections, we focus on the physical modelling of the transport phenomena occurring in nanopore based devices. The multiscale nature of the problem is discussed and, in this respect, some of the main possible computational approaches are illustrated.

  1. Fluorescence In situ Hybridization: Cell-Based Genetic Diagnostic and Research Applications.

    PubMed

    Cui, Chenghua; Shu, Wei; Li, Peining

    2016-01-01

    Fluorescence in situ hybridization (FISH) is a macromolecule recognition technology based on the complementary nature of DNA or DNA/RNA double strands. Selected DNA strands incorporated with fluorophore-coupled nucleotides can be used as probes to hybridize onto the complementary sequences in tested cells and tissues and then visualized through a fluorescence microscope or an imaging system. This technology was initially developed as a physical mapping tool to delineate genes within chromosomes. Its high analytical resolution to a single gene level and high sensitivity and specificity enabled an immediate application for genetic diagnosis of constitutional common aneuploidies, microdeletion/microduplication syndromes, and subtelomeric rearrangements. FISH tests using panels of gene-specific probes for somatic recurrent losses, gains, and translocations have been routinely applied for hematologic and solid tumors and are one of the fastest-growing areas in cancer diagnosis. FISH has also been used to detect infectious microbias and parasites like malaria in human blood cells. Recent advances in FISH technology involve various methods for improving probe labeling efficiency and the use of super resolution imaging systems for direct visualization of intra-nuclear chromosomal organization and profiling of RNA transcription in single cells. Cas9-mediated FISH (CASFISH) allowed in situ labeling of repetitive sequences and single-copy sequences without the disruption of nuclear genomic organization in fixed or living cells. Using oligopaint-FISH and super-resolution imaging enabled in situ visualization of chromosome haplotypes from differentially specified single-nucleotide polymorphism loci. Single molecule RNA FISH (smRNA-FISH) using combinatorial labeling or sequential barcoding by multiple round of hybridization were applied to measure mRNA expression of multiple genes within single cells. Research applications of these single molecule single cells DNA and RNA FISH techniques have visualized intra-nuclear genomic structure and sub-cellular transcriptional dynamics of many genes and revealed their functions in various biological processes.

  2. High processivity polymerases

    DOEpatents

    Shamoo, Yousif; Sun, Siyang

    2014-06-10

    Chimeric proteins comprising a sequence nonspecific single-stranded nucleic-acid-binding domain joined to a catalytic nucleic-acid-modifying domain are provided. Methods comprising contacting a nucleic acid molecule with a chimeric protein, as well as systems comprising a nucleic acid molecule, a chimeric protein, and an aqueous solution are also provided. The joining of sequence nonspecific single-stranded nucleic-acid-binding domain and a catalytic nucleic-acid-modifying domain in chimeric proteins, among other things, may prevent the separation of the two domains due to their weak association and thereby enhances processivity while maintaining fidelity.

  3. Molecular dynamics simulations and docking enable to explore the biophysical factors controlling the yields of engineered nanobodies.

    PubMed

    Soler, Miguel A; de Marco, Ario; Fortuna, Sara

    2016-10-10

    Nanobodies (VHHs) have proved to be valuable substitutes of conventional antibodies for molecular recognition. Their small size represents a precious advantage for rational mutagenesis based on modelling. Here we address the problem of predicting how Camelidae nanobody sequences can tolerate mutations by developing a simulation protocol based on all-atom molecular dynamics and whole-molecule docking. The method was tested on two sets of nanobodies characterized experimentally for their biophysical features. One set contained point mutations introduced to humanize a wild type sequence, in the second the CDRs were swapped between single-domain frameworks with Camelidae and human hallmarks. The method resulted in accurate scoring approaches to predict experimental yields and enabled to identify the structural modifications induced by mutations. This work is a promising tool for the in silico development of single-domain antibodies and opens the opportunity to customize single functional domains of larger macromolecules.

  4. Molecular dynamics simulations and docking enable to explore the biophysical factors controlling the yields of engineered nanobodies

    NASA Astrophysics Data System (ADS)

    Soler, Miguel A.; De Marco, Ario; Fortuna, Sara

    2016-10-01

    Nanobodies (VHHs) have proved to be valuable substitutes of conventional antibodies for molecular recognition. Their small size represents a precious advantage for rational mutagenesis based on modelling. Here we address the problem of predicting how Camelidae nanobody sequences can tolerate mutations by developing a simulation protocol based on all-atom molecular dynamics and whole-molecule docking. The method was tested on two sets of nanobodies characterized experimentally for their biophysical features. One set contained point mutations introduced to humanize a wild type sequence, in the second the CDRs were swapped between single-domain frameworks with Camelidae and human hallmarks. The method resulted in accurate scoring approaches to predict experimental yields and enabled to identify the structural modifications induced by mutations. This work is a promising tool for the in silico development of single-domain antibodies and opens the opportunity to customize single functional domains of larger macromolecules.

  5. Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding

    PubMed Central

    Best, Katharine; Oakes, Theres; Heather, James M.; Shawe-Taylor, John; Chain, Benny

    2015-01-01

    The polymerase chain reaction (PCR) is one of the most widely used techniques in molecular biology. In combination with High Throughput Sequencing (HTS), PCR is widely used to quantify transcript abundance for RNA-seq, and in the context of analysis of T and B cell receptor repertoires. In this study, we combine DNA barcoding with HTS to quantify PCR output from individual target molecules. We develop computational tools that simulate both the PCR branching process itself, and the subsequent subsampling which typically occurs during HTS sequencing. We explore the influence of different types of heterogeneity on sequencing output, and compare them to experimental results where the efficiency of amplification is measured by barcodes uniquely identifying each molecule of starting template. Our results demonstrate that the PCR process introduces substantial amplification heterogeneity, independent of primer sequence and bulk experimental conditions. This heterogeneity can be attributed both to inherited differences between different template DNA molecules, and the inherent stochasticity of the PCR process. The results demonstrate that PCR heterogeneity arises even when reaction and substrate conditions are kept as constant as possible, and therefore single molecule barcoding is essential in order to derive reproducible quantitative results from any protocol combining PCR with HTS. PMID:26459131

  6. Single-molecule dilution and multiple displacement amplification for molecular haplotyping.

    PubMed

    Paul, Philip; Apgar, Josh

    2005-04-01

    Separate haploid analysis is frequently required for heterozygous genotyping to resolve phase ambiguity or confirm allelic sequence. We demonstrate a technique of single-molecule dilution followed by multiple strand displacement amplification to haplotype polymorphic alleles. Dilution of DNA to haploid equivalency, or a single molecule, is a simple method for separating di-allelic DNA. Strand displacement amplification is a robust method for non-specific DNA expansion that employs random hexamers and phage polymerase Phi29 for double-stranded DNA displacement and primer extension, resulting in high processivity and exceptional product length. Single-molecule dilution was followed by strand displacement amplification to expand separated alleles to microgram quantities of DNA for more efficient haplotype analysis of heterozygous genes.

  7. Single-Molecule Imaging of an in Vitro-Evolved RNA Aptamer Reveals Homogeneous Ligand Binding Kinetics

    PubMed Central

    2009-01-01

    Many studies of RNA folding and catalysis have revealed conformational heterogeneity, metastable folding intermediates, and long-lived states with distinct catalytic activities. We have developed a single-molecule imaging approach for investigating the functional heterogeneity of in vitro-evolved RNA aptamers. Monitoring the association of fluorescently labeled ligands with individual RNA aptamer molecules has allowed us to record binding events over the course of multiple days, thus providing sufficient statistics to quantitatively define the kinetic properties at the single-molecule level. The ligand binding kinetics of the highly optimized RNA aptamer studied here displays a remarkable degree of uniformity and lack of memory. Such homogeneous behavior is quite different from the heterogeneity seen in previous single-molecule studies of naturally derived RNA and protein enzymes. The single-molecule methods we describe may be of use in analyzing the distribution of functional molecules in heterogeneous evolving populations or even in unselected samples of random sequences. PMID:19572753

  8. Optimization of conditions to sequence long cDNAs from viruses

    USDA-ARS?s Scientific Manuscript database

    Fourth generation sequencing with the Minion nanopore sequencer provides opportunity to obtain deep coverage and long read for single molecules. This will benefit studies on RNA viruses. In the past, Sanger, Illumina, and Ion Torrent sequencing have been utilized to study RNA viruses. Both technique...

  9. Unraveling secrets of telomeres: one molecule at a time

    PubMed Central

    Lin, Jiangguo; Kaur, Parminder; Countryman, Preston; Opresko, Patricia L.; Wang, Hong

    2016-01-01

    Telomeres play important roles in maintaining the stability of linear chromosomes. Telomere maintenance involves dynamic actions of multiple proteins interacting with long repetitive sequences and complex dynamic DNA structures, such as G-quadruplexes, T-loops and t-circles. Given the heterogeneity and complexity of telomeres, single-molecule approaches are essential to fully understand the structure-function relationships that govern telomere maintenance. In this review, we present a brief overview of the principles of single-molecule imaging and manipulation techniques. We then highlight results obtained from applying these single-molecule techniques for studying structure, dynamics and functions of G-quadruplexes, telomerase, and shelterin proteins. PMID:24569170

  10. Fluorescence-based strategies to investigate the structure and dynamics of aptamer-ligand complexes

    NASA Astrophysics Data System (ADS)

    Perez-Gonzalez, Cibran; Lafontaine, Daniel; Penedo, J.

    2016-08-01

    In addition to the helical nature of double-stranded DNA and RNA, single-stranded oligonucleotides can arrange themselves into tridimensional structures containing loops, bulges, internal hairpins and many other motifs. This ability has been used for more than two decades to generate oligonucleotide sequences, so-called aptamers, that can recognize certain metabolites with high affinity and specificity. More recently, this library of artificially-generated nucleic acid aptamers has been expanded by the discovery that naturally occurring RNA sequences control bacterial gene expression in response to cellular concentration of a given metabolite. The application of fluorescence methods has been pivotal to characterize in detail the structure and dynamics of these aptamer-ligand complexes in solution. This is mostly due to the intrinsic high sensitivity of fluorescence methods and also to significant improvements in solid-phase synthesis, post-synthetic labelling strategies and optical instrumentation that took place during the last decade. In this work, we provide an overview of the most widely employed fluorescence methods to investigate aptamer structure and function by describing the use of aptamers labelled with a single dye in fluorescence quenching and anisotropy assays. The use of 2-aminopurine as a fluorescent analog of adenine to monitor local changes in structure and fluorescence resonance energy transfer (FRET) to follow long-range conformational changes is also covered in detail. The last part of the review is dedicated to the application of fluorescence techniques based on single-molecule microscopy, a technique that has revolutionized our understanding of nucleic acid structure and dynamics. We finally describe the advantages of monitoring ligand-binding and conformational changes, one molecule at a time, to decipher the complexity of regulatory aptamers and summarize the emerging folding and ligand-binding models arising from the application of these single-molecule FRET microscopy techniques.

  11. Fluorescence-Based Strategies to Investigate the Structure and Dynamics of Aptamer-Ligand Complexes

    PubMed Central

    Perez-Gonzalez, Cibran; Lafontaine, Daniel A.; Penedo, J. Carlos

    2016-01-01

    In addition to the helical nature of double-stranded DNA and RNA, single-stranded oligonucleotides can arrange themselves into tridimensional structures containing loops, bulges, internal hairpins and many other motifs. This ability has been used for more than two decades to generate oligonucleotide sequences, so-called aptamers, that can recognize certain metabolites with high affinity and specificity. More recently, this library of artificially-generated nucleic acid aptamers has been expanded by the discovery that naturally occurring RNA sequences control bacterial gene expression in response to cellular concentration of a given metabolite. The application of fluorescence methods has been pivotal to characterize in detail the structure and dynamics of these aptamer-ligand complexes in solution. This is mostly due to the intrinsic high sensitivity of fluorescence methods and also to significant improvements in solid-phase synthesis, post-synthetic labeling strategies and optical instrumentation that took place during the last decade. In this work, we provide an overview of the most widely employed fluorescence methods to investigate aptamer structure and function by describing the use of aptamers labeled with a single dye in fluorescence quenching and anisotropy assays. The use of 2-aminopurine as a fluorescent analog of adenine to monitor local changes in structure and fluorescence resonance energy transfer (FRET) to follow long-range conformational changes is also covered in detail. The last part of the review is dedicated to the application of fluorescence techniques based on single-molecule microscopy, a technique that has revolutionized our understanding of nucleic acid structure and dynamics. We finally describe the advantages of monitoring ligand-binding and conformational changes, one molecule at a time, to decipher the complexity of regulatory aptamers and summarize the emerging folding and ligand-binding models arising from the application of these single-molecule FRET microscopy techniques. PMID:27536656

  12. Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.

    PubMed

    Laird Smith, Melissa; Murrell, Ben; Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E; Kosakovsky Pond, Sergei L; Smith, Davey M

    2016-07-01

    The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences' Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.

  13. Long-read sequencing and de novo assembly of a Chinese genome

    USDA-ARS?s Scientific Manuscript database

    Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arr...

  14. Noise reduction in single time frame optical DNA maps

    PubMed Central

    Müller, Vilhelm; Westerlund, Fredrik

    2017-01-01

    In optical DNA mapping technologies sequence-specific intensity variations (DNA barcodes) along stretched and stained DNA molecules are produced. These “fingerprints” of the underlying DNA sequence have a resolution of the order one kilobasepairs and the stretching of the DNA molecules are performed by surface adsorption or nano-channel setups. A post-processing challenge for nano-channel based methods, due to local and global random movement of the DNA molecule during imaging, is how to align different time frames in order to produce reproducible time-averaged DNA barcodes. The current solutions to this challenge are computationally rather slow. With high-throughput applications in mind, we here introduce a parameter-free method for filtering a single time frame noisy barcode (snap-shot optical map), measured in a fraction of a second. By using only a single time frame barcode we circumvent the need for post-processing alignment. We demonstrate that our method is successful at providing filtered barcodes which are less noisy and more similar to time averaged barcodes. The method is based on the application of a low-pass filter on a single noisy barcode using the width of the Point Spread Function of the system as a unique, and known, filtering parameter. We find that after applying our method, the Pearson correlation coefficient (a real number in the range from -1 to 1) between the single time-frame barcode and the time average of the aligned kymograph increases significantly, roughly by 0.2 on average. By comparing to a database of more than 3000 theoretical plasmid barcodes we show that the capabilities to identify plasmids is improved by filtering single time-frame barcodes compared to the unfiltered analogues. Since snap-shot experiments and computational time using our method both are less than a second, this study opens up for high throughput optical DNA mapping with improved reproducibility. PMID:28640821

  15. Designing robust watermark barcodes for multiplex long-read sequencing.

    PubMed

    Ezpeleta, Joaquín; Krsticevic, Flavia J; Bulacio, Pilar; Tapia, Elizabeth

    2017-03-15

    To attain acceptable sample misassignment rates, current approaches to multiplex single-molecule real-time sequencing require upstream quality improvement, which is obtained from multiple passes over the sequenced insert and significantly reduces the effective read length. In order to fully exploit the raw read length on multiplex applications, robust barcodes capable of dealing with the full single-pass error rates are needed. We present a method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11%. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10-7 under the above conditions, and are designed to be compatible with chemical constraints imposed by the sequencing process. Software tools for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark . ezpeleta@cifasis-conicet.gov.ar. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  16. Enhanced sequencing coverage with digital droplet multiple displacement amplification

    PubMed Central

    Sidore, Angus M.; Lan, Freeman; Lim, Shaun W.; Abate, Adam R.

    2016-01-01

    Sequencing small quantities of DNA is important for applications ranging from the assembly of uncultivable microbial genomes to the identification of cancer-associated mutations. To obtain sufficient quantities of DNA for sequencing, the small amount of starting material must be amplified significantly. However, existing methods often yield errors or non-uniform coverage, reducing sequencing data quality. Here, we describe digital droplet multiple displacement amplification, a method that enables massive amplification of low-input material while maintaining sequence accuracy and uniformity. The low-input material is compartmentalized as single molecules in millions of picoliter droplets. Because the molecules are isolated in compartments, they amplify to saturation without competing for resources; this yields uniform representation of all sequences in the final product and, in turn, enhances the quality of the sequence data. We demonstrate the ability to uniformly amplify the genomes of single Escherichia coli cells, comprising just 4.7 fg of starting DNA, and obtain sequencing coverage distributions that rival that of unamplified material. Digital droplet multiple displacement amplification provides a simple and effective method for amplifying minute amounts of DNA for accurate and uniform sequencing. PMID:26704978

  17. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    PubMed Central

    Tsai, Yu-Chih; Deming, Clayton; Segre, Julia A.; Kong, Heidi H.; Korlach, Jonas

    2016-01-01

    ABSTRACT Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation. PMID:26861018

  18. Molecular Bases of cyclodextrin Adapter Interactions with Engineered Protein Nanopores

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Banerjee, A.; Mikhailova, E; Cheley, S

    2010-01-01

    Engineered protein pores have several potential applications in biotechnology: as sensor elements in stochastic detection and ultrarapid DNA sequencing, as nanoreactors to observe single-molecule chemistry, and in the construction of nano- and micro-devices. One important class of pores contains molecular adapters, which provide internal binding sites for small molecules. Mutants of the {alpha}-hemolysin ({alpha}HL) pore that bind the adapter {beta}-cyclodextrin ({beta}CD) {approx}10{sup 4} times more tightly than the wild type have been obtained. We now use single-channel electrical recording, protein engineering including unnatural amino acid mutagenesis, and high-resolution x-ray crystallography to provide definitive structural information on these engineered protein nanoporesmore » in unparalleled detail.« less

  19. Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples

    PubMed Central

    Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E.; Kosakovsky Pond, Sergei L.

    2016-01-01

    Abstract The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences’ Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data. PMID:29492273

  20. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity.

    PubMed

    Edger, Patrick P; VanBuren, Robert; Colle, Marivi; Poorten, Thomas J; Wai, Ching Man; Niederhuth, Chad E; Alger, Elizabeth I; Ou, Shujun; Acharya, Charlotte B; Wang, Jie; Callow, Pete; McKain, Michael R; Shi, Jinghua; Collier, Chad; Xiong, Zhiyong; Mower, Jeffrey P; Slovin, Janet P; Hytönen, Timo; Jiang, Ning; Childs, Kevin L; Knapp, Steven J

    2018-02-01

    Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. © The Authors 2017. Published by Oxford University Press.

  1. SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information

    PubMed Central

    2014-01-01

    Background The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data. Results Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50× coverage of uncorrected PacBio RS long reads is sufficient to drastically reduce the number of contigs. Comparisons to the AHA scaffolder indicate our strategy is better capable of producing (nearly) complete bacterial genomes. Conclusions The current work describes our SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences. We conclude that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run our program, allow to scaffold genomes in a fast and reliable manner. PMID:24950923

  2. Complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1 using PacBio single-molecule real-time technology

    USDA-ARS?s Scientific Manuscript database

    We report the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1 isolated in Minnesota, USA. The R1-1 genome, generated by de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies....

  3. Sequencing Technologies Panel at SFAF

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Turner, Steve; Fiske, Haley; Knight, Jim

    2010-06-02

    From left to right: Steve Turner of Pacific Biosciences, Haley Fiske of Illumina, Jim Knight of Roche, Michael Rhodes of Life Technologies and Peter Vander Horn of Life Technologies' Single Molecule Sequencing group discuss new sequencing technologies and applications on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  4. Single molecule detection of nitric oxide enabled by d(AT)15 DNA adsorbed to near infrared fluorescent single-walled carbon nanotubes.

    PubMed

    Zhang, Jingqing; Boghossian, Ardemis A; Barone, Paul W; Rwei, Alina; Kim, Jong-Ho; Lin, Dahua; Heller, Daniel A; Hilmer, Andrew J; Nair, Nitish; Reuel, Nigel F; Strano, Michael S

    2011-01-26

    We report the selective detection of single nitric oxide (NO) molecules using a specific DNA sequence of d(AT)(15) oligonucleotides, adsorbed to an array of near-infrared fluorescent semiconducting single-walled carbon nanotubes (AT(15)-SWNT). While SWNT suspended with eight other variant DNA sequences show fluorescence quenching or enhancement from analytes such as dopamine, NADH, L-ascorbic acid, and riboflavin, d(AT)(15) imparts SWNT with a distinct selectivity toward NO. In contrast, the electrostatically neutral polyvinyl alcohol enables no response to nitric oxide, but exhibits fluorescent enhancement to other molecules in the tested library. For AT(15)-SWNT, a stepwise fluorescence decrease is observed when the nanotubes are exposed to NO, reporting the dynamics of single-molecule NO adsorption via SWNT exciton quenching. We describe these quenching traces using a birth-and-death Markov model, and the maximum likelihood estimator of adsorption and desorption rates of NO is derived. Applying the method to simulated traces indicates that the resulting error in the estimated rate constants is less than 5% under our experimental conditions, allowing for calibration using a series of NO concentrations. As expected, the adsorption rate is found to be linearly proportional to NO concentration, and the intrinsic single-site NO adsorption rate constant is 0.001 s(-1) μM NO(-1). The ability to detect nitric oxide quantitatively at the single-molecule level may find applications in new cellular assays for the study of nitric oxide carcinogenesis and chemical signaling, as well as medical diagnostics for inflammation.

  5. Quantification of differential gene expression by multiplexed targeted resequencing of cDNA

    PubMed Central

    Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.

    2017-01-01

    Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677

  6. Reading Out Single-Molecule Digital RNA and DNA Isothermal Amplification in Nanoliter Volumes with Unmodified Camera Phones

    PubMed Central

    2016-01-01

    Digital single-molecule technologies are expanding diagnostic capabilities, enabling the ultrasensitive quantification of targets, such as viral load in HIV and hepatitis C infections, by directly counting single molecules. Replacing fluorescent readout with a robust visual readout that can be captured by any unmodified cell phone camera will facilitate the global distribution of diagnostic tests, including in limited-resource settings where the need is greatest. This paper describes a methodology for developing a visual readout system for digital single-molecule amplification of RNA and DNA by (i) selecting colorimetric amplification-indicator dyes that are compatible with the spectral sensitivity of standard mobile phones, and (ii) identifying an optimal ratiometric image-process for a selected dye to achieve a readout that is robust to lighting conditions and camera hardware and provides unambiguous quantitative results, even for colorblind users. We also include an analysis of the limitations of this methodology, and provide a microfluidic approach that can be applied to expand dynamic range and improve reaction performance, allowing ultrasensitive, quantitative measurements at volumes as low as 5 nL. We validate this methodology using SlipChip-based digital single-molecule isothermal amplification with λDNA as a model and hepatitis C viral RNA as a clinically relevant target. The innovative combination of isothermal amplification chemistry in the presence of a judiciously chosen indicator dye and ratiometric image processing with SlipChip technology allowed the sequence-specific visual readout of single nucleic acid molecules in nanoliter volumes with an unmodified cell phone camera. When paired with devices that integrate sample preparation and nucleic acid amplification, this hardware-agnostic approach will increase the affordability and the distribution of quantitative diagnostic and environmental tests. PMID:26900709

  7. Transforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations

    PubMed Central

    Dressman, Devin; Yan, Hai; Traverso, Giovanni; Kinzler, Kenneth W.; Vogelstein, Bert

    2003-01-01

    Many areas of biomedical research depend on the analysis of uncommon variations in individual genes or transcripts. Here we describe a method that can quantify such variation at a scale and ease heretofore unattainable. Each DNA molecule in a collection of such molecules is converted into a single magnetic particle to which thousands of copies of DNA identical in sequence to the original are bound. This population of beads then corresponds to a one-to-one representation of the starting DNA molecules. Variation within the original population of DNA molecules can then be simply assessed by counting fluorescently labeled particles via flow cytometry. This approach is called BEAMing on the basis of four of its principal components (beads, emulsion, amplification, and magnetics). Millions of individual DNA molecules can be assessed in this fashion with standard laboratory equipment. Moreover, specific variants can be isolated by flow sorting and used for further experimentation. BEAMing can be used for the identification and quantification of rare mutations as well as to study variations in gene sequences or transcripts in specific populations or tissues. PMID:12857956

  8. Single Cell Total RNA Sequencing through Isothermal Amplification in Picoliter-Droplet Emulsion.

    PubMed

    Fu, Yusi; Chen, He; Liu, Lu; Huang, Yanyi

    2016-11-15

    Prevalent single cell RNA amplification and sequencing chemistries mainly focus on polyadenylated RNAs in eukaryotic cells by using oligo(dT) primers for reverse transcription. We develop a new RNA amplification method, "easier-seq", to reverse transcribe and amplify the total RNAs, both with and without polyadenylate tails, from a single cell for transcriptome sequencing with high efficiency, reproducibility, and accuracy. By distributing the reverse transcribed cDNA molecules into 1.5 × 10 5 aqueous droplets in oil, the cDNAs are isothermally amplified using random primers in each of these 65-pL reactors separately. This new method greatly improves the ease of single-cell RNA sequencing by reducing the experimental steps. Meanwhile, with less chance to induce errors, this method can easily maintain the quality of single-cell sequencing. In addition, this polyadenylate-tail-independent method can be seamlessly applied to prokaryotic cell RNA sequencing.

  9. Helicos BioSciences.

    PubMed

    Milos, Patrice

    2008-04-01

    Helicos BioSciences Corporation is a life sciences company developing revolutionary new single molecule sequencing technology to provide the path to the US$1000 genome. True Single Molecule Sequencing (tSMS) will drive advancements in pharmacogenomics that can enable a better understanding of an individual's susceptibility to disease, develop more effective disease diagnoses and differentiate response to disease therapies. During 2007, genome-wide disease-association studies, the encylopedia of DNA elements (ENCODE) and the published genome sequence of two individuals have revealed human genome variation far more extensive than originally believed. These also demonstrated that common variations explain only a fraction of the genetic basis of disease. Therefore, the capability to understand an individual genome is critical in setting the foundation for the next great revolution in healthcare. Helicos is committed to this vision and will provide cost-effective genome sequencing and comprehensive analysis of the transcribed genome that can unlock the era of personalized healthcare.

  10. Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia

    PubMed Central

    2014-01-01

    Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii. Conclusions Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems. PMID:24655715

  11. Comparative genomic analysis of single-molecule sequencing and hybrid approaches for finishing the Clostridium autoethanogenum JA1-1 strain DSM 10061 genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brown, Steven D; Nagaraju, Shilpa; Utturkar, Sagar M

    Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G +more » C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii. Conclusions Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems.« less

  12. Single-molecule protein sequencing through fingerprinting: computational assessment

    NASA Astrophysics Data System (ADS)

    Yao, Yao; Docter, Margreet; van Ginkel, Jetty; de Ridder, Dick; Joo, Chirlmin

    2015-10-01

    Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences.

  13. Complete Genome Sequence of Clavibacter michiganensis subsp. insidiosus R1-1 Using PacBio Single-Molecule Real-Time Technology

    PubMed Central

    Lu, You; Samac, Deborah A.; Glazebrook, Jane

    2015-01-01

    We report here the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1, isolated in Minnesota, USA. The R1-1 genome, generated by a de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies. PMID:25953184

  14. Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification.

    PubMed

    Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

    2017-01-01

    Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.

  15. Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification

    PubMed Central

    Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

    2017-01-01

    Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878

  16. Tracking single mRNA molecules in live cells

    NASA Astrophysics Data System (ADS)

    Moon, Hyungseok C.; Lee, Byung Hun; Lim, Kiseong; Son, Jae Seok; Song, Minho S.; Park, Hye Yoon

    2016-06-01

    mRNAs inside cells interact with numerous RNA-binding proteins, microRNAs, and ribosomes that together compose a highly heterogeneous population of messenger ribonucleoprotein (mRNP) particles. Perhaps one of the best ways to investigate the complex regulation of mRNA is to observe individual molecules. Single molecule imaging allows the collection of quantitative and statistical data on subpopulations and transient states that are otherwise obscured by ensemble averaging. In addition, single particle tracking reveals the sequence of events that occur in the formation and remodeling of mRNPs in real time. Here, we review the current state-of-the-art techniques in tagging, delivery, and imaging to track single mRNAs in live cells. We also discuss how these techniques are applied to extract dynamic information on the transcription, transport, localization, and translation of mRNAs. These studies demonstrate how single molecule tracking is transforming the understanding of mRNA regulation in live cells.

  17. Detection and interrogation of biomolecules via nanoscale probes: From fundamental physics to DNA sequencing

    NASA Astrophysics Data System (ADS)

    Zwolak, Michael

    2013-03-01

    A rapid and low-cost method to sequence DNA would revolutionize personalized medicine, where genetic information is used to diagnose, treat, and prevent diseases. There is a longstanding interest in nanopores as a platform for rapid interrogation of single DNA molecules. I will discuss a sequencing protocol based on the measurement of transverse electronic currents during the translocation of single-stranded DNA through nanopores. Using molecular dynamics simulations coupled to quantum mechanical calculations of the tunneling current, I will show that the DNA nucleotides are predicted to have distinguishable electronic signatures in experimentally realizable systems. Several recent experiments support our theoretical predictions. In addition to their possible impact in medicine and biology, the above methods offer ideal test beds to study open scientific issues in the relatively unexplored area at the interface between solids, liquids, and biomolecules at the nanometer length scale. http://mike.zwolak.org

  18. Nanopore arrays in a silicon membrane for parallel single-molecule detection: DNA translocation

    NASA Astrophysics Data System (ADS)

    Zhang, Miao; Schmidt, Torsten; Jemt, Anders; Sahlén, Pelin; Sychugov, Ilya; Lundeberg, Joakim; Linnros, Jan

    2015-08-01

    Optical nanopore sensing offers great potential in single-molecule detection, genotyping, or DNA sequencing for high-throughput applications. However, one of the bottle-necks for fluorophore-based biomolecule sensing is the lack of an optically optimized membrane with a large array of nanopores, which has large pore-to-pore distance, small variation in pore size and low background photoluminescence (PL). Here, we demonstrate parallel detection of single-fluorophore-labeled DNA strands (450 bps) translocating through an array of silicon nanopores that fulfills the above-mentioned requirements for optical sensing. The nanopore array was fabricated using electron beam lithography and anisotropic etching followed by electrochemical etching resulting in pore diameters down to ∼7 nm. The DNA translocation measurements were performed in a conventional wide-field microscope tailored for effective background PL control. The individual nanopore diameter was found to have a substantial effect on the translocation velocity, where smaller openings slow the translocation enough for the event to be clearly detectable in the fluorescence. Our results demonstrate that a uniform silicon nanopore array combined with wide-field optical detection is a promising alternative with which to realize massively-parallel single-molecule detection.

  19. Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes

    NASA Astrophysics Data System (ADS)

    Roxbury, Daniel

    It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.

  20. Nanomanipulation of Single RNA Molecules by Optical Tweezers

    PubMed Central

    Stephenson, William; Wan, Gorby; Tenenbaum, Scott A.; Li, Pan T. X.

    2014-01-01

    A large portion of the human genome is transcribed but not translated. In this post genomic era, regulatory functions of RNA have been shown to be increasingly important. As RNA function often depends on its ability to adopt alternative structures, it is difficult to predict RNA three-dimensional structures directly from sequence. Single-molecule approaches show potentials to solve the problem of RNA structural polymorphism by monitoring molecular structures one molecule at a time. This work presents a method to precisely manipulate the folding and structure of single RNA molecules using optical tweezers. First, methods to synthesize molecules suitable for single-molecule mechanical work are described. Next, various calibration procedures to ensure the proper operations of the optical tweezers are discussed. Next, various experiments are explained. To demonstrate the utility of the technique, results of mechanically unfolding RNA hairpins and a single RNA kissing complex are used as evidence. In these examples, the nanomanipulation technique was used to study folding of each structural domain, including secondary and tertiary, independently. Lastly, the limitations and future applications of the method are discussed. PMID:25177917

  1. DNA Sequencing by Capillary Electrophoresis

    PubMed Central

    Karger, Barry L.; Guttman, Andras

    2009-01-01

    Sequencing of human and other genomes has been at the center of interest in the biomedical field over the past several decades and is now leading toward an era of personalized medicine. During this time, DNA sequencing methods have evolved from the labor intensive slab gel electrophoresis, through automated multicapillary electrophoresis systems using fluorophore labeling with multispectral imaging, to the “next generation” technologies of cyclic array, hybridization based, nanopore and single molecule sequencing. Deciphering the genetic blueprint and follow-up confirmatory sequencing of Homo sapiens and other genomes was only possible by the advent of modern sequencing technologies that was a result of step by step advances with a contribution of academics, medical personnel and instrument companies. While next generation sequencing is moving ahead at break-neck speed, the multicapillary electrophoretic systems played an essential role in the sequencing of the Human Genome, the foundation of the field of genomics. In this prospective, we wish to overview the role of capillary electrophoresis in DNA sequencing based in part of several of our articles in this journal. PMID:19517496

  2. Single molecule molecular inversion probes for targeted, high-accuracy detection of low-frequency variation.

    PubMed

    Hiatt, Joseph B; Pritchard, Colin C; Salipante, Stephen J; O'Roak, Brian J; Shendure, Jay

    2013-05-01

    The detection and quantification of genetic heterogeneity in populations of cells is fundamentally important to diverse fields, ranging from microbial evolution to human cancer genetics. However, despite the cost and throughput advances associated with massively parallel sequencing, it remains challenging to reliably detect mutations that are present at a low relative abundance in a given DNA sample. Here we describe smMIP, an assay that combines single molecule tagging with multiplex targeted capture to enable practical and highly sensitive detection of low-frequency or subclonal variation. To demonstrate the potential of the method, we simultaneously resequenced 33 clinically informative cancer genes in eight cell line and 45 clinical cancer samples. Single molecule tagging facilitated extremely accurate consensus calling, with an estimated per-base error rate of 8.4 × 10(-6) in cell lines and 2.6 × 10(-5) in clinical specimens. False-positive mutations in the single molecule consensus base-calls exhibited patterns predominantly consistent with DNA damage, including 8-oxo-guanine and spontaneous deamination of cytosine. Based on mixing experiments with cell line samples, sensitivity for mutations above 1% frequency was 83% with no false positives. At clinically informative sites, we identified seven low-frequency point mutations (0.2%-4.7%), including BRAF p.V600E (melanoma, 0.2% alternate allele frequency), KRAS p.G12V (lung, 0.6%), JAK2 p.V617F (melanoma, colon, two lung, 0.3%-1.4%), and NRAS p.Q61R (colon, 4.7%). We anticipate that smMIP will be broadly adoptable as a practical and effective method for accurately detecting low-frequency mutations in both research and clinical settings.

  3. Single molecule molecular inversion probes for targeted, high-accuracy detection of low-frequency variation

    PubMed Central

    Hiatt, Joseph B.; Pritchard, Colin C.; Salipante, Stephen J.; O'Roak, Brian J.; Shendure, Jay

    2013-01-01

    The detection and quantification of genetic heterogeneity in populations of cells is fundamentally important to diverse fields, ranging from microbial evolution to human cancer genetics. However, despite the cost and throughput advances associated with massively parallel sequencing, it remains challenging to reliably detect mutations that are present at a low relative abundance in a given DNA sample. Here we describe smMIP, an assay that combines single molecule tagging with multiplex targeted capture to enable practical and highly sensitive detection of low-frequency or subclonal variation. To demonstrate the potential of the method, we simultaneously resequenced 33 clinically informative cancer genes in eight cell line and 45 clinical cancer samples. Single molecule tagging facilitated extremely accurate consensus calling, with an estimated per-base error rate of 8.4 × 10−6 in cell lines and 2.6 × 10−5 in clinical specimens. False-positive mutations in the single molecule consensus base-calls exhibited patterns predominantly consistent with DNA damage, including 8-oxo-guanine and spontaneous deamination of cytosine. Based on mixing experiments with cell line samples, sensitivity for mutations above 1% frequency was 83% with no false positives. At clinically informative sites, we identified seven low-frequency point mutations (0.2%–4.7%), including BRAF p.V600E (melanoma, 0.2% alternate allele frequency), KRAS p.G12V (lung, 0.6%), JAK2 p.V617F (melanoma, colon, two lung, 0.3%–1.4%), and NRAS p.Q61R (colon, 4.7%). We anticipate that smMIP will be broadly adoptable as a practical and effective method for accurately detecting low-frequency mutations in both research and clinical settings. PMID:23382536

  4. An evolution based biosensor receptor DNA sequence generation algorithm.

    PubMed

    Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

    2010-01-01

    A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.

  5. Assessing quality of Medicago sativa silage by monitoring bacterial composition with single molecule, real-time sequencing technology and various physiological parameters

    PubMed Central

    Bao, Weichen; Mi, Zhihui; Xu, Haiyan; Zheng, Yi; Kwok, Lai Yu; Zhang, Heping; Zhang, Wenyi

    2016-01-01

    The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, γ-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage. PMID:27340760

  6. Assessing quality of Medicago sativa silage by monitoring bacterial composition with single molecule, real-time sequencing technology and various physiological parameters.

    PubMed

    Bao, Weichen; Mi, Zhihui; Xu, Haiyan; Zheng, Yi; Kwok, Lai Yu; Zhang, Heping; Zhang, Wenyi

    2016-06-24

    The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, γ-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage.

  7. Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

    PubMed

    Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

    2017-04-01

    There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.

  8. Free energy minimization to predict RNA secondary structures and computational RNA design.

    PubMed

    Churkin, Alexander; Weinbrand, Lina; Barash, Danny

    2015-01-01

    Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.

  9. RNase H-assisted RNA-primed rolling circle amplification for targeted RNA sequence detection.

    PubMed

    Takahashi, Hirokazu; Ohkawachi, Masahiko; Horio, Kyohei; Kobori, Toshiro; Aki, Tsunehiro; Matsumura, Yukihiko; Nakashimada, Yutaka; Okamura, Yoshiko

    2018-05-17

    RNA-primed rolling circle amplification (RPRCA) is a useful laboratory method for RNA detection; however, the detection of RNA is limited by the lack of information on 3'-terminal sequences. We uncovered that conventional RPRCA using pre-circularized probes could potentially detect the internal sequence of target RNA molecules in combination with RNase H. However, the specificity for mRNA detection was low, presumably due to non-specific hybridization of non-target RNA with the circular probe. To overcome this technical problem, we developed a method for detecting a sequence of interest in target RNA molecules via RNase H-assisted RPRCA using padlocked probes. When padlock probes are hybridized to the target RNA molecule, they are converted to the circular form by SplintR ligase. Subsequently, RNase H creates nick sites only in the hybridized RNA sequence, and single-stranded DNA is finally synthesized from the nick site by phi29 DNA polymerase. This method could specifically detect at least 10 fmol of the target RNA molecule without reverse transcription. Moreover, this method detected GFP mRNA present in 10 ng of total RNA isolated from Escherichia coli without background DNA amplification. Therefore, this method can potentially detect almost all types of RNA molecules without reverse transcription and reveal full-length sequence information.

  10. Complete genome sequences of two strains of the meat spoilage bacterium Brochothrix thermosphacta isolated from ground chicken

    USDA-ARS?s Scientific Manuscript database

    Brochothrix thermosphacta is an important meat spoilage bacterium. Here we report the genome sequences of two strains of B. thermosphacta isolated from ground chicken. The genome sequences were determined using long-read PacBio single-molecule real-time (SMRT©) technology and are the first complete ...

  11. Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01

    PubMed Central

    Codoñer, Francisco M.; Martinez-Blanch, Juan F.; Acevedo-Piérart, Marcelo; Ormeño, M. Loreto; Ramón, Daniel

    2016-01-01

    Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. PMID:27881545

  12. A state space based approach to localizing single molecules from multi-emitter images.

    PubMed

    Vahid, Milad R; Chao, Jerry; Ward, E Sally; Ober, Raimund J

    2017-01-28

    Single molecule super-resolution microscopy is a powerful tool that enables imaging at sub-diffraction-limit resolution. In this technique, subsets of stochastically photoactivated fluorophores are imaged over a sequence of frames and accurately localized, and the estimated locations are used to construct a high-resolution image of the cellular structures labeled by the fluorophores. Available localization methods typically first determine the regions of the image that contain emitting fluorophores through a process referred to as detection. Then, the locations of the fluorophores are estimated accurately in an estimation step. We propose a novel localization method which combines the detection and estimation steps. The method models the given image as the frequency response of a multi-order system obtained with a balanced state space realization algorithm based on the singular value decomposition of a Hankel matrix, and determines the locations of intensity peaks in the image as the pole locations of the resulting system. The locations of the most significant peaks correspond to the locations of single molecules in the original image. Although the accuracy of the location estimates is reasonably good, we demonstrate that, by using the estimates as the initial conditions for a maximum likelihood estimator, refined estimates can be obtained that have a standard deviation close to the Cramér-Rao lower bound-based limit of accuracy. We validate our method using both simulated and experimental multi-emitter images.

  13. Competition between B-Z and B-L transitions in a single DNA molecule: Computational studies

    NASA Astrophysics Data System (ADS)

    Kwon, Ah-Young; Nam, Gi-Moon; Johner, Albert; Kim, Seyong; Hong, Seok-Cheol; Lee, Nam-Kyung

    2016-02-01

    Under negative torsion, DNA adopts left-handed helical forms, such as Z-DNA and L-DNA. Using the random copolymer model developed for a wormlike chain, we represent a single DNA molecule with structural heterogeneity as a helical chain consisting of monomers which can be characterized by different helical senses and pitches. By Monte Carlo simulation, where we take into account bending and twist fluctuations explicitly, we study sequence dependence of B-Z transitions under torsional stress and tension focusing on the interaction with B-L transitions. We consider core sequences, (GC) n repeats or (TG) n repeats, which can interconvert between the right-handed B form and the left-handed Z form, imbedded in a random sequence, which can convert to left-handed L form with different (tension dependent) helical pitch. We show that Z-DNA formation from the (GC) n sequence is always supported by unwinding torsional stress but Z-DNA formation from the (TG) n sequence, which are more costly to convert but numerous, can be strongly influenced by the quenched disorder in the surrounding random sequence.

  14. Biosensors for DNA sequence detection

    NASA Technical Reports Server (NTRS)

    Vercoutere, Wenonah; Akeson, Mark

    2002-01-01

    DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.

  15. Complete Genome Sequence of Clavibacter michiganensis subsp. insidiosus R1-1 Using PacBio Single-Molecule Real-Time Technology.

    PubMed

    Lu, You; Samac, Deborah A; Glazebrook, Jane; Ishimaru, Carol A

    2015-05-07

    We report here the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1, isolated in Minnesota, USA. The R1-1 genome, generated by a de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies. Copyright © 2015 Lu et al.

  16. bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

    PubMed

    Danaee, Padideh; Rouches, Mason; Wiley, Michelle; Deng, Dezhong; Huang, Liang; Hendrix, David

    2018-05-09

    While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.

  17. SAbPred: a structure-based antibody prediction server

    PubMed Central

    Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo; Marks, Claire; Nowak, Jaroslaw; Regep, Cristian; Georges, Guy; Kelm, Sebastian; Popovic, Bojana; Deane, Charlotte M.

    2016-01-01

    SAbPred is a server that makes predictions of the properties of antibodies focusing on their structures. Antibody informatics tools can help improve our understanding of immune responses to disease and aid in the design and engineering of therapeutic molecules. SAbPred is a single platform containing multiple applications which can: number and align sequences; automatically generate antibody variable fragment homology models; annotate such models with estimated accuracy alongside sequence and structural properties including potential developability issues; predict paratope residues; and predict epitope patches on protein antigens. The server is available at http://opig.stats.ox.ac.uk/webapps/sabpred. PMID:27131379

  18. Unraveling Hydrophobic Interactions at the Molecular Scale Using Force Spectroscopy and Molecular Dynamics Simulations.

    PubMed

    Stock, Philipp; Monroe, Jacob I; Utzig, Thomas; Smith, David J; Shell, M Scott; Valtiner, Markus

    2017-03-28

    Interactions between hydrophobic moieties steer ubiquitous processes in aqueous media, including the self-organization of biologic matter. Recent decades have seen tremendous progress in understanding these for macroscopic hydrophobic interfaces. Yet, it is still a challenge to experimentally measure hydrophobic interactions (HIs) at the single-molecule scale and thus to compare with theory. Here, we present a combined experimental-simulation approach to directly measure and quantify the sequence dependence and additivity of HIs in peptide systems at the single-molecule scale. We combine dynamic single-molecule force spectroscopy on model peptides with fully atomistic, both equilibrium and nonequilibrium, molecular dynamics (MD) simulations of the same systems. Specifically, we mutate a flexible (GS) 5 peptide scaffold with increasing numbers of hydrophobic leucine monomers and measure the peptides' desorption from hydrophobic self-assembled monolayer surfaces. Based on the analysis of nonequilibrium work-trajectories, we measure an interaction free energy that scales linearly with 3.0-3.4 k B T per leucine. In good agreement, simulations indicate a similar trend with 2.1 k B T per leucine, while also providing a detailed molecular view into HIs. This approach potentially provides a roadmap for directly extracting qualitative and quantitative single-molecule interactions at solid/liquid interfaces in a wide range of fields, including interactions at biointerfaces and adhesive interactions in industrial applications.

  19. DNA Sequence-Dependent Ionic Currents in Ultra-Small Solid-State Nanopores†

    PubMed Central

    Comer, Jeffrey

    2016-01-01

    Measurements of ionic currents through nanopores partially blocked by DNA have emerged as a powerful method for characterization of the DNA nucleotide sequence. Although the effect of the nucleotide sequence on the nanopore blockade current has been experimentally demonstrated, prediction and interpretation of such measurements remain a formidable challenge. Using atomic resolution computational approaches, here we show how the sequence, molecular conformation, and pore geometry affect the blockade ionic current in model solid-state nanopores. We demonstrate that the blockade current from a DNA molecule is determined by the chemical identities and conformations of at least three consecutive nucleotides. We find the blockade currents produced by the nucleotide triplets to vary considerably with their nucleotide sequence despite having nearly identical molecular conformations. Encouragingly, we find blockade current differences as large as 25% for single-base substitutions in ultra small (1.6 nm × 1.1 nm cross section; 2 nm length) solid-state nanopores. Despite the complex dependence of the blockade current on the sequence and conformation of the DNA triplets, we find that, under many conditions, the number of thymine bases is positively correlated with the current, whereas the number of purine bases and the presence of both purine and pyrimidines in the triplet are negatively correlated with the current. Based on these observations, we construct a simple theoretical model that relates the ion current to the base content of a solid-state nanopore. Furthermore, we show that compact conformations of DNA in narrow pores provide the greatest signal-to-noise ratio for single base detection, whereas reduction of the nanopore length increases the ionic current noise. Thus, the sequence dependence of nanopore blockade current can be theoretically rationalized, although the predictions will likely need to be customized for each nanopore type. PMID:27103233

  20. DNA unzipping phase diagram calculated via replica theory.

    PubMed

    Roland, C Brian; Hatch, Kristi Adamson; Prentiss, Mara; Shakhnovich, Eugene I

    2009-05-01

    We show how single-molecule unzipping experiments can provide strong evidence that the zero-force melting transition of long molecules of natural dsDNA should be classified as a phase transition of the higher-order type (continuous). Toward this end, we study a statistical-mechanics model for the fluctuating structure of a long molecule of dsDNA, and compute the equilibrium phase diagram for the experiment in which the molecule is unzipped under applied force. We consider a perfect-matching dsDNA model, in which the loops are volume-excluding chains with arbitrary loop exponent c . We include stacking interactions, hydrogen bonds, and main-chain entropy. We include sequence heterogeneity at the level of random sequences; in particular, there is no correlation in the base-pairing (bp) energy from one sequence position to the next. We present heuristic arguments to demonstrate that the low-temperature macrostate does not exhibit degenerate ergodicity breaking. We use this claim to understand the results of our replica-theoretic calculation of the equilibrium properties of the system. As a function of temperature, we obtain the minimal force at which the molecule separates completely. This critical-force curve is a line in the temperature-force phase diagram that marks the regions where the molecule exists primarily as a double helix versus the region where the molecule exists as two separate strands. We compare our random-sequence model to magnetic tweezer experiments performed on the 48 502 bp genome of bacteriophage lambda . We find good agreement with the experimental data, which is restricted to temperatures between 24 and 50 degrees C . At higher temperatures, the critical-force curve of our random-sequence model is very different for that of the homogeneous-sequence version of our model. For both sequence models, the critical force falls to zero at the melting temperature T_{c} like |T-T_{c}|;{alpha} . For the homogeneous-sequence model, alpha=1/2 almost exactly, while for the random-sequence model, alpha approximately 0.9 . Importantly, the shape of the critical-force curve is connected, via our theory, to the manner in which the helix fraction falls to zero at T_{c} . The helix fraction is the property that is used to classify the melting transition as a type of phase transition. In our calculation, the shape of the critical-force curve holds strong evidence that the zero-force melting transition of long natural dsDNA should be classified as a higher-order (continuous) phase transition. Specifically, the order is 3rd or greater.

  1. Optical mapping and its potential for large-scale sequencing projects.

    PubMed

    Aston, C; Mishra, B; Schwartz, D C

    1999-07-01

    Physical mapping has been rediscovered as an important component of large-scale sequencing projects. Restriction maps provide landmark sequences at defined intervals, and high-resolution restriction maps can be assembled from ensembles of single molecules by optical means. Such optical maps can be constructed from both large-insert clones and genomic DNA, and are used as a scaffold for accurately aligning sequence contigs generated by shotgun sequencing.

  2. Single-molecule analysis of DNA cross-links using nanopore technology

    NASA Astrophysics Data System (ADS)

    Wolna, Anna H.

    The alpha-hemolysin (alpha-HL) protein ion channel is a potential next-generation sequencing platform that has been extensively used to study nucleic acids at a single-molecule level. After applying a potential across a lipid bilayer, the imbedded alpha-HL allows monitoring of the duration and current levels of DNA translocation and immobilization. Because this method does not require DNA amplification prior to sequencing, all the DNA damage present in the cell at any given time will be present during the sequencing experiment. The goal of this research is to determine if these damage sites give distinguishable current levels beyond those observed for the canonical nucleobases. Because DNA cross-links are one of the most prevalent types of DNA damage occurring in vivo, the blockage current levels were determined for thymine-dimers, guanine(C8)-thymine(N3) cross-links and platinum adducts. All of these cross-links give a different blockage current level compared to the undamaged strands when immobilized in the ion channel, and they all can easily translocate across the alpha-HL channel. Additionally, the alpha-HL nanopore technique presents a unique opportunity to study the effects of DNA cross-links, such as thymine-dimers, on the secondary structure of DNA G-quadruplexes folded from the human telomere sequence. Using this single-molecule nanopore technique we can detect subtle structural differences that cannot be easily addressed using conventional methods. The human telomere plays crucial roles in maintaining genome stability. In the presence of suitable cations, the repetitive 5'-TTAGGG human telomere sequence can fold into G-quadruplexes that adopt the hybrid fold in vivo. The telomere sequence is hypersensitive to UV-induced thymine-dimer (T=T) formation, and yet the presence of thymine dimers does not cause telomere shortening. The potential structural disruption and thermodynamic stability of the T=T-containing natural telomere sequences were studied to understand how this damage is tolerated in telomeric DNA. The alpha-HL experiments determined that T=Ts disrupt double-chain reversal loop formation but are well tolerated in edgewise and diagonal loops of the hybrid G-quadruplexes. These studies demonstrated the power of the alpha-HL ion channel to analyze DNA modifications and secondary structures at a single-molecule level.

  3. Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

    DOEpatents

    McCutchen-Maloney, Sandra L.

    2002-01-01

    DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.

  4. Identifying active foraminifera in the Sea of Japan using metatranscriptomic approach

    NASA Astrophysics Data System (ADS)

    Lejzerowicz, Franck; Voltsky, Ivan; Pawlowski, Jan

    2013-02-01

    Metagenetics represents an efficient and rapid tool to describe environmental diversity patterns of microbial eukaryotes based on ribosomal DNA sequences. However, the results of metagenetic studies are often biased by the presence of extracellular DNA molecules that are persistent in the environment, especially in deep-sea sediment. As an alternative, short-lived RNA molecules constitute a good proxy for the detection of active species. Here, we used a metatranscriptomic approach based on RNA-derived (cDNA) sequences to study the diversity of the deep-sea benthic foraminifera and compared it to the metagenetic approach. We analyzed 257 ribosomal DNA and cDNA sequences obtained from seven sediments samples collected in the Sea of Japan at depths ranging from 486 to 3665 m. The DNA and RNA-based approaches gave a similar view of the taxonomic composition of foraminiferal assemblage, but differed in some important points. First, the cDNA dataset was dominated by sequences of rotaliids and robertiniids, suggesting that these calcareous species, some of which have been observed in Rose Bengal stained samples, are the most active component of foraminiferal community. Second, the richness of monothalamous (single-chambered) foraminifera was particularly high in DNA extracts from the deepest samples, confirming that this group of foraminifera is abundant but not necessarily very active in the deep-sea sediments. Finally, the high divergence of undetermined sequences in cDNA dataset indicate the limits of our database and lack of knowledge about some active but possibly rare species. Our study demonstrates the capability of the metatranscriptomic approach to detect active foraminiferal species and prompt its use in future high-throughput sequencing-based environmental surveys.

  5. Single helically folded aromatic oligoamides that mimic the charge surface of double-stranded B-DNA

    NASA Astrophysics Data System (ADS)

    Ziach, Krzysztof; Chollet, Céline; Parissi, Vincent; Prabhakaran, Panchami; Marchivie, Mathieu; Corvaglia, Valentina; Bose, Partha Pratim; Laxmi-Reddy, Katta; Godde, Frédéric; Schmitter, Jean-Marie; Chaignepain, Stéphane; Pourquier, Philippe; Huc, Ivan

    2018-05-01

    Numerous essential biomolecular processes require the recognition of DNA surface features by proteins. Molecules mimicking these features could potentially act as decoys and interfere with pharmacologically or therapeutically relevant protein-DNA interactions. Although naturally occurring DNA-mimicking proteins have been described, synthetic tunable molecules that mimic the charge surface of double-stranded DNA are not known. Here, we report the design, synthesis and structural characterization of aromatic oligoamides that fold into single helical conformations and display a double helical array of negatively charged residues in positions that match the phosphate moieties in B-DNA. These molecules were able to inhibit several enzymes possessing non-sequence-selective DNA-binding properties, including topoisomerase 1 and HIV-1 integrase, presumably through specific foldamer-protein interactions, whereas sequence-selective enzymes were not inhibited. Such modular and synthetically accessible DNA mimics provide a versatile platform to design novel inhibitors of protein-DNA interactions.

  6. Single-Stranded Condensation Stochastically Blocks G-Quadruplex Assembly in Human Telomeric RNA.

    PubMed

    Gutiérrez, Irene; Garavís, Miguel; de Lorenzo, Sara; Villasante, Alfredo; González, Carlos; Arias-Gonzalez, J Ricardo

    2018-05-17

    TERRA is an RNA molecule transcribed from human subtelomeric regions toward chromosome ends potentially involved in regulation of heterochromatin stability, semiconservative replication, and telomerase inhibition, among others. TERRA contains tandem repeats of the sequence GGGUUA, with a strong tendency to fold into a four-stranded arrangement known as a parallel G-quadruplex. Here, we demonstrate by using single-molecule force spectroscopy that this potential is limited by the inherent capacity of RNA to self-associate randomly and further condense into entropically more favorable structures. We stretched RNA constructions with more than four and less than eight hexanucleotide repeats, thus unable to form several G-quadruplexes in tandem, flanked by non-G-rich overhangs of random sequence by optical tweezers on a one by one basis. We found that condensed RNA stochastically blocks G-quadruplex folding pathways with a near 20% probability, a behavior that is not found in DNA analogous molecules.

  7. Single-molecule detection: applications to ultrasensitive biochemical analysis

    NASA Astrophysics Data System (ADS)

    Castro, Alonso; Shera, E. Brooks

    1995-06-01

    Recent developments in laser-based detection of fluorescent molecules have made possible the implementation of very sensitive techniques for biochemical analysis. We present and discuss our experiments on the applications of our recently developed technique of single-molecule detection to the analysis of molecules of biological interest. These newly developed methods are capable of detecting and identifying biomolecules at the single-molecule level of sensitivity. In one case, identification is based on measuring fluorescence brightness from single molecules. In another, molecules are classified by determining their electrophoretic velocities.

  8. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  9. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  10. Complete genome sequence of Pelosinus sp. strain UFO1 assembled using single-molecule real-time DNA sequencing technology

    DOE PAGES

    Brown, Steven D.; Utturkar, Sagar M.; Magnuson, Timothy S.; ...

    2014-09-04

    Pelosinus fermentans strain R7 was isolated from Russian kaolin clays as the type strain and it can reduce Fe(III) during fermentative growth (1). Draft genome sequences for P. fermentans R7 and four strains from Hanford, Washington, USA, have been published (2–4). The P. fermentans 16S rRNA sequence dominated the lactate-based enrichment cultures from three geochemically contrasting soils from the Melton Branch Watershed, Oak Ridge, Tennessee, USA (5) and also at another stimulated, uraniumcontaminated field site near Oak Ridge (6). For the current work, strain UFO1 was isolated from pristine sediments at a background field site in Oak Ridge and characterizedmore » as facilitating U(VI) reduction and precipitation with phosphate (7).« less

  11. Single nucleotide primer extension to detect genetic diseases: Experimental application to hemophilia B (factor IX) and cystic fibrosis genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kuppuswamy, M.N.; Hoffmann, J.W.; Spitzer, S.G.

    1991-02-15

    In this report, the authors describe an approach to detect the presence of abnormal alleles in those genetic diseases in which frequency of occurrence of the same mutation is high (e.g., hemophilia B). Initially, from each subject, the DNA fragment containing the putative mutation site is amplified by the polymerase chain reaction. For each fragment two reaction mixtures are then prepared. Each contains the amplified fragment, a primer (18-mer or longer) whose sequence is identical to the coding sequence of the normal gene immediately flanking the 5{prime} end of the mutation site, and either an {alpha}-{sup 32}P-labeled nucleotide corresponding tomore » the normal coding sequence at the mutation site or an {alpha}-{sup 32}P-labeled nucleotide corresponding to the mutant sequence. An essential feature of the present methodology is that the base immediately 3{prime} to the template-bound primer is one of those altered in the mutant, since in this way an extension of the primer by a single base will give an extended molecule characteristic of either the mutant or the wild type. The method is rapid and should be useful in carrier detection and prenatal diagnosis of every genetic disease with a known sequence variation.« less

  12. Recognition Tunneling

    PubMed Central

    Lindsay, Stuart; He, Jin; Sankey, Otto; Hapala, Prokop; Jelinek, Pavel; Zhang, Peiming; Chang, Shuai; Huang, Shuo

    2010-01-01

    Single molecules in a tunnel junction can now be interrogated reliably using chemically-functionalized electrodes. Monitoring stochastic bonding fluctuations between a ligand bound to one electrode and its target bound to a second electrode (“tethered molecule-pair” configuration) gives insight into the nature of the intermolecular bonding at a single molecule-pair level, and defines the requirements for reproducible tunneling data. Simulations show that there is an instability in the tunnel gap at large currents, and this results in a multiplicity of contacts with a corresponding spread in the measured currents. At small currents (i.e. large gaps) the gap is stable, and functionalizing a pair of electrodes with recognition reagents (the “free analyte” configuration) can generate a distinct tunneling signal when an analyte molecule is trapped in the gap. This opens up a new interface between chemistry and electronics with immediate implications for rapid sequencing of single DNA molecules. PMID:20522930

  13. Two-Way Gold Nanoparticle Label-Free Sensing of Specific Sequence and Small Molecule Targets Using Switchable Concatemers.

    PubMed

    Zhu, Longjiao; Shao, Xiangli; Luo, Yunbo; Huang, Kunlung; Xu, Wentao

    2017-05-19

    A two-way colorimetric biosensor based on unmodified gold nanoparticles (GNPs) and a switchable double-stranded DNA (dsDNA) concatemer have been demonstrated. Two hairpin probes (H1 and H2) were first designed that provided the fuels to assemble the dsDNA concatemers via hybridization chain reaction (HCR). A functional hairpin (FH) was rationally designed to recognize the target sequences. All the hairpins contained a single-stranded DNA (ssDNA) loop and sticky end to prevent GNPs from salt-induced aggregation. In the presence of target sequence, the capture probe blocked in the FH recognizes the target to form a duplex DNA, which causes the release of the initiator probe by FH conformational change. This process then starts the alternate-opening of H1 and H2 through HCR, and dsDNA concatemers grow from the target sequence. As a result, unmodified GNPs undergo salt-induced aggregation because the formed dsDNA concatemers are stiffer and provide less stabilization. A light purple-to-blue color variation was observed in the bulk solution, termed the light-off sensing way. Furthermore, H1 ingeniously inserted an aptamer sequence to generate dsDNA concatemers with multiple small molecule binding sites. In the presence of small molecule targets, concatemers can be disassembled into mixtures with ssDNA sticky ends. A blue-to-purple reverse color variation was observed due to the regeneration of the ssDNA, termed the light-on way. The two-way biosensor can detect both nucleic acids and small molecule targets with one sensing device. This switchable sensing element is label-free, enzyme-free, and sophisticated-instrumentation-free. The detection limits of both targets were below nanomolar.

  14. Observing Holliday junction branch migration one step at a time

    NASA Astrophysics Data System (ADS)

    Ha, Taekjip

    2004-03-01

    During genetic recombination, two homologous DNA molecules undergo strand exchange to form a four-way DNA (Holliday) junction and the recognition and processing of this species by branch migration and junction resolving enzymes determine the outcome. We have used single molecule fluorescence techniques to study two intrinsic structural dynamics of the Holliday junction, stacking conformer transitions and spontaneous branch migration. Our studies show that the dynamics of branch migration, resolved with one base pair resolution, is determined by the stability of conformers which in turn depends on the local DNA sequences. Therefore, the energy landscape of Holliday junction branch migation is not uniform, but is rugged.

  15. SMRT sequencing of the Vitis vinifera cv. ‘Flame seedless’ genome using a SMRTbell-free library preparation from Swift Biosciences

    USDA-ARS?s Scientific Manuscript database

    Single Molecule Real-Time (SMRT) sequencing provides advantages to the sequencing of complex genomes. The long reads generated are superior for resolving complex genomic regions and provide highly contiguous de novo assemblies. Current SMRTbell libraries generate average read lengths of 10-15kb. How...

  16. Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01.

    PubMed

    Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Acevedo-Piérart, Marcelo; Ormeño, M Loreto; Ramón, Daniel; Genovés, Salvador

    2016-11-23

    Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. Copyright © 2016 Chenoll et al.

  17. The structure of cell adhesion molecule uvomorulin. Insights into the molecular mechanism of Ca2+-dependent cell adhesion.

    PubMed Central

    Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S

    1987-01-01

    We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370

  18. A survey of the sorghum transcriptome using single-molecule long reads

    DOE PAGES

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...

    2016-06-24

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less

  19. A survey of the sorghum transcriptome using single-molecule long reads

    PubMed Central

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.

    2016-01-01

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290

  20. Single-molecule nanopore enzymology

    PubMed Central

    Wloka, Carsten; Maglia, Giovanni

    2017-01-01

    Biological nanopores are a class of membrane proteins that open nanoscale water-conduits in biological membranes. When they are reconstituted in artificial membranes and a bias voltage is applied across the membrane, the ionic current passing through individual nanopores can be used to monitor chemical reactions, to recognize individual molecules and, of most interest, to sequence DNA. More recently, proteins and enzymes have started being analysed with nanopores. Monitoring enzymatic reactions with nanopores, i.e. nanopore enzymology, has the unique advantage that it allows long-timescale observations of native proteins at the single-molecule level. Here we describe the approaches and challenges in nanopore enzymology. PMID:28630164

  1. A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS

    PubMed Central

    Jiao, Xiaoli; Zheng, Xin; Ma, Liang; Kutty, Geetha; Gogineni, Emile; Sun, Qiang; Sherman, Brad T.; Hu, Xiaojun; Jones, Kristine; Raley, Castle; Tran, Bao; Munroe, David J.; Stephens, Robert; Liang, Dun; Imamichi, Tomozumi; Kovacs, Joseph A.; Lempicki, Richard A.; Huang, Da Wei

    2013-01-01

    PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results. PMID:24179701

  2. A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS.

    PubMed

    Jiao, Xiaoli; Zheng, Xin; Ma, Liang; Kutty, Geetha; Gogineni, Emile; Sun, Qiang; Sherman, Brad T; Hu, Xiaojun; Jones, Kristine; Raley, Castle; Tran, Bao; Munroe, David J; Stephens, Robert; Liang, Dun; Imamichi, Tomozumi; Kovacs, Joseph A; Lempicki, Richard A; Huang, Da Wei

    2013-07-31

    PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results.

  3. Changes in solvation during DNA binding and cleavage are critical to altered specificity of the EcoRI endonuclease

    PubMed Central

    Robinson, Clifford R.; Sligar, Stephen G.

    1998-01-01

    Restriction endonucleases such as EcoRI bind and cleave DNA with great specificity and represent a paradigm for protein–DNA interactions and molecular recognition. Using osmotic pressure to induce water release, we demonstrate the participation of bound waters in the sequence discrimination of substrate DNA by EcoRI. Changes in solvation can play a critical role in directing sequence-specific DNA binding by EcoRI and are also crucial in assisting site discrimination during catalysis. By measuring the volume change for complex formation, we show that at the cognate sequence (GAATTC) EcoRI binding releases about 70 fewer water molecules than binding at an alternate DNA sequence (TAATTC), which differs by a single base pair. EcoRI complexation with nonspecific DNA releases substantially less water than either of these specific complexes. In cognate substrates (GAATTC) kcat decreases as osmotic pressure is increased, indicating the binding of about 30 water molecules accompanies the cleavage reaction. For the alternate substrate (TAATTC), release of about 40 water molecules accompanies the reaction, indicated by a dramatic acceleration of the rate when osmotic pressure is raised. These large differences in solvation effects demonstrate that water molecules can be key players in the molecular recognition process during both association and catalytic phases of the EcoRI reaction, acting to change the specificity of the enzyme. For both the protein–DNA complex and the transition state, there may be substantial conformational differences between cognate and alternate sites, accompanied by significant alterations in hydration and solvent accessibility. PMID:9482860

  4. Identification of Microbial Profile of Koji Using Single Molecule, Real-Time Sequencing Technology.

    PubMed

    Hui, Wenyan; Hou, Qiangchuan; Cao, Chenxia; Xu, Haiyan; Zhen, Yi; Kwok, Lai-Yu; Sun, Tiansong; Zhang, Heping; Zhang, Wenyi

    2017-05-01

    Koji is a kind of Japanese traditional fermented starter that has been used for centuries. Many fermented foods are made from koji, such as sake, miso, and soy sauce. This study used the single molecule real-time sequencing technology (SMRT) to investigate the bacterial and fungal microbiota of 3 Japanese koji samples. After SMRT analysis, a total of 39121 high-quality sequences were generated, including 14354 bacterial and 24767 fungal sequence reads. The high-quality gene sequences were assigned to 5 bacterial and 2 fungal plyla, dominated by Proteobacteria and Ascomycota, respectively. At the genus level, Ochrobactrum and Wickerhamomyces were the most abundant bacterial and fungal genera, respectively. The predominant bacterial and fungal species were Ochrobactrum lupini and Wickerhamomyces anomalus, respectively. Our study profiled the microbiota composition of 3 Japanese koji samples to the species level precision. The results may be useful for further development of traditional fermented products, especially optimization of koji preparation. Meanwhile, this study has demonstrated that SMRT is a robust tool for analyzing the microbial composition in food samples. © 2017 Institute of Food Technologists®.

  5. DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.

    PubMed

    MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M

    2015-09-14

    The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.

  6. Improved maize reference genome with single-molecule technologies.

    PubMed

    Jiao, Yinping; Peluso, Paul; Shi, Jinghua; Liang, Tiffany; Stitzer, Michelle C; Wang, Bo; Campbell, Michael S; Stein, Joshua C; Wei, Xuehong; Chin, Chen-Shan; Guill, Katherine; Regulski, Michael; Kumari, Sunita; Olson, Andrew; Gent, Jonathan; Schneider, Kevin L; Wolfgruber, Thomas K; May, Michael R; Springer, Nathan M; Antoniou, Eric; McCombie, W Richard; Presting, Gernot G; McMullen, Michael; Ross-Ibarra, Jeffrey; Dawe, R Kelly; Hastie, Alex; Rank, David R; Ware, Doreen

    2017-06-22

    Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.

  7. Switchable DNA interfaces for the highly sensitive detection of label-free DNA targets.

    PubMed

    Rant, Ulrich; Arinaga, Kenji; Scherer, Simon; Pringsheim, Erika; Fujita, Shozo; Yokoyama, Naoki; Tornow, Marc; Abstreiter, Gerhard

    2007-10-30

    We report a method to detect label-free oligonucleotide targets. The conformation of surface-tethered probe nucleic acids is modulated by alternating electric fields, which cause the molecules to extend away from or fold onto the biased surface. Binding (hybridization) of targets to the single-stranded probes results in a pronounced enhancement of the layer-height modulation amplitude, monitored optically in real time. The method features an exceptional detection limit of <3 x 10(8) bound targets per cm(2) sensor area. Single base-pair mismatches in the sequences of DNA complements may readily be identified; moreover, binding kinetics and binding affinities can be determined with high accuracy. When driving the DNA to oscillate at frequencies in the kHz regime, distinct switching kinetics are revealed for single- and double-stranded DNA. Molecular dynamics are used to identify the binding state of molecules according to their characteristic kinetic fingerprints by using a chip-compatible detection format.

  8. Switchable DNA interfaces for the highly sensitive detection of label-free DNA targets

    PubMed Central

    Rant, Ulrich; Arinaga, Kenji; Scherer, Simon; Pringsheim, Erika; Fujita, Shozo; Yokoyama, Naoki; Tornow, Marc; Abstreiter, Gerhard

    2007-01-01

    We report a method to detect label-free oligonucleotide targets. The conformation of surface-tethered probe nucleic acids is modulated by alternating electric fields, which cause the molecules to extend away from or fold onto the biased surface. Binding (hybridization) of targets to the single-stranded probes results in a pronounced enhancement of the layer-height modulation amplitude, monitored optically in real time. The method features an exceptional detection limit of <3 × 108 bound targets per cm2 sensor area. Single base-pair mismatches in the sequences of DNA complements may readily be identified; moreover, binding kinetics and binding affinities can be determined with high accuracy. When driving the DNA to oscillate at frequencies in the kHz regime, distinct switching kinetics are revealed for single- and double-stranded DNA. Molecular dynamics are used to identify the binding state of molecules according to their characteristic kinetic fingerprints by using a chip-compatible detection format. PMID:17951434

  9. Replacement of RNA hairpins by in vitro selected tetranucleotides.

    PubMed Central

    Dichtl, B; Pan, T; DiRenzo, A B; Uhlenbeck, O C

    1993-01-01

    An in vitro selection method based on the autolytic cleavage of yeast tRNA(Phe) by Pb2+ was applied to obtain tRNA derivatives with the anticodon hairpin replaced by four single-stranded nucleotides. Based on the rates of the site-specific cleavage by Pb2+ and the presence of a specific UV-induced crosslink, certain tetranucleotide sequences allow proper folding of the rest of the tRNA molecule, whereas others do not. One such successful tetramer sequence was also used to replace the acceptor stem of yeast tRNA(Phe) and the anticodon hairpin of E.coli tRNA(Phe) without disrupting folding. These experiments suggest that certain tetramers may be able to replace structurally nonessential hairpins in any RNA. Images PMID:7680121

  10. SNSMIL, a real-time single molecule identification and localization algorithm for super-resolution fluorescence microscopy

    PubMed Central

    Tang, Yunqing; Dai, Luru; Zhang, Xiaoming; Li, Junbai; Hendriks, Johnny; Fan, Xiaoming; Gruteser, Nadine; Meisenberg, Annika; Baumann, Arnd; Katranidis, Alexandros; Gensch, Thomas

    2015-01-01

    Single molecule localization based super-resolution fluorescence microscopy offers significantly higher spatial resolution than predicted by Abbe’s resolution limit for far field optical microscopy. Such super-resolution images are reconstructed from wide-field or total internal reflection single molecule fluorescence recordings. Discrimination between emission of single fluorescent molecules and background noise fluctuations remains a great challenge in current data analysis. Here we present a real-time, and robust single molecule identification and localization algorithm, SNSMIL (Shot Noise based Single Molecule Identification and Localization). This algorithm is based on the intrinsic nature of noise, i.e., its Poisson or shot noise characteristics and a new identification criterion, QSNSMIL, is defined. SNSMIL improves the identification accuracy of single fluorescent molecules in experimental or simulated datasets with high and inhomogeneous background. The implementation of SNSMIL relies on a graphics processing unit (GPU), making real-time analysis feasible as shown for real experimental and simulated datasets. PMID:26098742

  11. Complete Genome Sequence of Lactobacillus rhamnosus Strain BPL5 (CECT 8800), a Probiotic for Treatment of Bacterial Vaginosis.

    PubMed

    Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Ramón, Daniel; Genovés, Salvador; Menabrito, Marco

    2016-04-21

    ITALIC! Lactobacillus rhamnosusBPL5 (CECT 8800), is a probiotic strain suitable for the treatment of bacterial vaginosis. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insight into its functional activity. Copyright © 2016 Chenoll et al.

  12. PHYSICAL MODEL FOR RECOGNITION TUNNELING

    PubMed Central

    Krstić, Predrag; Ashcroft, Brian; Lindsay, Stuart

    2015-01-01

    Recognition tunneling (RT) identifies target molecules trapped between tunneling electrodes functionalized with recognition molecules that serve as specific chemical linkages between the metal electrodes and the trapped target molecule. Possible applications include single molecule DNA and protein sequencing. This paper addresses several fundamental aspects of RT by multiscale theory, applying both all-atom and coarse-grained DNA models: (1) We show that the magnitude of the observed currents are consistent with the results of non-equilibrium Green's function calculations carried out on a solvated all-atom model. (2) Brownian fluctuations in hydrogen bond-lengths lead to current spikes that are similar to what is observed experimentally. (3) The frequency characteristics of these fluctuations can be used to identify the trapped molecules with a machine-learning algorithm, giving a theoretical underpinning to this new method of identifying single molecule signals. PMID:25650375

  13. Hydrogel Droplet Microfluidics for High-Throughput Single Molecule/Cell Analysis.

    PubMed

    Zhu, Zhi; Yang, Chaoyong James

    2017-01-17

    Heterogeneity among individual molecules and cells has posed significant challenges to traditional bulk assays, due to the assumption of average behavior, which would lose important biological information in heterogeneity and result in a misleading interpretation. Single molecule/cell analysis has become an important and emerging field in biological and biomedical research for insights into heterogeneity between large populations at high resolution. Compared with the ensemble bulk method, single molecule/cell analysis explores the information on time trajectories, conformational states, and interactions of individual molecules/cells, all key factors in the study of chemical and biological reaction pathways. Various powerful techniques have been developed for single molecule/cell analysis, including flow cytometry, atomic force microscopy, optical and magnetic tweezers, single-molecule fluorescence spectroscopy, and so forth. However, some of them have the low-throughput issue that has to analyze single molecules/cells one by one. Flow cytometry is a widely used high-throughput technique for single cell analysis but lacks the ability for intercellular interaction study and local environment control. Droplet microfluidics becomes attractive for single molecule/cell manipulation because single molecules/cells can be individually encased in monodisperse microdroplets, allowing high-throughput analysis and manipulation with precise control of the local environment. Moreover, hydrogels, cross-linked polymer networks that swell in the presence of water, have been introduced into droplet microfluidic systems as hydrogel droplet microfluidics. By replacing an aqueous phase with a monomer or polymer solution, hydrogel droplets can be generated on microfluidic chips for encapsulation of single molecules/cells according to the Poisson distribution. The sol-gel transition property endows the hydrogel droplets with new functionalities and diversified applications in single molecule/cell analysis. The hydrogel can act as a 3D cell culture matrix to mimic the extracellular environment for long-term single cell culture, which allows further heterogeneity study in proliferation, drug screening, and metastasis at the single-cell level. The sol-gel transition allows reactions in solution to be performed rapidly and efficiently with product storage in the gel for flexible downstream manipulation and analysis. More importantly, controllable sol-gel regulation provides a new way to maintain phenotype-genotype linkages in the hydrogel matrix for high throughput molecular evolution. In this Account, we will review the hydrogel droplet generation on microfluidics, single molecule/cell encapsulation in hydrogel droplets, as well as the progress made by our group and others in the application of hydrogel droplet microfluidics for single molecule/cell analysis, including single cell culture, single molecule/cell detection, single cell sequencing, and molecular evolution.

  14. Single molecule and single cell epigenomics.

    PubMed

    Hyun, Byung-Ryool; McElwee, John L; Soloway, Paul D

    2015-01-15

    Dynamically regulated changes in chromatin states are vital for normal development and can produce disease when they go awry. Accordingly, much effort has been devoted to characterizing these states under normal and pathological conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most widely used method to characterize where in the genome transcription factors, modified histones, modified nucleotides and chromatin binding proteins are found; bisulfite sequencing (BS-seq) and its variants are commonly used to characterize the locations of DNA modifications. Though very powerful, these methods are not without limitations. Notably, they are best at characterizing one chromatin feature at a time, yet chromatin features arise and function in combination. Investigators commonly superimpose separate ChIP-seq or BS-seq datasets, and then infer where chromatin features are found together. While these inferences might be correct, they can be misleading when the chromatin source has distinct cell types, or when a given cell type exhibits any cell to cell variation in chromatin state. These ambiguities can be eliminated by robust methods that directly characterize the existence and genomic locations of combinations of chromatin features in very small inputs of cells or ideally, single cells. Here we review single molecule epigenomic methods under development to overcome these limitations, the technical challenges associated with single molecule methods and their potential application to single cells. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Single Molecule and Single Cell Epigenomics

    PubMed Central

    Hyun, Byung-Ryool; McElwee, John L.; Soloway, Paul D.

    2014-01-01

    Dynamically regulated changes in chromatin states are vital for normal development and can produce disease when they go awry. Accordingly, much effort has been devoted to characterizing these states under normal and pathological conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most widely used method to characterize where in the genome transcription factors, modified histones, modified nucleotides and chromatin binding proteins are found; bisulfite sequencing (BS-seq) and its variants are commonly used to characterize the locations of DNA modifications. Though very powerful, these methods are not without limitations. Notably, they are best at characterizing one chromatin feature at a time, yet chromatin features arise and function in combination. Investigators commonly superimpose separate ChIP-seq or BS-seq datasets, and then infer where chromatin features are found together. While these inferences might be correct, they can be misleading when the chromatin source has distinct cell types, or when a given cell type exhibits any cell to cell variation in chromatin state. These ambiguities can be eliminated by robust methods that directly characterize the existence and genomic locations of combinations of chromatin features in very small inputs of cells or ideally, single cells. Here we review single molecule epigenomic methods under development to overcome these limitations, the technical challenges associated with single molecule methods and their potential application to single cells. PMID:25204781

  16. Quantitative analysis and prediction of G-quadruplex forming sequences in double-stranded DNA

    PubMed Central

    Kim, Minji; Kreig, Alex; Lee, Chun-Ying; Rube, H. Tomas; Calvert, Jacob; Song, Jun S.; Myong, Sua

    2016-01-01

    Abstract G-quadruplex (GQ) is a four-stranded DNA structure that can be formed in guanine-rich sequences. GQ structures have been proposed to regulate diverse biological processes including transcription, replication, translation and telomere maintenance. Recent studies have demonstrated the existence of GQ DNA in live mammalian cells and a significant number of potential GQ forming sequences in the human genome. We present a systematic and quantitative analysis of GQ folding propensity on a large set of 438 GQ forming sequences in double-stranded DNA by integrating fluorescence measurement, single-molecule imaging and computational modeling. We find that short minimum loop length and the thymine base are two main factors that lead to high GQ folding propensity. Linear and Gaussian process regression models further validate that the GQ folding potential can be predicted with high accuracy based on the loop length distribution and the nucleotide content of the loop sequences. Our study provides important new parameters that can inform the evaluation and classification of putative GQ sequences in the human genome. PMID:27095201

  17. Digital encoding of cellular mRNAs enabling precise and absolute gene expression measurement by single-molecule counting.

    PubMed

    Fu, Glenn K; Wilhelmy, Julie; Stern, David; Fan, H Christina; Fodor, Stephen P A

    2014-03-18

    We present a new approach for the sensitive detection and accurate quantitation of messenger ribonucleic acid (mRNA) gene transcripts in single cells. First, the entire population of mRNAs is encoded with molecular barcodes during reverse transcription. After amplification of the gene targets of interest, molecular barcodes are counted by sequencing or scored on a simple hybridization detector to reveal the number of molecules in the starting sample. Since absolute quantities are measured, calibration to standards is unnecessary, and many of the relative quantitation challenges such as polymerase chain reaction (PCR) bias are avoided. We apply the method to gene expression analysis of minute sample quantities and demonstrate precise measurements with sensitivity down to sub single-cell levels. The method is an easy, single-tube, end point assay utilizing standard thermal cyclers and PCR reagents. Accurate and precise measurements are obtained without any need for cycle-to-cycle intensity-based real-time monitoring or physical partitioning into multiple reactions (e.g., digital PCR). Further, since all mRNA molecules are encoded with molecular barcodes, amplification can be used to generate more material for multiple measurements and technical replicates can be carried out on limited samples. The method is particularly useful for small sample quantities, such as single-cell experiments. Digital encoding of cellular content preserves true abundance levels and overcomes distortions introduced by amplification.

  18. A new chicken genome assembly provides insight into avian genome structure

    USDA-ARS?s Scientific Manuscript database

    The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3) built from combined long single molecule sequencing t...

  19. Additional annotation of the pig transcriptome using integrated Iso-seq and Illumina RNA-seq analysis

    USDA-ARS?s Scientific Manuscript database

    Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...

  20. Joint analysis of bacterial DNA methylation, predicted promoter and regulation motifs for biological significance

    USDA-ARS?s Scientific Manuscript database

    Advances in long-read, single molecule real-time sequencing technology and analysis software over the last two years has enabled the efficient production of closed bacterial genome sequences. However, consistent annotation of these genomes has lagged behind the ability to create them, while the avai...

  1. Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

    PubMed

    Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

    1998-10-20

    Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.

  2. Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing.

    PubMed

    Cramaro, Wibke J; Hunewald, Oliver E; Bell-Sakyi, Lesley; Muller, Claude P

    2017-02-08

    Global warming and other ecological changes have facilitated the expansion of Ixodes ricinus tick populations. Ixodes ricinus is the most important carrier of vector-borne pathogens in Europe, transmitting viruses, protozoa and bacteria, in particular Borrelia burgdorferi (sensu lato), the causative agent of Lyme borreliosis, the most prevalent vector-borne disease in humans in the Northern hemisphere. To faster control this disease vector, a better understanding of the I. ricinus tick is necessary. To facilitate such studies, we recently published the first reference genome of this highly prevalent pathogen vector. Here, we further extend these studies by scaffolding and annotating the first reference genome by using ultra-long sequencing reads from third generation single molecule sequencing. In addition, we present the first genome size estimation for I. ricinus ticks and the embryo-derived cell line IRE/CTVM19. 235,953 contigs were integrated into 204,904 scaffolds, extending the currently known genome lengths by more than 30% from 393 to 516 Mb and the N50 contig value by 87% from 1643 bp to a N50 scaffold value of 3067 bp. In addition, 25,263 sequences were annotated by comparison to the tick's North American relative Ixodes scapularis. After (conserved) hypothetical proteins, zinc finger proteins, secreted proteins and P450 coding proteins were the most prevalent protein categories annotated. Interestingly, more than 50% of the amino acid sequences matching the homology threshold had 95-100% identity to the corresponding I. scapularis gene models. The sequence information was complemented by the first genome size estimation for this species. Flow cytometry-based genome size analysis revealed a haploid genome size of 2.65Gb for I. ricinus ticks and 3.80 Gb for the cell line. We present a first draft sequence map of the I. ricinus genome based on a PacBio-Illumina assembly. The I. ricinus genome was shown to be 26% (500 Mb) larger than the genome of its American relative I. scapularis. Based on the genome size of 2.65 Gb we estimated that we covered about 67% of the non-repetitive sequences. Genome annotation will facilitate screening for specific molecular pathways in I. ricinus cells and provides an overview of characteristics and functions.

  3. Nanopores: A journey towards DNA sequencing

    PubMed Central

    Wanunu, Meni

    2013-01-01

    Much more than ever, nucleic acids are recognized as key building blocks in many of life's processes, and the science of studying these molecular wonders at the single-molecule level is thriving. A new method of doing so has been introduced in the mid 1990's. This method is exceedingly simple: a nanoscale pore that spans across an impermeable thin membrane is placed between two chambers that contain an electrolyte, and voltage is applied across the membrane using two electrodes. These conditions lead to a steady stream of ion flow across the pore. Nucleic acid molecules in solution can be driven through the pore, and structural features of the biomolecules are observed as measurable changes in the trans-membrane ion current. In essence, a nanopore is a high-throughput ion microscope and a single-molecule force apparatus. Nanopores are taking center stage as a tool that promises to read a DNA sequence, and this promise has resulted in overwhelming academic, industrial, and national interest. Regardless of the fate of future nanopore applications, in the process of this 16-year-long exploration, many studies have validated the indispensability of nanopores in the toolkit of single-molecule biophysics. This review surveys past and current studies related to nucleic acid biophysics, and will hopefully provoke a discussion of immediate and future prospects for the field. PMID:22658507

  4. Spontaneous Transport of Single-Stranded DNA through Graphene-MoS2 Heterostructure Nanopores.

    PubMed

    Luan, Binquan; Zhou, Ruhong

    2018-04-24

    The effective transport of a single-stranded DNA (ssDNA) molecule through a solid-state nanopore is essential to the future success of high-throughput and low-cost DNA sequencing. Compatible with current electric sensing technologies, here, we propose and demonstrate by molecular dynamics simulations the ssDNA transport through a quasi-two-dimensional nanopore in a heterostructure stacked together with different 2D materials, such as graphene and molybdenum disulfide (MoS 2 ). Due to different chemical potentials, U, of DNA bases on different 2D materials, it is energetically favorable for a ssDNA molecule to move from the low- U MoS 2 surface to the high- U graphene surface through a nanopore. With the proper attraction between the negatively charged phosphate group in each nucleotide and the positively charged Mo atoms exposed on the pore surface, the ssDNA molecule can be temporarily seized and released thereafter through a thermal activation, that is, a slow and possible nucleotide-by-nucleotide transport. A theoretical formulation is then developed for the free energy of the ssDNA transiting a heterostructure nanopore to properly characterize the non-equilibrium stick-slip-like motion of a ssDNA molecule.

  5. Bio-recognitive photonics of a DNA-guided organic semiconductor

    PubMed Central

    Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June

    2016-01-01

    Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA–DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an ‘inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA–DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition. PMID:26725969

  6. Single-molecule paleoenzymology probes the chemistry of resurrected enzymes

    PubMed Central

    Perez-Jimenez, Raul; Inglés-Prieto, Alvaro; Zhao, Zi-Ming; Sanchez-Romero, Inmaculada; Alegre-Cebollada, Jorge; Kosuri, Pallav; Garcia-Manyes, Sergi; Kappock, T. Joseph; Tanokura, Masaru; Holmgren, Arne; Sanchez-Ruiz, Jose M.; Gaucher, Eric A.; Fernandez, Julio M.

    2011-01-01

    A journey back in time is possible at the molecular level by reconstructing proteins from extinct organisms. Here we report the reconstruction, based on sequence predicted by phylogenetic analysis, of seven Precambrian thioredoxin enzymes (Trx), dating back between ~1.4 and ~4 billion years (Gyr). The reconstructed enzymes are up to 32° C more stable than modern enzymes and the oldest show significantly higher activity than extant ones at pH 5. We probed their mechanisms of reduction using single-molecule force spectroscopy. From the force-dependency of the rate of reduction of an engineered substrate, we conclude that ancient Trxs utilize chemical mechanisms of reduction similar to those of modern enzymes. While Trx enzymes have maintained their reductase chemistry unchanged, they have adapted over a 4 Gyr time span to the changes in temperature and ocean acidity that characterize the evolution of the global environment from ancient to modern Earth. PMID:21460845

  7. Bio-recognitive photonics of a DNA-guided organic semiconductor.

    PubMed

    Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June

    2016-01-04

    Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA-DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an 'inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA-DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition.

  8. Bio-recognitive photonics of a DNA-guided organic semiconductor

    NASA Astrophysics Data System (ADS)

    Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June

    2016-01-01

    Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA-DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an `inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA-DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition.

  9. Error-based Extraction of States and Energy Landscapes from Experimental Single-Molecule Time-Series

    NASA Astrophysics Data System (ADS)

    Taylor, J. Nicholas; Li, Chun-Biu; Cooper, David R.; Landes, Christy F.; Komatsuzaki, Tamiki

    2015-03-01

    Characterization of states, the essential components of the underlying energy landscapes, is one of the most intriguing subjects in single-molecule (SM) experiments due to the existence of noise inherent to the measurements. Here we present a method to extract the underlying state sequences from experimental SM time-series. Taking into account empirical error and the finite sampling of the time-series, the method extracts a steady-state network which provides an approximation of the underlying effective free energy landscape. The core of the method is the application of rate-distortion theory from information theory, allowing the individual data points to be assigned to multiple states simultaneously. We demonstrate the method's proficiency in its application to simulated trajectories as well as to experimental SM fluorescence resonance energy transfer (FRET) trajectories obtained from isolated agonist binding domains of the AMPA receptor, an ionotropic glutamate receptor that is prevalent in the central nervous system.

  10. HLA genotyping by next-generation sequencing of complementary DNA.

    PubMed

    Segawa, Hidenobu; Kukita, Yoji; Kato, Kikuya

    2017-11-28

    Genotyping of the human leucocyte antigen (HLA) is indispensable for various medical treatments. However, unambiguous genotyping is technically challenging due to high polymorphism of the corresponding genomic region. Next-generation sequencing is changing the landscape of genotyping. In addition to high throughput of data, its additional advantage is that DNA templates are derived from single molecules, which is a strong merit for the phasing problem. Although most currently developed technologies use genomic DNA, use of cDNA could enable genotyping with reduced costs in data production and analysis. We thus developed an HLA genotyping system based on next-generation sequencing of cDNA. Each HLA gene was divided into 3 or 4 target regions subjected to PCR amplification and subsequent sequencing with Ion Torrent PGM. The sequence data were then subjected to an automated analysis. The principle of the analysis was to construct candidate sequences generated from all possible combinations of variable bases and arrange them in decreasing order of the number of reads. Upon collecting candidate sequences from all target regions, 2 haplotypes were usually assigned. Cases not assigned 2 haplotypes were forwarded to 4 additional processes: selection of candidate sequences applying more stringent criteria, removal of artificial haplotypes, selection of candidate sequences with a relaxed threshold for sequence matching, and countermeasure for incomplete sequences in the HLA database. The genotyping system was evaluated using 30 samples; the overall accuracy was 97.0% at the field 3 level and 98.3% at the G group level. With one sample, genotyping of DPB1 was not completed due to short read size. We then developed a method for complete sequencing of individual molecules of the DPB1 gene, using the molecular barcode technology. The performance of the automatic genotyping system was comparable to that of systems developed in previous studies. Thus, next-generation sequencing of cDNA is a viable option for HLA genotyping.

  11. Primer-independent RNA sequencing with bacteriophage phi6 RNA polymerase and chain terminators.

    PubMed

    Makeyev, E V; Bamford, D H

    2001-05-01

    Here we propose a new general method for directly determining RNA sequence based on the use of the RNA-dependent RNA polymerase from bacteriophage phi6 and the chain terminators (RdRP sequencing). The following properties of the polymerase render it appropriate for this application: (1) the phi6 polymerase can replicate a number of single-stranded RNA templates in vitro. (2) In contrast to the primer-dependent DNA polymerases utilized in the sequencing procedure by Sanger et al. (Proc Natl Acad Sci USA, 1977, 74:5463-5467), it initiates nascent strand synthesis without a primer, starting the polymerization on the very 3'-terminus of the template. (3) The polymerase can incorporate chain-terminating nucleotide analogs into the nascent RNA chain to produce a set of base-specific termination products. Consequently, 3' proximal or even complete sequence of many target RNA molecules can be rapidly deduced without prior sequence information. The new technique proved useful for sequencing several synthetic ssRNA templates. Furthermore, using genomic segments of the bluetongue virus we show that RdRP sequencing can also be applied to naturally occurring dsRNA templates. This suggests possible uses of the method in the RNA virus research and diagnostics.

  12. Challenges for single molecule electronic devices with nanographene and organic molecules. Do single molecules offer potential as elements of electronic devices in the next generation?

    NASA Astrophysics Data System (ADS)

    Enoki, Toshiaki; Kiguchi, Manabu

    2018-03-01

    Interest in utilizing organic molecules to fabricate electronic materials has existed ever since organic (molecular) semiconductors were first discovered in the 1950s. Since then, scientists have devoted serious effort to the creation of various molecule-based electronic systems, such as molecular metals and molecular superconductors. Single-molecule electronics and the associated basic science have emerged over the past two decades and provided hope for the development of highly integrated molecule-based electronic devices in the future (after the Si-based technology era has ended). Here, nanographenes (nano-sized graphene) with atomically precise structures are among the most promising molecules that can be utilized for electronic/spintronic devices. To manipulate single small molecules for an electronic device, a single molecular junction has been developed. It is a powerful tool that allows even small molecules to be utilized. External electric, magnetic, chemical, and mechanical perturbations can change the physical and chemical properties of molecules in a way that is different from bulk materials. Therefore, the various functionalities of molecules, along with changes induced by external perturbations, allows us to create electronic devices that we cannot create using current top-down Si-based technology. Future challenges that involve the incorporation of condensed matter physics, quantum chemistry calculations, organic synthetic chemistry, and electronic device engineering are expected to open a new era in single-molecule device electronic technology.

  13. Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.

    PubMed

    Hargreaves, Adam D; Mulley, John F

    2015-01-01

    Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.

  14. Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing

    PubMed Central

    Hargreaves, Adam D.

    2015-01-01

    Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species. PMID:26623194

  15. A force-based, parallel assay for the quantification of protein-DNA interactions.

    PubMed

    Limmer, Katja; Pippig, Diana A; Aschenbrenner, Daniela; Gaub, Hermann E

    2014-01-01

    Analysis of transcription factor binding to DNA sequences is of utmost importance to understand the intricate regulatory mechanisms that underlie gene expression. Several techniques exist that quantify DNA-protein affinity, but they are either very time-consuming or suffer from possible misinterpretation due to complicated algorithms or approximations like many high-throughput techniques. We present a more direct method to quantify DNA-protein interaction in a force-based assay. In contrast to single-molecule force spectroscopy, our technique, the Molecular Force Assay (MFA), parallelizes force measurements so that it can test one or multiple proteins against several DNA sequences in a single experiment. The interaction strength is quantified by comparison to the well-defined rupture stability of different DNA duplexes. As a proof-of-principle, we measured the interaction of the zinc finger construct Zif268/NRE against six different DNA constructs. We could show the specificity of our approach and quantify the strength of the protein-DNA interaction.

  16. Computational Prediction of the Immunomodulatory Potential of RNA Sequences.

    PubMed

    Nagpal, Gandharva; Chaudhary, Kumardeep; Dhanda, Sandeep Kumar; Raghava, Gajendra Pal Singh

    2017-01-01

    Advances in the knowledge of various roles played by non-coding RNAs have stimulated the application of RNA molecules as therapeutics. Among these molecules, miRNA, siRNA, and CRISPR-Cas9 associated gRNA have been identified as the most potent RNA molecule classes with diverse therapeutic applications. One of the major limitations of RNA-based therapeutics is immunotoxicity of RNA molecules as it may induce the innate immune system. In contrast, RNA molecules that are potent immunostimulators are strong candidates for use in vaccine adjuvants. Thus, it is important to understand the immunotoxic or immunostimulatory potential of these RNA molecules. The experimental techniques for determining immunostimulatory potential of siRNAs are time- and resource-consuming. To overcome this limitation, recently our group has developed a web-based server "imRNA" for predicting the immunomodulatory potential of RNA sequences. This server integrates a number of modules that allow users to perform various tasks including (1) generation of RNA analogs with reduced immunotoxicity, (2) identification of highly immunostimulatory regions in RNA sequence, and (3) virtual screening. This server may also assist users in the identification of minimum mutations required in a given RNA sequence to minimize its immunomodulatory potential that is required for designing RNA-based therapeutics. Besides, the server can be used for designing RNA-based vaccine adjuvants as it may assist users in the identification of mutations required for increasing immunomodulatory potential of a given RNA sequence. In summary, this chapter describes major applications of the "imRNA" server in designing RNA-based therapeutics and vaccine adjuvants (http://www.imtech.res.in/raghava/imrna/).

  17. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  18. Single cell and single molecule techniques for the analysis of the epigenome

    NASA Astrophysics Data System (ADS)

    Wallin, Christopher Benjamin

    Epigenetic regulation is a critical biological process for the health and development of a cell. Epigenetic regulation is facilitated by covalent modifications to the underlying DNA and chromatin proteins. A fundamental understanding of these epigenetic modifications and their associated interactions at the molecular scale is necessary to explain phenomena including cellular identity, stem cell plasticity, and neoplastic transformation. It is widely known that abnormal epigenetic profiles have been linked to many diseases, most notably cancer. While the field of epigenetics has progressed rapidly with conventional techniques, significant advances remain to be made with respect to combinatoric analysis of epigenetic marks and single cell epigenetics. Therefore, in this dissertation, I will discuss our development of devices and methodologies to address these pertinent issues. First, we designed a preparatory polydimethylsiloxane (PDMS) microdevice for the extraction, purification, and stretching of human chromosomal DNA and chromatin from small cell populations down to a single cell. The valveless device captures cells by size exclusion within the micropillars, entraps the DNA or chromatin in the micropillars after cell lysis, purifies away the cellular debris, and fluorescently labels the DNA and/or chromatin all within a single reaction chamber. With the device, we achieve nearly 100% extraction efficiency of the DNA. The device is also used for in-channel immunostaining of chromatin followed by downstream single molecule chromatin analysis in nanochannels (SCAN). Second, using multi-color, time-correlated single molecule measurements in nanochannels, simultaneous coincidence detection of 2 epigenetic marks is demonstrated. Coincidence detection of 3 epigenetic marks is also established using a pulsed interleaved excitation scheme. With these two promising results, genome-wide quantification of epigenetic marks was pursued. Unfortunately, quantitative SCAN never materialized. Reasons for this, including poor signal to background, are explained in detail. Third, development of mobility-SCAN, an analytical technique for measuring and analyzing single molecules based on their fluorescent signature and their electrophoretic mobility in nanochannels is described. We use the technique to differentiate biomolecules from complex mixtures and derive parameters such as diffusion coefficients and effective charges. Finally, the device is used to detect binding interactions of various complexes similar to affinity capillary electrophoresis, but on a single molecule level. Fourth, we conclude by briefly discussing SCAN-sort, a technique to sort individual chromatin molecules based on their fluorescent emissions for further downstream analysis such as DNA sequencing. We demonstrate a 2-fold enrichment of chromatin from sorting and discuss possible system modifications for better performance in the future.

  19. Coherent (photon) vs incoherent (current) detection of multidimensional optical signals from single molecules in open junctions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Agarwalla, Bijay Kumar; Hua, Weijie; Zhang, Yu

    2015-06-07

    The nonlinear optical response of a current-carrying single molecule coupled to two metal leads and driven by a sequence of impulsive optical pulses with controllable phases and time delays is calculated. Coherent (stimulated, heterodyne) detection of photons and incoherent detection of the optically induced current are compared. Using a diagrammatic Liouville space superoperator formalism, the signals are recast in terms of molecular correlation functions which are then expanded in the many-body molecular states. Two dimensional signals in benzene-1,4-dithiol molecule show cross peaks involving charged states. The correlation between optical and charge current signal is also observed.

  20. A rule of seven in Watson-Crick base-pairing of mismatched sequences.

    PubMed

    Cisse, Ibrahim I; Kim, Hajin; Ha, Taekjip

    2012-05-13

    Sequence recognition through base-pairing is essential for DNA repair and gene regulation, but the basic rules governing this process remain elusive. In particular, the kinetics of annealing between two imperfectly matched strands is not well characterized, despite its potential importance in nucleic acid-based biotechnologies and gene silencing. Here we use single-molecule fluorescence to visualize the multiple annealing and melting reactions of two untethered strands inside a porous vesicle, allowing us to precisely quantify the annealing and melting rates. The data as a function of mismatch position suggest that seven contiguous base pairs are needed for rapid annealing of DNA and RNA. This phenomenological rule of seven may underlie the requirement for seven nucleotides of complementarity to seed gene silencing by small noncoding RNA and may help guide performance improvement in DNA- and RNA-based bio- and nanotechnologies, in which off-target effects can be detrimental.

  1. A Single Amino Acid Substitution in the v-Eyk Intracellular Domain Results in Activation of Stat3 and Enhances Cellular Transformation

    PubMed Central

    Besser, Daniel; Bromberg, Jacqueline F.; Darnell, James E.; Hanafusa, Hidesaburo

    1999-01-01

    The receptor tyrosine kinase Eyk, a member of the Axl/Tyro3 subfamily, activates the STAT pathway and transforms cells when constitutively activated. Here, we compared the potentials of the intracellular domains of Eyk molecules derived from c-Eyk and v-Eyk to transform rat 3Y1 fibroblasts. The v-Eyk molecule induced higher numbers of transformants in soft agar and stronger activation of Stat3; levels of Stat1 activation by the two Eyk molecules were similar. A mutation in the sequence Y933VPL, present in c-Eyk, to the v-Eyk sequence Y933VPQ led to increased activation of Stat3 and increased transformation efficiency. However, altering another sequence, Y862VNT, present in both Eyk molecules to F862VNT markedly decreased transformation without impairing Stat3 activation. These results indicate that activation of Stat3 enhances transformation efficiency and cooperates with another pathway to induce transformation. PMID:9891073

  2. Comprehensive profiling and quantitation of oncogenic mutations in non small-cell lung carcinoma using single molecule amplification and re-sequencing technology

    PubMed Central

    Jiang, Hong; Wang, Limin; Xu, Rujun; Shi, Yanbin; Zhang, Jianguang; Xu, Mengnan; Cram, David S.; Ma, Shenglin

    2016-01-01

    Activating and resistance mutations in the tyrosine kinase domain of several oncogenes are frequently associated with non-small cell lung carcinoma (NSCLC). In this study we assessed the frequency, type and abundance of EGFR, KRAS, BRAF, TP53 and ALK mutations in tumour specimens from 184 patients with early and late stage disease using single molecule amplification and re-sequencing technology (SMART). Based on modelling of EGFR mutations, the detection sensitivity of the SMART assay was at least 0.1%. Benchmarking EGFR mutation detection against the gold standard ARMS-PCR assay, SMART assay had a sensitivity and specificity of 98.7% and 99.0%. Amongst the 184 samples, EGFR mutations were the most prevalent (59.9%), followed by KRAS (16.9%), TP53 (12.7%), EML4-ALK fusions (6.3%) and BRAF (4.2%) mutations. The abundance and types of mutations in tumour specimens were extremely heterogeneous, involving either monoclonal (51.6%) or polyclonal (12.6%) mutation events. At the clinical level, although the spectrum of tumour mutation(s) was unique to each patient, the overall patterns in early or advanced stage disease were relatively similar. Based on these findings, we propose that personalized profiling and quantitation of clinically significant oncogenic mutations will allow better classification of patients according to tumour characteristics and provide clinicians with important ancillary information for treatment decision-making. PMID:27409166

  3. Comprehensive profiling and quantitation of oncogenic mutations in non small-cell lung carcinoma using single molecule amplification and re-sequencing technology.

    PubMed

    Zhang, Shirong; Xia, Bing; Jiang, Hong; Wang, Limin; Xu, Rujun; Shi, Yanbin; Zhang, Jianguang; Xu, Mengnan; Cram, David S; Ma, Shenglin

    2016-08-02

    Activating and resistance mutations in the tyrosine kinase domain of several oncogenes are frequently associated with non-small cell lung carcinoma (NSCLC). In this study we assessed the frequency, type and abundance of EGFR, KRAS, BRAF, TP53 and ALK mutations in tumour specimens from 184 patients with early and late stage disease using single molecule amplification and re-sequencing technology (SMART). Based on modelling of EGFR mutations, the detection sensitivity of the SMART assay was at least 0.1%. Benchmarking EGFR mutation detection against the gold standard ARMS-PCR assay, SMART assay had a sensitivity and specificity of 98.7% and 99.0%. Amongst the 184 samples, EGFR mutations were the most prevalent (59.9%), followed by KRAS (16.9%), TP53 (12.7%), EML4-ALK fusions (6.3%) and BRAF (4.2%) mutations. The abundance and types of mutations in tumour specimens were extremely heterogeneous, involving either monoclonal (51.6%) or polyclonal (12.6%) mutation events. At the clinical level, although the spectrum of tumour mutation(s) was unique to each patient, the overall patterns in early or advanced stage disease were relatively similar. Based on these findings, we propose that personalized profiling and quantitation of clinically significant oncogenic mutations will allow better classification of patients according to tumour characteristics and provide clinicians with important ancillary information for treatment decision-making.

  4. Development of Solid-State Nanopore Technology for Life Detection

    NASA Technical Reports Server (NTRS)

    Bywaters, K. B.; Schmidt, H.; Vercoutere, W.; Deamer, D.; Hawkins, A. R.; Quinn, R. C.; Burton, A. S.; Mckay, C. P.

    2017-01-01

    Biomarkers for life on Earth are an important starting point to guide the search for life elsewhere. However, the search for life beyond Earth should incorporate technologies capable of recognizing an array of potential biomarkers beyond what we see on Earth, in order to minimize the risk of false negatives from life detection missions. With this in mind, charged linear polymers may be a universal signature for life, due to their ability to store information while also inherently reducing the tendency of complex tertiary structure formation that significantly inhibit replication. Thus, these molecules are attractive targets for biosignature detection as potential "self-sustaining chemical signatures." Examples of charged linear polymers, or polyelectrolytes, include deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) as well as synthetic polyelectrolytes that could potentially support life, including threose nucleic acid (TNA) and other xenonucleic acids (XNAs). Nanopore analysis is a novel technology that has been developed for singlemolecule sequencing with exquisite single nucleotide resolution which is also well-suited for analysis of polyelectrolyte molecules. Nanopore analysis has the ability to detect repeating sequences of electrical charges in organic linear polymers, and it is not molecule- specific (i.e. it is not restricted to only DNA or RNA). In this sense, it is a better life detection technique than approaches that are based on specific molecules, such as the polymerase chain reaction (PCR), which requires that the molecule being detected be composed of DNA.

  5. Genome Sequence of Bacillus megaterium Strain YC4-R4, a Plant Growth-Promoting Rhizobacterium Isolated from a High-Salinity Environment.

    PubMed

    Vílchez, Juan Ignacio; Tang, Qiming; Kaushal, Richa; Wang, Wei; Lv, Suhui; He, Danxia; Chu, Zhaoqing; Zhang, Heng; Liu, Renyi; Zhang, Huiming

    2018-06-21

    Here, we report the complete genome sequence for Bacillus megaterium strain YC4-R4, a highly salt-tolerant rhizobacterium that promotes growth in plants. The sequencing process was performed by combining pyrosequencing and single-molecule sequencing techniques. The complete genome is estimated to be approximately 5.44 Mb, containing a total of 5,673 predicted protein-coding DNA sequences (CDSs). Copyright © 2018 Vílchez et al.

  6. Potentials of single-cell biology in identification and validation of disease biomarkers.

    PubMed

    Niu, Furong; Wang, Diane C; Lu, Jiapei; Wu, Wei; Wang, Xiangdong

    2016-09-01

    Single-cell biology is considered a new approach to identify and validate disease-specific biomarkers. However, the concern raised by clinicians is how to apply single-cell measurements for clinical practice, translate the message of single-cell systems biology into clinical phenotype or explain alterations of single-cell gene sequencing and function in patient response to therapies. This study is to address the importance and necessity of single-cell gene sequencing in the identification and development of disease-specific biomarkers, the definition and significance of single-cell biology and single-cell systems biology in the understanding of single-cell full picture, the development and establishment of whole-cell models in the validation of targeted biological function and the figure and meaning of single-molecule imaging in single cell to trace intra-single-cell molecule expression, signal, interaction and location. We headline the important role of single-cell biology in the discovery and development of disease-specific biomarkers with a special emphasis on understanding single-cell biological functions, e.g. mechanical phenotypes, single-cell biology, heterogeneity and organization of genome function. We have reason to believe that such multi-dimensional, multi-layer, multi-crossing and stereoscopic single-cell biology definitely benefits the discovery and development of disease-specific biomarkers. © 2016 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.

  7. Circular replication-associated protein encoding DNA viruses identified in the faecal matter of various animals in New Zealand.

    PubMed

    Steel, Olivia; Kraberger, Simona; Sikorski, Alyssa; Young, Laura M; Catchpole, Ryan J; Stevens, Aaron J; Ladley, Jenny J; Coray, Dorien S; Stainton, Daisy; Dayaram, Anisha; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind

    2016-09-01

    In recent years, innovations in molecular techniques and sequencing technologies have resulted in a rapid expansion in the number of known viral sequences, in particular those with circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA genomes. CRESS DNA viruses are present in the virome of many ecosystems and are known to infect a wide range of organisms. A large number of the recently identified CRESS DNA viruses cannot be classified into any known viral families, indicating that the current view of CRESS DNA viral sequence space is greatly underestimated. Animal faecal matter has proven to be a particularly useful source for sampling CRESS DNA viruses in an ecosystem, as it is cost-effective and non-invasive. In this study a viral metagenomic approach was used to explore the diversity of CRESS DNA viruses present in the faeces of domesticated and wild animals in New Zealand. Thirty-eight complete CRESS DNA viral genomes and two circular molecules (that may be defective molecules or single components of multicomponent genomes) were identified from forty-nine individual animal faecal samples. Based on shared genome organisations and sequence similarities, eighteen of the isolates were classified as gemycircularviruses and twelve isolates were classified as smacoviruses. The remaining eight isolates lack significant sequence similarity with any members of known CRESS DNA virus groups. This research adds significantly to our knowledge of CRESS DNA viral diversity in New Zealand, emphasising the prevalence of CRESS DNA viruses in nature, and reinforcing the suggestion that a large proportion of CRESS DNA viruses are yet to be identified. Copyright © 2016 Elsevier B.V. All rights reserved.

  8. Analysis of Multiallelic CNVs by Emulsion Haplotype Fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2017-01-01

    Emulsion-fusion PCR recovers long-range sequence information by combining products in cis from individual genomic DNA molecules. Emulsion droplets act as very numerous small reaction chambers in which different PCR products from a single genomic DNA molecule are condensed into short joint products, to unite sequences in cis from widely separated genomic sites. These products can therefore provide information about the arrangement of sequences and variants at a larger scale than established long-read sequencing methods. The method has been useful in defining the phase of variants in haplotypes, the typing of inversions, and determining the configuration of sequence variants in multiallelic CNVs. In this description we outline the rationale for the application of emulsion-fusion PCR methods to the analysis of multiallelic CNVs, and give practical details for our own implementation of the method in that context.

  9. Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases

    PubMed Central

    Zajac, Pawel; Islam, Saiful; Hochgerner, Hannah; Lönnerberg, Peter; Linnarsson, Sten

    2013-01-01

    Reverse transcriptases derived from Moloney Murine Leukemia Virus (MMLV) have an intrinsic terminal transferase activity, which causes the addition of a few non-templated nucleotides at the 3´ end of cDNA, with a preference for cytosine. This mechanism can be exploited to make the reverse transcriptase switch template from the RNA molecule to a secondary oligonucleotide during first-strand cDNA synthesis, and thereby to introduce arbitrary barcode or adaptor sequences in the cDNA. Because the mechanism is relatively efficient and occurs in a single reaction, it has recently found use in several protocols for single-cell RNA sequencing. However, the base preference of the terminal transferase activity is not known in detail, which may lead to inefficiencies in template switching when starting from tiny amounts of mRNA. Here, we used fully degenerate oligos to determine the exact base preference at the template switching site up to a distance of ten nucleotides. We found a strong preference for guanosine at the first non-templated nucleotide, with a greatly reduced bias at progressively more distant positions. Based on this result, and a number of careful optimizations, we report conditions for efficient template switching for cDNA amplification from single cells. PMID:24392002

  10. Single-molecule sequencing and conformational capture enable de novo mammalian reference genomes

    USDA-ARS?s Scientific Manuscript database

    Genome assemblies have been produced for numerous species as a result of advances in sequencing technologies. However, many of the assemblies are fragmented, with many gaps, ambiguities, and errors. We use the genome of the domestic goat (Capra hircus) to demonstrate current state of the art for ef...

  11. Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus.

    PubMed

    Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar

    2018-06-12

    We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.

  12. Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform.

    PubMed

    Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

    2015-01-01

    The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.

  13. Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform

    PubMed Central

    Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

    2015-01-01

    The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum. PMID:25705213

  14. Single-molecule FRET studies of the cooperative and non-cooperative binding kinetics of the bacteriophage T4 single-stranded DNA binding protein (gp32) to ssDNA lattices at replication fork junctions

    PubMed Central

    Lee, Wonbae; Gillies, John P.; Jose, Davis; Israels, Brett A.; von Hippel, Peter H.; Marcus, Andrew H.

    2016-01-01

    Gene 32 protein (gp32) is the single-stranded (ss) DNA binding protein of the bacteriophage T4. It binds transiently and cooperatively to ssDNA sequences exposed during the DNA replication process and regulates the interactions of the other sub-assemblies of the replication complex during the replication cycle. We here use single-molecule FRET techniques to build on previous thermodynamic studies of gp32 binding to initiate studies of the dynamics of the isolated and cooperative binding of gp32 molecules within the replication complex. DNA primer/template (p/t) constructs are used as models to determine the effects of ssDNA lattice length, gp32 concentration, salt concentration, binding cooperativity and binding polarity at p/t junctions. Hidden Markov models (HMMs) and transition density plots (TDPs) are used to characterize the dynamics of the multi-step assembly pathway of gp32 at p/t junctions of differing polarity, and show that isolated gp32 molecules bind to their ssDNA targets weakly and dissociate quickly, while cooperatively bound dimeric or trimeric clusters of gp32 bind much more tightly, can ‘slide’ on ssDNA sequences, and exhibit binding dynamics that depend on p/t junction polarities. The potential relationships of these binding dynamics to interactions with other components of the T4 DNA replication complex are discussed. PMID:27694621

  15. Profiling of Oral Microbiota in Early Childhood Caries Using Single-Molecule Real-Time Sequencing

    PubMed Central

    Wang, Yuan; Zhang, Jie; Chen, Xi; Jiang, Wen; Wang, Sa; Xu, Lei; Tu, Yan; Zheng, Pei; Wang, Ying; Lin, Xiaolong; Chen, Hui

    2017-01-01

    Background: Alterations of oral microbiota are the main cause of the progression of caries. The goal of this study was to characterize the oral microbiota in childhood caries based on single-molecule real-time sequencing. Methods: A total of 21 preschoolers, aged 3–5 years old with severe early childhood caries, and 20 age-matched, caries-free children as controls were recruited. Saliva samples were collected, followed by DNA extraction, Pacbio sequencing, and phylogenetic analyses of the oral microbial communities. Results: Eight hundred and seventy six species derived from 13 known bacterial phyla and 110 genera were detected from 41 children using Pacbio sequencing. At the species level, 38 species, including Veillonella spp., Streptococcus spp., Prevotella spp., and Lactobacillus spp., showed higher abundance in the caries group compared to the caries-free group (p < 0.05). The core microbiota at the genus and species levels was more stable in the caries-free micro-ecological niche. At follow-up, oral examinations 6 months after sample collection, development of new dental caries was observed in 5 children (the transitional group) among the 21 caries free children. Compared with the caries-free children, in the transitional and caries groups, 6 species, which were more abundant in the caries-free group, exhibited a relatively low abundance in both the caries group and the transitional group (p < 0.05). We conclude that Abiotrophia spp., Neisseria spp., and Veillonella spp., might be associated with healthy oral microbial ecosystem. Prevotella spp., Lactobacillus spp., Dialister spp., and Filifactor spp. may be related to the pathogenesis and progression of dental caries. PMID:29187843

  16. Profiling of Oral Microbiota in Early Childhood Caries Using Single-Molecule Real-Time Sequencing.

    PubMed

    Wang, Yuan; Zhang, Jie; Chen, Xi; Jiang, Wen; Wang, Sa; Xu, Lei; Tu, Yan; Zheng, Pei; Wang, Ying; Lin, Xiaolong; Chen, Hui

    2017-01-01

    Background: Alterations of oral microbiota are the main cause of the progression of caries. The goal of this study was to characterize the oral microbiota in childhood caries based on single-molecule real-time sequencing. Methods: A total of 21 preschoolers, aged 3-5 years old with severe early childhood caries, and 20 age-matched, caries-free children as controls were recruited. Saliva samples were collected, followed by DNA extraction, Pacbio sequencing, and phylogenetic analyses of the oral microbial communities. Results: Eight hundred and seventy six species derived from 13 known bacterial phyla and 110 genera were detected from 41 children using Pacbio sequencing. At the species level, 38 species, including Veillonella spp., Streptococcus spp., Prevotella spp., and Lactobacillus spp., showed higher abundance in the caries group compared to the caries-free group ( p < 0.05). The core microbiota at the genus and species levels was more stable in the caries-free micro-ecological niche. At follow-up, oral examinations 6 months after sample collection, development of new dental caries was observed in 5 children (the transitional group) among the 21 caries free children. Compared with the caries-free children, in the transitional and caries groups, 6 species, which were more abundant in the caries-free group, exhibited a relatively low abundance in both the caries group and the transitional group ( p < 0.05). We conclude that Abiotrophia spp., Neisseria spp., and Veillonella spp., might be associated with healthy oral microbial ecosystem. Prevotella spp., Lactobacillus spp., Dialister spp., and Filifactor spp. may be related to the pathogenesis and progression of dental caries.

  17. A rapid, ratiometric, enzyme-free, and sensitive single-step miRNA detection using three-way junction based FRET probes

    NASA Astrophysics Data System (ADS)

    Luo, Qingying; Liu, Lin; Yang, Cai; Yuan, Jing; Feng, Hongtao; Chen, Yan; Zhao, Peng; Yu, Zhiqiang; Jin, Zongwen

    2018-03-01

    MicroRNAs (miRNAs) are single stranded endogenous molecules composed of only 18-24 nucleotides which are critical for gene expression regulating the translation of messenger RNAs. Conventional methods based on enzyme-assisted nucleic acid amplification techniques have many problems, such as easy contamination, high cost, susceptibility to false amplification, and tendency to have sequence mismatches. Here we report a rapid, ratiometric, enzyme-free, sensitive, and highly selective single-step miRNA detection using three-way junction assembled (or self-assembled) FRET probes. The developed strategy can be operated within the linear range from subnanomolar to hundred nanomolar concentrations of miRNAs. In comparison with the traditional approaches, our method showed high sensitivity for the miRNA detection and extreme selectivity for the efficient discrimination of single-base mismatches. The results reveal that the strategy paved a new avenue for the design of novel highly specific probes applicable in diagnostics and potentially in microscopic imaging of miRNAs in real biological environments.

  18. A single base substitution in the coding region for neurophysin II associated with familial central diabetes insipidus.

    PubMed Central

    Ito, M; Mori, Y; Oiso, Y; Saito, H

    1991-01-01

    To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604

  19. Direct observation of single flexible polymers using single stranded DNA†

    PubMed Central

    Brockman, Christopher; Kim, Sun Ju

    2012-01-01

    Over the last 15 years, double stranded DNA (dsDNA) has been used as a model polymeric system for nearly all single polymer dynamics studies. However, dsDNA is a semiflexible polymer with markedly different molecular properties compared to flexible chains, including synthetic organic polymers. In this work, we report a new system for single polymer studies of flexible chains based on single stranded DNA (ssDNA). We developed a method to synthesize ssDNA for fluorescence microscopy based on rolling circle replication, which generates long strands (>65 kb) of ssDNA containing “designer” sequences, thereby preventing intramolecular base pair interactions. Polymers are synthesized to contain amine-modified bases randomly distributed along the backbone, which enables uniform labelling of polymer chains with a fluorescent dye to facilitate fluorescence microscopy and imaging. Using this approach, we synthesized ssDNA chains with long contour lengths (>30 μm) and relatively low dye loading ratios (~1 dye per 100 bases). In addition, we used epifluorescence microscopy to image single ssDNA polymer molecules stretching in flow in a microfluidic device. Overall, we anticipate that ssDNA will serve as a useful model system to probe the dynamics of polymeric materials at the molecular level. PMID:22956981

  20. Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

    PubMed

    O'Neill, F J; Gao, Y; Xu, X

    1993-11-01

    The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant monomolecular genomes. These and other findings indicate that the bipartite genome state can sustain many mutations which wtSV40 cannot directly sustain. However, the mutations can later be introduced into the wild type genomes when the E- and L-SV40 DNAs recombine to generate a new monomolecular genome structure.

  1. Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA

    PubMed Central

    Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev

    2012-01-01

    B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350

  2. Single molecule detection with graphene and other two-dimensional materials: nanopores and beyond

    PubMed Central

    Arjmandi-Tash, Hadi; Belyaeva, Liubov A.

    2016-01-01

    Graphene and other two dimensional (2D) materials are currently integrated into nanoscaled devices that may – one day – sequence genomes. The challenge to solve is conceptually straightforward: cut a sheet out of a 2D material and use the edge of the sheet to scan an unfolded biomolecule from head to tail. As the scan proceeds – and because 2D materials are atomically thin – the information provided by the edge might be used to identify different segments – ideally single nucleotides – in the biomolecular strand. So far, the most efficient approach was to drill a nano-sized pore in the sheet and use this pore as a channel to guide and detect individual molecules by measuring the electrochemical ionic current. Nanoscaled gaps between two electrodes in 2D materials recently emerged as powerful alternatives to nanopores. This article reviews the current status and prospects of integrating 2D materials in nanopores, nanogaps and similar devices for single molecule biosensing applications. We discuss the pros and cons, the challenges, and the latest achievements in the field. To achieve high-throughput sequencing with 2D materials, interdisciplinary research is essential. PMID:26612268

  3. Characterization of Hepatitis C Virus (HCV) Envelope Diversification from Acute to Chronic Infection within a Sexually Transmitted HCV Cluster by Using Single-Molecule, Real-Time Sequencing

    PubMed Central

    Ho, Cynthia K. Y.; Raghwani, Jayna; Koekkoek, Sylvie; Liang, Richard H.; Van der Meer, Jan T. M.; Van Der Valk, Marc; De Jong, Menno; Pybus, Oliver G.

    2016-01-01

    ABSTRACT In contrast to other available next-generation sequencing platforms, PacBio single-molecule, real-time (SMRT) sequencing has the advantage of generating long reads albeit with a relatively higher error rate in unprocessed data. Using this platform, we longitudinally sampled and sequenced the hepatitis C virus (HCV) envelope genome region (1,680 nucleotides [nt]) from individuals belonging to a cluster of sexually transmitted cases. All five subjects were coinfected with HIV-1 and a closely related strain of HCV genotype 4d. In total, 50 samples were analyzed by using SMRT sequencing. By using 7 passes of circular consensus sequencing, the error rate was reduced to 0.37%, and the median number of sequences was 612 per sample. A further reduction of insertions was achieved by alignment against a sample-specific reference sequence. However, in vitro recombination during PCR amplification could not be excluded. Phylogenetic analysis supported close relationships among HCV sequences from the four male subjects and subsequent transmission from one subject to his female partner. Transmission was characterized by a strong genetic bottleneck. Viral genetic diversity was low during acute infection and increased upon progression to chronicity but subsequently fluctuated during chronic infection, caused by the alternate detection of distinct coexisting lineages. SMRT sequencing combines long reads with sufficient depth for many phylogenetic analyses and can therefore provide insights into within-host HCV evolutionary dynamics without the need for haplotype reconstruction using statistical algorithms. IMPORTANCE Next-generation sequencing has revolutionized the study of genetically variable RNA virus populations, but for phylogenetic and evolutionary analyses, longer sequences than those generated by most available platforms, while minimizing the intrinsic error rate, are desired. Here, we demonstrate for the first time that PacBio SMRT sequencing technology can be used to generate full-length HCV envelope sequences at the single-molecule level, providing a data set with large sequencing depth for the characterization of intrahost viral dynamics. The selection of consensus reads derived from at least 7 full circular consensus sequencing rounds significantly reduced the intrinsic high error rate of this method. We used this method to genetically characterize a unique transmission cluster of sexually transmitted HCV infections, providing insight into the distinct evolutionary pathways in each patient over time and identifying the transmission-associated genetic bottleneck as well as fluctuations in viral genetic diversity over time, accompanied by dynamic shifts in viral subpopulations. PMID:28077634

  4. Clonal evolution in breast cancer revealed by single nucleus genome sequencing.

    PubMed

    Wang, Yong; Waters, Jill; Leung, Marco L; Unruh, Anna; Roh, Whijae; Shi, Xiuqing; Chen, Ken; Scheet, Paul; Vattathil, Selina; Liang, Han; Multani, Asha; Zhang, Hong; Zhao, Rui; Michor, Franziska; Meric-Bernstam, Funda; Navin, Nicholas E

    2014-08-14

    Sequencing studies of breast tumour cohorts have identified many prevalent mutations, but provide limited insight into the genomic diversity within tumours. Here we developed a whole-genome and exome single cell sequencing approach called nuc-seq that uses G2/M nuclei to achieve 91% mean coverage breadth. We applied this method to sequence single normal and tumour nuclei from an oestrogen-receptor-positive (ER(+)) breast cancer and a triple-negative ductal carcinoma. In parallel, we performed single nuclei copy number profiling. Our data show that aneuploid rearrangements occurred early in tumour evolution and remained highly stable as the tumour masses clonally expanded. In contrast, point mutations evolved gradually, generating extensive clonal diversity. Using targeted single-molecule sequencing, many of the diverse mutations were shown to occur at low frequencies (<10%) in the tumour mass. Using mathematical modelling we found that the triple-negative tumour cells had an increased mutation rate (13.3×), whereas the ER(+) tumour cells did not. These findings have important implications for the diagnosis, therapeutic treatment and evolution of chemoresistance in breast cancer.

  5. Mapping DNA methylation by transverse current sequencing: Reduction of noise from neighboring nucleotides

    NASA Astrophysics Data System (ADS)

    Alvarez, Jose; Massey, Steven; Kalitsov, Alan; Velev, Julian

    Nanopore sequencing via transverse current has emerged as a competitive candidate for mapping DNA methylation without needed bisulfite-treatment, fluorescent tag, or PCR amplification. By eliminating the error producing amplification step, long read lengths become feasible, which greatly simplifies the assembly process and reduces the time and the cost inherent in current technologies. However, due to the large error rates of nanopore sequencing, single base resolution has not been reached. A very important source of noise is the intrinsic structural noise in the electric signature of the nucleotide arising from the influence of neighboring nucleotides. In this work we perform calculations of the tunneling current through DNA molecules in nanopores using the non-equilibrium electron transport method within an effective multi-orbital tight-binding model derived from first-principles calculations. We develop a base-calling algorithm accounting for the correlations of the current through neighboring bases, which in principle can reduce the error rate below any desired precision. Using this method we show that we can clearly distinguish DNA methylation and other base modifications based on the reading of the tunneling current.

  6. Apigenin Impacts the Growth of the Gut Microbiota and Alters the Gene Expression of Enterococcus.

    PubMed

    Wang, Minqian; Firrman, Jenni; Zhang, Liqing; Arango-Argoty, Gustavo; Tomasula, Peggy; Liu, LinShu; Xiao, Weidong; Yam, Kit

    2017-08-03

    Apigenin is a major dietary flavonoid with many bioactivities, widely distributed in plants. Apigenin reaches the colon region intact and interacts there with the human gut microbiota, however there is little research on how apigenin affects the gut bacteria. This study investigated the effect of pure apigenin on human gut bacteria, at both the single strain and community levels. The effect of apigenin on the single gut bacteria strains Bacteroides galacturonicus , Bifidobacterium catenulatum , Lactobacillus rhamnosus GG, and Enterococcus caccae , was examined by measuring their anaerobic growth profiles. The effect of apigenin on a gut microbiota community was studied by culturing a fecal inoculum under in vitro conditions simulating the human ascending colon. 16S rRNA gene sequencing and GC-MS analysis quantified changes in the community structure. Single molecule RNA sequencing was used to reveal the response of Enterococcus caccae to apigenin. Enterococcus caccae was effectively inhibited by apigenin when cultured alone, however, the genus Enterococcus was enhanced when tested in a community setting. Single molecule RNA sequencing found that Enterococcus caccae responded to apigenin by up-regulating genes involved in DNA repair, stress response, cell wall synthesis, and protein folding. Taken together, these results demonstrate that apigenin affects both the growth and gene expression of Enterococcus caccae .

  7. Labeled nucleotide phosphate (NP) probes

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2009-02-03

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  8. Dock 'n roll: folding of a silk-inspired polypeptide into an amyloid-like beta solenoid.

    PubMed

    Zhao, Binwu; Cohen Stuart, Martien A; Hall, Carol K

    2016-04-20

    Polypeptides containing the motif ((GA)mGX)n occur in silk and have a strong tendency to self-assemble. For example, polypeptides containing (GAGAGAGX)n, where X = G or H have been observed to form filaments; similar sequences but with X = Q have been used in the design of coat proteins (capsids) for artificial viruses. The structure of the (GAGAGAGX)m filaments has been proposed to be a stack of peptides in a β roll structure with the hydrophobic side chains pointing outwards (hydrophobic shell). Another possible configuration, a β roll or β solenoid structure which has its hydrophobic side chains buried inside (hydrophobic core) was, however, overlooked. We perform ground state analysis as well as atomic-level molecular dynamics simulations, both on single molecules and on two-molecule stacks of the silk-inspired sequence (GAGAGAGQ)10, to decide whether the hydrophobic core or the hydrophobic shell configuration is the most stable one. We find that a stack of two hydrophobic core molecules is energetically more favorable than a stack of two hydrophobic shell molecules. A shell molecule initially placed in a perfect β roll structure tends to rotate its strands, breaking in-plane hydrogen bonds and forming out-of-plane hydrogen bonds, while a core molecule stays in the β roll structure. The hydrophobic shell structure has type II' β turns whereas the core configuration has type II β turns; only the latter secondary structure agrees well with solid-state NMR experiments on a similar sequence (GA)15. We also observe that the core stack has a higher number of intra-molecular hydrogen bonds and a higher number of hydrogen bonds between stack and water than the shell stack. Hence, we conclude that the hydrophobic core configuration is the most likely structure. In the stacked state, each peptide has more intra-molecular hydrogen bonds than a single folded molecule, which suggests that stacking provides the extra stability needed for molecules to reach the folded state.

  9. A graphene-based biosensing platform based on the release of DNA probes and rolling circle amplification.

    PubMed

    Liu, Meng; Song, Jinping; Shuang, Shaomin; Dong, Chuan; Brennan, John D; Li, Yingfu

    2014-06-24

    We report a versatile biosensing platform capable of achieving ultrasensitive detection of both small-molecule and macromolecular targets. The system features three components: reduced graphene oxide for its ability to adsorb single-stranded DNA molecules nonspecifically, DNA aptamers for their ability to bind reduced graphene oxide but undergo target-induced conformational changes that facilitate their release from the reduced graphene oxide surface, and rolling circle amplification (RCA) for its ability to amplify a primer-template recognition event into repetitive sequence units that can be easily detected. The key to the design is the tagging of a short primer to an aptamer sequence, which results in a small DNA probe that allows for both effective probe adsorption onto the reduced graphene oxide surface to mask the primer domain in the absence of the target, as well as efficient probe release in the presence of the target to make the primer available for template binding and RCA. We also made an observation that the circular template, which on its own does not cause a detectable level of probe release from the reduced graphene oxide, augments target-induced probe release. The synergistic release of DNA probes is interpreted to be a contributing factor for the high detection sensitivity. The broad utility of the platform is illustrated though engineering three different sensors that are capable of achieving ultrasensitive detection of a protein target, a DNA sequence and a small-molecule analyte. We envision that the approach described herein will find useful applications in the biological, medical, and environmental fields.

  10. DNA sequence analysis with droplet-based microfluidics

    PubMed Central

    Abate, Adam R.; Hung, Tony; Sperling, Ralph A.; Mary, Pascaline; Rotem, Assaf; Agresti, Jeremy J.; Weiner, Michael A.; Weitz, David A.

    2014-01-01

    Droplet-based microfluidic techniques can form and process micrometer scale droplets at thousands per second. Each droplet can house an individual biochemical reaction, allowing millions of reactions to be performed in minutes with small amounts of total reagent. This versatile approach has been used for engineering enzymes, quantifying concentrations of DNA in solution, and screening protein crystallization conditions. Here, we use it to read the sequences of DNA molecules with a FRET-based assay. Using probes of different sequences, we interrogate a target DNA molecule for polymorphisms. With a larger probe set, additional polymorphisms can be interrogated as well as targets of arbitrary sequence. PMID:24185402

  11. Digital DNA detection based on a compact optofluidic laser with ultra-low sample consumption.

    PubMed

    Lee, Wonsuk; Chen, Qiushu; Fan, Xudong; Yoon, Dong Ki

    2016-11-29

    DNA lasers self-amplify optical signals from a DNA analyte as well as thermodynamic differences between sequences, allowing quasi-digital DNA detection. However, these systems have drawbacks, such as relatively large sample consumption and complicated dye labelling. Moreover, although the lasing signal can detect the target DNA, it is superimposed on an unintended fluorescence background, which persists for non-target DNA samples as well. From an optical point of view, it is thus not truly digital detection and requires spectral analysis to identify the target. In this work, we propose and demonstrate an optofluidic laser that has a single layer of DNA molecules as the gain material. A target DNA produces intensive laser emission comparable to existing DNA lasers, while any unnecessary fluorescence background is successfully suppressed. As a result, the target DNA can be detected with a single laser pulse, in a truly digital manner. Since the DNA molecules cover only a single layer on the surface of the laser microcavity, the DNA sample consumption is a few orders of magnitude lower than that of existing DNA lasers. Furthermore, the DNA molecules are stained by simply immersing the microcavity in the intercalating dye solution, and thus the proposed DNA laser is free of any complex dye-labelling process prior to analysis.

  12. Topological events in single molecules of E. coli DNA confined in nanochannels

    PubMed Central

    Reifenberger, Jeffrey G.; Dorfman, Kevin D.; Cao, Han

    2015-01-01

    We present experimental data concerning potential topological events such as folds, internal backfolds, and/or knots within long molecules of double-stranded DNA when they are stretched by confinement in a nanochannel. Genomic DNA from E. coli was labeled near the ‘GCTCTTC’ sequence with a fluorescently labeled dUTP analog and stained with the DNA intercalator YOYO. Individual long molecules of DNA were then linearized and imaged using methods based on the NanoChannel Array technology (Irys® System) available from BioNano Genomics. Data were collected on 189,153 molecules of length greater than 50 kilobases. A custom code was developed to search for abnormal intensity spikes in the YOYO backbone profile along the length of individual molecules. By correlating the YOYO intensity spikes with the aligned barcode pattern to the reference, we were able to correlate the bright intensity regions of YOYO with abnormal stretching in the molecule, which suggests these events were either a knot or a region of internal backfolding within the DNA. We interpret the results of our experiments involving molecules exceeding 50 kilobases in the context of existing simulation data for relatively short DNA, typically several kilobases. The frequency of these events is lower than the predictions from simulations, while the size of the events is larger than simulation predictions and often exceeds the molecular weight of the simulated molecules. We also identified DNA molecules that exhibit large, single folds as they enter the nanochannels. Overall, topological events occur at a low frequency (~7% of all molecules) and pose an easily surmountable obstacle for the practice of genome mapping in nanochannels. PMID:25991508

  13. Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers.

    PubMed

    Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E; Przytycka, Teresa M

    2012-06-15

    Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. To close this gap we developed, Aptamotif, a computational method for the identification of sequence-structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process.

  14. Shotgun Optical Maps of the Whole Escherichia coli O157:H7 Genome

    PubMed Central

    Lim, Alex; Dimalanta, Eileen T.; Potamousis, Konstantinos D.; Yen, Galex; Apodoca, Jennifer; Tao, Chunhong; Lin, Jieyi; Qi, Rong; Skiadas, John; Ramanathan, Arvind; Perna, Nicole T.; Plunkett, Guy; Burland, Valerie; Mau, Bob; Hackett, Jeremiah; Blattner, Frederick R.; Anantharaman, Thomas S.; Mishra, Bhubaneswar; Schwartz, David C.

    2001-01-01

    We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies. PMID:11544203

  15. Molecular vibrations in metal-single-molecule-metal junctions

    NASA Astrophysics Data System (ADS)

    Yokota, Kazumichi; Taniguchi, Masateru; Kawai, Tomoji

    2010-03-01

    Molecular vibrations in a metal-single-molecule-metal junction were studied based on density functional theory using a single benzenedithiolate molecule connected between gold clusters. We found that the difference in vibrational energy between an isolated benzenedithiol and the single-molecule junction is less than 3% in the energy range above 540 cm -1, where sulfur atoms contribute little to molecular vibrations. The finding implies that we can predict the peak energy in the inelastic electron tunneling spectrum of the single-molecule junction in the high energy range by vibrational analyses of isolated molecules.

  16. A cost effective 5΄ selective single cell transcriptome profiling approach with improved UMI design

    PubMed Central

    Arguel, Marie-Jeanne; LeBrigand, Kevin; Paquet, Agnès; Ruiz García, Sandra; Zaragosi, Laure-Emmanuelle; Waldmann, Rainer

    2017-01-01

    Abstract Single cell RNA sequencing approaches are instrumental in studies of cell-to-cell variability. 5΄ selective transcriptome profiling approaches allow simultaneous definition of the transcription start size and have advantages over 3΄ selective approaches which just provide internal sequences close to the 3΄ end. The only currently existing 5΄ selective approach requires costly and labor intensive fragmentation and cell barcoding after cDNA amplification. We developed an optimized 5΄ selective workflow where all the cell indexing is done prior to fragmentation. With our protocol, cell indexing can be performed in the Fluidigm C1 microfluidic device, resulting in a significant reduction of cost and labor. We also designed optimized unique molecular identifiers that show less sequence bias and vulnerability towards sequencing errors resulting in an improved accuracy of molecule counting. We provide comprehensive experimental workflows for Illumina and Ion Proton sequencers that allow single cell sequencing in a cost range comparable to qPCR assays. PMID:27940562

  17. Ranalexin. A novel antimicrobial peptide from bullfrog (Rana catesbeiana) skin, structurally related to the bacterial antibiotic, polymyxin.

    PubMed

    Clark, D P; Durell, S; Maloy, W L; Zasloff, M

    1994-04-08

    Antimicrobial peptides comprise a diverse class of molecules used in host defense by plants, insects, and animals. In this study we have isolated a novel antimicrobial peptide from the skin of the bullfrog, Rana catesbeiana. This 20 amino acid peptide, which we have termed Ranalexin, has the amino acid sequence: NH2-Phe-Leu-Gly-Gly-Leu-Ile-Lys-Ile-Val-Pro-Ala-Met-Ile-Cys-Ala-Val-Thr- Lys-Lys - Cys-COOH, and it contains a single intramolecular disulfide bond which forms a heptapeptide ring within the molecule. Structurally, Ranalexin resembles the bacterial antibiotic, polymyxin, which contains a similar heptapeptide ring. We have also cloned the cDNA for Ranalexin from a metamorphic R. catesbeiana tadpole cDNA library. Based on the cDNA sequence, it appears that Ranalexin is initially synthesized as a propeptide with a putative signal sequence and an acidic amino acid-rich region at its amino-terminal end. Interestingly, the putative signal sequence of the Ranalexin cDNA is strikingly similar to the signal sequence of opioid peptide precursors isolated from the skin of the South American frogs Phyllomedusa sauvagei and Phyllomedusa bicolor. Northern blot analysis and in situ hybridization experiments demonstrated that Ranalexin mRNA is first expressed in R. catesbeiana skin at metamorphosis and continues to be expressed into adulthood.

  18. Quantitation of fetal DNA fraction in maternal plasma using circulating single molecule amplification and re-sequencing technology (cSMART).

    PubMed

    Song, Yijun; Zhou, Xiya; Huang, Saiqiong; Li, Xiaohong; Qi, Qingwei; Jiang, Yulin; Liu, Yiqian; Ma, Chengcheng; Li, Zhifeng; Xu, Mengnan; Cram, David S; Liu, Juntao

    2016-05-01

    Calculation of the fetal DNA fraction (FF) is important for reliable and accurate noninvasive prenatal testing (NIPT) for fetal genetic abnormalities. The aim of the study was to develop and validate a novel method for FF determination. FF was calculated using the chromosome Y (ChrY) sequence read assay and by circulating single molecule amplification and re-sequencing technology of 76 autosomal SNPs. By Pearson correlation for FF (4.73-22.11%) in 33 male pregnancy samples, the R(2) co-efficient for the 76-SNP versus the ChrY assay was 0.9572 (p<0.001). In addition, the co-efficient of variation (CV) of FF measurement by the 76-SNP assay was low (0.15-0.35). As a control, the FF measurement for four non-pregnant plasma samples was virtually zero. In prospective longitudinal studies of 14 women with normal pregnancies, FF generally increased with gestational age. However, in eight women (71%) there was a significant decrease in FF between the first trimester (11-13 weeks) and the second trimester (15-19 weeks), and this was attributable to significant maternal weight gain. The novel 76-SNP cSMART assay has the precision to accurately measure FF in all pregnancies at a detection threshold of 5%. Based on FF trends in individual pregnancies, our results suggest that the end of the first trimester may be a more optimal window for performing NIPT. Copyright © 2016 Elsevier B.V. All rights reserved.

  19. Controlled chain polymerisation and chemical soldering for single-molecule electronics.

    PubMed

    Okawa, Yuji; Akai-Kasaya, Megumi; Kuwahara, Yuji; Mandal, Swapan K; Aono, Masakazu

    2012-05-21

    Single functional molecules offer great potential for the development of novel nanoelectronic devices with capabilities beyond today's silicon-based devices. To realise single-molecule electronics, the development of a viable method for connecting functional molecules to each other using single conductive polymer chains is required. The method of initiating chain polymerisation using the tip of a scanning tunnelling microscope (STM) is very useful for fabricating single conductive polymer chains at designated positions and thereby wiring single molecules. In this feature article, developments in the controlled chain polymerisation of diacetylene compounds and the properties of polydiacetylene chains are summarised. Recent studies of "chemical soldering", a technique enabling the covalent connection of single polydiacetylene chains to single functional molecules, are also introduced. This represents a key step in advancing the development of single-molecule electronics.

  20. Reaching the Ionic Current Detection Limit in Silicon-Based Nanopores

    NASA Astrophysics Data System (ADS)

    Puster, Matthew; Rodriguez-Manzo, Julio Alejandro; Nicolai, Adrien; Meunier, Vincent; Drndic, Marija

    2015-03-01

    Solid-state nanopores act as single-molecule sensors whereby passage of an individual molecule in aqueous electrolyte through a nanopore is registered as a change in ionic conductance (ΔG). Future nanopore applications such as DNA sequencing at high bandwidth require high ΔG for optimal signal-to-noise ratio. Reducing the nanopore diameter and thickness increase ΔG. Molecule size limits the diameter, thus efforts concentrate on minimizing the thickness by thinning oxide/nitride films or using 2D materials. Weighted by electrolyte conductivity the highest ΔG reported to date for DNA translocations were obtained with nanopores made in oxide/nitride films. We present a controlled electron irradiation technique to thin such films to the limit of their stability, producing nanopores tailored to molecule size in amorphous Si with thicknesses less than 2 nm. We compare ΔG values with results found in the literature for DNA translocation through these nanopores, where access resistance becomes comparable to the resistance through the nanopore itself.

  1. Comprehensive profiling and quantitation of oncogenic mutations in non-small cell lung carcinoma using single-molecule amplification and re-sequencing technology.

    PubMed

    Shi, Jian; Yuan, Meng; Wang, Zhan-Dong; Xu, Xiao-Li; Hong, Lei; Sun, Shenglin

    2017-02-01

    The carcinogenesis of non-small cell lung carcinoma has been found to associate with activating and resistant mutations in the tyrosine kinase domain of specific oncogenes. Here, we assessed the type, frequency, and abundance of epithelial growth factor receptor, KRAS, BRAF, and ALK mutations in 154 non-small cell lung carcinoma specimens using single-molecule amplification and re-sequencing technology. We found that epithelial growth factor receptor mutations were the most prevalent (44.2%), followed by KRAS (18.8%), ALK (7.8%), and BRAF (5.8%) mutations. The type and abundance of the mutations in tumor specimens appeared to be heterogeneous. Thus, we conclude that identification of clinically significant oncogenic mutations may improve the classification of patients and provide valuable information for determination of the therapeutic strategies.

  2. Amino acids 16-275 of minute virus of mice NS1 include a domain that specifically binds (ACCA)2-3-containing DNA.

    PubMed

    Mouw, M; Pintel, D J

    1998-11-10

    GST-NS1 purified from Escherichia coli and insect cells binds double-strand DNA in an (ACCA)2-3-dependent fashion under similar ionic conditions, independent of the presence of anti-NS1 antisera or exogenously supplied ATP and interacts with single-strand DNA and RNA in a sequence-independent manner. An amino-terminal domain (amino acids 1-275) of NS1 [GST-NS1(1-275)], representing 41% of the full-length NS1 molecule, includes a domain that binds double-strand DNA in a sequence-specific manner at levels comparable to full-length GST-NS1, as well as single-strand DNA and RNA in a sequence-independent manner. The deletion of 15 additional amino-terminal amino acids yielded a molecule [GST-NS1(1-275)] that maintained (ACCA)2-3-specific double-strand DNA binding; however, this molecule was more sensitive to increasing ionic conditions than full-length GST-NS1 and GST-NS1(1-275) and could not be demonstrated to bind single-strand nucleic acids. A quantitative filter binding assay showed that E. coli- and baculovirus-expressed GST-NS1 and E. coli GST-NS1(1-275) specifically bound double-strand DNA with similar equilibrium kinetics [as measured by their apparent equilibrium DNA binding constants (KD)], whereas GST-NS1(16-275) bound 4- to 8-fold less well. Copyright 1998 Academic Press.

  3. Massively Parallel, Molecular Analysis Platform Developed Using a CMOS Integrated Circuit With Biological Nanopores

    PubMed Central

    Roever, Stefan

    2012-01-01

    A massively parallel, low cost molecular analysis platform will dramatically change the nature of protein, molecular and genomics research, DNA sequencing, and ultimately, molecular diagnostics. An integrated circuit (IC) with 264 sensors was fabricated using standard CMOS semiconductor processing technology. Each of these sensors is individually controlled with precision analog circuitry and is capable of single molecule measurements. Under electronic and software control, the IC was used to demonstrate the feasibility of creating and detecting lipid bilayers and biological nanopores using wild type α-hemolysin. The ability to dynamically create bilayers over each of the sensors will greatly accelerate pore development and pore mutation analysis. In addition, the noise performance of the IC was measured to be 30fA(rms). With this noise performance, single base detection of DNA was demonstrated using α-hemolysin. The data shows that a single molecule, electrical detection platform using biological nanopores can be operationalized and can ultimately scale to millions of sensors. Such a massively parallel platform will revolutionize molecular analysis and will completely change the field of molecular diagnostics in the future.

  4. DNA-templated synthesis of Pt nanoparticles on single-walled carbon nanotubes.

    PubMed

    Dong, Lifeng

    2009-11-18

    A series of electron microscopy characterizations demonstrate that single-stranded deoxyribonucleic acid (ssDNA) can bind to nanotube surfaces and disperse bundled single-walled carbon nanotubes (SWCNTs) into individual tubes. The ssDNA molecules on the nanotube surfaces demonstrate various morphologies, such as aggregated clusters and spiral wrapping around a nanotube with different pitches and spaces, indicating that the morphology of the SWCNT/DNA hybrids is not related solely to the base sequence of the ssDNA or the chirality or the diameter of the nanotubes. In addition to serving as a non-covalent dispersion agent, the ssDNA molecules bonded to the nanotube surface can provide addresses for localizing Pt(II) complexes along the nanotubes. The Pt nanoparticles obtained by a reduction of the Pt2+-DNA adducts are crystals with a size of < or =1-2 nm. These results expand our understanding of the interactions between ssDNA and SWCNTs and provide an efficient approach for positioning Pt and other metal particles, with uniform sizes and without aggregations, along the nanotube surfaces for applications in direct ethanol/methanol fuel cells and nanoscale electronics.

  5. Developing Single-Molecule TPM Experiments for Direct Observation of Successful RecA-Mediated Strand Exchange Reaction

    PubMed Central

    Fan, Hsiu-Fang; Cox, Michael M.; Li, Hung-Wen

    2011-01-01

    RecA recombinases play a central role in homologous recombination. Once assembled on single-stranded (ss) DNA, RecA nucleoprotein filaments mediate the pairing of homologous DNA sequences and strand exchange processes. We have designed two experiments based on tethered particle motion (TPM) to investigate the fates of the invading and the outgoing strands during E. coli RecA-mediated pairing and strand exchange at the single-molecule level in the absence of force. TPM experiments measure the tethered bead Brownian motion indicative of the DNA tether length change resulting from RecA binding and dissociation. Experiments with beads labeled on either the invading strand or the outgoing strand showed that DNA pairing and strand exchange occurs successfully in the presence of either ATP or its non-hydrolyzable analog, ATPγS. The strand exchange rates and efficiencies are similar under both ATP and ATPγS conditions. In addition, the Brownian motion time-courses suggest that the strand exchange process progresses uni-directionally in the 5′-to-3′ fashion, using a synapse segment with a wide and continuous size distribution. PMID:21765895

  6. Development of Scoring Functions for Antibody Sequence Assessment and Optimization

    PubMed Central

    Seeliger, Daniel

    2013-01-01

    Antibody development is still associated with substantial risks and difficulties as single mutations can radically change molecule properties like thermodynamic stability, solubility or viscosity. Since antibody generation methodologies cannot select and optimize for molecule properties which are important for biotechnological applications, careful sequence analysis and optimization is necessary to develop antibodies that fulfil the ambitious requirements of future drugs. While efforts to grab the physical principles of undesired molecule properties from the very bottom are becoming increasingly powerful, the wealth of publically available antibody sequences provides an alternative way to develop early assessment strategies for antibodies using a statistical approach which is the objective of this paper. Here, publically available sequences were used to develop heuristic potentials for the framework regions of heavy and light chains of antibodies of human and murine origin. The potentials take into account position dependent probabilities of individual amino acids but also conditional probabilities which are inevitable for sequence assessment and optimization. It is shown that the potentials derived from human sequences clearly distinguish between human sequences and sequences from mice and, hence, can be used as a measure of humaness which compares a given sequence with the phenotypic pool of human sequences instead of comparing sequence identities to germline genes. Following this line, it is demonstrated that, using the developed potentials, humanization of an antibody can be described as a simple mathematical optimization problem and that the in-silico generated framework variants closely resemble native sequences in terms of predicted immunogenicity. PMID:24204701

  7. SMRT sequencing data for Garcinia mangostana L. variety Mesta.

    PubMed

    Midin, Mohd Razik; Loke, Kok-Keong; Madon, Maria; Nordin, Mohd Shukor; Goh, Hoe-Han; Mohd Noor, Normah

    2017-06-01

    The "Queen of Fruits" mangosteen ( Garcinia mangostana L.) produces commercially important fruits with desirable taste of flesh and pericarp rich in xanthones with medicinal properties. To date, only limited knowledge is available on the cytogenetics and genome sequences of a common variety of mangosteen (Abu Bakar et al., 2016 [1]). Here, we report the first single-molecule real-time (SMRT) sequencing data from whole genome sequencing of mangosteen of Mesta variety. Raw reads of the SMRT sequencing project can be obtained from SRA database with the accession numbers SRX2718652 until SRX2718659.

  8. Versatile microfluidic total internal reflection (TIR)-based devices: application to microbeads velocity measurement and single molecule detection with upright and inverted microscope.

    PubMed

    Le, Nam Cao Hoai; Yokokawa, Ryuji; Dao, Dzung Viet; Nguyen, Thien Duy; Wells, John C; Sugiyama, Susumu

    2009-01-21

    A poly(dimethylsiloxane) (PDMS) chip for Total Internal Reflection (TIR)-based imaging and detection has been developed using Si bulk micromachining and PDMS casting. In this paper, we report the applications of the chip on both inverted and upright fluorescent microscopes and confirm that two types of sample delivery platforms, PDMS microchannel and glass microchannel, can be easily integrated depending on the magnification of an objective lens needed to visualize a sample. Although any device configuration can be achievable, here we performed two experiments to demonstrate the versatility of the microfluidic TIR-based devices. The first experiment was velocity measurement of Nile red microbeads with nominal diameter of 500 nm in a pressure-driven flow. The time-sequenced fluorescent images of microbeads, illuminated by an evanescent field, were cross-correlated by a Particle Image Velocimetry (PIV) program to obtain near-wall velocity field of the microbeads at various flow rates from 500 nl/min to 3000 nl/min. We then evaluated the capabilities of the device for Single Molecule Detection (SMD) of fluorescently labeled DNA molecules from 30 bp to 48.5 kbp and confirm that DNA molecules as short as 1105 bp were detectable. Our versatile, integrated device could provide low-cost and fast accessibility to Total Internal Reflection Fluorescent Microscopy (TIRFM) on both conventional upright and inverted microscopes. It could also be a useful component in a Micro-Total Analysis System (micro-TAS) to analyze nanoparticles or biomolecules near-wall transport or motion.

  9. Classification of DNA nucleotides with transverse tunneling currents

    NASA Astrophysics Data System (ADS)

    Nyvold Pedersen, Jonas; Boynton, Paul; Di Ventra, Massimiliano; Jauho, Antti-Pekka; Flyvbjerg, Henrik

    2017-01-01

    It has been theoretically suggested and experimentally demonstrated that fast and low-cost sequencing of DNA, RNA, and peptide molecules might be achieved by passing such molecules between electrodes embedded in a nanochannel. The experimental realization of this scheme faces major challenges, however. In realistic liquid environments, typical currents in tunneling devices are of the order of picoamps. This corresponds to only six electrons per microsecond, and this number affects the integration time required to do current measurements in real experiments. This limits the speed of sequencing, though current fluctuations due to Brownian motion of the molecule average out during the required integration time. Moreover, data acquisition equipment introduces noise, and electronic filters create correlations in time-series data. We discuss how these effects must be included in the analysis of, e.g., the assignment of specific nucleobases to current signals. As the signals from different molecules overlap, unambiguous classification is impossible with a single measurement. We argue that the assignment of molecules to a signal is a standard pattern classification problem and calculation of the error rates is straightforward. The ideas presented here can be extended to other sequencing approaches of current interest.

  10. Ligase Detection Reaction Generation of Reverse Molecular Beacons for Near Real-Time Analysis of Bacterial Pathogens Using Single-Pair Fluorescence Resonance Energy Transfer and a Cyclic Olefin Copolymer Microfluidic Chip

    PubMed Central

    Peng, Zhiyong; Soper, Steven A.; Pingle, Maneesh R.; Barany, Francis; Davis, Lloyd M.

    2015-01-01

    Detection of pathogenic bacteria and viruses require strategies that can signal the presence of these targets in near real-time due to the potential threats created by rapid dissemination into water and/or food supplies. In this paper, we report an innovative strategy that can rapidly detect bacterial pathogens using reporter sequences found in their genome without requiring polymerase chain reaction (PCR). A pair of strain-specific primers was designed based on the 16S rRNA gene and were end-labeled with a donor (Cy5) or acceptor (Cy5.5) dye. In the presence of the target bacterium, the primers were joined using a ligase detection reaction (LDR) only when the primers were completely complementary to the target sequence to form a reverse molecular beacon (rMB), thus bringing Cy5 (donor) and Cy5.5 (acceptor) into close proximity to allow fluorescence resonance energy transfer (FRET) to occur. These rMBs were subsequently analyzed using single-molecule detection of the FRET pairs (single-pair FRET; spFRET). The LDR was performed using a continuous flow thermal cycling process configured in a cyclic olefin copolymer (COC) microfluidic device using either 2 or 20 thermal cycles. Single-molecule photon bursts from the resulting rMBs were detected on-chip and registered using a simple laser-induced fluorescence (LIF) instrument. The spFRET signatures from the target pathogens were reported in as little as 2.6 min using spFRET. PMID:21047095

  11. Re-polarization of nuclear spins using selective SABRE-INEPT.

    PubMed

    Knecht, Stephan; Kiryutin, Alexey S; Yurkovskaya, Alexandra V; Ivanov, Konstantin L

    2018-02-01

    A method is proposed for significant improvement of NMR pulse sequences used in high-field SABRE (Signal Amplification By Reversible Exchange) experiments. SABRE makes use of spin order transfer from parahydrogen (pH 2 , the H 2 molecule in its singlet spin state) to a substrate in a transient organometallic Ir-based complex. The technique proposed here utilizes "re-polarization", i.e., multiple application of an NMR pulse sequence used for spin order transfer. During re-polarization only the form of the substrate, which is bound to the complex, is excited by selective NMR pulses and the resulting polarization is transferred to the free substrate via chemical exchange. Owing to the fact that (i) only a small fraction of the substrate molecules is in the bound form and (ii) spin relaxation of the free substrate is slow, the re-polarization scheme provides greatly improved NMR signal enhancement, ε. For instance, when pyridine is used as a substrate, single use of the SABRE-INEPT sequence provides ε≈260 for 15 N nuclei, whereas SABRE-INEPT with re-polarization yields ε>2000. We anticipate that the proposed method is useful for achieving maximal NMR enhancement with spin hyperpolarization techniques. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Re-polarization of nuclear spins using selective SABRE-INEPT

    NASA Astrophysics Data System (ADS)

    Knecht, Stephan; Kiryutin, Alexey S.; Yurkovskaya, Alexandra V.; Ivanov, Konstantin L.

    2018-02-01

    A method is proposed for significant improvement of NMR pulse sequences used in high-field SABRE (Signal Amplification By Reversible Exchange) experiments. SABRE makes use of spin order transfer from parahydrogen (pH2, the H2 molecule in its singlet spin state) to a substrate in a transient organometallic Ir-based complex. The technique proposed here utilizes "re-polarization", i.e., multiple application of an NMR pulse sequence used for spin order transfer. During re-polarization only the form of the substrate, which is bound to the complex, is excited by selective NMR pulses and the resulting polarization is transferred to the free substrate via chemical exchange. Owing to the fact that (i) only a small fraction of the substrate molecules is in the bound form and (ii) spin relaxation of the free substrate is slow, the re-polarization scheme provides greatly improved NMR signal enhancement, ε . For instance, when pyridine is used as a substrate, single use of the SABRE-INEPT sequence provides ε ≈ 260 for 15N nuclei, whereas SABRE-INEPT with re-polarization yields ε > 2000 . We anticipate that the proposed method is useful for achieving maximal NMR enhancement with spin hyperpolarization techniques.

  13. Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations.

    PubMed

    Reid-Bayliss, Kate S; Loeb, Lawrence A

    2017-08-29

    Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.

  14. Comparative Genomic Analysis of Two Clonally Related Multidrug Resistant Mycobacterium tuberculosis by Single Molecule Real Time Sequencing.

    PubMed

    Leung, Kenneth Siu-Sing; Siu, Gilman Kit-Hang; Tam, Kingsley King-Gee; To, Sabrina Wai-Chi; Rajwani, Rahim; Ho, Pak-Leung; Wong, Samson Sai-Yin; Zhao, Wei W; Ma, Oliver Chiu-Kit; Yam, Wing-Cheong

    2017-01-01

    Background: Multidrug-resistant tuberculosis (MDR-TB) is posing a major threat to global TB control. In this study, we focused on two consecutive MDR-TB isolated from the same patient before and after the initiation of anti-TB treatment. To better understand the genomic characteristics of MDR-TB, Single Molecule Real-Time (SMRT) Sequencing and comparative genomic analyses was performed to identify mutations that contributed to the stepwise development of drug resistance and growth fitness in MDR-TB under in vivo challenge of anti-TB drugs. Result: Both pre-treatment and post-treatment strain demonstrated concordant phenotypic and genotypic susceptibility profiles toward rifampicin, pyrazinamide, streptomycin, fluoroquinolones, aminoglycosides, cycloserine, ethionamide, and para-aminosalicylic acid. However, although both strains carried identical missense mutations at rpoB S531L, inhA C-15T, and embB M306V, MYCOTB Sensititre assay showed that the post-treatment strain had 16-, 8-, and 4-fold elevation in the minimum inhibitory concentrations (MICs) toward rifabutin, isoniazid, and ethambutol respectively. The results have indicated the presence of additional resistant-related mutations governing the stepwise development of MDR-TB. Further comparative genomic analyses have identified three additional polymorphisms between the clinical isolates. These include a single nucleotide deletion at nucleotide position 360 of rv0888 in pre-treatment strain, and a missense mutation at rv3303c ( lpdA) V44I and a 6-bp inframe deletion at codon 67-68 in rv2071c ( cobM) in the post-treatment strain. Multiple sequence alignment showed that these mutations were occurring at highly conserved regions among pathogenic mycobacteria. Using structural-based and sequence-based algorithms, we further predicted that the mutations potentially have deleterious effect on protein function. Conclusion: This is the first study that compared the full genomes of two clonally-related MDR-TB clinical isolates during the course of anti-TB treatment. Our work has demonstrated the robustness of SMRT Sequencing in identifying mutations among MDR-TB clinical isolates. Comparative genome analysis also suggested novel mutations at rv0888, lpdA , and cobM that might explain the difference in antibiotic resistance and growth pattern between the two MDR-TB strains.

  15. Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

    PubMed

    Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

    2015-07-01

    A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  16. End-to-end distance and contour length distribution functions of DNA helices

    NASA Astrophysics Data System (ADS)

    Zoli, Marco

    2018-06-01

    I present a computational method to evaluate the end-to-end and the contour length distribution functions of short DNA molecules described by a mesoscopic Hamiltonian. The method generates a large statistical ensemble of possible configurations for each dimer in the sequence, selects the global equilibrium twist conformation for the molecule, and determines the average base pair distances along the molecule backbone. Integrating over the base pair radial and angular fluctuations, I derive the room temperature distribution functions as a function of the sequence length. The obtained values for the most probable end-to-end distance and contour length distance, providing a measure of the global molecule size, are used to examine the DNA flexibility at short length scales. It is found that, also in molecules with less than ˜60 base pairs, coiled configurations maintain a large statistical weight and, consistently, the persistence lengths may be much smaller than in kilo-base DNA.

  17. Method for performing site-specific affinity fractionation for use in DNA sequencing

    DOEpatents

    Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

    1999-01-01

    A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.

  18. Miniaturized reaction vessel system, method for performing site-specific biochemical reactions and affinity fractionation for use in DNA sequencing

    DOEpatents

    Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

    2000-01-01

    A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.

  19. Method for performing site-specific affinity fractionation for use in DNA sequencing

    DOEpatents

    Mirzabekov, A.D.; Lysov, Y.P.; Dubley, S.A.

    1999-05-18

    A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between the cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting the extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to the extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from the array. 14 figs.

  20. Nucleic acid analysis using terminal-phosphate-labeled nucleotides

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-04-22

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  1. Screening the sequence selectivity of DNA-binding molecules using a gold nanoparticle-based colorimetric approach.

    PubMed

    Hurst, Sarah J; Han, Min Su; Lytton-Jean, Abigail K R; Mirkin, Chad A

    2007-09-15

    We have developed a novel competition assay that uses a gold nanoparticle (Au NP)-based, high-throughput colorimetric approach to screen the sequence selectivity of DNA-binding molecules. This assay hinges on the observation that the melting behavior of DNA-functionalized Au NP aggregates is sensitive to the concentration of the DNA-binding molecule in solution. When short, oligomeric hairpin DNA sequences were added to a reaction solution consisting of DNA-functionalized Au NP aggregates and DNA-binding molecules, these molecules may either bind to the Au NP aggregate interconnects or the hairpin stems based on their relative affinity for each. This relative affinity can be measured as a change in the melting temperature (Tm) of the DNA-modified Au NP aggregates in solution. As a proof of concept, we evaluated the selectivity of 4',6-diamidino-2-phenylindone (an AT-specific binder), ethidium bromide (a nonspecific binder), and chromomycin A (a GC-specific binder) for six sequences of hairpin DNA having different numbers of AT pairs in a five-base pair variable stem region. Our assay accurately and easily confirmed the known trends in selectivity for the DNA binders in question without the use of complicated instrumentation. This novel assay will be useful in assessing large libraries of potential drug candidates that work by binding DNA to form a drug/DNA complex.

  2. Instability of plasmid DNA sequences: macro and micro evolution of the antibiotic resistance plasmid R6-5.

    PubMed

    Timmis, K N; Cabello, F; Andrés, I; Nordheim, A; Burkhardt, H J; Cohen, S N

    1978-11-16

    Detailed examination of the structure of cloned DNA fragments of the R6-5 antibiotic resistance plasmid has revealed a substantial degree of polynucleotide sequence heterogeneity and indicates that sequence rearrangements in plasmids and possible other replicons occur more frequently than has hitherto been appreciated. The sequences changes in cloned R6-5 fragments were shown in some instances to have occurred prior to cloning, i.e. existing in the original population of R6-5 molecules that was obtained from a single bacterial clone and by several different criteria judged to be homogeneous, and in others to have occurred either during the cloning procedure or during subsequent propagation of hybrid molecules. The molecular changes that are described involved insertion/deletion of the previously characterized IS2 insertion element, formation of a new inverted repeat structure probably by duplication of a preexisting R6-5 DNA sequence, sequence inversion, and loss and gain of restriction endonuclease cleavage sites.

  3. Crossovers are associated with mutation and biased gene conversion at recombination hotspots.

    PubMed

    Arbeithuber, Barbara; Betancourt, Andrea J; Ebner, Thomas; Tiemann-Boege, Irene

    2015-02-17

    Meiosis is a potentially important source of germline mutations, as sites of meiotic recombination experience recurrent double-strand breaks (DSBs). However, evidence for a local mutagenic effect of recombination from population sequence data has been equivocal, likely because mutation is only one of several forces shaping sequence variation. By sequencing large numbers of single crossover molecules obtained from human sperm for two recombination hotspots, we find direct evidence that recombination is mutagenic: Crossovers carry more de novo mutations than nonrecombinant DNA molecules analyzed for the same donors and hotspots. The observed mutations were primarily CG to TA transitions, with a higher frequency of transitions at CpG than non-CpGs sites. This enrichment of mutations at CpG sites at hotspots could predominate in methylated regions involving frequent single-stranded DNA processing as part of DSB repair. In addition, our data set provides evidence that GC alleles are preferentially transmitted during crossing over, opposing mutation, and shows that GC-biased gene conversion (gBGC) predominates over mutation in the sequence evolution of hotspots. These findings are consistent with the idea that gBGC could be an adaptation to counteract the mutational load of recombination.

  4. Crossovers are associated with mutation and biased gene conversion at recombination hotspots

    PubMed Central

    Arbeithuber, Barbara; Betancourt, Andrea J.; Ebner, Thomas; Tiemann-Boege, Irene

    2015-01-01

    Meiosis is a potentially important source of germline mutations, as sites of meiotic recombination experience recurrent double-strand breaks (DSBs). However, evidence for a local mutagenic effect of recombination from population sequence data has been equivocal, likely because mutation is only one of several forces shaping sequence variation. By sequencing large numbers of single crossover molecules obtained from human sperm for two recombination hotspots, we find direct evidence that recombination is mutagenic: Crossovers carry more de novo mutations than nonrecombinant DNA molecules analyzed for the same donors and hotspots. The observed mutations were primarily CG to TA transitions, with a higher frequency of transitions at CpG than non-CpGs sites. This enrichment of mutations at CpG sites at hotspots could predominate in methylated regions involving frequent single-stranded DNA processing as part of DSB repair. In addition, our data set provides evidence that GC alleles are preferentially transmitted during crossing over, opposing mutation, and shows that GC-biased gene conversion (gBGC) predominates over mutation in the sequence evolution of hotspots. These findings are consistent with the idea that gBGC could be an adaptation to counteract the mutational load of recombination. PMID:25646453

  5. Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

    DOEpatents

    Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

    2011-01-18

    A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.

  6. Determination of a mutational spectrum

    DOEpatents

    Thilly, William G.; Keohavong, Phouthone

    1991-01-01

    A method of resolving (physically separating) mutant DNA from nonmutant DNA and a method of defining or establishing a mutational spectrum or profile of alterations present in nucleic acid sequences from a sample to be analyzed, such as a tissue or body fluid. The present method is based on the fact that it is possible, through the use of DGGE, to separate nucleic acid sequences which differ by only a single base change and on the ability to detect the separate mutant molecules. The present invention, in another aspect, relates to a method for determining a mutational spectrum in a DNA sequence of interest present in a population of cells. The method of the present invention is useful as a diagnostic or analytical tool in forensic science in assessing environmental and/or occupational exposures to potentially genetically toxic materials (also referred to as potential mutagens); in biotechnology, particularly in the study of the relationship between the amino acid sequence of enzymes and other biologically-active proteins or protein-containing substances and their respective functions; and in determining the effects of drugs, cosmetics and other chemicals for which toxicity data must be obtained.

  7. DNA sequence alignment by microhomology sampling during homologous recombination

    PubMed Central

    Qi, Zhi; Redding, Sy; Lee, Ja Yil; Gibb, Bryan; Kwon, YoungHo; Niu, Hengyao; Gaines, William A.; Sung, Patrick

    2015-01-01

    Summary Homologous recombination (HR) mediates the exchange of genetic information between sister or homologous chromatids. During HR, members of the RecA/Rad51 family of recombinases must somehow search through vast quantities of DNA sequence to align and pair ssDNA with a homologous dsDNA template. Here we use single-molecule imaging to visualize Rad51 as it aligns and pairs homologous DNA sequences in real-time. We show that Rad51 uses a length-based recognition mechanism while interrogating dsDNA, enabling robust kinetic selection of 8-nucleotide (nt) tracts of microhomology, which kinetically confines the search to sites with a high probability of being a homologous target. Successful pairing with a 9th nucleotide coincides with an additional reduction in binding free energy and subsequent strand exchange occurs in precise 3-nt steps, reflecting the base triplet organization of the presynaptic complex. These findings provide crucial new insights into the physical and evolutionary underpinnings of DNA recombination. PMID:25684365

  8. Influence of amine and thiol modifications at the 3' ends of single stranded DNA molecules on their adsorption on gold surface and the efficiency of their hybridization.

    PubMed

    Jaworska, Aleksandra; Jablonska, Anna; Wilanowski, Tomasz; Palys, Barbara; Sek, Slawomir; Kudelski, Andrzej

    2018-05-24

    Adsorption of molecules of DNA (deoxyribonucleic acid) or modified DNA on gold surfaces is often the first step in construction of many various biosensors, including biosensors for detection of DNA with a particular sequence. In this work we study the influence of amine and thiol modifications at the 3' ends of single stranded DNA (ssDNA) molecules on their adsorption on the surface of gold substrates and on the efficiency of hybridization of immobilized DNA with the complementary single stranded DNA. The characterization of formed layers has been carried out using infrared spectroscopy and atomic force microscopy. As model single stranded DNA we used DNA containing 20 adenine bases, whereas the complementary DNA contained 20 thymine bases. We found that the bands in polarization modulation-infrared reflection-adsorption spectroscopy (PM-IRRAS) spectra of layers formed from thiol-modified DNA are significantly narrower and sharper, indicating their higher regularity in the orientation of DNA on gold surface when using thiol linker. Also, hybridization of the layer of thiol-modified DNA containing 20 adenine bases with the respective DNA containing thymine bases leads to formation of much more organized structures than in the case of unmodified DNA or DNA with the amine linker. We conclude that the thiol-modified ssDNA is more promising for the preparation of biosensors, in comparison with the amine-modified or unmodified ssDNA. We have also found that the above-mentioned modifications at the 3' end of ssDNA significantly influence the IR spectrum (and hence the structure) of polycrystalline films formed from such compounds, even though adsorbed fragments contain less than 5% of the DNA chain. This effect should be taken into account when comparing IR spectra of various polycrystalline films formed from modified and unmodified DNA. Copyright © 2018. Published by Elsevier B.V.

  9. Drug-DNA interactions at single molecule level: A view with optical tweezers

    NASA Astrophysics Data System (ADS)

    Paramanathan, Thayaparan

    Studies of small molecule--DNA interactions are essential for developing new drugs for challenging diseases like cancer and HIV. The main idea behind developing these molecules is to target and inhibit the reproduction of the tumor cells and infected cells. We mechanically manipulate single DNA molecule using optical tweezers to investigate two molecules that have complex and multiple binding modes. Mononuclear ruthenium complexes have been extensively studied as a test for rational drug design. Potential drug candidates should have high affinity to DNA and slow dissociation kinetics. To achieve this, motifs of the ruthenium complexes are altered. Our collaborators designed a dumb-bell shaped binuclear ruthenium complex that can only intercalate DNA by threading through its bases. Studying the binding properties of this complex in bulk studies took hours. By mechanically manipulating a single DNA molecule held with optical tweezers, we lower the barrier to thread and make it fast compared to the bulk experiments. Stretching single DNA molecules with different concentration of drug molecules and holding it at a constant force allows the binding to reach equilibrium. By this we can obtain the equilibrium fractional ligand binding and length of DNA at saturated binding. Fitting these results yields quantitative measurements of the binding thermodynamics and kinetics of this complex process. The second complex discussed in this study is Actinomycin D (ActD), a well studied anti-cancer agent that is used as a prototype for developing new generations of drugs. However, the biophysical basis of its activity is still unclear. Because ActD is known to intercalate double stranded DNA (dsDNA), it was assumed to block replication by stabilizing dsDNA in front of the replication fork. However, recent studies have shown that ActD binds with even higher affinity to imperfect duplexes and some sequences of single stranded DNA (ssDNA). We directly measure the on and off rates by stretching the DNA molecule to a certain force and holding it at constant force while adding the drug and then while washing off the drug. Our finding resolves the long lasting controversy of ActD binding modes, clearly showing that both the dsDNA binding and ssDNA binding converge to the same single mode. The result supports the hypothesis that the primary characteristic of ActD that contributes to its biological activity is its ability to inhibit cellular replication by binding to transcription bubbles and causing cell death.

  10. Microwell Array Method for Rapid Generation of Uniform Agarose Droplets and Beads for Single Molecule Analysis.

    PubMed

    Li, Xingrui; Zhang, Dongfeng; Zhang, Huimin; Guan, Zhichao; Song, Yanling; Liu, Ruochen; Zhu, Zhi; Yang, Chaoyong

    2018-02-20

    Compartmentalization of aqueous samples in uniform emulsion droplets has proven to be a useful tool for many chemical, biological, and biomedical applications. Herein, we introduce an array-based emulsification method for rapid and easy generation of monodisperse agarose-in-oil droplets in a PDMS microwell array. The microwells are filled with agarose solution, and subsequent addition of hot oil results in immediate formation of agarose droplets due to the surface-tension of the liquid solution. Because droplet size is determined solely by the array unit dimensions, uniform droplets with preselectable diameters ranging from 20 to 100 μm can be produced with relative standard deviations less than 3.5%. The array-based droplet generation method was used to perform digital PCR for absolute DNA quantitation. The array-based droplet isolation and sol-gel switching property of agarose enable formation of stable beads by chilling the droplet array at -20 °C, thus, maintaining the monoclonality of each droplet and facilitating the selective retrieval of desired droplets. The monoclonality of droplets was demonstrated by DNA sequencing and FACS analysis, suggesting the robustness and flexibility of the approach for single molecule amplification and analysis. We believe our approach will lead to new possibilities for a great variety of applications, such as single-cell gene expression studies, aptamer selection, and oligonucleotide analysis.

  11. Capillary array scanner for time-resolved detection and identification of fluorescently labelled DNA fragments.

    PubMed

    Neumann, M; Herten, D P; Dietrich, A; Wolfrum, J; Sauer, M

    2000-02-25

    The first capillary array scanner for time-resolved fluorescence detection in parallel capillary electrophoresis based on semiconductor technology is described. The system consists essentially of a confocal fluorescence microscope and a x,y-microscope scanning stage. Fluorescence of the labelled probe molecules was excited using a short-pulse diode laser emitting at 640 nm with a repetition rate of 50 MHz. Using a single filter system the fluorescence decays of different labels were detected by an avalanche photodiode in combination with a PC plug-in card for time-correlated single-photon counting (TCSPC). The time-resolved fluorescence signals were analyzed and identified by a maximum likelihood estimator (MLE). The x,y-microscope scanning stage allows for discontinuous, bidirectional scanning of up to 16 capillaries in an array, resulting in longer fluorescence collection times per capillary compared to scanners working in a continuous mode. Synchronization of the alignment and measurement process were developed to allow for data acquisition without overhead. Detection limits in the subzeptomol range for different dye molecules separated in parallel capillaries have been achieved. In addition, we report on parallel time-resolved detection and separation of more than 400 bases of single base extension DNA fragments in capillary array electrophoresis. Using only semiconductor technology the presented technique represents a low-cost alternative for high throughput DNA sequencing in parallel capillaries.

  12. Direct observation of processive exoribonuclease motion using optical tweezers.

    PubMed

    Fazal, Furqan M; Koslover, Daniel J; Luisi, Ben F; Block, Steven M

    2015-12-08

    Bacterial RNases catalyze the turnover of RNA and are essential for gene expression and quality surveillance of transcripts. In Escherichia coli, the exoribonucleases RNase R and polynucleotide phosphorylase (PNPase) play critical roles in degrading RNA. Here, we developed an optical-trapping assay to monitor the translocation of individual enzymes along RNA-based substrates. Single-molecule records of motion reveal RNase R to be highly processive: one molecule can unwind over 500 bp of a structured substrate. However, enzyme progress is interrupted by pausing and stalling events that can slow degradation in a sequence-dependent fashion. We found that the distance traveled by PNPase through structured RNA is dependent on the A+U content of the substrate and that removal of its KH and S1 RNA-binding domains can reduce enzyme processivity without affecting the velocity. By a periodogram analysis of single-molecule records, we establish that PNPase takes discrete steps of six or seven nucleotides. These findings, in combination with previous structural and biochemical data, support an asymmetric inchworm mechanism for PNPase motion. The assay developed here for RNase R and PNPase is well suited to studies of other exonucleases and helicases.

  13. Characterizing protein domain associations by Small-molecule ligand binding

    PubMed Central

    Li, Qingliang; Cheng, Tiejun; Wang, Yanli; Bryant, Stephen H.

    2012-01-01

    Background Protein domains are evolutionarily conserved building blocks for protein structure and function, which are conventionally identified based on protein sequence or structure similarity. Small molecule binding domains are of great importance for the recognition of small molecules in biological systems and drug development. Many small molecules, including drugs, have been increasingly identified to bind to multiple targets, leading to promiscuous interactions with protein domains. Thus, a large scale characterization of the protein domains and their associations with respect to small-molecule binding is of particular interest to system biology research, drug target identification, as well as drug repurposing. Methods We compiled a collection of 13,822 physical interactions of small molecules and protein domains derived from the Protein Data Bank (PDB) structures. Based on the chemical similarity of these small molecules, we characterized pairwise associations of the protein domains and further investigated their global associations from a network point of view. Results We found that protein domains, despite lack of similarity in sequence and structure, were comprehensively associated through binding the same or similar small-molecule ligands. Moreover, we identified modules in the domain network that consisted of closely related protein domains by sharing similar biochemical mechanisms, being involved in relevant biological pathways, or being regulated by the same cognate cofactors. Conclusions A novel protein domain relationship was identified in the context of small-molecule binding, which is complementary to those identified by traditional sequence-based or structure-based approaches. The protein domain network constructed in the present study provides a novel perspective for chemogenomic study and network pharmacology, as well as target identification for drug repurposing. PMID:23745168

  14. PhAST: pharmacophore alignment search tool.

    PubMed

    Hähnke, Volker; Hofmann, Bettina; Grgat, Tomislav; Proschak, Ewgenij; Steinhilber, Dieter; Schneider, Gisbert

    2009-04-15

    We present a ligand-based virtual screening technique (PhAST) for rapid hit and lead structure searching in large compound databases. Molecules are represented as strings encoding the distribution of pharmacophoric features on the molecular graph. In contrast to other text-based methods using SMILES strings, we introduce a new form of text representation that describes the pharmacophore of molecules. This string representation opens the opportunity for revealing functional similarity between molecules by sequence alignment techniques in analogy to homology searching in protein or nucleic acid sequence databases. We favorably compared PhAST with other current ligand-based virtual screening methods in a retrospective analysis using the BEDROC metric. In a prospective application, PhAST identified two novel inhibitors of 5-lipoxygenase product formation with minimal experimental effort. This outcome demonstrates the applicability of PhAST to drug discovery projects and provides an innovative concept of sequence-based compound screening with substantial scaffold hopping potential. 2008 Wiley Periodicals, Inc.

  15. A septal chromosome segregator protein evolved into a conjugative DNA-translocator protein

    PubMed Central

    Sepulveda, Edgardo; Vogelmann, Jutta

    2011-01-01

    Streptomycetes, Gram-positive soil bacteria well known for the production of antibiotics feature a unique conjugative DNA transfer system. In contrast to classical conjugation which is characterized by the secretion of a pilot protein covalently linked to a single-stranded DNA molecule, in Streptomyces a double-stranded DNA molecule is translocated during conjugative transfer. This transfer involves a single plasmid encoded protein, TraB. A detailed biochemical and biophysical characterization of TraB, revealed a close relationship to FtsK, mediating chromosome segregation during bacterial cell division. TraB translocates plasmid DNA by recognizing 8-bp direct repeats located in a specific plasmid region clt. Similar sequences accidentally also occur on chromosomes and have been shown to be bound by TraB. We suggest that TraB mobilizes chromosomal genes by the interaction with these chromosomal clt-like sequences not relying on the integration of the conjugative plasmid into the chromosome. PMID:22479692

  16. Periodic Assembly of Nanospecies on Repetitive DNA Sequences Generated on Gold Nanoparticles by Rolling Circle Amplification

    NASA Astrophysics Data System (ADS)

    Zhao, Weian; Brook, Michael A.; Li, Yingfu

    Periodical assembly of nanospecies is desirable for the construction of nanodevices. We provide a protocol for the preparation of a gold nanoparticle (AuNP)/DNA scaffold on which nanospecies can be assembled in a periodical manner. AuNP/DNA scaffold is prepared by growing long single-stranded DNA (ssDNA) molecules (typically hundreds of nanometers to a few microns in length) on AuNPs via rolling circle amplification (RCA). Since these long ssDNA molecules contain many repetitive sequence units, complementary DNA-attached nanospecies can be assembled through specific hybridization in a controllable and periodical manner.

  17. Computer-Aided Design of RNA Origami Structures.

    PubMed

    Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S

    2017-01-01

    RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.

  18. DNA microdevice for electrochemical detection of Escherichia coli 0157:H7 molecular markers.

    PubMed

    Berganza, J; Olabarria, G; García, R; Verdoy, D; Rebollo, A; Arana, S

    2007-04-15

    An electrochemical DNA sensor based on the hybridization recognition of a single-stranded DNA (ssDNA) probe immobilized onto a gold electrode to its complementary ssDNA is presented. The DNA probe is bound on gold surface electrode by using self-assembled monolayer (SAM) technology. An optimized mixed SAM with a blocking molecule preventing the nonspecific adsorption on the electrode surface has been prepared. In this paper, a DNA biosensor is designed by means of the immobilization of a single stranded DNA probe on an electrochemical transducer surface to recognize specifically Escherichia coli (E. coli) 0157:H7 complementary target DNA sequence via cyclic voltammetry experiments. The 21 mer DNA probe including a C6 alkanethiol group at the 5' phosphate end has been synthesized to form the SAM onto the gold surface through the gold sulfur bond. The goal of this paper has been to design, characterise and optimise an electrochemical DNA sensor. In order to investigate the oligonucleotide probe immobilization and the hybridization detection, experiments with different concentration of DNA and mismatch sequences have been performed. This microdevice has demonstrated the suitability of oligonucleotide Self-assembled monolayers (SAMs) on gold as immobilization method. The DNA probes deposited on gold surface have been functional and able to detect changes in bases sequence in a 21-mer oligonucleotide.

  19. Biological nanopore MspA for DNA sequencing

    NASA Astrophysics Data System (ADS)

    Manrao, Elizabeth A.

    Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.

  20. Single molecule photobleaching (SMPB) technology for counting of RNA, DNA, protein and other molecules in nanoparticles and biological complexes by TIRF instrumentation.

    PubMed

    Zhang, Hui; Guo, Peixuan

    2014-05-15

    Direct counting of biomolecules within biological complexes or nanomachines is demanding. Single molecule counting using optical microscopy is challenging due to the diffraction limit. The single molecule photobleaching (SMPB) technology for direct counting developed by our team (Shu et al., 2007 [18]; Zhang et al., 2007 [19]) offers a simple and straightforward method to determine the stoichiometry of molecules or subunits within biocomplexes or nanomachines at nanometer scales. Stoichiometry is determined by real-time observation of the number of descending steps resulted from the photobleaching of individual fluorophore. This technology has now been used extensively for single molecule counting of protein, RNA, and other macromolecules in a variety of complexes or nanostructures. Here, we elucidate the SMPB technology, using the counting of RNA molecules within a bacteriophage phi29 DNA-packaging biomotor as an example. The method described here can be applied to the single molecule counting of other molecules in other systems. The construction of a concise, simple and economical single molecule total internal reflection fluorescence (TIRF) microscope combining prism-type and objective-type TIRF is described. The imaging system contains a deep-cooled sensitive EMCCD camera with single fluorophore detection sensitivity, a laser combiner for simultaneous dual-color excitation, and a Dual-View™ imager to split the multiple outcome signals to different detector channels based on their wavelengths. Methodology of the single molecule photobleaching assay used to elucidate the stoichiometry of RNA on phi29 DNA packaging motor and the mechanism of protein/RNA interaction are described. Different methods for single fluorophore labeling of RNA molecules are reviewed. The process of statistical modeling to reveal the true copy number of the biomolecules based on binomial distribution is also described. Copyright © 2014 Elsevier Inc. All rights reserved.

  1. Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships

    PubMed Central

    Booher, Nicholas J.; Carpenter, Sara C. D.; Sebra, Robert P.; Wang, Li; Salzberg, Steven L.; Leach, Jan E.

    2015-01-01

    Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33–35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution. PMID:27148456

  2. Dwell-Time Distribution, Long Pausing and Arrest of Single-Ribosome Translation through the mRNA Duplex.

    PubMed

    Xie, Ping

    2015-10-09

    Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by "hungry" codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel.

  3. Dwell-Time Distribution, Long Pausing and Arrest of Single-Ribosome Translation through the mRNA Duplex

    PubMed Central

    Xie, Ping

    2015-01-01

    Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by “hungry” codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel. PMID:26473825

  4. Phylogenomic relationship of feijoa (Acca sellowiana (O.Berg) Burret) with other Myrtaceae based on complete chloroplast genome sequences.

    PubMed

    Machado, Lilian de Oliveira; Vieira, Leila do Nascimento; Stefenon, Valdir Marcos; Oliveira Pedrosa, Fábio de; Souza, Emanuel Maltempi de; Guerra, Miguel Pedro; Nodari, Rubens Onofre

    2017-04-01

    Given their distribution, importance, and richness, Myrtaceae species comprise a model system for studying the evolution of tropical plant diversity. In addition, chloroplast (cp) genome sequencing is an efficient tool for phylogenetic relationship studies. Feijoa [Acca sellowiana (O. Berg) Burret; CN: pineapple-guava] is a Myrtaceae species that occurs naturally in southern Brazil and northern Uruguay. Feijoa is known for its exquisite perfume and flavorful fruits, pharmacological properties, ornamental value and increasing economic relevance. In the present work, we reported the complete cp genome of feijoa. The feijoa cp genome is a circular molecule of 159,370 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC 88,028 bp) and a Small Single Copy region (SSC 18,598 bp) separated by Inverted Repeat regions (IRs 26,372 bp). The genome structure, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. When compared to other cp genome sequences of Myrtaceae, feijoa showed closest relationship with pitanga (Eugenia uniflora L.). Furthermore, a comparison of pitanga synonymous (Ks) and nonsynonymous (Ka) substitution rates revealed extremely low values. Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of three Myrtoideae clades.

  5. A Single-Level Tunnel Model to Account for Electrical Transport through Single Molecule- and Self-Assembled Monolayer-based Junctions

    PubMed Central

    Garrigues, Alvar R.; Yuan, Li; Wang, Lejia; Mucciolo, Eduardo R.; Thompon, Damien; del Barco, Enrique; Nijhuis, Christian A.

    2016-01-01

    We present a theoretical analysis aimed at understanding electrical conduction in molecular tunnel junctions. We focus on discussing the validity of coherent versus incoherent theoretical formulations for single-level tunneling to explain experimental results obtained under a wide range of experimental conditions, including measurements in individual molecules connecting the leads of electromigrated single-electron transistors and junctions of self-assembled monolayers (SAM) of molecules sandwiched between two macroscopic contacts. We show that the restriction of transport through a single level in solid state junctions (no solvent) makes coherent and incoherent tunneling formalisms indistinguishable when only one level participates in transport. Similar to Marcus relaxation processes in wet electrochemistry, the thermal broadening of the Fermi distribution describing the electronic occupation energies in the electrodes accounts for the exponential dependence of the tunneling current on temperature. We demonstrate that a single-level tunnel model satisfactorily explains experimental results obtained in three different molecular junctions (both single-molecule and SAM-based) formed by ferrocene-based molecules. Among other things, we use the model to map the electrostatic potential profile in EGaIn-based SAM junctions in which the ferrocene unit is placed at different positions within the molecule, and we find that electrical screening gives rise to a strongly non-linear profile across the junction. PMID:27216489

  6. One-by-one single-molecule detection of mutated nucleobases by monitoring tunneling current using a DNA tip.

    PubMed

    Bui, Phuc Tan; Nishino, Tomoaki; Shiigi, Hiroshi; Nagaoka, Tsutomu

    2015-01-31

    A DNA molecule was utilized as a probe tip to achieve single-molecule genetic diagnoses. Hybridization of the probe and target DNAs resulted in electron tunneling along the emergent double-stranded DNA. Simple stationary monitoring of the tunneling current leads to single-molecule DNA detection and discovery of base mismatches and methylation.

  7. Droplet Microfluidics for Compartmentalized Cell Lysis and Extension of DNA from Single-Cells

    NASA Astrophysics Data System (ADS)

    Zimny, Philip; Juncker, David; Reisner, Walter

    Current single cell DNA analysis methods suffer from (i) bias introduced by the need for molecular amplification and (ii) limited ability to sequence repetitive elements, resulting in (iii) an inability to obtain information regarding long range genomic features. Recent efforts to circumvent these limitations rely on techniques for sensing single molecules of DNA extracted from single-cells. Here we demonstrate a droplet microfluidic approach for encapsulation and biochemical processing of single-cells inside alginate microparticles. In our approach, single-cells are first packaged inside the alginate microparticles followed by cell lysis, DNA purification, and labeling steps performed off-chip inside this microparticle system. The alginate microparticles are then introduced inside a micro/nanofluidic system where the alginate is broken down via a chelating buffer, releasing long DNA molecules which are then extended inside nanofluidic channels for analysis via standard mapping protocols.

  8. Sequence Dependencies of DNA Deformability and Hydration in the Minor Groove

    PubMed Central

    Yonetani, Yoshiteru; Kono, Hidetoshi

    2009-01-01

    Abstract DNA deformability and hydration are both sequence-dependent and are essential in specific DNA sequence recognition by proteins. However, the relationship between the two is not well understood. Here, systematic molecular dynamics simulations of 136 DNA sequences that differ from each other in their central tetramer revealed that sequence dependence of hydration is clearly correlated with that of deformability. We show that this correlation can be illustrated by four typical cases. Most rigid basepair steps are highly likely to form an ordered hydration pattern composed of one water molecule forming a bridge between the bases of distinct strands, but a few exceptions favor another ordered hydration composed of two water molecules forming such a bridge. Steps with medium deformability can display both of these hydration patterns with frequent transition. Highly flexible steps do not have any stable hydration pattern. A detailed picture of this correlation demonstrates that motions of hydration water molecules and DNA bases are tightly coupled with each other at the atomic level. These results contribute to our understanding of the entropic contribution from water molecules in protein or drug binding and could be applied for the purpose of predicting binding sites. PMID:19686662

  9. WS2 nanopores for molecule analysis

    NASA Astrophysics Data System (ADS)

    Danda, Gopinath; Masih Das, Paul; Chou, Yung-Chien; Mlack, Jerome; Naylor, Carl; Perea-Lopez, Nestor; Lin, Zhong; Fulton, Laura Beth; Terrones, Mauricio; Johnson, A. T. Charlie; Drndic, Marija

    Atomically thin 2D materials like graphene and transition metal dichalcogenides (TMDs) are interesting as membranes in solid state nanopore sensors for DNA analysis as they may facilitate single base resolution sequencing. These materials also exhibit unique optical and electronic properties which may be exploited to enhance the functionality of nanopore sensors. Here, we report WS2 nanopores, fabricated using a focused TEM beam. We also report their controlled laser-induced expansion in ionic solution. This study demonstrates the possibility of dynamic control of nanopore characteristics optically. NIH Grant R21HG007856, NSF EFRI-1542707.

  10. Extracting physics of life at the molecular level: A review of single-molecule data analyses.

    PubMed

    Colomb, Warren; Sarkar, Susanta K

    2015-06-01

    Studying individual biomolecules at the single-molecule level has proved very insightful recently. Single-molecule experiments allow us to probe both the equilibrium and nonequilibrium properties as well as make quantitative connections with ensemble experiments and equilibrium thermodynamics. However, it is important to be careful about the analysis of single-molecule data because of the noise present and the lack of theoretical framework for processes far away from equilibrium. Biomolecular motion, whether it is free in solution, on a substrate, or under force, involves thermal fluctuations in varying degrees, which makes the motion noisy. In addition, the noise from the experimental setup makes it even more complex. The details of biologically relevant interactions, conformational dynamics, and activities are hidden in the noisy single-molecule data. As such, extracting biological insights from noisy data is still an active area of research. In this review, we will focus on analyzing both fluorescence-based and force-based single-molecule experiments and gaining biological insights at the single-molecule level. Inherently nonequilibrium nature of biological processes will be highlighted. Simulated trajectories of biomolecular diffusion will be used to compare and validate various analysis techniques. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. Tunable graphene quantum point contact transistor for DNA detection and characterization

    PubMed Central

    Girdhar, Anuj; Sathe, Chaitanya; Schulten, Klaus; Leburton, Jean-Pierre

    2015-01-01

    A graphene membrane conductor containing a nanopore in a quantum point contact (QPC) geometry is a promising candidate to sense, and potentially sequence, DNA molecules translocating through the nanopore. Within this geometry, the shape, size, and position of the nanopore as well as the edge configuration influences the membrane conductance caused by the electrostatic interaction between the DNA nucleotides and the nanopore edge. It is shown that the graphene conductance variations resulting from DNA translocation can be enhanced by choosing a particular geometry as well as by modulating the graphene Fermi energy, which demonstrates the ability to detect conformational transformations of a double-stranded DNA, as well as the passage of individual base pairs of a single-stranded DNA molecule through the nanopore. PMID:25765702

  12. An atypical CNG channel activated by a single cGMP molecule controls sperm chemotaxis.

    PubMed

    Bönigk, Wolfgang; Loogen, Astrid; Seifert, Reinhard; Kashikar, Nachiket; Klemm, Clementine; Krause, Eberhard; Hagen, Volker; Kremmer, Elisabeth; Strünker, Timo; Kaupp, U Benjamin

    2009-10-27

    Sperm of the sea urchin Arbacia punctulata can respond to a single molecule of chemoattractant released by an egg. The mechanism underlying this extreme sensitivity is unknown. Crucial signaling events in the response of A. punctulata sperm to chemoattractant include the rapid synthesis of the intracellular messenger guanosine 3',5'-monophosphate (cGMP) and the ensuing membrane hyperpolarization that results from the opening of potassium-selective cyclic nucleotide-gated (CNGK) channels. Here, we use calibrated photolysis of caged cGMP to show that approximately 45 cGMP molecules are generated during the response to a single molecule of chemoattractant. The CNGK channel can respond to such small cGMP changes because it is exquisitely sensitive to cGMP and activated in a noncooperative fashion. Like voltage-activated Ca(v) and Na(v) channels, the CNGK polypeptide consists of four homologous repeat sequences. Disabling each of the four cyclic nucleotide-binding sites through mutagenesis revealed that binding of a single cGMP molecule to repeat 3 is necessary and sufficient to activate the CNGK channel. Thus, CNGK has developed a mechanism of activation that is different from the activation of other CNG channels, which requires the cooperative binding of several ligands and operates in the micromolar rather than the nanomolar range.

  13. High speed nucleic acid sequencing

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  14. Superresolution Imaging using Single-Molecule Localization

    PubMed Central

    Patterson, George; Davidson, Michael; Manley, Suliana; Lippincott-Schwartz, Jennifer

    2013-01-01

    Superresolution imaging is a rapidly emerging new field of microscopy that dramatically improves the spatial resolution of light microscopy by over an order of magnitude (∼10–20-nm resolution), allowing biological processes to be described at the molecular scale. Here, we discuss a form of superresolution microscopy based on the controlled activation and sampling of sparse subsets of photoconvertible fluorescent molecules. In this single-molecule based imaging approach, a wide variety of probes have proved valuable, ranging from genetically encodable photoactivatable fluorescent proteins to photoswitchable cyanine dyes. These have been used in diverse applications of superresolution imaging: from three-dimensional, multicolor molecule localization to tracking of nanometric structures and molecules in living cells. Single-molecule-based superresolution imaging thus offers exciting possibilities for obtaining molecular-scale information on biological events occurring at variable timescales. PMID:20055680

  15. All gene-sized DNA molecules in four species of hypotrichs have the same terminal sequence and an unusual 3' terminus.

    PubMed Central

    Klobutcher, L A; Swanton, M T; Donini, P; Prescott, D M

    1981-01-01

    In hypotrichous ciliates, all of the macronuclear DNA is in the form of low molecular weight molecules with an average size of approximately 2200 base pairs. Total macronuclear DNA from four hypotrichs has been shown to have inverted terminal repeats by direct sequence analysis. In Oxytricha nova, Oxytricha sp., and Stylonychia pustulata, this terminal sequence may be written as 5'-C4A4C4A4C4 ... 3'-G4T4G4T4G4T4G4T4G4 ... In Euplotes aediculatus, the sequences is similar but differs in the lengths of the duplex region (28 base pairs) and of the putative 3' extension (14 base pairs). Also in Euplotes, a second common sequence of 5 base pairs (A-A-C-T-T-T-T-G-A-A) occurs internal to the terminal repeat and a 17-base-pair heterogeneous region: 5'-C4A4C4A4C4A4C4(X)17T-T-G-A-A ... 3'-G2T4G4T4G4T4G4T4G4T4G4(X)17A-A-C-T-T ... The length of the terminal repeat sequence for O. nova was confirmed in cloned macronuclear DNA molecules. Images PMID:6265931

  16. Molecular sled sequences are common in mammalian proteins.

    PubMed

    Xiong, Kan; Blainey, Paul C

    2016-03-18

    Recent work revealed a new class of molecular machines called molecular sleds, which are small basic molecules that bind and slide along DNA with the ability to carry cargo along DNA. Here, we performed biochemical and single-molecule flow stretching assays to investigate the basis of sliding activity in molecular sleds. In particular, we identified the functional core of pVIc, the first molecular sled characterized; peptide functional groups that control sliding activity; and propose a model for the sliding activity of molecular sleds. We also observed widespread DNA binding and sliding activity among basic polypeptide sequences that implicate mammalian nuclear localization sequences and many cell penetrating peptides as molecular sleds. These basic protein motifs exhibit weak but physiologically relevant sequence-nonspecific DNA affinity. Our findings indicate that many mammalian proteins contain molecular sled sequences and suggest the possibility that substantial undiscovered sliding activity exists among nuclear mammalian proteins. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules.

    PubMed

    Garcia-Martin, Juan Antonio; Dotu, Ivan; Clote, Peter

    2015-07-01

    Several algorithms for RNA inverse folding have been used to design synthetic riboswitches, ribozymes and thermoswitches, whose activity has been experimentally validated. The RNAiFold software is unique among approaches for inverse folding in that (exhaustive) constraint programming is used instead of heuristic methods. For that reason, RNAiFold can generate all sequences that fold into the target structure or determine that there is no solution. RNAiFold 2.0 is a complete overhaul of RNAiFold 1.0, rewritten from the now defunct COMET language to C++. The new code properly extends the capabilities of its predecessor by providing a user-friendly pipeline to design synthetic constructs having the functionality of given Rfam families. In addition, the new software supports amino acid constraints, even for proteins translated in different reading frames from overlapping coding sequences; moreover, structure compatibility/incompatibility constraints have been expanded. With these features, RNAiFold 2.0 allows the user to design single RNA molecules as well as hybridization complexes of two RNA molecules. the web server, source code and linux binaries are publicly accessible at http://bioinformatics.bc.edu/clotelab/RNAiFold2.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Nothing in Evolution Makes Sense Except in the Light of Genomics: Read-Write Genome Evolution as an Active Biological Process.

    PubMed

    Shapiro, James A

    2016-06-08

    The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.

  19. Nothing in Evolution Makes Sense Except in the Light of Genomics: Read–Write Genome Evolution as an Active Biological Process

    PubMed Central

    Shapiro, James A.

    2016-01-01

    The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess “Read–Write Genomes” they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification. PMID:27338490

  20. Single-molecule optical genome mapping of a human HapMap and a colorectal cancer cell line.

    PubMed

    Teo, Audrey S M; Verzotto, Davide; Yao, Fei; Nagarajan, Niranjan; Hillmer, Axel M

    2015-01-01

    Next-generation sequencing (NGS) technologies have changed our understanding of the variability of the human genome. However, the identification of genome structural variations based on NGS approaches with read lengths of 35-300 bases remains a challenge. Single-molecule optical mapping technologies allow the analysis of DNA molecules of up to 2 Mb and as such are suitable for the identification of large-scale genome structural variations, and for de novo genome assemblies when combined with short-read NGS data. Here we present optical mapping data for two human genomes: the HapMap cell line GM12878 and the colorectal cancer cell line HCT116. High molecular weight DNA was obtained by embedding GM12878 and HCT116 cells, respectively, in agarose plugs, followed by DNA extraction under mild conditions. Genomic DNA was digested with KpnI and 310,000 and 296,000 DNA molecules (≥ 150 kb and 10 restriction fragments), respectively, were analyzed per cell line using the Argus optical mapping system. Maps were aligned to the human reference by OPTIMA, a new glocal alignment method. Genome coverage of 6.8× and 5.7× was obtained, respectively; 2.9× and 1.7× more than the coverage obtained with previously available software. Optical mapping allows the resolution of large-scale structural variations of the genome, and the scaffold extension of NGS-based de novo assemblies. OPTIMA is an efficient new alignment method; our optical mapping data provide a resource for genome structure analyses of the human HapMap reference cell line GM12878, and the colorectal cancer cell line HCT116.

  1. Optical-nanofiber-based interface for single molecules

    NASA Astrophysics Data System (ADS)

    Skoff, Sarah M.; Papencordt, David; Schauffert, Hardy; Bayer, Bernhard C.; Rauschenbeutel, Arno

    2018-04-01

    Optical interfaces for quantum emitters are a prerequisite for implementing quantum networks. Here, we couple single molecules to the guided modes of an optical nanofiber. The molecules are embedded within a crystal that provides photostability and, due to the inhomogeneous broadening, a means to spectrally address single molecules. Single molecules are excited and detected solely via the nanofiber interface without the requirement of additional optical access. In this way, we realize a fully fiber-integrated system that is scalable and may become a versatile constituent for quantum hybrid systems.

  2. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  3. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  4. Giant light-harvesting nanoantenna for single-molecule detection in ambient light

    PubMed Central

    Trofymchuk, Kateryna; Reisch, Andreas; Didier, Pascal; Fras, François; Gilliot, Pierre; Mely, Yves; Klymchenko, Andrey S.

    2017-01-01

    Here, we explore the enhancement of single molecule emission by polymeric nano-antenna that can harvest energy from thousands of donor dyes to a single acceptor. In this nano-antenna, the cationic dyes are brought together in very close proximity using bulky counterions, thus enabling ultrafast diffusion of excitation energy (≤30 fs) with minimal losses. Our 60-nm nanoparticles containing >10,000 rhodamine-based donor dyes can efficiently transfer energy to 1-2 acceptors resulting in an antenna effect of ~1,000. Therefore, single Cy5-based acceptors become 25-fold brighter than quantum dots QD655. This unprecedented amplification of the acceptor dye emission enables observation of single molecules at illumination powers (1-10 mW cm-2) that are >10,000-fold lower than typically required in single-molecule measurements. Finally, using a basic setup, which includes a 20X air objective and a sCMOS camera, we could detect single Cy5 molecules by simply shining divergent light on the sample at powers equivalent to sunlight. PMID:28983324

  5. Dock ’n Roll: Folding of a Silk-Inspired Polypeptide into an Amyloid-like Beta Solenoid

    PubMed Central

    Zhao, Binwu; Cohen Stuart, Martien A.; Hall, Carol K.

    2016-01-01

    Polypeptides containing the motif ((GA)mGX)n occur in silk (we refer to them as ‘silk-like’) and have a strong tendency to self-assemble. For example, polypeptides containing (GAGAGAGX)n, where X = G or H have been observed to form filaments; similar sequences but with X = Q have been used in the design of coat proteins (capsids) for artificial viruses. The structure of the (GAGAGAGX)m filaments has been proposed to be a stack of peptides in a β roll structure with the hydrophobic side chains pointing outwards (hydrophobic shell). Another possible configuration, a β roll or β solenoid structure which has its hydrophobic side chains buried inside (hydrophobic core) was, however, overlooked. We perform ground state analysis as well as atomic-level molecular dynamics simulations, both on single molecules and on two-molecule stacks of the silk-inspired sequence (GAGAGAGQ)10, to decide whether the hydrophobic core or the hydrophobic shell configuration is the most stable one. We find that a stack of two hydrophobic core molecules is energetically more favorable than a stack of two shell molecules. A shell molecule initially placed in a perfect β roll structure tends to rotate its strands, breaking in-plane hydrogen bonds and forming out-of-plane hydrogen bonds, while a core molecule stays in the β roll structure. The hydrophobic shell structure has type II’ β turns whereas the core configuration has type II β turns; only the latter secondary structure agrees well with solid-state NMR experiments on a similar sequence (GA)15. We also observe that the core stack has a higher number of intra-molecular hydrogen bonds and a higher number of hydrogen bonds between stack and water than the shell stack. Hence, we conclude that the hydrophobic core configuration is the most likely structure. In the stacked state, each peptide has more intra-molecular hydrogen bonds than a single folded molecule, which suggests that stacking provides the extra stability needed for molecules to reach the folded state. PMID:26947809

  6. Virology: The Next Generation from Digital PCR to Single Virion Genomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    White, Richard A.; Brazelton De Cardenas, Jessica N.; Hayden, Randall T.

    In the past 25 years, virology has had major technology breakthroughs stemming first from the introduction of nucleic acid amplification testing, but more recently from the use of next-generation sequencing, digital PCR, and the possibility of single virion genomics. These technologies have and will improve diagnosis and disease state monitoring in clinical settings, aid in environmental monitoring, and reveal the vast genetic potential of viruses. Using the principle of limiting dilution, digital PCR amplifies single molecules of DNA in highly partitioned endpoint reactions and reads each of those reactions as either positive or negative based on the presence or absencemore » of target fluorophore. In this review, digital PCR will be highlighted along with current studies, advantages/disadvantages, and future perspectives with regard to digital PCR, viral load testing, and the possibility of single virion genomics.« less

  7. Rtools: a web server for various secondary structural analyses on single RNA sequences.

    PubMed

    Hamada, Michiaki; Ono, Yukiteru; Kiryu, Hisanori; Sato, Kengo; Kato, Yuki; Fukunaga, Tsukasa; Mori, Ryota; Asai, Kiyoshi

    2016-07-08

    The secondary structures, as well as the nucleotide sequences, are the important features of RNA molecules to characterize their functions. According to the thermodynamic model, however, the probability of any secondary structure is very small. As a consequence, any tool to predict the secondary structures of RNAs has limited accuracy. On the other hand, there are a few tools to compensate the imperfect predictions by calculating and visualizing the secondary structural information from RNA sequences. It is desirable to obtain the rich information from those tools through a friendly interface. We implemented a web server of the tools to predict secondary structures and to calculate various structural features based on the energy models of secondary structures. By just giving an RNA sequence to the web server, the user can get the different types of solutions of the secondary structures, the marginal probabilities such as base-paring probabilities, loop probabilities and accessibilities of the local bases, the energy changes by arbitrary base mutations as well as the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp, which integrates software tools, CentroidFold, CentroidHomfold, IPKnot, CapR, Raccess, Rchange and RintD. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9.

    PubMed

    Sternberg, Samuel H; Redding, Sy; Jinek, Martin; Greene, Eric C; Doudna, Jennifer A

    2014-03-06

    The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.

  9. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9

    NASA Astrophysics Data System (ADS)

    Sternberg, Samuel H.; Redding, Sy; Jinek, Martin; Greene, Eric C.; Doudna, Jennifer A.

    2014-03-01

    The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.

  10. Manipulation of oligonucleotides immobilized on solid supports - DNA computations on surfaces

    NASA Astrophysics Data System (ADS)

    Liu, Qinghua

    The manipulation of DNA oligonucleotides immobilized on various solid supports has been studied intensively, especially in the area of surface hybridization. Recently, surface-based biotechnology has been applied to the area of molecular computing. These surface-based methods have advantages with regard to ease of handling, facile purification, and less interference when compared to solution methodologies. This dissertation describes the investigation of molecular approaches to DNA computing. The feasibility of encoding a bit (0 or 1) of information for DNA-based computations at the single nucleotide level was studied, particularly with regard to the efficiency and specificity of hybridization discrimination. Both gold and glass surfaces, with addressed arrays of 32 oligonucleotides, were employed with similar hybridization results. Although single-base discrimination may be achieved in the system, it is at the cost of a severe decrease in the efficiency of hybridization to perfectly matched sequences. This compromises the utility of single nucleotide encoding for DNA computing applications in the absence of some additional mechanism for increasing specificity. Several methods are suggested including a multiple-base encoding strategy. The multiple-base encoding strategy was employed to develop a prototype DNA computer. The approach was demonstrated by solving a small example of the Satisfiability (SAT) problem, an NP-complete problem in Boolean logic. 16 distinct DNA oligonucleotides, encoding all candidate solutions to the 4-variable-4-clause-3-SAT problem, were immobilized on a gold surface in the non-addressed format. Four cycles of MARK (hybridization), DESTROY (enzymatic destruction) and UNMARK (denaturation) were performed, which identified and eliminated members of the set which were not solutions to the problem. Determination of the answer was accomplished in the READOUT (sequence identification) operation by PCR amplification of the remaining molecules and hybridization to an addressed array. Four answers were determined and the S/N ratio between correct and incorrect solutions ranged from 10 to 777, making discrimination between correct and incorrect solutions to the problem straightforward. Additionally, studies of enzymatic manipulations of DNA molecules on surfaces suggested the use of E. coli Exonuclease I (Exo I) and perhaps EarI in the DESTROY operation.

  11. Organization of 'nanocrystal molecules' using DNA

    NASA Astrophysics Data System (ADS)

    Alivisatos, A. Paul; Johnsson, Kai P.; Peng, Xiaogang; Wilson, Troy E.; Loweth, Colin J.; Bruchez, Marcel P.; Schultz, Peter G.

    1996-08-01

    PATTERNING matter on the nanometre scale is an important objective of current materials chemistry and physics. It is driven by both the need to further miniaturize electronic components and the fact that at the nanometre scale, materials properties are strongly size-dependent and thus can be tuned sensitively1. In nanoscale crystals, quantum size effects and the large number of surface atoms influence the, chemical, electronic, magnetic and optical behaviour2-4. 'Top-down' (for example, lithographic) methods for nanoscale manipulation reach only to the upper end of the nanometre regime5; but whereas 'bottom-up' wet chemical techniques allow for the preparation of mono-disperse, defect-free crystallites just 1-10 nm in size6-10, ways to control the structure of nanocrystal assemblies are scarce. Here we describe a strategy for the synthesis of'nanocrystal molecules', in which discrete numbers of gold nanocrystals are organized into spatially defined structures based on Watson-Crick base-pairing interactions. We attach single-stranded DNA oligonucleotides of defined length and sequence to individual nanocrystals, and these assemble into dimers and trimers on addition of a complementary single-stranded DNA template. We anticipate that this approach should allow the construction of more complex two-and three-dimensional assemblies.

  12. Computer simulation of gene detection without PCR by single molecule detection

    NASA Astrophysics Data System (ADS)

    Davis, Lloyd M.; Williams, John G.; Lamb, Don T.

    1999-01-01

    Pioneer Hi-Bred is developing a low-cost method for rapid screening of DNA, for use in research on elite crop seed genetics. Unamplified genomic DNA with the requisite base sequence is simultaneously labeled by two different colored fluorescent probes, which hybridize near the selected gene. Dual-channel single molecule detection (SMD) within a flow cell, then provides a sensitive and specific assay for the gene. The technique has been demonstrated using frequency- doubled Nd:YAG laser excitation of two visible-wavelength dyes. A prototype instrument employing infrared fluorophores and laser diodes for excitation has been developed. Here, we report results from a Monte Carlo simulation of the new instrument, in which experimentally determined photophysical parameters for candidate infrared dyes are used for parametric studies of experimental operating conditions. Fluorophore photostability is found to be a key factor in determining the instrument sensitivity. Most infrared dyes have poor photostability, resulting in inefficient SMD. However, the normalized cross-correlation function of the photon signals from each of the two channels can still yield a discernable peak, provided that the concentration of dual- labeled molecules is sufficiently high. Further, for low concentrations, processing of the two photon streams with Gaussian -weighted sliding sum digital filters and selection of simultaneously occurring peaks can also provide a sensitive indicator of the presence of dual-labeled molecules, although accidental coincidences must be considered in the interpretation of results.

  13. Atomic force microscope observation of branching in single transcript molecules derived from human cardiac muscle

    NASA Astrophysics Data System (ADS)

    Reed, Jason; Hsueh, Carlin; Mishra, Bud; Gimzewski, James K.

    2008-09-01

    We have used an atomic force microscope to examine a clinically derived sample of single-molecule gene transcripts, in the form of double-stranded cDNA, (c: complementary) obtained from human cardiac muscle without the use of polymerase chain reaction (PCR) amplification. We observed a log-normal distribution of transcript sizes, with most molecules being in the range of 0.4-7.0 kilobase pairs (kb) or 130-2300 nm in contour length, in accordance with the expected distribution of mRNA (m: messenger) sizes in mammalian cells. We observed novel branching structures not previously known to exist in cDNA, and which could have profound negative effects on traditional analysis of cDNA samples through cloning, PCR and DNA sequencing.

  14. Yeast prion architecture explains how proteins can be genes

    NASA Astrophysics Data System (ADS)

    Wickner, Reed

    2013-03-01

    Prions (infectious proteins) transmit information without an accompanying DNA or RNA. Most yeast prions are self-propagating amyloids that inactivate a normally functional protein. A single protein can become any of several prion variants, with different manifestations due to different amyloid structures. We showed that the yeast prion amyloids of Ure2p, Sup35p and Rnq1p are folded in-register parallel beta sheets using solid state NMR dipolar recoupling experiments, mass-per-filament-length measurements, and filament diameter measurements. The extent of beta sheet structure, measured by chemical shifts in solid-state NMR and acquired protease-resistance on amyloid formation, combined with the measured filament diameters, imply that the beta sheets must be folded along the long axis of the filament. We speculate that prion variants of a single protein sequence differ in the location of these folds. Favorable interactions between identical side chains must hold these structures in-register. The same interactions must guide an unstructured monomer joining the end of a filament to assume the same conformation as molecules already in the filament, with the turns at the same locations. In this way, a protein can template its own conformation, in analogy to the ability of a DNA molecule to template its sequence by specific base-pairing. Bldg. 8, Room 225, NIH, 8 Center Drive MSC 0830, Bethesda, MD 20892-0830, wickner@helix.nih.gov, 301-496-3452

  15. Single molecule characterization of DNA binding and strand displacement reactions on lithographic DNA origami microarrays.

    PubMed

    Scheible, Max B; Pardatscher, Günther; Kuzyk, Anton; Simmel, Friedrich C

    2014-03-12

    The combination of molecular self-assembly based on the DNA origami technique with lithographic patterning enables the creation of hierarchically ordered nanosystems, in which single molecules are positioned at precise locations on multiple length scales. Based on a hybrid assembly protocol utilizing DNA self-assembly and electron-beam lithography on transparent glass substrates, we here demonstrate a DNA origami microarray, which is compatible with the requirements of single molecule fluorescence and super-resolution microscopy. The spatial arrangement allows for a simple and reliable identification of single molecule events and facilitates automated read-out and data analysis. As a specific application, we utilize the microarray to characterize the performance of DNA strand displacement reactions localized on the DNA origami structures. We find considerable variability within the array, which results both from structural variations and stochastic reaction dynamics prevalent at the single molecule level.

  16. Deciphering hierarchical features in the energy landscape of adenylate kinase folding/unfolding

    NASA Astrophysics Data System (ADS)

    Taylor, J. Nicholas; Pirchi, Menahem; Haran, Gilad; Komatsuzaki, Tamiki

    2018-03-01

    Hierarchical features of the energy landscape of the folding/unfolding behavior of adenylate kinase, including its dependence on denaturant concentration, are elucidated in terms of single-molecule fluorescence resonance energy transfer (smFRET) measurements in which the proteins are encapsulated in a lipid vesicle. The core in constructing the energy landscape from single-molecule time-series across different denaturant concentrations is the application of rate-distortion theory (RDT), which naturally considers the effects of measurement noise and sampling error, in combination with change-point detection and the quantification of the FRET efficiency-dependent photobleaching behavior. Energy landscapes are constructed as a function of observation time scale, revealing multiple partially folded conformations at small time scales that are situated in a superbasin. As the time scale increases, these denatured states merge into a single basin, demonstrating the coarse-graining of the energy landscape as observation time increases. Because the photobleaching time scale is dependent on the conformational state of the protein, possible nonequilibrium features are discussed, and a statistical test for violation of the detailed balance condition is developed based on the state sequences arising from the RDT framework.

  17. Complete telomere-to-telomere de novo assembly of the Plasmodium falciparum genome through long-read (>11 kb), single molecule, real-time sequencing

    PubMed Central

    Vembar, Shruthi Sridhar; Seetin, Matthew; Lambert, Christine; Nattestad, Maria; Schatz, Michael C.; Baybayan, Primo; Scherf, Artur; Smith, Melissa Laird

    2016-01-01

    The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [∼80.6% (A + T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [∼90–99% (A + T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission. PMID:27345719

  18. A reversible single-molecule switch based on activated antiaromaticity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yin, Xiaodong; Zang, Yaping; Zhu, Liangliang

    Single-molecule electronic devices provide researchers with an unprecedented ability to relate novel physical phenomena to molecular chemical structures. Typically, conjugated aromatic molecular backbones are relied upon to create electronic devices, where the aromaticity of the building blocks is used to enhance conductivity. We capitalize on the classical physical organic chemistry concept of Hückel antiaromaticity by demonstrating a single-molecule switch that exhibits low conductance in the neutral state and, upon electrochemical oxidation, reversibly switches to an antiaromatic high-conducting structure. We form single-molecule devices using the scanning tunneling microscope–based break-junction technique and observe an on/off ratio of ~70 for a thiophenylidene derivativemore » that switches to an antiaromatic state with 6-4-6-p electrons. Through supporting nuclear magnetic resonance measurements, we show that the doubly oxidized core has antiaromatic character and we use density functional theory calculations to rationalize the origin of the high-conductance state for the oxidized single-molecule junction. Together, our work demonstrates how the concept of antiaromaticity can be exploited to create single-molecule devices that are highly conducting.« less

  19. A reversible single-molecule switch based on activated antiaromaticity

    DOE PAGES

    Yin, Xiaodong; Zang, Yaping; Zhu, Liangliang; ...

    2017-10-27

    Single-molecule electronic devices provide researchers with an unprecedented ability to relate novel physical phenomena to molecular chemical structures. Typically, conjugated aromatic molecular backbones are relied upon to create electronic devices, where the aromaticity of the building blocks is used to enhance conductivity. We capitalize on the classical physical organic chemistry concept of Hückel antiaromaticity by demonstrating a single-molecule switch that exhibits low conductance in the neutral state and, upon electrochemical oxidation, reversibly switches to an antiaromatic high-conducting structure. We form single-molecule devices using the scanning tunneling microscope–based break-junction technique and observe an on/off ratio of ~70 for a thiophenylidene derivativemore » that switches to an antiaromatic state with 6-4-6-p electrons. Through supporting nuclear magnetic resonance measurements, we show that the doubly oxidized core has antiaromatic character and we use density functional theory calculations to rationalize the origin of the high-conductance state for the oxidized single-molecule junction. Together, our work demonstrates how the concept of antiaromaticity can be exploited to create single-molecule devices that are highly conducting.« less

  20. Single-Stranded DNA Aptamers against Pathogens and Toxins: Identification and Biosensing Applications

    PubMed Central

    Hong, Ka Lok

    2015-01-01

    Molecular recognition elements (MREs) can be short sequences of single-stranded DNA, RNA, small peptides, or antibody fragments. They can bind to user-defined targets with high affinity and specificity. There has been an increasing interest in the identification and application of nucleic acid molecular recognition elements, commonly known as aptamers, since they were first described in 1990 by the Gold and Szostak laboratories. A large number of target specific nucleic acids MREs and their applications are currently in the literature. This review first describes the general methodologies used in identifying single-stranded DNA (ssDNA) aptamers. It then summarizes advancements in the identification and biosensing application of ssDNA aptamers specific for bacteria, viruses, their associated molecules, and selected chemical toxins. Lastly, an overview of the basic principles of ssDNA aptamer-based biosensors is discussed. PMID:26199940

  1. Inhibition of Oncogenic functionality of STAT3 Protein by Membrane Anchoring

    NASA Astrophysics Data System (ADS)

    Liu, Baoxu; Fletcher, Steven; Gunning, Patrick; Gradinaru, Claudiu

    2009-03-01

    Signal Transducer and Activator of Transcription 3 (STAT3) protein plays an important role in oncogenic processes. A novel molecular therapeutic approach to inhibit the oncogenic functionality of STAT3 is to design a prenylated small peptide sequence which could sequester STAT3 to the plasma membrane. We have also developed a novel fluorescein derivative label (F-NAc), which is much more photostable compared to the popular fluorescein label FITC. Remarkably, the new dye shows fluorescent properties that are invariant over a wide pH range, which is advantageous for our application. We have shown that F-NAc is suitable for single-molecule measurements and its properties are not affected by ligation to biomolecules. The membrane localization via high-affinity prenylated small-molecule binding agents is studied by encapsulating FNAc-labeled STAT3 and inhibitors within a liposome model cell system. The dynamics of the interaction between the protein and the prenylated ligands is investigated at single molecule level. The efficiency and stability of the STAT3 anchoring in lipid membranes are addressed via quantitative confocal imaging and single-molecule spectroscopy using a custom-built multiparameter fluorescence microscope.

  2. DNA sequencing with pyrophosphatase

    DOEpatents

    Tabor, S.; Richardson, C.C.

    1996-03-12

    A kit or solution is disclosed for use in extension of an oligonucleotide primer having a first single-stranded region on a template molecule and having a second single-stranded region homologous to the first single-stranded region. The first agent is able to cause extension of the first single-stranded region of the primer on the second single-stranded region of the template in a reaction mixture. The second agent is able to reduce the amount of pyrophosphate in the reaction mixture below the amount produced during the extension in the absence of the second agent.

  3. DNA sequencing with pyrophosphatase

    DOEpatents

    Tabor, Stanley; Richardson, Charles C.

    1996-03-12

    A kit or solution for use in extension of an oligonucleotide primer having a first single-stranded region on a template molecule having a second single-stranded region homologous to the first single-stranded region, comprising a first agent able to cause extension of the first single-stranded region of the primer on the second single-stranded region of the template in a reaction mixture, and a second agent able to reduce the amount of pyrophosphate in the reaction mixture below the amount produced during the extension in the absence of the second agent.

  4. Use of continuous/contiguous stacking hybridization as a diagnostic tool

    DOEpatents

    Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

    2002-01-01

    A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.

  5. Use of continuous/contiguous stacking hybridization as a diagnostic tool

    DOEpatents

    Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

    2000-01-01

    A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.

  6. Prediction of RNA secondary structures: from theory to models and real molecules

    NASA Astrophysics Data System (ADS)

    Schuster, Peter

    2006-05-01

    RNA secondary structures are derived from RNA sequences, which are strings built form the natural four letter nucleotide alphabet, {AUGC}. These coarse-grained structures, in turn, are tantamount to constrained strings over a three letter alphabet. Hence, the secondary structures are discrete objects and the number of sequences always exceeds the number of structures. The sequences built from two letter alphabets form perfect structures when the nucleotides can form a base pair, as is the case with {GC} or {AU}, but the relation between the sequences and structures differs strongly from the four letter alphabet. A comprehensive theory of RNA structure is presented, which is based on the concepts of sequence space and shape space, being a space of structures. It sets the stage for modelling processes in ensembles of RNA molecules like evolutionary optimization or kinetic folding as dynamical phenomena guided by mappings between the two spaces. The number of minimum free energy (mfe) structures is always smaller than the number of sequences, even for two letter alphabets. Folding of RNA molecules into mfe energy structures constitutes a non-invertible mapping from sequence space onto shape space. The preimage of a structure in sequence space is defined as its neutral network. Similarly the set of suboptimal structures is the preimage of a sequence in shape space. This set represents the conformation space of a given sequence. The evolutionary optimization of structures in populations is a process taking place in sequence space, whereas kinetic folding occurs in molecular ensembles that optimize free energy in conformation space. Efficient folding algorithms based on dynamic programming are available for the prediction of secondary structures for given sequences. The inverse problem, the computation of sequences for predefined structures, is an important tool for the design of RNA molecules with tailored properties. Simultaneous folding or cofolding of two or more RNA molecules can be modelled readily at the secondary structure level and allows prediction of the most stable (mfe) conformations of complexes together with suboptimal states. Cofolding algorithms are important tools for efficient and highly specific primer design in the polymerase chain reaction (PCR) and help to explain the mechanisms of small interference RNA (si-RNA) molecules in gene regulation. The evolutionary optimization of RNA structures is illustrated by the search for a target structure and mimics aptamer selection in evolutionary biotechnology. It occurs typically in steps consisting of short adaptive phases interrupted by long epochs of little or no obvious progress in optimization. During these quasi-stationary epochs the populations are essentially confined to neutral networks where they search for sequences that allow a continuation of the adaptive process. Modelling RNA evolution as a simultaneous process in sequence and shape space provides answers to questions of the optimal population size and mutation rates. Kinetic folding is a stochastic process in conformation space. Exact solutions are derived by direct simulation in the form of trajectory sampling or by solving the master equation. The exact solutions can be approximated straightforwardly by Arrhenius kinetics on barrier trees, which represent simplified versions of conformational energy landscapes. The existence of at least one sequence forming any arbitrarily chosen pair of structures is granted by the intersection theorem. Folding kinetics is the key to understanding and designing multistable RNA molecules or RNA switches. These RNAs form two or more long lived conformations, and conformational changes occur either spontaneously or are induced through binding of small molecules or other biopolymers. RNA switches are found in nature where they act as elements in genetic and metabolic regulation. The reliability of RNA secondary structure prediction is limited by the accuracy with which the empirical parameters can be determined and by principal deficiencies, for example by the lack of energy contributions resulting from tertiary interactions. In addition, native structures may be determined by folding kinetics rather than by thermodynamics. We address the first problem by considering base pair probabilities or base pairing entropies, which are derived from the partition function of conformations. A high base pair probability corresponding to a low pairing entropy is taken as an indicator of a high reliability of prediction. Pseudoknots are discussed as an example of a tertiary interaction that is highly important for RNA function. Moreover, pseudoknot formation is readily incorporated into structure prediction algorithms. Some examples of experimental data on RNA secondary structures that are readily explained using the landscape concept are presented. They deal with (i) properties of RNA molecules with random sequences, (ii) RNA molecules from restricted alphabets, (iii) existence of neutral networks, (iv) shape space covering, (v) riboswitches and (vi) evolution of non-coding RNAs as an example of evolution restricted to neutral networks.

  7. Base modifications affecting RNA polymerase and reverse transcriptase fidelity.

    PubMed

    Potapov, Vladimir; Fu, Xiaoqing; Dai, Nan; Corrêa, Ivan R; Tanner, Nathan A; Ong, Jennifer L

    2018-06-20

    Ribonucleic acid (RNA) is capable of hosting a variety of chemically diverse modifications, in both naturally-occurring post-transcriptional modifications and artificial chemical modifications used to expand the functionality of RNA. However, few studies have addressed how base modifications affect RNA polymerase and reverse transcriptase activity and fidelity. Here, we describe the fidelity of RNA synthesis and reverse transcription of modified ribonucleotides using an assay based on Pacific Biosciences Single Molecule Real-Time sequencing. Several modified bases, including methylated (m6A, m5C and m5U), hydroxymethylated (hm5U) and isomeric bases (pseudouridine), were examined. By comparing each modified base to the equivalent unmodified RNA base, we can determine how the modification affected cumulative RNA polymerase and reverse transcriptase fidelity. 5-hydroxymethyluridine and N6-methyladenosine both increased the combined error rate of T7 RNA polymerase and reverse transcriptases, while pseudouridine specifically increased the error rate of RNA synthesis by T7 RNA polymerase. In addition, we examined the frequency, mutational spectrum and sequence context of reverse transcription errors on DNA templates from an analysis of second strand DNA synthesis.

  8. Quantitative analysis of single-molecule superresolution images

    PubMed Central

    Coltharp, Carla; Yang, Xinxing; Xiao, Jie

    2014-01-01

    This review highlights the quantitative capabilities of single-molecule localization-based superresolution imaging methods. In addition to revealing fine structural details, the molecule coordinate lists generated by these methods provide the critical ability to quantify the number, clustering, and colocalization of molecules with 10 – 50 nm resolution. Here we describe typical workflows and precautions for quantitative analysis of single-molecule superresolution images. These guidelines include potential pitfalls and essential control experiments, allowing critical assessment and interpretation of superresolution images. PMID:25179006

  9. 1,8-Naphthyridine-2,7-diamine: a potential universal reader of Watson-Crick base pairs for DNA sequencing by electron tunneling.

    PubMed

    Liang, Feng; Lindsay, Stuart; Zhang, Peiming

    2012-11-21

    With the aid of Density Functional Theory (DFT), we designed 1,8-naphthyridine-2,7-diamine as a recognition molecule to read DNA base pairs for genomic sequencing by electron tunneling. NMR studies show that it can form stable triplets with both A : T and G : C base pairs through hydrogen bonding. Our results suggest that the naphthyridine molecule should be able to function as a universal base pair reader in a tunneling gap, generating distinguishable signatures under electrical bias for each of DNA base pairs.

  10. 1,8-Naphthyridine-2,7-diamine: A Potential Universal Reader of the Watson-Crick Base Pairs for DNA Sequencing by Electron Tunneling

    PubMed Central

    Liang, Feng; Lindsay, Stuart; Zhang, Peiming

    2013-01-01

    With the aid of Density Functional Theory (DFT), we designed 1,8-naphthyridine-2,7-diamine as a recognition molecule to read the DNA base pairs for genomic sequencing by electron tunneling. NMR studies show that it can form stable triplets with both A:T and G:C base pairs through hydrogen bonding. Our results suggest that the naphthyridine molecule should be able to function as a universal base pair reader in a tunneling gap, generating distinguishable signatures under electrical bias for each of DNA base pairs. PMID:23038027

  11. A disruptive sequencer meets disruptive publishing.

    PubMed

    Loman, Nick; Goodwin, Sarah; Jansen, Hans; Loose, Matt

    2015-01-01

    Nanopore sequencing was recently made available to users in the form of the Oxford Nanopore MinION. Released to users through an early access programme, the MinION is made unique by its tiny form factor and ability to generate very long sequences from single DNA molecules. The platform is undergoing rapid evolution with three distinct nanopore types and five updates to library preparation chemistry in the last 18 months. To keep pace with the rapid evolution of this sequencing platform, and to provide a space where new analysis methods can be openly discussed, we present a new F1000Research channel devoted to updates to and analysis of nanopore sequence data.

  12. Conformation and Aggregation of LKα14 Peptide in Bulk Water and at the Air/Water Interface.

    PubMed

    Dalgicdir, Cahit; Sayar, Mehmet

    2015-12-10

    Historically, the protein folding problem has mainly been associated with understanding the relationship between amino acid sequence and structure. However, it is known that both the conformation of individual molecules and their aggregation strongly depend on the environmental conditions. Here, we study the aggregation behavior of the model peptide LKα14 (with amino acid sequence LKKLLKLLKKLLKL) in bulk water and at the air/water interface. We start by a quantitative analysis of the conformational space of a single LKα14 in bulk water. Next, in order to analyze the aggregation tendency of LKα14, by using the umbrella sampling technique we calculate the potential of mean force for pulling a single peptide from an n-molecule aggregate. In agreement with the experimental results, our calculations yield the optimal aggregate size as four. This equilibrium state is achieved by two opposing forces: Coulomb repulsion between the lysine side chains and the reduction of solvent accessible hydrophobic surface area upon aggregation. At the vacuum/water interface, however, even dimers of LKα14 become marginally stable, and any larger aggregate falls apart instantaneously. Our results indicate that even though the interface is highly influential in stabilizing the α-helix conformation for a single molecule, it significantly reduces the attraction between two LKα14 peptides, along with their aggregation tendency.

  13. Superconducting molybdenum-rhenium electrodes for single-molecule transport studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gaudenzi, R.; Island, J. O.; Bruijckere, J. de

    2015-06-01

    We demonstrate that electronic transport through single molecules or molecular ensembles, commonly based on gold (Au) electrodes, can be extended to superconducting electrodes by combining gold with molybdenum-rhenium (MoRe). This combination induces proximity-effect superconductivity in the gold to temperatures of at least 4.6 K and magnetic fields of 6 T, improving on previously reported aluminum based superconducting nanojunctions. As a proof of concept, we show three-terminal superconductive transport measurements through an individual Fe{sub 4} single-molecule magnet.

  14. Sequence-based design of bioactive small molecules that target precursor microRNAs.

    PubMed

    Velagapudi, Sai Pradeep; Gallo, Steven M; Disney, Matthew D

    2014-04-01

    Oligonucleotides are designed to target RNA using base pairing rules, but they can be hampered by poor cellular delivery and nonspecific stimulation of the immune system. Small molecules are preferred as lead drugs or probes but cannot be designed from sequence. Herein, we describe an approach termed Inforna that designs lead small molecules for RNA from solely sequence. Inforna was applied to all human microRNA hairpin precursors, and it identified bioactive small molecules that inhibit biogenesis by binding nuclease-processing sites (44% hit rate). Among 27 lead interactions, the most avid interaction is between a benzimidazole (1) and precursor microRNA-96. Compound 1 selectively inhibits biogenesis of microRNA-96, upregulating a protein target (FOXO1) and inducing apoptosis in cancer cells. Apoptosis is ablated when FOXO1 mRNA expression is knocked down by an siRNA, validating compound selectivity. Markedly, microRNA profiling shows that 1 only affects microRNA-96 biogenesis and is at least as selective as an oligonucleotide.

  15. Sequence-based design of bioactive small molecules that target precursor microRNAs

    PubMed Central

    Velagapudi, Sai Pradeep; Gallo, Steven M.; Disney, Matthew D.

    2014-01-01

    Oligonucleotides are designed to target RNA using base pairing rules, however, they are hampered by poor cellular delivery and non-specific stimulation of the immune system. Small molecules are preferred as lead drugs or probes, but cannot be designed from sequence. Herein, we describe an approach termed Inforna that designs lead small molecules for RNA from solely sequence. Inforna was applied to all human microRNA precursors and identified bioactive small molecules that inhibit biogenesis by binding to nuclease processing sites (41% hit rate). Amongst 29 lead interactions, the most avid interaction is between a benzimidazole (1) and precursor microRNA-96. Compound 1 selectively inhibits biogenesis of microRNA-96, upregulating a protein target (FOXO1) and inducing apoptosis in cancer cells. Apoptosis is ablated when FOXO1 mRNA expression is knocked down by an siRNA, validating compound selectivity. Importantly, microRNA profiling shows that 1 only significantly effects microRNA-96 biogenesis and is more selective than an oligonucleotide. PMID:24509821

  16. Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

    DOEpatents

    Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

    2013-06-25

    A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.

  17. A Single-Molecule Barcoding System using Nanoslits for DNA Analysis

    NASA Astrophysics Data System (ADS)

    Jo, Kyubong; Schramm, Timothy M.; Schwartz, David C.

    Single DNA molecule approaches are playing an increasingly central role in the analytical genomic sciences because single molecule techniques intrinsically provide individualized measurements of selected molecules, free from the constraints of bulk techniques, which blindly average noise and mask the presence of minor analyte components. Accordingly, a principal challenge that must be addressed by all single molecule approaches aimed at genome analysis is how to immobilize and manipulate DNA molecules for measurements that foster construction of large, biologically relevant data sets. For meeting this challenge, this chapter discusses an integrated approach for microfabricated and nanofabricated devices for the manipulation of elongated DNA molecules within nanoscale geometries. Ideally, large DNA coils stretch via nanoconfinement when channel dimensions are within tens of nanometers. Importantly, stretched, often immobilized, DNA molecules spanning hundreds of kilobase pairs are required by all analytical platforms working with large genomic substrates because imaging techniques acquire sequence information from molecules that normally exist in free solution as unrevealing random coils resembling floppy balls of yarn. However, nanoscale devices fabricated with sufficiently small dimensions fostering molecular stretching make these devices impractical because of the requirement of exotic fabrication technologies, costly materials, and poor operational efficiencies. In this chapter, such problems are addressed by discussion of a new approach to DNA presentation and analysis that establishes scaleable nanoconfinement conditions through reduction of ionic strength; stiffening DNA molecules thus enabling their arraying for analysis using easily fabricated devices that can also be mass produced. This new approach to DNA nanoconfinement is complemented by the development of a novel labeling scheme for reliable marking of individual molecules with fluorochrome labels, creating molecular barcodes, which are efficiently read using fluorescence resonance energy transfer techniques for minimizing noise from unincorporated labels. As such, our integrative approach for the realization of genomic analysis through nanoconfinement, named nanocoding, was demonstrated through the barcoding and mapping of bacterial artificial chromosomal molecules, thereby providing the basis for a high-throughput platform competent for whole genome investigations.

  18. Label-free and high-sensitive detection for genetic point mutation based on hyperspectral interferometry

    NASA Astrophysics Data System (ADS)

    Fu, Rongxin; Li, Qi; Zhang, Junqi; Wang, Ruliang; Lin, Xue; Xue, Ning; Su, Ya; Jiang, Kai; Huang, Guoliang

    2016-10-01

    Label free point mutation detection is particularly momentous in the area of biomedical research and clinical diagnosis since gene mutations naturally occur and bring about highly fatal diseases. In this paper, a label free and high sensitive approach is proposed for point mutation detection based on hyperspectral interferometry. A hybridization strategy is designed to discriminate a single-base substitution with sequence-specific DNA ligase. Double-strand structures will take place only if added oligonucleotides are perfectly paired to the probe sequence. The proposed approach takes full use of the inherent conformation of double-strand DNA molecules on the substrate and a spectrum analysis method is established to point out the sub-nanoscale thickness variation, which benefits to high sensitive mutation detection. The limit of detection reach 4pg/mm2 according to the experimental result. A lung cancer gene point mutation was demonstrated, proving the high selectivity and multiplex analysis capability of the proposed biosensor.

  19. Single Molecule Enzymology via Nanoelectronic Circuits

    NASA Astrophysics Data System (ADS)

    Collins, Philip

    Traditional single-molecule techniques rely on fluorescence or force transduction to monitor conformational changes and biochemical activity. Recent demonstrations of single-molecule monitoring with electronic transistors are poised to add to the single-molecule research toolkit. The transistor-based technique is sensitive to the motion of single charged side chain residues and can transduce those motions with microsecond resolution, opening the doors to single-molecule enzymology with unprecedented resolution. Furthermore, the solid-state platform provides opportunities for parallelization in arrays and long-duration monitoring of one molecule's activity or processivity, all without the limitations caused by photo-oxidation or mutagenic fluorophore incorporation. This presentation will review some of these advantages and their particular application to DNA polymerase I processing single-stranded DNA templates. This research was supported financially by the NIH NCI (R01 CA133592-01), the NIH NIGMS (1R01GM106957-01) and the NSF (DMR-1104629 and ECCS-1231910).

  20. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction

    PubMed Central

    Laehnemann, David; Borkhardt, Arndt

    2016-01-01

    Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159

  1. Dramatic effect of single-base mutation on the conformational dynamics of human telomeric G-quadruplex

    PubMed Central

    Lee, Ja Yil; Kim, D. S.

    2009-01-01

    Guanine-rich DNA sequences can form G-quadruplexes. These four-stranded structures are known to form in several genomic regions and to influence certain biological activities. Sometimes, the instability of G-quadruplexes causes the abnormal biological processes. Mutation is a culprit for the destabilization of G-quadruplexes, but the details of mutated G-quadruplexes are poorly understood. In this article, we investigated the conformational dynamics of single-base mutated human telomeric G-quadruplexes in the presence of K+ with single-molecule FRET spectroscopy. We observed that the replacement of single guanine by thymine in a G-track induces various folded structures, i.e. structural polymorphism. Moreover, direct observation of their dynamics revealed that a single-base mutation causes fast unfolding of folded states under physiological conditions. Furthermore, we found that the degree of destabilization varies according to mutation positions. When the central guanine of a G-track is replaced, the G-quadruplexes unfold quickly at any K+ concentrations and temperature. Meanwhile, outer-quartet mutated G-quadruplexes have heterogeneous dynamics at intermediate K+ concentrations and longstanding folded states at high K+ concentrations. Several factors such as base-stacking interaction and K+ coordination are responsible for the different dynamics according to the mutation position. PMID:19359361

  2. SISGR: Room Temperature Single-Molecule Detection and Imaging by Stimulated Emission Microscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xie, Xiaoliang Sunney

    Single-molecule spectroscopy has made considerable impact on many disciplines including chemistry, physics, and biology. To date, most single-molecule spectroscopy work is accomplished by detecting fluorescence. On the other hand, many naturally occurring chromophores, such as retinal, hemoglobin and cytochromes, do not have detectable fluorescence. There is an emerging need for single-molecule spectroscopy techniques that do not require fluorescence. In the last proposal period, we have successfully demonstrated stimulated emission microscopy, single molecule absorption, and stimulated Raman microscopy based on a high-frequency modulation transfer technique. These first-of-a- kind new spectroscopy/microscopy methods tremendously improved our ability to observe molecules that fluorescence weakly,more » even to the limit of single molecule detection for absorption measurement. All of these methods employ two laser beams: one (pump beam) excites a single molecule to a real or virtual excited state, and the other (probe beam) monitors the absorption/emission property of the single. We extract the intensity change of the probe beam with high sensitivity by implementing a high-frequency phase-sensitive detection scheme, which offers orders of magnitude improvement in detection sensitivity over direct absorption/emission measurement. However, single molecule detection based on fluorescence or absorption is fundamentally limited due to their broad spectral response. It is important to explore other avenues in single molecule detection and imaging which provides higher molecular specificity for studying a wide variety of heterogeneous chemical and biological systems. This proposal aimed to achieve single-molecule detection sensitivity with near resonance stimulated Raman scattering (SRS) microscopy. SRS microscopy was developed in our lab as a powerful technique for imaging heterogeneous samples based on their intrinsic vibrational contrasts, which provides much higher molecular specificity than absorption and fluorescence. Current sensitivity limit of SRS microscopy has not yet reached single molecule detection. We proposed to capitalize on our state-of-the-art SRS microscopy and develop near-resonance enhanced SRS for single molecule detection of carotenoids and heme proteins. The specific aims we pursued are: (1) building the next SRS generation microscope that utilizes near resonance enhancement to allow detection and imaging of single molecules with undetectable fluorescence, such as -carotene. (2) using near-resonance SRS as a contrast mechanism to study dye-sensitize semiconductor interface, elucidating the heterogeneous electron ejection kinetics with high spatial and temporal resolution. (3) studying the binding and unbinding of oxygen in single hemoglobin molecules in order to gain molecular level understanding of the long-standing issue of cooperativity. The new methods developed in the fund period of this grant have advanced the detection sensitivity in many aspects. Near-resonance SRS improved the signal by using shorter wavelengths for SRS microscopy. Frequency modulation and multi-color SRS target the reduction of background to improve the chemical specificity of SRS while maintaining the high imaging speed. Time-domain coherent Raman scattering microscopy targets to reduce the noise floor of coherent Raman microscopy. These methods have already demonstrated first-of-a-kind new applications in biology and medical research. However, we are still one order of magnitude away from single molecule limit. It is important to continue to improve the laser specification and develop new imaging methods to finally achieve label-free single molecule microscopy.« less

  3. Combining single-molecule manipulation and single-molecule detection.

    PubMed

    Cordova, Juan Carlos; Das, Dibyendu Kumar; Manning, Harris W; Lang, Matthew J

    2014-10-01

    Single molecule force manipulation combined with fluorescence techniques offers much promise in revealing mechanistic details of biomolecular machinery. Here, we review force-fluorescence microscopy, which combines the best features of manipulation and detection techniques. Three of the mainstay manipulation methods (optical traps, magnetic traps and atomic force microscopy) are discussed with respect to milestones in combination developments, in addition to highlight recent contributions to the field. An overview of additional strategies is discussed, including fluorescence based force sensors for force measurement in vivo. Armed with recent exciting demonstrations of this technology, the field of combined single-molecule manipulation and single-molecule detection is poised to provide unprecedented views of molecular machinery. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Nano-fabrication of molecular electronic junctions by targeted modification of metal-molecule bonds

    NASA Astrophysics Data System (ADS)

    Jafri, S. Hassan M.; Löfås, Henrik; Blom, Tobias; Wallner, Andreas; Grigoriev, Anton; Ahuja, Rajeev; Ottosson, Henrik; Leifer, Klaus

    2015-09-01

    Reproducibility, stability and the coupling between electrical and molecular properties are central challenges in the field of molecular electronics. The field not only needs devices that fulfill these criteria but they also need to be up-scalable to application size. In this work, few-molecule based electronics devices with reproducible electrical characteristics are demonstrated. Our previously reported 5 nm gold nanoparticles (AuNP) coated with ω-triphenylmethyl (trityl) protected 1,8-octanedithiol molecules are trapped in between sub-20 nm gap spacing gold nanoelectrodes forming AuNP-molecule network. When the trityl groups are removed, reproducible devices and stable Au-thiol junctions are established on both ends of the alkane segment. The resistance of more than 50 devices is reduced by orders of magnitude as well as a reduction of the spread in the resistance histogram is observed. By density functional theory calculations the orders of magnitude decrease in resistance can be explained and supported by TEM observations thus indicating that the resistance changes and strongly improved resistance spread are related to the establishment of reproducible and stable metal-molecule bonds. The same experimental sequence is carried out using 1,6-hexanedithiol functionalized AuNPs. The average resistances as a function of molecular length, demonstrated herein, are comparable to the one found in single molecule devices.

  5. Detection of Single Molecules Illuminated by a Light-Emitting Diode

    PubMed Central

    Gerhardt, Ilja; Mai, Lijian; Lamas-Linares, Antía; Kurtsiefer, Christian

    2011-01-01

    Optical detection and spectroscopy of single molecules has become an indispensable tool in biological imaging and sensing. Its success is based on fluorescence of organic dye molecules under carefully engineered laser illumination. In this paper we demonstrate optical detection of single molecules on a wide-field microscope with an illumination based on a commercially available, green light-emitting diode. The results are directly compared with laser illumination in the same experimental configuration. The setup and the limiting factors, such as light transfer to the sample, spectral filtering and the resulting signal-to-noise ratio are discussed. A theoretical and an experimental approach to estimate these parameters are presented. The results can be adapted to other single emitter and illumination schemes. PMID:22346610

  6. Polymethylated [4.1.1] Octanes Leading to Zeolite SSZ-50

    NASA Astrophysics Data System (ADS)

    Lee, Greg S.; Zones, Stacey I.

    2002-09-01

    In this communication, we report on the discovery of novel zeolite compositions, SSZ-50. The zeolite has the RTH topology but can be made over a large silica-to-alumina range including no aluminum at all. The surprising capability to produce a broad compositional range comes from the use of a single organo-cation guest molecule in the zeolite synthesis. The molecule is a specific derivative from within a family of 2-aza [4.1.1] bicyclo octanes that were prepared employing a sequence of organic synthesis steps from a starting ketone. Other cage-based zeolites like SSZ-35,-36,-39 and MTN arose from the use of the other derivatives in this series. We also comment on the tendency of a variety of polymethylated organo-cations to produce RTH, the closely related ITE, or the intergrowth structure, SSZ-36.

  7. Single-Molecule Sequencing Reveals Complex Genome Variation of Hepatitis B Virus during 15 Years of Chronic Infection following Liver Transplantation

    PubMed Central

    Betz-Stablein, B. D.; Töpfer, A.; Littlejohn, M.; Yuen, L.; Colledge, D.; Sozzi, V.; Angus, P.; Thompson, A.; Revill, P.; Beerenwinkel, N.; Warner, N.

    2016-01-01

    ABSTRACT Chronic hepatitis B (CHB) is prevalent worldwide. The infectious agent, hepatitis B virus (HBV), replicates via an RNA intermediate and is error prone, leading to the rapid generation of closely related but not identical viral variants, including those that can escape host immune responses and antiviral treatments. The complexity of CHB can be further enhanced by the presence of HBV variants with large deletions in the genome generated via splicing (spHBV variants). Although spHBV variants are incapable of autonomous replication, their replication is rescued by wild-type HBV. spHBV variants have been shown to enhance wild-type virus replication, and their prevalence increases with liver disease progression. Single-molecule deep sequencing was performed on whole HBV genomes extracted from samples, including the liver explant, longitudinally collected from a subject with CHB over a 15-year period after liver transplantation. By employing novel bioinformatics methods, this analysis showed that the dynamics of the viral population across a period of changing treatment regimens was complex. The spHBV variants detected in the liver explant remained present posttransplantation, and a highly diverse novel spHBV population as well as variants with multiple deletions in the pre-S genes emerged. The identification of novel mutations outside the HBV reverse transcriptase gene that co-occurred with known drug resistance-associated mutations highlights the relevance of using full-genome deep sequencing and supports the hypothesis that drug resistance involves interactions across the full length of the HBV genome. IMPORTANCE Single-molecule sequencing allowed the characterization, in unprecedented detail, of the evolution of HBV populations and offered unique insights into the dynamics of defective and spHBV variants following liver transplantation and complex treatment regimens. This analysis also showed the rapid adaptation of HBV populations to treatment regimens with evolving drug resistance phenotypes and evidence of purifying selection across the whole genome. Finally, the new open-source bioinformatics tools with the capacity to easily identify potential spliced variants from deep sequencing data are freely available. PMID:27252524

  8. Sequence-structure mapping errors in the PDB: OB-fold domains

    PubMed Central

    Venclovas, Česlovas; Ginalski, Krzysztof; Kang, Chulhee

    2004-01-01

    The Protein Data Bank (PDB) is the single most important repository of structural data for proteins and other biologically relevant molecules. Therefore, it is critically important to keep the PDB data, as much as possible, error-free. In this study, we have analyzed PDB crystal structures possessing oligonucleotide/oligosaccharide binding (OB)-fold, one of the highly populated folds, for the presence of sequence-structure mapping errors. Using energy-based structure quality assessment coupled with sequence analyses, we have found that there are at least five OB-structures in the PDB that have regions where sequences have been incorrectly mapped onto the structure. We have demonstrated that the combination of these computation techniques is effective not only in detecting sequence-structure mapping errors, but also in providing guidance to correct them. Namely, we have used results of computational analysis to direct a revision of X-ray data for one of the PDB entries containing a fairly inconspicuous sequence-structure mapping error. The revised structure has been deposited with the PDB. We suggest use of computational energy assessment and sequence analysis techniques to facilitate structure determination when homologs having known structure are available to use as a reference. Such computational analysis may be useful in either guiding the sequence-structure assignment process or verifying the sequence mapping within poorly defined regions. PMID:15133161

  9. Direct, concurrent measurements of the forces and currents affecting DNA in a nanopore with comparable topography.

    PubMed

    Nelson, Edward M; Li, Hui; Timp, Gregory

    2014-06-24

    We report direct, concurrent measurements of the forces and currents associated with the translocation of a single-stranded DNA molecule tethered to the tip of an atomic force microscope (AFM) cantilever through synthetic pores with topagraphies comparable to the DNA. These measurements were performed to gauge the signal available for sequencing and the electric force required to impel a single molecule through synthetic nanopores ranging from 1.0 to 3.5 nm in diameter in silicon nitride membranes 6-10 nm thick. The measurements revealed that a molecule can slide relatively frictionlessly through a pore, but regular fluctuations are observed intermittently in the force (and the current) every 0.35-0.72 nm, which are attributed to individual nucleotides translating through the nanopore in a turnstile-like motion.

  10. Nanopore arrays in a silicon membrane for parallel single-molecule detection: fabrication

    NASA Astrophysics Data System (ADS)

    Schmidt, Torsten; Zhang, Miao; Sychugov, Ilya; Roxhed, Niclas; Linnros, Jan

    2015-08-01

    Solid state nanopores enable translocation and detection of single bio-molecules such as DNA in buffer solutions. Here, sub-10 nm nanopore arrays in silicon membranes were fabricated by using electron-beam lithography to define etch pits and by using a subsequent electrochemical etching step. This approach effectively decouples positioning of the pores and the control of their size, where the pore size essentially results from the anodizing current and time in the etching cell. Nanopores with diameters as small as 7 nm, fully penetrating 300 nm thick membranes, were obtained. The presented fabrication scheme to form large arrays of nanopores is attractive for parallel bio-molecule sensing and DNA sequencing using optical techniques. In particular the signal-to-noise ratio is improved compared to other alternatives such as nitride membranes suffering from a high-luminescence background.

  11. Nanopore arrays in a silicon membrane for parallel single-molecule detection: fabrication.

    PubMed

    Schmidt, Torsten; Zhang, Miao; Sychugov, Ilya; Roxhed, Niclas; Linnros, Jan

    2015-08-07

    Solid state nanopores enable translocation and detection of single bio-molecules such as DNA in buffer solutions. Here, sub-10 nm nanopore arrays in silicon membranes were fabricated by using electron-beam lithography to define etch pits and by using a subsequent electrochemical etching step. This approach effectively decouples positioning of the pores and the control of their size, where the pore size essentially results from the anodizing current and time in the etching cell. Nanopores with diameters as small as 7 nm, fully penetrating 300 nm thick membranes, were obtained. The presented fabrication scheme to form large arrays of nanopores is attractive for parallel bio-molecule sensing and DNA sequencing using optical techniques. In particular the signal-to-noise ratio is improved compared to other alternatives such as nitride membranes suffering from a high-luminescence background.

  12. Improved Analysis of Nanopore Sequence Data and Scanning Nanopore Techniques

    NASA Astrophysics Data System (ADS)

    Szalay, Tamas

    The field of nanopore research has been driven by the need to inexpensively and rapidly sequence DNA. In order to help realize this goal, this thesis describes the PoreSeq algorithm that identifies and corrects errors in real-world nanopore sequencing data and improves the accuracy of de novo genome assembly with increasing coverage depth. The approach relies on modeling the possible sources of uncertainty that occur as DNA advances through the nanopore and then using this model to find the sequence that best explains multiple reads of the same region of DNA. PoreSeq increases nanopore sequencing read accuracy of M13 bacteriophage DNA from 85% to 99% at 100X coverage. We also use the algorithm to assemble E. coli with 30X coverage and the lambda genome at a range of coverages from 3X to 50X. Additionally, we classify sequence variants at an order of magnitude lower coverage than is possible with existing methods. This thesis also reports preliminary progress towards controlling the motion of DNA using two nanopores instead of one. The speed at which the DNA travels through the nanopore needs to be carefully controlled to facilitate the detection of individual bases. A second nanopore in close proximity to the first could be used to slow or stop the motion of the DNA in order to enable a more accurate readout. The fabrication process for a new pyramidal nanopore geometry was developed in order to facilitate the positioning of the nanopores. This thesis demonstrates that two of them can be placed close enough to interact with a single molecule of DNA, which is a prerequisite for being able to use the driving force of the pores to exert fine control over the motion of the DNA. Another strategy for reading the DNA is to trap it completely with one pore and to move the second nanopore instead. To that end, this thesis also shows that a single strand of immobilized DNA can be captured in a scanning nanopore and examined for a full hour, with data from many scans at many different voltages obtained in order to detect a bound protein placed partway along the molecule.

  13. Synthesis and Properties of Size-expanded DNAs: Toward Designed, Functional Genetic Systems

    PubMed Central

    Krueger, Andrew T.; Lu, Haige; Lee, Alex H. F.; Kool, Eric T.

    2008-01-01

    We describe the design, synthesis, and properties of DNA-like molecules in which the base pairs are expanded by benzo homologation. The resulting size-expanded genetic helices are called xDNA (“expanded DNA”) and yDNA (“wide DNA”). The large component bases are fluorescent, and they display high stacking affinity. When singly substituted into natural DNA, they are destabilizing because the benzo-expanded base pair size is too large for the natural helix. However, when all base pairs are expanded, xDNA and yDNA form highly stable, sequence-selective double helices. The size-expanded DNAs are candidates for components of new, functioning genetic systems. In addition, the fluorescence of expanded DNA bases makes them potentially useful in probing nucleic acids. PMID:17309194

  14. Molecular electronics with single molecules in solid-state devices.

    PubMed

    Moth-Poulsen, Kasper; Bjørnholm, Thomas

    2009-09-01

    The ultimate aim of molecular electronics is to understand and master single-molecule devices. Based on the latest results on electron transport in single molecules in solid-state devices, we focus here on new insights into the influence of metal electrodes on the energy spectrum of the molecule, and on how the electron transport properties of the molecule depend on the strength of the electronic coupling between it and the electrodes. A variety of phenomena are observed depending on whether this coupling is weak, intermediate or strong.

  15. The fluorescently responsive 3-(naphthalen-1-ylethynyl)-3-deaza-2'-deoxyguanosine discriminates cytidine via the DNA minor groove.

    PubMed

    Suzuki, Azusa; Yanagi, Masaki; Takeda, Takuya; Hudson, Robert H E; Saito, Yoshio

    2017-09-26

    A new environmentally responsive fluorescent nucleoside, 3-(naphthalen-1-ylethynyl)-3-deaza-2'-deoxyguanosine ( 3nz G), has been synthesized. The nucleoside, 3nz G, exhibited solvatochromic properties and when introduced into ODN probes it was able to recognize 2'-deoxycytidine in target strands by a distinct change in its emission wavelength through probing microenvironmental changes in the DNA minor groove. Thus, 3nz G has the potential for use as a fluorescent probe molecule for micro-structural studies of nucleic acids including the detection of single-base alterations in target DNA sequences.

  16. PLMItRNA, a database on the heterogeneous genetic origin of mitochondrial tRNA genes and tRNAs in photosynthetic eukaryotes.

    PubMed

    Rainaldi, Guglielmo; Volpicella, Mariateresa; Licciulli, Flavio; Liuni, Sabino; Gallerani, Raffaele; Ceci, Luigi R

    2003-01-01

    The updated version of PLMItRNA reports information and multialignments on 609 genes and 34 tRNA molecules active in the mitochondria of Viridiplantae (27 Embryophyta and 10 Chlorophyta), and photosynthetic algae (one Cryptophyta, four Rhodophyta and two Stramenopiles). Colour-code based tables reporting the different genetic origin of identified genes allow hyper-textual link to single entries. Promoter sequences identified for tRNA genes in the mitochondrial genomes of Angiospermae are also reported. The PLMItRNA database is accessible at http://bighost.area.ba.cnr.it/PLMItRNA/.

  17. Use of synthetic peptide libraries for the H-2Kd binding motif identification.

    PubMed

    Quesnel, A; Casrouge, A; Kourilsky, P; Abastado, J P; Trudelle, Y

    1995-01-01

    To identify Kd-binding peptides, an approach based on small peptide libraries has been developed. These peptide libraries correspond to all possible single-amino acid variants of a particular Kd-binding peptide, SYIPSAEYI, an analog of the Plasmodium berghei 252-260 antigenic peptide SYIPSAEKI. In the parent sequence, each position is replaced by all the genetically encoded amino acids (except cysteine). The multiple analog syntheses are performed either by the Divide Couple and Recombine method or by the Single Resin method and generate mixtures containing 19 peptides. The present report deals with the synthesis, the purification, the chemical characterization by amino acid analysis and electrospray mass spectrometry (ES-MS), and the application of such mixtures in binding tests with a soluble, functionally empty, single-chain H-2Kd molecule denoted SC-Kd. For each mixture, bound peptides were eluted and analyzed by sequencing. Since the binding tests were realized in noncompetitive conditions, our results show that a much broader set of peptides bind to Kd than expected from previous studies. This may be of practical importance when looking for low affinity peptides such as tumor peptides capable of eliciting protective immune response.

  18. Electrochemical detection of single molecules using abiotic nanopores having electrically tunable dimensions

    DOEpatents

    Sansinena, Jose-Maria [Los Alamos, NM; Redondo, Antonio [Los Alamos, NM; Olazabal, Virginia [Los Alamos, NM; Hoffbauer, Mark A [Los Alamos, NM; Akhadov, Elshan A [Los Alamos, NM

    2009-12-29

    A barrier structure for use in an electrochemical stochastic membrane sensor for single molecule detection. The sensor is based upon inorganic nanopores having electrically tunable dimensions. The inorganic nanopores are formed from inorganic materials and an electrically conductive polymer. Methods of making the barrier structure and sensing single molecules using the barrier structure are also described.

  19. Electrochemical detection of single molecules using abiotic nanopores having electrically tunable dimensions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sansinena, Jose-Maria; Redondo, Antonio; Olazabal, Virginia

    2017-09-12

    A barrier structure for use in an electrochemical stochastic membrane sensor for single molecule detection. The sensor is based upon inorganic nanopores having electrically tunable dimensions. The inorganic nanopores are formed from inorganic materials and an electrically conductive polymer. Methods of making the barrier structure and sensing single molecules using the barrier structure are also described.

  20. Electrochemical detection of single molecules using abiotic nanopores having electrically tunable dimensions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sansinena, Jose-Maria; Redondo, Antonio; Olazabal, Virginia

    2017-07-18

    A barrier structure for use in an electrochemical stochastic membrane sensor for single molecule detection. The sensor is based upon inorganic nanopores having electrically tunable dimensions. The inorganic nanopores are formed from inorganic materials and an electrically conductive polymer. Methods of making the barrier structure and sensing single molecules using the barrier structure are also described.

  1. Electrochemical detection of single molecules using abiotic nanopores having electrically tunable dimensions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sansinena, Jose-Maria; Redondo, Antonio; Olazabal, Virginia

    A barrier structure for use in an electrochemical stochastic membrane sensor for single molecule detection. The sensor is based upon inorganic nanopores having electrically tunable dimensions. The inorganic nanopores are formed from inorganic materials and an electrically conductive polymer. Methods of making the barrier structure and sensing single molecules using the barrier structure are also described.

  2. Research Update: Molecular electronics: The single-molecule switch and transistor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sotthewes, Kai; Heimbuch, René, E-mail: r.heimbuch@utwente.nl; Kumar, Avijit

    2014-01-01

    In order to design and realize single-molecule devices it is essential to have a good understanding of the properties of an individual molecule. For electronic applications, the most important property of a molecule is its conductance. Here we show how a single octanethiol molecule can be connected to macroscopic leads and how the transport properties of the molecule can be measured. Based on this knowledge we have realized two single-molecule devices: a molecular switch and a molecular transistor. The switch can be opened and closed at will by carefully adjusting the separation between the electrical contacts and the voltage dropmore » across the contacts. This single-molecular switch operates in a broad temperature range from cryogenic temperatures all the way up to room temperature. Via mechanical gating, i.e., compressing or stretching of the octanethiol molecule, by varying the contact's interspace, we are able to systematically adjust the conductance of the electrode-octanethiol-electrode junction. This two-terminal single-molecule transistor is very robust, but the amplification factor is rather limited.« less

  3. Detection of gas molecules on single Mn adatom adsorbed graphyne: a DFT-D study

    NASA Astrophysics Data System (ADS)

    Lu, Zhansheng; Lv, Peng; Ma, Dongwei; Yang, Xinwei; Li, Shuo; Yang, Zongxian

    2018-02-01

    As one of the prominent applications in intelligent systems, gas sensing technology has attracted great interest in both industry and academia. In the current study, the pristine graphyne (GY) without and with a single Mn atom is investigated to detect the gas molecules (CO, CH4, CO2, NH3, NO and O2). The pristine GY is promising to detect O2 molecules because of its chemical adsorption on GY with large electron transfer. The great stability of the Mn/GY is found, and the Mn atom prefers to anchor at the alkyne ring as a single atom. Upon single Mn atom anchoring, the sensitivity and selectivity of GY based gas sensors is significantly improved for various molecules, except CH4. The recovery time of the Mn/GY after detecting the gas molecules may help to appraise the detection efficiency for the Mn/GY. The current study will help to understand the mechanism of detecting the gas molecules, and extend the potentially fascinating applications of GY-based materials.

  4. Second generation noninvasive fetal genome analysis reveals de novo mutations, single-base parental inheritance, and preferred DNA ends

    PubMed Central

    Chan, K. C. Allen; Jiang, Peiyong; Sun, Kun; Cheng, Yvonne K. Y.; Tong, Yu K.; Cheng, Suk Hang; Wong, Ada I. C.; Hudecova, Irena; Leung, Tak Y.; Chiu, Rossa W. K.; Lo, Yuk Ming Dennis

    2016-01-01

    Plasma DNA obtained from a pregnant woman was sequenced to a depth of 270× haploid genome coverage. Comparing the maternal plasma DNA sequencing data with the parental genomic DNA data and using a series of bioinformatics filters, fetal de novo mutations were detected at a sensitivity of 85% and a positive predictive value of 74%. These results represent a 169-fold improvement in the positive predictive value over previous attempts. Improvements in the interpretation of the sequence information of every base position in the genome allowed us to interrogate the maternal inheritance of the fetus for 618,271 of 656,676 (94.2%) heterozygous SNPs within the maternal genome. The fetal genotype at each of these sites was deduced individually, unlike previously, where the inheritance was determined for a collection of sites within a haplotype. These results represent a 90-fold enhancement in the resolution in determining the fetus’s maternal inheritance. Selected genomic locations were more likely to be found at the ends of plasma DNA molecules. We found that a subset of such preferred ends exhibited selectivity for fetal- or maternal-derived DNA in maternal plasma. The ratio of the number of maternal plasma DNA molecules with fetal preferred ends to those with maternal preferred ends showed a correlation with the fetal DNA fraction. Finally, this second generation approach for noninvasive fetal whole-genome analysis was validated in a pregnancy diagnosed with cardiofaciocutaneous syndrome with maternal plasma DNA sequenced to 195× coverage. The causative de novo BRAF mutation was successfully detected through the maternal plasma DNA analysis. PMID:27799561

  5. Single-molecule detection of epidermal growth factor receptor mutations in plasma by microfluidics digital PCR in non-small cell lung cancer patients.

    PubMed

    Yung, Tony K F; Chan, K C Allen; Mok, Tony S K; Tong, Joanna; To, Ka-Fai; Lo, Y M Dennis

    2009-03-15

    We aim to develop a digital PCR-based method for the quantitative detection of the two common epidermal growth factor receptor (EGFR) mutations (in-frame deletion at exon 19 and L858R at exon 21) in the plasma and tumor tissues of patients suffering from non-small cell lung cancers. These two mutations account for >85% of clinically important EGFR mutations associated with responsiveness to tyrosine kinase inhibitors. DNA samples were analyzed using a microfluidics system that simultaneously performed 9,180 PCRs at nanoliter scale. A single-mutant DNA molecule in a clinical specimen could be detected and the quantities of mutant and wild-type sequences were precisely determined. Exon 19 deletion and L858R mutation were detectable in 6 (17%) and 9 (26%) of 35 pretreatment plasma samples, respectively. When compared with the sequencing results of the tumor samples, the sensitivity and specificity of plasma EGFR mutation analysis were 92% and 100%, respectively. The plasma concentration of the mutant sequences correlated well with the clinical response. Decreased concentration was observed in all patients with partial or complete clinical remission, whereas persistence of mutation was observed in a patient with cancer progression. In one patient, tyrosine kinase inhibitor was stopped after an initial response and the tumor-associated EGFR mutation reemerged 4 weeks after stopping treatment. The sensitive detection and accurate quantification of low abundance EGFR mutations in tumor tissues and plasma by microfluidics digital PCR would be useful for predicting treatment response, monitoring disease progression and early detection of treatment failure associated with acquired drug resistance.

  6. High-resolution community profiling of arbuscular mycorrhizal fungi.

    PubMed

    Schlaeppi, Klaus; Bender, S Franz; Mascher, Fabio; Russo, Giancarlo; Patrignani, Andrea; Camenzind, Tessa; Hempel, Stefan; Rillig, Matthias C; van der Heijden, Marcel G A

    2016-11-01

    Community analyses of arbuscular mycorrhizal fungi (AMF) using ribosomal small subunit (SSU) or internal transcribed spacer (ITS) DNA sequences often suffer from low resolution or coverage. We developed a novel sequencing based approach for a highly resolving and specific profiling of AMF communities. We took advantage of previously established AMF-specific PCR primers that amplify a c. 1.5-kb long fragment covering parts of SSU, ITS and parts of the large ribosomal subunit (LSU), and we sequenced the resulting amplicons with single molecule real-time (SMRT) sequencing. The method was applicable to soil and root samples, detected all major AMF families and successfully discriminated closely related AMF species, which would not be discernible using SSU sequences. In inoculation tests we could trace the introduced AMF inoculum at the molecular level. One of the introduced strains almost replaced the local strain(s), revealing that AMF inoculation can have a profound impact on the native community. The methodology presented offers researchers a powerful new tool for AMF community analysis because it unifies improved specificity and enhanced resolution, whereas the drawback of medium sequencing throughput appears of lesser importance for low-diversity groups such as AMF. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  7. A simple procedure for parallel sequence analysis of both strands of 5'-labeled DNA.

    PubMed

    Razvi, F; Gargiulo, G; Worcel, A

    1983-08-01

    Ligation of a 5'-labeled DNA restriction fragment results in a circular DNA molecule carrying the two 32Ps at the reformed restriction site. Double digestions of the circular DNA with the original enzyme and a second restriction enzyme cleavage near the labeled site allows direct chemical sequencing of one 5'-labeled DNA strand. Similar double digestions, using an isoschizomer that cleaves differently at the 32P-labeled site, allows direct sequencing of the now 3'-labeled complementary DNA strand. It is possible to directly sequence both strands of cloned DNA inserts by using the above protocol and a multiple cloning site vector that provides the necessary restriction sites. The simultaneous and parallel visualization of both DNA strands eliminates sequence ambiguities. In addition, the labeled circular molecules are particularly useful for single-hit DNA cleavage studies and DNA footprint analysis. As an example, we show here an analysis of the micrococcal nuclease-induced breaks on the two strands of the somatic 5S RNA gene of Xenopus borealis, which suggests that the enzyme may recognize and cleave small AT-containing palindromes along the DNA helix.

  8. Meeting report: SMART timing--principles of single molecule techniques course at the University of Michigan 2014.

    PubMed

    Bartke, Rebecca M; Cameron, Elizabeth L; Cristie-David, Ajitha S; Custer, Thomas C; Denies, Maxwell S; Daher, May; Dhakal, Soma; Ghosh, Soumi; Heinicke, Laurie A; Hoff, J Damon; Hou, Qian; Kahlscheuer, Matthew L; Karslake, Joshua; Krieger, Adam G; Li, Jieming; Li, Xiang; Lund, Paul E; Vo, Nguyen N; Park, Jun; Pitchiaya, Sethuramasundaram; Rai, Victoria; Smith, David J; Suddala, Krishna C; Wang, Jiarui; Widom, Julia R; Walter, Nils G

    2015-05-01

    Four days after the announcement of the 2014 Nobel Prize in Chemistry for "the development of super-resolved fluorescence microscopy" based on single molecule detection, the Single Molecule Analysis in Real-Time (SMART) Center at the University of Michigan hosted a "Principles of Single Molecule Techniques 2014" course. Through a combination of plenary lectures and an Open House at the SMART Center, the course took a snapshot of a technology with an especially broad and rapidly expanding range of applications in the biomedical and materials sciences. Highlighting the continued rapid emergence of technical and scientific advances, the course underscored just how brightly the future of the single molecule field shines. © 2014 Wiley Periodicals, Inc.

  9. Single-Molecule Electronics: Chemical and Analytical Perspectives.

    PubMed

    Nichols, Richard J; Higgins, Simon J

    2015-01-01

    It is now possible to measure the electrical properties of single molecules using a variety of techniques including scanning probe microcopies and mechanically controlled break junctions. Such measurements can be made across a wide range of environments including ambient conditions, organic liquids, ionic liquids, aqueous solutions, electrolytes, and ultra high vacuum. This has given new insights into charge transport across molecule electrical junctions, and these experimental methods have been complemented with increasingly sophisticated theory. This article reviews progress in single-molecule electronics from a chemical perspective and discusses topics such as the molecule-surface coupling in electrical junctions, chemical control, and supramolecular interactions in junctions and gating charge transport. The article concludes with an outlook regarding chemical analysis based on single-molecule conductance.

  10. Basic quantitative polymerase chain reaction using real-time fluorescence measurements.

    PubMed

    Ares, Manuel

    2014-10-01

    This protocol uses quantitative polymerase chain reaction (qPCR) to measure the number of DNA molecules containing a specific contiguous sequence in a sample of interest (e.g., genomic DNA or cDNA generated by reverse transcription). The sample is subjected to fluorescence-based PCR amplification and, theoretically, during each cycle, two new duplex DNA molecules are produced for each duplex DNA molecule present in the sample. The progress of the reaction during PCR is evaluated by measuring the fluorescence of dsDNA-dye complexes in real time. In the early cycles, DNA duplication is not detected because inadequate amounts of DNA are made. At a certain threshold cycle, DNA-dye complexes double each cycle for 8-10 cycles, until the DNA concentration becomes so high and the primer concentration so low that the reassociation of the product strands blocks efficient synthesis of new DNA and the reaction plateaus. There are two types of measurements: (1) the relative change of the target sequence compared to a reference sequence and (2) the determination of molecule number in the starting sample. The first requires a reference sequence, and the second requires a sample of the target sequence with known numbers of the molecules of sequence to generate a standard curve. By identifying the threshold cycle at which a sample first begins to accumulate DNA-dye complexes exponentially, an estimation of the numbers of starting molecules in the sample can be extrapolated. © 2014 Cold Spring Harbor Laboratory Press.

  11. The methylome and virulence of bovine respiratory disease bacterial pathogens

    USDA-ARS?s Scientific Manuscript database

    With the advent of single molecule, real-time (SMRT®) sequencing, it is now possible to study complete microbial epigenomes. It has been known for decades that methylation and other types of epigenetic modifications in bacteria are responsible for much more than restriction-modification mechanics, b...

  12. Electrochemical sensing and biosensing platform based on chemically reduced graphene oxide.

    PubMed

    Zhou, Ming; Zhai, Yueming; Dong, Shaojun

    2009-07-15

    In this paper, the characterization and application of a chemically reduced graphene oxide modified glassy carbon (CR-GO/GC) electrode, a novel electrode system, for the preparation of electrochemical sensing and biosensing platform are proposed. Different kinds of important inorganic and organic electroactive compounds (i.e., probe molecule (potassium ferricyanide), free bases of DNA (guanine (G), adenine (A), thymine (T), and cytosine (C)), oxidase/dehydrogenase-related molecules (hydrogen peroxide (H2O2)/beta-nicotinamide adenine dinucleotide (NADH)), neurotransmitters (dopamine (DA)), and other biological molecules (ascorbic acid (AA), uric acid (UA), and acetaminophen (APAP)) were employed to study their electrochemical responses at the CR-GO/GC electrode, which shows more favorable electron transfer kinetics than graphite modified glassy carbon (graphite/GC) and glassy carbon (GC) electrodes. The greatly enhanced electrochemical reactivity of the four free bases of DNA at the CR-GO/GC electrode compared with that at graphite/GC and GC electrodes makes the CR-GO/GC electrode a better choice for the electrochemical biosensing of four DNA bases in both the single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) at physiological pH without a prehydrolysis step. This allows us to detect a single-nucleotide polymorphism (SNP) site for short oligomers with a particular sequence at the CR-GO/GC electrode without any hybridization or labeling processes in this work, suggesting the potential applications of CR-GO in the label-free electrochemical detection of DNA hybridization or DNA damage for further research. Based on the greatly enhanced electrochemical reactivity of H2O2 and NADH at the CR-GO/GC electrode, CR-GO/GC electrode-based bioelectrodes (in connection with glucose oxidase (GOD) and alcohol dehydrogenase (ADH)) show a better analytical performance for the detection of glucose and ethanol compared with graphite/GC- or GC-based bioelectrodes. By comparing the electrochemical performance of CR-GO with that of the conventional graphite and GC, we reveal that CR-GO with the nature of a single sheet showing favorable electrochemical activity should be a kind of more robust and advanced carbon electrode material which may hold great promise for electrochemical sensors and biosensors design.

  13. Probes labelled with energy transfer coupled dyes

    DOEpatents

    Mathies, R.A.; Glazer, A.; Ju, J.

    1997-11-18

    Compositions are provided comprising sets of fluorescent labels carrying pairs of donor and acceptor dye molecules, designed for efficient excitation of the donors at a single wavelength and emission from the acceptor in each of the pairs at different wavelengths. The different molecules having different donor-acceptor pairs can be modified to have substantially the same mobility under separation conditions, by varying the distance between the donor and acceptor in a given pair. Particularly, the fluorescent compositions find use as labels in sequencing nucleic acids. 7 figs.

  14. Fluorescent labels and their use in separations

    DOEpatents

    Mathies, Richard A.; Glazer, Alexander; Ju, Jingyue

    1997-01-01

    Compositions are provided comprising sets of fluorescent labels carrying pairs of donor and acceptor dye molecules, designed for efficient excitation of the donors at a single wavelength and emission from the acceptor in each of the pairs at different wavelengths. The different molecules having different donor-acceptor pairs can be modified to have substantially the same mobility under separation conditions, by varying the distance between the donor and acceptor in a given pair. Particularly, the fluorescent compositions find use as labels in sequencing nucleic acids.

  15. Probes labelled with energy transfer coupled dyes

    DOEpatents

    Mathies, Richard A.; Glazer, Alexander; Ju, Jingyue

    1997-01-01

    Compositions are provided comprising sets of fluorescent labels carrying pairs of donor and acceptor dye molecules, designed for efficient excitation of the donors at a single wavelength and emission from the acceptor in each of the pairs at different wavelengths. The different molecules having different donor-acceptor pairs can be modified to have substantially the same mobility under separation conditions, by varying the distance between the donor and acceptor in a given pair. Particularly, the fluorescent compositions find use as labels in sequencing nucleic acids.

  16. The Complete Chloroplast Genome of Banana (Musa acuminata, Zingiberales): Insight into Plastid Monocotyledon Evolution

    PubMed Central

    Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D’Hont, Angélique

    2013-01-01

    Background Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. Methodology/Principal Findings The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. Conclusion The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas. PMID:23840670

  17. The complete chloroplast genome of banana (Musa acuminata, Zingiberales): insight into plastid monocotyledon evolution.

    PubMed

    Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D'Hont, Angélique

    2013-01-01

    Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.

  18. Overview of Single-Molecule Speckle (SiMS) Microscopy and Its Electroporation-Based Version with Efficient Labeling and Improved Spatiotemporal Resolution.

    PubMed

    Yamashiro, Sawako; Watanabe, Naoki

    2017-07-06

    Live-cell single-molecule imaging was introduced more than a decade ago, and has provided critical information on remodeling of the actin cytoskeleton, the motion of plasma membrane proteins, and dynamics of molecular motor proteins. Actin remodeling has been the best target for this approach because actin and its associated proteins stop diffusing when assembled, allowing visualization of single-molecules of fluorescently-labeled proteins in a state specific manner. The approach based on this simple principle is called Single-Molecule Speckle (SiMS) microscopy. For instance, spatiotemporal regulation of actin polymerization and lifetime distribution of actin filaments can be monitored directly by tracking actin SiMS. In combination with fluorescently labeled probes of various actin regulators, SiMS microscopy has contributed to clarifying the processes underlying recycling, motion and remodeling of the live-cell actin network. Recently, we introduced an electroporation-based method called eSiMS microscopy, with high efficiency, easiness and improved spatiotemporal precision. In this review, we describe the application of live-cell single-molecule imaging to cellular actin dynamics and discuss the advantages of eSiMS microscopy over previous SiMS microscopy.

  19. LongISLND: in silico sequencing of lengthy and noisy datatypes

    PubMed Central

    Lau, Bayo; Mohiyuddin, Marghoob; Mu, John C.; Fang, Li Tai; Bani Asadi, Narges; Dallett, Carolina; Lam, Hugo Y. K.

    2016-01-01

    Summary: LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling. Availability and Implementation: LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd Contact: hugo.lam@roche.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27667791

  20. Sequence-specific sepsis-related DNA capture and fluorescent labeling in monoliths prepared by single-step photopolymerization in microfluidic devices.

    PubMed

    Knob, Radim; Hanson, Robert L; Tateoka, Olivia B; Wood, Ryan L; Guerrero-Arguero, Israel; Robison, Richard A; Pitt, William G; Woolley, Adam T

    2018-05-21

    Fast determination of antibiotic resistance is crucial in selecting appropriate treatment for sepsis patients, but current methods based on culture are time consuming. We are developing a microfluidic platform with a monolithic column modified with oligonucleotides designed for sequence-specific capture of target DNA related to the Klebsiella pneumoniae carbapenemase (KPC) gene. We developed a novel single-step monolith fabrication method with an acrydite-modified capture oligonucleotide in the polymerization mixture, enabling fast monolith preparation in a microfluidic channel using UV photopolymerization. These prepared columns had a threefold higher capacity compared to monoliths prepared in a multistep process involving Schiff-base DNA attachment. Conditions for denaturing, capture and fluorescence labeling using hybridization probes were optimized with synthetic 90-mer oligonucleotides. These procedures were applied for extraction of a PCR amplicon from the KPC antibiotic resistance gene in bacterial lysate obtained from a blood sample spiked with E. coli. The results showed similar eluted peak areas for KPC amplicon extracted from either hybridization buffer or bacterial lysate. Selective extraction of the KPC DNA was verified by real time PCR on eluted fractions. These results show great promise for application in an integrated microfluidic diagnostic system that combines upstream blood sample preparation and downstream single-molecule counting detection. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. DNA-Based Single-Molecule Electronics: From Concept to Function.

    PubMed

    Wang, Kun

    2018-01-17

    Beyond being the repository of genetic information, DNA is playing an increasingly important role as a building block for molecular electronics. Its inherent structural and molecular recognition properties render it a leading candidate for molecular electronics applications. The structural stability, diversity and programmability of DNA provide overwhelming freedom for the design and fabrication of molecular-scale devices. In the past two decades DNA has therefore attracted inordinate amounts of attention in molecular electronics. This review gives a brief survey of recent experimental progress in DNA-based single-molecule electronics with special focus on single-molecule conductance and I-V characteristics of individual DNA molecules. Existing challenges and exciting future opportunities are also discussed.

  2. DNA-Based Single-Molecule Electronics: From Concept to Function

    PubMed Central

    2018-01-01

    Beyond being the repository of genetic information, DNA is playing an increasingly important role as a building block for molecular electronics. Its inherent structural and molecular recognition properties render it a leading candidate for molecular electronics applications. The structural stability, diversity and programmability of DNA provide overwhelming freedom for the design and fabrication of molecular-scale devices. In the past two decades DNA has therefore attracted inordinate amounts of attention in molecular electronics. This review gives a brief survey of recent experimental progress in DNA-based single-molecule electronics with special focus on single-molecule conductance and I–V characteristics of individual DNA molecules. Existing challenges and exciting future opportunities are also discussed. PMID:29342091

  3. Structator: fast index-based search for RNA sequence-structure patterns

    PubMed Central

    2011-01-01

    Background The secondary structure of RNA molecules is intimately related to their function and often more conserved than the sequence. Hence, the important task of searching databases for RNAs requires to match sequence-structure patterns. Unfortunately, current tools for this task have, in the best case, a running time that is only linear in the size of sequence databases. Furthermore, established index data structures for fast sequence matching, like suffix trees or arrays, cannot benefit from the complementarity constraints introduced by the secondary structure of RNAs. Results We present a novel method and readily applicable software for time efficient matching of RNA sequence-structure patterns in sequence databases. Our approach is based on affix arrays, a recently introduced index data structure, preprocessed from the target database. Affix arrays support bidirectional pattern search, which is required for efficiently handling the structural constraints of the pattern. Structural patterns like stem-loops can be matched inside out, such that the loop region is matched first and then the pairing bases on the boundaries are matched consecutively. This allows to exploit base pairing information for search space reduction and leads to an expected running time that is sublinear in the size of the sequence database. The incorporation of a new chaining approach in the search of RNA sequence-structure patterns enables the description of molecules folding into complex secondary structures with multiple ordered patterns. The chaining approach removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our method runs up to two orders of magnitude faster than previous methods. Conclusions The presented method's sublinear expected running time makes it well suited for RNA sequence-structure pattern matching in large sequence databases. RNA molecules containing several stem-loop substructures can be described by multiple sequence-structure patterns and their matches are efficiently handled by a novel chaining method. Beyond our algorithmic contributions, we provide with Structator a complete and robust open-source software solution for index-based search of RNA sequence-structure patterns. The Structator software is available at http://www.zbh.uni-hamburg.de/Structator. PMID:21619640

  4. Axial Colocalization of Single Molecules with Nanometer Accuracy Using Metal-Induced Energy Transfer.

    PubMed

    Isbaner, Sebastian; Karedla, Narain; Kaminska, Izabela; Ruhlandt, Daja; Raab, Mario; Bohlen, Johann; Chizhik, Alexey; Gregor, Ingo; Tinnefeld, Philip; Enderlein, Jörg; Tsukanov, Roman

    2018-04-11

    Single-molecule localization based super-resolution microscopy has revolutionized optical microscopy and routinely allows for resolving structural details down to a few nanometers. However, there exists a rather large discrepancy between lateral and axial localization accuracy, the latter typically three to five times worse than the former. Here, we use single-molecule metal-induced energy transfer (smMIET) to localize single molecules along the optical axis, and to measure their axial distance with an accuracy of 5 nm. smMIET relies only on fluorescence lifetime measurements and does not require additional complex optical setups.

  5. Discrimination among individual Watson–Crick base pairs at the termini of single DNA hairpin molecules

    PubMed Central

    Vercoutere, Wenonah A.; Winters-Hilt, Stephen; DeGuzman, Veronica S.; Deamer, David; Ridino, Sam E.; Rodgers, Joseph T.; Olsen, Hugh E.; Marziali, Andre; Akeson, Mark

    2003-01-01

    Nanoscale α-hemolysin pores can be used to analyze individual DNA or RNA molecules. Serial examination of hundreds to thousands of molecules per minute is possible using ionic current impedance as the measured property. In a recent report, we showed that a nanopore device coupled with machine learning algorithms could automatically discriminate among the four combinations of Watson–Crick base pairs and their orientations at the ends of individual DNA hairpin molecules. Here we use kinetic analysis to demonstrate that ionic current signatures caused by these hairpin molecules depend on the number of hydrogen bonds within the terminal base pair, stacking between the terminal base pair and its nearest neighbor, and 5′ versus 3′ orientation of the terminal bases independent of their nearest neighbors. This report constitutes evidence that single Watson–Crick base pairs can be identified within individual unmodified DNA hairpin molecules based on their dynamic behavior in a nanoscale pore. PMID:12582251

  6. In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

    PubMed Central

    Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

    2014-01-01

    Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699

  7. In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

    PubMed

    Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

    2014-01-01

    Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.

  8. Computational Approaches for Decoding Select Odorant-Olfactory Receptor Interactions Using Mini-Virtual Screening

    PubMed Central

    Harini, K.; Sowdhamini, Ramanathan

    2015-01-01

    Olfactory receptors (ORs) belong to the class A G-Protein Coupled Receptor superfamily of proteins. Unlike G-Protein Coupled Receptors, ORs exhibit a combinatorial response to odors/ligands. ORs display an affinity towards a range of odor molecules rather than binding to a specific set of ligands and conversely a single odorant molecule may bind to a number of olfactory receptors with varying affinities. The diversity in odor recognition is linked to the highly variable transmembrane domains of these receptors. The purpose of this study is to decode the odor-olfactory receptor interactions using in silico docking studies. In this study, a ligand (odor molecules) dataset of 125 molecules was used to carry out in silico docking using the GLIDE docking tool (SCHRODINGER Inc Pvt LTD). Previous studies, with smaller datasets of ligands, have shown that orthologous olfactory receptors respond to similarly-tuned ligands, but are dramatically different in their efficacy and potency. Ligand docking results were applied on homologous pairs (with varying sequence identity) of ORs from human and mouse genomes and ligand binding residues and the ligand profile differed among such related olfactory receptor sequences. This study revealed that homologous sequences with high sequence identity need not bind to the same/ similar ligand with a given affinity. A ligand profile has been obtained for each of the 20 receptors in this analysis which will be useful for expression and mutation studies on these receptors. PMID:26221959

  9. Mitochondrial genome of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa): A linear DNA molecule encoding a putative DNA-dependent DNA polymerase.

    PubMed

    Shao, Zhiyong; Graf, Shannon; Chaga, Oleg Y; Lavrov, Dennis V

    2006-10-15

    The 16,937-nuceotide sequence of the linear mitochondrial DNA (mt-DNA) molecule of the moon jelly Aurelia aurita (Cnidaria, Scyphozoa) - the first mtDNA sequence from the class Scypozoa and the first sequence of a linear mtDNA from Metazoa - has been determined. This sequence contains genes for 13 energy pathway proteins, small and large subunit rRNAs, and methionine and tryptophan tRNAs. In addition, two open reading frames of 324 and 969 base pairs in length have been found. The deduced amino-acid sequence of one of them, ORF969, displays extensive sequence similarity with the polymerase [but not the exonuclease] domain of family B DNA polymerases, and this ORF has been tentatively identified as dnab. This is the first report of dnab in animal mtDNA. The genes in A. aurita mtDNA are arranged in two clusters with opposite transcriptional polarities; transcription proceeding toward the ends of the molecule. The determined sequences at the ends of the molecule are nearly identical but inverted and lack any obvious potential secondary structures or telomere-like repeat elements. The acquisition of mitochondrial genomic data for the second class of Cnidaria allows us to reconstruct characteristic features of mitochondrial evolution in this animal phylum.

  10. Probe-based measurement of lateral single-electron transfer between individual molecules

    PubMed Central

    Steurer, Wolfram; Fatayer, Shadi; Gross, Leo; Meyer, Gerhard

    2015-01-01

    The field of molecular electronics aims at using single molecules as functional building blocks for electronics components, such as switches, rectifiers or transistors. A key challenge is to perform measurements with atomistic control over the alignment of the molecule and its contacting electrodes. Here we use atomic force microscopy to examine charge transfer between weakly coupled pentacene molecules on insulating films with single-electron sensitivity and control over the atomistic details. We show that, in addition to the imaging capability, the probe tip can be used to control the charge state of individual molecules and to detect charge transfers to/from the tip, as well as between individual molecules. Our approach represents a novel route for molecular charge transfer studies with a host of opportunities, especially in combination with single atom/molecule manipulation and nanopatterning techniques. PMID:26387533

  11. Hardware solution for continuous time-resolved burst detection of single molecules in flow

    NASA Astrophysics Data System (ADS)

    Wahl, Michael; Erdmann, Rainer; Lauritsen, Kristian; Rahn, Hans-Juergen

    1998-04-01

    Time Correlated Single Photon Counting (TCSPC) is a valuable tool for Single Molecule Detection (SMD). However, existing TCSPC systems did not support continuous data collection and processing as is desirable for applications such as SMD for e.g. DNA-sequencing in a liquid flow. First attempts at using existing instrumentation in this kind of operation mode required additional routing hardware to switch between several memory banks and were not truly continuous. We have designed a hard- and software system to perform continuous real-time TCSPC based upon a modern solid state Time to Digital Converter (TDC). Short dead times of the fully digital TDC design combined with fast Field Programmable Gay Array logic permit a continuous data throughput as high as 3 Mcounts/sec. The histogramming time may be set as short as 100 microsecond(s) . Every histogram or every single fluorescence photon can be real-time tagged at 200 ns resolution in addition to recording its arrival time relative to the excitation pulse. Continuous switching between memory banks permits concurrent histogramming and data read-out. The instrument provides a time resolution of 60 ps and up to 4096 histogram channels. The overall instrument response function in combination with a low cost picosecond diode laser and an inexpensive photomultiplier tube was found to be 180 ps and well sufficient to measure sub-nanosecond fluorescence lifetimes.

  12. Single-molecule sequencing and Hi-C-based proximity-guided assembly of amaranth (Amaranthus hypochondriacus) chromosomes provide insights into genome evolution.

    PubMed

    Lightfoot, D J; Jarvis, D E; Ramaraj, T; Lee, R; Jellen, E N; Maughan, P J

    2017-08-31

    Amaranth (Amaranthus hypochondriacus) was a food staple among the ancient civilizations of Central and South America that has recently received increased attention due to the high nutritional value of the seeds, with the potential to help alleviate malnutrition and food security concerns, particularly in arid and semiarid regions of the developing world. Here, we present a reference-quality assembly of the amaranth genome which will assist the agronomic development of the species. Utilizing single-molecule, real-time sequencing (Pacific Biosciences) and chromatin interaction mapping (Hi-C) to close assembly gaps and scaffold contigs, respectively, we improved our previously reported Illumina-based assembly to produce a chromosome-scale assembly with a scaffold N50 of 24.4 Mb. The 16 largest scaffolds contain 98% of the assembly and likely represent the haploid chromosomes (n = 16). To demonstrate the accuracy and utility of this approach, we produced physical and genetic maps and identified candidate genes for the betalain pigmentation pathway. The chromosome-scale assembly facilitated a genome-wide syntenic comparison of amaranth with other Amaranthaceae species, revealing chromosome loss and fusion events in amaranth that explain the reduction from the ancestral haploid chromosome number (n = 18) for a tetraploid member of the Amaranthaceae. The assembly method reported here minimizes cost by relying primarily on short-read technology and is one of the first reported uses of in vivo Hi-C for assembly of a plant genome. Our analyses implicate chromosome loss and fusion as major evolutionary events in the 2n = 32 amaranths and clearly establish the homoeologous relationship among most of the subgenome chromosomes, which will facilitate future investigations of intragenomic changes that occurred post polyploidization.

  13. Genealogical analyses of multiple loci of litostomatean ciliates (Protista, Ciliophora, Litostomatea)

    PubMed Central

    Vd’ačný, Peter; Bourland, William A.; Orsi, William; Epstein, Slava S.; Foissner, Wilhelm

    2012-01-01

    The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria + Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. PMID:22789763

  14. Genealogical analyses of multiple loci of litostomatean ciliates (Protista, Ciliophora, Litostomatea).

    PubMed

    Vd'ačný, Peter; Bourland, William A; Orsi, William; Epstein, Slava S; Foissner, Wilhelm

    2012-11-01

    The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria+Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. Copyright © 2012 Elsevier Inc. All rights reserved.

  15. Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing

    USDA-ARS?s Scientific Manuscript database

    Zea mays is an important crop species and genetic model for elucidating transcriptional networks in plants. Uncertainties about the complete structure of mRNA transcripts, particularly with respect to alternatively spliced isoforms, limit the progress of research in this system. In this study, we us...

  16. CoSMoS Unravels Mysteries of Transcription Initiation

    PubMed Central

    Gourse, Richard L.; Landick, Robert

    2013-01-01

    Using a fluorescence method called colocalization single-molecule spectroscopy (CoSMoS), Friedman and Gelles dissect the kinetics of transcription initiation at a bacterial promoter. Ultimately, CoSMoS could greatly aid the study of the effects of DNA sequence and transcription factors on both prokaryotic and eukaryotic promoters. PMID:22341438

  17. Mapping the yeast genome by melting in nanofluidic devices

    NASA Astrophysics Data System (ADS)

    Welch, Robert L.; Czolkos, Ilja; Sladek, Rob; Reisner, Walter

    2012-02-01

    Optical mapping of DNA provides large-scale genomic information that can be used to assemble contigs from next-generation sequencing, and to detect re-arrangements between single cells. A recent optical mapping technique called denaturation mapping has the unique advantage of using physical principles rather than the action of enzymes to probe genomic structure. The absence of reagents or reaction steps makes denaturation mapping simpler than other protocols. Denaturation mapping uses fluorescence microscopy to image the pattern of partial melting along a DNA molecule extended in a channel of cross-section ˜100nm at the heart of a nanofluidic device. We successfully aligned melting maps from single DNA molecules to a theoretical map of the yeast genome (11.6Mbp) to identify their location. By aligning hundreds of molecules we assembled a consensus melting map of the yeast genome with 95% coverage.

  18. DNA Photo Lithography with Cinnamate-based Photo-Bio-Nano-Glue

    NASA Astrophysics Data System (ADS)

    Feng, Lang; Li, Minfeng; Romulus, Joy; Sha, Ruojie; Royer, John; Wu, Kun-Ta; Xu, Qin; Seeman, Nadrian; Weck, Marcus; Chaikin, Paul

    2013-03-01

    We present a technique to make patterned functional surfaces, using a cinnamate photo cross-linker and photolithography. We have designed and modified a complementary set of single DNA strands to incorporate a pair of opposing cinnamate molecules. On exposure to 360nm UV, the cinnamate makes a highly specific covalent bond permanently linking only the complementary strands containing the cinnamates. We have studied this specific and efficient crosslinking with cinnamate-containing DNA in solution and on particles. UV addressability allows us to pattern surfaces functionally. The entire surface is coated with a DNA sequence A incorporating cinnamate. DNA strands A'B with one end containing a complementary cinnamated sequence A' attached to another sequence B, are then hybridized to the surface. UV photolithography is used to bind the A'B strand in a specific pattern. The system is heated and the unbound DNA is washed away. The pattern is then observed by thermo-reversibly hybridizing either fluorescently dyed B' strands complementary to B, or colloids coated with B' strands. Our techniques can be used to reversibly and/or permanently bind, via DNA linkers, an assortment of molecules, proteins and nanostructures. Potential applications range from advanced self-assembly, such as templated self-replication schemes recently reported, to designed physical and chemical patterns, to high-resolution multi-functional DNA surfaces for genetic detection or DNA computing.

  19. ModeRNA: a tool for comparative modeling of RNA 3D structure

    PubMed Central

    Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

    2011-01-01

    RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639

  20. Electrochemical label-free and sensitive nanobiosensing of DNA hybridization by graphene oxide modified pencil graphite electrode.

    PubMed

    Ahour, F; Shamsi, A

    2017-09-01

    Based on the strong interaction between single-stranded DNA (ss-DNA) and graphene material, we have constructed a novel label-free electrochemical biosensor for rapid and facile detection of short sequences ss-DNA molecules related to hepatitis C virus 1a using graphene oxide modified pencil graphite electrode. The sensing mechanism is based on the superior adsorption of single-stranded DNA to GO over double stranded DNA (ds-DNA). The intrinsic guanine oxidation signal measured by differential pulse voltammetry (DPV) has been used for duplex DNA formation detection. The probe ss-DNA adsorbs onto the surface of GO via the π- π* stacking interactions leading to a strong background guanine oxidation signal. In the presence of complementary target, formation of helix which has weak binding ability to GO induced ds-DNA to release from the electrode surface and significant variation in differential pulse voltammetric response of guanine bases. The results indicated that the oxidation peak current was proportional to the concentration of complementary strand in the range of 0.1 nM-0.5 μM with a detection limit of 4.3 × 10 -11  M. The simple fabricated electrochemical biosensor has high sensitivity, good selectivity, and could be applied as a new platform for a range of target molecules in future. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Programmable and highly resolved in vitro detection of 5-methylcytosine by TALEs.

    PubMed

    Kubik, Grzegorz; Schmidt, Moritz J; Penner, Johanna E; Summerer, Daniel

    2014-06-02

    Gene expression is extensively regulated by specific patterns of genomic 5-methylcytosine (mC), but the ability to directly detect this modification at user-defined genomic loci is limited. One reason is the lack of molecules that discriminate between mC and cytosine (C) and at the same time provide inherent, programmable sequence-selectivity. Programmable transcription-activator-like effectors (TALEs) have been observed to exhibit mC-sensitivity in vivo, but to only a limited extent in vitro. We report an mC-detection assay based on TALE control of DNA replication that displays unexpectedly strong mC-discrimination ability in vitro. The status and level of mC modification at single positions in oligonucleotides can be determined unambiguously by this assay, independently of the overall target sequence. Moreover, discrimination is reliably observed for positions bound by N-terminal and central regions of TALEs. This indicates the wide scope and robustness of the approach for highly resolved mC detection and enabled the detection of a single mC in a large, eukaryotic genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Thrombin-Binding Aptamer Quadruplex Formation: AFM and Voltammetric Characterization

    PubMed Central

    Diculescu, Victor Constantin; Chiorcea-Paquim, Ana-Maria; Eritja, Ramon; Oliveira-Brett, Ana Maria

    2010-01-01

    The adsorption and the redox behaviour of thrombin-binding aptamer (TBA) and extended TBA (eTBA) were studied using atomic force microscopy and voltammetry at highly oriented pyrolytic graphite and glassy carbon. The different adsorption patterns and degree of surface coverage were correlated with the sequence base composition, presence/absence of K+, and voltammetric behaviour of TBA and eTBA. In the presence of K+, only a few single-stranded sequences present adsorption, while the majority of the molecules forms stable and rigid quadruplexes with no adsorption. Both TBA and eTBA are oxidized and the only anodic peak corresponds to guanine oxidation. Upon addition of K+ ions, TBA and eTBA fold into a quadruplex, causing the decrease of guanine oxidation peak and occurrence of a new peak at a higher potential due to the oxidation of G-quartets. The higher oxidation potential of G-quartets is due to the greater difficulty of electron transfer from the inside of the quadruplex to the electrode surface than electron transfer from the more flexible single strands. PMID:20798847

  3. Single-Molecule Spectroscopy and Imaging Over the Decades

    PubMed Central

    Moerner, W. E.; Shechtman, Yoav; Wang, Quan

    2016-01-01

    As of 2015, it has been 26 years since the first optical detection and spectroscopy of single molecules in condensed matter. This area of science has expanded far beyond the early low temperature studies in crystals to include single molecules in cells, polymers, and in solution. The early steps relied upon high-resolution spectroscopy of inhomogeneously broadened optical absorption profiles of molecular impurities in solids at low temperatures. Spectral fine structure arising directly from the position-dependent fluctuations of the number of molecules in resonance led to the attainment of the single-molecule limit in 1989 using frequency-modulation laser spectroscopy. In the early 1990's, a variety of fascinating physical effects were observed for individual molecules, including imaging of the light from single molecules as well as observations of spectral diffusion, optical switching and the ability to select different single molecules in the same focal volume simply by tuning the pumping laser frequency. In the room temperature regime, researchers showed that bursts of light from single molecules could be detected in solution, leading to imaging and microscopy by a variety of methods. Studies of single copies of the green fluorescent protein also uncovered surprises, especially the blinking and photoinduced recovery of emitters, which stimulated further development of photoswitchable fluorescent protein labels. All of these early steps provided important fundamentals underpinning the development of super-resolution microscopy based on single-molecule localization and active control of emitting concentration. Current thrust areas include extensions to three-dimensional imaging with high precision, orientational analysis of single molecules, and direct measurements of photodynamics and transport properties for single molecules trapped in solution by suppression of Brownian motion. Without question, a huge variety of studies of single molecules performed by many talented scientists all over the world have extended our knowledge of the nanoscale and microscopic mechanisms previously hidden by ensemble averaging. PMID:26616210

  4. Simulation studies of DNA at the nanoscale: Interactions with proteins, polycations, and surfaces

    NASA Astrophysics Data System (ADS)

    Elder, Robert M.

    Understanding the nanoscale interactions of DNA, a multifunctional biopolymer with sequence-dependent properties, with other biological and synthetic substrates and molecules is essential to advancing these technologies. This doctoral thesis research is aimed at understanding the thermodynamics and molecular-level structure when DNA interacts with proteins, polycations, and functionalized surfaces. First, we investigate the ability of a DNA damage recognition protein (HMGB1a) to bind to anti-cancer drug-induced DNA damage, seeking to explain how HMGB1a differentiates between the drugs in vivo. Using atomistic molecular dynamics simulations, we show that the structure of the drug-DNA molecule exhibits drug- and base sequence-dependence that explains some of the experimentally observed differential recognition of the drugs in various sequence contexts. Then, we show how steric hindrance from the drug decreases the deformability of the drug-DNA molecule, which decreases recognition by the protein, a concept that can be applied to rational drug design. Second, we study how polycation architecture and chemistry affect polycation-DNA binding so as to design optimal polycations for high efficiency gene (DNA) delivery. Using a multiscale computational approach involving atomistic and coarse-grained simulations, we examine how rearranging polylysine from a linear to a grafted architecture, and several aspects of the grafted architecture, affect polycation-DNA binding and the structure of polycation-DNA complexes. Next, going beyond lysine we examine how oligopeptide chemistry and sequence in the grafted architecture affects polycation-DNA binding and find that strategic placement of hydrophobic peptides might be used to tailor binding strength. Third, we study the adsorption and conformations of single-stranded DNA (an amphiphilic biopolymer) on model hydrophilic and hydrophobic surfaces. Short ssDNA oligomers adsorb to both surfaces with similar strength, with the strength of adsorption to the hydrophobic surface depending on the composition of the DNA strands, i.e. purine or pyrimidine bases. Additionally, DNA-surface and DNA-water interactions near the surfaces govern the adsorption. For longer ssDNA oligomers, the effects of surface chemistry and temperature on ssDNA conformations are rather small, but either the hydrophilic surface or increased temperature favor slightly more compact conformations due to energetic and entropic effects, respectively.

  5. DNABIT Compress - Genome compression algorithm.

    PubMed

    Rajarajeswari, Pothuraju; Apparao, Allam

    2011-01-22

    Data compression is concerned with how information is organized in data. Efficient storage means removal of redundancy from the data being stored in the DNA molecule. Data compression algorithms remove redundancy and are used to understand biologically important molecules. We present a compression algorithm, "DNABIT Compress" for DNA sequences based on a novel algorithm of assigning binary bits for smaller segments of DNA bases to compress both repetitive and non repetitive DNA sequence. Our proposed algorithm achieves the best compression ratio for DNA sequences for larger genome. Significantly better compression results show that "DNABIT Compress" algorithm is the best among the remaining compression algorithms. While achieving the best compression ratios for DNA sequences (Genomes),our new DNABIT Compress algorithm significantly improves the running time of all previous DNA compression programs. Assigning binary bits (Unique BIT CODE) for (Exact Repeats, Reverse Repeats) fragments of DNA sequence is also a unique concept introduced in this algorithm for the first time in DNA compression. This proposed new algorithm could achieve the best compression ratio as much as 1.58 bits/bases where the existing best methods could not achieve a ratio less than 1.72 bits/bases.

  6. Rapid PCR-mediated synthesis of competitor molecules for accurate quantification of beta(2) GABA(A) receptor subunit mRNA.

    PubMed

    Vela, J; Vitorica, J; Ruano, D

    2001-12-01

    We describe a fast and easy method for the synthesis of competitor molecules based on non-specific conditions of PCR. RT-competitive PCR is a sensitive technique that allows quantification of very low quantities of mRNA molecules in small tissue samples. This technique is based on the competition established between the native and standard templates for nucleotides, primers or other factors during PCR. Thus, the most critical parameter is the use of good internal standards to generate a standard curve from which the amount of native sequences can be properly estimated. At the present time different types of internal standards and methods for their synthesis have been described. Normally, most of these methods are time-consuming and require the use of different sets of primers, different rounds of PCR or specific modifications, such as site-directed mutagenesis, that need subsequent analysis of the PCR products. Using our method, we obtained in a single round of PCR and with the same primer pair, competitor molecules that were successfully used in RT-competitive PCR experiments. The principal advantage of this method is high versatility and economy. Theoretically it is possible to synthesize a specific competitor molecule for each primer pair used. Finally, using this method we have been able to quantify the increase in the expression of the beta(2) GABA(A) receptor subunit mRNA that occurs during rat hippocampus development.

  7. A clone-free, single molecule map of the domestic cow (Bos taurus) genome.

    PubMed

    Zhou, Shiguo; Goldstein, Steve; Place, Michael; Bechner, Michael; Patino, Diego; Potamousis, Konstantinos; Ravindran, Prabu; Pape, Louise; Rincon, Gonzalo; Hernandez-Ortiz, Juan; Medrano, Juan F; Schwartz, David C

    2015-08-28

    The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.

  8. Identification of conformational epitopes for human IgG on Chemotaxis inhibitory protein of Staphylococcus aureus

    PubMed Central

    Gustafsson, Erika; Haas, Pieter-Jan; Walse, Björn; Hijnen, Marcel; Furebring, Christina; Ohlin, Mats; van Strijp, Jos AG; van Kessel, Kok PM

    2009-01-01

    Background The Chemotaxis inhibitory protein of Staphylococcus aureus (CHIPS) blocks the Complement fragment C5a receptor (C5aR) and formylated peptide receptor (FPR) and is thereby a potent inhibitor of neutrophil chemotaxis and activation of inflammatory responses. The majority of the healthy human population has antibodies against CHIPS that have been shown to interfere with its function in vitro. The aim of this study was to define potential epitopes for human antibodies on the CHIPS surface. We also initiate the process to identify a mutated CHIPS molecule that is not efficiently recognized by preformed anti-CHIPS antibodies and retains anti-inflammatory activity. Results In this paper, we panned peptide displaying phage libraries against a pool of CHIPS specific affinity-purified polyclonal human IgG. The selected peptides could be divided into two groups of sequences. The first group was the most dominant with 36 of the 48 sequenced clones represented. Binding to human affinity-purified IgG was verified by ELISA for a selection of peptide sequences in phage format. For further analysis, one peptide was chemically synthesized and antibodies affinity-purified on this peptide were found to bind the CHIPS molecule as studied by ELISA and Surface Plasmon Resonance. Furthermore, seven potential conformational epitopes responsible for antibody recognition were identified by mapping phage selected peptide sequences on the CHIPS surface as defined in the NMR structure of the recombinant CHIPS31–121 protein. Mapped epitopes were verified by in vitro mutational analysis of the CHIPS molecule. Single mutations introduced in the proposed antibody epitopes were shown to decrease antibody binding to CHIPS. The biological function in terms of C5aR signaling was studied by flow cytometry. A few mutations were shown to affect this biological function as well as the antibody binding. Conclusion Conformational epitopes recognized by human antibodies have been mapped on the CHIPS surface and amino acid residues involved in both antibody and C5aR interaction could be defined. This information has implications for the development of an effective anti-inflammatory agent based on a functional CHIPS molecule with low interaction with human IgG. PMID:19284584

  9. Single Molecule Sensing by Nanopores and Nanopore Devices

    PubMed Central

    Gu, Li-Qun; Shim, Ji Wook

    2010-01-01

    Molecular-scale pore structures, called nanopores, can be assembled by protein ion channels through genetic engineering or be artificially fabricated on solid substrates using fashion nanotechnology. When target molecules interact with the functionalized lumen of a nanopore, they characteristically block the ion pathway. The resulting conductance changes allow for identification of single molecules and quantification of target species in the mixture. In this review, we first overview nanopore-based sensory techniques that have been created for the detection of myriad biomedical targets, from metal ions, drug compounds, and cellular second messengers to proteins and DNA. Then we introduce our recent discoveries in nanopore single molecule detection: (1) using the protein nanopore to study folding/unfolding of the G-quadruplex aptamer; (2) creating a portable and durable biochip that is integrated with a single-protein pore sensor (this chip is compared with recently developed protein pore sensors based on stabilized bilayers on glass nanopore membranes and droplet interface bilayer); and (3) creating a glass nanopore-terminated probe for single-molecule DNA detection, chiral enantiomer discrimination, and identification of the bioterrorist agent ricin with an aptamer-encoded nanopore. PMID:20174694

  10. Electrospray-assisted laser desorption/ionization and tandem mass spectrometry of peptides and proteins.

    PubMed

    Peng, Ivory X; Shiea, Jentaie; Ogorzalek Loo, Rachel R; Loo, Joseph A

    2007-01-01

    We have constructed an electrospray-assisted laser desorption/ionization (ELDI) source which utilizes a nitrogen laser pulse to desorb intact molecules from matrix-containing sample solution droplets, followed by electrospray ionization (ESI) post-ionization. The ELDI source is coupled to a quadrupole ion trap mass spectrometer and allows sampling under ambient conditions. Preliminary data showed that ELDI produces ESI-like multiply charged peptides and proteins up to 29 kDa carbonic anhydrase and 66 kDa bovine albumin from single-protein solutions, as well as from complex digest mixtures. The generated multiply charged polypeptides enable efficient tandem mass spectrometric (MS/MS)-based peptide sequencing. ELDI-MS/MS of protein digests and small intact proteins was performed both by collisionally activated dissociation (CAD) and by nozzle-skimmer dissociation (NSD). ELDI-MS/MS may be a useful tool for protein sequencing analysis and top-down proteomics study, and may complement matrix-assisted laser desorption/ionization (MALDI)-based measurements. Copyright (c) 2007 John Wiley & Sons, Ltd.

  11. Separation and counting of single molecules through nanofluidics, programmable electrophoresis, and nanoelectrode-gated tunneling and dielectric detection

    DOEpatents

    Lee, James W.; Thundat, Thomas G.

    2006-04-25

    An apparatus for carrying out the separation, detection, and/or counting of single molecules at nanometer scale. Molecular separation is achieved by driving single molecules through a microfluidic or nanofluidic medium using programmable and coordinated electric fields. In various embodiments, the fluidic medium is a strip of hydrophilic material on nonconductive hydrophobic surface, a trough produced by parallel strips of hydrophobic nonconductive material on a hydrophilic base, or a covered passageway produced by parallel strips of hydrophobic nonconductive material on a hydrophilic base together with a nonconductive cover on the parallel strips of hydrophobic nonconductive material. The molecules are detected and counted using nanoelectrode-gated electron tunneling methods, dielectric monitoring, and other methods.

  12. Single-molecule detection of proteins with antigen-antibody interaction using resistive-pulse sensing of submicron latex particles

    NASA Astrophysics Data System (ADS)

    Takakura, T.; Yanagi, I.; Goto, Y.; Ishige, Y.; Kohara, Y.

    2016-03-01

    We developed a resistive-pulse sensor with a solid-state pore and measured the latex agglutination of submicron particles induced by antigen-antibody interaction for single-molecule detection of proteins. We fabricated the pore based on numerical simulation to clearly distinguish between monomer and dimer latex particles. By measuring single dimers agglutinated in the single-molecule regime, we detected single human alpha-fetoprotein molecules. Adjusting the initial particle concentration improves the limit of detection (LOD) to 95 fmol/l. We established a theoretical model of the LOD by combining the reaction kinetics and the counting statistics to explain the effect of initial particle concentration on the LOD. The theoretical model shows how to improve the LOD quantitatively. The single-molecule detection studied here indicates the feasibility of implementing a highly sensitive immunoassay by a simple measurement method using resistive-pulse sensing.

  13. Nano-fabrication of molecular electronic junctions by targeted modification of metal-molecule bonds

    PubMed Central

    Jafri, S. Hassan M.; Löfås, Henrik; Blom, Tobias; Wallner, Andreas; Grigoriev, Anton; Ahuja, Rajeev; Ottosson, Henrik; Leifer, Klaus

    2015-01-01

    Reproducibility, stability and the coupling between electrical and molecular properties are central challenges in the field of molecular electronics. The field not only needs devices that fulfill these criteria but they also need to be up-scalable to application size. In this work, few-molecule based electronics devices with reproducible electrical characteristics are demonstrated. Our previously reported 5 nm gold nanoparticles (AuNP) coated with ω-triphenylmethyl (trityl) protected 1,8-octanedithiol molecules are trapped in between sub-20 nm gap spacing gold nanoelectrodes forming AuNP-molecule network. When the trityl groups are removed, reproducible devices and stable Au-thiol junctions are established on both ends of the alkane segment. The resistance of more than 50 devices is reduced by orders of magnitude as well as a reduction of the spread in the resistance histogram is observed. By density functional theory calculations the orders of magnitude decrease in resistance can be explained and supported by TEM observations thus indicating that the resistance changes and strongly improved resistance spread are related to the establishment of reproducible and stable metal-molecule bonds. The same experimental sequence is carried out using 1,6-hexanedithiol functionalized AuNPs. The average resistances as a function of molecular length, demonstrated herein, are comparable to the one found in single molecule devices. PMID:26395225

  14. Independent assessment and improvement of wheat genome sequence assemblies using Fosill jumping libraries.

    PubMed

    Lu, Fu-Hao; McKenzie, Neil; Kettleborough, George; Heavens, Darren; Clark, Matthew D; Bevan, Michael W

    2018-05-01

    The accurate sequencing and assembly of very large, often polyploid, genomes remains a challenging task, limiting long-range sequence information and phased sequence variation for applications such as plant breeding. The 15-Gb hexaploid bread wheat (Triticum aestivum) genome has been particularly challenging to sequence, and several different approaches have recently generated long-range assemblies. Mapping and understanding the types of assembly errors are important for optimising future sequencing and assembly approaches and for comparative genomics. Here we use a Fosill 38-kb jumping library to assess medium and longer-range order of different publicly available wheat genome assemblies. Modifications to the Fosill protocol generated longer Illumina sequences and enabled comprehensive genome coverage. Analyses of two independent Bacterial Artificial Chromosome (BAC)-based chromosome-scale assemblies, two independent Illumina whole genome shotgun assemblies, and a hybrid Single Molecule Real Time (SMRT-PacBio) and short read (Illumina) assembly were carried out. We revealed a surprising scale and variety of discrepancies using Fosill mate-pair mapping and validated several of each class. In addition, Fosill mate-pairs were used to scaffold a whole genome Illumina assembly, leading to a 3-fold increase in N50 values. Our analyses, using an independent means to validate different wheat genome assemblies, show that whole genome shotgun assemblies based solely on Illumina sequences are significantly more accurate by all measures compared to BAC-based chromosome-scale assemblies and hybrid SMRT-Illumina approaches. Although current whole genome assemblies are reasonably accurate and useful, additional improvements will be needed to generate complete assemblies of wheat genomes using open-source, computationally efficient, and cost-effective methods.

  15. Use of amplicon sequencing to improve sensitivity in PCR-based detection of microbial pathogen in environmental samples.

    PubMed

    Saingam, Prakit; Li, Bo; Yan, Tao

    2018-06-01

    DNA-based molecular detection of microbial pathogens in complex environments is still plagued by sensitivity, specificity and robustness issues. We propose to address these issues by viewing them as inadvertent consequences of requiring specific and adequate amplification (SAA) of target DNA molecules by current PCR methods. Using the invA gene of Salmonella as the model system, we investigated if next generation sequencing (NGS) can be used to directly detect target sequences in false-negative PCR reaction (PCR-NGS) in order to remove the SAA requirement from PCR. False-negative PCR and qPCR reactions were first created using serial dilutions of laboratory-prepared Salmonella genomic DNA and then analyzed directly by NGS. Target invA sequences were detected in all false-negative PCR and qPCR reactions, which lowered the method detection limits near the theoretical minimum of single gene copy detection. The capability of the PCR-NGS approach in correcting false negativity was further tested and confirmed under more environmentally relevant conditions using Salmonella-spiked stream water and sediment samples. Finally, the PCR-NGS approach was applied to ten urban stream water samples and detected invA sequences in eight samples that would be otherwise deemed Salmonella negative. Analysis of the non-target sequences in the false-negative reactions helped to identify primer dime-like short sequences as the main cause of the false negativity. Together, the results demonstrated that the PCR-NGS approach can significantly improve method sensitivity, correct false-negative detections, and enable sequence-based analysis for failure diagnostics in complex environmental samples. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  17. Direct Single-Molecule Observation of Mode and Geometry of RecA-Mediated Homology Search.

    PubMed

    Lee, Andrew J; Endo, Masayuki; Hobbs, Jamie K; Wälti, Christoph

    2018-01-23

    Genomic integrity, when compromised by accrued DNA lesions, is maintained through efficient repair via homologous recombination. For this process the ubiquitous recombinase A (RecA), and its homologues such as the human Rad51, are of central importance, able to align and exchange homologous sequences within single-stranded and double-stranded DNA in order to swap out defective regions. Here, we directly observe the widely debated mechanism of RecA homology searching at a single-molecule level using high-speed atomic force microscopy (HS-AFM) in combination with tailored DNA origami frames to present the reaction targets in a way suitable for AFM-imaging. We show that RecA nucleoprotein filaments move along DNA substrates via short-distance facilitated diffusions, or slides, interspersed with longer-distance random moves, or hops. Importantly, from the specific interaction geometry, we find that the double-stranded substrate DNA resides in the secondary DNA binding-site within the RecA nucleoprotein filament helical groove during the homology search. This work demonstrates that tailored DNA origami, in conjunction with HS-AFM, can be employed to reveal directly conformational and geometrical information on dynamic protein-DNA interactions which was previously inaccessible at an individual single-molecule level.

  18. Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.

    PubMed

    Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju

    2015-01-01

    Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.

  19. Universal Readers Based on Hydrogen Bonding or π-π Stacking for Identification of DNA Nucleotides in Electron Tunnel Junctions.

    PubMed

    Biswas, Sovan; Sen, Suman; Im, JongOne; Biswas, Sudipta; Krstic, Predrag; Ashcroft, Brian; Borges, Chad; Zhao, Yanan; Lindsay, Stuart; Zhang, Peiming

    2016-12-27

    A reader molecule, which recognizes all the naturally occurring nucleobases in an electron tunnel junction, is required for sequencing DNA by a recognition tunneling (RT) technique, referred to as a universal reader. In the present study, we have designed a series of heterocyclic carboxamides based on hydrogen bonding and a large-sized pyrene ring based on a π-π stacking interaction as universal reader candidates. Each of these compounds was synthesized to bear a thiolated linker for attachment to metal electrodes and examined for their interactions with naturally occurring DNA nucleosides and nucleotides by 1 H NMR, ESI-MS, computational calculations, and surface plasmon resonance. RT measurements were carried out in a scanning tunnel microscope. All of these molecules generated electrical signals with DNA nucleotides in tunneling junctions under physiological conditions (phosphate buffered aqueous solution, pH 7.4). Using a support vector machine as a tool for data analysis, we found that these candidates distinguished among naturally occurring DNA nucleotides with the accuracy of pyrene (by π-π stacking interactions) > azole carboxamides (by hydrogen-bonding interactions). In addition, the pyrene reader operated efficiently in a larger tunnel junction. However, the azole carboxamide could read abasic (AP) monophosphate, a product from spontaneous base hydrolysis or an intermediate of base excision repair. Thus, we envision that sequencing DNA using both π-π stacking and hydrogen-bonding-based universal readers in parallel should generate more comprehensive genome sequences than sequencing based on either reader molecule alone.

  20. Hybridization chain reaction-based colorimetric aptasensor of adenosine 5'-triphosphate on unmodified gold nanoparticles and two label-free hairpin probes.

    PubMed

    Gao, Zhuangqiang; Qiu, Zhenli; Lu, Minghua; Shu, Jian; Tang, Dianping

    2017-03-15

    This work designs a new label-free aptasensor for the colorimetric determination of small molecules (adenosine 5'-triphosphate, ATP) by using visible gold nanoparticles as the signal-generation tags, based on target-triggered hybridization chain reaction (HCR) between two hairpin DNA probes. The assay is carried out referring to the change in the color/absorbance by salt-induced aggregation of gold nanoparticles after the interaction with hairpins, gold nanoparticles and ATP. To construct such an assay system, two hairpin DNA probes with a short single-stranded DNA at the sticky end are utilized for interaction with gold nanoparticles. In the absence of target ATP, the hairpin DNA probes can prevent gold nanoparticles from the salt-induced aggregation through the interaction of the single-stranded DNA at the sticky end with gold nanoparticles. Upon target ATP introduction, the aptamer-based hairpin probe is opened to expose a new sticky end for the strand-displacement reaction with another complementary hairpin, thus resulting in the decreasing single-stranded DNA because of the consumption of hairpins. In this case, gold nanoparticles are uncovered owing to the formation of double-stranded DNA, which causes their aggregation upon addition of the salt, thereby leading to the change in the red-to-blue color. Under the optimal conditions, the HCR-based colorimetric assay presents good visible color or absorbance responses for the determination of target ATP at a concentration as low as 1.0nM. Importantly, the methodology can be further extended to quantitatively or qualitatively monitor other small molecules or biotoxins by changing the sequence of the corresponding aptamer. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. In Vitro Selection for Small-Molecule-Triggered Strand Displacement and Riboswitch Activity.

    PubMed

    Martini, Laura; Meyer, Adam J; Ellefson, Jared W; Milligan, John N; Forlin, Michele; Ellington, Andrew D; Mansy, Sheref S

    2015-10-16

    An in vitro selection method for ligand-responsive RNA sensors was developed that exploited strand displacement reactions. The RNA library was based on the thiamine pyrophosphate (TPP) riboswitch, and RNA sequences capable of hybridizing to a target duplex DNA in a TPP regulated manner were identified. After three rounds of selection, RNA molecules that mediated a strand exchange reaction upon TPP binding were enriched. The enriched sequences also showed riboswitch activity. Our results demonstrated that small-molecule-responsive nucleic acid sensors can be selected to control the activity of target nucleic acid circuitry.

  2. Bleaching/blinking assisted localization microscopy for superresolution imaging using standard fluorescent molecules.

    PubMed

    Burnette, Dylan T; Sengupta, Prabuddha; Dai, Yuhai; Lippincott-Schwartz, Jennifer; Kachar, Bechara

    2011-12-27

    Superresolution imaging techniques based on the precise localization of single molecules, such as photoactivated localization microscopy (PALM) and stochastic optical reconstruction microscopy (STORM), achieve high resolution by fitting images of single fluorescent molecules with a theoretical Gaussian to localize them with a precision on the order of tens of nanometers. PALM/STORM rely on photoactivated proteins or photoswitching dyes, respectively, which makes them technically challenging. We present a simple and practical way of producing point localization-based superresolution images that does not require photoactivatable or photoswitching probes. Called bleaching/blinking assisted localization microscopy (BaLM), the technique relies on the intrinsic bleaching and blinking behaviors characteristic of all commonly used fluorescent probes. To detect single fluorophores, we simply acquire a stream of fluorescence images. Fluorophore bleach or blink-off events are detected by subtracting from each image of the series the subsequent image. Similarly, blink-on events are detected by subtracting from each frame the previous one. After image subtractions, fluorescence emission signals from single fluorophores are identified and the localizations are determined by fitting the fluorescence intensity distribution with a theoretical Gaussian. We also show that BaLM works with a spectrum of fluorescent molecules in the same sample. Thus, BaLM extends single molecule-based superresolution localization to samples labeled with multiple conventional fluorescent probes.

  3. Detecting a single molecule using a micropore-nanopore hybrid chip

    PubMed Central

    2013-01-01

    Nanopore-based DNA sequencing and biomolecule sensing have attracted more and more attention. In this work, novel sensing devices were built on the basis of the chips containing nanopore arrays in polycarbonate (PC) membranes and micropores in Si3N4 films. Using the integrated chips, the transmembrane ionic current induced by biomolecule's translocation was recorded and analyzed, which suggested that the detected current did not change linearly as commonly expected with increasing biomolecule concentration. On the other hand, detailed translocation information (such as translocation gesture) was also extracted from the discrete current blockages in basic current curves. These results indicated that the nanofluidic device based on the chips integrated by micropores and nanopores possessed comparative potentials in biomolecule sensing. PMID:24261484

  4. Detecting a single molecule using a micropore-nanopore hybrid chip.

    PubMed

    Liu, Lei; Zhu, Lizhong; Ni, Zhonghua; Chen, Yunfei

    2013-11-21

    Nanopore-based DNA sequencing and biomolecule sensing have attracted more and more attention. In this work, novel sensing devices were built on the basis of the chips containing nanopore arrays in polycarbonate (PC) membranes and micropores in Si3N4 films. Using the integrated chips, the transmembrane ionic current induced by biomolecule's translocation was recorded and analyzed, which suggested that the detected current did not change linearly as commonly expected with increasing biomolecule concentration. On the other hand, detailed translocation information (such as translocation gesture) was also extracted from the discrete current blockages in basic current curves. These results indicated that the nanofluidic device based on the chips integrated by micropores and nanopores possessed comparative potentials in biomolecule sensing.

  5. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Coleman-Derr, Devin; Bowman, Brett

    2014-03-17

    The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less

  6. Two dimensional molecular electronics spectroscopy for molecular fingerprinting, DNA sequencing, and cancerous DNA recognition.

    PubMed

    Rajan, Arunkumar Chitteth; Rezapour, Mohammad Reza; Yun, Jeonghun; Cho, Yeonchoo; Cho, Woo Jong; Min, Seung Kyu; Lee, Geunsik; Kim, Kwang S

    2014-02-25

    Laser-driven molecular spectroscopy of low spatial resolution is widely used, while electronic current-driven molecular spectroscopy of atomic scale resolution has been limited because currents provide only minimal information. However, electron transmission of a graphene nanoribbon on which a molecule is adsorbed shows molecular fingerprints of Fano resonances, i.e., characteristic features of frontier orbitals and conformations of physisorbed molecules. Utilizing these resonance profiles, here we demonstrate two-dimensional molecular electronics spectroscopy (2D MES). The differential conductance with respect to bias and gate voltages not only distinguishes different types of nucleobases for DNA sequencing but also recognizes methylated nucleobases which could be related to cancerous cell growth. This 2D MES could open an exciting field to recognize single molecule signatures at atomic resolution. The advantages of the 2D MES over the one-dimensional (1D) current analysis can be comparable to those of 2D NMR over 1D NMR analysis.

  7. LongISLND: in silico sequencing of lengthy and noisy datatypes.

    PubMed

    Lau, Bayo; Mohiyuddin, Marghoob; Mu, John C; Fang, Li Tai; Bani Asadi, Narges; Dallett, Carolina; Lam, Hugo Y K

    2016-12-15

    LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling. LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd CONTACT: hugo.lam@roche.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  8. Robust nonparametric quantification of clustering density of molecules in single-molecule localization microscopy

    PubMed Central

    Jiang, Shenghang; Park, Seongjin; Challapalli, Sai Divya; Fei, Jingyi; Wang, Yong

    2017-01-01

    We report a robust nonparametric descriptor, J′(r), for quantifying the density of clustering molecules in single-molecule localization microscopy. J′(r), based on nearest neighbor distribution functions, does not require any parameter as an input for analyzing point patterns. We show that J′(r) displays a valley shape in the presence of clusters of molecules, and the characteristics of the valley reliably report the clustering features in the data. Most importantly, the position of the J′(r) valley (rJm′) depends exclusively on the density of clustering molecules (ρc). Therefore, it is ideal for direct estimation of the clustering density of molecules in single-molecule localization microscopy. As an example, this descriptor was applied to estimate the clustering density of ptsG mRNA in E. coli bacteria. PMID:28636661

  9. Precise Quantitation of MicroRNA in a Single Cell with Droplet Digital PCR Based on Ligation Reaction.

    PubMed

    Tian, Hui; Sun, Yuanyuan; Liu, Chenghui; Duan, Xinrui; Tang, Wei; Li, Zhengping

    2016-12-06

    MicroRNA (miRNA) analysis in a single cell is extremely important because it allows deep understanding of the exact correlation between the miRNAs and cell functions. Herein, we wish to report a highly sensitive and precisely quantitative assay for miRNA detection based on ligation-based droplet digital polymerase chain reaction (ddPCR), which permits the quantitation of miRNA in a single cell. In this ligation-based ddPCR assay, two target-specific oligonucleotide probes can be simply designed to be complementary to the half-sequence of the target miRNA, respectively, which avoids the sophisticated design of reverse transcription and provides high specificity to discriminate a single-base difference among miRNAs with simple operations. After the miRNA-templated ligation, the ddPCR partitions individual ligated products into a water-in-oil droplet and digitally counts the fluorescence-positive and negative droplets after PCR amplification for quantification of the target molecules, which possesses the power of precise quantitation and robustness to variation in PCR efficiency. By integrating the advantages of the precise quantification of ddPCR and the simplicity of the ligation-based PCR, the proposed method can sensitively measure let-7a miRNA with a detection limit of 20 aM (12 copies per microliter), and even a single-base difference can be discriminated in let-7 family members. More importantly, due to its high selectivity and sensitivity, the proposed method can achieve precise quantitation of miRNAs in single-cell lysate. Therefore, the ligation-based ddPCR assay may serve as a useful tool to exactly reveal the miRNAs' actions in a single cell, which is of great importance for the study of miRNAs' biofunction as well as for the related biomedical studies.

  10. Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity

    USDA-ARS?s Scientific Manuscript database

    Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we ...

  11. Genes and Vocal Learning

    ERIC Educational Resources Information Center

    White, Stephanie A.

    2010-01-01

    Could a mutation in a single gene be the evolutionary lynchpin supporting the development of human language? A rare mutation in the molecule known as FOXP2 discovered in a human family seemed to suggest so, and its sequence phylogeny reinforced a Chomskian view that language emerged wholesale in humans. Spurred by this discovery, research in…

  12. n-CoDeR concept: unique types of antibodies for diagnostic use and therapy.

    PubMed

    Carlsson, R; Söderlind, E

    2001-05-01

    The n-CoDeR recombinant antibody gene libraries are built on a single master framework, into which diverse in vivo-formed complementarity determining regions (CDRs) are allowed to recombine. These CDRs are sampled from in vivo-processed and proof-read gene sequences, thus ensuring an optimal level of correctly folded and functional molecules. By the modularized assembly process, up to six CDRs can be varied at the same time, providing a possibility for the creation of a hitherto undescribed genetic and functional variation. The n-CoDeR antibody gene libraries can be used to select highly specific, human antibody fragments with specificities to virtually any antigen, including carbohydrates and human self-proteins and with affinities down into the subnanomolar range. Furthermore, combining CDRs sampled from in vivo-processed sequences into a single framework result in molecules exhibiting a lower immunogenicity compared to normal human immunoglobulins, as determined by computer analyses. The distinguished features of the n-CoDeR libraries in the therapeutic and diagnostic areas are discussed.

  13. DNA - peptide polyelectrolyte complexes: Phase control by hybridization

    NASA Astrophysics Data System (ADS)

    Vieregg, Jeffrey; Lueckheide, Michael; Marciel, Amanda; Leon, Lorraine; Tirrell, Matthew

    DNA is one of the most highly-charged molecules known, and interacts strongly with charged molecules in the cell. Condensation of long double-stranded DNA is one of the classic problems of biophysics, but the polyelectrolyte behavior of short and/or single-stranded nucleic acids has attracted far less study despite its importance for both biological and engineered systems. We report here studies of DNA oligonucleotides complexed with cationic peptides and polyamines. As seen previously for longer sequences, double-stranded oligonucleotides form solid precipitates, but single-stranded oligonucleotides instead undergo liquid-liquid phase separation to form coacervate droplets. Complexed oligonucleotides remain competent for hybridization, and display sequence-dependent environmental response. We observe similar behavior for RNA oligonucleotides, and methylphosphonate substitution of the DNA backbone indicates that nucleic acid charge density controls whether liquid or solid complexes are formed. Liquid-liquid phase separations of this type have been implicated in formation of membraneless organelles in vivo, and have been suggested as protocells in early life scenarios; oligonucleotides offer an excellent method to probe the physics controlling these phenomena.

  14. Discrimination of Single Base Pair Differences Among Individual DNA Molecules Using a Nanopore

    NASA Technical Reports Server (NTRS)

    Vercoutere, Wenonah; DeGuzman, Veronica

    2003-01-01

    The protein toxin alpha-hemolysin form nanometer scale channels across lipid membranes. Our lab uses a single channel in an artificial lipid bilayer in a patch clamp device to capture and examine individual DNA molecules. This nanopore detector used with a support vector machine (SVM) can analyze DNA hairpin molecules on the millisecond time scale. We distinguish duplex stem length, base pair mismatches, loop length, and single base pair differences. The residual current fluxes also reveal structural molecular dynamics elements. DNA end-fraying (terminal base pair dissociation) can be observed as near full blockades, or spikes, in current. This technique can be used to investigate other biological processes dependent on DNA end-fraying, such as the processing of HIV DNA by HIV integrase.

  15. Next-Generation Sequencing Platforms

    NASA Astrophysics Data System (ADS)

    Mardis, Elaine R.

    2013-06-01

    Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.

  16. Structural Insights into the Quadruplex-Duplex 3' Interface Formed from a Telomeric Repeat: A Potential Molecular Target.

    PubMed

    Russo Krauss, Irene; Ramaswamy, Sneha; Neidle, Stephen; Haider, Shozeb; Parkinson, Gary N

    2016-02-03

    We report here on an X-ray crystallographic and molecular modeling investigation into the complex 3' interface formed between putative parallel stranded G-quadruplexes and a duplex DNA sequence constructed from the human telomeric repeat sequence TTAGGG. Our crystallographic approach provides a detailed snapshot of a telomeric 3' quadruplex-duplex junction: a junction that appears to have the potential to form a unique molecular target for small molecule binding and interference with telomere-related functions. This unique target is particularly relevant as current high-affinity compounds that bind putative G-quadruplex forming sequences only rarely have a high degree of selectivity for a particular quadruplex. Here DNA junctions were assembled using different putative quadruplex-forming scaffolds linked at the 3' end to a telomeric duplex sequence and annealed to a complementary strand. We successfully generated a series of G-quadruplex-duplex containing crystals, both alone and in the presence of ligands. The structures demonstrate the formation of a parallel folded G-quadruplex and a B-form duplex DNA stacked coaxially. Most strikingly, structural data reveals the consistent formation of a TAT triad platform between the two motifs. This triad allows for a continuous stack of bases to link the quadruplex motif with the duplex region. For these crystal structures formed in the absence of ligands, the TAT triad interface occludes ligand binding at the 3' quadruplex-duplex interface, in agreement with in silico docking predictions. However, with the rearrangement of a single nucleotide, a stable pocket can be produced, thus providing an opportunity for the binding of selective molecules at the interface.

  17. Biorecognition by DNA oligonucleotides after Exposure to Photoresists and Resist Removers

    PubMed Central

    Dean, Stacey L.; Morrow, Thomas J.; Patrick, Sue; Li, Mingwei; Clawson, Gary; Mayer, Theresa S.; Keating, Christine D.

    2013-01-01

    Combining biological molecules with integrated circuit technology is of considerable interest for next generation sensors and biomedical devices. Current lithographic microfabrication methods, however, were developed for compatibility with silicon technology rather than bioorganic molecules and consequently it cannot be assumed that biomolecules will remain attached and intact during on-chip processing. Here, we evaluate the effects of three common photoresists (Microposit S1800 series, PMGI SF6, and Megaposit SPR 3012) and two photoresist removers (acetone and 1165 remover) on the ability of surface-immobilized DNA oligonucleotides to selectively recognize their reverse-complementary sequence. Two common DNA immobilization methods were compared: adsorption of 5′-thiolated sequences directly to gold nanowires and covalent attachment of 5′-thiolated sequences to surface amines on silica coated nanowires. We found that acetone had deleterious effects on selective hybridization as compared to 1165 remover, presumably due to incomplete resist removal. Use of the PMGI photoresist, which involves a high temperature bake step, was detrimental to the later performance of nanowire-bound DNA in hybridization assays, especially for DNA attached via thiol adsorption. The other three photoresists did not substantially degrade DNA binding capacity or selectivity for complementary DNA sequences. To determine if the lithographic steps caused more subtle damage, we also tested oligonucleotides containing a single base mismatch. Finally, a two-step photolithographic process was developed and used in combination with dielectrophoretic nanowire assembly to produce an array of doubly-contacted, electrically isolated individual nanowire components on a chip. Post-fabrication fluorescence imaging indicated that nanowire-bound DNA was present and able to selectively bind complementary strands. PMID:23952639

  18. Rotation-Induced Macromolecular Spooling of DNA

    NASA Astrophysics Data System (ADS)

    Shendruk, Tyler N.; Sean, David; Berard, Daniel J.; Wolf, Julian; Dragoman, Justin; Battat, Sophie; Slater, Gary W.; Leslie, Sabrina R.

    2017-07-01

    Genetic information is stored in a linear sequence of base pairs; however, thermal fluctuations and complex DNA conformations such as folds and loops make it challenging to order genomic material for in vitro analysis. In this work, we discover that rotation-induced macromolecular spooling of DNA around a rotating microwire can monotonically order genomic bases, overcoming this challenge. We use single-molecule fluorescence microscopy to directly visualize long DNA strands deforming and elongating in shear flow near a rotating microwire, in agreement with numerical simulations. While untethered DNA is observed to elongate substantially, in agreement with our theory and numerical simulations, strong extension of DNA becomes possible by introducing tethering. For the case of tethered polymers, we show that increasing the rotation rate can deterministically spool a substantial portion of the chain into a fully stretched, single-file conformation. When applied to DNA, the fraction of genetic information sequentially ordered on the microwire surface will increase with the contour length, despite the increased entropy. This ability to handle long strands of DNA is in contrast to modern DNA sample preparation technologies for sequencing and mapping, which are typically restricted to comparatively short strands, resulting in challenges in reconstructing the genome. Thus, in addition to discovering new rotation-induced macromolecular dynamics, this work inspires new approaches to handling genomic-length DNA strands.

  19. Current rectification in a single molecule diode: the role of electrode coupling.

    PubMed

    Sherif, Siya; Rubio-Bollinger, Gabino; Pinilla-Cienfuegos, Elena; Coronado, Eugenio; Cuevas, Juan Carlos; Agraït, Nicolás

    2015-07-24

    We demonstrate large rectification ratios (> 100) in single-molecule junctions based on a metal-oxide cluster (polyoxometalate), using a scanning tunneling microscope (STM) both at ambient conditions and at low temperature. These rectification ratios are the largest ever observed in a single-molecule junction, and in addition these junctions sustain current densities larger than 10(5) A cm(-2). By following the variation of the I-V characteristics with tip-molecule separation we demonstrate unambiguously that rectification is due to asymmetric coupling to the electrodes of a molecule with an asymmetric level structure. This mechanism can be implemented in other type of molecular junctions using both organic and inorganic molecules and provides a simple strategy for the rational design of molecular diodes.

  20. Current rectification in a single molecule diode: the role of electrode coupling

    NASA Astrophysics Data System (ADS)

    Sherif, Siya; Rubio-Bollinger, Gabino; Pinilla-Cienfuegos, Elena; Coronado, Eugenio; Cuevas, Juan Carlos; Agraït, Nicolás

    2015-07-01

    We demonstrate large rectification ratios (\\gt 100) in single-molecule junctions based on a metal-oxide cluster (polyoxometalate), using a scanning tunneling microscope (STM) both at ambient conditions and at low temperature. These rectification ratios are the largest ever observed in a single-molecule junction, and in addition these junctions sustain current densities larger than 105 A cm-2. By following the variation of the I-V characteristics with tip-molecule separation we demonstrate unambiguously that rectification is due to asymmetric coupling to the electrodes of a molecule with an asymmetric level structure. This mechanism can be implemented in other type of molecular junctions using both organic and inorganic molecules and provides a simple strategy for the rational design of molecular diodes.

Top