single molecule sequencing: Topics by Science.gov

Sample records for single molecule sequencing

Single molecule sequencing of the M13 virus genome without amplification

PubMed Central

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X.; Yan, Qin; Deem, Michael W.; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias. PMID:29253901
Single molecule sequencing of the M13 virus genome without amplification.

PubMed

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X; Yan, Qin; Deem, Michael W; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias.
Single-Molecule Electrical Random Resequencing of DNA and RNA

NASA Astrophysics Data System (ADS)

Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

2012-07-01

Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics

PubMed Central

Ardui, Simon; Ameur, Adam; Vermeesch, Joris R; Hestand, Matthew S

2018-01-01

Abstract Short read massive parallel sequencing has emerged as a standard diagnostic tool in the medical setting. However, short read technologies have inherent limitations such as GC bias, difficulties mapping to repetitive elements, trouble discriminating paralogous sequences, and difficulties in phasing alleles. Long read single molecule sequencers resolve these obstacles. Moreover, they offer higher consensus accuracies and can detect epigenetic modifications from native DNA. The first commercially available long read single molecule platform was the RS system based on PacBio's single molecule real-time (SMRT) sequencing technology, which has since evolved into their RSII and Sequel systems. Here we capsulize how SMRT sequencing is revolutionizing constitutional, reproductive, cancer, microbial and viral genetic testing. PMID:29401301
[The principle and application of the single-molecule real-time sequencing technology].

PubMed

Yanhu, Liu; Lu, Wang; Li, Yu

2015-03-01

Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modiﬁcations. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.
Development of a reference material of a single DNA molecule for the quality control of PCR testing.

PubMed

Mano, Junichi; Hatano, Shuko; Futo, Satoshi; Yoshii, Junji; Nakae, Hiroki; Naito, Shigehiro; Takabatake, Reona; Kitta, Kazumi

2014-09-02

We developed a reference material of a single DNA molecule with a specific nucleotide sequence. The double-strand linear DNA which has PCR target sequences at the both ends was prepared as a reference DNA molecule, and we named the PCR targets on each side as confirmation sequence and standard sequence. The highly diluted solution of the reference molecule was dispensed into 96 wells of a plastic PCR plate to make the average number of molecules in a well below one. Subsequently, the presence or absence of the reference molecule in each well was checked by real-time PCR targeting for the confirmation sequence. After an enzymatic treatment of the reaction mixture in the positive wells for the digestion of PCR products, the resultant solution was used as the reference material of a single DNA molecule with the standard sequence. PCR analyses revealed that the prepared samples included only one reference molecule with high probability. The single-molecule reference material developed in this study will be useful for the absolute evaluation of a detection limit of PCR-based testing methods, the quality control of PCR analyses, performance evaluations of PCR reagents and instruments, and the preparation of an accurate calibration curve for real-time PCR quantitation.
Reducing assembly complexity of microbial genomes with single-molecule sequencing.

PubMed

Koren, Sergey; Harhay, Gregory P; Smith, Timothy P L; Bono, James L; Harhay, Dayna M; Mcvey, Scott D; Radune, Diana; Bergman, Nicholas H; Phillippy, Adam M

2013-01-01

The short reads output by first- and second-generation DNA sequencing instruments cannot completely reconstruct microbial chromosomes. Therefore, most genomes have been left unfinished due to the significant resources required to manually close gaps in draft assemblies. Third-generation, single-molecule sequencing addresses this problem by greatly increasing sequencing read length, which simplifies the assembly problem. To measure the benefit of single-molecule sequencing on microbial genome assembly, we sequenced and assembled the genomes of six bacteria and analyzed the repeat complexity of 2,267 complete bacteria and archaea. Our results indicate that the majority of known bacterial and archaeal genomes can be assembled without gaps, at finished-grade quality, using a single PacBio RS sequencing library. These single-library assemblies are also more accurate than typical short-read assemblies and hybrid assemblies of short and long reads. Automated assembly of long, single-molecule sequencing data reduces the cost of microbial finishing to $1,000 for most genomes, and future advances in this technology are expected to drive the cost lower. This is expected to increase the number of completed genomes, improve the quality of microbial genome databases, and enable high-fidelity, population-scale studies of pan-genomes and chromosomal organization.
Hybrid error correction and de novo assembly of single-molecule sequencing reads

PubMed Central

Koren, Sergey; Schatz, Michael C.; Walenz, Brian P.; Martin, Jeffrey; Howard, Jason; Ganapathy, Ganeshkumar; Wang, Zhong; Rasko, David A.; McCombie, W. Richard; Jarvis, Erich D.; Phillippy, Adam M.

2012-01-01

Emerging single-molecule sequencing instruments can generate multi-kilobase sequences with the potential to dramatically improve genome and transcriptome assembly. However, the high error rate of single-molecule reads is challenging, and has limited their use to resequencing bacteria. To address this limitation, we introduce a novel correction algorithm and assembly strategy that utilizes shorter, high-identity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on Pacbio RS reads of phage, prokaryotic, and eukaryotic whole genomes, including the novel genome of the parrot Melopsittacus undulatus, as well as for RNA-seq reads of the corn (Zea mays) transcriptome. Our approach achieves over 99.9% read correction accuracy and produces substantially better assemblies than current sequencing strategies: in the best example, quintupling the median contig size relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly. PMID:22750884
Assembly and diploid architecture of an individual human genome via single-molecule technologies

PubMed Central

Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

2015-01-01

We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality. PMID:26121404
Assembly and diploid architecture of an individual human genome via single-molecule technologies.

PubMed

Pendleton, Matthew; Sebra, Robert; Pang, Andy Wing Chun; Ummat, Ajay; Franzen, Oscar; Rausch, Tobias; Stütz, Adrian M; Stedman, William; Anantharaman, Thomas; Hastie, Alex; Dai, Heng; Fritz, Markus Hsi-Yang; Cao, Han; Cohain, Ariella; Deikus, Gintaras; Durrett, Russell E; Blanchard, Scott C; Altman, Roger; Chin, Chen-Shan; Guo, Yan; Paxinos, Ellen E; Korbel, Jan O; Darnell, Robert B; McCombie, W Richard; Kwok, Pui-Yan; Mason, Christopher E; Schadt, Eric E; Bashir, Ali

2015-08-01

We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.
Quantum-Sequencing: Fast electronic single DNA molecule sequencing

NASA Astrophysics Data System (ADS)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.
Nanopores and nucleic acids: prospects for ultrarapid sequencing

NASA Technical Reports Server (NTRS)

Deamer, D. W.; Akeson, M.

2000-01-01

DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.
Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

PubMed

Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

2017-11-28

Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.
Single-molecule study of thymidine glycol and i-motif through the alpha-hemolysin ion channel

NASA Astrophysics Data System (ADS)

He, Lidong

Nanopore-based devices have emerged as a single-molecule detection and analysis tool for a wide range of applications. Through electrophoretically driving DNA molecules across a nanosized pore, a lot of information can be received, including unfolding kinetics and DNA-protein interactions. This single-molecule method has the potential to sequence kilobase length DNA polymers without amplification or labeling, approaching "the third generation" genome sequencing for around $1000 within 24 hours. alpha-Hemolysin biological nanopores have the advantages of excellent stability, low-noise level, and precise site-directed mutagenesis for engineering this protein nanopore. The first work presented in this thesis established the current signal of the thymidine glycol lesion in DNA oligomers through an immobilization experiment. The thymidine glycol enantiomers were differentiated from each other by different current blockage levels. Also, the effect of bulky hydrophobic adducts to the current blockage was investigated. Secondly, the alpha-hemolysin nanopore was used to study the human telomere i-motif and RET oncogene i-motif at a single-molecule level. In Chapter 3, it was demonstrated that the alpha-hemolysin nanopore can differentiate an i-motif form and single-strand DNA form at different pH values based on the same sequence. In addition, it shows potential to differentiate the folding topologies generated from the same DNA sequence.
Single molecule sequencing-guided scaffolding and correction of draft assemblies.

PubMed

Zhu, Shenglong; Chen, Danny Z; Emrich, Scott J

2017-12-06

Although single molecule sequencing is still improving, the lengths of the generated sequences are inevitably an advantage in genome assembly. Prior work that utilizes long reads to conduct genome assembly has mostly focused on correcting sequencing errors and improving contiguity of de novo assemblies. We propose a disassembling-reassembling approach for both correcting structural errors in the draft assembly and scaffolding a target assembly based on error-corrected single molecule sequences. To achieve this goal, we formulate a maximum alternating path cover problem. We prove that this problem is NP-hard, and solve it by a 2-approximation algorithm. Our experimental results show that our approach can improve the structural correctness of target assemblies in the cost of some contiguity, even with smaller amounts of long reads. In addition, our reassembling process can also serve as a competitive scaffolder relative to well-established assembly benchmarks.
Attomole-level Genomics with Single-molecule Direct DNA, cDNA and RNA Sequencing Technologies.

PubMed

Ozsolak, Fatih

2016-01-01

With the introduction of next-generation sequencing (NGS) technologies in 2005, the domination of microarrays in genomics quickly came to an end due to NGS's superior technical performance and cost advantages. By enabling genetic analysis capabilities that were not possible previously, NGS technologies have started to play an integral role in all areas of biomedical research. This chapter outlines the low-quantity DNA and cDNA sequencing capabilities and applications developed with the Helicos single molecule DNA sequencing technology.
[Biophysics of single molecules].

PubMed

Serdiuk, I N; Deriusheva, E I

2011-01-01

The modern methods of research of biological molecules whose application led to the development of a new field of science, biophysics of single molecules, are reviewed. The measurement of the characteristics of single molecules enables one to reveal their individual features, and it is just for this reason that much more information can be obtained from one molecule than from the entire ensample of molecules. The high sensitivity of the methods considered in detail makes it possible to come close to the solution of the basic problem of practical importance, namely, the determination of the nucleotide sequence of a single DNA molecule.
Single-Molecule Counting of Point Mutations by Transient DNA Binding

NASA Astrophysics Data System (ADS)

Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan

2017-03-01

High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.
Single molecule targeted sequencing for cancer gene mutation detection.

PubMed

Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui

2016-05-19

With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.
DNA and RNA sequencing by nanoscale reading through programmable electrophoresis and nanoelectrode-gated tunneling and dielectric detection

DOEpatents

Lee, James W.; Thundat, Thomas G.

2005-06-14

An apparatus and method for performing nucleic acid (DNA and/or RNA) sequencing on a single molecule. The genetic sequence information is obtained by probing through a DNA or RNA molecule base by base at nanometer scale as though looking through a strip of movie film. This DNA sequencing nanotechnology has the theoretical capability of performing DNA sequencing at a maximal rate of about 1,000,000 bases per second. This enhanced performance is made possible by a series of innovations including: novel applications of a fine-tuned nanometer gap for passage of a single DNA or RNA molecule; thin layer microfluidics for sample loading and delivery; and programmable electric fields for precise control of DNA or RNA movement. Detection methods include nanoelectrode-gated tunneling current measurements, dielectric molecular characterization, and atomic force microscopy/electrostatic force microscopy (AFM/EFM) probing for nanoscale reading of the nucleic acid sequences.

The Shine-Dalgarno sequence of riboswitch-regulated single mRNAs shows ligand-dependent accessibility bursts

NASA Astrophysics Data System (ADS)

Rinaldi, Arlie J.; Lund, Paul E.; Blanco, Mario R.; Walter, Nils G.

2016-01-01

In response to intracellular signals in Gram-negative bacteria, translational riboswitches--commonly embedded in messenger RNAs (mRNAs)--regulate gene expression through inhibition of translation initiation. It is generally thought that this regulation originates from occlusion of the Shine-Dalgarno (SD) sequence upon ligand binding; however, little direct evidence exists. Here we develop Single Molecule Kinetic Analysis of RNA Transient Structure (SiM-KARTS) to investigate the ligand-dependent accessibility of the SD sequence of an mRNA hosting the 7-aminomethyl-7-deazaguanine (preQ1)-sensing riboswitch. Spike train analysis reveals that individual mRNA molecules alternate between two conformational states, distinguished by `bursts' of probe binding associated with increased SD sequence accessibility. Addition of preQ1 decreases the lifetime of the SD's high-accessibility (bursting) state and prolongs the time between bursts. In addition, ligand-jump experiments reveal imperfect riboswitching of single mRNA molecules. Such complex ligand sensing by individual mRNA molecules rationalizes the nuanced ligand response observed during bulk mRNA translation.
Recent patents of nanopore DNA sequencing technology: progress and challenges.

PubMed

Zhou, Jianfeng; Xu, Bingqian

2010-11-01

DNA sequencing techniques witnessed fast development in the last decades, primarily driven by the Human Genome Project. Among the proposed new techniques, Nanopore was considered as a suitable candidate for the single DNA sequencing with ultrahigh speed and very low cost. Several fabrication and modification techniques have been developed to produce robust and well-defined nanopore devices. Many efforts have also been done to apply nanopore to analyze the properties of DNA molecules. By comparing with traditional sequencing techniques, nanopore has demonstrated its distinctive superiorities in main practical issues, such as sample preparation, sequencing speed, cost-effective and read-length. Although challenges still remain, recent researches in improving the capabilities of nanopore have shed a light to achieve its ultimate goal: Sequence individual DNA strand at single nucleotide level. This patent review briefly highlights recent developments and technological achievements for DNA analysis and sequencing at single molecule level, focusing on nanopore based methods.
Multiplex single-molecule interaction profiling of DNA-barcoded proteins.

PubMed

Gu, Liangcai; Li, Chao; Aach, John; Hill, David E; Vidal, Marc; Church, George M

2014-11-27

In contrast with advances in massively parallel DNA sequencing, high-throughput protein analyses are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule protein detection using optical methods is limited by the number of spectrally non-overlapping chromophores. Here we introduce a single-molecular-interaction sequencing (SMI-seq) technology for parallel protein interaction profiling leveraging single-molecule advantages. DNA barcodes are attached to proteins collectively via ribosome display or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide thin film to construct a random single-molecule array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies) and analysed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimetre. Furthermore, protein interactions can be measured on the basis of the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor and antibody-binding profiling, are demonstrated. SMI-seq enables 'library versus library' screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity.
Single Molecule Visualization of Protein-DNA Complexes: Watching Machines at Work

NASA Astrophysics Data System (ADS)

Kowalczykowski, Stephen

2013-03-01

We can now watch individual proteins acting on single molecules of DNA. Such imaging provides unprecedented interrogation of fundamental biophysical processes. Visualization is achieved through the application of two complementary procedures. In one, single DNA molecules are attached to a polystyrene bead and are then captured by an optical trap. The DNA, a worm-like coil, is extended either by the force of solution flow in a micro-fabricated channel, or by capturing the opposite DNA end in a second optical trap. In the second procedure, DNA is attached by one end to a glass surface. The coiled DNA is elongated either by continuous solution flow or by subsequently tethering the opposite end to the surface. Protein action is visualized by fluorescent reporters: fluorescent dyes that bind double-stranded DNA (dsDNA), fluorescent biosensors for single-stranded DNA (ssDNA), or fluorescently-tagged proteins. Individual molecules are imaged using either epifluorescence microscopy or total internal reflection fluorescence (TIRF) microscopy. Using these approaches, we imaged the search for DNA sequence homology conducted by the RecA-ssDNA filament. The manner by which RecA protein finds a single homologous sequence in the genome had remained undefined for almost 30 years. Single-molecule imaging revealed that the search occurs through a mechanism termed ``intersegmental contact sampling,'' in which the randomly coiled structure of DNA is essential for reiterative sampling of DNA sequence identity: an example of parallel processing. In addition, the assembly of RecA filaments on single molecules of single-stranded DNA was visualized. Filament assembly requires nucleation of a protein dimer on DNA, and subsequent growth occurs via monomer addition. Furthermore, we discovered a class of proteins that catalyzed both nucleation and growth of filaments, revealing how the cell controls assembly of this protein-DNA complex.
Exploring Connectivity in Sequence Space of Functional RNA

NASA Technical Reports Server (NTRS)

Wei, Chenyu; Pohorille, Andrzej; Popovic, Milena; Ditzler, Mark

2017-01-01

Emergence of replicable genetic molecules was one of the marking points in the origin of life, evolution of which can be conceptualized as a walk through the space of all possible sequences. A theoretical concept of fitness landscape helps to understand evolutionary processes through assigning a value of fitness to each genotype. Then, evolution of a phenotype is viewed as a series of consecutive, single-point mutations. Natural selection biases evolution toward peaks of high fitness and away from valleys of low fitness. whereas neutral drift occurs in the sequence space without direction as mutations are introduced at random. Large networks of neutral or near-neutral mutations on a fitness landscape, especially for sufficiently long genomes, are possible or even inevitable. Their detection in experiments, however, has been elusive. Although a few near-neutral evolutionary pathways have been found, recent experimental evidence indicates landscapes consist of largely isolated islands. The generality of these results, however, is not clear, as the genome length or the fraction of functional molecules in the genotypic space might have been insufficient for the emergence of large, neutral networks. Thorough investigation on the structure of the fitness landscape is essential to understand the mechanisms of evolution of early genomes. RNA molecules are commonly assumed to play the pivotal role in the origin of genetic systems. They are widely believed to be early, if not the earliest, genetic and catalytic molecules, with abundant biochemical activities as aptamers and ribozymes, i.e. RNA molecules capable, respectively, to bind small molecules or catalyze chemical reactions. Here, we present results of our recent studies on the structure of the sequence space of RNA ligase ribozymes selected through in vitro evolution. Several hundred thousands of sequences active to a different degree were obtained by way of deep sequencing. Analysis of these sequences revealed several large clusters defined such that every sequence in a cluster can be reached from any other sequence in the same cluster through a series of single point mutations. Sequences in a single cluster appear to adopt more than one secondary structure. The mechanism of refolding within a single cluster was examined. To shed light on possible evolutionary paths in the space of ribozymes, the connectivity between clusters was investigated. The effect of length of RNA molecules on the structure of the fitness landscape and possible evolutionary paths was examined by way of comparing functional sequences of 20 and 80 nucleobases in length. It was found that sequences of different lengths shared secondary structure motifs that were presumed responsible for catalytic activity, with increasing complexity and global structural rearrangements emerging in longer molecules.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

PubMed

VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C

2015-11-26

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
Single-Molecule Denaturation Mapping of Genomic DNA in Nanofluidic Channels

NASA Astrophysics Data System (ADS)

Reisner, Walter; Larsen, Niels; Kristensen, Anders; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik

2009-03-01

We have developed a new DNA barcoding technique based on the partial denaturation of extended fluorescently labeled DNA molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions and can be explained by calculations of sequence-dependent melting probability. We believe this melting mapping technology is the first optically based single molecule technique sensitive to genome wide sequence variation that does not require an additional enzymatic labeling or restriction scheme.
Using Synthetic Nanopores for Single-Molecule Analyses: Detecting SNPs, Trapping DNA Molecules, and the Prospects for Sequencing DNA

ERIC Educational Resources Information Center

Dimitrov, Valentin V.

2009-01-01

This work focuses on studying properties of DNA molecules and DNA-protein interactions using synthetic nanopores, and it examines the prospects of sequencing DNA using synthetic nanopores. We have developed a method for discriminating between alleles that uses a synthetic nanopore to measure the binding of a restriction enzyme to DNA. There exists…
Thermoelectric effect and its dependence on molecular length and sequence in single DNA molecules.

PubMed

Li, Yueqi; Xiang, Limin; Palma, Julio L; Asai, Yoshihiro; Tao, Nongjian

2016-04-15

Studying the thermoelectric effect in DNA is important for unravelling charge transport mechanisms and for developing relevant applications of DNA molecules. Here we report a study of the thermoelectric effect in single DNA molecules. By varying the molecular length and sequence, we tune the charge transport in DNA to either a hopping- or tunnelling-dominated regimes. The thermoelectric effect is small and insensitive to the molecular length in the hopping regime. In contrast, the thermoelectric effect is large and sensitive to the length in the tunnelling regime. These findings indicate that one may control the thermoelectric effect in DNA by varying its sequence and length. We describe the experimental results in terms of hopping and tunnelling charge transport models.
Thermoelectric effect and its dependence on molecular length and sequence in single DNA molecules

PubMed Central

Li, Yueqi; Xiang, Limin; Palma, Julio L.; Asai, Yoshihiro; Tao, Nongjian

2016-01-01

Studying the thermoelectric effect in DNA is important for unravelling charge transport mechanisms and for developing relevant applications of DNA molecules. Here we report a study of the thermoelectric effect in single DNA molecules. By varying the molecular length and sequence, we tune the charge transport in DNA to either a hopping- or tunnelling-dominated regimes. The thermoelectric effect is small and insensitive to the molecular length in the hopping regime. In contrast, the thermoelectric effect is large and sensitive to the length in the tunnelling regime. These findings indicate that one may control the thermoelectric effect in DNA by varying its sequence and length. We describe the experimental results in terms of hopping and tunnelling charge transport models. PMID:27079152
Long-range barcode labeling-sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Feng; Zhang, Tao; Singh, Kanwar K.

Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE Office of Scientific and Technical Information (OSTI.GOV)

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum

DOE PAGES

VanBuren, Robert; Bryant, Doug; Edger, Patrick P.; ...

2015-11-11

Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly1. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetiummore » genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a ‘near-complete’ draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. As a result, the Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.« less
DNA confinement in nanochannels: physics and biological applications

NASA Astrophysics Data System (ADS)

Reisner, Walter; Pedersen, Jonas N.; Austin, Robert H.

2012-10-01

DNA is the central storage molecule of genetic information in the cell, and reading that information is a central problem in biology. While sequencing technology has made enormous advances over the past decade, there is growing interest in platforms that can readout genetic information directly from long single DNA molecules, with the ultimate goal of single-cell, single-genome analysis. Such a capability would obviate the need for ensemble averaging over heterogeneous cellular populations and eliminate uncertainties introduced by cloning and molecular amplification steps (thus enabling direct assessment of the genome in its native state). In this review, we will discuss how the information contained in genomic-length single DNA molecules can be accessed via physical confinement in nanochannels. Due to self-avoidance interactions, DNA molecules will stretch out when confined in nanochannels, creating a linear unscrolling of the genome along the channel for analysis. We will first review the fundamental physics of DNA nanochannel confinement—including the effect of varying ionic strength—and then discuss recent applications of these systems to genomic mapping. Apart from the intense biological interest in extracting linear sequence information from elongated DNA molecules, from a physics view these systems are fascinating as they enable probing of single-molecule conformation in environments with dimensions that intersect key physical length-scales in the 1 nm to 100 µm range.
DNA confinement in nanochannels: physics and biological applications.

PubMed

Reisner, Walter; Pedersen, Jonas N; Austin, Robert H

2012-10-01

DNA is the central storage molecule of genetic information in the cell, and reading that information is a central problem in biology. While sequencing technology has made enormous advances over the past decade, there is growing interest in platforms that can readout genetic information directly from long single DNA molecules, with the ultimate goal of single-cell, single-genome analysis. Such a capability would obviate the need for ensemble averaging over heterogeneous cellular populations and eliminate uncertainties introduced by cloning and molecular amplification steps (thus enabling direct assessment of the genome in its native state). In this review, we will discuss how the information contained in genomic-length single DNA molecules can be accessed via physical confinement in nanochannels. Due to self-avoidance interactions, DNA molecules will stretch out when confined in nanochannels, creating a linear unscrolling of the genome along the channel for analysis. We will first review the fundamental physics of DNA nanochannel confinement--including the effect of varying ionic strength--and then discuss recent applications of these systems to genomic mapping. Apart from the intense biological interest in extracting linear sequence information from elongated DNA molecules, from a physics view these systems are fascinating as they enable probing of single-molecule conformation in environments with dimensions that intersect key physical length-scales in the 1 nm to 100 µm range.
Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.

PubMed

Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun

2018-02-28

Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.
Diagnostic Applications of Next Generation Sequencing in Immunogenetics and Molecular Oncology

PubMed Central

Grumbt, Barbara; Eck, Sebastian H.; Hinrichsen, Tanja; Hirv, Kaimo

2013-01-01

Summary With the introduction of the next generation sequencing (NGS) technologies, remarkable new diagnostic applications have been established in daily routine. Implementation of NGS is challenging in clinical diagnostics, but definite advantages and new diagnostic possibilities make the switch to the technology inevitable. In addition to the higher sequencing capacity, clonal sequencing of single molecules, multiplexing of samples, higher diagnostic sensitivity, workflow miniaturization, and cost benefits are some of the valuable features of the technology. After the recent advances, NGS emerged as a proven alternative for classical Sanger sequencing in the typing of human leukocyte antigens (HLA). By virtue of the clonal amplification of single DNA molecules ambiguous typing results can be avoided. Simultaneously, a higher sample throughput can be achieved by tagging of DNA molecules with multiplex identifiers and pooling of PCR products before sequencing. In our experience, up to 380 samples can be typed for HLA-A, -B, and -DRB1 in high-resolution during every sequencing run. In molecular oncology, NGS shows a markedly increased sensitivity in comparison to the conventional Sanger sequencing and is developing to the standard diagnostic tool in detection of somatic mutations in cancer cells with great impact on personalized treatment of patients. PMID:23922545
Reducing assembly complexity of microbial genomes with single-molecule sequencing

USDA-ARS?s Scientific Manuscript database

Genome assembly algorithms cannot fully reconstruct microbial chromosomes from the DNA reads output by first or second-generation sequencing instruments. Therefore, most genomes are left unfinished due to the significant resources required to manually close gaps left in the draft assemblies. Single-...
Nanopore-based fourth-generation DNA sequencing technology.

PubMed

Feng, Yanxiao; Zhang, Yuechuan; Ying, Cuifeng; Wang, Deqiang; Du, Chunlei

2015-02-01

Nanopore-based sequencers, as the fourth-generation DNA sequencing technology, have the potential to quickly and reliably sequence the entire human genome for less than $1000, and possibly for even less than $100. The single-molecule techniques used by this technology allow us to further study the interaction between DNA and protein, as well as between protein and protein. Nanopore analysis opens a new door to molecular biology investigation at the single-molecule scale. In this article, we have reviewed academic achievements in nanopore technology from the past as well as the latest advances, including both biological and solid-state nanopores, and discussed their recent and potential applications. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Conformational Smear Characterization and Binning of Single-Molecule Conductance Measurements for Enhanced Molecular Recognition.

PubMed

Korshoj, Lee E; Afsari, Sepideh; Chatterjee, Anushree; Nagpal, Prashant

2017-11-01

Electronic conduction or charge transport through single molecules depends primarily on molecular structure and anchoring groups and forms the basis for a wide range of studies from molecular electronics to DNA sequencing. Several high-throughput nanoelectronic methods such as mechanical break junctions, nanopores, conductive atomic force microscopy, scanning tunneling break junctions, and static nanoscale electrodes are often used for measuring single-molecule conductance. In these measurements, "smearing" due to conformational changes and other entropic factors leads to large variances in the observed molecular conductance, especially in individual measurements. Here, we show a method for characterizing smear in single-molecule conductance measurements and demonstrate how binning measurements according to smear can significantly enhance the use of individual conductance measurements for molecular recognition. Using quantum point contact measurements on single nucleotides within DNA macromolecules, we demonstrate that the distance over which molecular junctions are maintained is a measure of smear, and the resulting variance in unbiased single measurements depends on this smear parameter. Our ability to identify individual DNA nucleotides at 20× coverage increases from 81.3% accuracy without smear analysis to 93.9% with smear characterization and binning (SCRIB). Furthermore, merely 7 conductance measurements (7× coverage) are needed to achieve 97.8% accuracy for DNA nucleotide recognition when only low molecular smear measurements are used, which represents a significant improvement over contemporary sequencing methods. These results have important implications in a broad range of molecular electronics applications from designing robust molecular switches to nanoelectronic DNA sequencing.

Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

PubMed Central

Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

2016-01-01

DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
Slowing down single-molecule trafficking through a protein nanopore reveals intermediates for peptide translocation

NASA Astrophysics Data System (ADS)

Mereuta, Loredana; Roy, Mahua; Asandei, Alina; Lee, Jong Kook; Park, Yoonkyung; Andricioaei, Ioan; Luchian, Tudor

2014-01-01

The microscopic details of how peptides translocate one at a time through nanopores are crucial determinants for transport through membrane pores and important in developing nano-technologies. To date, the translocation process has been too fast relative to the resolution of the single molecule techniques that sought to detect its milestones. Using pH-tuned single-molecule electrophysiology and molecular dynamics simulations, we demonstrate how peptide passage through the α-hemolysin protein can be sufficiently slowed down to observe intermediate single-peptide sub-states associated to distinct structural milestones along the pore, and how to control residence time, direction and the sequence of spatio-temporal state-to-state dynamics of a single peptide. Molecular dynamics simulations of peptide translocation reveal the time- dependent ordering of intermediate structures of the translocating peptide inside the pore at atomic resolution. Calculations of the expected current ratios of the different pore-blocking microstates and their time sequencing are in accord with the recorded current traces.
Single Molecule Spectroscopy of Amino Acids and Peptides by Recognition Tunneling

PubMed Central

Zhao, Yanan; Ashcroft, Brian; Zhang, Peiming; Liu, Hao; Sen, Suman; Song, Weisi; Im, JongOne; Gyarfas, Brett; Manna, Saikat; Biswas, Sovan; Borges, Chad; Lindsay, Stuart

2014-01-01

The human proteome has millions of protein variants due to alternative RNA splicing and post-translational modifications, and variants that are related to diseases are frequently present in minute concentrations. For DNA and RNA, low concentrations can be amplified using the polymerase chain reaction, but there is no such reaction for proteins. Therefore, the development of single molecule protein sequencing is a critical step in the search for protein biomarkers. Here we show that single amino acids can be identified by trapping the molecules between two electrodes that are coated with a layer of recognition molecules and measuring the electron tunneling current across the junction. A given molecule can bind in more than one way in the junction, and we therefore use a machine-learning algorithm to distinguish between the sets of electronic ‘fingerprints’ associated with each binding motif. With this recognition tunneling technique, we are able to identify D, L enantiomers, a methylated amino acid, isobaric isomers, and short peptides. The results suggest that direct electronic sequencing of single proteins could be possible by sequentially measuring the products of processive exopeptidase digestion, or by using a molecular motor to pull proteins through a tunnel junction integrated with a nanopore. PMID:24705512
Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

NASA Astrophysics Data System (ADS)

Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

2014-03-01

Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.
High processivity polymerases

DOEpatents

Shamoo, Yousif; Sun, Siyang

2014-06-10

Chimeric proteins comprising a sequence nonspecific single-stranded nucleic-acid-binding domain joined to a catalytic nucleic-acid-modifying domain are provided. Methods comprising contacting a nucleic acid molecule with a chimeric protein, as well as systems comprising a nucleic acid molecule, a chimeric protein, and an aqueous solution are also provided. The joining of sequence nonspecific single-stranded nucleic-acid-binding domain and a catalytic nucleic-acid-modifying domain in chimeric proteins, among other things, may prevent the separation of the two domains due to their weak association and thereby enhances processivity while maintaining fidelity.
Single Nucleobase Identification Using Biophysical Signatures from Nanoelectronic Quantum Tunneling.

PubMed

Korshoj, Lee E; Afsari, Sepideh; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

2017-03-01

Nanoelectronic DNA sequencing can provide an important alternative to sequencing-by-synthesis by reducing sample preparation time, cost, and complexity as a high-throughput next-generation technique with accurate single-molecule identification. However, sample noise and signature overlap continue to prevent high-resolution and accurate sequencing results. Probing the molecular orbitals of chemically distinct DNA nucleobases offers a path for facile sequence identification, but molecular entropy (from nucleotide conformations) makes such identification difficult when relying only on the energies of lowest-unoccupied and highest-occupied molecular orbitals (LUMO and HOMO). Here, nine biophysical parameters are developed to better characterize molecular orbitals of individual nucleobases, intended for single-molecule DNA sequencing using quantum tunneling of charges. For this analysis, theoretical models for quantum tunneling are combined with transition voltage spectroscopy to obtain measurable parameters unique to the molecule within an electronic junction. Scanning tunneling spectroscopy is then used to measure these nine biophysical parameters for DNA nucleotides, and a modified machine learning algorithm identified nucleobases. The new parameters significantly improve base calling over merely using LUMO and HOMO frontier orbital energies. Furthermore, high accuracies for identifying DNA nucleobases were observed at different pH conditions. These results have significant implications for developing a robust and accurate high-throughput nanoelectronic DNA sequencing technique. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding

PubMed Central

Best, Katharine; Oakes, Theres; Heather, James M.; Shawe-Taylor, John; Chain, Benny

2015-01-01

The polymerase chain reaction (PCR) is one of the most widely used techniques in molecular biology. In combination with High Throughput Sequencing (HTS), PCR is widely used to quantify transcript abundance for RNA-seq, and in the context of analysis of T and B cell receptor repertoires. In this study, we combine DNA barcoding with HTS to quantify PCR output from individual target molecules. We develop computational tools that simulate both the PCR branching process itself, and the subsequent subsampling which typically occurs during HTS sequencing. We explore the influence of different types of heterogeneity on sequencing output, and compare them to experimental results where the efficiency of amplification is measured by barcodes uniquely identifying each molecule of starting template. Our results demonstrate that the PCR process introduces substantial amplification heterogeneity, independent of primer sequence and bulk experimental conditions. This heterogeneity can be attributed both to inherited differences between different template DNA molecules, and the inherent stochasticity of the PCR process. The results demonstrate that PCR heterogeneity arises even when reaction and substrate conditions are kept as constant as possible, and therefore single molecule barcoding is essential in order to derive reproducible quantitative results from any protocol combining PCR with HTS. PMID:26459131
Single-molecule dilution and multiple displacement amplification for molecular haplotyping.

PubMed

Paul, Philip; Apgar, Josh

2005-04-01

Separate haploid analysis is frequently required for heterozygous genotyping to resolve phase ambiguity or confirm allelic sequence. We demonstrate a technique of single-molecule dilution followed by multiple strand displacement amplification to haplotype polymorphic alleles. Dilution of DNA to haploid equivalency, or a single molecule, is a simple method for separating di-allelic DNA. Strand displacement amplification is a robust method for non-specific DNA expansion that employs random hexamers and phage polymerase Phi29 for double-stranded DNA displacement and primer extension, resulting in high processivity and exceptional product length. Single-molecule dilution was followed by strand displacement amplification to expand separated alleles to microgram quantities of DNA for more efficient haplotype analysis of heterozygous genes.
Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

PubMed Central

Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

2012-01-01

RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Secondary structure prediction and structure-specific sequence analysis of single-stranded DNA.

PubMed

Dong, F; Allawi, H T; Anderson, T; Neri, B P; Lyamichev, V I

2001-08-01

DNA sequence analysis by oligonucleotide binding is often affected by interference with the secondary structure of the target DNA. Here we describe an approach that improves DNA secondary structure prediction by combining enzymatic probing of DNA by structure-specific 5'-nucleases with an energy minimization algorithm that utilizes the 5'-nuclease cleavage sites as constraints. The method can identify structural differences between two DNA molecules caused by minor sequence variations such as a single nucleotide mutation. It also demonstrates the existence of long-range interactions between DNA regions separated by >300 nt and the formation of multiple alternative structures by a 244 nt DNA molecule. The differences in the secondary structure of DNA molecules revealed by 5'-nuclease probing were used to design structure-specific probes for mutation discrimination that target the regions of structural, rather than sequence, differences. We also demonstrate the performance of structure-specific 'bridge' probes complementary to non-contiguous regions of the target molecule. The structure-specific probes do not require the high stringency binding conditions necessary for methods based on mismatch formation and permit mutation detection at temperatures from 4 to 37 degrees C. Structure-specific sequence analysis is applied for mutation detection in the Mycobacterium tuberculosis katG gene and for genotyping of the hepatitis C virus.
Single-Molecule Imaging of an in Vitro-Evolved RNA Aptamer Reveals Homogeneous Ligand Binding Kinetics

PubMed Central

2009-01-01

Many studies of RNA folding and catalysis have revealed conformational heterogeneity, metastable folding intermediates, and long-lived states with distinct catalytic activities. We have developed a single-molecule imaging approach for investigating the functional heterogeneity of in vitro-evolved RNA aptamers. Monitoring the association of fluorescently labeled ligands with individual RNA aptamer molecules has allowed us to record binding events over the course of multiple days, thus providing sufficient statistics to quantitatively define the kinetic properties at the single-molecule level. The ligand binding kinetics of the highly optimized RNA aptamer studied here displays a remarkable degree of uniformity and lack of memory. Such homogeneous behavior is quite different from the heterogeneity seen in previous single-molecule studies of naturally derived RNA and protein enzymes. The single-molecule methods we describe may be of use in analyzing the distribution of functional molecules in heterogeneous evolving populations or even in unselected samples of random sequences. PMID:19572753
Optimization of conditions to sequence long cDNAs from viruses

USDA-ARS?s Scientific Manuscript database

Fourth generation sequencing with the Minion nanopore sequencer provides opportunity to obtain deep coverage and long read for single molecules. This will benefit studies on RNA viruses. In the past, Sanger, Illumina, and Ion Torrent sequencing have been utilized to study RNA viruses. Both technique...
Highly sensitive detection of mutations in CHO cell recombinant DNA using multi-parallel single molecule real-time DNA sequencing.

PubMed

Cartwright, Joseph F; Anderson, Karin; Longworth, Joseph; Lobb, Philip; James, David C

2018-06-01

High-fidelity replication of biologic-encoding recombinant DNA sequences by engineered mammalian cell cultures is an essential pre-requisite for the development of stable cell lines for the production of biotherapeutics. However, immortalized mammalian cells characteristically exhibit an increased point mutation frequency compared to mammalian cells in vivo, both across their genomes and at specific loci (hotspots). Thus unforeseen mutations in recombinant DNA sequences can arise and be maintained within producer cell populations. These may affect both the stability of recombinant gene expression and give rise to protein sequence variants with variable bioactivity and immunogenicity. Rigorous quantitative assessment of recombinant DNA integrity should therefore form part of the cell line development process and be an essential quality assurance metric for instances where synthetic/multi-component assemblies are utilized to engineer mammalian cells, such as the assessment of recombinant DNA fidelity or the mutability of single-site integration target loci. Based on Pacific Biosciences (Menlo Park, CA) single molecule real-time (SMRT™) circular consensus sequencing (CCS) technology we developed a rDNA sequence analysis tool to process the multi-parallel sequencing of ∼40,000 single recombinant DNA molecules. After statistical filtering of raw sequencing data, we show that this analytical method is capable of detecting single point mutations in rDNA to a minimum single mutation frequency of 0.0042% (<1/24,000 bases). Using a stable CHO transfectant pool harboring a randomly integrated 5 kB plasmid construct encoding GFP we found that 28% of recombinant plasmid copies contained at least one low frequency (<0.3%) point mutation. These mutations were predominantly found in GC base pairs (85%) and that there was no positional bias in mutation across the plasmid sequence. There was no discernable difference between the mutation frequencies of coding and non-coding DNA. The putative ratio of non-synonymous and synonymous changes within the open reading frames (ORFs) in the plasmid sequence indicates that natural selection does not impact upon the prevalence of these mutations. Here we have demonstrated the abundance of mutations that fall outside of the reported range of detection of next generation sequencing (NGS) and second generation sequencing (SGS) platforms, providing a methodology capable of being utilized in cell line development platforms to identify the fidelity of recombinant genes throughout the production process. © 2018 Wiley Periodicals, Inc.
Unraveling secrets of telomeres: one molecule at a time

PubMed Central

Lin, Jiangguo; Kaur, Parminder; Countryman, Preston; Opresko, Patricia L.; Wang, Hong

2016-01-01

Telomeres play important roles in maintaining the stability of linear chromosomes. Telomere maintenance involves dynamic actions of multiple proteins interacting with long repetitive sequences and complex dynamic DNA structures, such as G-quadruplexes, T-loops and t-circles. Given the heterogeneity and complexity of telomeres, single-molecule approaches are essential to fully understand the structure-function relationships that govern telomere maintenance. In this review, we present a brief overview of the principles of single-molecule imaging and manipulation techniques. We then highlight results obtained from applying these single-molecule techniques for studying structure, dynamics and functions of G-quadruplexes, telomerase, and shelterin proteins. PMID:24569170
Mapping DNA polymerase errors by single-molecule sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lee, David F.; Lu, Jenny; Chang, Seungwoo

Genomic integrity is compromised by DNA polymerase replication errors, which occur in a sequence-dependent manner across the genome. Accurate and complete quantification of a DNA polymerase's error spectrum is challenging because errors are rare and difficult to detect. We report a high-throughput sequencing assay to map in vitro DNA replication errors at the single-molecule level. Unlike previous methods, our assay is able to rapidly detect a large number of polymerase errors at base resolution over any template substrate without quantification bias. To overcome the high error rate of high-throughput sequencing, our assay uses a barcoding strategy in which each replicationmore » product is tagged with a unique nucleotide sequence before amplification. Here, this allows multiple sequencing reads of the same product to be compared so that sequencing errors can be found and removed. We demonstrate the ability of our assay to characterize the average error rate, error hotspots and lesion bypass fidelity of several DNA polymerases.« less
Mapping DNA polymerase errors by single-molecule sequencing

DOE PAGES

Lee, David F.; Lu, Jenny; Chang, Seungwoo; ...

2016-05-16

Genomic integrity is compromised by DNA polymerase replication errors, which occur in a sequence-dependent manner across the genome. Accurate and complete quantification of a DNA polymerase's error spectrum is challenging because errors are rare and difficult to detect. We report a high-throughput sequencing assay to map in vitro DNA replication errors at the single-molecule level. Unlike previous methods, our assay is able to rapidly detect a large number of polymerase errors at base resolution over any template substrate without quantification bias. To overcome the high error rate of high-throughput sequencing, our assay uses a barcoding strategy in which each replicationmore » product is tagged with a unique nucleotide sequence before amplification. Here, this allows multiple sequencing reads of the same product to be compared so that sequencing errors can be found and removed. We demonstrate the ability of our assay to characterize the average error rate, error hotspots and lesion bypass fidelity of several DNA polymerases.« less
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples.

PubMed

Laird Smith, Melissa; Murrell, Ben; Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E; Kosakovsky Pond, Sergei L; Smith, Davey M

2016-07-01

The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences' Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data.
Noninvasive prenatal testing for Wilson disease by use of circulating single-molecule amplification and resequencing technology (cSMART).

PubMed

Lv, Weigang; Wei, Xianda; Guo, Ruolan; Liu, Qin; Zheng, Yu; Chang, Jiazhen; Bai, Ting; Li, Haoxian; Zhang, Jianguang; Song, Zhuo; Cram, David S; Liang, Desheng; Wu, Lingqian

2015-01-01

Noninvasive prenatal testing (NIPT) for monogenic diseases by use of PCR-based strategies requires precise quantification of mutant fetal alleles circulating in the maternal plasma. The study describes the development and validation of a novel assay termed circulating single-molecule amplification and resequencing technology (cSMART) for counting single allelic molecules in plasma. Here we demonstrate the suitability of cSMART for NIPT, with Wilson Disease (WD) as proof of concept. We used Sanger and whole-exome sequencing to identify familial ATP7B (ATPase, Cu(++) transporting, β polypeptide) gene mutations. For cSMART, single molecules were tagged with unique barcodes and circularized, and alleles were targeted and replicated by inverse PCR. The unique single allelic molecules were identified by sequencing and counted, and the percentage of mutant alleles in the original maternal plasma sample was used to determine fetal genotypes. Four families with WD pedigrees consented to the study. Using Sanger and whole-exome sequencing, we mapped the pathogenic ATP7B mutations in each pedigree and confirmed the proband's original diagnosis of WD. After validation of cSMART with defined plasma models mimicking fetal inheritance of paternal, maternal, or both parental mutant alleles, we retrospectively showed in second pregnancies that the fetal genotypes assigned by invasive testing and NIPT were concordant. We developed a reliable and accurate NIPT assay that correctly diagnosed the fetal genotypes in 4 pregnancies at risk for WD. This novel technology has potential as a universal strategy for NIPT of other monogenic disorders, since it requires only knowledge of the parental pathogenic mutations. © 2014 American Association for Clinical Chemistry.
Long-read sequencing and de novo assembly of a Chinese genome

USDA-ARS?s Scientific Manuscript database

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arr...
Efficient use of single molecule time traces to resolve kinetic rates, models and uncertainties

NASA Astrophysics Data System (ADS)

Schmid, Sonja; Hugel, Thorsten

2018-03-01

Single molecule time traces reveal the time evolution of unsynchronized kinetic systems. Especially single molecule Förster resonance energy transfer (smFRET) provides access to enzymatically important time scales, combined with molecular distance resolution and minimal interference with the sample. Yet the kinetic analysis of smFRET time traces is complicated by experimental shortcomings—such as photo-bleaching and noise. Here we recapitulate the fundamental limits of single molecule fluorescence that render the classic, dwell-time based kinetic analysis unsuitable. In contrast, our Single Molecule Analysis of Complex Kinetic Sequences (SMACKS) considers every data point and combines the information of many short traces in one global kinetic rate model. We demonstrate the potential of SMACKS by resolving the small kinetic effects caused by different ionic strengths in the chaperone protein Hsp90. These results show an unexpected interrelation between conformational dynamics and ATPase activity in Hsp90.

Designing robust watermark barcodes for multiplex long-read sequencing.

PubMed

Ezpeleta, Joaquín; Krsticevic, Flavia J; Bulacio, Pilar; Tapia, Elizabeth

2017-03-15

To attain acceptable sample misassignment rates, current approaches to multiplex single-molecule real-time sequencing require upstream quality improvement, which is obtained from multiple passes over the sequenced insert and significantly reduces the effective read length. In order to fully exploit the raw read length on multiplex applications, robust barcodes capable of dealing with the full single-pass error rates are needed. We present a method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11%. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10-7 under the above conditions, and are designed to be compatible with chemical constraints imposed by the sequencing process. Software tools for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark . ezpeleta@cifasis-conicet.gov.ar. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Enhanced sequencing coverage with digital droplet multiple displacement amplification

PubMed Central

Sidore, Angus M.; Lan, Freeman; Lim, Shaun W.; Abate, Adam R.

2016-01-01

Sequencing small quantities of DNA is important for applications ranging from the assembly of uncultivable microbial genomes to the identification of cancer-associated mutations. To obtain sufficient quantities of DNA for sequencing, the small amount of starting material must be amplified significantly. However, existing methods often yield errors or non-uniform coverage, reducing sequencing data quality. Here, we describe digital droplet multiple displacement amplification, a method that enables massive amplification of low-input material while maintaining sequence accuracy and uniformity. The low-input material is compartmentalized as single molecules in millions of picoliter droplets. Because the molecules are isolated in compartments, they amplify to saturation without competing for resources; this yields uniform representation of all sequences in the final product and, in turn, enhances the quality of the sequence data. We demonstrate the ability to uniformly amplify the genomes of single Escherichia coli cells, comprising just 4.7 fg of starting DNA, and obtain sequencing coverage distributions that rival that of unamplified material. Digital droplet multiple displacement amplification provides a simple and effective method for amplifying minute amounts of DNA for accurate and uniform sequencing. PMID:26704978
Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

PubMed Central

Tsai, Yu-Chih; Deming, Clayton; Segre, Julia A.; Kong, Heidi H.; Korlach, Jonas

2016-01-01

ABSTRACT Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation. PMID:26861018
Single Molecule Nano-Metronome

PubMed Central

Buranachai, Chittanon; McKinney, Sean A.; Ha, Taekjip

2008-01-01

We constructed a DNA-based nano-mechanical device called the nano-metronome. Our device is made by introducing complementary single stranded overhangs at the two arms of the DNA four-way junction. The ticking rates of this stochastic metronome depend on ion concentrations and can be changed by a set of DNA-based switches to deactivate/reactivate the sticky end. Since the device displays clearly distinguishable responses even with a single basepair difference, it may lead to a single molecule sensor of minute sequence differences of a target DNA. PMID:16522050
Toward the 1,000 dollars human genome.

PubMed

Bennett, Simon T; Barnes, Colin; Cox, Anthony; Davies, Lisa; Brown, Clive

2005-06-01

Revolutionary new technologies, capable of transforming the economics of sequencing, are providing an unparalleled opportunity to analyze human genetic variation comprehensively at the whole-genome level within a realistic timeframe and at affordable costs. Current estimates suggest that it would cost somewhere in the region of 30 million US dollars to sequence an entire human genome using Sanger-based sequencing, and on one machine it would take about 60 years. Solexa is widely regarded as a company with the necessary disruptive technology to be the first to achieve the ultimate goal of the so-called 1,000 dollars human genome - the conceptual cost-point needed for routine analysis of individual genomes. Solexa's technology is based on completely novel sequencing chemistry capable of sequencing billions of individual DNA molecules simultaneously, a base at a time, to enable highly accurate, low cost analysis of an entire human genome in a single experiment. When applied over a large enough genomic region, these new approaches to resequencing will enable the simultaneous detection and typing of known, as well as unknown, polymorphisms, and will also offer information about patterns of linkage disequilibrium in the population being studied. Technological progress, leading to the advent of single-molecule-based approaches, is beginning to dramatically drive down costs and increase throughput to unprecedented levels, each being several orders of magnitude better than that which is currently available. A new sequencing paradigm based on single molecules will be faster, cheaper and more sensitive, and will permit routine analysis at the whole-genome level.
Rapid Sequencing of Complete env Genes from Primary HIV-1 Samples

PubMed Central

Eren, Kemal; Ignacio, Caroline; Landais, Elise; Weaver, Steven; Phung, Pham; Ludka, Colleen; Hepler, Lance; Caballero, Gemma; Pollner, Tristan; Guo, Yan; Richman, Douglas; Poignard, Pascal; Paxinos, Ellen E.; Kosakovsky Pond, Sergei L.

2016-01-01

Abstract The ability to study rapidly evolving viral populations has been constrained by the read length of next-generation sequencing approaches and the sampling depth of single-genome amplification methods. Here, we develop and characterize a method using Pacific Biosciences’ Single Molecule, Real-Time (SMRT®) sequencing technology to sequence multiple, intact full-length human immunodeficiency virus-1 env genes amplified from viral RNA populations circulating in blood, and provide computational tools for analyzing and visualizing these data. PMID:29492273
SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information

PubMed Central

2014-01-01

Background The recent introduction of the Pacific Biosciences RS single molecule sequencing technology has opened new doors to scaffolding genome assemblies in a cost-effective manner. The long read sequence information is promised to enhance the quality of incomplete and inaccurate draft assemblies constructed from Next Generation Sequencing (NGS) data. Results Here we propose a novel hybrid assembly methodology that aims to scaffold pre-assembled contigs in an iterative manner using PacBio RS long read information as a backbone. On a test set comprising six bacterial draft genomes, assembled using either a single Illumina MiSeq or Roche 454 library, we show that even a 50× coverage of uncorrected PacBio RS long reads is sufficient to drastically reduce the number of contigs. Comparisons to the AHA scaffolder indicate our strategy is better capable of producing (nearly) complete bacterial genomes. Conclusions The current work describes our SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences. We conclude that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run our program, allow to scaffold genomes in a fast and reliable manner. PMID:24950923
Characterization of individual polynucleotide molecules using a membrane channel

NASA Technical Reports Server (NTRS)

Kasianowicz, J. J.; Brandin, E.; Branton, D.; Deamer, D. W.

1996-01-01

We show that an electric field can drive single-stranded RNA and DNA molecules through a 2.6-nm diameter ion channel in a lipid bilayer membrane. Because the channel diameter can accommodate only a single strand of RNA or DNA, each polymer traverses the membrane as an extended chain that partially blocks the channel. The passage of each molecule is detected as a transient decrease of ionic current whose duration is proportional to polymer length. Channel blockades can therefore be used to measure polynucleotide length. With further improvements, the method could in principle provide direct, high-speed detection of the sequence of bases in single molecules of DNA or RNA.
Complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1 using PacBio single-molecule real-time technology

USDA-ARS?s Scientific Manuscript database

We report the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1 isolated in Minnesota, USA. The R1-1 genome, generated by de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies....
Sequencing Technologies Panel at SFAF

DOE Office of Scientific and Technical Information (OSTI.GOV)

Turner, Steve; Fiske, Haley; Knight, Jim

2010-06-02

From left to right: Steve Turner of Pacific Biosciences, Haley Fiske of Illumina, Jim Knight of Roche, Michael Rhodes of Life Technologies and Peter Vander Horn of Life Technologies' Single Molecule Sequencing group discuss new sequencing technologies and applications on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM
Electrostatic melting in a single-molecule field-effect transistor with applications in genomic identification

PubMed Central

Vernick, Sefi; Trocchia, Scott M.; Warren, Steven B.; Young, Erik F.; Bouilly, Delphine; Gonzalez, Ruben L.; Nuckolls, Colin; Shepard, Kenneth L.

2017-01-01

The study of biomolecular interactions at the single-molecule level holds great potential for both basic science and biotechnology applications. Single-molecule studies often rely on fluorescence-based reporting, with signal levels limited by photon emission from single optical reporters. The point-functionalized carbon nanotube transistor, known as the single-molecule field-effect transistor, is a bioelectronics alternative based on intrinsic molecular charge that offers significantly higher signal levels for detection. Such devices are effective for characterizing DNA hybridization kinetics and thermodynamics and enabling emerging applications in genomic identification. In this work, we show that hybridization kinetics can be directly controlled by electrostatic bias applied between the device and the surrounding electrolyte. We perform the first single-molecule experiments demonstrating the use of electrostatics to control molecular binding. Using bias as a proxy for temperature, we demonstrate the feasibility of detecting various concentrations of 20-nt target sequences from the Ebolavirus nucleoprotein gene in a constant-temperature environment. PMID:28516911
Single molecule detection of nitric oxide enabled by d(AT)15 DNA adsorbed to near infrared fluorescent single-walled carbon nanotubes.

PubMed

Zhang, Jingqing; Boghossian, Ardemis A; Barone, Paul W; Rwei, Alina; Kim, Jong-Ho; Lin, Dahua; Heller, Daniel A; Hilmer, Andrew J; Nair, Nitish; Reuel, Nigel F; Strano, Michael S

2011-01-26

We report the selective detection of single nitric oxide (NO) molecules using a specific DNA sequence of d(AT)(15) oligonucleotides, adsorbed to an array of near-infrared fluorescent semiconducting single-walled carbon nanotubes (AT(15)-SWNT). While SWNT suspended with eight other variant DNA sequences show fluorescence quenching or enhancement from analytes such as dopamine, NADH, L-ascorbic acid, and riboflavin, d(AT)(15) imparts SWNT with a distinct selectivity toward NO. In contrast, the electrostatically neutral polyvinyl alcohol enables no response to nitric oxide, but exhibits fluorescent enhancement to other molecules in the tested library. For AT(15)-SWNT, a stepwise fluorescence decrease is observed when the nanotubes are exposed to NO, reporting the dynamics of single-molecule NO adsorption via SWNT exciton quenching. We describe these quenching traces using a birth-and-death Markov model, and the maximum likelihood estimator of adsorption and desorption rates of NO is derived. Applying the method to simulated traces indicates that the resulting error in the estimated rate constants is less than 5% under our experimental conditions, allowing for calibration using a series of NO concentrations. As expected, the adsorption rate is found to be linearly proportional to NO concentration, and the intrinsic single-site NO adsorption rate constant is 0.001 s(-1) μM NO(-1). The ability to detect nitric oxide quantitatively at the single-molecule level may find applications in new cellular assays for the study of nitric oxide carcinogenesis and chemical signaling, as well as medical diagnostics for inflammation.
Transforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations

PubMed Central

Dressman, Devin; Yan, Hai; Traverso, Giovanni; Kinzler, Kenneth W.; Vogelstein, Bert

2003-01-01

Many areas of biomedical research depend on the analysis of uncommon variations in individual genes or transcripts. Here we describe a method that can quantify such variation at a scale and ease heretofore unattainable. Each DNA molecule in a collection of such molecules is converted into a single magnetic particle to which thousands of copies of DNA identical in sequence to the original are bound. This population of beads then corresponds to a one-to-one representation of the starting DNA molecules. Variation within the original population of DNA molecules can then be simply assessed by counting fluorescently labeled particles via flow cytometry. This approach is called BEAMing on the basis of four of its principal components (beads, emulsion, amplification, and magnetics). Millions of individual DNA molecules can be assessed in this fashion with standard laboratory equipment. Moreover, specific variants can be isolated by flow sorting and used for further experimentation. BEAMing can be used for the identification and quantification of rare mutations as well as to study variations in gene sequences or transcripts in specific populations or tissues. PMID:12857956
Single Cell Total RNA Sequencing through Isothermal Amplification in Picoliter-Droplet Emulsion.

PubMed

Fu, Yusi; Chen, He; Liu, Lu; Huang, Yanyi

2016-11-15

Prevalent single cell RNA amplification and sequencing chemistries mainly focus on polyadenylated RNAs in eukaryotic cells by using oligo(dT) primers for reverse transcription. We develop a new RNA amplification method, "easier-seq", to reverse transcribe and amplify the total RNAs, both with and without polyadenylate tails, from a single cell for transcriptome sequencing with high efficiency, reproducibility, and accuracy. By distributing the reverse transcribed cDNA molecules into 1.5 × 10 5 aqueous droplets in oil, the cDNAs are isothermally amplified using random primers in each of these 65-pL reactors separately. This new method greatly improves the ease of single-cell RNA sequencing by reducing the experimental steps. Meanwhile, with less chance to induce errors, this method can easily maintain the quality of single-cell sequencing. In addition, this polyadenylate-tail-independent method can be seamlessly applied to prokaryotic cell RNA sequencing.
Helicos BioSciences.

PubMed

Milos, Patrice

2008-04-01

Helicos BioSciences Corporation is a life sciences company developing revolutionary new single molecule sequencing technology to provide the path to the US$1000 genome. True Single Molecule Sequencing (tSMS) will drive advancements in pharmacogenomics that can enable a better understanding of an individual's susceptibility to disease, develop more effective disease diagnoses and differentiate response to disease therapies. During 2007, genome-wide disease-association studies, the encylopedia of DNA elements (ENCODE) and the published genome sequence of two individuals have revealed human genome variation far more extensive than originally believed. These also demonstrated that common variations explain only a fraction of the genetic basis of disease. Therefore, the capability to understand an individual genome is critical in setting the foundation for the next great revolution in healthcare. Helicos is committed to this vision and will provide cost-effective genome sequencing and comprehensive analysis of the transcribed genome that can unlock the era of personalized healthcare.
Single-molecule protein sequencing through fingerprinting: computational assessment

NASA Astrophysics Data System (ADS)

Yao, Yao; Docter, Margreet; van Ginkel, Jetty; de Ridder, Dick; Joo, Chirlmin

2015-10-01

Proteins are vital in all biological systems as they constitute the main structural and functional components of cells. Recent advances in mass spectrometry have brought the promise of complete proteomics by helping draft the human proteome. Yet, this commonly used protein sequencing technique has fundamental limitations in sensitivity. Here we propose a method for single-molecule (SM) protein sequencing. A major challenge lies in the fact that proteins are composed of 20 different amino acids, which demands 20 molecular reporters. We computationally demonstrate that it suffices to measure only two types of amino acids to identify proteins and suggest an experimental scheme using SM fluorescence. When achieved, this highly sensitive approach will result in a paradigm shift in proteomics, with major impact in the biological and medical sciences.
Complete Genome Sequence of Clavibacter michiganensis subsp. insidiosus R1-1 Using PacBio Single-Molecule Real-Time Technology

PubMed Central

Lu, You; Samac, Deborah A.; Glazebrook, Jane

2015-01-01

We report here the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1, isolated in Minnesota, USA. The R1-1 genome, generated by a de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies. PMID:25953184
UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy

PubMed Central

2017-01-01

Unique Molecular Identifiers (UMIs) are random oligonucleotide barcodes that are increasingly used in high-throughput sequencing experiments. Through a UMI, identical copies arising from distinct molecules can be distinguished from those arising through PCR amplification of the same molecule. However, bioinformatic methods to leverage the information from UMIs have yet to be formalized. In particular, sequencing errors in the UMI sequence are often ignored or else resolved in an ad hoc manner. We show that errors in the UMI sequence are common and introduce network-based methods to account for these errors when identifying PCR duplicates. Using these methods, we demonstrate improved quantification accuracy both under simulated conditions and real iCLIP and single-cell RNA-seq data sets. Reproducibility between iCLIP replicates and single-cell RNA-seq clustering are both improved using our proposed network-based method, demonstrating the value of properly accounting for errors in UMIs. These methods are implemented in the open source UMI-tools software package. PMID:28100584
Tracking single mRNA molecules in live cells

NASA Astrophysics Data System (ADS)

Moon, Hyungseok C.; Lee, Byung Hun; Lim, Kiseong; Son, Jae Seok; Song, Minho S.; Park, Hye Yoon

2016-06-01

mRNAs inside cells interact with numerous RNA-binding proteins, microRNAs, and ribosomes that together compose a highly heterogeneous population of messenger ribonucleoprotein (mRNP) particles. Perhaps one of the best ways to investigate the complex regulation of mRNA is to observe individual molecules. Single molecule imaging allows the collection of quantitative and statistical data on subpopulations and transient states that are otherwise obscured by ensemble averaging. In addition, single particle tracking reveals the sequence of events that occur in the formation and remodeling of mRNPs in real time. Here, we review the current state-of-the-art techniques in tagging, delivery, and imaging to track single mRNAs in live cells. We also discuss how these techniques are applied to extract dynamic information on the transcription, transport, localization, and translation of mRNAs. These studies demonstrate how single molecule tracking is transforming the understanding of mRNA regulation in live cells.
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation

PubMed Central

Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob

2014-01-01

As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252

Theoretical electrical conductivity of hydrogen-bonded benzamide-derived molecules and single DNA bases.

PubMed

Chen, Xiang

2013-09-01

A benzamide molecule is used as a "reader" molecule to form hydrogen bonds with five single DNA bases, i.e., four normal single DNA bases A,T,C,G and one for 5methylC. The whole molecule is then attached to the gold surface so that a meta-molecule junction is formed. We calculate the transmission function and conductance for the five metal-molecule systems, with the implementation of density functional theory-based non-equilibrium Green function method. Our results show that each DNA base exhibits a unique conductance and most of them are on the pS level. The distinguishable conductance of each DNA base provides a way for the fast sequencing of DNA. We also investigate the dependence of conductivity of such a metal-molecule system on the hydrogen bond length between the "reader" molecule and DNA base, which shows that conductance follows an exponential decay as the hydrogen bond length increases, i.e., the conductivity is highly sensitive to the change in hydrogen bond length.
Nanomanipulation of Single RNA Molecules by Optical Tweezers

PubMed Central

Stephenson, William; Wan, Gorby; Tenenbaum, Scott A.; Li, Pan T. X.

2014-01-01

A large portion of the human genome is transcribed but not translated. In this post genomic era, regulatory functions of RNA have been shown to be increasingly important. As RNA function often depends on its ability to adopt alternative structures, it is difficult to predict RNA three-dimensional structures directly from sequence. Single-molecule approaches show potentials to solve the problem of RNA structural polymorphism by monitoring molecular structures one molecule at a time. This work presents a method to precisely manipulate the folding and structure of single RNA molecules using optical tweezers. First, methods to synthesize molecules suitable for single-molecule mechanical work are described. Next, various calibration procedures to ensure the proper operations of the optical tweezers are discussed. Next, various experiments are explained. To demonstrate the utility of the technique, results of mechanically unfolding RNA hairpins and a single RNA kissing complex are used as evidence. In these examples, the nanomanipulation technique was used to study folding of each structural domain, including secondary and tertiary, independently. Lastly, the limitations and future applications of the method are discussed. PMID:25177917
Method for rapid base sequencing in DNA and RNA with two base labeling

DOEpatents

Jett, J.H.; Keller, R.A.; Martin, J.C.; Posner, R.G.; Marrone, B.L.; Hammond, M.L.; Simpson, D.J.

1995-04-11

A method is described for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand. 4 figures.
Method for rapid base sequencing in DNA and RNA with two base labeling

DOEpatents

Jett, James H.; Keller, Richard A.; Martin, John C.; Posner, Richard G.; Marrone, Babetta L.; Hammond, Mark L.; Simpson, Daniel J.

1995-01-01

Method for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand.
RNase H-assisted RNA-primed rolling circle amplification for targeted RNA sequence detection.

PubMed

Takahashi, Hirokazu; Ohkawachi, Masahiko; Horio, Kyohei; Kobori, Toshiro; Aki, Tsunehiro; Matsumura, Yukihiko; Nakashimada, Yutaka; Okamura, Yoshiko

2018-05-17

RNA-primed rolling circle amplification (RPRCA) is a useful laboratory method for RNA detection; however, the detection of RNA is limited by the lack of information on 3'-terminal sequences. We uncovered that conventional RPRCA using pre-circularized probes could potentially detect the internal sequence of target RNA molecules in combination with RNase H. However, the specificity for mRNA detection was low, presumably due to non-specific hybridization of non-target RNA with the circular probe. To overcome this technical problem, we developed a method for detecting a sequence of interest in target RNA molecules via RNase H-assisted RPRCA using padlocked probes. When padlock probes are hybridized to the target RNA molecule, they are converted to the circular form by SplintR ligase. Subsequently, RNase H creates nick sites only in the hybridized RNA sequence, and single-stranded DNA is finally synthesized from the nick site by phi29 DNA polymerase. This method could specifically detect at least 10 fmol of the target RNA molecule without reverse transcription. Moreover, this method detected GFP mRNA present in 10 ng of total RNA isolated from Escherichia coli without background DNA amplification. Therefore, this method can potentially detect almost all types of RNA molecules without reverse transcription and reveal full-length sequence information.
Complete genome sequences of two strains of the meat spoilage bacterium Brochothrix thermosphacta isolated from ground chicken

USDA-ARS?s Scientific Manuscript database

Brochothrix thermosphacta is an important meat spoilage bacterium. Here we report the genome sequences of two strains of B. thermosphacta isolated from ground chicken. The genome sequences were determined using long-read PacBio single-molecule real-time (SMRT©) technology and are the first complete ...
Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01

PubMed Central

Codoñer, Francisco M.; Martinez-Blanch, Juan F.; Acevedo-Piérart, Marcelo; Ormeño, M. Loreto; Ramón, Daniel

2016-01-01

Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. PMID:27881545
Competition between B-Z and B-L transitions in a single DNA molecule: Computational studies

NASA Astrophysics Data System (ADS)

Kwon, Ah-Young; Nam, Gi-Moon; Johner, Albert; Kim, Seyong; Hong, Seok-Cheol; Lee, Nam-Kyung

2016-02-01

Under negative torsion, DNA adopts left-handed helical forms, such as Z-DNA and L-DNA. Using the random copolymer model developed for a wormlike chain, we represent a single DNA molecule with structural heterogeneity as a helical chain consisting of monomers which can be characterized by different helical senses and pitches. By Monte Carlo simulation, where we take into account bending and twist fluctuations explicitly, we study sequence dependence of B-Z transitions under torsional stress and tension focusing on the interaction with B-L transitions. We consider core sequences, (GC) n repeats or (TG) n repeats, which can interconvert between the right-handed B form and the left-handed Z form, imbedded in a random sequence, which can convert to left-handed L form with different (tension dependent) helical pitch. We show that Z-DNA formation from the (GC) n sequence is always supported by unwinding torsional stress but Z-DNA formation from the (TG) n sequence, which are more costly to convert but numerous, can be strongly influenced by the quenched disorder in the surrounding random sequence.
Biosensors for DNA sequence detection

NASA Technical Reports Server (NTRS)

Vercoutere, Wenonah; Akeson, Mark

2002-01-01

DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.
Complete Genome Sequence of Clavibacter michiganensis subsp. insidiosus R1-1 Using PacBio Single-Molecule Real-Time Technology.

PubMed

Lu, You; Samac, Deborah A; Glazebrook, Jane; Ishimaru, Carol A

2015-05-07

We report here the complete genome sequence of Clavibacter michiganensis subsp. insidiosus R1-1, isolated in Minnesota, USA. The R1-1 genome, generated by a de novo assembly of PacBio sequencing data, is the first complete genome sequence available for this subspecies. Copyright © 2015 Lu et al.
Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

PubMed

Hong, Jungeui; Gresham, David

2017-11-01

Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.
ampliMethProfiler: a pipeline for the analysis of CpG methylation profiles of targeted deep bisulfite sequenced amplicons.

PubMed

Scala, Giovanni; Affinito, Ornella; Palumbo, Domenico; Florio, Ermanno; Monticelli, Antonella; Miele, Gennaro; Chiariotti, Lorenzo; Cocozza, Sergio

2016-11-25

CpG sites in an individual molecule may exist in a binary state (methylated or unmethylated) and each individual DNA molecule, containing a certain number of CpGs, is a combination of these states defining an epihaplotype. Classic quantification based approaches to study DNA methylation are intrinsically unable to fully represent the complexity of the underlying methylation substrate. Epihaplotype based approaches, on the other hand, allow methylation profiles of cell populations to be studied at the single molecule level. For such investigations, next-generation sequencing techniques can be used, both for quantitative and for epihaplotype analysis. Currently available tools for methylation analysis lack output formats that explicitly report CpG methylation profiles at the single molecule level and that have suited statistical tools for their interpretation. Here we present ampliMethProfiler, a python-based pipeline for the extraction and statistical epihaplotype analysis of amplicons from targeted deep bisulfite sequencing of multiple DNA regions. ampliMethProfiler tool provides an easy and user friendly way to extract and analyze the epihaplotype composition of reads from targeted bisulfite sequencing experiments. ampliMethProfiler is written in python language and requires a local installation of BLAST and (optionally) QIIME tools. It can be run on Linux and OS X platforms. The software is open source and freely available at http://amplimethprofiler.sourceforge.net .
Nanopore analysis of polymers in solution.

NASA Astrophysics Data System (ADS)

Deamer, David

2002-03-01

Nanopores represent a novel approach for investigating macromolecules in solution. Polymers that have been analyzed by this technique include polyethylene glycol (PEG), certain proteins and nucleic acids. The a-hemolysin pore inserted into lipid bilayers provides continuous non-gated ion current through a pore diameter of approximately 1.5 - 2 nm. Nucleic acid molecules can be driven through the pore by imposing a voltage across the supporting membrane. Single stranded, but not double stranded nucleic acids pass through in strict linear sequence from one end of the molecule to the other. While in the pore, the molecule reduces ionic current, and properties of the ionic current blockade such as duration, mean amplitude and modulations of amplitude provide information about structure and composition of the nucleic acid. For a given molecular species, the duration of the blockade is a function of chain length, and the rate of blockades is linearly related to concentration. More recent studies have shown that the a-hemolysin nanopore can discriminate between synthetic DNA molecules differing by a single base pair or even a single nucleotide. These results indicate that a nanopore may have the resolution required for nucleic acid sequencing applications.
Optical mapping and its potential for large-scale sequencing projects.

PubMed

Aston, C; Mishra, B; Schwartz, D C

1999-07-01

Physical mapping has been rediscovered as an important component of large-scale sequencing projects. Restriction maps provide landmark sequences at defined intervals, and high-resolution restriction maps can be assembled from ensembles of single molecules by optical means. Such optical maps can be constructed from both large-insert clones and genomic DNA, and are used as a scaffold for accurately aligning sequence contigs generated by shotgun sequencing.
Single-molecule analysis of DNA cross-links using nanopore technology

NASA Astrophysics Data System (ADS)

Wolna, Anna H.

The alpha-hemolysin (alpha-HL) protein ion channel is a potential next-generation sequencing platform that has been extensively used to study nucleic acids at a single-molecule level. After applying a potential across a lipid bilayer, the imbedded alpha-HL allows monitoring of the duration and current levels of DNA translocation and immobilization. Because this method does not require DNA amplification prior to sequencing, all the DNA damage present in the cell at any given time will be present during the sequencing experiment. The goal of this research is to determine if these damage sites give distinguishable current levels beyond those observed for the canonical nucleobases. Because DNA cross-links are one of the most prevalent types of DNA damage occurring in vivo, the blockage current levels were determined for thymine-dimers, guanine(C8)-thymine(N3) cross-links and platinum adducts. All of these cross-links give a different blockage current level compared to the undamaged strands when immobilized in the ion channel, and they all can easily translocate across the alpha-HL channel. Additionally, the alpha-HL nanopore technique presents a unique opportunity to study the effects of DNA cross-links, such as thymine-dimers, on the secondary structure of DNA G-quadruplexes folded from the human telomere sequence. Using this single-molecule nanopore technique we can detect subtle structural differences that cannot be easily addressed using conventional methods. The human telomere plays crucial roles in maintaining genome stability. In the presence of suitable cations, the repetitive 5'-TTAGGG human telomere sequence can fold into G-quadruplexes that adopt the hybrid fold in vivo. The telomere sequence is hypersensitive to UV-induced thymine-dimer (T=T) formation, and yet the presence of thymine dimers does not cause telomere shortening. The potential structural disruption and thermodynamic stability of the T=T-containing natural telomere sequences were studied to understand how this damage is tolerated in telomeric DNA. The alpha-HL experiments determined that T=Ts disrupt double-chain reversal loop formation but are well tolerated in edgewise and diagonal loops of the hybrid G-quadruplexes. These studies demonstrated the power of the alpha-HL ion channel to analyze DNA modifications and secondary structures at a single-molecule level.
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Single helically folded aromatic oligoamides that mimic the charge surface of double-stranded B-DNA

NASA Astrophysics Data System (ADS)

Ziach, Krzysztof; Chollet, Céline; Parissi, Vincent; Prabhakaran, Panchami; Marchivie, Mathieu; Corvaglia, Valentina; Bose, Partha Pratim; Laxmi-Reddy, Katta; Godde, Frédéric; Schmitter, Jean-Marie; Chaignepain, Stéphane; Pourquier, Philippe; Huc, Ivan

2018-05-01

Numerous essential biomolecular processes require the recognition of DNA surface features by proteins. Molecules mimicking these features could potentially act as decoys and interfere with pharmacologically or therapeutically relevant protein-DNA interactions. Although naturally occurring DNA-mimicking proteins have been described, synthetic tunable molecules that mimic the charge surface of double-stranded DNA are not known. Here, we report the design, synthesis and structural characterization of aromatic oligoamides that fold into single helical conformations and display a double helical array of negatively charged residues in positions that match the phosphate moieties in B-DNA. These molecules were able to inhibit several enzymes possessing non-sequence-selective DNA-binding properties, including topoisomerase 1 and HIV-1 integrase, presumably through specific foldamer-protein interactions, whereas sequence-selective enzymes were not inhibited. Such modular and synthetically accessible DNA mimics provide a versatile platform to design novel inhibitors of protein-DNA interactions.
Single-Stranded Condensation Stochastically Blocks G-Quadruplex Assembly in Human Telomeric RNA.

PubMed

Gutiérrez, Irene; Garavís, Miguel; de Lorenzo, Sara; Villasante, Alfredo; González, Carlos; Arias-Gonzalez, J Ricardo

2018-05-17

TERRA is an RNA molecule transcribed from human subtelomeric regions toward chromosome ends potentially involved in regulation of heterochromatin stability, semiconservative replication, and telomerase inhibition, among others. TERRA contains tandem repeats of the sequence GGGUUA, with a strong tendency to fold into a four-stranded arrangement known as a parallel G-quadruplex. Here, we demonstrate by using single-molecule force spectroscopy that this potential is limited by the inherent capacity of RNA to self-associate randomly and further condense into entropically more favorable structures. We stretched RNA constructions with more than four and less than eight hexanucleotide repeats, thus unable to form several G-quadruplexes in tandem, flanked by non-G-rich overhangs of random sequence by optical tweezers on a one by one basis. We found that condensed RNA stochastically blocks G-quadruplex folding pathways with a near 20% probability, a behavior that is not found in DNA analogous molecules.
De novo assembly and phasing of a Korean human genome.

PubMed

Seo, Jeong-Sun; Rhie, Arang; Kim, Junsoo; Lee, Sangjin; Sohn, Min-Hwan; Kim, Chang-Uk; Hastie, Alex; Cao, Han; Yun, Ji-Young; Kim, Jihye; Kuk, Junho; Park, Gun Hwa; Kim, Juhyeok; Ryu, Hanna; Kim, Jongbum; Roh, Mira; Baek, Jeonghun; Hunkapiller, Michael W; Korlach, Jonas; Shin, Jong-Yeon; Kim, Changhoon

2016-10-13

Advances in genome assembly and phasing provide an opportunity to investigate the diploid architecture of the human genome and reveal the full range of structural variation across population groups. Here we report the de novo assembly and haplotype phasing of the Korean individual AK1 (ref. 1) using single-molecule real-time sequencing, next-generation mapping, microfluidics-based linked reads, and bacterial artificial chromosome (BAC) sequencing approaches. Single-molecule sequencing coupled with next-generation mapping generated a highly contiguous assembly, with a contig N50 size of 17.9 Mb and a scaffold N50 size of 44.8 Mb, resolving 8 chromosomal arms into single scaffolds. The de novo assembly, along with local assemblies and spanning long reads, closes 105 and extends into 72 out of 190 euchromatic gaps in the reference genome, adding 1.03 Mb of previously intractable sequence. High concordance between the assembly and paired-end sequences from 62,758 BAC clones provides strong support for the robustness of the assembly. We identify 18,210 structural variants by direct comparison of the assembly with the human reference, identifying thousands of breakpoints that, to our knowledge, have not been reported before. Many of the insertions are reflected in the transcriptome and are shared across the Asian population. We performed haplotype phasing of the assembly with short reads, long reads and linked reads from whole-genome sequencing and with short reads from 31,719 BAC clones, thereby achieving phased blocks with an N50 size of 11.6 Mb. Haplotigs assembled from single-molecule real-time reads assigned to haplotypes on phased blocks covered 89% of genes. The haplotigs accurately characterized the hypervariable major histocompatability complex region as well as demonstrating allele configuration in clinically relevant genes such as CYP2D6. This work presents the most contiguous diploid human genome assembly so far, with extensive investigation of unreported and Asian-specific structural variants, and high-quality haplotyping of clinically relevant alleles for precision medicine.
Recognition Tunneling

PubMed Central

Lindsay, Stuart; He, Jin; Sankey, Otto; Hapala, Prokop; Jelinek, Pavel; Zhang, Peiming; Chang, Shuai; Huang, Shuo

2010-01-01

Single molecules in a tunnel junction can now be interrogated reliably using chemically-functionalized electrodes. Monitoring stochastic bonding fluctuations between a ligand bound to one electrode and its target bound to a second electrode (“tethered molecule-pair” configuration) gives insight into the nature of the intermolecular bonding at a single molecule-pair level, and defines the requirements for reproducible tunneling data. Simulations show that there is an instability in the tunnel gap at large currents, and this results in a multiplicity of contacts with a corresponding spread in the measured currents. At small currents (i.e. large gaps) the gap is stable, and functionalizing a pair of electrodes with recognition reagents (the “free analyte” configuration) can generate a distinct tunneling signal when an analyte molecule is trapped in the gap. This opens up a new interface between chemistry and electronics with immediate implications for rapid sequencing of single DNA molecules. PMID:20522930

SMRT sequencing of the Vitis vinifera cv. ‘Flame seedless’ genome using a SMRTbell-free library preparation from Swift Biosciences

USDA-ARS?s Scientific Manuscript database

Single Molecule Real-Time (SMRT) sequencing provides advantages to the sequencing of complex genomes. The long reads generated are superior for resolving complex genomic regions and provide highly contiguous de novo assemblies. Current SMRTbell libraries generate average read lengths of 10-15kb. How...
Complete Genome Sequence of the Probiotic Strain Lactobacillus salivarius LPM01.

PubMed

Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Acevedo-Piérart, Marcelo; Ormeño, M Loreto; Ramón, Daniel; Genovés, Salvador

2016-11-23

Lactobacillus salivarius LPM01 (DSM 22150) is a probiotic strain able to improve health status in immunocompromised people. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insights into its functional activity and safety assessment. Copyright © 2016 Chenoll et al.
The structure of cell adhesion molecule uvomorulin. Insights into the molecular mechanism of Ca2+-dependent cell adhesion.

PubMed Central

Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S

1987-01-01

We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370
A survey of the sorghum transcriptome using single-molecule long reads

DOE PAGES

Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...

2016-06-24

Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less
A survey of the sorghum transcriptome using single-molecule long reads

PubMed Central

Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.

2016-01-01

Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290
Assessing the performance of the Oxford Nanopore Technologies MinION

PubMed Central

Laver, T.; Harrison, J.; O’Neill, P.A.; Moore, K.; Farbos, A.; Paszkiewicz, K.; Studholme, D.J.

2015-01-01

The Oxford Nanopore Technologies (ONT) MinION is a new sequencing technology that potentially offers read lengths of tens of kilobases (kb) limited only by the length of DNA molecules presented to it. The device has a low capital cost, is by far the most portable DNA sequencer available, and can produce data in real-time. It has numerous prospective applications including improving genome sequence assemblies and resolution of repeat-rich regions. Before such a technology is widely adopted, it is important to assess its performance and limitations in respect of throughput and accuracy. In this study we assessed the performance of the MinION by re-sequencing three bacterial genomes, with very different nucleotide compositions ranging from 28.6% to 70.7%; the high G + C strain was underrepresented in the sequencing reads. We estimate the error rate of the MinION (after base calling) to be 38.2%. Mean and median read lengths were 2 kb and 1 kb respectively, while the longest single read was 98 kb. The whole length of a 5 kb rRNA operon was covered by a single read. As the first nanopore-based single molecule sequencer available to researchers, the MinION is an exciting prospect; however, the current error rate limits its ability to compete with existing sequencing technologies, though we do show that MinION sequence reads can enhance contiguity of de novo assembly when used in conjunction with Illumina MiSeq data. PMID:26753127
Single-molecule nanopore enzymology

PubMed Central

Wloka, Carsten; Maglia, Giovanni

2017-01-01

Biological nanopores are a class of membrane proteins that open nanoscale water-conduits in biological membranes. When they are reconstituted in artificial membranes and a bias voltage is applied across the membrane, the ionic current passing through individual nanopores can be used to monitor chemical reactions, to recognize individual molecules and, of most interest, to sequence DNA. More recently, proteins and enzymes have started being analysed with nanopores. Monitoring enzymatic reactions with nanopores, i.e. nanopore enzymology, has the unique advantage that it allows long-timescale observations of native proteins at the single-molecule level. Here we describe the approaches and challenges in nanopore enzymology. PMID:28630164
Identification of Microbial Profile of Koji Using Single Molecule, Real-Time Sequencing Technology.

PubMed

Hui, Wenyan; Hou, Qiangchuan; Cao, Chenxia; Xu, Haiyan; Zhen, Yi; Kwok, Lai-Yu; Sun, Tiansong; Zhang, Heping; Zhang, Wenyi

2017-05-01

Koji is a kind of Japanese traditional fermented starter that has been used for centuries. Many fermented foods are made from koji, such as sake, miso, and soy sauce. This study used the single molecule real-time sequencing technology (SMRT) to investigate the bacterial and fungal microbiota of 3 Japanese koji samples. After SMRT analysis, a total of 39121 high-quality sequences were generated, including 14354 bacterial and 24767 fungal sequence reads. The high-quality gene sequences were assigned to 5 bacterial and 2 fungal plyla, dominated by Proteobacteria and Ascomycota, respectively. At the genus level, Ochrobactrum and Wickerhamomyces were the most abundant bacterial and fungal genera, respectively. The predominant bacterial and fungal species were Ochrobactrum lupini and Wickerhamomyces anomalus, respectively. Our study profiled the microbiota composition of 3 Japanese koji samples to the species level precision. The results may be useful for further development of traditional fermented products, especially optimization of koji preparation. Meanwhile, this study has demonstrated that SMRT is a robust tool for analyzing the microbial composition in food samples. © 2017 Institute of Food Technologists®.
DNA origami-based shape IDs for single-molecule nanomechanical genotyping

NASA Astrophysics Data System (ADS)

Zhang, Honglu; Chao, Jie; Pan, Dun; Liu, Huajie; Qiang, Yu; Liu, Ke; Cui, Chengjun; Chen, Jianhua; Huang, Qing; Hu, Jun; Wang, Lianhui; Huang, Wei; Shi, Yongyong; Fan, Chunhai

2017-04-01

Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ~10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level.
DNA origami-based shape IDs for single-molecule nanomechanical genotyping

PubMed Central

Zhang, Honglu; Chao, Jie; Pan, Dun; Liu, Huajie; Qiang, Yu; Liu, Ke; Cui, Chengjun; Chen, Jianhua; Huang, Qing; Hu, Jun; Wang, Lianhui; Huang, Wei; Shi, Yongyong; Fan, Chunhai

2017-01-01

Variations on DNA sequences profoundly affect how we develop diseases and respond to pathogens and drugs. Atomic force microscopy (AFM) provides a nanomechanical imaging approach for genetic analysis with nanometre resolution. However, unlike fluorescence imaging that has wavelength-specific fluorophores, the lack of shape-specific labels largely hampers widespread applications of AFM imaging. Here we report the development of a set of differentially shaped, highly hybridizable self-assembled DNA origami nanostructures serving as shape IDs for magnified nanomechanical imaging of single-nucleotide polymorphisms. Using these origami shape IDs, we directly genotype single molecules of human genomic DNA with an ultrahigh resolution of ∼10 nm and the multiplexing ability. Further, we determine three types of disease-associated, long-range haplotypes in samples from the Han Chinese population. Single-molecule analysis allows robust haplotyping even for samples with low labelling efficiency. We expect this generic shape ID-based nanomechanical approach to hold great potential in genetic analysis at the single-molecule level. PMID:28382928
Development of Single-Molecule DNA Sequencing Platform Based on Single-Molecule Electrical Conductance

DTIC Science & Technology

2015-05-25

nanoparticles , Nature Nanotechnology 7, 197-203. 11. Dreaden, E. C., Alkilany, A. M., Huang, X. H., Murphy, C. J., and El-Sayed, M. A. (2012) The...13840-13851. 14. Llevot, A., and Astruc, D. (2012) Applications of vectorized gold nanoparticles to the diagnosis and therapy of cancer , Chem. Soc. Rev...caused by the injection of gold nanoparticles , Nanotechnology 21, 485102. 25. Dykman, L. A., Matora, L. Y., and Bogatyrev, V. A. (1996) Use of
Improved maize reference genome with single-molecule technologies.

PubMed

Jiao, Yinping; Peluso, Paul; Shi, Jinghua; Liang, Tiffany; Stitzer, Michelle C; Wang, Bo; Campbell, Michael S; Stein, Joshua C; Wei, Xuehong; Chin, Chen-Shan; Guill, Katherine; Regulski, Michael; Kumari, Sunita; Olson, Andrew; Gent, Jonathan; Schneider, Kevin L; Wolfgruber, Thomas K; May, Michael R; Springer, Nathan M; Antoniou, Eric; McCombie, W Richard; Presting, Gernot G; McMullen, Michael; Ross-Ibarra, Jeffrey; Dawe, R Kelly; Hastie, Alex; Rank, David R; Ware, Doreen

2017-06-22

Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Correlation dynamics and enhanced signals for the identification of serial biomolecules and DNA bases.

PubMed

Ahmed, Towfiq; Haraldsen, Jason T; Rehr, John J; Di Ventra, Massimiliano; Schuller, Ivan; Balatsky, Alexander V

2014-03-28

Nanopore-based sequencing has demonstrated a significant potential for the development of fast, accurate, and cost-efficient fingerprinting techniques for next generation molecular detection and sequencing. We propose a specific multilayered graphene-based nanopore device architecture for the recognition of single biomolecules. Molecular detection and analysis can be accomplished through the detection of transverse currents as the molecule or DNA base translocates through the nanopore. To increase the overall signal-to-noise ratio and the accuracy, we implement a new 'multi-point cross-correlation' technique for identification of DNA bases or other molecules on the single molecular level. We demonstrate that the cross-correlations between each nanopore will greatly enhance the transverse current signal for each molecule. We implement first-principles transport calculations for DNA bases surveyed across a multilayered graphene nanopore system to illustrate the advantages of the proposed geometry. A time-series analysis of the cross-correlation functions illustrates the potential of this method for enhancing the signal-to-noise ratio. This work constitutes a significant step forward in facilitating fingerprinting of single biomolecules using solid state technology.
Rapid method to detect duplex formation in sequencing by hybridization methods

DOEpatents

Mirzabekov, A.D.; Timofeev, E.N.; Florentiev, V.L.; Kirillov, E.V.

1999-01-19

A method for determining the existence of duplexes of oligonucleotide complementary molecules is provided. A plurality of immobilized oligonucleotide molecules, each of a specific length and each having a specific base sequence, is contacted with complementary, single stranded oligonucleotide molecules to form a duplex. Each duplex facilitates intercalation of a fluorescent dye between the base planes of the duplex. The invention also provides for a method for constructing oligonucleotide matrices comprising confining light sensitive fluid to a surface and exposing the light-sensitive fluid to a light pattern. This causes the fluid exposed to the light to coalesce into discrete units and adhere to the surface. This places each of the units in contact with a set of different oligonucleotide molecules so as to allow the molecules to disperse into the units. 13 figs.
Rapid method to detect duplex formation in sequencing by hybridization methods

DOEpatents

Mirzabekov, Andrei Darievich; Timofeev, Edward Nikolaevich; Florentiev, Vladimer Leonidovich; Kirillov, Eugene Vladislavovich

1999-01-01

A method for determining the existence of duplexes of oligonucleotide complementary molecules is provided whereby a plurality of immobilized oligonucleotide molecules, each of a specific length and each having a specific base sequence, is contacted with complementary, single stranded oligonucleotide molecules to form a duplex so as to facilitate intercalation of a fluorescent dye between the base planes of the duplex. The invention also provides for a method for constructing oligonucleotide matrices comprising confining light sensitive fluid to a surface, exposing said light-sensitive fluid to a light pattern so as to cause the fluid exposed to the light to coalesce into discrete units and adhere to the surface; and contacting each of the units with a set of different oligonucleotide molecules so as to allow the molecules to disperse into the units.
Complete Genome Sequence of Lactobacillus rhamnosus Strain BPL5 (CECT 8800), a Probiotic for Treatment of Bacterial Vaginosis.

PubMed

Chenoll, Empar; Codoñer, Francisco M; Martinez-Blanch, Juan F; Ramón, Daniel; Genovés, Salvador; Menabrito, Marco

2016-04-21

ITALIC! Lactobacillus rhamnosusBPL5 (CECT 8800), is a probiotic strain suitable for the treatment of bacterial vaginosis. Here, we report its complete genome sequence deciphered by PacBio single-molecule real-time (SMRT) technology. Analysis of the sequence may provide insight into its functional activity. Copyright © 2016 Chenoll et al.
A reference bacterial genome dataset generated on the MinION™ portable single-molecule nanopore sequencer.

PubMed

Quick, Joshua; Quinlan, Aaron R; Loman, Nicholas J

2014-01-01

The MinION™ is a new, portable single-molecule sequencer developed by Oxford Nanopore Technologies. It measures four inches in length and is powered from the USB 3.0 port of a laptop computer. The MinION™ measures the change in current resulting from DNA strands interacting with a charged protein nanopore. These measurements can then be used to deduce the underlying nucleotide sequence. We present a read dataset from whole-genome shotgun sequencing of the model organism Escherichia coli K-12 substr. MG1655 generated on a MinION™ device during the early-access MinION™ Access Program (MAP). Sequencing runs of the MinION™ are presented, one generated using R7 chemistry (released in July 2014) and one using R7.3 (released in September 2014). Base-called sequence data are provided to demonstrate the nature of data produced by the MinION™ platform and to encourage the development of customised methods for alignment, consensus and variant calling, de novo assembly and scaffolding. FAST5 files containing event data within the HDF5 container format are provided to assist with the development of improved base-calling methods.
PHYSICAL MODEL FOR RECOGNITION TUNNELING

PubMed Central

Krstić, Predrag; Ashcroft, Brian; Lindsay, Stuart

2015-01-01

Recognition tunneling (RT) identifies target molecules trapped between tunneling electrodes functionalized with recognition molecules that serve as specific chemical linkages between the metal electrodes and the trapped target molecule. Possible applications include single molecule DNA and protein sequencing. This paper addresses several fundamental aspects of RT by multiscale theory, applying both all-atom and coarse-grained DNA models: (1) We show that the magnitude of the observed currents are consistent with the results of non-equilibrium Green's function calculations carried out on a solvated all-atom model. (2) Brownian fluctuations in hydrogen bond-lengths lead to current spikes that are similar to what is observed experimentally. (3) The frequency characteristics of these fluctuations can be used to identify the trapped molecules with a machine-learning algorithm, giving a theoretical underpinning to this new method of identifying single molecule signals. PMID:25650375
Hydrogel Droplet Microfluidics for High-Throughput Single Molecule/Cell Analysis.

PubMed

Zhu, Zhi; Yang, Chaoyong James

2017-01-17

Heterogeneity among individual molecules and cells has posed significant challenges to traditional bulk assays, due to the assumption of average behavior, which would lose important biological information in heterogeneity and result in a misleading interpretation. Single molecule/cell analysis has become an important and emerging field in biological and biomedical research for insights into heterogeneity between large populations at high resolution. Compared with the ensemble bulk method, single molecule/cell analysis explores the information on time trajectories, conformational states, and interactions of individual molecules/cells, all key factors in the study of chemical and biological reaction pathways. Various powerful techniques have been developed for single molecule/cell analysis, including flow cytometry, atomic force microscopy, optical and magnetic tweezers, single-molecule fluorescence spectroscopy, and so forth. However, some of them have the low-throughput issue that has to analyze single molecules/cells one by one. Flow cytometry is a widely used high-throughput technique for single cell analysis but lacks the ability for intercellular interaction study and local environment control. Droplet microfluidics becomes attractive for single molecule/cell manipulation because single molecules/cells can be individually encased in monodisperse microdroplets, allowing high-throughput analysis and manipulation with precise control of the local environment. Moreover, hydrogels, cross-linked polymer networks that swell in the presence of water, have been introduced into droplet microfluidic systems as hydrogel droplet microfluidics. By replacing an aqueous phase with a monomer or polymer solution, hydrogel droplets can be generated on microfluidic chips for encapsulation of single molecules/cells according to the Poisson distribution. The sol-gel transition property endows the hydrogel droplets with new functionalities and diversified applications in single molecule/cell analysis. The hydrogel can act as a 3D cell culture matrix to mimic the extracellular environment for long-term single cell culture, which allows further heterogeneity study in proliferation, drug screening, and metastasis at the single-cell level. The sol-gel transition allows reactions in solution to be performed rapidly and efficiently with product storage in the gel for flexible downstream manipulation and analysis. More importantly, controllable sol-gel regulation provides a new way to maintain phenotype-genotype linkages in the hydrogel matrix for high throughput molecular evolution. In this Account, we will review the hydrogel droplet generation on microfluidics, single molecule/cell encapsulation in hydrogel droplets, as well as the progress made by our group and others in the application of hydrogel droplet microfluidics for single molecule/cell analysis, including single cell culture, single molecule/cell detection, single cell sequencing, and molecular evolution.
Single molecule and single cell epigenomics.

PubMed

Hyun, Byung-Ryool; McElwee, John L; Soloway, Paul D

2015-01-15

Dynamically regulated changes in chromatin states are vital for normal development and can produce disease when they go awry. Accordingly, much effort has been devoted to characterizing these states under normal and pathological conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most widely used method to characterize where in the genome transcription factors, modified histones, modified nucleotides and chromatin binding proteins are found; bisulfite sequencing (BS-seq) and its variants are commonly used to characterize the locations of DNA modifications. Though very powerful, these methods are not without limitations. Notably, they are best at characterizing one chromatin feature at a time, yet chromatin features arise and function in combination. Investigators commonly superimpose separate ChIP-seq or BS-seq datasets, and then infer where chromatin features are found together. While these inferences might be correct, they can be misleading when the chromatin source has distinct cell types, or when a given cell type exhibits any cell to cell variation in chromatin state. These ambiguities can be eliminated by robust methods that directly characterize the existence and genomic locations of combinations of chromatin features in very small inputs of cells or ideally, single cells. Here we review single molecule epigenomic methods under development to overcome these limitations, the technical challenges associated with single molecule methods and their potential application to single cells. Copyright © 2014 Elsevier Inc. All rights reserved.

Single Molecule and Single Cell Epigenomics

PubMed Central

Hyun, Byung-Ryool; McElwee, John L.; Soloway, Paul D.

2014-01-01

Dynamically regulated changes in chromatin states are vital for normal development and can produce disease when they go awry. Accordingly, much effort has been devoted to characterizing these states under normal and pathological conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most widely used method to characterize where in the genome transcription factors, modified histones, modified nucleotides and chromatin binding proteins are found; bisulfite sequencing (BS-seq) and its variants are commonly used to characterize the locations of DNA modifications. Though very powerful, these methods are not without limitations. Notably, they are best at characterizing one chromatin feature at a time, yet chromatin features arise and function in combination. Investigators commonly superimpose separate ChIP-seq or BS-seq datasets, and then infer where chromatin features are found together. While these inferences might be correct, they can be misleading when the chromatin source has distinct cell types, or when a given cell type exhibits any cell to cell variation in chromatin state. These ambiguities can be eliminated by robust methods that directly characterize the existence and genomic locations of combinations of chromatin features in very small inputs of cells or ideally, single cells. Here we review single molecule epigenomic methods under development to overcome these limitations, the technical challenges associated with single molecule methods and their potential application to single cells. PMID:25204781
A new chicken genome assembly provides insight into avian genome structure

USDA-ARS?s Scientific Manuscript database

The importance of the Gallus gallus (chicken) as a model organism and agricultural animal merits a continuation of sequence assembly improvement efforts. We present a new version of the chicken genome assembly (Gallus_gallus-5.0; GCA_000002315.3) built from combined long single molecule sequencing t...
Additional annotation of the pig transcriptome using integrated Iso-seq and Illumina RNA-seq analysis

USDA-ARS?s Scientific Manuscript database

Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...
Joint analysis of bacterial DNA methylation, predicted promoter and regulation motifs for biological significance

USDA-ARS?s Scientific Manuscript database

Advances in long-read, single molecule real-time sequencing technology and analysis software over the last two years has enabled the efficient production of closed bacterial genome sequences. However, consistent annotation of these genomes has lagged behind the ability to create them, while the avai...
Single-molecule Protein Unfolding in Solid State Nanopores

PubMed Central

Talaga, David S.; Li, Jiali

2009-01-01

We use single silicon nitride nanopores to study folded, partially folded and unfolded single proteins by measuring their excluded volumes. The DNA-calibrated translocation signals of β-lactoglobulin and histidine-containing phosphocarrier protein match quantitatively with that predicted by a simple sum of the partial volumes of the amino acids in the polypeptide segment inside the pore when translocation stalls due to the primary charge sequence. Our analysis suggests that the majority of the protein molecules were linear or looped during translocation and that the electrical forces present under physiologically relevant potentials can unfold proteins. Our results show that the nanopore translocation signals are sensitive enough to distinguish the folding state of a protein and distinguish between proteins based on the excluded volume of a local segment of the polypeptide chain that transiently stalls in the nanopore due to the primary sequence of charges. PMID:19530678
Nanopores: A journey towards DNA sequencing

PubMed Central

Wanunu, Meni

2013-01-01

Much more than ever, nucleic acids are recognized as key building blocks in many of life's processes, and the science of studying these molecular wonders at the single-molecule level is thriving. A new method of doing so has been introduced in the mid 1990's. This method is exceedingly simple: a nanoscale pore that spans across an impermeable thin membrane is placed between two chambers that contain an electrolyte, and voltage is applied across the membrane using two electrodes. These conditions lead to a steady stream of ion flow across the pore. Nucleic acid molecules in solution can be driven through the pore, and structural features of the biomolecules are observed as measurable changes in the trans-membrane ion current. In essence, a nanopore is a high-throughput ion microscope and a single-molecule force apparatus. Nanopores are taking center stage as a tool that promises to read a DNA sequence, and this promise has resulted in overwhelming academic, industrial, and national interest. Regardless of the fate of future nanopore applications, in the process of this 16-year-long exploration, many studies have validated the indispensability of nanopores in the toolkit of single-molecule biophysics. This review surveys past and current studies related to nucleic acid biophysics, and will hopefully provoke a discussion of immediate and future prospects for the field. PMID:22658507
Identification of Biomolecular Building Blocks by Recognition Tunneling: Stride towards Nanopore Sequencing of Biomolecules

NASA Astrophysics Data System (ADS)

Sen, Suman

DNA, RNA and Protein are three pivotal biomolecules in human and other organisms, playing decisive roles in functionality, appearance, diseases development and other physiological phenomena. Hence, sequencing of these biomolecules acquires the prime interest in the scientific community. Single molecular identification of their building blocks can be done by a technique called Recognition Tunneling (RT) based on Scanning Tunneling Microscope (STM). A single layer of specially designed recognition molecule is attached to the STM electrodes, which trap the targeted molecules (DNA nucleoside monophosphates, RNA nucleoside monophosphates or amino acids) inside the STM nanogap. Depending on their different binding interactions with the recognition molecules, the analyte molecules generate stochastic signal trains accommodating their "electronic fingerprints". Signal features are used to detect the molecules using a machine learning algorithm and different molecules can be identified with significantly high accuracy. This, in turn, paves the way for rapid, economical nanopore sequencing platform, overcoming the drawbacks of Next Generation Sequencing (NGS) techniques. To read DNA nucleotides with high accuracy in an STM tunnel junction a series of nitrogen-based heterocycles were designed and examined to check their capabilities to interact with naturally occurring DNA nucleotides by hydrogen bonding in the tunnel junction. These recognition molecules are Benzimidazole, Imidazole, Triazole and Pyrrole. Benzimidazole proved to be best among them showing DNA nucleotide classification accuracy close to 99%. Also, Imidazole reader can read an abasic monophosphate (AP), a product from depurination or depyrimidination that occurs 10,000 times per human cell per day. In another study, I have investigated a new universal reader, 1-(2-mercaptoethyl)pyrene (Pyrene reader) based on stacking interactions, which should be more specific to the canonical DNA nucleosides. In addition, Pyrene reader showed higher DNA base-calling accuracy compare to Imidazole reader, the workhorse in our previous projects. In my other projects, various amino acids and RNA nucleoside monophosphates were also classified with significantly high accuracy using RT. Twenty naturally occurring amino acids and various RNA nucleosides (four canonical and two modified) were successfully identified. Thus, we envision nanopore sequencing biomolecules using Recognition Tunneling (RT) that should provide comprehensive betterment over current technologies in terms of time, chemical and instrumental cost and capability of de novo sequencing.
Coherent (photon) vs incoherent (current) detection of multidimensional optical signals from single molecules in open junctions

DOE Office of Scientific and Technical Information (OSTI.GOV)

Agarwalla, Bijay Kumar; Hua, Weijie; Zhang, Yu

2015-06-07

The nonlinear optical response of a current-carrying single molecule coupled to two metal leads and driven by a sequence of impulsive optical pulses with controllable phases and time delays is calculated. Coherent (stimulated, heterodyne) detection of photons and incoherent detection of the optically induced current are compared. Using a diagrammatic Liouville space superoperator formalism, the signals are recast in terms of molecular correlation functions which are then expanded in the many-body molecular states. Two dimensional signals in benzene-1,4-dithiol molecule show cross peaks involving charged states. The correlation between optical and charge current signal is also observed.
Universal digital high-resolution melt: a novel approach to broad-based profiling of heterogeneous biological samples.

PubMed

Fraley, Stephanie I; Hardick, Justin; Masek, Billie J; Jo Masek, Billie; Athamanolap, Pornpat; Rothman, Richard E; Gaydos, Charlotte A; Carroll, Karen C; Wakefield, Teresa; Wang, Tza-Huei; Yang, Samuel

2013-10-01

Comprehensive profiling of nucleic acids in genetically heterogeneous samples is important for clinical and basic research applications. Universal digital high-resolution melt (U-dHRM) is a new approach to broad-based PCR diagnostics and profiling technologies that can overcome issues of poor sensitivity due to contaminating nucleic acids and poor specificity due to primer or probe hybridization inaccuracies for single nucleotide variations. The U-dHRM approach uses broad-based primers or ligated adapter sequences to universally amplify all nucleic acid molecules in a heterogeneous sample, which have been partitioned, as in digital PCR. Extensive assay optimization enables direct sequence identification by algorithm-based matching of melt curve shape and Tm to a database of known sequence-specific melt curves. We show that single-molecule detection and single nucleotide sensitivity is possible. The feasibility and utility of U-dHRM is demonstrated through detection of bacteria associated with polymicrobial blood infection and microRNAs (miRNAs) associated with host response to infection. U-dHRM using broad-based 16S rRNA gene primers demonstrates universal single cell detection of bacterial pathogens, even in the presence of larger amounts of contaminating bacteria; U-dHRM using universally adapted Lethal-7 miRNAs in a heterogeneous mixture showcases the single copy sensitivity and single nucleotide specificity of this approach.
A Single Amino Acid Substitution in the v-Eyk Intracellular Domain Results in Activation of Stat3 and Enhances Cellular Transformation

PubMed Central

Besser, Daniel; Bromberg, Jacqueline F.; Darnell, James E.; Hanafusa, Hidesaburo

1999-01-01

The receptor tyrosine kinase Eyk, a member of the Axl/Tyro3 subfamily, activates the STAT pathway and transforms cells when constitutively activated. Here, we compared the potentials of the intracellular domains of Eyk molecules derived from c-Eyk and v-Eyk to transform rat 3Y1 fibroblasts. The v-Eyk molecule induced higher numbers of transformants in soft agar and stronger activation of Stat3; levels of Stat1 activation by the two Eyk molecules were similar. A mutation in the sequence Y933VPL, present in c-Eyk, to the v-Eyk sequence Y933VPQ led to increased activation of Stat3 and increased transformation efficiency. However, altering another sequence, Y862VNT, present in both Eyk molecules to F862VNT markedly decreased transformation without impairing Stat3 activation. These results indicate that activation of Stat3 enhances transformation efficiency and cooperates with another pathway to induce transformation. PMID:9891073
Genome Sequence of Bacillus cereus Strain TG1-6, a Plant-Beneficial Rhizobacterium That Is Highly Salt Tolerant

PubMed Central

2018-01-01

ABSTRACT The complete genome sequence of Bacillus cereus strain TG1-6, which is a highly salt-tolerant rhizobacterium that enhances plant tolerance to drought stress, is reported here. The sequencing process was performed based on a combination of pyrosequencing and single-molecule sequencing. The complete genome is estimated to be approximately 5.42 Mb, containing a total of 5,610 predicted protein-coding DNA sequences (CDSs). PMID:29748401
Genome Sequence of Bacillus megaterium Strain YC4-R4, a Plant Growth-Promoting Rhizobacterium Isolated from a High-Salinity Environment.

PubMed

Vílchez, Juan Ignacio; Tang, Qiming; Kaushal, Richa; Wang, Wei; Lv, Suhui; He, Danxia; Chu, Zhaoqing; Zhang, Heng; Liu, Renyi; Zhang, Huiming

2018-06-21

Here, we report the complete genome sequence for Bacillus megaterium strain YC4-R4, a highly salt-tolerant rhizobacterium that promotes growth in plants. The sequencing process was performed by combining pyrosequencing and single-molecule sequencing techniques. The complete genome is estimated to be approximately 5.44 Mb, containing a total of 5,673 predicted protein-coding DNA sequences (CDSs). Copyright © 2018 Vílchez et al.
Rapid method to detect duplex formation in sequencing by hybridization methods, a method for constructing containment structures for reagent interaction

DOEpatents

Mirzabekov, Andrei Darievich; Yershov, Gennadiy Moiseyevich; Guschin, Dmitry Yuryevich; Gemmell, Margaret Anne; Shick, Valentine V.; Proudnikov, Dmitri Y.; Timofeev, Edward N.

2002-01-01

A method for determining the existence of duplexes of oligonucleotide complementary molecules is provided whereby a plurality of immobilized oligonucleotide molecules, each of a specific length and each having a specific base sequence, is contacted with complementary, single stranded oligonucleotide molecules to form a duplex so as to facilitate intercalation of a fluorescent dye between the base planes of the duplex. The invention also provides for a method for constructing oligonucleotide matrices comprising confining light sensitive fluid to a surface, exposing said light-sensitive fluid to a light pattern so as to cause the fluid exposed to the light to polymerize into discrete units and adhere to the surface; and contacting each of the units with a set of different oligonucleotide molecules so as to allow the molecules to disperse into the units.
Potentials of single-cell biology in identification and validation of disease biomarkers.

PubMed

Niu, Furong; Wang, Diane C; Lu, Jiapei; Wu, Wei; Wang, Xiangdong

2016-09-01

Single-cell biology is considered a new approach to identify and validate disease-specific biomarkers. However, the concern raised by clinicians is how to apply single-cell measurements for clinical practice, translate the message of single-cell systems biology into clinical phenotype or explain alterations of single-cell gene sequencing and function in patient response to therapies. This study is to address the importance and necessity of single-cell gene sequencing in the identification and development of disease-specific biomarkers, the definition and significance of single-cell biology and single-cell systems biology in the understanding of single-cell full picture, the development and establishment of whole-cell models in the validation of targeted biological function and the figure and meaning of single-molecule imaging in single cell to trace intra-single-cell molecule expression, signal, interaction and location. We headline the important role of single-cell biology in the discovery and development of disease-specific biomarkers with a special emphasis on understanding single-cell biological functions, e.g. mechanical phenotypes, single-cell biology, heterogeneity and organization of genome function. We have reason to believe that such multi-dimensional, multi-layer, multi-crossing and stereoscopic single-cell biology definitely benefits the discovery and development of disease-specific biomarkers. © 2016 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
Analysis of Multiallelic CNVs by Emulsion Haplotype Fusion PCR.

PubMed

Tyson, Jess; Armour, John A L

2017-01-01

Emulsion-fusion PCR recovers long-range sequence information by combining products in cis from individual genomic DNA molecules. Emulsion droplets act as very numerous small reaction chambers in which different PCR products from a single genomic DNA molecule are condensed into short joint products, to unite sequences in cis from widely separated genomic sites. These products can therefore provide information about the arrangement of sequences and variants at a larger scale than established long-read sequencing methods. The method has been useful in defining the phase of variants in haplotypes, the typing of inversions, and determining the configuration of sequence variants in multiallelic CNVs. In this description we outline the rationale for the application of emulsion-fusion PCR methods to the analysis of multiallelic CNVs, and give practical details for our own implementation of the method in that context.
Subangstrom Measurements of Enzyme Function Using a Biological Nanopore, SPRNT.

PubMed

Laszlo, A H; Derrrington, I M; Gundlach, J H

2017-01-01

Nanopores are emerging as new single-molecule tools in the study of enzymes. Based on the progress in nanopore sequencing of DNA, a tool called Single-molecule Picometer Resolution Nanopore Tweezers (SPRNT) was developed to measure the movement of enzymes along DNA in real time. In this new method, an enzyme is loaded onto a DNA (or RNA) molecule. A single-stranded DNA end of this complex is drawn into a nanopore by an electrostatic potential that is applied across the pore. The single-stranded DNA passes through the pore's constriction until the enzyme comes into contact with the pore. Further progression of the DNA through the pore is then controlled by the enzyme. An ion current that flows through the pore's constriction is modulated by the DNA in the constriction. Analysis of ion current changes reveals the advance of the DNA with high spatiotemporal precision, thereby providing a real-time record of the enzyme's activity. Using an engineered version of the protein nanopore MspA, SPRNT has spatial resolution as small as 40pm at millisecond timescales, while simultaneously providing the DNA's sequence within the enzyme. In this chapter, SPRNT is introduced and its extraordinary potential is exemplified using the helicase Hel308. Two distinct substates are observed for each one-nucleotide advance; one of these about half-nucleotide long steps is ATP dependent and the other is ATP independent. The spatiotemporal resolution of this low-cost single-molecule technique lifts the study of enzymes to a new level of precision, enabling exploration of hitherto unobservable enzyme dynamics in real time. © 2017 Elsevier Inc. All rights reserved.
Solid-State and Biological Nanopore for Real-Time Sensing of Single Chemical and Sequencing of DNA.

PubMed

Haque, Farzin; Li, Jinghong; Wu, Hai-Chen; Liang, Xing-Jie; Guo, Peixuan

2013-02-01

Sensitivity and specificity are two most important factors to take into account for molecule sensing, chemical detection and disease diagnosis. A perfect sensitivity is to reach the level where a single molecule can be detected. An ideal specificity is to reach the level where the substance can be detected in the presence of many contaminants. The rapidly progressing nanopore technology is approaching this threshold. A wide assortment of biomotors and cellular pores in living organisms perform diverse biological functions. The elegant design of these transportation machineries has inspired the development of single molecule detection based on modulations of the individual current blockage events. The dynamic growth of nanotechnology and nanobiotechnology has stimulated rapid advances in the study of nanopore based instrumentation over the last decade, and inspired great interest in sensing of single molecules including ions, nucleotides, enantiomers, drugs, and polymers such as PEG, RNA, DNA, and polypeptides. This sensing technology has been extended to medical diagnostics and third generation high throughput DNA sequencing. This review covers current nanopore detection platforms including both biological pores and solid state counterparts. Several biological nanopores have been studied over the years, but this review will focus on the three best characterized systems including α-hemolysin and MspA, both containing a smaller channel for the detection of single-strand DNA, as well as bacteriophage phi29 DNA packaging motor connector that contains a larger channel for the passing of double stranded DNA. The advantage and disadvantage of each system are compared; their current and potential applications in nanomedicine, biotechnology, and nanotechnology are discussed.
Solid-State and Biological Nanopore for Real-Time Sensing of Single Chemical and Sequencing of DNA

PubMed Central

Haque, Farzin; Li, Jinghong; Wu, Hai-Chen; Liang, Xing-Jie; Guo, Peixuan

2013-01-01

Sensitivity and specificity are two most important factors to take into account for molecule sensing, chemical detection and disease diagnosis. A perfect sensitivity is to reach the level where a single molecule can be detected. An ideal specificity is to reach the level where the substance can be detected in the presence of many contaminants. The rapidly progressing nanopore technology is approaching this threshold. A wide assortment of biomotors and cellular pores in living organisms perform diverse biological functions. The elegant design of these transportation machineries has inspired the development of single molecule detection based on modulations of the individual current blockage events. The dynamic growth of nanotechnology and nanobiotechnology has stimulated rapid advances in the study of nanopore based instrumentation over the last decade, and inspired great interest in sensing of single molecules including ions, nucleotides, enantiomers, drugs, and polymers such as PEG, RNA, DNA, and polypeptides. This sensing technology has been extended to medical diagnostics and third generation high throughput DNA sequencing. This review covers current nanopore detection platforms including both biological pores and solid state counterparts. Several biological nanopores have been studied over the years, but this review will focus on the three best characterized systems including α-hemolysin and MspA, both containing a smaller channel for the detection of single-strand DNA, as well as bacteriophage phi29 DNA packaging motor connector that contains a larger channel for the passing of double stranded DNA. The advantage and disadvantage of each system are compared; their current and potential applications in nanomedicine, biotechnology, and nanotechnology are discussed. PMID:23504223
A single-molecule sequencing assay for the comprehensive profiling of T4 DNA ligase fidelity and bias during DNA end-joining.

PubMed

Potapov, Vladimir; Ong, Jennifer L; Langhorst, Bradley W; Bilotti, Katharina; Cahoon, Dan; Canton, Barry; Knight, Thomas F; Evans, Thomas C; Lohman, Gregory Js

2018-05-08

DNA ligases are key enzymes in molecular and synthetic biology that catalyze the joining of breaks in duplex DNA and the end-joining of DNA fragments. Ligation fidelity (discrimination against the ligation of substrates containing mismatched base pairs) and bias (preferential ligation of particular sequences over others) have been well-studied in the context of nick ligation. However, almost no data exist for fidelity and bias in end-joining ligation contexts. In this study, we applied Pacific Biosciences Single-Molecule Real-Time sequencing technology to directly sequence the products of a highly multiplexed ligation reaction. This method has been used to profile the ligation of all three-base 5'-overhangs by T4 DNA ligase under typical ligation conditions in a single experiment. We report the relative frequency of all ligation products with or without mismatches, the position-dependent frequency of each mismatch, and the surprising observation that 5'-TNA overhangs ligate extremely inefficiently compared to all other Watson-Crick pairings. The method can easily be extended to profile other ligases, end-types (e.g. blunt ends and overhangs of different lengths), and the effect of adjacent sequence on the ligation results. Further, the method has the potential to provide new insights into the thermodynamics of annealing and the kinetics of end-joining reactions.
Electronic Transport in Single-Stranded DNA Molecule Related to Huntington's Disease

NASA Astrophysics Data System (ADS)

Sarmento, R. G.; Silva, R. N. O.; Madeira, M. P.; Frazão, N. F.; Sousa, J. O.; Macedo-Filho, A.

2018-04-01

We report a numerical analysis of the electronic transport in single chain DNA molecule consisting of 182 nucleotides. The DNA chains studied were extracted from a segment of the human chromosome 4p16.3, which were modified by expansion of CAG (cytosine-adenine-guanine) triplet repeats to mimics Huntington's disease. The mutated DNA chains were connected between two platinum electrodes to analyze the relationship between charge propagation in the molecule and Huntington's disease. The computations were performed within a tight-binding model, together with a transfer matrix technique, to investigate the current-voltage (I-V) of 23 types of DNA sequence and compare them with the distributions of the related CAG repeat numbers with the disease. All DNA sequences studied have a characteristic behavior of a semiconductor. In addition, the results showed a direct correlation between the current-voltage curves and the distributions of the CAG repeat numbers, suggesting possible applications in the development of DNA-based biosensors for molecular diagnostics.

Single-molecule sequencing and conformational capture enable de novo mammalian reference genomes

USDA-ARS?s Scientific Manuscript database

Genome assemblies have been produced for numerous species as a result of advances in sequencing technologies. However, many of the assemblies are fragmented, with many gaps, ambiguities, and errors. We use the genome of the domestic goat (Capra hircus) to demonstrate current state of the art for ef...
SINGLE MOLECULE APPROACHES TO BIOLOGY, 2010 GORDON RESEARCH CONFERENCE, JUNE 27-JULY 2, 2010, ITALY

DOE Office of Scientific and Technical Information (OSTI.GOV)

Professor William Moerner

2010-07-09

The 2010 Gordon Conference on Single-Molecule Approaches to Biology focuses on cutting-edge research in single-molecule science. Tremendous technical developments have made it possible to detect, identify, track, and manipulate single biomolecules in an ambient environment or even in a live cell. Single-molecule approaches have changed the way many biological problems are addressed, and new knowledge derived from these approaches continues to emerge. The ability of single-molecule approaches to avoid ensemble averaging and to capture transient intermediates and heterogeneous behavior renders them particularly powerful in elucidating mechanisms of biomolecular machines: what they do, how they work individually, how they work together,more » and finally, how they work inside live cells. The burgeoning use of single-molecule methods to elucidate biological problems is a highly multidisciplinary pursuit, involving both force- and fluorescence-based methods, the most up-to-date advances in microscopy, innovative biological and chemical approaches, and nanotechnology tools. This conference seeks to bring together top experts in molecular and cell biology with innovators in the measurement and manipulation of single molecules, and will provide opportunities for junior scientists and graduate students to present their work in poster format and to exchange ideas with leaders in the field. A number of excellent poster presenters will be selected for short oral talks. Topics as diverse as single-molecule sequencing, DNA/RNA/protein interactions, folding machines, cellular biophysics, synthetic biology and bioengineering, force spectroscopy, new method developments, superresolution imaging in cells, and novel probes for single-molecule imaging will be on the program. Additionally, the collegial atmosphere of this Conference, with programmed discussion sessions as well as opportunities for informal gatherings in the afternoons and evenings in the beauty of the Il Ciocco site in Tuscany, provides an avenue for scientists from different disciplines to interact and brainstorm and promotes cross-disciplinary collaborations directed toward compelling biological problems.« less
A transcriptome atlas of rabbit revealed by PacBio single-molecule long-read sequencing.

PubMed

Chen, Shi-Yi; Deng, Feilong; Jia, Xianbo; Li, Cao; Lai, Song-Jia

2017-08-09

It is widely acknowledged that transcriptional diversity largely contributes to biological regulation in eukaryotes. Since the advent of second-generation sequencing technologies, a large number of RNA sequencing studies have considerably improved our understanding of transcriptome complexity. However, it still remains a huge challenge for obtaining full-length transcripts because of difficulties in the short read-based assembly. In the present study we employ PacBio single-molecule long-read sequencing technology for whole-transcriptome profiling in rabbit (Oryctolagus cuniculus). We totally obtain 36,186 high-confidence transcripts from 14,474 genic loci, among which more than 23% of genic loci and 66% of isoforms have not been annotated yet within the current reference genome. Furthermore, about 17% of transcripts are computationally revealed to be non-coding RNAs. Up to 24,797 alternative splicing (AS) and 11,184 alternative polyadenylation (APA) events are detected within this de novo constructed transcriptome, respectively. The results provide a comprehensive set of reference transcripts and hence contribute to the improved annotation of rabbit genome.
Comprehensive analysis of single molecule sequencing-derived complete genome and whole transcriptome of Hyposidra talaca nuclear polyhedrosis virus.

PubMed

Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar

2018-06-12

We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.
Single-molecule FRET studies of the cooperative and non-cooperative binding kinetics of the bacteriophage T4 single-stranded DNA binding protein (gp32) to ssDNA lattices at replication fork junctions

PubMed Central

Lee, Wonbae; Gillies, John P.; Jose, Davis; Israels, Brett A.; von Hippel, Peter H.; Marcus, Andrew H.

2016-01-01

Gene 32 protein (gp32) is the single-stranded (ss) DNA binding protein of the bacteriophage T4. It binds transiently and cooperatively to ssDNA sequences exposed during the DNA replication process and regulates the interactions of the other sub-assemblies of the replication complex during the replication cycle. We here use single-molecule FRET techniques to build on previous thermodynamic studies of gp32 binding to initiate studies of the dynamics of the isolated and cooperative binding of gp32 molecules within the replication complex. DNA primer/template (p/t) constructs are used as models to determine the effects of ssDNA lattice length, gp32 concentration, salt concentration, binding cooperativity and binding polarity at p/t junctions. Hidden Markov models (HMMs) and transition density plots (TDPs) are used to characterize the dynamics of the multi-step assembly pathway of gp32 at p/t junctions of differing polarity, and show that isolated gp32 molecules bind to their ssDNA targets weakly and dissociate quickly, while cooperatively bound dimeric or trimeric clusters of gp32 bind much more tightly, can ‘slide’ on ssDNA sequences, and exhibit binding dynamics that depend on p/t junction polarities. The potential relationships of these binding dynamics to interactions with other components of the T4 DNA replication complex are discussed. PMID:27694621
Sequence analysis of cultivated strawberry (Fragaria × ananassa Duch.) using microdissected single somatic chromosomes.

PubMed

Yanagi, Tomohiro; Shirasawa, Kenta; Terachi, Mayuko; Isobe, Sachiko

2017-01-01

Cultivated strawberry ( Fragaria × ananassa Duch.) has homoeologous chromosomes because of allo-octoploidy. For example, two homoeologous chromosomes that belong to different sub-genome of allopolyploids have similar base sequences. Thus, when conducting de novo assembly of DNA sequences, it is difficult to determine whether these sequences are derived from the same chromosome. To avoid the difficulties associated with homoeologous chromosomes and demonstrate the possibility of sequencing allopolyploids using single chromosomes, we conducted sequence analysis using microdissected single somatic chromosomes of cultivated strawberry. Three hundred and ten somatic chromosomes of the Japanese octoploid strawberry 'Reiko' were individually selected under a light microscope using a microdissection system. DNA from 288 of the dissected chromosomes was successfully amplified using a DNA amplification kit. Using next-generation sequencing, we decoded the base sequences of the amplified DNA segments, and on the basis of mapping, we identified DNA sequences from 144 samples that were best matched to the reference genomes of the octoploid strawberry, F. × ananassa , and the diploid strawberry, F. vesca . The 144 samples were classified into seven pseudo-molecules of F. vesca . The coverage rates of the DNA sequences from the single chromosome onto all pseudo-molecular sequences varied from 3 to 29.9%. We demonstrated an efficient method for sequence analysis of allopolyploid plants using microdissected single chromosomes. On the basis of our results, we believe that whole-genome analysis of allopolyploid plants can be enhanced using methodology that employs microdissected single chromosomes.
MethylViewer: computational analysis and editing for bisulfite sequencing and methyltransferase accessibility protocol for individual templates (MAPit) projects.

PubMed

Pardo, Carolina E; Carr, Ian M; Hoffman, Christopher J; Darst, Russell P; Markham, Alexander F; Bonthron, David T; Kladde, Michael P

2011-01-01

Bisulfite sequencing is a widely-used technique for examining cytosine DNA methylation at nucleotide resolution along single DNA strands. Probing with cytosine DNA methyltransferases followed by bisulfite sequencing (MAPit) is an effective technique for mapping protein-DNA interactions. Here, MAPit methylation footprinting with M.CviPI, a GC methyltransferase we previously cloned and characterized, was used to probe hMLH1 chromatin in HCT116 and RKO colorectal cancer cells. Because M.CviPI-probed samples contain both CG and GC methylation, we developed a versatile, visually-intuitive program, called MethylViewer, for evaluating the bisulfite sequencing results. Uniquely, MethylViewer can simultaneously query cytosine methylation status in bisulfite-converted sequences at as many as four different user-defined motifs, e.g. CG, GC, etc., including motifs with degenerate bases. Data can also be exported for statistical analysis and as publication-quality images. Analysis of hMLH1 MAPit data with MethylViewer showed that endogenous CG methylation and accessible GC sites were both mapped on single molecules at high resolution. Disruption of positioned nucleosomes on single molecules of the PHO5 promoter was detected in budding yeast using M.CviPII, increasing the number of enzymes available for probing protein-DNA interactions. MethylViewer provides an integrated solution for primer design and rapid, accurate and detailed analysis of bisulfite sequencing or MAPit datasets from virtually any biological or biochemical system.
Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

PubMed Central

Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

2016-01-01

Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524
Using Multiorder Time-Correlation Functions (TCFs) To Elucidate Biomolecular Reaction Pathways from Microsecond Single-Molecule Fluorescence Experiments.

PubMed

Phelps, Carey; Israels, Brett; Marsh, Morgan C; von Hippel, Peter H; Marcus, Andrew H

2016-12-29

Recent advances in single-molecule fluorescence imaging have made it possible to perform measurements on microsecond time scales. Such experiments have the potential to reveal detailed information about the conformational changes in biological macromolecules, including the reaction pathways and dynamics of the rearrangements involved in processes, such as sequence-specific DNA "breathing" and the assembly of protein-nucleic acid complexes. Because microsecond-resolved single-molecule trajectories often involve "sparse" data, that is, they contain relatively few data points per unit time, they cannot be easily analyzed using the standard protocols that were developed for single-molecule experiments carried out with tens-of-millisecond time resolution and high "data density." Here, we describe a generalized approach, based on time-correlation functions, to obtain kinetic information from microsecond-resolved single-molecule fluorescence measurements. This approach can be used to identify short-lived intermediates that lie on reaction pathways connecting relatively long-lived reactant and product states. As a concrete illustration of the potential of this methodology for analyzing specific macromolecular systems, we accompany the theoretical presentation with the description of a specific biologically relevant example drawn from studies of reaction mechanisms of the assembly of the single-stranded DNA binding protein of the T4 bacteriophage replication complex onto a model DNA replication fork.
DNA nanomapping using CRISPR-Cas9 as a programmable nanoparticle.

PubMed

Mikheikin, Andrey; Olsen, Anita; Leslie, Kevin; Russell-Pavier, Freddie; Yacoot, Andrew; Picco, Loren; Payton, Oliver; Toor, Amir; Chesney, Alden; Gimzewski, James K; Mishra, Bud; Reed, Jason

2017-11-21

Progress in whole-genome sequencing using short-read (e.g., <150 bp), next-generation sequencing technologies has reinvigorated interest in high-resolution physical mapping to fill technical gaps that are not well addressed by sequencing. Here, we report two technical advances in DNA nanotechnology and single-molecule genomics: (1) we describe a labeling technique (CRISPR-Cas9 nanoparticles) for high-speed AFM-based physical mapping of DNA and (2) the first successful demonstration of using DVD optics to image DNA molecules with high-speed AFM. As a proof of principle, we used this new "nanomapping" method to detect and map precisely BCL2-IGH translocations present in lymph node biopsies of follicular lymphoma patents. This HS-AFM "nanomapping" technique can be complementary to both sequencing and other physical mapping approaches.
Single molecule detection with graphene and other two-dimensional materials: nanopores and beyond

PubMed Central

Arjmandi-Tash, Hadi; Belyaeva, Liubov A.

2016-01-01

Graphene and other two dimensional (2D) materials are currently integrated into nanoscaled devices that may – one day – sequence genomes. The challenge to solve is conceptually straightforward: cut a sheet out of a 2D material and use the edge of the sheet to scan an unfolded biomolecule from head to tail. As the scan proceeds – and because 2D materials are atomically thin – the information provided by the edge might be used to identify different segments – ideally single nucleotides – in the biomolecular strand. So far, the most efficient approach was to drill a nano-sized pore in the sheet and use this pore as a channel to guide and detect individual molecules by measuring the electrochemical ionic current. Nanoscaled gaps between two electrodes in 2D materials recently emerged as powerful alternatives to nanopores. This article reviews the current status and prospects of integrating 2D materials in nanopores, nanogaps and similar devices for single molecule biosensing applications. We discuss the pros and cons, the challenges, and the latest achievements in the field. To achieve high-throughput sequencing with 2D materials, interdisciplinary research is essential. PMID:26612268
Characterization of Hepatitis C Virus (HCV) Envelope Diversification from Acute to Chronic Infection within a Sexually Transmitted HCV Cluster by Using Single-Molecule, Real-Time Sequencing

PubMed Central

Ho, Cynthia K. Y.; Raghwani, Jayna; Koekkoek, Sylvie; Liang, Richard H.; Van der Meer, Jan T. M.; Van Der Valk, Marc; De Jong, Menno; Pybus, Oliver G.

2016-01-01

ABSTRACT In contrast to other available next-generation sequencing platforms, PacBio single-molecule, real-time (SMRT) sequencing has the advantage of generating long reads albeit with a relatively higher error rate in unprocessed data. Using this platform, we longitudinally sampled and sequenced the hepatitis C virus (HCV) envelope genome region (1,680 nucleotides [nt]) from individuals belonging to a cluster of sexually transmitted cases. All five subjects were coinfected with HIV-1 and a closely related strain of HCV genotype 4d. In total, 50 samples were analyzed by using SMRT sequencing. By using 7 passes of circular consensus sequencing, the error rate was reduced to 0.37%, and the median number of sequences was 612 per sample. A further reduction of insertions was achieved by alignment against a sample-specific reference sequence. However, in vitro recombination during PCR amplification could not be excluded. Phylogenetic analysis supported close relationships among HCV sequences from the four male subjects and subsequent transmission from one subject to his female partner. Transmission was characterized by a strong genetic bottleneck. Viral genetic diversity was low during acute infection and increased upon progression to chronicity but subsequently fluctuated during chronic infection, caused by the alternate detection of distinct coexisting lineages. SMRT sequencing combines long reads with sufficient depth for many phylogenetic analyses and can therefore provide insights into within-host HCV evolutionary dynamics without the need for haplotype reconstruction using statistical algorithms. IMPORTANCE Next-generation sequencing has revolutionized the study of genetically variable RNA virus populations, but for phylogenetic and evolutionary analyses, longer sequences than those generated by most available platforms, while minimizing the intrinsic error rate, are desired. Here, we demonstrate for the first time that PacBio SMRT sequencing technology can be used to generate full-length HCV envelope sequences at the single-molecule level, providing a data set with large sequencing depth for the characterization of intrahost viral dynamics. The selection of consensus reads derived from at least 7 full circular consensus sequencing rounds significantly reduced the intrinsic high error rate of this method. We used this method to genetically characterize a unique transmission cluster of sexually transmitted HCV infections, providing insight into the distinct evolutionary pathways in each patient over time and identifying the transmission-associated genetic bottleneck as well as fluctuations in viral genetic diversity over time, accompanied by dynamic shifts in viral subpopulations. PMID:28077634
Clonal evolution in breast cancer revealed by single nucleus genome sequencing.

PubMed

Wang, Yong; Waters, Jill; Leung, Marco L; Unruh, Anna; Roh, Whijae; Shi, Xiuqing; Chen, Ken; Scheet, Paul; Vattathil, Selina; Liang, Han; Multani, Asha; Zhang, Hong; Zhao, Rui; Michor, Franziska; Meric-Bernstam, Funda; Navin, Nicholas E

2014-08-14

Sequencing studies of breast tumour cohorts have identified many prevalent mutations, but provide limited insight into the genomic diversity within tumours. Here we developed a whole-genome and exome single cell sequencing approach called nuc-seq that uses G2/M nuclei to achieve 91% mean coverage breadth. We applied this method to sequence single normal and tumour nuclei from an oestrogen-receptor-positive (ER(+)) breast cancer and a triple-negative ductal carcinoma. In parallel, we performed single nuclei copy number profiling. Our data show that aneuploid rearrangements occurred early in tumour evolution and remained highly stable as the tumour masses clonally expanded. In contrast, point mutations evolved gradually, generating extensive clonal diversity. Using targeted single-molecule sequencing, many of the diverse mutations were shown to occur at low frequencies (<10%) in the tumour mass. Using mathematical modelling we found that the triple-negative tumour cells had an increased mutation rate (13.3×), whereas the ER(+) tumour cells did not. These findings have important implications for the diagnosis, therapeutic treatment and evolution of chemoresistance in breast cancer.
An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing.

PubMed

Zimin, Aleksey V; Stevens, Kristian A; Crepeau, Marc W; Puiu, Daniela; Wegrzyn, Jill L; Yorke, James A; Langley, Charles H; Neale, David B; Salzberg, Steven L

2017-01-01

The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25 361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107 821, 61% larger than the previous assembly. © The Author 2017. Published by Oxford University Press.
Erratum to: An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing.

PubMed

Zimin, Aleksey V; Stevens, Kristian A; Crepeau, Marc W; Puiu, Daniela; Wegrzyn, Jill L; Yorke, James A; Langley, Charles H; Neale, David B; Salzberg, Steven L

2017-10-01

The 22-gigabase genome of loblolly pine (Pinus taeda) is one of the largest ever sequenced. The draft assembly published in 2014 was built entirely from short Illumina reads, with lengths ranging from 100 to 250 base pairs (bp). The assembly was quite fragmented, containing over 11 million contigs whose weighted average (N50) size was 8206 bp. To improve this result, we generated approximately 12-fold coverage in long reads using the Single Molecule Real Time sequencing technology developed at Pacific Biosciences. We assembled the long and short reads together using the MaSuRCA mega-reads assembly algorithm, which produced a substantially better assembly, P. taeda version 2.0. The new assembly has an N50 contig size of 25 361, more than three times as large as achieved in the original assembly, and an N50 scaffold size of 107 821, 61% larger than the previous assembly. © The Authors 2017. Published by Oxford University Press.
Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank

PubMed Central

Dutta, Shuchismita; Dimitropoulos, Dimitris; Feng, Zukang; Persikova, Irina; Sen, Sanchayita; Shao, Chenghua; Westbrook, John; Young, Jasmine; Zhuravleva, Marina A; Kleywegt, Gerard J; Berman, Helen M

2014-01-01

With the accumulation of a large number and variety of molecules in the Protein Data Bank (PDB) comes the need on occasion to review and improve their representation. The Worldwide PDB (wwPDB) partners have periodically updated various aspects of structural data representation to improve the integrity and consistency of the archive. The remediation effort described here was focused on improving the representation of peptide-like inhibitor and antibiotic molecules so that they can be easily identified and analyzed. Peptide-like inhibitors or antibiotics were identified in over 1000 PDB entries, systematically reviewed and represented either as peptides with polymer sequence or as single components. For the majority of the single-component molecules, their peptide-like composition was captured in a new representation, called the subcomponent sequence. A novel concept called “group” was developed for representing complex peptide-like antibiotics and inhibitors that are composed of multiple polymer and nonpolymer components. In addition, a reference dictionary was developed with detailed information about these peptide-like molecules to aid in their annotation, identification and analysis. Based on the experience gained in this remediation, guidelines, procedures, and tools were developed to annotate new depositions containing peptide-like inhibitors and antibiotics accurately and consistently. © 2013 Wiley Periodicals, Inc. Biopolymers 101: 659–668, 2014. PMID:24173824
Apigenin Impacts the Growth of the Gut Microbiota and Alters the Gene Expression of Enterococcus.

PubMed

Wang, Minqian; Firrman, Jenni; Zhang, Liqing; Arango-Argoty, Gustavo; Tomasula, Peggy; Liu, LinShu; Xiao, Weidong; Yam, Kit

2017-08-03

Apigenin is a major dietary flavonoid with many bioactivities, widely distributed in plants. Apigenin reaches the colon region intact and interacts there with the human gut microbiota, however there is little research on how apigenin affects the gut bacteria. This study investigated the effect of pure apigenin on human gut bacteria, at both the single strain and community levels. The effect of apigenin on the single gut bacteria strains Bacteroides galacturonicus , Bifidobacterium catenulatum , Lactobacillus rhamnosus GG, and Enterococcus caccae , was examined by measuring their anaerobic growth profiles. The effect of apigenin on a gut microbiota community was studied by culturing a fecal inoculum under in vitro conditions simulating the human ascending colon. 16S rRNA gene sequencing and GC-MS analysis quantified changes in the community structure. Single molecule RNA sequencing was used to reveal the response of Enterococcus caccae to apigenin. Enterococcus caccae was effectively inhibited by apigenin when cultured alone, however, the genus Enterococcus was enhanced when tested in a community setting. Single molecule RNA sequencing found that Enterococcus caccae responded to apigenin by up-regulating genes involved in DNA repair, stress response, cell wall synthesis, and protein folding. Taken together, these results demonstrate that apigenin affects both the growth and gene expression of Enterococcus caccae .
Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes.

PubMed

Fredlake, Christopher P; Hert, Daniel G; Kan, Cheuk-Wai; Chiesl, Thomas N; Root, Brian E; Forster, Ryan E; Barron, Annelise E

2008-01-15

To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require approximately 70 min to deliver approximately 650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered "hybrid" mechanism of DNA electromigration, in which DNA molecules alternate rapidly between repeating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs.
Ultrafast DNA sequencing on a microchip by a hybrid separation mechanism that gives 600 bases in 6.5 minutes

PubMed Central

Fredlake, Christopher P.; Hert, Daniel G.; Kan, Cheuk-Wai; Chiesl, Thomas N.; Root, Brian E.; Forster, Ryan E.; Barron, Annelise E.

2008-01-01

To realize the immense potential of large-scale genomic sequencing after the completion of the second human genome (Venter's), the costs for the complete sequencing of additional genomes must be dramatically reduced. Among the technologies being developed to reduce sequencing costs, microchip electrophoresis is the only new technology ready to produce the long reads most suitable for the de novo sequencing and assembly of large and complex genomes. Compared with the current paradigm of capillary electrophoresis, microchip systems promise to reduce sequencing costs dramatically by increasing throughput, reducing reagent consumption, and integrating the many steps of the sequencing pipeline onto a single platform. Although capillary-based systems require ≈70 min to deliver ≈650 bases of contiguous sequence, we report sequencing up to 600 bases in just 6.5 min by microchip electrophoresis with a unique polymer matrix/adsorbed polymer wall coating combination. This represents a two-thirds reduction in sequencing time over any previously published chip sequencing result, with comparable read length and sequence quality. We hypothesize that these ultrafast long reads on chips can be achieved because the combined polymer system engenders a recently discovered “hybrid” mechanism of DNA electromigration, in which DNA molecules alternate rapidly between reptating through the intact polymer network and disrupting network entanglements to drag polymers through the solution, similar to dsDNA dynamics we observe in single-molecule DNA imaging studies. Most importantly, these results reveal the surprisingly powerful ability of microchip electrophoresis to provide ultrafast Sanger sequencing, which will translate to increased system throughput and reduced costs. PMID:18184818
Dock 'n roll: folding of a silk-inspired polypeptide into an amyloid-like beta solenoid.

PubMed

Zhao, Binwu; Cohen Stuart, Martien A; Hall, Carol K

2016-04-20

Polypeptides containing the motif ((GA)mGX)n occur in silk and have a strong tendency to self-assemble. For example, polypeptides containing (GAGAGAGX)n, where X = G or H have been observed to form filaments; similar sequences but with X = Q have been used in the design of coat proteins (capsids) for artificial viruses. The structure of the (GAGAGAGX)m filaments has been proposed to be a stack of peptides in a β roll structure with the hydrophobic side chains pointing outwards (hydrophobic shell). Another possible configuration, a β roll or β solenoid structure which has its hydrophobic side chains buried inside (hydrophobic core) was, however, overlooked. We perform ground state analysis as well as atomic-level molecular dynamics simulations, both on single molecules and on two-molecule stacks of the silk-inspired sequence (GAGAGAGQ)10, to decide whether the hydrophobic core or the hydrophobic shell configuration is the most stable one. We find that a stack of two hydrophobic core molecules is energetically more favorable than a stack of two hydrophobic shell molecules. A shell molecule initially placed in a perfect β roll structure tends to rotate its strands, breaking in-plane hydrogen bonds and forming out-of-plane hydrogen bonds, while a core molecule stays in the β roll structure. The hydrophobic shell structure has type II' β turns whereas the core configuration has type II β turns; only the latter secondary structure agrees well with solid-state NMR experiments on a similar sequence (GA)15. We also observe that the core stack has a higher number of intra-molecular hydrogen bonds and a higher number of hydrogen bonds between stack and water than the shell stack. Hence, we conclude that the hydrophobic core configuration is the most likely structure. In the stacked state, each peptide has more intra-molecular hydrogen bonds than a single folded molecule, which suggests that stacking provides the extra stability needed for molecules to reach the folded state.

Nanochannel Device with Embedded Nanopore: a New Approach for Single-Molecule DNA Analysis and Manipulation

NASA Astrophysics Data System (ADS)

Zhang, Yuning; Reisner, Walter

2012-02-01

Nanopore and nanochannel based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with nanpore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a nanopore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We will discuss our recent progress on device fabrication and characterization. In particular, we demonstrate that we can detect - using fluorescent microscopy - successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. In particular, we show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore, suggesting that the embedded pore could be used as a nanoscale window through which to interrogate a nanochannel extended DNA molecule.
Nanochannel Device with Embedded Nanopore: a New Approach for Single-Molecule DNA Analysis and Manipulation

NASA Astrophysics Data System (ADS)

Zhang, Yuning; Reisner, Walter

2013-03-01

Nanopore and nanochannel based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with embedded pore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a pore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We demonstrate that we can optically detect successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. In particular, we show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore, suggesting that the pore could be used as a nanoscale window through which to interrogate a nanochannel extended DNA molecule. Furthermore, electrical measurements through the nanopore are performed, indicating that DNA sensing is feasible using the nanochannel-nanopore device.
Shotgun Optical Maps of the Whole Escherichia coli O157:H7 Genome

PubMed Central

Lim, Alex; Dimalanta, Eileen T.; Potamousis, Konstantinos D.; Yen, Galex; Apodoca, Jennifer; Tao, Chunhong; Lin, Jieyi; Qi, Rong; Skiadas, John; Ramanathan, Arvind; Perna, Nicole T.; Plunkett, Guy; Burland, Valerie; Mau, Bob; Hackett, Jeremiah; Blattner, Frederick R.; Anantharaman, Thomas S.; Mishra, Bhubaneswar; Schwartz, David C.

2001-01-01

We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies. PMID:11544203
A cost effective 5΄ selective single cell transcriptome profiling approach with improved UMI design

PubMed Central

Arguel, Marie-Jeanne; LeBrigand, Kevin; Paquet, Agnès; Ruiz García, Sandra; Zaragosi, Laure-Emmanuelle; Waldmann, Rainer

2017-01-01

Abstract Single cell RNA sequencing approaches are instrumental in studies of cell-to-cell variability. 5΄ selective transcriptome profiling approaches allow simultaneous definition of the transcription start size and have advantages over 3΄ selective approaches which just provide internal sequences close to the 3΄ end. The only currently existing 5΄ selective approach requires costly and labor intensive fragmentation and cell barcoding after cDNA amplification. We developed an optimized 5΄ selective workflow where all the cell indexing is done prior to fragmentation. With our protocol, cell indexing can be performed in the Fluidigm C1 microfluidic device, resulting in a significant reduction of cost and labor. We also designed optimized unique molecular identifiers that show less sequence bias and vulnerability towards sequencing errors resulting in an improved accuracy of molecule counting. We provide comprehensive experimental workflows for Illumina and Ion Proton sequencers that allow single cell sequencing in a cost range comparable to qPCR assays. PMID:27940562
Serogroup-level resolution of the “Super-7” Shiga toxin-producing Escherichia coli using nanopore single-molecule DNA sequencing

USDA-ARS?s Scientific Manuscript database

DNA sequencing and other DNA-based methods, such as PCR, are now broadly used for detection and identification of bacterial foodborne pathogens. For the identification of foodborne bacterial pathogens, it is important to make taxonomic assignments to the species, or even subspecies level. Long-read ...
Multiplex single-molecule interaction profiling of DNA barcoded proteins

PubMed Central

Gu, Liangcai; Li, Chao; Aach, John; Hill, David E.; Vidal, Marc; Church, George M.

2014-01-01

In contrast with advances in massively parallel DNA sequencing1, high-throughput protein analyses2-4 are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule (SM) protein detection achieved using optical methods5 is limited by the number of spectrally nonoverlapping chromophores. Here, we introduce a single molecular interaction-sequencing (SMI-Seq) technology for parallel protein interaction profiling leveraging SM advantages. DNA barcodes are attached to proteins collectively via ribosome display6 or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide (PAA) thin film to construct a random SM array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies)7 and analyzed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimeter. Furthermore, protein interactions can be measured based on the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor (GPCR) and antibody binding profiling, were demonstrated. SMI-Seq enables “library vs. library” screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity. PMID:25252978
Antibody-Mediated Small Molecule Detection Using Programmable DNA-Switches.

PubMed

Rossetti, Marianna; Ippodrino, Rudy; Marini, Bruna; Palleschi, Giuseppe; Porchetta, Alessandro

2018-06-13

The development of rapid, cost-effective, and single-step methods for the detection of small molecules is crucial for improving the quality and efficiency of many applications ranging from life science to environmental analysis. Unfortunately, current methodologies still require multiple complex, time-consuming washing and incubation steps, which limit their applicability. In this work we present a competitive DNA-based platform that makes use of both programmable DNA-switches and antibodies to detect small target molecules. The strategy exploits both the advantages of proximity-based methods and structure-switching DNA-probes. The platform is modular and versatile and it can potentially be applied for the detection of any small target molecule that can be conjugated to a nucleic acid sequence. Here the rational design of programmable DNA-switches is discussed, and the sensitive, rapid, and single-step detection of different environmentally relevant small target molecules is demonstrated.
Comprehensive profiling and quantitation of oncogenic mutations in non-small cell lung carcinoma using single-molecule amplification and re-sequencing technology.

PubMed

Shi, Jian; Yuan, Meng; Wang, Zhan-Dong; Xu, Xiao-Li; Hong, Lei; Sun, Shenglin

2017-02-01

The carcinogenesis of non-small cell lung carcinoma has been found to associate with activating and resistant mutations in the tyrosine kinase domain of specific oncogenes. Here, we assessed the type, frequency, and abundance of epithelial growth factor receptor, KRAS, BRAF, and ALK mutations in 154 non-small cell lung carcinoma specimens using single-molecule amplification and re-sequencing technology. We found that epithelial growth factor receptor mutations were the most prevalent (44.2%), followed by KRAS (18.8%), ALK (7.8%), and BRAF (5.8%) mutations. The type and abundance of the mutations in tumor specimens appeared to be heterogeneous. Thus, we conclude that identification of clinically significant oncogenic mutations may improve the classification of patients and provide valuable information for determination of the therapeutic strategies.
A dynamic bead-based microarray for parallel DNA detection

NASA Astrophysics Data System (ADS)

Sochol, R. D.; Casavant, B. P.; Dueck, M. E.; Lee, L. P.; Lin, L.

2011-05-01

A microfluidic system has been designed and constructed by means of micromachining processes to integrate both microfluidic mixing of mobile microbeads and hydrodynamic microbead arraying capabilities on a single chip to simultaneously detect multiple bio-molecules. The prototype system has four parallel reaction chambers, which include microchannels of 18 × 50 µm2 cross-sectional area and a microfluidic mixing section of 22 cm length. Parallel detection of multiple DNA oligonucleotide sequences was achieved via molecular beacon probes immobilized on polystyrene microbeads of 16 µm diameter. Experimental results show quantitative detection of three distinct DNA oligonucleotide sequences from the Hepatitis C viral (HCV) genome with single base-pair mismatch specificity. Our dynamic bead-based microarray offers an effective microfluidic platform to increase parallelization of reactions and improve microbead handling for various biological applications, including bio-molecule detection, medical diagnostics and drug screening.
Amino acids 16-275 of minute virus of mice NS1 include a domain that specifically binds (ACCA)2-3-containing DNA.

PubMed

Mouw, M; Pintel, D J

1998-11-10

GST-NS1 purified from Escherichia coli and insect cells binds double-strand DNA in an (ACCA)2-3-dependent fashion under similar ionic conditions, independent of the presence of anti-NS1 antisera or exogenously supplied ATP and interacts with single-strand DNA and RNA in a sequence-independent manner. An amino-terminal domain (amino acids 1-275) of NS1 [GST-NS1(1-275)], representing 41% of the full-length NS1 molecule, includes a domain that binds double-strand DNA in a sequence-specific manner at levels comparable to full-length GST-NS1, as well as single-strand DNA and RNA in a sequence-independent manner. The deletion of 15 additional amino-terminal amino acids yielded a molecule [GST-NS1(1-275)] that maintained (ACCA)2-3-specific double-strand DNA binding; however, this molecule was more sensitive to increasing ionic conditions than full-length GST-NS1 and GST-NS1(1-275) and could not be demonstrated to bind single-strand nucleic acids. A quantitative filter binding assay showed that E. coli- and baculovirus-expressed GST-NS1 and E. coli GST-NS1(1-275) specifically bound double-strand DNA with similar equilibrium kinetics [as measured by their apparent equilibrium DNA binding constants (KD)], whereas GST-NS1(16-275) bound 4- to 8-fold less well. Copyright 1998 Academic Press.
Development of Scoring Functions for Antibody Sequence Assessment and Optimization

PubMed Central

Seeliger, Daniel

2013-01-01

Antibody development is still associated with substantial risks and difficulties as single mutations can radically change molecule properties like thermodynamic stability, solubility or viscosity. Since antibody generation methodologies cannot select and optimize for molecule properties which are important for biotechnological applications, careful sequence analysis and optimization is necessary to develop antibodies that fulfil the ambitious requirements of future drugs. While efforts to grab the physical principles of undesired molecule properties from the very bottom are becoming increasingly powerful, the wealth of publically available antibody sequences provides an alternative way to develop early assessment strategies for antibodies using a statistical approach which is the objective of this paper. Here, publically available sequences were used to develop heuristic potentials for the framework regions of heavy and light chains of antibodies of human and murine origin. The potentials take into account position dependent probabilities of individual amino acids but also conditional probabilities which are inevitable for sequence assessment and optimization. It is shown that the potentials derived from human sequences clearly distinguish between human sequences and sequences from mice and, hence, can be used as a measure of humaness which compares a given sequence with the phenotypic pool of human sequences instead of comparing sequence identities to germline genes. Following this line, it is demonstrated that, using the developed potentials, humanization of an antibody can be described as a simple mathematical optimization problem and that the in-silico generated framework variants closely resemble native sequences in terms of predicted immunogenicity. PMID:24204701
SMRT sequencing data for Garcinia mangostana L. variety Mesta.

PubMed

Midin, Mohd Razik; Loke, Kok-Keong; Madon, Maria; Nordin, Mohd Shukor; Goh, Hoe-Han; Mohd Noor, Normah

2017-06-01

The "Queen of Fruits" mangosteen ( Garcinia mangostana L.) produces commercially important fruits with desirable taste of flesh and pericarp rich in xanthones with medicinal properties. To date, only limited knowledge is available on the cytogenetics and genome sequences of a common variety of mangosteen (Abu Bakar et al., 2016 [1]). Here, we report the first single-molecule real-time (SMRT) sequencing data from whole genome sequencing of mangosteen of Mesta variety. Raw reads of the SMRT sequencing project can be obtained from SRA database with the accession numbers SRX2718652 until SRX2718659.
FANTOM5 CAGE profiles of human and mouse samples.

PubMed

Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-Ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A; Babina, Magda; Baillie, J Kenneth; Barnett, Timothy C; Beckhouse, Anthony G; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J; Clevers, Hans C; Davis, Carrie A; Detmar, Michael; Dohi, Taeko; Edge, Albert S B; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C; Faulkner, Geoffrey J; Ferrai, Carmelo; Fisher, Malcolm E; Forrester, Lesley M; Fujita, Rie; Furusawa, Jun-Ichi; Geijtenbeek, Teunis B; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J; Hume, David A; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I; Kempfle, Judith S; Kenna, Tony J; Kere, Juha; Khachigian, Levon M; Kitamura, Toshio; Klein, Sarah; Klinken, S Peter; Knox, Alan J; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-Sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J; Motohashi, Hozumi; Mummery, Christine L; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A; Winteringham, Louise N; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R R; Hayashizaki, Yoshihide

2017-08-29

In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities.
FANTOM5 CAGE profiles of human and mouse samples

PubMed Central

Noguchi, Shuhei; Arakawa, Takahiro; Fukuda, Shiro; Furuno, Masaaki; Hasegawa, Akira; Hori, Fumi; Ishikawa-Kato, Sachi; Kaida, Kaoru; Kaiho, Ai; Kanamori-Katayama, Mutsumi; Kawashima, Tsugumi; Kojima, Miki; Kubosaki, Atsutaka; Manabe, Ri-ichiroh; Murata, Mitsuyoshi; Nagao-Sato, Sayaka; Nakazato, Kenichi; Ninomiya, Noriko; Nishiyori-Sueki, Hiromi; Noma, Shohei; Saijyo, Eri; Saka, Akiko; Sakai, Mizuho; Simon, Christophe; Suzuki, Naoko; Tagami, Michihira; Watanabe, Shoko; Yoshida, Shigehiro; Arner, Peter; Axton, Richard A.; Babina, Magda; Baillie, J. Kenneth; Barnett, Timothy C.; Beckhouse, Anthony G.; Blumenthal, Antje; Bodega, Beatrice; Bonetti, Alessandro; Briggs, James; Brombacher, Frank; Carlisle, Ailsa J.; Clevers, Hans C.; Davis, Carrie A.; Detmar, Michael; Dohi, Taeko; Edge, Albert S.B.; Edinger, Matthias; Ehrlund, Anna; Ekwall, Karl; Endoh, Mitsuhiro; Enomoto, Hideki; Eslami, Afsaneh; Fagiolini, Michela; Fairbairn, Lynsey; Farach-Carson, Mary C.; Faulkner, Geoffrey J.; Ferrai, Carmelo; Fisher, Malcolm E.; Forrester, Lesley M.; Fujita, Rie; Furusawa, Jun-ichi; Geijtenbeek, Teunis B.; Gingeras, Thomas; Goldowitz, Daniel; Guhl, Sven; Guler, Reto; Gustincich, Stefano; Ha, Thomas J.; Hamaguchi, Masahide; Hara, Mitsuko; Hasegawa, Yuki; Herlyn, Meenhard; Heutink, Peter; Hitchens, Kelly J.; Hume, David A.; Ikawa, Tomokatsu; Ishizu, Yuri; Kai, Chieko; Kawamoto, Hiroshi; Kawamura, Yuki I.; Kempfle, Judith S.; Kenna, Tony J.; Kere, Juha; Khachigian, Levon M.; Kitamura, Toshio; Klein, Sarah; Klinken, S. Peter; Knox, Alan J.; Kojima, Soichi; Koseki, Haruhiko; Koyasu, Shigeo; Lee, Weonju; Lennartsson, Andreas; Mackay-sim, Alan; Mejhert, Niklas; Mizuno, Yosuke; Morikawa, Hiromasa; Morimoto, Mitsuru; Moro, Kazuyo; Morris, Kelly J.; Motohashi, Hozumi; Mummery, Christine L.; Nakachi, Yutaka; Nakahara, Fumio; Nakamura, Toshiyuki; Nakamura, Yukio; Nozaki, Tadasuke; Ogishima, Soichi; Ohkura, Naganari; Ohno, Hiroshi; Ohshima, Mitsuhiro; Okada-Hatakeyama, Mariko; Okazaki, Yasushi; Orlando, Valerio; Ovchinnikov, Dmitry A.; Passier, Robert; Patrikakis, Margaret; Pombo, Ana; Pradhan-Bhatt, Swati; Qin, Xian-Yang; Rehli, Michael; Rizzu, Patrizia; Roy, Sugata; Sajantila, Antti; Sakaguchi, Shimon; Sato, Hiroki; Satoh, Hironori; Savvi, Suzana; Saxena, Alka; Schmidl, Christian; Schneider, Claudio; Schulze-Tanzil, Gundula G.; Schwegmann, Anita; Sheng, Guojun; Shin, Jay W.; Sugiyama, Daisuke; Sugiyama, Takaaki; Summers, Kim M.; Takahashi, Naoko; Takai, Jun; Tanaka, Hiroshi; Tatsukawa, Hideki; Tomoiu, Andru; Toyoda, Hiroo; van de Wetering, Marc; van den Berg, Linda M.; Verardo, Roberto; Vijayan, Dipti; Wells, Christine A.; Winteringham, Louise N.; Wolvetang, Ernst; Yamaguchi, Yoko; Yamamoto, Masayuki; Yanagi-Mizuochi, Chiyo; Yoneda, Misako; Yonekura, Yohei; Zhang, Peter G.; Zucchelli, Silvia; Abugessaisa, Imad; Arner, Erik; Harshbarger, Jayson; Kondo, Atsushi; Lassmann, Timo; Lizio, Marina; Sahin, Serkan; Sengstag, Thierry; Severin, Jessica; Shimoji, Hisashi; Suzuki, Masanori; Suzuki, Harukazu; Kawai, Jun; Kondo, Naoto; Itoh, Masayoshi; Daub, Carsten O.; Kasukawa, Takeya; Kawaji, Hideya; Carninci, Piero; Forrest, Alistair R.R.; Hayashizaki, Yoshihide

2017-01-01

In the FANTOM5 project, transcription initiation events across the human and mouse genomes were mapped at a single base-pair resolution and their frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled with single-molecule sequencing. Approximately three thousands of samples, consisting of a variety of primary cells, tissues, cell lines, and time series samples during cell activation and development, were subjected to a uniform pipeline of CAGE data production. The analysis pipeline started by measuring RNA extracts to assess their quality, and continued to CAGE library production by using a robotic or a manual workflow, single molecule sequencing, and computational processing to generate frequencies of transcription initiation. Resulting data represents the consequence of transcriptional regulation in each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE profiles, approximately 200,000 and 150,000 peaks for the human and mouse genomes, were identified and annotated to provide precise location of known promoters as well as novel ones, and to quantify their activities. PMID:28850106
Classification of DNA nucleotides with transverse tunneling currents

NASA Astrophysics Data System (ADS)

Nyvold Pedersen, Jonas; Boynton, Paul; Di Ventra, Massimiliano; Jauho, Antti-Pekka; Flyvbjerg, Henrik

2017-01-01

It has been theoretically suggested and experimentally demonstrated that fast and low-cost sequencing of DNA, RNA, and peptide molecules might be achieved by passing such molecules between electrodes embedded in a nanochannel. The experimental realization of this scheme faces major challenges, however. In realistic liquid environments, typical currents in tunneling devices are of the order of picoamps. This corresponds to only six electrons per microsecond, and this number affects the integration time required to do current measurements in real experiments. This limits the speed of sequencing, though current fluctuations due to Brownian motion of the molecule average out during the required integration time. Moreover, data acquisition equipment introduces noise, and electronic filters create correlations in time-series data. We discuss how these effects must be included in the analysis of, e.g., the assignment of specific nucleobases to current signals. As the signals from different molecules overlap, unambiguous classification is impossible with a single measurement. We argue that the assignment of molecules to a signal is a standard pattern classification problem and calculation of the error rates is straightforward. The ideas presented here can be extended to other sequencing approaches of current interest.
Single-cell template strand sequencing by Strand-seq enables the characterization of individual homologs.

PubMed

Sanders, Ashley D; Falconer, Ester; Hills, Mark; Spierings, Diana C J; Lansdorp, Peter M

2017-06-01

The ability to distinguish between genome sequences of homologous chromosomes in single cells is important for studies of copy-neutral genomic rearrangements (such as inversions and translocations), building chromosome-length haplotypes, refining genome assemblies, mapping sister chromatid exchange events and exploring cellular heterogeneity. Strand-seq is a single-cell sequencing technology that resolves the individual homologs within a cell by restricting sequence analysis to the DNA template strands used during DNA replication. This protocol, which takes up to 4 d to complete, relies on the directionality of DNA, in which each single strand of a DNA molecule is distinguished based on its 5'-3' orientation. Culturing cells in a thymidine analog for one round of cell division labels nascent DNA strands, allowing for their selective removal during genomic library construction. To preserve directionality of template strands, genomic preamplification is bypassed and labeled nascent strands are nicked and not amplified during library preparation. Each single-cell library is multiplexed for pooling and sequencing, and the resulting sequence data are aligned, mapping to either the minus or plus strand of the reference genome, to assign template strand states for each chromosome in the cell. The major adaptations to conventional single-cell sequencing protocols include harvesting of daughter cells after a single round of BrdU incorporation, bypassing of whole-genome amplification, and removal of the BrdU + strand during Strand-seq library preparation. By sequencing just template strands, the structure and identity of each homolog are preserved.
Single-molecule DNA detection with an engineered MspA protein nanopore

PubMed Central

Butler, Tom Z.; Pavlenok, Mikhail; Derrington, Ian M.; Niederweis, Michael; Gundlach, Jens H.

2008-01-01

Nanopores hold great promise as single-molecule analytical devices and biophysical model systems because the ionic current blockades they produce contain information about the identity, concentration, structure, and dynamics of target molecules. The porin MspA of Mycobacterium smegmatis has remarkable stability against environmental stresses and can be rationally modified based on its crystal structure. Further, MspA has a short and narrow channel constriction that is promising for DNA sequencing because it may enable improved characterization of short segments of a ssDNA molecule that is threaded through the pore. By eliminating the negative charge in the channel constriction, we designed and constructed an MspA mutant capable of electronically detecting and characterizing single molecules of ssDNA as they are electrophoretically driven through the pore. A second mutant with additional exchanges of negatively-charged residues for positively-charged residues in the vestibule region exhibited a factor of ≈20 higher interaction rates, required only half as much voltage to observe interaction, and allowed ssDNA to reside in the vestibule ≈100 times longer than the first mutant. Our results introduce MspA as a nanopore for nucleic acid analysis and highlight its potential as an engineerable platform for single-molecule detection and characterization applications. PMID:19098105
Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations.

PubMed

Reid-Bayliss, Kate S; Loeb, Lawrence A

2017-08-29

Transcriptional mutagenesis (TM) due to misincorporation during RNA transcription can result in mutant RNAs, or epimutations, that generate proteins with altered properties. TM has long been hypothesized to play a role in aging, cancer, and viral and bacterial evolution. However, inadequate methodologies have limited progress in elucidating a causal association. We present a high-throughput, highly accurate RNA sequencing method to measure epimutations with single-molecule sensitivity. Accurate RNA consensus sequencing (ARC-seq) uniquely combines RNA barcoding and generation of multiple cDNA copies per RNA molecule to eliminate errors introduced during cDNA synthesis, PCR, and sequencing. The stringency of ARC-seq can be scaled to accommodate the quality of input RNAs. We apply ARC-seq to directly assess transcriptome-wide epimutations resulting from RNA polymerase mutants and oxidative stress.
Homogeneous assay of target molecules based on chemiluminescence resonance energy transfer (CRET) using DNAzyme-linked aptamers.

PubMed

Mun, Hyoyoung; Jo, Eun-Jung; Li, Taihua; Joung, Hyou-Arm; Hong, Dong-Gu; Shim, Won-Bo; Jung, Cheulhee; Kim, Min-Gon

2014-08-15

We have designed a single-stranded DNAzyme-aptamer sensor for homogeneous target molecular detection based on chemiluminescence resonance energy transfer (CRET). The structure of the engineered single-stranded DNA (ssDNA) includes the horseradish peroxidase (HRP)-like DNAzyme, optimum-length linker (10-mer-length DNA), and target-specific aptamer sequences. A quencher dye was modified at the 3' end of the aptamer sequence. The incorporation of hemin into the G-quadruplex structure of DNAzyme yields an active HRP-like activity that catalyzes luminol to generate a chemiluminescence (CL) signal. In the presence of target molecules, such as ochratoxin A (OTA), adenosine triphosphate (ATP), or thrombin, the aptamer sequence was folded due to the formation of the aptamer/analyte complex, which induced the quencher dye close to the DNAzyme structure. Consequently, the CRET occurred between a DNAzyme-catalyzed chemiluminescence reaction and the quencher dye. Our results showed that CRET-based DNAzyme-aptamer biosensing enabled specific OTA analysis with a limit of detection of 0.27ng/mL. The CRET platform needs no external light source and avoids autofluorescence and photobleaching, and target molecules can be detected specifically and sensitively in a homogeneous manner. Copyright © 2014 Elsevier B.V. All rights reserved.
Deep learning for single-molecule science

NASA Astrophysics Data System (ADS)

Albrecht, Tim; Slabaugh, Gregory; Alonso, Eduardo; Al-Arif, SM Masudur R.

2017-10-01

Exploring and making predictions based on single-molecule data can be challenging, not only due to the sheer size of the datasets, but also because a priori knowledge about the signal characteristics is typically limited and poor signal-to-noise ratio. For example, hypothesis-driven data exploration, informed by an expectation of the signal characteristics, can lead to interpretation bias or loss of information. Equally, even when the different data categories are known, e.g., the four bases in DNA sequencing, it is often difficult to know how to make best use of the available information content. The latest developments in machine learning (ML), so-called deep learning (DL) offer interesting, new avenues to address such challenges. In some applications, such as speech and image recognition, DL has been able to outperform conventional ML strategies and even human performance. However, to date DL has not been applied much in single-molecule science, presumably in part because relatively little is known about the ‘internal workings’ of such DL tools within single-molecule science as a field. In this Tutorial, we make an attempt to illustrate in a step-by-step guide how one of those, a convolutional neural network (CNN), may be used for base calling in DNA sequencing applications. We compare it with a SVM as a more conventional ML method, and discuss some of the strengths and weaknesses of the approach. In particular, a ‘deep’ neural network has many features of a ‘black box’, which has important implications on how we look at and interpret data.

Highly multiplexed subcellular RNA sequencing in situ

PubMed Central

Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Yang, Joyce L.; Terry, Richard; Jeanty, Sauveur S. F.; Li, Chao; Amamoto, Ryoji; Peters, Derek T.; Turczyk, Brian M.; Marblestone, Adam H.; Inverso, Samuel A.; Bernard, Amy; Mali, Prashant; Rios, Xavier; Aach, John; Church, George M.

2014-01-01

Understanding the spatial organization of gene expression with single nucleotide resolution requires localizing the sequences of expressed RNA transcripts within a cell in situ. Here we describe fluorescent in situ RNA sequencing (FISSEQ), in which stably cross-linked cDNA amplicons are sequenced within a biological sample. Using 30-base reads from 8,742 genes in situ, we examined RNA expression and localization in human primary fibroblasts using a simulated wound healing assay. FISSEQ is compatible with tissue sections and whole mount embryos, and reduces the limitations of optical resolution and noisy signals on single molecule detection. Our platform enables massively parallel detection of genetic elements, including gene transcripts and molecular barcodes, and can be used to investigate cellular phenotype, gene regulation, and environment in situ. PMID:24578530
Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

PubMed

Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

2015-07-01

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Single molecule quantitation and sequencing of rare translocations using microfluidic nested digital PCR.

PubMed

Shuga, Joe; Zeng, Yong; Novak, Richard; Lan, Qing; Tang, Xiaojiang; Rothman, Nathaniel; Vermeulen, Roel; Li, Laiyu; Hubbard, Alan; Zhang, Luoping; Mathies, Richard A; Smith, Martyn T

2013-09-01

Cancers are heterogeneous and genetically unstable. New methods are needed that provide the sensitivity and specificity to query single cells at the genetic loci that drive cancer progression, thereby enabling researchers to study the progression of individual tumors. Here, we report the development and application of a bead-based hemi-nested microfluidic droplet digital PCR (dPCR) technology to achieve 'quantitative' measurement and single-molecule sequencing of somatically acquired carcinogenic translocations at extremely low levels (<10(-6)) in healthy subjects. We use this technique in our healthy study population to determine the overall concentration of the t(14;18) translocation, which is strongly associated with follicular lymphoma. The nested dPCR approach improves the detection limit to 1×10(-7) or lower while maintaining the analysis efficiency and specificity. Further, the bead-based dPCR enabled us to isolate and quantify the relative amounts of the various clonal forms of t(14;18) translocation in these subjects, and the single-molecule sensitivity and resolution of dPCR led to the discovery of new clonal forms of t(14;18) that were otherwise masked by the conventional quantitative PCR measurements. In this manner, we created a quantitative map for this carcinogenic mutation in this healthy population and identified the positions on chromosomes 14 and 18 where the vast majority of these t(14;18) events occur.
Instability of plasmid DNA sequences: macro and micro evolution of the antibiotic resistance plasmid R6-5.

PubMed

Timmis, K N; Cabello, F; Andrés, I; Nordheim, A; Burkhardt, H J; Cohen, S N

1978-11-16

Detailed examination of the structure of cloned DNA fragments of the R6-5 antibiotic resistance plasmid has revealed a substantial degree of polynucleotide sequence heterogeneity and indicates that sequence rearrangements in plasmids and possible other replicons occur more frequently than has hitherto been appreciated. The sequences changes in cloned R6-5 fragments were shown in some instances to have occurred prior to cloning, i.e. existing in the original population of R6-5 molecules that was obtained from a single bacterial clone and by several different criteria judged to be homogeneous, and in others to have occurred either during the cloning procedure or during subsequent propagation of hybrid molecules. The molecular changes that are described involved insertion/deletion of the previously characterized IS2 insertion element, formation of a new inverted repeat structure probably by duplication of a preexisting R6-5 DNA sequence, sequence inversion, and loss and gain of restriction endonuclease cleavage sites.
Diff-seq: A high throughput sequencing-based mismatch detection assay for DNA variant enrichment and discovery

PubMed Central

Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin

2018-01-01

Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139
Highly parallel single-molecule amplification approach based on agarose droplet polymerase chain reaction for efficient and cost-effective aptamer selection.

PubMed

Zhang, Wei Yun; Zhang, Wenhua; Liu, Zhiyuan; Li, Cong; Zhu, Zhi; Yang, Chaoyong James

2012-01-03

We have developed a novel method for efficiently screening affinity ligands (aptamers) from a complex single-stranded DNA (ssDNA) library by employing single-molecule emulsion polymerase chain reaction (PCR) based on the agarose droplet microfluidic technology. In a typical systematic evolution of ligands by exponential enrichment (SELEX) process, the enriched library is sequenced first, and tens to hundreds of aptamer candidates are analyzed via a bioinformatic approach. Possible candidates are then chemically synthesized, and their binding affinities are measured individually. Such a process is time-consuming, labor-intensive, inefficient, and expensive. To address these problems, we have developed a highly efficient single-molecule approach for aptamer screening using our agarose droplet microfluidic technology. Statistically diluted ssDNA of the pre-enriched library evolved through conventional SELEX against cancer biomarker Shp2 protein was encapsulated into individual uniform agarose droplets for droplet PCR to generate clonal agarose beads. The binding capacity of amplified ssDNA from each clonal bead was then screened via high-throughput fluorescence cytometry. DNA clones with high binding capacity and low K(d) were chosen as the aptamer and can be directly used for downstream biomedical applications. We have identified an ssDNA aptamer that selectively recognizes Shp2 with a K(d) of 24.9 nM. Compared to a conventional sequencing-chemical synthesis-screening work flow, our approach avoids large-scale DNA sequencing and expensive, time-consuming DNA synthesis of large populations of DNA candidates. The agarose droplet microfluidic approach is thus highly efficient and cost-effective for molecular evolution approaches and will find wide application in molecular evolution technologies, including mRNA display, phage display, and so on. © 2011 American Chemical Society
Crossovers are associated with mutation and biased gene conversion at recombination hotspots.

PubMed

Arbeithuber, Barbara; Betancourt, Andrea J; Ebner, Thomas; Tiemann-Boege, Irene

2015-02-17

Meiosis is a potentially important source of germline mutations, as sites of meiotic recombination experience recurrent double-strand breaks (DSBs). However, evidence for a local mutagenic effect of recombination from population sequence data has been equivocal, likely because mutation is only one of several forces shaping sequence variation. By sequencing large numbers of single crossover molecules obtained from human sperm for two recombination hotspots, we find direct evidence that recombination is mutagenic: Crossovers carry more de novo mutations than nonrecombinant DNA molecules analyzed for the same donors and hotspots. The observed mutations were primarily CG to TA transitions, with a higher frequency of transitions at CpG than non-CpGs sites. This enrichment of mutations at CpG sites at hotspots could predominate in methylated regions involving frequent single-stranded DNA processing as part of DSB repair. In addition, our data set provides evidence that GC alleles are preferentially transmitted during crossing over, opposing mutation, and shows that GC-biased gene conversion (gBGC) predominates over mutation in the sequence evolution of hotspots. These findings are consistent with the idea that gBGC could be an adaptation to counteract the mutational load of recombination.
Crossovers are associated with mutation and biased gene conversion at recombination hotspots

PubMed Central

Arbeithuber, Barbara; Betancourt, Andrea J.; Ebner, Thomas; Tiemann-Boege, Irene

2015-01-01

Meiosis is a potentially important source of germline mutations, as sites of meiotic recombination experience recurrent double-strand breaks (DSBs). However, evidence for a local mutagenic effect of recombination from population sequence data has been equivocal, likely because mutation is only one of several forces shaping sequence variation. By sequencing large numbers of single crossover molecules obtained from human sperm for two recombination hotspots, we find direct evidence that recombination is mutagenic: Crossovers carry more de novo mutations than nonrecombinant DNA molecules analyzed for the same donors and hotspots. The observed mutations were primarily CG to TA transitions, with a higher frequency of transitions at CpG than non-CpGs sites. This enrichment of mutations at CpG sites at hotspots could predominate in methylated regions involving frequent single-stranded DNA processing as part of DSB repair. In addition, our data set provides evidence that GC alleles are preferentially transmitted during crossing over, opposing mutation, and shows that GC-biased gene conversion (gBGC) predominates over mutation in the sequence evolution of hotspots. These findings are consistent with the idea that gBGC could be an adaptation to counteract the mutational load of recombination. PMID:25646453
Single-molecule DNA unzipping reveals asymmetric modulation of a transcription factor by its binding site sequence and context

PubMed Central

Rudnizky, Sergei; Khamis, Hadeel; Malik, Omri; Squires, Allison H; Meller, Amit; Melamed, Philippa

2018-01-01

Abstract Most functional transcription factor (TF) binding sites deviate from their ‘consensus’ recognition motif, although their sites and flanking sequences are often conserved across species. Here, we used single-molecule DNA unzipping with optical tweezers to study how Egr-1, a TF harboring three zinc fingers (ZF1, ZF2 and ZF3), is modulated by the sequence and context of its functional sites in the Lhb gene promoter. We find that both the core 9 bp bound to Egr-1 in each of the sites, and the base pairs flanking them, modulate the affinity and structure of the protein–DNA complex. The effect of the flanking sequences is asymmetric, with a stronger effect for the sequence flanking ZF3. Characterization of the dissociation time of Egr-1 revealed that a local, mechanical perturbation of the interactions of ZF3 destabilizes the complex more effectively than a perturbation of the ZF1 interactions. Our results reveal a novel role for ZF3 in the interaction of Egr-1 with other proteins and the DNA, providing insight on the regulation of Lhb and other genes by Egr-1. Moreover, our findings reveal the potential of small changes in DNA sequence to alter transcriptional regulation, and may shed light on the organization of regulatory elements at promoters. PMID:29253225
Single Molecule Bioelectronics and Their Application to Amplification-Free Measurement of DNA Lengths

PubMed Central

Gül, O. Tolga; Pugliese, Kaitlin M.; Choi, Yongki; Sims, Patrick C.; Pan, Deng; Rajapakse, Arith J.; Weiss, Gregory A.; Collins, Philip G.

2016-01-01

As biosensing devices shrink smaller and smaller, they approach a scale in which single molecule electronic sensing becomes possible. Here, we review the operation of single-enzyme transistors made using single-walled carbon nanotubes. These novel hybrid devices transduce the motions and catalytic activity of a single protein into an electronic signal for real-time monitoring of the protein’s activity. Analysis of these electronic signals reveals new insights into enzyme function and proves the electronic technique to be complementary to other single-molecule methods based on fluorescence. As one example of the nanocircuit technique, we have studied the Klenow Fragment (KF) of DNA polymerase I as it catalytically processes single-stranded DNA templates. The fidelity of DNA polymerases makes them a key component in many DNA sequencing techniques, and here we demonstrate that KF nanocircuits readily resolve DNA polymerization with single-base sensitivity. Consequently, template lengths can be directly counted from electronic recordings of KF’s base-by-base activity. After measuring as few as 20 copies, the template length can be determined with <1 base pair resolution, and different template lengths can be identified and enumerated in solutions containing template mixtures. PMID:27348011
Single Molecule Bioelectronics and Their Application to Amplification-Free Measurement of DNA Lengths.

PubMed

Gül, O Tolga; Pugliese, Kaitlin M; Choi, Yongki; Sims, Patrick C; Pan, Deng; Rajapakse, Arith J; Weiss, Gregory A; Collins, Philip G

2016-06-24

As biosensing devices shrink smaller and smaller, they approach a scale in which single molecule electronic sensing becomes possible. Here, we review the operation of single-enzyme transistors made using single-walled carbon nanotubes. These novel hybrid devices transduce the motions and catalytic activity of a single protein into an electronic signal for real-time monitoring of the protein's activity. Analysis of these electronic signals reveals new insights into enzyme function and proves the electronic technique to be complementary to other single-molecule methods based on fluorescence. As one example of the nanocircuit technique, we have studied the Klenow Fragment (KF) of DNA polymerase I as it catalytically processes single-stranded DNA templates. The fidelity of DNA polymerases makes them a key component in many DNA sequencing techniques, and here we demonstrate that KF nanocircuits readily resolve DNA polymerization with single-base sensitivity. Consequently, template lengths can be directly counted from electronic recordings of KF's base-by-base activity. After measuring as few as 20 copies, the template length can be determined with <1 base pair resolution, and different template lengths can be identified and enumerated in solutions containing template mixtures.
A septal chromosome segregator protein evolved into a conjugative DNA-translocator protein

PubMed Central

Sepulveda, Edgardo; Vogelmann, Jutta

2011-01-01

Streptomycetes, Gram-positive soil bacteria well known for the production of antibiotics feature a unique conjugative DNA transfer system. In contrast to classical conjugation which is characterized by the secretion of a pilot protein covalently linked to a single-stranded DNA molecule, in Streptomyces a double-stranded DNA molecule is translocated during conjugative transfer. This transfer involves a single plasmid encoded protein, TraB. A detailed biochemical and biophysical characterization of TraB, revealed a close relationship to FtsK, mediating chromosome segregation during bacterial cell division. TraB translocates plasmid DNA by recognizing 8-bp direct repeats located in a specific plasmid region clt. Similar sequences accidentally also occur on chromosomes and have been shown to be bound by TraB. We suggest that TraB mobilizes chromosomal genes by the interaction with these chromosomal clt-like sequences not relying on the integration of the conjugative plasmid into the chromosome. PMID:22479692
Nanosecond to submillisecond dynamics in dye-labeled single-stranded DNA, as revealed by ensemble measurements and photon statistics at single-molecule level.

PubMed

Kaji, Takahiro; Ito, Syoji; Iwai, Shigenori; Miyasaka, Hiroshi

2009-10-22

Single-molecule and ensemble time-resolved fluorescence measurements were applied for the investigation of the conformational dynamics of single-stranded DNA, ssDNA, connected with a fluorescein dye by a C6 linker, where the motions both of DNA and the C6 linker affect the geometry of the system. From the ensemble measurement of the fluorescence quenching via photoinduced electron transfer with a guanine base in the DNA sequence, three main conformations were found in aqueous solution: a conformation unaffected by the guanine base in the excited state lifetime of fluorescein, a conformation in which the fluorescence is dynamically quenched in the excited-state lifetime, and a conformation leading to rapid quenching via nonfluorescent complex. The analysis by using the parameters acquired from the ensemble measurements for interphoton time distribution histograms and FCS autocorrelations by the single-molecule measurement revealed that interconversion in these three conformations took place with two characteristic time constants of several hundreds of nanoseconds and tens of microseconds. The advantage of the combination use of the ensemble measurements with the single-molecule detections for rather complex dynamic motions is discussed by integrating the experimental results with those obtained by molecular dynamics simulation.
Detecting and Analyzing Genetic Recombination Using RDP4.

PubMed

Martin, Darren P; Murrell, Ben; Khoosal, Arjun; Muhire, Brejnev

2017-01-01

Recombination between nucleotide sequences is a major process influencing the evolution of most species on Earth. The evolutionary value of recombination has been widely debated and so too has its influence on evolutionary analysis methods that assume nucleotide sequences replicate without recombining. When nucleic acids recombine, the evolution of the daughter or recombinant molecule cannot be accurately described by a single phylogeny. This simple fact can seriously undermine the accuracy of any phylogenetics-based analytical approach which assumes that the evolutionary history of a set of recombining sequences can be adequately described by a single phylogenetic tree. There are presently a large number of available methods and associated computer programs for analyzing and characterizing recombination in various classes of nucleotide sequence datasets. Here we examine the use of some of these methods to derive and test recombination hypotheses using multiple sequence alignments.
Periodic Assembly of Nanospecies on Repetitive DNA Sequences Generated on Gold Nanoparticles by Rolling Circle Amplification

NASA Astrophysics Data System (ADS)

Zhao, Weian; Brook, Michael A.; Li, Yingfu

Periodical assembly of nanospecies is desirable for the construction of nanodevices. We provide a protocol for the preparation of a gold nanoparticle (AuNP)/DNA scaffold on which nanospecies can be assembled in a periodical manner. AuNP/DNA scaffold is prepared by growing long single-stranded DNA (ssDNA) molecules (typically hundreds of nanometers to a few microns in length) on AuNPs via rolling circle amplification (RCA). Since these long ssDNA molecules contain many repetitive sequence units, complementary DNA-attached nanospecies can be assembled through specific hybridization in a controllable and periodical manner.
The sequence of sequencers: The history of sequencing DNA

PubMed Central

Heather, James M.; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. PMID:26554401
Distinguishing Individual DNA Bases in a Network by Non-Resonant Tip-Enhanced Raman Scattering.

PubMed

Zhang, Rui; Zhang, Xianbiao; Wang, Huifang; Zhang, Yao; Jiang, Song; Hu, Chunrui; Zhang, Yang; Luo, Yi; Dong, Zhenchao

2017-05-08

The importance of identifying DNA bases at the single-molecule level is well recognized for many biological applications. Although such identification can be achieved by electrical measurements using special setups, it is still not possible to identify single bases in real space by optical means owing to the diffraction limit. Herein, we demonstrate the outstanding ability of scanning tunneling microscope (STM)-controlled non-resonant tip-enhanced Raman scattering (TERS) to unambiguously distinguish two individual complementary DNA bases (adenine and thymine) with a spatial resolution down to 0.9 nm. The distinct Raman fingerprints identified for the two molecules allow to differentiate in real space individual DNA bases in coupled base pairs. The demonstrated ability of non-resonant Raman scattering with super-high spatial resolution will significantly extend the applicability of TERS, opening up new routes for single-molecule DNA sequencing. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Single molecule real-time sequencing of Xanthomonas oryzae genomes reveals a dynamic structure and complex TAL (transcription activator-like) effector gene relationships

PubMed Central

Booher, Nicholas J.; Carpenter, Sara C. D.; Sebra, Robert P.; Wang, Li; Salzberg, Steven L.; Leach, Jan E.

2015-01-01

Pathogen-injected, direct transcriptional activators of host genes, TAL (transcription activator-like) effectors play determinative roles in plant diseases caused by Xanthomonas spp. A large domain of nearly identical, 33–35 aa repeats in each protein mediates DNA recognition. This modularity makes TAL effectors customizable and thus important also in biotechnology. However, the repeats render TAL effector (tal) genes nearly impossible to assemble using next-generation, short reads. Here, we demonstrate that long-read, single molecule real-time (SMRT) sequencing solves this problem. Taking an ensemble approach to first generate local, tal gene contigs, we correctly assembled de novo the genomes of two strains of the rice pathogen X. oryzae completed previously using the Sanger method and even identified errors in those references. Sequencing two more strains revealed a dynamic genome structure and a striking plasticity in tal gene content. Our results pave the way for population-level studies to inform resistance breeding, improve biotechnology and probe TAL effector evolution. PMID:27148456
Biophysics of protein-DNA interactions and chromosome organization

PubMed Central

Marko, John F.

2014-01-01

The function of DNA in cells depends on its interactions with protein molecules, which recognize and act on base sequence patterns along the double helix. These notes aim to introduce basic polymer physics of DNA molecules, biophysics of protein-DNA interactions and their study in single-DNA experiments, and some aspects of large-scale chromosome structure. Mechanisms for control of chromosome topology will also be discussed. PMID:25419039
Droplet Microfluidics for Compartmentalized Cell Lysis and Extension of DNA from Single-Cells

NASA Astrophysics Data System (ADS)

Zimny, Philip; Juncker, David; Reisner, Walter

Current single cell DNA analysis methods suffer from (i) bias introduced by the need for molecular amplification and (ii) limited ability to sequence repetitive elements, resulting in (iii) an inability to obtain information regarding long range genomic features. Recent efforts to circumvent these limitations rely on techniques for sensing single molecules of DNA extracted from single-cells. Here we demonstrate a droplet microfluidic approach for encapsulation and biochemical processing of single-cells inside alginate microparticles. In our approach, single-cells are first packaged inside the alginate microparticles followed by cell lysis, DNA purification, and labeling steps performed off-chip inside this microparticle system. The alginate microparticles are then introduced inside a micro/nanofluidic system where the alginate is broken down via a chelating buffer, releasing long DNA molecules which are then extended inside nanofluidic channels for analysis via standard mapping protocols.

An atypical CNG channel activated by a single cGMP molecule controls sperm chemotaxis.

PubMed

Bönigk, Wolfgang; Loogen, Astrid; Seifert, Reinhard; Kashikar, Nachiket; Klemm, Clementine; Krause, Eberhard; Hagen, Volker; Kremmer, Elisabeth; Strünker, Timo; Kaupp, U Benjamin

2009-10-27

Sperm of the sea urchin Arbacia punctulata can respond to a single molecule of chemoattractant released by an egg. The mechanism underlying this extreme sensitivity is unknown. Crucial signaling events in the response of A. punctulata sperm to chemoattractant include the rapid synthesis of the intracellular messenger guanosine 3',5'-monophosphate (cGMP) and the ensuing membrane hyperpolarization that results from the opening of potassium-selective cyclic nucleotide-gated (CNGK) channels. Here, we use calibrated photolysis of caged cGMP to show that approximately 45 cGMP molecules are generated during the response to a single molecule of chemoattractant. The CNGK channel can respond to such small cGMP changes because it is exquisitely sensitive to cGMP and activated in a noncooperative fashion. Like voltage-activated Ca(v) and Na(v) channels, the CNGK polypeptide consists of four homologous repeat sequences. Disabling each of the four cyclic nucleotide-binding sites through mutagenesis revealed that binding of a single cGMP molecule to repeat 3 is necessary and sufficient to activate the CNGK channel. Thus, CNGK has developed a mechanism of activation that is different from the activation of other CNG channels, which requires the cooperative binding of several ligands and operates in the micromolar rather than the nanomolar range.
Methods And Devices For Characterizing Duplex Nucleic Acid Molecules

DOEpatents

Akeson, Mark; Vercoutere, Wenonah; Haussler, David; Winters-Hilt, Stephen

2005-08-30

Methods and devices are provided for characterizing a duplex nucleic acid, e.g., a duplex DNA molecule. In the subject methods, a fluid conducting medium that includes a duplex nucleic acid molecule is contacted with a nanopore under the influence of an applied electric field and the resulting changes in current through the nanopore caused by the duplex nucleic acid molecule are monitored. The observed changes in current through the nanopore are then employed as a set of data values to characterize the duplex nucleic acid, where the set of data values may be employed in raw form or manipulated, e.g., into a current blockade profile. Also provided are nanopore devices for practicing the subject methods, where the subject nanopore devices are characterized by the presence of an algorithm which directs a processing means to employ monitored changes in current through a nanopore to characterize a duplex nucleic acid molecule responsible for the current changes. The subject methods and devices find use in a variety of applications, including, among other applications, the identification of an analyte duplex DNA molecule in a sample, the specific base sequence at a single nulceotide polymorphism (SNP), and the sequencing of duplex DNA molecules.
Molecular sled sequences are common in mammalian proteins.

PubMed

Xiong, Kan; Blainey, Paul C

2016-03-18

Recent work revealed a new class of molecular machines called molecular sleds, which are small basic molecules that bind and slide along DNA with the ability to carry cargo along DNA. Here, we performed biochemical and single-molecule flow stretching assays to investigate the basis of sliding activity in molecular sleds. In particular, we identified the functional core of pVIc, the first molecular sled characterized; peptide functional groups that control sliding activity; and propose a model for the sliding activity of molecular sleds. We also observed widespread DNA binding and sliding activity among basic polypeptide sequences that implicate mammalian nuclear localization sequences and many cell penetrating peptides as molecular sleds. These basic protein motifs exhibit weak but physiologically relevant sequence-nonspecific DNA affinity. Our findings indicate that many mammalian proteins contain molecular sled sequences and suggest the possibility that substantial undiscovered sliding activity exists among nuclear mammalian proteins. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments.

PubMed

Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

2013-09-24

Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp.
Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments

PubMed Central

Dabney, Jesse; Knapp, Michael; Glocke, Isabelle; Gansauge, Marie-Theres; Weihmann, Antje; Nickel, Birgit; Valdiosera, Cristina; García, Nuria; Pääbo, Svante; Arsuaga, Juan-Luis; Meyer, Matthias

2013-01-01

Although an inverse relationship is expected in ancient DNA samples between the number of surviving DNA fragments and their length, ancient DNA sequencing libraries are strikingly deficient in molecules shorter than 40 bp. We find that a loss of short molecules can occur during DNA extraction and present an improved silica-based extraction protocol that enables their efficient retrieval. In combination with single-stranded DNA library preparation, this method enabled us to reconstruct the mitochondrial genome sequence from a Middle Pleistocene cave bear (Ursus deningeri) bone excavated at Sima de los Huesos in the Sierra de Atapuerca, Spain. Phylogenetic reconstructions indicate that the U. deningeri sequence forms an early diverging sister lineage to all Western European Late Pleistocene cave bears. Our results prove that authentic ancient DNA can be preserved for hundreds of thousand years outside of permafrost. Moreover, the techniques presented enable the retrieval of phylogenetically informative sequences from samples in which virtually all DNA is diminished to fragments shorter than 50 bp. PMID:24019490
Dock ’n Roll: Folding of a Silk-Inspired Polypeptide into an Amyloid-like Beta Solenoid

PubMed Central

Zhao, Binwu; Cohen Stuart, Martien A.; Hall, Carol K.

2016-01-01

Polypeptides containing the motif ((GA)mGX)n occur in silk (we refer to them as ‘silk-like’) and have a strong tendency to self-assemble. For example, polypeptides containing (GAGAGAGX)n, where X = G or H have been observed to form filaments; similar sequences but with X = Q have been used in the design of coat proteins (capsids) for artificial viruses. The structure of the (GAGAGAGX)m filaments has been proposed to be a stack of peptides in a β roll structure with the hydrophobic side chains pointing outwards (hydrophobic shell). Another possible configuration, a β roll or β solenoid structure which has its hydrophobic side chains buried inside (hydrophobic core) was, however, overlooked. We perform ground state analysis as well as atomic-level molecular dynamics simulations, both on single molecules and on two-molecule stacks of the silk-inspired sequence (GAGAGAGQ)10, to decide whether the hydrophobic core or the hydrophobic shell configuration is the most stable one. We find that a stack of two hydrophobic core molecules is energetically more favorable than a stack of two shell molecules. A shell molecule initially placed in a perfect β roll structure tends to rotate its strands, breaking in-plane hydrogen bonds and forming out-of-plane hydrogen bonds, while a core molecule stays in the β roll structure. The hydrophobic shell structure has type II’ β turns whereas the core configuration has type II β turns; only the latter secondary structure agrees well with solid-state NMR experiments on a similar sequence (GA)15. We also observe that the core stack has a higher number of intra-molecular hydrogen bonds and a higher number of hydrogen bonds between stack and water than the shell stack. Hence, we conclude that the hydrophobic core configuration is the most likely structure. In the stacked state, each peptide has more intra-molecular hydrogen bonds than a single folded molecule, which suggests that stacking provides the extra stability needed for molecules to reach the folded state. PMID:26947809
Atomic force microscope observation of branching in single transcript molecules derived from human cardiac muscle

NASA Astrophysics Data System (ADS)

Reed, Jason; Hsueh, Carlin; Mishra, Bud; Gimzewski, James K.

2008-09-01

We have used an atomic force microscope to examine a clinically derived sample of single-molecule gene transcripts, in the form of double-stranded cDNA, (c: complementary) obtained from human cardiac muscle without the use of polymerase chain reaction (PCR) amplification. We observed a log-normal distribution of transcript sizes, with most molecules being in the range of 0.4-7.0 kilobase pairs (kb) or 130-2300 nm in contour length, in accordance with the expected distribution of mRNA (m: messenger) sizes in mammalian cells. We observed novel branching structures not previously known to exist in cDNA, and which could have profound negative effects on traditional analysis of cDNA samples through cloning, PCR and DNA sequencing.
Direct Detection and Sequencing of Damaged DNA Bases

PubMed Central

2011-01-01

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications. PMID:22185597
Direct detection and sequencing of damaged DNA bases.

PubMed

Clark, Tyson A; Spittle, Kristi E; Turner, Stephen W; Korlach, Jonas

2011-12-20

Products of various forms of DNA damage have been implicated in a variety of important biological processes, such as aging, neurodegenerative diseases, and cancer. Therefore, there exists great interest to develop methods for interrogating damaged DNA in the context of sequencing. Here, we demonstrate that single-molecule, real-time (SMRT®) DNA sequencing can directly detect damaged DNA bases in the DNA template - as a by-product of the sequencing method - through an analysis of the DNA polymerase kinetics that are altered by the presence of a modified base. We demonstrate the sequencing of several DNA templates containing products of DNA damage, including 8-oxoguanine, 8-oxoadenine, O6-methylguanine, 1-methyladenine, O4-methylthymine, 5-hydroxycytosine, 5-hydroxyuracil, 5-hydroxymethyluracil, or thymine dimers, and show that these base modifications can be readily detected with single-modification resolution and DNA strand specificity. We characterize the distinct kinetic signatures generated by these DNA base modifications.
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

PubMed

Chin, Chen-Shan; Alexander, David H; Marks, Patrick; Klammer, Aaron A; Drake, James; Heiner, Cheryl; Clum, Alicia; Copeland, Alex; Huddleston, John; Eichler, Evan E; Turner, Stephen W; Korlach, Jonas

2013-06-01

We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.
The sequence of sequencers: The history of sequencing DNA.

PubMed

Heather, James M; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Complete telomere-to-telomere de novo assembly of the Plasmodium falciparum genome through long-read (>11 kb), single molecule, real-time sequencing

PubMed Central

Vembar, Shruthi Sridhar; Seetin, Matthew; Lambert, Christine; Nattestad, Maria; Schatz, Michael C.; Baybayan, Primo; Scherf, Artur; Smith, Melissa Laird

2016-01-01

The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [∼80.6% (A + T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [∼90–99% (A + T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission. PMID:27345719
Inhibition of Oncogenic functionality of STAT3 Protein by Membrane Anchoring

NASA Astrophysics Data System (ADS)

Liu, Baoxu; Fletcher, Steven; Gunning, Patrick; Gradinaru, Claudiu

2009-03-01

Signal Transducer and Activator of Transcription 3 (STAT3) protein plays an important role in oncogenic processes. A novel molecular therapeutic approach to inhibit the oncogenic functionality of STAT3 is to design a prenylated small peptide sequence which could sequester STAT3 to the plasma membrane. We have also developed a novel fluorescein derivative label (F-NAc), which is much more photostable compared to the popular fluorescein label FITC. Remarkably, the new dye shows fluorescent properties that are invariant over a wide pH range, which is advantageous for our application. We have shown that F-NAc is suitable for single-molecule measurements and its properties are not affected by ligation to biomolecules. The membrane localization via high-affinity prenylated small-molecule binding agents is studied by encapsulating FNAc-labeled STAT3 and inhibitors within a liposome model cell system. The dynamics of the interaction between the protein and the prenylated ligands is investigated at single molecule level. The efficiency and stability of the STAT3 anchoring in lipid membranes are addressed via quantitative confocal imaging and single-molecule spectroscopy using a custom-built multiparameter fluorescence microscope.
DNA sequencing with pyrophosphatase

DOEpatents

Tabor, S.; Richardson, C.C.

1996-03-12

A kit or solution is disclosed for use in extension of an oligonucleotide primer having a first single-stranded region on a template molecule and having a second single-stranded region homologous to the first single-stranded region. The first agent is able to cause extension of the first single-stranded region of the primer on the second single-stranded region of the template in a reaction mixture. The second agent is able to reduce the amount of pyrophosphate in the reaction mixture below the amount produced during the extension in the absence of the second agent.
DNA sequencing with pyrophosphatase

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1996-03-12

A kit or solution for use in extension of an oligonucleotide primer having a first single-stranded region on a template molecule having a second single-stranded region homologous to the first single-stranded region, comprising a first agent able to cause extension of the first single-stranded region of the primer on the second single-stranded region of the template in a reaction mixture, and a second agent able to reduce the amount of pyrophosphate in the reaction mixture below the amount produced during the extension in the absence of the second agent.
A disruptive sequencer meets disruptive publishing.

PubMed

Loman, Nick; Goodwin, Sarah; Jansen, Hans; Loose, Matt

2015-01-01

Nanopore sequencing was recently made available to users in the form of the Oxford Nanopore MinION. Released to users through an early access programme, the MinION is made unique by its tiny form factor and ability to generate very long sequences from single DNA molecules. The platform is undergoing rapid evolution with three distinct nanopore types and five updates to library preparation chemistry in the last 18 months. To keep pace with the rapid evolution of this sequencing platform, and to provide a space where new analysis methods can be openly discussed, we present a new F1000Research channel devoted to updates to and analysis of nanopore sequence data.
Conformation and Aggregation of LKα14 Peptide in Bulk Water and at the Air/Water Interface.

PubMed

Dalgicdir, Cahit; Sayar, Mehmet

2015-12-10

Historically, the protein folding problem has mainly been associated with understanding the relationship between amino acid sequence and structure. However, it is known that both the conformation of individual molecules and their aggregation strongly depend on the environmental conditions. Here, we study the aggregation behavior of the model peptide LKα14 (with amino acid sequence LKKLLKLLKKLLKL) in bulk water and at the air/water interface. We start by a quantitative analysis of the conformational space of a single LKα14 in bulk water. Next, in order to analyze the aggregation tendency of LKα14, by using the umbrella sampling technique we calculate the potential of mean force for pulling a single peptide from an n-molecule aggregate. In agreement with the experimental results, our calculations yield the optimal aggregate size as four. This equilibrium state is achieved by two opposing forces: Coulomb repulsion between the lysine side chains and the reduction of solvent accessible hydrophobic surface area upon aggregation. At the vacuum/water interface, however, even dimers of LKα14 become marginally stable, and any larger aggregate falls apart instantaneously. Our results indicate that even though the interface is highly influential in stabilizing the α-helix conformation for a single molecule, it significantly reduces the attraction between two LKα14 peptides, along with their aggregation tendency.
Reading Out Single-Molecule Digital RNA and DNA Isothermal Amplification in Nanoliter Volumes with Unmodified Camera Phones

PubMed Central

2016-01-01

Digital single-molecule technologies are expanding diagnostic capabilities, enabling the ultrasensitive quantification of targets, such as viral load in HIV and hepatitis C infections, by directly counting single molecules. Replacing fluorescent readout with a robust visual readout that can be captured by any unmodified cell phone camera will facilitate the global distribution of diagnostic tests, including in limited-resource settings where the need is greatest. This paper describes a methodology for developing a visual readout system for digital single-molecule amplification of RNA and DNA by (i) selecting colorimetric amplification-indicator dyes that are compatible with the spectral sensitivity of standard mobile phones, and (ii) identifying an optimal ratiometric image-process for a selected dye to achieve a readout that is robust to lighting conditions and camera hardware and provides unambiguous quantitative results, even for colorblind users. We also include an analysis of the limitations of this methodology, and provide a microfluidic approach that can be applied to expand dynamic range and improve reaction performance, allowing ultrasensitive, quantitative measurements at volumes as low as 5 nL. We validate this methodology using SlipChip-based digital single-molecule isothermal amplification with λDNA as a model and hepatitis C viral RNA as a clinically relevant target. The innovative combination of isothermal amplification chemistry in the presence of a judiciously chosen indicator dye and ratiometric image processing with SlipChip technology allowed the sequence-specific visual readout of single nucleic acid molecules in nanoliter volumes with an unmodified cell phone camera. When paired with devices that integrate sample preparation and nucleic acid amplification, this hardware-agnostic approach will increase the affordability and the distribution of quantitative diagnostic and environmental tests. PMID:26900709
A Single-Molecule Barcoding System using Nanoslits for DNA Analysis

NASA Astrophysics Data System (ADS)

Jo, Kyubong; Schramm, Timothy M.; Schwartz, David C.

Single DNA molecule approaches are playing an increasingly central role in the analytical genomic sciences because single molecule techniques intrinsically provide individualized measurements of selected molecules, free from the constraints of bulk techniques, which blindly average noise and mask the presence of minor analyte components. Accordingly, a principal challenge that must be addressed by all single molecule approaches aimed at genome analysis is how to immobilize and manipulate DNA molecules for measurements that foster construction of large, biologically relevant data sets. For meeting this challenge, this chapter discusses an integrated approach for microfabricated and nanofabricated devices for the manipulation of elongated DNA molecules within nanoscale geometries. Ideally, large DNA coils stretch via nanoconfinement when channel dimensions are within tens of nanometers. Importantly, stretched, often immobilized, DNA molecules spanning hundreds of kilobase pairs are required by all analytical platforms working with large genomic substrates because imaging techniques acquire sequence information from molecules that normally exist in free solution as unrevealing random coils resembling floppy balls of yarn. However, nanoscale devices fabricated with sufficiently small dimensions fostering molecular stretching make these devices impractical because of the requirement of exotic fabrication technologies, costly materials, and poor operational efficiencies. In this chapter, such problems are addressed by discussion of a new approach to DNA presentation and analysis that establishes scaleable nanoconfinement conditions through reduction of ionic strength; stiffening DNA molecules thus enabling their arraying for analysis using easily fabricated devices that can also be mass produced. This new approach to DNA nanoconfinement is complemented by the development of a novel labeling scheme for reliable marking of individual molecules with fluorochrome labels, creating molecular barcodes, which are efficiently read using fluorescence resonance energy transfer techniques for minimizing noise from unincorporated labels. As such, our integrative approach for the realization of genomic analysis through nanoconfinement, named nanocoding, was demonstrated through the barcoding and mapping of bacterial artificial chromosomal molecules, thereby providing the basis for a high-throughput platform competent for whole genome investigations.
Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction

PubMed Central

Laehnemann, David; Borkhardt, Arndt

2016-01-01

Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159

DNA unzipping phase diagram calculated via replica theory.

PubMed

Roland, C Brian; Hatch, Kristi Adamson; Prentiss, Mara; Shakhnovich, Eugene I

2009-05-01

We show how single-molecule unzipping experiments can provide strong evidence that the zero-force melting transition of long molecules of natural dsDNA should be classified as a phase transition of the higher-order type (continuous). Toward this end, we study a statistical-mechanics model for the fluctuating structure of a long molecule of dsDNA, and compute the equilibrium phase diagram for the experiment in which the molecule is unzipped under applied force. We consider a perfect-matching dsDNA model, in which the loops are volume-excluding chains with arbitrary loop exponent c . We include stacking interactions, hydrogen bonds, and main-chain entropy. We include sequence heterogeneity at the level of random sequences; in particular, there is no correlation in the base-pairing (bp) energy from one sequence position to the next. We present heuristic arguments to demonstrate that the low-temperature macrostate does not exhibit degenerate ergodicity breaking. We use this claim to understand the results of our replica-theoretic calculation of the equilibrium properties of the system. As a function of temperature, we obtain the minimal force at which the molecule separates completely. This critical-force curve is a line in the temperature-force phase diagram that marks the regions where the molecule exists primarily as a double helix versus the region where the molecule exists as two separate strands. We compare our random-sequence model to magnetic tweezer experiments performed on the 48 502 bp genome of bacteriophage lambda . We find good agreement with the experimental data, which is restricted to temperatures between 24 and 50 degrees C . At higher temperatures, the critical-force curve of our random-sequence model is very different for that of the homogeneous-sequence version of our model. For both sequence models, the critical force falls to zero at the melting temperature T_{c} like |T-T_{c}|;{alpha} . For the homogeneous-sequence model, alpha=1/2 almost exactly, while for the random-sequence model, alpha approximately 0.9 . Importantly, the shape of the critical-force curve is connected, via our theory, to the manner in which the helix fraction falls to zero at T_{c} . The helix fraction is the property that is used to classify the melting transition as a type of phase transition. In our calculation, the shape of the critical-force curve holds strong evidence that the zero-force melting transition of long natural dsDNA should be classified as a higher-order (continuous) phase transition. Specifically, the order is 3rd or greater.
Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12.

PubMed

Anton, Brian P; Mongodin, Emmanuel F; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R; Roberts, Richard J; Raleigh, Elisabeth A

2015-01-01

We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.
Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12

PubMed Central

Anton, Brian P.; Mongodin, Emmanuel F.; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R.; Roberts, Richard J.; Raleigh, Elisabeth A.

2015-01-01

We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems. PMID:26010885
Single-Molecule Sequencing Reveals Complex Genome Variation of Hepatitis B Virus during 15 Years of Chronic Infection following Liver Transplantation

PubMed Central

Betz-Stablein, B. D.; Töpfer, A.; Littlejohn, M.; Yuen, L.; Colledge, D.; Sozzi, V.; Angus, P.; Thompson, A.; Revill, P.; Beerenwinkel, N.; Warner, N.

2016-01-01

ABSTRACT Chronic hepatitis B (CHB) is prevalent worldwide. The infectious agent, hepatitis B virus (HBV), replicates via an RNA intermediate and is error prone, leading to the rapid generation of closely related but not identical viral variants, including those that can escape host immune responses and antiviral treatments. The complexity of CHB can be further enhanced by the presence of HBV variants with large deletions in the genome generated via splicing (spHBV variants). Although spHBV variants are incapable of autonomous replication, their replication is rescued by wild-type HBV. spHBV variants have been shown to enhance wild-type virus replication, and their prevalence increases with liver disease progression. Single-molecule deep sequencing was performed on whole HBV genomes extracted from samples, including the liver explant, longitudinally collected from a subject with CHB over a 15-year period after liver transplantation. By employing novel bioinformatics methods, this analysis showed that the dynamics of the viral population across a period of changing treatment regimens was complex. The spHBV variants detected in the liver explant remained present posttransplantation, and a highly diverse novel spHBV population as well as variants with multiple deletions in the pre-S genes emerged. The identification of novel mutations outside the HBV reverse transcriptase gene that co-occurred with known drug resistance-associated mutations highlights the relevance of using full-genome deep sequencing and supports the hypothesis that drug resistance involves interactions across the full length of the HBV genome. IMPORTANCE Single-molecule sequencing allowed the characterization, in unprecedented detail, of the evolution of HBV populations and offered unique insights into the dynamics of defective and spHBV variants following liver transplantation and complex treatment regimens. This analysis also showed the rapid adaptation of HBV populations to treatment regimens with evolving drug resistance phenotypes and evidence of purifying selection across the whole genome. Finally, the new open-source bioinformatics tools with the capacity to easily identify potential spliced variants from deep sequencing data are freely available. PMID:27252524
Direct, concurrent measurements of the forces and currents affecting DNA in a nanopore with comparable topography.

PubMed

Nelson, Edward M; Li, Hui; Timp, Gregory

2014-06-24

We report direct, concurrent measurements of the forces and currents associated with the translocation of a single-stranded DNA molecule tethered to the tip of an atomic force microscope (AFM) cantilever through synthetic pores with topagraphies comparable to the DNA. These measurements were performed to gauge the signal available for sequencing and the electric force required to impel a single molecule through synthetic nanopores ranging from 1.0 to 3.5 nm in diameter in silicon nitride membranes 6-10 nm thick. The measurements revealed that a molecule can slide relatively frictionlessly through a pore, but regular fluctuations are observed intermittently in the force (and the current) every 0.35-0.72 nm, which are attributed to individual nucleotides translating through the nanopore in a turnstile-like motion.
Nanopore arrays in a silicon membrane for parallel single-molecule detection: fabrication

NASA Astrophysics Data System (ADS)

Schmidt, Torsten; Zhang, Miao; Sychugov, Ilya; Roxhed, Niclas; Linnros, Jan

2015-08-01

Solid state nanopores enable translocation and detection of single bio-molecules such as DNA in buffer solutions. Here, sub-10 nm nanopore arrays in silicon membranes were fabricated by using electron-beam lithography to define etch pits and by using a subsequent electrochemical etching step. This approach effectively decouples positioning of the pores and the control of their size, where the pore size essentially results from the anodizing current and time in the etching cell. Nanopores with diameters as small as 7 nm, fully penetrating 300 nm thick membranes, were obtained. The presented fabrication scheme to form large arrays of nanopores is attractive for parallel bio-molecule sensing and DNA sequencing using optical techniques. In particular the signal-to-noise ratio is improved compared to other alternatives such as nitride membranes suffering from a high-luminescence background.
Nanopore arrays in a silicon membrane for parallel single-molecule detection: fabrication.

PubMed

Schmidt, Torsten; Zhang, Miao; Sychugov, Ilya; Roxhed, Niclas; Linnros, Jan

2015-08-07

Solid state nanopores enable translocation and detection of single bio-molecules such as DNA in buffer solutions. Here, sub-10 nm nanopore arrays in silicon membranes were fabricated by using electron-beam lithography to define etch pits and by using a subsequent electrochemical etching step. This approach effectively decouples positioning of the pores and the control of their size, where the pore size essentially results from the anodizing current and time in the etching cell. Nanopores with diameters as small as 7 nm, fully penetrating 300 nm thick membranes, were obtained. The presented fabrication scheme to form large arrays of nanopores is attractive for parallel bio-molecule sensing and DNA sequencing using optical techniques. In particular the signal-to-noise ratio is improved compared to other alternatives such as nitride membranes suffering from a high-luminescence background.
Single-Cell-Based Platform for Copy Number Variation Profiling through Digital Counting of Amplified Genomic DNA Fragments.

PubMed

Li, Chunmei; Yu, Zhilong; Fu, Yusi; Pang, Yuhong; Huang, Yanyi

2017-04-26

We develop a novel single-cell-based platform through digital counting of amplified genomic DNA fragments, named multifraction amplification (mfA), to detect the copy number variations (CNVs) in a single cell. Amplification is required to acquire genomic information from a single cell, while introducing unavoidable bias. Unlike prevalent methods that directly infer CNV profiles from the pattern of sequencing depth, our mfA platform denatures and separates the DNA molecules from a single cell into multiple fractions of a reaction mix before amplification. By examining the sequencing result of each fraction for a specific fragment and applying a segment-merge maximum likelihood algorithm to the calculation of copy number, we digitize the sequencing-depth-based CNV identification and thus provide a method that is less sensitive to the amplification bias. In this paper, we demonstrate a mfA platform through multiple displacement amplification (MDA) chemistry. When performing the mfA platform, the noise of MDA is reduced; therefore, the resolution of single-cell CNV identification can be improved to 100 kb. We can also determine the genomic region free of allelic drop-out with mfA platform, which is impossible for conventional single-cell amplification methods.
A simple procedure for parallel sequence analysis of both strands of 5'-labeled DNA.

PubMed

Razvi, F; Gargiulo, G; Worcel, A

1983-08-01

Ligation of a 5'-labeled DNA restriction fragment results in a circular DNA molecule carrying the two 32Ps at the reformed restriction site. Double digestions of the circular DNA with the original enzyme and a second restriction enzyme cleavage near the labeled site allows direct chemical sequencing of one 5'-labeled DNA strand. Similar double digestions, using an isoschizomer that cleaves differently at the 32P-labeled site, allows direct sequencing of the now 3'-labeled complementary DNA strand. It is possible to directly sequence both strands of cloned DNA inserts by using the above protocol and a multiple cloning site vector that provides the necessary restriction sites. The simultaneous and parallel visualization of both DNA strands eliminates sequence ambiguities. In addition, the labeled circular molecules are particularly useful for single-hit DNA cleavage studies and DNA footprint analysis. As an example, we show here an analysis of the micrococcal nuclease-induced breaks on the two strands of the somatic 5S RNA gene of Xenopus borealis, which suggests that the enzyme may recognize and cleave small AT-containing palindromes along the DNA helix.
Fluorescence In situ Hybridization: Cell-Based Genetic Diagnostic and Research Applications.

PubMed

Cui, Chenghua; Shu, Wei; Li, Peining

2016-01-01

Fluorescence in situ hybridization (FISH) is a macromolecule recognition technology based on the complementary nature of DNA or DNA/RNA double strands. Selected DNA strands incorporated with fluorophore-coupled nucleotides can be used as probes to hybridize onto the complementary sequences in tested cells and tissues and then visualized through a fluorescence microscope or an imaging system. This technology was initially developed as a physical mapping tool to delineate genes within chromosomes. Its high analytical resolution to a single gene level and high sensitivity and specificity enabled an immediate application for genetic diagnosis of constitutional common aneuploidies, microdeletion/microduplication syndromes, and subtelomeric rearrangements. FISH tests using panels of gene-specific probes for somatic recurrent losses, gains, and translocations have been routinely applied for hematologic and solid tumors and are one of the fastest-growing areas in cancer diagnosis. FISH has also been used to detect infectious microbias and parasites like malaria in human blood cells. Recent advances in FISH technology involve various methods for improving probe labeling efficiency and the use of super resolution imaging systems for direct visualization of intra-nuclear chromosomal organization and profiling of RNA transcription in single cells. Cas9-mediated FISH (CASFISH) allowed in situ labeling of repetitive sequences and single-copy sequences without the disruption of nuclear genomic organization in fixed or living cells. Using oligopaint-FISH and super-resolution imaging enabled in situ visualization of chromosome haplotypes from differentially specified single-nucleotide polymorphism loci. Single molecule RNA FISH (smRNA-FISH) using combinatorial labeling or sequential barcoding by multiple round of hybridization were applied to measure mRNA expression of multiple genes within single cells. Research applications of these single molecule single cells DNA and RNA FISH techniques have visualized intra-nuclear genomic structure and sub-cellular transcriptional dynamics of many genes and revealed their functions in various biological processes.
The methylome and virulence of bovine respiratory disease bacterial pathogens

USDA-ARS?s Scientific Manuscript database

With the advent of single molecule, real-time (SMRT®) sequencing, it is now possible to study complete microbial epigenomes. It has been known for decades that methylation and other types of epigenetic modifications in bacteria are responsible for much more than restriction-modification mechanics, b...
Accurate multiplex polony sequencing of an evolved bacterial genome.

PubMed

Shendure, Jay; Porreca, Gregory J; Reppas, Nikos B; Lin, Xiaoxia; McCutcheon, John P; Rosenbaum, Abraham M; Wang, Michael D; Zhang, Kun; Mitra, Robi D; Church, George M

2005-09-09

We describe a DNA sequencing technology in which a commonly available, inexpensive epifluorescence microscope is converted to rapid nonelectrophoretic DNA sequencing automation. We apply this technology to resequence an evolved strain of Escherichia coli at less than one error per million consensus bases. A cell-free, mate-paired library provided single DNA molecules that were amplified in parallel to 1-micrometer beads by emulsion polymerase chain reaction. Millions of beads were immobilized in a polyacrylamide gel and subjected to automated cycles of sequencing by ligation and four-color imaging. Cost per base was roughly one-ninth as much as that of conventional sequencing. Our protocols were implemented with off-the-shelf instrumentation and reagents.
Probes labelled with energy transfer coupled dyes

DOEpatents

Mathies, R.A.; Glazer, A.; Ju, J.

1997-11-18

Compositions are provided comprising sets of fluorescent labels carrying pairs of donor and acceptor dye molecules, designed for efficient excitation of the donors at a single wavelength and emission from the acceptor in each of the pairs at different wavelengths. The different molecules having different donor-acceptor pairs can be modified to have substantially the same mobility under separation conditions, by varying the distance between the donor and acceptor in a given pair. Particularly, the fluorescent compositions find use as labels in sequencing nucleic acids. 7 figs.
Fluorescent labels and their use in separations

DOEpatents

Mathies, Richard A.; Glazer, Alexander; Ju, Jingyue

1997-01-01

Compositions are provided comprising sets of fluorescent labels carrying pairs of donor and acceptor dye molecules, designed for efficient excitation of the donors at a single wavelength and emission from the acceptor in each of the pairs at different wavelengths. The different molecules having different donor-acceptor pairs can be modified to have substantially the same mobility under separation conditions, by varying the distance between the donor and acceptor in a given pair. Particularly, the fluorescent compositions find use as labels in sequencing nucleic acids.
Probes labelled with energy transfer coupled dyes

DOEpatents

Mathies, Richard A.; Glazer, Alexander; Ju, Jingyue

1997-01-01

Compositions are provided comprising sets of fluorescent labels carrying pairs of donor and acceptor dye molecules, designed for efficient excitation of the donors at a single wavelength and emission from the acceptor in each of the pairs at different wavelengths. The different molecules having different donor-acceptor pairs can be modified to have substantially the same mobility under separation conditions, by varying the distance between the donor and acceptor in a given pair. Particularly, the fluorescent compositions find use as labels in sequencing nucleic acids.
Schemes of detecting nuclear spin correlations by dynamical decoupling based quantum sensing

NASA Astrophysics Data System (ADS)

Ma, Wen-Long Ma; Liu, Ren-Bao

Single-molecule sensitivity of nuclear magnetic resonance (NMR) and angstrom resolution of magnetic resonance imaging (MRI) are the highest challenges in magnetic microscopy. Recent development in dynamical decoupling (DD) enhanced diamond quantum sensing has enabled NMR of single nuclear spins and nanoscale NMR. Similar to conventional NMR and MRI, current DD-based quantum sensing utilizes the frequency fingerprints of target nuclear spins. Such schemes, however, cannot resolve different nuclear spins that have the same noise frequency or differentiate different types of correlations in nuclear spin clusters. Here we show that the first limitation can be overcome by using wavefunction fingerprints of target nuclear spins, which is much more sensitive than the ''frequency fingerprints'' to weak hyperfine interaction between the targets and a sensor, while the second one can be overcome by a new design of two-dimensional DD sequences composed of two sets of periodic DD sequences with different periods, which can be independently set to match two different transition frequencies. Our schemes not only offer an approach to breaking the resolution limit set by ''frequency gradients'' in conventional MRI, but also provide a standard approach to correlation spectroscopy for single-molecule NMR.
LongISLND: in silico sequencing of lengthy and noisy datatypes

PubMed Central

Lau, Bayo; Mohiyuddin, Marghoob; Mu, John C.; Fang, Li Tai; Bani Asadi, Narges; Dallett, Carolina; Lam, Hugo Y. K.

2016-01-01

Summary: LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling. Availability and Implementation: LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd Contact: hugo.lam@roche.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27667791
Protein mechanics: from single molecules to functional biomaterials.

PubMed

Li, Hongbin; Cao, Yi

2010-10-19

Elastomeric proteins act as the essential functional units in a wide variety of biomechanical machinery and serve as the basic building blocks for biological materials that exhibit superb mechanical properties. These proteins provide the desired elasticity, mechanical strength, resilience, and toughness within these materials. Understanding the mechanical properties of elastomeric protein-based biomaterials is a multiscale problem spanning from the atomistic/molecular level to the macroscopic level. Uncovering the design principles of individual elastomeric building blocks is critical both for the scientific understanding of multiscale mechanics of biomaterials and for the rational engineering of novel biomaterials with desirable mechanical properties. The development of single-molecule force spectroscopy techniques has provided methods for characterizing mechanical properties of elastomeric proteins one molecule at a time. Single-molecule atomic force microscopy (AFM) is uniquely suited to this purpose. Molecular dynamic simulations, protein engineering techniques, and single-molecule AFM study have collectively revealed tremendous insights into the molecular design of single elastomeric proteins, which can guide the design and engineering of elastomeric proteins with tailored mechanical properties. Researchers are focusing experimental efforts toward engineering artificial elastomeric proteins with mechanical properties that mimic or even surpass those of natural elastomeric proteins. In this Account, we summarize our recent experimental efforts to engineer novel artificial elastomeric proteins and develop general and rational methodologies to tune the nanomechanical properties of elastomeric proteins at the single-molecule level. We focus on general design principles used for enhancing the mechanical stability of proteins. These principles include the development of metal-chelation-based general methodology, strategies to control the unfolding hierarchy of multidomain elastomeric proteins, and the design of novel elastomeric proteins that exhibit stimuli-responsive mechanical properties. Moving forward, we are now exploring the use of these artificial elastomeric proteins as building blocks of protein-based biomaterials. Ultimately, we would like to rationally tailor mechanical properties of elastomeric protein-based materials by programming the molecular sequence, and thus nanomechanical properties, of elastomeric proteins at the single-molecule level. This step would help bridge the gap between single protein mechanics and material biomechanics, revealing how the mechanical properties of individual elastomeric proteins are translated into the properties of macroscopic materials.
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

PubMed Central

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

PubMed

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.

Computational Approaches for Decoding Select Odorant-Olfactory Receptor Interactions Using Mini-Virtual Screening

PubMed Central

Harini, K.; Sowdhamini, Ramanathan

2015-01-01

Olfactory receptors (ORs) belong to the class A G-Protein Coupled Receptor superfamily of proteins. Unlike G-Protein Coupled Receptors, ORs exhibit a combinatorial response to odors/ligands. ORs display an affinity towards a range of odor molecules rather than binding to a specific set of ligands and conversely a single odorant molecule may bind to a number of olfactory receptors with varying affinities. The diversity in odor recognition is linked to the highly variable transmembrane domains of these receptors. The purpose of this study is to decode the odor-olfactory receptor interactions using in silico docking studies. In this study, a ligand (odor molecules) dataset of 125 molecules was used to carry out in silico docking using the GLIDE docking tool (SCHRODINGER Inc Pvt LTD). Previous studies, with smaller datasets of ligands, have shown that orthologous olfactory receptors respond to similarly-tuned ligands, but are dramatically different in their efficacy and potency. Ligand docking results were applied on homologous pairs (with varying sequence identity) of ORs from human and mouse genomes and ligand binding residues and the ligand profile differed among such related olfactory receptor sequences. This study revealed that homologous sequences with high sequence identity need not bind to the same/ similar ligand with a given affinity. A ligand profile has been obtained for each of the 20 receptors in this analysis which will be useful for expression and mutation studies on these receptors. PMID:26221959
Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification.

PubMed

Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

2017-01-01

Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.
Deep sequencing is an appropriate tool for the selection of unique Hepatitis C virus (HCV) variants after single genomic amplification

PubMed Central

Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine

2017-01-01

Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878
Sequencing of adenine in DNA by scanning tunneling microscopy

NASA Astrophysics Data System (ADS)

Tanaka, Hiroyuki; Taniguchi, Masateru

2017-08-01

The development of DNA sequencing technology utilizing the detection of a tunnel current is important for next-generation sequencer technologies based on single-molecule analysis technology. Using a scanning tunneling microscope, we previously reported that dI/dV measurements and dI/dV mapping revealed that the guanine base (purine base) of DNA adsorbed onto the Cu(111) surface has a characteristic peak at V s = -1.6 V. If, in addition to guanine, the other purine base of DNA, namely, adenine, can be distinguished, then by reading all the purine bases of each single strand of a DNA double helix, the entire base sequence of the original double helix can be determined due to the complementarity of the DNA base pair. Therefore, the ability to read adenine is important from the viewpoint of sequencing. Here, we report on the identification of adenine by STM topographic and spectroscopic measurements using a synthetic DNA oligomer and viral DNA.
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

PubMed

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing

USDA-ARS?s Scientific Manuscript database

Zea mays is an important crop species and genetic model for elucidating transcriptional networks in plants. Uncertainties about the complete structure of mRNA transcripts, particularly with respect to alternatively spliced isoforms, limit the progress of research in this system. In this study, we us...
CoSMoS Unravels Mysteries of Transcription Initiation

PubMed Central

Gourse, Richard L.; Landick, Robert

2013-01-01

Using a fluorescence method called colocalization single-molecule spectroscopy (CoSMoS), Friedman and Gelles dissect the kinetics of transcription initiation at a bacterial promoter. Ultimately, CoSMoS could greatly aid the study of the effects of DNA sequence and transcription factors on both prokaryotic and eukaryotic promoters. PMID:22341438
Quantification of differential gene expression by multiplexed targeted resequencing of cDNA

PubMed Central

Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.

2017-01-01

Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677
Mapping the yeast genome by melting in nanofluidic devices

NASA Astrophysics Data System (ADS)

Welch, Robert L.; Czolkos, Ilja; Sladek, Rob; Reisner, Walter

2012-02-01

Optical mapping of DNA provides large-scale genomic information that can be used to assemble contigs from next-generation sequencing, and to detect re-arrangements between single cells. A recent optical mapping technique called denaturation mapping has the unique advantage of using physical principles rather than the action of enzymes to probe genomic structure. The absence of reagents or reaction steps makes denaturation mapping simpler than other protocols. Denaturation mapping uses fluorescence microscopy to image the pattern of partial melting along a DNA molecule extended in a channel of cross-section ˜100nm at the heart of a nanofluidic device. We successfully aligned melting maps from single DNA molecules to a theoretical map of the yeast genome (11.6Mbp) to identify their location. By aligning hundreds of molecules we assembled a consensus melting map of the yeast genome with 95% coverage.
A clone-free, single molecule map of the domestic cow (Bos taurus) genome.

PubMed

Zhou, Shiguo; Goldstein, Steve; Place, Michael; Bechner, Michael; Patino, Diego; Potamousis, Konstantinos; Ravindran, Prabu; Pape, Louise; Rincon, Gonzalo; Hernandez-Ortiz, Juan; Medrano, Juan F; Schwartz, David C

2015-08-28

The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.
Noise reduction in single time frame optical DNA maps

PubMed Central

Müller, Vilhelm; Westerlund, Fredrik

2017-01-01

In optical DNA mapping technologies sequence-specific intensity variations (DNA barcodes) along stretched and stained DNA molecules are produced. These “fingerprints” of the underlying DNA sequence have a resolution of the order one kilobasepairs and the stretching of the DNA molecules are performed by surface adsorption or nano-channel setups. A post-processing challenge for nano-channel based methods, due to local and global random movement of the DNA molecule during imaging, is how to align different time frames in order to produce reproducible time-averaged DNA barcodes. The current solutions to this challenge are computationally rather slow. With high-throughput applications in mind, we here introduce a parameter-free method for filtering a single time frame noisy barcode (snap-shot optical map), measured in a fraction of a second. By using only a single time frame barcode we circumvent the need for post-processing alignment. We demonstrate that our method is successful at providing filtered barcodes which are less noisy and more similar to time averaged barcodes. The method is based on the application of a low-pass filter on a single noisy barcode using the width of the Point Spread Function of the system as a unique, and known, filtering parameter. We find that after applying our method, the Pearson correlation coefficient (a real number in the range from -1 to 1) between the single time-frame barcode and the time average of the aligned kymograph increases significantly, roughly by 0.2 on average. By comparing to a database of more than 3000 theoretical plasmid barcodes we show that the capabilities to identify plasmids is improved by filtering single time-frame barcodes compared to the unfiltered analogues. Since snap-shot experiments and computational time using our method both are less than a second, this study opens up for high throughput optical DNA mapping with improved reproducibility. PMID:28640821
Sequence-Dependent Elasticity and Electrostatics of Single-Stranded DNA: Signatures of Base-Stacking

PubMed Central

McIntosh, Dustin B.; Duggan, Gina; Gouil, Quentin; Saleh, Omar A.

2014-01-01

Base-stacking is a key factor in the energetics that determines nucleic acid structure. We measure the tensile response of single-stranded DNA as a function of sequence and monovalent salt concentration to examine the effects of base-stacking on the mechanical and thermodynamic properties of single-stranded DNA. By comparing the elastic response of highly stacked poly(dA) and that of a polypyrimidine sequence with minimal stacking, we find that base-stacking in poly(dA) significantly enhances the polymer’s rigidity. The unstacking transition of poly(dA) at high force reveals that the intrinsic electrostatic tension on the molecule varies significantly more weakly on salt concentration than mean-field predictions. Further, we provide a model-independent estimate of the free energy difference between stacked poly(dA) and unstacked polypyrimidine, finding it to be ∼−0.25 kBT/base and nearly constant over three orders of magnitude in salt concentration. PMID:24507606
Self-sequencing of amino acids and origins of polyfunctional protocells

NASA Technical Reports Server (NTRS)

Fox, S. W.

1984-01-01

The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.
Two-Way Gold Nanoparticle Label-Free Sensing of Specific Sequence and Small Molecule Targets Using Switchable Concatemers.

PubMed

Zhu, Longjiao; Shao, Xiangli; Luo, Yunbo; Huang, Kunlung; Xu, Wentao

2017-05-19

A two-way colorimetric biosensor based on unmodified gold nanoparticles (GNPs) and a switchable double-stranded DNA (dsDNA) concatemer have been demonstrated. Two hairpin probes (H1 and H2) were first designed that provided the fuels to assemble the dsDNA concatemers via hybridization chain reaction (HCR). A functional hairpin (FH) was rationally designed to recognize the target sequences. All the hairpins contained a single-stranded DNA (ssDNA) loop and sticky end to prevent GNPs from salt-induced aggregation. In the presence of target sequence, the capture probe blocked in the FH recognizes the target to form a duplex DNA, which causes the release of the initiator probe by FH conformational change. This process then starts the alternate-opening of H1 and H2 through HCR, and dsDNA concatemers grow from the target sequence. As a result, unmodified GNPs undergo salt-induced aggregation because the formed dsDNA concatemers are stiffer and provide less stabilization. A light purple-to-blue color variation was observed in the bulk solution, termed the light-off sensing way. Furthermore, H1 ingeniously inserted an aptamer sequence to generate dsDNA concatemers with multiple small molecule binding sites. In the presence of small molecule targets, concatemers can be disassembled into mixtures with ssDNA sticky ends. A blue-to-purple reverse color variation was observed due to the regeneration of the ssDNA, termed the light-on way. The two-way biosensor can detect both nucleic acids and small molecule targets with one sensing device. This switchable sensing element is label-free, enzyme-free, and sophisticated-instrumentation-free. The detection limits of both targets were below nanomolar.
Direct Single-Molecule Observation of Mode and Geometry of RecA-Mediated Homology Search.

PubMed

Lee, Andrew J; Endo, Masayuki; Hobbs, Jamie K; Wälti, Christoph

2018-01-23

Genomic integrity, when compromised by accrued DNA lesions, is maintained through efficient repair via homologous recombination. For this process the ubiquitous recombinase A (RecA), and its homologues such as the human Rad51, are of central importance, able to align and exchange homologous sequences within single-stranded and double-stranded DNA in order to swap out defective regions. Here, we directly observe the widely debated mechanism of RecA homology searching at a single-molecule level using high-speed atomic force microscopy (HS-AFM) in combination with tailored DNA origami frames to present the reaction targets in a way suitable for AFM-imaging. We show that RecA nucleoprotein filaments move along DNA substrates via short-distance facilitated diffusions, or slides, interspersed with longer-distance random moves, or hops. Importantly, from the specific interaction geometry, we find that the double-stranded substrate DNA resides in the secondary DNA binding-site within the RecA nucleoprotein filament helical groove during the homology search. This work demonstrates that tailored DNA origami, in conjunction with HS-AFM, can be employed to reveal directly conformational and geometrical information on dynamic protein-DNA interactions which was previously inaccessible at an individual single-molecule level.
Nanopore arrays in a silicon membrane for parallel single-molecule detection: DNA translocation

NASA Astrophysics Data System (ADS)

Zhang, Miao; Schmidt, Torsten; Jemt, Anders; Sahlén, Pelin; Sychugov, Ilya; Lundeberg, Joakim; Linnros, Jan

2015-08-01

Optical nanopore sensing offers great potential in single-molecule detection, genotyping, or DNA sequencing for high-throughput applications. However, one of the bottle-necks for fluorophore-based biomolecule sensing is the lack of an optically optimized membrane with a large array of nanopores, which has large pore-to-pore distance, small variation in pore size and low background photoluminescence (PL). Here, we demonstrate parallel detection of single-fluorophore-labeled DNA strands (450 bps) translocating through an array of silicon nanopores that fulfills the above-mentioned requirements for optical sensing. The nanopore array was fabricated using electron beam lithography and anisotropic etching followed by electrochemical etching resulting in pore diameters down to ∼7 nm. The DNA translocation measurements were performed in a conventional wide-field microscope tailored for effective background PL control. The individual nanopore diameter was found to have a substantial effect on the translocation velocity, where smaller openings slow the translocation enough for the event to be clearly detectable in the fluorescence. Our results demonstrate that a uniform silicon nanopore array combined with wide-field optical detection is a promising alternative with which to realize massively-parallel single-molecule detection.
Fluorescence-based strategies to investigate the structure and dynamics of aptamer-ligand complexes

NASA Astrophysics Data System (ADS)

Perez-Gonzalez, Cibran; Lafontaine, Daniel; Penedo, J.

2016-08-01

In addition to the helical nature of double-stranded DNA and RNA, single-stranded oligonucleotides can arrange themselves into tridimensional structures containing loops, bulges, internal hairpins and many other motifs. This ability has been used for more than two decades to generate oligonucleotide sequences, so-called aptamers, that can recognize certain metabolites with high affinity and specificity. More recently, this library of artificially-generated nucleic acid aptamers has been expanded by the discovery that naturally occurring RNA sequences control bacterial gene expression in response to cellular concentration of a given metabolite. The application of fluorescence methods has been pivotal to characterize in detail the structure and dynamics of these aptamer-ligand complexes in solution. This is mostly due to the intrinsic high sensitivity of fluorescence methods and also to significant improvements in solid-phase synthesis, post-synthetic labelling strategies and optical instrumentation that took place during the last decade. In this work, we provide an overview of the most widely employed fluorescence methods to investigate aptamer structure and function by describing the use of aptamers labelled with a single dye in fluorescence quenching and anisotropy assays. The use of 2-aminopurine as a fluorescent analog of adenine to monitor local changes in structure and fluorescence resonance energy transfer (FRET) to follow long-range conformational changes is also covered in detail. The last part of the review is dedicated to the application of fluorescence techniques based on single-molecule microscopy, a technique that has revolutionized our understanding of nucleic acid structure and dynamics. We finally describe the advantages of monitoring ligand-binding and conformational changes, one molecule at a time, to decipher the complexity of regulatory aptamers and summarize the emerging folding and ligand-binding models arising from the application of these single-molecule FRET microscopy techniques.
Fluorescence-Based Strategies to Investigate the Structure and Dynamics of Aptamer-Ligand Complexes

PubMed Central

Perez-Gonzalez, Cibran; Lafontaine, Daniel A.; Penedo, J. Carlos

2016-01-01

In addition to the helical nature of double-stranded DNA and RNA, single-stranded oligonucleotides can arrange themselves into tridimensional structures containing loops, bulges, internal hairpins and many other motifs. This ability has been used for more than two decades to generate oligonucleotide sequences, so-called aptamers, that can recognize certain metabolites with high affinity and specificity. More recently, this library of artificially-generated nucleic acid aptamers has been expanded by the discovery that naturally occurring RNA sequences control bacterial gene expression in response to cellular concentration of a given metabolite. The application of fluorescence methods has been pivotal to characterize in detail the structure and dynamics of these aptamer-ligand complexes in solution. This is mostly due to the intrinsic high sensitivity of fluorescence methods and also to significant improvements in solid-phase synthesis, post-synthetic labeling strategies and optical instrumentation that took place during the last decade. In this work, we provide an overview of the most widely employed fluorescence methods to investigate aptamer structure and function by describing the use of aptamers labeled with a single dye in fluorescence quenching and anisotropy assays. The use of 2-aminopurine as a fluorescent analog of adenine to monitor local changes in structure and fluorescence resonance energy transfer (FRET) to follow long-range conformational changes is also covered in detail. The last part of the review is dedicated to the application of fluorescence techniques based on single-molecule microscopy, a technique that has revolutionized our understanding of nucleic acid structure and dynamics. We finally describe the advantages of monitoring ligand-binding and conformational changes, one molecule at a time, to decipher the complexity of regulatory aptamers and summarize the emerging folding and ligand-binding models arising from the application of these single-molecule FRET microscopy techniques. PMID:27536656
Assessing quality of Medicago sativa silage by monitoring bacterial composition with single molecule, real-time sequencing technology and various physiological parameters

PubMed Central

Bao, Weichen; Mi, Zhihui; Xu, Haiyan; Zheng, Yi; Kwok, Lai Yu; Zhang, Heping; Zhang, Wenyi

2016-01-01

The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, γ-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage. PMID:27340760
Assessing quality of Medicago sativa silage by monitoring bacterial composition with single molecule, real-time sequencing technology and various physiological parameters.

PubMed

Bao, Weichen; Mi, Zhihui; Xu, Haiyan; Zheng, Yi; Kwok, Lai Yu; Zhang, Heping; Zhang, Wenyi

2016-06-24

The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, γ-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage.

Molecular Bases of cyclodextrin Adapter Interactions with Engineered Protein Nanopores

DOE Office of Scientific and Technical Information (OSTI.GOV)

Banerjee, A.; Mikhailova, E; Cheley, S

2010-01-01

Engineered protein pores have several potential applications in biotechnology: as sensor elements in stochastic detection and ultrarapid DNA sequencing, as nanoreactors to observe single-molecule chemistry, and in the construction of nano- and micro-devices. One important class of pores contains molecular adapters, which provide internal binding sites for small molecules. Mutants of the {alpha}-hemolysin ({alpha}HL) pore that bind the adapter {beta}-cyclodextrin ({beta}CD) {approx}10{sup 4} times more tightly than the wild type have been obtained. We now use single-channel electrical recording, protein engineering including unnatural amino acid mutagenesis, and high-resolution x-ray crystallography to provide definitive structural information on these engineered protein nanoporesmore » in unparalleled detail.« less
Two dimensional molecular electronics spectroscopy for molecular fingerprinting, DNA sequencing, and cancerous DNA recognition.

PubMed

Rajan, Arunkumar Chitteth; Rezapour, Mohammad Reza; Yun, Jeonghun; Cho, Yeonchoo; Cho, Woo Jong; Min, Seung Kyu; Lee, Geunsik; Kim, Kwang S

2014-02-25

Laser-driven molecular spectroscopy of low spatial resolution is widely used, while electronic current-driven molecular spectroscopy of atomic scale resolution has been limited because currents provide only minimal information. However, electron transmission of a graphene nanoribbon on which a molecule is adsorbed shows molecular fingerprints of Fano resonances, i.e., characteristic features of frontier orbitals and conformations of physisorbed molecules. Utilizing these resonance profiles, here we demonstrate two-dimensional molecular electronics spectroscopy (2D MES). The differential conductance with respect to bias and gate voltages not only distinguishes different types of nucleobases for DNA sequencing but also recognizes methylated nucleobases which could be related to cancerous cell growth. This 2D MES could open an exciting field to recognize single molecule signatures at atomic resolution. The advantages of the 2D MES over the one-dimensional (1D) current analysis can be comparable to those of 2D NMR over 1D NMR analysis.
Unraveling Hydrophobic Interactions at the Molecular Scale Using Force Spectroscopy and Molecular Dynamics Simulations.

PubMed

Stock, Philipp; Monroe, Jacob I; Utzig, Thomas; Smith, David J; Shell, M Scott; Valtiner, Markus

2017-03-28

Interactions between hydrophobic moieties steer ubiquitous processes in aqueous media, including the self-organization of biologic matter. Recent decades have seen tremendous progress in understanding these for macroscopic hydrophobic interfaces. Yet, it is still a challenge to experimentally measure hydrophobic interactions (HIs) at the single-molecule scale and thus to compare with theory. Here, we present a combined experimental-simulation approach to directly measure and quantify the sequence dependence and additivity of HIs in peptide systems at the single-molecule scale. We combine dynamic single-molecule force spectroscopy on model peptides with fully atomistic, both equilibrium and nonequilibrium, molecular dynamics (MD) simulations of the same systems. Specifically, we mutate a flexible (GS) 5 peptide scaffold with increasing numbers of hydrophobic leucine monomers and measure the peptides' desorption from hydrophobic self-assembled monolayer surfaces. Based on the analysis of nonequilibrium work-trajectories, we measure an interaction free energy that scales linearly with 3.0-3.4 k B T per leucine. In good agreement, simulations indicate a similar trend with 2.1 k B T per leucine, while also providing a detailed molecular view into HIs. This approach potentially provides a roadmap for directly extracting qualitative and quantitative single-molecule interactions at solid/liquid interfaces in a wide range of fields, including interactions at biointerfaces and adhesive interactions in industrial applications.
LongISLND: in silico sequencing of lengthy and noisy datatypes.

PubMed

Lau, Bayo; Mohiyuddin, Marghoob; Mu, John C; Fang, Li Tai; Bani Asadi, Narges; Dallett, Carolina; Lam, Hugo Y K

2016-12-15

LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling. LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd CONTACT: hugo.lam@roche.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
DNA-Encoded Solid-Phase Synthesis: Encoding Language Design and Complex Oligomer Library Synthesis.

PubMed

MacConnell, Andrew B; McEnaney, Patrick J; Cavett, Valerie J; Paegel, Brian M

2015-09-14

The promise of exploiting combinatorial synthesis for small molecule discovery remains unfulfilled due primarily to the "structure elucidation problem": the back-end mass spectrometric analysis that significantly restricts one-bead-one-compound (OBOC) library complexity. The very molecular features that confer binding potency and specificity, such as stereochemistry, regiochemistry, and scaffold rigidity, are conspicuously absent from most libraries because isomerism introduces mass redundancy and diverse scaffolds yield uninterpretable MS fragmentation. Here we present DNA-encoded solid-phase synthesis (DESPS), comprising parallel compound synthesis in organic solvent and aqueous enzymatic ligation of unprotected encoding dsDNA oligonucleotides. Computational encoding language design yielded 148 thermodynamically optimized sequences with Hamming string distance ≥ 3 and total read length <100 bases for facile sequencing. Ligation is efficient (70% yield), specific, and directional over 6 encoding positions. A series of isomers served as a testbed for DESPS's utility in split-and-pool diversification. Single-bead quantitative PCR detected 9 × 10(4) molecules/bead and sequencing allowed for elucidation of each compound's synthetic history. We applied DESPS to the combinatorial synthesis of a 75,645-member OBOC library containing scaffold, stereochemical and regiochemical diversity using mixed-scale resin (160-μm quality control beads and 10-μm screening beads). Tandem DNA sequencing/MALDI-TOF MS analysis of 19 quality control beads showed excellent agreement (<1 ppt) between DNA sequence-predicted mass and the observed mass. DESPS synergistically unites the advantages of solid-phase synthesis and DNA encoding, enabling single-bead structural elucidation of complex compounds and synthesis using reactions normally considered incompatible with unprotected DNA. The widespread availability of inexpensive oligonucleotide synthesis, enzymes, DNA sequencing, and PCR make implementation of DESPS straightforward, and may prompt the chemistry community to revisit the synthesis of more complex and diverse libraries.
Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity

USDA-ARS?s Scientific Manuscript database

Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we ...
Genes and Vocal Learning

ERIC Educational Resources Information Center

White, Stephanie A.

2010-01-01

Could a mutation in a single gene be the evolutionary lynchpin supporting the development of human language? A rare mutation in the molecule known as FOXP2 discovered in a human family seemed to suggest so, and its sequence phylogeny reinforced a Chomskian view that language emerged wholesale in humans. Spurred by this discovery, research in…
n-CoDeR concept: unique types of antibodies for diagnostic use and therapy.

PubMed

Carlsson, R; Söderlind, E

2001-05-01

The n-CoDeR recombinant antibody gene libraries are built on a single master framework, into which diverse in vivo-formed complementarity determining regions (CDRs) are allowed to recombine. These CDRs are sampled from in vivo-processed and proof-read gene sequences, thus ensuring an optimal level of correctly folded and functional molecules. By the modularized assembly process, up to six CDRs can be varied at the same time, providing a possibility for the creation of a hitherto undescribed genetic and functional variation. The n-CoDeR antibody gene libraries can be used to select highly specific, human antibody fragments with specificities to virtually any antigen, including carbohydrates and human self-proteins and with affinities down into the subnanomolar range. Furthermore, combining CDRs sampled from in vivo-processed sequences into a single framework result in molecules exhibiting a lower immunogenicity compared to normal human immunoglobulins, as determined by computer analyses. The distinguished features of the n-CoDeR libraries in the therapeutic and diagnostic areas are discussed.
Nanofluidic Device with Embedded Nanopore

NASA Astrophysics Data System (ADS)

Zhang, Yuning; Reisner, Walter

2014-03-01

Nanofluidic based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with nanpore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a nanopore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We demonstrate that we can detect - using fluorescent microscopy - successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. We also show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore until a certain voltage bias is added.
DNA - peptide polyelectrolyte complexes: Phase control by hybridization

NASA Astrophysics Data System (ADS)

Vieregg, Jeffrey; Lueckheide, Michael; Marciel, Amanda; Leon, Lorraine; Tirrell, Matthew

DNA is one of the most highly-charged molecules known, and interacts strongly with charged molecules in the cell. Condensation of long double-stranded DNA is one of the classic problems of biophysics, but the polyelectrolyte behavior of short and/or single-stranded nucleic acids has attracted far less study despite its importance for both biological and engineered systems. We report here studies of DNA oligonucleotides complexed with cationic peptides and polyamines. As seen previously for longer sequences, double-stranded oligonucleotides form solid precipitates, but single-stranded oligonucleotides instead undergo liquid-liquid phase separation to form coacervate droplets. Complexed oligonucleotides remain competent for hybridization, and display sequence-dependent environmental response. We observe similar behavior for RNA oligonucleotides, and methylphosphonate substitution of the DNA backbone indicates that nucleic acid charge density controls whether liquid or solid complexes are formed. Liquid-liquid phase separations of this type have been implicated in formation of membraneless organelles in vivo, and have been suggested as protocells in early life scenarios; oligonucleotides offer an excellent method to probe the physics controlling these phenomena.
Next-Generation Sequencing Platforms

NASA Astrophysics Data System (ADS)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Free energy minimization to predict RNA secondary structures and computational RNA design.

PubMed

Churkin, Alexander; Weinbrand, Lina; Barash, Danny

2015-01-01

Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.
Extending the spectrum of DNA sequences retrieved from ancient bones and teeth

PubMed Central

Glocke, Isabelle; Meyer, Matthias

2017-01-01

The number of DNA fragments surviving in ancient bones and teeth is known to decrease with fragment length. Recent genetic analyses of Middle Pleistocene remains have shown that the recovery of extremely short fragments can prove critical for successful retrieval of sequence information from particularly degraded ancient biological material. Current sample preparation techniques, however, are not optimized to recover DNA sequences from fragments shorter than ∼35 base pairs (bp). Here, we show that much shorter DNA fragments are present in ancient skeletal remains but lost during DNA extraction. We present a refined silica-based DNA extraction method that not only enables efficient recovery of molecules as short as 25 bp but also doubles the yield of sequences from longer fragments due to improved recovery of molecules with single-strand breaks. Furthermore, we present strategies for monitoring inefficiencies in library preparation that may result from co-extraction of inhibitory substances during DNA extraction. The combination of DNA extraction and library preparation techniques described here substantially increases the yield of DNA sequences from ancient remains and provides access to a yet unexploited source of highly degraded DNA fragments. Our work may thus open the door for genetic analyses on even older material. PMID:28408382
Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia

PubMed Central

2014-01-01

Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii. Conclusions Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems. PMID:24655715
Comparative genomic analysis of single-molecule sequencing and hybrid approaches for finishing the Clostridium autoethanogenum JA1-1 strain DSM 10061 genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, Steven D; Nagaraju, Shilpa; Utturkar, Sagar M

Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G +more » C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a phophoenolpyruvate synthase) and substrate utilization pathway (mannose and aromatics utilization) that might explain phenotypic differences between C. autoethanogenum and C. ljungdahlii. Conclusions Single molecule sequencing will be increasingly used to produce finished microbial genomes. The complete genome will facilitate comparative genomics and functional genomics and support future comparisons between Clostridia and studies that examine the evolution of plasmids, bacteriophage and CRISPR systems.« less
Changes in solvation during DNA binding and cleavage are critical to altered specificity of the EcoRI endonuclease

PubMed Central

Robinson, Clifford R.; Sligar, Stephen G.

1998-01-01

Restriction endonucleases such as EcoRI bind and cleave DNA with great specificity and represent a paradigm for protein–DNA interactions and molecular recognition. Using osmotic pressure to induce water release, we demonstrate the participation of bound waters in the sequence discrimination of substrate DNA by EcoRI. Changes in solvation can play a critical role in directing sequence-specific DNA binding by EcoRI and are also crucial in assisting site discrimination during catalysis. By measuring the volume change for complex formation, we show that at the cognate sequence (GAATTC) EcoRI binding releases about 70 fewer water molecules than binding at an alternate DNA sequence (TAATTC), which differs by a single base pair. EcoRI complexation with nonspecific DNA releases substantially less water than either of these specific complexes. In cognate substrates (GAATTC) kcat decreases as osmotic pressure is increased, indicating the binding of about 30 water molecules accompanies the cleavage reaction. For the alternate substrate (TAATTC), release of about 40 water molecules accompanies the reaction, indicated by a dramatic acceleration of the rate when osmotic pressure is raised. These large differences in solvation effects demonstrate that water molecules can be key players in the molecular recognition process during both association and catalytic phases of the EcoRI reaction, acting to change the specificity of the enzyme. For both the protein–DNA complex and the transition state, there may be substantial conformational differences between cognate and alternate sites, accompanied by significant alterations in hydration and solvent accessibility. PMID:9482860
Identification and DNA annotation of a plasmid isolated from Chromobacterium violaceum.

PubMed

Lima, Daniel C; Nyberg, Lena K; Westerlund, Fredrik; Batistuzzo de Medeiros, Silvia R

2018-03-28

Chromobacterium violaceum is a ß-proteobacterium found widely worldwide with important biotechnological properties and is associated to lethal sepsis in immune-depressed individuals. In this work, we report the discover, complete sequence and annotation of a plasmid detected in C. violaceum that has been unnoticed until now. We used DNA single-molecule analysis to confirm that the episome found was a circular molecule and then proceeded with NGS sequencing. After DNA annotation, we found that this extra-chromosomal DNA is probably a defective bacteriophage of approximately 44 kilobases, with 39 ORFs comprising, mostly hypothetical proteins. We also found DNA sequences that ensure proper plasmid replication and partitioning as well as a toxin addiction system. This report sheds light on the biology of this important species, helping us to understand the mechanisms by which C. violaceum endures to several harsh conditions. This discovery could also be a first step in the development of a DNA manipulation tool in this bacterium.
Formation of template-switching artifacts by linear amplification.

PubMed

Chakravarti, Dhrubajyoti; Mailander, Paula C

2008-07-01

Linear amplification is a method of synthesizing single-stranded DNA from either a single-stranded DNA or one strand of a double-stranded DNA. In this protocol, molecules of a single primer DNA are extended by multiple rounds of DNA synthesis at high temperature using thermostable DNA polymerases. Although linear amplification generates the intended full-length single-stranded product, it is more efficient over single-stranded templates than double-stranded templates. We analyzed linear amplification over single- or double-stranded mouse H-ras DNA (exon 1-2 region). The single-stranded H-ras template yielded only the intended product. However, when the double-stranded template was used, additional artifact products were observed. Increasing the concentration of the double-stranded template produced relatively higher amounts of these artifact products. One of the artifact DNA bands could be mapped and analyzed by sequencing. It contained three template-switching products. These DNAs were formed by incomplete DNA strand extension over the template strand, followed by switching to the complementary strand at a specific Ade nucleotide within a putative hairpin sequence, from which DNA synthesis continued over the complementary strand.
Fabrication and characterization of a solid state nanopore with self-aligned carbon nanoelectrodes for molecular detection

NASA Astrophysics Data System (ADS)

Spinney, Patrick; Collins, Scott D.; Howitt, David G.; Smith, Rosemary L.

2012-06-01

Rapid and cost-effective DNA sequencing is a pivotal prerequisite for the genomics era. Many of the recent advances in forensics, medicine, agriculture, taxonomy, and drug discovery have paralleled critical advances in DNA sequencing technology. Nanopore modalities for DNA sequencing have recently surfaced including the electrical interrogation of protein ion channels and/or solid-state nanopores during translocation of DNA. However to date, most of this work has met with mixed success. In this work, we present a unique nanofabrication strategy that realizes an artificial nanopore articulated with carbon electrodes to sense the current modulations during the transport of DNA through the nanopore. This embodiment overcomes most of the technical difficulties inherent in other artificial nanopore embodiments and present a versatile platform for the testing of DNA single nucleotide detection. Characterization of the device using gold nanoparticles, silica nanoparticles, lambda dsDNA and 16-mer ssDNA are presented. Although single molecule DNA sequencing is still not demonstrated, the device shows a path towards this goal.
Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae.

PubMed

Zhang, Hui; Hall, Nathan; McElroy, J Scott; Lowe, Elijah K; Goertzen, Leslie R

2017-02-05

Eleusine indica, also known as goosegrass, is a serious weed in at least 42 countries. In this paper we report the complete plastid genome sequence of goosegrass obtained by de novo assembly of paired-end and mate-paired reads generated by Illumina sequencing of total genomic DNA. The goosegrass plastome is a circular molecule of 135,151bp in length, consisting of two single-copy regions separated by a pair of inverted repeats (IRs) of 20,919 bases. The large (LSC) and the small (SSC) single-copy regions span 80,667 bases and 12,646 bases, respectively. The plastome of goosegrass has 38.19% GC content and includes 108 unique genes, of which 76 are protein-coding, 28 are transfer RNA, and 4 are ribosomal RNA. The goosegrass plastome sequence was compared to eight other species of Poaceae. Although generally conserved with respect to Poaceae, this genomic resource will be useful for evolutionary studies within this weed species and the genus Eleusine. Copyright © 2016. Published by Elsevier B.V.

Multidimensional optical spectroscopy of a single molecule in a current-carrying state

NASA Astrophysics Data System (ADS)

Rahav, S.; Mukamel, S.

2010-12-01

The nonlinear optical signals from an open system consisting of a molecule connected to metallic leads, in response to a sequence of impulsive pulses, are calculated using a superoperator formalism. Two detection schemes are considered: coherent stimulated emission and incoherent fluorescence. The two provide similar but not identical information. The necessary superoperator correlation functions are evaluated either by converting them to ordinary (Hilbert space) operators which are then expanded in many-body states, or by using Wick's theorem for superoperators to factorize them into nonequilibrium two point Green's functions. As an example we discuss a stimulated Raman process that shows resonances involving two different charge states of the molecule in the same signal.
The origin of the 5S ribosomal RNA molecule could have been caused by a single inverse duplication: strong evidence from its sequences.

PubMed

Branciamore, Sergio; Di Giulio, Massimo

2012-04-01

The secondary structure of the 5S ribosomal RNA (5S rRNA) molecule shows a high degree of symmetry. In order to explain the origin of this symmetry, it has been conjectured that one half of the 5S rRNA molecule was its precursor and that an indirect duplication of this precursor created the other half and thus the current symmetry of the molecule. Here, we have subjected to an empirical test both the indirect duplication model, analysing a total of 684 5S rRNA sequences for complementarity between the two halves of the 5S rRNA, and the direct duplication model analysing in this case the similarity between the two halves of this molecule. In intra- and inter-molecule and intra- and inter-domain comparisons, we find a high statistical support to the hypothesis of a complementarity relationship between the two halves of the 5S rRNA molecule, denying vice versa the hypothesis of similarity between these halves. Therefore, these observations corroborate the indirect duplication model at the expense of the direct duplication model, as reason of the origin of the 5S rRNA molecule. More generally, we discuss and favour the hypothesis that all RNAs and proteins, which present symmetry, did so through gene duplication and not by gradualistic accumulation of few monomers or segments of molecule into a gradualistic growth process. This would be the consequence of the very high propensity that nucleic acids have to be subjected to duplications.
Detection and interrogation of biomolecules via nanoscale probes: From fundamental physics to DNA sequencing

NASA Astrophysics Data System (ADS)

Zwolak, Michael

2013-03-01

A rapid and low-cost method to sequence DNA would revolutionize personalized medicine, where genetic information is used to diagnose, treat, and prevent diseases. There is a longstanding interest in nanopores as a platform for rapid interrogation of single DNA molecules. I will discuss a sequencing protocol based on the measurement of transverse electronic currents during the translocation of single-stranded DNA through nanopores. Using molecular dynamics simulations coupled to quantum mechanical calculations of the tunneling current, I will show that the DNA nucleotides are predicted to have distinguishable electronic signatures in experimentally realizable systems. Several recent experiments support our theoretical predictions. In addition to their possible impact in medicine and biology, the above methods offer ideal test beds to study open scientific issues in the relatively unexplored area at the interface between solids, liquids, and biomolecules at the nanometer length scale. http://mike.zwolak.org
Nanostructured Tip-Shaped Biosensors: Application of Six Sigma Approach for Enhanced Manufacturing.

PubMed

Kahng, Seong-Joong; Kim, Jong-Hoon; Chung, Jae-Hyun

2016-12-23

Nanostructured tip-shaped biosensors have drawn attention for biomolecule detection as they are promising for highly sensitive and specific detection of a target analyte. Using a nanostructured tip, the sensitivity is increased to identify individual molecules because of the high aspect ratio structure. Various detection methods, such as electrochemistry, fluorescence microcopy, and Raman spectroscopy, have been attempted to enhance the sensitivity and the specificity. Due to the confined path of electrons, electrochemical measurement using a nanotip enables the detection of single molecules. When an electric field is combined with capillary action and fluid flow, target molecules can be effectively concentrated onto a nanotip surface for detection. To enhance the concentration efficacy, a dendritic nanotip rather than a single tip could be used to detect target analytes, such as nanoparticles, cells, and DNA. However, reproducible fabrication with relation to specific detection remains a challenge due to the instability of a manufacturing method, resulting in inconsistent shape. In this paper, nanostructured biosensors are reviewed with our experimental results using dendritic nanotips for sequence specific detection of DNA. By the aid of the Six Sigma approach, the fabrication yield of dendritic nanotips increases from 20.0% to 86.6%. Using the nanotips, DNA is concentrated and detected in a sequence specific way with the detection limit equivalent to 1000 CFU/mL. The pros and cons of a nanotip biosensor are evaluated in conjunction with future prospects.
The Domino Way to Heterocycles

PubMed Central

Padwa, Albert; Bur, Scott K.

2007-01-01

Sequential transformations enable the facile synthesis of complex target molecules from simple building blocks in a single preparative step. Their value is amplified if they also create multiple stereogenic centers. In the ongoing search for new domino processes, emphasis is usually placed on sequential reactions which occur cleanly and without forming by-products. As a prerequisite for an ideally proceeding one-pot sequential transformation, the reactivity pattern of all participating components has to be such that each building block gets involved in a reaction only when it is supposed to do so. The development of sequences that combine transformations of fundamentally different mechanisms broadens the scope of such procedures in synthetic chemistry. This mini review contains a representative sampling from the last 15 years on the kinds of reactions that have been sequenced into cascades to produce heterocyclic molecules. PMID:17940591
Counterion accumulation effects on a suspension of DNA molecules: Equation of state and pressure-driven denaturation

NASA Astrophysics Data System (ADS)

Nicasio-Collazo, Luz Adriana; Delgado-González, Alexandra; Hernández-Lemus, Enrique; Castañeda-Priego, Ramón

2017-04-01

The study of the effects associated with the electrostatic properties of DNA is of fundamental importance to understand both its molecular properties at the single molecule level, like the rigidity of the chain, and its interaction with other charged bio-molecules, including other DNA molecules; such interactions are crucial to maintain the thermodynamic stability of the intra-cellular medium. In the present work, we combine the Poisson-Boltzmann mean-field theory with an irreversible thermodynamic approximation to analyze the effects of counterion accumulation inside DNA on both the denaturation profile of the chain and the equation of state of the suspension. To this end, we model the DNA molecule as a porous charged cylinder immersed in an aqueous solution. These thermo-electrostatic effects are explicitly studied in the particular case of some genes for which damage in their sequence is associated with diffuse large B-cell lymphoma.
Single-Molecule Denaturation Mapping of DNA in Nanofluidic Channels

NASA Astrophysics Data System (ADS)

Reisner, Walter; Larsen, Niels; Silahtaroglu, Asli; Kristensen, Anders; Tommerup, Niels; Tegenfeldt, Jonas O.; Flyvbjerg, Henrik

2010-03-01

Nanochannel based DNA stretching can serve as a platform for a new optical mapping technique based on measuring the pattern of partial melting along the extended molecules. We partially melt DNA extended in nanofluidic channels via a combination of local heating and added chemical denaturants. The melted molecules, imaged via a standard fluorescence videomicroscopy setup, exhibit a nonuniform fluorescence profile corresponding to a series of local dips and peaks in the intensity trace along the stretched molecule. We show that this barcode is consistent with the presence of locally melted regions along the molecule and can be explained by calculations of sequence-dependent melting probability. Specifically, we obtain experimental melting profiles for T4, T7, lambda-phage and bacterial artificial chromosome DNA (from human chromosome 12) and compare these profiles to theory. In addition, we demonstrate that the BAC melting profile can be used to align the BAC to its correct position on chromosome 12.
Fluorescence fluctuations analysis in nanoapertures: physical concepts and biological applications.

PubMed

Lenne, Pierre-François; Rigneault, Hervé; Marguet, Didier; Wenger, Jérôme

2008-11-01

During the past years, nanophotonics has provided new approaches to study the biological processes below the optical diffraction limit. How single molecules diffuse, bind and assemble can be studied now at the nanometric level, not only in solutions but also in complex and crowded environments such as in live cells. In this context fluorescence fluctuations spectroscopy is a unique tool since it has proven to be easy to use in combination with nanostructures, which are able to confine light in nanometric volumes. We review here recent advances in fluorescence fluctuations' analysis below the optical diffraction limit with a special focus on nanoapertures milled in metallic films. We discuss applications in the field of single-molecule detection, DNA sequencing and membrane organization, and underscore some potential perspectives of this new emerging technology.
CoSMoS unravels mysteries of transcription initiation.

PubMed

Gourse, Richard L; Landick, Robert

2012-02-17

Using a fluorescence method called colocalization single-molecule spectroscopy (CoSMoS), Friedman and Gelles dissect the kinetics of transcription initiation at a bacterial promoter. Ultimately, CoSMoS could greatly aid the study of the effects of DNA sequence and transcription factors on both prokaryotic and eukaryotic promoters. Copyright Â© 2012 Elsevier Inc. All rights reserved.
High-Throughput Block Optical DNA Sequence Identification.

PubMed

Sagar, Dodderi Manjunatha; Korshoj, Lee Erik; Hanson, Katrina Bethany; Chowdhury, Partha Pratim; Otoupal, Peter Britton; Chatterjee, Anushree; Nagpal, Prashant

2018-01-01

Optical techniques for molecular diagnostics or DNA sequencing generally rely on small molecule fluorescent labels, which utilize light with a wavelength of several hundred nanometers for detection. Developing a label-free optical DNA sequencing technique will require nanoscale focusing of light, a high-throughput and multiplexed identification method, and a data compression technique to rapidly identify sequences and analyze genomic heterogeneity for big datasets. Such a method should identify characteristic molecular vibrations using optical spectroscopy, especially in the "fingerprinting region" from ≈400-1400 cm -1 . Here, surface-enhanced Raman spectroscopy is used to demonstrate label-free identification of DNA nucleobases with multiplexed 3D plasmonic nanofocusing. While nanometer-scale mode volumes prevent identification of single nucleobases within a DNA sequence, the block optical technique can identify A, T, G, and C content in DNA k-mers. The content of each nucleotide in a DNA block can be a unique and high-throughput method for identifying sequences, genes, and other biomarkers as an alternative to single-letter sequencing. Additionally, coupling two complementary vibrational spectroscopy techniques (infrared and Raman) can improve block characterization. These results pave the way for developing a novel, high-throughput block optical sequencing method with lossy genomic data compression using k-mer identification from multiplexed optical data acquisition. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Molecular dynamics simulations and docking enable to explore the biophysical factors controlling the yields of engineered nanobodies.

PubMed

Soler, Miguel A; de Marco, Ario; Fortuna, Sara

2016-10-10

Nanobodies (VHHs) have proved to be valuable substitutes of conventional antibodies for molecular recognition. Their small size represents a precious advantage for rational mutagenesis based on modelling. Here we address the problem of predicting how Camelidae nanobody sequences can tolerate mutations by developing a simulation protocol based on all-atom molecular dynamics and whole-molecule docking. The method was tested on two sets of nanobodies characterized experimentally for their biophysical features. One set contained point mutations introduced to humanize a wild type sequence, in the second the CDRs were swapped between single-domain frameworks with Camelidae and human hallmarks. The method resulted in accurate scoring approaches to predict experimental yields and enabled to identify the structural modifications induced by mutations. This work is a promising tool for the in silico development of single-domain antibodies and opens the opportunity to customize single functional domains of larger macromolecules.
Molecular dynamics simulations and docking enable to explore the biophysical factors controlling the yields of engineered nanobodies

NASA Astrophysics Data System (ADS)

Soler, Miguel A.; De Marco, Ario; Fortuna, Sara

2016-10-01

Nanobodies (VHHs) have proved to be valuable substitutes of conventional antibodies for molecular recognition. Their small size represents a precious advantage for rational mutagenesis based on modelling. Here we address the problem of predicting how Camelidae nanobody sequences can tolerate mutations by developing a simulation protocol based on all-atom molecular dynamics and whole-molecule docking. The method was tested on two sets of nanobodies characterized experimentally for their biophysical features. One set contained point mutations introduced to humanize a wild type sequence, in the second the CDRs were swapped between single-domain frameworks with Camelidae and human hallmarks. The method resulted in accurate scoring approaches to predict experimental yields and enabled to identify the structural modifications induced by mutations. This work is a promising tool for the in silico development of single-domain antibodies and opens the opportunity to customize single functional domains of larger macromolecules.
Single-Molecule Kinetics Reveal Cation-Promoted DNA Duplex Formation Through Ordering of Single-Stranded Helices

PubMed Central

Dupuis, Nicholas F.; Holmstrom, Erik D.; Nesbitt, David J.

2013-01-01

In this work, the kinetics of short, fully complementary oligonucleotides are investigated at the single-molecule level. Constructs 6–9 bp in length exhibit single exponential kinetics over 2 orders of magnitude time for both forward (kon, association) and reverse (koff, dissociation) processes. Bimolecular rate constants for association are weakly sensitive to the number of basepairs in the duplex, with a 2.5-fold increase between 9 bp (k′on = 2.1(1) × 106 M−1 s−1) and 6 bp (k′on = 5.0(1) × 106 M−1 s−1) sequences. In sharp contrast, however, dissociation rate constants prove to be exponentially sensitive to sequence length, varying by nearly 600-fold over the same 9 bp (koff = 0.024 s−1) to 6 bp (koff = 14 s−1) range. The 8 bp sequence is explored in more detail, and the NaCl dependence of kon and koff is measured. Interestingly, konincreases by >40-fold (kon = 0.10(1) s−1 to 4.0(4) s−1 between [NaCl] = 25 mM and 1 M), whereas in contrast, koffdecreases by fourfold (0.72(3) s−1 to 0.17(7) s−1) over the same range of conditions. Thus, the equilibrium constant (Keq) increases by ≈160, largely due to changes in the association rate, kon. Finally, temperature-dependent measurements reveal that increased [NaCl] reduces the overall exothermicity (ΔΔH° > 0) of duplex formation, albeit by an amount smaller than the reduction in entropic penalty (−TΔΔS° < 0). This reduced entropic cost is attributed to a cation-facilitated preordering of the two single-stranded species, which lowers the association free-energy barrier and in turn accelerates the rate of duplex formation. PMID:23931323
Digital encoding of cellular mRNAs enabling precise and absolute gene expression measurement by single-molecule counting.

PubMed

Fu, Glenn K; Wilhelmy, Julie; Stern, David; Fan, H Christina; Fodor, Stephen P A

2014-03-18

We present a new approach for the sensitive detection and accurate quantitation of messenger ribonucleic acid (mRNA) gene transcripts in single cells. First, the entire population of mRNAs is encoded with molecular barcodes during reverse transcription. After amplification of the gene targets of interest, molecular barcodes are counted by sequencing or scored on a simple hybridization detector to reveal the number of molecules in the starting sample. Since absolute quantities are measured, calibration to standards is unnecessary, and many of the relative quantitation challenges such as polymerase chain reaction (PCR) bias are avoided. We apply the method to gene expression analysis of minute sample quantities and demonstrate precise measurements with sensitivity down to sub single-cell levels. The method is an easy, single-tube, end point assay utilizing standard thermal cyclers and PCR reagents. Accurate and precise measurements are obtained without any need for cycle-to-cycle intensity-based real-time monitoring or physical partitioning into multiple reactions (e.g., digital PCR). Further, since all mRNA molecules are encoded with molecular barcodes, amplification can be used to generate more material for multiple measurements and technical replicates can be carried out on limited samples. The method is particularly useful for small sample quantities, such as single-cell experiments. Digital encoding of cellular content preserves true abundance levels and overcomes distortions introduced by amplification.
Single-molecule sequencing and optical mapping yields an improved genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity.

PubMed

Edger, Patrick P; VanBuren, Robert; Colle, Marivi; Poorten, Thomas J; Wai, Ching Man; Niederhuth, Chad E; Alger, Elizabeth I; Ou, Shujun; Acharya, Charlotte B; Wang, Jie; Callow, Pete; McKain, Michael R; Shi, Jinghua; Collier, Chad; Xiong, Zhiyong; Mower, Jeffrey P; Slovin, Janet P; Hytönen, Timo; Jiang, Ning; Childs, Kevin L; Knapp, Steven J

2018-02-01

Although draft genomes are available for most agronomically important plant species, the majority are incomplete, highly fragmented, and often riddled with assembly and scaffolding errors. These assembly issues hinder advances in tool development for functional genomics and systems biology. Here we utilized a robust, cost-effective approach to produce high-quality reference genomes. We report a near-complete genome of diploid woodland strawberry (Fragaria vesca) using single-molecule real-time sequencing from Pacific Biosciences (PacBio). This assembly has a contig N50 length of ∼7.9 million base pairs (Mb), representing a ∼300-fold improvement of the previous version. The vast majority (>99.8%) of the assembly was anchored to 7 pseudomolecules using 2 sets of optical maps from Bionano Genomics. We obtained ∼24.96 Mb of sequence not present in the previous version of the F. vesca genome and produced an improved annotation that includes 1496 new genes. Comparative syntenic analyses uncovered numerous, large-scale scaffolding errors present in each chromosome in the previously published version of the F. vesca genome. Our results highlight the need to improve existing short-read based reference genomes. Furthermore, we demonstrate how genome quality impacts commonly used analyses for addressing both fundamental and applied biological questions. © The Authors 2017. Published by Oxford University Press.
Single molecule counting and assessment of random molecular tagging errors with transposable giga-scale error-correcting barcodes.

PubMed

Lau, Billy T; Ji, Hanlee P

2017-09-21

RNA-Seq measures gene expression by counting sequence reads belonging to unique cDNA fragments. Molecular barcodes commonly in the form of random nucleotides were recently introduced to improve gene expression measures by detecting amplification duplicates, but are susceptible to errors generated during PCR and sequencing. This results in false positive counts, leading to inaccurate transcriptome quantification especially at low input and single-cell RNA amounts where the total number of molecules present is minuscule. To address this issue, we demonstrated the systematic identification of molecular species using transposable error-correcting barcodes that are exponentially expanded to tens of billions of unique labels. We experimentally showed random-mer molecular barcodes suffer from substantial and persistent errors that are difficult to resolve. To assess our method's performance, we applied it to the analysis of known reference RNA standards. By including an inline random-mer molecular barcode, we systematically characterized the presence of sequence errors in random-mer molecular barcodes. We observed that such errors are extensive and become more dominant at low input amounts. We described the first study to use transposable molecular barcodes and its use for studying random-mer molecular barcode errors. Extensive errors found in random-mer molecular barcodes may warrant the use of error correcting barcodes for transcriptome analysis as input amounts decrease.
Circular replication-associated protein encoding DNA viruses identified in the faecal matter of various animals in New Zealand.

PubMed

Steel, Olivia; Kraberger, Simona; Sikorski, Alyssa; Young, Laura M; Catchpole, Ryan J; Stevens, Aaron J; Ladley, Jenny J; Coray, Dorien S; Stainton, Daisy; Dayaram, Anisha; Julian, Laurel; van Bysterveldt, Katherine; Varsani, Arvind

2016-09-01

In recent years, innovations in molecular techniques and sequencing technologies have resulted in a rapid expansion in the number of known viral sequences, in particular those with circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA genomes. CRESS DNA viruses are present in the virome of many ecosystems and are known to infect a wide range of organisms. A large number of the recently identified CRESS DNA viruses cannot be classified into any known viral families, indicating that the current view of CRESS DNA viral sequence space is greatly underestimated. Animal faecal matter has proven to be a particularly useful source for sampling CRESS DNA viruses in an ecosystem, as it is cost-effective and non-invasive. In this study a viral metagenomic approach was used to explore the diversity of CRESS DNA viruses present in the faeces of domesticated and wild animals in New Zealand. Thirty-eight complete CRESS DNA viral genomes and two circular molecules (that may be defective molecules or single components of multicomponent genomes) were identified from forty-nine individual animal faecal samples. Based on shared genome organisations and sequence similarities, eighteen of the isolates were classified as gemycircularviruses and twelve isolates were classified as smacoviruses. The remaining eight isolates lack significant sequence similarity with any members of known CRESS DNA virus groups. This research adds significantly to our knowledge of CRESS DNA viral diversity in New Zealand, emphasising the prevalence of CRESS DNA viruses in nature, and reinforcing the suggestion that a large proportion of CRESS DNA viruses are yet to be identified. Copyright © 2016 Elsevier B.V. All rights reserved.
A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications.

PubMed

Haque, Ashraful; Engel, Jessica; Teichmann, Sarah A; Lönnberg, Tapio

2017-08-18

RNA sequencing (RNA-seq) is a genomic approach for the detection and quantitative analysis of messenger RNA molecules in a biological sample and is useful for studying cellular responses. RNA-seq has fueled much discovery and innovation in medicine over recent years. For practical reasons, the technique is usually conducted on samples comprising thousands to millions of cells. However, this has hindered direct assessment of the fundamental unit of biology-the cell. Since the first single-cell RNA-sequencing (scRNA-seq) study was published in 2009, many more have been conducted, mostly by specialist laboratories with unique skills in wet-lab single-cell genomics, bioinformatics, and computation. However, with the increasing commercial availability of scRNA-seq platforms, and the rapid ongoing maturation of bioinformatics approaches, a point has been reached where any biomedical researcher or clinician can use scRNA-seq to make exciting discoveries. In this review, we present a practical guide to help researchers design their first scRNA-seq studies, including introductory information on experimental hardware, protocol choice, quality control, data analysis and biological interpretation.
Sequence Dependent Interactions Between DNA and Single-Walled Carbon Nanotubes

NASA Astrophysics Data System (ADS)

Roxbury, Daniel

It is known that single-stranded DNA adopts a helical wrap around a single-walled carbon nanotube (SWCNT), forming a water-dispersible hybrid molecule. The ability to sort mixtures of SWCNTs based on chirality (electronic species) has recently been demonstrated using special short DNA sequences that recognize certain matching SWCNTs of specific chirality. This thesis investigates the intricacies of DNA-SWCNT sequence-specific interactions through both experimental and molecular simulation studies. The DNA-SWCNT binding strengths were experimentally quantified by studying the kinetics of DNA replacement by a surfactant on the surface of particular SWCNTs. Recognition ability was found to correlate strongly with measured binding strength, e.g. DNA sequence (TAT)4 was found to bind 20 times stronger to the (6,5)-SWCNT than sequence (TAT)4T. Next, using replica exchange molecular dynamics (REMD) simulations, equilibrium structures formed by (a) single-strands and (b) multiple-strands of 12-mer oligonucleotides adsorbed on various SWCNTs were explored. A number of structural motifs were discovered in which the DNA strand wraps around the SWCNT and 'stitches' to itself via hydrogen bonding. Great variability among equilibrium structures was observed and shown to be directly influenced by DNA sequence and SWCNT type. For example, the (6,5)-SWCNT DNA recognition sequence, (TAT)4, was found to wrap in a tight single-stranded right-handed helical conformation. In contrast, DNA sequence T12 forms a beta-barrel left-handed structure on the same SWCNT. These are the first theoretical indications that DNA-based SWCNT selectivity can arise on a molecular level. In a biomedical collaboration with the Mayo Clinic, pathways for DNA-SWCNT internalization into healthy human endothelial cells were explored. Through absorbance spectroscopy, TEM imaging, and confocal fluorescence microscopy, we showed that intracellular concentrations of SWCNTs far exceeded those of the incubation solution, which suggested an energy-dependent pathway. Additionally, by means of pharmacological inhibition and vector-induced gene knockout studies, the DNA-SWCNTs were shown to enter the cells via Rac1-mediated macropinocytosis.
Nanopore detection of DNA molecules in crowded neutral polymer solutions

NASA Astrophysics Data System (ADS)

Sharma, Rajesh Kumar; Dai, Liang; Doyle, Patrick; Garaj, Slaven

Nanopore sensing is a precise technique for analysis of the structure and dynamics of individual biomolecules in different environments, and has even become a prominent technique for next-gen DNA sequencing. In the nanopore sensor, an individual DNA molecule is electrophoretically translocated through a single, nanometer-scaled pore in a solid-state membrane separating two chambers filled with electrolyte. The conformation of the molecule is deduced from modulations in the ionic current through the pore during the translocation event. Using nanopores, we investigated the dynamics of the DNA molecules in a crowded solution of neutral polymers of different sizes and concentrations. The translocation dynamics depends significantly on the size and concentration of the polymers, as different contributions to the electrophoretic and entropic forces on the DNA molecules come into play. This setup offers an excellent, tuneable model-system for probing biologically relevant questions regarding the behaviour of DNA molecules in highly confined and crowded environments. Singapore-MIT Alliance for Research and Technology.

Slowing DNA Translocation in a Nanofluidic Field-Effect Transistor.

PubMed

Liu, Yifan; Yobas, Levent

2016-04-26

Here, we present an experimental demonstration of slowing DNA translocation across a nanochannel by modulating the channel surface charge through an externally applied gate bias. The experiments were performed on a nanofluidic field-effect transistor, which is a monolithic integrated platform featuring a 50 nm-diameter in-plane alumina nanocapillary whose entire length is surrounded by a gate electrode. The field-effect transistor behavior was validated on the gating of ionic conductance and protein transport. The gating of DNA translocation was subsequently studied by measuring discrete current dips associated with single λ-DNA translocation events under a source-to-drain bias of 1 V. The translocation speeds under various gate bias conditions were extracted by fitting event histograms of the measured translocation time to the first passage time distributions obtained from a simple 1D biased diffusion model. A positive gate bias was observed to slow the translocation of single λ-DNA chains markedly; the translocation speed was reduced by an order of magnitude from 18.4 mm/s obtained under a floating gate down to 1.33 mm/s under a positive gate bias of 9 V. Therefore, a dynamic and flexible regulation of the DNA translocation speed, which is vital for single-molecule sequencing, can be achieved on this device by simply tuning the gate bias. The device is realized in a conventional semiconductor microfabrication process without the requirement of advanced lithography, and can be potentially further developed into a compact electronic single-molecule sequencer.
Nanolock-Nanopore Facilitated Digital Diagnostics of Cancer Driver Mutation in Tumor Tissue.

PubMed

Wang, Yong; Tian, Kai; Shi, Ruicheng; Gu, Amy; Pennella, Michael; Alberts, Lindsey; Gates, Kent S; Li, Guangfu; Fan, Hongxin; Wang, Michael X; Gu, Li-Qun

2017-07-28

Cancer driver mutations are clinically significant biomarkers. In precision medicine, accurate detection of these oncogenic changes in patients would enable early diagnostics of cancer, individually tailored targeted therapy, and precise monitoring of treatment response. Here we investigated a novel nanolock-nanopore method for single-molecule detection of a serine/threonine protein kinase gene BRAF V600E mutation in tumor tissues of thyroid cancer patients. The method lies in a noncovalent, mutation sequence-specific nanolock. We found that the nanolock formed on the mutant allele/probe duplex can separate the duplex dehybridization procedure into two sequential steps in the nanopore. Remarkably, this stepwise unzipping kinetics can produce a unique nanopore electric marker, with which a single DNA molecule of the cancer mutant allele can be unmistakably identified in various backgrounds of the normal wild-type allele. The single-molecule sensitivity for mutant allele enables both binary diagnostics and quantitative analysis of mutation occurrence. In the current configuration, the method can detect the BRAF V600E mutant DNA lower than 1% in the tumor tissues. The nanolock-nanopore method can be adapted to detect a broad spectrum of both transversion and transition DNA mutations, with applications from diagnostics to targeted therapy.
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.

PubMed

Eernisse, D J

1992-04-01

DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Towards a molecular logic machine

NASA Astrophysics Data System (ADS)

Remacle, F.; Levine, R. D.

2001-06-01

Finite state logic machines can be realized by pump-probe spectroscopic experiments on an isolated molecule. The most elaborate setup, a Turing machine, can be programmed to carry out a specific computation. We argue that a molecule can be similarly programmed, and provide examples using two photon spectroscopies. The states of the molecule serve as the possible states of the head of the Turing machine and the physics of the problem determines the possible instructions of the program. The tape is written in an alphabet that allows the listing of the different pump and probe signals that are applied in a given experiment. Different experiments using the same set of molecular levels correspond to different tapes that can be read and processed by the same head and program. The analogy to a Turing machine is not a mechanical one and is not completely molecular because the tape is not part of the molecular machine. We therefore also discuss molecular finite state machines, such as sequential devices, for which the tape is not part of the machine. Nonmolecular tapes allow for quite long input sequences with a rich alphabet (at the level of 7 bits) and laser pulse shaping experiments provide concrete examples. Single molecule spectroscopies show that a single molecule can be repeatedly cycled through a logical operation.
Profiling of Oral Microbiota in Early Childhood Caries Using Single-Molecule Real-Time Sequencing

PubMed Central

Wang, Yuan; Zhang, Jie; Chen, Xi; Jiang, Wen; Wang, Sa; Xu, Lei; Tu, Yan; Zheng, Pei; Wang, Ying; Lin, Xiaolong; Chen, Hui

2017-01-01

Background: Alterations of oral microbiota are the main cause of the progression of caries. The goal of this study was to characterize the oral microbiota in childhood caries based on single-molecule real-time sequencing. Methods: A total of 21 preschoolers, aged 3–5 years old with severe early childhood caries, and 20 age-matched, caries-free children as controls were recruited. Saliva samples were collected, followed by DNA extraction, Pacbio sequencing, and phylogenetic analyses of the oral microbial communities. Results: Eight hundred and seventy six species derived from 13 known bacterial phyla and 110 genera were detected from 41 children using Pacbio sequencing. At the species level, 38 species, including Veillonella spp., Streptococcus spp., Prevotella spp., and Lactobacillus spp., showed higher abundance in the caries group compared to the caries-free group (p < 0.05). The core microbiota at the genus and species levels was more stable in the caries-free micro-ecological niche. At follow-up, oral examinations 6 months after sample collection, development of new dental caries was observed in 5 children (the transitional group) among the 21 caries free children. Compared with the caries-free children, in the transitional and caries groups, 6 species, which were more abundant in the caries-free group, exhibited a relatively low abundance in both the caries group and the transitional group (p < 0.05). We conclude that Abiotrophia spp., Neisseria spp., and Veillonella spp., might be associated with healthy oral microbial ecosystem. Prevotella spp., Lactobacillus spp., Dialister spp., and Filifactor spp. may be related to the pathogenesis and progression of dental caries. PMID:29187843
Profiling of Oral Microbiota in Early Childhood Caries Using Single-Molecule Real-Time Sequencing.

PubMed

Wang, Yuan; Zhang, Jie; Chen, Xi; Jiang, Wen; Wang, Sa; Xu, Lei; Tu, Yan; Zheng, Pei; Wang, Ying; Lin, Xiaolong; Chen, Hui

2017-01-01

Background: Alterations of oral microbiota are the main cause of the progression of caries. The goal of this study was to characterize the oral microbiota in childhood caries based on single-molecule real-time sequencing. Methods: A total of 21 preschoolers, aged 3-5 years old with severe early childhood caries, and 20 age-matched, caries-free children as controls were recruited. Saliva samples were collected, followed by DNA extraction, Pacbio sequencing, and phylogenetic analyses of the oral microbial communities. Results: Eight hundred and seventy six species derived from 13 known bacterial phyla and 110 genera were detected from 41 children using Pacbio sequencing. At the species level, 38 species, including Veillonella spp., Streptococcus spp., Prevotella spp., and Lactobacillus spp., showed higher abundance in the caries group compared to the caries-free group ( p < 0.05). The core microbiota at the genus and species levels was more stable in the caries-free micro-ecological niche. At follow-up, oral examinations 6 months after sample collection, development of new dental caries was observed in 5 children (the transitional group) among the 21 caries free children. Compared with the caries-free children, in the transitional and caries groups, 6 species, which were more abundant in the caries-free group, exhibited a relatively low abundance in both the caries group and the transitional group ( p < 0.05). We conclude that Abiotrophia spp., Neisseria spp., and Veillonella spp., might be associated with healthy oral microbial ecosystem. Prevotella spp., Lactobacillus spp., Dialister spp., and Filifactor spp. may be related to the pathogenesis and progression of dental caries.
T-cell receptor repertoire of human peripheral CD161hiTRAV1-2+ MAIT cells revealed by next generation sequencing and single cell analysis.

PubMed

Held, Kathrin; Beltrán, Eduardo; Moser, Markus; Hohlfeld, Reinhard; Dornmair, Klaus

2015-09-01

Mucosal-associated invariant T (MAIT) cells are a T-cell subset that expresses a conserved TRAV1-2 (Vα7.2) T-cell receptor (TCR) chain and the surface marker CD161. They are involved in the defence against microbes as they recognise small organic molecules of microbial origin that are presented by the non-classical MHC molecule 1 (MR1). MAIT cells express a semi-restricted TCR α chain with TRAV1-2 preferentially linked to TRAJ33, TRAJ12, or TRAJ20 which pairs with a limited set of β chains. To investigate the TCR repertoire of human CD161(hi)TRAV1-2(+) T cells in depth we analysed the α and β chains of this T-cell subset by next generation sequencing. Concomitantly we analysed 132 paired α and β chains from single cells to assess the αβ pairing preferences. We found that the CD161(hi)TRAV1-2(+) TCR repertoire in addition to the typical MAIT TCRs further contains polyclonal elements reminiscent of classical αβ T cells. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Evolution of Sphingomonad Gene Clusters Related to Pesticide Catabolism Revealed by Genome Sequence and Mobilomics of Sphingobium herbicidovorans MH

PubMed Central

Nielsen, Tue Kjærgaard; Rasmussen, Morten; Demanèche, Sandrine; Cecillon, Sébastien; Vogel, Timothy M.

2017-01-01

Abstract Bacterial degraders of chlorophenoxy herbicides have been isolated from various ecosystems, including pristine environments. Among these degraders, the sphingomonads constitute a prominent group that displays versatile xenobiotic-degradation capabilities. Four separate sequencing strategies were required to provide the complete sequence of the complex and plastic genome of the canonical chlorophenoxy herbicide-degrading Sphingobium herbicidovorans MH. The genome has an intricate organization of the chlorophenoxy-herbicide catabolic genes sdpA, rdpA, and cadABCD that encode the (R)- and (S)-enantiomer-specific 2,4-dichlorophenoxypropionate dioxygenases and four subunits of a Rieske non-heme iron oxygenase involved in 2-methyl-chlorophenoxyacetic acid degradation, respectively. Several major genomic rearrangements are proposed to help understand the evolution and mobility of these important genes and their genetic context. Single-strain mobilomic sequence analysis uncovered plasmids and insertion sequence-associated circular intermediates in this environmentally important bacterium and enabled the description of evolutionary models for pesticide degradation in strain MH and related organisms. The mobilome presented a complex mosaic of mobile genetic elements including four plasmids and several circular intermediate DNA molecules of insertion-sequence elements and transposons that are central to the evolution of xenobiotics degradation. Furthermore, two individual chromosomally integrated prophages were shown to excise and form free circular DNA molecules. This approach holds great potential for improving the understanding of genome plasticity, evolution, and microbial ecology. PMID:28961970
Drug-DNA interactions at single molecule level: A view with optical tweezers

NASA Astrophysics Data System (ADS)

Paramanathan, Thayaparan

Studies of small molecule--DNA interactions are essential for developing new drugs for challenging diseases like cancer and HIV. The main idea behind developing these molecules is to target and inhibit the reproduction of the tumor cells and infected cells. We mechanically manipulate single DNA molecule using optical tweezers to investigate two molecules that have complex and multiple binding modes. Mononuclear ruthenium complexes have been extensively studied as a test for rational drug design. Potential drug candidates should have high affinity to DNA and slow dissociation kinetics. To achieve this, motifs of the ruthenium complexes are altered. Our collaborators designed a dumb-bell shaped binuclear ruthenium complex that can only intercalate DNA by threading through its bases. Studying the binding properties of this complex in bulk studies took hours. By mechanically manipulating a single DNA molecule held with optical tweezers, we lower the barrier to thread and make it fast compared to the bulk experiments. Stretching single DNA molecules with different concentration of drug molecules and holding it at a constant force allows the binding to reach equilibrium. By this we can obtain the equilibrium fractional ligand binding and length of DNA at saturated binding. Fitting these results yields quantitative measurements of the binding thermodynamics and kinetics of this complex process. The second complex discussed in this study is Actinomycin D (ActD), a well studied anti-cancer agent that is used as a prototype for developing new generations of drugs. However, the biophysical basis of its activity is still unclear. Because ActD is known to intercalate double stranded DNA (dsDNA), it was assumed to block replication by stabilizing dsDNA in front of the replication fork. However, recent studies have shown that ActD binds with even higher affinity to imperfect duplexes and some sequences of single stranded DNA (ssDNA). We directly measure the on and off rates by stretching the DNA molecule to a certain force and holding it at constant force while adding the drug and then while washing off the drug. Our finding resolves the long lasting controversy of ActD binding modes, clearly showing that both the dsDNA binding and ssDNA binding converge to the same single mode. The result supports the hypothesis that the primary characteristic of ActD that contributes to its biological activity is its ability to inhibit cellular replication by binding to transcription bubbles and causing cell death.
Development of Solid-State Nanopore Technology for Life Detection

NASA Technical Reports Server (NTRS)

Bywaters, K. B.; Schmidt, H.; Vercoutere, W.; Deamer, D.; Hawkins, A. R.; Quinn, R. C.; Burton, A. S.; Mckay, C. P.

2017-01-01

Biomarkers for life on Earth are an important starting point to guide the search for life elsewhere. However, the search for life beyond Earth should incorporate technologies capable of recognizing an array of potential biomarkers beyond what we see on Earth, in order to minimize the risk of false negatives from life detection missions. With this in mind, charged linear polymers may be a universal signature for life, due to their ability to store information while also inherently reducing the tendency of complex tertiary structure formation that significantly inhibit replication. Thus, these molecules are attractive targets for biosignature detection as potential "self-sustaining chemical signatures." Examples of charged linear polymers, or polyelectrolytes, include deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) as well as synthetic polyelectrolytes that could potentially support life, including threose nucleic acid (TNA) and other xenonucleic acids (XNAs). Nanopore analysis is a novel technology that has been developed for singlemolecule sequencing with exquisite single nucleotide resolution which is also well-suited for analysis of polyelectrolyte molecules. Nanopore analysis has the ability to detect repeating sequences of electrical charges in organic linear polymers, and it is not molecule- specific (i.e. it is not restricted to only DNA or RNA). In this sense, it is a better life detection technique than approaches that are based on specific molecules, such as the polymerase chain reaction (PCR), which requires that the molecule being detected be composed of DNA.
Spontaneous Transport of Single-Stranded DNA through Graphene-MoS2 Heterostructure Nanopores.

PubMed

Luan, Binquan; Zhou, Ruhong

2018-04-24

The effective transport of a single-stranded DNA (ssDNA) molecule through a solid-state nanopore is essential to the future success of high-throughput and low-cost DNA sequencing. Compatible with current electric sensing technologies, here, we propose and demonstrate by molecular dynamics simulations the ssDNA transport through a quasi-two-dimensional nanopore in a heterostructure stacked together with different 2D materials, such as graphene and molybdenum disulfide (MoS 2 ). Due to different chemical potentials, U, of DNA bases on different 2D materials, it is energetically favorable for a ssDNA molecule to move from the low- U MoS 2 surface to the high- U graphene surface through a nanopore. With the proper attraction between the negatively charged phosphate group in each nucleotide and the positively charged Mo atoms exposed on the pore surface, the ssDNA molecule can be temporarily seized and released thereafter through a thermal activation, that is, a slow and possible nucleotide-by-nucleotide transport. A theoretical formulation is then developed for the free energy of the ssDNA transiting a heterostructure nanopore to properly characterize the non-equilibrium stick-slip-like motion of a ssDNA molecule.
Temporal and spatial regulation of mRNA export: Single particle RNA-imaging provides new tools and insights

PubMed Central

Heinrich, Stephanie; Derrer, Carina Patrizia; Lari, Azra; Weis, Karsten; Montpetit, Ben

2017-01-01

The transport of messenger RNAs (mRNAs) from the nucleus to cytoplasm is an essential step in the gene expression program of all eukaryotes. Recent technological advances in the areas of RNA-labeling, microscopy, and sequencing are leading to novel insights about mRNA biogenesis and export. This includes quantitative single molecule imaging (SMI) of RNA molecules in live cells, which is providing knowledge of the spatial and temporal dynamics of the export process. As this information becomes available, it leads to new questions, the reinterpretation of previous findings, and revised models of mRNA export. In this review, we will briefly highlight some of these recent findings and discuss how live cell SMI approaches may be used to further our current understanding of mRNA export and gene expression. PMID:28052353
Applications of biological pores in nanomedicine, sensing, and nanoelectronics

PubMed Central

Majd, Sheereen; Yusko, Erik C; Billeh, Yazan N; Macrae, Michael X; Yang, Jerry; Mayer, Michael

2011-01-01

Biological protein pores and pore-forming peptides can generate a pathway for the flux of ions and other charged or polar molecules across cellular membranes. In nature, these nanopores have diverse and essential functions that range from maintaining cell homeostasis and participating in cell signaling to activating or killing cells. The combination of the nanoscale dimensions and sophisticated – often regulated – functionality of these biological pores make them particularly attractive for the growing field of nanobiotechnology. Applications range from single-molecule sensing to drug delivery and targeted killing of malignant cells. Potential future applications may include the use of nanopores for single strand DNA sequencing and for generating bio-inspired, and possibly, biocompatible visual detection systems and batteries. This article reviews the current state of applications of pore-forming peptides and proteins in nanomedicine, sensing, and nanoelectronics. PMID:20561776
Identifying single bases in a DNA oligomer with electron tunnelling.

PubMed

Huang, Shuo; He, Jin; Chang, Shuai; Zhang, Peiming; Liang, Feng; Li, Shengqin; Tuchband, Michael; Fuhrmann, Alexander; Ros, Robert; Lindsay, Stuart

2010-12-01

It has been proposed that single molecules of DNA could be sequenced by measuring the physical properties of the bases as they pass through a nanopore. Theoretical calculations suggest that electron tunnelling can identify bases in single-stranded DNA without enzymatic processing, and it was recently experimentally shown that tunnelling can sense individual nucleotides and nucleosides. Here, we report that tunnelling electrodes functionalized with recognition reagents can identify a single base flanked by other bases in short DNA oligomers. The residence time of a single base in a recognition junction is on the order of a second, but pulling the DNA through the junction with a force of tens of piconewtons would yield reading speeds of tens of bases per second.
Lessons from single-cell transcriptome analysis of oxygen-sensing cells.

PubMed

Zhou, Ting; Matsunami, Hiroaki

2018-05-01

The advent of single-cell RNA-sequencing (RNA-Seq) technology has enabled transcriptome profiling of individual cells. Comprehensive gene expression analysis at the single-cell level has proven to be effective in characterizing the most fundamental aspects of cellular function and identity. This unbiased approach is revolutionary for small and/or heterogeneous tissues like oxygen-sensing cells in identifying key molecules. Here, we review the major methods of current single-cell RNA-Seq technology. We discuss how this technology has advanced the understanding of oxygen-sensing glomus cells in the carotid body and helped uncover novel oxygen-sensing cells and mechanisms in the mice olfactory system. We conclude by providing our perspective on future single-cell RNA-Seq research directed at oxygen-sensing cells.
Complete genome sequence of Pelosinus sp. strain UFO1 assembled using single-molecule real-time DNA sequencing technology

DOE PAGES

Brown, Steven D.; Utturkar, Sagar M.; Magnuson, Timothy S.; ...

2014-09-04

Pelosinus fermentans strain R7 was isolated from Russian kaolin clays as the type strain and it can reduce Fe(III) during fermentative growth (1). Draft genome sequences for P. fermentans R7 and four strains from Hanford, Washington, USA, have been published (2–4). The P. fermentans 16S rRNA sequence dominated the lactate-based enrichment cultures from three geochemically contrasting soils from the Melton Branch Watershed, Oak Ridge, Tennessee, USA (5) and also at another stimulated, uraniumcontaminated field site near Oak Ridge (6). For the current work, strain UFO1 was isolated from pristine sediments at a background field site in Oak Ridge and characterizedmore » as facilitating U(VI) reduction and precipitation with phosphate (7).« less
Switchable DNA interfaces for the highly sensitive detection of label-free DNA targets.

PubMed

Rant, Ulrich; Arinaga, Kenji; Scherer, Simon; Pringsheim, Erika; Fujita, Shozo; Yokoyama, Naoki; Tornow, Marc; Abstreiter, Gerhard

2007-10-30

We report a method to detect label-free oligonucleotide targets. The conformation of surface-tethered probe nucleic acids is modulated by alternating electric fields, which cause the molecules to extend away from or fold onto the biased surface. Binding (hybridization) of targets to the single-stranded probes results in a pronounced enhancement of the layer-height modulation amplitude, monitored optically in real time. The method features an exceptional detection limit of <3 x 10(8) bound targets per cm(2) sensor area. Single base-pair mismatches in the sequences of DNA complements may readily be identified; moreover, binding kinetics and binding affinities can be determined with high accuracy. When driving the DNA to oscillate at frequencies in the kHz regime, distinct switching kinetics are revealed for single- and double-stranded DNA. Molecular dynamics are used to identify the binding state of molecules according to their characteristic kinetic fingerprints by using a chip-compatible detection format.
Switchable DNA interfaces for the highly sensitive detection of label-free DNA targets

PubMed Central

Rant, Ulrich; Arinaga, Kenji; Scherer, Simon; Pringsheim, Erika; Fujita, Shozo; Yokoyama, Naoki; Tornow, Marc; Abstreiter, Gerhard

2007-01-01

We report a method to detect label-free oligonucleotide targets. The conformation of surface-tethered probe nucleic acids is modulated by alternating electric fields, which cause the molecules to extend away from or fold onto the biased surface. Binding (hybridization) of targets to the single-stranded probes results in a pronounced enhancement of the layer-height modulation amplitude, monitored optically in real time. The method features an exceptional detection limit of <3 × 108 bound targets per cm2 sensor area. Single base-pair mismatches in the sequences of DNA complements may readily be identified; moreover, binding kinetics and binding affinities can be determined with high accuracy. When driving the DNA to oscillate at frequencies in the kHz regime, distinct switching kinetics are revealed for single- and double-stranded DNA. Molecular dynamics are used to identify the binding state of molecules according to their characteristic kinetic fingerprints by using a chip-compatible detection format. PMID:17951434
Topological events in single molecules of E. coli DNA confined in nanochannels

PubMed Central

Reifenberger, Jeffrey G.; Dorfman, Kevin D.; Cao, Han

2015-01-01

We present experimental data concerning potential topological events such as folds, internal backfolds, and/or knots within long molecules of double-stranded DNA when they are stretched by confinement in a nanochannel. Genomic DNA from E. coli was labeled near the ‘GCTCTTC’ sequence with a fluorescently labeled dUTP analog and stained with the DNA intercalator YOYO. Individual long molecules of DNA were then linearized and imaged using methods based on the NanoChannel Array technology (Irys® System) available from BioNano Genomics. Data were collected on 189,153 molecules of length greater than 50 kilobases. A custom code was developed to search for abnormal intensity spikes in the YOYO backbone profile along the length of individual molecules. By correlating the YOYO intensity spikes with the aligned barcode pattern to the reference, we were able to correlate the bright intensity regions of YOYO with abnormal stretching in the molecule, which suggests these events were either a knot or a region of internal backfolding within the DNA. We interpret the results of our experiments involving molecules exceeding 50 kilobases in the context of existing simulation data for relatively short DNA, typically several kilobases. The frequency of these events is lower than the predictions from simulations, while the size of the events is larger than simulation predictions and often exceeds the molecular weight of the simulated molecules. We also identified DNA molecules that exhibit large, single folds as they enter the nanochannels. Overall, topological events occur at a low frequency (~7% of all molecules) and pose an easily surmountable obstacle for the practice of genome mapping in nanochannels. PMID:25991508
Method and apparatus for enhanced sequencing of complex molecules using surface-induced dissociation in conjunction with mass spectrometric analysis

DOEpatents

Laskin, Julia [Richland, WA; Futrell, Jean H [Richland, WA

2008-04-29

The invention relates to a method and apparatus for enhanced sequencing of complex molecules using surface-induced dissociation (SID) in conjunction with mass spectrometric analysis. Results demonstrate formation of a wide distribution of structure-specific fragments having wide sequence coverage useful for sequencing and identifying the complex molecules.

Selection and Characterization of Single Stranded DNA Aptamers for the Hormone Abscisic Acid

PubMed Central

Gonzalez, Victor M.; Millo, Enrico; Sturla, Laura; Vigliarolo, Tiziana; Bagnasco, Luca; Guida, Lucrezia; D'Arrigo, Cristina; De Flora, Antonio; Salis, Annalisa; Martin, Elena M.; Bellotti, Marta; Zocchi, Elena

2013-01-01

The hormone abscisic acid (ABA) is a small molecule involved in pivotal physiological functions in higher plants. Recently, ABA has been also identified as an endogenous hormone in mammals, regulating different cell functions including inflammatory processes, stem cell expansion, insulin release, and glucose uptake. Aptamers are short, single-stranded (ss) oligonucleotidesable to recognize target molecules with high affinity. The small size of the ABA molecule represented a challenge for aptamer development and the aim of this study was to develop specific anti-ABA DNA aptamers. Biotinylated abscisic acid (bio-ABA) was immobilized on streptavidin-coated magnetic beads. DNA aptamers against bio-ABA were selected with 7 iterative rounds of the systematic evolution of ligands by exponential enrichment method (SELEX), each round comprising incubation of the ABA-binding beads with the ssDNA sequences, DNA elution, electrophoresis, and polymerase chain reaction (PCR) amplification. The PCR product was cloned and sequenced. The binding affinity of several clones was determined using bio-ABA immobilized on streptavidin-coated plates. Aptamer 2 and aptamer 9 showed the highest binding affinity, with dissociation constants values of 0.98±0.14 μM and 0.80±0.07 μM, respectively. Aptamers 2 and 9 were also able to bind free, unmodified ABA and to discriminate between different ABA enantiomers and isomers. Our findings indicate that ssDNA aptamers can selectively bind ABA and could be used for the development of ABA quantitation assays. PMID:23971905
Nanostructured Tip-Shaped Biosensors: Application of Six Sigma Approach for Enhanced Manufacturing

PubMed Central

Kahng, Seong-Joong; Kim, Jong-Hoon; Chung, Jae-Hyun

2016-01-01

Nanostructured tip-shaped biosensors have drawn attention for biomolecule detection as they are promising for highly sensitive and specific detection of a target analyte. Using a nanostructured tip, the sensitivity is increased to identify individual molecules because of the high aspect ratio structure. Various detection methods, such as electrochemistry, fluorescence microcopy, and Raman spectroscopy, have been attempted to enhance the sensitivity and the specificity. Due to the confined path of electrons, electrochemical measurement using a nanotip enables the detection of single molecules. When an electric field is combined with capillary action and fluid flow, target molecules can be effectively concentrated onto a nanotip surface for detection. To enhance the concentration efficacy, a dendritic nanotip rather than a single tip could be used to detect target analytes, such as nanoparticles, cells, and DNA. However, reproducible fabrication with relation to specific detection remains a challenge due to the instability of a manufacturing method, resulting in inconsistent shape. In this paper, nanostructured biosensors are reviewed with our experimental results using dendritic nanotips for sequence specific detection of DNA. By the aid of the Six Sigma approach, the fabrication yield of dendritic nanotips increases from 20.0% to 86.6%. Using the nanotips, DNA is concentrated and detected in a sequence specific way with the detection limit equivalent to 1000 CFU/mL. The pros and cons of a nanotip biosensor are evaluated in conjunction with future prospects. PMID:28025540
SAbPred: a structure-based antibody prediction server

PubMed Central

Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo; Marks, Claire; Nowak, Jaroslaw; Regep, Cristian; Georges, Guy; Kelm, Sebastian; Popovic, Bojana; Deane, Charlotte M.

2016-01-01

SAbPred is a server that makes predictions of the properties of antibodies focusing on their structures. Antibody informatics tools can help improve our understanding of immune responses to disease and aid in the design and engineering of therapeutic molecules. SAbPred is a single platform containing multiple applications which can: number and align sequences; automatically generate antibody variable fragment homology models; annotate such models with estimated accuracy alongside sequence and structural properties including potential developability issues; predict paratope residues; and predict epitope patches on protein antigens. The server is available at http://opig.stats.ox.ac.uk/webapps/sabpred. PMID:27131379
Materials and methods for stabilizing nanoparticles in salt solutions

DOEpatents

Robinson, David Bruce; Zuckermann, Ronald; Buffleben, George M.

2013-06-11

Sequence-specific polymers are proving to be a powerful approach to assembly and manipulation of matter on the nanometer scale. Ligands that are peptoids, or sequence-specific N-functional glycine oligomers, allow precise and flexible control over the arrangement of binding groups, steric spacers, charge, and other functionality. We have synthesized short peptoids that can prevent the aggregation of gold nanoparticles in high-salt environments including divalent salt, and allow co-adsorption of a single DNA molecule. This degree of precision and versatility is likely to prove essential in bottom-up assembly of nanostructures and in biomedical applications of nanomaterials.
miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments.

PubMed

Hackenberg, Michael; Sturm, Martin; Langenberger, David; Falcón-Pérez, Juan Manuel; Aransay, Ana M

2009-07-01

Next-generation sequencing allows now the sequencing of small RNA molecules and the estimation of their expression levels. Consequently, there will be a high demand of bioinformatics tools to cope with the several gigabytes of sequence data generated in each single deep-sequencing experiment. Given this scene, we developed miRanalyzer, a web server tool for the analysis of deep-sequencing experiments for small RNAs. The web server tool requires a simple input file containing a list of unique reads and its copy numbers (expression levels). Using these data, miRanalyzer (i) detects all known microRNA sequences annotated in miRBase, (ii) finds all perfect matches against other libraries of transcribed sequences and (iii) predicts new microRNAs. The prediction of new microRNAs is an especially important point as there are many species with very few known microRNAs. Therefore, we implemented a highly accurate machine learning algorithm for the prediction of new microRNAs that reaches AUC values of 97.9% and recall values of up to 75% on unseen data. The web tool summarizes all the described steps in a single output page, which provides a comprehensive overview of the analysis, adding links to more detailed output pages for each analysis module. miRanalyzer is available at http://web.bioinformatics.cicbiogune.es/microRNA/.
Biological nanopore MspA for DNA sequencing

NASA Astrophysics Data System (ADS)

Manrao, Elizabeth A.

Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Distinguishing between Protein Dynamics and Dye Photophysics in Single-Molecule FRET Experiments

PubMed Central

Chung, Hoi Sung; Louis, John M.; Eaton, William A.

2010-01-01

Abstract Förster resonance energy transfer (FRET) efficiency distributions in single-molecule experiments contain both structural and dynamical information. Extraction of this information from these distributions requires a careful analysis of contributions from dye photophysics. To investigate how mechanisms other than FRET affect the distributions obtained by counting donor and acceptor photons, we have measured single-molecule fluorescence trajectories of a small α/β protein, i.e., protein GB1, undergoing two-state, folding/unfolding transitions. Alexa 488 donor and Alexa 594 acceptor dyes were attached to cysteines at positions 10 and 57 to yield two isomers—donor10/acceptor57 and donor57/acceptor10—which could not be separated in the purification. The protein was immobilized via binding of a histidine tag added to a linker sequence at the N-terminus to cupric ions embedded in a polyethylene-glycol–coated glass surface. The distribution of FRET efficiencies assembled from the trajectories is complex with widths for the individual peaks in large excess of that caused by shot noise. Most of this complexity can be explained by two interfering photophysical effects—a photoinduced red shift of the donor dye and differences in the quantum yield of the acceptor dye for the two isomers resulting from differences in quenching rate by the cupric ion. Measurements of steady-state polarization, calculation of the donor-acceptor cross-correlation function from photon trajectories, and comparison of the single molecule and ensemble kinetics all indicate that conformational distributions and dynamics do not contribute to the complexity. PMID:20159166
Distinguishing between protein dynamics and dye photophysics in single-molecule FRET experiments.

PubMed

Chung, Hoi Sung; Louis, John M; Eaton, William A

2010-02-17

Förster resonance energy transfer (FRET) efficiency distributions in single-molecule experiments contain both structural and dynamical information. Extraction of this information from these distributions requires a careful analysis of contributions from dye photophysics. To investigate how mechanisms other than FRET affect the distributions obtained by counting donor and acceptor photons, we have measured single-molecule fluorescence trajectories of a small alpha/beta protein, i.e., protein GB1, undergoing two-state, folding/unfolding transitions. Alexa 488 donor and Alexa 594 acceptor dyes were attached to cysteines at positions 10 and 57 to yield two isomers-donor(10)/acceptor(57) and donor(57)/acceptor(10)-which could not be separated in the purification. The protein was immobilized via binding of a histidine tag added to a linker sequence at the N-terminus to cupric ions embedded in a polyethylene-glycol-coated glass surface. The distribution of FRET efficiencies assembled from the trajectories is complex with widths for the individual peaks in large excess of that caused by shot noise. Most of this complexity can be explained by two interfering photophysical effects-a photoinduced red shift of the donor dye and differences in the quantum yield of the acceptor dye for the two isomers resulting from differences in quenching rate by the cupric ion. Measurements of steady-state polarization, calculation of the donor-acceptor cross-correlation function from photon trajectories, and comparison of the single molecule and ensemble kinetics all indicate that conformational distributions and dynamics do not contribute to the complexity. Copyright 2010 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Laser microtreatment for genetic manipulations and DNA diagnostics by a combination of microbeam and photonic tweezers (laser microbeam trap)

NASA Astrophysics Data System (ADS)

Greulich, Karl-Otto; Monajembashi, Shamci; Celeda, D.; Endlich, N.; Eickhoff, Holger; Hoyer, Carsten; Leitz, G.; Weber, Gerd; Scheef, J.; Rueterjans, H.

1994-12-01

Genomes of higher organisms are larger than one typically expects. For example, the DNA of a single human cell is almost two meters long, the DNA in the human body covers the distance Earth-Sun approximately 140 times. This is often not considered in typical molecular biological approaches for DNA diagnostics, where usually only DNA of the length of a gene is investigated. Also, one basic aspect of sequencing the human genome is not really solved: the problem how to prepare the huge amounts of DNA required. Approaches from biomedical optics combined with new developments in single molecule biotechnology may at least contribute some parts of the puzzle. A large genome can be partitioned into portions comprising approximately 1% of the whole DNA using a laser microbeam. The single DNA fragment can be amplified by the polymerase chain reaction in order to obtain a sufficient amount of molecules for conventional DNA diagnostics or for analysis by octanucleotide hybridization. When not amplified by biotechnological processes, the individual DNA molecule can be visualized in the light microscope and can be manipulated and dissected with the laser microbeam trap. The DNA probes obtained by single molecule biotechnology can be employed for fluorescence in situ introduced into plant cells and subcellular structures even when other techniques fail. Since the laser microbeam trap allows to work in the interior of a cell without opening it, subcellular structures can be manipulated. For example, in algae, such structures can be moved out of their original position and used to study intracellular viscosities.
Enzymatic repair of selected cross-linked homoduplex molecules enhances nuclear gene rescue from Pompeii and Herculaneum remains.

PubMed

Di Bernardo, Giovanni; Del Gaudio, Stefania; Cammarota, Marcella; Galderisi, Umberto; Cascino, Antonino; Cipollaro, Marilena

2002-02-15

Ancient DNA (aDNA) samples extracted from the bone remains of six equids buried by the Vesuvius eruption in 79 AD were investigated to test pre-amplification and enzymatic repair procedures designed to enhance the rescue of nuclear genes. The extracts, which proved all positive for Equidae mtDNA amplification, proved positive only four times out of 18 when tested for single-copy Equidae nuclear genes (epsilon globin, p53 and gamma interferon). Pre-amplification did not change the number of retrieved aDNA sequences but 10 times out of 14 enzymatic repair restored the amplifiability of the genes analysed, proving that repair increases the rate of successful rescue from 22 to alpha(lambda)mu(omicron)sigma(tau) 80%. These findings support the hypothesis that some of these cross-linked aDNA molecules, which are not completely separated when DNA is extracted under denaturing conditions, become homoduplex substrates for Pol I and/or T4 ligase action upon renaturation. aDNA authenticity is proved by the homology of the nucleotide sequences of loci tested to the corresponding modern Equidae sequences. Data also indicate that cross-linked homoduplex molecules selected by denaturation of the extract are repaired without any chimera formation. The general features of aDNA amplification with and without denaturation and enzymatic repair are discussed.
Enzymatic repair of selected cross-linked homoduplex molecules enhances nuclear gene rescue from Pompeii and Herculaneum remains

PubMed Central

Di Bernardo, Giovanni; Del Gaudio, Stefania; Cammarota, Marcella; Galderisi, Umberto; Cascino, Antonino; Cipollaro, Marilena

2002-01-01

Ancient DNA (aDNA) samples extracted from the bone remains of six equids buried by the Vesuvius eruption in 79 AD were investigated to test pre-amplification and enzymatic repair procedures designed to enhance the rescue of nuclear genes. The extracts, which proved all positive for Equidae mtDNA amplification, proved positive only four times out of 18 when tested for single-copy Equidae nuclear genes (ɛ globin, p53 and γ interferon). Pre-amplification did not change the number of retrieved aDNA sequences but 10 times out of 14 enzymatic repair restored the amplifiability of the genes analysed, proving that repair increases the rate of successful rescue from 22 to αλµοστ 80%. These findings support the hypothesis that some of these cross-linked aDNA molecules, which are not completely separated when DNA is extracted under denaturing conditions, become homoduplex substrates for Pol I and/or T4 ligase action upon renaturation. aDNA authenticity is proved by the homology of the nucleotide sequences of loci tested to the corresponding modern Equidae sequences. Data also indicate that cross-linked homoduplex molecules selected by denaturation of the extract are repaired without any chimera formation. The general features of aDNA amplification with and without denaturation and enzymatic repair are discussed. PMID:11842122
Biotechnological mass production of DNA origami

NASA Astrophysics Data System (ADS)

Praetorius, Florian; Kick, Benjamin; Behler, Karl L.; Honemann, Maximilian N.; Weuster-Botz, Dirk; Dietz, Hendrik

2017-12-01

DNA nanotechnology, in particular DNA origami, enables the bottom-up self-assembly of micrometre-scale, three-dimensional structures with nanometre-precise features. These structures are customizable in that they can be site-specifically functionalized or constructed to exhibit machine-like or logic-gating behaviour. Their use has been limited to applications that require only small amounts of material (of the order of micrograms), owing to the limitations of current production methods. But many proposed applications, for example as therapeutic agents or in complex materials, could be realized if more material could be used. In DNA origami, a nanostructure is assembled from a very long single-stranded scaffold molecule held in place by many short single-stranded staple oligonucleotides. Only the bacteriophage-derived scaffold molecules are amenable to scalable and efficient mass production; the shorter staple strands are obtained through costly solid-phase synthesis or enzymatic processes. Here we show that single strands of DNA of virtually arbitrary length and with virtually arbitrary sequences can be produced in a scalable and cost-efficient manner by using bacteriophages to generate single-stranded precursor DNA that contains target strand sequences interleaved with self-excising ‘cassettes’, with each cassette comprising two Zn2+-dependent DNA-cleaving DNA enzymes. We produce all of the necessary single strands of DNA for several DNA origami using shaker-flask cultures, and demonstrate end-to-end production of macroscopic amounts of a DNA origami nanorod in a litre-scale stirred-tank bioreactor. Our method is compatible with existing DNA origami design frameworks and retains the modularity and addressability of DNA origami objects that are necessary for implementing custom modifications using functional groups. With all of the production and purification steps amenable to scaling, we expect that our method will expand the scope of DNA nanotechnology in many areas of science and technology.
Biotechnological mass production of DNA origami.

PubMed

Praetorius, Florian; Kick, Benjamin; Behler, Karl L; Honemann, Maximilian N; Weuster-Botz, Dirk; Dietz, Hendrik

2017-12-06

DNA nanotechnology, in particular DNA origami, enables the bottom-up self-assembly of micrometre-scale, three-dimensional structures with nanometre-precise features. These structures are customizable in that they can be site-specifically functionalized or constructed to exhibit machine-like or logic-gating behaviour. Their use has been limited to applications that require only small amounts of material (of the order of micrograms), owing to the limitations of current production methods. But many proposed applications, for example as therapeutic agents or in complex materials, could be realized if more material could be used. In DNA origami, a nanostructure is assembled from a very long single-stranded scaffold molecule held in place by many short single-stranded staple oligonucleotides. Only the bacteriophage-derived scaffold molecules are amenable to scalable and efficient mass production; the shorter staple strands are obtained through costly solid-phase synthesis or enzymatic processes. Here we show that single strands of DNA of virtually arbitrary length and with virtually arbitrary sequences can be produced in a scalable and cost-efficient manner by using bacteriophages to generate single-stranded precursor DNA that contains target strand sequences interleaved with self-excising 'cassettes', with each cassette comprising two Zn 2+ -dependent DNA-cleaving DNA enzymes. We produce all of the necessary single strands of DNA for several DNA origami using shaker-flask cultures, and demonstrate end-to-end production of macroscopic amounts of a DNA origami nanorod in a litre-scale stirred-tank bioreactor. Our method is compatible with existing DNA origami design frameworks and retains the modularity and addressability of DNA origami objects that are necessary for implementing custom modifications using functional groups. With all of the production and purification steps amenable to scaling, we expect that our method will expand the scope of DNA nanotechnology in many areas of science and technology.
Human ribosomal RNA gene: nucleotide sequence of the transcription initiation region and comparison of three mammalian genes.

PubMed Central

Financsek, I; Mizumoto, K; Mishima, Y; Muramatsu, M

1982-01-01

The transcription initiation site of the human ribosomal RNA gene (rDNA) was located by using the single-strand specific nuclease protection method and by determining the first nucleotide of the in vitro capped 45S preribosomal RNA. The sequence of 1,211 nucleotides surrounding the initiation site was determined. The sequenced region was found to consist of 75% G and C and to contain a number of short direct and inverted repeats and palindromes. By comparison of the corresponding initiation regions of three mammalian species, several conserved sequences were found upstream and downstream from the transcription starting point. Two short A + T-rich sequences are present on human, mouse, and rat ribosomal RNA genes between the initiation site and 40 nucleotides upstream, and a C + T cluster is located at a position around -60. At and downstream from the initiation site, a common sequence, T-AG-C-T-G-A-C-A-C-G-C-T-G-T-C-C-T-CT-T, was found in the three genes from position -1 through +18. The strong conservation of these sequences suggests their functional significance in rDNA. The S1 nuclease protection experiments with cloned rDNA fragments indicated the presence in human 45S RNA of molecules several hundred nucleotides shorter than the supposed primary transcript. The first 19 nucleotides of these molecules appear identical--except for one mismatch--to the nucleotide sequence of the 5' end of a supposed early processing product of the mouse 45S RNA. Images PMID:6954460
Observing Holliday junction branch migration one step at a time

NASA Astrophysics Data System (ADS)

Ha, Taekjip

2004-03-01

During genetic recombination, two homologous DNA molecules undergo strand exchange to form a four-way DNA (Holliday) junction and the recognition and processing of this species by branch migration and junction resolving enzymes determine the outcome. We have used single molecule fluorescence techniques to study two intrinsic structural dynamics of the Holliday junction, stacking conformer transitions and spontaneous branch migration. Our studies show that the dynamics of branch migration, resolved with one base pair resolution, is determined by the stability of conformers which in turn depends on the local DNA sequences. Therefore, the energy landscape of Holliday junction branch migation is not uniform, but is rugged.
Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

PubMed

Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

1998-10-20

Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.
Quantitation of fetal DNA fraction in maternal plasma using circulating single molecule amplification and re-sequencing technology (cSMART).

PubMed

Song, Yijun; Zhou, Xiya; Huang, Saiqiong; Li, Xiaohong; Qi, Qingwei; Jiang, Yulin; Liu, Yiqian; Ma, Chengcheng; Li, Zhifeng; Xu, Mengnan; Cram, David S; Liu, Juntao

2016-05-01

Calculation of the fetal DNA fraction (FF) is important for reliable and accurate noninvasive prenatal testing (NIPT) for fetal genetic abnormalities. The aim of the study was to develop and validate a novel method for FF determination. FF was calculated using the chromosome Y (ChrY) sequence read assay and by circulating single molecule amplification and re-sequencing technology of 76 autosomal SNPs. By Pearson correlation for FF (4.73-22.11%) in 33 male pregnancy samples, the R(2) co-efficient for the 76-SNP versus the ChrY assay was 0.9572 (p<0.001). In addition, the co-efficient of variation (CV) of FF measurement by the 76-SNP assay was low (0.15-0.35). As a control, the FF measurement for four non-pregnant plasma samples was virtually zero. In prospective longitudinal studies of 14 women with normal pregnancies, FF generally increased with gestational age. However, in eight women (71%) there was a significant decrease in FF between the first trimester (11-13 weeks) and the second trimester (15-19 weeks), and this was attributable to significant maternal weight gain. The novel 76-SNP cSMART assay has the precision to accurately measure FF in all pregnancies at a detection threshold of 5%. Based on FF trends in individual pregnancies, our results suggest that the end of the first trimester may be a more optimal window for performing NIPT. Copyright © 2016 Elsevier B.V. All rights reserved.
Single cell and single molecule techniques for the analysis of the epigenome

NASA Astrophysics Data System (ADS)

Wallin, Christopher Benjamin

Epigenetic regulation is a critical biological process for the health and development of a cell. Epigenetic regulation is facilitated by covalent modifications to the underlying DNA and chromatin proteins. A fundamental understanding of these epigenetic modifications and their associated interactions at the molecular scale is necessary to explain phenomena including cellular identity, stem cell plasticity, and neoplastic transformation. It is widely known that abnormal epigenetic profiles have been linked to many diseases, most notably cancer. While the field of epigenetics has progressed rapidly with conventional techniques, significant advances remain to be made with respect to combinatoric analysis of epigenetic marks and single cell epigenetics. Therefore, in this dissertation, I will discuss our development of devices and methodologies to address these pertinent issues. First, we designed a preparatory polydimethylsiloxane (PDMS) microdevice for the extraction, purification, and stretching of human chromosomal DNA and chromatin from small cell populations down to a single cell. The valveless device captures cells by size exclusion within the micropillars, entraps the DNA or chromatin in the micropillars after cell lysis, purifies away the cellular debris, and fluorescently labels the DNA and/or chromatin all within a single reaction chamber. With the device, we achieve nearly 100% extraction efficiency of the DNA. The device is also used for in-channel immunostaining of chromatin followed by downstream single molecule chromatin analysis in nanochannels (SCAN). Second, using multi-color, time-correlated single molecule measurements in nanochannels, simultaneous coincidence detection of 2 epigenetic marks is demonstrated. Coincidence detection of 3 epigenetic marks is also established using a pulsed interleaved excitation scheme. With these two promising results, genome-wide quantification of epigenetic marks was pursued. Unfortunately, quantitative SCAN never materialized. Reasons for this, including poor signal to background, are explained in detail. Third, development of mobility-SCAN, an analytical technique for measuring and analyzing single molecules based on their fluorescent signature and their electrophoretic mobility in nanochannels is described. We use the technique to differentiate biomolecules from complex mixtures and derive parameters such as diffusion coefficients and effective charges. Finally, the device is used to detect binding interactions of various complexes similar to affinity capillary electrophoresis, but on a single molecule level. Fourth, we conclude by briefly discussing SCAN-sort, a technique to sort individual chromatin molecules based on their fluorescent emissions for further downstream analysis such as DNA sequencing. We demonstrate a 2-fold enrichment of chromatin from sorting and discuss possible system modifications for better performance in the future.
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.

PubMed

Hargreaves, Adam D; Mulley, John F

2015-01-01

Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0-2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5' and 3' UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species.
Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing

PubMed Central

Hargreaves, Adam D.

2015-01-01

Portable DNA sequencers such as the Oxford Nanopore MinION device have the potential to be truly disruptive technologies, facilitating new approaches and analyses and, in some cases, taking sequencing out of the lab and into the field. However, the capabilities of these technologies are still being revealed. Here we show that single-molecule cDNA sequencing using the MinION accurately characterises venom toxin-encoding genes in the painted saw-scaled viper, Echis coloratus. We find the raw sequencing error rate to be around 12%, improved to 0–2% with hybrid error correction and 3% with de novo error correction. Our corrected data provides full coding sequences and 5′ and 3′ UTRs for 29 of 33 candidate venom toxins detected, far superior to Illumina data (13/40 complete) and Sanger-based ESTs (15/29). We suggest that, should the current pace of improvement continue, the MinION will become the default approach for cDNA sequencing in a variety of species. PMID:26623194

Probing the dynamics of restriction endonuclease NgoMIV-DNA interaction by single-molecule FRET.

PubMed

Tutkus, Marijonas; Sasnauskas, Giedrius; Rutkauskas, Danielis

2017-12-01

Many type II restriction endonucleases require two copies of their recognition sequence for optimal activity. Concomitant binding of two DNA sites by such an enzyme produces a DNA loop. Here we exploit single-molecule Förster resonance energy transfer (smFRET) of surface-immobilized DNA fragments to study the dynamics of DNA looping induced by tetrameric endonuclease NgoMIV. We have employed a DNA fragment with two NgoMIV recognition sites and a FRET dye pair such that upon protein-induced DNA looping the dyes are brought to close proximity resulting in a FRET signal. The dynamics of DNA-NgoMIV interactions proved to be heterogeneous, with individual smFRET trajectories exhibiting broadly different average looped state durations. Distinct types of the dynamics were attributed to different types of DNA-protein complexes, mediated either by one NgoMIV tetramer simultaneously bound to two specific sites ("slow" trajectories) or by semi-specific interactions of two DNA-bound NgoMIV tetramers ("fast" trajectories), as well as to conformational heterogeneity of individual NgoMIV molecules. © 2017 Wiley Periodicals, Inc.
DNA Physical Mapping via the Controlled Translocation of Single Molecules through a 5-10nm Silicon Nitride Nanopore

NASA Astrophysics Data System (ADS)

Stein, Derek; Reisner, Walter; Jiang, Zhijun; Hagerty, Nick; Wood, Charles; Chan, Jason

2009-03-01

The ability to map the binding position of sequence-specific markers, including transcription-factors, protein-nucleic acids (PNAs) or deactivated restriction enzymes, along a single DNA molecule in a nanofluidic device would be of key importance for the life-sciences. Such markers could give an indication of the active genes at particular stage in a cell's transcriptional cycle, pinpoint the location of mutations or even provide a DNA barcode that could aid in genomics applications. We have developed a setup consisting of a 5-10 nm nanopore in a 20nm thick silicon nitride film coupled to an optical tweezer setup. The translocation of DNA across the nanopore can be detected via blockades in the electrical current through the pore. By anchoring one end of the translocating DNA to an optically trapped microsphere, we hope to stretch out the molecule in the nanopore and control the translocation speed, enabling us to slowly scan across the genome and detect changes in the baseline current due to the presence of bound markers.
Confined wormlike chains in external fields

NASA Astrophysics Data System (ADS)

Morrison, Greg

The confinement of biomolecules is ubiquitous in nature, such as the spatial constraints of viral encapsulation, histone binding, and chromosomal packing. Advances in microfluidics and nanopore fabrication have permitted powerful new tools in single molecule manipulation and gene sequencing through molecular confinement as well. In order to fully understand and exploit these systems, the ability to predict the structure of spatially confined molecules is essential. In this talk, I describe a mean field approach to determine the properties of stiff polymers confined to cylinders and slits, which is relevant for a variety of biological and experimental conditions. I show that this approach is able to not only reproduce known scaling laws for confined wormlike chains, but also provides an improvement over existing weakly bending rod approximations in determining the detailed chain properties (such as correlation functions). Using this approach, we also show that it is possible to study the effect of an externally applied tension or static electric field in a natural and analytically tractable way. These external perturbations can alter the scaling laws and introduce important new length scales into the system, relevant for histone unbinding and single-molecule analysis of DNA.
Ligase Detection Reaction Generation of Reverse Molecular Beacons for Near Real-Time Analysis of Bacterial Pathogens Using Single-Pair Fluorescence Resonance Energy Transfer and a Cyclic Olefin Copolymer Microfluidic Chip

PubMed Central

Peng, Zhiyong; Soper, Steven A.; Pingle, Maneesh R.; Barany, Francis; Davis, Lloyd M.

2015-01-01

Detection of pathogenic bacteria and viruses require strategies that can signal the presence of these targets in near real-time due to the potential threats created by rapid dissemination into water and/or food supplies. In this paper, we report an innovative strategy that can rapidly detect bacterial pathogens using reporter sequences found in their genome without requiring polymerase chain reaction (PCR). A pair of strain-specific primers was designed based on the 16S rRNA gene and were end-labeled with a donor (Cy5) or acceptor (Cy5.5) dye. In the presence of the target bacterium, the primers were joined using a ligase detection reaction (LDR) only when the primers were completely complementary to the target sequence to form a reverse molecular beacon (rMB), thus bringing Cy5 (donor) and Cy5.5 (acceptor) into close proximity to allow fluorescence resonance energy transfer (FRET) to occur. These rMBs were subsequently analyzed using single-molecule detection of the FRET pairs (single-pair FRET; spFRET). The LDR was performed using a continuous flow thermal cycling process configured in a cyclic olefin copolymer (COC) microfluidic device using either 2 or 20 thermal cycles. Single-molecule photon bursts from the resulting rMBs were detected on-chip and registered using a simple laser-induced fluorescence (LIF) instrument. The spFRET signatures from the target pathogens were reported in as little as 2.6 min using spFRET. PMID:21047095
Protein sequencing via nanopore based devices: a nanofluidics perspective

NASA Astrophysics Data System (ADS)

Chinappi, Mauro; Cecconi, Fabio

2018-05-01

Proteins perform a huge number of central functions in living organisms, thus all the new techniques allowing their precise, fast and accurate characterization at single-molecule level certainly represent a burst in proteomics with important biomedical impact. In this review, we describe the recent progresses in the developing of nanopore based devices for protein sequencing. We start with a critical analysis of the main technical requirements for nanopore protein sequencing, summarizing some ideas and methodologies that have recently appeared in the literature. In the last sections, we focus on the physical modelling of the transport phenomena occurring in nanopore based devices. The multiscale nature of the problem is discussed and, in this respect, some of the main possible computational approaches are illustrated.
Bio-recognitive photonics of a DNA-guided organic semiconductor

PubMed Central

Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June

2016-01-01

Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA–DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an ‘inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA–DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition. PMID:26725969
Single-molecule paleoenzymology probes the chemistry of resurrected enzymes

PubMed Central

Perez-Jimenez, Raul; Inglés-Prieto, Alvaro; Zhao, Zi-Ming; Sanchez-Romero, Inmaculada; Alegre-Cebollada, Jorge; Kosuri, Pallav; Garcia-Manyes, Sergi; Kappock, T. Joseph; Tanokura, Masaru; Holmgren, Arne; Sanchez-Ruiz, Jose M.; Gaucher, Eric A.; Fernandez, Julio M.

2011-01-01

A journey back in time is possible at the molecular level by reconstructing proteins from extinct organisms. Here we report the reconstruction, based on sequence predicted by phylogenetic analysis, of seven Precambrian thioredoxin enzymes (Trx), dating back between ~1.4 and ~4 billion years (Gyr). The reconstructed enzymes are up to 32° C more stable than modern enzymes and the oldest show significantly higher activity than extant ones at pH 5. We probed their mechanisms of reduction using single-molecule force spectroscopy. From the force-dependency of the rate of reduction of an engineered substrate, we conclude that ancient Trxs utilize chemical mechanisms of reduction similar to those of modern enzymes. While Trx enzymes have maintained their reductase chemistry unchanged, they have adapted over a 4 Gyr time span to the changes in temperature and ocean acidity that characterize the evolution of the global environment from ancient to modern Earth. PMID:21460845
Bio-recognitive photonics of a DNA-guided organic semiconductor.

PubMed

Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June

2016-01-04

Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA-DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an 'inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA-DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition.
Quantitative mass imaging of single biological macromolecules.

PubMed

Young, Gavin; Hundt, Nikolas; Cole, Daniel; Fineberg, Adam; Andrecka, Joanna; Tyler, Andrew; Olerinyova, Anna; Ansari, Ayla; Marklund, Erik G; Collier, Miranda P; Chandler, Shane A; Tkachenko, Olga; Allen, Joel; Crispin, Max; Billington, Neil; Takagi, Yasuharu; Sellers, James R; Eichmann, Cédric; Selenko, Philipp; Frey, Lukas; Riek, Roland; Galpin, Martin R; Struwe, Weston B; Benesch, Justin L P; Kukura, Philipp

2018-04-27

The cellular processes underpinning life are orchestrated by proteins and their interactions. The associated structural and dynamic heterogeneity, despite being key to function, poses a fundamental challenge to existing analytical and structural methodologies. We used interferometric scattering microscopy to quantify the mass of single biomolecules in solution with 2% sequence mass accuracy, up to 19-kilodalton resolution, and 1-kilodalton precision. We resolved oligomeric distributions at high dynamic range, detected small-molecule binding, and mass-imaged proteins with associated lipids and sugars. These capabilities enabled us to characterize the molecular dynamics of processes as diverse as glycoprotein cross-linking, amyloidogenic protein aggregation, and actin polymerization. Interferometric scattering mass spectrometry allows spatiotemporally resolved measurement of a broad range of biomolecular interactions, one molecule at a time. Copyright © 2018 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Bio-recognitive photonics of a DNA-guided organic semiconductor

NASA Astrophysics Data System (ADS)

Back, Seung Hyuk; Park, Jin Hyuk; Cui, Chunzhi; Ahn, Dong June

2016-01-01

Incorporation of duplex DNA with higher molecular weights has attracted attention for a new opportunity towards a better organic light-emitting diode (OLED) capability. However, biological recognition by OLED materials is yet to be addressed. In this study, specific oligomeric DNA-DNA recognition is successfully achieved by tri (8-hydroxyquinoline) aluminium (Alq3), an organic semiconductor. Alq3 rods crystallized with guidance from single-strand DNA molecules show, strikingly, a unique distribution of the DNA molecules with a shape of an `inverted' hourglass. The crystal's luminescent intensity is enhanced by 1.6-fold upon recognition of the perfect-matched target DNA sequence, but not in the case of a single-base mismatched one. The DNA-DNA recognition forming double-helix structure is identified to occur only in the rod's outer periphery. This study opens up new opportunities of Alq3, one of the most widely used OLED materials, enabling biological recognition.
Error-based Extraction of States and Energy Landscapes from Experimental Single-Molecule Time-Series

NASA Astrophysics Data System (ADS)

Taylor, J. Nicholas; Li, Chun-Biu; Cooper, David R.; Landes, Christy F.; Komatsuzaki, Tamiki

2015-03-01

Characterization of states, the essential components of the underlying energy landscapes, is one of the most intriguing subjects in single-molecule (SM) experiments due to the existence of noise inherent to the measurements. Here we present a method to extract the underlying state sequences from experimental SM time-series. Taking into account empirical error and the finite sampling of the time-series, the method extracts a steady-state network which provides an approximation of the underlying effective free energy landscape. The core of the method is the application of rate-distortion theory from information theory, allowing the individual data points to be assigned to multiple states simultaneously. We demonstrate the method's proficiency in its application to simulated trajectories as well as to experimental SM fluorescence resonance energy transfer (FRET) trajectories obtained from isolated agonist binding domains of the AMPA receptor, an ionotropic glutamate receptor that is prevalent in the central nervous system.
Applications of biological pores in nanomedicine, sensing, and nanoelectronics.

PubMed

Majd, Sheereen; Yusko, Erik C; Billeh, Yazan N; Macrae, Michael X; Yang, Jerry; Mayer, Michael

2010-08-01

Biological protein pores and pore-forming peptides can generate a pathway for the flux of ions and other charged or polar molecules across cellular membranes. In nature, these nanopores have diverse and essential functions that range from maintaining cell homeostasis and participating in cell signaling to activating or killing cells. The combination of the nanoscale dimensions and sophisticated - often regulated - functionality of these biological pores make them particularly attractive for the growing field of nanobiotechnology. Applications range from single-molecule sensing to drug delivery and targeted killing of malignant cells. Potential future applications may include the use of nanopores for single strand DNA sequencing and for generating bio-inspired, and possibly, biocompatible visual detection systems and batteries. This article reviews the current state of applications of pore-forming peptides and proteins in nanomedicine, sensing, and nanoelectronics. Copyright © 2010 Elsevier Ltd. All rights reserved.
Relationships between residue Voronoi volume and sequence conservation in proteins.

PubMed

Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung

2018-02-01

Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
Ranalexin. A novel antimicrobial peptide from bullfrog (Rana catesbeiana) skin, structurally related to the bacterial antibiotic, polymyxin.

PubMed

Clark, D P; Durell, S; Maloy, W L; Zasloff, M

1994-04-08

Antimicrobial peptides comprise a diverse class of molecules used in host defense by plants, insects, and animals. In this study we have isolated a novel antimicrobial peptide from the skin of the bullfrog, Rana catesbeiana. This 20 amino acid peptide, which we have termed Ranalexin, has the amino acid sequence: NH2-Phe-Leu-Gly-Gly-Leu-Ile-Lys-Ile-Val-Pro-Ala-Met-Ile-Cys-Ala-Val-Thr- Lys-Lys - Cys-COOH, and it contains a single intramolecular disulfide bond which forms a heptapeptide ring within the molecule. Structurally, Ranalexin resembles the bacterial antibiotic, polymyxin, which contains a similar heptapeptide ring. We have also cloned the cDNA for Ranalexin from a metamorphic R. catesbeiana tadpole cDNA library. Based on the cDNA sequence, it appears that Ranalexin is initially synthesized as a propeptide with a putative signal sequence and an acidic amino acid-rich region at its amino-terminal end. Interestingly, the putative signal sequence of the Ranalexin cDNA is strikingly similar to the signal sequence of opioid peptide precursors isolated from the skin of the South American frogs Phyllomedusa sauvagei and Phyllomedusa bicolor. Northern blot analysis and in situ hybridization experiments demonstrated that Ranalexin mRNA is first expressed in R. catesbeiana skin at metamorphosis and continues to be expressed into adulthood.
Digital DNA detection based on a compact optofluidic laser with ultra-low sample consumption.

PubMed

Lee, Wonsuk; Chen, Qiushu; Fan, Xudong; Yoon, Dong Ki

2016-11-29

DNA lasers self-amplify optical signals from a DNA analyte as well as thermodynamic differences between sequences, allowing quasi-digital DNA detection. However, these systems have drawbacks, such as relatively large sample consumption and complicated dye labelling. Moreover, although the lasing signal can detect the target DNA, it is superimposed on an unintended fluorescence background, which persists for non-target DNA samples as well. From an optical point of view, it is thus not truly digital detection and requires spectral analysis to identify the target. In this work, we propose and demonstrate an optofluidic laser that has a single layer of DNA molecules as the gain material. A target DNA produces intensive laser emission comparable to existing DNA lasers, while any unnecessary fluorescence background is successfully suppressed. As a result, the target DNA can be detected with a single laser pulse, in a truly digital manner. Since the DNA molecules cover only a single layer on the surface of the laser microcavity, the DNA sample consumption is a few orders of magnitude lower than that of existing DNA lasers. Furthermore, the DNA molecules are stained by simply immersing the microcavity in the intercalating dye solution, and thus the proposed DNA laser is free of any complex dye-labelling process prior to analysis.
Single-molecule imaging at high fluorophore concentrations by local activation of dye

DOE Office of Scientific and Technical Information (OSTI.GOV)

Geertsema, Hylkje J.; Mangel, Walter F.; Schulte, Aartje C.

Single-molecule fluorescence microscopy is a powerful approach to observe biomolecular interactions with high spatial and temporal resolution. Detecting fluorescent signals from individual, labeled proteins above high levels of background fluorescence remains challenging, however. For this reason, the concentrations of labeled proteins in in vitro assays are often kept low compared to their in vivo concentrations. Here, we present a new fluorescence imaging technique by which single fluorescent molecules can be observed in real time at high, physiologically relevant concentrations. The technique requires a protein and its macromolecular substrate to be labeled each with a different fluorophore. Then, making use ofmore » short-distance energy-transfer mechanisms, the fluorescence from only those proteins bound to their substrate are selectively activated. This approach is demonstrated by labeling a DNA substrate with an intercalating stain, exciting the stain, and using energy transfer from the stain to activate the fluorescence of only those labeled DNA-binding proteins bound to the DNA. Such an experimental design allowed us to observe the sequence-independent interaction of Cy5-labeled interferon-inducible protein 16 (IFI16) with DNA and the sliding via one-dimensional diffusion of Cy5-labeled adenovirus protease (pVIc-AVP) on DNA in the presence of a background of hundreds of nM Cy5 fluorophore.« less
Single-molecule imaging at high fluorophore concentrations by local activation of dye

DOE PAGES

Geertsema, Hylkje J.; Mangel, Walter F.; Schulte, Aartje C.; ...

2015-02-17

Single-molecule fluorescence microscopy is a powerful approach to observe biomolecular interactions with high spatial and temporal resolution. Detecting fluorescent signals from individual, labeled proteins above high levels of background fluorescence remains challenging, however. For this reason, the concentrations of labeled proteins in in vitro assays are often kept low compared to their in vivo concentrations. Here, we present a new fluorescence imaging technique by which single fluorescent molecules can be observed in real time at high, physiologically relevant concentrations. The technique requires a protein and its macromolecular substrate to be labeled each with a different fluorophore. Then, making use ofmore » short-distance energy-transfer mechanisms, the fluorescence from only those proteins bound to their substrate are selectively activated. This approach is demonstrated by labeling a DNA substrate with an intercalating stain, exciting the stain, and using energy transfer from the stain to activate the fluorescence of only those labeled DNA-binding proteins bound to the DNA. Such an experimental design allowed us to observe the sequence-independent interaction of Cy5-labeled interferon-inducible protein 16 (IFI16) with DNA and the sliding via one-dimensional diffusion of Cy5-labeled adenovirus protease (pVIc-AVP) on DNA in the presence of a background of hundreds of nM Cy5 fluorophore.« less
tRNAmodpred: a computational method for predicting posttranscriptional modifications in tRNAs

PubMed Central

Machnicka, Magdalena A.; Dunin-Horkawicz, Stanislaw; de Crécy-Lagard, Valerie; Bujnicki, Janusz M.

2016-01-01

tRNA molecules contain numerous chemically altered nucleosides, which are formed by enzymatic modification of the primary transcripts during the complex tRNA maturation process. Some of the modifications are introduced by single reactions, while other require complex series of reactions carried out by several different enzymes. The location and distribution of various types of modifications vary greatly between different tRNA molecules, organisms and organelles. We have developed a computational method tRNAmodpred, for predicting modifications in tRNA sequences. Briefly, our method takes as an input one or more unmodified tRNA sequences and a set of protein sequences corresponding to a proteome of a cell. Subsequently it identifies homologs of known tRNA modification enzymes in the proteome, predicts tRNA modification activities and maps them onto known pathways of RNA modification from the MODOMICS database. Thereby, theoretically possible modification pathways are identified, and products of these modification reactions are proposed for query tRNAs. This method allows for predicting modification patterns for newly sequenced genomes as well as for checking tentative modification status of tRNAs from one species treated with enzymes from another source, e.g. to predict the possible modifications of eukaryotic tRNAs expressed in bacteria. tRNAmodpred is freely available as web server at http://genesilico.pl/trnamodpred/. PMID:27016142
bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

PubMed

Danaee, Padideh; Rouches, Mason; Wiley, Michelle; Deng, Dezhong; Huang, Liang; Hendrix, David

2018-05-09

While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.
Identification of sequence-structure RNA binding motifs for SELEX-derived aptamers.

PubMed

Hoinka, Jan; Zotenko, Elena; Friedman, Adam; Sauna, Zuben E; Przytycka, Teresa M

2012-06-15

Systematic Evolution of Ligands by EXponential Enrichment (SELEX) represents a state-of-the-art technology to isolate single-stranded (ribo)nucleic acid fragments, named aptamers, which bind to a molecule (or molecules) of interest via specific structural regions induced by their sequence-dependent fold. This powerful method has applications in designing protein inhibitors, molecular detection systems, therapeutic drugs and antibody replacement among others. However, full understanding and consequently optimal utilization of the process has lagged behind its wide application due to the lack of dedicated computational approaches. At the same time, the combination of SELEX with novel sequencing technologies is beginning to provide the data that will allow the examination of a variety of properties of the selection process. To close this gap we developed, Aptamotif, a computational method for the identification of sequence-structure motifs in SELEX-derived aptamers. To increase the chances of identifying functional motifs, Aptamotif uses an ensemble-based approach. We validated the method using two published aptamer datasets containing experimentally determined motifs of increasing complexity. We were able to recreate the author's findings to a high degree, thus proving the capability of our approach to identify binding motifs in SELEX data. Additionally, using our new experimental dataset, we illustrate the application of Aptamotif to elucidate several properties of the selection process.

Evolution of Sphingomonad Gene Clusters Related to Pesticide Catabolism Revealed by Genome Sequence and Mobilomics of Sphingobium herbicidovorans MH.

PubMed

Nielsen, Tue Kjærgaard; Rasmussen, Morten; Demanèche, Sandrine; Cecillon, Sébastien; Vogel, Timothy M; Hansen, Lars Hestbjerg

2017-09-01

Bacterial degraders of chlorophenoxy herbicides have been isolated from various ecosystems, including pristine environments. Among these degraders, the sphingomonads constitute a prominent group that displays versatile xenobiotic-degradation capabilities. Four separate sequencing strategies were required to provide the complete sequence of the complex and plastic genome of the canonical chlorophenoxy herbicide-degrading Sphingobium herbicidovorans MH. The genome has an intricate organization of the chlorophenoxy-herbicide catabolic genes sdpA, rdpA, and cadABCD that encode the (R)- and (S)-enantiomer-specific 2,4-dichlorophenoxypropionate dioxygenases and four subunits of a Rieske non-heme iron oxygenase involved in 2-methyl-chlorophenoxyacetic acid degradation, respectively. Several major genomic rearrangements are proposed to help understand the evolution and mobility of these important genes and their genetic context. Single-strain mobilomic sequence analysis uncovered plasmids and insertion sequence-associated circular intermediates in this environmentally important bacterium and enabled the description of evolutionary models for pesticide degradation in strain MH and related organisms. The mobilome presented a complex mosaic of mobile genetic elements including four plasmids and several circular intermediate DNA molecules of insertion-sequence elements and transposons that are central to the evolution of xenobiotics degradation. Furthermore, two individual chromosomally integrated prophages were shown to excise and form free circular DNA molecules. This approach holds great potential for improving the understanding of genome plasticity, evolution, and microbial ecology. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

PubMed

Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

2017-07-01

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
DNA Sequencing by Capillary Electrophoresis

PubMed Central

Karger, Barry L.; Guttman, Andras

2009-01-01

Sequencing of human and other genomes has been at the center of interest in the biomedical field over the past several decades and is now leading toward an era of personalized medicine. During this time, DNA sequencing methods have evolved from the labor intensive slab gel electrophoresis, through automated multicapillary electrophoresis systems using fluorophore labeling with multispectral imaging, to the “next generation” technologies of cyclic array, hybridization based, nanopore and single molecule sequencing. Deciphering the genetic blueprint and follow-up confirmatory sequencing of Homo sapiens and other genomes was only possible by the advent of modern sequencing technologies that was a result of step by step advances with a contribution of academics, medical personnel and instrument companies. While next generation sequencing is moving ahead at break-neck speed, the multicapillary electrophoretic systems played an essential role in the sequencing of the Human Genome, the foundation of the field of genomics. In this prospective, we wish to overview the role of capillary electrophoresis in DNA sequencing based in part of several of our articles in this journal. PMID:19517496
Single cell systems biology by super-resolution imaging and combinatorial labeling

PubMed Central

Lubeck, Eric; Cai, Long

2012-01-01

Fluorescence microscopy is a powerful quantitative tool for exploring regulatory networks in single cells. However, the number of molecular species that can be measured simultaneously is limited by the spectral separability of fluorophores. Here we demonstrate a simple but general strategy to drastically increase the capacity for multiplex detection of molecules in single cells by using optical super-resolution microscopy (SRM) and combinatorial labeling. As a proof of principle, we labeled mRNAs with unique combinations of fluorophores using Fluorescence in situ Hybridization (FISH), and resolved the sequences and combinations of fluorophores with SRM. We measured the mRNA levels of 32 genes simultaneously in single S. cerevisiae cells. These experiments demonstrate that combinatorial labeling and super-resolution imaging of single cells provides a natural approach to bring systems biology into single cells. PMID:22660740
Preface: Special Topic on Single-Molecule Biophysics

NASA Astrophysics Data System (ADS)

Makarov, Dmitrii E.; Schuler, Benjamin

2018-03-01

Single-molecule measurements are now almost routinely used to study biological systems and processes. The scope of this special topic emphasizes the physics side of single-molecule observations, with the goal of highlighting new developments in physical techniques as well as conceptual insights that single-molecule measurements bring to biophysics. This issue also comprises recent advances in theoretical physical models of single-molecule phenomena, interpretation of single-molecule signals, and fundamental areas of statistical mechanics that are related to single-molecule observations. A particular goal is to illustrate the increasing synergy between theory, simulation, and experiment in single-molecule biophysics.
GET-SERF, a new gradient encoded SERF experiment for the trivial edition of 1H-19F couplings

NASA Astrophysics Data System (ADS)

Di Pietro, Maria Enrica; Aroulanda, Christie; Merlet, Denis

2013-09-01

A new spatially encoded heteronuclear 1H-19F selective refocusing NMR experiment (GET-SERF) is proposed. This sequence allows editing in one single 2D experiment all couplings between a selected fluorine site and all the proton nuclei of the molecule. Its efficiency is illustrated in the case of diflunisal, a difluorinated anti-inflammatory drug, in isotropic and anisotropic media.
Infusion of Autologous Lysed Plasma Into the Baboon: Assessment of Coagulation, Platelet, and Pulmonary Function

DTIC Science & Technology

1993-06-03

obtained from whole blood collected into a commercially available tube containing thrombin and epsilon aminocaproic acid (Wellcome 44 Diagnostics...first proposed by Hall & Slayter in 1959 as an extended, multidomained molecule. Electron microscopy, amino acid sequencing and proteolytic studies have...Plasminogen (Figure 7) is a single chain, 88 kilodalton glycoprotein. It contains 790 amino acids , 24 disulfide bridges and five homologous triple loop
Fundamental Aspects of Single Molecule and Zeptomole Electroanalysis

DTIC Science & Technology

2018-04-01

objective of our research program was to provide the fundamental understanding required for using the principles of electroanalytical chemistry to detect...report is organized in terms of research in the individual co-PI laboratories. Figure 1. A probe DNA sequence (red) immobilized onto a nanoscale...were tested on both Au microelectrodes, an Au microband in a microfluidic device, and an Au microband in a microfluidic device in the presence of a
Single Molecule Study of the Intrinsically Disordered FG-Repeat Nucleoporin 153

PubMed Central

Milles, Sigrid; Lemke, Edward A.

2011-01-01

Nucleoporins (Nups), which are intrinsically disordered, form a selectivity filter inside the nuclear pore complex, taking a central role in the vital nucleocytoplasmic transport mechanism. These Nups display a complex and nonrandom amino-acid architecture of phenylalanine glycine (FG)-repeat clusters and intra-FG linkers. How such heterogeneous sequence composition relates to function and could give rise to a transport mechanism is still unclear. Here we describe a combined chemical biology and single-molecule fluorescence approach to study the large human Nup153 FG-domain. In order to obtain insights into the properties of this domain beyond the average behavior, we probed the end-to-end distance (RE) of several ∼50-residues long FG-repeat clusters in the context of the whole protein domain. Despite the sequence heterogeneity of these FG-clusters, we detected a reoccurring and consistent compaction from a relaxed coil behavior under denaturing conditions (RE/RE,RC = 0.99 ± 0.15 with RE,RC corresponding to ideal relaxed coil behavior) to a collapsed state under native conditions (RE/RE,RC = 0.79 ± 0.09). We then analyzed the properties of this protein on the supramolecular level, and determined that this human FG-domain was in fact able to form a hydrogel with physiological permeability barrier properties. PMID:21961597
The hepatitis C virus Core protein is a potent nucleic acid chaperone that directs dimerization of the viral (+) strand RNA in vitro

PubMed Central

Cristofari, Gaël; Ivanyi-Nagy, Roland; Gabus, Caroline; Boulant, Steeve; Lavergne, Jean-Pierre; Penin, François; Darlix, Jean-Luc

2004-01-01

The hepatitis C virus (HCV) is an important human pathogen causing chronic hepatitis, liver cirrhosis and hepatocellular carcinoma. HCV is an enveloped virus with a positive-sense, single-stranded RNA genome encoding a single polyprotein that is processed to generate viral proteins. Several hundred molecules of the structural Core protein are thought to coat the genome in the viral particle, as do nucleocapsid (NC) protein molecules in Retroviruses, another class of enveloped viruses containing a positive-sense RNA genome. Retroviral NC proteins also possess nucleic acid chaperone properties that play critical roles in the structural remodelling of the genome during retrovirus replication. This analogy between HCV Core and retroviral NC proteins prompted us to investigate the putative nucleic acid chaperoning properties of the HCV Core protein. Here we report that Core protein chaperones the annealing of complementary DNA and RNA sequences and the formation of the most stable duplex by strand exchange. These results show that the HCV Core is a nucleic acid chaperone similar to retroviral NC proteins. We also find that the Core protein directs dimerization of HCV (+) RNA 3′ untranslated region which is promoted by a conserved palindromic sequence possibly involved at several stages of virus replication. PMID:15141033
The hepatitis C virus Core protein is a potent nucleic acid chaperone that directs dimerization of the viral (+) strand RNA in vitro.

PubMed

Cristofari, Gaël; Ivanyi-Nagy, Roland; Gabus, Caroline; Boulant, Steeve; Lavergne, Jean-Pierre; Penin, François; Darlix, Jean-Luc

2004-01-01

The hepatitis C virus (HCV) is an important human pathogen causing chronic hepatitis, liver cirrhosis and hepatocellular carcinoma. HCV is an enveloped virus with a positive-sense, single-stranded RNA genome encoding a single polyprotein that is processed to generate viral proteins. Several hundred molecules of the structural Core protein are thought to coat the genome in the viral particle, as do nucleocapsid (NC) protein molecules in Retroviruses, another class of enveloped viruses containing a positive-sense RNA genome. Retroviral NC proteins also possess nucleic acid chaperone properties that play critical roles in the structural remodelling of the genome during retrovirus replication. This analogy between HCV Core and retroviral NC proteins prompted us to investigate the putative nucleic acid chaperoning properties of the HCV Core protein. Here we report that Core protein chaperones the annealing of complementary DNA and RNA sequences and the formation of the most stable duplex by strand exchange. These results show that the HCV Core is a nucleic acid chaperone similar to retroviral NC proteins. We also find that the Core protein directs dimerization of HCV (+) RNA 3' untranslated region which is promoted by a conserved palindromic sequence possibly involved at several stages of virus replication.
Electron-molecule scattering in a strong laser field: Two-center interference effects

NASA Astrophysics Data System (ADS)

Dakić, J.; Habibović, D.; Čerkić, A.; Busuladžić, M.; Milošević, D. B.

2017-10-01

Laser-assisted scattering of electrons on diatomic molecules is considered using the S -matrix theory within the second Born approximation. The first term of the expansion in powers of the scattering potential corresponds to the direct or single laser-assisted scattering of electrons on molecular targets, while the second term of this expansion corresponds to the laser-assisted rescattering or double scattering. The rescattered electrons may have considerably higher energies in the final state than those that scattered only once. For multicenter polyatomic molecules scattering and rescattering may happen at any center and in any order. All these cases contribute to the scattering amplitude and the interference of different contributions leads to an increase or a decrease of the differential cross section in particular electron energy regions. For diatomic molecules there are two such contributions for single scattering and four contributions for double scattering. Analyzing the spectra of the scattered electrons, we find two interesting effects. For certain molecular orientations, the plateaus in the electron energy spectrum, characteristic of laser-assisted electron-atom scattering, are replaced by a sequence of gradually declining maxima, caused by the two-center interference effects. The second effect is the appearance of symmetric U -shaped structures in the angle-resolved energy spectra, which are described very well by the analytical formulas we provide.
Molecular characterization of faba bean necrotic yellows viruses in Tunisia.

PubMed

Kraberger, Simona; Kumari, Safaa G; Najar, Asma; Stainton, Daisy; Martin, Darren P; Varsani, Arvind

2018-03-01

Faba bean necrotic yellows virus (FBNYV) (genus Nanovirus; family Nanoviridae) has a genome comprising eight individually encapsidated circular single-stranded DNA components. It has frequently been found infecting faba bean (Vicia faba L.) and chickpea (Cicer arietinum L.) in association with satellite molecules (alphasatellites). Genome sequences of FBNYV from Azerbaijan, Egypt, Iran, Morocco, Spain and Syria have been determined previously and we now report the first five genome sequences of FBNYV and associated alphasatellites from faba bean sampled in Tunisia. In addition, we have determined the genome sequences of two additional FBNYV isolates from chickpea plants sampled in Syria and Iran. All individual FBNYV genome component sequences that were determined here share > 84% nucleotide sequence identity with FBNYV sequences available in public databases, with the DNA-M component displaying the highest degree of diversity. As with other studied nanoviruses, recombination and genome component reassortment occurs frequently both between FBNYV genomes and between genomes of nanoviruses belonging to other species.
Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA

PubMed Central

Oberstrass, Florian C.; Fernandes, Louis E.; Bryant, Zev

2012-01-01

B-DNA becomes unstable under superhelical stress and is able to adopt a wide range of alternative conformations including strand-separated DNA and Z-DNA. Localized sequence-dependent structural transitions are important for the regulation of biological processes such as DNA replication and transcription. To directly probe the effect of sequence on structural transitions driven by torque, we have measured the torsional response of a panel of DNA sequences using single molecule assays that employ nanosphere rotational probes to achieve high torque resolution. The responses of Z-forming d(pGpC)n sequences match our predictions based on a theoretical treatment of cooperative transitions in helical polymers. “Bubble” templates containing 50–100 bp mismatch regions show cooperative structural transitions similar to B-DNA, although less torque is required to disrupt strand–strand interactions. Our mechanical measurements, including direct characterization of the torsional rigidity of strand-separated DNA, establish a framework for quantitative predictions of the complex torsional response of arbitrary sequences in their biological context. PMID:22474350
Directional rolling of positively charged nanoparticles along a flexibility gradient on long DNA molecules.

PubMed

Park, Suehyun; Joo, Heesun; Kim, Jun Soo

2018-01-31

Directing the motion of molecules/colloids in any specific direction is of great interest in many applications of chemistry, physics, and biological sciences, where regulated positioning or transportation of materials is highly desired. Using Brownian dynamics simulations of coarse-grained models of a long, double-stranded DNA molecule and positively charged nanoparticles, we observed that the motion of a single nanoparticle bound to and wrapped by the DNA molecule can be directed along a gradient of DNA local flexibility. The flexibility gradient is constructed along a 0.8 kilobase-pair DNA molecule such that local persistence length decreases gradually from 50 nm to 40 nm, mimicking a gradual change in sequence-dependent flexibility. Nanoparticles roll over a long DNA molecule from less flexible regions towards more flexible ones as a result of the decreasing energetic cost of DNA bending and wrapping. In addition, the rolling becomes slightly accelerated as the positive charge of nanoparticles decreases due to a lower free energy barrier of DNA detachment from charged nanoparticle for processive rolling. This study suggests that the variation in DNA local flexibility can be utilized in constructing and manipulating supramolecular assemblies of DNA molecules and nanoparticles in structural DNA nanotechnology.
Crystal structure of tandem type III fibronectin domains from Drosophila neuroglian at 2.0 A.

PubMed

Huber, A H; Wang, Y M; Bieber, A J; Bjorkman, P J

1994-04-01

We report the crystal structure of two adjacent fibronectin type III repeats from the Drosophila neural cell adhesion molecule neuroglian. Each domain consists of two antiparallel beta sheets and is folded topologically identically to single fibronectin type III domains from the extracellular matrix proteins tenascin and fibronectin. beta bulges and left-handed polyproline II helices disrupt the regular beta sheet structure of both neuroglian domains. The hydrophobic interdomain interface includes a metal-binding site, presumably involved in stabilizing the relative orientation between domains and predicted by sequence comparision to be present in the vertebrate homolog molecule L1. The neuroglian domains are related by a near perfect 2-fold screw axis along the longest molecular dimension. Using this relationship, a model for arrays of tandem fibronectin type III repeats in neuroglian and other molecules is proposed.
HLA genotyping by next-generation sequencing of complementary DNA.

PubMed

Segawa, Hidenobu; Kukita, Yoji; Kato, Kikuya

2017-11-28

Genotyping of the human leucocyte antigen (HLA) is indispensable for various medical treatments. However, unambiguous genotyping is technically challenging due to high polymorphism of the corresponding genomic region. Next-generation sequencing is changing the landscape of genotyping. In addition to high throughput of data, its additional advantage is that DNA templates are derived from single molecules, which is a strong merit for the phasing problem. Although most currently developed technologies use genomic DNA, use of cDNA could enable genotyping with reduced costs in data production and analysis. We thus developed an HLA genotyping system based on next-generation sequencing of cDNA. Each HLA gene was divided into 3 or 4 target regions subjected to PCR amplification and subsequent sequencing with Ion Torrent PGM. The sequence data were then subjected to an automated analysis. The principle of the analysis was to construct candidate sequences generated from all possible combinations of variable bases and arrange them in decreasing order of the number of reads. Upon collecting candidate sequences from all target regions, 2 haplotypes were usually assigned. Cases not assigned 2 haplotypes were forwarded to 4 additional processes: selection of candidate sequences applying more stringent criteria, removal of artificial haplotypes, selection of candidate sequences with a relaxed threshold for sequence matching, and countermeasure for incomplete sequences in the HLA database. The genotyping system was evaluated using 30 samples; the overall accuracy was 97.0% at the field 3 level and 98.3% at the G group level. With one sample, genotyping of DPB1 was not completed due to short read size. We then developed a method for complete sequencing of individual molecules of the DPB1 gene, using the molecular barcode technology. The performance of the automatic genotyping system was comparable to that of systems developed in previous studies. Thus, next-generation sequencing of cDNA is a viable option for HLA genotyping.
Click strategies for single-molecule protein fluorescence.

PubMed

Milles, Sigrid; Tyagi, Swati; Banterle, Niccolò; Koehler, Christine; VanDelinder, Virginia; Plass, Tilman; Neal, Adrian P; Lemke, Edward A

2012-03-21

Single-molecule methods have matured into central tools for studies in biology. Foerster resonance energy transfer (FRET) techniques, in particular, have been widely applied to study biomolecular structure and dynamics. The major bottleneck for a facile and general application of these studies arises from the need to label biological samples site-specifically with suitable fluorescent dyes. In this work, we present an optimized strategy combining click chemistry and the genetic encoding of unnatural amino acids (UAAs) to overcome this limitation for proteins. We performed a systematic study with a variety of clickable UAAs and explored their potential for high-resolution single-molecule FRET (smFRET). We determined all parameters that are essential for successful single-molecule studies, such as accessibility of the probes, expression yield of proteins, and quantitative labeling. Our multiparameter fluorescence analysis allowed us to gain new insights into the effects and photophysical properties of fluorescent dyes linked to various UAAs for smFRET measurements. This led us to determine that, from the extended tool set that we now present, genetically encoding propargyllysine has major advantages for state-of-the-art measurements compared to other UAAs. Using this optimized system, we present a biocompatible one-step dual-labeling strategy of the regulatory protein RanBP3 with full labeling position freedom. Our technique allowed us then to determine that the region encompassing two FxFG repeat sequences adopts a disordered but collapsed state. RanBP3 serves here as a prototypical protein that, due to its multiple cysteines, size, and partially disordered structure, is not readily accessible to any of the typical structure determination techniques such as smFRET, NMR, and X-ray crystallography.
Single molecule molecular inversion probes for targeted, high-accuracy detection of low-frequency variation.

PubMed

Hiatt, Joseph B; Pritchard, Colin C; Salipante, Stephen J; O'Roak, Brian J; Shendure, Jay

2013-05-01

The detection and quantification of genetic heterogeneity in populations of cells is fundamentally important to diverse fields, ranging from microbial evolution to human cancer genetics. However, despite the cost and throughput advances associated with massively parallel sequencing, it remains challenging to reliably detect mutations that are present at a low relative abundance in a given DNA sample. Here we describe smMIP, an assay that combines single molecule tagging with multiplex targeted capture to enable practical and highly sensitive detection of low-frequency or subclonal variation. To demonstrate the potential of the method, we simultaneously resequenced 33 clinically informative cancer genes in eight cell line and 45 clinical cancer samples. Single molecule tagging facilitated extremely accurate consensus calling, with an estimated per-base error rate of 8.4 × 10(-6) in cell lines and 2.6 × 10(-5) in clinical specimens. False-positive mutations in the single molecule consensus base-calls exhibited patterns predominantly consistent with DNA damage, including 8-oxo-guanine and spontaneous deamination of cytosine. Based on mixing experiments with cell line samples, sensitivity for mutations above 1% frequency was 83% with no false positives. At clinically informative sites, we identified seven low-frequency point mutations (0.2%-4.7%), including BRAF p.V600E (melanoma, 0.2% alternate allele frequency), KRAS p.G12V (lung, 0.6%), JAK2 p.V617F (melanoma, colon, two lung, 0.3%-1.4%), and NRAS p.Q61R (colon, 4.7%). We anticipate that smMIP will be broadly adoptable as a practical and effective method for accurately detecting low-frequency mutations in both research and clinical settings.
Single molecule molecular inversion probes for targeted, high-accuracy detection of low-frequency variation

PubMed Central

Hiatt, Joseph B.; Pritchard, Colin C.; Salipante, Stephen J.; O'Roak, Brian J.; Shendure, Jay

2013-01-01

The detection and quantification of genetic heterogeneity in populations of cells is fundamentally important to diverse fields, ranging from microbial evolution to human cancer genetics. However, despite the cost and throughput advances associated with massively parallel sequencing, it remains challenging to reliably detect mutations that are present at a low relative abundance in a given DNA sample. Here we describe smMIP, an assay that combines single molecule tagging with multiplex targeted capture to enable practical and highly sensitive detection of low-frequency or subclonal variation. To demonstrate the potential of the method, we simultaneously resequenced 33 clinically informative cancer genes in eight cell line and 45 clinical cancer samples. Single molecule tagging facilitated extremely accurate consensus calling, with an estimated per-base error rate of 8.4 × 10−6 in cell lines and 2.6 × 10−5 in clinical specimens. False-positive mutations in the single molecule consensus base-calls exhibited patterns predominantly consistent with DNA damage, including 8-oxo-guanine and spontaneous deamination of cytosine. Based on mixing experiments with cell line samples, sensitivity for mutations above 1% frequency was 83% with no false positives. At clinically informative sites, we identified seven low-frequency point mutations (0.2%–4.7%), including BRAF p.V600E (melanoma, 0.2% alternate allele frequency), KRAS p.G12V (lung, 0.6%), JAK2 p.V617F (melanoma, colon, two lung, 0.3%–1.4%), and NRAS p.Q61R (colon, 4.7%). We anticipate that smMIP will be broadly adoptable as a practical and effective method for accurately detecting low-frequency mutations in both research and clinical settings. PMID:23382536

Evidence for a Complex Class of Nonadenylated mRNA in Drosophila

PubMed Central

Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.

1980-01-01

The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
Single nucleotide primer extension to detect genetic diseases: Experimental application to hemophilia B (factor IX) and cystic fibrosis genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kuppuswamy, M.N.; Hoffmann, J.W.; Spitzer, S.G.

1991-02-15

In this report, the authors describe an approach to detect the presence of abnormal alleles in those genetic diseases in which frequency of occurrence of the same mutation is high (e.g., hemophilia B). Initially, from each subject, the DNA fragment containing the putative mutation site is amplified by the polymerase chain reaction. For each fragment two reaction mixtures are then prepared. Each contains the amplified fragment, a primer (18-mer or longer) whose sequence is identical to the coding sequence of the normal gene immediately flanking the 5{prime} end of the mutation site, and either an {alpha}-{sup 32}P-labeled nucleotide corresponding tomore » the normal coding sequence at the mutation site or an {alpha}-{sup 32}P-labeled nucleotide corresponding to the mutant sequence. An essential feature of the present methodology is that the base immediately 3{prime} to the template-bound primer is one of those altered in the mutant, since in this way an extension of the primer by a single base will give an extended molecule characteristic of either the mutant or the wild type. The method is rapid and should be useful in carrier detection and prenatal diagnosis of every genetic disease with a known sequence variation.« less
Evidence of protein-free homology recognition in magnetic bead force-extension experiments

NASA Astrophysics Data System (ADS)

O'Lee, D. J.; Danilowicz, C.; Rochester, C.; Kornyshev, A. A.; Prentiss, M.

2016-07-01

Earlier theoretical studies have proposed that the homology-dependent pairing of large tracts of dsDNA may be due to physical interactions between homologous regions. Such interactions could contribute to the sequence-dependent pairing of chromosome regions that may occur in the presence or the absence of double-strand breaks. Several experiments have indicated the recognition of homologous sequences in pure electrolytic solutions without proteins. Here, we report single-molecule force experiments with a designed 60 kb long dsDNA construct; one end attached to a solid surface and the other end to a magnetic bead. The 60 kb constructs contain two 10 kb long homologous tracts oriented head to head, so that their sequences match if the two tracts fold on each other. The distance between the bead and the surface is measured as a function of the force applied to the bead. At low forces, the construct molecules extend substantially less than normal, control dsDNA, indicating the existence of preferential interaction between the homologous regions. The force increase causes no abrupt but continuous unfolding of the paired homologous regions. Simple semi-phenomenological models of the unfolding mechanics are proposed, and their predictions are compared with the data.
Re-polarization of nuclear spins using selective SABRE-INEPT.

PubMed

Knecht, Stephan; Kiryutin, Alexey S; Yurkovskaya, Alexandra V; Ivanov, Konstantin L

2018-02-01

A method is proposed for significant improvement of NMR pulse sequences used in high-field SABRE (Signal Amplification By Reversible Exchange) experiments. SABRE makes use of spin order transfer from parahydrogen (pH 2 , the H 2 molecule in its singlet spin state) to a substrate in a transient organometallic Ir-based complex. The technique proposed here utilizes "re-polarization", i.e., multiple application of an NMR pulse sequence used for spin order transfer. During re-polarization only the form of the substrate, which is bound to the complex, is excited by selective NMR pulses and the resulting polarization is transferred to the free substrate via chemical exchange. Owing to the fact that (i) only a small fraction of the substrate molecules is in the bound form and (ii) spin relaxation of the free substrate is slow, the re-polarization scheme provides greatly improved NMR signal enhancement, ε. For instance, when pyridine is used as a substrate, single use of the SABRE-INEPT sequence provides ε≈260 for 15 N nuclei, whereas SABRE-INEPT with re-polarization yields ε>2000. We anticipate that the proposed method is useful for achieving maximal NMR enhancement with spin hyperpolarization techniques. Copyright © 2017 Elsevier Inc. All rights reserved.
Re-polarization of nuclear spins using selective SABRE-INEPT

NASA Astrophysics Data System (ADS)

Knecht, Stephan; Kiryutin, Alexey S.; Yurkovskaya, Alexandra V.; Ivanov, Konstantin L.

2018-02-01

A method is proposed for significant improvement of NMR pulse sequences used in high-field SABRE (Signal Amplification By Reversible Exchange) experiments. SABRE makes use of spin order transfer from parahydrogen (pH2, the H2 molecule in its singlet spin state) to a substrate in a transient organometallic Ir-based complex. The technique proposed here utilizes "re-polarization", i.e., multiple application of an NMR pulse sequence used for spin order transfer. During re-polarization only the form of the substrate, which is bound to the complex, is excited by selective NMR pulses and the resulting polarization is transferred to the free substrate via chemical exchange. Owing to the fact that (i) only a small fraction of the substrate molecules is in the bound form and (ii) spin relaxation of the free substrate is slow, the re-polarization scheme provides greatly improved NMR signal enhancement, ε . For instance, when pyridine is used as a substrate, single use of the SABRE-INEPT sequence provides ε ≈ 260 for 15N nuclei, whereas SABRE-INEPT with re-polarization yields ε > 2000 . We anticipate that the proposed method is useful for achieving maximal NMR enhancement with spin hyperpolarization techniques.
Selection of a platinum-binding sequence in a loop of a four-helix bundle protein.

PubMed

Yagi, Sota; Akanuma, Satoshi; Kaji, Asumi; Niiro, Hiroya; Akiyama, Hayato; Uchida, Tatsuya; Yamagishi, Akihiko

2018-02-01

Protein-metal hybrids are functional materials with various industrial applications. For example, a redox enzyme immobilized on a platinum electrode is a key component of some biofuel cells and biosensors. To create these hybrid materials, protein molecules are bound to metal surfaces. Here, we report the selection of a novel platinum-binding sequence in a loop of a four-helix bundle protein, the Lac repressor four-helix protein (LARFH), an artificial protein in which four identical α-helices are connected via three identical loops. We created a genetic library in which the Ser-Gly-Gln-Gly-Gly-Ser sequence within the first inter-helical loop of LARFH was semi-randomly mutated. The library was then subjected to selection for platinum-binding affinity by using the T7 phage display method. The majority of the selected variants contained the Tyr-Lys-Arg-Gly-Tyr-Lys (YKRGYK) sequence in their randomized segment. We characterized the platinum-binding properties of mutant LARFH by using quartz crystal microbalance analysis. Mutant LARFH seemed to interact with platinum through its loop containing the YKRGYK sequence, as judged by the estimated exclusive area occupied by a single molecule. Furthermore, a 10-residue peptide containing the YKRGYK sequence bound to platinum with reasonably high affinity and basic side chains in the peptide were crucial in mediating this interaction. In conclusion, we have identified an amino acid sequence, YKRGYK, in the loop of a helix-loop-helix motif that shows high platinum-binding affinity. This sequence could be grafted into loops of other polypeptides as an approach to immobilize proteins on platinum electrodes for use as biosensors among other applications. Copyright © 2017 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Massively Parallel, Molecular Analysis Platform Developed Using a CMOS Integrated Circuit With Biological Nanopores

PubMed Central

Roever, Stefan

2012-01-01

A massively parallel, low cost molecular analysis platform will dramatically change the nature of protein, molecular and genomics research, DNA sequencing, and ultimately, molecular diagnostics. An integrated circuit (IC) with 264 sensors was fabricated using standard CMOS semiconductor processing technology. Each of these sensors is individually controlled with precision analog circuitry and is capable of single molecule measurements. Under electronic and software control, the IC was used to demonstrate the feasibility of creating and detecting lipid bilayers and biological nanopores using wild type α-hemolysin. The ability to dynamically create bilayers over each of the sensors will greatly accelerate pore development and pore mutation analysis. In addition, the noise performance of the IC was measured to be 30fA(rms). With this noise performance, single base detection of DNA was demonstrated using α-hemolysin. The data shows that a single molecule, electrical detection platform using biological nanopores can be operationalized and can ultimately scale to millions of sensors. Such a massively parallel platform will revolutionize molecular analysis and will completely change the field of molecular diagnostics in the future.
Quantifying short-lived events in multistate ionic current measurements.

PubMed

Balijepalli, Arvind; Ettedgui, Jessica; Cornio, Andrew T; Robertson, Joseph W F; Cheung, Kin P; Kasianowicz, John J; Vaz, Canute

2014-02-25

We developed a generalized technique to characterize polymer-nanopore interactions via single channel ionic current measurements. Physical interactions between analytes, such as DNA, proteins, or synthetic polymers, and a nanopore cause multiple discrete states in the current. We modeled the transitions of the current to individual states with an equivalent electrical circuit, which allowed us to describe the system response. This enabled the estimation of short-lived states that are presently not characterized by existing analysis techniques. Our approach considerably improves the range and resolution of single-molecule characterization with nanopores. For example, we characterized the residence times of synthetic polymers that are three times shorter than those estimated with existing algorithms. Because the molecule's residence time follows an exponential distribution, we recover nearly 20-fold more events per unit time that can be used for analysis. Furthermore, the measurement range was extended from 11 monomers to as few as 8. Finally, we applied this technique to recover a known sequence of single-stranded DNA from previously published ion channel recordings, identifying discrete current states with subpicoampere resolution.
DNA-templated synthesis of Pt nanoparticles on single-walled carbon nanotubes.

PubMed

Dong, Lifeng

2009-11-18

A series of electron microscopy characterizations demonstrate that single-stranded deoxyribonucleic acid (ssDNA) can bind to nanotube surfaces and disperse bundled single-walled carbon nanotubes (SWCNTs) into individual tubes. The ssDNA molecules on the nanotube surfaces demonstrate various morphologies, such as aggregated clusters and spiral wrapping around a nanotube with different pitches and spaces, indicating that the morphology of the SWCNT/DNA hybrids is not related solely to the base sequence of the ssDNA or the chirality or the diameter of the nanotubes. In addition to serving as a non-covalent dispersion agent, the ssDNA molecules bonded to the nanotube surface can provide addresses for localizing Pt(II) complexes along the nanotubes. The Pt nanoparticles obtained by a reduction of the Pt2+-DNA adducts are crystals with a size of < or =1-2 nm. These results expand our understanding of the interactions between ssDNA and SWCNTs and provide an efficient approach for positioning Pt and other metal particles, with uniform sizes and without aggregations, along the nanotube surfaces for applications in direct ethanol/methanol fuel cells and nanoscale electronics.
Developing Single-Molecule TPM Experiments for Direct Observation of Successful RecA-Mediated Strand Exchange Reaction

PubMed Central

Fan, Hsiu-Fang; Cox, Michael M.; Li, Hung-Wen

2011-01-01

RecA recombinases play a central role in homologous recombination. Once assembled on single-stranded (ss) DNA, RecA nucleoprotein filaments mediate the pairing of homologous DNA sequences and strand exchange processes. We have designed two experiments based on tethered particle motion (TPM) to investigate the fates of the invading and the outgoing strands during E. coli RecA-mediated pairing and strand exchange at the single-molecule level in the absence of force. TPM experiments measure the tethered bead Brownian motion indicative of the DNA tether length change resulting from RecA binding and dissociation. Experiments with beads labeled on either the invading strand or the outgoing strand showed that DNA pairing and strand exchange occurs successfully in the presence of either ATP or its non-hydrolyzable analog, ATPγS. The strand exchange rates and efficiencies are similar under both ATP and ATPγS conditions. In addition, the Brownian motion time-courses suggest that the strand exchange process progresses uni-directionally in the 5′-to-3′ fashion, using a synapse segment with a wide and continuous size distribution. PMID:21765895
Method to transform algae, materials therefor, and products produced thereby

DOEpatents

Dunahay, T.G.; Roessler, P.G.; Jarvis, E.E.

1997-08-26

Disclosed is a method to transform chlorophyll C-containing algae. The method includes introducing a recombinant molecule comprising a nucleic acid molecule encoding a dominant selectable marker operatively linked to an algal regulatory control sequence into a chlorophyll C-containing alga in such a manner that the marker is produced by the alga. In a preferred embodiment the algal regulatory control sequence is derived from a diatom and preferably Cyclotella cryptica. Also disclosed is a chimeric molecule having one or more regulatory control sequences derived from one or more chlorophyll C-containing algae operatively linked to a nucleic acid molecule encoding a selectable marker, an RNA molecule and/or a protein, wherein the nucleic acid molecule does not normally occur with one or more of the regulatory control sequences. Further, specifically disclosed are molecules pACCNPT10, pACCNPT4.8 and pACCNPT5.1. The methods and materials of the present invention provide the ability to accomplish stable genetic transformation of chlorophyll C-containing algae. 2 figs.
Method to transform algae, materials therefor, and products produced thereby

DOEpatents

Dunahay, Terri Goodman; Roessler, Paul G.; Jarvis, Eric E.

1997-01-01

Disclosed is a method to transform chlorophyll C-containing algae which includes introducing a recombinant molecule comprising a nucleic acid molecule encoding a dominant selectable marker operatively linked to an algal regulatory control sequence into a chlorophyll C-containing alga in such a manner that the marker is produced by the alga. In a preferred embodiment the algal regulatory control sequence is derived from a diatom and preferably Cyclotella cryptica. Also disclosed is a chimeric molecule having one or more regulatory control sequences derived from one or more chlorophyll C-containing algae operatively linked to a nucleic acid molecule encoding a selectable marker, an RNA molecule and/or a protein, wherein the nucleic acid molecule does not normally occur with one or more of the regulatory control sequences. Further specifically disclosed are molecules pACCNPT10, pACCNPT4.8 and pACCNPT5.1. The methods and materials of the present invention provide the ability to accomplish stable genetic transformation of chlorophyll C-containing algae.
Accuracy of maximum likelihood estimates of a two-state model in single-molecule FRET

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gopich, Irina V.

2015-01-21

Photon sequences from single-molecule Förster resonance energy transfer (FRET) experiments can be analyzed using a maximum likelihood method. Parameters of the underlying kinetic model (FRET efficiencies of the states and transition rates between conformational states) are obtained by maximizing the appropriate likelihood function. In addition, the errors (uncertainties) of the extracted parameters can be obtained from the curvature of the likelihood function at the maximum. We study the standard deviations of the parameters of a two-state model obtained from photon sequences with recorded colors and arrival times. The standard deviations can be obtained analytically in a special case when themore » FRET efficiencies of the states are 0 and 1 and in the limiting cases of fast and slow conformational dynamics. These results are compared with the results of numerical simulations. The accuracy and, therefore, the ability to predict model parameters depend on how fast the transition rates are compared to the photon count rate. In the limit of slow transitions, the key parameters that determine the accuracy are the number of transitions between the states and the number of independent photon sequences. In the fast transition limit, the accuracy is determined by the small fraction of photons that are correlated with their neighbors. The relative standard deviation of the relaxation rate has a “chevron” shape as a function of the transition rate in the log-log scale. The location of the minimum of this function dramatically depends on how well the FRET efficiencies of the states are separated.« less
Accuracy of maximum likelihood estimates of a two-state model in single-molecule FRET

PubMed Central

Gopich, Irina V.

2015-01-01

Photon sequences from single-molecule Förster resonance energy transfer (FRET) experiments can be analyzed using a maximum likelihood method. Parameters of the underlying kinetic model (FRET efficiencies of the states and transition rates between conformational states) are obtained by maximizing the appropriate likelihood function. In addition, the errors (uncertainties) of the extracted parameters can be obtained from the curvature of the likelihood function at the maximum. We study the standard deviations of the parameters of a two-state model obtained from photon sequences with recorded colors and arrival times. The standard deviations can be obtained analytically in a special case when the FRET efficiencies of the states are 0 and 1 and in the limiting cases of fast and slow conformational dynamics. These results are compared with the results of numerical simulations. The accuracy and, therefore, the ability to predict model parameters depend on how fast the transition rates are compared to the photon count rate. In the limit of slow transitions, the key parameters that determine the accuracy are the number of transitions between the states and the number of independent photon sequences. In the fast transition limit, the accuracy is determined by the small fraction of photons that are correlated with their neighbors. The relative standard deviation of the relaxation rate has a “chevron” shape as a function of the transition rate in the log-log scale. The location of the minimum of this function dramatically depends on how well the FRET efficiencies of the states are separated. PMID:25612692
Experimental single-strain mobilomics reveals events that shape pathogen emergence

DOE PAGES

Schoeniger, Joseph S.; Hudson, Corey M.; Bent, Zachary W.; ...

2016-07-04

Virulence and resistance genes carried on mobile DNAs such as genomic islands (GIs) and plasmids promote bacterial pathogen emergence. An early step in the mobilization of GIs is their excision, which produces both a circular form of the GI and a deletion site in the chromosome; circular forms have also been described for some bacterial insertion sequences (ISs). We demonstrate that the recombinant sequence produced at the junction of such circles, and their corresponding deletion sites, can be detected sensitively in high throughput sequencing data, using new computational methods that enable empirical discovery of new mobile DNAs. Applied to themore » rich mobilome of a single strain (Kpn2146) of the emerging multidrug-resistant pathogen Klebsiella pneumoniae, our approach detected circular junctions for six GIs and seven IS types (several of the latter not previously known to circularize). Our methods further revealed differential biology of multiple mobile DNAs, imprecision of integrases and transposases, and differential activity among identical IS copies for IS26, ISKpn18 and ISKpn21. Exonuclease was used to enrich for circular dsDNA molecules, and internal calibration with the native Kpn2146 plasmids showed that not all molecules bearing GI and IS circular junctions were circular dsDNAs. Transposition events were also detected, revealing replicon preference (ISKpn18 preferring a conjugative IncA/C2 plasmid), local action (IS26), regional preferences, selection (against capsule synthesis), and left-right IS end swapping. Efficient discovery and global characterization of numerous mobile elements per experiment will allow detailed accounting of bacterial evolution, explaining the new gene combinations that arise in emerging pathogens.« less
Reaching the Ionic Current Detection Limit in Silicon-Based Nanopores

NASA Astrophysics Data System (ADS)

Puster, Matthew; Rodriguez-Manzo, Julio Alejandro; Nicolai, Adrien; Meunier, Vincent; Drndic, Marija

2015-03-01

Solid-state nanopores act as single-molecule sensors whereby passage of an individual molecule in aqueous electrolyte through a nanopore is registered as a change in ionic conductance (ΔG). Future nanopore applications such as DNA sequencing at high bandwidth require high ΔG for optimal signal-to-noise ratio. Reducing the nanopore diameter and thickness increase ΔG. Molecule size limits the diameter, thus efforts concentrate on minimizing the thickness by thinning oxide/nitride films or using 2D materials. Weighted by electrolyte conductivity the highest ΔG reported to date for DNA translocations were obtained with nanopores made in oxide/nitride films. We present a controlled electron irradiation technique to thin such films to the limit of their stability, producing nanopores tailored to molecule size in amorphous Si with thicknesses less than 2 nm. We compare ΔG values with results found in the literature for DNA translocation through these nanopores, where access resistance becomes comparable to the resistance through the nanopore itself.
A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS

PubMed Central

Jiao, Xiaoli; Zheng, Xin; Ma, Liang; Kutty, Geetha; Gogineni, Emile; Sun, Qiang; Sherman, Brad T.; Hu, Xiaojun; Jones, Kristine; Raley, Castle; Tran, Bao; Munroe, David J.; Stephens, Robert; Liang, Dun; Imamichi, Tomozumi; Kovacs, Joseph A.; Lempicki, Richard A.; Huang, Da Wei

2013-01-01

PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results. PMID:24179701
A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS.

PubMed

Jiao, Xiaoli; Zheng, Xin; Ma, Liang; Kutty, Geetha; Gogineni, Emile; Sun, Qiang; Sherman, Brad T; Hu, Xiaojun; Jones, Kristine; Raley, Castle; Tran, Bao; Munroe, David J; Stephens, Robert; Liang, Dun; Imamichi, Tomozumi; Kovacs, Joseph A; Lempicki, Richard A; Huang, Da Wei

2013-07-31

PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results.
Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

1999-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.
Miniaturized reaction vessel system, method for performing site-specific biochemical reactions and affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, Andrei Darievich; Lysov, Yuri Petrovich; Dubley, Svetlana A.

2000-01-01

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between said cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting said extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to said extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from said array.

Method for performing site-specific affinity fractionation for use in DNA sequencing

DOEpatents

Mirzabekov, A.D.; Lysov, Y.P.; Dubley, S.A.

1999-05-18

A method for fractionating and sequencing DNA via affinity interaction is provided comprising contacting cleaved DNA to a first array of oligonucleotide molecules to facilitate hybridization between the cleaved DNA and the molecules; extracting the hybridized DNA from the molecules; contacting the extracted hybridized DNA with a second array of oligonucleotide molecules, wherein the oligonucleotide molecules in the second array have specified base sequences that are complementary to the extracted hybridized DNA; and attaching labeled DNA to the second array of oligonucleotide molecules, wherein the labeled re-hybridized DNA have sequences that are complementary to the oligomers. The invention further provides a method for performing multi-step conversions of the chemical structure of compounds comprising supplying an array of polyacrylamide vessels separated by hydrophobic surfaces; immobilizing a plurality of reactants, such as enzymes, in the vessels so that each vessel contains one reactant; contacting the compounds to each of the vessels in a predetermined sequence and for a sufficient time to convert the compounds to a desired state; and isolating the converted compounds from the array. 14 figs.
Tethered particle analysis of supercoiled circular DNA using peptide nucleic acid handles.

PubMed

Norregaard, Kamilla; Andersson, Magnus; Nielsen, Peter Eigil; Brown, Stanley; Oddershede, Lene B

2014-09-01

This protocol describes how to monitor individual naturally supercoiled circular DNA plasmids bound via peptide nucleic acid (PNA) handles between a bead and a surface. The protocol was developed for single-molecule investigation of the dynamics of supercoiled DNA, and it allows the investigation of both the dynamics of the molecule itself and of its interactions with a regulatory protein. Two bis-PNA clamps designed to bind with extremely high affinity to predetermined homopurine sequence sites in supercoiled DNA are prepared: one conjugated with digoxigenin for attachment to an anti-digoxigenin-coated glass cover slide, and one conjugated with biotin for attachment to a submicron-sized streptavidin-coated polystyrene bead. Plasmids are constructed, purified and incubated with the PNA handles. The dynamics of the construct is analyzed by tracking the tethered bead using video microscopy: less supercoiling results in more movement, and more supercoiling results in less movement. In contrast to other single-molecule methodologies, the current methodology allows for studying DNA in its naturally supercoiled state with constant linking number and constant writhe. The protocol has potential for use in studying the influence of supercoils on the dynamics of DNA and its associated proteins, e.g., topoisomerase. The procedure takes ~4 weeks.
Direct observation of processive exoribonuclease motion using optical tweezers.

PubMed

Fazal, Furqan M; Koslover, Daniel J; Luisi, Ben F; Block, Steven M

2015-12-08

Bacterial RNases catalyze the turnover of RNA and are essential for gene expression and quality surveillance of transcripts. In Escherichia coli, the exoribonucleases RNase R and polynucleotide phosphorylase (PNPase) play critical roles in degrading RNA. Here, we developed an optical-trapping assay to monitor the translocation of individual enzymes along RNA-based substrates. Single-molecule records of motion reveal RNase R to be highly processive: one molecule can unwind over 500 bp of a structured substrate. However, enzyme progress is interrupted by pausing and stalling events that can slow degradation in a sequence-dependent fashion. We found that the distance traveled by PNPase through structured RNA is dependent on the A+U content of the substrate and that removal of its KH and S1 RNA-binding domains can reduce enzyme processivity without affecting the velocity. By a periodogram analysis of single-molecule records, we establish that PNPase takes discrete steps of six or seven nucleotides. These findings, in combination with previous structural and biochemical data, support an asymmetric inchworm mechanism for PNPase motion. The assay developed here for RNase R and PNPase is well suited to studies of other exonucleases and helicases.
The Conformations of Confined Polymers in an External Potential

NASA Astrophysics Data System (ADS)

Morrison, Greg

The confinement of biomolecules is ubiquitous in nature, such as the spatial constraints of viral encapsulation, histone binding, and chromosomal packing. Advances in microfluidics and nanopore fabrication have permitted powerful new tools in single molecule manipulation and gene sequencing through molecular confinement as well. In order to fully understand and exploit these systems, the ability to predict the structure of spatially confined molecules is essential. In this talk, I describe a mean field approach to determine the properties of stiff polymers confined to cylinders and slits, which is relevant for a variety of biological and experimental conditions. I show that this approach is able to not only reproduce known scaling laws for confined wormlike chains, but also provides an improvement over existing weakly bending rod approximations in determining the detailed chain properties (such as correlation functions). Using this approach, we also show that it is possible to study the effect of an externally applied tension or static electric field in a natural and analytically tractable way. These external perturbations can alter the scaling laws and introduce important new length scales into the system, relevant for histone unbinding and single-molecule analysis of DNA.
NetMHCpan, a Method for Quantitative Predictions of Peptide Binding to Any HLA-A and -B Locus Protein of Known Sequence

PubMed Central

Nielsen, Morten; Lundegaard, Claus; Blicher, Thomas; Lamberth, Kasper; Harndahl, Mikkel; Justesen, Sune; Røder, Gustav; Peters, Bjoern; Sette, Alessandro; Lund, Ole; Buus, Søren

2007-01-01

Background Binding of peptides to Major Histocompatibility Complex (MHC) molecules is the single most selective step in the recognition of pathogens by the cellular immune system. The human MHC class I system (HLA-I) is extremely polymorphic. The number of registered HLA-I molecules has now surpassed 1500. Characterizing the specificity of each separately would be a major undertaking. Principal Findings Here, we have drawn on a large database of known peptide-HLA-I interactions to develop a bioinformatics method, which takes both peptide and HLA sequence information into account, and generates quantitative predictions of the affinity of any peptide-HLA-I interaction. Prospective experimental validation of peptides predicted to bind to previously untested HLA-I molecules, cross-validation, and retrospective prediction of known HIV immune epitopes and endogenous presented peptides, all successfully validate this method. We further demonstrate that the method can be applied to perform a clustering analysis of MHC specificities and suggest using this clustering to select particularly informative novel MHC molecules for future biochemical and functional analysis. Conclusions Encompassing all HLA molecules, this high-throughput computational method lends itself to epitope searches that are not only genome- and pathogen-wide, but also HLA-wide. Thus, it offers a truly global analysis of immune responses supporting rational development of vaccines and immunotherapy. It also promises to provide new basic insights into HLA structure-function relationships. The method is available at http://www.cbs.dtu.dk/services/NetMHCpan. PMID:17726526
Fluorescence enhancement on silver nanostructures: studies of components of ribosomal translation in vitro

NASA Astrophysics Data System (ADS)

Mandecki, Wlodek; Bharill, Shashank; Borejdo, Julian; Cabral, Diana; Cooperman, Barry S.; Farrell, Ian; Fetter, Linus; Goldman, Emanuel; Gryczynski, Zygmunt; Jakubowski, Hieronim; Liu, Hanqing; Luchowski, Rafal; Matveeva, Evgenia; Pan, Dongli; Qin, Haiou; Tennant, Donald; Gryczynski, Ignacy

2008-02-01

Metallic particles, silver in particular, can significantly enhance the fluorescence of dye molecules in the immediate vicinity (5-20 nm) of the particle. This magnifying effect can be theoretically explained/predicted by considering the change of photonic mode density near the fluorophore due to coupling to the conducting surface. We are using this method to observe fluorescence from a single ribosomal particle in a project aimed at acquiring sequence information from the translating ribosome (NIH's $1000 Genome Initiative). Several quartz slides with silver nanostructures were made using electron beam lithography techniques. The structures were approximately 50 nm high silver tiles measuring 400-700 nm on the side, and were spaced differently over a total area of 1 mm x 1 mm on any given quartz slide. In a preliminary experiment, we coated this surface with the Alexa 647-labeled antibodies and collected single molecule images using the MicroTime 200 (PicoQuant) confocal system. We showed that the fluorescence intensity measured over the silver islands film was more than 100-fold higher than fluorescence from a comparable site on uncoated section of the quartz slide. No noticeable photobleaching was seen. The fluorescence lifetime was very short, about 200 ps or less (this is the resolution limit of the system). The method has great promise for investigations of biologically relevant single molecules.
A state space based approach to localizing single molecules from multi-emitter images.

PubMed

Vahid, Milad R; Chao, Jerry; Ward, E Sally; Ober, Raimund J

2017-01-28

Single molecule super-resolution microscopy is a powerful tool that enables imaging at sub-diffraction-limit resolution. In this technique, subsets of stochastically photoactivated fluorophores are imaged over a sequence of frames and accurately localized, and the estimated locations are used to construct a high-resolution image of the cellular structures labeled by the fluorophores. Available localization methods typically first determine the regions of the image that contain emitting fluorophores through a process referred to as detection. Then, the locations of the fluorophores are estimated accurately in an estimation step. We propose a novel localization method which combines the detection and estimation steps. The method models the given image as the frequency response of a multi-order system obtained with a balanced state space realization algorithm based on the singular value decomposition of a Hankel matrix, and determines the locations of intensity peaks in the image as the pole locations of the resulting system. The locations of the most significant peaks correspond to the locations of single molecules in the original image. Although the accuracy of the location estimates is reasonably good, we demonstrate that, by using the estimates as the initial conditions for a maximum likelihood estimator, refined estimates can be obtained that have a standard deviation close to the Cramér-Rao lower bound-based limit of accuracy. We validate our method using both simulated and experimental multi-emitter images.
Cucurbituril mediated single molecule detection and identification via recognition tunneling.

PubMed

Xiao, Bohuai; Liang, Feng; Liu, Simin; Im, JongOne; Li, Yunchuan; Liu, Jing; Zhang, Bintian; Zhou, Jianghao; He, Jin; Chang, Shuai

2018-06-08

Recognition tunneling (RT) is an emerging technique for investigating single molecules in a tunnel junction. We have previously demonstrated its capability of single molecule detection and identification, as well as probing the dynamics of intermolecular bonding at the single molecule level. Here by introducing cucurbituril as a new class of recognition molecule, we demonstrate a powerful platform for electronically investigating the host-guest chemistry at single molecule level. In this report, we first investigated the single molecule electrical properties of cucurbituril in a tunnel junction. Then we studied two model guest molecules, aminoferrocene and amantadine, which were encapsulated by cucurbituril. Small differences in conductance and lifetime can be recognized between the host-guest complexes with the inclusion of different guest molecules. By using a machine learning algorithm to classify the RT signals in a hyper dimensional space, the accuracy of guest molecule recognition can be significantly improved, suggesting the possibility of using cucurbituril molecule for single molecule identification. This work enables a new class of recognition molecule for RT technique and opens the door for detecting a vast variety of small molecules by electrical measurements.
Improved Analysis of Nanopore Sequence Data and Scanning Nanopore Techniques

NASA Astrophysics Data System (ADS)

Szalay, Tamas

The field of nanopore research has been driven by the need to inexpensively and rapidly sequence DNA. In order to help realize this goal, this thesis describes the PoreSeq algorithm that identifies and corrects errors in real-world nanopore sequencing data and improves the accuracy of de novo genome assembly with increasing coverage depth. The approach relies on modeling the possible sources of uncertainty that occur as DNA advances through the nanopore and then using this model to find the sequence that best explains multiple reads of the same region of DNA. PoreSeq increases nanopore sequencing read accuracy of M13 bacteriophage DNA from 85% to 99% at 100X coverage. We also use the algorithm to assemble E. coli with 30X coverage and the lambda genome at a range of coverages from 3X to 50X. Additionally, we classify sequence variants at an order of magnitude lower coverage than is possible with existing methods. This thesis also reports preliminary progress towards controlling the motion of DNA using two nanopores instead of one. The speed at which the DNA travels through the nanopore needs to be carefully controlled to facilitate the detection of individual bases. A second nanopore in close proximity to the first could be used to slow or stop the motion of the DNA in order to enable a more accurate readout. The fabrication process for a new pyramidal nanopore geometry was developed in order to facilitate the positioning of the nanopores. This thesis demonstrates that two of them can be placed close enough to interact with a single molecule of DNA, which is a prerequisite for being able to use the driving force of the pores to exert fine control over the motion of the DNA. Another strategy for reading the DNA is to trap it completely with one pore and to move the second nanopore instead. To that end, this thesis also shows that a single strand of immobilized DNA can be captured in a scanning nanopore and examined for a full hour, with data from many scans at many different voltages obtained in order to detect a bound protein placed partway along the molecule.
Isoform Sequencing and State-of-Art Applications for Unravelling Complexity of Plant Transcriptomes

PubMed Central

An, Dong; Li, Changsheng; Humbeck, Klaus

2018-01-01

Single-molecule real-time (SMRT) sequencing developed by PacBio, also called third-generation sequencing (TGS), offers longer reads than the second-generation sequencing (SGS). Given its ability to obtain full-length transcripts without assembly, isoform sequencing (Iso-Seq) of transcriptomes by PacBio is advantageous for genome annotation, identification of novel genes and isoforms, as well as the discovery of long non-coding RNA (lncRNA). In addition, Iso-Seq gives access to the direct detection of alternative splicing, alternative polyadenylation (APA), gene fusion, and DNA modifications. Such applications of Iso-Seq facilitate the understanding of gene structure, post-transcriptional regulatory networks, and subsequently proteomic diversity. In this review, we summarize its applications in plant transcriptome study, specifically pointing out challenges associated with each step in the experimental design and highlight the development of bioinformatic pipelines. We aim to provide the community with an integrative overview and a comprehensive guidance to Iso-Seq, and thus to promote its applications in plant research. PMID:29346292
Dwell-Time Distribution, Long Pausing and Arrest of Single-Ribosome Translation through the mRNA Duplex.

PubMed

Xie, Ping

2015-10-09

Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by "hungry" codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel.
Dwell-Time Distribution, Long Pausing and Arrest of Single-Ribosome Translation through the mRNA Duplex

PubMed Central

Xie, Ping

2015-01-01

Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by “hungry” codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel. PMID:26473825
Phylogenomic relationship of feijoa (Acca sellowiana (O.Berg) Burret) with other Myrtaceae based on complete chloroplast genome sequences.

PubMed

Machado, Lilian de Oliveira; Vieira, Leila do Nascimento; Stefenon, Valdir Marcos; Oliveira Pedrosa, Fábio de; Souza, Emanuel Maltempi de; Guerra, Miguel Pedro; Nodari, Rubens Onofre

2017-04-01

Given their distribution, importance, and richness, Myrtaceae species comprise a model system for studying the evolution of tropical plant diversity. In addition, chloroplast (cp) genome sequencing is an efficient tool for phylogenetic relationship studies. Feijoa [Acca sellowiana (O. Berg) Burret; CN: pineapple-guava] is a Myrtaceae species that occurs naturally in southern Brazil and northern Uruguay. Feijoa is known for its exquisite perfume and flavorful fruits, pharmacological properties, ornamental value and increasing economic relevance. In the present work, we reported the complete cp genome of feijoa. The feijoa cp genome is a circular molecule of 159,370 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC 88,028 bp) and a Small Single Copy region (SSC 18,598 bp) separated by Inverted Repeat regions (IRs 26,372 bp). The genome structure, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. When compared to other cp genome sequences of Myrtaceae, feijoa showed closest relationship with pitanga (Eugenia uniflora L.). Furthermore, a comparison of pitanga synonymous (Ks) and nonsynonymous (Ka) substitution rates revealed extremely low values. Maximum Likelihood and Bayesian Inference analyses produced phylogenomic trees identical in topology. These trees supported monophyly of three Myrtoideae clades.
Genome scaffolding and annotation for the pathogen vector Ixodes ricinus by ultra-long single molecule sequencing.

PubMed

Cramaro, Wibke J; Hunewald, Oliver E; Bell-Sakyi, Lesley; Muller, Claude P

2017-02-08

Global warming and other ecological changes have facilitated the expansion of Ixodes ricinus tick populations. Ixodes ricinus is the most important carrier of vector-borne pathogens in Europe, transmitting viruses, protozoa and bacteria, in particular Borrelia burgdorferi (sensu lato), the causative agent of Lyme borreliosis, the most prevalent vector-borne disease in humans in the Northern hemisphere. To faster control this disease vector, a better understanding of the I. ricinus tick is necessary. To facilitate such studies, we recently published the first reference genome of this highly prevalent pathogen vector. Here, we further extend these studies by scaffolding and annotating the first reference genome by using ultra-long sequencing reads from third generation single molecule sequencing. In addition, we present the first genome size estimation for I. ricinus ticks and the embryo-derived cell line IRE/CTVM19. 235,953 contigs were integrated into 204,904 scaffolds, extending the currently known genome lengths by more than 30% from 393 to 516 Mb and the N50 contig value by 87% from 1643 bp to a N50 scaffold value of 3067 bp. In addition, 25,263 sequences were annotated by comparison to the tick's North American relative Ixodes scapularis. After (conserved) hypothetical proteins, zinc finger proteins, secreted proteins and P450 coding proteins were the most prevalent protein categories annotated. Interestingly, more than 50% of the amino acid sequences matching the homology threshold had 95-100% identity to the corresponding I. scapularis gene models. The sequence information was complemented by the first genome size estimation for this species. Flow cytometry-based genome size analysis revealed a haploid genome size of 2.65Gb for I. ricinus ticks and 3.80 Gb for the cell line. We present a first draft sequence map of the I. ricinus genome based on a PacBio-Illumina assembly. The I. ricinus genome was shown to be 26% (500 Mb) larger than the genome of its American relative I. scapularis. Based on the genome size of 2.65 Gb we estimated that we covered about 67% of the non-repetitive sequences. Genome annotation will facilitate screening for specific molecular pathways in I. ricinus cells and provides an overview of characteristics and functions.
Origin of noncoding DNA sequences: molecular fossils of genome evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Naora, H.; Miyahara, K.; Curnow, R.N.

The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. The authors propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approx. 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a genome) that was polymerized at random and without a specific template in the primordial soup/cell. The statistical distribution of stop codons allows examination of the probability of generating reading frames of approx. 0.55 kb in this primordial polynucleotide. This analysis reveals that with three stopmore » codons, a run of at least 0.55-kb equivalent length of nonstop codons would occur in 4.6% of 20-kb-long polynucleotide molecules. They attempt to estimate the total amount of noncoding sequences that would be present on the chromosomes of contemporary species assuming that present-day chromosomes retain the prototype primordial genome structure. Theoretical estimates thus obtained for most eukaryotes do not differ significantly from those reported for these specific organisms, with only a few exceptions. Furthermore, analysis of possible stop-codon distributions suggests that life on earth would not exist, at least in its present form, had two or four stop codons been selected early in evolution.« less
Structure and specificity of the RNA-guided endonuclease Cas9 during DNA interrogation, target binding and cleavage

PubMed Central

Josephs, Eric A.; Kocak, D. Dewran; Fitzgibbon, Christopher J.; McMenemy, Joshua; Gersbach, Charles A.; Marszalek, Piotr E.

2015-01-01

CRISPR-associated endonuclease Cas9 cuts DNA at variable target sites designated by a Cas9-bound RNA molecule. Cas9's ability to be directed by single ‘guide RNA’ molecules to target nearly any sequence has been recently exploited for a number of emerging biological and medical applications. Therefore, understanding the nature of Cas9's off-target activity is of paramount importance for its practical use. Using atomic force microscopy (AFM), we directly resolve individual Cas9 and nuclease-inactive dCas9 proteins as they bind along engineered DNA substrates. High-resolution imaging allows us to determine their relative propensities to bind with different guide RNA variants to targeted or off-target sequences. Mapping the structural properties of Cas9 and dCas9 to their respective binding sites reveals a progressive conformational transformation at DNA sites with increasing sequence similarity to its target. With kinetic Monte Carlo (KMC) simulations, these results provide evidence of a ‘conformational gating’ mechanism driven by the interactions between the guide RNA and the 14th–17th nucleotide region of the targeted DNA, the stabilities of which we find correlate significantly with reported off-target cleavage rates. KMC simulations also reveal potential methodologies to engineer guide RNA sequences with improved specificity by considering the invasion of guide RNAs into targeted DNA duplex. PMID:26384421
Evaluation of the Kinetic Property of Single-Molecule Junctions by Tunneling Current Measurements.

PubMed

Harashima, Takanori; Hasegawa, Yusuke; Kiguchi, Manabu; Nishino, Tomoaki

2018-01-01

We investigated the formation and breaking of single-molecule junctions of two kinds of dithiol molecules by time-resolved tunneling current measurements in a metal nanogap. The resulting current trajectory was statistically analyzed to determine the single-molecule conductance and, more importantly, to reveal the kinetic property of the single-molecular junction. These results suggested that combining a measurement of the single-molecule conductance and statistical analysis is a promising method to uncover the kinetic properties of the single-molecule junction.
Comparative Genomic Analysis of Two Clonally Related Multidrug Resistant Mycobacterium tuberculosis by Single Molecule Real Time Sequencing.

PubMed

Leung, Kenneth Siu-Sing; Siu, Gilman Kit-Hang; Tam, Kingsley King-Gee; To, Sabrina Wai-Chi; Rajwani, Rahim; Ho, Pak-Leung; Wong, Samson Sai-Yin; Zhao, Wei W; Ma, Oliver Chiu-Kit; Yam, Wing-Cheong

2017-01-01

Background: Multidrug-resistant tuberculosis (MDR-TB) is posing a major threat to global TB control. In this study, we focused on two consecutive MDR-TB isolated from the same patient before and after the initiation of anti-TB treatment. To better understand the genomic characteristics of MDR-TB, Single Molecule Real-Time (SMRT) Sequencing and comparative genomic analyses was performed to identify mutations that contributed to the stepwise development of drug resistance and growth fitness in MDR-TB under in vivo challenge of anti-TB drugs. Result: Both pre-treatment and post-treatment strain demonstrated concordant phenotypic and genotypic susceptibility profiles toward rifampicin, pyrazinamide, streptomycin, fluoroquinolones, aminoglycosides, cycloserine, ethionamide, and para-aminosalicylic acid. However, although both strains carried identical missense mutations at rpoB S531L, inhA C-15T, and embB M306V, MYCOTB Sensititre assay showed that the post-treatment strain had 16-, 8-, and 4-fold elevation in the minimum inhibitory concentrations (MICs) toward rifabutin, isoniazid, and ethambutol respectively. The results have indicated the presence of additional resistant-related mutations governing the stepwise development of MDR-TB. Further comparative genomic analyses have identified three additional polymorphisms between the clinical isolates. These include a single nucleotide deletion at nucleotide position 360 of rv0888 in pre-treatment strain, and a missense mutation at rv3303c ( lpdA) V44I and a 6-bp inframe deletion at codon 67-68 in rv2071c ( cobM) in the post-treatment strain. Multiple sequence alignment showed that these mutations were occurring at highly conserved regions among pathogenic mycobacteria. Using structural-based and sequence-based algorithms, we further predicted that the mutations potentially have deleterious effect on protein function. Conclusion: This is the first study that compared the full genomes of two clonally-related MDR-TB clinical isolates during the course of anti-TB treatment. Our work has demonstrated the robustness of SMRT Sequencing in identifying mutations among MDR-TB clinical isolates. Comparative genome analysis also suggested novel mutations at rv0888, lpdA , and cobM that might explain the difference in antibiotic resistance and growth pattern between the two MDR-TB strains.
Rational design of DNA sequences for nanotechnology, microarrays and molecular computers using Eulerian graphs.

PubMed

Pancoska, Petr; Moravek, Zdenek; Moll, Ute M

2004-01-01

Nucleic acids are molecules of choice for both established and emerging nanoscale technologies. These technologies benefit from large functional densities of 'DNA processing elements' that can be readily manufactured. To achieve the desired functionality, polynucleotide sequences are currently designed by a process that involves tedious and laborious filtering of potential candidates against a series of requirements and parameters. Here, we present a complete novel methodology for the rapid rational design of large sets of DNA sequences. This method allows for the direct implementation of very complex and detailed requirements for the generated sequences, thus avoiding 'brute force' filtering. At the same time, these sequences have narrow distributions of melting temperatures. The molecular part of the design process can be done without computer assistance, using an efficient 'human engineering' approach by drawing a single blueprint graph that represents all generated sequences. Moreover, the method eliminates the necessity for extensive thermodynamic calculations. Melting temperature can be calculated only once (or not at all). In addition, the isostability of the sequences is independent of the selection of a particular set of thermodynamic parameters. Applications are presented for DNA sequence designs for microarrays, universal microarray zip sequences and electron transfer experiments.
Microdissection and molecular manipulation of single chromosomes in woody fruit trees with small chromosomes using pomelo (Citrus grandis) as a model. II. Cloning of resistance gene analogs from single chromosomes.

PubMed

Huang, D; Wu, W; Lu, L

2004-05-01

Amplification of resistance gene analogs (RGAs) is both a useful method for acquiring DNA markers closely linked to disease resistance (R) genes and a potential approach for the rapid cloning of R genes in plants. However, the screening of target sequences from among the numerous amplified RGAs can be very laborious. The amplification of RGAs from specific chromosomes could greatly reduce the number of RGAs to be screened and, consequently, speed up the identification of target RGAs. We have developed two methods for amplifying RGAs from single chromosomes. Method 1 uses products of Sau3A linker adaptor-mediated PCR (LAM-PCR) from a single chromosome as the templates for RGA amplification, while Method 2 directly uses a single chromosomal DNA molecule as the template. Using a pair of degenerate primers designed on the basis of the conserved nucleotide-binding-site motifs in many R genes, RGAs were successfully amplified from single chromosomes of pomelo using both these methods. Sequencing and cluster analysis of RGA clones obtained from single chromosomes revealed the number, type and organization of R-gene clusters on the chromosomes. We suggest that Method 1 is suitable for analyzing chromosomes that are unidentifiable under a microscope, while Method 2 is more appropriate when chromosomes can be clearly identified.

Controlled chain polymerisation and chemical soldering for single-molecule electronics.

PubMed

Okawa, Yuji; Akai-Kasaya, Megumi; Kuwahara, Yuji; Mandal, Swapan K; Aono, Masakazu

2012-05-21

Single functional molecules offer great potential for the development of novel nanoelectronic devices with capabilities beyond today's silicon-based devices. To realise single-molecule electronics, the development of a viable method for connecting functional molecules to each other using single conductive polymer chains is required. The method of initiating chain polymerisation using the tip of a scanning tunnelling microscope (STM) is very useful for fabricating single conductive polymer chains at designated positions and thereby wiring single molecules. In this feature article, developments in the controlled chain polymerisation of diacetylene compounds and the properties of polydiacetylene chains are summarised. Recent studies of "chemical soldering", a technique enabling the covalent connection of single polydiacetylene chains to single functional molecules, are also introduced. This represents a key step in advancing the development of single-molecule electronics.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules.

PubMed

Garcia-Martin, Juan Antonio; Dotu, Ivan; Clote, Peter

2015-07-01

Several algorithms for RNA inverse folding have been used to design synthetic riboswitches, ribozymes and thermoswitches, whose activity has been experimentally validated. The RNAiFold software is unique among approaches for inverse folding in that (exhaustive) constraint programming is used instead of heuristic methods. For that reason, RNAiFold can generate all sequences that fold into the target structure or determine that there is no solution. RNAiFold 2.0 is a complete overhaul of RNAiFold 1.0, rewritten from the now defunct COMET language to C++. The new code properly extends the capabilities of its predecessor by providing a user-friendly pipeline to design synthetic constructs having the functionality of given Rfam families. In addition, the new software supports amino acid constraints, even for proteins translated in different reading frames from overlapping coding sequences; moreover, structure compatibility/incompatibility constraints have been expanded. With these features, RNAiFold 2.0 allows the user to design single RNA molecules as well as hybridization complexes of two RNA molecules. the web server, source code and linux binaries are publicly accessible at http://bioinformatics.bc.edu/clotelab/RNAiFold2.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Nothing in Evolution Makes Sense Except in the Light of Genomics: Read-Write Genome Evolution as an Active Biological Process.

PubMed

Shapiro, James A

2016-06-08

The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess "Read-Write Genomes" they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification.
Nothing in Evolution Makes Sense Except in the Light of Genomics: Read–Write Genome Evolution as an Active Biological Process

PubMed Central

Shapiro, James A.

2016-01-01

The 21st century genomics-based analysis of evolutionary variation reveals a number of novel features impossible to predict when Dobzhansky and other evolutionary biologists formulated the neo-Darwinian Modern Synthesis in the middle of the last century. These include three distinct realms of cell evolution; symbiogenetic fusions forming eukaryotic cells with multiple genome compartments; horizontal organelle, virus and DNA transfers; functional organization of proteins as systems of interacting domains subject to rapid evolution by exon shuffling and exonization; distributed genome networks integrated by mobile repetitive regulatory signals; and regulation of multicellular development by non-coding lncRNAs containing repetitive sequence components. Rather than single gene traits, all phenotypes involve coordinated activity by multiple interacting cell molecules. Genomes contain abundant and functional repetitive components in addition to the unique coding sequences envisaged in the early days of molecular biology. Combinatorial coding, plus the biochemical abilities cells possess to rearrange DNA molecules, constitute a powerful toolbox for adaptive genome rewriting. That is, cells possess “Read–Write Genomes” they alter by numerous biochemical processes capable of rapidly restructuring cellular DNA molecules. Rather than viewing genome evolution as a series of accidental modifications, we can now study it as a complex biological process of active self-modification. PMID:27338490
Underwound DNA under Tension: Structure, Elasticity, and Sequence-Dependent Behaviors

NASA Astrophysics Data System (ADS)

Sheinin, Maxim Y.; Forth, Scott; Marko, John F.; Wang, Michelle D.

2011-09-01

DNA melting under torsion plays an important role in a wide variety of cellular processes. In the present Letter, we have investigated DNA melting at the single-molecule level using an angular optical trap. By directly measuring force, extension, torque, and angle of DNA, we determined the structural and elastic parameters of torsionally melted DNA. Our data reveal that under moderate forces, the melted DNA assumes a left-handed structure as opposed to an open bubble conformation and is highly torsionally compliant. We have also discovered that at low forces melted DNA properties are highly dependent on DNA sequence. These results provide a more comprehensive picture of the global DNA force-torque phase diagram.
Computer simulation of gene detection without PCR by single molecule detection

NASA Astrophysics Data System (ADS)

Davis, Lloyd M.; Williams, John G.; Lamb, Don T.

1999-01-01

Pioneer Hi-Bred is developing a low-cost method for rapid screening of DNA, for use in research on elite crop seed genetics. Unamplified genomic DNA with the requisite base sequence is simultaneously labeled by two different colored fluorescent probes, which hybridize near the selected gene. Dual-channel single molecule detection (SMD) within a flow cell, then provides a sensitive and specific assay for the gene. The technique has been demonstrated using frequency- doubled Nd:YAG laser excitation of two visible-wavelength dyes. A prototype instrument employing infrared fluorophores and laser diodes for excitation has been developed. Here, we report results from a Monte Carlo simulation of the new instrument, in which experimentally determined photophysical parameters for candidate infrared dyes are used for parametric studies of experimental operating conditions. Fluorophore photostability is found to be a key factor in determining the instrument sensitivity. Most infrared dyes have poor photostability, resulting in inefficient SMD. However, the normalized cross-correlation function of the photon signals from each of the two channels can still yield a discernable peak, provided that the concentration of dual- labeled molecules is sufficiently high. Further, for low concentrations, processing of the two photon streams with Gaussian -weighted sliding sum digital filters and selection of simultaneously occurring peaks can also provide a sensitive indicator of the presence of dual-labeled molecules, although accidental coincidences must be considered in the interpretation of results.
Single-molecule fluorescence measurements reveal the reaction mechanisms of the core RISC, composed of human Argonaute 2 and a guide RNA.

PubMed

Jo, Myung Hyun; Song, Ji-Joon; Hohng, Sungchul

2015-12-01

In eukaryotes, small RNAs play important roles in both gene regulation and resistance to viral infection. Argonaute proteins have been identified as a key component of the effector complexes of various RNA-silencing pathways, but the mechanistic roles of Argonaute proteins in these pathways are not clearly understood. To address this question, we performed single-molecule fluorescence experiments using an RNA-induced silencing complex (core-RISC) composed of a small RNA and human Argonaute 2. We found that target binding of core-RISC starts at the seed region of the guide RNA. After target binding, four distinct reactions followed: target cleavage, transient binding, stable binding, and Argonaute unloading. Target cleavage required extensive sequence complementarity and accelerated core-RISC dissociation for recycling. In contrast, the stable binding of core-RISC to target RNAs required seed-match only, suggesting a potential explanation for the seed-match rule of microRNA (miRNA) target selection.
Single molecule study of the intrinsically disordered FG-repeat nucleoporin 153.

PubMed

Milles, Sigrid; Lemke, Edward A

2011-10-05

Nucleoporins (Nups), which are intrinsically disordered, form a selectivity filter inside the nuclear pore complex, taking a central role in the vital nucleocytoplasmic transport mechanism. These Nups display a complex and nonrandom amino-acid architecture of phenylalanine glycine (FG)-repeat clusters and intra-FG linkers. How such heterogeneous sequence composition relates to function and could give rise to a transport mechanism is still unclear. Here we describe a combined chemical biology and single-molecule fluorescence approach to study the large human Nup153 FG-domain. In order to obtain insights into the properties of this domain beyond the average behavior, we probed the end-to-end distance (R(E)) of several ∼50-residues long FG-repeat clusters in the context of the whole protein domain. Despite the sequence heterogeneity of these FG-clusters, we detected a reoccurring and consistent compaction from a relaxed coil behavior under denaturing conditions (R(E)/R(E,RC) = 0.99 ± 0.15 with R(E,RC) corresponding to ideal relaxed coil behavior) to a collapsed state under native conditions (R(E)/R(E,RC) = 0.79 ± 0.09). We then analyzed the properties of this protein on the supramolecular level, and determined that this human FG-domain was in fact able to form a hydrogel with physiological permeability barrier properties. Copyright © 2011 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Yeast prion architecture explains how proteins can be genes

NASA Astrophysics Data System (ADS)

Wickner, Reed

2013-03-01

Prions (infectious proteins) transmit information without an accompanying DNA or RNA. Most yeast prions are self-propagating amyloids that inactivate a normally functional protein. A single protein can become any of several prion variants, with different manifestations due to different amyloid structures. We showed that the yeast prion amyloids of Ure2p, Sup35p and Rnq1p are folded in-register parallel beta sheets using solid state NMR dipolar recoupling experiments, mass-per-filament-length measurements, and filament diameter measurements. The extent of beta sheet structure, measured by chemical shifts in solid-state NMR and acquired protease-resistance on amyloid formation, combined with the measured filament diameters, imply that the beta sheets must be folded along the long axis of the filament. We speculate that prion variants of a single protein sequence differ in the location of these folds. Favorable interactions between identical side chains must hold these structures in-register. The same interactions must guide an unstructured monomer joining the end of a filament to assume the same conformation as molecules already in the filament, with the turns at the same locations. In this way, a protein can template its own conformation, in analogy to the ability of a DNA molecule to template its sequence by specific base-pairing. Bldg. 8, Room 225, NIH, 8 Center Drive MSC 0830, Bethesda, MD 20892-0830, wickner@helix.nih.gov, 301-496-3452
Comprehensive profiling and quantitation of oncogenic mutations in non small-cell lung carcinoma using single molecule amplification and re-sequencing technology

PubMed Central

Jiang, Hong; Wang, Limin; Xu, Rujun; Shi, Yanbin; Zhang, Jianguang; Xu, Mengnan; Cram, David S.; Ma, Shenglin

2016-01-01

Activating and resistance mutations in the tyrosine kinase domain of several oncogenes are frequently associated with non-small cell lung carcinoma (NSCLC). In this study we assessed the frequency, type and abundance of EGFR, KRAS, BRAF, TP53 and ALK mutations in tumour specimens from 184 patients with early and late stage disease using single molecule amplification and re-sequencing technology (SMART). Based on modelling of EGFR mutations, the detection sensitivity of the SMART assay was at least 0.1%. Benchmarking EGFR mutation detection against the gold standard ARMS-PCR assay, SMART assay had a sensitivity and specificity of 98.7% and 99.0%. Amongst the 184 samples, EGFR mutations were the most prevalent (59.9%), followed by KRAS (16.9%), TP53 (12.7%), EML4-ALK fusions (6.3%) and BRAF (4.2%) mutations. The abundance and types of mutations in tumour specimens were extremely heterogeneous, involving either monoclonal (51.6%) or polyclonal (12.6%) mutation events. At the clinical level, although the spectrum of tumour mutation(s) was unique to each patient, the overall patterns in early or advanced stage disease were relatively similar. Based on these findings, we propose that personalized profiling and quantitation of clinically significant oncogenic mutations will allow better classification of patients according to tumour characteristics and provide clinicians with important ancillary information for treatment decision-making. PMID:27409166
Comprehensive profiling and quantitation of oncogenic mutations in non small-cell lung carcinoma using single molecule amplification and re-sequencing technology.

PubMed

Zhang, Shirong; Xia, Bing; Jiang, Hong; Wang, Limin; Xu, Rujun; Shi, Yanbin; Zhang, Jianguang; Xu, Mengnan; Cram, David S; Ma, Shenglin

2016-08-02

Activating and resistance mutations in the tyrosine kinase domain of several oncogenes are frequently associated with non-small cell lung carcinoma (NSCLC). In this study we assessed the frequency, type and abundance of EGFR, KRAS, BRAF, TP53 and ALK mutations in tumour specimens from 184 patients with early and late stage disease using single molecule amplification and re-sequencing technology (SMART). Based on modelling of EGFR mutations, the detection sensitivity of the SMART assay was at least 0.1%. Benchmarking EGFR mutation detection against the gold standard ARMS-PCR assay, SMART assay had a sensitivity and specificity of 98.7% and 99.0%. Amongst the 184 samples, EGFR mutations were the most prevalent (59.9%), followed by KRAS (16.9%), TP53 (12.7%), EML4-ALK fusions (6.3%) and BRAF (4.2%) mutations. The abundance and types of mutations in tumour specimens were extremely heterogeneous, involving either monoclonal (51.6%) or polyclonal (12.6%) mutation events. At the clinical level, although the spectrum of tumour mutation(s) was unique to each patient, the overall patterns in early or advanced stage disease were relatively similar. Based on these findings, we propose that personalized profiling and quantitation of clinically significant oncogenic mutations will allow better classification of patients according to tumour characteristics and provide clinicians with important ancillary information for treatment decision-making.
Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform.

PubMed

Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

2015-01-01

The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.
Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform

PubMed Central

Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

2015-01-01

The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum. PMID:25705213
Isolation and sequence of partial cDNA clones of human L1: homology of human and rodent L1 in the cytoplasmic region.

PubMed

Harper, J R; Prince, J T; Healy, P A; Stuart, J K; Nauman, S J; Stallcup, W B

1991-03-01

We have isolated cDNA clones coding for the human homologue of the neuronal cell adhesion molecule L1. The nucleotide sequence of the cDNA clones and the deduced primary amino acid sequence of the carboxy terminal portion of the human L1 are homologous to the corresponding sequences of mouse L1 and rat NILE glycoprotein, with an especially high sequences identity in the cytoplasmic regions of the proteins. There is also protein sequence homology with the cytoplasmic region of the Drosophila cell adhesion molecule, neuroglian. The conservation of the cytoplasmic domain argues for an important functional role for this portion of the molecule.
Next generation sequencing and its applications in forensic genetics.

PubMed

Børsting, Claus; Morling, Niels

2015-09-01

It has been almost a decade since the first next generation sequencing (NGS) technologies emerged and quickly changed the way genetic research is conducted. Today, full genomes are mapped and published almost weekly and with ever increasing speed and decreasing costs. NGS methods and platforms have matured during the last 10 years, and the quality of the sequences has reached a level where NGS is used in clinical diagnostics of humans. Forensic genetic laboratories have also explored NGS technologies and especially in the last year, there has been a small explosion in the number of scientific articles and presentations at conferences with forensic aspects of NGS. These contributions have demonstrated that NGS offers new possibilities for forensic genetic case work. More information may be obtained from unique samples in a single experiment by analyzing combinations of markers (STRs, SNPs, insertion/deletions, mRNA) that cannot be analyzed simultaneously with the standard PCR-CE methods used today. The true variation in core forensic STR loci has been uncovered, and previously unknown STR alleles have been discovered. The detailed sequence information may aid mixture interpretation and will increase the statistical weight of the evidence. In this review, we will give an introduction to NGS and single-molecule sequencing, and we will discuss the possible applications of NGS in forensic genetics. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Quantitative analysis and prediction of G-quadruplex forming sequences in double-stranded DNA

PubMed Central

Kim, Minji; Kreig, Alex; Lee, Chun-Ying; Rube, H. Tomas; Calvert, Jacob; Song, Jun S.; Myong, Sua

2016-01-01

Abstract G-quadruplex (GQ) is a four-stranded DNA structure that can be formed in guanine-rich sequences. GQ structures have been proposed to regulate diverse biological processes including transcription, replication, translation and telomere maintenance. Recent studies have demonstrated the existence of GQ DNA in live mammalian cells and a significant number of potential GQ forming sequences in the human genome. We present a systematic and quantitative analysis of GQ folding propensity on a large set of 438 GQ forming sequences in double-stranded DNA by integrating fluorescence measurement, single-molecule imaging and computational modeling. We find that short minimum loop length and the thymine base are two main factors that lead to high GQ folding propensity. Linear and Gaussian process regression models further validate that the GQ folding potential can be predicted with high accuracy based on the loop length distribution and the nucleotide content of the loop sequences. Our study provides important new parameters that can inform the evaluation and classification of putative GQ sequences in the human genome. PMID:27095201
Primer-independent RNA sequencing with bacteriophage phi6 RNA polymerase and chain terminators.

PubMed

Makeyev, E V; Bamford, D H

2001-05-01

Here we propose a new general method for directly determining RNA sequence based on the use of the RNA-dependent RNA polymerase from bacteriophage phi6 and the chain terminators (RdRP sequencing). The following properties of the polymerase render it appropriate for this application: (1) the phi6 polymerase can replicate a number of single-stranded RNA templates in vitro. (2) In contrast to the primer-dependent DNA polymerases utilized in the sequencing procedure by Sanger et al. (Proc Natl Acad Sci USA, 1977, 74:5463-5467), it initiates nascent strand synthesis without a primer, starting the polymerization on the very 3'-terminus of the template. (3) The polymerase can incorporate chain-terminating nucleotide analogs into the nascent RNA chain to produce a set of base-specific termination products. Consequently, 3' proximal or even complete sequence of many target RNA molecules can be rapidly deduced without prior sequence information. The new technique proved useful for sequencing several synthetic ssRNA templates. Furthermore, using genomic segments of the bluetongue virus we show that RdRP sequencing can also be applied to naturally occurring dsRNA templates. This suggests possible uses of the method in the RNA virus research and diagnostics.
Electrons, Photons, and Force: Quantitative Single-Molecule Measurements from Physics to Biology

PubMed Central

2011-01-01

Single-molecule measurement techniques have illuminated unprecedented details of chemical behavior, including observations of the motion of a single molecule on a surface, and even the vibration of a single bond within a molecule. Such measurements are critical to our understanding of entities ranging from single atoms to the most complex protein assemblies. We provide an overview of the strikingly diverse classes of measurements that can be used to quantify single-molecule properties, including those of single macromolecules and single molecular assemblies, and discuss the quantitative insights they provide. Examples are drawn from across the single-molecule literature, ranging from ultrahigh vacuum scanning tunneling microscopy studies of adsorbate diffusion on surfaces to fluorescence studies of protein conformational changes in solution. PMID:21338175
Sources of PCR-induced distortions in high-throughput sequencing data sets

PubMed Central

Kebschull, Justus M.; Zador, Anthony M.

2015-01-01

PCR permits the exponential and sequence-specific amplification of DNA, even from minute starting quantities. PCR is a fundamental step in preparing DNA samples for high-throughput sequencing. However, there are errors associated with PCR-mediated amplification. Here we examine the effects of four important sources of error—bias, stochasticity, template switches and polymerase errors—on sequence representation in low-input next-generation sequencing libraries. We designed a pool of diverse PCR amplicons with a defined structure, and then used Illumina sequencing to search for signatures of each process. We further developed quantitative models for each process, and compared predictions of these models to our experimental data. We find that PCR stochasticity is the major force skewing sequence representation after amplification of a pool of unique DNA amplicons. Polymerase errors become very common in later cycles of PCR but have little impact on the overall sequence distribution as they are confined to small copy numbers. PCR template switches are rare and confined to low copy numbers. Our results provide a theoretical basis for removing distortions from high-throughput sequencing data. In addition, our findings on PCR stochasticity will have particular relevance to quantification of results from single cell sequencing, in which sequences are represented by only one or a few molecules. PMID:26187991

Single-molecule dynamics in nanofabricated traps

NASA Astrophysics Data System (ADS)

Cohen, Adam

2009-03-01

The Anti-Brownian Electrokinetic trap (ABEL trap) provides a means to immobilize a single fluorescent molecule in solution, without surface attachment chemistry. The ABEL trap works by tracking the Brownian motion of a single molecule, and applying feedback electric fields to induce an electrokinetic motion that approximately cancels the Brownian motion. We present a new design for the ABEL trap that allows smaller molecules to be trapped and more information to be extracted from the dynamics of a single molecule than was previously possible. In particular, we present strategies for extracting dynamically fluctuating mobilities and diffusion coefficients, as a means to probe dynamic changes in molecular charge and shape. If one trapped molecule is good, many trapped molecules are better. An array of single molecules in solution, each immobilized without surface attachment chemistry, provides an ideal test-bed for single-molecule analyses of intramolecular dynamics and intermolecular interactions. We present a technology for creating such an array, using a fused silica plate with nanofabricated dimples and a removable cover for sealing single molecules within the dimples. With this device one can watch the shape fluctuations of single molecules of DNA or study cooperative interactions in weakly associating protein complexes.
Specific and non-specific interactions of ParB with DNA: implications for chromosome segregation

PubMed Central

Taylor, James A.; Pastrana, Cesar L.; Butterer, Annika; Pernstich, Christian; Gwynn, Emma J.; Sobott, Frank; Moreno-Herrero, Fernando; Dillingham, Mark S.

2015-01-01

The segregation of many bacterial chromosomes is dependent on the interactions of ParB proteins with centromere-like DNA sequences called parS that are located close to the origin of replication. In this work, we have investigated the binding of Bacillus subtilis ParB to DNA in vitro using a variety of biochemical and biophysical techniques. We observe tight and specific binding of a ParB homodimer to the parS sequence. Binding of ParB to non-specific DNA is more complex and displays apparent positive co-operativity that is associated with the formation of larger, poorly defined, nucleoprotein complexes. Experiments with magnetic tweezers demonstrate that non-specific binding leads to DNA condensation that is reversible by protein unbinding or force. The condensed DNA structure is not well ordered and we infer that it is formed by many looping interactions between neighbouring DNA segments. Consistent with this view, ParB is also able to stabilize writhe in single supercoiled DNA molecules and to bridge segments from two different DNA molecules in trans. The experiments provide no evidence for the promotion of non-specific DNA binding and/or condensation events by the presence of parS sequences. The implications of these observations for chromosome segregation are discussed. PMID:25572315
Evidence of protein-free homology recognition in magnetic bead force–extension experiments

PubMed Central

(O’) Lee, D. J.; Danilowicz, C.; Rochester, C.; Prentiss, M.

2016-01-01

Earlier theoretical studies have proposed that the homology-dependent pairing of large tracts of dsDNA may be due to physical interactions between homologous regions. Such interactions could contribute to the sequence-dependent pairing of chromosome regions that may occur in the presence or the absence of double-strand breaks. Several experiments have indicated the recognition of homologous sequences in pure electrolytic solutions without proteins. Here, we report single-molecule force experiments with a designed 60 kb long dsDNA construct; one end attached to a solid surface and the other end to a magnetic bead. The 60 kb constructs contain two 10 kb long homologous tracts oriented head to head, so that their sequences match if the two tracts fold on each other. The distance between the bead and the surface is measured as a function of the force applied to the bead. At low forces, the construct molecules extend substantially less than normal, control dsDNA, indicating the existence of preferential interaction between the homologous regions. The force increase causes no abrupt but continuous unfolding of the paired homologous regions. Simple semi-phenomenological models of the unfolding mechanics are proposed, and their predictions are compared with the data. PMID:27493568
Topological Structure of the Space of Phenotypes: The Case of RNA Neutral Networks

PubMed Central

Aguirre, Jacobo; Buldú, Javier M.; Stich, Michael; Manrubia, Susanna C.

2011-01-01

The evolution and adaptation of molecular populations is constrained by the diversity accessible through mutational processes. RNA is a paradigmatic example of biopolymer where genotype (sequence) and phenotype (approximated by the secondary structure fold) are identified in a single molecule. The extreme redundancy of the genotype-phenotype map leads to large ensembles of RNA sequences that fold into the same secondary structure and can be connected through single-point mutations. These ensembles define neutral networks of phenotypes in sequence space. Here we analyze the topological properties of neutral networks formed by 12-nucleotides RNA sequences, obtained through the exhaustive folding of sequence space. A total of 412 sequences fragments into 645 subnetworks that correspond to 57 different secondary structures. The topological analysis reveals that each subnetwork is far from being random: it has a degree distribution with a well-defined average and a small dispersion, a high clustering coefficient, and an average shortest path between nodes close to its minimum possible value, i.e. the Hamming distance between sequences. RNA neutral networks are assortative due to the correlation in the composition of neighboring sequences, a feature that together with the symmetries inherent to the folding process explains the existence of communities. Several topological relationships can be analytically derived attending to structural restrictions and generic properties of the folding process. The average degree of these phenotypic networks grows logarithmically with their size, such that abundant phenotypes have the additional advantage of being more robust to mutations. This property prevents fragmentation of neutral networks and thus enhances the navigability of sequence space. In summary, RNA neutral networks show unique topological properties, unknown to other networks previously described. PMID:22028856
A graphene-based biosensing platform based on the release of DNA probes and rolling circle amplification.

PubMed

Liu, Meng; Song, Jinping; Shuang, Shaomin; Dong, Chuan; Brennan, John D; Li, Yingfu

2014-06-24

We report a versatile biosensing platform capable of achieving ultrasensitive detection of both small-molecule and macromolecular targets. The system features three components: reduced graphene oxide for its ability to adsorb single-stranded DNA molecules nonspecifically, DNA aptamers for their ability to bind reduced graphene oxide but undergo target-induced conformational changes that facilitate their release from the reduced graphene oxide surface, and rolling circle amplification (RCA) for its ability to amplify a primer-template recognition event into repetitive sequence units that can be easily detected. The key to the design is the tagging of a short primer to an aptamer sequence, which results in a small DNA probe that allows for both effective probe adsorption onto the reduced graphene oxide surface to mask the primer domain in the absence of the target, as well as efficient probe release in the presence of the target to make the primer available for template binding and RCA. We also made an observation that the circular template, which on its own does not cause a detectable level of probe release from the reduced graphene oxide, augments target-induced probe release. The synergistic release of DNA probes is interpreted to be a contributing factor for the high detection sensitivity. The broad utility of the platform is illustrated though engineering three different sensors that are capable of achieving ultrasensitive detection of a protein target, a DNA sequence and a small-molecule analyte. We envision that the approach described herein will find useful applications in the biological, medical, and environmental fields.
Reversible Aptamer-Au Plasmon Rulers for Secreted Single Molecules

DOE PAGES

Lee, Somin Eunice; Chen, Qian; Bhat, Ramray; ...

2015-06-03

Plasmon rulers, consisting of pairs of gold nanoparticles, allow single-molecule analysis without photobleaching or blinking; however, current plasmon rulers are irreversible, restricting detection to only single events. Here, we present a reversible plasmon ruler, comprised of coupled gold nanoparticles linked by a single aptamer, capable of binding individual secreted molecules with high specificity. We show that the binding of target secreted molecules to the reversible plasmon ruler is characterized by single-molecule sensitivity, high specificity, and reversibility. Lastly, such reversible plasmon rulers should enable dynamic and adaptive live-cell measurement of secreted single molecules in their local microenvironment.
Sequence-Mandated, Distinct Assembly of Giant Molecules

DOE PAGES

Zhang, Wei; Lu, Xinlin; Mao, Jialin; ...

2017-10-24

Although controlling the primary structure of synthetic polymers is itself a great challenge, the potential of sequence control for tailoring hierarchical structures remains to be exploited, especially in the creation of new and unconventional phases. A series of model amphiphilic chain-like giant molecules was designed and synthesized by interconnecting both hydrophobic and hydrophilic molecular nanoparticles in precisely defined sequence and composition to investigate their sequence-dependent phase structures. Not only compositional variation changed the self-assembled supramolecular phases, but also specific sequences induce unconventional phase formation, including Frank-Kasper phases. The formation mechanism was attributed to the conformational change driven by the collectivemore » hydrogen bonding and the sequence-mandated topology of the molecules. Lastly, these results show that sequence control in synthetic polymers can have a dramatic impact on polymer properties and self-assembly.« less
Sequence-Mandated, Distinct Assembly of Giant Molecules

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Wei; Lu, Xinlin; Mao, Jialin

Although controlling the primary structure of synthetic polymers is itself a great challenge, the potential of sequence control for tailoring hierarchical structures remains to be exploited, especially in the creation of new and unconventional phases. A series of model amphiphilic chain-like giant molecules was designed and synthesized by interconnecting both hydrophobic and hydrophilic molecular nanoparticles in precisely defined sequence and composition to investigate their sequence-dependent phase structures. Not only compositional variation changed the self-assembled supramolecular phases, but also specific sequences induce unconventional phase formation, including Frank-Kasper phases. The formation mechanism was attributed to the conformational change driven by the collectivemore » hydrogen bonding and the sequence-mandated topology of the molecules. Lastly, these results show that sequence control in synthetic polymers can have a dramatic impact on polymer properties and self-assembly.« less
Analysis of hepatitis C NS5A resistance associated polymorphisms using ultra deep single molecule real time (SMRT) sequencing.

PubMed

Bergfors, Assar; Leenheer, Daniël; Bergqvist, Anders; Ameur, Adam; Lennerstrand, Johan

2016-02-01

Development of Hepatitis C virus (HCV) resistance against direct-acting antivirals (DAAs), including NS5A inhibitors, is an obstacle to successful treatment of HCV when DAAs are used in sub-optimal combinations. Furthermore, it has been shown that baseline (pre-existing) resistance against DAAs is present in treatment naïve-patients and this will potentially complicate future treatment strategies in different HCV genotypes (GTs). Thus the aim was to detect low levels of NS5A resistant associated variants (RAVs) in a limited sample set of treatment-naïve patients of HCV GT1a and 3a, since such polymorphisms can display in vitro resistance as high as 60000 fold. Ultra-deep single molecule real time (SMRT) sequencing with the Pacific Biosciences (PacBio) RSII instrument was used to detect these RAVs. The SMRT sequencing was conducted on ten samples; three of them positive with Sanger sequencing (GT1a Q30H and Y93N, and GT3a Y93H), five GT1a samples, and two GT3a non-positive samples. The same methods were applied to the HCV GT1a H77-plasmid in a dilution series, in order to determine the error rates of replication, which in turn was used to determine the limit of detection (LOD), as defined by mean + 3SD, of minority variants down to 0.24%. We found important baseline NS5A RAVs at levels between 0.24 and 0.5%, which could potentially have clinical relevance. This new method with low level detection of baseline RAVs could be useful in predicting the most cost-efficient combination of DAA treatment, and reduce the treatment duration for an HCV infected individual. Copyright © 2015 Elsevier B.V. All rights reserved.
Single-Molecule Plasmon Sensing: Current Status and Future Prospects

PubMed Central

2017-01-01

Single-molecule detection has long relied on fluorescent labeling with high quantum-yield fluorophores. Plasmon-enhanced detection circumvents the need for labeling by allowing direct optical detection of weakly emitting and completely nonfluorescent species. This review focuses on recent advances in single molecule detection using plasmonic metal nanostructures as a sensing platform, particularly using a single particle–single molecule approach. In the past decade two mechanisms for plasmon-enhanced single-molecule detection have been demonstrated: (1) by plasmonically enhancing the emission of weakly fluorescent biomolecules, or (2) by monitoring shifts of the plasmon resonance induced by single-molecule interactions. We begin with a motivation regarding the importance of single molecule detection, and advantages plasmonic detection offers. We describe both detection mechanisms and discuss challenges and potential solutions. We finalize by highlighting the exciting possibilities in analytical chemistry and medical diagnostics. PMID:28762723
Single-molecule detection of epidermal growth factor receptor mutations in plasma by microfluidics digital PCR in non-small cell lung cancer patients.

PubMed

Yung, Tony K F; Chan, K C Allen; Mok, Tony S K; Tong, Joanna; To, Ka-Fai; Lo, Y M Dennis

2009-03-15

We aim to develop a digital PCR-based method for the quantitative detection of the two common epidermal growth factor receptor (EGFR) mutations (in-frame deletion at exon 19 and L858R at exon 21) in the plasma and tumor tissues of patients suffering from non-small cell lung cancers. These two mutations account for >85% of clinically important EGFR mutations associated with responsiveness to tyrosine kinase inhibitors. DNA samples were analyzed using a microfluidics system that simultaneously performed 9,180 PCRs at nanoliter scale. A single-mutant DNA molecule in a clinical specimen could be detected and the quantities of mutant and wild-type sequences were precisely determined. Exon 19 deletion and L858R mutation were detectable in 6 (17%) and 9 (26%) of 35 pretreatment plasma samples, respectively. When compared with the sequencing results of the tumor samples, the sensitivity and specificity of plasma EGFR mutation analysis were 92% and 100%, respectively. The plasma concentration of the mutant sequences correlated well with the clinical response. Decreased concentration was observed in all patients with partial or complete clinical remission, whereas persistence of mutation was observed in a patient with cancer progression. In one patient, tyrosine kinase inhibitor was stopped after an initial response and the tumor-associated EGFR mutation reemerged 4 weeks after stopping treatment. The sensitive detection and accurate quantification of low abundance EGFR mutations in tumor tissues and plasma by microfluidics digital PCR would be useful for predicting treatment response, monitoring disease progression and early detection of treatment failure associated with acquired drug resistance.
Deciphering hierarchical features in the energy landscape of adenylate kinase folding/unfolding

NASA Astrophysics Data System (ADS)

Taylor, J. Nicholas; Pirchi, Menahem; Haran, Gilad; Komatsuzaki, Tamiki

2018-03-01

Hierarchical features of the energy landscape of the folding/unfolding behavior of adenylate kinase, including its dependence on denaturant concentration, are elucidated in terms of single-molecule fluorescence resonance energy transfer (smFRET) measurements in which the proteins are encapsulated in a lipid vesicle. The core in constructing the energy landscape from single-molecule time-series across different denaturant concentrations is the application of rate-distortion theory (RDT), which naturally considers the effects of measurement noise and sampling error, in combination with change-point detection and the quantification of the FRET efficiency-dependent photobleaching behavior. Energy landscapes are constructed as a function of observation time scale, revealing multiple partially folded conformations at small time scales that are situated in a superbasin. As the time scale increases, these denatured states merge into a single basin, demonstrating the coarse-graining of the energy landscape as observation time increases. Because the photobleaching time scale is dependent on the conformational state of the protein, possible nonequilibrium features are discussed, and a statistical test for violation of the detailed balance condition is developed based on the state sequences arising from the RDT framework.
DNA interrogation by the CRISPR RNA-guided endonuclease Cas9.

PubMed

Sternberg, Samuel H; Redding, Sy; Jinek, Martin; Greene, Eric C; Doudna, Jennifer A

2014-03-06

The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.
DNA interrogation by the CRISPR RNA-guided endonuclease Cas9

NASA Astrophysics Data System (ADS)

Sternberg, Samuel H.; Redding, Sy; Jinek, Martin; Greene, Eric C.; Doudna, Jennifer A.

2014-03-01

The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.
Ordered array of CoPc-vacancies filled with single-molecule rotors

NASA Astrophysics Data System (ADS)

Xie, Zheng-Bo; Wang, Ya-Li; Tao, Min-Long; Sun, Kai; Tu, Yu-Bing; Yuan, Hong-Kuan; Wang, Jun-Zhong

2018-05-01

We report the highly ordered array of CoPc-vacancies and the single-molecule rotors inside the vacancies. When CoPc molecules are deposited on Cd(0001) at low-temperature, three types of molecular vacancies appeared randomly in the CoPc monolayer. Annealing the sample to higher temperature leads to the spontaneous phase separation and self-organized arrangement of the vacancies. Highly ordered arrays of two-molecule vacancies and single-molecule vacancies have been obtained. In particular, there is a rotating CoPc molecule inside each single-molecule vacancy, which constitutes the array of single-molecule rotors. These results provide a new routine to fabricate the nano-machines on a large scale.
Identifying active foraminifera in the Sea of Japan using metatranscriptomic approach

NASA Astrophysics Data System (ADS)

Lejzerowicz, Franck; Voltsky, Ivan; Pawlowski, Jan

2013-02-01

Metagenetics represents an efficient and rapid tool to describe environmental diversity patterns of microbial eukaryotes based on ribosomal DNA sequences. However, the results of metagenetic studies are often biased by the presence of extracellular DNA molecules that are persistent in the environment, especially in deep-sea sediment. As an alternative, short-lived RNA molecules constitute a good proxy for the detection of active species. Here, we used a metatranscriptomic approach based on RNA-derived (cDNA) sequences to study the diversity of the deep-sea benthic foraminifera and compared it to the metagenetic approach. We analyzed 257 ribosomal DNA and cDNA sequences obtained from seven sediments samples collected in the Sea of Japan at depths ranging from 486 to 3665 m. The DNA and RNA-based approaches gave a similar view of the taxonomic composition of foraminiferal assemblage, but differed in some important points. First, the cDNA dataset was dominated by sequences of rotaliids and robertiniids, suggesting that these calcareous species, some of which have been observed in Rose Bengal stained samples, are the most active component of foraminiferal community. Second, the richness of monothalamous (single-chambered) foraminifera was particularly high in DNA extracts from the deepest samples, confirming that this group of foraminifera is abundant but not necessarily very active in the deep-sea sediments. Finally, the high divergence of undetermined sequences in cDNA dataset indicate the limits of our database and lack of knowledge about some active but possibly rare species. Our study demonstrates the capability of the metatranscriptomic approach to detect active foraminiferal species and prompt its use in future high-throughput sequencing-based environmental surveys.
High Resolution Size Analysis of Fetal DNA in the Urine of Pregnant Women by Paired-End Massively Parallel Sequencing

PubMed Central

Tsui, Nancy B. Y.; Jiang, Peiyong; Chow, Katherine C. K.; Su, Xiaoxi; Leung, Tak Y.; Sun, Hao; Chan, K. C. Allen; Chiu, Rossa W. K.; Lo, Y. M. Dennis

2012-01-01

Background Fetal DNA in maternal urine, if present, would be a valuable source of fetal genetic material for noninvasive prenatal diagnosis. However, the existence of fetal DNA in maternal urine has remained controversial. The issue is due to the lack of appropriate technology to robustly detect the potentially highly degraded fetal DNA in maternal urine. Methodology We have used massively parallel paired-end sequencing to investigate cell-free DNA molecules in maternal urine. Catheterized urine samples were collected from seven pregnant women during the third trimester of pregnancies. We detected fetal DNA by identifying sequenced reads that contained fetal-specific alleles of the single nucleotide polymorphisms. The sizes of individual urinary DNA fragments were deduced from the alignment positions of the paired reads. We measured the fractional fetal DNA concentration as well as the size distributions of fetal and maternal DNA in maternal urine. Principal Findings Cell-free fetal DNA was detected in five of the seven maternal urine samples, with the fractional fetal DNA concentrations ranged from 1.92% to 4.73%. Fetal DNA became undetectable in maternal urine after delivery. The total urinary cell-free DNA molecules were less intact when compared with plasma DNA. Urinary fetal DNA fragments were very short, and the most dominant fetal sequences were between 29 bp and 45 bp in length. Conclusions With the use of massively parallel sequencing, we have confirmed the existence of transrenal fetal DNA in maternal urine, and have shown that urinary fetal DNA was heavily degraded. PMID:23118982
A Single Molecule Scaffold for the Maize Genome

PubMed Central

Zhou, Shiguo; Wei, Fusheng; Nguyen, John; Bechner, Mike; Potamousis, Konstantinos; Goldstein, Steve; Pape, Louise; Mehan, Michael R.; Churas, Chris; Pasternak, Shiran; Forrest, Dan K.; Wise, Roger; Ware, Doreen; Wing, Rod A.; Waterman, Michael S.; Livny, Miron; Schwartz, David C.

2009-01-01

About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/∼23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/∼2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars. PMID:19936062
Amino acid sequence of bovine muzzle epithelial desmocollin derived from cloned cDNA: a novel subtype of desmosomal cadherins.

PubMed

Koch, P J; Goldschmidt, M D; Walsh, M J; Zimbelmann, R; Schmelz, M; Franke, W W

1991-05-01

Desmosomes are cell-type-specific intercellular junctions found in epithelium, myocardium and certain other tissues. They consist of assemblies of molecules involved in the adhesion of specific cell types and in the anchorage of cell-type-specific cytoskeletal elements, the intermediate-size filaments, to the plasma membrane. To explore the individual desmosomal components and their functions we have isolated DNA clones encoding the desmosomal glycoprotein, desmocollin, using antibodies and a cDNA expression library from bovine muzzle epithelium. The cDNA-deduced amino-acid sequence of desmocollin (presently we cannot decide to which of the two desmocollins, DC I or DC II, this clone relates) defines a polypeptide with a calculated molecular weight of 85,000, with a single candidate sequence of 24 amino acids sufficiently long for a transmembrane arrangement, and an extracellular aminoterminal portion of 561 amino acid residues, compared to a cytoplasmic part of only 176 amino acids. Amino acid sequence comparisons have revealed that desmocollin is highly homologous to members of the cadherin family of cell adhesion molecules, including the previously sequenced desmoglein, another desmosome-specific cadherin. Using riboprobes derived from cDNAs for Northern-blot analyses, we have identified an mRNA of approximately 6 kb in stratified epithelia such as muzzle epithelium and tongue mucosa but not in two epithelial cell culture lines containing desmosomes and desmoplakins. The difference may indicate drastic differences in mRNA concentration or the existence of cell-type-specific desmocollin subforms. The molecular topology of desmocollin(s) is discussed in relation to possible functions of the individual molecular domains.
New small-molecule inhibitor class targeting human immunodeficiency virus type 1 virion maturation.

PubMed

Blair, Wade S; Cao, Joan; Fok-Seang, Juin; Griffin, Paul; Isaacson, Jason; Jackson, R Lynn; Murray, Edward; Patick, Amy K; Peng, Qinghai; Perros, Manos; Pickford, Chris; Wu, Hua; Butler, Scott L

2009-12-01

A new small-molecule inhibitor class that targets virion maturation was identified from a human immunodeficiency virus type 1 (HIV-1) antiviral screen. PF-46396, a representative molecule, exhibits antiviral activity against HIV-1 laboratory strains and clinical isolates in T-cell lines and peripheral blood mononuclear cells (PBMCs). PF-46396 specifically inhibits the processing of capsid (CA)/spacer peptide 1 (SP1) (p25), resulting in the accumulation of CA/SP1 (p25) precursor proteins and blocked maturation of the viral core particle. Viral variants resistant to PF-46396 contain a single amino acid substitution in HIV-1 CA sequences (CAI201V), distal to the CA/SP1 cleavage site in the primary structure, which we demonstrate is sufficient to confer significant resistance to PF-46396 and 3-O-(3',3'-dimethylsuccinyl) betulinic acid (DSB), a previously described maturation inhibitor. Conversely, a single amino substitution in SP1 (SP1A1V), which was previously associated with DSB in vitro resistance, was sufficient to confer resistance to DSB and PF-46396. Further, the CAI201V substitution restored CA/SP1 processing in HIV-1-infected cells treated with PF-46396 or DSB. Our results demonstrate that PF-46396 acts through a mechanism that is similar to DSB to inhibit the maturation of HIV-1 virions. To our knowledge, PF-46396 represents the first small-molecule HIV-1 maturation inhibitor that is distinct in chemical class from betulinic acid-derived maturation inhibitors (e.g., DSB), demonstrating that molecules of diverse chemical classes can inhibit this mechanism.

Molecular vibrations in metal-single-molecule-metal junctions

NASA Astrophysics Data System (ADS)

Yokota, Kazumichi; Taniguchi, Masateru; Kawai, Tomoji

2010-03-01

Molecular vibrations in a metal-single-molecule-metal junction were studied based on density functional theory using a single benzenedithiolate molecule connected between gold clusters. We found that the difference in vibrational energy between an isolated benzenedithiol and the single-molecule junction is less than 3% in the energy range above 540 cm -1, where sulfur atoms contribute little to molecular vibrations. The finding implies that we can predict the peak energy in the inelastic electron tunneling spectrum of the single-molecule junction in the high energy range by vibrational analyses of isolated molecules.
Diverse circular replication-associated protein encoding viruses circulating in invertebrates within a lake ecosystem.

PubMed

Dayaram, Anisha; Galatowitsch, Mark L; Argüello-Astorga, Gerardo R; van Bysterveldt, Katherine; Kraberger, Simona; Stainton, Daisy; Harding, Jon S; Roumagnac, Philippe; Martin, Darren P; Lefeuvre, Pierre; Varsani, Arvind

2016-04-01

Over the last five years next-generation sequencing has become a cost effective and efficient method for identifying known and unknown microorganisms. Access to this technique has dramatically changed the field of virology, enabling a wide range of environmental viral metagenome studies to be undertaken of organisms and environmental samples from polar to tropical regions. These studies have led to the discovery of hundreds of highly divergent single stranded DNA (ssDNA) virus-like sequences encoding replication-associated proteins. Yet, few studies have explored how viruses might be shared in an ecosystem through feeding relationships. Here we identify 169 circular molecules (160 CRESS DNA molecules, nine circular molecules) recovered from a New Zealand freshwater lake, that we have tentatively classified into 51 putatively novel species and five previously described species (DflaCV-3, -5, -6, -8, -10). The CRESS DNA viruses identified in this study were recovered from molluscs (Echyridella menzeisii, Musculium novaezelandiae, Potamopyrgus antipodarum and Physella acuta) and insect larvae (Procordulia grayi, Xanthocnemis zealandica, and Chironomus zealandicus) collected from Lake Sarah, as well as from the lake water and benthic sediments. Extensive diversity was observed across most CRESS DNA molecules recovered. The putative capsid protein of one viral species was found to be most similar to those of members of the Tombusviridae family, thus expanding the number of known RNA-DNA hybrid viruses in nature. We noted a strong association between the CRESS DNA viruses and circular molecules identified in the water and browser organisms (C. zealandicus, P. antipodarum and P. acuta), and between water sediments and undefended prey species (C. zealandicus). However, we were unable to find any significant correlation of viral assemblages to the potential feeding relationships of the host aquatic invertebrates. Copyright © 2016 Elsevier B.V. All rights reserved.
Physical Chemistry of Nucleic Acids

NASA Astrophysics Data System (ADS)

Tinoco, Ignacio

2002-10-01

The Watson-Crick double helix of DNA was first revealed in 1953. Since then a wide range of physical chemical methods have been applied to DNA and to its more versatile relative RNA to determine their structures and functions. My major goal is to predict the folded structure of any RNA from its sequence. We have used bulk and single-molecule measurements of thermodynamics and kinetics, plus various spectroscopic methods (UV absorption, optical rotation, circular dichroism, circular intensity differential scattering, fluorescence, NMR) to approach this goal.
Identification of conformational epitopes for human IgG on Chemotaxis inhibitory protein of Staphylococcus aureus

PubMed Central

Gustafsson, Erika; Haas, Pieter-Jan; Walse, Björn; Hijnen, Marcel; Furebring, Christina; Ohlin, Mats; van Strijp, Jos AG; van Kessel, Kok PM

2009-01-01

Background The Chemotaxis inhibitory protein of Staphylococcus aureus (CHIPS) blocks the Complement fragment C5a receptor (C5aR) and formylated peptide receptor (FPR) and is thereby a potent inhibitor of neutrophil chemotaxis and activation of inflammatory responses. The majority of the healthy human population has antibodies against CHIPS that have been shown to interfere with its function in vitro. The aim of this study was to define potential epitopes for human antibodies on the CHIPS surface. We also initiate the process to identify a mutated CHIPS molecule that is not efficiently recognized by preformed anti-CHIPS antibodies and retains anti-inflammatory activity. Results In this paper, we panned peptide displaying phage libraries against a pool of CHIPS specific affinity-purified polyclonal human IgG. The selected peptides could be divided into two groups of sequences. The first group was the most dominant with 36 of the 48 sequenced clones represented. Binding to human affinity-purified IgG was verified by ELISA for a selection of peptide sequences in phage format. For further analysis, one peptide was chemically synthesized and antibodies affinity-purified on this peptide were found to bind the CHIPS molecule as studied by ELISA and Surface Plasmon Resonance. Furthermore, seven potential conformational epitopes responsible for antibody recognition were identified by mapping phage selected peptide sequences on the CHIPS surface as defined in the NMR structure of the recombinant CHIPS31–121 protein. Mapped epitopes were verified by in vitro mutational analysis of the CHIPS molecule. Single mutations introduced in the proposed antibody epitopes were shown to decrease antibody binding to CHIPS. The biological function in terms of C5aR signaling was studied by flow cytometry. A few mutations were shown to affect this biological function as well as the antibody binding. Conclusion Conformational epitopes recognized by human antibodies have been mapped on the CHIPS surface and amino acid residues involved in both antibody and C5aR interaction could be defined. This information has implications for the development of an effective anti-inflammatory agent based on a functional CHIPS molecule with low interaction with human IgG. PMID:19284584
Chemical synthesis and characterization of branched oligodeoxyribonucleotides (bDNA) for use as signal amplifiers in nucleic acid quantification assays.

PubMed

Horn, T; Chang, C A; Urdea, M S

1997-12-01

The divergent synthesis of bDNA structures is described. This new type of branched DNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branching network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb molecules were assembled on a solid support using parameters optimized for bDNA synthesis. The chemistry was used to synthesize bDNA comb molecules containing 15 secondary sequences. The bDNA comb molecules were elaborated by enzymatic ligation into branched amplification multimers, large bDNA molecules (a total of 1068 nt) containing an average of 36 repeated DNA oligomer sequences, each capable of hybridizing specifically to an alkaline phosphatase-labeled oligonucleotide. The bDNA comb molecules were characterized by electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The branched amplification multimers have been used as signal amplifiers in nucleic acid quantification assays for detection of viral infection. It is possible to detect as few as 50 molecules with bDNA technology.
Chemical synthesis and characterization of branched oligodeoxyribonucleotides (bDNA) for use as signal amplifiers in nucleic acid quantification assays.

PubMed Central

Horn, T; Chang, C A; Urdea, M S

1997-01-01

The divergent synthesis of bDNA structures is described. This new type of branched DNA contains one unique oligonucleotide, the primary sequence, covalently attached through a comb-like branching network to many identical copies of a different oligonucleotide, the secondary sequence. The bDNA comb molecules were assembled on a solid support using parameters optimized for bDNA synthesis. The chemistry was used to synthesize bDNA comb molecules containing 15 secondary sequences. The bDNA comb molecules were elaborated by enzymatic ligation into branched amplification multimers, large bDNA molecules (a total of 1068 nt) containing an average of 36 repeated DNA oligomer sequences, each capable of hybridizing specifically to an alkaline phosphatase-labeled oligonucleotide. The bDNA comb molecules were characterized by electrophoretic methods and by controlled cleavage at periodate-cleavable moieties incorporated during synthesis. The branched amplification multimers have been used as signal amplifiers in nucleic acid quantification assays for detection of viral infection. It is possible to detect as few as 50 molecules with bDNA technology. PMID:9365266
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-06-06

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-05-30

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
The Complete Chloroplast Genome of Banana (Musa acuminata, Zingiberales): Insight into Plastid Monocotyledon Evolution

PubMed Central

Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D’Hont, Angélique

2013-01-01

Background Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. Methodology/Principal Findings The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. Conclusion The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas. PMID:23840670
The complete chloroplast genome of banana (Musa acuminata, Zingiberales): insight into plastid monocotyledon evolution.

PubMed

Martin, Guillaume; Baurens, Franc-Christophe; Cardi, Céline; Aury, Jean-Marc; D'Hont, Angélique

2013-01-01

Banana (genus Musa) is a crop of major economic importance worldwide. It is a monocotyledonous member of the Zingiberales, a sister group of the widely studied Poales. Most cultivated bananas are natural Musa inter-(sub-)specific triploid hybrids. A Musa acuminata reference nuclear genome sequence was recently produced based on sequencing of genomic DNA enriched in nucleus. The Musa acuminata chloroplast genome was assembled with chloroplast reads extracted from whole-genome-shotgun sequence data. The Musa chloroplast genome is a circular molecule of 169,972 bp with a quadripartite structure containing two single copy regions, a Large Single Copy region (LSC, 88,338 bp) and a Small Single Copy region (SSC, 10,768 bp) separated by Inverted Repeat regions (IRs, 35,433 bp). Two forms of the chloroplast genome relative to the orientation of SSC versus LSC were found. The Musa chloroplast genome shows an extreme IR expansion at the IR/SSC boundary relative to the most common structures found in angiosperms. This expansion consists of the integration of three additional complete genes (rps15, ndhH and ycf1) and part of the ndhA gene. No such expansion has been observed in monocots so far. Simple Sequence Repeats were identified in the Musa chloroplast genome and a new set of Musa chloroplastic markers was designed. The complete sequence of M. acuminata ssp malaccensis chloroplast we reported here is the first one for the Zingiberales order. As such it provides new insight in the evolution of the chloroplast of monocotyledons. In particular, it reinforces that IR/SSC expansion has occurred independently several times within monocotyledons. The discovery of new polymorphic markers within Musa chloroplast opens new perspectives to better understand the origin of cultivated triploid bananas.
Discrete microfluidics for the isolation of circulating tumor cell subpopulations targeting fibroblast activation protein alpha and epithelial cell adhesion molecule.

PubMed

Witek, Małgorzata A; Aufforth, Rachel D; Wang, Hong; Kamande, Joyce W; Jackson, Joshua M; Pullagurla, Swathi R; Hupert, Mateusz L; Usary, Jerry; Wysham, Weiya Z; Hilliard, Dawud; Montgomery, Stephanie; Bae-Jump, Victoria; Carey, Lisa A; Gehrig, Paola A; Milowsky, Matthew I; Perou, Charles M; Soper, John T; Whang, Young E; Yeh, Jen Jen; Martin, George; Soper, Steven A

2017-01-01

Circulating tumor cells consist of phenotypically distinct subpopulations that originate from the tumor microenvironment. We report a circulating tumor cell dual selection assay that uses discrete microfluidics to select circulating tumor cell subpopulations from a single blood sample; circulating tumor cells expressing the established marker epithelial cell adhesion molecule and a new marker, fibroblast activation protein alpha, were evaluated. Both circulating tumor cell subpopulations were detected in metastatic ovarian, colorectal, prostate, breast, and pancreatic cancer patients and 90% of the isolated circulating tumor cells did not co-express both antigens. Clinical sensitivities of 100% showed substantial improvement compared to epithelial cell adhesion molecule selection alone. Owing to high purity (>80%) of the selected circulating tumor cells, molecular analysis of both circulating tumor cell subpopulations was carried out in bulk, including next generation sequencing, mutation analysis, and gene expression. Results suggested fibroblast activation protein alpha and epithelial cell adhesion molecule circulating tumor cells are distinct subpopulations and the use of these in concert can provide information needed to navigate through cancer disease management challenges.
A force-based, parallel assay for the quantification of protein-DNA interactions.

PubMed

Limmer, Katja; Pippig, Diana A; Aschenbrenner, Daniela; Gaub, Hermann E

2014-01-01

Analysis of transcription factor binding to DNA sequences is of utmost importance to understand the intricate regulatory mechanisms that underlie gene expression. Several techniques exist that quantify DNA-protein affinity, but they are either very time-consuming or suffer from possible misinterpretation due to complicated algorithms or approximations like many high-throughput techniques. We present a more direct method to quantify DNA-protein interaction in a force-based assay. In contrast to single-molecule force spectroscopy, our technique, the Molecular Force Assay (MFA), parallelizes force measurements so that it can test one or multiple proteins against several DNA sequences in a single experiment. The interaction strength is quantified by comparison to the well-defined rupture stability of different DNA duplexes. As a proof-of-principle, we measured the interaction of the zinc finger construct Zif268/NRE against six different DNA constructs. We could show the specificity of our approach and quantify the strength of the protein-DNA interaction.
Conformational Transitions and Stop-and-Go Nanopore Transport of Single Stranded DNA on Charged Graphene

PubMed Central

Shankla, Manish; Aksimentiev, Aleksei

2014-01-01

Control over interactions with biomolecules holds the key to applications of graphene in biotechnology. One such application is nanopore sequencing, where a DNA molecule is electrophoretically driven through a graphene nanopore. Here, we investigate how interactions of single-stranded DNA and a graphene membrane can be controlled by electrically biasing the membrane. The results of our molecular dynamics simulations suggest that electric charge on graphene can force a DNA homopolymer to adopt a range of strikingly different conformations. The conformational response is sensitive to even very subtle nucleotide modifications, such as DNA methylation. The speed of DNA motion through a graphene nanopore is strongly affected by the graphene charge: a positive charge accelerates the motion whereas a negative charge arrests it. As a possible application of the effect, we demonstrate stop-and-go transport of DNA controlled by the charge of graphene. Such on-demand transport of DNA is essential for realizing nanopore sequencing. PMID:25296960
Conformational transitions and stop-and-go nanopore transport of single-stranded DNA on charged graphene

NASA Astrophysics Data System (ADS)

Shankla, Manish; Aksimentiev, Aleksei

2014-10-01

Control over interactions with biomolecules holds the key to applications of graphene in biotechnology. One such application is nanopore sequencing, where a DNA molecule is electrophoretically driven through a graphene nanopore. Here we investigate how interactions of single-stranded DNA and a graphene membrane can be controlled by electrically biasing the membrane. The results of our molecular dynamics simulations suggest that electric charge on graphene can force a DNA homopolymer to adopt a range of strikingly different conformations. The conformational response is sensitive to even very subtle nucleotide modifications, such as DNA methylation. The speed of DNA motion through a graphene nanopore is strongly affected by the graphene charge: a positive charge accelerates the motion, whereas a negative charge arrests it. As a possible application of the effect, we demonstrate stop-and-go transport of DNA controlled by the charge of graphene. Such on-demand transport of DNA is essential for realizing nanopore sequencing.
Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

PubMed

O'Neill, F J; Gao, Y; Xu, X

1993-11-01

The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant monomolecular genomes. These and other findings indicate that the bipartite genome state can sustain many mutations which wtSV40 cannot directly sustain. However, the mutations can later be introduced into the wild type genomes when the E- and L-SV40 DNAs recombine to generate a new monomolecular genome structure.
Threading DNA through nanopores for biosensing applications

NASA Astrophysics Data System (ADS)

Fyta, Maria

2015-07-01

This review outlines the recent achievements in the field of nanopore research. Nanopores are typically used in single-molecule experiments and are believed to have a high potential to realize an ultra-fast and very cheap genome sequencer. Here, the various types of nanopore materials, ranging from biological to 2D nanopores are discussed together with their advantages and disadvantages. These nanopores can utilize different protocols to read out the DNA nucleobases. Although, the first nanopore devices have reached the market, many still have issues which do not allow a full realization of a nanopore sequencer able to sequence the human genome in about a day. Ways to control the DNA, its dynamics and speed as the biomolecule translocates the nanopore in order to increase the signal-to-noise ratio in the reading-out process are examined in this review. Finally, the advantages, as well as the drawbacks in distinguishing the DNA nucleotides, i.e., the genetic information, are presented in view of their importance in the field of nanopore sequencing.
Effects of pre- and pro-sequence of thaumatin on the secretion by Pichia pastoris.

PubMed

Ide, Nobuyuki; Masuda, Tetsuya; Kitabatake, Naofumi

2007-11-23

Thaumatin is a 22-kDa sweet-tasting protein containing eight disulfide bonds. When thaumatin is expressed in Pichia pastoris using the thaumatin cDNA fused with both the alpha-factor signal sequence and the Kex2 protease cleavage site from Saccharomyces cerevisiae, the N-terminal sequence of the secreted thaumatin molecule is not processed correctly. To examine the role of the thaumatin cDNA-encoded N-terminal pre-sequence and C-terminal pro-sequence on the processing of thaumatin and efficiency of thaumatin production in P. pastoris, four expression plasmids with different pre-sequence and pro-sequence were constructed and transformed into P. pastoris. The transformants containing pre-thaumatin gene that has the native plant signal, secreted thaumatin molecules in the medium. The N-terminal amino acid sequence of the secreted thaumatin molecule was processed correctly. The production yield of thaumatin was not affected by the C-terminal pro-sequence, and the pro-sequence was not processed in P. pastoris, indicating that pro-sequence is not necessary for thaumatin synthesis.
Single-molecule detection: applications to ultrasensitive biochemical analysis

NASA Astrophysics Data System (ADS)

Castro, Alonso; Shera, E. Brooks

1995-06-01

Recent developments in laser-based detection of fluorescent molecules have made possible the implementation of very sensitive techniques for biochemical analysis. We present and discuss our experiments on the applications of our recently developed technique of single-molecule detection to the analysis of molecules of biological interest. These newly developed methods are capable of detecting and identifying biomolecules at the single-molecule level of sensitivity. In one case, identification is based on measuring fluorescence brightness from single molecules. In another, molecules are classified by determining their electrophoretic velocities.
Evolutionary origins of the emergent ST796 clone of vancomycin resistant Enterococcus faecium

PubMed Central

Buultjens, Andrew H.; Lam, Margaret M.C.; Ballard, Susan; Monk, Ian R.; Mahony, Andrew A.; Grabsch, Elizabeth A.; Grayson, M. Lindsay; Pang, Stanley; Coombs, Geoffrey W.; Robinson, J. Owen; Seemann, Torsten; Howden, Benjamin P.

2017-01-01

From early 2012, a novel clone of vancomycin resistant Enterococcus faecium (assigned the multi locus sequence type ST796) was simultaneously isolated from geographically separate hospitals in south eastern Australia and New Zealand. Here we describe the complete genome sequence of Ef_aus0233, a representative ST796 E. faecium isolate. We used PacBio single molecule real-time sequencing to establish a high quality, fully assembled genome comprising a circular chromosome of 2,888,087 bp and five plasmids. Comparison of Ef_aus0233 to other E. faecium genomes shows Ef_aus0233 is a member of the epidemic hospital-adapted lineage and has evolved from an ST555-like ancestral progenitor by the accumulation or modification of five mosaic plasmids and five putative prophage, acquisition of two cryptic genomic islands, accrued chromosomal single nucleotide polymorphisms and a 80 kb region of recombination, also gaining Tn1549 and Tn916, transposons conferring resistance to vancomycin and tetracycline respectively. The genomic dissection of this new clone presented here underscores the propensity of the hospital E. faecium lineage to change, presumably in response to the specific conditions of hospital and healthcare environments. PMID:28149688
A single base substitution in the coding region for neurophysin II associated with familial central diabetes insipidus.

PubMed Central

Ito, M; Mori, Y; Oiso, Y; Saito, H

1991-01-01

To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604

Nano-fabrication of molecular electronic junctions by targeted modification of metal-molecule bonds

NASA Astrophysics Data System (ADS)

Jafri, S. Hassan M.; Löfås, Henrik; Blom, Tobias; Wallner, Andreas; Grigoriev, Anton; Ahuja, Rajeev; Ottosson, Henrik; Leifer, Klaus

2015-09-01

Reproducibility, stability and the coupling between electrical and molecular properties are central challenges in the field of molecular electronics. The field not only needs devices that fulfill these criteria but they also need to be up-scalable to application size. In this work, few-molecule based electronics devices with reproducible electrical characteristics are demonstrated. Our previously reported 5 nm gold nanoparticles (AuNP) coated with ω-triphenylmethyl (trityl) protected 1,8-octanedithiol molecules are trapped in between sub-20 nm gap spacing gold nanoelectrodes forming AuNP-molecule network. When the trityl groups are removed, reproducible devices and stable Au-thiol junctions are established on both ends of the alkane segment. The resistance of more than 50 devices is reduced by orders of magnitude as well as a reduction of the spread in the resistance histogram is observed. By density functional theory calculations the orders of magnitude decrease in resistance can be explained and supported by TEM observations thus indicating that the resistance changes and strongly improved resistance spread are related to the establishment of reproducible and stable metal-molecule bonds. The same experimental sequence is carried out using 1,6-hexanedithiol functionalized AuNPs. The average resistances as a function of molecular length, demonstrated herein, are comparable to the one found in single molecule devices.
Single-Molecule Spectroscopy and Imaging Over the Decades

PubMed Central

Moerner, W. E.; Shechtman, Yoav; Wang, Quan

2016-01-01

As of 2015, it has been 26 years since the first optical detection and spectroscopy of single molecules in condensed matter. This area of science has expanded far beyond the early low temperature studies in crystals to include single molecules in cells, polymers, and in solution. The early steps relied upon high-resolution spectroscopy of inhomogeneously broadened optical absorption profiles of molecular impurities in solids at low temperatures. Spectral fine structure arising directly from the position-dependent fluctuations of the number of molecules in resonance led to the attainment of the single-molecule limit in 1989 using frequency-modulation laser spectroscopy. In the early 1990's, a variety of fascinating physical effects were observed for individual molecules, including imaging of the light from single molecules as well as observations of spectral diffusion, optical switching and the ability to select different single molecules in the same focal volume simply by tuning the pumping laser frequency. In the room temperature regime, researchers showed that bursts of light from single molecules could be detected in solution, leading to imaging and microscopy by a variety of methods. Studies of single copies of the green fluorescent protein also uncovered surprises, especially the blinking and photoinduced recovery of emitters, which stimulated further development of photoswitchable fluorescent protein labels. All of these early steps provided important fundamentals underpinning the development of super-resolution microscopy based on single-molecule localization and active control of emitting concentration. Current thrust areas include extensions to three-dimensional imaging with high precision, orientational analysis of single molecules, and direct measurements of photodynamics and transport properties for single molecules trapped in solution by suppression of Brownian motion. Without question, a huge variety of studies of single molecules performed by many talented scientists all over the world have extended our knowledge of the nanoscale and microscopic mechanisms previously hidden by ensemble averaging. PMID:26616210
Alphasatellitidae: a new family with two subfamilies for the classification of geminivirus- and nanovirus-associated alphasatellites.

PubMed

Briddon, Rob W; Martin, Darren P; Roumagnac, Philippe; Navas-Castillo, Jesús; Fiallo-Olivé, Elvira; Moriones, Enrique; Lett, Jean-Michel; Zerbini, F Murilo; Varsani, Arvind

2018-05-09

Nanoviruses and geminiviruses are circular, single stranded DNA viruses that infect many plant species around the world. Nanoviruses and certain geminiviruses that belong to the Begomovirus and Mastrevirus genera are associated with additional circular, single stranded DNA molecules (~ 1-1.4 kb) that encode a replication-associated protein (Rep). These Rep-encoding satellite molecules are commonly referred to as alphasatellites and here we communicate the establishment of the family Alphasatellitidae to which these have been assigned. Within the Alphasatellitidae family two subfamilies, Geminialphasatellitinae and Nanoalphasatellitinae, have been established to respectively accommodate the geminivirus- and nanovirus-associated alphasatellites. Whereas the pairwise nucleotide sequence identity distribution of all the known geminialphasatellites (n = 628) displayed a troughs at ~ 70% and 88% pairwise identity, that of the known nanoalphasatellites (n = 54) had a troughs at ~ 67% and ~ 80% pairwise identity. We use these pairwise identity values as thresholds together with phylogenetic analyses to establish four genera and 43 species of geminialphasatellites and seven genera and 19 species of nanoalphasatellites. Furthermore, a divergent alphasatellite associated with coconut foliar decay disease is assigned to a species but not a subfamily as it likely represents a new alphasatellite subfamily that could be established once other closely related molecules are discovered.
Discovery of DNA viruses in wild-caught mosquitoes using small RNA high throughput sequencing.

PubMed

Ma, Maijuan; Huang, Yong; Gong, Zhengda; Zhuang, Lu; Li, Cun; Yang, Hong; Tong, Yigang; Liu, Wei; Cao, Wuchun

2011-01-01

Mosquito-borne infectious diseases pose a severe threat to public health in many areas of the world. Current methods for pathogen detection and surveillance are usually dependent on prior knowledge of the etiologic agents involved. Hence, efficient approaches are required for screening wild mosquito populations to detect known and unknown pathogens. In this study, we explored the use of Next Generation Sequencing to identify viral agents in wild-caught mosquitoes. We extracted total RNA from different mosquito species from South China. Small 18-30 bp length RNA molecules were purified, reverse-transcribed into cDNA and sequenced using Illumina GAIIx instrumentation. Bioinformatic analyses to identify putative viral agents were conducted and the results confirmed by PCR. We identified a non-enveloped single-stranded DNA densovirus in the wild-caught Culex pipiens molestus mosquitoes. The majority of the viral transcripts (.>80% of the region) were covered by the small viral RNAs, with a few peaks of very high coverage obtained. The +/- strand sequence ratio of the small RNAs was approximately 7∶1, indicating that the molecules were mainly derived from the viral RNA transcripts. The small viral RNAs overlapped, enabling contig assembly of the viral genome sequence. We identified some small RNAs in the reverse repeat regions of the viral 5'- and 3' -untranslated regions where no transcripts were expected. Our results demonstrate for the first time that high throughput sequencing of small RNA is feasible for identifying viral agents in wild-caught mosquitoes. Our results show that it is possible to detect DNA viruses by sequencing the small RNAs obtained from insects, although the underlying mechanism of small viral RNA biogenesis is unclear. Our data and those of other researchers show that high throughput small RNA sequencing can be used for pathogen surveillance in wild mosquito vectors.
Quantifying Genome Editing Outcomes at Endogenous Loci using SMRT Sequencing

PubMed Central

Clark, Joseph; Punjya, Niraj; Sebastiano, Vittorio; Bao, Gang; Porteus, Matthew H

2014-01-01

SUMMARY Targeted genome editing with engineered nucleases has transformed the ability to introduce precise sequence modifications at almost any site within the genome. A major obstacle to probing the efficiency and consequences of genome editing is that no existing method enables the frequency of different editing events to be simultaneously measured across a cell population at any endogenous genomic locus. We have developed a novel method for quantifying individual genome editing outcomes at any site of interest using single molecule real time (SMRT) DNA sequencing. We show that this approach can be applied at various loci, using multiple engineered nuclease platforms including TALENs, RNA guided endonucleases (CRISPR/Cas9), and ZFNs, and in different cell lines to identify conditions and strategies in which the desired engineering outcome has occurred. This approach facilitates the evaluation of new gene editing technologies and permits sensitive quantification of editing outcomes in almost every experimental system used. PMID:24685129
Replacement of RNA hairpins by in vitro selected tetranucleotides.

PubMed Central

Dichtl, B; Pan, T; DiRenzo, A B; Uhlenbeck, O C

1993-01-01

An in vitro selection method based on the autolytic cleavage of yeast tRNA(Phe) by Pb2+ was applied to obtain tRNA derivatives with the anticodon hairpin replaced by four single-stranded nucleotides. Based on the rates of the site-specific cleavage by Pb2+ and the presence of a specific UV-induced crosslink, certain tetranucleotide sequences allow proper folding of the rest of the tRNA molecule, whereas others do not. One such successful tetramer sequence was also used to replace the acceptor stem of yeast tRNA(Phe) and the anticodon hairpin of E.coli tRNA(Phe) without disrupting folding. These experiments suggest that certain tetramers may be able to replace structurally nonessential hairpins in any RNA. Images PMID:7680121
Sequence-structure mapping errors in the PDB: OB-fold domains

PubMed Central

Venclovas, Česlovas; Ginalski, Krzysztof; Kang, Chulhee

2004-01-01

The Protein Data Bank (PDB) is the single most important repository of structural data for proteins and other biologically relevant molecules. Therefore, it is critically important to keep the PDB data, as much as possible, error-free. In this study, we have analyzed PDB crystal structures possessing oligonucleotide/oligosaccharide binding (OB)-fold, one of the highly populated folds, for the presence of sequence-structure mapping errors. Using energy-based structure quality assessment coupled with sequence analyses, we have found that there are at least five OB-structures in the PDB that have regions where sequences have been incorrectly mapped onto the structure. We have demonstrated that the combination of these computation techniques is effective not only in detecting sequence-structure mapping errors, but also in providing guidance to correct them. Namely, we have used results of computational analysis to direct a revision of X-ray data for one of the PDB entries containing a fairly inconspicuous sequence-structure mapping error. The revised structure has been deposited with the PDB. We suggest use of computational energy assessment and sequence analysis techniques to facilitate structure determination when homologs having known structure are available to use as a reference. Such computational analysis may be useful in either guiding the sequence-structure assignment process or verifying the sequence mapping within poorly defined regions. PMID:15133161
Basic quantitative polymerase chain reaction using real-time fluorescence measurements.

PubMed

Ares, Manuel

2014-10-01

This protocol uses quantitative polymerase chain reaction (qPCR) to measure the number of DNA molecules containing a specific contiguous sequence in a sample of interest (e.g., genomic DNA or cDNA generated by reverse transcription). The sample is subjected to fluorescence-based PCR amplification and, theoretically, during each cycle, two new duplex DNA molecules are produced for each duplex DNA molecule present in the sample. The progress of the reaction during PCR is evaluated by measuring the fluorescence of dsDNA-dye complexes in real time. In the early cycles, DNA duplication is not detected because inadequate amounts of DNA are made. At a certain threshold cycle, DNA-dye complexes double each cycle for 8-10 cycles, until the DNA concentration becomes so high and the primer concentration so low that the reassociation of the product strands blocks efficient synthesis of new DNA and the reaction plateaus. There are two types of measurements: (1) the relative change of the target sequence compared to a reference sequence and (2) the determination of molecule number in the starting sample. The first requires a reference sequence, and the second requires a sample of the target sequence with known numbers of the molecules of sequence to generate a standard curve. By identifying the threshold cycle at which a sample first begins to accumulate DNA-dye complexes exponentially, an estimation of the numbers of starting molecules in the sample can be extrapolated. © 2014 Cold Spring Harbor Laboratory Press.
Virology: The Next Generation from Digital PCR to Single Virion Genomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

White, Richard A.; Brazelton De Cardenas, Jessica N.; Hayden, Randall T.

In the past 25 years, virology has had major technology breakthroughs stemming first from the introduction of nucleic acid amplification testing, but more recently from the use of next-generation sequencing, digital PCR, and the possibility of single virion genomics. These technologies have and will improve diagnosis and disease state monitoring in clinical settings, aid in environmental monitoring, and reveal the vast genetic potential of viruses. Using the principle of limiting dilution, digital PCR amplifies single molecules of DNA in highly partitioned endpoint reactions and reads each of those reactions as either positive or negative based on the presence or absencemore » of target fluorophore. In this review, digital PCR will be highlighted along with current studies, advantages/disadvantages, and future perspectives with regard to digital PCR, viral load testing, and the possibility of single virion genomics.« less
Spectroscopic characterization of Venus at the single molecule level.

PubMed

David, Charlotte C; Dedecker, Peter; De Cremer, Gert; Verstraeten, Natalie; Kint, Cyrielle; Michiels, Jan; Hofkens, Johan

2012-02-01

Venus is a recently developed, fast maturating, yellow fluorescent protein that has been used as a probe for in vivo applications. In the present work the photophysical characteristics of Venus were analyzed spectroscopically at the bulk and single molecule level. Through time-resolved single molecule measurements we found that single molecules of Venus display pronounced fluctuations in fluorescence emission, with clear fluorescence on- and off-times. These fluorescence intermittencies were found to occupy a broad range of time scales, ranging from milliseconds to several seconds. Such long off-times can complicate the analysis of single molecule counting experiments or single-molecule FRET experiments. This journal is © The Royal Society of Chemistry and Owner Societies 2012
Single molecule data under scrutiny. Comment on "Extracting physics of life at the molecular level: A review of single-molecule data analyses" by W. Colomb & S.K. Sarkar

NASA Astrophysics Data System (ADS)

Wohland, Thorsten

2015-06-01

Single Molecule Detection and Spectroscopy have grown from their first beginnings into mainstream, mature research areas that are widely applied in the biological sciences. However, despite the advances in technology and the application of many single molecule techniques even in in vivo settings, the data analysis of single molecule experiments is complicated by noise, systematic errors, and complex underlying processes that are only incompletely understood. Colomb and Sarkar provide in this issue an overview of single molecule experiments and the accompanying problems in data analysis, which have to be overcome for a proper interpretation of the experiments [1].
Single Molecule Electronics and Devices

PubMed Central

Tsutsui, Makusu; Taniguchi, Masateru

2012-01-01

The manufacture of integrated circuits with single-molecule building blocks is a goal of molecular electronics. While research in the past has been limited to bulk experiments on self-assembled monolayers, advances in technology have now enabled us to fabricate single-molecule junctions. This has led to significant progress in understanding electron transport in molecular systems at the single-molecule level and the concomitant emergence of new device concepts. Here, we review recent developments in this field. We summarize the methods currently used to form metal-molecule-metal structures and some single-molecule techniques essential for characterizing molecular junctions such as inelastic electron tunnelling spectroscopy. We then highlight several important achievements, including demonstration of single-molecule diodes, transistors, and switches that make use of electrical, photo, and mechanical stimulation to control the electron transport. We also discuss intriguing issues to be addressed further in the future such as heat and thermoelectric transport in an individual molecule. PMID:22969345
The complete DNA sequence of lymphocystis disease virus.

PubMed

Tidona, C A; Darai, G

1997-04-14

Lymphocystis disease virus (LCDV) is the causative agent of lymphocystis disease, which has been reported to occur in over 100 different fish species worldwide. LCDV is a member of the family Iridoviridae and the type species of the genus Lymphocystivirus. The virions contain a single linear double-stranded DNA molecule, which is circularly permuted, terminally redundant, and heavily methylated at cytosines in CpG sequences. The complete nucleotide sequence of LCDV-1 (flounder isolate) was determined by automated cycle sequencing and primer walking. The genome of LCDV-1 is 102.653 bp in length and contains 195 open reading frames with coding capacities ranging from 40 to 1199 amino acids. Computer-assisted analyses of the deduced amino acid sequences led to the identification of several putative gene products with significant homologies to entries in protein data banks, such as the two major subunits of the viral DNA-dependent RNA polymerase, DNA polymerase, several protein kinases, two subunits of the ribonucleoside diphosphate reductase, DNA methyltransferase, the viral major capsid protein, insulin-like growth factor, and tumor necrosis factor receptor homolog.
Reversible gating of smart plasmonic molecular traps using thermoresponsive polymers for single-molecule detection

PubMed Central

Zheng, Yuanhui; Soeriyadi, Alexander H.; Rosa, Lorenzo; Ng, Soon Hock; Bach, Udo; Justin Gooding, J.

2015-01-01

Single-molecule surface-enhanced Raman spectroscopy (SERS) has attracted increasing interest for chemical and biochemical sensing. Many conventional substrates have a broad distribution of SERS enhancements, which compromise reproducibility and result in slow response times for single-molecule detection. Here we report a smart plasmonic sensor that can reversibly trap a single molecule at hotspots for rapid single-molecule detection. The sensor was fabricated through electrostatic self-assembly of gold nanoparticles onto a gold/silica-coated silicon substrate, producing a high yield of uniformly distributed hotspots on the surface. The hotspots were isolated with a monolayer of a thermoresponsive polymer (poly(N-isopropylacrylamide)), which act as gates for molecular trapping at the hotspots. The sensor shows not only a good SERS reproducibility but also a capability to repetitively trap and release molecules for single-molecular sensing. The single-molecule sensitivity is experimentally verified using SERS spectral blinking and bianalyte methods. PMID:26549539
An evolution based biosensor receptor DNA sequence generation algorithm.

PubMed

Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

2010-01-01

A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.
Tunable graphene quantum point contact transistor for DNA detection and characterization

PubMed Central

Girdhar, Anuj; Sathe, Chaitanya; Schulten, Klaus; Leburton, Jean-Pierre

2015-01-01

A graphene membrane conductor containing a nanopore in a quantum point contact (QPC) geometry is a promising candidate to sense, and potentially sequence, DNA molecules translocating through the nanopore. Within this geometry, the shape, size, and position of the nanopore as well as the edge configuration influences the membrane conductance caused by the electrostatic interaction between the DNA nucleotides and the nanopore edge. It is shown that the graphene conductance variations resulting from DNA translocation can be enhanced by choosing a particular geometry as well as by modulating the graphene Fermi energy, which demonstrates the ability to detect conformational transformations of a double-stranded DNA, as well as the passage of individual base pairs of a single-stranded DNA molecule through the nanopore. PMID:25765702
Direct single-molecule dynamic detection of chemical reactions.

PubMed

Guan, Jianxin; Jia, Chuancheng; Li, Yanwei; Liu, Zitong; Wang, Jinying; Yang, Zhongyue; Gu, Chunhui; Su, Dingkai; Houk, Kendall N; Zhang, Deqing; Guo, Xuefeng

2018-02-01

Single-molecule detection can reveal time trajectories and reaction pathways of individual intermediates/transition states in chemical reactions and biological processes, which is of fundamental importance to elucidate their intrinsic mechanisms. We present a reliable, label-free single-molecule approach that allows us to directly explore the dynamic process of basic chemical reactions at the single-event level by using stable graphene-molecule single-molecule junctions. These junctions are constructed by covalently connecting a single molecule with a 9-fluorenone center to nanogapped graphene electrodes. For the first time, real-time single-molecule electrical measurements unambiguously show reproducible large-amplitude two-level fluctuations that are highly dependent on solvent environments in a nucleophilic addition reaction of hydroxylamine to a carbonyl group. Both theoretical simulations and ensemble experiments prove that this observation originates from the reversible transition between the reactant and a new intermediate state within a time scale of a few microseconds. These investigations open up a new route that is able to be immediately applied to probe fast single-molecule physics or biophysics with high time resolution, making an important contribution to broad fields beyond reaction chemistry.
Direct single-molecule dynamic detection of chemical reactions

PubMed Central

Guan, Jianxin; Jia, Chuancheng; Li, Yanwei; Liu, Zitong; Wang, Jinying; Yang, Zhongyue; Gu, Chunhui; Su, Dingkai; Houk, Kendall N.; Zhang, Deqing; Guo, Xuefeng

2018-01-01

Single-molecule detection can reveal time trajectories and reaction pathways of individual intermediates/transition states in chemical reactions and biological processes, which is of fundamental importance to elucidate their intrinsic mechanisms. We present a reliable, label-free single-molecule approach that allows us to directly explore the dynamic process of basic chemical reactions at the single-event level by using stable graphene-molecule single-molecule junctions. These junctions are constructed by covalently connecting a single molecule with a 9-fluorenone center to nanogapped graphene electrodes. For the first time, real-time single-molecule electrical measurements unambiguously show reproducible large-amplitude two-level fluctuations that are highly dependent on solvent environments in a nucleophilic addition reaction of hydroxylamine to a carbonyl group. Both theoretical simulations and ensemble experiments prove that this observation originates from the reversible transition between the reactant and a new intermediate state within a time scale of a few microseconds. These investigations open up a new route that is able to be immediately applied to probe fast single-molecule physics or biophysics with high time resolution, making an important contribution to broad fields beyond reaction chemistry. PMID:29487914
Linearisation of λDNA molecules by instantaneous variation of the trapping electrode voltage inside a micro-channel

NASA Astrophysics Data System (ADS)

Hanasaki, Itsuo; Yukimoto, Naoya; Uehara, Satoshi; Shintaku, Hirofumi; Kawano, Satoyuki

2015-04-01

Because long DNA molecules usually exist in random coil states due to the entropic effect, linearisation is required for devices equipped with nanopores where electrical sequencing is necessary during single-file translocation. We present a novel technique for linearising DNA molecules in a micro-channel. In our device, electrodes are embedded in the bottom surface of the channel. The application of a voltage induces the trapping of λDNA molecules on the positive electrode. An instantaneous voltage drop is used to put the λDNA molecules in a partly released state and the hydrodynamic force of the solution induces linearisation. Phenomena were directly observed using an optical microscopy system equipped with a high-speed camera and the linearisation principle was explored in detail. Furthermore, we estimate the tensile characteristics produced by the flow of the solution through a numerical model of a tethered polymer subject to a Poiseuille flow. The mean tensile force is in the range of 0.1-1 pN. This is sufficiently smaller than the structural transition point of λDNA but counterbalances the entropic elasticity that causes the random coil shape of λDNA molecules in solution. We show the important role of thermal fluctuation in the manipulation of molecules in solution and clarify the tensile conditions required for DNA linearisation using a combination of solution flow and voltage variation in a microchannel.
The effect of quercetin on genetic expression of the commensal gut microbes Bifidobacterium catenulatum, Enterococcus caccae and Ruminococcus gauvreauii.

PubMed

Firrman, Jenni; Liu, LinShu; Zhang, Liqing; Arango Argoty, Gustavo; Wang, Minqian; Tomasula, Peggy; Kobori, Masuko; Pontious, Sherri; Xiao, Weidong

2016-12-01

Quercetin is one of the most abundant polyphenols found in fruits and vegetables. The ability of the gut microbiota to metabolize quercetin has been previously documented; however, the effect that quercetin may have on commensal gut microbes remains unclear. In the present study, the effects of quercetin on the commensal gut microbes Ruminococcus gauvreauii, Bifidobacterium catenulatum and Enterococcus caccae were determined through evaluation of growth patterns and cell morphology, and analysis of genetic expression profiles between quercetin treated and non-treated groups using Single Molecule RNA sequencing via Helicos technology. Results of this study revealed that phenotypically, quercetin did not prevent growth of Ruminococcus gauvreauii, mildly suppressed growth of Bifidobacterium catenulatum, and moderately inhibited growth of Enterococcus caccae. Genetic analysis revealed that in response to quercetin, Ruminococcus gauvreauii down regulated genes responsible for protein folding, purine synthesis and metabolism. Bifidobacterium catenulatum increased expression of the ABC transport pathway and decreased metabolic pathways and cell wall synthesis. Enterococcus caccae upregulated genes responsible for energy production and metabolism, and downregulated pathways of stress response, translation and sugar transport. For the first time, the effect of quercetin on the growth and genetic expression of three different commensal gut bacteria was documented. The data provides insight into the interactions between genetic regulation and growth. This is also a unique demonstration of how RNA single molecule sequencing can be used to study the gut microbiota. Published by Elsevier Ltd.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.