Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.
Hu, Ming; Zhu, Yu; Taylor, Jeremy M G; Liu, Jun S; Qin, Zhaohui S
2012-01-01
RNA sequencing (RNA-Seq) is a powerful new technology for mapping and quantifying transcriptomes using ultra high-throughput next-generation sequencing technologies. Using deep sequencing, gene expression levels of all transcripts including novel ones can be quantified digitally. Although extremely promising, the massive amounts of data generated by RNA-Seq, substantial biases and uncertainty in short read alignment pose challenges for data analysis. In particular, large base-specific variation and between-base dependence make simple approaches, such as those that use averaging to normalize RNA-Seq data and quantify gene expressions, ineffective. In this study, we propose a Poisson mixed-effects (POME) model to characterize base-level read coverage within each transcript. The underlying expression level is included as a key parameter in this model. Since the proposed model is capable of incorporating base-specific variation as well as between-base dependence that affect read coverage profile throughout the transcript, it can lead to improved quantification of the true underlying expression level. POME can be freely downloaded at http://www.stat.purdue.edu/~yuzhu/pome.html. yuzhu@purdue.edu; zhaohui.qin@emory.edu Supplementary data are available at Bioinformatics online.
Woolhouse, Mark
2017-07-01
Transmissibility is the defining characteristic of infectious diseases. Quantifying transmission matters for understanding infectious disease epidemiology and designing evidence-based disease control programs. Tracing individual transmission events can be achieved by epidemiological investigation coupled with pathogen typing or genome sequencing. Individual infectiousness can be estimated by measuring pathogen loads, but few studies have directly estimated the ability of infected hosts to transmit to uninfected hosts. Individuals' opportunities to transmit infection are dependent on behavioral and other risk factors relevant given the transmission route of the pathogen concerned. Transmission at the population level can be quantified through knowledge of risk factors in the population or phylogeographic analysis of pathogen sequence data. Mathematical model-based approaches require estimation of the per capita transmission rate and basic reproduction number, obtained by fitting models to case data and/or analysis of pathogen sequence data. Heterogeneities in infectiousness, contact behavior, and susceptibility can have substantial effects on the epidemiology of an infectious disease, so estimates of only mean values may be insufficient. For some pathogens, super-shedders (infected individuals who are highly infectious) and super-spreaders (individuals with more opportunities to transmit infection) may be important. Future work on quantifying transmission should involve integrated analyses of multiple data sources.
Tillage and cropping sequence impacts on nitrogen cycling in dryland farming in eastern Montana, USA
USDA-ARS?s Scientific Manuscript database
Information on N cycling in dryland crops and soils as influenced by long-term tillage and cropping sequence is needed to quantify soil N sequestration, mineralization, and N balance to reduce N fertilization rate and N losses through soil processes. We evaluated the 21-yr effects of combinations of...
Peng, Xian; Yuan, Han; Chen, Wufan; Ding, Lei
2017-01-01
Continuous loop averaging deconvolution (CLAD) is one of the proven methods for recovering transient auditory evoked potentials (AEPs) in rapid stimulation paradigms, which requires an elaborated stimulus sequence design to attenuate impacts from noise in data. The present study aimed to develop a new metric in gauging a CLAD sequence in terms of noise gain factor (NGF), which has been proposed previously but with less effectiveness in the presence of pink (1/f) noise. We derived the new metric by explicitly introducing the 1/f model into the proposed time-continuous sequence. We selected several representative CLAD sequences to test their noise property on typical EEG recordings, as well as on five real CLAD electroencephalogram (EEG) recordings to retrieve the middle latency responses. We also demonstrated the merit of the new metric in generating and quantifying optimized sequences using a classic genetic algorithm. The new metric shows evident improvements in measuring actual noise gains at different frequencies, and better performance than the original NGF in various aspects. The new metric is a generalized NGF measurement that can better quantify the performance of a CLAD sequence, and provide a more efficient mean of generating CLAD sequences via the incorporation with optimization algorithms. The present study can facilitate the specific application of CLAD paradigm with desired sequences in the clinic. PMID:28414803
Droplet Digital™ PCR Next-Generation Sequencing Library QC Assay.
Heredia, Nicholas J
2018-01-01
Digital PCR is a valuable tool to quantify next-generation sequencing (NGS) libraries precisely and accurately. Accurately quantifying NGS libraries enable accurate loading of the libraries on to the sequencer and thus improve sequencing performance by reducing under and overloading error. Accurate quantification also benefits users by enabling uniform loading of indexed/barcoded libraries which in turn greatly improves sequencing uniformity of the indexed/barcoded samples. The advantages gained by employing the Droplet Digital PCR (ddPCR™) library QC assay includes the precise and accurate quantification in addition to size quality assessment, enabling users to QC their sequencing libraries with confidence.
Browning, J.V.; Miller, K.G.; McLaughlin, P.P.; Kominz, M.A.; Sugarman, P.J.; Monteverde, D.; Feigenson, M.D.; Hernandez, J.C.
2006-01-01
We use backstripping to quantify the roles of variations in global sea level (eustasy), subsidence, and sediment supply on the development of the Miocene stratigraphic record of the mid-Atlantic continental margin of the United States (New Jersey, Delaware, and Maryland). Eustasy is a primary influence on sequence patterns, determining the global template of sequences (i.e., times when sequences can be preserved) and explaining similarities in Miocene sequence architecture on margins throughout the world. Sequences can be correlated throughout the mid-Atlantic region with Sr-isotopic chronology (??0.6 m.y. to ??1.2 m.y.). Eight Miocene sequences correlate regionally and can be correlated to global ??18O increases, indicating glacioeustatic control. This margin is dominated by passive subsidence with little evidence for active tectonic overprints, except possibly in Maryland during the early Miocene. However, early Miocene sequences in New Jersey and Delaware display a patchwork distribution that is attributable to minor (tens of meters) intervals of excess subsidence. Backstripping quantifies that excess subsidence began in Delaware at ca. 21 Ma and continued until 12 Ma, with maximum rates from ca. 21-16 Ma. We attribute this enhanced subsidence to local flexural response to the progradation of thick sequences offshore and adjacent to this area. Removing this excess subsidence in Delaware yields a record that is remarkably similar to New Jersey eustatic estimates. We conclude that sea-level rise and fall is a first-order control on accommodation providing similar timing on all margins to the sequence record. Tectonic changes due to movement of the crust can overprint the record, resulting in large gaps in the stratigraphic record. Smaller differences in sequences can be attributed to local flexural loading effects, particularly in regions experiencing large-scale progradation. ?? 2006 Geological Society of America.
Garrido-Martín, Diego; Pazos, Florencio
2018-02-27
The exponential accumulation of new sequences in public databases is expected to improve the performance of all the approaches for predicting protein structural and functional features. Nevertheless, this was never assessed or quantified for some widely used methodologies, such as those aimed at detecting functional sites and functional subfamilies in protein multiple sequence alignments. Using raw protein sequences as only input, these approaches can detect fully conserved positions, as well as those with a family-dependent conservation pattern. Both types of residues are routinely used as predictors of functional sites and, consequently, understanding how the sequence content of the databases affects them is relevant and timely. In this work we evaluate how the growth and change with time in the content of sequence databases affect five sequence-based approaches for detecting functional sites and subfamilies. We do that by recreating historical versions of the multiple sequence alignments that would have been obtained in the past based on the database contents at different time points, covering a period of 20 years. Applying the methods to these historical alignments allows quantifying the temporal variation in their performance. Our results show that the number of families to which these methods can be applied sharply increases with time, while their ability to detect potentially functional residues remains almost constant. These results are informative for the methods' developers and final users, and may have implications in the design of new sequencing initiatives.
Continuous aesthetic judgment of image sequences.
Khaw, Mel W; Freedberg, David
2018-05-18
Perceptual judgments are said to be reference-dependent as they change on the basis of recent experiences. Here we quantify sequence effects within two types of aesthetic judgments: (i) individual ratings of single images (during self-paced trials) and (ii) continuous ratings of image sequences. As in the case of known contrast effects, trial-by-trial aesthetic responses are negatively correlated with judgments made toward the preceding image. During continuous judgment, a different type of bias is observed. The onset of change within a sequence introduces a persistent increase in ratings (relative to when the same images are judged in isolation). Furthermore, subjects indicate adjustment patterns and choices that selectively favor sequences that are rich in change. Sequence effects in aesthetic judgments thus differ greatly depending on the continuity and arrangement of presented stimuli. The effects highlighted here are important in understanding sustained aesthetic responses over time, such as those elicited during choreographic and musical arrangements. In contrast, standard measurements of aesthetic responses (over trials) may represent a series of distinct aesthetic experiences (e.g., viewing artworks in a museum). Copyright © 2018 Elsevier B.V. All rights reserved.
Sporadic frame dropping impact on quality perception
NASA Astrophysics Data System (ADS)
Pastrana-Vidal, Ricardo R.; Gicquel, Jean Charles; Colomes, Catherine; Cherifi, Hocine
2004-06-01
Over the past few years there has been an increasing interest in real time video services over packet networks. When considering quality, it is essential to quantify user perception of the received sequence. Severe motion discontinuities are one of the most common degradations in video streaming. The end-user perceives a jerky motion when the discontinuities are uniformly distributed over time and an instantaneous fluidity break is perceived when the motion loss is isolated or irregularly distributed. Bit rate adaptation techniques, transmission errors in the packet networks or restitution strategy could be the origin of this perceived jerkiness. In this paper we present a psychovisual experiment performed to quantify the effect of sporadically dropped pictures on the overall perceived quality. First, the perceptual detection thresholds of generated temporal discontinuities were measured. Then, the quality function was estimated in relation to a single frame dropping for different durations. Finally, a set of tests was performed to quantify the effect of several impairments distributed over time. We have found that the detection thresholds are content, duration and motion dependent. The assessment results show how quality is impaired by a single burst of dropped frames in a 10 sec sequence. The effect of several bursts of discarded frames, irregularly distributed over the time is also discussed.
Quantifying transfer after perceptual-motor sequence learning: how inflexible is implicit learning?
Sanchez, Daniel J; Yarnik, Eric N; Reber, Paul J
2015-03-01
Studies of implicit perceptual-motor sequence learning have often shown learning to be inflexibly tied to the training conditions during learning. Since sequence learning is seen as a model task of skill acquisition, limits on the ability to transfer knowledge from the training context to a performance context indicates important constraints on skill learning approaches. Lack of transfer across contexts has been demonstrated by showing that when task elements are changed following training, this leads to a disruption in performance. These results have typically been taken as suggesting that the sequence knowledge relies on integrated representations across task elements (Abrahamse, Jiménez, Verwey, & Clegg, Psychon Bull Rev 17:603-623, 2010a). Using a relatively new sequence learning task, serial interception sequence learning, three experiments are reported that quantify this magnitude of performance disruption after selectively manipulating individual aspects of motor performance or perceptual information. In Experiment 1, selective disruption of the timing or order of sequential actions was examined using a novel response manipulandum that allowed for separate analysis of these two motor response components. In Experiments 2 and 3, transfer was examined after selective disruption of perceptual information that left the motor response sequence intact. All three experiments provided quantifiable estimates of partial transfer to novel contexts that suggest some level of information integration across task elements. However, the ability to identify quantifiable levels of successful transfer indicates that integration is not all-or-none and that measurement sensitivity is a key in understanding sequence knowledge representations.
Dispositional mindfulness is associated with reduced implicit learning.
Stillman, Chelsea M; Feldman, Halley; Wambach, Caroline G; Howard, James H; Howard, Darlene V
2014-08-01
Behavioral and neuroimaging evidence suggest that mindfulness exerts its salutary effects by disengaging habitual processes supported by subcortical regions and increasing effortful control processes supported by the frontal lobes. Here we investigated whether individual differences in dispositional mindfulness relate to performance on implicit sequence learning tasks in which optimal learning may in fact be impeded by the engagement of effortful control processes. We report results from two studies where participants completed a widely used questionnaire assessing mindfulness and one of two implicit sequence learning tasks. Learning was quantified using two commonly used measures of sequence learning. In both studies we detected a negative relationship between mindfulness and sequence learning, and the relationship was consistent across both learning measures. Our results, the first to show a negative relationship between mindfulness and implicit sequence learning, suggest that the beneficial effects of mindfulness do not extend to all cognitive functions. Copyright © 2014 Elsevier Inc. All rights reserved.
Quantifying transfer after perceptual-motor sequence learning: how inflexible is implicit learning?
Sanchez, Daniel J.; Yarnik, Eric N.
2015-01-01
Studies of implicit perceptual-motor sequence learning have often shown learning to be inflexibly tied to the training conditions during learning. Since sequence learning is seen as a model task of skill acquisition, limits on the ability to transfer knowledge from the training context to a performance context indicates important constraints on skill learning approaches. Lack of transfer across contexts has been demonstrated by showing that when task elements are changed following training, this leads to a disruption in performance. These results have typically been taken as suggesting that the sequence knowledge relies on integrated representations across task elements (Abrahamse, Jiménez, Verwey, & Clegg, Psychon Bull Rev 17:603–623, 2010a). Using a relatively new sequence learning task, serial interception sequence learning, three experiments are reported that quantify this magnitude of performance disruption after selectively manipulating individual aspects of motor performance or perceptual information. In Experiment 1, selective disruption of the timing or order of sequential actions was examined using a novel response manipulandum that allowed for separate analysis of these two motor response components. In Experiments 2 and 3, transfer was examined after selective disruption of perceptual information that left the motor response sequence intact. All three experiments provided quantifiable estimates of partial transfer to novel contexts that suggest some level of information integration across task elements. However, the ability to identify quantifiable levels of successful transfer indicates that integration is not all-or-none and that measurement sensitivity is a key in understanding sequence knowledge representations. PMID:24668505
Wang, Edwin; Zou, Jinfeng; Zaman, Naif; Beitel, Lenore K; Trifiro, Mark; Paliouras, Miltiadis
2013-08-01
Recent tumor genome sequencing confirmed that one tumor often consists of multiple cell subpopulations (clones) which bear different, but related, genetic profiles such as mutation and copy number variation profiles. Thus far, one tumor has been viewed as a whole entity in cancer functional studies. With the advances of genome sequencing and computational analysis, we are able to quantify and computationally dissect clones from tumors, and then conduct clone-based analysis. Emerging technologies such as single-cell genome sequencing and RNA-Seq could profile tumor clones. Thus, we should reconsider how to conduct cancer systems biology studies in the genome sequencing era. We will outline new directions for conducting cancer systems biology by considering that genome sequencing technology can be used for dissecting, quantifying and genetically characterizing clones from tumors. Topics discussed in Part 1 of this review include computationally quantifying of tumor subpopulations; clone-based network modeling, cancer hallmark-based networks and their high-order rewiring principles and the principles of cell survival networks of fast-growing clones. Crown Copyright © 2013. Published by Elsevier Ltd. All rights reserved.
McNamara, Daniel E.; Gee, Lind; Benz, Harley M.; Chapman, Martin
2014-01-01
Ground shaking due to earthquakes in the eastern United States (EUS) is felt at significantly greater distances than in the western United States (WUS) and for some earthquakes it has been shown to display a strong preferential direction. Shaking intensity variation can be due to propagation path effects, source directivity, and/or site amplification. In this paper, we use S and Lg waves recorded from the 2011 central Virginia earthquake and aftershock sequence, in the Central Virginia Seismic Zone, to quantify attenuation as frequency‐dependent Q(f). In support of observations based on shaking intensity, we observe high Q values in the EUS relative to previous studies in the WUS with especially efficient propagation along the structural trend of the Appalachian mountains. Our analysis of Q(f) quantifies the path effects of the northeast‐trending felt distribution previously inferred from the U.S. Geological Survey (USGS) “Did You Feel It” data, historic intensity data, and the asymmetrical distribution of rockfalls and landslides.
HOMFLYPT polynomial is the best quantifier for topological cascades of vortex knots
NASA Astrophysics Data System (ADS)
Ricca, Renzo L.; Liu, Xin
2018-02-01
In this paper we derive and compare numerical sequences obtained by adapted polynomials such as HOMFLYPT, Jones and Alexander-Conway for the topological cascade of vortex torus knots and links that progressively untie by a single reconnection event at a time. Two cases are considered: the alternate sequence of knots and co-oriented links (with positive crossings) and the sequence of two-component links with oppositely oriented components (negative crossings). New recurrence equations are derived and sequences of numerical values are computed. In all cases the adapted HOMFLYPT polynomial proves to be the best quantifier for the topological cascade of torus knots and links.
A systematic comparison of error correction enzymes by next-generation sequencing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lubock, Nathan B.; Zhang, Di; Sidore, Angus M.
Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared sixmore » different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.« less
A systematic comparison of error correction enzymes by next-generation sequencing
Lubock, Nathan B.; Zhang, Di; Sidore, Angus M.; ...
2017-08-01
Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared sixmore » different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.« less
Computational studies of sequence-specific driving forces in peptide self-assembly
NASA Astrophysics Data System (ADS)
Jeon, Joohyun
Peptides are biopolymers made from various sequences of twenty different types of amino acids, connected by peptide bonds. There are practically an infinite number of possible sequences and tremendous possible combinations of peptide-peptide interactions. Recently, an increasing number of studies have shown a stark variety of peptide self-assembled nanomaterials whose detailed structures depend on their sequences and environmental factors; these have end uses in medical and bio-electronic applications, for example. To understand the underlying physics of complex peptide self-assembly processes and to delineate sequence specific effects, in this study, I use various simulation tools spanning all-atom molecular dynamics to simple lattice models and quantify the balance of interactions in the peptide self-assembly processes. In contrast to the existing view that peptides' aggregation propensities are proportional to the net sequence hydrophobicity and inversely proportional to the net charge, I show the more nuanced effects of electrostatic interactions, including the cooperative effects between hydrophobic and electrostatic interactions. Notably, I suggest rather unexpected, yet important roles of entropies in the small scale oligomerization processes. Overall, this study broadens our understanding of the role of thermodynamic driving forces in peptide self-assembly.
msgbsR: An R package for analysing methylation-sensitive restriction enzyme sequencing data.
Mayne, Benjamin T; Leemaqz, Shalem Y; Buckberry, Sam; Rodriguez Lopez, Carlos M; Roberts, Claire T; Bianco-Miotto, Tina; Breen, James
2018-02-01
Genotyping-by-sequencing (GBS) or restriction-site associated DNA marker sequencing (RAD-seq) is a practical and cost-effective method for analysing large genomes from high diversity species. This method of sequencing, coupled with methylation-sensitive enzymes (often referred to as methylation-sensitive restriction enzyme sequencing or MRE-seq), is an effective tool to study DNA methylation in parts of the genome that are inaccessible in other sequencing techniques or are not annotated in microarray technologies. Current software tools do not fulfil all methylation-sensitive restriction sequencing assays for determining differences in DNA methylation between samples. To fill this computational need, we present msgbsR, an R package that contains tools for the analysis of methylation-sensitive restriction enzyme sequencing experiments. msgbsR can be used to identify and quantify read counts at methylated sites directly from alignment files (BAM files) and enables verification of restriction enzyme cut sites with the correct recognition sequence of the individual enzyme. In addition, msgbsR assesses DNA methylation based on read coverage, similar to RNA sequencing experiments, rather than methylation proportion and is a useful tool in analysing differential methylation on large populations. The package is fully documented and available freely online as a Bioconductor package ( https://bioconductor.org/packages/release/bioc/html/msgbsR.html ).
Leakey, Tatiana I; Zielinski, Jerzy; Siegfried, Rachel N; Siegel, Eric R; Fan, Chun-Yang; Cooney, Craig A
2008-06-01
DNA methylation at cytosines is a widely studied epigenetic modification. Methylation is commonly detected using bisulfite modification of DNA followed by PCR and additional techniques such as restriction digestion or sequencing. These additional techniques are either laborious, require specialized equipment, or are not quantitative. Here we describe a simple algorithm that yields quantitative results from analysis of conventional four-dye-trace sequencing. We call this method Mquant and we compare it with the established laboratory method of combined bisulfite restriction assay (COBRA). This analysis of sequencing electropherograms provides a simple, easily applied method to quantify DNA methylation at specific CpG sites.
Effects of rainfall and surface flow on chemical diffusion from soil to runoff water
USDA-ARS?s Scientific Manuscript database
Although basic processes of diffusion and convection have been used to quantify chemical transport from soil to surface runoff, there are little research results actually showing how these processes were affected by rainfall and surface flow. We developed a laboratory flow cell and a sequence of exp...
Moyle, Richard L.; Carvalhais, Lilia C.; Pretorius, Lara-Simone; Nowak, Ekaterina; Subramaniam, Gayathery; Dalton-Morgan, Jessica; Schenk, Peer M.
2017-01-01
Studies investigating the action of small RNAs on computationally predicted target genes require some form of experimental validation. Classical molecular methods of validating microRNA action on target genes are laborious, while approaches that tag predicted target sequences to qualitative reporter genes encounter technical limitations. The aim of this study was to address the challenge of experimentally validating large numbers of computationally predicted microRNA-target transcript interactions using an optimized, quantitative, cost-effective, and scalable approach. The presented method combines transient expression via agroinfiltration of Nicotiana benthamiana leaves with a quantitative dual luciferase reporter system, where firefly luciferase is used to report the microRNA-target sequence interaction and Renilla luciferase is used as an internal standard to normalize expression between replicates. We report the appropriate concentration of N. benthamiana leaf extracts and dilution factor to apply in order to avoid inhibition of firefly LUC activity. Furthermore, the optimal ratio of microRNA precursor expression construct to reporter construct and duration of the incubation period post-agroinfiltration were determined. The optimized dual luciferase assay provides an efficient, repeatable and scalable method to validate and quantify microRNA action on predicted target sequences. The optimized assay was used to validate five predicted targets of rice microRNA miR529b, with as few as six technical replicates. The assay can be extended to assess other small RNA-target sequence interactions, including assessing the functionality of an artificial miRNA or an RNAi construct on a targeted sequence. PMID:28979287
PARRoT- a homology-based strategy to quantify and compare RNA-sequencing from non-model organisms.
Gan, Ruei-Chi; Chen, Ting-Wen; Wu, Timothy H; Huang, Po-Jung; Lee, Chi-Ching; Yeh, Yuan-Ming; Chiu, Cheng-Hsun; Huang, Hsien-Da; Tang, Petrus
2016-12-22
Next-generation sequencing promises the de novo genomic and transcriptomic analysis of samples of interests. However, there are only a few organisms having reference genomic sequences and even fewer having well-defined or curated annotations. For transcriptome studies focusing on organisms lacking proper reference genomes, the common strategy is de novo assembly followed by functional annotation. However, things become even more complicated when multiple transcriptomes are compared. Here, we propose a new analysis strategy and quantification methods for quantifying expression level which not only generate a virtual reference from sequencing data, but also provide comparisons between transcriptomes. First, all reads from the transcriptome datasets are pooled together for de novo assembly. The assembled contigs are searched against NCBI NR databases to find potential homolog sequences. Based on the searched result, a set of virtual transcripts are generated and served as a reference transcriptome. By using the same reference, normalized quantification values including RC (read counts), eRPKM (estimated RPKM) and eTPM (estimated TPM) can be obtained that are comparable across transcriptome datasets. In order to demonstrate the feasibility of our strategy, we implement it in the web service PARRoT. PARRoT stands for Pipeline for Analyzing RNA Reads of Transcriptomes. It analyzes gene expression profiles for two transcriptome sequencing datasets. For better understanding of the biological meaning from the comparison among transcriptomes, PARRoT further provides linkage between these virtual transcripts and their potential function through showing best hits in SwissProt, NR database, assigning GO terms. Our demo datasets showed that PARRoT can analyze two paired-end transcriptomic datasets of approximately 100 million reads within just three hours. In this study, we proposed and implemented a strategy to analyze transcriptomes from non-reference organisms which offers the opportunity to quantify and compare transcriptome profiles through a homolog based virtual transcriptome reference. By using the homolog based reference, our strategy effectively avoids the problems that may cause from inconsistencies among transcriptomes. This strategy will shed lights on the field of comparative genomics for non-model organism. We have implemented PARRoT as a web service which is freely available at http://parrot.cgu.edu.tw .
USDA-ARS?s Scientific Manuscript database
Major whole genome sequencing projects promise to identify rare and causal variants within livestock species; however, the efficient selection of animals for sequencing remains a major problem within these surveys. The goal of this project was to develop a library of high accuracy genetic variants f...
Quantifying and Mitigating the Effect of Preferential Sampling on Phylodynamic Inference
Karcher, Michael D.; Palacios, Julia A.; Bedford, Trevor; Suchard, Marc A.; Minin, Vladimir N.
2016-01-01
Phylodynamics seeks to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. One way to accomplish this task formulates an observed sequence data likelihood exploiting a coalescent model for the sampled individuals’ genealogy and then integrating over all possible genealogies via Monte Carlo or, less efficiently, by conditioning on one genealogy estimated from the sequence data. However, when analyzing sequences sampled serially through time, current methods implicitly assume either that sampling times are fixed deterministically by the data collection protocol or that their distribution does not depend on the size of the population. Through simulation, we first show that, when sampling times do probabilistically depend on effective population size, estimation methods may be systematically biased. To correct for this deficiency, we propose a new model that explicitly accounts for preferential sampling by modeling the sampling times as an inhomogeneous Poisson process dependent on effective population size. We demonstrate that in the presence of preferential sampling our new model not only reduces bias, but also improves estimation precision. Finally, we compare the performance of the currently used phylodynamic methods with our proposed model through clinically-relevant, seasonal human influenza examples. PMID:26938243
Kono, H; Saven, J G
2001-02-23
Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Quantifying the relationship between sequence and three-dimensional structure conservation in RNA
2010-01-01
Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657
Riffle, Michael; Merrihew, Gennifer E; Jaschob, Daniel; Sharma, Vagisha; Davis, Trisha N; Noble, William S; MacCoss, Michael J
2015-11-01
Regulation of protein abundance is a critical aspect of cellular function, organism development, and aging. Alternative splicing may give rise to multiple possible proteoforms of gene products where the abundance of each proteoform is independently regulated. Understanding how the abundances of these distinct gene products change is essential to understanding the underlying mechanisms of many biological processes. Bottom-up proteomics mass spectrometry techniques may be used to estimate protein abundance indirectly by sequencing and quantifying peptides that are later mapped to proteins based on sequence. However, quantifying the abundance of distinct gene products is routinely confounded by peptides that map to multiple possible proteoforms. In this work, we describe a technique that may be used to help mitigate the effects of confounding ambiguous peptides and multiple proteoforms when quantifying proteins. We have applied this technique to visualize the distribution of distinct gene products for the whole proteome across 11 developmental stages of the model organism Caenorhabditis elegans. The result is a large multidimensional dataset for which web-based tools were developed for visualizing how translated gene products change during development and identifying possible proteoforms. The underlying instrument raw files and tandem mass spectra may also be downloaded. The data resource is freely available on the web at http://www.yeastrc.org/wormpes/ . Graphical Abstract ᅟ.
NASA Astrophysics Data System (ADS)
Riffle, Michael; Merrihew, Gennifer E.; Jaschob, Daniel; Sharma, Vagisha; Davis, Trisha N.; Noble, William S.; MacCoss, Michael J.
2015-11-01
Regulation of protein abundance is a critical aspect of cellular function, organism development, and aging. Alternative splicing may give rise to multiple possible proteoforms of gene products where the abundance of each proteoform is independently regulated. Understanding how the abundances of these distinct gene products change is essential to understanding the underlying mechanisms of many biological processes. Bottom-up proteomics mass spectrometry techniques may be used to estimate protein abundance indirectly by sequencing and quantifying peptides that are later mapped to proteins based on sequence. However, quantifying the abundance of distinct gene products is routinely confounded by peptides that map to multiple possible proteoforms. In this work, we describe a technique that may be used to help mitigate the effects of confounding ambiguous peptides and multiple proteoforms when quantifying proteins. We have applied this technique to visualize the distribution of distinct gene products for the whole proteome across 11 developmental stages of the model organism Caenorhabditis elegans. The result is a large multidimensional dataset for which web-based tools were developed for visualizing how translated gene products change during development and identifying possible proteoforms. The underlying instrument raw files and tandem mass spectra may also be downloaded. The data resource is freely available on the web at http://www.yeastrc.org/wormpes/.
In vivo MR detection of fluorine-labeled human MSC using the bSSFP sequence
Ribot, Emeline J; Gaudet, Jeffrey M; Chen, Yuhua; Gilbert, Kyle M; Foster, Paula J
2014-01-01
Mesenchymal stem cells (MSC) are used to restore deteriorated cell environments. There is a need to specifically track these cells following transplantation in order to evaluate different methods of implantation, to follow their migration within the body, and to quantify their accumulation at the target. Cellular magnetic resonance imaging (MRI) using fluorine-based nanoemulsions is a great means to detect these transplanted cells in vivo because of the high specificity for fluorine detection and the capability for precise quantification. This technique, however, has low sensitivity, necessitating improvement in MR sequences. To counteract this issue, the balanced steady-state free precession (bSSFP) imaging sequence can be of great interest due to the high signal-to-noise ratio (SNR). Furthermore, it can be applied to obtain 3D images within short acquisition times. In this paper, bSSFP provided accurate quantification of samples of the perfluorocarbon Cell Sense-labeled cells in vitro. Cell Sense was internalized by human MSC (hMSC) without adverse alterations in cell viability or differentiation into adipocytes/osteocytes. The bSSFP sequence was applied in vivo to track and quantify the signals from both Cell Sense-labeled and iron-labeled hMSC after intramuscular implantation. The fluorine signal was observed to decrease faster and more significantly than the volume of iron-associated voids, which points to the advantage of quantifying the fluorine signal and the complexity of quantifying signal loss due to iron. PMID:24748787
In vivo MR detection of fluorine-labeled human MSC using the bSSFP sequence.
Ribot, Emeline J; Gaudet, Jeffrey M; Chen, Yuhua; Gilbert, Kyle M; Foster, Paula J
2014-01-01
Mesenchymal stem cells (MSC) are used to restore deteriorated cell environments. There is a need to specifically track these cells following transplantation in order to evaluate different methods of implantation, to follow their migration within the body, and to quantify their accumulation at the target. Cellular magnetic resonance imaging (MRI) using fluorine-based nanoemulsions is a great means to detect these transplanted cells in vivo because of the high specificity for fluorine detection and the capability for precise quantification. This technique, however, has low sensitivity, necessitating improvement in MR sequences. To counteract this issue, the balanced steady-state free precession (bSSFP) imaging sequence can be of great interest due to the high signal-to-noise ratio (SNR). Furthermore, it can be applied to obtain 3D images within short acquisition times. In this paper, bSSFP provided accurate quantification of samples of the perfluorocarbon Cell Sense-labeled cells in vitro. Cell Sense was internalized by human MSC (hMSC) without adverse alterations in cell viability or differentiation into adipocytes/osteocytes. The bSSFP sequence was applied in vivo to track and quantify the signals from both Cell Sense-labeled and iron-labeled hMSC after intramuscular implantation. The fluorine signal was observed to decrease faster and more significantly than the volume of iron-associated voids, which points to the advantage of quantifying the fluorine signal and the complexity of quantifying signal loss due to iron.
Taggart, David J.; Camerlengo, Terry L.; Harrison, Jason K.; Sherrer, Shanen M.; Kshetry, Ajay K.; Taylor, John-Stephen; Huang, Kun; Suo, Zucai
2013-01-01
Cellular genomes are constantly damaged by endogenous and exogenous agents that covalently and structurally modify DNA to produce DNA lesions. Although most lesions are mended by various DNA repair pathways in vivo, a significant number of damage sites persist during genomic replication. Our understanding of the mutagenic outcomes derived from these unrepaired DNA lesions has been hindered by the low throughput of existing sequencing methods. Therefore, we have developed a cost-effective high-throughput short oligonucleotide sequencing assay that uses next-generation DNA sequencing technology for the assessment of the mutagenic profiles of translesion DNA synthesis catalyzed by any error-prone DNA polymerase. The vast amount of sequencing data produced were aligned and quantified by using our novel software. As an example, the high-throughput short oligonucleotide sequencing assay was used to analyze the types and frequencies of mutations upstream, downstream and at a site-specifically placed cis–syn thymidine–thymidine dimer generated individually by three lesion-bypass human Y-family DNA polymerases. PMID:23470999
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sullivan, T; Diak, A; Surucu, M
Purpose: The use of MR to plan and evaluate brachytherapy treatment for cervical cancer is increasing given the availability of MR conditional or safe applicators and MRI’s proven superiority to CT for characterizing soft tissue lesions. The titanium applicators, however, cause geometric distortions or imaging artifacts, which reduce the utility of MRI for dosimetry. We sought to quantify the observed volume of the same applicator on a previously optimized T2 sequence in comparison to the conventional T2 sequence and CT obtained for brachytherapy planning. Methods: Prior work with testing in phantoms showed that increases in readout bandwidth yielded reductions inmore » artifact area and distortion measurements even with voxel increases. Following IRB approval, nine patients with titanium tandem & ovoid applicator (Varian Medical Systems) in place were scanned with a standard periprocedural protocol which included sagittal T2 fast spin echo (FSE) acquisition (res 0.98×0.78×4.0 mm{sup 3}; BW 200Hz). An additional T2-weighted FSE sequence (res 0.98×0.98×3–4 mm{sup 3}; BW500Hz) with increased readout bandwidth, readout voxel size, and echo train length was added to the protocol. Volume measurements of the applicator (from tip to cervical stop) were hand-segmented in Velocity AI 3.1 (Velocity Medical Solutions) for the two T2 FSE sequences and a planning CT obtained shortly after MRI. Differences were analyzed using a paired t-test. Results: Average apparent volumes of the applicator on standard T2 sequence, decreased bandwidth T2 sequence and CT were 5.922±1.283 cm{sup 3}, 4.544±1.524 cm3, and 2.304±0.509 cm{sup 3} respectively. Conclusion: Apparent volumes of a brachytherapy applicator can be compared in vivo. The modified sequence results in decreased apparent size of the cervical applicator. Both MR sequence volumes were larger than the planning CT, which was expected. Future work will focus on the diagnostic quality of the new sequence and quantifying any geometric shifts after CT to MRI registration based on anatomical landmarks.« less
Using metabarcoding to reveal and quantify plant-pollinator interactions
Pornon, André; Escaravage, Nathalie; Burrus, Monique; Holota, Hélène; Khimoun, Aurélie; Mariette, Jérome; Pellizzari, Charlène; Iribar, Amaia; Etienne, Roselyne; Taberlet, Pierre; Vidal, Marie; Winterton, Peter; Zinger, Lucie; Andalo, Christophe
2016-01-01
Given the ongoing decline of both pollinators and plants, it is crucial to implement effective methods to describe complex pollination networks across time and space in a comprehensive and high-throughput way. Here we tested if metabarcoding may circumvent the limits of conventional methodologies in detecting and quantifying plant-pollinator interactions. Metabarcoding experiments on pollen DNA mixtures described a positive relationship between the amounts of DNA from focal species and the number of trnL and ITS1 sequences yielded. The study of pollen loads of insects captured in plant communities revealed that as compared to the observation of visits, metabarcoding revealed 2.5 times more plant species involved in plant-pollinator interactions. We further observed a tight positive relationship between the pollen-carrying capacities of insect taxa and the number of trnL and ITS1 sequences. The number of visits received per plant species also positively correlated to the number of their ITS1 and trnL sequences in insect pollen loads. By revealing interactions hard to observe otherwise, metabarcoding significantly enlarges the spatiotemporal observation window of pollination interactions. By providing new qualitative and quantitative information, metabarcoding holds great promise for investigating diverse facets of interactions and will provide a new perception of pollination networks as a whole. PMID:27255732
Quantifying Aluminum Crystal Size Part 2: The Model-Development Sequence
ERIC Educational Resources Information Center
Hjalmarson, Margret; Diefes-Dux, Heidi A.; Bowman, Keith; Zawojewski, Judith S.
2006-01-01
We have designed model-development sequences using a common context to provide authentic problem-solving experiences for first-year students. The model-development sequence takes a model-eliciting activity a step further by engaging students in the exploration and adaptation of a mathematical model (e.g., procedure, algorithm, method) for solving…
CapZyme-Seq Comprehensively Defines Promoter-Sequence Determinants for RNA 5' Capping with NAD.
Vvedenskaya, Irina O; Bird, Jeremy G; Zhang, Yuanchao; Zhang, Yu; Jiao, Xinfu; Barvík, Ivan; Krásný, Libor; Kiledjian, Megerditch; Taylor, Deanne M; Ebright, Richard H; Nickels, Bryce E
2018-05-03
Nucleoside-containing metabolites such as NAD + can be incorporated as 5' caps on RNA by serving as non-canonical initiating nucleotides (NCINs) for transcription initiation by RNA polymerase (RNAP). Here, we report CapZyme-seq, a high-throughput-sequencing method that employs NCIN-decapping enzymes NudC and Rai1 to detect and quantify NCIN-capped RNA. By combining CapZyme-seq with multiplexed transcriptomics, we determine efficiencies of NAD + capping by Escherichia coli RNAP for ∼16,000 promoter sequences. The results define preferred transcription start site (TSS) positions for NAD + capping and define a consensus promoter sequence for NAD + capping: HRRASWW (TSS underlined). By applying CapZyme-seq to E. coli total cellular RNA, we establish that sequence determinants for NCIN capping in vivo match the NAD + -capping consensus defined in vitro, and we identify and quantify NCIN-capped small RNAs (sRNAs). Our findings define the promoter-sequence determinants for NCIN capping with NAD + and provide a general method for analysis of NCIN capping in vitro and in vivo. Copyright © 2018 Elsevier Inc. All rights reserved.
Jiwaji, Meesbah; Sandison, Mairi E.; Reboud, Julien; Stevenson, Ross; Daly, Rónán; Barkess, Gráinne; Faulds, Karen; Kolch, Walter; Graham, Duncan; Girolami, Mark A.; Cooper, Jonathan M.; Pitt, Andrew R.
2014-01-01
Introduction Gene therapy continues to grow as an important area of research, primarily because of its potential in the treatment of disease. One significant area where there is a need for better understanding is in improving the efficiency of oligonucleotide delivery to the cell and indeed, following delivery, the characterization of the effects on the cell. Methods In this report, we compare different transfection reagents as delivery vehicles for gold nanoparticles functionalized with DNA oligonucleotides, and quantify their relative transfection efficiencies. The inhibitory properties of small interfering RNA (siRNA), single-stranded RNA (ssRNA) and single-stranded DNA (ssDNA) sequences targeted to human metallothionein hMT-IIa are also quantified in HeLa cells. Techniques used in this study include fluorescence and confocal microscopy, qPCR and Western analysis. Findings We show that the use of transfection reagents does significantly increase nanoparticle transfection efficiencies. Furthermore, siRNA, ssRNA and ssDNA sequences all have comparable inhibitory properties to ssDNA sequences immobilized onto gold nanoparticles. We also show that functionalized gold nanoparticles can co-localize with autophagosomes and illustrate other factors that can affect data collection and interpretation when performing studies with functionalized nanoparticles. Conclusions The desired outcome for biological knockdown studies is the efficient reduction of a specific target; which we demonstrate by using ssDNA inhibitory sequences targeted to human metallothionein IIa gene transcripts that result in the knockdown of both the mRNA transcript and the target protein. PMID:24926959
Winterfeld, Katrin; Quera, Vicenç; Winterfeld, Tobias; Ganss, Carolina
2018-01-01
Systematics is considered important for effective toothbrushing. A theoretical concept of systematics in toothbrushing and a validated index to quantify it using observational data is suggested. The index consists of three components: completeness (all areas of the dentition reached), isochronicity (all areas brushed equally long) and consistency (avoiding frequent alternations between areas). Toothbrushing should take a sufficient length of time; therefore, this parameter is part of the index value calculation. Quantitative data from video observations were used including the number of changes between areas, number of areas reached, absolute brushing time and brushing time per area. These data were fed into two algorithms that converted the behaviour into two index values (each with values between 0 and 1) and were summed as the Toothbrushing Systematics Index (TSI) value; 0 indicates completely unsystematic and 2 indicates perfectly systematic brushing. The index was developed using theoretical data. The data matrices revealed the highest values when all areas are reached and brushed equally long. Few changes occurred between the areas when the brushing duration was ≥90 s; the lowest values occurred under opposite conditions. Clinical applicability was tested with data from re-analysed videos from an earlier intervention study aiming to establish a pre-defined toothbrushing sequence. Subjects who fully adopted this sequence had a baseline TSI of 1.30±0.26, which increased to 1.74±0.09 after the intervention (p≤0.001). When the participants who only partially adopted the sequence were included, the respective values were 1.25±0.27 and 1.69±0.14 (p≤0.001). The suggested new TS-index can cover a variety of clinically meaningful variations of systematic brushing, validly quantifies the changes in toothbrushing systematics and has discriminative power. PMID:29708989
Enhanced nitrogen removal with spent mushroom compost in a sequencing batch reactor.
Yang, Yunlong; Tao, Xin; Lin, Ershu; Hu, Kaihui
2017-11-01
In order to remove nitrogen effectively from the wastewater with a low C/N ratio, the feasibility of using spent mushroom compost (SMC) hydrolysates as carbon sources for denitrification was investigated in a sequencing batch reactor (SBR). With SMCs supplement, the SBR performance was improved obviously within the 180days of operation. The total nitrogen removal was promoted from 46.9% to 81-89.4%, and no negative impact induced by different SMCs on the SBR system was observed. The abundance of functional genes including amoA, nirS/K, norB and nosZ in the active sludge was quantified by qPCR, and most of them elevated after SMC was fed. 16S rRNA gene high-throughput sequencing showed that the significant change in microbial community not only promoted pollutants removal but also benefited the stability of the reactor. Therefore, SMC could be an extremely promising carbon source used for nitrogen removal due to its cost-effective and efficient characteristics. Copyright © 2017 Elsevier Ltd. All rights reserved.
A force-based, parallel assay for the quantification of protein-DNA interactions.
Limmer, Katja; Pippig, Diana A; Aschenbrenner, Daniela; Gaub, Hermann E
2014-01-01
Analysis of transcription factor binding to DNA sequences is of utmost importance to understand the intricate regulatory mechanisms that underlie gene expression. Several techniques exist that quantify DNA-protein affinity, but they are either very time-consuming or suffer from possible misinterpretation due to complicated algorithms or approximations like many high-throughput techniques. We present a more direct method to quantify DNA-protein interaction in a force-based assay. In contrast to single-molecule force spectroscopy, our technique, the Molecular Force Assay (MFA), parallelizes force measurements so that it can test one or multiple proteins against several DNA sequences in a single experiment. The interaction strength is quantified by comparison to the well-defined rupture stability of different DNA duplexes. As a proof-of-principle, we measured the interaction of the zinc finger construct Zif268/NRE against six different DNA constructs. We could show the specificity of our approach and quantify the strength of the protein-DNA interaction.
Magnetic Moment Quantifications of Small Spherical Objects in MRI
Cheng, Yu-Chung N.; Hsieh, Ching-Yi; Tackett, Ronald; Kokeny, Paul; Regmi, Rajesh Kumar; Lawes, Gavin
2014-01-01
Purpose The purpose of this work is to develop a method for accurately quantifying effective magnetic moments of spherical-like small objects from magnetic resonance imaging (MRI). A standard 3D gradient echo sequence with only one echo time is intended for our approach to measure the effective magnetic moment of a given object of interest. Methods Our method sums over complex MR signals around the object and equates those sums to equations derived from the magnetostatic theory. With those equations, our method is able to determine the center of the object with subpixel precision. By rewriting those equations, the effective magnetic moment of the object becomes the only unknown to be solved. Each quantified effective magnetic moment has an uncertainty that is derived from the error propagation method. If the volume of the object can be measured from spin echo images, the susceptibility difference between the object and its surrounding can be further quantified from the effective magnetic moment. Numerical simulations, a variety of glass beads in phantom studies with different MR imaging parameters from a 1.5 T machine, and measurements from a SQUID (superconducting quantum interference device) based magnetometer have been conducted to test the robustness of our method. Results Quantified effective magnetic moments and susceptibility differences from different imaging parameters and methods all agree with each other within two standard deviations of estimated uncertainties. Conclusion An MRI method is developed to accurately quantify the effective magnetic moment of a given small object of interest. Most results are accurate within 10% of true values and roughly half of the total results are accurate within 5% of true values using very reasonable imaging parameters. Our method is minimally affected by the partial volume, dephasing, and phase aliasing effects. Our next goal is to apply this method to in vivo studies. PMID:25490517
Magnetic moment quantifications of small spherical objects in MRI.
Cheng, Yu-Chung N; Hsieh, Ching-Yi; Tackett, Ronald; Kokeny, Paul; Regmi, Rajesh Kumar; Lawes, Gavin
2015-07-01
The purpose of this work is to develop a method for accurately quantifying effective magnetic moments of spherical-like small objects from magnetic resonance imaging (MRI). A standard 3D gradient echo sequence with only one echo time is intended for our approach to measure the effective magnetic moment of a given object of interest. Our method sums over complex MR signals around the object and equates those sums to equations derived from the magnetostatic theory. With those equations, our method is able to determine the center of the object with subpixel precision. By rewriting those equations, the effective magnetic moment of the object becomes the only unknown to be solved. Each quantified effective magnetic moment has an uncertainty that is derived from the error propagation method. If the volume of the object can be measured from spin echo images, the susceptibility difference between the object and its surrounding can be further quantified from the effective magnetic moment. Numerical simulations, a variety of glass beads in phantom studies with different MR imaging parameters from a 1.5T machine, and measurements from a SQUID (superconducting quantum interference device) based magnetometer have been conducted to test the robustness of our method. Quantified effective magnetic moments and susceptibility differences from different imaging parameters and methods all agree with each other within two standard deviations of estimated uncertainties. An MRI method is developed to accurately quantify the effective magnetic moment of a given small object of interest. Most results are accurate within 10% of true values, and roughly half of the total results are accurate within 5% of true values using very reasonable imaging parameters. Our method is minimally affected by the partial volume, dephasing, and phase aliasing effects. Our next goal is to apply this method to in vivo studies. Copyright © 2015 Elsevier Inc. All rights reserved.
Segovia, Romulo; Shen, Yaoqing; Lujan, Scott A; Jones, Steven J M; Stirling, Peter C
2017-03-07
Gene-gene or gene-drug interactions are typically quantified using fitness as a readout because the data are continuous and easily measured in high throughput. However, to what extent fitness captures the range of other phenotypes that show synergistic effects is usually unknown. Using Saccharomyces cerevisiae and focusing on a matrix of DNA repair mutants and genotoxic drugs, we quantify 76 gene-drug interactions based on both mutation rate and fitness and find that these parameters are not connected. Independent of fitness defects, we identified six cases of synthetic hypermutation, where the combined effect of the drug and mutant on mutation rate was greater than predicted. One example occurred when yeast lacking RA D1 were exposed to cisplatin, and we characterized this interaction using whole-genome sequencing. Our sequencing results indicate mutagenesis by cisplatin in rad1 Δ cells appeared to depend almost entirely on interstrand cross-links at GpCpN motifs. Interestingly, our data suggest that the following base on the template strand dictates the addition of the mutated base. This result differs from cisplatin mutation signatures in XPF-deficient Caenorhabditis elegans and supports a model in which translesion synthesis polymerases perform a slippage and realignment extension across from the damaged base. Accordingly, DNA polymerase ζ activity was essential for mutagenesis in cisplatin-treated rad1 Δ cells. Together these data reveal the potential to gain new mechanistic insights from nonfitness measures of gene-drug interactions and extend the use of mutation accumulation and whole-genome sequencing analysis to define DNA repair mechanisms.
Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L
2011-06-02
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-07-21
A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Niland, Courtney N.; Jankowsky, Eckhard; Harris, Michael E.
2016-01-01
Quantification of the specificity of RNA binding proteins and RNA processing enzymes is essential to understanding their fundamental roles in biological processes. High Throughput Sequencing Kinetics (HTS-Kin) uses high throughput sequencing and internal competition kinetics to simultaneously monitor the processing rate constants of thousands of substrates by RNA processing enzymes. This technique has provided unprecedented insight into the substrate specificity of the tRNA processing endonuclease ribonuclease P. Here, we investigate the accuracy and robustness of measurements associated with each step of the HTS-Kin procedure. We examine the effect of substrate concentration on the observed rate constant, determine the optimal kinetic parameters, and provide guidelines for reducing error in amplification of the substrate population. Importantly, we find that high-throughput sequencing, and experimental reproducibility contribute their own sources of error, and these are the main sources of imprecision in the quantified results when otherwise optimized guidelines are followed. PMID:27296633
76 FR 4920 - Government-Owned Inventions; Availability for Licensing
Federal Register 2010, 2011, 2012, 2013, 2014
2011-01-27
... appear to be distinct from each other. Available for licensing is a Plk1 ELISA assay using peptide... binding property, an easy and reliable ELISA assay has been developed to quantify Plk1 expression levels... sequence. ELISA assay to quantify Plk1 expression and kinase activity. Advantages: Rapid, highly sensitive...
Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic
2015-01-01
The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338
High-Frame-Rate Doppler Ultrasound Using a Repeated Transmit Sequence
Podkowa, Anthony S.; Oelze, Michael L.; Ketterling, Jeffrey A.
2018-01-01
The maximum detectable velocity of high-frame-rate color flow Doppler ultrasound is limited by the imaging frame rate when using coherent compounding techniques. Traditionally, high quality ultrasonic images are produced at a high frame rate via coherent compounding of steered plane wave reconstructions. However, this compounding operation results in an effective downsampling of the slow-time signal, thereby artificially reducing the frame rate. To alleviate this effect, a new transmit sequence is introduced where each transmit angle is repeated in succession. This transmit sequence allows for direct comparison between low resolution, pre-compounded frames at a short time interval in ways that are resistent to sidelobe motion. Use of this transmit sequence increases the maximum detectable velocity by a scale factor of the transmit sequence length. The performance of this new transmit sequence was evaluated using a rotating cylindrical phantom and compared with traditional methods using a 15-MHz linear array transducer. Axial velocity estimates were recorded for a range of ±300 mm/s and compared to the known ground truth. Using these new techniques, the root mean square error was reduced from over 400 mm/s to below 50 mm/s in the high-velocity regime compared to traditional techniques. The standard deviation of the velocity estimate in the same velocity range was reduced from 250 mm/s to 30 mm/s. This result demonstrates the viability of the repeated transmit sequence methods in detecting and quantifying high-velocity flow. PMID:29910966
The bench scientist's guide to RNA-Seq analysis
USDA-ARS?s Scientific Manuscript database
RNA sequencing (RNA-Seq) is emerging as a highly accurate method to quantify transcript abundance. However, analyses of the large data sets obtained by sequencing the entire transcriptome of organisms have generally been performed by bioinformatic specialists. Here we outline a methods strategy desi...
Use of the alpha shape to quantify finite helical axis dispersion during simulated spine movements.
McLachlin, Stewart D; Bailey, Christopher S; Dunning, Cynthia E
2016-01-04
In biomechanical studies examining joint kinematics the most common measurement is range of motion (ROM), yet other techniques, such as the finite helical axis (FHA), may elucidate the changes in the 3D motion pathology more effectively. One of the deficiencies with the FHA technique is in quantifying the axes generated throughout a motion sequence. This study attempted to solve this issue via a computational geometric technique known as the alpha shape, which bounds a set of point data within a closed boundary similar to a convex hull. The purpose of this study was to use the alpha shape as an additional tool to visualize and quantify FHA dispersion between intact and injured cadaveric spine movements and compare these changes to the gold-standard ROM measurements. Flexion-extension, axial rotation, and lateral bending were simulated with five C5-C6 motion segments using a spinal loading simulator and Optotrak motion tracking system. Specimens were first tested intact followed by a simulated injury model. ROM and the FHAs were calculated post-hoc, with alpha shapes and convex hulls generated from the anatomic planar intercept points of the FHAs. While both ROM and the boundary shape areas increased with injury (p<0.05), no consistent geometric trends in the alpha shape growth were identified. The alpha shape area was sensitive to the alpha value chosen and values examined below 2.5 created more than one closed boundary. Ultimately, the alpha shape presents as a useful technique to quantify sequences of joint kinematics described by scatter plots such as FHA intercept data. Copyright © 2015. Published by Elsevier Ltd.
Leveraging Distant Relatedness to Quantify Human Mutation and Gene-Conversion Rates
Palamara, Pier Francesco; Francioli, Laurent C.; Wilton, Peter R.; Genovese, Giulio; Gusev, Alexander; Finucane, Hilary K.; Sankararaman, Sriram; Sunyaev, Shamil R.; de Bakker, Paul I.W.; Wakeley, John; Pe’er, Itsik; Price, Alkes L.
2015-01-01
The rate at which human genomes mutate is a central biological parameter that has many implications for our ability to understand demographic and evolutionary phenomena. We present a method for inferring mutation and gene-conversion rates by using the number of sequence differences observed in identical-by-descent (IBD) segments together with a reconstructed model of recent population-size history. This approach is robust to, and can quantify, the presence of substantial genotyping error, as validated in coalescent simulations. We applied the method to 498 trio-phased sequenced Dutch individuals and inferred a point mutation rate of 1.66 × 10−8 per base per generation and a rate of 1.26 × 10−9 for <20 bp indels. By quantifying how estimates varied as a function of allele frequency, we inferred the probability that a site is involved in non-crossover gene conversion as 5.99 × 10−6. We found that recombination does not have observable mutagenic effects after gene conversion is accounted for and that local gene-conversion rates reflect recombination rates. We detected a strong enrichment of recent deleterious variation among mismatching variants found within IBD regions and observed summary statistics of local sharing of IBD segments to closely match previously proposed metrics of background selection; however, we found no significant effects of selection on our mutation-rate estimates. We detected no evidence of strong variation of mutation rates in a number of genomic annotations obtained from several recent studies. Our analysis suggests that a mutation-rate estimate higher than that reported by recent pedigree-based studies should be adopted in the context of DNA-based demographic reconstruction. PMID:26581902
Evaluation of a compact tinnitus therapy by electrophysiological tinnitus decompensation measures.
Low, Yin Fen; Argstatter, Heike; Bolay, Hans Volker; Strauss, Daniel J
2008-01-01
Large-scale neural correlates of the tinnitus decompensation have been identified by using wavelet phase stability criteria of single sweep sequences of auditory late responses (ALRs). Our previous work showed that the synchronization stability in ALR sequences might be used for objective quantification of the tinnitus decompensation and attention which link to Jastreboff tinnitus model. In this study, we intend to provide an objective evaluation for quantifying the effect of music therapy in tinnitus patients. We examined neural correlates of the attentional mechanism in single sweep sequences of ALRs in chronic tinnitus patients who underwent compact therapy course by using the maximum entropy auditory paradigm. Results by our measure showed that the extent of differentiation between attended and unattended conditions improved significantly after the therapy. It is concluded that the wavelet phase synchronization stability of ALRs single sweeps can be used for the objective evaluation of tinnitus therapies, in this case the compact tinnitus music therapy.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A; Lang, D; Kostova, T
2010-11-29
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less
Universal Entropy of Word Ordering Across Linguistic Families
Montemurro, Marcelo A.; Zanette, Damián H.
2011-01-01
Background The language faculty is probably the most distinctive feature of our species, and endows us with a unique ability to exchange highly structured information. In written language, information is encoded by the concatenation of basic symbols under grammatical and semantic constraints. As is also the case in other natural information carriers, the resulting symbolic sequences show a delicate balance between order and disorder. That balance is determined by the interplay between the diversity of symbols and by their specific ordering in the sequences. Here we used entropy to quantify the contribution of different organizational levels to the overall statistical structure of language. Methodology/Principal Findings We computed a relative entropy measure to quantify the degree of ordering in word sequences from languages belonging to several linguistic families. While a direct estimation of the overall entropy of language yielded values that varied for the different families considered, the relative entropy quantifying word ordering presented an almost constant value for all those families. Conclusions/Significance Our results indicate that despite the differences in the structure and vocabulary of the languages analyzed, the impact of word ordering in the structure of language is a statistical linguistic universal. PMID:21603637
Whale song analyses using bioinformatics sequence analysis approaches
NASA Astrophysics Data System (ADS)
Chen, Yian A.; Almeida, Jonas S.; Chou, Lien-Siang
2005-04-01
Animal songs are frequently analyzed using discrete hierarchical units, such as units, themes and songs. Because animal songs and bio-sequences may be understood as analogous, bioinformatics analysis tools DNA/protein sequence alignment and alignment-free methods are proposed to quantify the theme similarities of the songs of false killer whales recorded off northeast Taiwan. The eighteen themes with discrete units that were identified in an earlier study [Y. A. Chen, masters thesis, University of Charleston, 2001] were compared quantitatively using several distance metrics. These metrics included the scores calculated using the Smith-Waterman algorithm with the repeated procedure; the standardized Euclidian distance and the angle metrics based on word frequencies. The theme classifications based on different metrics were summarized and compared in dendrograms using cluster analyses. The results agree with earlier classifications derived by human observation qualitatively. These methods further quantify the similarities among themes. These methods could be applied to the analyses of other animal songs on a larger scale. For instance, these techniques could be used to investigate song evolution and cultural transmission quantifying the dissimilarities of humpback whale songs across different seasons, years, populations, and geographic regions. [Work supported by SC Sea Grant, and Ilan County Government, Taiwan.
NASA Astrophysics Data System (ADS)
Ciaramello, Frank M.; Hemami, Sheila S.
2009-02-01
Communication of American Sign Language (ASL) over mobile phones would be very beneficial to the Deaf community. ASL video encoded to achieve the rates provided by current cellular networks must be heavily compressed and appropriate assessment techniques are required to analyze the intelligibility of the compressed video. As an extension to a purely spatial measure of intelligibility, this paper quantifies the effect of temporal compression artifacts on sign language intelligibility. These artifacts can be the result of motion-compensation errors that distract the observer or frame rate reductions. They reduce the the perception of smooth motion and disrupt the temporal coherence of the video. Motion-compensation errors that affect temporal coherence are identified by measuring the block-level correlation between co-located macroblocks in adjacent frames. The impact of frame rate reductions was quantified through experimental testing. A subjective study was performed in which fluent ASL participants rated the intelligibility of sequences encoded at a range of 5 different frame rates and with 3 different levels of distortion. The subjective data is used to parameterize an objective intelligibility measure which is highly correlated with subjective ratings at multiple frame rates.
Iqbal, Zohaib; Wilson, Neil E; Thomas, M Albert
2017-07-24
1 H Magnetic Resonance Spectroscopic imaging (SI) is a powerful tool capable of investigating metabolism in vivo from mul- tiple regions. However, SI techniques are time consuming, and are therefore difficult to implement clinically. By applying non-uniform sampling (NUS) and compressed sensing (CS) reconstruction, it is possible to accelerate these scans while re- taining key spectral information. One recently developed method that utilizes this type of acceleration is the five-dimensional echo planar J-resolved spectroscopic imaging (5D EP-JRESI) sequence, which is capable of obtaining two-dimensional (2D) spectra from three spatial dimensions. The prior-knowledge fitting (ProFit) algorithm is typically used to quantify 2D spectra in vivo, however the effects of NUS and CS reconstruction on the quantitation results are unknown. This study utilized a simulated brain phantom to investigate the errors introduced through the acceleration methods. Errors (normalized root mean square error >15%) were found between metabolite concentrations after twelve-fold acceleration for several low concentra- tion (<2 mM) metabolites. The Cramér Rao lower bound% (CRLB%) values, which are typically used for quality control, were not reflective of the increased quantitation error arising from acceleration. Finally, occipital white (OWM) and gray (OGM) human brain matter were quantified in vivo using the 5D EP-JRESI sequence with eight-fold acceleration.
Quantifying Genome Editing Outcomes at Endogenous Loci using SMRT Sequencing
Clark, Joseph; Punjya, Niraj; Sebastiano, Vittorio; Bao, Gang; Porteus, Matthew H
2014-01-01
SUMMARY Targeted genome editing with engineered nucleases has transformed the ability to introduce precise sequence modifications at almost any site within the genome. A major obstacle to probing the efficiency and consequences of genome editing is that no existing method enables the frequency of different editing events to be simultaneously measured across a cell population at any endogenous genomic locus. We have developed a novel method for quantifying individual genome editing outcomes at any site of interest using single molecule real time (SMRT) DNA sequencing. We show that this approach can be applied at various loci, using multiple engineered nuclease platforms including TALENs, RNA guided endonucleases (CRISPR/Cas9), and ZFNs, and in different cell lines to identify conditions and strategies in which the desired engineering outcome has occurred. This approach facilitates the evaluation of new gene editing technologies and permits sensitive quantification of editing outcomes in almost every experimental system used. PMID:24685129
De Pittà, Cristiano; Bertolucci, Cristiano; Mazzotta, Gabriella M; Bernante, Filippo; Rizzo, Giorgia; De Nardi, Barbara; Pallavicini, Alberto; Lanfranchi, Gerolamo; Costa, Rodolfo
2008-01-01
Background Little is known about the genome sequences of Euphausiacea (krill) although these crustaceans are abundant components of the pelagic ecosystems in all oceans and used for aquaculture and pharmaceutical industry. This study reports the results of an expressed sequence tag (EST) sequencing project from different tissues of Euphausia superba (the Antarctic krill). Results We have constructed and sequenced five cDNA libraries from different Antarctic krill tissues: head, abdomen, thoracopods and photophores. We have identified 1.770 high-quality ESTs which were assembled into 216 overlapping clusters and 801 singletons resulting in a total of 1.017 non-redundant sequences. Quantitative RT-PCR analysis was performed to quantify and validate the expression levels of ten genes presenting different EST countings in krill tissues. In addition, bioinformatic screening of the non-redundant E. superba sequences identified 69 microsatellite containing ESTs. Clusters, consensuses and related similarity and gene ontology searches were organized in a dedicated E. superba database . Conclusion We defined the first tissue transcriptional signatures of E. superba based on functional categorization among the examined tissues. The analyses of annotated transcripts showed a higher similarity with genes from insects with respect to Malacostraca possibly as an effect of the limited number of Malacostraca sequences in the public databases. Our catalogue provides for the first time a genomic tool to investigate the biology of the Antarctic krill. PMID:18226200
Apigenin Impacts the Growth of the Gut Microbiota and Alters the Gene Expression of Enterococcus.
Wang, Minqian; Firrman, Jenni; Zhang, Liqing; Arango-Argoty, Gustavo; Tomasula, Peggy; Liu, LinShu; Xiao, Weidong; Yam, Kit
2017-08-03
Apigenin is a major dietary flavonoid with many bioactivities, widely distributed in plants. Apigenin reaches the colon region intact and interacts there with the human gut microbiota, however there is little research on how apigenin affects the gut bacteria. This study investigated the effect of pure apigenin on human gut bacteria, at both the single strain and community levels. The effect of apigenin on the single gut bacteria strains Bacteroides galacturonicus , Bifidobacterium catenulatum , Lactobacillus rhamnosus GG, and Enterococcus caccae , was examined by measuring their anaerobic growth profiles. The effect of apigenin on a gut microbiota community was studied by culturing a fecal inoculum under in vitro conditions simulating the human ascending colon. 16S rRNA gene sequencing and GC-MS analysis quantified changes in the community structure. Single molecule RNA sequencing was used to reveal the response of Enterococcus caccae to apigenin. Enterococcus caccae was effectively inhibited by apigenin when cultured alone, however, the genus Enterococcus was enhanced when tested in a community setting. Single molecule RNA sequencing found that Enterococcus caccae responded to apigenin by up-regulating genes involved in DNA repair, stress response, cell wall synthesis, and protein folding. Taken together, these results demonstrate that apigenin affects both the growth and gene expression of Enterococcus caccae .
Delaney, Nigel F.; Marx, Christopher J.
2012-01-01
Understanding evolutionary dynamics within microbial populations requires the ability to accurately follow allele frequencies through time. Here we present a rapid, cost-effective method (FREQ-Seq) that leverages Illumina next-generation sequencing for localized, quantitative allele frequency detection. Analogous to RNA-Seq, FREQ-Seq relies upon counts from the >105 reads generated per locus per time-point to determine allele frequencies. Loci of interest are directly amplified from a mixed population via two rounds of PCR using inexpensive, user-designed oligonucleotides and a bar-coded bridging primer system that can be regenerated in-house. The resulting bar-coded PCR products contain the adapters needed for Illumina sequencing, eliminating further library preparation. We demonstrate the utility of FREQ-Seq by determining the order and dynamics of beneficial alleles that arose as a microbial population, founded with an engineered strain of Methylobacterium, evolved to grow on methanol. Quantifying allele frequencies with minimal bias down to 1% abundance allowed effective analysis of SNPs, small in-dels and insertions of transposable elements. Our data reveal large-scale clonal interference during the early stages of adaptation and illustrate the utility of FREQ-Seq as a cost-effective tool for tracking allele frequencies in populations. PMID:23118913
Jakubec, David; Laskowski, Roman A.; Vondrasek, Jiri
2016-01-01
Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties. PMID:27384774
Vasudevan, John M; Logan, Andrew; Shultz, Rebecca; Koval, Jeffrey J; Roh, Eugene Y; Fredericson, Michael
2016-01-01
Aim. The purpose of this pilot study is to use surface electromyography to determine an individual athlete's typical muscle onset activation sequence when performing a golf or tennis forward swing and to use the method to assess to what degree the sequence is reproduced with common conditioning exercises and a machine designed for this purpose. Methods. Data for 18 healthy male subjects were collected for 15 muscles of the trunk and lower extremities. Data were filtered and processed to determine the average onset of muscle activation for each motion. A Spearman correlation estimated congruence of activation order between the swing and each exercise. Correlations of each group were pooled with 95% confidence intervals using a random effects meta-analytic strategy. Results. The averaged sequences differed among each athlete tested, but pooled correlations demonstrated a positive association between each exercise and the participants' natural muscle onset activation sequence. Conclusion. The selected training exercises and Turning Point™ device all partially reproduced our athletes' averaged muscle onset activation sequences for both sports. The results support consideration of a larger, adequately powered study using this method to quantify to what degree each of the selected exercises is appropriate for use in both golf and tennis.
Shultz, Rebecca; Fredericson, Michael
2016-01-01
Aim. The purpose of this pilot study is to use surface electromyography to determine an individual athlete's typical muscle onset activation sequence when performing a golf or tennis forward swing and to use the method to assess to what degree the sequence is reproduced with common conditioning exercises and a machine designed for this purpose. Methods. Data for 18 healthy male subjects were collected for 15 muscles of the trunk and lower extremities. Data were filtered and processed to determine the average onset of muscle activation for each motion. A Spearman correlation estimated congruence of activation order between the swing and each exercise. Correlations of each group were pooled with 95% confidence intervals using a random effects meta-analytic strategy. Results. The averaged sequences differed among each athlete tested, but pooled correlations demonstrated a positive association between each exercise and the participants' natural muscle onset activation sequence. Conclusion. The selected training exercises and Turning Point™ device all partially reproduced our athletes' averaged muscle onset activation sequences for both sports. The results support consideration of a larger, adequately powered study using this method to quantify to what degree each of the selected exercises is appropriate for use in both golf and tennis. PMID:27403454
Zhang, Huimin; He, Hongkui; Yu, Xiujuan; Xu, Zhaohui; Zhang, Zhizhou
2016-11-01
It remains an unsolved problem to quantify a natural microbial community by rapidly and conveniently measuring multiple species with functional significance. Most widely used high throughput next-generation sequencing methods can only generate information mainly for genus-level taxonomic identification and quantification, and detection of multiple species in a complex microbial community is still heavily dependent on approaches based on near full-length ribosome RNA gene or genome sequence information. In this study, we used near full-length rRNA gene library sequencing plus Primer-Blast to design species-specific primers based on whole microbial genome sequences. The primers were intended to be specific at the species level within relevant microbial communities, i.e., a defined genomics background. The primers were tested with samples collected from the Daqu (also called fermentation starters) and pit mud of a traditional Chinese liquor production plant. Sixteen pairs of primers were found to be suitable for identification of individual species. Among them, seven pairs were chosen to measure the abundance of microbial species through quantitative PCR. The combination of near full-length ribosome RNA gene library sequencing and Primer-Blast may represent a broadly useful protocol to quantify multiple species in complex microbial population samples with species-specific primers.
Bauer, Bianca S.; Forsyth, George W.; Sandmeyer, Lynne S.; Grahn, Bruce H.
2011-01-01
Mitochondrial transcription factor A (Tfam) has been implicated in the pathogenesis of retinal dysplasia in miniature schnauzer dogs and it has been proposed that affected dogs have altered mitochondrial numbers, size, and morphology. To test these hypotheses the Tfam gene of affected and normal miniature schnauzer dogs with retinal dysplasia was sequenced and lymphocyte mitochondria were quantified, measured, and the morphology was compared in normal and affected dogs using transmission electron microscopy. For Tfam sequencing, retina, retinal pigment epithelium (RPE), and whole blood samples were collected. Total RNA was isolated from the retina and RPE and reverse transcribed to make cDNA. Genomic DNA was extracted from white blood cell pellets obtained from the whole blood samples. The Tfam coding sequence, 5′ promoter region, intron1 and the 3′ non-coding sequence of normal and affected dogs were amplified using polymerase chain reaction (PCR), cloned and sequenced. For electron microscopy, lymphocytes from affected and normal dogs were photographed and the mitochondria within each cross-section were identified, quantified, and the mitochondrial area (μm2) per lymphocyte cross-section was calculated. Lastly, using a masked technique, mitochondrial morphology was compared between the 2 groups. Sequencing of the miniature schnauzer Tfam gene revealed no functional sequence variation between affected and normal dogs. Lymphocyte and mitochondrial area, mitochondrial quantification, and morphology assessment also revealed no significant difference between the 2 groups. Further investigation into other candidate genes or factors causing retinal dysplasia in the miniature schnauzer is warranted. PMID:21731185
Bauer, Bianca S; Forsyth, George W; Sandmeyer, Lynne S; Grahn, Bruce H
2011-04-01
Mitochondrial transcription factor A (Tfam) has been implicated in the pathogenesis of retinal dysplasia in miniature schnauzer dogs and it has been proposed that affected dogs have altered mitochondrial numbers, size, and morphology. To test these hypotheses the Tfam gene of affected and normal miniature schnauzer dogs with retinal dysplasia was sequenced and lymphocyte mitochondria were quantified, measured, and the morphology was compared in normal and affected dogs using transmission electron microscopy. For Tfam sequencing, retina, retinal pigment epithelium (RPE), and whole blood samples were collected. Total RNA was isolated from the retina and RPE and reverse transcribed to make cDNA. Genomic DNA was extracted from white blood cell pellets obtained from the whole blood samples. The Tfam coding sequence, 5' promoter region, intron1 and the 3' non-coding sequence of normal and affected dogs were amplified using polymerase chain reaction (PCR), cloned and sequenced. For electron microscopy, lymphocytes from affected and normal dogs were photographed and the mitochondria within each cross-section were identified, quantified, and the mitochondrial area (μm²) per lymphocyte cross-section was calculated. Lastly, using a masked technique, mitochondrial morphology was compared between the 2 groups. Sequencing of the miniature schnauzer Tfam gene revealed no functional sequence variation between affected and normal dogs. Lymphocyte and mitochondrial area, mitochondrial quantification, and morphology assessment also revealed no significant difference between the 2 groups. Further investigation into other candidate genes or factors causing retinal dysplasia in the miniature schnauzer is warranted.
Dynamic multiplexed analysis method using ion mobility spectrometer
Belov, Mikhail E [Richland, WA
2010-05-18
A method for multiplexed analysis using ion mobility spectrometer in which the effectiveness and efficiency of the multiplexed method is optimized by automatically adjusting rates of passage of analyte materials through an IMS drift tube during operation of the system. This automatic adjustment is performed by the IMS instrument itself after determining the appropriate levels of adjustment according to the method of the present invention. In one example, the adjustment of the rates of passage for these materials is determined by quantifying the total number of analyte molecules delivered to the ion trap in a preselected period of time, comparing this number to the charge capacity of the ion trap, selecting a gate opening sequence; and implementing the selected gate opening sequence to obtain a preselected rate of analytes within said IMS drift tube.
The genome sequence of a widespread apex Predator, the golden eagle (Aquila chrysaetos)
Jacqueline M. Doyle; Todd E. Katzner; Peter H. Bloom; Yanzhu Ji; Bhagya K. Wijayawardena; J. Andrew DeWoody; Ludovic Orlando
2014-01-01
Biologists routinely use molecular markers to identify conservation units, to quantify genetic connectivity, to estimate population sizes, and to identify targets of selection. Many imperiled eagle populations require such efforts and would benefit from enhanced genomic resources. We sequenced, assembled, and annotated the first eagle genome using DNA from a male...
Quantifying the evolution of individual scientific impact.
Sinatra, Roberta; Wang, Dashun; Deville, Pierre; Song, Chaoming; Barabási, Albert-László
2016-11-04
Despite the frequent use of numerous quantitative indicators to gauge the professional impact of a scientist, little is known about how scientific impact emerges and evolves in time. Here, we quantify the changes in impact and productivity throughout a career in science, finding that impact, as measured by influential publications, is distributed randomly within a scientist's sequence of publications. This random-impact rule allows us to formulate a stochastic model that uncouples the effects of productivity, individual ability, and luck and unveils the existence of universal patterns governing the emergence of scientific success. The model assigns a unique individual parameter Q to each scientist, which is stable during a career, and it accurately predicts the evolution of a scientist's impact, from the h-index to cumulative citations, and independent recognitions, such as prizes. Copyright © 2016, American Association for the Advancement of Science.
Automated regional analysis of B-mode ultrasound images of skeletal muscle movement
Darby, John; Costen, Nicholas; Loram, Ian D.
2012-01-01
To understand the functional significance of skeletal muscle anatomy, a method of quantifying local shape changes in different tissue structures during dynamic tasks is required. Taking advantage of the good spatial and temporal resolution of B-mode ultrasound imaging, we describe a method of automatically segmenting images into fascicle and aponeurosis regions and tracking movement of features, independently, in localized portions of each tissue. Ultrasound images (25 Hz) of the medial gastrocnemius muscle were collected from eight participants during ankle joint rotation (2° and 20°), isometric contractions (1, 5, and 50 Nm), and deep knee bends. A Kanade-Lucas-Tomasi feature tracker was used to identify and track any distinctive and persistent features within the image sequences. A velocity field representation of local movement was then found and subdivided between fascicle and aponeurosis regions using segmentations from a multiresolution active shape model (ASM). Movement in each region was quantified by interpolating the effect of the fields on a set of probes. ASM segmentation results were compared with hand-labeled data, while aponeurosis and fascicle movement were compared with results from a previously documented cross-correlation approach. ASM provided good image segmentations (<1 mm average error), with fully automatic initialization possible in sequences from seven participants. Feature tracking provided similar length change results to the cross-correlation approach for small movements, while outperforming it in larger movements. The proposed method provides the potential to distinguish between active and passive changes in muscle shape and model strain distributions during different movements/conditions and quantify nonhomogeneous strain along aponeuroses. PMID:22033532
Lossy compression of quality scores in genomic data.
Cánovas, Rodrigo; Moffat, Alistair; Turpin, Andrew
2014-08-01
Next-generation sequencing technologies are revolutionizing medicine. Data from sequencing technologies are typically represented as a string of bases, an associated sequence of per-base quality scores and other metadata, and in aggregate can require a large amount of space. The quality scores show how accurate the bases are with respect to the sequencing process, that is, how confident the sequencer is of having called them correctly, and are the largest component in datasets in which they are retained. Previous research has examined how to store sequences of bases effectively; here we add to that knowledge by examining methods for compressing quality scores. The quality values originate in a continuous domain, and so if a fidelity criterion is introduced, it is possible to introduce flexibility in the way these values are represented, allowing lossy compression over the quality score data. We present existing compression options for quality score data, and then introduce two new lossy techniques. Experiments measuring the trade-off between compression ratio and information loss are reported, including quantifying the effect of lossy representations on a downstream application that carries out single nucleotide polymorphism and insert/deletion detection. The new methods are demonstrably superior to other techniques when assessed against the spectrum of possible trade-offs between storage required and fidelity of representation. An implementation of the methods described here is available at https://github.com/rcanovas/libCSAM. rcanovas@student.unimelb.edu.au Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Model-based quality assessment and base-calling for second-generation sequencing data.
Bravo, Héctor Corrada; Irizarry, Rafael A
2010-09-01
Second-generation sequencing (sec-gen) technology can sequence millions of short fragments of DNA in parallel, making it capable of assembling complex genomes for a small fraction of the price and time of previous technologies. In fact, a recently formed international consortium, the 1000 Genomes Project, plans to fully sequence the genomes of approximately 1200 people. The prospect of comparative analysis at the sequence level of a large number of samples across multiple populations may be achieved within the next five years. These data present unprecedented challenges in statistical analysis. For instance, analysis operates on millions of short nucleotide sequences, or reads-strings of A,C,G, or T's, between 30 and 100 characters long-which are the result of complex processing of noisy continuous fluorescence intensity measurements known as base-calling. The complexity of the base-calling discretization process results in reads of widely varying quality within and across sequence samples. This variation in processing quality results in infrequent but systematic errors that we have found to mislead downstream analysis of the discretized sequence read data. For instance, a central goal of the 1000 Genomes Project is to quantify across-sample variation at the single nucleotide level. At this resolution, small error rates in sequencing prove significant, especially for rare variants. Sec-gen sequencing is a relatively new technology for which potential biases and sources of obscuring variation are not yet fully understood. Therefore, modeling and quantifying the uncertainty inherent in the generation of sequence reads is of utmost importance. In this article, we present a simple model to capture uncertainty arising in the base-calling procedure of the Illumina/Solexa GA platform. Model parameters have a straightforward interpretation in terms of the chemistry of base-calling allowing for informative and easily interpretable metrics that capture the variability in sequencing quality. Our model provides these informative estimates readily usable in quality assessment tools while significantly improving base-calling performance. © 2009, The International Biometric Society.
Aguirre, Jacobo; Buldú, Javier M; Manrubia, Susanna C
2009-12-01
Networks of selectively neutral genotypes underlie the evolution of populations of replicators in constant environments. Previous theoretical analysis predicted that such populations will evolve toward highly connected regions of the genome space. We first study the evolution of populations of replicators on simple networks and quantify how the transient time to equilibrium depends on the initial distribution of sequences on the neutral network, on the topological properties of the latter, and on the mutation rate. Second, network neutrality is broken through the introduction of an energy for each sequence. This allows to study the competition between two features (neutrality and energetic stability) relevant for survival and subjected to different selective pressures. In cases where the two features are negatively correlated, the population experiences sudden migrations in the genome space for values of the relevant parameters that we calculate. The numerical study of larger networks indicates that the qualitative behavior to be expected in more realistic cases is already seen in representative examples of small networks.
NASA Astrophysics Data System (ADS)
Aguirre, Jacobo; Buldú, Javier M.; Manrubia, Susanna C.
2009-12-01
Networks of selectively neutral genotypes underlie the evolution of populations of replicators in constant environments. Previous theoretical analysis predicted that such populations will evolve toward highly connected regions of the genome space. We first study the evolution of populations of replicators on simple networks and quantify how the transient time to equilibrium depends on the initial distribution of sequences on the neutral network, on the topological properties of the latter, and on the mutation rate. Second, network neutrality is broken through the introduction of an energy for each sequence. This allows to study the competition between two features (neutrality and energetic stability) relevant for survival and subjected to different selective pressures. In cases where the two features are negatively correlated, the population experiences sudden migrations in the genome space for values of the relevant parameters that we calculate. The numerical study of larger networks indicates that the qualitative behavior to be expected in more realistic cases is already seen in representative examples of small networks.
Quantifying chaotic dynamics from integrate-and-fire processes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pavlov, A. N.; Saratov State Technical University, Politehnicheskaya Str. 77, 410054 Saratov; Pavlova, O. N.
2015-01-15
Characterizing chaotic dynamics from integrate-and-fire (IF) interspike intervals (ISIs) is relatively easy performed at high firing rates. When the firing rate is low, a correct estimation of Lyapunov exponents (LEs) describing dynamical features of complex oscillations reflected in the IF ISI sequences becomes more complicated. In this work we discuss peculiarities and limitations of quantifying chaotic dynamics from IF point processes. We consider main factors leading to underestimated LEs and demonstrate a way of improving numerical determining of LEs from IF ISI sequences. We show that estimations of the two largest LEs can be performed using around 400 mean periodsmore » of chaotic oscillations in the regime of phase-coherent chaos. Application to real data is discussed.« less
Harvey, William T.; Benton, Donald J.; Gregory, Victoria; Hall, James P. J.; Daniels, Rodney S.; Bedford, Trevor; Haydon, Daniel T.; Hay, Alan J.; McCauley, John W.; Reeve, Richard
2016-01-01
Determining phenotype from genetic data is a fundamental challenge. Identification of emerging antigenic variants among circulating influenza viruses is critical to the vaccine virus selection process, with vaccine effectiveness maximized when constituents are antigenically similar to circulating viruses. Hemagglutination inhibition (HI) assay data are commonly used to assess influenza antigenicity. Here, sequence and 3-D structural information of hemagglutinin (HA) glycoproteins were analyzed together with corresponding HI assay data for former seasonal influenza A(H1N1) virus isolates (1997–2009) and reference viruses. The models developed identify and quantify the impact of eighteen amino acid substitutions on the antigenicity of HA, two of which were responsible for major transitions in antigenic phenotype. We used reverse genetics to demonstrate the causal effect on antigenicity for a subset of these substitutions. Information on the impact of substitutions allowed us to predict antigenic phenotypes of emerging viruses directly from HA gene sequence data and accuracy was doubled by including all substitutions causing antigenic changes over a model incorporating only the substitutions with the largest impact. The ability to quantify the phenotypic impact of specific amino acid substitutions should help refine emerging techniques that predict the evolution of virus populations from one year to the next, leading to stronger theoretical foundations for selection of candidate vaccine viruses. These techniques have great potential to be extended to other antigenically variable pathogens. PMID:27057693
Reiman, Mario; Laan, Maris; Rull, Kristiina; Sõber, Siim
2017-08-01
RNA degradation is a ubiquitous process that occurs in living and dead cells, as well as during handling and storage of extracted RNA. Reduced RNA quality caused by degradation is an established source of uncertainty for all RNA-based gene expression quantification techniques. RNA sequencing is an increasingly preferred method for transcriptome analyses, and dependence of its results on input RNA integrity is of significant practical importance. This study aimed to characterize the effects of varying input RNA integrity [estimated as RNA integrity number (RIN)] on transcript level estimates and delineate the characteristic differences between transcripts that differ in degradation rate. The study used ribodepleted total RNA sequencing data from a real-life clinically collected set ( n = 32) of human solid tissue (placenta) samples. RIN-dependent alterations in gene expression profiles were quantified by using DESeq2 software. Our results indicate that small differences in RNA integrity affect gene expression quantification by introducing a moderate and pervasive bias in expression level estimates that significantly affected 8.1% of studied genes. The rapidly degrading transcript pool was enriched in pseudogenes, short noncoding RNAs, and transcripts with extended 3' untranslated regions. Typical slowly degrading transcripts (median length, 2389 nt) represented protein coding genes with 4-10 exons and high guanine-cytosine content.-Reiman, M., Laan, M., Rull, K., Sõber, S. Effects of RNA integrity on transcript quantification by total RNA sequencing of clinically collected human placental samples. © FASEB.
Kuzmina, Maria L; Braukmann, Thomas W A; Fazekas, Aron J; Graham, Sean W; Dewaard, Stephanie L; Rodrigues, Anuar; Bennett, Bruce A; Dickinson, Timothy A; Saarela, Jeffery M; Catling, Paul M; Newmaster, Steven G; Percy, Diana M; Fenneman, Erin; Lauron-Moreau, Aurélien; Ford, Bruce; Gillespie, Lynn; Subramanyam, Ragupathy; Whitton, Jeannette; Jennings, Linda; Metsger, Deborah; Warne, Connor P; Brown, Allison; Sears, Elizabeth; Dewaard, Jeremy R; Zakharov, Evgeny V; Hebert, Paul D N
2017-12-01
Constructing complete, accurate plant DNA barcode reference libraries can be logistically challenging for large-scale floras. Here we demonstrate the promise and challenges of using herbarium collections for building a DNA barcode reference library for the vascular plant flora of Canada. Our study examined 20,816 specimens representing 5076 of 5190 vascular plant species in Canada (98%). For 98% of the specimens, at least one of the DNA barcode regions was recovered from the plastid loci rbcL and matK and from the nuclear ITS2 region. We used beta regression to quantify the effects of age, type of preservation, and taxonomic affiliation (family) on DNA sequence recovery. Specimen age and method of preservation had significant effects on sequence recovery for all markers, but influenced some families more (e.g., Boraginaceae) than others (e.g., Asteraceae). Our DNA barcode library represents an unparalleled resource for metagenomic and ecological genetic research working on temperate and arctic biomes. An observed decline in sequence recovery with specimen age may be associated with poor primer matches, intragenomic variation (for ITS2), or inhibitory secondary compounds in some taxa.
Kuzmina, Maria L.; Braukmann, Thomas W. A.; Fazekas, Aron J.; Graham, Sean W.; Dewaard, Stephanie L.; Rodrigues, Anuar; Bennett, Bruce A.; Dickinson, Timothy A.; Saarela, Jeffery M.; Catling, Paul M.; Newmaster, Steven G.; Percy, Diana M.; Fenneman, Erin; Lauron-Moreau, Aurélien; Ford, Bruce; Gillespie, Lynn; Subramanyam, Ragupathy; Whitton, Jeannette; Jennings, Linda; Metsger, Deborah; Warne, Connor P.; Brown, Allison; Sears, Elizabeth; Dewaard, Jeremy R.; Zakharov, Evgeny V.; Hebert, Paul D. N.
2017-01-01
Premise of the study: Constructing complete, accurate plant DNA barcode reference libraries can be logistically challenging for large-scale floras. Here we demonstrate the promise and challenges of using herbarium collections for building a DNA barcode reference library for the vascular plant flora of Canada. Methods: Our study examined 20,816 specimens representing 5076 of 5190 vascular plant species in Canada (98%). For 98% of the specimens, at least one of the DNA barcode regions was recovered from the plastid loci rbcL and matK and from the nuclear ITS2 region. We used beta regression to quantify the effects of age, type of preservation, and taxonomic affiliation (family) on DNA sequence recovery. Results: Specimen age and method of preservation had significant effects on sequence recovery for all markers, but influenced some families more (e.g., Boraginaceae) than others (e.g., Asteraceae). Discussion: Our DNA barcode library represents an unparalleled resource for metagenomic and ecological genetic research working on temperate and arctic biomes. An observed decline in sequence recovery with specimen age may be associated with poor primer matches, intragenomic variation (for ITS2), or inhibitory secondary compounds in some taxa. PMID:29299394
Quantifying Selection with Pool-Seq Time Series Data.
Taus, Thomas; Futschik, Andreas; Schlötterer, Christian
2017-11-01
Allele frequency time series data constitute a powerful resource for unraveling mechanisms of adaptation, because the temporal dimension captures important information about evolutionary forces. In particular, Evolve and Resequence (E&R), the whole-genome sequencing of replicated experimentally evolving populations, is becoming increasingly popular. Based on computer simulations several studies proposed experimental parameters to optimize the identification of the selection targets. No such recommendations are available for the underlying parameters selection strength and dominance. Here, we introduce a highly accurate method to estimate selection parameters from replicated time series data, which is fast enough to be applied on a genome scale. Using this new method, we evaluate how experimental parameters can be optimized to obtain the most reliable estimates for selection parameters. We show that the effective population size (Ne) and the number of replicates have the largest impact. Because the number of time points and sequencing coverage had only a minor effect, we suggest that time series analysis is feasible without major increase in sequencing costs. We anticipate that time series analysis will become routine in E&R studies. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Whale, Alexandra S; Devonshire, Alison S; Karlin-Neumann, George; Regan, Jack; Javier, Leanne; Cowen, Simon; Fernandez-Gonzalez, Ana; Jones, Gerwyn M; Redshaw, Nicholas; Beck, Julia; Berger, Andreas W; Combaret, Valérie; Dahl Kjersgaard, Nina; Davis, Lisa; Fina, Frederic; Forshew, Tim; Fredslund Andersen, Rikke; Galbiati, Silvia; González Hernández, Álvaro; Haynes, Charles A; Janku, Filip; Lacave, Roger; Lee, Justin; Mistry, Vilas; Pender, Alexandra; Pradines, Anne; Proudhon, Charlotte; Saal, Lao H; Stieglitz, Elliot; Ulrich, Bryan; Foy, Carole A; Parkes, Helen; Tzonev, Svilen; Huggett, Jim F
2017-02-07
This study tested the claim that digital PCR (dPCR) can offer highly reproducible quantitative measurements in disparate laboratories. Twenty-one laboratories measured four blinded samples containing different quantities of a KRAS fragment encoding G12D, an important genetic marker for guiding therapy of certain cancers. This marker is challenging to quantify reproducibly using quantitative PCR (qPCR) or next generation sequencing (NGS) due to the presence of competing wild type sequences and the need for calibration. Using dPCR, 18 laboratories were able to quantify the G12D marker within 12% of each other in all samples. Three laboratories appeared to measure consistently outlying results; however, proper application of a follow-up analysis recommendation rectified their data. Our findings show that dPCR has demonstrable reproducibility across a large number of laboratories without calibration. This could enable the reproducible application of molecular stratification to guide therapy and, potentially, for molecular diagnostics.
Localized structural frustration for evaluating the impact of sequence variants
Kumar, Sushant; Clarke, Declan; Gerstein, Mark
2016-01-01
Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype–genotype associations. Protein structures provide a way of addressing this challenge. Previous efforts have focused on globally quantifying the impact of SNVs on protein stability. However, local perturbations may severely impact protein functionality without strongly disrupting global stability (e.g. in relation to catalysis or allostery). Here, we describe a workflow in which localized frustration, quantifying unfavorable local interactions, is employed as a metric to investigate such effects. Using this workflow on the Protein Databank, we find that frustration produces many immediately intuitive results: for instance, disease-related SNVs create stronger changes in localized frustration than non-disease related variants, and rare SNVs tend to disrupt local interactions to a larger extent than common variants. Less obviously, we observe that somatic SNVs associated with oncogenes and tumor suppressor genes (TSGs) induce very different changes in frustration. In particular, those associated with TSGs change the frustration more in the core than the surface (by introducing loss-of-function events), whereas those associated with oncogenes manifest the opposite pattern, creating gain-of-function events. PMID:27915290
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes.
Haiminen, Niina; Feltus, F Alex; Parida, Laxmi
2011-04-15
We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies.
Effect of hot acid hydrolysis and hot chlorine dioxide stage on bleaching effluent biodegradability.
Gomes, C M; Colodette, J L; Delantonio, N R N; Mounteer, A H; Silva, C M
2007-01-01
The hot acid hydrolysis followed by chlorine dioxide (A/D*) and hot chlorine dioxide (D*) technologies have proven very useful for bleaching of eucalyptus kraft pulp. Although the characteristics and biodegradability of effluents from conventional chlorine dioxide bleaching are well known, such information is not yet available for effluents derived from hot acid hydrolysis and hot chorine dioxide bleaching. This study discusses the characteristics and biodegradability of such effluents. Combined whole effluents from the complete sequences DEpD, D*EpD, A/D*EpD and ADEpD, and from the pre-bleaching sequences DEp, D*Ep, A/D*Ep and ADEp were characterized by quantifying their colour, AOX and organic load (BOD, COD, TOC). These effluents were also evaluated for their treatability by simulation of an activated sludge system. It was concluded that treatment in the laboratory sequencing batch reactor was efficient for removal of COD, BOD and TOC of all effluents. However, colour increased after biological treatment, with the greatest increase found for the effluent produced using the AD technology. Biological treatment was less efficient at removing AOX of effluents from the sequences with D*, A/D* and AD as the first stages, when compared to the reference D stage; there was evidence of the lower treatability of these organochlorine compounds from these sequences.
Tracing cell lineages in videos of lens-free microscopy.
Rempfler, Markus; Stierle, Valentin; Ditzel, Konstantin; Kumar, Sanjeev; Paulitschke, Philipp; Andres, Bjoern; Menze, Bjoern H
2018-06-05
In vitro experiments with cultured cells are essential for studying their growth and migration pattern and thus, for gaining a better understanding of cancer progression and its treatment. Recent progress in lens-free microscopy (LFM) has rendered it an inexpensive tool for label-free, continuous live cell imaging, yet there is only little work on analysing such time-lapse image sequences. We propose (1) a cell detector for LFM images based on fully convolutional networks and residual learning, and (2) a probabilistic model based on moral lineage tracing that explicitly handles multiple detections and temporal successor hypotheses by clustering and tracking simultaneously. (3) We benchmark our method in terms of detection and tracking scores on a dataset of three annotated sequences of several hours of LFM, where we demonstrate our method to produce high quality lineages. (4) We evaluate its performance on a somewhat more challenging problem: estimating cell lineages from the LFM sequence as would be possible from a corresponding fluorescence microscopy sequence. We present experiments on 16 LFM sequences for which we acquired fluorescence microscopy in parallel and generated annotations from them. Finally, (5) we showcase our methods effectiveness for quantifying cell dynamics in an experiment with skin cancer cells. Copyright © 2018 Elsevier B.V. All rights reserved.
Similarity of Symbol Frequency Distributions with Heavy Tails
NASA Astrophysics Data System (ADS)
Gerlach, Martin; Font-Clos, Francesc; Altmann, Eduardo G.
2016-04-01
Quantifying the similarity between symbolic sequences is a traditional problem in information theory which requires comparing the frequencies of symbols in different sequences. In numerous modern applications, ranging from DNA over music to texts, the distribution of symbol frequencies is characterized by heavy-tailed distributions (e.g., Zipf's law). The large number of low-frequency symbols in these distributions poses major difficulties to the estimation of the similarity between sequences; e.g., they hinder an accurate finite-size estimation of entropies. Here, we show analytically how the systematic (bias) and statistical (fluctuations) errors in these estimations depend on the sample size N and on the exponent γ of the heavy-tailed distribution. Our results are valid for the Shannon entropy (α =1 ), its corresponding similarity measures (e.g., the Jensen-Shanon divergence), and also for measures based on the generalized entropy of order α . For small α 's, including α =1 , the errors decay slower than the 1 /N decay observed in short-tailed distributions. For α larger than a critical value α*=1 +1 /γ ≤2 , the 1 /N decay is recovered. We show the practical significance of our results by quantifying the evolution of the English language over the last two centuries using a complete α spectrum of measures. We find that frequent words change more slowly than less frequent words and that α =2 provides the most robust measure to quantify language change.
Supervised Learning for Detection of Duplicates in Genomic Sequence Databases.
Chen, Qingyu; Zobel, Justin; Zhang, Xiuzhen; Verspoor, Karin
2016-01-01
First identified as an issue in 1996, duplication in biological databases introduces redundancy and even leads to inconsistency when contradictory information appears. The amount of data makes purely manual de-duplication impractical, and existing automatic systems cannot detect duplicates as precisely as can experts. Supervised learning has the potential to address such problems by building automatic systems that learn from expert curation to detect duplicates precisely and efficiently. While machine learning is a mature approach in other duplicate detection contexts, it has seen only preliminary application in genomic sequence databases. We developed and evaluated a supervised duplicate detection method based on an expert curated dataset of duplicates, containing over one million pairs across five organisms derived from genomic sequence databases. We selected 22 features to represent distinct attributes of the database records, and developed a binary model and a multi-class model. Both models achieve promising performance; under cross-validation, the binary model had over 90% accuracy in each of the five organisms, while the multi-class model maintains high accuracy and is more robust in generalisation. We performed an ablation study to quantify the impact of different sequence record features, finding that features derived from meta-data, sequence identity, and alignment quality impact performance most strongly. The study demonstrates machine learning can be an effective additional tool for de-duplication of genomic sequence databases. All Data are available as described in the supplementary material.
Scaling features of noncoding DNA
NASA Technical Reports Server (NTRS)
Stanley, H. E.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.
1999-01-01
We review evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene, and utilize this fact to build a Coding Sequence Finder Algorithm, which uses statistical ideas to locate the coding regions of an unknown DNA sequence. Finally, we describe briefly some recent work adapting to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and reporting that noncoding regions in eukaryotes display a larger redundancy than coding regions. Specifically, we consider the possibility that this result is solely a consequence of nucleotide concentration differences as first noted by Bonhoeffer and his collaborators. We find that cytosine-guanine (CG) concentration does have a strong "background" effect on redundancy. However, we find that for the purine-pyrimidine binary mapping rule, which is not affected by the difference in CG concentration, the Shannon redundancy for the set of analyzed sequences is larger for noncoding regions compared to coding regions.
Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W
2011-01-01
We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.
2011-01-01
We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Rasmussen, Thomas A; McMahon, James; Chang, J Judy; Symons, Jori; Roche, Michael; Dantanarayana, Ashanti; Okoye, Afam; Hiener, Bonnie; Palmer, Sarah; Lee, Wen Shi; Kent, Stephen J; Van Der Weyden, Carrie; Prince, H Miles; Cameron, Paul U; Lewin, Sharon R
2017-08-24
To study the effects of alemtuzumab on HIV persistence in an HIV-infected individual on antiretroviral therapy (ART) with Sezary syndrome, a rare malignancy of CD4 T cells. Case report. Blood was collected 30 and 18 months prior to presentation with Sezary syndrome, at the time of presentation and during alemtuzumab. T-cell subsets in malignant (CD7-CD26-TCR-VBeta2+) and nonmalignant cells were quantified by flow cytometry. HIV-DNA in total CD4 T cells, in sorted malignant and nonmalignant CD4 T cells, was quantified by PCR and clonal expansion of HIV-DNA assessed by full-length next-generation sequencing. HIV-hepatitis B virus coinfection was diagnosed and antiretroviral therapy initiated 4 years prior to presentation with Sezary syndrome and primary cutaneous anaplastic large cell lymphoma. The patient received alemtuzumab 10 mg three times per week for 4 weeks but died 6 weeks post alemtuzumab. HIV-DNA was detected in nonmalignant but not in malignant CD4 T cells, consistent with expansion of a noninfected CD4 T-cell clone. Full-length HIV-DNA sequencing demonstrated multiple defective viruses but no identical or expanded sequences. Alemtuzumab extensively depleted T cells, including more than 1 log reduction in total T cells and more than 3 log reduction in CD4 T cells. Finally, alemtuzumab decreased HIV-DNA in CD4 T cells by 57% but HIV-DNA remained detectable at low levels even after depletion of nearly all CD4 T cells. Alemtuzumab extensively depleted multiple T-cell subsets and decreased the frequency of but did not eliminate HIV-infected CD4 T cells. Studying the effects on HIV persistence following immune recovery in HIV-infected individuals who require alemtuzumab for malignancy or in animal studies may provide further insights into novel cure strategies.
Nanopore-CMOS Interfaces for DNA Sequencing
Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim
2016-01-01
DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces. PMID:27509529
Nanopore-CMOS Interfaces for DNA Sequencing.
Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim
2016-08-06
DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces.
DNA sequence analysis with droplet-based microfluidics
Abate, Adam R.; Hung, Tony; Sperling, Ralph A.; Mary, Pascaline; Rotem, Assaf; Agresti, Jeremy J.; Weiner, Michael A.; Weitz, David A.
2014-01-01
Droplet-based microfluidic techniques can form and process micrometer scale droplets at thousands per second. Each droplet can house an individual biochemical reaction, allowing millions of reactions to be performed in minutes with small amounts of total reagent. This versatile approach has been used for engineering enzymes, quantifying concentrations of DNA in solution, and screening protein crystallization conditions. Here, we use it to read the sequences of DNA molecules with a FRET-based assay. Using probes of different sequences, we interrogate a target DNA molecule for polymorphisms. With a larger probe set, additional polymorphisms can be interrogated as well as targets of arbitrary sequence. PMID:24185402
Lung dynamic MRI deblurring using low-rank decomposition and dictionary learning.
Gou, Shuiping; Wang, Yueyue; Wu, Jiaolong; Lee, Percy; Sheng, Ke
2015-04-01
Lung dynamic MRI (dMRI) has emerged to be an appealing tool to quantify lung motion for both planning and treatment guidance purposes. However, this modality can result in blurry images due to intrinsically low signal-to-noise ratio in the lung and spatial/temporal interpolation. The image blurring could adversely affect the image processing that depends on the availability of fine landmarks. The purpose of this study is to reduce dMRI blurring using image postprocessing. To enhance the image quality and exploit the spatiotemporal continuity of dMRI sequences, a low-rank decomposition and dictionary learning (LDDL) method was employed to deblur lung dMRI and enhance the conspicuity of lung blood vessels. Fifty frames of continuous 2D coronal dMRI frames using a steady state free precession sequence were obtained from five subjects including two healthy volunteer and three lung cancer patients. In LDDL, the lung dMRI was decomposed into sparse and low-rank components. Dictionary learning was employed to estimate the blurring kernel based on the whole image, low-rank or sparse component of the first image in the lung MRI sequence. Deblurring was performed on the whole image sequences using deconvolution based on the estimated blur kernel. The deblurring results were quantified using an automated blood vessel extraction method based on the classification of Hessian matrix filtered images. Accuracy of automated extraction was calculated using manual segmentation of the blood vessels as the ground truth. In the pilot study, LDDL based on the blurring kernel estimated from the sparse component led to performance superior to the other ways of kernel estimation. LDDL consistently improved image contrast and fine feature conspicuity of the original MRI without introducing artifacts. The accuracy of automated blood vessel extraction was on average increased by 16% using manual segmentation as the ground truth. Image blurring in dMRI images can be effectively reduced using a low-rank decomposition and dictionary learning method using kernels estimated by the sparse component.
Gurney-Champion, Oliver J; Bruins Slot, Thijs; Lens, Eelco; van der Horst, Astrid; Klaassen, Remy; van Laarhoven, Hanneke W M; van Tienhoven, Geertjan; van Hooft, Jeanin E; Nederveen, Aart J; Bel, Arjan
2016-10-01
Biliary stents may cause susceptibility artifacts, gradient-induced artifacts, and radio frequency (RF) induced artifacts on magnetic resonance images, which can hinder accurate target volume delineation in radiotherapy. In this study, the authors investigated and quantified the magnitude of these artifacts for stents of different materials. Eight biliary stents made of nitinol, platinum-cored nitinol, stainless steel, or polyethylene from seven vendors, with different lengths (57-98 mm) and diameters (3.0-11.7 mm), were placed in a phantom. To quantify the susceptibility artifacts sequence-independently, ΔB0-maps and T2 ∗ -maps were acquired at 1.5 and 3 T. To study the effect of the gradient-induced artifacts at 3 T, signal decay in images obtained with maximum readout gradient-induced artifacts was compared to signal decay in reference scans. To quantify the RF induced artifacts at 3 T, B1-maps were acquired. Finally, ΔB0-maps and T2 ∗ -maps were acquired at 3 T of two pancreatic cancer patients who had received platinum-cored nitinol biliary stents. Outside the stent, susceptibility artifacts dominated the other artifacts. The stainless steel stent produced the largest susceptibility artifacts. The other stents caused decreased T2 ∗ up to 5.1 mm (1.5 T) and 8.5 mm (3 T) from the edge of the stent. For sequences with a higher bandwidth per voxel (1.5 T: BW vox > 275 Hz/voxel; 3 T: BW vox > 500 Hz/voxel), the B0-related susceptibility artifacts were negligible (<0.2 voxels). The polyethylene stent showed no artifacts. In vivo, the changes in B0 and T2 ∗ induced by the stent were larger than typical variations in B0 and T2 ∗ induced by anatomy when the stent was at an angle of 30° with the main magnetic field. Susceptibility artifacts were dominating over the other artifacts. The magnitudes of the susceptibility artifacts were determined sequence-independently. This method allows to include additional safety margins that ensure target irradiation.
Oshiki, Mamoru; Segawa, Takahiro; Ishii, Satoshi
2018-02-02
Various microorganisms play key roles in the Nitrogen (N) cycle. Quantitative PCR (qPCR) and PCR-amplicon sequencing of the N cycle functional genes allow us to analyze the abundance and diversity of microbes responsible in the N transforming reactions in various environmental samples. However, analysis of multiple target genes can be cumbersome and expensive. PCR-independent analysis, such as metagenomics and metatranscriptomics, is useful but expensive especially when we analyze multiple samples and try to detect N cycle functional genes present at relatively low abundance. Here, we present the application of microfluidic qPCR chip technology to simultaneously quantify and prepare amplicon sequence libraries for multiple N cycle functional genes as well as taxon-specific 16S rRNA gene markers for many samples. This approach, named as N cycle evaluation (NiCE) chip, was evaluated by using DNA from pure and artificially mixed bacterial cultures and by comparing the results with those obtained by conventional qPCR and amplicon sequencing methods. Quantitative results obtained by the NiCE chip were comparable to those obtained by conventional qPCR. In addition, the NiCE chip was successfully applied to examine abundance and diversity of N cycle functional genes in wastewater samples. Although non-specific amplification was detected on the NiCE chip, this could be overcome by optimizing the primer sequences in the future. As the NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes, this tool should advance our ability to explore N cycling in various samples. Importance. We report a novel approach, namely Nitrogen Cycle Evaluation (NiCE) chip by using microfluidic qPCR chip technology. By sequencing the amplicons recovered from the NiCE chip, we can assess diversities of the N cycle functional genes. The NiCE chip technology is applicable to analyze the temporal dynamics of the N cycle gene transcriptions in wastewater treatment bioreactors. The NiCE chip can provide high-throughput format to quantify and prepare sequence libraries for multiple N cycle functional genes. While there is a room for future improvement, this tool should significantly advance our ability to explore the N cycle in various environmental samples. Copyright © 2018 American Society for Microbiology.
Beyond DNA: integrating inclusive inheritance into an extended theory of evolution.
Danchin, Étienne; Charmantier, Anne; Champagne, Frances A; Mesoudi, Alex; Pujol, Benoit; Blanchet, Simon
2011-06-17
Many biologists are calling for an 'extended evolutionary synthesis' that would 'modernize the modern synthesis' of evolution. Biological information is typically considered as being transmitted across generations by the DNA sequence alone, but accumulating evidence indicates that both genetic and non-genetic inheritance, and the interactions between them, have important effects on evolutionary outcomes. We review the evidence for such effects of epigenetic, ecological and cultural inheritance and parental effects, and outline methods that quantify the relative contributions of genetic and non-genetic heritability to the transmission of phenotypic variation across generations. These issues have implications for diverse areas, from the question of missing heritability in human complex-trait genetics to the basis of major evolutionary transitions.
Assessing pooled BAC and whole genome shotgun strategies for assembly of complex genomes
2011-01-01
Background We investigate if pooling BAC clones and sequencing the pools can provide for more accurate assembly of genome sequences than the "whole genome shotgun" (WGS) approach. Furthermore, we quantify this accuracy increase. We compare the pooled BAC and WGS approaches using in silico simulations. Standard measures of assembly quality focus on assembly size and fragmentation, which are desirable for large whole genome assemblies. We propose additional measures enabling easy and visual comparison of assembly quality, such as rearrangements and redundant sequence content, relative to the known target sequence. Results The best assembly quality scores were obtained using 454 coverage of 15× linear and 5× paired (3kb insert size) reads (15L-5P) on Arabidopsis. This regime gave similarly good results on four additional plant genomes of very different GC and repeat contents. BAC pooling improved assembly scores over WGS assembly, coverage and redundancy scores improving the most. Conclusions BAC pooling works better than WGS, however, both require a physical map to order the scaffolds. Pool sizes up to 12Mbp work well, suggesting this pooling density to be effective in medium-scale re-sequencing applications such as targeted sequencing of QTL intervals for candidate gene discovery. Assuming the current Roche/454 Titanium sequencing limitations, a 12 Mbp region could be re-sequenced with a full plate of linear reads and a half plate of paired-end reads, yielding 15L-5P coverage after read pre-processing. Our simulation suggests that massively over-sequencing may not improve accuracy. Our scoring measures can be used generally to evaluate and compare results of simulated genome assemblies. PMID:21496274
Sharma, Aseem; Chatterjee, Arindam; Goyal, Manu; Parsons, Matthew S; Bartel, Seth
2015-04-01
Targeting redundancy within MRI can improve its cost-effective utilization. We sought to quantify potential redundancy in our brain MRI protocols. In this retrospective review, we aggregated 207 consecutive adults who underwent brain MRI and reviewed their medical records to document clinical indication, core diagnostic information provided by MRI, and its clinical impact. Contributory imaging abnormalities constituted positive core diagnostic information whereas absence of imaging abnormalities constituted negative core diagnostic information. The senior author selected core sequences deemed sufficient for extraction of core diagnostic information. For validating core sequences selection, four readers assessed the relative ease of extracting core diagnostic information from the core sequences. Potential redundancy was calculated by comparing the average number of core sequences to the average number of sequences obtained. Scanning had been performed using 9.4±2.8 sequences over 37.3±12.3 minutes. Core diagnostic information was deemed extractable from 2.1±1.1 core sequences, with an assumed scanning time of 8.6±4.8 minutes, reflecting a potential redundancy of 74.5%±19.1%. Potential redundancy was least in scans obtained for treatment planning (14.9%±25.7%) and highest in scans obtained for follow-up of benign diseases (81.4%±12.6%). In 97.4% of cases, all four readers considered core diagnostic information to be either easily extractable from core sequences or the ease to be equivalent to that from the entire study. With only one MRI lacking clinical impact (0.48%), overutilization did not seem to contribute to potential redundancy. High potential redundancy that can be targeted for more efficient scanner utilization exists in brain MRI protocols.
Assessing Species Diversity Using Metavirome Data: Methods and Challenges.
Herath, Damayanthi; Jayasundara, Duleepa; Ackland, David; Saeed, Isaam; Tang, Sen-Lin; Halgamuge, Saman
2017-01-01
Assessing biodiversity is an important step in the study of microbial ecology associated with a given environment. Multiple indices have been used to quantify species diversity, which is a key biodiversity measure. Measuring species diversity of viruses in different environments remains a challenge relative to measuring the diversity of other microbial communities. Metagenomics has played an important role in elucidating viral diversity by conducting metavirome studies; however, metavirome data are of high complexity requiring robust data preprocessing and analysis methods. In this review, existing bioinformatics methods for measuring species diversity using metavirome data are categorised broadly as either sequence similarity-dependent methods or sequence similarity-independent methods. The former includes a comparison of DNA fragments or assemblies generated in the experiment against reference databases for quantifying species diversity, whereas estimates from the latter are independent of the knowledge of existing sequence data. Current methods and tools are discussed in detail, including their applications and limitations. Drawbacks of the state-of-the-art method are demonstrated through results from a simulation. In addition, alternative approaches are proposed to overcome the challenges in estimating species diversity measures using metavirome data.
Blazejak, Anna; Schippers, Axel
2010-05-01
Sequences of members of the bacterial candidate division JS-1 and the classes Anaerolineae and Caldilineae of the phylum Chloroflexi are frequently found in 16S rRNA gene clone libraries obtained from marine sediments. Using a newly designed quantitative, real-time PCR assay, these bacterial groups were jointly quantified in samples from near-surface and deeply buried marine sediments from the Peru margin, the Black Sea, and a forearc basin off the island of Sumatra. In near-surface sediments, sequences of the JS-1 as well as Anaerolineae- and Caldilineae-related Bacteria were quantified with significantly lower 16S rRNA gene copy numbers than the sequences of total Bacteria. In contrast, in deeply buried sediments below approximately 1 m depth, similar quantities of the 16S rRNA gene copies of these specific groups and Bacteria were found. This finding indicates that JS-1 and Anaerolineae- and Caldilineae-related Bacteria might dominate the bacterial community in deeply buried marine sediments and thus seem to play an important ecological role in the deep biosphere.
Best, Katharine; Oakes, Theres; Heather, James M.; Shawe-Taylor, John; Chain, Benny
2015-01-01
The polymerase chain reaction (PCR) is one of the most widely used techniques in molecular biology. In combination with High Throughput Sequencing (HTS), PCR is widely used to quantify transcript abundance for RNA-seq, and in the context of analysis of T and B cell receptor repertoires. In this study, we combine DNA barcoding with HTS to quantify PCR output from individual target molecules. We develop computational tools that simulate both the PCR branching process itself, and the subsequent subsampling which typically occurs during HTS sequencing. We explore the influence of different types of heterogeneity on sequencing output, and compare them to experimental results where the efficiency of amplification is measured by barcodes uniquely identifying each molecule of starting template. Our results demonstrate that the PCR process introduces substantial amplification heterogeneity, independent of primer sequence and bulk experimental conditions. This heterogeneity can be attributed both to inherited differences between different template DNA molecules, and the inherent stochasticity of the PCR process. The results demonstrate that PCR heterogeneity arises even when reaction and substrate conditions are kept as constant as possible, and therefore single molecule barcoding is essential in order to derive reproducible quantitative results from any protocol combining PCR with HTS. PMID:26459131
Paliwoda, Rebecca E; Li, Feng; Reid, Michael S; Lin, Yanwen; Le, X Chris
2014-06-17
Functionalizing nanomaterials for diverse analytical, biomedical, and therapeutic applications requires determination of surface coverage (or density) of DNA on nanomaterials. We describe a sequential strand displacement beacon assay that is able to quantify specific DNA sequences conjugated or coconjugated onto gold nanoparticles (AuNPs). Unlike the conventional fluorescence assay that requires the target DNA to be fluorescently labeled, the sequential strand displacement beacon method is able to quantify multiple unlabeled DNA oligonucleotides using a single (universal) strand displacement beacon. This unique feature is achieved by introducing two short unlabeled DNA probes for each specific DNA sequence and by performing sequential DNA strand displacement reactions. Varying the relative amounts of the specific DNA sequences and spacing DNA sequences during their coconjugation onto AuNPs results in different densities of the specific DNA on AuNP, ranging from 90 to 230 DNA molecules per AuNP. Results obtained from our sequential strand displacement beacon assay are consistent with those obtained from the conventional fluorescence assays. However, labeling of DNA with some fluorescent dyes, e.g., tetramethylrhodamine, alters DNA density on AuNP. The strand displacement strategy overcomes this problem by obviating direct labeling of the target DNA. This method has broad potential to facilitate more efficient design and characterization of novel multifunctional materials for diverse applications.
Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney
2012-01-01
RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Review of road user costs and methods.
DOT National Transportation Integrated Search
2013-07-01
The South Dakota Department of Transportation (SDDOT) uses road user costs (RUC) to calculate incentive or disincentive compensation for contractors, quantify project-specific liquidated damages, select the ideal sequencing of a project, and forecast...
Tn5Prime, a Tn5 based 5' capture method for single cell RNA-seq.
Cole, Charles; Byrne, Ashley; Beaudin, Anna E; Forsberg, E Camilla; Vollmers, Christopher
2018-06-01
RNA-sequencing (RNA-seq) is a powerful technique to investigate and quantify entire transcriptomes. Recent advances in the field have made it possible to explore the transcriptomes of single cells. However, most widely used RNA-seq protocols fail to provide crucial information regarding transcription start sites. Here we present a protocol, Tn5Prime, that takes advantage of the Tn5 transposase-based Smart-seq2 protocol to create RNA-seq libraries that capture the 5' end of transcripts. The Tn5Prime method dramatically streamlines the 5' capture process and is both cost effective and reliable. By applying Tn5Prime to bulk RNA and single cell samples, we were able to define transcription start sites as well as quantify transcriptomes at high accuracy and reproducibility. Additionally, similar to 3' end-based high-throughput methods like Drop-seq and 10× Genomics Chromium, the 5' capture Tn5Prime method allows the introduction of cellular identifiers during reverse transcription, simplifying the analysis of large numbers of single cells. In contrast to 3' end-based methods, Tn5Prime also enables the assembly of the variable 5' ends of the antibody sequences present in single B-cell data. Therefore, Tn5Prime presents a robust tool for both basic and applied research into the adaptive immune system and beyond.
Localized structural frustration for evaluating the impact of sequence variants.
Kumar, Sushant; Clarke, Declan; Gerstein, Mark
2016-12-01
Population-scale sequencing is increasingly uncovering large numbers of rare single-nucleotide variants (SNVs) in coding regions of the genome. The rarity of these variants makes it challenging to evaluate their deleteriousness with conventional phenotype-genotype associations. Protein structures provide a way of addressing this challenge. Previous efforts have focused on globally quantifying the impact of SNVs on protein stability. However, local perturbations may severely impact protein functionality without strongly disrupting global stability (e.g. in relation to catalysis or allostery). Here, we describe a workflow in which localized frustration, quantifying unfavorable local interactions, is employed as a metric to investigate such effects. Using this workflow on the Protein Databank, we find that frustration produces many immediately intuitive results: for instance, disease-related SNVs create stronger changes in localized frustration than non-disease related variants, and rare SNVs tend to disrupt local interactions to a larger extent than common variants. Less obviously, we observe that somatic SNVs associated with oncogenes and tumor suppressor genes (TSGs) induce very different changes in frustration. In particular, those associated with TSGs change the frustration more in the core than the surface (by introducing loss-of-function events), whereas those associated with oncogenes manifest the opposite pattern, creating gain-of-function events. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A rule of seven in Watson-Crick base-pairing of mismatched sequences.
Cisse, Ibrahim I; Kim, Hajin; Ha, Taekjip
2012-05-13
Sequence recognition through base-pairing is essential for DNA repair and gene regulation, but the basic rules governing this process remain elusive. In particular, the kinetics of annealing between two imperfectly matched strands is not well characterized, despite its potential importance in nucleic acid-based biotechnologies and gene silencing. Here we use single-molecule fluorescence to visualize the multiple annealing and melting reactions of two untethered strands inside a porous vesicle, allowing us to precisely quantify the annealing and melting rates. The data as a function of mismatch position suggest that seven contiguous base pairs are needed for rapid annealing of DNA and RNA. This phenomenological rule of seven may underlie the requirement for seven nucleotides of complementarity to seed gene silencing by small noncoding RNA and may help guide performance improvement in DNA- and RNA-based bio- and nanotechnologies, in which off-target effects can be detrimental.
Quantum-dot-based quantitative identification of pathogens in complex mixture
NASA Astrophysics Data System (ADS)
Lim, Sun Hee; Bestwater, Felix; Buchy, Philippe; Mardy, Sek; Yu, Alexey Dan Chin
2010-02-01
In the present study we describe sandwich design hybridization probes consisting of magnetic particles (MP) and quantum dots (QD) with target DNA, and their application in the detection of avian influenza virus (H5N1) sequences. Hybridization of 25-, 40-, and 100-mer target DNA with both probes was analyzed and quantified by flow cytometry and fluorescence microscopy on the scale of single particles. The following steps were used in the assay: (i) target selection by MP probes and (ii) target detection by QD probes. Hybridization efficiency between MP conjugated probes and target DNA hybrids was controlled by a fluorescent dye specific for nucleic acids. Fluorescence was detected by flow cytometry to distinguish differences in oligo sequences as short as 25-mer capturing in target DNA and by gel-electrophoresis in the case of QD probes. This report shows that effective manipulation and control of micro- and nanoparticles in hybridization assays is possible.
Weirather, Jason L.; Afshar, Pegah Tootoonchi; Clark, Tyson A.; Tseng, Elizabeth; Powers, Linda S.; Underwood, Jason G.; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai
2015-01-01
We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. PMID:26040699
BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations
Wang, Junbai; Batmanov, Kirill
2015-01-01
Sequence variations in regulatory DNA regions are known to cause functionally important consequences for gene expression. DNA sequence variations may have an essential role in determining phenotypes and may be linked to disease; however, their identification through analysis of massive genome-wide sequencing data is a great challenge. In this work, a new computational pipeline, a Bayesian method for protein–DNA interaction with binding affinity ranking (BayesPI-BAR), is proposed for quantifying the effect of sequence variations on protein binding. BayesPI-BAR uses biophysical modeling of protein–DNA interactions to predict single nucleotide polymorphisms (SNPs) that cause significant changes in the binding affinity of a regulatory region for transcription factors (TFs). The method includes two new parameters (TF chemical potentials or protein concentrations and direct TF binding targets) that are neglected by previous methods. The new method is verified on 67 known human regulatory SNPs, of which 47 (70%) have predicted true TFs ranked in the top 10. Importantly, the performance of BayesPI-BAR, which uses principal component analysis to integrate multiple predictions from various TF chemical potentials, is found to be better than that of existing programs, such as sTRAP and is-rSNP, when evaluated on the same SNPs. BayesPI-BAR is a publicly available tool and is able to carry out parallelized computation, which helps to investigate a large number of TFs or SNPs and to detect disease-associated regulatory sequence variations in the sea of genome-wide noncoding regions. PMID:26202972
Revell, J D; Mirmehdi, M; McNally, D S
2004-04-01
We examine tissue deformations using non-invasive dynamic musculoskeletal ultrasonograhy, and quantify its performance on controlled in vitro gold standard (groundtruth) sequences followed by clinical in vivo data. The proposed approach employs a two-dimensional variable-sized block matching algorithm with a hierarchical full search. We extend this process by refining displacements to sub-pixel accuracy. We show by application that this technique yields quantitatively reliable results.
Statistical inference of the generation probability of T-cell receptors from sequence repertoires.
Murugan, Anand; Mora, Thierry; Walczak, Aleksandra M; Callan, Curtis G
2012-10-02
Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.
Li, Yantao; Fu, Tuo; Liu, Tao; Guo, Huaizu; Guo, Qingcheng; Xu, Jin; Zhang, Dapeng; Qian, Weizhu; Dai, Jianxin; Li, Bohua; Guo, Yajun; Hou, Sheng; Wang, Hao
2016-07-01
Nivolumab is a therapeutic fully human IgG4 antibody to programmed death 1 (PD-1). In this study, a nivolumab biosimilar, which was produced in our laboratory, was analyzed and characterized. Sequence variants that contain undesired amino acid sequences may cause concern during biosimilar bioprocess development. We found that low levels of sequence variants were detected in the heavy chain of the nivolumab biosimilar by ultra performance liquid chromatography (UPLC) and tandem mass spectrometry. It was further identified with UPLC-MS/MS by IdeS or trypsin digestion. The sequence variant was confirmed through addition of synthetic mutant peptide. Subsequently, the mixing base signal of normal and mutant sequence was detected through DNA sequencing. The relative levels of mutant A424V in the Fc region of the heavy chain have been detected and demonstrated to be 12.25% and 13.54%, via base peak intensity (BPI) and UV chromatography of the tryptic peptide mapping, respectively. A424V variant was also quantified by real-time PCR (RT-PCR) at the DNA and RNA level, which was 19.2% and 16.8%, respectively. The relative content of the mutant was consistent at the DNA, RNA and protein level, indicating that the A424V mutation may have little influence at transcriptional or translational levels. These results demonstrate that orthogonal state-of-the-art techniques such as LC- UV- MS and RT-PCR should be implemented to characterize recombinant proteins and cell lines for development of biosimilars. Our study suggests that it is important to establish an integrated and effective analytical method to monitor and characterize sequence variants during antibody drug development, especially for antibody biosimilar products.
Ochoa, David; García-Gutiérrez, Ponciano; Juan, David; Valencia, Alfonso; Pazos, Florencio
2013-01-27
A widespread family of methods for studying and predicting protein interactions using sequence information is based on co-evolution, quantified as similarity of phylogenetic trees. Part of the co-evolution observed between interacting proteins could be due to co-adaptation caused by inter-protein contacts. In this case, the co-evolution is expected to be more evident when evaluated on the surface of the proteins or the internal layers close to it. In this work we study the effect of incorporating information on predicted solvent accessibility to three methods for predicting protein interactions based on similarity of phylogenetic trees. We evaluate the performance of these methods in predicting different types of protein associations when trees based on positions with different characteristics of predicted accessibility are used as input. We found that predicted accessibility improves the results of two recent versions of the mirrortree methodology in predicting direct binary physical interactions, while it neither improves these methods, nor the original mirrortree method, in predicting other types of interactions. That improvement comes at no cost in terms of applicability since accessibility can be predicted for any sequence. We also found that predictions of protein-protein interactions are improved when multiple sequence alignments with a richer representation of sequences (including paralogs) are incorporated in the accessibility prediction.
Data-Driven Sequence of Changes to Anatomical Brain Connectivity in Sporadic Alzheimer's Disease.
Oxtoby, Neil P; Garbarino, Sara; Firth, Nicholas C; Warren, Jason D; Schott, Jonathan M; Alexander, Daniel C
2017-01-01
Model-based investigations of transneuronal spreading mechanisms in neurodegenerative diseases relate the pattern of pathology severity to the brain's connectivity matrix, which reveals information about how pathology propagates through the connectivity network. Such network models typically use networks based on functional or structural connectivity in young and healthy individuals, and only end-stage patterns of pathology, thereby ignoring/excluding the effects of normal aging and disease progression. Here, we examine the sequence of changes in the elderly brain's anatomical connectivity over the course of a neurodegenerative disease. We do this in a data-driven manner that is not dependent upon clinical disease stage, by using event-based disease progression modeling. Using data from the Alzheimer's Disease Neuroimaging Initiative dataset, we sequence the progressive decline of anatomical connectivity, as quantified by graph-theory metrics, in the Alzheimer's disease brain. Ours is the first single model to contribute to understanding all three of the nature, the location, and the sequence of changes to anatomical connectivity in the human brain due to Alzheimer's disease. Our experimental results reveal new insights into Alzheimer's disease: that degeneration of anatomical connectivity in the brain may be a viable, even early, biomarker and should be considered when studying such neurodegenerative diseases.
Jiang, Yue; Turinsky, Andrei L.; Brudno, Michael
2015-01-01
With the development of High-Throughput Sequencing (HTS) thousands of human genomes have now been sequenced. Whenever different studies analyze the same genome they usually agree on the amount of single-nucleotide polymorphisms, but differ dramatically on the number of insertion and deletion variants (indels). Furthermore, there is evidence that indels are often severely under-reported. In this manuscript we derive the total number of indel variants in a human genome by combining data from different sequencing technologies, while assessing the indel detection accuracy. Our estimate of approximately 1 million indels in a Yoruban genome is much higher than the results reported in several recent HTS studies. We identify two key sources of difficulties in indel detection: the insufficient coverage, read length or alignment quality; and the presence of repeats, including short interspersed elements and homopolymers/dimers. We quantify the effect of these factors on indel detection. The quality of sequencing data plays a major role in improving indel detection by HTS methods. However, many indels exist in long homopolymers and repeats, where their detection is severely impeded. The true number of indel events is likely even higher than our current estimates, and new techniques and technologies will be required to detect them. PMID:26130710
Directed evolution of the TALE N-terminal domain for recognition of all 5' bases.
Lamb, Brian M; Mercer, Andrew C; Barbas, Carlos F
2013-11-01
Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5'-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5' T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5' T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes.
Gijsbertse, Kaj; Goselink, Rianne; Lassche, Saskia; Nillesen, Maartje; Sprengers, André; Verdonschot, Nico; van Alfen, Nens; de Korte, Chris
2017-11-01
A need exists for biomarkers to diagnose, quantify and longitudinally follow facioscapulohumeral muscular dystrophy (FSHD) and many other neuromuscular disorders. Furthermore, the pathophysiological mechanisms leading to muscle weakness in most neuromuscular disorders are not completely understood. Dynamic ultrasound imaging (B-mode image sequences) in combination with speckle tracking is an easy, applicable and patient-friendly imaging tool to visualize and quantify muscle deformation. This dynamic information provides insight in the pathophysiological mechanisms and may help to distinguish the various stages of diseased muscle in FSHD. In this proof-of-principle study, we applied a speckle tracking technique to 2-D ultrasound image sequences to quantify the deformation of the tibialis anterior muscle in patients with FSHD and in healthy controls. The resulting deformation patterns were compared with muscle ultrasound echo intensity analysis (a measure of fat infiltration and dystrophy) and clinical outcome measures. Of the four FSHD patients, two patients had severe peroneal weakness and two patients had mild peroneal weakness on clinical examination. We found a markedly varied muscle deformation pattern between these groups: patients with severe peroneal weakness showed a different motion pattern of the tibialis anterior, with overall less displacement of the central tendon region, while healthy patients showed a non-uniform displacement pattern, with the central aponeurosis showing the largest displacement. Hence, dynamic muscle ultrasound of the tibialis anterior muscle in patients with FSHD revealed a distinctively different tissue deformation pattern among persons with and without tibialis anterior weakness. These findings could clarify the understanding of the pathophysiology of muscle weakness in FSHD patients. In addition, the change in muscle deformation shows good correlation with clinical measures and quantitative muscle ultrasound measurements. In conclusion, dynamic ultrasound in combination with speckle tracking allows the study of the effects of muscle pathology in relation to strength, force transmission and movement generation. Although further research is required, this technique can develop into a biomarker to quantify muscle disease severity. Copyright © 2017 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
Inferring Fitness Effects from Time-Resolved Sequence Data with a Delay-Deterministic Model
Nené, Nuno R.; Dunham, Alistair S.; Illingworth, Christopher J. R.
2018-01-01
A common challenge arising from the observation of an evolutionary system over time is to infer the magnitude of selection acting upon a specific genetic variant, or variants, within the population. The inference of selection may be confounded by the effects of genetic drift in a system, leading to the development of inference procedures to account for these effects. However, recent work has suggested that deterministic models of evolution may be effective in capturing the effects of selection even under complex models of demography, suggesting the more general application of deterministic approaches to inference. Responding to this literature, we here note a case in which a deterministic model of evolution may give highly misleading inferences, resulting from the nondeterministic properties of mutation in a finite population. We propose an alternative approach that acts to correct for this error, and which we denote the delay-deterministic model. Applying our model to a simple evolutionary system, we demonstrate its performance in quantifying the extent of selection acting within that system. We further consider the application of our model to sequence data from an evolutionary experiment. We outline scenarios in which our model may produce improved results for the inference of selection, noting that such situations can be easily identified via the use of a regular deterministic model. PMID:29500183
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gallaher, Sean D.; Fitz-Gibbon, Sorel T.; Strenkert, Daniela
Chlamydomonas reinhardtii is a unicellular chlorophyte alga that is widely studied as a reference organism for understanding photosynthesis, sensory and motile cilia, and for development of an algal-based platform for producing biofuels and bio-products. Its highly repetitive, ~205-kbp circular chloroplast genome and ~15.8-kbp linear mitochondrial genome were sequenced prior to the advent of high-throughput sequencing technologies. Here, high coverage shotgun sequencing was used to assemble both organellar genomes de novo. These new genomes correct dozens of errors in the prior genome sequences and annotations. Gen-ome sequencing coverage indicates that each cell contains on average 83 copies of the chloroplast genomemore » and 130 copies of the mitochondrial genome. Using protocols and analyses optimized for organellar tran-scripts, RNA-Seq was used to quantify their relative abundances across 12 different growth conditions. Forty-six percent of total cellular mRNA is attributable to high expression from a few dozen chloroplast genes. RNA-Seq data were used to guide gene annotation, to demonstrate polycistronic gene expression, and to quantify splicing of psaA and psbA introns. In contrast to a conclusion from a recent study, we found that chloroplast transcripts are not edited. Unexpectedly, cytosine-rich polynucleotide tails were observed at the 3’-end of all mitochondrial transcripts. A comparative genomics analysis of eight laboratory strains and 11 wild isolates of C. reinhardtii identified 2658 variants in the organellargenomes, which is 1/10th as much genetic diversity as is found in the nucleus.« less
Ho, Jessica; Firmalino, Miracle Vania; Anbarani, Afarin Golabgir; Takesh, Thair; Epstein, Joel; Wilder-Smith, Petra
2017-02-01
The goal of this study was to identify the in vivo effects in patients with hyposalivation of a novel slowly dissolving adhering test disc upon on enamel remineralization, oral biofilm, salivary production, pH and buffering, gingival health, and on self-evaluation of oral well-being. Five subjects with xerostomia wore custom made retainers carrying 5 demineralized enamel chips for periods of 1 week each. In 1 study arm, subjects used the test agent plus oral hygiene self-care; in the other they used oral hygiene self-care only, with a 1 week washout in between arms. The treatment sequence was randomized. Before and after each study arm Plaque Index (PI), Gingival Index (GI) and Sulcus Bleeding Index (mSBI) were recorded. Clinical plaque staining was quantified using digital image analysis. Saliva production, pH and buffering capacity were recorded. Subjects completed a self-evaluation questionnaire for oral comfort. Enamel samples underwent standardized Knoop microhardness testing to quantify mineralization status. Plaque presence and clinical Plaque Indices decreased significantly with test agent use (p<0.05). Five-minute saliva production almost doubled 10 and 40 minutes after oral test disc insertion (significant, p<0.05). Salivary pH buffering improved in 4/5 subjects with disc use. All demineralized tooth samples re-hardened intraorally (p>0.05). The discs favorably impacted eating problems and dental sensitivity. Subjects were positive about disc flavor and mouth feel. Using established in vivo techniques, the effects of a novel product in xerostomic patients were evaluated and quantified. The adhering disc facilitated eating, reduced dental sensitivity, improved saliva production and buffering capacity, reduced plaque, and alleviated xerostomia symptoms. Xerostomia management is challenging. A novel dry mouth disc was effective in alleviating dry mouth symptoms.
Empirical fitness models for hepatitis C virus immunogen design
NASA Astrophysics Data System (ADS)
Hart, Gregory R.; Ferguson, Andrew L.
2015-12-01
Hepatitis C virus (HCV) afflicts 170 million people worldwide, 2%-3% of the global population, and kills 350 000 each year. Prophylactic vaccination offers the most realistic and cost effective hope of controlling this epidemic in the developing world where expensive drug therapies are not available. Despite 20 years of research, the high mutability of the virus and lack of knowledge of what constitutes effective immune responses have impeded development of an effective vaccine. Coupling data mining of sequence databases with spin glass models from statistical physics, we have developed a computational approach to translate clinical sequence databases into empirical fitness landscapes quantifying the replicative capacity of the virus as a function of its amino acid sequence. These landscapes explicitly connect viral genotype to phenotypic fitness, and reveal vulnerable immunological targets within the viral proteome that can be exploited to rationally design vaccine immunogens. We have recovered the empirical fitness landscape for the HCV RNA-dependent RNA polymerase (protein NS5B) responsible for viral genome replication, and validated the predictions of our model by demonstrating excellent accord with experimental measurements and clinical observations. We have used our landscapes to perform exhaustive in silico screening of 16.8 million T-cell immunogen candidates to identify 86 optimal formulations. By reducing the search space of immunogen candidates by over five orders of magnitude, our approach can offer valuable savings in time, expense, and labor for experimental vaccine development and accelerate the search for a HCV vaccine. Abbreviations: HCV—hepatitis C virus, HLA—human leukocyte antigen, CTL—cytotoxic T lymphocyte, NS5B—nonstructural protein 5B, MSA—multiple sequence alignment, PEG-IFN—pegylated interferon.
Genomes: At the edge of chaos with maximum information capacity
NASA Astrophysics Data System (ADS)
Kong, Sing-Guan; Chen, Hong-Da; Torda, Andrew; Lee, H. C.
2016-12-01
We propose an order index, ϕ, which quantifies the notion of “life at the edge of chaos” when applied to genome sequences. It maps genomes to a number from 0 (random and of infinite length) to 1 (fully ordered) and applies regardless of sequence length and base composition. The 786 complete genomic sequences in GenBank were found to have ϕ values in a very narrow range, 0.037 ± 0.027. We show this implies that genomes are halfway towards being completely random, namely, at the edge of chaos. We argue that this narrow range represents the neighborhood of a fixed-point in the space of sequences, and genomes are driven there by the dynamics of a robust, predominantly neutral evolution process.
Messmer, Bradley T; Raphael, Benjamin J; Aerni, Sarah J; Widhopf, George F; Rassenti, Laura Z; Gribben, John G; Kay, Neil E; Kipps, Thomas J
2009-01-01
The leukemia cells of unrelated patients with chronic lymphocytic leukemia (CLL) display a restricted repertoire of immunoglobulin (Ig) gene rearrangements with preferential usage of certain Ig gene segments. We developed a computational method to rigorously quantify biases in Ig sequence similarity in large patient databases and to identify groups of patients with unusual levels of sequence similarity. We applied our method to sequences from 1577 CLL patients through the CLL Research Consortium (CRC), and identified 67 similarity groups into which roughly 20% of all patients could be assigned. Immunoglobulin light chain class was highly correlated within all groups and light chain gene usage was similar within sets. Surprisingly, over 40% of the identified groups were composed of somatically mutated genes. This study significantly expands the evidence that antigen selection shapes the Ig repertoire in CLL. PMID:18640719
Messmer, Bradley T; Raphael, Benjamin J; Aerni, Sarah J; Widhopf, George F; Rassenti, Laura Z; Gribben, John G; Kay, Neil E; Kipps, Thomas J
2009-03-01
The leukemia cells of unrelated patients with chronic lymphocytic leukemia (CLL) display a restricted repertoire of immunoglobulin (Ig) gene rearrangements with preferential usage of certain Ig gene segments. We developed a computational method to rigorously quantify biases in Ig sequence similarity in large patient databases and to identify groups of patients with unusual levels of sequence similarity. We applied our method to sequences from 1577 CLL patients through the CLL Research Consortium (CRC), and identified 67 similarity groups into which roughly 20% of all patients could be assigned. Immunoglobulin light chain class was highly correlated within all groups and light chain gene usage was similar within sets. Surprisingly, over 40% of the identified groups were composed of somatically mutated genes. This study significantly expands the evidence that antigen selection shapes the Ig repertoire in CLL.
SU-F-T-540: Comprehensive Fluence Delivery Optimization with Multileaf Collimation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weppler, S; Villarreal-Barajas, J; Department of Medical Physics, Tom Baker Cancer Center, Calgary, Alberta
2016-06-15
Purpose: Multileaf collimator (MLC) leaf sequencing is performed via commercial black-box implementations, on which a user has limited to no access. We have developed an explicit, generic MLC sequencing model to serve as a tool for future investigations of fluence map optimization, fluence delivery optimization, and rotational collimator delivery methods. Methods: We have developed a novel, comprehensive model to effectively account for a variety of transmission and penumbra effects previously treated on an ad hoc basis in the literature. As the model is capable of quantifying a variety of effects, we utilize the asymmetric leakage intensity across each leaf tomore » deliver fluence maps with pixel size smaller than the narrowest leaf width. Developed using linear programming and mixed integer programming formulations, the model is implemented using state of the art open-source solvers. To demonstrate the versatility of the algorithm, a graphical user interface (GUI) was developed in MATLAB capable of accepting custom leaf specifications and transmission parameters. As a preliminary proof-ofconcept, we have sequenced the leaves of a Varian 120 Leaf Millennium MLC for five prostate cancer patient fields and one head and neck field. Predetermined fluence maps have been processed by data smoothing methods to obtain pixel sizes of 2.5 cm{sup 2}. The quality of output was analyzed using computer simulations. Results: For the prostate fields, an average root mean squared error (RMSE) of 0.82 and gamma (0.5mm/0.5%) of 91.4% were observed compared to RMSE and gamma (0.5mm/0.5%) values of 7.04 and 34.0% when the leakage considerations were omitted. Similar results were observed for the head and neck case. Conclusion: A model to sequence MLC leaves to optimality has been proposed. Future work will involve extensive testing and evaluation of the method on clinical MLCs and comparison with black-box leaf sequencing algorithms currently used by commercial treatment planning systems.« less
The US EPA has patented a mold ID technology (#6,387,652) licensed by 15 companies in the US and EU. This technology is based upon DNA sequences. In conjunction with HUD, this technology will be used in a National Survey of Homes.
Chávez Montes, Ricardo A; de Fátima Rosas-Cárdenas, Flor; De Paoli, Emanuele; Accerbi, Monica; Rymarquis, Linda A; Mahalingam, Gayathri; Marsch-Martínez, Nayelli; Meyers, Blake C; Green, Pamela J; de Folter, Stefan
2014-04-23
Small RNAs are pivotal regulators of gene expression that guide transcriptional and post-transcriptional silencing mechanisms in eukaryotes, including plants. Here we report a comprehensive atlas of sRNA and miRNA from 3 species of algae and 31 representative species across vascular plants, including non-model plants. We sequence and quantify sRNAs from 99 different tissues or treatments across species, resulting in a data set of over 132 million distinct sequences. Using miRBase mature sequences as a reference, we identify the miRNA sequences present in these libraries. We apply diverse profiling methods to examine critical sRNA and miRNA features, such as size distribution, tissue-specific regulation and sequence conservation between species, as well as to predict putative new miRNA sequences. We also develop database resources, computational analysis tools and a dedicated website, http://smallrna.udel.edu/. This study provides new insights on plant sRNAs and miRNAs, and a foundation for future studies.
Simulating nailfold capillaroscopy sequences to evaluate algorithms for blood flow estimation.
Tresadern, P A; Berks, M; Murray, A K; Dinsdale, G; Taylor, C J; Herrick, A L
2013-01-01
The effects of systemic sclerosis (SSc)--a disease of the connective tissue causing blood flow problems that can require amputation of the fingers--can be observed indirectly by imaging the capillaries at the nailfold, though taking quantitative measures such as blood flow to diagnose the disease and monitor its progression is not easy. Optical flow algorithms may be applied, though without ground truth (i.e. known blood flow) it is hard to evaluate their accuracy. We propose an image model that generates realistic capillaroscopy videos with known flow, and use this model to quantify the effect of flow rate, cell density and contrast (among others) on estimated flow. This resource will help researchers to design systems that are robust under real-world conditions.
Killian, Tobias; Dickopf, Steffen; Haas, Alexander K; Kirstenpfad, Claudia; Mayer, Klaus; Brinkmann, Ulrich
2017-11-13
We have devised an effective and robust method for the characterization of gene-editing events. The efficacy of editing-mediated mono- and bi-allelic gene inactivation and integration events is quantified based on colony counts. The combination of diphtheria toxin (DT) and puromycin (PM) selection enables analyses of 10,000-100,000 individual cells, assessing hundreds of clones with inactivated genes per experiment. Mono- and bi-allelic gene inactivation is differentiated by DT resistance, which occurs only upon bi-allelic inactivation. PM resistance indicates integration. The robustness and generalizability of the method were demonstrated by quantifying the frequency of gene inactivation and cassette integration under different editing approaches: CRISPR/Cas9-mediated complete inactivation was ~30-50-fold more frequent than cassette integration. Mono-allelic inactivation without integration occurred >100-fold more frequently than integration. Assessment of gRNA length confirmed 20mers to be most effective length for inactivation, while 16-18mers provided the highest overall integration efficacy. The overall efficacy was ~2-fold higher for CRISPR/Cas9 than for zinc-finger nuclease and was significantly increased upon modulation of non-homologous end joining or homology-directed repair. The frequencies and ratios of editing events were similar for two different DPH genes (independent of the target sequence or chromosomal location), which indicates that the optimization parameters identified with this method can be generalized.
Microfluidic single-cell whole-transcriptome sequencing.
Streets, Aaron M; Zhang, Xiannian; Cao, Chen; Pang, Yuhong; Wu, Xinglong; Xiong, Liang; Yang, Lu; Fu, Yusi; Zhao, Liang; Tang, Fuchou; Huang, Yanyi
2014-05-13
Single-cell whole-transcriptome analysis is a powerful tool for quantifying gene expression heterogeneity in populations of cells. Many techniques have, thus, been recently developed to perform transcriptome sequencing (RNA-Seq) on individual cells. To probe subtle biological variation between samples with limiting amounts of RNA, more precise and sensitive methods are still required. We adapted a previously developed strategy for single-cell RNA-Seq that has shown promise for superior sensitivity and implemented the chemistry in a microfluidic platform for single-cell whole-transcriptome analysis. In this approach, single cells are captured and lysed in a microfluidic device, where mRNAs with poly(A) tails are reverse-transcribed into cDNA. Double-stranded cDNA is then collected and sequenced using a next generation sequencing platform. We prepared 94 libraries consisting of single mouse embryonic cells and technical replicates of extracted RNA and thoroughly characterized the performance of this technology. Microfluidic implementation increased mRNA detection sensitivity as well as improved measurement precision compared with tube-based protocols. With 0.2 M reads per cell, we were able to reconstruct a majority of the bulk transcriptome with 10 single cells. We also quantified variation between and within different types of mouse embryonic cells and found that enhanced measurement precision, detection sensitivity, and experimental throughput aided the distinction between biological variability and technical noise. With this work, we validated the advantages of an early approach to single-cell RNA-Seq and showed that the benefits of combining microfluidic technology with high-throughput sequencing will be valuable for large-scale efforts in single-cell transcriptome analysis.
Erceg, Jelena; Saunders, Timothy E.; Girardot, Charles; Devos, Damien P.; Hufnagel, Lars; Furlong, Eileen E. M.
2014-01-01
Deciphering the specific contribution of individual motifs within cis-regulatory modules (CRMs) is crucial to understanding how gene expression is regulated and how this process is affected by sequence variation. But despite vast improvements in the ability to identify where transcription factors (TFs) bind throughout the genome, we are limited in our ability to relate information on motif occupancy to function from sequence alone. Here, we engineered 63 synthetic CRMs to systematically assess the relationship between variation in the content and spacing of motifs within CRMs to CRM activity during development using Drosophila transgenic embryos. In over half the cases, very simple elements containing only one or two types of TF binding motifs were capable of driving specific spatio-temporal patterns during development. Different motif organizations provide different degrees of robustness to enhancer activity, ranging from binary on-off responses to more subtle effects including embryo-to-embryo and within-embryo variation. By quantifying the effects of subtle changes in motif organization, we were able to model biophysical rules that explain CRM behavior and may contribute to the spatial positioning of CRM activity in vivo. For the same enhancer, the effects of small differences in motif positions varied in developmentally related tissues, suggesting that gene expression may be more susceptible to sequence variation in one tissue compared to another. This result has important implications for human eQTL studies in which many associated mutations are found in cis-regulatory regions, though the mechanism for how they affect tissue-specific gene expression is often not understood. PMID:24391522
Bautista-de Los Santos, Quyen Melina; Schroeder, Joanna L; Blakemore, Oliver; Moses, Jonathan; Haffey, Mark; Sloan, William; Pinto, Ameet J
2016-03-01
High-throughput and deep DNA sequencing, particularly amplicon sequencing, is being increasingly utilized to reveal spatial and temporal dynamics of bacterial communities in drinking water systems. Whilst the sampling and methodological biases associated with PCR and sequencing have been studied in other environments, they have not been quantified for drinking water. These biases are likely to have the greatest effect on the ability to characterize subtle spatio-temporal patterns influenced by process/environmental conditions. In such cases, intra-sample variability may swamp any underlying small, systematic variation. To evaluate this, we undertook a study with replication at multiple levels including sampling sites, sample collection, PCR amplification, and high throughput sequencing of 16S rRNA amplicons. The variability inherent to the PCR amplification and sequencing steps is significant enough to mask differences between bacterial communities from replicate samples. This was largely driven by greater variability in detection of rare bacteria (relative abundance <0.01%) across PCR/sequencing replicates as compared to replicate samples. Despite this, we captured significant changes in bacterial community over diurnal time-scales and find that the extent and pattern of diurnal changes is specific to each sampling location. Further, we find diurnal changes in bacterial community arise due to differences in the presence/absence of the low abundance bacteria and changes in the relative abundance of dominant bacteria. Finally, we show that bacterial community composition is significantly different across sampling sites for time-periods during which there are typically rapid changes in water use. This suggests hydraulic changes (driven by changes in water demand) contribute to shaping the bacterial community in bulk drinking water over diurnal time-scales. Copyright © 2015 Elsevier Ltd. All rights reserved.
Dong, Yun-Wei; Liao, Ming-Ling; Meng, Xian-Liang; Somero, George N
2018-02-06
Orthologous proteins of species adapted to different temperatures exhibit differences in stability and function that are interpreted to reflect adaptive variation in structural "flexibility." However, quantifying flexibility and comparing flexibility across proteins has remained a challenge. To address this issue, we examined temperature effects on cytosolic malate dehydrogenase (cMDH) orthologs from differently thermally adapted congeners of five genera of marine molluscs whose field body temperatures span a range of ∼60 °C. We describe consistent patterns of convergent evolution in adaptation of function [temperature effects on K M of cofactor (NADH)] and structural stability (rate of heat denaturation of activity). To determine how these differences depend on flexibilities of overall structure and of regions known to be important in binding and catalysis, we performed molecular dynamics simulation (MDS) analyses. MDS analyses revealed a significant negative correlation between adaptation temperature and heat-induced increase of backbone atom movements [root mean square deviation (rmsd) of main-chain atoms]. Root mean square fluctuations (RMSFs) of movement by individual amino acid residues varied across the sequence in a qualitatively similar pattern among orthologs. Regions of sequence involved in ligand binding and catalysis-termed mobile regions 1 and 2 (MR1 and MR2), respectively-showed the largest values for RMSF. Heat-induced changes in RMSF values across the sequence and, importantly, in MR1 and MR2 were greatest in cold-adapted species. MDS methods are shown to provide powerful tools for examining adaptation of enzymes by providing a quantitative index of protein flexibility and identifying sequence regions where adaptive change in flexibility occurs.
Effect of PRESS and STEAM sequences on magnetic resonance spectroscopic liver fat quantification.
Hamilton, Gavin; Middleton, Michael S; Bydder, Mark; Yokoo, Takeshi; Schwimmer, Jeffrey B; Kono, Yuko; Patton, Heather M; Lavine, Joel E; Sirlin, Claude B
2009-07-01
To compare PRESS and STEAM MR spectroscopy for assessment of liver fat in human subjects. Single-voxel (20 x 20 x 20 mm) PRESS and STEAM spectra were obtained at 1.5T in 49 human subjects with known or suspected fatty liver disease. PRESS and STEAM sequences were obtained with fixed TR (1500 msec) and different TE (five PRESS spectra between TE 30-70 msec, five STEAM spectra between TE 20-60 msec). Spectra were quantified and T2 and T2-corrected peak area were calculated by different techniques. The values were compared for PRESS and STEAM. Water T2 values from PRESS and STEAM were not significantly different (P = 0.33). Fat peak T2s were 25%-50% shorter on PRESS than on STEAM (P < 0.02 for all comparisons) and there was no correlation between T2s of individual peaks. PRESS systematically overestimated the relative fat peak areas (by 7%-263%) compared to STEAM (P < 0.005 for all comparisons). The peak area given by PRESS was more dependent on the T2-correction technique than STEAM. Measured liver fat depends on the MRS sequence used. Compared to STEAM, PRESS underestimates T2 values of fat, overestimates fat fraction, and provides a less consistent fat fraction estimate, probably due to J coupling effects. (c) 2009 Wiley-Liss, Inc.
Fu, Liezhen; Wen, Luan; Luu, Nga; Shi, Yun-Bo
2016-01-01
Genome editing with designer nucleases such as TALEN and CRISPR/Cas enzymes has broad applications. Delivery of these designer nucleases into organisms induces various genetic mutations including deletions, insertions and nucleotide substitutions. Characterizing those mutations is critical for evaluating the efficacy and specificity of targeted genome editing. While a number of methods have been developed to identify the mutations, none other than sequencing allows the identification of the most desired mutations, i.e., out-of-frame insertions/deletions that disrupt genes. Here we report a simple and efficient method to visualize and quantify the efficiency of genomic mutations induced by genome-editing. Our approach is based on the expression of a two-color fusion protein in a vector that allows the insertion of the edited region in the genome in between the two color moieties. We show that our approach not only easily identifies developing animals with desired mutations but also efficiently quantifies the mutation rate in vivo. Furthermore, by using LacZα and GFP as the color moieties, our approach can even eliminate the need for a fluorescent microscope, allowing the analysis with simple bright field visualization. Such an approach will greatly simplify the screen for effective genome-editing enzymes and identify the desired mutant cells/animals. PMID:27748423
Revisiting Statistical Aspects of Nuclear Material Accounting
Burr, T.; Hamada, M. S.
2013-01-01
Nuclear material accounting (NMA) is the only safeguards system whose benefits are routinely quantified. Process monitoring (PM) is another safeguards system that is increasingly used, and one challenge is how to quantify its benefit. This paper considers PM in the role of enabling frequent NMA, which is referred to as near-real-time accounting (NRTA). We quantify NRTA benefits using period-driven and data-driven testing. Period-driven testing makes a decision to alarm or not at fixed periods. Data-driven testing decides as the data arrives whether to alarm or continue testing. The difference between period-driven and datad-riven viewpoints is illustrated by using one-year andmore » two-year periods. For both one-year and two-year periods, period-driven NMA using once-per-year cumulative material unaccounted for (CUMUF) testing is compared to more frequent Shewhart and joint sequential cusum testing using either MUF or standardized, independently transformed MUF (SITMUF) data. We show that the data-driven viewpoint is appropriate for NRTA and that it can be used to compare safeguards effectiveness. In addition to providing period-driven and data-driven viewpoints, new features include assessing the impact of uncertainty in the estimated covariance matrix of the MUF sequence and the impact of both random and systematic measurement errors.« less
Ayala, D I; Chen, J C; Bugarel, M; Loneragan, G H; den Bakker, H C; Kottapalli, K R; Brashears, M M; Nightingale, K K
2018-04-17
Lactobacillus animalis NP51 is a direct-fed microbial strain (DFM) extensively used as a pre-harvest food safety mitigation in feedlot cattle due to its antagonistic effects against human foodborne pathogens such as Salmonella and Escherichia coli O157:H7. NP51 not only promotes overall gut health but interferes with the ability of these pathogens to colonize the gastrointestinal tract of cattle. As a result, NP51 reduces fecal shedding of Salmonella and E. coli O157:H7 in cattle presented for harvest and the load of these pathogens that enter the human food chain. Cattle are administered a high dose (1 × 10 9 CFU/head/day) of NP51 to reduce fecal shedding of foodborne pathogens. Ensiled animal feedstuffs naturally contain a high load of lactic acid bacteria (LAB) and it is not possible to detect and quantify the level of a specific LAB strain (e.g., NP51) in this matrix using traditional microbiological culture. The purpose of this study was to develop a molecular method to detect and quantify viable populations of a specific LAB strain (e.g., NP51) in cattle feedstuffs. The NP51 whole genome sequence was aligned with closely related LAB clustering within the same well-supported clade in a LAB phylogeny derived from 30 conserved amino acid encoding sequence to identify orthologs. A sequence encoding recombinational DNA repair protein RecT was found to be unique to NP51 and used to design primers and a probe for molecular detection and quantification of NP51. The primers and probe were confirmed to be specific to NP51 in vitro. Total RNA was extracted from silage samples, including samples naturally inoculated in the field and control samples that were artificially spiked with a range of NP51 concentrations in the laboratory. Reverse-transcriptase quantitative real-time (RT-qRTi) PCR was used to quantify cDNA copies in samples and cycle threshold (Ct) values were compared to a standard curve to estimate NP51 concentrations. Our results indicate this novel molecular method is suitable to confirm the presence and estimate the concentration of a specific LAB strain in animal feedstuffs containing high background levels of LAB. Copyright © 2018. Published by Elsevier B.V.
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf; ...
2014-11-14
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
Woods, J P; Heinecke, E L; Goldman, W E
1998-04-01
We developed an efficient electrotransformation system for the pathogenic fungus Histoplasma capsulatum and used it to examine the effects of features of the transforming DNA on transformation efficiency and fate of the transforming DNA and to demonstrate fungal expression of two recombinant Escherichia coli genes, hph and lacZ. Linearized DNA and plasmids containing Histoplasma telomeric sequences showed the greatest transformation efficiencies, while the plasmid vector had no significant effect, nor did the derivation of the selectable URA5 marker (native Histoplasma gene or a heterologous Podospora anserina gene). Electrotransformation resulted in more frequent multimerization, other modification, or possibly chromosomal integration of transforming telomeric plasmids when saturating amounts of DNA were used, but this effect was not observed with smaller amounts of transforming DNA. We developed another selection system using a hygromycin B resistance marker from plasmid pAN7-1, consisting of the E. coli hph gene flanked by Aspergillus nidulans promoter and terminator sequences. Much of the heterologous fungal sequences could be removed without compromising function in H. capsulatum, allowing construction of a substantially smaller effective marker fragment. Transformation efficiency increased when nonselective conditions were maintained for a time after electrotransformation before selection with the protein synthesis inhibitor hygromycin B was imposed. Finally, we constructed a readily detectable and quantifiable reporter gene by fusing Histoplasma URA5 with E. coli lacZ, resulting in expression of functional beta-galactosidase in H. capsulatum. Demonstration of expression of bacterial genes as effective selectable markers and reporters, together with a highly efficient electrotransformation system, provide valuable approaches for molecular genetic analysis and manipulation of H. capsulatum, which have proven useful for examination of targeted gene disruption, regulated gene expression, and potential virulence determinants in this fungus.
Optimizing complex phenotypes through model-guided multiplex genome engineering
Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.; ...
2017-05-25
Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Optimizing complex phenotypes through model-guided multiplex genome engineering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.
Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Probabilistic metrology or how some measurement outcomes render ultra-precise estimates
NASA Astrophysics Data System (ADS)
Calsamiglia, J.; Gendra, B.; Muñoz-Tapia, R.; Bagan, E.
2016-10-01
We show on theoretical grounds that, even in the presence of noise, probabilistic measurement strategies (which have a certain probability of failure or abstention) can provide, upon a heralded successful outcome, estimates with a precision that exceeds the deterministic bounds for the average precision. This establishes a new ultimate bound on the phase estimation precision of particular measurement outcomes (or sequence of outcomes). For probe systems subject to local dephasing, we quantify such precision limit as a function of the probability of failure that can be tolerated. Our results show that the possibility of abstaining can set back the detrimental effects of noise.
Weirather, Jason L; Afshar, Pegah Tootoonchi; Clark, Tyson A; Tseng, Elizabeth; Powers, Linda S; Underwood, Jason G; Zabner, Joseph; Korlach, Jonas; Wong, Wing Hung; Au, Kin Fai
2015-10-15
We developed an innovative hybrid sequencing approach, IDP-fusion, to detect fusion genes, determine fusion sites and identify and quantify fusion isoforms. IDP-fusion is the first method to study gene fusion events by integrating Third Generation Sequencing long reads and Second Generation Sequencing short reads. We applied IDP-fusion to PacBio data and Illumina data from the MCF-7 breast cancer cells. Compared with the existing tools, IDP-fusion detects fusion genes at higher precision and a very low false positive rate. The results show that IDP-fusion will be useful for unraveling the complexity of multiple fusion splices and fusion isoforms within tumorigenesis-relevant fusion genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Tome, Jacob M; Ozer, Abdullah; Pagano, John M; Gheba, Dan; Schroth, Gary P; Lis, John T
2014-06-01
RNA-protein interactions play critical roles in gene regulation, but methods to quantitatively analyze these interactions at a large scale are lacking. We have developed a high-throughput sequencing-RNA affinity profiling (HiTS-RAP) assay by adapting a high-throughput DNA sequencer to quantify the binding of fluorescently labeled protein to millions of RNAs anchored to sequenced cDNA templates. Using HiTS-RAP, we measured the affinity of mutagenized libraries of GFP-binding and NELF-E-binding aptamers to their respective targets and identified critical regions of interaction. Mutations additively affected the affinity of the NELF-E-binding aptamer, whose interaction depended mainly on a single-stranded RNA motif, but not that of the GFP aptamer, whose interaction depended primarily on secondary structure.
A cloud-based workflow to quantify transcript-expression levels in public cancer compendia
Tatlow, PJ; Piccolo, Stephen R.
2016-01-01
Public compendia of sequencing data are now measured in petabytes. Accordingly, it is infeasible for researchers to transfer these data to local computers. Recently, the National Cancer Institute began exploring opportunities to work with molecular data in cloud-computing environments. With this approach, it becomes possible for scientists to take their tools to the data and thereby avoid large data transfers. It also becomes feasible to scale computing resources to the needs of a given analysis. We quantified transcript-expression levels for 12,307 RNA-Sequencing samples from the Cancer Cell Line Encyclopedia and The Cancer Genome Atlas. We used two cloud-based configurations and examined the performance and cost profiles of each configuration. Using preemptible virtual machines, we processed the samples for as little as $0.09 (USD) per sample. As the samples were processed, we collected performance metrics, which helped us track the duration of each processing step and quantified computational resources used at different stages of sample processing. Although the computational demands of reference alignment and expression quantification have decreased considerably, there remains a critical need for researchers to optimize preprocessing steps. We have stored the software, scripts, and processed data in a publicly accessible repository (https://osf.io/gqrz9). PMID:27982081
MR imaging of a novel NOE-mediated magnetization transfer with water in rat brain at 9.4 T.
Zhang, Xiao-Yong; Wang, Feng; Jin, Tao; Xu, Junzhong; Xie, Jingping; Gochberg, Daniel F; Gore, John C; Zu, Zhongliang
2017-08-01
To detect, map, and quantify a novel nuclear Overhauser enhancement (NOE)-mediated magnetization transfer (MT) with water at approximately -1.6 ppm [NOE(-1.6)] in rat brain using MRI. Continuous wave MT sequences with a variety of radiofrequency irradiation powers were optimized to achieve the maximum contrast of this NOE(-1.6) effect at 9.4 T. The distribution of effect magnitudes, resonance frequency offsets, and line widths in healthy rat brains and the differences of the effect between tumors and contralateral normal brains were imaged and quantified using a multi-Lorentzian fitting method. MR measurements on reconstituted model phospholipids as well as two cell lines (HEK293 and 9L) were also performed to investigate the possible molecular origin of this NOE. Our results suggest that the NOE(-1.6) effect can be detected reliably in rat brain. Pixel-wise fittings demonstrated the regional variations of the effect. Measurements in a rodent tumor model showed that the amplitude of NOE(-1.6) in brain tumor was significantly diminished compared with that in normal brain tissue. Measurements of reconstituted phospholipids suggest that this effect may originate from choline phospholipids. NOE(-1.6) could be used as a new biomarker for the detection of brain tumor. Magn Reson Med 78:588-597, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.
AfterQC: automatic filtering, trimming, error removing and quality control for fastq data.
Chen, Shifu; Huang, Tanxiao; Zhou, Yanqing; Han, Yue; Xu, Mingyan; Gu, Jia
2017-03-14
Some applications, especially those clinical applications requiring high accuracy of sequencing data, usually have to face the troubles caused by unavoidable sequencing errors. Several tools have been proposed to profile the sequencing quality, but few of them can quantify or correct the sequencing errors. This unmet requirement motivated us to develop AfterQC, a tool with functions to profile sequencing errors and correct most of them, plus highly automated quality control and data filtering features. Different from most tools, AfterQC analyses the overlapping of paired sequences for pair-end sequencing data. Based on overlapping analysis, AfterQC can detect and cut adapters, and furthermore it gives a novel function to correct wrong bases in the overlapping regions. Another new feature is to detect and visualise sequencing bubbles, which can be commonly found on the flowcell lanes and may raise sequencing errors. Besides normal per cycle quality and base content plotting, AfterQC also provides features like polyX (a long sub-sequence of a same base X) filtering, automatic trimming and K-MER based strand bias profiling. For each single or pair of FastQ files, AfterQC filters out bad reads, detects and eliminates sequencer's bubble effects, trims reads at front and tail, detects the sequencing errors and corrects part of them, and finally outputs clean data and generates HTML reports with interactive figures. AfterQC can run in batch mode with multiprocess support, it can run with a single FastQ file, a single pair of FastQ files (for pair-end sequencing), or a folder for all included FastQ files to be processed automatically. Based on overlapping analysis, AfterQC can estimate the sequencing error rate and profile the error transform distribution. The results of our error profiling tests show that the error distribution is highly platform dependent. Much more than just another new quality control (QC) tool, AfterQC is able to perform quality control, data filtering, error profiling and base correction automatically. Experimental results show that AfterQC can help to eliminate the sequencing errors for pair-end sequencing data to provide much cleaner outputs, and consequently help to reduce the false-positive variants, especially for the low-frequency somatic mutations. While providing rich configurable options, AfterQC can detect and set all the options automatically and require no argument in most cases.
Status of the Microbial Census
Schloss, Patrick D.; Handelsman, Jo
2004-01-01
Over the past 20 years, more than 78,000 16S rRNA gene sequences have been deposited in GenBank and the Ribosomal Database Project, making the 16S rRNA gene the most widely studied gene for reconstructing bacterial phylogeny. While there is a general appreciation that these sequences are largely unique and derived from diverse species of bacteria, there has not been a quantitative attempt to describe the extent of sequencing efforts to date. We constructed rarefaction curves for each bacterial phylum and for the entire bacterial domain to assess the current state of sampling and the relative taxonomic richness of each phylum. This analysis quantifies the general sense among microbiologists that we are a long way from a complete census of the bacteria on Earth. Moreover, the analysis indicates that current sampling strategies might not be the most effective ones to describe novel diversity because there remain numerous phyla that are globally distributed yet poorly sampled. Based on the current level of sampling, it is not possible to estimate the total number of bacterial species on Earth, but the minimum species richness is 35,498. Considering previous global species richness estimates of 107 to 109, we are certain that this estimate will increase with additional sequencing efforts. The data support previous calls for extensive surveys of multiple chemically disparate environments and of specific phylogenetic groups to advance the census most rapidly. PMID:15590780
Directed evolution of the TALE N-terminal domain for recognition of all 5′ bases
Lamb, Brian M.; Mercer, Andrew C.; Barbas, Carlos F.
2013-01-01
Transcription activator-like effector (TALE) proteins can be designed to bind virtually any DNA sequence. General guidelines for design of TALE DNA-binding domains suggest that the 5′-most base of the DNA sequence bound by the TALE (the N0 base) should be a thymine. We quantified the N0 requirement by analysis of the activities of TALE transcription factors (TALE-TF), TALE recombinases (TALE-R) and TALE nucleases (TALENs) with each DNA base at this position. In the absence of a 5′ T, we observed decreases in TALE activity up to >1000-fold in TALE-TF activity, up to 100-fold in TALE-R activity and up to 10-fold reduction in TALEN activity compared with target sequences containing a 5′ T. To develop TALE architectures that recognize all possible N0 bases, we used structure-guided library design coupled with TALE-R activity selections to evolve novel TALE N-terminal domains to accommodate any N0 base. A G-selective domain and broadly reactive domains were isolated and characterized. The engineered TALE domains selected in the TALE-R format demonstrated modularity and were active in TALE-TF and TALEN architectures. Evolved N-terminal domains provide effective and unconstrained TALE-based targeting of any DNA sequence as TALE binding proteins and designer enzymes. PMID:23980031
Bacterial taxa–area and distance–decay relationships in marine environments
Zinger, L; Boetius, A; Ramette, A
2014-01-01
The taxa–area relationship (TAR) and the distance–decay relationship (DDR) both describe spatial turnover of taxa and are central patterns of biodiversity. Here, we compared TAR and DDR of bacterial communities across different marine realms and ecosystems at the global scale. To obtain reliable global estimates for both relationships, we quantified the poorly assessed effects of sequencing depth, rare taxa removal and number of sampling sites. Slope coefficients of bacterial TARs were within the range of those of plants and animals, whereas slope coefficients of bacterial DDR were much lower. Slope coefficients were mostly affected by removing rare taxa and by the number of sampling sites considered in the calculations. TAR and DDR slope coefficients were overestimated at sequencing depth <4000 sequences per sample. Noticeably, bacterial TAR and DDR patterns did not correlate with each other both within and across ecosystem types, suggesting that (i) TAR cannot be directly derived from DDR and (ii) TAR and DDR may be influenced by different ecological factors. Nevertheless, we found marine bacterial TAR and DDR to be steeper in ecosystems associated with high environmental heterogeneity or spatial isolation, namely marine sediments and coastal environments compared with pelagic ecosystems. Hence, our study provides information on macroecological patterns of marine bacteria, as well as methodological and conceptual insights, at a time when biodiversity surveys increasingly make use of high-throughput sequencing technologies. PMID:24460915
Estorninho, Megan; Gibson, Vivienne B; Kronenberg-Versteeg, Deborah; Liu, Yuk-Fun; Ni, Chester; Cerosaletti, Karen; Peakman, Mark
2013-12-01
Extensive diversity in the human repertoire of TCRs for Ag is both a cornerstone of effective adaptive immunity that enables host protection against a multiplicity of pathogens and a weakness that gives rise to potential pathological self-reactivity. The complexity arising from diversity makes detection and tracking of single Ag-specific CD4 T cells (ASTs) involved in these immune responses challenging. We report a tandem, multistep process to quantify rare TCRβ-chain variable sequences of ASTs in large polyclonal populations. The approach combines deep high-throughput sequencing (HTS) within functional CD4 T cell compartments, such as naive/memory cells, with shallow, multiple identifier-based HTS of ASTs identified by activation marker upregulation after short-term Ag stimulation in vitro. We find that clonotypes recognizing HLA class II-restricted epitopes of both pathogen-derived Ags and self-Ags are oligoclonal and typically private. Clonotype tracking within an individual reveals private AST clonotypes resident in the memory population, as would be expected, representing clonal expansions (identical nucleotide sequence; "ultraprivate"). Other AST clonotypes share CDR3β amino acid sequences through convergent recombination and are found in memory populations of multiple individuals. Tandem HTS-based clonotyping will facilitate studying AST dynamics, epitope spreading, and repertoire changes that arise postvaccination and following Ag-specific immunotherapies for cancer and autoimmune disease.
Miller, John J; Eackles, Michael S.; Stauffer, Jay R; King, Timothy L.
2015-01-01
We characterized variation within the mitochondrial genomes of the invasive silver carp (Hypophthalmichthys molitrix) and bighead carp (H. nobilis) from the Mississippi River drainage by mapping our Next-Generation sequences to their publicly available genomes. Variant detection resulted in 338 single-nucleotide polymorphisms for H. molitrix and 39 for H. nobilis. The much greater genetic variation in H. molitrix mitochondria relative to H. nobilis may be indicative of a greater North American female effective population size of the former. When variation was quantified by gene, many tRNA loci appear to have little or no variability based on our results whereas protein-coding regions were more frequently polymorphic. These results provide biologists with additional regions of DNA to be used as markers to study the invasion dynamics of these species.
Rathe, Susan K; Moriarity, Branden S; Stoltenberg, Christopher B; Kurata, Morito; Aumann, Natalie K; Rahrmann, Eric P; Bailey, Natashay J; Melrose, Ellen G; Beckmann, Dominic A; Liska, Chase R; Largaespada, David A
2014-08-13
The evolution from microarrays to transcriptome deep-sequencing (RNA-seq) and from RNA interference to gene knockouts using Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) and Transcription Activator-Like Effector Nucleases (TALENs) has provided a new experimental partnership for identifying and quantifying the effects of gene changes on drug resistance. Here we describe the results from deep-sequencing of RNA derived from two cytarabine (Ara-C) resistance acute myeloid leukemia (AML) cell lines, and present CRISPR and TALEN based methods for accomplishing complete gene knockout (KO) in AML cells. We found protein modifying loss-of-function mutations in Dck in both Ara-C resistant cell lines. CRISPR and TALEN-based KO of Dck dramatically increased the IC₅₀ of Ara-C and introduction of a DCK overexpression vector into Dck KO clones resulted in a significant increase in Ara-C sensitivity. This effort demonstrates the power of using transcriptome analysis and CRISPR/TALEN-based KOs to identify and verify genes associated with drug resistance.
Potential energy landscapes identify the information-theoretic nature of the epigenome
Jenkinson, Garrett; Pujadas, Elisabet; Goutsias, John; Feinberg, Andrew P.
2017-01-01
Epigenetics studies genomic modifications carrying information independent of DNA sequence heritable through cell division. In 1940, Waddington coined the term “epigenetic landscape” as a metaphor for pluripotency and differentiation, but methylation landscapes have not yet been rigorously computed. By using principles of statistical physics and information theory, we derive epigenetic energy landscapes from whole-genome bisulfite sequencing data that allow us to quantify methylation stochasticity genome-wide using Shannon’s entropy and associate entropy with chromatin structure. Moreover, we consider the Jensen-Shannon distance between sample-specific energy landscapes as a measure of epigenetic dissimilarity and demonstrate its effectiveness for discerning epigenetic differences. By viewing methylation maintenance as a communications system, we introduce methylation channels and show that higher-order chromatin organization can be predicted from their informational properties. Our results provide a fundamental understanding of the information-theoretic nature of the epigenome that leads to a powerful approach for studying its role in disease and aging. PMID:28346445
Vaccine escape in 2013-4 and the hydropathic evolution of glycoproteins of A/H3N2 viruses
NASA Astrophysics Data System (ADS)
Phillips, J. C.
2016-08-01
More virulent strains of influenza virus subtypes H1N1 appeared widely in 2007 and H3N2 in 2011, and especially 2013-4, when the effectiveness of the H3N2 vaccine decreased by more than a factor of two. The amino acid differences of neuraminidase from prior less virulent strains appear to be small (<1%) when tabulated through sequence alignments and counting site identities and similarities. Here we show how analyzing fractal hydropathic forces responsible for neuraminidase globular compaction and modularity quantifies the mutational origins of increased virulence. It also predicts vaccine escape and specifies optimized targets for the 2015 H3N2 vaccine. Unlike some earlier methods based on measuring hemagglutinin antigenic drift, which take several years, cover only a few candidate strains, and are ambiguous, the new methods are timely and can be completed, using NCBI and GISAID amino acid sequences only, in a few days.
Inferring Fitness Effects from Time-Resolved Sequence Data with a Delay-Deterministic Model.
Nené, Nuno R; Dunham, Alistair S; Illingworth, Christopher J R
2018-05-01
A common challenge arising from the observation of an evolutionary system over time is to infer the magnitude of selection acting upon a specific genetic variant, or variants, within the population. The inference of selection may be confounded by the effects of genetic drift in a system, leading to the development of inference procedures to account for these effects. However, recent work has suggested that deterministic models of evolution may be effective in capturing the effects of selection even under complex models of demography, suggesting the more general application of deterministic approaches to inference. Responding to this literature, we here note a case in which a deterministic model of evolution may give highly misleading inferences, resulting from the nondeterministic properties of mutation in a finite population. We propose an alternative approach that acts to correct for this error, and which we denote the delay-deterministic model. Applying our model to a simple evolutionary system, we demonstrate its performance in quantifying the extent of selection acting within that system. We further consider the application of our model to sequence data from an evolutionary experiment. We outline scenarios in which our model may produce improved results for the inference of selection, noting that such situations can be easily identified via the use of a regular deterministic model. Copyright © 2018 Nené et al.
Łabaj, Paweł P; Leparc, Germán G; Linggi, Bryan E; Markillie, Lye Meng; Wiley, H Steven; Kreil, David P
2011-07-01
Measurement precision determines the power of any analysis to reliably identify significant signals, such as in screens for differential expression, independent of whether the experimental design incorporates replicates or not. With the compilation of large-scale RNA-Seq datasets with technical replicate samples, however, we can now, for the first time, perform a systematic analysis of the precision of expression level estimates from massively parallel sequencing technology. This then allows considerations for its improvement by computational or experimental means. We report on a comprehensive study of target identification and measurement precision, including their dependence on transcript expression levels, read depth and other parameters. In particular, an impressive recall of 84% of the estimated true transcript population could be achieved with 331 million 50 bp reads, with diminishing returns from longer read lengths and even less gains from increased sequencing depths. Most of the measurement power (75%) is spent on only 7% of the known transcriptome, however, making less strongly expressed transcripts harder to measure. Consequently, <30% of all transcripts could be quantified reliably with a relative error<20%. Based on established tools, we then introduce a new approach for mapping and analysing sequencing reads that yields substantially improved performance in gene expression profiling, increasing the number of transcripts that can reliably be quantified to over 40%. Extrapolations to higher sequencing depths highlight the need for efficient complementary steps. In discussion we outline possible experimental and computational strategies for further improvements in quantification precision. rnaseq10@boku.ac.at
DNA sequence-dependent mechanics and protein-assisted bending in repressor-mediated loop formation
Boedicker, James Q.; Garcia, Hernan G.; Johnson, Stephanie; Phillips, Rob
2014-01-01
As the chief informational molecule of life, DNA is subject to extensive physical manipulations. The energy required to deform double-helical DNA depends on sequence, and this mechanical code of DNA influences gene regulation, such as through nucleosome positioning. Here we examine the sequence-dependent flexibility of DNA in bacterial transcription factor-mediated looping, a context for which the role of sequence remains poorly understood. Using a suite of synthetic constructs repressed by the Lac repressor and two well-known sequences that show large flexibility differences in vitro, we make precise statistical mechanical predictions as to how DNA sequence influences loop formation and test these predictions using in vivo transcription and in vitro single-molecule assays. Surprisingly, sequence-dependent flexibility does not affect in vivo gene regulation. By theoretically and experimentally quantifying the relative contributions of sequence and the DNA-bending protein HU to DNA mechanical properties, we reveal that bending by HU dominates DNA mechanics and masks intrinsic sequence-dependent flexibility. Such a quantitative understanding of how mechanical regulatory information is encoded in the genome will be a key step towards a predictive understanding of gene regulation at single-base pair resolution. PMID:24231252
Hockenberry, Adam J; Pah, Adam R; Jewett, Michael C; Amaral, Luís A N
2017-01-01
Studies dating back to the 1970s established that sequence complementarity between the anti-Shine-Dalgarno (aSD) sequence on prokaryotic ribosomes and the 5' untranslated region of mRNAs helps to facilitate translation initiation. The optimal location of aSD sequence binding relative to the start codon, the full extents of the aSD sequence and the functional form of the relationship between aSD sequence complementarity and translation efficiency have not been fully resolved. Here, we investigate these relationships by leveraging the sequence diversity of endogenous genes and recently available genome-wide estimates of translation efficiency. We show that-after accounting for predicted mRNA structure-aSD sequence complementarity increases the translation of endogenous mRNAs by roughly 50%. Further, we observe that this relationship is nonlinear, with translation efficiency maximized for mRNAs with intermediate levels of aSD sequence complementarity. The mechanistic insights that we observe are highly robust: we find nearly identical results in multiple datasets spanning three distantly related bacteria. Further, we verify our main conclusions by re-analysing a controlled experimental dataset. © 2017 The Authors.
Cho, Jin-Young; Lee, Hyoung-Joo; Jeong, Seul-Ki; Paik, Young-Ki
2017-12-01
Mass spectrometry (MS) is a widely used proteome analysis tool for biomedical science. In an MS-based bottom-up proteomic approach to protein identification, sequence database (DB) searching has been routinely used because of its simplicity and convenience. However, searching a sequence DB with multiple variable modification options can increase processing time, false-positive errors in large and complicated MS data sets. Spectral library searching is an alternative solution, avoiding the limitations of sequence DB searching and allowing the detection of more peptides with high sensitivity. Unfortunately, this technique has less proteome coverage, resulting in limitations in the detection of novel and whole peptide sequences in biological samples. To solve these problems, we previously developed the "Combo-Spec Search" method, which uses manually multiple references and simulated spectral library searching to analyze whole proteomes in a biological sample. In this study, we have developed a new analytical interface tool called "Epsilon-Q" to enhance the functions of both the Combo-Spec Search method and label-free protein quantification. Epsilon-Q performs automatically multiple spectral library searching, class-specific false-discovery rate control, and result integration. It has a user-friendly graphical interface and demonstrates good performance in identifying and quantifying proteins by supporting standard MS data formats and spectrum-to-spectrum matching powered by SpectraST. Furthermore, when the Epsilon-Q interface is combined with the Combo-Spec search method, called the Epsilon-Q system, it shows a synergistic function by outperforming other sequence DB search engines for identifying and quantifying low-abundance proteins in biological samples. The Epsilon-Q system can be a versatile tool for comparative proteome analysis based on multiple spectral libraries and label-free quantification.
Modular synthetic inverters from zinc finger proteins and small RNAs
Hsia, Justin; Holtz, William J.; Maharbiz, Michel M.; ...
2016-02-17
Synthetic zinc finger proteins (ZFPs) can be created to target promoter DNA sequences, repressing transcription. The binding of small RNA (sRNA) to ZFP mRNA creates an ultrasensitive response to generate higher effective Hill coefficients. Here we combined three “off the shelf” ZFPs and three sRNAs to create new modular inverters in E. coli and quantify their behavior using induction fold. We found a general ordering of the effects of the ZFPs and sRNAs on induction fold that mostly held true when combining these parts. We then attempted to construct a ring oscillator using our new inverters. In conclusion, our chosenmore » parts performed insufficiently to create oscillations, but we include future directions for improvement upon our work presented here.« less
Simulating Nailfold Capillaroscopy Sequences to Evaluate Algorithms for Blood Flow Estimation
Tresadern, P. A.; Berks, M.; Murray, A. K.; Dinsdale, G.; Taylor, C. J.; Herrick, A. L.
2016-01-01
The effects of systemic sclerosis (SSc) – a disease of the connective tissue causing blood flow problems that can require amputation of the fingers – can be observed indirectly by imaging the capillaries at the nailfold, though taking quantitative measures such as blood flow to diagnose the disease and monitor its progression is not easy. Optical flow algorithms may be applied, though without ground truth (i.e. known blood flow) it is hard to evaluate their accuracy. We propose an image model that generates realistic capillaroscopy videos with known flow, and use this model to quantify the effect of flow rate, cell density and contrast (among others) on estimated flow. This resource will help researchers to design systems that are robust under real-world conditions. PMID:24110268
Quantifying the buildup in extent and complexity of free exploration in mice
Benjamini, Yoav; Fonio, Ehud; Galili, Tal; Havkin, Gregor Z.; Golani, Ilan
2011-01-01
To obtain a perspective on an animal's own functional world, we study its behavior in situations that allow the animal to regulate the growth rate of its behavior and provide us with the opportunity to quantify its moment-by-moment developmental dynamics. Thus, we are able to show that mouse exploratory behavior consists of sequences of repeated motion: iterative processes that increase in extent and complexity, whose presumed function is a systematic active management of input acquired during the exploration of a novel environment. We use this study to demonstrate our approach to quantifying behavior: targeting aspects of behavior that are shown to be actively managed by the animal, and using measures that are discriminative across strains and treatments and replicable across laboratories. PMID:21383149
Thomas, Carole D.; Walczak, Christine; Kaffy, Julia; Pontikis, Renée; Jouanneau, Jacqueline; Volk, Andreas
2006-01-01
Abstract Combretastatin A4 phosphate (CA4P) causes rapid disruption of the tumor vasculature and is currently being evaluated for antivascular therapy. We describe the initial results obtained with a noninvasive multi-parametric magnetic resonance imaging (MRI) approach to assess the early effects of CA4P on rat bladder tumors implanted on nude mice. MRI (4.7 T) comprised a fast spin-echo sequence for growth curve assessment; a multislice multiecho sequence for T2 measurement before, 15 minutes after, and 24 hours after CA4P (100 mg/kg); and a fast T2w* gradient-echo sequence to assess MR signal modification under carbogen breathing before, 35 minutes after, and 24 hours after CA4P. The tumor fraction with increased T2w* signal intensity under carbogen (T+) was used to quantify CA4P effect on functional vasculature. CA4P slowed tumor growth over 24 hours and accelerated necrosis development. T+ decrease was observed already at 35 minutes post-CA4P. Early T2 increase was observed in regions becoming necrotic at 24 hours post-CA4P, as confirmed by high T2 and histology. These regions exhibited, under carbogen, a switch from T2w* signal increase before CA4P to a decrease post-CA4P. The combination of carbogen-based functional MRI and T2 measurement may be useful for the early follow-up of antivascular therapy without the administration of contrast agents. PMID:16867221
Thomas, Carole D; Walczak, Christine; Kaffy, Julia; Pontikis, Renée; Jouanneau, Jacqueline; Volk, Andreas
2006-07-01
Combretastatin A4 phosphate (CA4P) causes rapid disruption of the tumor vasculature and is currently being evaluated for antivascular therapy. We describe the initial results obtained with a noninvasive multiparametric magnetic resonance imaging (MRI) approach to assess the early effects of CA4P on rat bladder tumors implanted on nude mice. MRI (4.7 T) comprised a fast spin-echo sequence for growth curve assessment; a multislice multiecho sequence for T2 measurement before, 15 minutes after, and 24 hours after CA4P (100 mg/kg); and a fast T2w* gradient-echo sequence to assess MR signal modification under carbogen breathing before, 35 minutes after, and 24 hours after CA4P. The tumor fraction with increased T2w* signal intensity under carbogen (T+) was used to quantify CA4P effect on functional vasculature. CA4P slowed tumor growth over 24 hours and accelerated necrosis development. T+ decrease was observed already at 35 minutes post-CA4P. Early T2 increase was observed in regions becoming necrotic at 24 hours post-CA4P, as confirmed by high T2 and histology. These regions exhibited, under carbogen, a switch from T2w* signal increase before CA4P to a decrease postCA4P. The combination of carbogen-based functional MRI and T2 measurement may be useful for the early follow-up of antivascular therapy without the administration of contrast agents.
Effective evaluation of privacy protection techniques in visible and thermal imagery
NASA Astrophysics Data System (ADS)
Nawaz, Tahir; Berg, Amanda; Ferryman, James; Ahlberg, Jörgen; Felsberg, Michael
2017-09-01
Privacy protection may be defined as replacing the original content in an image region with a (less intrusive) content having modified target appearance information to make it less recognizable by applying a privacy protection technique. Indeed, the development of privacy protection techniques also needs to be complemented with an established objective evaluation method to facilitate their assessment and comparison. Generally, existing evaluation methods rely on the use of subjective judgments or assume a specific target type in image data and use target detection and recognition accuracies to assess privacy protection. An annotation-free evaluation method that is neither subjective nor assumes a specific target type is proposed. It assesses two key aspects of privacy protection: "protection" and "utility." Protection is quantified as an appearance similarity, and utility is measured as a structural similarity between original and privacy-protected image regions. We performed an extensive experimentation using six challenging datasets (having 12 video sequences), including a new dataset (having six sequences) that contains visible and thermal imagery. The new dataset is made available online for the community. We demonstrate effectiveness of the proposed method by evaluating six image-based privacy protection techniques and also show comparisons of the proposed method over existing methods.
Sequence and Solution Effects on the Prevalence of d-Isomers Produced by Deamidation
2017-01-01
Deamidation of asparagine is a spontaneous and irreversible post-translational modification associated with a growing list of human diseases. While pervasive, deamidation is often overlooked because it represents a relatively minor chemical change. Structural and functional characterization of this modification is complicated because deamidation of asparagine yields four isomeric forms of Asp. Herein, radical directed dissociation (RDD), in conjunction with mass spectrometry, is used to identify and quantify all four isomers in a series of model peptides that were subjected to various deamidation conditions. Although primary sequence significantly influences the rate of deamidation, it has little impact on the relative proportions of the product isomers. Furthermore, the addition of ammonia can be used to increase the rate of deamidation without significantly perturbing isomer populations. Conversely, external factors such as buffer conditions and temperature alter product distributions but exhibit less dramatic effects on the deamidation rate. Strikingly, the common laboratory and biologically significant bicarbonate buffer is found to strongly promote racemization, yielding increased amounts of d-Asp and d-isoAsp. These outcomes following deamidation have broad implications in human aging and should be considered during the development of protein-based therapeutics. PMID:28984444
CpG methylation increases the DNA binding of 9-aminoacridine carboxamide Pt analogues.
Kava, Hieronimus W; Murray, Vincent
2016-10-01
This study investigated the effect of CpG methylation on the DNA binding of cisplatin analogues with an attached aminoacridine intercalator. DNA-targeted 9-aminoacridine carboxamide Pt complexes are known to bind at 5'-CpG sequences. Their binding to methylated and non-methylated 5'-CpG sequences was determined and compared with cisplatin. The damage profiles of each platinum compound were quantified via a polymerase stop assay with fluorescently labelled primers and capillary electrophoresis. Methylation at 5'-CpG was shown to significantly increase the binding intensity for the 9-aminoacridine carboxamide compounds, whereas no significant increase was found for cisplatin. 5'-CpG methylation had the largest effect on the 9-ethanolamine-acridine carboxamide Pt complex, followed by the 9-aminoacridine carboxamide Pt complex and the 7-fluoro complex. The methylation state of a cell's genome is important in maintaining normal gene expression, and is often aberrantly altered in cancer cells. An analogue of cisplatin which differentially targets methylated DNA may be able to improve its therapeutic activity, or alter its range of targets and evade the chemoresistance which hampers cisplatin efficacy in clinical use. Copyright © 2016 Elsevier Ltd. All rights reserved.
Perea Palazón, R J; Solé Arqués, M; Prat González, S; de Caralt Robira, T M; Cibeira López, M T; Ortiz Pérez, J T
2015-01-01
Cardiac magnetic resonance imaging is considered the reference technique for characterizing myocardial tissue; for example, T2-weighted sequences make it possible to evaluate areas of edema or myocardial inflammation. However, traditional sequences have many limitations and provide only qualitative information. Moreover, traditional sequences depend on the reference to remote myocardium or skeletal muscle, which limits their ability to detect and quantify diffuse myocardial damage. Recently developed magnetic resonance myocardial mapping techniques enable quantitative assessment of parameters indicative of edema. These techniques have proven better than traditional sequences both in acute cardiomyopathy and in acute ischemic heart disease. This article synthesizes current developments in T2 mapping as well as their clinical applications and limitations. Copyright © 2014 SERAM. Published by Elsevier España, S.L.U. All rights reserved.
Li, S; Cullen, D; Hjort, M; Spear, R; Andrews, J H
1996-01-01
Aureobasidium pullulans, a cosmopolitan yeast-like fungus, colonizes leaf surfaces and has potential as a biocontrol agent of pathogens. To assess the feasibility of rRNA as a target for A. pullulans-specific oligonucleotide probes, we compared the nucleotide sequences of the small-subunit rRNA (18S) genes of 12 geographically diverse A. pullulans strains. Extreme sequence conservation was observed. The consensus A. pullulans sequence was compared with other fungal sequences to identify potential probes. A 21-mer probe which hybridized to the 12 A. pullulans strains but not to 98 other fungi, including 82 isolates from the phylloplane, was identified. A 17-mer highly specific for Cladosporium herbarum was also identified. These probes have potential in monitoring and quantifying fungi in leaf surface and other microbial communities. PMID:8633850
Wong, Wing-Cheong; Ng, Hong-Kiat; Tantoso, Erwin; Soong, Richie; Eisenhaber, Frank
2018-02-12
Though earlier works on modelling transcript abundance from vertebrates to lower eukaroytes have specifically singled out the Zip's law, the observed distributions often deviate from a single power-law slope. In hindsight, while power-laws of critical phenomena are derived asymptotically under the conditions of infinite observations, real world observations are finite where the finite-size effects will set in to force a power-law distribution into an exponential decay and consequently, manifests as a curvature (i.e., varying exponent values) in a log-log plot. If transcript abundance is truly power-law distributed, the varying exponent signifies changing mathematical moments (e.g., mean, variance) and creates heteroskedasticity which compromises statistical rigor in analysis. The impact of this deviation from the asymptotic power-law on sequencing count data has never truly been examined and quantified. The anecdotal description of transcript abundance being almost Zipf's law-like distributed can be conceptualized as the imperfect mathematical rendition of the Pareto power-law distribution when subjected to the finite-size effects in the real world; This is regardless of the advancement in sequencing technology since sampling is finite in practice. Our conceptualization agrees well with our empirical analysis of two modern day NGS (Next-generation sequencing) datasets: an in-house generated dilution miRNA study of two gastric cancer cell lines (NUGC3 and AGS) and a publicly available spike-in miRNA data; Firstly, the finite-size effects causes the deviations of sequencing count data from Zipf's law and issues of reproducibility in sequencing experiments. Secondly, it manifests as heteroskedasticity among experimental replicates to bring about statistical woes. Surprisingly, a straightforward power-law correction that restores the distribution distortion to a single exponent value can dramatically reduce data heteroskedasticity to invoke an instant increase in signal-to-noise ratio by 50% and the statistical/detection sensitivity by as high as 30% regardless of the downstream mapping and normalization methods. Most importantly, the power-law correction improves concordance in significant calls among different normalization methods of a data series averagely by 22%. When presented with a higher sequence depth (4 times difference), the improvement in concordance is asymmetrical (32% for the higher sequencing depth instance versus 13% for the lower instance) and demonstrates that the simple power-law correction can increase significant detection with higher sequencing depths. Finally, the correction dramatically enhances the statistical conclusions and eludes the metastasis potential of the NUGC3 cell line against AGS of our dilution analysis. The finite-size effects due to undersampling generally plagues transcript count data with reproducibility issues but can be minimized through a simple power-law correction of the count distribution. This distribution correction has direct implication on the biological interpretation of the study and the rigor of the scientific findings. This article was reviewed by Oliviero Carugo, Thomas Dandekar and Sandor Pongor.
Aschard, Hugues; Cattoir, Vincent; Yoder-Himes, Deborah; Lory, Stephen; Pier, Gerald B.
2013-01-01
High-throughput sequencing of transposon (Tn) libraries created within entire genomes identifies and quantifies the contribution of individual genes and operons to the fitness of organisms in different environments. We used insertion-sequencing (INSeq) to analyze the contribution to fitness of all non-essential genes in the chromosome of Pseudomonas aeruginosa strain PA14 based on a library of ∼300,000 individual Tn insertions. In vitro growth in LB provided a baseline for comparison with the survival of the Tn insertion strains following 6 days of colonization of the murine gastrointestinal tract as well as a comparison with Tn-inserts subsequently able to systemically disseminate to the spleen following induction of neutropenia. Sequencing was performed following DNA extraction from the recovered bacteria, digestion with the MmeI restriction enzyme that hydrolyzes DNA 16 bp away from the end of the Tn insert, and fractionation into oligonucleotides of 1,200–1,500 bp that were prepared for high-throughput sequencing. Changes in frequency of Tn inserts into the P. aeruginosa genome were used to quantify in vivo fitness resulting from loss of a gene. 636 genes had <10 sequencing reads in LB, thus defined as unable to grow in this medium. During in vivo infection there were major losses of strains with Tn inserts in almost all known virulence factors, as well as respiration, energy utilization, ion pumps, nutritional genes and prophages. Many new candidates for virulence factors were also identified. There were consistent changes in the recovery of Tn inserts in genes within most operons and Tn insertions into some genes enhanced in vivo fitness. Strikingly, 90% of the non-essential genes were required for in vivo survival following systemic dissemination during neutropenia. These experiments resulted in the identification of the P. aeruginosa strain PA14 genes necessary for optimal survival in the mucosal and systemic environments of a mammalian host. PMID:24039572
Cavusoglu, M; Ciloglu, T; Serinagaoglu, Y; Kamasak, M; Erogul, O; Akcam, T
2008-08-01
In this paper, 'snore regularity' is studied in terms of the variations of snoring sound episode durations, separations and average powers in simple snorers and in obstructive sleep apnoea (OSA) patients. The goal was to explore the possibility of distinguishing among simple snorers and OSA patients using only sleep sound recordings of individuals and to ultimately eliminate the need for spending a whole night in the clinic for polysomnographic recording. Sequences that contain snoring episode durations (SED), snoring episode separations (SES) and average snoring episode powers (SEP) were constructed from snoring sound recordings of 30 individuals (18 simple snorers and 12 OSA patients) who were also under polysomnographic recording in Gülhane Military Medical Academy Sleep Studies Laboratory (GMMA-SSL), Ankara, Turkey. Snore regularity is quantified in terms of mean, standard deviation and coefficient of variation values for the SED, SES and SEP sequences. In all three of these sequences, OSA patients' data displayed a higher variation than those of simple snorers. To exclude the effects of slow variations in the base-line of these sequences, new sequences that contain the coefficient of variation of the sample values in a 'short' signal frame, i.e., short time coefficient of variation (STCV) sequences, were defined. The mean, the standard deviation and the coefficient of variation values calculated from the STCV sequences displayed a stronger potential to distinguish among simple snorers and OSA patients than those obtained from the SED, SES and SEP sequences themselves. Spider charts were used to jointly visualize the three parameters, i.e., the mean, the standard deviation and the coefficient of variation values of the SED, SES and SEP sequences, and the corresponding STCV sequences as two-dimensional plots. Our observations showed that the statistical parameters obtained from the SED and SES sequences, and the corresponding STCV sequences, possessed a strong potential to distinguish among simple snorers and OSA patients, both marginally, i.e., when the parameters are examined individually, and jointly. The parameters obtained from the SEP sequences and the corresponding STCV sequences, on the other hand, did not have a strong discrimination capability. However, the joint behaviour of these parameters showed some potential to distinguish among simple snorers and OSA patients.
USDA-ARS?s Scientific Manuscript database
Quantifying magnitudes and frequencies of rainless times between storms (TBS), or storm occurrence, is required for generating continuous sequences of precipitation for modeling inputs to small watershed models for conservation studies. Two parameters characterize TBS, minimum TBS (MTBS) and averag...
Not all (possibly) “random” sequences are created equal
Pincus, Steve; Kalman, Rudolf E.
1997-01-01
The need to assess the randomness of a single sequence, especially a finite sequence, is ubiquitous, yet is unaddressed by axiomatic probability theory. Here, we assess randomness via approximate entropy (ApEn), a computable measure of sequential irregularity, applicable to single sequences of both (even very short) finite and infinite length. We indicate the novelty and facility of the multidimensional viewpoint taken by ApEn, in contrast to classical measures. Furthermore and notably, for finite length, finite state sequences, one can identify maximally irregular sequences, and then apply ApEn to quantify the extent to which given sequences differ from maximal irregularity, via a set of deficit (defm) functions. The utility of these defm functions which we show allows one to considerably refine the notions of probabilistic independence and normality, is featured in several studies, including (i) digits of e, π, √2, and √3, both in base 2 and in base 10, and (ii) sequences given by fractional parts of multiples of irrationals. We prove companion analytic results, which also feature in a discussion of the role and validity of the almost sure properties from axiomatic probability theory insofar as they apply to specified sequences and sets of sequences (in the physical world). We conclude by relating the present results and perspective to both previous and subsequent studies. PMID:11038612
Sakakura, Chouhei; Hamada, Takuo; Miyagawa, Koji; Nishio, Minoru; Miyashita, Atushi; Nagata, Hiroyuki; Ida, Hiroshi; Yazumi, Shujiro; Otsuji, Eigo; Chiba, Tsutomu; Ito, Kosei; Ito, Yoshiaki
2009-07-01
Using real-time quantitative methylation-specific PCR (RTQ-MSP), methylated RUNX3 sequences were quantified and the fractional concentrations of circulating tumor DNA in serum were determined, along with peripheral blood cells collected preoperatively, intraoperatively and postoperatively from 65 patients with gastric cancer. RTQ-MSP was sufficiently sensitive to detect RUNX3 methylation. Quantitative MSP data were expressed in terms of the methylation index, which was defined as the relative amount of methylated RUNX3 sequences divided by the concentration of methylated actin. High levels of methylated RUNX3 sequences were detected in the peripheral circulation of 29% (19 of 65) of the gastric cancer patients. The RUNX3 methylation index was concordant with cancer stage, histology, lymphatic and vascular invasion, and was more sensitive than carcinoembryonic antigen (CEA) as a biomarker. Twenty-nine percent (19 out of 65) of preoperative serum samples had methylated RUNX3 sequences, ranging from 5.2 to 1625955 (median quantity=43 m-index, sensitivity 95.5%, specificity 62.5%, AUC 0.8651). After surgical resection, the median RUNX3 methylation index in serum significantly decreased. These results demonstrate the clinical usefulness and effectiveness of peripheral blood RTQ-MSP for detecting and monitoring gastric cancer after treatment. Furthermore, 5 out of the 30 preoperative control samples of benign disease (cases of panperitonitis due to acute appendicitis or cholecystitis) showed transient RUNX3 methylation which decreased after the operation in accordance with recovery. Quantification of epigenetic changes in serum RUNX3 methylation using RTQ-MSP is useful for the detection and monitoring of gastric cancer.
Chan, Rachel W; von Deuster, Constantin; Giese, Daniel; Stoeck, Christian T; Harmer, Jack; Aitken, Andrew P; Atkinson, David; Kozerke, Sebastian
2014-07-01
Diffusion tensor imaging (DTI) of moving organs is gaining increasing attention but robust performance requires sequence modifications and dedicated correction methods to account for system imperfections. In this study, eddy currents in the "unipolar" Stejskal-Tanner and the velocity-compensated "bipolar" spin-echo diffusion sequences were investigated and corrected for using a magnetic field monitoring approach in combination with higher-order image reconstruction. From the field-camera measurements, increased levels of second-order eddy currents were quantified in the unipolar sequence relative to the bipolar diffusion sequence while zeroth and linear orders were found to be similar between both sequences. Second-order image reconstruction based on field-monitoring data resulted in reduced spatial misalignment artifacts and residual displacements of less than 0.43 mm and 0.29 mm (in the unipolar and bipolar sequences, respectively) after second-order eddy-current correction. Results demonstrate the need for second-order correction in unipolar encoding schemes but also show that bipolar sequences benefit from second-order reconstruction to correct for incomplete intrinsic cancellation of eddy-currents. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Tracking the Invasion of Small Numbers of Cells in Paper-Based Assays with Quantitative PCR.
Truong, Andrew S; Lochbaum, Christian A; Boyce, Matthew W; Lockett, Matthew R
2015-11-17
Paper-based scaffolds are an attractive material for culturing mammalian cells in a three-dimensional environment. There are a number of previously published studies, which utilize these scaffolds to generate models of aortic valves, cardiac ischemia and reperfusion, and solid tumors. These models have largely relied on fluorescence imaging and microscopy to quantify cells in the scaffolds. We present here a polymerase chain reaction (PCR)-based method, capable of quantifying multiple cell types in a single culture with the aid of DNA barcodes: unique sequences of DNA introduced to the genome of individual cells or cell types through lentiviral transduction. PCR-based methods are highly specific and are amenable to high-throughput and multiplexed analyses. To validate this method, we engineered two different breast cancer lines to constitutively express either a green or red fluorescent protein. These cells lines allowed us to directly compare the ability of fluorescence imaging (of the fluorescent proteins) and qPCR (of the unique DNA sequences of the fluorescent proteins) to quantify known numbers of cells in the paper based-scaffolds. We also used both methods to quantify the distribution of these breast cell lines in homotypic and heterotypic invasion assays. In the paper-based invasion assays, a single sheet of paper containing cells suspended in a hydrogel was sandwiched between sheets of paper containing only hydrogel. The stack was incubated, and the cells invaded the adjacent layers. The individual sheets of the invasion assay were then destacked and the number of cells in each layer quantified. Our results show both methods can accurately detect cell populations of greater than 500 cells. The qPCR method can repeatedly and accurately detect as few as 50 cells, allowing small populations of highly invasive cells to be detected and differentiated from other cell types.
Richardson, C A; Hides, J A; Wilson, S; Stanton, W; Snijders, C J
2004-07-01
The antigravity muscles of the lumbo-pelvic region, especially transversus abdominis (TrA), are important for the protection and support of the weightbearing joints. Measures of TrA function (the response to the postural cue of drawing in the abdominal wall) have been developed and quantified using magnetic resonance imaging (MRI). Cross-sections through the trunk allowed muscle contraction as well as the large fascial attachments of the TrA to be visualized. The cross sectional area (CSA) of the deep musculo-fascial system was measured at rest and in the contracted state, using static images as well as a cine sequence. In this developmental study, MRI measures were undertaken on a small sample of low back pain (LBP) and non LBP subjects. Results demonstrated that, in non LBP subjects, the draw in action produced a symmetrical deep musculo-fascial "corset" which encircles the abdomen. This study demonstrated a difference in this "corset" measure between subjects with and without LBP. These measures may also prove useful to quantify the effect of unloading in bedrest and microgravity exposure.
Spagnol, Stephen T.; Dahl, Kris Noel
2016-01-01
The linear sequence of DNA encodes access to the complete set of proteins that carry out cellular functions. Yet, much of the functionality appropriate for each cell is nested within layers of dynamic regulation and organization, including a hierarchy of chromatin structural states and spatial arrangement within the nucleus. There remain limitations in our understanding of gene expression within the context of nuclear organization from an inability to characterize hierarchical chromatin organization in situ. Here we demonstrate the use of fluorescence lifetime imaging microscopy (FLIM) to quantify and spatially resolve chromatin condensation state using cell-permeable, DNA-binding dyes (Hoechst 33342 and PicoGreen). Through in vitro and in situ experiments we demonstrate the sensitivity of fluorescence lifetime to condensation state through the mechanical effects that accompany the structural changes and are reflected through altered viscosity. The establishment of FLIM for resolving and quantifying chromatin condensation state opens the door for single-measurement mechanical studies of the nucleus and for characterizing the role of genome structure and organization in nuclear processes that accompany physiological and pathological changes. PMID:26765322
Hatae, Ryusuke; Yoshimoto, Koji; Kuga, Daisuke; Akagi, Yojiro; Murata, Hideki; Suzuki, Satoshi O.; Mizoguchi, Masahiro; Iihara, Koji
2016-01-01
High resolution melting (HRM) is a simple and rapid method for screening mutations. It offers various advantages for clinical diagnostic applications. Conventional HRM analysis often yields equivocal results, especially for surgically obtained tissues. We attempted to improve HRM analyses for more effective applications to clinical diagnostics. HRM analyses were performed for IDH1R132 and IDH2R172 mutations in 192 clinical glioma samples in duplicate and these results were compared with sequencing results. BRAFV600E mutations were analyzed in 52 additional brain tumor samples. The melting profiles were used for differential calculus analyses. Negative second derivative plots revealed additional peaks derived from heteroduplexes in PCR products that contained mutations; this enabled unequivocal visual discrimination of the mutations. We further developed a numerical expression, the HRM-mutation index (MI), to quantify the heteroduplex-derived peak of the mutational curves. Using this expression, all IDH1 mutation statuses matched those ascertained by sequencing, with the exception of three samples. These discordant results were all derived from the misinterpretation of sequencing data. The effectiveness of our approach was further validated by analyses of IDH2R172 and BRAFV600E mutations. The present analytical method enabled an unequivocal and objective HRM analysis and is suitable for reliable mutation scanning in surgically obtained glioma tissues. This approach could facilitate molecular diagnostics in clinical environments. PMID:27529619
Du, Jiang; Ma, Guolin; Li, Shihong; Carl, Michael; Szeverenyi, Nikolaus M; VandenBerg, Scott; Corey-Bloom, Jody; Bydder, Graeme M
2014-01-01
White matter of the brain contains a majority of long T2 components as well as a minority of short T2 components. These are not detectable using clinical magnetic resonance imaging (MRI) sequences with conventional echo times (TEs). In this study we used ultrashort echo time (UTE) sequences to investigate the ultrashort T2 components in white matter of the brain and quantify their T2*s and relative proton densities (RPDs) (relative to water with a proton density of 100%) using a clinical whole body 3T scanner. An adiabatic inversion recovery prepared dual echo UTE (IR-dUTE) sequence was used for morphological imaging of the ultrashort T2 components in white matter. IR-dUTE acquisitions at a constant TR of 1000 ms and a series of TIs were performed to determine the optimal TI which corresponded to the minimum signal to noise ratio (SNR) in white matter of the brain on the second echo image. T2*s of the ultrashort T2 components were quantified using mono-exponential decay fitting of the IR-dUTE signal at a series of TEs. RPD was quantified by comparing IR-dUTE signal of the ultrashort T2 components with that of a rubber phantom. Nine healthy volunteers were studied. The IR-dUTE sequence provided excellent image contrast for the ultrashort T2 components in white matter of the brain with a mean signal to noise ratio of 18.7 ± 3.7 and a contrast to noise ratio of 14.6 ± 2.4 between the ultrashort T2 white matter and gray matter in a 4.4 min scan time with a nominal voxel size of 1.25×1.25×5.0 mm3. On average a T2* value of 0.42 ± 0.08 ms and a RPD of 4.05 ± 0.88% were demonstrated for the ultrashort T2 components in white matter of the brain of healthy volunteers at 3T. PMID:24188809
NASA Astrophysics Data System (ADS)
Lekmine, G.; Auradou, H.; Pessel, M.; Rayner, J. L.
2017-04-01
Cross-borehole ERT imaging was tested to quantify the average velocity and transport parameters of tracer plumes in saturated porous media. Seven tracer tests were performed at different flow rates and monitored by either a vertical or horizontal dipole-dipole ERT sequence. These sequences were tested to reconstruct the shape and temporally follow the spread of the tracer plumes through a background regularization procedure. Data sets were inverted with the same inversion parameters and 2D model sections of resistivity ratios were converted to tracer concentrations. Both array types provided an accurate estimation of the average pore velocity vz. The total mass Mtot recovered was always overestimated by the horizontal dipole-dipole and underestimated by the vertical dipole-dipole. The vertical dipole-dipole was however reliable to quantify the longitudinal dispersivity λz, while the horizontal dipole-dipole returned better estimation for the transverse component λx. λ and Mtot were mainly influenced by the 2D distribution of the cumulated electrical sensitivity and the Shadow Effects induced by the third dimension. The size reduction of the edge of the plume was also related to the inability of the inversion process to reconstruct sharp resistivity contrasts at the interface. Smoothing was counterbalanced by a non-realistic rise of the ERT concentrations around the centre of mass returning overpredicted total masses. A sensitivity analysis on the cementation factor m and the porosity ϕ demonstrated that a change in one of these parameters by 8% involved non negligible variations by 30 and 40% of the dispersion coefficients and mass recovery.
Akbari, Atena; Abbasi-Rad, Shahrokh; Rad, Hamidreza Saligheh
2016-02-01
Large pores of human cortical bone (>30μm) are filled with fluids, essentially consisting of water, suggesting that cortical bone free water can be considered as a reliable surrogate measure of cortical bone porosity and hence quality. Signal from such pores can be reliably captured using Short Echo Time (STE) pulse sequence with echo-time in the range of 1-1.5msec (which should be judiciously selected correspond to T2(⁎) value of free water molecules). Furthermore, it is well-known that cortical bone T1-relaxivity is a function of its geometry, suggesting that cortical bone free water increases with age. In this work, we quantified cortical bone free water longitudinal relaxation time (T1) by a Dual-TR technique using STE pulse sequence. In the sequel, we investigated relationship between STE-derived cortical bone free water T1-values and age in a group of healthy volunteers (thirty subjects covering the age range of 20-70years) at 1.5T. Preliminary results showed that cortical bone free water T1 highly correlates with age (r(2)=0.73, p<0.0001), representing cortical bone free water T1 as a reliable indicator of cortical bone porosity and age-related deterioration. It can be concluded that STE-MRI can be utilized as proper alternative in quantifying cortical bone porosity parameters in-vivo, with the advantages of widespread clinical availability and being cost-effective. Copyright © 2015 Elsevier Inc. All rights reserved.
Ambur, Ole Herman; Frye, Stephan A.; Nilsen, Mariann; Hovland, Eirik; Tønjum, Tone
2012-01-01
Transformation is a complex process that involves several interactions from the binding and uptake of naked DNA to homologous recombination. Some actions affect transformation favourably whereas others act to limit it. Here, meticulous manipulation of a single type of transforming DNA allowed for quantifying the impact of three different mediators of meningococcal transformation: NlaIV restriction, homologous recombination and the DNA Uptake Sequence (DUS). In the wildtype, an inverse relationship between the transformation frequency and the number of NlaIV restriction sites in DNA was observed when the transforming DNA harboured a heterologous region for selection (ermC) but not when the transforming DNA was homologous with only a single nucleotide heterology. The influence of homologous sequence in transforming DNA was further studied using plasmids with a small interruption or larger deletions in the recombinogenic region and these alterations were found to impair transformation frequency. In contrast, a particularly potent positive driver of DNA uptake in Neisseria sp. are short DUS in the transforming DNA. However, the molecular mechanism(s) responsible for DUS specificity remains unknown. Increasing the number of DUS in the transforming DNA was here shown to exert a positive effect on transformation. Furthermore, an influence of variable placement of DUS relative to the homologous region in the donor DNA was documented for the first time. No effect of altering the orientation of DUS was observed. These observations suggest that DUS is important at an early stage in the recognition of DNA, but does not exclude the existence of more than one level of DUS specificity in the sequence of events that constitute transformation. New knowledge on the positive and negative drivers of transformation may in a larger perspective illuminate both the mechanisms and the evolutionary role(s) of one of the most conserved mechanisms in nature: homologous recombination. PMID:22768309
Thomas, David; Finan, Chris; Newport, Melanie J; Jones, Susan
2015-10-01
The complexity of DNA can be quantified using estimates of entropy. Variation in DNA complexity is expected between the promoters of genes with different transcriptional mechanisms; namely housekeeping (HK) and tissue specific (TS). The former are transcribed constitutively to maintain general cellular functions, and the latter are transcribed in restricted tissue and cells types for specific molecular events. It is known that promoter features in the human genome are related to tissue specificity, but this has been difficult to quantify on a genomic scale. If entropy effectively quantifies DNA complexity, calculating the entropies of HK and TS gene promoters as profiles may reveal significant differences. Entropy profiles were calculated for a total dataset of 12,003 human gene promoters and for 501 housekeeping (HK) and 587 tissue specific (TS) human gene promoters. The mean profiles show the TS promoters have a significantly lower entropy (p<2.2e-16) than HK gene promoters. The entropy distributions for the 3 datasets show that promoter entropies could be used to identify novel HK genes. Functional features comprise DNA sequence patterns that are non-random and hence they have lower entropies. The lower entropy of TS gene promoters can be explained by a higher density of positive and negative regulatory elements, required for genes with complex spatial and temporary expression. Copyright © 2015 Elsevier Ltd. All rights reserved.
Numerical Analysis of the SCHOLAR Supersonic Combustor
NASA Technical Reports Server (NTRS)
Rodriguez, Carlos G.; Cutler, Andrew D.
2003-01-01
The SCHOLAR scramjet experiment is the subject of an ongoing numerical investigation. The facility nozzle and combustor were solved separate and sequentially, with the exit conditions of the former used as inlet conditions for the latter. A baseline configuration for the numerical model was compared with the available experimental data. It was found that ignition-delay was underpredicted and fuel-plume penetration overpredicted, while the pressure rise was close to experimental values. In addition, grid-convergence by means of grid-sequencing could not be established. The effects of the different turbulence parameters were quantified. It was found that it was not possible to simultaneously predict the three main parameters of this flow: pressure-rise, ignition-delay, and fuel-plume penetration.
Nucleic Acid-Based Approaches for Detection of Viral Hepatitis
Behzadi, Payam; Ranjbar, Reza; Alavian, Seyed Moayed
2014-01-01
Context: To determining suitable nucleic acid diagnostics for individual viral hepatitis agent, an extensive search using related keywords was done in major medical library and data were collected, categorized, and summarized in different sections. Results: Various types of molecular biology tools can be used to detect and quantify viral genomic elements and analyze the sequences. These molecular assays are proper technologies for rapidly detecting viral agents with high accuracy, high sensitivity, and high specificity. Nonetheless, the application of each diagnostic method is completely dependent on viral agent. Conclusions: Despite rapidity, automation, accuracy, cost-effectiveness, high sensitivity, and high specificity of molecular techniques, each type of molecular technology has its own advantages and disadvantages. PMID:25789132
Entanglement of two individual neutral atoms using Rydberg blockade.
Wilk, T; Gaëtan, A; Evellin, C; Wolters, J; Miroshnychenko, Y; Grangier, P; Browaeys, A
2010-01-08
We report the generation of entanglement between two individual 87Rb atoms in hyperfine ground states |F=1,M=1> and |F=2,M=2> which are held in two optical tweezers separated by 4 microm. Our scheme relies on the Rydberg blockade effect which prevents the simultaneous excitation of the two atoms to a Rydberg state. The entangled state is generated in about 200 ns using pulsed two-photon excitation. We quantify the entanglement by applying global Raman rotations on both atoms. We measure that 61% of the initial pairs of atoms are still present at the end of the entangling sequence. These pairs are in the target entangled state with a fidelity of 0.75.
Cao, Shenglong; Hua, Ya; Keep, Richard F; Chaudhary, Neeraj; Xi, Guohua
2018-04-01
Brain iron overload is a key factor causing brain injury after intracerebral hemorrhage (ICH). This study quantified brain iron levels after ICH with magnetic resonance imaging R2* mapping. The effect of minocycline on iron overload and ICH-induced brain injury in aged rats was also determined. Aged (18 months old) male Fischer 344 rats had an intracerebral injection of autologous blood or saline, and brain iron levels were measured by magnetic resonance imaging R2* mapping. Some ICH rats were treated with minocycline or vehicle. The rats were euthanized at days 7 and 28 after ICH, and brains were used for immunohistochemistry and Western blot analyses. Magnetic resonance imaging (T2-weighted, T2* gradient-echo, and R2* mapping) sequences were performed at different time points. ICH-induced brain iron overload in the perihematomal area could be quantified by R2* mapping. Minocycline treatment reduced brain iron accumulation, T2* lesion volume, iron-handling protein upregulation, neuronal cell death, and neurological deficits ( P <0.05). Magnetic resonance imaging R2* mapping is a reliable and noninvasive method, which can quantitatively measure brain iron levels after ICH. Minocycline reduced ICH-related perihematomal iron accumulation and brain injury in aged rats. © 2018 American Heart Association, Inc.
Chen, Siyu; Li, Ke; Cao, Wenqing; Wang, Jia; Zhao, Tong; Huan, Qing; Yang, Yu-Fei; Wu, Shaohuan; Qian, Wenfeng
2017-01-01
Abstract Codon usage bias (CUB) refers to the observation that synonymous codons are not used equally frequently in a genome. CUB is stronger in more highly expressed genes, a phenomenon commonly explained by stronger natural selection on translational accuracy and/or efficiency among these genes. Nevertheless, this phenomenon could also occur if CUB regulates gene expression at the mRNA level, a hypothesis that has not been tested until recently. Here, we attempt to quantify the impact of synonymous mutations on mRNA level in yeast using 3,556 synonymous variants of a heterologous gene encoding green fluorescent protein (GFP) and 523 synonymous variants of an endogenous gene TDH3. We found that mRNA level was positively correlated with CUB among these synonymous variants, demonstrating a direct role of CUB in regulating transcript concentration, likely via regulating mRNA degradation rate, as our additional experiments suggested. More importantly, we quantified the effects of individual synonymous mutations on mRNA level and found them dependent on 1) CUB and 2) mRNA secondary structure, both in proximal sequence contexts. Our study reveals the pleiotropic effects of synonymous codon usage and provides an additional explanation for the well-known correlation between CUB and gene expression level. PMID:28961875
ERIC Educational Resources Information Center
Busey, Thomas; Yu, Chen; Wyatte, Dean; Vanderkolk, John
2013-01-01
Perceptual tasks such as object matching, mammogram interpretation, mental rotation, and satellite imagery change detection often require the assignment of correspondences to fuse information across views. We apply techniques developed for machine translation to the gaze data recorded from a complex perceptual matching task modeled after…
Deng, Yufeng; Rouze, Ned C.; Palmeri, Mark L.; Nightingale, Kathryn R.
2017-01-01
Ultrasound elasticity imaging has been developed over the last decade to estimate tissue stiffness. Shear wave elasticity imaging (SWEI) quantifies tissue stiffness by measuring the speed of propagating shear waves following acoustic radiation force excitation. This work presents the sequencing and data processing protocols of SWEI using a Verasonics system. The selection of the sequence parameters in a Verasonics programming script is discussed in detail. The data processing pipeline to calculate group shear wave speed (SWS), including tissue motion estimation, data filtering, and SWS estimation is demonstrated. In addition, the procedures for calibration of beam position, scanner timing, and transducer face heating are provided to avoid SWS measurement bias and transducer damage. PMID:28092508
Error catastrophe and phase transition in the empirical fitness landscape of HIV
NASA Astrophysics Data System (ADS)
Hart, Gregory R.; Ferguson, Andrew L.
2015-03-01
We have translated clinical sequence databases of the p6 HIV protein into an empirical fitness landscape quantifying viral replicative capacity as a function of the amino acid sequence. We show that the viral population resides close to a phase transition in sequence space corresponding to an "error catastrophe" beyond which there is lethal accumulation of mutations. Our model predicts that the phase transition may be induced by drug therapies that elevate the mutation rate, or by forcing mutations at particular amino acids. Applying immune pressure to any combination of killer T-cell targets cannot induce the transition, providing a rationale for why the viral protein can exist close to the error catastrophe without sustaining fatal fitness penalties due to adaptive immunity.
Evaluating, Comparing, and Interpreting Protein Domain Hierarchies
2014-01-01
Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108
Genome-wide mapping of autonomous promoter activity in human cells
van Arensbergen, Joris; FitzPatrick, Vincent D.; de Haas, Marcel; Pagie, Ludo; Sluimer, Jasper; Bussemaker, Harmen J.; van Steensel, Bas
2017-01-01
Previous methods to systematically characterize sequence-intrinsic activity of promoters have been limited by relatively low throughput and the length of sequences that could be tested. Here we present Survey of Regulatory Elements (SuRE), a method to assay more than 108 DNA fragments, each 0.2–2kb in size, for their ability to drive transcription autonomously. In SuRE, a plasmid library is constructed of random genomic fragments upstream of a 20bp barcode and decoded by paired-end sequencing. This library is then transfected into cells and transcribed barcodes are quantified in the RNA by high throughput sequencing. When applied to the human genome, we achieved a 55-fold genome coverage, allowing us to map autonomous promoter activity genome-wide. By computational modeling we delineated subregions within promoters that are relevant for their activity. For instance, we show that antisense promoter transcription is generally dependent on the sense core promoter sequences, and that most enhancers and several families of repetitive elements act as autonomous transcription initiation sites. PMID:28024146
Duan, Chuanren; Cui, Yamin; Zhao, Yi; Zhai, Jun; Zhang, Baoyun; Zhang, Kun; Sun, Da; Chen, Hang
2016-10-01
A genetic marker within the 16S rRNA gene of Faecalibacterium was identified for use in a quantitative PCR (qPCR) assay to detect swine faecal contamination in water. A total of 146,038 bacterial sequences were obtained using 454 pyrosequencing. By comparative bioinformatics analysis of Faecalibacterium sequences with those of numerous swine and other animal species, swine-specific Faecalibacterium 16S rRNA gene sequences were identified and Polymerase Chain Okabe (PCR) primer sets designed and tested against faecal DNA samples from swine and non-swine sources. Two PCR primer sets, PFB-1 and PFB-2, showed the highest specificity to swine faecal waste and had no cross-reaction with other animal samples. PFB-1 and PFB-2 amplified 16S rRNA gene sequences from 50 samples of swine with positive ratios of 86 and 90%, respectively. We compared swine-specific Faecalibacterium qPCR assays for the purpose of quantifying the newly identified markers. The quantification limits (LOQs) of PFB-1 and PFB-2 markers in environmental water were 6.5 and 2.9 copies per 100 ml, respectively. Of the swine-associated assays tested, PFB-2 was more sensitive in detecting the swine faecal waste and quantifying the microbial load. Furthermore, the microbial abundance and diversity of the microbiomes of swine and other animal faeces were estimated using operational taxonomic units (OTUs). The species specificity was demonstrated for the microbial populations present in various animal faeces. Copyright © 2016 Elsevier Ltd. All rights reserved.
Comte, Alexia; Gräfenhan, Tom; Links, Matthew G; Hemmingsen, Sean M; Dumonceaux, Tim J
2017-01-01
We examined the epiphytic microbiome of cereal grain using the universal barcode chaperonin-60 (cpn60). Microbial community profiling of seed washes containing DNA extracts prepared from field-grown cereal grain detected sequences from a fungus identified only to Class Sordariomycetes. To identify the fungal sequence and to improve the reference database, we determined cpn60 sequences from field-collected and reference strains of the ergot fungus, Claviceps purpurea. These data allowed us to identify this fungal sequence as deriving from C. purpurea, and suggested that C. purpurea DNA is readily detectable on agricultural commodities, including those for which ergot was not identified as a grading factor. To get a sense of the prevalence and level of C. purpurea DNA in cereal grains, we developed a quantitative PCR assay based on the fungal internal transcribed spacer (ITS) and applied it to 137 samples from the 2014 crop year. The amount of Claviceps DNA quantified correlated strongly with the proportion of ergot sclerotia identified in each grain lot, although there was evidence that non-target organisms were responsible for some false positives with the ITS-based assay. We therefore developed a cpn60-targeted loop-mediated isothermal amplification assay and applied it to the same grain wash samples. The time to positive displayed a significant, inverse correlation to ergot levels determined by visual ratings. These results indicate that both laboratory-based and field-adaptable molecular diagnostic assays can be used to detect and quantify pathogen load in bulk commodities using cereal grain washes.
Comte, Alexia; Gräfenhan, Tom; Links, Matthew G.; Hemmingsen, Sean M.
2017-01-01
We examined the epiphytic microbiome of cereal grain using the universal barcode chaperonin-60 (cpn60). Microbial community profiling of seed washes containing DNA extracts prepared from field-grown cereal grain detected sequences from a fungus identified only to Class Sordariomycetes. To identify the fungal sequence and to improve the reference database, we determined cpn60 sequences from field-collected and reference strains of the ergot fungus, Claviceps purpurea. These data allowed us to identify this fungal sequence as deriving from C. purpurea, and suggested that C. purpurea DNA is readily detectable on agricultural commodities, including those for which ergot was not identified as a grading factor. To get a sense of the prevalence and level of C. purpurea DNA in cereal grains, we developed a quantitative PCR assay based on the fungal internal transcribed spacer (ITS) and applied it to 137 samples from the 2014 crop year. The amount of Claviceps DNA quantified correlated strongly with the proportion of ergot sclerotia identified in each grain lot, although there was evidence that non-target organisms were responsible for some false positives with the ITS-based assay. We therefore developed a cpn60-targeted loop-mediated isothermal amplification assay and applied it to the same grain wash samples. The time to positive displayed a significant, inverse correlation to ergot levels determined by visual ratings. These results indicate that both laboratory-based and field-adaptable molecular diagnostic assays can be used to detect and quantify pathogen load in bulk commodities using cereal grain washes. PMID:28257512
Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L
2010-07-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.
Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.
2010-01-01
Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors.
Adalsteinsson, Viktor A; Ha, Gavin; Freeman, Samuel S; Choudhury, Atish D; Stover, Daniel G; Parsons, Heather A; Gydush, Gregory; Reed, Sarah C; Rotem, Denisse; Rhoades, Justin; Loginov, Denis; Livitz, Dimitri; Rosebrock, Daniel; Leshchiner, Ignaty; Kim, Jaegil; Stewart, Chip; Rosenberg, Mara; Francis, Joshua M; Zhang, Cheng-Zhong; Cohen, Ofir; Oh, Coyin; Ding, Huiming; Polak, Paz; Lloyd, Max; Mahmud, Sairah; Helvie, Karla; Merrill, Margaret S; Santiago, Rebecca A; O'Connor, Edward P; Jeong, Seong H; Leeson, Rachel; Barry, Rachel M; Kramkowski, Joseph F; Zhang, Zhenwei; Polacek, Laura; Lohr, Jens G; Schleicher, Molly; Lipscomb, Emily; Saltzman, Andrea; Oliver, Nelly M; Marini, Lori; Waks, Adrienne G; Harshman, Lauren C; Tolaney, Sara M; Van Allen, Eliezer M; Winer, Eric P; Lin, Nancy U; Nakabayashi, Mari; Taplin, Mary-Ellen; Johannessen, Cory M; Garraway, Levi A; Golub, Todd R; Boehm, Jesse S; Wagle, Nikhil; Getz, Gad; Love, J Christopher; Meyerson, Matthew
2017-11-06
Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.
Association of Amine-Receptor DNA Sequence Variants with Associative Learning in the Honeybee.
Lagisz, Malgorzata; Mercer, Alison R; de Mouzon, Charlotte; Santos, Luana L S; Nakagawa, Shinichi
2016-03-01
Octopamine- and dopamine-based neuromodulatory systems play a critical role in learning and learning-related behaviour in insects. To further our understanding of these systems and resulting phenotypes, we quantified DNA sequence variations at six loci coding octopamine-and dopamine-receptors and their association with aversive and appetitive learning traits in a population of honeybees. We identified 79 polymorphic sequence markers (mostly SNPs and a few insertions/deletions) located within or close to six candidate genes. Intriguingly, we found that levels of sequence variation in the protein-coding regions studied were low, indicating that sequence variation in the coding regions of receptor genes critical to learning and memory is strongly selected against. Non-coding and upstream regions of the same genes, however, were less conserved and sequence variations in these regions were weakly associated with between-individual differences in learning-related traits. While these associations do not directly imply a specific molecular mechanism, they suggest that the cross-talk between dopamine and octopamine signalling pathways may influence olfactory learning and memory in the honeybee.
Espin‐Garcia, Osvaldo; Craiu, Radu V.
2017-01-01
ABSTRACT We evaluate two‐phase designs to follow‐up findings from genome‐wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation‐maximization‐based inference under a semiparametric maximum likelihood formulation tailored for post‐GWAS inference. A GWAS‐SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT‐SNP‐dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme‐QT strata yields significant power improvements compared to marginal QT‐ or SNP‐based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. PMID:29239496
Factors influencing success of clinical genome sequencing across a broad spectrum of disorders
Lise, Stefano; Broxholme, John; Cazier, Jean-Baptiste; Rimmer, Andy; Kanapin, Alexander; Lunter, Gerton; Fiddy, Simon; Allan, Chris; Aricescu, A. Radu; Attar, Moustafa; Babbs, Christian; Becq, Jennifer; Beeson, David; Bento, Celeste; Bignell, Patricia; Blair, Edward; Buckle, Veronica J; Bull, Katherine; Cais, Ondrej; Cario, Holger; Chapel, Helen; Copley, Richard R; Cornall, Richard; Craft, Jude; Dahan, Karin; Davenport, Emma E; Dendrou, Calliope; Devuyst, Olivier; Fenwick, Aimée L; Flint, Jonathan; Fugger, Lars; Gilbert, Rodney D; Goriely, Anne; Green, Angie; Greger, Ingo H.; Grocock, Russell; Gruszczyk, Anja V; Hastings, Robert; Hatton, Edouard; Higgs, Doug; Hill, Adrian; Holmes, Chris; Howard, Malcolm; Hughes, Linda; Humburg, Peter; Johnson, David; Karpe, Fredrik; Kingsbury, Zoya; Kini, Usha; Knight, Julian C; Krohn, Jonathan; Lamble, Sarah; Langman, Craig; Lonie, Lorne; Luck, Joshua; McCarthy, Davis; McGowan, Simon J; McMullin, Mary Frances; Miller, Kerry A; Murray, Lisa; Németh, Andrea H; Nesbit, M Andrew; Nutt, David; Ormondroyd, Elizabeth; Oturai, Annette Bang; Pagnamenta, Alistair; Patel, Smita Y; Percy, Melanie; Petousi, Nayia; Piazza, Paolo; Piret, Sian E; Polanco-Echeverry, Guadalupe; Popitsch, Niko; Powrie, Fiona; Pugh, Chris; Quek, Lynn; Robbins, Peter A; Robson, Kathryn; Russo, Alexandra; Sahgal, Natasha; van Schouwenburg, Pauline A; Schuh, Anna; Silverman, Earl; Simmons, Alison; Sørensen, Per Soelberg; Sweeney, Elizabeth; Taylor, John; Thakker, Rajesh V; Tomlinson, Ian; Trebes, Amy; Twigg, Stephen RF; Uhlig, Holm H; Vyas, Paresh; Vyse, Tim; Wall, Steven A; Watkins, Hugh; Whyte, Michael P; Witty, Lorna; Wright, Ben; Yau, Chris; Buck, David; Humphray, Sean; Ratcliffe, Peter J; Bell, John I; Wilkie, Andrew OM; Bentley, David; Donnelly, Peter; McVean, Gilean
2015-01-01
To assess factors influencing the success of whole genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases across a broad spectrum of disorders in whom prior screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritisation. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease causing variants in 21% of cases, rising to 34% (23/68) for Mendelian disorders and 57% (8/14) in trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, though only four were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis, but also highlight many outstanding challenges. PMID:25985138
Quantifying low-frequency revertants in oral poliovirus vaccine using next generation sequencing.
Sarcey, Eric; Serres, Aurélie; Tindy, Fabrice; Chareyre, Audrey; Ng, Siemon; Nicolas, Marine; Vetter, Emmanuelle; Bonnevay, Thierry; Abachin, Eric; Mallet, Laurent
2017-08-01
Spontaneous reversion to neurovirulence of live attenuated oral poliovirus vaccine (OPV) serotype 3 (chiefly involving the n.472U>C mutation), must be monitored during production to ensure vaccine safety and consistency. Mutant analysis by polymerase chain reaction and restriction enzyme cleavage (MAPREC) has long been endorsed by the World Health Organization as the preferred in vitro test for this purpose; however, it requires radiolabeling, which is no longer supported by many laboratories. We evaluated the performance and suitability of next generation sequencing (NGS) as an alternative to MAPREC. The linearity of NGS was demonstrated at revertant concentrations equivalent to the study range of 0.25%-1.5%. NGS repeatability and intermediate precision were comparable across all tested samples, and NGS was highly reproducible, irrespective of sequencing platform or analysis software used. NGS was performed on OPV serotype 3 working seed lots and monovalent bulks (n=21) that were previously tested using MAPREC, and which covered the representative range of vaccine production. Percentages of 472-C revertants identified by NGS and MAPREC were comparable and highly correlated (r≥0.80), with a Pearson correlation coefficient of 0.95585 (p<0.0001). NGS demonstrated statistically equivalent performance to that of MAPREC for quantifying low-frequency OPV serotype 3 revertants, and offers a valid alternative to MAPREC. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Reconstructing a herbivore’s diet using a novel rbcL DNA mini-barcode for plants
Erickson, David L.; Reed, Elizabeth; Ramachandran, Padmini; Bourg, Norman; McShea, William J.; Ottesen, Andrea
2017-01-01
Next Generation Sequencing and the application of metagenomic analyses can be used to answer questions about animal diet choice and study the consequences of selective foraging by herbivores. The quantification of herbivore diet choice with respect to native versus exotic plant species is particularly relevant given concerns of invasive species establishment and their effects on ecosystems. While increased abundance of white-tailed deer (Odocoileus virginianus) appears to correlate with increased incidence of invasive plant species, data supporting a causal link is scarce. We used a metabarcoding approach (PCR amplicons of the plant rbcL gene) to survey the diet of white-tailed deer (fecal samples), from a forested site in Warren County, Virginia with a comprehensive plant species inventory and corresponding reference collection of plant barcode and chloroplast sequences. We sampled fecal pellet piles and extracted DNA from 12 individual deer in October 2014. These samples were compared to a reference DNA library of plant species collected within the study area. For 72 % of the amplicons, we were able to assign taxonomy at the species level, which provides for the first time—sufficient taxonomic resolution to quantify the relative frequency at which native and exotic plant species are being consumed by white-tailed deer. For each of the 12 individual deer we collected three subsamples from the same fecal sample, resulting in sequencing 36 total samples. Using Qiime, we quantified the plant DNA found in all 36 samples, and found that variance within samples was less than variance between samples (F = 1.73, P = 0.004), indicating additional subsamples may not be necessary. Species level diversity ranged from 60 to 93 OTUs per individual and nearly 70 % of all plant sequences recovered were from native plant species. The number of species detected did reduce significantly (range 4–12) when we excluded species whose OTU composed <1 % of each sample’s total. When compared to the abundance of native and non-natives plants inventoried in the local community, our results support the observation that white-tailed deer have strong foraging preferences, but these preferences were not consistent for species in either class. Deer forage behaviour may favour some exotic species, but not all.
Reconstructing a herbivore's diet using a novel rbcL DNA mini-barcode for plants.
Erickson, David L; Reed, Elizabeth; Ramachandran, Padmini; Bourg, Norman A; McShea, William J; Ottesen, Andrea
2017-05-01
Next Generation Sequencing and the application of metagenomic analyses can be used to answer questions about animal diet choice and study the consequences of selective foraging by herbivores. The quantification of herbivore diet choice with respect to native versus exotic plant species is particularly relevant given concerns of invasive species establishment and their effects on ecosystems. While increased abundance of white-tailed deer ( Odocoileus virginianus ) appears to correlate with increased incidence of invasive plant species, data supporting a causal link is scarce. We used a metabarcoding approach (PCR amplicons of the plant rbc L gene) to survey the diet of white-tailed deer (fecal samples), from a forested site in Warren County, Virginia with a comprehensive plant species inventory and corresponding reference collection of plant barcode and chloroplast sequences. We sampled fecal pellet piles and extracted DNA from 12 individual deer in October 2014. These samples were compared to a reference DNA library of plant species collected within the study area. For 72 % of the amplicons, we were able to assign taxonomy at the species level, which provides for the first time-sufficient taxonomic resolution to quantify the relative frequency at which native and exotic plant species are being consumed by white-tailed deer. For each of the 12 individual deer we collected three subsamples from the same fecal sample, resulting in sequencing 36 total samples. Using Qiime, we quantified the plant DNA found in all 36 samples, and found that variance within samples was less than variance between samples ( F = 1.73, P = 0.004), indicating additional subsamples may not be necessary. Species level diversity ranged from 60 to 93 OTUs per individual and nearly 70 % of all plant sequences recovered were from native plant species. The number of species detected did reduce significantly (range 4-12) when we excluded species whose OTU composed <1 % of each sample's total. When compared to the abundance of native and non-natives plants inventoried in the local community, our results support the observation that white-tailed deer have strong foraging preferences, but these preferences were not consistent for species in either class. Deer forage behaviour may favour some exotic species, but not all.
Reconstructing a herbivore’s diet using a novel rbcL DNA mini-barcode for plants
Erickson, David L.; Reed, Elizabeth; Ramachandran, Padmini; Bourg, Norman A.; Ottesen, Andrea
2017-01-01
Abstract Next Generation Sequencing and the application of metagenomic analyses can be used to answer questions about animal diet choice and study the consequences of selective foraging by herbivores. The quantification of herbivore diet choice with respect to native versus exotic plant species is particularly relevant given concerns of invasive species establishment and their effects on ecosystems. While increased abundance of white-tailed deer (Odocoileus virginianus) appears to correlate with increased incidence of invasive plant species, data supporting a causal link is scarce. We used a metabarcoding approach (PCR amplicons of the plant rbcL gene) to survey the diet of white-tailed deer (fecal samples), from a forested site in Warren County, Virginia with a comprehensive plant species inventory and corresponding reference collection of plant barcode and chloroplast sequences. We sampled fecal pellet piles and extracted DNA from 12 individual deer in October 2014. These samples were compared to a reference DNA library of plant species collected within the study area. For 72 % of the amplicons, we were able to assign taxonomy at the species level, which provides for the first time—sufficient taxonomic resolution to quantify the relative frequency at which native and exotic plant species are being consumed by white-tailed deer. For each of the 12 individual deer we collected three subsamples from the same fecal sample, resulting in sequencing 36 total samples. Using Qiime, we quantified the plant DNA found in all 36 samples, and found that variance within samples was less than variance between samples (F = 1.73, P = 0.004), indicating additional subsamples may not be necessary. Species level diversity ranged from 60 to 93 OTUs per individual and nearly 70 % of all plant sequences recovered were from native plant species. The number of species detected did reduce significantly (range 4–12) when we excluded species whose OTU composed <1 % of each sample’s total. When compared to the abundance of native and non-natives plants inventoried in the local community, our results support the observation that white-tailed deer have strong foraging preferences, but these preferences were not consistent for species in either class. Deer forage behaviour may favour some exotic species, but not all. PMID:28533898
A Cyber-Attack Detection Model Based on Multivariate Analyses
NASA Astrophysics Data System (ADS)
Sakai, Yuto; Rinsaka, Koichiro; Dohi, Tadashi
In the present paper, we propose a novel cyber-attack detection model based on two multivariate-analysis methods to the audit data observed on a host machine. The statistical techniques used here are the well-known Hayashi's quantification method IV and cluster analysis method. We quantify the observed qualitative audit event sequence via the quantification method IV, and collect similar audit event sequence in the same groups based on the cluster analysis. It is shown in simulation experiments that our model can improve the cyber-attack detection accuracy in some realistic cases where both normal and attack activities are intermingled.
Detecting Nano-Scale Vibrations in Rotating Devices by Using Advanced Computational Methods
del Toro, Raúl M.; Haber, Rodolfo E.; Schmittdiel, Michael C.
2010-01-01
This paper presents a computational method for detecting vibrations related to eccentricity in ultra precision rotation devices used for nano-scale manufacturing. The vibration is indirectly measured via a frequency domain analysis of the signal from a piezoelectric sensor attached to the stationary component of the rotating device. The algorithm searches for particular harmonic sequences associated with the eccentricity of the device rotation axis. The detected sequence is quantified and serves as input to a regression model that estimates the eccentricity. A case study presents the application of the computational algorithm during precision manufacturing processes. PMID:22399918
Widespread Site-Dependent Buffering of Human Regulatory Polymorphism
Kutyavin, Tanya; Stamatoyannopoulos, John A.
2012-01-01
The average individual is expected to harbor thousands of variants within non-coding genomic regions involved in gene regulation. However, it is currently not possible to interpret reliably the functional consequences of genetic variation within any given transcription factor recognition sequence. To address this, we comprehensively analyzed heritable genome-wide binding patterns of a major sequence-specific regulator (CTCF) in relation to genetic variability in binding site sequences across a multi-generational pedigree. We localized and quantified CTCF occupancy by ChIP-seq in 12 related and unrelated individuals spanning three generations, followed by comprehensive targeted resequencing of the entire CTCF–binding landscape across all individuals. We identified hundreds of variants with reproducible quantitative effects on CTCF occupancy (both positive and negative). While these effects paralleled protein–DNA recognition energetics when averaged, they were extensively buffered by striking local context dependencies. In the significant majority of cases buffering was complete, resulting in silent variants spanning every position within the DNA recognition interface irrespective of level of binding energy or evolutionary constraint. The prevalence of complex partial or complete buffering effects severely constrained the ability to predict reliably the impact of variation within any given binding site instance. Surprisingly, 40% of variants that increased CTCF occupancy occurred at positions of human–chimp divergence, challenging the expectation that the vast majority of functional regulatory variants should be deleterious. Our results suggest that, even in the presence of “perfect” genetic information afforded by resequencing and parallel studies in multiple related individuals, genomic site-specific prediction of the consequences of individual variation in regulatory DNA will require systematic coupling with empirical functional genomic measurements. PMID:22457641
The Development of Voiceless Sibilant Fricatives in Putonghua-Speaking Children
ERIC Educational Resources Information Center
Li, Fangfang; Munson, Benjamin
2016-01-01
Purpose The aims of the present study are (a) to quantify the developmental sequence of fricative mastery in Putonghua-speaking children and discuss the observed pattern in relation to existing theoretical positions, and (b) to describe the acquisition of the fine-articulatory/acoustic details of fricatives in the multidimensional acoustic space.…
ERIC Educational Resources Information Center
Valasek, Mark A.; Repa, Joyce J.
2005-01-01
In recent years, real-time polymerase chain reaction (PCR) has emerged as a robust and widely used methodology for biological investigation because it can detect and quantify very small amounts of specific nucleic acid sequences. As a research tool, a major application of this technology is the rapid and accurate assessment of changes in gene…
Lee, Soohyun; Seo, Chae Hwa; Alver, Burak Han; Lee, Sanghyuk; Park, Peter J
2015-09-03
RNA-seq has been widely used for genome-wide expression profiling. RNA-seq data typically consists of tens of millions of short sequenced reads from different transcripts. However, due to sequence similarity among genes and among isoforms, the source of a given read is often ambiguous. Existing approaches for estimating expression levels from RNA-seq reads tend to compromise between accuracy and computational cost. We introduce a new approach for quantifying transcript abundance from RNA-seq data. EMSAR (Estimation by Mappability-based Segmentation And Reclustering) groups reads according to the set of transcripts to which they are mapped and finds maximum likelihood estimates using a joint Poisson model for each optimal set of segments of transcripts. The method uses nearly all mapped reads, including those mapped to multiple genes. With an efficient transcriptome indexing based on modified suffix arrays, EMSAR minimizes the use of CPU time and memory while achieving accuracy comparable to the best existing methods. EMSAR is a method for quantifying transcripts from RNA-seq data with high accuracy and low computational cost. EMSAR is available at https://github.com/parklab/emsar.
Measuring information-based energy and temperature of literary texts
NASA Astrophysics Data System (ADS)
Chang, Mei-Chu; Yang, Albert C.-C.; Eugene Stanley, H.; Peng, C.-K.
2017-02-01
We apply a statistical method, information-based energy, to quantify informative symbolic sequences. To apply this method to literary texts, it is assumed that different words with different occurrence frequencies are at different energy levels, and that the energy-occurrence frequency distribution obeys a Boltzmann distribution. The temperature within the Boltzmann distribution can be an indicator for the author's writing capacity as the repertory of thoughts. The relative temperature of a text is obtained by comparing the energy-occurrence frequency distributions of words collected from one text versus from all texts of the same author. Combining the relative temperature with the Shannon entropy as the text complexity, the information-based energy of the text is defined and can be viewed as a quantitative evaluation of an author's writing performance. We demonstrate the method by analyzing two authors, Shakespeare in English and Jin Yong in Chinese, and find that their well-known works are associated with higher information-based energies. This method can be used to measure the creativity level of a writer's work in linguistics, and can also quantify symbolic sequences in different systems.
Anthropogenic Litter in Urban Freshwater Ecosystems: Distribution and Microbial Interactions
Hoellein, Timothy; Rojas, Miguel; Pink, Adam; Gasior, Joseph; Kelly, John
2014-01-01
Accumulation of anthropogenic litter (i.e. garbage; AL) and its ecosystem effects in marine environments are well documented. Rivers receive AL from terrestrial habitats and represent a major source of AL to marine environments, but AL is rarely studied within freshwater ecosystems. Our objectives were to 1) quantify AL density in urban freshwaters, 2) compare AL abundance among freshwater, terrestrial, and marine ecosystems, and 3) characterize the activity and composition of AL biofilms in freshwater habitats. We quantified AL from the Chicago River and Chicago's Lake Michigan shoreline, and found that AL abundance in Chicago freshwater ecosystems was comparable to previously reported data for marine and terrestrial ecosystems, although AL density and composition differed among habitats. To assess microbial interactions with AL, we incubated AL and natural substrates in 3 freshwater ecosystems, quantified biofilm metabolism as gross primary production (GPP) and community respiration (CR), and characterized biofilm bacterial community composition via high-throughput sequencing of 16S rRNA genes. The main driver of biofilm community composition was incubation location (e.g., river vs pond), but there were some significant differences in biofilm composition and metabolism among substrates. For example, biofilms on organic substrates (cardboard and leaves) had lower GPP than hard substrates (glass, plastic, aluminum and tiles). In addition, bacterial communities on organic substrates were distinct in composition from those on hard substrates, with higher relative abundances of bacteria associated with cellulose decomposition. Finally, we used our results to develop a conceptual diagram designed to unite the study of AL in terrestrial and freshwater environments with the well-established field of marine debris research. We suggest this broad perspective will be useful for future studies which synthesize AL sources, ecosystem effects, and fate across multiple ecosystem types, and will benefit management and reduction of global AL accumulations. PMID:24955768
Anthropogenic litter in urban freshwater ecosystems: distribution and microbial interactions.
Hoellein, Timothy; Rojas, Miguel; Pink, Adam; Gasior, Joseph; Kelly, John
2014-01-01
Accumulation of anthropogenic litter (i.e. garbage; AL) and its ecosystem effects in marine environments are well documented. Rivers receive AL from terrestrial habitats and represent a major source of AL to marine environments, but AL is rarely studied within freshwater ecosystems. Our objectives were to 1) quantify AL density in urban freshwaters, 2) compare AL abundance among freshwater, terrestrial, and marine ecosystems, and 3) characterize the activity and composition of AL biofilms in freshwater habitats. We quantified AL from the Chicago River and Chicago's Lake Michigan shoreline, and found that AL abundance in Chicago freshwater ecosystems was comparable to previously reported data for marine and terrestrial ecosystems, although AL density and composition differed among habitats. To assess microbial interactions with AL, we incubated AL and natural substrates in 3 freshwater ecosystems, quantified biofilm metabolism as gross primary production (GPP) and community respiration (CR), and characterized biofilm bacterial community composition via high-throughput sequencing of 16S rRNA genes. The main driver of biofilm community composition was incubation location (e.g., river vs pond), but there were some significant differences in biofilm composition and metabolism among substrates. For example, biofilms on organic substrates (cardboard and leaves) had lower GPP than hard substrates (glass, plastic, aluminum and tiles). In addition, bacterial communities on organic substrates were distinct in composition from those on hard substrates, with higher relative abundances of bacteria associated with cellulose decomposition. Finally, we used our results to develop a conceptual diagram designed to unite the study of AL in terrestrial and freshwater environments with the well-established field of marine debris research. We suggest this broad perspective will be useful for future studies which synthesize AL sources, ecosystem effects, and fate across multiple ecosystem types, and will benefit management and reduction of global AL accumulations.
Ultrafast Comparison of Personal Genomes via Precomputed Genome Fingerprints.
Glusman, Gustavo; Mauldin, Denise E; Hood, Leroy E; Robinson, Max
2017-01-01
We present an ultrafast method for comparing personal genomes. We transform the standard genome representation (lists of variants relative to a reference) into "genome fingerprints" via locality sensitive hashing. The resulting genome fingerprints can be meaningfully compared even when the input data were obtained using different sequencing technologies, processed using different pipelines, represented in different data formats and relative to different reference versions. Furthermore, genome fingerprints are robust to up to 30% missing data. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. For example, we could compute all-against-all pairwise comparisons among the 2504 genomes in the 1000 Genomes data set in 67 s at high quality (21 μs per comparison, on a single processor), and achieved a lower quality approximation in just 11 s. Efficient computation enables scaling up a variety of important genome analyses, including quantifying relatedness, recognizing duplicative sequenced genomes in a set, population reconstruction, and many others. The original genome representation cannot be reconstructed from its fingerprint, effectively decoupling genome comparison from genome interpretation; the method thus has significant implications for privacy-preserving genome analytics.
Machczyńska, Joanna; Zimny, Janusz; Bednarek, Piotr Tomasz
2015-10-01
Plant regeneration via in vitro culture can induce genetic and epigenetic variation; however, the extent of such changes in triticale is not yet understood. In the present study, metAFLP, a variation of methylation-sensitive amplified fragment length polymorphism analysis, was used to investigate tissue culture-induced variation in triticale regenerants derived from four distinct genotypes using androgenesis and somatic embryogenesis. The metAFLP technique enabled identification of both sequence and DNA methylation pattern changes in a single experiment. Moreover, it was possible to quantify subtle effects such as sequence variation, demethylation, and de novo methylation, which affected 19, 5.5, 4.5% of sites, respectively. Comparison of variation in different genotypes and with different in vitro regeneration approaches demonstrated that both the culture technique and genetic background of donor plants affected tissue culture-induced variation. The results showed that the metAFLP approach could be used for quantification of tissue culture-induced variation and provided direct evidence that in vitro plant regeneration could cause genetic and epigenetic variation.
Quantification of differential gene expression by multiplexed targeted resequencing of cDNA
Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.
2017-01-01
Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677
Using DNA-labelled nano- and microparticles to track particle transport in the environment
NASA Astrophysics Data System (ADS)
McNew, Coy; Wang, Chaozi; Dahlke, Helen; Lyon, Steve; Walter, Todd
2017-04-01
By utilizing bio-molecular nanotechnology developed for nano-medicines and drug delivery, we are able to produce DNA-labelled nano- and microparticle tracers for use in a myriad of environmental systems. The use of custom sequenced DNA allows for the fabrication of an enormous number of uniquely labelled tracers with identical transport properties (approximately 1.61 x 1060 unique sequences), each independently quantifiable, that can be applied simultaneously in any hydrologic system. By controlling the fabrication procedure to produce particles of custom size and charge, we are able to tag each size-charge combination uniquely in order to directly probe the effect of these variables on the transport properties of the particles. Here we present our methods for fabrication, extraction, and analysis of the DNA nano- and microparticle tracers, along with results from several successful applications of the tracers, including transport and retention analysis at the lab, continuum, and field scales. To date, our DNA-labelled nano- and microparticle tracers have proved useful in surface and subsurface water applications, soil retention, and even subglacial flow pathways. The range of potential applications continue to prove nearly limitless.
Refinetti, Paulo; Morgenthaler, Stephan; Ekstrøm, Per O
2016-07-01
Cycling temperature capillary electrophoresis has been optimised for mutation detection in 76% of the mitochondrial genome. The method was tested on a mixed sample and compared to mutation detection by next generation sequencing. Out of 152 fragments 90 were concordant, 51 discordant and in 11 were semi-concordant. Dilution experiments show that cycling capillary electrophoresis has a detection limit of 1-3%. The detection limit of routine next generation sequencing was in the ranges of 15 to 30%. Cycling temperature capillary electrophoresis detect and accurate quantify mutations at a fraction of the cost and time required to perform a next generation sequencing analysis. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Implications of Secondary Aftershocks for Failure Processes
NASA Astrophysics Data System (ADS)
Gross, S. J.
2001-12-01
When a seismic sequence with more than one mainshock or an unusually large aftershock occurs, there is a compound aftershock sequence. The secondary aftershocks need not have exactly the same decay as the primary sequence, with the differences having implications for the failure process. When the stress step from the secondary mainshock is positive but not large enough to cause immediate failure of all the remaining primary aftershocks, failure processes which involve accelerating slip will produce secondary aftershocks that decay more rapidly than primary aftershocks. This is because the primary aftershocks are an accelerated version of the background seismicity, and secondary aftershocks are an accelerated version of the primary aftershocks. Real stress perturbations may be negative, and heterogeneities in mainshock stress fields mean that the real world situation is quite complicated. I will first describe and verify my picture of secondary aftershock decay with reference to a simple numerical model of slipping faults which obeys rate and state dependent friction and lacks stress heterogeneity. With such a model, it is possible to generate secondary aftershock sequences with perturbed decay patterns, quantify those patterns, and develop an analysis technique capable of correcting for the effect in real data. The secondary aftershocks are defined in terms of frequency linearized time s(T), which is equal to the number of primary aftershocks expected by a time T, $ s ≡ ∫ t=0T n(t) dt, where the start time t=0 is the time of the primary aftershock, and the primary aftershock decay function n(t) is extrapolated forward to the times of the secondary aftershocks. In the absence of secondary sequences the function s(T)$ re-scales the time so that approximately one event occurs per new time unit; the aftershock sequence is gone. If this rescaling is applied in the presence of a secondary sequence, the secondary sequence is shaped like a primary aftershock sequence, and can be fit by the same modeling techniques applied to simple sequences. The later part of the presentation will concern the decay of Hector Mine aftershocks as influenced by the Landers aftershocks. Although attempts to predict the abundance of Hector aftershocks based on stress overlap analysis are not very successful, the analysis does do a good job fitting the decay of secondary sequences.
NASA Astrophysics Data System (ADS)
Emmett, Jeremy; Murphy, Jim
2016-10-01
Structural and compositional variability in the layering sequences comprising Mars' polar layered terrains (PLT's) is likely explained by orbital-forced climatic variations in the sedimentary cycles of water ice and dust from which they formed [1]. The PLT's therefore contain a direct, extensive record of the recent climate history of Mars encoded in their structure and stratigraphy, but deciphering this record requires understanding the depositional history of their dust and water ice constituents. 3D Mars atmosphere modeling enables direct simulation of atmospheric dynamics, aerosol transport and quantification of surface accumulation for a range of past and present orbital configurations. By quantifying the net yearly polar deposition rates of water ice and dust under Mars' current and past orbital configurations characteristic of the last several millions of years, and integrating these into the present with a time-stepping model, the formation history of the north and south PLT's will be investigated, further constraining their age and composition, and, if reproducible, revealing the processes responsible for prominent features and stratigraphy observed within the deposits. Simulating the formation of the deposits by quantifying net deposition rates during past orbital epochs and integrating these into the present, effectively 'rebuilding' the terrains, could aid in understanding deeper stratigraphic trends, correlating between geographically-separated deposits, explaining the presence and shapes of large-scale polar features, and correlating stratigraphy with geological time. Quantification of the magnitude and geographical distribution of surface aerosol accumulation will build on the work of previous GCM-based investigations [3]. Construction and analysis of hypothetical stratigraphic sequences in the PLT's will draw from previous climate-controlled stratigraphy methodologies [2,4], but will utilize GCM-derived net deposition rates to model orbital influences on sedimentation and erosion.[1] Milkovich S.M. and Head J. W. (2005) JGR, 110. [2] Laskar J.B. and Mustard J.F. (2002) Nature, 419, 375-377 [3] Newman C.E. et al. (2005) Icarus, 174, 135-160. [4] Hvidberg C.S. et al. (2012) Icarus, 221, 405-419.
Milton, James A.; Patole, Samson; Yin, Huabing; Xiao, Qiang; Brown, Tom; Melvin, Tracy
2013-01-01
Although strategies for the immobilization of DNA oligonucleotides onto surfaces for bioanalytical and top-down bio-inspired nanobiofabrication approaches are well developed, the effect of introducing spacer molecules between the surface and the DNA oligonucleotide for the hybridization of nanoparticle–DNA conjugates has not been previously assessed in a quantitative manner. The hybridization efficiency of DNA oligonucleotides end-labelled with gold nanoparticles (1.4 or 10 nm diameter) with DNA sequences conjugated to silicon surfaces via hexaethylene glycol phosphate diester oligomer spacers (0, 1, 2, 6 oligomers) was found to be independent of spacer length. To quantify both the density of DNA strands attached to the surfaces and hybridization with the surface-attached DNA, new methodologies have been developed. Firstly, a simple approach based on fluorescence has been developed for determination of the immobilization density of DNA oligonucleotides. Secondly, an approach using mass spectrometry has been created to establish (i) the mean number of DNA oligonucleotides attached to the gold nanoparticles and (ii) the hybridization density of nanoparticle–oligonucleotide conjugates with the silicon surface–attached complementary sequence. These methods and results will be useful for application with nanosensors, the self-assembly of nanoelectronic devices and the attachment of nanoparticles to biomolecules for single-molecule biophysical studies. PMID:23361467
DNA interaction with platinum-based cytostatics revealed by DNA sequencing.
Smerkova, Kristyna; Vaculovic, Tomas; Vaculovicova, Marketa; Kynicky, Jindrich; Brtnicky, Martin; Eckschlager, Tomas; Stiborova, Marie; Hubalek, Jaromir; Adam, Vojtech
2017-12-15
The main mechanism of action of platinum-based cytostatic drugs - cisplatin, oxaliplatin and carboplatin - is the formation of DNA cross-links, which restricts the transcription due to the disability of DNA to enter the active site of the polymerase. The polymerase chain reaction (PCR) was employed as a simplified model of the amplification process in the cell nucleus. PCR with fluorescently labelled dideoxynucleotides commonly employed for DNA sequencing was used to monitor the effect of platinum-based cytostatics on DNA in terms of decrease in labeling efficiency dependent on a presence of the DNA-drug cross-link. It was found that significantly different amounts of the drugs - cisplatin (0.21 μg/mL), oxaliplatin (5.23 μg/mL), and carboplatin (71.11 μg/mL) - were required to cause the same quenching effect (50%) on the fluorescent labelling of 50 μg/mL of DNA. Moreover, it was found that even though the amounts of the drugs was applied to the reaction mixture differing by several orders of magnitude, the amount of incorporated platinum, quantified by inductively coupled plasma mass spectrometry, was in all cases at the level of tenths of μg per 5 μg of DNA. Copyright © 2017 Elsevier Inc. All rights reserved.
Jobst-Schwan, Tilman; Schmidt, Johanna Magdalena; Schneider, Ronen; Hoogstraten, Charlotte A; Ullmann, Jeremy F P; Schapiro, David; Majmundar, Amar J; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A; Poduri, Annapurna; Hildebrandt, Friedhelm
2018-01-01
Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest.
Takei, Hiraku; Morishita, Soji; Araki, Marito; Edahiro, Yoko; Sunami, Yoshitaka; Hironaka, Yumi; Noda, Naohiro; Sekiguchi, Yuji; Tsuneda, Satoshi; Ohsaka, Akimichi; Komatsu, Norio
2014-01-01
A gain-of-function mutation in the myeloproliferative leukemia virus (MPL) gene, which encodes the thrombopoietin receptor, has been identified in patients with essential thrombocythemia and primary myelofibrosis, subgroups of classic myeloproliferative neoplasms (MPNs). The presence of MPL gene mutations is a critical diagnostic criterion for these diseases. Here, we developed a rapid, simple, and cost-effective method of detecting two major MPL mutations, MPLW515L/K, in a single PCR assay; we termed this method DARMS (dual amplification refractory mutation system)-PCR. DARMS-PCR is designed to produce three different PCR products corresponding to MPLW515L, MPLW515K, and all MPL alleles. The amplicons are later detected and quantified using a capillary sequencer to determine the relative frequencies of the mutant and wild-type alleles. Applying DARMS-PCR to human specimens, we successfully identified MPL mutations in MPN patients, with the exception of patients bearing mutant allele frequencies below the detection limit (5%) of this method. The MPL mutant allele frequencies determined using DARMS-PCR correlated strongly with the values determined using deep sequencing. Thus, we demonstrated the potential of DARMS-PCR to detect MPL mutations and determine the allele frequencies in a timely and cost-effective manner. PMID:25144224
Takei, Hiraku; Morishita, Soji; Araki, Marito; Edahiro, Yoko; Sunami, Yoshitaka; Hironaka, Yumi; Noda, Naohiro; Sekiguchi, Yuji; Tsuneda, Satoshi; Ohsaka, Akimichi; Komatsu, Norio
2014-01-01
A gain-of-function mutation in the myeloproliferative leukemia virus (MPL) gene, which encodes the thrombopoietin receptor, has been identified in patients with essential thrombocythemia and primary myelofibrosis, subgroups of classic myeloproliferative neoplasms (MPNs). The presence of MPL gene mutations is a critical diagnostic criterion for these diseases. Here, we developed a rapid, simple, and cost-effective method of detecting two major MPL mutations, MPLW515L/K, in a single PCR assay; we termed this method DARMS (dual amplification refractory mutation system)-PCR. DARMS-PCR is designed to produce three different PCR products corresponding to MPLW515L, MPLW515K, and all MPL alleles. The amplicons are later detected and quantified using a capillary sequencer to determine the relative frequencies of the mutant and wild-type alleles. Applying DARMS-PCR to human specimens, we successfully identified MPL mutations in MPN patients, with the exception of patients bearing mutant allele frequencies below the detection limit (5%) of this method. The MPL mutant allele frequencies determined using DARMS-PCR correlated strongly with the values determined using deep sequencing. Thus, we demonstrated the potential of DARMS-PCR to detect MPL mutations and determine the allele frequencies in a timely and cost-effective manner.
Inducing protein aggregation by extensional flow
Dobson, John; Kumar, Amit; Willis, Leon F.; Tuma, Roman; Higazi, Daniel R.; Turner, Richard; Lowe, David C.; Ashcroft, Alison E.; Radford, Sheena E.; Kapur, Nikil
2017-01-01
Relative to other extrinsic factors, the effects of hydrodynamic flow fields on protein stability and conformation remain poorly understood. Flow-induced protein remodeling and/or aggregation is observed both in Nature and during the large-scale industrial manufacture of proteins. Despite its ubiquity, the relationships between the type and magnitude of hydrodynamic flow, a protein’s structure and stability, and the resultant aggregation propensity are unclear. Here, we assess the effects of a defined and quantified flow field dominated by extensional flow on the aggregation of BSA, β2-microglobulin (β2m), granulocyte colony stimulating factor (G-CSF), and three monoclonal antibodies (mAbs). We show that the device induces protein aggregation after exposure to an extensional flow field for 0.36–1.8 ms, at concentrations as low as 0.5 mg mL−1. In addition, we reveal that the extent of aggregation depends on the applied strain rate and the concentration, structural scaffold, and sequence of the protein. Finally we demonstrate the in situ labeling of a buried cysteine residue in BSA during extensional stress. Together, these data indicate that an extensional flow readily unfolds thermodynamically and kinetically stable proteins, exposing previously sequestered sequences whose aggregation propensity determines the probability and extent of aggregation. PMID:28416674
NASA Astrophysics Data System (ADS)
Awasthi, Amit; Hothi, Navjot; Kaur, Prabhjot; Singh, Nirankar; Chakraborty, Monojit; Bansal, Sangeeta
2017-12-01
Atmospheric composition of ambient air consists of different gases in definite proportion that affect the earth's climate and its ecological system. Due to varied anthropogenic reasons, this composition is changed, which ultimately have an impact on the health of living beings. For survival, the human respiratory system is one of the sensitive systems, which is easily and closely affected by the change in atmospheric composition of an external environment. Many studies have been conducted to quantify the effects of atmospheric pollution on human health by using different approaches. This article presents different scenario of studies conducted to evaluate the effects on different human groups. Differences between the studies conducted using spirometry and survey methods are presented in this article to extract a better sequence between these two methodologies. Many studies have been conducted to measure the respiratory status by evaluating the respiratory symptoms and hospital admissions. Limited numbers of studies are found with repeated spirometry on the same subjects for long duration to nullify the error arising due to decrease in efforts by the same subjects during manoeuvre of pulmonary function tests. Present study reveals the importance of methodological sequencing in order to obtain more authentic and reliable results. This study suggests that impacts of deteriorating atmospheric composition on human health can be more significantly studied if spirometry is done after survey analysis. The article also proposes that efficiency and authenticity of surveys involving health impacts will increase, if medical data information of patients is saved in hospitals in a proper format.
Levin-Karp, Ayelet; Barenholz, Uri; Bareia, Tasneem; Dayagi, Michal; Zelcbuch, Lior; Antonovsky, Niv; Noor, Elad; Milo, Ron
2013-06-21
Translational coupling is the interdependence of translation efficiency of neighboring genes encoded within an operon. The degree of coupling may be quantified by measuring how the translation rate of a gene is modulated by the translation rate of its upstream gene. Translational coupling was observed in prokaryotic operons several decades ago, but the quantitative range of modulation translational coupling leads to and the factors governing this modulation were only partially characterized. In this study, we systematically quantify and characterize translational coupling in E. coli synthetic operons using a library of plasmids carrying fluorescent reporter genes that are controlled by a set of different ribosome binding site (RBS) sequences. The downstream gene expression level is found to be enhanced by the upstream gene expression via translational coupling with the enhancement level varying from almost no coupling to over 10-fold depending on the upstream gene's sequence. Additionally, we find that the level of translational coupling in our system is similar between the second and third locations in the operon. The coupling depends on the distance between the stop codon of the upstream gene and the start codon of the downstream gene. This study is the first to systematically and quantitatively characterize translational coupling in a synthetic E. coli operon. Our analysis will be useful in accurate manipulation of gene expression in synthetic biology and serves as a step toward understanding the mechanisms involved in translational expression modulation.
Digital gene expression analysis of the zebra finch genome
2010-01-01
Background In order to understand patterns of adaptation and molecular evolution it is important to quantify both variation in gene expression and nucleotide sequence divergence. Gene expression profiling in non-model organisms has recently been facilitated by the advent of massively parallel sequencing technology. Here we investigate tissue specific gene expression patterns in the zebra finch (Taeniopygia guttata) with special emphasis on the genes of the major histocompatibility complex (MHC). Results Almost 2 million 454-sequencing reads from cDNA of six different tissues were assembled and analysed. A total of 11,793 zebra finch transcripts were represented in this EST data, indicating a transcriptome coverage of about 65%. There was a positive correlation between the tissue specificity of gene expression and non-synonymous to synonymous nucleotide substitution ratio of genes, suggesting that genes with a specialised function are evolving at a higher rate (or with less constraint) than genes with a more general function. In line with this, there was also a negative correlation between overall expression levels and expression specificity of contigs. We found evidence for expression of 10 different genes related to the MHC. MHC genes showed relatively tissue specific expression levels and were in general primarily expressed in spleen. Several MHC genes, including MHC class I also showed expression in brain. Furthermore, for all genes with highest levels of expression in spleen there was an overrepresentation of several gene ontology terms related to immune function. Conclusions Our study highlights the usefulness of next-generation sequence data for quantifying gene expression in the genome as a whole as well as in specific candidate genes. Overall, the data show predicted patterns of gene expression profiles and molecular evolution in the zebra finch genome. Expression of MHC genes in particular, corresponds well with expression patterns in other vertebrates. PMID:20359325
Pan, Keyao; Deem, Michael W.
2011-01-01
Many viruses evolve rapidly. For example, haemagglutinin (HA) of the H3N2 influenza A virus evolves to escape antibody binding. This evolution of the H3N2 virus means that people who have previously been exposed to an influenza strain may be infected by a newly emerged virus. In this paper, we use Shannon entropy and relative entropy to measure the diversity and selection pressure by an antibody in each amino acid site of H3 HA between the 1992–1993 season and the 2009–2010 season. Shannon entropy and relative entropy are two independent state variables that we use to characterize H3N2 evolution. The entropy method estimates future H3N2 evolution and migration using currently available H3 HA sequences. First, we show that the rate of evolution increases with the virus diversity in the current season. The Shannon entropy of the sequence in the current season predicts relative entropy between sequences in the current season and those in the next season. Second, a global migration pattern of H3N2 is assembled by comparing the relative entropy flows of sequences sampled in China, Japan, the USA and Europe. We verify this entropy method by describing two aspects of historical H3N2 evolution. First, we identify 54 amino acid sites in HA that have evolved in the past to evade the immune system. Second, the entropy method shows that epitopes A and B on the top of HA evolve most vigorously to escape antibody binding. Our work provides a novel entropy-based method to predict and quantify future H3N2 evolution and to describe the evolutionary history of H3N2. PMID:21543352
Optimisation of DNA extraction from the crustacean Daphnia
Athanasio, Camila Gonçalves; Chipman, James K.; Viant, Mark R.
2016-01-01
Daphnia are key model organisms for mechanistic studies of phenotypic plasticity, adaptation and microevolution, which have led to an increasing demand for genomics resources. A key step in any genomics analysis, such as high-throughput sequencing, is the availability of sufficient and high quality DNA. Although commercial kits exist to extract genomic DNA from several species, preparation of high quality DNA from Daphnia spp. and other chitinous species can be challenging. Here, we optimise methods for tissue homogenisation, DNA extraction and quantification customised for different downstream analyses (e.g., LC-MS/MS, Hiseq, mate pair sequencing or Nanopore). We demonstrate that if Daphnia magna are homogenised as whole animals (including the carapace), absorbance-based DNA quantification methods significantly over-estimate the amount of DNA, resulting in using insufficient starting material for experiments, such as preparation of sequencing libraries. This is attributed to the high refractive index of chitin in Daphnia’s carapace at 260 nm. Therefore, unless the carapace is removed by overnight proteinase digestion, the extracted DNA should be quantified with fluorescence-based methods. However, overnight proteinase digestion will result in partial fragmentation of DNA therefore the prepared DNA is not suitable for downstream methods that require high molecular weight DNA, such as PacBio, mate pair sequencing and Nanopore. In conclusion, we found that the MasterPure DNA purification kit, coupled with grinding of frozen tissue, is the best method for extraction of high molecular weight DNA as long as the extracted DNA is quantified with fluorescence-based methods. This method generated high yield and high molecular weight DNA (3.10 ± 0.63 ng/µg dry mass, fragments >60 kb), free of organic contaminants (phenol, chloroform) and is suitable for large number of downstream analyses. PMID:27190714
Three-Dimensional Transgenic Cell Models to Quantify Space Genotoxic Effects
NASA Technical Reports Server (NTRS)
Gonda, S.; Wu, H.; Pingerelli, P.; Glickman, B.
2000-01-01
In this paper we describe a three-dimensional, multicellular tissue-equivalent model, produced in NASA-designed, rotating wall bioreactors using mammalian cells engineered for genomic containment of mUltiple copies of defined target genes for genotoxic assessment. The Rat 2(lambda) fibroblasts (Stratagene, Inc.) were genetically engineered to contain high-density target genes for mutagenesis. Stable three-dimensional, multicellular spheroids were formed when human mammary epithelial cells and Rat 2(lambda) fibroblasts were cocultured on Cytodex 3 Beads in a rotating wall bioreactor. The utility of this spheroidal model for genotoxic assessment was indicated by a linear dose response curve and by results of gene sequence analysis of mutant clones from 400micron diameter spheroids following low-dose, high-energy, neon radiation exposure
Elevation of Proenkephalin 143-183 in Cerebrospinal Fluid in Moyamoya Disease.
Yokoyama, Kinya; Maruwaka, Mikio; Yoshikawa, Kazuhiro; Araki, Yoshio; Okamoto, Sho; Sumitomo, Masaki; Kawamura, Akino; Sakamoto, Yusuke; Shimizu, Kenzo; Izumi, Takashi; Wakabayashi, Toshihiko
2018-01-01
In moyamoya disease (MMD), the causes of differences in clinical features between children and adults and of the dramatic temporal changes in moyamoya vessels are poorly understood. We previously discovered elevated levels of m/z 4588 and m/z 4473 peptides in cerebrospinal fluid (CSF) in patients with MMD. This study examined the amino acid sequences of these peptides and quantified in specimens. The m/z 4588 and m/z 4473 peptides in CSF from patients with MMD were purified and concentrated by high-performance liquid chromatography and ultrafiltration. Liquid chromatography coupled with tandem mass spectrometry analysis was performed to identify the amino acid sequences of these peptides. We quantified these peptides in samples using sandwich enzyme-linked immunosorbent assay, and concentrations in CSF were compared between MMD (n = 40, 19 male; median age, 37 years) and non-MMD intracranial disease (n = 40, 19 male; median age, 39 years) as controls. These peptides were identified as proenkephalin 143-183 (PENK 143-183). The concentration of PENK 143-183 was significantly greater in patients with MMD (median, 8,270 pmol/L) than control patients (median, 3,760 pmol/L; P < 0.001) and decreased in an age-dependent manner in MMD (r = -0.57; P < 0.001). The area under the receiver operating characteristic curve in children (age <18 years) was 0.885 (95% confidence interval 0.741-1). The correlation between proenkephalin concentration and temporal changes in moyamoya vessels was suggested. Proenkephalin 143-183 in CSF may offer a helpful diagnostic biomarker in pediatric MMD. The effect of enkephalin peptides through opioid growth factor receptor or delta opioid receptor might be associated with the pathophysiology of MMD. Copyright © 2017 Elsevier Inc. All rights reserved.
Desmarais, Samantha M.; Leitner, Thomas; Barron, Annelise E.
2012-01-01
DNA barcodes are short, unique ssDNA primers that “mark” individual biomolecules. To gain better understanding of biophysical parameters constraining primer-dimer formation between primers that incorporate barcode sequences, we have developed a capillary electrophoresis method that utilizes drag-tag-DNA conjugates to quantify dimerization risk between primer-barcode pairs. Results obtained with this unique free-solution conjugate electrophoresis (FSCE) approach are useful as quantitatively precise input data to parameterize computation models of dimerization risk. A set of fluorescently labeled, model primer-barcode conjugates were designed with complementary regions of differing lengths to quantify heterodimerization as a function of temperature. Primer-dimer cases comprised two 30-mer primers, one of which was covalently conjugated to a lab-made, chemically synthesized poly-N-methoxyethylglycine drag-tag, which reduced electrophoretic mobility of ssDNA to distinguish it from ds primer-dimers. The drag-tags also provided a shift in mobility for the dsDNA species, which allowed us to quantitate primer-dimer formation. In the experimental studies, pairs of oligonucleotide primer-barcodes with fully or partially complementary sequences were annealed, and then separated by free-solution conjugate CE at different temperatures, to assess effects on primer-dimer formation. When less than 30 out of 30 basepairs were bonded, dimerization was inversely correlated to temperature. Dimerization occurred when more than 15 consecutive basepairs formed, yet non-consecutive basepairs did not create stable dimers even when 20 out of 30 possible basepairs bonded. The use of free-solution electrophoresis in combination with a peptoid drag-tag and different fluorophores enabled precise separation of short DNA fragments to establish a new mobility shift assay for detection of primer-dimer formation. PMID:22331820
Skeletal development in the African elephant and ossification timing in placental mammals
Hautier, Lionel; Stansfield, Fiona J.; Allen, W. R. Twink; Asher, Robert J.
2012-01-01
We provide here unique data on elephant skeletal ontogeny. We focus on the sequence of cranial and post-cranial ossification events during growth in the African elephant (Loxodonta africana). Previous analyses on ossification sequences in mammals have focused on monotremes, marsupials, boreoeutherian and xenarthran placentals. Here, we add data on ossification sequences in an afrotherian. We use two different methods to quantify sequence heterochrony: the sequence method and event-paring/Parsimov. Compared with other placentals, elephants show late ossifications of the basicranium, manual and pedal phalanges, and early ossifications of the ischium and metacarpals. Moreover, ossification in elephants starts very early and progresses rapidly. Specifically, the elephant exhibits the same percentage of bones showing an ossification centre at the end of the first third of its gestation period as the mouse and hamster have close to birth. Elephants show a number of features of their ossification patterns that differ from those of other placental mammals. The pattern of the initiation of the ossification evident in the African elephant underscores a possible correlation between the timing of ossification onset and gestation time throughout mammals. PMID:22298853
SAMSA2: a standalone metatranscriptome analysis pipeline.
Westreich, Samuel T; Treiber, Michelle L; Mills, David A; Korf, Ian; Lemay, Danielle G
2018-05-21
Complex microbial communities are an area of growing interest in biology. Metatranscriptomics allows researchers to quantify microbial gene expression in an environmental sample via high-throughput sequencing. Metatranscriptomic experiments are computationally intensive because the experiments generate a large volume of sequence data and each sequence must be compared with reference sequences from thousands of organisms. SAMSA2 is an upgrade to the original Simple Annotation of Metatranscriptomes by Sequence Analysis (SAMSA) pipeline that has been redesigned for standalone use on a supercomputing cluster. SAMSA2 is faster due to the use of the DIAMOND aligner, and more flexible and reproducible because it uses local databases. SAMSA2 is available with detailed documentation, and example input and output files along with examples of master scripts for full pipeline execution. SAMSA2 is a rapid and efficient metatranscriptome pipeline for analyzing large RNA-seq datasets in a supercomputing cluster environment. SAMSA2 provides simplified output that can be examined directly or used for further analyses, and its reference databases may be upgraded, altered or customized to fit the needs of any experiment.
Quantifying the Level of Inquiry in a Reformed Introductory Geology Lab Course
ERIC Educational Resources Information Center
Moss, Elizabeth; Cervato, Cinzia
2016-01-01
As part of a campus-wide effort to transform introductory science courses to be more engaging and more accurately convey the excitement of discovery in science, the curriculum of an introductory physical geology lab course was redesigned. What had been a series of ''cookbook'' lab activities was transformed into a sequence of activities based on…
USDA-ARS?s Scientific Manuscript database
We studied the mechanisms of azole resistance in the grapevine powdery mildew fungus, Erysiphe necator, by quantifying the sensitivity to myclobutanil (EC50) in 65 isolates from the eastern U.S. and 12 from Chile. From each isolate, we sequenced the gene for sterol 14a-demethylase (CYP51), and measu...
The effects of mutational processes and selection on driver mutations across cancer types.
Temko, Daniel; Tomlinson, Ian P M; Severini, Simone; Schuster-Böckler, Benjamin; Graham, Trevor A
2018-05-10
Epidemiological evidence has long associated environmental mutagens with increased cancer risk. However, links between specific mutation-causing processes and the acquisition of individual driver mutations have remained obscure. Here we have used public cancer sequencing data from 11,336 cancers of various types to infer the independent effects of mutation and selection on the set of driver mutations in a cancer type. First, we detect associations between a range of mutational processes, including those linked to smoking, ageing, APOBEC and DNA mismatch repair (MMR) and the presence of key driver mutations across cancer types. Second, we quantify differential selection between well-known alternative driver mutations, including differences in selection between distinct mutant residues in the same gene. These results show that while mutational processes have a large role in determining which driver mutations are present in a cancer, the role of selection frequently dominates.
BioShuttle-mediated Plasmid Transfer
Braun, Klaus; von Brasch, Leonie; Pipkorn, Ruediger; Ehemann, Volker; Jenne, Juergen; Spring, Herbert; Debus, Juergen; Didinger, Bernd; Rittgen, Werner; Waldeck, Waldemar
2007-01-01
An efficient gene transfer into target tissues and cells is needed for safe and effective treatment of genetic diseases like cancer. In this paper, we describe the development of a transport system and show its ability for transporting plasmids. This non-viral peptide-based BioShuttle-mediated transfer system consists of a nuclear localization address sequence realizing the delivery of the plasmid phNIS-IRES-EGFP coding for two independent reporter genes into nuclei of HeLa cells. The quantification of the transfer efficiency was achieved by measurements of the sodium iodide symporter activity. EGFP gene expression was measured with Confocal Laser Scanning Microscopy and quantified with biostatistical methods by analysis of the frequency of the amplitude distribution in the CLSM images. The results demonstrate that the “BioShuttle”-Technology is an appropriate tool for an effective transfer of genetic material carried by a plasmid. PMID:18026568
The genetic landscape of a physical interaction
Diss, Guillaume
2018-01-01
A key question in human genetics and evolutionary biology is how mutations in different genes combine to alter phenotypes. Efforts to systematically map genetic interactions have mostly made use of gene deletions. However, most genetic variation consists of point mutations of diverse and difficult to predict effects. Here, by developing a new sequencing-based protein interaction assay – deepPCA – we quantified the effects of >120,000 pairs of point mutations on the formation of the AP-1 transcription factor complex between the products of the FOS and JUN proto-oncogenes. Genetic interactions are abundant both in cis (within one protein) and trans (between the two molecules) and consist of two classes – interactions driven by thermodynamics that can be predicted using a three-parameter global model, and structural interactions between proximally located residues. These results reveal how physical interactions generate quantitatively predictable genetic interactions. PMID:29638215
Male germplasm in relation to environmental conditions: synoptic focus on DNA
Jenkins, Jill A.; Tiersch, Terrence R.; Green, Christopher C.
2011-01-01
Wild animals are generally more sensitive than humans to environmental stressors, thus they act as sentinels for resource degradation. Sublethal stress is generally manifested first at the sub-organismal level, where immune systems are compromised, reproductive success is reduced, and genetic integrity is altered. Biomarkers - variables quantifiably responsive to changes in the environment - provide useful information to resource managers and regulatory agencies. Biomarkers of sperm quality are proving useful in this capacity, as well as in artificial breeding. Cellular and molecular bioassays can help to determine mechanisms of action of deleterious agents, predict fertility and reproductive potential, and model population-wide and community level effects. A sequence of biomarker assays can be tailored to fit species of concern, to study physiological effects responsive to known contamination events, and can be selectively applied to fresh, thawed, and fixed samples, as well as those shipped to the laboratory from field sites.
Swallow Event Sequencing: Comparing Healthy Older and Younger Adults.
Herzberg, Erica G; Lazarus, Cathy L; Steele, Catriona M; Molfenter, Sonja M
2018-04-23
Previous research has established that a great deal of variation exists in the temporal sequence of swallowing events for healthy adults. Yet, the impact of aging on swallow event sequence is not well understood. Kendall et al. (Dysphagia 18(2):85-91, 2003) suggested there are 4 obligatory paired-event sequences in swallowing. We directly compared adherence to these sequences, as well as event latencies, and quantified the percentage of unique sequences in two samples of healthy adults: young (< 45) and old (> 65). The 8 swallowing events that contribute to the sequences were reliably identified from videofluoroscopy in a sample of 23 healthy seniors (10 male, mean age 74.7) and 20 healthy young adults (10 male, mean age 31.5) with no evidence of penetration-aspiration or post-swallow residue. Chi-square analyses compared the proportions of obligatory pairs and unique sequences by age group. Compared to the older subjects, younger subjects had significantly lower adherence to two obligatory sequences: Upper Esophageal Sphincter (UES) opening occurs before (or simultaneous with) the bolus arriving at the UES and UES maximum distention occurs before maximum pharyngeal constriction. The associated latencies were significantly different between age groups as well. Further, significantly fewer unique swallow sequences were observed in the older group (61%) compared with the young (82%) (χ 2 = 31.8; p < 0.001). Our findings suggest that paired swallow event sequences may not be robust across the age continuum and that variation in swallow sequences appears to decrease with aging. These findings provide normative references for comparisons to older individuals with dysphagia.
McMahon, B H; Manore, C A; Hyman, J M; LaBute, M X; Fair, J M
2014-01-01
We present and characterize a multi-host epidemic model of Rift Valley fever (RVF) virus in East Africa with geographic spread on a network, rule-based mitigation measures, and mosquito infection and population dynamics. Susceptible populations are depleted by disease and vaccination and are replenished with the birth of new animals. We observe that the severity of the epidemics is strongly correlated with the duration of the rainy season and that even severe epidemics are abruptly terminated when the rain stops. Because naturally acquired herd immunity is established, total mortality across 25 years is relatively insensitive to many mitigation approaches. Strong reductions in cattle mortality are expected, however, with sufficient reduction in population densities of either vectors or susceptible (ie. unvaccinated) hosts. A better understanding of RVF epidemiology would result from serology surveys to quantify the importance of herd immunity in epidemic control, and sequencing of virus from representative animals to quantify the realative importance of transportation and local reservoirs in nucleating yearly epidemics. Our results suggest that an effective multi-layered mitigation strategy would include vector control, movement control, and vaccination of young animals yearly, even in the absence of expected rainfall.
Enzyme and microbial sensors for environmental monitoring
NASA Astrophysics Data System (ADS)
Wollenberger, U.; Neumann, B.; Scheller, Frieder W.
1993-03-01
Biosensors employing the biocatalyst on a different level of integration have been developed for monitoring environmental pollution. These probes range from laboratory specimen to commercial detectors applied to analyzers. This paper presents a selection of recent developments on amperometric enzyme and microbial biosensors. A monoenzymatic bulk type carbon electrode is described for biosensing organic hydroperoxides in aqueous solutions. Here, peroxidase is immobilized within the electrode body and the direct electron transfer between electrode and enzyme is measured. Both, reversible and irreversible inhibitors of acetylcholinesterase have been quantified by using a kinetically controlled acetylcholine enzyme sequence electrode. The inhibitory effect of pesticides such as butoxycarboxime, dimethoate, and trichlorfon could be quantified within 6 min in micrometers olar concentrations. Different multi-enzyme electrodes have been developed for the determination of inorganic phosphate. These sensors represent examples of sequentially acting enzymes in combination with enzymatic analyte recycling. Using this type of amplification nanomolar concentrations could be measured. A very fast responding microbial sensor for biological oxygen demand has been developed by immobilizing Trichosporon cutaneum onto an oxygen electrode. With this whole cell sensor waste water can be assayed with a sample frequency of 20 per hour and a working stability of more than 30 days.
Deciphering the distance to antibiotic resistance for the pneumococcus using genome sequencing data
Mobegi, Fredrick M.; Cremers, Amelieke J. H.; de Jonge, Marien I.; Bentley, Stephen D.; van Hijum, Sacha A. F. T.; Zomer, Aldert
2017-01-01
Advances in genome sequencing technologies and genome-wide association studies (GWAS) have provided unprecedented insights into the molecular basis of microbial phenotypes and enabled the identification of the underlying genetic variants in real populations. However, utilization of genome sequencing in clinical phenotyping of bacteria is challenging due to the lack of reliable and accurate approaches. Here, we report a method for predicting microbial resistance patterns using genome sequencing data. We analyzed whole genome sequences of 1,680 Streptococcus pneumoniae isolates from four independent populations using GWAS and identified probable hotspots of genetic variation which correlate with phenotypes of resistance to essential classes of antibiotics. With the premise that accumulation of putative resistance-conferring SNPs, potentially in combination with specific resistance genes, precedes full resistance, we retrogressively surveyed the hotspot loci and quantified the number of SNPs and/or genes, which if accumulated would confer full resistance to an otherwise susceptible strain. We name this approach the ‘distance to resistance’. It can be used to identify the creep towards complete antibiotics resistance in bacteria using genome sequencing. This approach serves as a basis for the development of future sequencing-based methods for predicting resistance profiles of bacterial strains in hospital microbiology and public health settings. PMID:28205635
Lucas Lledó, José Ignacio; Cáceres, Mario
2013-01-01
One of the most used techniques to study structural variation at a genome level is paired-end mapping (PEM). PEM has the advantage of being able to detect balanced events, such as inversions and translocations. However, inversions are still quite difficult to predict reliably, especially from high-throughput sequencing data. We simulated realistic PEM experiments with different combinations of read and library fragment lengths, including sequencing errors and meaningful base-qualities, to quantify and track down the origin of false positives and negatives along sequencing, mapping, and downstream analysis. We show that PEM is very appropriate to detect a wide range of inversions, even with low coverage data. However, % of inversions located between segmental duplications are expected to go undetected by the most common sequencing strategies. In general, longer DNA libraries improve the detectability of inversions far better than increments of the coverage depth or the read length. Finally, we review the performance of three algorithms to detect inversions —SVDetect, GRIAL, and VariationHunter—, identify common pitfalls, and reveal important differences in their breakpoint precisions. These results stress the importance of the sequencing strategy for the detection of structural variants, especially inversions, and offer guidelines for the design of future genome sequencing projects. PMID:23637806
Vergani, Stefano; Korsunsky, Ilya; Mazzarello, Andrea Nicola; Ferrer, Gerardo; Chiorazzi, Nicholas; Bagnara, Davide
2017-01-01
Efficient and accurate high-throughput DNA sequencing of the adaptive immune receptor repertoire (AIRR) is necessary to study immune diversity in healthy subjects and disease-related conditions. The high complexity and diversity of the AIRR coupled with the limited amount of starting material, which can compromise identification of the full biological diversity makes such sequencing particularly challenging. AIRR sequencing protocols often fail to fully capture the sampled AIRR diversity, especially for samples containing restricted numbers of B lymphocytes. Here, we describe a library preparation method for immunoglobulin sequencing that results in an exhaustive full-length repertoire where virtually every sampled B-cell is sequenced. This maximizes the likelihood of identifying and quantifying the entire IGHV-D-J repertoire of a sample, including the detection of rearrangements present in only one cell in the starting population. The methodology establishes the importance of circumventing genetic material dilution in the preamplification phases and incorporates the use of certain described concepts: (1) balancing the starting material amount and depth of sequencing, (2) avoiding IGHV gene-specific amplification, and (3) using Unique Molecular Identifier. Together, this methodology is highly efficient, in particular for detecting rare rearrangements in the sampled population and when only a limited amount of starting material is available.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moseley, D; Dave, M
Purpose: Use Varian TrueBeam Developer mode to quantify the mechanical limits of the couch and to simulate 4D respiratory motion. Methods: An in-house MATLAB based GUI was created to make the BEAM XML files. The couch was moved in a triangular wave in the S/I direction with varying amplitudes (1mm, 5mm, 10mm, and 50mm) and periods (3s, 6s, and 9s). The periods were determined by specifying the speed. The theoretical positions were compared to the values recorded by the machine at 50 Hz. HD videos were taken for certain tests as external validation. 4D Respiratory motion was simulated by anmore » A/P MV beam being delivered while the couch moved in an elliptical manner. The ellipse had a major axis of 2 cm (S/I) and a minor axis of 1 cm (A/P). Results: The path planned by the TrueBeam deviated from the theoretical triangular form as the speed increased. Deviations were noticed starting at a speed of 3.33 cm/s (50mm amplitude, 6s period). The greatest deviation occurred in the 50mm- 3s sequence with a correlation value of −0.13 and a 27% time increase; the plan essentially became out of phase. Excluding these two, the plans had correlation values of 0.99. The elliptical sequence effectively simulated a respiratory pattern with a period of 6s. The period could be controlled by changing the speeds or the dose rate. Conclusion: The work first shows the quantification of the mechanical limits of the couch and the speeds at which the proposed plans begin to deviate. These limits must be kept in mind when programming other couch sequences. The methodology can be used to quantify the limits of other axes. Furthermore, the work shows the possibility of creating 4D respiratory simulations without using specialized phantoms or motion-platforms. This can be further developed to program patient-specific breathing patterns.« less
Martinuzzi, Claudia; Pastorino, Lorenza; Andreotti, Virginia; Garuti, Anna; Minuto, Michele; Fiocca, Roberto; Bianchi-Scarrà, Giovanna; Ghiorzo, Paola; Grillo, Federica; Mastracci, Luca
2016-09-01
The optimal method for BRAF mutation detection remains to be determined despite advances in molecular detection techniques. The aim of this study was to compare, against classical Sanger sequencing, the diagnostic performance of two of the most recently developed, highly sensitive methods: BRAF V600E immunohistochemistry (IHC) and peptide nucleic-acid (PNA)-clamp qPCR. BRAF exon 15 mutations were searched in formalin-fixed paraffin-embedded tissues from 86 papillary thyroid carcinoma using the three methods. The limits of detection of Sanger sequencing in borderline or discordant cases were quantified by next generation sequencing. BRAF mutations were found in 74.4 % of cases by PNA, in 71 % of cases by IHC, and in 64 % of cases by Sanger sequencing. Complete concordance for the three methods was observed in 80 % of samples. Better concordance was observed with the combination of two methods, particularly PNA and IHC (59/64) (92 %), while the combination of PNA and Sanger was concordant in 55 cases (86 %). Sensitivity of the three methods was 99 % for PNA, 94.2 % for IHC, and 89.5 % for Sanger. Our data show that IHC could be used as a cost-effective, first-line method for BRAF V600E detection in daily practice, followed by PNA analysis in negative or uninterpretable cases, as the most efficient method. PNA-clamp quantitative PCR is highly sensitive and complementary to IHC as it also recognizes other mutations besides V600E and it is suitable for diagnostic purposes.
Salimnia, H; Fairfax, M R; Chandrasekar, P H
2014-12-01
Cytomegalovirus (CMV) causes significant morbidity and mortality in solid organ and bone marrow transplant recipients. DNA vaccines can provide both humoral and cellular immunity without exposing immune-compromised persons to replication-competent CMV. We studied the kinetics of CMV vaccine DNA in plasma. The samples were obtained from vaccine recipients who were enrolled in a double-blinded, placebo-controlled clinical trial of an intramuscular, plasmid-based, bivalent DNA vaccine for CMV in stem cell transplant recipients. Residual specimens on patients enrolled in the vaccine trial were saved until the trial was unblinded and published. Quantitative real-time polymerase chain reaction (PCR) was used to detect and quantify CMV glycoprotein B (gB) DNA in plasma from 4 recipients of the vaccine. The melting temperature of the vaccine gB amplicon was 62.4°C, compared to 68.8°C, which is seen with the wild-type virus. Sequence analysis revealed that there were 3 mismatches between the fluorescent resonance energy transfer probe and the vaccine DNA sequence. Because preemptive treatment of CMV disease in stem cell transplant patients is based on quantitative PCR analysis of viral sequences in plasma, it is important that vaccine sequences not be confused with those in wild-type virus. Confusion could lead to treatment with toxic medications, potentially compromising the transplant. Effects of PCR target choice and amplicon detection techniques on patient management and vaccine trials are discussed. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Lukes, Julius; Paris, Zdenek; Regmi, Sandesh; Breitling, Reinhard; Mureev, Sergey; Kushnir, Susanna; Pyatkov, Konstantin; Jirků, Milan; Alexandrov, Kirill A
2006-08-01
To investigate the influence of sequence context of translation initiation codon on translation efficiency in Kinetoplastida, we constructed a library of expression plasmids randomized in the three nucleotides prefacing ATG of a reporter gene encoding enhanced green fluorescent protein (EGFP). All 64 possible combinations of pre-ATG triplets were individually stably integrated into the rDNA locus of Leishmania tarentolae and the resulting cell lines were assessed for EGFP expression. The expression levels were quantified directly by measuring the fluorescence of EGFP protein in living cells and confirmed by Western blotting. We observed a strong influence of the pre-ATG triplet on the level of protein expression over a 20-fold range. To understand the degree of evolutionary conservation of the observed effect, we transformed Phytomonas serpens, a trypanosomatid parasite of plants, with a subset of the constructs. The pattern of translational efficiency mediated by individual pre-ATG triplets in this species was similar to that observed in L. tarentolae. However, the pattern of translational efficiency of two other proteins (red fluorescent protein and tetracycline repressor) containing selected pre-ATG triplets did not correlate with either EGFP or each other. Thus, we conclude that a conserved mechanism of translation initiation site selection exists in kinetoplastids that is strongly influenced not only by the pre-ATG sequences but also by the coding region of the gene.
How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.
Tian, Pengfei; Best, Robert B
2017-10-17
Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Scaling in nature: From DNA through heartbeats to weather
NASA Astrophysics Data System (ADS)
Havlin, S.; Buldyrev, S. V.; Bunde, A.; Goldberger, A. L.; Ivanov, P. Ch.; Peng, C.-K.; Stanley, H. E.
1999-12-01
The purpose of this talk is to describe some recent progress in applying scaling concepts to various systems in nature. We review several systems characterized by scaling laws such as DNA sequences, heartbeat rates and weather variations. We discuss the finding that the exponent α quantifying the scaling in DNA in smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the scaling exponent α is smaller during sleep periods compared to wake periods. We also discuss the recent findings that suggest a universal scaling exponent characterizing the weather fluctuations.
Fractals in biology and medicine
NASA Technical Reports Server (NTRS)
Havlin, S.; Buldyrev, S. V.; Goldberger, A. L.; Mantegna, R. N.; Ossadnik, S. M.; Peng, C. K.; Simons, M.; Stanley, H. E.
1995-01-01
Our purpose is to describe some recent progress in applying fractal concepts to systems of relevance to biology and medicine. We review several biological systems characterized by fractal geometry, with a particular focus on the long-range power-law correlations found recently in DNA sequences containing noncoding material. Furthermore, we discuss the finding that the exponent alpha quantifying these long-range correlations ("fractal complexity") is smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the normal heart is characterized by long-range "anticorrelations" which are absent in the diseased heart.
Scaling in nature: from DNA through heartbeats to weather
NASA Technical Reports Server (NTRS)
Havlin, S.; Buldyrev, S. V.; Bunde, A.; Goldberger, A. L.; Peng, C. K.; Stanley, H. E.
1999-01-01
The purpose of this report is to describe some recent progress in applying scaling concepts to various systems in nature. We review several systems characterized by scaling laws such as DNA sequences, heartbeat rates and weather variations. We discuss the finding that the exponent alpha quantifying the scaling in DNA in smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the scaling exponent alpha is smaller during sleep periods compared to wake periods. We also discuss the recent findings that suggest a universal scaling exponent characterizing the weather fluctuations.
Complex multifractal nature in Mycobacterium tuberculosis genome
Mandal, Saurav; Roychowdhury, Tanmoy; Chirom, Keilash; Bhattacharya, Alok; Brojen Singh, R. K.
2017-01-01
The mutifractal and long range correlation (C(r)) properties of strings, such as nucleotide sequence can be a useful parameter for identification of underlying patterns and variations. In this study C(r) and multifractal singularity function f(α) have been used to study variations in the genomes of a pathogenic bacteria Mycobacterium tuberculosis. Genomic sequences of M. tuberculosis isolates displayed significant variations in C(r) and f(α) reflecting inherent differences in sequences among isolates. M. tuberculosis isolates can be categorised into different subgroups based on sensitivity to drugs, these are DS (drug sensitive isolates), MDR (multi-drug resistant isolates) and XDR (extremely drug resistant isolates). C(r) follows significantly different scaling rules in different subgroups of isolates, but all the isolates follow one parameter scaling law. The richness in complexity of each subgroup can be quantified by the measures of multifractal parameters displaying a pattern in which XDR isolates have highest value and lowest for drug sensitive isolates. Therefore C(r) and multifractal functions can be useful parameters for analysis of genomic sequences. PMID:28440326
Ionita-Laza, Iuliana; Ottman, Ruth
2011-11-01
The recent progress in sequencing technologies makes possible large-scale medical sequencing efforts to assess the importance of rare variants in complex diseases. The results of such efforts depend heavily on the use of efficient study designs and analytical methods. We introduce here a unified framework for association testing of rare variants in family-based designs or designs based on unselected affected individuals. This framework allows us to quantify the enrichment in rare disease variants in families containing multiple affected individuals and to investigate the optimal design of studies aiming to identify rare disease variants in complex traits. We show that for many complex diseases with small values for the overall sibling recurrence risk ratio, such as Alzheimer's disease and most cancers, sequencing affected individuals with a positive family history of the disease can be extremely advantageous for identifying rare disease variants. In contrast, for complex diseases with large values of the sibling recurrence risk ratio, sequencing unselected affected individuals may be preferable.
International Standards for Genomes, Transcriptomes, and Metagenomes
Mason, Christopher E.; Afshinnekoo, Ebrahim; Tighe, Scott; Wu, Shixiu; Levy, Shawn
2017-01-01
Challenges and biases in preparing, characterizing, and sequencing DNA and RNA can have significant impacts on research in genomics across all kingdoms of life, including experiments in single-cells, RNA profiling, and metagenomics (across multiple genomes). Technical artifacts and contamination can arise at each point of sample manipulation, extraction, sequencing, and analysis. Thus, the measurement and benchmarking of these potential sources of error are of paramount importance as next-generation sequencing (NGS) projects become more global and ubiquitous. Fortunately, a variety of methods, standards, and technologies have recently emerged that improve measurements in genomics and sequencing, from the initial input material to the computational pipelines that process and annotate the data. Here we review current standards and their applications in genomics, including whole genomes, transcriptomes, mixed genomic samples (metagenomes), and the modified bases within each (epigenomes and epitranscriptomes). These standards, tools, and metrics are critical for quantifying the accuracy of NGS methods, which will be essential for robust approaches in clinical genomics and precision medicine. PMID:28337071
NASA Astrophysics Data System (ADS)
Yasuda, Jun; Yoshizawa, Shin; Umemura, Shin-ichiro
2016-07-01
Sonodynamic treatment is a method of treating cancer using reactive oxygen species (ROS) generated by cavitation bubbles in collaboration with a sonosensitizer at a target tissue. In this treatment method, both localized ROS generation and ROS generation with high efficiency are important. In this study, a triggered high-intensity focused ultrasound (HIFU) sequence, which consists of a short, extremely high intensity pulse immediately followed by a long, moderate-intensity burst, was employed for the efficient generation of ROS. In experiments, a solution sealed in a chamber was exposed to a triggered HIFU sequence. Then, the distribution of generated ROS was observed by the luminol reaction, and the amount of generated ROS was quantified using KI method. As a result, the localized ROS generation was demonstrated by light emission from the luminol reaction. Moreover, it was demonstrated that the triggered HIFU sequence has higher efficiency of ROS generation by both the KI method and the luminol reaction emission.
Complex multifractal nature in Mycobacterium tuberculosis genome
NASA Astrophysics Data System (ADS)
Mandal, Saurav; Roychowdhury, Tanmoy; Chirom, Keilash; Bhattacharya, Alok; Brojen Singh, R. K.
2017-04-01
The mutifractal and long range correlation (C(r)) properties of strings, such as nucleotide sequence can be a useful parameter for identification of underlying patterns and variations. In this study C(r) and multifractal singularity function f(α) have been used to study variations in the genomes of a pathogenic bacteria Mycobacterium tuberculosis. Genomic sequences of M. tuberculosis isolates displayed significant variations in C(r) and f(α) reflecting inherent differences in sequences among isolates. M. tuberculosis isolates can be categorised into different subgroups based on sensitivity to drugs, these are DS (drug sensitive isolates), MDR (multi-drug resistant isolates) and XDR (extremely drug resistant isolates). C(r) follows significantly different scaling rules in different subgroups of isolates, but all the isolates follow one parameter scaling law. The richness in complexity of each subgroup can be quantified by the measures of multifractal parameters displaying a pattern in which XDR isolates have highest value and lowest for drug sensitive isolates. Therefore C(r) and multifractal functions can be useful parameters for analysis of genomic sequences.
Espin-Garcia, Osvaldo; Craiu, Radu V; Bull, Shelley B
2018-02-01
We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc.
Palmisano, Aldo N.; Winton, James R.; Dickhoff, Walton W.
1999-01-01
We cloned and sequenced a chinook salmon Hsp90 cDNA; sequence analysis shows it to be Hsp90??. Phylogenetic analysis supports the hypothesis that ?? and ?? paralogs of Hsp90 arose as a result of a gene duplication event and that they diverged early in the evolution of vertebrates, before tetrapods separated from the teleost lineage. Among several differences distinguishing poikilothermic Hsp90?? sequences from their bird and mammal orthologs, the teleost versions specifically lack a characteristic QTQDQP phosphorylation site near the N-terminus. We used the cDNA to develop an RNA (Northern) blot to quantify cellular Hsp90 mRNA levels. Chinook salmon embryonic (CHSE-214) cells responded to heat shock with a rapid rise in Hsp90 mRNA through 4 h, followed by a gradual decline over the next 20 h. Hsp90 mRNA level may be useful as a stress indicator, especially in a laboratory setting or in response to acute heat stress.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Glodzik, Dominik; Morganella, Sandro; Davies, Helen
Somatic rearrangements contribute to the mutagenized landscape of cancer genomes. Here, we systematically interrogated rearrangements in 560 breast cancers by using a piecewise constant fitting approach. We identified 33 hotspots of large (>100 kb) tandem duplications, a mutational signature associated with homologous-recombination-repair deficiency. Notably, these tandem-duplication hotspots were enriched in breast cancer germline susceptibility loci (odds ratio (OR) = 4.28) and breast-specific 'super-enhancer' regulatory elements (OR = 3.54). These hotspots may be sites of selective susceptibility to double-strand-break damage due to high transcriptional activity or, through incrementally increasing copy number, may be sites of secondary selective pressure. Furthermore, the transcriptomicmore » consequences ranged from strong individual oncogene effects to weak but quantifiable multigene expression effects. We thus present a somatic-rearrangement mutational process affecting coding sequences and noncoding regulatory elements and contributing a continuum of driver consequences, from modest to strong effects, thereby supporting a polygenic model of cancer development.« less
Glodzik, Dominik; Morganella, Sandro; Davies, Helen; ...
2017-01-23
Somatic rearrangements contribute to the mutagenized landscape of cancer genomes. Here, we systematically interrogated rearrangements in 560 breast cancers by using a piecewise constant fitting approach. We identified 33 hotspots of large (>100 kb) tandem duplications, a mutational signature associated with homologous-recombination-repair deficiency. Notably, these tandem-duplication hotspots were enriched in breast cancer germline susceptibility loci (odds ratio (OR) = 4.28) and breast-specific 'super-enhancer' regulatory elements (OR = 3.54). These hotspots may be sites of selective susceptibility to double-strand-break damage due to high transcriptional activity or, through incrementally increasing copy number, may be sites of secondary selective pressure. Furthermore, the transcriptomicmore » consequences ranged from strong individual oncogene effects to weak but quantifiable multigene expression effects. We thus present a somatic-rearrangement mutational process affecting coding sequences and noncoding regulatory elements and contributing a continuum of driver consequences, from modest to strong effects, thereby supporting a polygenic model of cancer development.« less
NASA Astrophysics Data System (ADS)
Lock, S. S. M.; Lau, K. K.; Lock Sow Mei, Irene; Shariff, A. M.; Yeong, Y. F.; Bustam, A. M.
2017-08-01
A sequence of molecular modelling procedure has been proposed to simulate experimentally validated membrane structure characterizing the effect of CO2 plasticization, whereby it can be subsequently employed to elucidate the depression in glass transition temperature (Tg ). Based on the above motivation, unswollen and swollen Polysulfone membrane structures with different CO2 loadings have been constructed, whereby the accuracy has been validated through good compliance with experimentally measured physical properties. It is found that the presence of CO2 constitutes to enhancement in polymeric chain relaxation, which consequently promotes the enlargement of molecular spacing and causes dilation in the membrane matrix. A series of glass transition temperature treatment has been conducted on the verified molecular structure to elucidate the effect of CO2 loadings to the depression in Tg induced by plasticization. Subsequently, a modified Michealis-Menten (M-M) function has been implemented to quantify the effect of CO2 loading attributed to plasticization towards Tg .
Shah, Dheeraj; Singh, Meenakshi; Gupta, Piyush; Faridi, M M A
2014-03-01
The aim of the present study was to evaluate whether the order of complementary feeding in relation to breast-feeding affects breast milk, semisolid, or total energy intake in infants. The present study was designed as a randomized crossover trial. The study was conducted in a tertiary care hospital. The study participants were 25 healthy infants between the ages of 7 and 11 months who were exclusively breast-fed for at least 6 months and were now receiving complementary foods for at least 1 month in addition to breast-feeding. Infants were randomized to follow a sequence of either complementary feeding before breast-feeding (sequence A) or complementary feeding after breast-feeding (sequence B) for the first day (24 hours) of the study period using simple randomization. For the next day, the sequence was reversed for each child. All babies received 3 actively fed complementary food meals per day (morning, afternoon, and evening). A semisolid study diet was prepared in the hospital by cooking rice and pulse with oil using a standard method, ensuring the energy density of at least 0.6 kcal/g. The infants were allowed ad libitum breast-feeding during the observation period. Semisolid intake was directly measured and breast milk intake was quantified by test weighing method. Energy intake from complementary foods was calculated from the product of energy density of the diet served on that day and the total amount consumed. The total energy intake and energy intake from breast milk and complementary foods between the 2 sequences were compared. The mean (standard deviation) energy intake from breast milk during 12 hours of daytime by following sequence A (complementary feeding before breast-feeding) was 132.0 (67.4) kcal in comparison with 135.9 (56.2) kcal in sequence B, which was not statistically different (P = 0.83). The mean (standard deviation) energy consumed from semisolids in sequences A and B was also comparable (88.6 [75.5] kcal vs. 85.5 [89.7] kcal; P = 0.58). The total energy intake during daytime in sequence A was 220.6 (96.2) kcal in comparison with 221.5 (94.0) kcal in sequence B, which was also comparable (P = 0.97). The results related to energy intake through breast milk and total energy intake were not different when insensible losses during feeding were adjusted in both groups. Altering the sequence of complementary feeding in relation to breast-feeding does not affect total energy intake.
Lossius, Andreas; Johansen, Jorunn N; Vartdal, Frode; Robins, Harlan; Jūratė Šaltytė, Benth; Holmøy, Trygve; Olweus, Johanna
2014-11-01
Epstein-Barr virus (EBV) has long been suggested as a pathogen in multiple sclerosis (MS). Here, we used high-throughput sequencing to determine the diversity, compartmentalization, persistence, and EBV-reactivity of the T-cell receptor (TCR) repertoires in MS. TCR-β genes were sequenced in paired samples of cerebrospinal fluid (CSF) and blood from patients with MS and controls with other inflammatory neurological diseases. The TCR repertoires were highly diverse in both compartments and patient groups. Expanded T-cell clones, represented by TCR-β sequences >0.1%, were of different identity in CSF and blood of MS patients, and persisted for more than a year. Reference TCR-β libraries generated from peripheral blood T cells reactive against autologous EBV-transformed B cells were highly enriched for public EBV-specific sequences and were used to quantify EBV-reactive TCR-β sequences in CSF. TCR-β sequences of EBV-reactive CD8+ T cells, including several public EBV-specific sequences, were intrathecally enriched in MS patients only, whereas those of EBV-reactive CD4+ T cells were also enriched in CSF of controls. These data provide evidence for a clonally diverse, yet compartmentalized and persistent, intrathecal T-cell response in MS. The presented strategy links TCR sequence to intrathecal T-cell specificity, demonstrating enrichment of EBV-reactive CD8+ T cells in MS. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Herlofsen, Sarah R; Bryne, Jan Christian; Høiby, Torill; Wang, Li; Issner, Robbyn; Zhang, Xiaolan; Coyne, Michael J; Boyle, Patrick; Gu, Hongcang; Meza-Zepeda, Leonardo A; Collas, Philippe; Mikkelsen, Tarjei S; Brinchmann, Jan E
2013-02-15
For safe clinical application of engineered cartilage made from mesenchymal stem cells (MSCs), molecular mechanisms for chondrogenic differentiation must be known in detail. Changes in gene expression and extracellular matrix synthesis have been extensively studied, but the epigenomic modifications underlying these changes have not been described. To this end we performed whole-genome chromatin immunoprecipitation and deep sequencing to quantify six histone modifications, reduced representation bisulphite sequencing to quantify DNA methylation and mRNA microarrays to quantify gene expression before and after 7 days of chondrogenic differentiation of MSCs in an alginate scaffold. To add to the clinical relevance of our observations, the study is based on primary bone marrow-derived MSCs from four donors, allowing us to investigate inter-individual variations. We see two levels of relationship between epigenetic marking and gene expression. First, a large number of genes ontogenetically linked to MSC properties and the musculoskeletal system are epigenetically prepatterned by moderate changes in H3K4me3 and H3K9ac near transcription start sites. Most of these genes remain transcriptionally unaltered. Second, transcriptionally upregulated genes, more closely associated with chondrogenesis, are marked by H3K36me3 in gene bodies, highly increased H3K4me3 and H3K9ac on promoters and 5' end of genes, and increased H3K27ac and H3K4me1 marking in at least one enhancer region per upregulated gene. Within the 7-day time frame, changes in promoter DNA methylation do not correlate significantly with changes in gene expression. Inter-donor variability analysis shows high level of similarity between the donors for this data set. Histone modifications, rather than DNA methylation, provide the primary epigenetic control of early differentiation of MSCs towards the chondrogenic lineage.
The human genome initiative: a statement of need.
Watson, J D
1991-10-15
We may never have a complete understanding of the complex dynamics of the human organism, but we can and should know all our genes and begin to understand their role in the diseases that diminish our lives. A 15-year program has been projected and the specific, quantifiable goals in mapping and sequencing are outlined. Ethical, legal, and social implications are also discussed.
Molecular methods for rapidly quantifying defined Bacteroidales species from the human gastrointestinal tract may have important clinical and environmental applications, ranging from diagnosis of infections to fecal source tracking in surface waters. In this study, sequences from...
Garland, Ellen C; Noad, Michael J; Goldizen, Anne W; Lilley, Matthew S; Rekdahl, Melinda L; Garrigue, Claire; Constantine, Rochelle; Daeschler Hauser, Nan; Poole, M Michael; Robbins, Jooke
2013-01-01
Humpback whales have a continually evolving vocal sexual display, or "song," that appears to undergo both evolutionary and "revolutionary" change. All males within a population adhere to the current content and arrangement of the song. Populations within an ocean basin share similarities in their songs; this sharing is complex as multiple variations of the song (song types) may be present within a region at any one time. To quantitatively investigate the similarity of song types, songs were compared at both the individual singer and population level using the Levenshtein distance technique and cluster analysis. The highly stereotyped sequences of themes from the songs of 211 individuals from populations within the western and central South Pacific region from 1998 through 2008 were grouped together based on the percentage of song similarity, and compared to qualitatively assigned song types. The analysis produced clusters of highly similar songs that agreed with previous qualitative assignments. Each cluster contained songs from multiple populations and years, confirming the eastward spread of song types and their progressive evolution through the study region. Quantifying song similarity and exchange will assist in understanding broader song dynamics and contribute to the use of vocal displays as population identifiers.
Quantifying cortical surface harmonic deformation with stereovision during open cranial neurosurgery
NASA Astrophysics Data System (ADS)
Ji, Songbai; Fan, Xiaoyao; Roberts, David W.; Paulsen, Keith D.
2012-02-01
Cortical surface harmonic motion during open cranial neurosurgery is well observed in image-guided neurosurgery. Recently, we quantified cortical surface deformation noninvasively with synchronized blood pressure pulsation (BPP) from a sequence of stereo image pairs using optical flow motion tracking. With three subjects, we found the average cortical surface displacement can reach more than 1 mm and in-plane principal strains of up to 7% relative to the first image pair. In addition, the temporal changes in deformation and strain were in concert with BPP and patient respiration [1]. However, because deformation was essentially computed relative to an arbitrary reference, comparing cortical surface deformation at different times was not possible. In this study, we extend the technique developed earlier by establishing a more reliable reference profile of the cortical surface for each sequence of stereo image acquisitions. Specifically, fast Fourier transform (FFT) was applied to the dynamic cortical surface deformation, and the fundamental frequencies corresponding to patient respiration and BPP were identified, which were used to determine the number of image acquisitions for use in averaging cortical surface images. This technique is important because it potentially allows in vivo characterization of soft tissue biomechanical properties using intraoperative stereovision and motion tracking.
MR fingerprinting using fast imaging with steady state precession (FISP) with spiral readout.
Jiang, Yun; Ma, Dan; Seiberlich, Nicole; Gulani, Vikas; Griswold, Mark A
2015-12-01
This study explores the possibility of using gradient echo-based sequences other than balanced steady-state free precession (bSSFP) in the magnetic resonance fingerprinting (MRF) framework to quantify the relaxation parameters . An MRF method based on a fast imaging with steady-state precession (FISP) sequence structure is presented. A dictionary containing possible signal evolutions with physiological range of T1 and T2 was created using the extended phase graph formalism according to the acquisition parameters. The proposed method was evaluated in a phantom and a human brain. T1 , T2 , and proton density were quantified directly from the undersampled data by the pattern recognition algorithm. T1 and T2 values from the phantom demonstrate that the results of MRF FISP are in good agreement with the traditional gold-standard methods. T1 and T2 values in brain are within the range of previously reported values. MRF-FISP enables a fast and accurate quantification of the relaxation parameters. It is immune to the banding artifact of bSSFP due to B0 inhomogeneities, which could improve the ability to use MRF for applications beyond brain imaging. © 2014 Wiley Periodicals, Inc.
MR Fingerprinting Using Fast Imaging with Steady State Precession (FISP) with Spiral Readout
Jiang, Yun; Ma, Dan; Seiberlich, Nicole; Gulani, Vikas; Griswold, Mark A.
2015-01-01
Purpose This study explores the possibility of using gradient echo based sequences other than bSSFP in the magnetic resonance fingerprinting (MRF) framework to quantify the relaxation parameters. Methods An MRF method based on a fast imaging with steady state precession (FISP) sequence structure is presented. A dictionary containing possible signal evolutions with physiological range of T1 and T2 was created using the extended phase graph (EPG) formalism according to the acquisition parameters. The proposed method was evaluated in a phantom and a human brain. T1, T2 and proton density were quantified directly from the undersampled data by the pattern recognition algorithm. Results T1 and T2 values from the phantom demonstrate that the results of MRF FISP are in good agreement with the traditional gold-standard methods. T1 and T2 values in brain are within the range of previously reported values. Conclusion MRF FISP enables a fast and accurate quantification of the relaxation parameters, while is immune to the banding artifact of bSSFP due to B0 inhomogeneities, which could improve the ability to use MRF for applications beyond brain imaging. PMID:25491018
Leray, Matthieu; Knowlton, Nancy
2017-01-01
DNA metabarcoding, the PCR-based profiling of natural communities, is becoming the method of choice for biodiversity monitoring because it circumvents some of the limitations inherent to traditional ecological surveys. However, potential sources of bias that can affect the reproducibility of this method remain to be quantified. The interpretation of differences in patterns of sequence abundance and the ecological relevance of rare sequences remain particularly uncertain. Here we used one artificial mock community to explore the significance of abundance patterns and disentangle the effects of two potential biases on data reproducibility: indexed PCR primers and random sampling during Illumina MiSeq sequencing. We amplified a short fragment of the mitochondrial Cytochrome c Oxidase Subunit I (COI) for a single mock sample containing equimolar amounts of total genomic DNA from 34 marine invertebrates belonging to six phyla. We used seven indexed broad-range primers and sequenced the resulting library on two consecutive Illumina MiSeq runs. The total number of Operational Taxonomic Units (OTUs) was ∼4 times higher than expected based on the composition of the mock sample. Moreover, the total number of reads for the 34 components of the mock sample differed by up to three orders of magnitude. However, 79 out of 86 of the unexpected OTUs were represented by <10 sequences that did not appear consistently across replicates. Our data suggest that random sampling of rare OTUs (e.g., small associated fauna such as parasites) accounted for most of variation in OTU presence-absence, whereas biases associated with indexed PCRs accounted for a larger amount of variation in relative abundance patterns. These results suggest that random sampling during sequencing leads to the low reproducibility of rare OTUs. We suggest that the strategy for handling rare OTUs should depend on the objectives of the study. Systematic removal of rare OTUs may avoid inflating diversity based on common β descriptors but will exclude positive records of taxa that are functionally important. Our results further reinforce the need for technical replicates (parallel PCR and sequencing from the same sample) in metabarcoding experimental designs. Data reproducibility should be determined empirically as it will depend upon the sequencing depth, the type of sample, the sequence analysis pipeline, and the number of replicates. Moreover, estimating relative biomasses or abundances based on read counts remains elusive at the OTU level.
NASA Astrophysics Data System (ADS)
Gallet, F.; Bolmont, E.; Mathis, S.; Charbonnel, C.; Amard, L.
2017-08-01
Context. Star-planet interactions must be taken into account in stellar models to understand the dynamical evolution of close-in planets. The dependence of the tidal interactions on the structural and rotational evolution of the star is of particular importance and should be correctly treated. Aims: We quantify how tidal dissipation in the convective envelope of rotating low-mass stars evolves from the pre-main sequence up to the red-giant branch depending on the initial stellar mass. We investigate the consequences of this evolution on planetary orbital evolution. Methods: We couple the tidal dissipation formalism previously described to the stellar evolution code STAREVOL and apply this coupling to rotating stars with masses between 0.3 and 1.4 M⊙. As a first step, this formalism assumes a simplified bi-layer stellar structure with corresponding averaged densities for the radiative core and the convective envelope. We use a frequency-averaged treatment of the dissipation of tidal inertial waves in the convection zone (but neglect the dissipation of tidal gravity waves in the radiation zone). In addition, we generalize a recent work by following the orbital evolution of close-in planets using the new tidal dissipation predictions for advanced phases of stellar evolution. Results: On the pre-main sequence the evolution of tidal dissipation is controlled by the evolution of the internal structure of the contracting star. On the main sequence it is strongly driven by the variation of surface rotation that is impacted by magnetized stellar winds braking. The main effect of taking into account the rotational evolution of the stars is to lower the tidal dissipation strength by about four orders of magnitude on the main sequence, compared to a normalized dissipation rate that only takes into account structural changes. Conclusions: The evolution of the dissipation strongly depends on the evolution of the internal structure and rotation of the star. From the pre-main sequence up to the tip of the red-giant branch, it varies by several orders of magnitude, with strong consequences for the orbital evolution of close-in massive planets. These effects are the strongest during the pre-main sequence, implying that the planets are mainly sensitive to the star's early history.
Zoutman, Willem H; Nell, Rogier J; Versluis, Mieke; van Steenderen, Debby; Lalai, Rajshri N; Out-Luiting, Jacoba J; de Lange, Mark J; Vermeer, Maarten H; Langerak, Anton W; van der Velden, Pieter A
2017-03-01
Quantifying T cells accurately in a variety of tissues of benign, inflammatory, or malignant origin can be of great importance in a variety of clinical applications. Flow cytometry and immunohistochemistry are considered to be gold-standard methods for T-cell quantification. However, these methods require fresh, frozen, or fixated cells and tissue of a certain quality. In addition, conventional and droplet digital PCR (ddPCR), whether followed by deep sequencing techniques, have been used to elucidate T-cell content by focusing on rearranged T-cell receptor (TCR) genes. These approaches typically target the whole TCR repertoire, thereby supplying additional information about TCR use. We alternatively developed and validated two novel generic single duplex ddPCR assays to quantify T cells accurately by measuring loss of specific germline TCR loci and compared them with flow cytometry-based quantification. These assays target sequences between the Dδ2 and Dδ3 genes (TRD locus) and Dβ1 and Jβ1.1 genes (TRB locus) that become deleted systematically early during lymphoid differentiation. Because these ddPCR assays require small amounts of DNA instead of freshly isolated, frozen, or fixated material, initially unanalyzable (scarce) specimens can be assayed from now on, supplying valuable information about T-cell content. Our ddPCR method provides a novel and sensitive way for quantifying T cells relatively fast, accurate, and independent of the cellular context. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Fetterhoff, Dustin; Kraft, Robert A.; Sandler, Roman A.; Opris, Ioan; Sexton, Cheryl A.; Marmarelis, Vasilis Z.; Hampson, Robert E.; Deadwyler, Sam A.
2015-01-01
Fractality, represented as self-similar repeating patterns, is ubiquitous in nature and the brain. Dynamic patterns of hippocampal spike trains are known to exhibit multifractal properties during working memory processing; however, it is unclear whether the multifractal properties inherent to hippocampal spike trains reflect active cognitive processing. To examine this possibility, hippocampal neuronal ensembles were recorded from rats before, during and after a spatial working memory task following administration of tetrahydrocannabinol (THC), a memory-impairing component of cannabis. Multifractal detrended fluctuation analysis was performed on hippocampal interspike interval sequences to determine characteristics of monofractal long-range temporal correlations (LRTCs), quantified by the Hurst exponent, and the degree/magnitude of multifractal complexity, quantified by the width of the singularity spectrum. Our results demonstrate that multifractal firing patterns of hippocampal spike trains are a marker of functional memory processing, as they are more complex during the working memory task and significantly reduced following administration of memory impairing THC doses. Conversely, LRTCs are largest during resting state recordings, therefore reflecting different information compared to multifractality. In order to deepen conceptual understanding of multifractal complexity and LRTCs, these measures were compared to classical methods using hippocampal frequency content and firing variability measures. These results showed that LRTCs, multifractality, and theta rhythm represent independent processes, while delta rhythm correlated with multifractality. Taken together, these results provide a novel perspective on memory function by demonstrating that the multifractal nature of spike trains reflects hippocampal microcircuit activity that can be used to detect and quantify cognitive, physiological, and pathological states. PMID:26441562
Yao, Qiu-Mei; Zhou, Jiao; Gale, Robert Peter; Li, Jin-Lan; Li, Ling-Di; Li, Ning; Chen, Shan-Shan; Ruan, Guo-Rui
2015-10-01
Calreticulin (CALR) mutations were recently identified in a substantial proportion of persons with essential thrombocythemia (ET) and with primary myelofibrosis (PMF) without JAK2(V617F). Consequently rapid, sensitive, and specific methods to detect and quantify these mutations are needed. We studied samples from 1088 persons with myeloproliferative neoplasms (MPNs) including 421 JAK2(V617F) negative subjects with ET, PMF, polycythemia vera (PV), chronic myeloid leukemia (CML) and hyper-eosinophilic syndrome (HES). Detection of CALR exon 9 mutations was done by PCR amplification followed by fragment length analysis and direct sequencing. Dilution assays were used to determine CALR mutant allele burden. We detected CALR mutations in blood and bone marrow samples from 152 subjects with ET and with PMF but not in samples from normal or persons with PV, CML, or HES. CALR mutant peaks were distinct from wild-type peaks and dilution experiments indicated a sensitivity level of 0.5-5% for a CALR mutant allele in a wild-type background. Diverse types of mutations were detected including deletions, insertions, and complex indels. All mutations were confirmed by direct sequencing. We also used dilution experiments to quantify mutant allele burden. We were able to reproducibly detect mutant allele levels as low 5% (0.5-5%) in a wild-type background. PCR amplification followed by fragment length analysis is a rapid, sensitive, and specific method for screening persons with MPNs for CALR mutations, especially those with ET and PMF and for estimating mutant allele burden.
Error Analysis of Deep Sequencing of Phage Libraries: Peptides Censored in Sequencing
Matochko, Wadim L.; Derda, Ratmir
2013-01-01
Next-generation sequencing techniques empower selection of ligands from phage-display libraries because they can detect low abundant clones and quantify changes in the copy numbers of clones without excessive selection rounds. Identification of errors in deep sequencing data is the most critical step in this process because these techniques have error rates >1%. Mechanisms that yield errors in Illumina and other techniques have been proposed, but no reports to date describe error analysis in phage libraries. Our paper focuses on error analysis of 7-mer peptide libraries sequenced by Illumina method. Low theoretical complexity of this phage library, as compared to complexity of long genetic reads and genomes, allowed us to describe this library using convenient linear vector and operator framework. We describe a phage library as N × 1 frequency vector n = ||ni||, where ni is the copy number of the ith sequence and N is the theoretical diversity, that is, the total number of all possible sequences. Any manipulation to the library is an operator acting on n. Selection, amplification, or sequencing could be described as a product of a N × N matrix and a stochastic sampling operator (S a). The latter is a random diagonal matrix that describes sampling of a library. In this paper, we focus on the properties of S a and use them to define the sequencing operator (S e q). Sequencing without any bias and errors is S e q = S a IN, where IN is a N × N unity matrix. Any bias in sequencing changes IN to a nonunity matrix. We identified a diagonal censorship matrix (C E N), which describes elimination or statistically significant downsampling, of specific reads during the sequencing process. PMID:24416071
Photovoltaic System Modeling. Uncertainty and Sensitivity Analyses
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hansen, Clifford W.; Martin, Curtis E.
2015-08-01
We report an uncertainty and sensitivity analysis for modeling AC energy from ph otovoltaic systems . Output from a PV system is predicted by a sequence of models. We quantify u ncertainty i n the output of each model using empirical distribution s of each model's residuals. We propagate uncertainty through the sequence of models by sampli ng these distributions to obtain a n empirical distribution of a PV system's output. We consider models that: (1) translate measured global horizontal, direct and global diffuse irradiance to plane - of - array irradiance; (2) estimate effective irradiance; (3) predict cell temperature;more » (4) estimate DC voltage, current and power ; (5) reduce DC power for losses due to inefficient maximum power point tracking or mismatch among modules; and (6) convert DC to AC power . O ur analysis consider s a notional PV system com prising an array of FirstSolar FS - 387 modules and a 250 kW AC inverter ; we use measured irradiance and weather at Albuquerque, NM. We found the uncertainty in PV syste m output to be relatively small, on the order of 1% for daily energy. We found that unce rtainty in the models for POA irradiance and effective irradiance to be the dominant contributors to uncertainty in predicted daily energy. Our analysis indicates that efforts to reduce the uncertainty in PV system output predictions may yield the greatest improvements by focusing on the POA and effective irradiance models.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maingi, R.; Canik, J. M.; Bell, R. E.
A sequence of H-mode discharges with increasing levels of pre-discharge lithium evaporation (‘dose’) was conducted in high triangularity and elongation boundary shape in NSTX. Energy confinement increased, and recycling decreased with increasing lithium dose, similar to a previous lithium dose scan in medium triangularity and elongation plasmas. Data-constrained SOLPS interpretive modeling quantified the edge transport change: the electron particle diffusivity decreased by 10-30x. The electron thermal diffusivity decreased by 4x just inside the top of the pedestal, but increased by up to 5x very near the separatrix. These results provide a baseline expectation for lithium benefits in NSTX-U, which ismore » optimized for a boundary shape similar to the one in this experiment.« less
Maingi, R.; Canik, J. M.; Bell, R. E.; ...
2016-07-19
A sequence of H-mode discharges with increasing levels of pre-discharge lithium evaporation (‘dose’) was conducted in high triangularity and elongation boundary shape in NSTX. Energy confinement increased, and recycling decreased with increasing lithium dose, similar to a previous lithium dose scan in medium triangularity and elongation plasmas. Data-constrained SOLPS interpretive modeling quantified the edge transport change: the electron particle diffusivity decreased by 10-30x. The electron thermal diffusivity decreased by 4x just inside the top of the pedestal, but increased by up to 5x very near the separatrix. These results provide a baseline expectation for lithium benefits in NSTX-U, which ismore » optimized for a boundary shape similar to the one in this experiment.« less
A longitudinal analysis of the relationship between fertility timing and schooling.
Stange, Kevin
2011-08-01
This article quantifies the contribution of pre-treatment dynamic selection to the relationship between fertility timing and postsecondary attainment, after controlling for a rich set of predetermined characteristics. Eventual mothers and nonmothers are matched using their predicted birth hazard rate, which shares the desirable properties of a propensity score but in a multivalued treatment setting. I find that eventual mothers and matched nonmothers enter college at the same rate, but their educational paths diverge well before the former become pregnant. This pre-pregnancy divergence creates substantial differences in ultimate educational attainment that cannot possibly be due to the childbirth itself. Controls for predetermined characteristics and fixed effects do not address this form of dynamic selection bias. A dynamic model of the simultaneous childbirth-education sequencing decision is necessary to address it.
Garland, Ellen C; Rendell, Luke; Lilley, Matthew S; Poole, M Michael; Allen, Jenny; Noad, Michael J
2017-07-01
Identifying and quantifying variation in vocalizations is fundamental to advancing our understanding of processes such as speciation, sexual selection, and cultural evolution. The song of the humpback whale (Megaptera novaeangliae) presents an extreme example of complexity and cultural evolution. It is a long, hierarchically structured vocal display that undergoes constant evolutionary change. Obtaining robust metrics to quantify song variation at multiple scales (from a sound through to population variation across the seascape) is a substantial challenge. Here, the authors present a method to quantify song similarity at multiple levels within the hierarchy. To incorporate the complexity of these multiple levels, the calculation of similarity is weighted by measurements of sound units (lower levels within the display) to bridge the gap in information between upper and lower levels. Results demonstrate that the inclusion of weighting provides a more realistic and robust representation of song similarity at multiple levels within the display. This method permits robust quantification of cultural patterns and processes that will also contribute to the conservation management of endangered humpback whale populations, and is applicable to any hierarchically structured signal sequence.
Quantifying domain-ligand affinities and specificities by high-throughput holdup assay
Vincentelli, Renaud; Luck, Katja; Poirson, Juline; Polanowska, Jolanta; Abdat, Julie; Blémont, Marilyne; Turchetto, Jeremy; Iv, François; Ricquier, Kevin; Straub, Marie-Laure; Forster, Anne; Cassonnet, Patricia; Borg, Jean-Paul; Jacob, Yves; Masson, Murielle; Nominé, Yves; Reboul, Jérôme; Wolff, Nicolas; Charbonnier, Sebastian; Travé, Gilles
2015-01-01
Many protein interactions are mediated by small linear motifs interacting specifically with defined families of globular domains. Quantifying the specificity of a motif requires measuring and comparing its binding affinities to all its putative target domains. To this aim, we developed the high-throughput holdup assay, a chromatographic approach that can measure up to a thousand domain-motif equilibrium binding affinities per day. Extracts of overexpressed domains are incubated with peptide-coated resins and subjected to filtration. Binding affinities are deduced from microfluidic capillary electrophoresis of flow-throughs. After benchmarking the approach on 210 PDZ-peptide pairs with known affinities, we determined the affinities of two viral PDZ-binding motifs derived from Human Papillomavirus E6 oncoproteins for 209 PDZ domains covering 79% of the human PDZome. We obtained exquisite sequence-dependent binding profiles, describing quantitatively the PDZome recognition specificity of each motif. This approach, applicable to many categories of domain-ligand interactions, has a wide potential for quantifying the specificities of interactomes. PMID:26053890
Quantification of pilot workload via instrument scan
NASA Technical Reports Server (NTRS)
Tole, J. R.; Stephens, A. T.; Harris, R. L., Sr.; Ephrath, A.
1982-01-01
The use of visual scanning behavior as an indicator of pilot workload is described. The relationship between level of performance on a constant piloting task under simulated IFR conditions, the skill of the pilot the level of mental workload induced by an additional verbal task imposed on the basic control task, and visual scanning behavior is investigated. An increase in fixation dwell times, especially on the primary instrument with increased mental loading is indicated. Skilled subjects 'stared' less under increased loading than did novice pilots. Sequences of instrument fixations were also examined. The percentage occurrence of the subject's most used sequences decreased with increased task difficulty for novice subjects but not for highly skilled subjects. Entropy rate (bits/sec) of the sequence of fixations was also used to quantify the scan pattern. It consistently decreased for most subjects as the four loading levels used increased.
High-throughput sequencing methods to study neuronal RNA-protein interactions.
Ule, Jernej
2009-12-01
UV-cross-linking and RNase protection, combined with high-throughput sequencing, have provided global maps of RNA sites bound by individual proteins or ribosomes. Using a stringent purification protocol, UV-CLIP (UV-cross-linking and immunoprecipitation) was able to identify intronic and exonic sites bound by splicing regulators in mouse brain tissue. Ribosome profiling has been used to quantify ribosome density on budding yeast mRNAs under different environmental conditions. Post-transcriptional regulation in neurons requires high spatial and temporal precision, as is evident from the role of localized translational control in synaptic plasticity. It remains to be seen if the high-throughput methods can be applied quantitatively to study the dynamics of RNP (ribonucleoprotein) remodelling in specific neuronal populations during the neurodegenerative process. It is certain, however, that applications of new biochemical techniques followed by high-throughput sequencing will continue to provide important insights into the mechanisms of neuronal post-transcriptional regulation.
Methyl-CpG island-associated genome signature tags
Dunn, John J
2014-05-20
Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.
Genomic investigations of evolutionary dynamics and epistasis in microbial evolution experiments.
Jerison, Elizabeth R; Desai, Michael M
2015-12-01
Microbial evolution experiments enable us to watch adaptation in real time, and to quantify the repeatability and predictability of evolution by comparing identical replicate populations. Further, we can resurrect ancestral types to examine changes over evolutionary time. Until recently, experimental evolution has been limited to measuring phenotypic changes, or to tracking a few genetic markers over time. However, recent advances in sequencing technology now make it possible to extensively sequence clones or whole-population samples from microbial evolution experiments. Here, we review recent work exploiting these techniques to understand the genomic basis of evolutionary change in experimental systems. We first focus on studies that analyze the dynamics of genome evolution in microbial systems. We then survey work that uses observations of sequence evolution to infer aspects of the underlying fitness landscape, concentrating on the epistatic interactions between mutations and the constraints these interactions impose on adaptation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Barcode extension for analysis and reconstruction of structures
NASA Astrophysics Data System (ADS)
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L.; Gootenberg, Jonathan S.; Yin, Peng
2017-03-01
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures.
Barcode extension for analysis and reconstruction of structures.
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L; Gootenberg, Jonathan S; Yin, Peng
2017-03-13
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures.
Régoudis, Estelle; Pélandakis, Michel
2016-02-01
The amoeba-flagellate Naegleria fowleri is a causative agent of primary amoebic meningoencephalitis (PAM). This thermophilic species occurs worldwide and tends to proliferate in warm aquatic environment. The PAM cases remain rare but this infection is mostly fatal. Here, we describe a single copy region which has been cloned and sequenced, and was used for both conventional and real-time PCR. Targeting a single-copy DNA sequence allows to directly quantify the N. fowleri cells. The real-time PCR results give a detection limit of 1 copy per reaction with high reproducibility without the need of a Taqman probe. This procedure is of interest as compared to other procedures which are mostly based on the detection of multi-copy DNA associated with a Taqman probe. Copyright © 2015 Elsevier Inc. All rights reserved.
Barcode extension for analysis and reconstruction of structures
Myhrvold, Cameron; Baym, Michael; Hanikel, Nikita; Ong, Luvena L; Gootenberg, Jonathan S; Yin, Peng
2017-01-01
Collections of DNA sequences can be rationally designed to self-assemble into predictable three-dimensional structures. The geometric and functional diversity of DNA nanostructures created to date has been enhanced by improvements in DNA synthesis and computational design. However, existing methods for structure characterization typically image the final product or laboriously determine the presence of individual, labelled strands using gel electrophoresis. Here we introduce a new method of structure characterization that uses barcode extension and next-generation DNA sequencing to quantitatively measure the incorporation of every strand into a DNA nanostructure. By quantifying the relative abundances of distinct DNA species in product and monomer bands, we can study the influence of geometry and sequence on assembly. We have tested our method using 2D and 3D DNA brick and DNA origami structures. Our method is general and should be extensible to a wide variety of DNA nanostructures. PMID:28287117
DNA enrichment approaches to identify unauthorized genetically modified organisms (GMOs).
Arulandhu, Alfred J; van Dijk, Jeroen P; Dobnik, David; Holst-Jensen, Arne; Shi, Jianxin; Zel, Jana; Kok, Esther J
2016-07-01
With the increased global production of different genetically modified (GM) plant varieties, chances increase that unauthorized GM organisms (UGMOs) may enter the food chain. At the same time, the detection of UGMOs is a challenging task because of the limited sequence information that will generally be available. PCR-based methods are available to detect and quantify known UGMOs in specific cases. If this approach is not feasible, DNA enrichment of the unknown adjacent sequences of known GMO elements is one way to detect the presence of UGMOs in a food or feed product. These enrichment approaches are also known as chromosome walking or gene walking (GW). In recent years, enrichment approaches have been coupled with next generation sequencing (NGS) analysis and implemented in, amongst others, the medical and microbiological fields. The present review will provide an overview of these approaches and an evaluation of their applicability in the identification of UGMOs in complex food or feed samples.
Lee, Seungeun; Yamamoto, Naomichi
2015-12-01
This study characterized the accuracy of high-throughput amplicon sequencing to identify species within the genus Aspergillus. To this end, we sequenced the internal transcribed spacer 1 (ITS1), β-tubulin (BenA), and calmodulin (CaM) gene encoding sequences as DNA markers from eight reference Aspergillus strains with known identities using 300-bp sequencing on the Illumina MiSeq platform, and compared them with the BLASTn outputs. The identifications with the sequences longer than 250 bp were accurate at the section rank, with some ambiguities observed at the species rank due to mostly cross detection of sibling species. Additionally, in silico analysis was performed to predict the identification accuracy for all species in the genus Aspergillus, where 107, 210, and 187 species were predicted to be identifiable down to the species rank based on ITS1, BenA, and CaM, respectively. Finally, air filter samples were analysed to quantify the relative abundances of Aspergillus species in outdoor air. The results were reproducible across biological duplicates both at the species and section ranks, but not strongly correlated between ITS1 and BenA, suggesting the Aspergillus detection can be taxonomically biased depending on the selection of the DNA markers and/or primers. Copyright © 2015 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Schouten, Jan P.; McElgunn, Cathal J.; Waaijer, Raymond; Zwijnenburg, Danny; Diepvens, Filip; Pals, Gerard
2002-01-01
We describe a new method for relative quantification of 40 different DNA sequences in an easy to perform reaction requiring only 20 ng of human DNA. Applications shown of this multiplex ligation-dependent probe amplification (MLPA) technique include the detection of exon deletions and duplications in the human BRCA1, MSH2 and MLH1 genes, detection of trisomies such as Down’s syndrome, characterisation of chromosomal aberrations in cell lines and tumour samples and SNP/mutation detection. Relative quantification of mRNAs by MLPA will be described elsewhere. In MLPA, not sample nucleic acids but probes added to the samples are amplified and quantified. Amplification of probes by PCR depends on the presence of probe target sequences in the sample. Each probe consists of two oligonucleotides, one synthetic and one M13 derived, that hybridise to adjacent sites of the target sequence. Such hybridised probe oligonucleotides are ligated, permitting subsequent amplification. All ligated probes have identical end sequences, permitting simultaneous PCR amplification using only one primer pair. Each probe gives rise to an amplification product of unique size between 130 and 480 bp. Probe target sequences are small (50–70 nt). The prerequisite of a ligation reaction provides the opportunity to discriminate single nucleotide differences. PMID:12060695
Schouten, Jan P; McElgunn, Cathal J; Waaijer, Raymond; Zwijnenburg, Danny; Diepvens, Filip; Pals, Gerard
2002-06-15
We describe a new method for relative quantification of 40 different DNA sequences in an easy to perform reaction requiring only 20 ng of human DNA. Applications shown of this multiplex ligation-dependent probe amplification (MLPA) technique include the detection of exon deletions and duplications in the human BRCA1, MSH2 and MLH1 genes, detection of trisomies such as Down's syndrome, characterisation of chromosomal aberrations in cell lines and tumour samples and SNP/mutation detection. Relative quantification of mRNAs by MLPA will be described elsewhere. In MLPA, not sample nucleic acids but probes added to the samples are amplified and quantified. Amplification of probes by PCR depends on the presence of probe target sequences in the sample. Each probe consists of two oligonucleotides, one synthetic and one M13 derived, that hybridise to adjacent sites of the target sequence. Such hybridised probe oligonucleotides are ligated, permitting subsequent amplification. All ligated probes have identical end sequences, permitting simultaneous PCR amplification using only one primer pair. Each probe gives rise to an amplification product of unique size between 130 and 480 bp. Probe target sequences are small (50-70 nt). The prerequisite of a ligation reaction provides the opportunity to discriminate single nucleotide differences.
HIV Type 1 Transmission Networks Among Men Having Sex with Men and Heterosexuals in Kenya
Faria, Nuno Rodrigues; Hassan, Amin; Hamers, Raph L.; Mutua, Gaudensia; Anzala, Omu; Mandaliya, Kishor; Cane, Patricia; Berkley, James A.; Rinke de Wit, Tobias F.; Wallis, Carole; Graham, Susan M.; Price, Matthew A.; Coutinho, Roel A.; Sanders, Eduard J.
2014-01-01
Abstract We performed a molecular phylogenetic study on HIV-1 polymerase sequences of men who have sex with men (MSM) and heterosexual patient samples in Kenya to characterize any observed HIV-1 transmission networks. HIV-1 polymerase sequences were obtained from samples in Nairobi and coastal Kenya from 84 MSM, 226 other men, and 364 women from 2005 to 2010. Using Bayesian phylogenetics, we tested whether sequences clustered by sexual orientation and geographic location. In addition, we used trait diffusion analyses to identify significant epidemiological links and to quantify the number of transmissions between risk groups. Finally, we compared 84 MSM sequences with all HIV-1 sequences available online at GenBank. Significant clustering of sequences from MSM at both coastal Kenya and Nairobi was found, with evidence of HIV-1 transmission between both locations. Although a transmission pair between a coastal MSM and woman was confirmed, no significant HIV-1 transmission was evident between MSM and the comparison population for the predominant subtype A (60%). However, a weak but significant link was evident when studying all subtypes together. GenBank comparison did not reveal other important transmission links. Our data suggest infrequent intermingling of MSM and heterosexual HIV-1 epidemics in Kenya. PMID:23947948
Hwang, Young Sun; Seo, Minseok; Choi, Hee Jung; Kim, Sang Kyung; Kim, Heebal; Han, Jae Yong
2018-04-01
The chicken is a valuable model organism, especially in evolutionary and embryology research because its embryonic development occurs in the egg. However, despite its scientific importance, no transcriptome data have been generated for deciphering the early developmental stages of the chicken because of practical and technical constraints in accessing pre-oviposited embryos. Here, we determine the entire transcriptome of pre-oviposited avian embryos, including oocyte, zygote, and intrauterine embryos from Eyal-giladi and Kochav stage I (EGK.I) to EGK.X collected using a noninvasive approach for the first time. We also compare RNA-sequencing data obtained using a bulked embryo sequencing and single embryo/cell sequencing technique. The raw sequencing data were preprocessed with two genome builds, Galgal4 and Galgal5, and the expression of 17,108 and 26,102 genes was quantified in the respective builds. There were some differences between the two techniques, as well as between the two genome builds, and these were affected by the emergence of long intergenic noncoding RNA annotations. The first transcriptome datasets of pre-oviposited early chicken embryos based on bulked and single embryo sequencing techniques will serve as a valuable resource for investigating early avian embryogenesis, for comparative studies among vertebrates, and for novel gene annotation in the chicken genome.
Accuracy of taxonomy prediction for 16S rRNA and fungal ITS sequences
2018-01-01
Prediction of taxonomy for marker gene sequences such as 16S ribosomal RNA (rRNA) is a fundamental task in microbiology. Most experimentally observed sequences are diverged from reference sequences of authoritatively named organisms, creating a challenge for prediction methods. I assessed the accuracy of several algorithms using cross-validation by identity, a new benchmark strategy which explicitly models the variation in distances between query sequences and the closest entry in a reference database. When the accuracy of genus predictions was averaged over a representative range of identities with the reference database (100%, 99%, 97%, 95% and 90%), all tested methods had ≤50% accuracy on the currently-popular V4 region of 16S rRNA. Accuracy was found to fall rapidly with identity; for example, better methods were found to have V4 genus prediction accuracy of ∼100% at 100% identity but ∼50% at 97% identity. The relationship between identity and taxonomy was quantified as the probability that a rank is the lowest shared by a pair of sequences with a given pair-wise identity. With the V4 region, 95% identity was found to be a twilight zone where taxonomy is highly ambiguous because the probabilities that the lowest shared rank between pairs of sequences is genus, family, order or class are approximately equal. PMID:29682424
ScaffoldSeq: Software for characterization of directed evolution populations.
Woldring, Daniel R; Holec, Patrick V; Hackel, Benjamin J
2016-07-01
ScaffoldSeq is software designed for the numerous applications-including directed evolution analysis-in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high-throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site-specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available at http://research.cems.umn.edu/hackel. Proteins 2016; 84:869-874. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The comet assay in human biomonitoring.
Anderson, Diana; Dhawan, Alok; Laubenthal, Julian
2013-01-01
Human biomonitoring studies aim to identify potential exposures to environmental, occupational, or lifestyle toxicants in human populations and are commonly used by public health decision makers to predict disease risk. The Comet assay measures changes in genomic stability and is one of the most reliable biomarkers to indicate early biological effects, and therefore accepted by various governmental regulatory agencies. The appeal of the Comet assay lies in its relative simplicity, rapidity, sensitivity, and economic efficiency. Furthermore, the assay is known for its broad versatility, as it can be applied to virtually any human cell and easily adapted in order to detect particular biomarkers of interest, such as DNA repair capacity or single- and double-strand breaks. In a standard experiment, isolated single cells are first embedded in agarose, and then lysed in high-salt solutions in order to remove all cellular contents except the DNA attached to a nuclear scaffold. Subsequent electrophoresis results in accumulation of undamaged DNA sequences at the proximity of the nuclear scaffold, while damaged sequences migrate towards the anode. When visualized with fluorochromes, these migrated DNA fragments resemble a comet tail and can be quantified for their intensity and shape according to internationally drafted guidelines.
Aryee, Martin J.; Jaffe, Andrew E.; Corrada-Bravo, Hector; Ladd-Acosta, Christine; Feinberg, Andrew P.; Hansen, Kasper D.; Irizarry, Rafael A.
2014-01-01
Motivation: The recently released Infinium HumanMethylation450 array (the ‘450k’ array) provides a high-throughput assay to quantify DNA methylation (DNAm) at ∼450 000 loci across a range of genomic features. Although less comprehensive than high-throughput sequencing-based techniques, this product is more cost-effective and promises to be the most widely used DNAm high-throughput measurement technology over the next several years. Results: Here we describe a suite of computational tools that incorporate state-of-the-art statistical techniques for the analysis of DNAm data. The software is structured to easily adapt to future versions of the technology. We include methods for preprocessing, quality assessment and detection of differentially methylated regions from the kilobase to the megabase scale. We show how our software provides a powerful and flexible development platform for future methods. We also illustrate how our methods empower the technology to make discoveries previously thought to be possible only with sequencing-based methods. Availability and implementation: http://bioconductor.org/packages/release/bioc/html/minfi.html. Contact: khansen@jhsph.edu; rafa@jimmy.harvard.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24478339
Ultrafast Comparison of Personal Genomes via Precomputed Genome Fingerprints
Glusman, Gustavo; Mauldin, Denise E.; Hood, Leroy E.; Robinson, Max
2017-01-01
We present an ultrafast method for comparing personal genomes. We transform the standard genome representation (lists of variants relative to a reference) into “genome fingerprints” via locality sensitive hashing. The resulting genome fingerprints can be meaningfully compared even when the input data were obtained using different sequencing technologies, processed using different pipelines, represented in different data formats and relative to different reference versions. Furthermore, genome fingerprints are robust to up to 30% missing data. Because of their reduced size, computation on the genome fingerprints is fast and requires little memory. For example, we could compute all-against-all pairwise comparisons among the 2504 genomes in the 1000 Genomes data set in 67 s at high quality (21 μs per comparison, on a single processor), and achieved a lower quality approximation in just 11 s. Efficient computation enables scaling up a variety of important genome analyses, including quantifying relatedness, recognizing duplicative sequenced genomes in a set, population reconstruction, and many others. The original genome representation cannot be reconstructed from its fingerprint, effectively decoupling genome comparison from genome interpretation; the method thus has significant implications for privacy-preserving genome analytics. PMID:29018478
Schneider, Ronen; Hoogstraten, Charlotte A.; Schapiro, David; Majmundar, Amar J.; Kolb, Amy; Eddy, Kaitlyn; Shril, Shirlee; Braun, Daniela A.; Poduri, Annapurna
2018-01-01
Until recently, morpholino oligonucleotides have been widely employed in zebrafish as an acute and efficient loss-of-function assay. However, off-target effects and reproducibility issues when compared to stable knockout lines have compromised their further use. Here we employed an acute CRISPR/Cas approach using multiple single guide RNAs targeting simultaneously different positions in two exemplar genes (osgep or tprkb) to increase the likelihood of generating mutations on both alleles in the injected F0 generation and to achieve a similar effect as morpholinos but with the reproducibility of stable lines. This multi single guide RNA approach resulted in median likelihoods for at least one mutation on each allele of >99% and sgRNA specific insertion/deletion profiles as revealed by deep-sequencing. Immunoblot showed a significant reduction for Osgep and Tprkb proteins. For both genes, the acute multi-sgRNA knockout recapitulated the microcephaly phenotype and reduction in survival that we observed previously in stable knockout lines, though milder in the acute multi-sgRNA knockout. Finally, we quantify the degree of mutagenesis by deep sequencing, and provide a mathematical model to quantitate the chance for a biallelic loss-of-function mutation. Our findings can be generalized to acute and stable CRISPR/Cas targeting for any zebrafish gene of interest. PMID:29346415
Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K
2017-03-17
Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Relative Pose Estimation Using Image Feature Triplets
NASA Astrophysics Data System (ADS)
Chuang, T. Y.; Rottensteiner, F.; Heipke, C.
2015-03-01
A fully automated reconstruction of the trajectory of image sequences using point correspondences is turning into a routine practice. However, there are cases in which point features are hardly detectable, cannot be localized in a stable distribution, and consequently lead to an insufficient pose estimation. This paper presents a triplet-wise scheme for calibrated relative pose estimation from image point and line triplets, and investigates the effectiveness of the feature integration upon the relative pose estimation. To this end, we employ an existing point matching technique and propose a method for line triplet matching in which the relative poses are resolved during the matching procedure. The line matching method aims at establishing hypotheses about potential minimal line matches that can be used for determining the parameters of relative orientation (pose estimation) of two images with respect to the reference one; then, quantifying the agreement using the estimated orientation parameters. Rather than randomly choosing the line candidates in the matching process, we generate an associated lookup table to guide the selection of potential line matches. In addition, we integrate the homologous point and line triplets into a common adjustment procedure. In order to be able to also work with image sequences the adjustment is formulated in an incremental manner. The proposed scheme is evaluated with both synthetic and real datasets, demonstrating its satisfactory performance and revealing the effectiveness of image feature integration.
Analysis of financial time series using multiscale entropy based on skewness and kurtosis
NASA Astrophysics Data System (ADS)
Xu, Meng; Shang, Pengjian
2018-01-01
There is a great interest in studying dynamic characteristics of the financial time series of the daily stock closing price in different regions. Multi-scale entropy (MSE) is effective, mainly in quantifying the complexity of time series on different time scales. This paper applies a new method for financial stability from the perspective of MSE based on skewness and kurtosis. To better understand the superior coarse-graining method for the different kinds of stock indexes, we take into account the developmental characteristics of the three continents of Asia, North America and European stock markets. We study the volatility of different financial time series in addition to analyze the similarities and differences of coarsening time series from the perspective of skewness and kurtosis. A kind of corresponding relationship between the entropy value of stock sequences and the degree of stability of financial markets, were observed. The three stocks which have particular characteristics in the eight piece of stock sequences were discussed, finding the fact that it matches the result of applying the MSE method to showing results on a graph. A comparative study is conducted to simulate over synthetic and real world data. Results show that the modified method is more effective to the change of dynamics and has more valuable information. The result is obtained at the same time, finding the results of skewness and kurtosis discrimination is obvious, but also more stable.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Finley, Adam J.; Matt, Sean P., E-mail: af472@exeter.ac.uk
Cool stars with outer convective envelopes are observed to have magnetic fields with a variety of geometries, which on large scales are dominated by a combination of the lowest-order fields such as the dipole, quadrupole, and octupole modes. Magnetized stellar wind outflows are primarily responsible for the loss of angular momentum from these objects during the main sequence. Previous works have shown the reduced effectiveness of the stellar wind braking mechanism with increasingly complex but singular magnetic field geometries. In this paper, we quantify the impact of mixed dipolar and quadrupolar fields on the spin-down torque using 50 MHD simulationsmore » with mixed fields, along with 10 each of the pure geometries. The simulated winds include a wide range of magnetic field strength and reside in the slow-rotator regime. We find that the stellar wind braking torque from our combined geometry cases is well described by a broken power-law behavior, where the torque scaling with field strength can be predicted by the dipole component alone or the quadrupolar scaling utilizing the total field strength. The simulation results can be scaled and apply to all main-sequence cool stars. For solar parameters, the lowest-order component of the field (dipole in this paper) is the most significant in determining the angular momentum loss.« less
Infrared imaging to quantify the effects of nicotine-induced vasoconstriction in humans
NASA Astrophysics Data System (ADS)
Brunner, Siegfried; Kargel, Christian
2009-05-01
Smoking is the most significant source of preventable morbidity and premature mortality worldwide (WHO-2008). One of the many effects of nicotine is vasoconstriction which is triggered by the autonomic nervous system. The constriction of blood vessels e.g. of the skin's vascular bed is responsible for a decrease of the supply with oxygen and nutrients and a lowering of the skin temperature. We used infrared imaging to quantify temperature decreases caused by cigarette smoking in the extremities of smokers and also monitored heart rate as well as blood pressure. The results - including thermograms showing "temporary amputations" of the fingertips due to a significant temperature drop - can help increase the awareness of the dangers of smoking and the success of withdrawal programs. Surprisingly, in our control persons (3 brave non-smoking volunteers who smoked a cigarette) we also found temperature increases suggesting that vasodilation (widening of blood vessels) was provoked by cigarettes. To verify this unexpected finding and eliminate effects from the 4000 chemical compounds in the smoke, we repeated the experiment following a stringent protocol ruling out physiological and psychological influences with 9 habitual smokers and 17 nonsmokers who all chew gums with 2 mg of nicotine. Task-optimized digital image processing techniques (target detection, image-registration and -segmentation) were applied to the acquired infrared image sequences to automatically yield temperature plots of the fingers and palm. In this paper we present the results of our study in detail and show that smokers and non-smokers respond differently to the administration of nicotine.
Groves, Benjamin; Kuchina, Anna; Rosenberg, Alexander B.; Jojic, Nebojsa; Fields, Stanley; Seelig, Georg
2017-01-01
Our ability to predict protein expression from DNA sequence alone remains poor, reflecting our limited understanding of cis-regulatory grammar and hampering the design of engineered genes for synthetic biology applications. Here, we generate a model that predicts the protein expression of the 5′ untranslated region (UTR) of mRNAs in the yeast Saccharomyces cerevisiae. We constructed a library of half a million 50-nucleotide-long random 5′ UTRs and assayed their activity in a massively parallel growth selection experiment. The resulting data allow us to quantify the impact on protein expression of Kozak sequence composition, upstream open reading frames (uORFs), and secondary structure. We trained a convolutional neural network (CNN) on the random library and showed that it performs well at predicting the protein expression of both a held-out set of the random 5′ UTRs as well as native S. cerevisiae 5′ UTRs. The model additionally was used to computationally evolve highly active 5′ UTRs. We confirmed experimentally that the great majority of the evolved sequences led to higher protein expression rates than the starting sequences, demonstrating the predictive power of this model. PMID:29097404
Identifying strains that contribute to complex diseases through the study of microbial inheritance
Faith, Jeremiah J.; Colombel, Jean-Frédéric; Gordon, Jeffrey I.
2015-01-01
It has been 35 y since Carl Woese reported in PNAS how sequencing ribosomal RNA genes could be used to distinguish the three domains of life on Earth. During the past decade, 16S rDNA sequencing has enabled the now frequent enumeration of bacterial communities that populate the bodies of humans representing different ages, cultural traditions, and health states. A challenge going forward is to quantify the contributions of community members to wellness, disease risk, and disease pathogenesis. Here, we explore a theoretical framework for studies of the inheritance of bacterial strains and discuss the advantages and disadvantages of various study designs for assessing the contribution of strains to complex diseases. PMID:25576328
Revell, James; Mirmehdi, Majid; McNally, Donal
2005-06-01
We present the development and validation of an image based speckle tracking methodology, for determining temporal two-dimensional (2-D) axial and lateral displacement and strain fields from ultrasound video streams. We refine a multiple scale region matching approach incorporating novel solutions to known speckle tracking problems. Key contributions include automatic similarity measure selection to adapt to varying speckle density, quantifying trajectory fields, and spatiotemporal elastograms. Results are validated using tissue mimicking phantoms and in vitro data, before applying them to in vivo musculoskeletal ultrasound sequences. The method presented has the potential to improve clinical knowledge of tendon pathology from carpel tunnel syndrome, inflammation from implants, sport injuries, and many others.
Chew, David S. H.; Choi, Kwok Pui; Leung, Ming-Ying
2005-01-01
Many empirical studies show that there are unusual clusters of palindromes, closely spaced direct and inverted repeats around the replication origins of herpesviruses. In this paper, we introduce two new scoring schemes to quantify the spatial abundance of palindromes in a genomic sequence. Based on these scoring schemes, a computational method to predict the locations of replication origins is developed. When our predictions are compared with 39 known or annotated replication origins in 19 herpesviruses, close to 80% of the replication origins are located within 2% of the genome length. A list of predicted locations of replication origins in all the known herpesviruses with complete genome sequences is reported. PMID:16141192
Periodic, chaotic, and doubled earthquake recurrence intervals on the deep San Andreas Fault
Shelly, David R.
2010-01-01
Earthquake recurrence histories may provide clues to the timing of future events, but long intervals between large events obscure full recurrence variability. In contrast, small earthquakes occur frequently, and recurrence intervals are quantifiable on a much shorter time scale. In this work, I examine an 8.5-year sequence of more than 900 recurring low-frequency earthquake bursts composing tremor beneath the San Andreas fault near Parkfield, California. These events exhibit tightly clustered recurrence intervals that, at times, oscillate between ~3 and ~6 days, but the patterns sometimes change abruptly. Although the environments of large and low-frequency earthquakes are different, these observations suggest that similar complexity might underlie sequences of large earthquakes.
Wang, Jiahui; Fan, Zheng; Vandenborne, Krista; Walter, Glenn; Shiloh-Malawsky, Yael; An, Hongyu; Kornegay, Joe N; Styner, Martin A
2013-09-01
Golden retriever muscular dystrophy (GRMD) is a widely used canine model of Duchenne muscular dystrophy (DMD). Recent studies have shown that magnetic resonance imaging (MRI) can be used to non-invasively detect consistent changes in both DMD and GRMD. In this paper, we propose a semiautomated system to quantify MRI biomarkers of GRMD. Our system was applied to a database of 45 MRI scans from 8 normal and 10 GRMD dogs in a longitudinal natural history study. We first segmented six proximal pelvic limb muscles using a semiautomated full muscle segmentation method. We then performed preprocessing, including intensity inhomogeneity correction, spatial registration of different image sequences, intensity calibration of T2-weighted and T2-weighted fat-suppressed images, and calculation of MRI biomarker maps. Finally, for each of the segmented muscles, we automatically measured MRI biomarkers of muscle volume, intensity statistics over MRI biomarker maps, and statistical image texture features. The muscle volume and the mean intensities in T2 value, fat, and water maps showed group differences between normal and GRMD dogs. For the statistical texture biomarkers, both the histogram and run-length matrix features showed obvious group differences between normal and GRMD dogs. The full muscle segmentation showed significantly less error and variability in the proposed biomarkers when compared to the standard, limited muscle range segmentation. The experimental results demonstrated that this quantification tool could reliably quantify MRI biomarkers in GRMD dogs, suggesting that it would also be useful for quantifying disease progression and measuring therapeutic effect in DMD patients.
Brannon, Erin E.; Suorsa, Kristina I.; Wilson, Dawn K.
2014-01-01
Objective To evaluate and quantify the evidence for health promotion interventions in children and adolescents. Method 96 independent samples of smoking, physical activity, and diet studies were included. Outcomes included both objective and self-reports of health behavior, as well as proxy measures such as fitness. Results The aggregated effect was significant (g = .20, 95% confidence interval [CI] = 0.08–0.32, n = 96). A significant effect of intervention was observed at approximately 1-year follow-up (g = .07, 95% CI = 0.02–0.14, n = 20). The greatest risk of bias was failure to blind outcome assessment, which occurred in 21% of studies. Most studies lacked sufficient detail to determine the quality of their randomization sequence (58%). Additional concerns about risk of bias for individual studies were minimal. Overall, the quality of this finding was moderate using the Grading of Recommendations Assessment, Development, and Evaluation criteria. Conclusion Health promotion interventions are effective for modifying health behavior; however, effect sizes are small. PMID:24934246
NASA Technical Reports Server (NTRS)
Baker, D. N.; Bame, S. J.; Mccomas, D. J.; Zwickl, R. D.; Slavin, J. A.; Smith, E. J.
1987-01-01
Examination of many individual event periods in the ISEE 3 deep-tail data set has suggested that magnetospheric substorms produce a characteristic pattern of effects in the distant magnetotail. During the growth, or tail-energy-storage phase of substorms, the magnetotail appears to grow diametrically in size, often by many earth radii. Subsequently, after the substorm expansive phase onset at earth, the distant tail undergoes a sequence of plasma, field, and energetic-particle variations as large-scale plasmoids move rapidly down the tail following their disconnection from the near-earth plasma sheet. ISEE 3 data are appropriate for the study of these effects since the spacecraft remained fixed within the nominal tail location for long periods. Using newly available auroral electrojet indices (AE and AL) and Geo particle data to time substorm onsets at earth, superposed epoch analyses of ISEE 3 and near-earth data prior to, and following, substorm expansive phase onsets have been performed. These analyses quantify and extend substantially the understanding of the deep-tail pattern of response to global substorm-induced dynamical effects.
Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.
2010-01-01
Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10–20% nucleotide deviation from the canonical ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966
Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B
2010-04-01
Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: <10% and 10-20% nucleotide deviation from the canonical ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.
Archaeal β diversity patterns under the seafloor along geochemical gradients
NASA Astrophysics Data System (ADS)
Koyano, Hitoshi; Tsubouchi, Taishi; Kishino, Hirohisa; Akutsu, Tatsuya
2014-09-01
Recently, deep drilling into the seafloor has revealed that there are vast sedimentary ecosystems of diverse microorganisms, particularly archaea, in subsurface areas. We investigated the β diversity patterns of archaeal communities in sediment layers under the seafloor and their determinants. This study was accomplished by analyzing large environmental samples of 16S ribosomal RNA gene sequences and various geochemical data collected from a sediment core of 365.3 m, obtained by drilling into the seafloor off the east coast of the Shimokita Peninsula. To extract the maximum amount of information from these environmental samples, we first developed a method for measuring β diversity using sequence data by applying probability theory on a set of strings developed by two of the authors in a previous publication. We introduced an index of β diversity between sequence populations from which the sequence data were sampled. We then constructed an estimator of the β diversity index based on the sequence data and demonstrated that it converges to the β diversity index between sequence populations with probability of 1 as the number of sampled sequences increases. Next, we applied this new method to quantify β diversities between archaeal sequence populations under the seafloor and constructed a quantitative model of the estimated β diversity patterns. Nearly 90% of the variation in the archaeal β diversity was explained by a model that included as variables the differences in the abundances of chlorine, iodine, and carbon between the sediment layers.
Heuristic reusable dynamic programming: efficient updates of local sequence alignment.
Hong, Changjin; Tewfik, Ahmed H
2009-01-01
Recomputation of the previously evaluated similarity results between biological sequences becomes inevitable when researchers realize errors in their sequenced data or when the researchers have to compare nearly similar sequences, e.g., in a family of proteins. We present an efficient scheme for updating local sequence alignments with an affine gap model. In principle, using the previous matching result between two amino acid sequences, we perform a forward-backward alignment to generate heuristic searching bands which are bounded by a set of suboptimal paths. Given a correctly updated sequence, we initially predict a new score of the alignment path for each contour to select the best candidates among them. Then, we run the Smith-Waterman algorithm in this confined space. Furthermore, our heuristic alignment for an updated sequence shows that it can be further accelerated by using reusable dynamic programming (rDP), our prior work. In this study, we successfully validate "relative node tolerance bound" (RNTB) in the pruned searching space. Furthermore, we improve the computational performance by quantifying the successful RNTB tolerance probability and switch to rDP on perturbation-resilient columns only. In our searching space derived by a threshold value of 90 percent of the optimal alignment score, we find that 98.3 percent of contours contain correctly updated paths. We also find that our method consumes only 25.36 percent of the runtime cost of sparse dynamic programming (sDP) method, and to only 2.55 percent of that of a normal dynamic programming with the Smith-Waterman algorithm.
Effects of eddy currents on selective spectral editing experiments at 3T.
Oeltzschner, Georg; Snoussi, Karim; Puts, Nicolaas A; Mikkelsen, Mark; Harris, Ashley D; Pradhan, Subechhya; Tsapkini, Kyrana; Schär, Michael; Barker, Peter B; Edden, Richard A E
2018-03-01
To investigate frequency-offset effects in edited magnetic resonance spectroscopy (MRS) experiments arising from B 0 eddy currents. Macromolecule-suppressed (MM-suppressed) γ-aminobutyric acid (GABA)-edited experiments were performed at 3T. Saturation-offset series of MEGA-PRESS experiments were performed in phantoms, in order to investigate different aspects of the relationship between the effective editing frequencies and eddy currents associated with gradient pulses in the sequence. Difference integrals were quantified for each series, and the offset dependence of the integrals was analyzed to quantify the difference in frequency (Δf) between the actual vs. nominal expected saturation frequency. Saturation-offset N-acetyl-aspartate-phantom experiments show that Δf varied with voxel orientation, ranging from 10.4 Hz (unrotated) to 6.4 Hz (45° rotation about the caudal-cranial axis) and 0.4 Hz (45° rotation about left-right axis), indicating that gradient-related B 0 eddy currents vary with crusher-gradient orientation. Fixing the crusher-gradient coordinate-frame substantially reduced the orientation dependence of Δf (to ∼2 Hz). Water-suppression crusher gradients also introduced a frequency offset, with Δf = 0.6 Hz ("excitation" water suppression), compared to 10.2 Hz (no water suppression). In vivo spectra showed a negative edited "GABA" signal, suggesting Δf on the order of 10 Hz; with fixed crusher-gradient coordinate-frame, the expected positive edited "GABA" signal was observed. Eddy currents associated with pulsed field gradients may have a considerable impact on highly frequency-selective spectral-editing experiments, such as MM-suppressed GABA editing at 3T. Careful selection of crusher gradient orientation may ameliorate these effects. 2 Technical Efficacy: Stage 1 J. Magn. Reson. Imaging 2018;47:673-681. © 2017 International Society for Magnetic Resonance in Medicine.
Anderson, Tavis K; Laegreid, William W; Cerutti, Francesco; Osorio, Fernando A; Nelson, Eric A; Christopher-Hennings, Jane; Goldberg, Tony L
2012-06-15
The extraordinary genetic and antigenic variability of RNA viruses is arguably the greatest challenge to the development of broadly effective vaccines. No single viral variant can induce sufficiently broad immunity, and incorporating all known naturally circulating variants into one multivalent vaccine is not feasible. Furthermore, no objective strategies currently exist to select actual viral variants that should be included or excluded in polyvalent vaccines. To address this problem, we demonstrate a method based on graph theory that quantifies the relative importance of viral variants. We demonstrate our method through application to the envelope glycoprotein gene of a particularly diverse RNA virus of pigs: porcine reproductive and respiratory syndrome virus (PRRSV). Using distance matrices derived from sequence nucleotide difference, amino acid difference and evolutionary distance, we constructed viral networks and used common network statistics to assign each sequence an objective ranking of relative 'importance'. To validate our approach, we use an independent published algorithm to score our top-ranked wild-type variants for coverage of putative T-cell epitopes across the 9383 sequences in our dataset. Top-ranked viruses achieve significantly higher coverage than low-ranked viruses, and top-ranked viruses achieve nearly equal coverage as a synthetic mosaic protein constructed in silico from the same set of 9383 sequences. Our approach relies on the network structure of PRRSV but applies to any diverse RNA virus because it identifies subsets of viral variants that are most important to overall viral diversity. We suggest that this method, through the objective quantification of variant importance, provides criteria for choosing viral variants for further characterization, diagnostics, surveillance and ultimately polyvalent vaccine development.
Ziegel, Rebecca; Shallop, Anthony; Jones, Roger; Tretyakova, Natalia
2003-04-01
The tobacco specific pulmonary carcinogen 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone (NNK) is metabolically activated to electrophilic species that form methyl and pyridyloxobutyl adducts with genomic DNA, including O(6)-methylguanine, N7-methylguanine, and O(6)-[4-oxo-4-(3-pyridyl)butyl]guanine. If not repaired, these lesions could lead to mutations and the initiation of cancer. Previous studies used ligation-mediated polymerase chain reaction (LMPCR) in combination with PAGE to examine the distribution of NNK-induced strand breaks and alkali labile lesions (e.g., N7-methylguanine) within gene sequences. However, LMPCR cannot be used to establish the distribution patterns of highly promutagenic O(6)-methylguanine and O(6)-[4-oxo-4-(3-pyridyl)butyl]guanine adducts of NNK. We have developed methods based on stable isotope labeling HPLC-electrospray ionization tandem mass spectrometry (HPLC-ESI MS/MS) that enable us to accurately quantify NNK-induced adducts at defined sites within DNA sequences. In the present study, the formation of N7-methylguanine, O(6)-methylguanine, and O(6)-[4-oxo-4-(3-pyridyl)butyl]guanine adducts at specific positions within a K-ras gene-derived double-stranded DNA sequence (5'-G(1)G(2)AG(3)CTG(4)G(5)TG(6)G(7)CG(8)TA G(9)G(10)C-3') was investigated following treatment with activated NNK metabolites. All three lesions preferentially formed at the second position of codon 12 (GGT), the major mutational hotspot for G-->A and G-->T base substitutions observed in smoking-induced lung tumors. Therefore, our data support the involvement of NNK and other tobacco specific nitrosamines in mutagenesis and carcinogenesis.
Alkio, Merianne; Jonas, Uwe; Sprink, Thorben; van Nocker, Steven; Knoche, Moritz
2012-01-01
Background and Aims The cuticular membrane (CM) of Prunus avium (sweet cherry) and other fleshy fruit is under stress. Previous research indicates that the resultant strain promotes microscopic cuticular cracking. Microcracks impair the function of the CM as a barrier against pathogens and uncontrolled water loss/uptake. Stress and strain result from a cessation of CM deposition during early development, while the fruit surface continues to expand. The cessation of CM deposition, in turn, may be related to an early downregulation of CM-related genes. The aims of this study were to identify genes potentially involved in CM formation in sweet cherry fruit and to quantify their expression levels. Methods Fruit growth and CM deposition were quantified weekly from anthesis to maturity and rates of CM deposition were calculated. Sequences of genes expressed in the sweet cherry fruit skin (exocarp) were generated using high-throughput sequencing of cDNA and de novo assembly and analysed using bioinformatics tools. Relative mRNA levels of selected genes were quantified in the exocarp and fruit flesh (mesocarp) weekly using reverse transcriptase-quantitative real-time PCR and compared with the calculated CM deposition rate over time. Key Results The rate of CM deposition peaked at 93 (±5) μg per fruit d−1 about 19 d after anthesis. Based on sequence analyses, 18 genes were selected as potentially involved in CM formation. Selected sweet cherry genes shared up to 100 and 98 % similarity with the respective Prunus persica (peach) and Arabidopsis thaliana genes. Expression of 13 putative CM-related genes was restricted to the exocarp and correlated positively with the CM deposition rate. Conclusions The results support the view that the cessation of CM deposition during early sweet cherry fruit development is accounted for by a downregulation of genes involved in CM deposition. Genes that merit further investigation include PaWINA, PaWINB, PaLipase, PaLTPG1, PaATT1, PaLCR, PaGPAT4/8, PaLACS2, PaLACS1 and PaCER1. PMID:22610921
Dubois, Benjamin; Bertin, Pierre; Muhovski, Yordan; Escarnot, Emmanuelle; Mingeot, Dominique
2017-01-01
Celiac disease (CD) is caused by specific sequences of gluten proteins found in cereals such as bread wheat ( Triticum aestivum ssp. aestivum ) and spelt ( T. aestivum ssp. spelta ). Among them, the α-gliadins display the highest immunogenicity, with four T-cell stimulatory epitopes. The toxicity of each epitope sequence can be reduced or even suppressed according to the allelic form of each sequence. One way to address the CD problem would be to make use of this allelic variability in breeding programs to develop safe varieties, but tools to track the presence of toxic epitopes are required. The objective of this study was to develop a tool to accurately detect and quantify the immunogenic content of expressed α-gliadins of spelt and bread wheat. Four TaqMan probes that only hybridize to the canonical-i.e. toxic-form of each of the four epitopes were developed and their specificity was demonstrated. Six TaqMan probes targeting stable reference genes were also developed and constitute a tool to normalize qPCR data. The probes were used to measure the epitope expression levels of 11 contrasted spelt accessions and three ancestral diploid accessions of bread wheat and spelt. A high expression variability was highlighted among epitopes and among accessions, especially in Asian spelts, which showed lower epitope expression levels than the other spelts. Some discrepancies were identified between the canonical epitope expression level and the global amount of expressed α-gliadins, which makes the designed TaqMan probes a useful tool to quantify the immunogenic potential independently of the global amount of expressed α-gliadins. The results obtained in this study provide useful tools to study the immunogenic potential of expressed α-gliadin sequences from Triticeae accessions such as spelt and bread wheat. The application of the designed probes to contrasted spelt accessions revealed a high variability and interesting low canonical epitope expression levels in the Asian spelt accessions studied.
Zink, Jean-Vincent; Souteyrand, Philippe; Guis, Sandrine; Chagnaud, Christophe; Fur, Yann Le; Militianu, Daniela; Mattei, Jean-Pierre; Rozenbaum, Michael; Rosner, Itzhak; Guye, Maxime; Bernard, Monique; Bendahan, David
2015-01-01
AIM: To quantify the wrist cartilage cross-sectional area in humans from a 3D magnetic resonance imaging (MRI) dataset and to assess the corresponding reproducibility. METHODS: The study was conducted in 14 healthy volunteers (6 females and 8 males) between 30 and 58 years old and devoid of articular pain. Subjects were asked to lie down in the supine position with the right hand positioned above the pelvic region on top of a home-built rigid platform attached to the scanner bed. The wrist was wrapped with a flexible surface coil. MRI investigations were performed at 3T (Verio-Siemens) using volume interpolated breath hold examination (VIBE) and dual echo steady state (DESS) MRI sequences. Cartilage cross sectional area (CSA) was measured on a slice of interest selected from a 3D dataset of the entire carpus and metacarpal-phalangeal areas on the basis of anatomical criteria using conventional image processing radiology software. Cartilage cross-sectional areas between opposite bones in the carpal region were manually selected and quantified using a thresholding method. RESULTS: Cartilage CSA measurements performed on a selected predefined slice were 292.4 ± 39 mm2 using the VIBE sequence and slightly lower, 270.4 ± 50.6 mm2, with the DESS sequence. The inter (14.1%) and intra (2.4%) subject variability was similar for both MRI methods. The coefficients of variation computed for the repeated measurements were also comparable for the VIBE (2.4%) and the DESS (4.8%) sequences. The carpus length averaged over the group was 37.5 ± 2.8 mm with a 7.45% between-subjects coefficient of variation. Of note, wrist cartilage CSA measured with either the VIBE or the DESS sequences was linearly related to the carpal bone length. The variability between subjects was significantly reduced to 8.4% when the CSA was normalized with respect to the carpal bone length. CONCLUSION: The ratio between wrist cartilage CSA and carpal bone length is a highly reproducible standardized measurement which normalizes the natural diversity between individuals. PMID:26396941
NASA Astrophysics Data System (ADS)
Boss, Andreas; Martirosian, Petros; Artunc, Ferruh; Risler, Teut; Claussen, Claus D.; Schlemmer, Heinz-Peter; Schick, Fritz
2007-03-01
Purpose: As the MR contrast-medium gadobutrol is completely eliminated via glomerular filtration, the glomerular filtration rate (GFR) can be quantified after bolus-injection of gadobutrol and complete mixing in the extracellular fluid volume (ECFV) by measuring the signal decrease within the liver parenchyma. Two different navigator-gated single-shot saturation-recovery sequences have been tested for suitability of GFR quantification: a TurboFLASH and a TrueFISP readout technique. Materials and Methods: Ten healthy volunteers (mean age 26.1+/-3.6) were equally devided in two subgroups. After bolus-injection of 0.05 mmol/kg gadobutrol, coronal single-slice images of the liver were recorded every 4-5 seconds during free breathing using either the TurboFLASH or the TrueFISP technique. Time-intensity curves were determined from manually drawn regions-of-interest over the liver parenchyma. Both sequences were subsequently evaluated regarding signal to noise ratio (SNR) and the behaviour of signal intensity curves. The calculated GFR values were compared to an iopromide clearance gold standard. Results: The TrueFISP sequence exhibited a 3.4-fold higher SNR as compared to the TurboFLASH sequence and markedly lower variability of the recorded time-intensity curves. The calculated mean GFR values were 107.0+/-16.1 ml/min/1.73m2 (iopromide: 92.1+/-14.5 ml/min/1.73m2) for the TrueFISP technique and 125.6+/-24.1 ml/min/1.73m2 (iopromide: 97.7+/-6.3 ml/min/1.73m2) for the TurboFLASH approach. The mean paired differences with TrueFISP was lower (15.0 ml/min/1.73m2) than in the TurboFLASH method (27.9 ml/min/1.73m2). Conclusion: The global GFR can be quantified via measurement of gadobutrol clearance from the ECFV. A saturation-recovery TrueFISP sequence allows for more reliable GFR quantification as a saturation recovery TurboFLASH technique.
Demetriou, Eleni; Tachrount, Mohamed; Zaiss, Moritz; Shmueli, Karin; Golay, Xavier
2018-03-05
To develop a new MRI technique to rapidly measure exchange rates in CEST MRI. A novel pulse sequence for measuring chemical exchange rates through a progressive saturation recovery process, called PRO-QUEST (progressive saturation for quantifying exchange rates using saturation times), has been developed. Using this method, the water magnetization is sampled under non-steady-state conditions, and off-resonance saturation is interleaved with the acquisition of images obtained through a Look-Locker type of acquisition. A complete theoretical framework has been set up, and simple equations to obtain the exchange rates have been derived. A reduction of scan time from 58 to 16 minutes has been obtained using PRO-QUEST versus the standard QUEST. Maps of both T 1 of water and B 1 can simply be obtained by repetition of the sequence without off-resonance saturation pulses. Simulations and calculated exchange rates from experimental data using amino acids such as glutamate, glutamine, taurine, and alanine were compared and found to be in good agreement. The PRO-QUEST sequence was also applied on healthy and infarcted rats after 24 hours, and revealed that imaging specificity to ischemic acidification during stroke was substantially increased relative to standard amide proton transfer-weighted imaging. Because of the reduced scan time and insensitivity to nonchemical exchange factors such as direct water saturation, PRO-QUEST can serve as an excellent alternative for researchers and clinicians interested to map pH changes in vivo. © 2018 International Society for Magnetic Resonance in Medicine.
Quantifying selection in evolving populations using time-resolved genetic data
NASA Astrophysics Data System (ADS)
Illingworth, Christopher J. R.; Mustonen, Ville
2013-01-01
Methods which uncover the molecular basis of the adaptive evolution of a population address some important biological questions. For example, the problem of identifying genetic variants which underlie drug resistance, a question of importance for the treatment of pathogens, and of cancer, can be understood as a matter of inferring selection. One difficulty in the inference of variants under positive selection is the potential complexity of the underlying evolutionary dynamics, which may involve an interplay between several contributing processes, including mutation, recombination and genetic drift. A source of progress may be found in modern sequencing technologies, which confer an increasing ability to gather information about evolving populations, granting a window into these complex processes. One particularly interesting development is the ability to follow evolution as it happens, by whole-genome sequencing of an evolving population at multiple time points. We here discuss how to use time-resolved sequence data to draw inferences about the evolutionary dynamics of a population under study. We begin by reviewing our earlier analysis of a yeast selection experiment, in which we used a deterministic evolutionary framework to identify alleles under selection for heat tolerance, and to quantify the selection acting upon them. Considering further the use of advanced intercross lines to measure selection, we here extend this framework to cover scenarios of simultaneous recombination and selection, and of two driver alleles with multiple linked neutral, or passenger, alleles, where the driver pair evolves under an epistatic fitness landscape. We conclude by discussing the limitations of the approach presented and outlining future challenges for such methodologies.
Flibotte, Stephane; Moerman, Donald G
2008-10-21
Microarray comparative genomic hybridization (CGH) is currently one of the most powerful techniques to measure DNA copy number in large genomes. In humans, microarray CGH is widely used to assess copy number variants in healthy individuals and copy number aberrations associated with various diseases, syndromes and disease susceptibility. In model organisms such as Caenorhabditis elegans (C. elegans) the technique has been applied to detect mutations, primarily deletions, in strains of interest. Although various constraints on oligonucleotide properties have been suggested to minimize non-specific hybridization and improve the data quality, there have been few experimental validations for CGH experiments. For genomic regions where strict design filters would limit the coverage it would also be useful to quantify the expected loss in data quality associated with relaxed design criteria. We have quantified the effects of filtering various oligonucleotide properties by measuring the resolving power for detecting deletions in the human and C. elegans genomes using NimbleGen microarrays. Approximately twice as many oligonucleotides are typically required to be affected by a deletion in human DNA samples in order to achieve the same statistical confidence as one would observe for a deletion in C. elegans. Surprisingly, the ability to detect deletions strongly depends on the oligonucleotide 15-mer count, which is defined as the sum of the genomic frequency of all the constituent 15-mers within the oligonucleotide. A similarity level above 80% to non-target sequences over the length of the probe produces significant cross-hybridization. We recommend the use of a fairly large melting temperature window of up to 10 degrees C, the elimination of repeat sequences, the elimination of homopolymers longer than 5 nucleotides, and a threshold of -1 kcal/mol on the oligonucleotide self-folding energy. We observed very little difference in data quality when varying the oligonucleotide length between 50 and 70, and even when using an isothermal design strategy. We have determined experimentally the effects of varying several key oligonucleotide microarray design criteria for detection of deletions in C. elegans and humans with NimbleGen's CGH technology. Our oligonucleotide design recommendations should be applicable for CGH analysis in most species.
Classification of HCV and HIV-1 Sequences with the Branching Index
Hraber, Peter; Kuiken, Carla; Waugh, Mark; Geer, Shaun; Bruno, William J.; Leitner, Thomas
2009-01-01
SUMMARY Classification of viral sequences should be fast, objective, accurate, and reproducible. Most methods that classify sequences use either pairwise distances or phylogenetic relations, but cannot discern when a sequence is unclassifiable. The branching index (BI) combines distance and phylogeny methods to compute a ratio that quantifies how closely a query sequence clusters with a subtype clade. In the hypothesis-testing framework of statistical inference, the BI is compared with a threshold to test whether sufficient evidence exists for the query sequence to be classified among known sequences. If above the threshold, the null hypothesis of no support for the subtype relation is rejected and the sequence is taken as belonging to the subtype clade with which it clusters on the tree. This study evaluates statistical properties of the branching index for subtype classification in HCV and HIV-1. Pairs of BI values with known positive and negative test results were computed from 10,000 random fragments of reference alignments. Sampled fragments were of sufficient length to contain phylogenetic signal that groups reference sequences together properly into subtype clades. For HCV, a threshold BI of 0.71 yields 95.1% agreement with reference subtypes, with equal false positive and false negative rates. For HIV-1, a threshold of 0.66 yields 93.5% agreement. Higher thresholds can be used where lower false positive rates are required. In synthetic recombinants, regions without breakpoints are recognized accurately; regions with breakpoints do not uniquely represent any known subtype. Web-based services for viral subtype classification with the branching index are available online. PMID:18753218
Network harness: bundles of routes in public transport networks
NASA Astrophysics Data System (ADS)
Berche, B.; von Ferber, C.; Holovatch, T.
2009-12-01
Public transport routes sharing the same grid of streets and tracks are often found to proceed in parallel along shorter or longer sequences of stations. Similar phenomena are observed in other networks built with space consuming links such as cables, vessels, pipes, neurons, etc. In the case of public transport networks (PTNs) this behavior may be easily worked out on the basis of sequences of stations serviced by each route. To quantify this behavior we use the recently introduced notion of network harness. It is described by the harness distribution P(r, s): the number of sequences of s consecutive stations that are serviced by r parallel routes. For certain PTNs that we have analyzed we observe that the harness distribution may be described by power laws. These power laws indicate a certain level of organization and planning which may be driven by the need to minimize the costs of infrastructure and secondly by the fact that points of interest tend to be clustered in certain locations of a city. This effect may be seen as a result of the strong interdependence of the evolutions of both the city and its PTN. To further investigate the significance of the empirical results we have studied one- and two-dimensional models of randomly placed routes modeled by different types of walks. While in one dimension an analytic treatment was successful, the two dimensional case was studied by simulations showing that the empirical results for real PTNs deviate significantly from those expected for randomly placed routes.
Rojo, Liliana; Muhlia-Almazan, Adriana; Saborowski, Reinhard; García-Carreño, Fernando
2010-11-01
Acid digestive proteinases were studied in the gastric fluids of two species of clawed lobster (Homarus americanus and Homarus gammarus). An active protein was identified in both species as aspartic proteinase by specific inhibition with pepstatin A. It was confirmed as cathepsin D by mass mapping, N-terminal, and full-length cDNA sequencing. Both lobster species transcribed two cathepsin D mRNAs: cathepsin D1 and cathepsin D2. Cathepsin D1 mRNA was detected only in the midgut gland, suggesting its function as a digestive enzyme. Cathepsin D2 mRNA was found in the midgut gland, gonads, and muscle. The deduced amino acid sequence of cathepsin D1 and cathepsin D2 possesses two catalytic DTG active-site motifs, the hallmark of aspartic proteinases. The putatively active cathepsin D1 has a molecular mass of 36.4 kDa and a calculated pI of 4.14 and possesses three potential glycosylation sites. The sequences showed highest similarities with cathepsin D from insects but also with another crustacean cathepsin D. Cathepsin D1 transcripts were quantified during a starvation period using real-time qPCR. In H. americanus, 15 days of starvation did not cause significant changes, but subsequent feeding caused a 2.5-fold increase. In H. gammarus, starvation caused a 40% reduction in cathepsin D1 mRNA, and no effect was observed with subsequent feeding.
Metagenetic Sequencing of Zooplankton Communities in the High-Diversity Central North Pacific
NASA Astrophysics Data System (ADS)
Matthews, S. A.; van Woudenberg, L.; Iacchei, M.; Lenz, P. H.; Goetze, E.
2016-02-01
Marine zooplankton are important intermediate trophic level consumers in the ocean, and the subtropical North Pacific holds global maxima in species diversity for these communities. Zooplankton assemblages in this region include several species complexes, with many understudied and morphologically cryptic species. We used metagenetic sequencing to characterize zooplankton community composition across depth (0-1500m) at an open ocean time series site in the central North Pacific (Station ALOHA), using depth-stratified 1m2 MOCNESS samples that were size fractionated into 5 size classes (0.2-0.5 mm, 0.5-1 mm, 1-2 mm, 2-5 mm, >5 mm). Our goals were to quantify the fraction of the community that is currently undescribed, identify taxonomic groups that contain large numbers of undescribed species and may be important to biogeochemical cycling in the ocean, and establish a metagenetic method that can be used to effectively characterize the species richness of epipelagic and mesopelagic communities in this region. Amplicons from several DNA loci, including mitochondrial cytochrome c oxidase subunit I and 12S rRNA, and nuclear 18S and 28S rRNA genes were sequenced on the MiSeq Illumina platform to characterize community composition. We evaluate species composition across metagenetic marker regions, pelagic depth zones, day and night-time MOCNESS tows, and compare our findings with prior species lists from the region. Our results are an important contribution to establishing standardized metagenetic methods for marine zooplankton communities.
Petrie, Matthew D; Peters, Debra P C; Yao, Jin; Blair, John M; Burruss, Nathan D; Collins, Scott L; Derner, Justin D; Gherardi, Laureano A; Hendrickson, John R; Sala, Osvaldo E; Starks, Patrick J; Steiner, Jean L
2018-05-01
There is considerable uncertainty in the magnitude and direction of changes in precipitation associated with climate change, and ecosystem responses are also uncertain. Multiyear periods of above- and below-average rainfall may foretell consequences of changes in rainfall regime. We compiled long-term aboveground net primary productivity (ANPP) and precipitation (PPT) data for eight North American grasslands, and quantified relationships between ANPP and PPT at each site, and in 1-3 year periods of above- and below-average rainfall for mesic, semiarid cool, and semiarid warm grassland types. Our objective was to improve understanding of ANPP dynamics associated with changing climatic conditions by contrasting PPT-ANPP relationships in above- and below-average PPT years to those that occurred during sequences of multiple above- and below-average years. We found differences in PPT-ANPP relationships in above- and below-average years compared to long-term site averages, and variation in ANPP not explained by PPT totals that likely are attributed to legacy effects. The correlation between ANPP and current- and prior-year conditions changed from year to year throughout multiyear periods, with some legacy effects declining, and new responses emerging. Thus, ANPP in a given year was influenced by sequences of conditions that varied across grassland types and climates. Most importantly, the influence of prior-year ANPP often increased with the length of multiyear periods, whereas the influence of the amount of current-year PPT declined. Although the mechanisms by which a directional change in the frequency of above- and below-average years imposes a persistent change in grassland ANPP require further investigation, our results emphasize the importance of legacy effects on productivity for sequences of above- vs. below-average years, and illustrate the utility of long-term data to examine these patterns. © 2018 John Wiley & Sons Ltd.
Sodium and T1ρ MRI for molecular and diagnostic imaging of articular cartilage†
Borthakur, Arijitt; Mellon, Eric; Niyogi, Sampreet; Witschey, Walter; Kneeland, J. Bruce; Reddy, Ravinder
2010-01-01
In this article, both sodium magnetic resonance (MR) and T1ρ relaxation mapping aimed at measuring molecular changes in cartilage for the diagnostic imaging of osteoarthritis are reviewed. First, an introduction to structure of cartilage, its degeneration in osteoarthritis (OA) and an outline of diagnostic imaging methods in quantifying molecular changes and early diagnostic aspects of cartilage degeneration are described. The sodium MRI section begins with a brief overview of the theory of sodium NMR of biological tissues and is followed by a section on multiple quantum filters that can be used to quantify both bi-exponential relaxation and residual quadrupolar interaction. Specifically, (i) the rationale behind the use of sodium MRI in quantifying proteoglycan (PG) changes, (ii) validation studies using biochemical assays, (iii) studies on human OA specimens, (iv) results on animal models and (v) clinical imaging protocols are reviewed. Results demonstrating the feasibility of quantifying PG in OA patients and comparison with that in healthy subjects are also presented. The section concludes with the discussion of advantages and potential issues with sodium MRI and the impact of new technological advancements (e.g. ultra-high field scanners and parallel imaging methods). In the theory section on T1ρ, a brief description of (i) principles of measuring T1ρ relaxation, (ii) pulse sequences for computing T1ρ relaxation maps, (iii) issues regarding radio frequency power deposition, (iv) mechanisms that contribute to T1ρ in biological tissues and (v) effects of exchange and dipolar interaction on T1ρ dispersion are discussed. Correlation of T1ρ relaxation rate with macromolecular content and biomechanical properties in cartilage specimens subjected to trypsin and cytokine-induced glycosaminoglycan depletion and validation against biochemical assay and histopathology are presented. Experimental T1ρ data from osteoarthritic specimens, animal models, healthy human subjects and as well from osteoarthritic patients are provided. The current status of T1ρ relaxation mapping of cartilage and future directions is also discussed. PMID:17075961
Scheven, U M
2013-12-01
This paper describes a new variant of established stimulated echo pulse sequences, and an analytical method for determining diffusion or dispersion coefficients for Gaussian or non-Gaussian displacement distributions. The unipolar displacement encoding PFGSTE sequence uses trapezoidal gradient pulses of equal amplitude g and equal ramp rates throughout while sampling positive and negative halves of q-space. Usefully, the equal gradient amplitudes and gradient ramp rates help to reduce the impact of experimental artefacts caused by residual amplifier transients, eddy currents, or ferromagnetic hysteresis in components of the NMR magnet. The pulse sequence was validated with measurements of diffusion in water and of dispersion in flow through a packing of spheres. The analytical method introduced here permits the robust determination of the variance of non-Gaussian, dispersive displacement distributions. The noise sensitivity of the analytical method is shown to be negligible, using a demonstration experiment with a non-Gaussian longitudinal displacement distribution, measured on flow through a packing of mono-sized spheres. Copyright © 2013 Elsevier Inc. All rights reserved.
Fast multiclonal clusterization of V(D)J recombinations from high-throughput sequencing.
Giraud, Mathieu; Salson, Mikaël; Duez, Marc; Villenet, Céline; Quief, Sabine; Caillault, Aurélie; Grardel, Nathalie; Roumier, Christophe; Preudhomme, Claude; Figeac, Martin
2014-05-28
V(D)J recombinations in lymphocytes are essential for immunological diversity. They are also useful markers of pathologies. In leukemia, they are used to quantify the minimal residual disease during patient follow-up. However, the full breadth of lymphocyte diversity is not fully understood. We propose new algorithms that process high-throughput sequencing (HTS) data to extract unnamed V(D)J junctions and gather them into clones for quantification. This analysis is based on a seed heuristic and is fast and scalable because in the first phase, no alignment is performed with germline database sequences. The algorithms were applied to TR γ HTS data from a patient with acute lymphoblastic leukemia, and also on data simulating hypermutations. Our methods identified the main clone, as well as additional clones that were not identified with standard protocols. The proposed algorithms provide new insight into the analysis of high-throughput sequencing data for leukemia, and also to the quantitative assessment of any immunological profile. The methods described here are implemented in a C++ open-source program called Vidjil.
Automatic prediction of protein domains from sequence information using a hybrid learning system.
Nagarajan, Niranjan; Yona, Golan
2004-06-12
We describe a novel method for detecting the domain structure of a protein from sequence information alone. The method is based on analyzing multiple sequence alignments that are derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence and are combined into a single predictor using a neural network. The output is further smoothed and post-processed using a probabilistic model to predict the most likely transition positions between domains. The method was assessed using the domain definitions in SCOP and CATH for proteins of known structure and was compared with several other existing methods. Our method performs well both in terms of accuracy and sensitivity. It improves significantly over the best methods available, even some of the semi-manual ones, while being fully automatic. Our method can also be used to suggest and verify domain partitions based on structural data. A few examples of predicted domain definitions and alternative partitions, as suggested by our method, are also discussed. An online domain-prediction server is available at http://biozon.org/tools/domains/
Enzyme-free detection and quantification of double-stranded nucleic acids.
Feuillie, Cécile; Merheb, Maxime Mohamad; Gillet, Benjamin; Montagnac, Gilles; Hänni, Catherine; Daniel, Isabelle
2012-08-01
We have developed a fully enzyme-free SERRS hybridization assay for specific detection of double-stranded DNA sequences. Although all DNA detection methods ranging from PCR to high-throughput sequencing rely on enzymes, this method is unique for being totally non-enzymatic. The efficiency of enzymatic processes is affected by alterations, modifications, and/or quality of DNA. For instance, a limitation of most DNA polymerases is their inability to process DNA damaged by blocking lesions. As a result, enzymatic amplification and sequencing of degraded DNA often fail. In this study we succeeded in detecting and quantifying, within a mixture, relative amounts of closely related double-stranded DNA sequences from Rupicapra rupicapra (chamois) and Capra hircus (goat). The non-enzymatic SERRS assay presented here is the corner stone of a promising approach to overcome the failure of DNA polymerase when DNA is too degraded or when the concentration of polymerase inhibitors is too high. It is the first time double-stranded DNA has been detected with a truly non-enzymatic SERRS-based method. This non-enzymatic, inexpensive, rapid assay is therefore a breakthrough in nucleic acid detection.
Neural correlates of nesting behavior in zebra finches (Taeniopygia guttata).
Hall, Zachary J; Bertin, Marion; Bailey, Ida E; Meddle, Simone L; Healy, Susan D
2014-05-01
Nest building in birds involves a behavioral sequence (nest material collection and deposition in the nest) that offers a unique model for addressing how the brain sequences motor actions. In this study, we identified brain regions involved in nesting behavior in male and female zebra finches (Taeniopygia guttata). We used Fos immunohistochemistry to quantify production of the immediate early gene protein product Fos (a molecular indicator of neuronal activity) in the brain correlated this expression with the variation in nesting behavior. Using this technique, we found that neural circuitry involved in motor sequencing, social behavior, reward and motivation were active during nesting. Within pairs of nesting birds, the number of times a male picked up or deposited nesting material and the amount of time a female spent in the nest explained the variation in Fos expression in the anterior motor pathway, social behavior network, and reward neural circuits. Identification of the brain regions that are involved in nesting enables us to begin studying the roles of motor sequencing, context, and reward in construction behavior at the neural level. Copyright © 2014 Elsevier B.V. All rights reserved.
Neural correlates of nesting behavior in zebra finches (Taeniopygia guttata)
Hall, Zachary J.; Bertin, Marion; Bailey, Ida E.; Meddle, Simone L.; Healy, Susan D.
2014-01-01
Nest building in birds involves a behavioral sequence (nest material collection and deposition in the nest) that offers a unique model for addressing how the brain sequences motor actions. In this study, we identified brain regions involved in nesting behavior in male and female zebra finches (Taeniopygia guttata). We used Fos immunohistochemistry to quantify production of the immediate early gene protein product Fos (a molecular indicator of neuronal activity) in the brain correlated this expression with the variation in nesting behavior. Using this technique, we found that neural circuitry involved in motor sequencing, social behavior, reward and motivation were active during nesting. Within pairs of nesting birds, the number of times a male picked up or deposited nesting material and the amount of time a female spent in the nest explained the variation in Fos expression in the anterior motor pathway, social behavior network, and reward neural circuits. Identification of the brain regions that are involved in nesting enables us to begin studying the roles of motor sequencing, context, and reward in construction behavior at the neural level. PMID:24508238
Rare Cell Detection by Single-Cell RNA Sequencing as Guided by Single-Molecule RNA FISH.
Torre, Eduardo; Dueck, Hannah; Shaffer, Sydney; Gospocic, Janko; Gupte, Rohit; Bonasio, Roberto; Kim, Junhyong; Murray, John; Raj, Arjun
2018-02-28
Although single-cell RNA sequencing can reliably detect large-scale transcriptional programs, it is unclear whether it accurately captures the behavior of individual genes, especially those that express only in rare cells. Here, we use single-molecule RNA fluorescence in situ hybridization as a gold standard to assess trade-offs in single-cell RNA-sequencing data for detecting rare cell expression variability. We quantified the gene expression distribution for 26 genes that range from ubiquitous to rarely expressed and found that the correspondence between estimates across platforms improved with both transcriptome coverage and increased number of cells analyzed. Further, by characterizing the trade-off between transcriptome coverage and number of cells analyzed, we show that when the number of genes required to answer a given biological question is small, then greater transcriptome coverage is more important than analyzing large numbers of cells. More generally, our report provides guidelines for selecting quality thresholds for single-cell RNA-sequencing experiments aimed at rare cell analyses. Copyright © 2018 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Yusung, E-mail: yusung-kim@uiowa.edu; Muruganandham, Manickam; Modrick, Joseph M.
Purpose: The aim of this study was to characterize the levels of artifacts and distortions of titanium applicators on 3.0-Tesla magnetic resonance imaging (MRI). Methods and Materials: Fletcher-Suit-Delclos-style tandem and ovoids (T and O) and tandem and ring applicator (T and R) were examined. The quality assurance (QA) phantoms for each applicator were designed and filled with copper sulphate solution (1.5 g/l). The artifacts were quantified with the registration of corresponding computed tomography (CT) images. A favorable MR sequence was searched in terms of artifacts. Using the sequence, the artifacts were determined. The geometric distortions induced by the applicators weremore » quantified through each registration of CT and MRI without applicators. The artifacts of T and O were also evaluated on in vivo MRI datasets of 5 patients. Results: T1-weighted MRI with 1-mm slice thickness was found as a favorable MR sequence. Applying the sequence, the artifacts at the tandem tip of T and O and T and R were determined as 1.5 {+-} 0.5 mm in a superior direction in phantom studies. In the ovoids of T and O, we found artifacts less than 1.5 {+-} 0.5 mm. The artifacts of a T and O tandem in vivo were found as less than 2.6 {+-} 1.3 mm on T1-weighted MRI, whereas less than 6.9 {+-} 3.4 mm on T2-weighted MRI. No more than 1.2 {+-} 0.6 mm (3.0 {+-} 1.5 mm) of distortions, due to a titanium applicator, were measured on T1-weighted MRI (T2-). Conclusion: In 3.0-Tesla MRI, we found the artifact widths at the tip of tandem were less than 1.5 {+-} 0.5 mm for both T and O and T and R when using T1-weighted MRI in phantom studies. However, exclusive 3.0-Tesla MRI-guided brachytherapy planning with a titanium applicator should be cautiously implemented.« less
Low-Cost HIV-1 Diagnosis and Quantification in Dried Blood Spots by Real Time PCR
Mehta, Nishaki; Trzmielina, Sonia; Nonyane, Bareng A. S.; Eliot, Melissa N.; Lin, Rongheng; Foulkes, Andrea S.; McNeal, Kristina; Ammann, Arthur; Eulalievyolo, Vindu; Sullivan, John L.; Luzuriaga, Katherine; Somasundaran, Mohan
2009-01-01
Background Rapid and cost-effective methods for HIV-1 diagnosis and viral load monitoring would greatly enhance the clinical management of HIV-1 infected adults and children in limited-resource settings. Recent recommendations to treat perinatally infected infants within the first year of life are feasible only if early diagnosis is routinely available. Dried blood spots (DBS) on filter paper are an easy and convenient way to collect and transport blood samples. A rapid and cost effective method to diagnose and quantify HIV-1 from DBS is urgently needed to facilitate early diagnosis of HIV-1 infection and monitoring of antiretroviral therapy. Methods and Findings We have developed a real-time LightCycler (rtLC) PCR assay to detect and quantify HIV-1 from DBS. HIV-1 RNA extracted from DBS was amplified in a one-step, single-tube system using primers specific for long-terminal repeat sequences that are conserved across all HIV-1 clades. SYBR Green dye was used to quantify PCR amplicons and HIV-1 RNA copy numbers were determined from a standard curve generated using serially diluted known copies of HIV-1 RNA. This assay detected samples across clades, has a dynamic range of 5 log10, and %CV <8% up to 4 log10 dilution. Plasma HIV-1 RNA copy numbers obtained using this method correlated well with the Roche Ultrasensitive (r = 0.91) and branched DNA (r = 0.89) assays. The lower limit of detection (95%) was estimated to be 136 copies. The rtLC DBS assay was 2.5 fold rapid as well as 40-fold cheaper when compared to commercial assays. Adaptation of the assay into other real-time systems demonstrated similar performance. Conclusions The accuracy, reliability, genotype inclusivity and affordability, along with the small volumes of blood required for the assay suggest that the rtLC DBS assay will be useful for early diagnosis and monitoring of pediatric HIV-1 infection in resource-limited settings. PMID:19503790
Low-cost HIV-1 diagnosis and quantification in dried blood spots by real time PCR.
Mehta, Nishaki; Trzmielina, Sonia; Nonyane, Bareng A S; Eliot, Melissa N; Lin, Rongheng; Foulkes, Andrea S; McNeal, Kristina; Ammann, Arthur; Eulalievyolo, Vindu; Sullivan, John L; Luzuriaga, Katherine; Somasundaran, Mohan
2009-06-05
Rapid and cost-effective methods for HIV-1 diagnosis and viral load monitoring would greatly enhance the clinical management of HIV-1 infected adults and children in limited-resource settings. Recent recommendations to treat perinatally infected infants within the first year of life are feasible only if early diagnosis is routinely available. Dried blood spots (DBS) on filter paper are an easy and convenient way to collect and transport blood samples. A rapid and cost effective method to diagnose and quantify HIV-1 from DBS is urgently needed to facilitate early diagnosis of HIV-1 infection and monitoring of antiretroviral therapy. We have developed a real-time LightCycler (rtLC) PCR assay to detect and quantify HIV-1 from DBS. HIV-1 RNA extracted from DBS was amplified in a one-step, single-tube system using primers specific for long-terminal repeat sequences that are conserved across all HIV-1 clades. SYBR Green dye was used to quantify PCR amplicons and HIV-1 RNA copy numbers were determined from a standard curve generated using serially diluted known copies of HIV-1 RNA. This assay detected samples across clades, has a dynamic range of 5 log(10), and %CV <8% up to 4 log(10) dilution. Plasma HIV-1 RNA copy numbers obtained using this method correlated well with the Roche Ultrasensitive (r = 0.91) and branched DNA (r = 0.89) assays. The lower limit of detection (95%) was estimated to be 136 copies. The rtLC DBS assay was 2.5 fold rapid as well as 40-fold cheaper when compared to commercial assays. Adaptation of the assay into other real-time systems demonstrated similar performance. The accuracy, reliability, genotype inclusivity and affordability, along with the small volumes of blood required for the assay suggest that the rtLC DBS assay will be useful for early diagnosis and monitoring of pediatric HIV-1 infection in resource-limited settings.
NASA Astrophysics Data System (ADS)
Mao, X.; Harkness, M.; Lee, M. D.; Mack, E. E.; Dworatzek, S.; Acheson, C.; McCarty, P.; Barry, D. A.; Gerhard, J. I.
2006-12-01
SABRE (Source Area BioREmediation) is a public-private consortium whose charter is to determine if enhanced anaerobic bioremediation can result in effective and quantifiable treatment of chlorinated solvent DNAPL source areas. The focus of this 4-year, $5.7 million research project is a field site in the United Kingdom containing a TCE DNAPL source area. In preparation, a microcosm study was performed to determine the optimal combination of factors to support reductive dechlorination of TCE in site soil and groundwater. The study consisted of 168 bottles distributed between four laboratories (Dupont, GE, SiREM, and Terra Systems) and tested the impact of six carbon substrates (lactate, acetate, methanol, SRS (soybean oil), hexanol, butyl acetate), bioaugmentation with KB-1 bacterial culture, three TCE levels (100 mg/L, 400 mg/L, and 800 mg/L) and two sulphate levels (200 mg/L, >500 mg/L) on TCE dechlorination. This research presents a numerical model designed to simulate the main processes occurring in the microcosms, including substrate fermentation, sequential dechlorination, toxic inhibition, and the influence of sulphate concentration. In calibrating the model to over 60 of the microcosm experiments, lumped parameters were employed to quantify the effect of key factors on the conversion rate of each chlorinated ethene in the TCE degradation sequence. Results quantify the benefit (i.e., increased stepwise dechlorination rate) due to both bioaugmentation and the presence of higher sulphate concentrations. Competitive inhibition is found to increase in significance as TCE concentrations increase; however, inclusion of Haldane inhibition is not supported. Over a wide range of experimental conditions and dechlorination steps, SRS appears to induce relatively little hydrogen limitation, thereby facilitating relatively quick conversion of TCE to ethene. In general, hydrogen limitation is found to increase with increasing TCE concentration and with bioaugmentation, and is most pronounced in the dechlorination of TCE to DCE.
Terekhov, Maxim; Krummenacker, Jan; Denysenkov, Vasyl; Gerz, Kathrin; Prisner, Thomas; Schreiber, Laura Maria
2016-03-01
Overhauser dynamic nuclear polarization (DNP) allows the production of liquid hyperpolarized substrate inside the MRI magnet bore as well as its administration in continuous flow mode to acquire MR images with enhanced signal-to-noise ratio. We implemented inversion recovery preparation in order to improve contrast-to-noise ratio and to quantify the overall imaging performance of Overhauser DNP-enhanced MRI. The negative enhancement created by DNP in combination with inversion recovery (IR) preparation allows canceling selectively the signal originated from Boltzmann magnetization and visualizing only hyperpolarized fluid. The theoretical model describing gain of MR image intensity produced by steady-state continuous flow DNP hyperpolarized magnetization was established and proved experimentally. A precise quantification of signal originated purely from DNP hyperpolarization was achieved. A temperature effect on longitudinal relaxation had to be taken into account to fit experimental results with numerical prediction. Using properly adjusted IR preparation, the complete zeroing of thermal background magnetization was achieved, providing an essential increase of contrast-to-noise ratio of DNP-hyperpolarized water images. To quantify and optimize the steady-state conditions for MRI with continuous flow DNP, an approach similar to that incorporating transient-state thermal magnetization equilibrium in spoiled fast field echo imaging sequences can be used. © 2015 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Pan, Feng; Pachepsky, Yakov A.; Guber, Andrey K.; McPherson, Brian J.; Hill, Robert L.
2012-01-01
SummaryUnderstanding streamflow patterns in space and time is important for improving flood and drought forecasting, water resources management, and predictions of ecological changes. Objectives of this work include (a) to characterize the spatial and temporal patterns of streamflow using information theory-based measures at two thoroughly-monitored agricultural watersheds located in different hydroclimatic zones with similar land use, and (b) to elucidate and quantify temporal and spatial scale effects on those measures. We selected two USDA experimental watersheds to serve as case study examples, including the Little River experimental watershed (LREW) in Tifton, Georgia and the Sleepers River experimental watershed (SREW) in North Danville, Vermont. Both watersheds possess several nested sub-watersheds and more than 30 years of continuous data records of precipitation and streamflow. Information content measures (metric entropy and mean information gain) and complexity measures (effective measure complexity and fluctuation complexity) were computed based on the binary encoding of 5-year streamflow and precipitation time series data. We quantified patterns of streamflow using probabilities of joint or sequential appearances of the binary symbol sequences. Results of our analysis illustrate that information content measures of streamflow time series are much smaller than those for precipitation data, and the streamflow data also exhibit higher complexity, suggesting that the watersheds effectively act as filters of the precipitation information that leads to the observed additional complexity in streamflow measures. Correlation coefficients between the information-theory-based measures and time intervals are close to 0.9, demonstrating the significance of temporal scale effects on streamflow patterns. Moderate spatial scale effects on streamflow patterns are observed with absolute values of correlation coefficients between the measures and sub-watershed area varying from 0.2 to 0.6 in the two watersheds. We conclude that temporal effects must be evaluated and accounted for when the information theory-based methods are used for performance evaluation and comparison of hydrological models.
Dong, Lianhua; Meng, Ying; Wang, Jing; Liu, Yingying
2014-02-01
DNA reference materials of certified value have a critical function in many analytical processes of DNA measurement. Quantification of amoA genes in ammonia oxidizing bacteria (AOB) and archaea (AOA), and of nirS and nosZ genes in the denitrifiers is very important for determining their distribution and abundance in the natural environment. A plasmid reference material containing nirS, nosZ, amoA-AOB, and amoA-AOA is developed to provide a DNA standard with copy number concentration for ensuring comparability and reliability of quantification of these genes. Droplet digital PCR (ddPCR) was evaluated for characterization of the plasmid reference material. The result revealed that restriction endonuclease digestion of plasmids can improve amplification efficiency and minimize the measurement bias of ddPCR. Compared with the conformation of the plasmid, the size of the DNA fragment containing the target sequence and the location of the restriction site relative to the target sequence are not significant factors affecting plasmid quantification by ddPCR. Liquid chromatography-isotope dilution mass spectrometry (LC-IDMS) was used to provide independent data for quantifying the plasmid reference material. The copy number concentration of the digested plasmid determined by ddPCR agreed well with that determined by LC-IDMS, improving both the accuracy and reliability of the plasmid reference material. The reference value, with its expanded uncertainty (k = 2), of the plasmid reference material was determined to be (5.19 ± 0.41) × 10(9) copies μL(-1) by averaging the results of two independent measurements. Consideration of the factors revealed in this study can improve the reliability and accuracy of ddPCR; thus, this method has the potential to accurately quantify DNA reference materials.
Signal enhancement ratio (SER) quantified from breast DCE-MRI and breast cancer risk
NASA Astrophysics Data System (ADS)
Wu, Shandong; Kurland, Brenda F.; Berg, Wendie A.; Zuley, Margarita L.; Jankowitz, Rachel C.; Sumkin, Jules; Gur, David
2015-03-01
Breast magnetic resonance imaging (MRI) is recommended as an adjunct to mammography for women who are considered at elevated risk of developing breast cancer. As a key component of breast MRI, dynamic contrast-enhanced MRI (DCE-MRI) uses a contrast agent to provide high intensity contrast between breast tissues, making it sensitive to tissue composition and vascularity. Breast DCE-MRI characterizes certain physiologic properties of breast tissue that are potentially related to breast cancer risk. Studies have shown that increased background parenchymal enhancement (BPE), which is the contrast enhancement occurring in normal cancer-unaffected breast tissues in post-contrast sequences, predicts increased breast cancer risk. Signal enhancement ratio (SER) computed from pre-contrast and post-contrast sequences in DCE-MRI measures change in signal intensity due to contrast uptake over time and is a measure of contrast enhancement kinetics. SER quantified in breast tumor has been shown potential as a biomarker for characterizing tumor response to treatments. In this work we investigated the relationship between quantitative measures of SER and breast cancer risk. A pilot retrospective case-control study was performed using a cohort of 102 women, consisting of 51 women who had diagnosed with unilateral breast cancer and 51 matched controls (by age and MRI date) with a unilateral biopsy-proven benign lesion. SER was quantified using fully-automated computerized algorithms and three SER-derived quantitative volume measures were compared between the cancer cases and controls using logistic regression analysis. Our preliminary results showed that SER is associated with breast cancer risk, after adjustment for the Breast Imaging Reporting and Data System (BI-RADS)-based mammographic breast density measures. This pilot study indicated that SER has potential for use as a risk factor for breast cancer risk assessment in women at elevated risk of developing breast cancer.
Gallium plasmonic nanoparticles for label-free DNA and single nucleotide polymorphism sensing
NASA Astrophysics Data System (ADS)
Marín, Antonio García; García-Mendiola, Tania; Bernabeu, Cristina Navio; Hernández, María Jesús; Piqueras, Juan; Pau, Jose Luis; Pariente, Félix; Lorenzo, Encarnación
2016-05-01
A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells.A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells. Electronic supplementary information (ESI) available. See DOI: 10.1039/c6nr00926c
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aldemir, Tunc; Denning, Richard; Catalyurek, Umit
Reduction in safety margin can be expected as passive structures and components undergo degradation with time. Limitations in the traditional probabilistic risk assessment (PRA) methodology constrain its value as an effective tool to address the impact of aging effects on risk and for quantifying the impact of aging management strategies in maintaining safety margins. A methodology has been developed to address multiple aging mechanisms involving large numbers of components (with possibly statistically dependent failures) within the PRA framework in a computationally feasible manner when the sequencing of events is conditioned on the physical conditions predicted in a simulation environment, suchmore » as the New Generation System Code (NGSC) concept. Both epistemic and aleatory uncertainties can be accounted for within the same phenomenological framework and maintenance can be accounted for in a coherent fashion. The framework accommodates the prospective impacts of various intervention strategies such as testing, maintenance, and refurbishment. The methodology is illustrated with several examples.« less
Photo-excited charge carriers suppress sub-terahertz phonon mode in silicon at room temperature
Liao, Bolin; Maznev, A. A.; Nelson, Keith A.; ...
2016-10-12
There is a growing interest in the mode-by-mode understanding of electron and phonon transport for improving energy conversion technologies, such as thermoelectrics and photovoltaics. Whereas remarkable progress has been made in probing phonon–phonon interactions, it has been a challenge to directly measure electron–phonon interactions at the single-mode level, especially their effect on phonon transport above cryogenic temperatures. Here in this paper, we use three-pulse photoacoustic spectroscopy to investigate the damping of a single sub-terahertz coherent phonon mode by free charge carriers in silicon at room temperature. Building on conventional pump–probe photoacoustic spectroscopy, we introduce an additional laser pulse to opticallymore » generate charge carriers, and carefully design temporal sequence of the three pulses to unambiguously quantify the scattering rate of a single-phonon mode due to the electron–phonon interaction. Our results confirm predictions from first-principles simulations and indicate the importance of the often-neglected effect of electron–phonon interaction on phonon transport in doped semiconductors.« less
A phase transition in energy-filtered RNA secondary structures.
Han, Hillary S W; Reidys, Christian M
2012-10-01
In this article we study the effect of energy parameters on minimum free energy (mfe) RNA secondary structures. Employing a simplified combinatorial energy model that is only dependent on the diagram representation and is not sequence-specific, we prove the following dichotomy result. Mfe structures derived via the Turner energy parameters contain only finitely many complex irreducible substructures, and just minor parameter changes produce a class of mfe structures that contain a large number of small irreducibles. We localize the exact point at which the distribution of irreducibles experiences this phase transition from a discrete limit to a central limit distribution and, subsequently, put our result into the context of quantifying the effect of sparsification of the folding of these respective mfe structures. We show that the sparsification of realistic mfe structures leads to a constant time and space reduction, and that the sparsification of the folding of structures with modified parameters leads to a linear time and space reduction. We, furthermore, identify the limit distribution at the phase transition as a Rayleigh distribution.
Deciphering the Mechanism of Alternative Cleavage and Polyadenylation in Mantle Cell Lymphoma (MCL)
2013-10-01
also has human firefly luciferase cloned within the same reporter system allowing for intra-plasmid normalization of transfection eliminating problems...collaboration with Dr. Wei Li, a Bioinformaticist from Baylor College of Medicine whose lab specializes in developing complex algorithms to analyze genome...wide sequencing data. Dr. Wei Li and his postdoctoral fellow, Dr. Zheng Xia developed a customized algorithm that is able to detect and quantify
Quantitative interaction proteomics using mass spectrometry.
Wepf, Alexander; Glatter, Timo; Schmidt, Alexander; Aebersold, Ruedi; Gstaiger, Matthias
2009-03-01
We present a mass spectrometry-based strategy for the absolute quantification of protein complex components isolated through affinity purification. We quantified bait proteins via isotope-labeled reference peptides corresponding to an affinity tag sequence and prey proteins by label-free correlational quantification using the precursor ion signal intensities of proteotypic peptides generated in reciprocal purifications. We used this method to quantitatively analyze interaction stoichiometries in the human protein phosphatase 2A network.
Mechanisms of force production during linear accelerations in bluegill sunfish Lepomis macrochirus
NASA Astrophysics Data System (ADS)
Tytell, Eric D.; Wise, Tyler N.; Boden, Alexandra L.; Sanders, Erin K.; Schwalbe, Margot A. B.
2016-11-01
In nature, fish rarely swim steadily. Although unsteady behaviors are common, we know little about how fish change their swimming kinematics for routine accelerations, and how these changes affect the fluid dynamic forces and the wake produced. To study force production during acceleration, particle image velocimetry was used to quantify the wake of bluegill sunfish Lepomis macrochirus and to estimate the pressure field during linear accelerations and steady swimming. We separated "steady" and "unsteady" trials and quantified the forward acceleration using inertial measurement units. Compared to steady sequences, unsteady sequences had larger accelerations and higher body amplitudes. The wake consisted of single vortices shed during each tail movement (a '2S' wake). The structure did not change during acceleration, but the circulation of the vortices increased, resulting in larger forces. A fish swimming unsteadily produced significantly more force than the same fish swimming steadily, even when the accelerations were the same. This increase is likely due to increased added mass during unsteady swimming, as a result of the larger body amplitude. Pressure estimates suggest that the increase in force is correlated with more low pressure regions on the anterior body. This work was supported by ARO W911NF-14-1-0494 and NSF RCN-PLS 1062052.
Walder, Florian; Schlaeppi, Klaus; Wittwer, Raphaël; Held, Alain Y; Vogelgsang, Susanne; van der Heijden, Marcel G A
2017-01-01
Fusarium head blight, caused by fungi from the genus Fusarium , is one of the most harmful cereal diseases, resulting not only in severe yield losses but also in mycotoxin contaminated and health-threatening grains. Fusarium head blight is caused by a diverse set of species that have different host ranges, mycotoxin profiles and responses to agricultural practices. Thus, understanding the composition of Fusarium communities in the field is crucial for estimating their impact and also for the development of effective control measures. Up to now, most molecular tools that monitor Fusarium communities on plants are limited to certain species and do not distinguish other plant associated fungi. To close these gaps, we developed a sequencing-based community profiling methodology for crop-associated fungi with a focus on the genus Fusarium . By analyzing a 1600 bp long amplicon spanning the highly variable segments ITS and D1-D3 of the ribosomal operon by PacBio SMRT sequencing, we were able to robustly quantify Fusarium down to species level through clustering against reference sequences. The newly developed methodology was successfully validated in mock communities and provided similar results as the culture-based assessment of Fusarium communities by seed health tests in grain samples from different crop species. Finally, we exemplified the newly developed methodology in a field experiment with a wheat-maize crop sequence under different cover crop and tillage regimes. We analyzed wheat straw residues, cover crop shoots and maize grains and we could reveal that the cover crop hairy vetch ( Vicia villosa ) acts as a potent alternative host for Fusarium (OTU F.ave/tri ) showing an eightfold higher relative abundance compared with other cover crop treatments. Moreover, as the newly developed methodology also allows to trace other crop-associated fungi, we found that vetch and green fallow hosted further fungal plant pathogens including Zymoseptoria tritici . Thus, besides their beneficial traits, cover crops can also entail phytopathological risks by acting as alternative hosts for Fusarium and other noxious plant pathogens. The newly developed sequencing based methodology is a powerful diagnostic tool to trace Fusarium in combination with other fungi associated to different crop species.
Walder, Florian; Schlaeppi, Klaus; Wittwer, Raphaël; Held, Alain Y.; Vogelgsang, Susanne; van der Heijden, Marcel G. A.
2017-01-01
Fusarium head blight, caused by fungi from the genus Fusarium, is one of the most harmful cereal diseases, resulting not only in severe yield losses but also in mycotoxin contaminated and health-threatening grains. Fusarium head blight is caused by a diverse set of species that have different host ranges, mycotoxin profiles and responses to agricultural practices. Thus, understanding the composition of Fusarium communities in the field is crucial for estimating their impact and also for the development of effective control measures. Up to now, most molecular tools that monitor Fusarium communities on plants are limited to certain species and do not distinguish other plant associated fungi. To close these gaps, we developed a sequencing-based community profiling methodology for crop-associated fungi with a focus on the genus Fusarium. By analyzing a 1600 bp long amplicon spanning the highly variable segments ITS and D1–D3 of the ribosomal operon by PacBio SMRT sequencing, we were able to robustly quantify Fusarium down to species level through clustering against reference sequences. The newly developed methodology was successfully validated in mock communities and provided similar results as the culture-based assessment of Fusarium communities by seed health tests in grain samples from different crop species. Finally, we exemplified the newly developed methodology in a field experiment with a wheat-maize crop sequence under different cover crop and tillage regimes. We analyzed wheat straw residues, cover crop shoots and maize grains and we could reveal that the cover crop hairy vetch (Vicia villosa) acts as a potent alternative host for Fusarium (OTU F.ave/tri) showing an eightfold higher relative abundance compared with other cover crop treatments. Moreover, as the newly developed methodology also allows to trace other crop-associated fungi, we found that vetch and green fallow hosted further fungal plant pathogens including Zymoseptoria tritici. Thus, besides their beneficial traits, cover crops can also entail phytopathological risks by acting as alternative hosts for Fusarium and other noxious plant pathogens. The newly developed sequencing based methodology is a powerful diagnostic tool to trace Fusarium in combination with other fungi associated to different crop species. PMID:29234337
NASA Astrophysics Data System (ADS)
Piao, Lin; Fu, Zuntao
2016-11-01
Cross-correlation between pairs of variables takes multi-time scale characteristic, and it can be totally different on different time scales (changing from positive correlation to negative one), e.g., the associations between mean air temperature and relative humidity over regions to the east of Taihang mountain in China. Therefore, how to correctly unveil these correlations on different time scales is really of great importance since we actually do not know if the correlation varies with scales in advance. Here, we compare two methods, i.e. Detrended Cross-Correlation Analysis (DCCA for short) and Pearson correlation, in quantifying scale-dependent correlations directly to raw observed records and artificially generated sequences with known cross-correlation features. Studies show that 1) DCCA related methods can indeed quantify scale-dependent correlations, but not Pearson method; 2) the correlation features from DCCA related methods are robust to contaminated noises, however, the results from Pearson method are sensitive to noise; 3) the scale-dependent correlation results from DCCA related methods are robust to the amplitude ratio between slow and fast components, while Pearson method may be sensitive to the amplitude ratio. All these features indicate that DCCA related methods take some advantages in correctly quantifying scale-dependent correlations, which results from different physical processes.
Piao, Lin; Fu, Zuntao
2016-11-09
Cross-correlation between pairs of variables takes multi-time scale characteristic, and it can be totally different on different time scales (changing from positive correlation to negative one), e.g., the associations between mean air temperature and relative humidity over regions to the east of Taihang mountain in China. Therefore, how to correctly unveil these correlations on different time scales is really of great importance since we actually do not know if the correlation varies with scales in advance. Here, we compare two methods, i.e. Detrended Cross-Correlation Analysis (DCCA for short) and Pearson correlation, in quantifying scale-dependent correlations directly to raw observed records and artificially generated sequences with known cross-correlation features. Studies show that 1) DCCA related methods can indeed quantify scale-dependent correlations, but not Pearson method; 2) the correlation features from DCCA related methods are robust to contaminated noises, however, the results from Pearson method are sensitive to noise; 3) the scale-dependent correlation results from DCCA related methods are robust to the amplitude ratio between slow and fast components, while Pearson method may be sensitive to the amplitude ratio. All these features indicate that DCCA related methods take some advantages in correctly quantifying scale-dependent correlations, which results from different physical processes.
MISTICA: Minimum Spanning Tree-based Coarse Image Alignment for Microscopy Image Sequences
Ray, Nilanjan; McArdle, Sara; Ley, Klaus; Acton, Scott T.
2016-01-01
Registration of an in vivo microscopy image sequence is necessary in many significant studies, including studies of atherosclerosis in large arteries and the heart. Significant cardiac and respiratory motion of the living subject, occasional spells of focal plane changes, drift in the field of view, and long image sequences are the principal roadblocks. The first step in such a registration process is the removal of translational and rotational motion. Next, a deformable registration can be performed. The focus of our study here is to remove the translation and/or rigid body motion that we refer to here as coarse alignment. The existing techniques for coarse alignment are unable to accommodate long sequences often consisting of periods of poor quality images (as quantified by a suitable perceptual measure). Many existing methods require the user to select an anchor image to which other images are registered. We propose a novel method for coarse image sequence alignment based on minimum weighted spanning trees (MISTICA) that overcomes these difficulties. The principal idea behind MISTICA is to re-order the images in shorter sequences, to demote nonconforming or poor quality images in the registration process, and to mitigate the error propagation. The anchor image is selected automatically making MISTICA completely automated. MISTICA is computationally efficient. It has a single tuning parameter that determines graph width, which can also be eliminated by way of additional computation. MISTICA outperforms existing alignment methods when applied to microscopy image sequences of mouse arteries. PMID:26415193
MISTICA: Minimum Spanning Tree-Based Coarse Image Alignment for Microscopy Image Sequences.
Ray, Nilanjan; McArdle, Sara; Ley, Klaus; Acton, Scott T
2016-11-01
Registration of an in vivo microscopy image sequence is necessary in many significant studies, including studies of atherosclerosis in large arteries and the heart. Significant cardiac and respiratory motion of the living subject, occasional spells of focal plane changes, drift in the field of view, and long image sequences are the principal roadblocks. The first step in such a registration process is the removal of translational and rotational motion. Next, a deformable registration can be performed. The focus of our study here is to remove the translation and/or rigid body motion that we refer to here as coarse alignment. The existing techniques for coarse alignment are unable to accommodate long sequences often consisting of periods of poor quality images (as quantified by a suitable perceptual measure). Many existing methods require the user to select an anchor image to which other images are registered. We propose a novel method for coarse image sequence alignment based on minimum weighted spanning trees (MISTICA) that overcomes these difficulties. The principal idea behind MISTICA is to reorder the images in shorter sequences, to demote nonconforming or poor quality images in the registration process, and to mitigate the error propagation. The anchor image is selected automatically making MISTICA completely automated. MISTICA is computationally efficient. It has a single tuning parameter that determines graph width, which can also be eliminated by the way of additional computation. MISTICA outperforms existing alignment methods when applied to microscopy image sequences of mouse arteries.
Substrate sequence selectivity of APOBEC3A implicates intra-DNA interactions.
Silvas, Tania V; Hou, Shurong; Myint, Wazo; Nalivaika, Ellen; Somasundaran, Mohan; Kelch, Brian A; Matsuo, Hiroshi; Kurt Yilmaz, Nese; Schiffer, Celia A
2018-05-14
The APOBEC3 (A3) family of human cytidine deaminases is renowned for providing a first line of defense against many exogenous and endogenous retroviruses. However, the ability of these proteins to deaminate deoxycytidines in ssDNA makes A3s a double-edged sword. When overexpressed, A3s can mutate endogenous genomic DNA resulting in a variety of cancers. Although the sequence context for mutating DNA varies among A3s, the mechanism for substrate sequence specificity is not well understood. To characterize substrate specificity of A3A, a systematic approach was used to quantify the affinity for substrate as a function of sequence context, length, secondary structure, and solution pH. We identified the A3A ssDNA binding motif as (T/C)TC(A/G), which correlated with enzymatic activity. We also validated that A3A binds RNA in a sequence specific manner. A3A bound tighter to substrate binding motif within a hairpin loop compared to linear oligonucleotide, suggesting A3A affinity is modulated by substrate structure. Based on these findings and previously published A3A-ssDNA co-crystal structures, we propose a new model with intra-DNA interactions for the molecular mechanism underlying A3A sequence preference. Overall, the sequence and structural preferences identified for A3A leads to a new paradigm for identifying A3A's involvement in mutation of endogenous or exogenous DNA.
Bidirectional Retroviral Integration Site PCR Methodology and Quantitative Data Analysis Workflow.
Suryawanshi, Gajendra W; Xu, Song; Xie, Yiming; Chou, Tom; Kim, Namshin; Chen, Irvin S Y; Kim, Sanggu
2017-06-14
Integration Site (IS) assays are a critical component of the study of retroviral integration sites and their biological significance. In recent retroviral gene therapy studies, IS assays, in combination with next-generation sequencing, have been used as a cell-tracking tool to characterize clonal stem cell populations sharing the same IS. For the accurate comparison of repopulating stem cell clones within and across different samples, the detection sensitivity, data reproducibility, and high-throughput capacity of the assay are among the most important assay qualities. This work provides a detailed protocol and data analysis workflow for bidirectional IS analysis. The bidirectional assay can simultaneously sequence both upstream and downstream vector-host junctions. Compared to conventional unidirectional IS sequencing approaches, the bidirectional approach significantly improves IS detection rates and the characterization of integration events at both ends of the target DNA. The data analysis pipeline described here accurately identifies and enumerates identical IS sequences through multiple steps of comparison that map IS sequences onto the reference genome and determine sequencing errors. Using an optimized assay procedure, we have recently published the detailed repopulation patterns of thousands of Hematopoietic Stem Cell (HSC) clones following transplant in rhesus macaques, demonstrating for the first time the precise time point of HSC repopulation and the functional heterogeneity of HSCs in the primate system. The following protocol describes the step-by-step experimental procedure and data analysis workflow that accurately identifies and quantifies identical IS sequences.
Community detection in sequence similarity networks based on attribute clustering
Chowdhary, Janamejaya; Loeffler, Frank E.; Smith, Jeremy C.
2017-07-24
Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here in this paper, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs,more » for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments« less
Bypassing bacterial infection in phage display by sequencing DNA released from phage particles.
Villequey, Camille; Kong, Xu-Dong; Heinis, Christian
2017-11-01
Phage display relies on a bacterial infection step in which the phage particles are replicated to perform multiple affinity selection rounds and to enable the identification of isolated clones by DNA sequencing. While this process is efficient for wild-type phage, the bacterial infection rate of phage with mutant or chemically modified coat proteins can be low. For example, a phage mutant with a disulfide-free p3 coat protein, used for the selection of bicyclic peptides, has a more than 100-fold reduced infection rate compared to the wild-type. A potential strategy for bypassing the bacterial infection step is to directly sequence DNA extracted from phage particles after a single round of phage panning using high-throughput sequencing. In this work, we have quantified the fraction of phage clones that can be identified by directly sequencing DNA from phage particles. The results show that the DNA of essentially all of the phage particles can be 'decoded', and that the sequence coverage for mutants equals that of amplified DNA extracted from cells infected with wild-type phage. This procedure is particularly attractive for selections with phage that have a compromised infection capacity, and it may allow phage display to be performed with particles that are not infective at all. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization
Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.; ...
2017-08-29
The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Community detection in sequence similarity networks based on attribute clustering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chowdhary, Janamejaya; Loeffler, Frank E.; Smith, Jeremy C.
Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here in this paper, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs,more » for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments« less
Evaluating the role of coherent delocalized phonon-like modes in DNA cyclization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alexandrov, Ludmil B.; Rasmussen, Kim Ã.; Bishop, Alan R.
The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer’s J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. We develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded ”flexible hinges” to assist in loop formation. We also combine the Czapla-Swigon-Olson structural model of DNA with ourmore » extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Furthermore, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.« less
Evolution of epigenetic regulation in vertebrate genomes
Lowdon, Rebecca F.; Jang, Hyo Sik; Wang, Ting
2016-01-01
Empirical models of sequence evolution have spurred progress in the field of evolutionary genetics for decades. We are now realizing the importance and complexity of the eukaryotic epigenome. While epigenome analysis has been applied to genomes from single cell eukaryotes to human, comparative analyses are still relatively few, and computational algorithms to quantify epigenome evolution remain scarce. Accordingly, a quantitative model of epigenome evolution remains to be established. Here we review the comparative epigenomics literature and synthesize its overarching themes. We also suggest one mechanism, transcription factor binding site turnover, which relates sequence evolution to epigenetic conservation or divergence. Lastly, we propose a framework for how the field can move forward to build a coherent quantitative model of epigenome evolution. PMID:27080453
Luo, Si-Wei; Liang, Zhi; Wu, Jia-Rui
2017-01-01
Quantitatively detecting correlations of multiple protein-protein interactions (PPIs) in vivo is a big challenge. Here we introduce a novel method, termed Protein-interactome Footprinting (PiF), to simultaneously measure multiple PPIs in one cell. The principle of PiF is that each target physical PPI in the interactome is simultaneously transcoded into a specific DNA sequence based on dimerization of the target proteins fused with DNA-binding domains. The interaction intensity of each target protein is quantified as the copy number of the specific DNA sequences bound by each fusion protein dimers. Using PiF, we quantitatively reveal dynamic patterns of PPIs and their correlation network in E. coli two-component systems. PMID:28338015
Bauer, Jan Stefan; Noël, Peter Benjamin; Vollhardt, Christiane; Much, Daniela; Degirmenci, Saliha; Brunner, Stefanie; Rummeny, Ernst Josef; Hauner, Hans
2015-01-01
Purpose MR might be well suited to obtain reproducible and accurate measures of fat tissues in infants. This study evaluates MR-measurements of adipose tissue in young infants in vitro and in vivo. Material and Methods MR images of ten phantoms simulating subcutaneous fat of an infant’s torso were obtained using a 1.5T MR scanner with and without simulated breathing. Scans consisted of a cartesian water-suppression turbo spin echo (wsTSE) sequence, and a PROPELLER wsTSE sequence. Fat volume was quantified directly and by MR imaging using k-means clustering and threshold-based segmentation procedures to calculate accuracy in vitro. Whole body MR was obtained in sleeping young infants (average age 67±30 days). This study was approved by the local review board. All parents gave written informed consent. To obtain reproducibility in vivo, cartesian and PROPELLER wsTSE sequences were repeated in seven and four young infants, respectively. Overall, 21 repetitions were performed for the cartesian sequence and 13 repetitions for the PROPELLER sequence. Results In vitro accuracy errors depended on the chosen segmentation procedure, ranging from 5.4% to 76%, while the sequence showed no significant influence. Artificial breathing increased the minimal accuracy error to 9.1%. In vivo reproducibility errors for total fat volume of the sleeping infants ranged from 2.6% to 3.4%. Neither segmentation nor sequence significantly influenced reproducibility. Conclusion With both cartesian and PROPELLER sequences an accurate and reproducible measure of body fat was achieved. Adequate segmentation was mandatory for high accuracy. PMID:25706876
Bauer, Jan Stefan; Noël, Peter Benjamin; Vollhardt, Christiane; Much, Daniela; Degirmenci, Saliha; Brunner, Stefanie; Rummeny, Ernst Josef; Hauner, Hans
2015-01-01
MR might be well suited to obtain reproducible and accurate measures of fat tissues in infants. This study evaluates MR-measurements of adipose tissue in young infants in vitro and in vivo. MR images of ten phantoms simulating subcutaneous fat of an infant's torso were obtained using a 1.5T MR scanner with and without simulated breathing. Scans consisted of a cartesian water-suppression turbo spin echo (wsTSE) sequence, and a PROPELLER wsTSE sequence. Fat volume was quantified directly and by MR imaging using k-means clustering and threshold-based segmentation procedures to calculate accuracy in vitro. Whole body MR was obtained in sleeping young infants (average age 67±30 days). This study was approved by the local review board. All parents gave written informed consent. To obtain reproducibility in vivo, cartesian and PROPELLER wsTSE sequences were repeated in seven and four young infants, respectively. Overall, 21 repetitions were performed for the cartesian sequence and 13 repetitions for the PROPELLER sequence. In vitro accuracy errors depended on the chosen segmentation procedure, ranging from 5.4% to 76%, while the sequence showed no significant influence. Artificial breathing increased the minimal accuracy error to 9.1%. In vivo reproducibility errors for total fat volume of the sleeping infants ranged from 2.6% to 3.4%. Neither segmentation nor sequence significantly influenced reproducibility. With both cartesian and PROPELLER sequences an accurate and reproducible measure of body fat was achieved. Adequate segmentation was mandatory for high accuracy.
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome
Margulies, Elliott H.; Cooper, Gregory M.; Asimenos, George; Thomas, Daryl J.; Dewey, Colin N.; Siepel, Adam; Birney, Ewan; Keefe, Damian; Schwartz, Ariel S.; Hou, Minmei; Taylor, James; Nikolaev, Sergey; Montoya-Burgos, Juan I.; Löytynoja, Ari; Whelan, Simon; Pardi, Fabio; Massingham, Tim; Brown, James B.; Bickel, Peter; Holmes, Ian; Mullikin, James C.; Ureta-Vidal, Abel; Paten, Benedict; Stone, Eric A.; Rosenbloom, Kate R.; Kent, W. James; Bouffard, Gerard G.; Guan, Xiaobin; Hansen, Nancy F.; Idol, Jacquelyn R.; Maduro, Valerie V.B.; Maskeri, Baishali; McDowell, Jennifer C.; Park, Morgan; Thomas, Pamela J.; Young, Alice C.; Blakesley, Robert W.; Muzny, Donna M.; Sodergren, Erica; Wheeler, David A.; Worley, Kim C.; Jiang, Huaiyang; Weinstock, George M.; Gibbs, Richard A.; Graves, Tina; Fulton, Robert; Mardis, Elaine R.; Wilson, Richard K.; Clamp, Michele; Cuff, James; Gnerre, Sante; Jaffe, David B.; Chang, Jean L.; Lindblad-Toh, Kerstin; Lander, Eric S.; Hinrichs, Angie; Trumbower, Heather; Clawson, Hiram; Zweig, Ann; Kuhn, Robert M.; Barber, Galt; Harte, Rachel; Karolchik, Donna; Field, Matthew A.; Moore, Richard A.; Matthewson, Carrie A.; Schein, Jacqueline E.; Marra, Marco A.; Antonarakis, Stylianos E.; Batzoglou, Serafim; Goldman, Nick; Hardison, Ross; Haussler, David; Miller, Webb; Pachter, Lior; Green, Eric D.; Sidow, Arend
2007-01-01
A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization. PMID:17567995
Fragmentation of whole-transcriptome RNA using E. coli RNase III.
Ares, Manuel
2013-05-01
High-throughput sequencing (HTS) methods can provide short sequence reads for many millions of individual molecules in a sample, allowing the use of sequencing to measure the abundance of RNA molecules. To quantify the amount of a particular sequence in a sample of large RNAs (e.g., mRNAs), it is important to fragment the RNA into short pieces that can be ligated to oligonucleotides that allow polymerase chain reaction (PCR) amplification and sequencing. The most desired end structure of RNA for such ligation steps is a 5' phosphate and a 3' OH. Thus, enzymes that leave these groups after cleavage are of particular utility, avoiding the need to dephosphorylate the 3' end with phosphatases or phosphorylate the 5' end with kinase before proceeding. One such enzyme, RNase III, is widely available. Although it primarily cuts duplex RNA, this specificity is salt- and concentration-dependent, and many RNAs that lack strong extended duplexes are nonetheless susceptible to cleavage at many spots. RNA fragmentation by RNase III does not seem to grossly affect the distribution of RNA sequencing reads. Thus, it has become a standard method for creating nominally representative pools of transcriptome sequences with 5' phosphates and 3' OH for library construction. Three steps in preparing fragmented transcriptome RNA for sequencing library construction are described here: (1) fragmenting the RNA with RNase III to the extent that ~60-100-nucleotide fragments are created, (2) purifying the RNA from the RNase III reaction, and (3) analyzing the digestion products for their suitability in library production.
The effectiveness of position- and composition-specific gap costs for protein similarity searches.
Stojmirović, Aleksandar; Gertz, E Michael; Altschul, Stephen F; Yu, Yi-Kuo
2008-07-01
The flexibility in gap cost enjoyed by hidden Markov models (HMMs) is expected to afford them better retrieval accuracy than position-specific scoring matrices (PSSMs). We attempt to quantify the effect of more general gap parameters by separately examining the influence of position- and composition-specific gap scores, as well as by comparing the retrieval accuracy of the PSSMs constructed using an iterative procedure to that of the HMMs provided by Pfam and SUPERFAMILY, curated ensembles of multiple alignments. We found that position-specific gap penalties have an advantage over uniform gap costs. We did not explore optimizing distinct uniform gap costs for each query. For Pfam, PSSMs iteratively constructed from seeds based on HMM consensus sequences perform equivalently to HMMs that were adjusted to have constant gap transition probabilities, albeit with much greater variance. We observed no effect of composition-specific gap costs on retrieval performance. These results suggest possible improvements to the PSI-BLAST protein database search program. The scripts for performing evaluations are available upon request from the authors.
Gd-EOB-DTPA-Enhanced MR Guidance in Thermal Ablation of Liver Malignancies
Rosenberg, Christian; Jahn, Andrea; Pickartz, Tilman; Wahnschaffe, Ulrich; Patrzyk, Maciej; Hosten, Norbert
2014-01-01
Objective To evaluate the potency of Gd-EOB-DTPA to support hepatic catheter placement in laser ablation procedures by quantifying time-dependent delineation effects for instrumentation and target tumor within liver parenchyma. Monitoring potential influence on online MR thermometry during the ablation procedure is a secondary aim. Materials and Methods 30 cases of MR-guided laser ablation were performed after i.v. bolus injection of gadoxetic acid (0.025 mmol/Kg Gd-EOB-DTPA; Bayer Healthcare, Berlin, Germany). T1-weighted GRE sequences were used for applicator guidance (FLASH 3D) in the catheter placement phase and for therapy monitoring (FLASH 2D) in the therapy phase. SNR and consecutive CNR values were measured for elements of interest plotted over time both for catheter placement and therapy phase and compared with a non-contrast control group of 19 earlier cases. Statistical analysis was realized using the paired Wilcoxon test. Results Sustainable signal elevation of liver parenchyma in the contrast-enhanced group was sufficient to silhouette both target tumor and applicator against the liver. Differences in time dependent CNR alteration were highly significant between contrast-enhanced and non-contrast interventions for parenchyma and target on the one hand (p = 0.020) and parenchyma and instrument on the other hand (p = 0.002). Effects lasted for the whole procedure (monitoring up to 60 min) and were specific for the contrast-enhanced group. Contrasting maxima were seen after median 30 (applicator) and 38 (tumor) minutes, in the potential core time of a multineedle procedure. Contrast influence on T1 thermometry for real-time monitoring of thermal impact was not significant (p = 0.068–0.715). Conclusion Results strongly support anticipated promotive effects of Gd-EOB-DTPA for MR-guided percutaneous liver interventions by proving and quantifying the delineating effects for therapy-relevant elements in the procedure. Time benefit, cost effectiveness and oncologic outcome of the described beneficiary effects will have to be part of further investigations. PMID:25541950
Gd-EOB-DTPA-enhanced MR guidance in thermal ablation of liver malignancies.
Rosenberg, Christian; Jahn, Andrea; Pickartz, Tilman; Wahnschaffe, Ulrich; Patrzyk, Maciej; Hosten, Norbert
2014-01-01
To evaluate the potency of Gd-EOB-DTPA to support hepatic catheter placement in laser ablation procedures by quantifying time-dependent delineation effects for instrumentation and target tumor within liver parenchyma. Monitoring potential influence on online MR thermometry during the ablation procedure is a secondary aim. 30 cases of MR-guided laser ablation were performed after i.v. bolus injection of gadoxetic acid (0.025 mmol/Kg Gd-EOB-DTPA; Bayer Healthcare, Berlin, Germany). T1-weighted GRE sequences were used for applicator guidance (FLASH 3D) in the catheter placement phase and for therapy monitoring (FLASH 2D) in the therapy phase. SNR and consecutive CNR values were measured for elements of interest plotted over time both for catheter placement and therapy phase and compared with a non-contrast control group of 19 earlier cases. Statistical analysis was realized using the paired Wilcoxon test. Sustainable signal elevation of liver parenchyma in the contrast-enhanced group was sufficient to silhouette both target tumor and applicator against the liver. Differences in time dependent CNR alteration were highly significant between contrast-enhanced and non-contrast interventions for parenchyma and target on the one hand (p = 0.020) and parenchyma and instrument on the other hand (p = 0.002). Effects lasted for the whole procedure (monitoring up to 60 min) and were specific for the contrast-enhanced group. Contrasting maxima were seen after median 30 (applicator) and 38 (tumor) minutes, in the potential core time of a multineedle procedure. Contrast influence on T1 thermometry for real-time monitoring of thermal impact was not significant (p = 0.068-0.715). Results strongly support anticipated promotive effects of Gd-EOB-DTPA for MR-guided percutaneous liver interventions by proving and quantifying the delineating effects for therapy-relevant elements in the procedure. Time benefit, cost effectiveness and oncologic outcome of the described beneficiary effects will have to be part of further investigations.
Jenkinson, Garrett; Abante, Jordi; Feinberg, Andrew P; Goutsias, John
2018-03-07
DNA methylation is a stable form of epigenetic memory used by cells to control gene expression. Whole genome bisulfite sequencing (WGBS) has emerged as a gold-standard experimental technique for studying DNA methylation by producing high resolution genome-wide methylation profiles. Statistical modeling and analysis is employed to computationally extract and quantify information from these profiles in an effort to identify regions of the genome that demonstrate crucial or aberrant epigenetic behavior. However, the performance of most currently available methods for methylation analysis is hampered by their inability to directly account for statistical dependencies between neighboring methylation sites, thus ignoring significant information available in WGBS reads. We present a powerful information-theoretic approach for genome-wide modeling and analysis of WGBS data based on the 1D Ising model of statistical physics. This approach takes into account correlations in methylation by utilizing a joint probability model that encapsulates all information available in WGBS methylation reads and produces accurate results even when applied on single WGBS samples with low coverage. Using the Shannon entropy, our approach provides a rigorous quantification of methylation stochasticity in individual WGBS samples genome-wide. Furthermore, it utilizes the Jensen-Shannon distance to evaluate differences in methylation distributions between a test and a reference sample. Differential performance assessment using simulated and real human lung normal/cancer data demonstrate a clear superiority of our approach over DSS, a recently proposed method for WGBS data analysis. Critically, these results demonstrate that marginal methods become statistically invalid when correlations are present in the data. This contribution demonstrates clear benefits and the necessity of modeling joint probability distributions of methylation using the 1D Ising model of statistical physics and of quantifying methylation stochasticity using concepts from information theory. By employing this methodology, substantial improvement of DNA methylation analysis can be achieved by effectively taking into account the massive amount of statistical information available in WGBS data, which is largely ignored by existing methods.
Quantifiers are incrementally interpreted in context, more than less
Urbach, Thomas P.; DeLong, Katherine A.; Kutas, Marta
2015-01-01
Language interpretation is often assumed to be incremental. However, our studies of quantifier expressions in isolated sentences found N400 event-related brain potential (ERP) evidence for partial but not full immediate quantifier interpretation (Urbach & Kutas, 2010). Here we tested similar quantifier expressions in pragmatically supporting discourse contexts (Alex was an unusual toddler. Most/Few kids prefer sweets/vegetables…) while participants made plausibility judgments (Experiment 1) or read for comprehension (Experiment 2). Control Experiments 3A (plausibility) and 3B (comprehension) removed the discourse contexts. Quantifiers always modulated typical and/or atypical word N400 amplitudes. However, only the real-time N400 effects only in Experiment 2 mirrored offline quantifier and typicality crossover interaction effects for plausibility ratings and cloze probabilities. We conclude that quantifier expressions can be interpreted fully and immediately, though pragmatic and task variables appear to impact the speed and/or depth of quantifier interpretation. PMID:26005285
Numerical modelling of the formation of fibrous bedding-parallel veins
NASA Astrophysics Data System (ADS)
Torremans, Koen; Muchez, Philippe; Sintubin, Manuel
2014-05-01
Bedding-parallel veins with a fibrous infill oriented orthogonal to the vein wall, are often observed in fine-grained metasedimentary sequences. Several mechanisms have been proposed for their formation, mostly with respect to effects of fluid overpressures and anisotropy of the host-rock fabric in order to explain the inferred extensional failure with sub-vertical opening. Abundant pre-folding, bedding-parallel fibrous dolomite veins are found associated with the Nkana-Mindola stratiform Cu-Co deposit in Zambia. The goal of this study is to better understand the formation mechanisms of these veins and to explain their particular spatial and thickness distribution, with respect to failure of transversely isotropic rocks. The spatial distribution and thickness variation of these veins was quantified during a field campaign in thirteen line transects perpendicular to undeformed veins in underground crosscuts. The fibrous dolomite veins studied are not related to lithological contrasts, but to a strong bedding-parallel shaly fabric, typical for the black shale facies of the Copperbelt Orebody Member. The host rock can hence be considered as transversely isotropic. Growth morphologies vary from antitaxial with a pronounced median surface to asymmetric syntaxial, always with small but quantifiable growth competition. A microstructural fabric study reveals that the undeformed dolomite veins show low-tortuosity vein walls and quantifiable growth competition. Here, we use a Discrete Element Method numerical modelling approach with ESyS-Particle (http://launchpad.net/esys-particle) to simulate the observed properties of the veins. Calibrated numerical specimens with a transversely isotropic matrix are repeatedly brought to failure under constant strain rates by changing the effective strain rates at model boundaries. After each fracture event, fractures in the numerical model are filled with cohesive vein material and the experiment is repeated. By systematically varying stress states, fluid pressures and mechanical properties of materials (host rock, vein infill and interface), we attempt to reproduce the characteristics of spatial distribution and thickness variation of the veins. Four parameter sets of mechanical micro-properties are defined in the models, essentially yielding (1) a competent and (2) incompetent matrix, (3) a vein material and (4) a vein-matrix interface. Each combination of parameters and particle packings is calibrated to fit a predetermined Mohr-Coulomb type failure envelope, via an automated calibration procedure. Preliminary tests already show that by varying these parameters, we are able to simulate realistically distributed cracking through crack-seal processes. Different types of veins and vein generations can be modelled, ranging from single veins, over crack-seal veins to anastomosing veins, by varying the mechanical strength of competent and incompetent matrix, vein and interface material. Further results of this approach will be presented. We will discuss our results with respect to mechanisms proposed in the literature for bedding-parallel, fibrous veins in metasedimentary rock sequences.
NASA Astrophysics Data System (ADS)
Cheng, Ryan; Morcos, Faruck; Levine, Herbert; Onuchic, Jose
2014-03-01
An important challenge in biology is to distinguish the subset of residues that allow bacterial two-component signaling (TCS) proteins to preferentially interact with their correct TCS partner such that they can bind and transfer signal. Detailed knowledge of this information would allow one to search sequence-space for mutations that can systematically tune the signal transmission between TCS partners as well as re-encode a TCS protein to preferentially transfer signals to a non-partner. Motivated by the notion that this detailed information is found in sequence data, we explore the mutual sequence co-evolution between signaling partners to infer how mutations can positively or negatively alter their interaction. Using Direct Coupling Analysis (DCA) for determining evolutionarily conserved interprotein interactions, we apply a DCA-based metric to quantify mutational changes in the interaction between TCS proteins and demonstrate that it accurately correlates with experimental mutagenesis studies probing the mutational change in the in vitro phosphotransfer. Our methodology serves as a potential framework for the rational design of TCS systems as well as a framework for the system-level study of protein-protein interactions in sequence-rich systems. This research has been supported by the NSF INSPIRE award MCB-1241332 and by the CTBP sponsored by the NSF (Grant PHY-1308264).
Bartram, Jack; Mountjoy, Edward; Brooks, Tony; Hancock, Jeremy; Williamson, Helen; Wright, Gary; Moppett, John; Goulden, Nick; Hubank, Mike
2016-07-01
High-throughput sequencing (HTS) (next-generation sequencing) of the rearranged Ig and T-cell receptor genes promises to be less expensive and more sensitive than current methods of monitoring minimal residual disease (MRD) in patients with acute lymphoblastic leukemia. However, the adoption of new approaches by clinical laboratories requires careful evaluation of all potential sources of error and the development of strategies to ensure the highest accuracy. Timely and efficient clinical use of HTS platforms will depend on combining multiple samples (multiplexing) in each sequencing run. Here we examine the Ig heavy-chain gene HTS on the Illumina MiSeq platform for MRD. We identify errors associated with multiplexing that could potentially impact the accuracy of MRD analysis. We optimize a strategy that combines high-purity, sequence-optimized oligonucleotides, dual indexing, and an error-aware demultiplexing approach to minimize errors and maximize sensitivity. We present a probability-based, demultiplexing pipeline Error-Aware Demultiplexer that is suitable for all MiSeq strategies and accurately assigns samples to the correct identifier without excessive loss of data. Finally, using controls quantified by digital PCR, we show that HTS-MRD can accurately detect as few as 1 in 10(6) copies of specific leukemic MRD. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.
Modelling dynamics with context-free grammars
NASA Astrophysics Data System (ADS)
García-Huerta, Juan-M.; Jiménez-Hernández, Hugo; Herrera-Navarro, Ana-M.; Hernández-Díaz, Teresa; Terol-Villalobos, Ivan
2014-03-01
This article presents a strategy to model the dynamics performed by vehicles in a freeway. The proposal consists on encode the movement as a set of finite states. A watershed-based segmentation is used to localize regions with high-probability of motion. Each state represents a proportion of a camera projection in a two-dimensional space, where each state is associated to a symbol, such that any combination of symbols is expressed as a language. Starting from a sequence of symbols through a linear algorithm a free-context grammar is inferred. This grammar represents a hierarchical view of common sequences observed into the scene. Most probable grammar rules express common rules associated to normal movement behavior. Less probable rules express themselves a way to quantify non-common behaviors and they might need more attention. Finally, all sequences of symbols that does not match with the grammar rules, may express itself uncommon behaviors (abnormal). The grammar inference is built with several sequences of images taken from a freeway. Testing process uses the sequence of symbols emitted by the scenario, matching the grammar rules with common freeway behaviors. The process of detect abnormal/normal behaviors is managed as the task of verify if any word generated by the scenario is recognized by the grammar.
Hu, Jiazhi; Meyers, Robin M; Dong, Junchao; Panchakshari, Rohit A; Alt, Frederick W; Frock, Richard L
2016-05-01
Unbiased, high-throughput assays for detecting and quantifying DNA double-stranded breaks (DSBs) across the genome in mammalian cells will facilitate basic studies of the mechanisms that generate and repair endogenous DSBs. They will also enable more applied studies, such as those to evaluate the on- and off-target activities of engineered nucleases. Here we describe a linear amplification-mediated high-throughput genome-wide sequencing (LAM-HTGTS) method for the detection of genome-wide 'prey' DSBs via their translocation in cultured mammalian cells to a fixed 'bait' DSB. Bait-prey junctions are cloned directly from isolated genomic DNA using LAM-PCR and unidirectionally ligated to bridge adapters; subsequent PCR steps amplify the single-stranded DNA junction library in preparation for Illumina Miseq paired-end sequencing. A custom bioinformatics pipeline identifies prey sequences that contribute to junctions and maps them across the genome. LAM-HTGTS differs from related approaches because it detects a wide range of broken end structures with nucleotide-level resolution. Familiarity with nucleic acid methods and next-generation sequencing analysis is necessary for library generation and data interpretation. LAM-HTGTS assays are sensitive, reproducible, relatively inexpensive, scalable and straightforward to implement with a turnaround time of <1 week.
Insights in connecting phenotypes in bacteria to coevolutionary information
NASA Astrophysics Data System (ADS)
Cheng, Ryan; Morcos, Faruck; Hayes, Ryan; Helm, Rodney; Levine, Herbert; Onuchic, Jose
It has long been known that protein sequences are far from random. These sequences have been evolutionarily selected to maintain their ability to fold into stable, three-dimensional folded structures as well as their ability to form macromolecular assemblies, perform catalytic functions, etc. For these reasons, there exist quantifiable mutational patterns in the collection of sequence data for a protein family arising from the need to maintain favorable residue-residue interactions to facilitate folding as well as cellular function. Here, we focus on studying the correlated mutational patterns that give rise to interaction specificity in bacterial two-component signaling (TCS) systems. TCS proteins have evolved to be able to preferentially bind and transfer a phosphate group to their signaling partner while avoiding phosphotransfer with non-partners. We infer a Potts model Hamiltonian governing the correlated mutational patterns that are observed in the sequence data of TCS partners and apply this model to recently published in vivo mutational data. Our findings further support the notion that statistical models built from sequence data can be used to predict bacterial phenotypes as well as engineer interaction specificity between non-partner TCS proteins. This research has been supported by the NSF INSPIRE Award (MCB-1241332) and by the CTBP sponsored by the NSF (Grant PHY- 1427654).
cyclostratigraphy, sequence stratigraphy and organic matter accumulation mechanism
NASA Astrophysics Data System (ADS)
Cong, F.; Li, J.
2016-12-01
The first member of Maokou Formation of Sichuan basin is composed of well preserved carbonate ramp couplets of limestone and marlstone/shale. It acts as one of the potential shale gas source rock, and is suitable for time-series analysis. We conducted time-series analysis to identify high-frequency sequences, reconstruct high-resolution sedimentation rate, estimate detailed primary productivity for the first time in the study intervals and discuss organic matter accumulation mechanism of source rock under sequence stratigraphic framework.Using the theory of cyclostratigraphy and sequence stratigraphy, the high-frequency sequences of one outcrop profile and one drilling well are identified. Two third-order sequences and eight fourth-order sequences are distinguished on outcrop profile based on the cycle stacking patterns. For drilling well, sequence boundary and four system tracts is distinguished by "integrated prediction error filter analysis" (INPEFA) of Gamma-ray logging data, and eight fourth-order sequences is identified by 405ka long eccentricity curve in depth domain which is quantified and filtered by integrated analysis of MTM spectral analysis, evolutive harmonic analysis (EHA), evolutive average spectral misfit (eASM) and band-pass filtering. It suggests that high-frequency sequences correlate well with Milankovitch orbital signals recorded in sediments, and it is applicable to use cyclostratigraphy theory in dividing high-frequency(4-6 orders) sequence stratigraphy.High-resolution sedimentation rate is reconstructed through the study interval by tracking the highly statistically significant short eccentricity component (123ka) revealed by EHA. Based on sedimentation rate, measured TOC and density data, the burial flux, delivery flux and primary productivity of organic carbon was estimated. By integrating redox proxies, we can discuss the controls on organic matter accumulation by primary production and preservation under the high-resolution sequence stratigraphic framework. Results show that high average organic carbon contents in the study interval are mainly attributed to high primary production. The results also show a good correlation between high organic carbon accumulation and intervals of transgression.
Auty, Harriet; Cleaveland, Sarah; Malele, Imna; Masoy, Joseph; Lembo, Tiziana; Bessell, Paul; Torr, Stephen; Picozzi, Kim; Welburn, Susan C.
2016-01-01
Background Identifying hosts of blood-feeding insect vectors is crucial in understanding their role in disease transmission. Rhodesian human African trypanosomiasis (rHAT), also known as acute sleeping sickness is caused by Trypanosoma brucei rhodesiense and transmitted by tsetse flies. The disease is commonly associated with wilderness areas of east and southern Africa. Such areas hold a diverse range of species which form communities of hosts for disease maintenance. The relative importance of different wildlife hosts remains unclear. This study quantified tsetse feeding preferences in a wilderness area of great host species richness, Serengeti National Park, Tanzania, assessing tsetse feeding and host density contemporaneously. Methods Glossina swynnertoni and G. pallidipes were collected from six study sites. Bloodmeal sources were identified through matching Cytochrome B sequences amplified from bloodmeals from recently fed flies to published sequences. Densities of large mammal species in each site were quantified, and feeding indices calculated to assess the relative selection or avoidance of each host species by tsetse. Results The host species most commonly identified in G. swynnertoni bloodmeals, warthog (94/220), buffalo (48/220) and giraffe (46/220), were found at relatively low densities (3-11/km2) and fed on up to 15 times more frequently than expected by their relative density. Wildebeest, zebra, impala and Thomson’s gazelle, found at the highest densities, were never identified in bloodmeals. Commonly identified hosts for G. pallidipes were buffalo (26/46), giraffe (9/46) and elephant (5/46). Conclusions This study is the first to quantify tsetse host range by molecular analysis of tsetse diet with simultaneous assessment of host density in a wilderness area. Although G. swynnertoni and G. pallidipes can feed on a range of species, they are highly selective. Many host species are rarely fed on, despite being present in areas where tsetse are abundant. These feeding patterns, along with the ability of key host species to maintain and transmit T. b. rhodesiense, drive the epidemiology of rHAT in wilderness areas. PMID:27706167
A computerized MRI biomarker quantification scheme for a canine model of Duchenne muscular dystrophy
Wang, Jiahui; Fan, Zheng; Vandenborne, Krista; Walter, Glenn; Shiloh-Malawsky, Yael; An, Hongyu; Kornegay, Joe N.; Styner, Martin A.
2015-01-01
Purpose Golden retriever muscular dystrophy (GRMD) is a widely used canine model of Duchenne muscular dystrophy (DMD). Recent studies have shown that magnetic resonance imaging (MRI) can be used to non-invasively detect consistent changes in both DMD and GRMD. In this paper, we propose a semi-automated system to quantify MRI biomarkers of GRMD. Methods Our system was applied to a database of 45 MRI scans from 8 normal and 10 GRMD dogs in a longitudinal natural history study. We first segmented six proximal pelvic limb muscles using two competing schemes: 1) standard, limited muscle range segmentation and 2) semi-automatic full muscle segmentation. We then performed pre-processing, including: intensity inhomogeneity correction, spatial registration of different image sequences, intensity calibration of T2-weighted (T2w) and T2-weighted fat suppressed (T2fs) images, and calculation of MRI biomarker maps. Finally, for each of the segmented muscles, we automatically measured MRI biomarkers of muscle volume and intensity statistics over MRI biomarker maps, and statistical image texture features. Results The muscle volume and the mean intensities in T2 value, fat, and water maps showed group differences between normal and GRMD dogs. For the statistical texture biomarkers, both the histogram and run-length matrix features showed obvious group differences between normal and GRMD dogs. The full muscle segmentation shows significantly less error and variability in the proposed biomarkers when compared to the standard, limited muscle range segmentation. Conclusion The experimental results demonstrated that this quantification tool can reliably quantify MRI biomarkers in GRMD dogs, suggesting that it would also be useful for quantifying disease progression and measuring therapeutic effect in DMD patients. PMID:23299128
The Alphabet Soup of HIV Reservoir Markers.
Sharaf, Radwa R; Li, Jonathan Z
2017-04-01
Despite the success of antiretroviral therapy in suppressing HIV, life-long therapy is required to avoid HIV reactivation from long-lived viral reservoirs. Currently, there is intense interest in searching for therapeutic interventions that can purge the viral reservoir to achieve complete remission in HIV patients off antiretroviral therapy. The evaluation of such interventions relies on our ability to accurately and precisely measure the true size of the viral reservoir. In this review, we assess the most commonly used HIV reservoir assays, as a clear understanding of the strengths and weaknesses of each is vital for the accurate interpretation of results and for the development of improved assays. The quantification of intracellular or plasma HIV RNA or DNA levels remains the most commonly used tests for the characterization of the viral reservoir. While cost-effective and high-throughput, these assays are not able to differentiate between replication-competent or defective fractions or quantify the number of infected cells. Viral outgrowth assays provide a lower bound for the fraction of cells that can produce infectious virus, but these assays are laborious, expensive and substantially underestimate the potential reservoir of replication-competent provirus. Newer assays are now available that seek to overcome some of these problems, including full-length proviral sequencing, inducible HIV RNA assays, ultrasensitive p24 assays and murine adoptive transfer techniques. The development and evaluation of strategies for HIV remission rely upon our ability to accurately and precisely quantify the size of the remaining viral reservoir. At this time, all current HIV reservoir assays have drawbacks such that combinations of assays are generally needed to gain a more comprehensive view of the viral reservoir. The development of novel, rapid, high-throughput assays that can sensitively quantify the levels of the replication-competent HIV reservoir is still needed.
Tills, Oliver; Bitterli, Tabitha; Culverhouse, Phil; Spicer, John I; Rundle, Simon
2013-02-01
Motion analysis is one of the tools available to biologists to extract biologically relevant information from image datasets and has been applied to a diverse range of organisms. The application of motion analysis during early development presents a challenge, as embryos often exhibit complex, subtle and diverse movement patterns. A method of motion analysis able to holistically quantify complex embryonic movements could be a powerful tool for fields such as toxicology and developmental biology to investigate whole organism stress responses. Here we assessed whether motion analysis could be used to distinguish the effects of stressors on three early developmental stages of each of three species: (i) the zebrafish Danio rerio (stages 19 h, 21.5 h and 33 h exposed to 1.5% ethanol and a salinity of 5); (ii) the African clawed toad Xenopus laevis (stages 24, 32 and 34 exposed to a salinity of 20); and iii) the pond snail Radix balthica (stages E3, E4, E6, E9 and E11 exposed to salinities of 5, 10 and 15). Image sequences were analysed using Sparse Optic Flow and the resultant frame-to-frame motion parameters were analysed using Discrete Fourier Transform to quantify the distribution of energy at different frequencies. This spectral frequency dataset was then used to construct a Bray-Curtis similarity matrix and differences in movement patterns between embryos in this matrix were tested for using ANOSIM. Spectral frequency analysis of these motion parameters was able to distinguish stage-specific effects of environmental stressors in most cases, including Xenopus laevis at stages 24, 32 and 34 exposed to a salinity of 20, Danio rerio at 33 hpf exposed to 1.5% ethanol, and Radix balthica at stages E4, E9 and E11 exposed to salinities of 5, 10 and 15. This technique was better able to distinguish embryos exposed to stressors than analysis of manual quantification of movement and within species distinguished most of the developmental stages studied in the control treatments. This innovative use of motion analysis incorporates data quantifying embryonic movements at a range of frequencies and so provides an holistic analysis of an embryo's movement patterns. This technique has potential applications for quantifying embryonic responses to environmental stressors such as exposure to pharmaceuticals or pollutants, and also as an automated tool for developmental staging of embryos.
Effect of continuous vs pulsed iontophoresis of treprostinil on skin blood flow.
Kotzki, Sylvain; Roustit, Matthieu; Arnaud, Claire; Godin-Ribuot, Diane; Cracowski, Jean-Luc
2015-05-25
Systemic sclerosis (SSc) is a rare disease affecting digital microcirculation, leading to finger ulcers and in some cases to amputation. Prostacyclin analogues can be used intravenously but their therapeutic effect is counterbalanced by potentially serious vasodilatation-induced side effects. Iontophoresis of treprostinil could be a promising local therapeutic alternative for SSc-related digital ulcers. Iontophoretic drug delivery is complex, and whether continuous or periodic current should be used remains debated. The objective of the present work is to compare the effect of continuous vs pulsed iontophoresis of treprostinil in rats. Treprostinil (0.64 mM and 0.064 mM) and NaCl were delivered by cathodal iontophoresis onto the hindquarters of anaesthetized rats. Three protocols delivering the same quantity of current were compared: one was continuous (100 μA during 20 min) and two were periodic (B: twenty 1-min cycles with 200 μA during 30 s followed by 30 s Off; and C: twenty 1-min cycles with 600 μA during 10s followed by 50s Off) (n=8 for each protocol with each concentration). Skin blood flow was quantified using laser Doppler imaging and skin resistance was calculated with Ohm's law. All protocols induced a significant increase in skin blood flow. At the lower concentration (0.064 mM treprostinil) the pulsed 10/50 sequence significantly enhanced cutaneous blood flow (Table 1; Fig. 1B) compared to continuous iontophoresis or the 30/30 sequence. We noted that the pulsed iontophoresis of NaCl (10/50 sequence) induced a significant early increase in cutaneous blood flow in comparison with continuous iontophoresis. Skin resistance measures were negatively correlated with current intensity delivered. In conclusion, pulsed iontophoresis of treprostinil with a 10 s/50 s (On/Off) protocol at 600 μA increases the efficacy of iontophoresis at 0.064 mM but not at a tenfold higher concentration. Pulsed iontophoresis could be used to optimize treprostinil iontophoresis, to provide similar efficacy with decreased costs, and should now be tested on humans. Copyright © 2015 Elsevier B.V. All rights reserved.
Evaluation of liver fat in the presence of iron with MRI using T2* correction: a clinical approach.
Henninger, Benjamin; Benjamin, Henninger; Kremser, Christian; Christian, Kremser; Rauch, Stefan; Stefan, Rauch; Eder, Robert; Robert, Eder; Judmaier, Werner; Werner, Judmaier; Zoller, Heinz; Heinz, Zoller; Michaely, Henrik; Henrik, Michaely; Schocke, Michael; Michael, Schocke
2013-06-01
To assess magnetic resonance imaging (MRI) with conventional chemical shift-based sequences with and without T2* correction for the evaluation of steatosis hepatitis (SH) in the presence of iron. Thirty-one patients who underwent MRI and liver biopsy because of clinically suspected diffuse liver disease were retrospectively analysed. The signal intensity (SI) was calculated in co-localised regions of interest (ROIs) using conventional spoiled gradient-echo T1 FLASH in-phase and opposed-phase (IP/OP). T2* relaxation time was recorded in a fat-saturated multi-echo-gradient-echo sequence. The fat fraction (FF) was calculated with non-corrected and T2*-corrected SIs. Results were correlated with liver biopsy. There was significant difference (P < 0.001) between uncorrected and T2* corrected FF in patients with SH and concomitant hepatic iron overload (HIO). Using 5 % as a threshold resulted in eight false negative results with uncorrected FF whereas T2* corrected FF lead to true positive results in 5/8 patients. ROC analysis calculated three threshold values (8.97 %, 5.3 % and 3.92 %) for T2* corrected FF with accuracy 84 %, sensitivity 83-91 % and specificity 63-88 %. FF with T2* correction is accurate for the diagnosis of hepatic fat in the presence of HIO. Findings of our study suggest the use of IP/OP imaging in combination with T2* correction. • Magnetic resonance helps quantify both iron and fat content within the liver • T2* correction helps to predict the correct diagnosis of steatosis hepatitis • "Fat fraction" from T2*-corrected chemical shift-based sequences accurately quantifies hepatic fat • "Fat fraction" without T2* correction underestimates hepatic fat with iron overload.
Comprehensive Analysis of Protein Modifications by Top-down Mass Spectrometry
Zhang, Han; Ge, Ying
2012-01-01
Mass spectrometry (MS)-based proteomics is playing an increasingly important role in cardiovascular research. Proteomics includes not only identification and quantification of proteins, but also the characterization of protein modifications such as post-translational modifications and sequence variants. The conventional bottom-up approach, involving proteolytic digestion of proteins into small peptides prior to MS analysis, is routinely used for protein identification and quantification with high throughput and automation. Nevertheless, it has limitations in the analysis of protein modifications mainly due to the partial sequence coverage and loss of connections among modifications on disparate portions of a protein. An alternative approach, top-down MS, has emerged as a powerful tool for the analysis of protein modifications. The top-down approach analyzes whole proteins directly, providing a “bird’s eye” view of all existing modifications. Subsequently, each modified protein form can be isolated and fragmented in the mass spectrometer to locate the modification site. The incorporation of the non-ergodic dissociation methods such as electron capture dissociation (ECD) greatly enhances the top-down capabilities. ECD is especially useful for mapping labile post-translational modifications which are well-preserved during the ECD fragmentation process. Top-down MS with ECD has been successfully applied to cardiovascular research with the unique advantages in unraveling the molecular complexity, quantifying modified protein forms, complete mapping of modifications with full sequence coverage, discovering unexpected modifications, and identifying and quantifying positional isomers and determining the order of multiple modifications. Nevertheless, top-down MS still needs to overcome some technical challenges to realize its full potential. Herein, we reviewed the advantages and challenges of top-down methodology with a focus on its application in cardiovascular research. PMID:22187450
Properties of Protein Drug Target Classes
Bull, Simon C.; Doig, Andrew J.
2015-01-01
Accurate identification of drug targets is a crucial part of any drug development program. We mined the human proteome to discover properties of proteins that may be important in determining their suitability for pharmaceutical modulation. Data was gathered concerning each protein’s sequence, post-translational modifications, secondary structure, germline variants, expression profile and drug target status. The data was then analysed to determine features for which the target and non-target proteins had significantly different values. This analysis was repeated for subsets of the proteome consisting of all G-protein coupled receptors, ion channels, kinases and proteases, as well as proteins that are implicated in cancer. Machine learning was used to quantify the proteins in each dataset in terms of their potential to serve as a drug target. This was accomplished by first inducing a random forest that could distinguish between its targets and non-targets, and then using the random forest to quantify the drug target likeness of the non-targets. The properties that can best differentiate targets from non-targets were primarily those that are directly related to a protein’s sequence (e.g. secondary structure). Germline variants, expression levels and interactions between proteins had minimal discriminative power. Overall, the best indicators of drug target likeness were found to be the proteins’ hydrophobicities, in vivo half-lives, propensity for being membrane bound and the fraction of non-polar amino acids in their sequences. In terms of predicting potential targets, datasets of proteases, ion channels and cancer proteins were able to induce random forests that were highly capable of distinguishing between targets and non-targets. The non-target proteins predicted to be targets by these random forests comprise the set of the most suitable potential future drug targets, and should therefore be prioritised when building a drug development programme. PMID:25822509
Highly sensitive and unbiased approach for elucidating antibody repertoires
Lin, Sherry G.; Ba, Zhaoqing; Du, Zhou; Zhang, Yu; Hu, Jiazhi; Alt, Frederick W.
2016-01-01
Developing B lymphocytes undergo V(D)J recombination to assemble germ-line V, D, and J gene segments into exons that encode the antigen-binding variable region of Ig heavy (H) and light (L) chains. IgH and IgL chains associate to form the B-cell receptor (BCR), which, upon antigen binding, activates B cells to secrete BCR as an antibody. Each of the huge number of clonally independent B cells expresses a unique set of IgH and IgL variable regions. The ability of V(D)J recombination to generate vast primary B-cell repertoires results from a combinatorial assortment of large numbers of different V, D, and J segments, coupled with diversification of the junctions between them to generate the complementary determining region 3 (CDR3) for antigen contact. Approaches to evaluate in depth the content of primary antibody repertoires and, ultimately, to study how they are further molded by secondary mutation and affinity maturation processes are of great importance to the B-cell development, vaccine, and antibody fields. We now describe an unbiased, sensitive, and readily accessible assay, referred to as high-throughput genome-wide translocation sequencing-adapted repertoire sequencing (HTGTS-Rep-seq), to quantify antibody repertoires. HTGTS-Rep-seq quantitatively identifies the vast majority of IgH and IgL V(D)J exons, including their unique CDR3 sequences, from progenitor and mature mouse B lineage cells via the use of specific J primers. HTGTS-Rep-seq also accurately quantifies DJH intermediates and V(D)J exons in either productive or nonproductive configurations. HTGTS-Rep-seq should be useful for studies of human samples, including clonal B-cell expansions, and also for following antibody affinity maturation processes. PMID:27354528
NASA Astrophysics Data System (ADS)
Wheeler, A.; Freiwald, A.; Hebbeln, D.; Swennen, R.; van Weering, T.; de Haas, H.; Dorschel, B.
2007-12-01
Up to now the carbonate stored in carbonate mounds has not been considered in any global carbonate budget or linked to any global carbon budget involving greenhouse gases. A major challenge exists to quantify the amount and flux of carbon stored by these newly discovered areas of enhanced carbonate accumulation in intermediate water depth. Furthermore, investigations so far reveal that all mounds possess different growth histories depending on the environmental setting and the involved faunal associations. Unfortunately, existing cores only penetrated the upper few meters of the mounds thus limiting mound research to the very late stage of mound development. Access to the longer sequences preserved in giant carbonate mounds was overcome in May 2005 when the IODP Expedition 307 (Porcupine Mound Drilling) recovered complete sedimentary records from one 155 m high "Challenger Mound" in the Porcupine Seabight west off Ireland. Furthermore, EU-FP projects have revealed late stage history of giant mounds in different settings showing that different mounds respond in different ways to environmental forcing factors with no one mound being typical of all. CARBONATE will drill complete sequences through a number of mounds in differing environmental settings using the portable drill rig MeBo (University of Bremen). By understanding how biogeochemical processes control the development of these carbonate mounds and their response to climate change, we will make an important step in quantifying their role as mid-latitude carbonate sinks. In the end, a better understanding of the processes involved in mound formation and development may also result in new views on fossil analogues many of which are less accessible hydrocarbon reservoirs.
Manual Dexterity in Schizophrenia—A Neglected Clinical Marker?
Térémetz, Maxime; Carment, Loïc; Brénugat-Herne, Lindsay; Croca, Marta; Bleton, Jean-Pierre; Krebs, Marie-Odile; Maier, Marc A.; Amado, Isabelle; Lindberg, Påvel G.
2017-01-01
Impaired manual dexterity is commonly observed in schizophrenia. However, a quantitative description of key sensorimotor components contributing to impaired dexterity is lacking. Whether the key components of dexterity are differentially affected and how they relate to clinical characteristics also remains unclear. We quantified the degree of dexterity in 35 stabilized patients with schizophrenia and in 20 age-matched control subjects using four visuomotor tasks: (i) force tracking to quantify visuomotor precision, (ii) sequential finger tapping to measure motor sequence recall, (iii) single-finger tapping to assess temporal regularity, and (iv) multi-finger tapping to measure independence of finger movements. Diverse clinical and neuropsychological tests were also applied. A patient subgroup (N = 15) participated in a 14-week cognitive remediation protocol and was assessed before and after remediation. Compared to control subjects, patients with schizophrenia showed greater error in force tracking, poorer recall of tapping sequences, decreased tapping regularity, and reduced degree of finger individuation. A composite performance measure discriminated patients from controls with sensitivity = 0.79 and specificity = 0.9. Aside from force-tracking error, no other dexterity components correlated with antipsychotic medication. In patients, some dexterity components correlated with neurological soft signs, Positive and Negative Syndrome Scale (PANSS), or neuropsychological scores. This suggests differential cognitive contributions to these components. Cognitive remediation lead to significant improvement in PANSS, tracking error, and sequence recall (without change in medication). These findings show that multiple aspects of sensorimotor control contribute to impaired manual dexterity in schizophrenia. Only visuomotor precision was related to antipsychotic medication. Good diagnostic accuracy and responsiveness to treatment suggest that manual dexterity may represent a useful clinical marker in schizophrenia. PMID:28740470
Krebs, Arnaud R; Dessus-Babus, Sophie; Burger, Lukas; Schübeler, Dirk
2014-09-26
The majority of mammalian promoters are CpG islands; regions of high CG density that require protection from DNA methylation to be functional. Importantly, how sequence architecture mediates this unmethylated state remains unclear. To address this question in a comprehensive manner, we developed a method to interrogate methylation states of hundreds of sequence variants inserted at the same genomic site in mouse embryonic stem cells. Using this assay, we were able to quantify the contribution of various sequence motifs towards the resulting DNA methylation state. Modeling of this comprehensive dataset revealed that CG density alone is a minor determinant of their unmethylated state. Instead, these data argue for a principal role for transcription factor binding sites, a prediction confirmed by testing synthetic mutant libraries. Taken together, these findings establish the hierarchy between the two cis-encoded mechanisms that define the DNA methylation state and thus the transcriptional competence of CpG islands.
Activity Catalog Tool (ACT) user manual, version 2.0
NASA Technical Reports Server (NTRS)
Segal, Leon D.; Andre, Anthony D.
1994-01-01
This report comprises the user manual for version 2.0 of the Activity Catalog Tool (ACT) software program, developed by Leon D. Segal and Anthony D. Andre in cooperation with NASA Ames Aerospace Human Factors Research Division, FLR branch. ACT is a software tool for recording and analyzing sequences of activity over time that runs on the Macintosh platform. It was designed as an aid for professionals who are interested in observing and understanding human behavior in field settings, or from video or audio recordings of the same. Specifically, the program is aimed at two primary areas of interest: human-machine interactions and interactions between humans. The program provides a means by which an observer can record an observed sequence of events, logging such parameters as frequency and duration of particular events. The program goes further by providing the user with a quantified description of the observed sequence, through application of a basic set of statistical routines, and enables merging and appending of several files and more extensive analysis of the resultant data.
Cai, Na; Bigdeli, Tim B; Kretzschmar, Warren W; Li, Yihan; Liang, Jieqin; Hu, Jingchu; Peterson, Roseann E; Bacanu, Silviu; Webb, Bradley Todd; Riley, Brien; Li, Qibin; Marchini, Jonathan; Mott, Richard; Kendler, Kenneth S; Flint, Jonathan
2017-02-14
The China, Oxford and Virginia Commonwealth University Experimental Research on Genetic Epidemiology (CONVERGE) project on Major Depressive Disorder (MDD) sequenced 11,670 female Han Chinese at low-coverage (1.7X), providing the first large-scale whole genome sequencing resource representative of the largest ethnic group in the world. Samples are collected from 58 hospitals from 23 provinces around China. We are able to call 22 million high quality single nucleotide polymorphisms (SNP) from the nuclear genome, representing the largest SNP call set from an East Asian population to date. We use these variants for imputation of genotypes across all samples, and this has allowed us to perform a successful genome wide association study (GWAS) on MDD. The utility of these data can be extended to studies of genetic ancestry in the Han Chinese and evolutionary genetics when integrated with data from other populations. Molecular phenotypes, such as copy number variations and structural variations can be detected, quantified and analysed in similar ways.
Buenrostro, Jason D.; Chircus, Lauren M.; Araya, Carlos L.; Layton, Curtis J.; Chang, Howard Y.; Snyder, Michael P.; Greenleaf, William J.
2015-01-01
RNA-protein interactions drive fundamental biological processes and are targets for molecular engineering, yet quantitative and comprehensive understanding of the sequence determinants of affinity remains limited. Here we repurpose a high-throughput sequencing instrument to quantitatively measure binding and dissociation of MS2 coat protein to >107 RNA targets generated on a flow-cell surface by in situ transcription and inter-molecular tethering of RNA to DNA. We decompose the binding energy contributions from primary and secondary RNA structure, finding that differences in affinity are often driven by sequence-specific changes in association rates. By analyzing the biophysical constraints and modeling mutational paths describing the molecular evolution of MS2 from low- to high-affinity hairpins, we quantify widespread molecular epistasis, and a long-hypothesized structure-dependent preference for G:U base pairs over C:A intermediates in evolutionary trajectories. Our results suggest that quantitative analysis of RNA on a massively parallel array (RNAMaP) relationships across molecular variants. PMID:24727714
Population genomics of intrapatient HIV-1 evolution
Zanini, Fabio; Brodin, Johanna; Thebo, Lina; Lanz, Christa; Bratt, Göran; Albert, Jan; Neher, Richard A
2015-01-01
Many microbial populations rapidly adapt to changing environments with multiple variants competing for survival. To quantify such complex evolutionary dynamics in vivo, time resolved and genome wide data including rare variants are essential. We performed whole-genome deep sequencing of HIV-1 populations in 9 untreated patients, with 6-12 longitudinal samples per patient spanning 5-8 years of infection. The data can be accessed and explored via an interactive web application. We show that patterns of minor diversity are reproducible between patients and mirror global HIV-1 diversity, suggesting a universal landscape of fitness costs that control diversity. Reversions towards the ancestral HIV-1 sequence are observed throughout infection and account for almost one third of all sequence changes. Reversion rates depend strongly on conservation. Frequent recombination limits linkage disequilibrium to about 100bp in most of the genome, but strong hitch-hiking due to short range linkage limits diversity. DOI: http://dx.doi.org/10.7554/eLife.11282.001 PMID:26652000
Schoone, G J; Oskam, L; Kroon, N C; Schallig, H D; Omar, S A
2000-11-01
A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the sample with one modified in vitro RNA as a competitor in a single-tube NASBA reaction. Parasite densities ranging from 10 to 10(8) Plasmodium falciparum parasites per ml could be demonstrated and quantified in whole blood. This is approximately 1,000 times more sensitive than conventional microscopy analysis of thick blood smears. Comparison of the parasite densities obtained by microscopy and QT-NASBA with 120 blood samples from Kenyan patients with clinical malaria revealed that for 112 of 120 (93%) of the samples results were within a 1-log difference. QT-NASBA may be especially useful for the detection of low parasite levels in patients with early-stage malaria and for the monitoring of the efficacy of drug treatment.
Metastasizing patent claims on BRCA1.
Kepler, Thomas B; Crossman, Colin; Cook-Deegan, Robert
2010-05-01
Many patents make claims on DNA sequences; some include claims on oligonucleotides related to the primary patented gene. We used bioinformatics to quantify the reach of one such claim from patent 4,747,282 on BRCA1. We find that human chromosome 1 (which does not contain BRCA1) contains over 300,000 oligonucleotides covered by this claim, and that 80% of cDNA and mRNA sequences contributed to GenBank before the patent application was filed also contain at least one claimed oligonucleotide. Any "isolated" DNA molecules that include such 15 bp nucleotide sequences would fall under the claim as granted by the US Patent and Trademark Office. Anyone making, using, selling, or importing such a molecule for any purpose within the United States would thus be infringing the claim. This claim and others like it turn out, on examination, to be surprisingly broad, and if enforced would have substantial implications for medical practice and scientific research. Copyright 2010 Elsevier Inc. All rights reserved.
2011-01-01
Background Wheat flour is one of the world's major food ingredients, in part because of the unique end-use qualities conferred by the abundant glutamine- and proline-rich gluten proteins. Many wheat flour proteins also present dietary problems for consumers with celiac disease or wheat allergies. Despite the importance of these proteins it has been particularly challenging to use MS/MS to distinguish the many proteins in a flour sample and relate them to gene sequences. Results Grain from the extensively characterized spring wheat cultivar Triticum aestivum 'Butte 86' was milled to white flour from which proteins were extracted, then separated and quantified by 2-DE. Protein spots were identified by separate digestions with three proteases, followed by tandem mass spectrometry analysis of the peptides. The spectra were used to interrogate an improved protein sequence database and results were integrated using the Scaffold program. Inclusion of cultivar specific sequences in the database greatly improved the results, and 233 spots were identified, accounting for 93.1% of normalized spot volume. Identified proteins were assigned to 157 wheat sequences, many for proteins unique to wheat and nearly 40% from Butte 86. Alpha-gliadins accounted for 20.4% of flour protein, low molecular weight glutenin subunits 18.0%, high molecular weight glutenin subunits 17.1%, gamma-gliadins 12.2%, omega-gliadins 10.5%, amylase/protease inhibitors 4.1%, triticins 1.6%, serpins 1.6%, purinins 0.9%, farinins 0.8%, beta-amylase 0.5%, globulins 0.4%, other enzymes and factors 1.9%, and all other 3%. Conclusions This is the first successful effort to identify the majority of abundant flour proteins for a single wheat cultivar, relate them to individual gene sequences and estimate their relative levels. Many genes for wheat flour proteins are not expressed, so this study represents further progress in describing the expressed wheat genome. Use of cultivar-specific contigs helped to overcome the difficulties of matching peptides to gene sequences for members of highly similar, rapidly evolving storage protein families. Prospects for simplifying this process for routine analyses are discussed. The ability to measure expression levels for individual flour protein genes complements information gained from efforts to sequence the wheat genome and is essential for studies of effects of environment on gene expression. PMID:21314956
Sundby, Øyvind Heiberg; Høiseth, Lars Øivind; Mathiesen, Iacob; Weedon-Fekjær, Harald; Sundhagen, Jon O; Hisdal, Jonny
2017-01-01
Intermittent negative pressure (INP) applied to the lower leg and foot increases foot perfusion in healthy volunteers. The aim of the present study was to describe the effects of INP to the lower leg and foot on foot macro- and microcirculation in patients with lower extremity peripheral arterial disease (PAD). In this experimental study, we analyzed foot circulation during INP in 20 patients [median (range): 75 (63-84yrs)] with PAD. One leg was placed inside an air-tight vacuum chamber connected to an INP-generator. During application of INP (alternating 10s of -40mmHg/7s of atmospheric pressure), we continuously recorded blood flow velocity in a distal foot artery (ultrasound Doppler), skin blood flow on the pulp of the first toes (laser Doppler), heart rate (ECG), and systemic blood pressure (Finometer). After a 5-min baseline sequence (no pressure), a 10-min INP sequence was applied, followed by 5-min post-INP (no pressure). To compare and quantify blood flow fluctuations between sequences, we calculated cumulative up-and-down fluctuations in arterial blood flow velocity per minute. Onset of INP induced an increase in arterial flow velocity and skin blood flow. Peak blood flow velocity was reached 3s after the onset of negative pressure, and increased 46% [(95% CI 36-57), P<0.001] above baseline. Peak skin blood flow was reached 2s after the onset of negative pressure, and increased 89% (95% CI 48-130), P<0.001) above baseline. Cumulative fluctuations per minute were significantly higher during INP-sequences compared to baseline [21 (95% CI 12-30)cm/s/min to 41 (95% CI 32-51)cm/s/min, P<0.001]. Mean INP blood flow velocity increased significantly ~12% above mean baseline blood flow velocity [(6.7 (95% CI 5.2-8.3)cm/s to 7.5 (95% CI 5.9-9.1)cm/s, P = 0.03)]. INP increases foot macro- and microcirculatory flow pulsatility in patients with PAD. Additionally, application of INP resulted in increased mean arterial blood flow velocity.
Høiseth, Lars Øivind; Mathiesen, Iacob; Weedon-Fekjær, Harald; Sundhagen, Jon O.; Hisdal, Jonny
2017-01-01
Background Intermittent negative pressure (INP) applied to the lower leg and foot increases foot perfusion in healthy volunteers. The aim of the present study was to describe the effects of INP to the lower leg and foot on foot macro- and microcirculation in patients with lower extremity peripheral arterial disease (PAD). Methods In this experimental study, we analyzed foot circulation during INP in 20 patients [median (range): 75 (63-84yrs)] with PAD. One leg was placed inside an air-tight vacuum chamber connected to an INP-generator. During application of INP (alternating 10s of -40mmHg/7s of atmospheric pressure), we continuously recorded blood flow velocity in a distal foot artery (ultrasound Doppler), skin blood flow on the pulp of the first toes (laser Doppler), heart rate (ECG), and systemic blood pressure (Finometer). After a 5-min baseline sequence (no pressure), a 10-min INP sequence was applied, followed by 5-min post-INP (no pressure). To compare and quantify blood flow fluctuations between sequences, we calculated cumulative up-and-down fluctuations in arterial blood flow velocity per minute. Results Onset of INP induced an increase in arterial flow velocity and skin blood flow. Peak blood flow velocity was reached 3s after the onset of negative pressure, and increased 46% [(95% CI 36–57), P<0.001] above baseline. Peak skin blood flow was reached 2s after the onset of negative pressure, and increased 89% (95% CI 48–130), P<0.001) above baseline. Cumulative fluctuations per minute were significantly higher during INP-sequences compared to baseline [21 (95% CI 12–30)cm/s/min to 41 (95% CI 32–51)cm/s/min, P<0.001]. Mean INP blood flow velocity increased significantly ~12% above mean baseline blood flow velocity [(6.7 (95% CI 5.2–8.3)cm/s to 7.5 (95% CI 5.9–9.1)cm/s, P = 0.03)]. Conclusion INP increases foot macro- and microcirculatory flow pulsatility in patients with PAD. Additionally, application of INP resulted in increased mean arterial blood flow velocity. PMID:28591174
A Statistical Framework for Microbial Source Attribution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Velsko, S P; Allen, J E; Cunningham, C T
2009-04-28
This report presents a general approach to inferring transmission and source relationships among microbial isolates from their genetic sequences. The outbreak transmission graph (also called the transmission tree or transmission network) is the fundamental structure which determines the statistical distributions relevant to source attribution. The nodes of this graph are infected individuals or aggregated sub-populations of individuals in which transmitted bacteria or viruses undergo clonal expansion, leading to a genetically heterogeneous population. Each edge of the graph represents a transmission event in which one or a small number of bacteria or virions infects another node thus increasing the size ofmore » the transmission network. Recombination and re-assortment events originate in nodes which are common to two distinct networks. In order to calculate the probability that one node was infected by another, given the observed genetic sequences of microbial isolates sampled from them, we require two fundamental probability distributions. The first is the probability of obtaining the observed mutational differences between two isolates given that they are separated by M steps in a transmission network. The second is the probability that two nodes sampled randomly from an outbreak transmission network are separated by M transmission events. We show how these distributions can be obtained from the genetic sequences of isolates obtained by sampling from past outbreaks combined with data from contact tracing studies. Realistic examples are drawn from the SARS outbreak of 2003, the FMDV outbreak in Great Britain in 2001, and HIV transmission cases. The likelihood estimators derived in this report, and the underlying probability distribution functions required to calculate them possess certain compelling general properties in the context of microbial forensics. These include the ability to quantify the significance of a sequence 'match' or 'mismatch' between two isolates; the ability to capture non-intuitive effects of network structure on inferential power, including the 'small world' effect; the insensitivity of inferences to uncertainties in the underlying distributions; and the concept of rescaling, i.e. ability to collapse sub-networks into single nodes and examine transmission inferences on the rescaled network.« less
Douglas, M R; Davis, M A; Amarello, M; Smith, J J; Schuett, G W; Herrmann, H-W; Holycross, A T; Douglas, M E
2016-04-01
Ecosystems transition quickly in the Anthropocene, whereas biodiversity adapts more slowly. Here we simulated a shifting woodland ecosystem on the Colorado Plateau of western North America by using as its proxy over space and time the fundamental niche of the Arizona black rattlesnake (Crotalus cerberus). We found an expansive (= end-of-Pleistocene) range that contracted sharply (= present), but is blocked topographically by Grand Canyon/Colorado River as it shifts predictably northwestward under moderate climate change (= 2080). Vulnerability to contemporary wildfire was quantified from available records, with forested area reduced more than 27% over 13 years. Both 'ecosystem metrics' underscore how climate and wildfire are rapidly converting the Plateau ecosystem into novel habitat. To gauge potential effects on C. cerberus, we derived a series of relevant 'conservation metrics' (i.e. genetic variability, dispersal capacity, effective population size) by sequencing 118 individuals across 846 bp of mitochondrial (mt)DNA-ATPase8/6. We identified five significantly different clades (net sequence divergence = 2.2%) isolated by drainage/topography, with low dispersal (F ST = 0.82) and small sizes (2N ef = 5.2). Our compiled metrics (i.e. small-populations, topographic-isolation, low-dispersal versus conserved-niche, vulnerable-ecosystem, dispersal barriers) underscore the susceptibility of this woodland specialist to a climate and wildfire tandem. We offer adaptive management scenarios that may counterbalance these metrics and avoid the extirpation of this and other highly specialized, relictual woodland clades.
Różycki, Bartosz; Cazade, Pierre-André; O'Mahony, Shane; Thompson, Damien; Cieplak, Marek
2017-08-16
Cellulosomes are large multi-protein catalysts produced by various anaerobic microorganisms to efficiently degrade plant cell-wall polysaccharides down into simple sugars. X-ray and physicochemical structural characterisations show that cellulosomes are composed of numerous protein domains that are connected by unstructured polypeptide segments, yet the properties and possible roles of these 'linker' peptides are largely unknown. We have performed coarse-grained and all-atom molecular dynamics computer simulations of a number of cellulosomal linkers of different lengths and compositions. Our data demonstrates that the effective stiffness of the linker peptides, as quantified by the equilibrium fluctuations in the end-to-end distances, depends primarily on the length of the linker and less so on the specific amino acid sequence. The presence of excluded volume - provided by the domains that are connected - dampens the motion of the linker residues and reduces the effective stiffness of the linkers. Simultaneously, the presence of the linkers alters the conformations of the protein domains that are connected. We demonstrate that short, stiff linkers induce significant rearrangements in the folded domains of the mini-cellulosome composed of endoglucanase Cel8A in complex with scaffoldin ScafT (Cel8A-ScafT) of Clostridium thermocellum as well as in a two-cohesin system derived from the scaffoldin ScaB of Acetivibrio cellulolyticus. We give experimentally testable predictions on structural changes in protein domains that depend on the length of linkers.
Wang, Qian; Yin, Bin-Cheng; Ye, Bang-Ce
2016-06-15
MicroRNAs (miRNAs), functioning as oncogenes or tumor suppressors, play significant regulatory roles in regulating gene expression and become as biomarkers for disease diagnostics and therapeutics. In this work, we have coupled a polydopamine (PDA) nanosphere-assisted chemiluminescence resonance energy transfer (CRET) platform and a duplex-specific nuclease (DSN)-assisted signal amplification strategy to develop a novel method for specific miRNA detection. With the assistance of hemin, luminol, and H2O2, the horseradish peroxidase (HRP)-mimicking G-rich sequence in the sensing probe produces chemiluminescence, which is quickly quenched by the CRET effect between PDA as energy acceptor and excited luminol as energy donor. The target miRNA triggers DSN to partially degrade the sensing probe in the DNA-miRNA heteroduplex to repeatedly release G-quadruplex formed by G-rich sequence from PDA for the production of chemiluminescence. The method allows quantitative detection of target miRNA in the range of 80 pM-50 nM with a detection limit of 49.6 pM. The method also shows excellent specificity to discriminate single-base differences, and can accurately quantify miRNA in biological samples, with good agreement with the result from a commercial miRNA detection kit. The procedure requires no organic dyes or labels, and is a simple and cost-effective method for miRNA detection for early clinical diagnosis. Copyright © 2016 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Barnhart, William D.; Lohman, Rowena B.; Mellors, Robert J.
2013-10-01
We present a catalog of interferometric synthetic aperture radar (InSAR) constraints on deformation that occurred during earthquake sequences in southern Iran between 1992 and 2011, and explore the implications on the accommodation of large-scale continental convergence between Saudi Arabia and Eurasia within the Zagros Mountains. The Zagros Mountains, a salt-laden fold-and-thrust belt involving ~10 km of sedimentary rocks overlying Precambrian basement rocks, have formed as a result of ongoing continental collision since 10-20 Ma that is currently occurring at a rate of ~3 cm/yr. We first demonstrate that there is a biased misfit in earthquake locations in global catalogs that likely results from neglect of 3-D velocity structure. Previous work involving two M ~ 6 earthquakes with well-recorded aftershocks has shown that the deformation observed with InSAR may represent triggered slip on faults much shallower than the primary earthquake, which likely occurred within the basement rocks (>10 km depth). We explore the hypothesis that most of the deformation observed with InSAR spanning earthquake sequences is also due to shallow, triggered slip above a deeper earthquake, effectively doubling the moment release for each event. We quantify the effects that this extra moment release would have on the discrepancy between seismically and geodetically constrained moment rates in the region, finding that even with the extra triggered fault slip, significant aseismic deformation during the interseismic period is necessary to fully explain the convergence between Eurasia and Saudi Arabia.
Douglas, M. R.; Davis, M. A.; Amarello, M.; Smith, J. J.; Schuett, G. W.; Herrmann, H.-W.; Holycross, A. T.; Douglas, M. E.
2016-01-01
Ecosystems transition quickly in the Anthropocene, whereas biodiversity adapts more slowly. Here we simulated a shifting woodland ecosystem on the Colorado Plateau of western North America by using as its proxy over space and time the fundamental niche of the Arizona black rattlesnake (Crotalus cerberus). We found an expansive (= end-of-Pleistocene) range that contracted sharply (= present), but is blocked topographically by Grand Canyon/Colorado River as it shifts predictably northwestward under moderate climate change (= 2080). Vulnerability to contemporary wildfire was quantified from available records, with forested area reduced more than 27% over 13 years. Both ‘ecosystem metrics' underscore how climate and wildfire are rapidly converting the Plateau ecosystem into novel habitat. To gauge potential effects on C. cerberus, we derived a series of relevant ‘conservation metrics' (i.e. genetic variability, dispersal capacity, effective population size) by sequencing 118 individuals across 846 bp of mitochondrial (mt)DNA-ATPase8/6. We identified five significantly different clades (net sequence divergence = 2.2%) isolated by drainage/topography, with low dispersal (FST = 0.82) and small sizes (2Nef = 5.2). Our compiled metrics (i.e. small-populations, topographic-isolation, low-dispersal versus conserved-niche, vulnerable-ecosystem, dispersal barriers) underscore the susceptibility of this woodland specialist to a climate and wildfire tandem. We offer adaptive management scenarios that may counterbalance these metrics and avoid the extirpation of this and other highly specialized, relictual woodland clades. PMID:27152218
Gordon, Adam S; Tabor, Holly K; Johnson, Andrew D; Snively, Beverly M; Assimes, Themistocles L; Auer, Paul L; Ioannidis, John P A; Peters, Ulrike; Robinson, Jennifer G; Sucheston, Lara E; Wang, Danxin; Sotoodehnia, Nona; Rotter, Jerome I; Psaty, Bruce M; Jackson, Rebecca D; Herrington, David M; O'Donnell, Christopher J; Reiner, Alexander P; Rich, Stephen S; Rieder, Mark J; Bamshad, Michael J; Nickerson, Deborah A
2014-04-15
The study of genetic influences on drug response and efficacy ('pharmacogenetics') has existed for over 50 years. Yet, we still lack a complete picture of how genetic variation, both common and rare, affects each individual's responses to medications. Exome sequencing is a promising alternative method for pharmacogenetic discovery as it provides information on both common and rare variation in large numbers of individuals. Using exome data from 2203 AA and 4300 Caucasian individuals through the NHLBI Exome Sequencing Project, we conducted a survey of coding variation within 12 Cytochrome P450 (CYP) genes that are collectively responsible for catalyzing nearly 75% of all known Phase I drug oxidation reactions. In addition to identifying many polymorphisms with known pharmacogenetic effects, we discovered over 730 novel nonsynonymous alleles across the 12 CYP genes of interest. These alleles include many with diverse functional effects such as premature stop codons, aberrant splicesites and mutations at conserved active site residues. Our analysis considering both novel, predicted functional alleles as well as known, actionable CYP alleles reveals that rare, deleterious variation contributes markedly to the overall burden of pharmacogenetic alleles within the populations considered, and that the contribution of rare variation to this burden is over three times greater in AA individuals as compared with Caucasians. While most of these impactful alleles are individually rare, 7.6-11.7% of individuals interrogated in the study carry at least one newly described potentially deleterious alleles in a major drug-metabolizing CYP.
The Urban Forest Effects (UFORE) model: quantifying urban forest structure and functions
David J. Nowak; Daniel E. Crane
2000-01-01
The Urban Forest Effects (UFORE) computer model was developed to help managers and researchers quantify urban forest structure and functions. The model quantifies species composition and diversity, diameter distribution, tree density and health, leaf area, leaf biomass, and other structural characteristics; hourly volatile organic compound emissions (emissions that...
Beng, Kingsly Chuo; Tomlinson, Kyle W.; Shen, Xian Hui; Surget-Groba, Yann; Hughes, Alice C.; Corlett, Richard T.; Slik, J. W. Ferry
2016-01-01
Metabarcoding potentially offers a rapid and cheap method of monitoring biodiversity, but real-world applications are few. We investigated its utility in studying patterns of litter arthropod diversity and composition in the tropics. We collected litter arthropods from 35 matched forest-plantation sites across Xishuangbanna, southwestern China. A new primer combination and the MiSeq platform were used to amplify and sequence a wide variety of litter arthropods using simulated and real-world communities. Quality filtered reads were clustered into 3,624 MOTUs at ≥97% similarity and the taxonomy of each MOTU was predicted. We compared diversity and compositional differences between forests and plantations (rubber and tea) for all MOTUs and for eight arthropod groups. We obtained ~100% detection rate after in silico sequencing six mock communities with known arthropod composition. Ordination showed that rubber, tea and forest communities formed distinct clusters. α-diversity declined significantly between forests and adjacent plantations for more arthropod groups in rubber than tea, and diversity of order Orthoptera increased significantly in tea. Turnover was higher in forests than plantations, but patterns differed among groups. Metabarcoding is useful for quantifying diversity patterns of arthropods under different land-uses and the MiSeq platform is effective for arthropod metabarcoding in the tropics. PMID:27112993
Impact of imaging measurements on response assessment in glioblastoma clinical trials
Reardon, David A.; Ballman, Karla V.; Buckner, Jan C.; Chang, Susan M.; Ellingson, Benjamin M.
2014-01-01
We provide historical and scientific guidance on imaging response assessment for incorporation into clinical trials to stimulate effective and expedited drug development for recurrent glioblastoma by addressing 3 fundamental questions: (i) What is the current validation status of imaging response assessment, and when are we confident assessing response using today's technology? (ii) What imaging technology and/or response assessment paradigms can be validated and implemented soon, and how will these technologies provide benefit? (iii) Which imaging technologies need extensive testing, and how can they be prospectively validated? Assessment of T1 +/− contrast, T2/FLAIR, diffusion, and perfusion-imaging sequences are routine and provide important insight into underlying tumor activity. Nonetheless, utility of these data within and across patients, as well as across institutions, are limited by challenges in quantifying measurements accurately and lack of consistent and standardized image acquisition parameters. Currently, there exists a critical need to generate guidelines optimizing and standardizing MRI sequences for neuro-oncology patients. Additionally, more accurate differentiation of confounding factors (pseudoprogression or pseudoresponse) may be valuable. Although promising, diffusion MRI, perfusion MRI, MR spectroscopy, and amino acid PET require extensive standardization and validation. Finally, additional techniques to enhance response assessment, such as digital T1 subtraction maps, warrant further investigation. PMID:25313236
Seismoelectric data processing for surface surveys of shallow targets
Haines, S.S.; Guitton, A.; Biondi, B.
2007-01-01
The utility of the seismoelectric method relies on the development of methods to extract the signal of interest from background and source-generated coherent noise that may be several orders-of-magnitude stronger. We compare data processing approaches to develop a sequence of preprocessing and signal/noise separation and to quantify the noise level from which we can extract signal events. Our preferred sequence begins with the removal of power line harmonic noise and the use of frequency filters to minimize random and source-generated noise. Mapping to the linear Radon domain with an inverse process incorporating a sparseness constraint provides good separation of signal from noise, though it is ineffective on noise that shows the same dip as the signal. Similarly, the seismoelectric signal and noise do not separate cleanly in the Fourier domain, so f-k filtering can not remove all of the source-generated noise and it also disrupts signal amplitude patterns. We find that prediction-error filters provide the most effective method to separate signal and noise, while also preserving amplitude information, assuming that adequate pattern models can be determined for the signal and noise. These Radon-domain and prediction-error-filter methods successfully separate signal from <33 dB stronger noise in our test data. ?? 2007 Society of Exploration Geophysicists.
Mei, Meng; Zhai, Chao; Li, Xinzhi; Zhou, Yu; Peng, Wenfang; Ma, Lixin; Wang, Qinhong; Iverson, Brent L; Zhang, Guimin; Yi, Li
2017-12-15
An endoplasmic reticulum (ER) retention sequence (ERS) is a characteristic short sequence that mediates protein retention in the ER of eukaryotic cells. However, little is known about the detailed molecular mechanism involved in ERS-mediated protein ER retention. Using a new surface display-based fluorescence technique that effectively quantifies ERS-promoted protein ER retention within Saccharomyces cerevisiae cells, we performed comprehensive ERS analyses. We found that the length, type of amino acid residue, and additional residues at positions -5 and -6 of the C-terminal HDEL motif all determined the retention of ERS in the yeast ER. Moreover, the biochemical results guided by structure simulation revealed that aromatic residues (Phe-54, Trp-56, and other aromatic residues facing the ER lumen) in both the ERS (at positions -6 and -4) and its receptor, Erd2, jointly determined their interaction with each other. Our studies also revealed that this aromatic residue interaction might lead to the discriminative recognition of HDEL or KDEL as ERS in yeast or human cells, respectively. Our findings expand the understanding of ERS-mediated residence of proteins in the ER and may guide future research into protein folding, modification, and translocation affected by ER retention. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Jessen, Leon Eyrich; Hoof, Ilka; Lund, Ole; Nielsen, Morten
2013-07-01
Identifying which mutation(s) within a given genotype is responsible for an observable phenotype is important in many aspects of molecular biology. Here, we present SigniSite, an online application for subgroup-free residue-level genotype-phenotype correlation. In contrast to similar methods, SigniSite does not require any pre-definition of subgroups or binary classification. Input is a set of protein sequences where each sequence has an associated real number, quantifying a given phenotype. SigniSite will then identify which amino acid residues are significantly associated with the data set phenotype. As output, SigniSite displays a sequence logo, depicting the strength of the phenotype association of each residue and a heat-map identifying 'hot' or 'cold' regions. SigniSite was benchmarked against SPEER, a state-of-the-art method for the prediction of specificity determining positions (SDP) using a set of human immunodeficiency virus protease-inhibitor genotype-phenotype data and corresponding resistance mutation scores from the Stanford University HIV Drug Resistance Database, and a data set of protein families with experimentally annotated SDPs. For both data sets, SigniSite was found to outperform SPEER. SigniSite is available at: http://www.cbs.dtu.dk/services/SigniSite/.
H-Bond Self-Assembly: Folding versus Duplex Formation.
Núñez-Villanueva, Diego; Iadevaia, Giulia; Stross, Alexander E; Jinks, Michael A; Swain, Jonathan A; Hunter, Christopher A
2017-05-17
Linear oligomers equipped with complementary H-bond donor (D) and acceptor (A) sites can interact via intermolecular H-bonds to form duplexes or fold via intramolecular H-bonds. These competing equilibria have been quantified using NMR titration and dilution experiments for seven systems featuring different recognition sites and backbones. For all seven architectures, duplex formation is observed for homo-sequence 2-mers (AA·DD) where there are no competing folding equilibria. The corresponding hetero-sequence AD 2-mers also form duplexes, but the observed self-association constants are strongly affected by folding equilibria in the monomeric states. When the backbone is flexible (five or more rotatable bonds separating the recognition sites), intramolecular H-bonding is favored, and the folded state is highly populated. For these systems, the stability of the AD·AD duplex is 1-2 orders of magnitude lower than that of the corresponding AA·DD duplex. However, for three architectures which have more rigid backbones (fewer than five rotatable bonds), intramolecular interactions are not observed, and folding does not compete with duplex formation. These systems are promising candidates for the development of longer, mixed-sequence synthetic information molecules that show sequence-selective duplex formation.
Barreda-García, Susana; González-Álvarez, María José; de-Los-Santos-Álvarez, Noemí; Palacios-Gutiérrez, Juan José; Miranda-Ordieres, Arturo J; Lobo-Castañón, María Jesús
2015-06-15
A highly sensitive and robust method for the quantification of specific DNA sequences based on coupling asymmetric helicase-dependent DNA amplification to electrochemical detection is described. This method relies on the entrapment of the amplified ssDNA sequences on magnetic beads followed by a post-amplification hybridization assay to provide an added degree of specificity. As a proof-of-concept a 84-bases long sequence specific of Mycobacterium tuberculosis is amplified at 65°C, providing 3×10(6) amplification after 90 min. Using this system 0.5 aM, corresponding to 15 copies of the target gene in 50 µL of sample, can be successfully detected and reliably quantified under isothermal conditions in less than 4h. The assay has been applied to the detection of M. tuberculosis in sputum, pleural fluid and urine samples. Besides this application, the proposed assays is a powerful and general tool for molecular diagnostic that can be applied to the detection of other specific DNA sequences, taking full advantage of the plethora of genomic information now available. Copyright © 2014 Elsevier B.V. All rights reserved.
Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
Slatkin, Montgomery
2016-01-01
When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of the contaminating individual(s). Quantifying the extent of contamination is a crucial step as it allows researchers to account for possible biases that may arise in downstream genetic analyses. Here, we present an MCMC algorithm to co-estimate the contamination rate, sequencing error rate and demographic parameters—including drift times and admixture rates—for an ancient nuclear genome obtained from human remains, when the putative contaminating DNA comes from present-day humans. We assume we have a large panel representing the putative contaminant population (e.g. European, East Asian or African). The method is implemented in a C++ program called ‘Demographic Inference with Contamination and Error’ (DICE). We applied it to simulations and genome data from ancient Neanderthals and modern humans. With reasonable levels of genome sequence coverage (>3X), we find we can recover accurate estimates of all these parameters, even when the contamination rate is as high as 50%. PMID:27049965
Spliced synthetic genes as internal controls in RNA sequencing experiments.
Hardwick, Simon A; Chen, Wendy Y; Wong, Ted; Deveson, Ira W; Blackburn, James; Andersen, Stacey B; Nielsen, Lars K; Mattick, John S; Mercer, Tim R
2016-09-01
RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and diversity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between samples. We demonstrate the use of sequins in RNA-seq experiments to measure sample-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA samples. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.
Parenchymal signal intensity in 3-T body MRI of dogs with hematopoietic neoplasia.
Feeney, Daniel A; Sharkey, Leslie C; Steward, Susan M; Bahr, Katherine L; Henson, Michael S; Ito, Daisuke; O'Brien, Timothy D; Jessen, Carl R; Husbands, Brian D; Borgatti, Antonella; Modiano, Jaime F
2013-04-01
We performed a preliminary study involving 10 dogs to assess the applicability of body MRI for staging of canine diffuse hematopoietic neoplasia. T1-weighted (before and after intravenous gadolinium), T2-weighted, in-phase, out-of-phase, and short tau inversion recovery pulse sequences were used. By using digital region of interest (ROI) and visual comparison techniques, relative parenchymal organ (medial iliac lymph nodes, liver, spleen, kidney cortex, and kidney medulla) signal intensity was quantified as less than, equal to, or greater than that of skeletal muscle in 2 clinically normal young adult dogs and 10 dogs affected with either B-cell lymphoma (n = 7) or myelodysplastic syndrome (n = 3). Falciform fat and urinary bladder were evaluated to provide additional perspective regarding signal intensity from the pulse sequences. Dogs with nonfocal disease could be distinguished from normal dogs according to both the visual and ROI signal-intensity relationships. In normal dogs, liver signal intensity on the T2-weighted sequence was greater than that of skeletal muscle by using either the visual or ROI approach. However in affected dogs, T2-weighted liver signal intensity was less than that of skeletal muscle by using either the ROI approach (10 of 10 dogs) or the visual approach (9 of 10 dogs). These findings suggest that the comparison of relative signal intensity among organs may have merit as a research model for infiltrative parenchymal disease (ROI approach) or metabolic effects of disease; this comparison may have practical clinical applicability (visual comparison approach) as well.
Parenchymal Signal Intensity in 3-T Body MRI of Dogs with Hematopoietic Neoplasia
Feeney, Daniel A; Sharkey, Leslie C; Steward, Susan M; Bahr, Katherine L; Henson, Michael S; Ito, Daisuke; O'Brien, Timothy D; Jessen, Carl R; Husbands, Brian D; Borgatti, Antonella; Modiano, Jaime F
2013-01-01
We performed a preliminary study involving 10 dogs to assess the applicability of body MRI for staging of canine diffuse hematopoietic neoplasia. T1-weighted (before and after intravenous gadolinium), T2-weighted, in-phase, out-of-phase, and short tau inversion recovery pulse sequences were used. By using digital region of interest (ROI) and visual comparison techniques, relative parenchymal organ (medial iliac lymph nodes, liver, spleen, kidney cortex, and kidney medulla) signal intensity was quantified as less than, equal to, or greater than that of skeletal muscle in 2 clinically normal young adult dogs and 10 dogs affected with either B-cell lymphoma (n = 7) or myelodysplastic syndrome (n = 3). Falciform fat and urinary bladder were evaluated to provide additional perspective regarding signal intensity from the pulse sequences. Dogs with nonfocal disease could be distinguished from normal dogs according to both the visual and ROI signal-intensity relationships. In normal dogs, liver signal intensity on the T2-weighted sequence was greater than that of skeletal muscle by using either the visual or ROI approach. However in affected dogs, T2-weighted liver signal intensity was less than that of skeletal muscle by using either the ROI approach (10 of 10 dogs) or the visual approach (9 of 10 dogs). These findings suggest that the comparison of relative signal intensity among organs may have merit as a research model for infiltrative parenchymal disease (ROI approach) or metabolic effects of disease; this comparison may have practical clinical applicability (visual comparison approach) as well. PMID:23582424
Comparing the efficacy of metronome beeps and stepping stones to adjust gait: steps to follow!
Bank, Paulina J M; Roerdink, Melvyn; Peper, C E
2011-03-01
Acoustic metronomes and visual targets have been used in rehabilitation practice to improve pathological gait. In addition, they may be instrumental in evaluating and training instantaneous gait adjustments. The aim of this study was to compare the efficacy of two cue types in inducing gait adjustments, viz. acoustic temporal cues in the form of metronome beeps and visual spatial cues in the form of projected stepping stones. Twenty healthy elderly (aged 63.2 ± 3.6 years) were recruited to walk on an instrumented treadmill at preferred speed and cadence, paced by either metronome beeps or projected stepping stones. Gait adaptations were induced using two manipulations: by perturbing the sequence of cues and by imposing switches from one cueing type to the other. Responses to these manipulations were quantified in terms of step-length and step-time adjustments, the percentage correction achieved over subsequent steps, and the number of steps required to restore the relation between gait and the beeps or stepping stones. The results showed that perturbations in a sequence of stepping stones were overcome faster than those in a sequence of metronome beeps. In switching trials, switching from metronome beeps to stepping stones was achieved faster than vice versa, indicating that gait was influenced more strongly by the stepping stones than the metronome beeps. Together these results revealed that, in healthy elderly, the stepping stones induced gait adjustments more effectively than did the metronome beeps. Potential implications for the use of metronome beeps and stepping stones in gait rehabilitation practice are discussed.
Litter Breakdown and Microbial Succession on Two Submerged Leaf Species in a Small Forested Stream
Newman, Molli M.; Liles, Mark R.; Feminella, Jack W.
2015-01-01
Microbial succession during leaf breakdown was investigated in a small forested stream in west-central Georgia, USA, using multiple culture-independent techniques. Red maple (Acer rubrum) and water oak (Quercus nigra) leaf litter were incubated in situ for 128 days, and litter breakdown was quantified by ash-free dry mass (AFDM) method and microbial assemblage composition using phospholipid fatty acid analysis (PLFA), ribosomal intergenic spacer analysis (RISA), denaturing gradient gel electrophoresis (DGGE), and bar-coded next-generation sequencing of 16S rRNA gene amplicons. Leaf breakdown was faster for red maple than water oak. PLFA revealed a significant time effect on microbial lipid profiles for both leaf species. Microbial assemblages on maple contained a higher relative abundance of bacterial lipids than oak, and oak microbial assemblages contained higher relative abundance of fungal lipids than maple. RISA showed that incubation time was more important in structuring bacterial assemblages than leaf physicochemistry. DGGE profiles revealed high variability in bacterial assemblages over time, and sequencing of DGGE-resolved amplicons indicated several taxa present on degrading litter. Next-generation sequencing revealed temporal shifts in dominant taxa within the phylum Proteobacteria, whereas γ-Proteobacteria dominated pre-immersion and α- and β-Proteobacteria dominated after 1 month of instream incubation; the latter groups contain taxa that are predicted to be capable of using organic material to fuel further breakdown. Our results suggest that incubation time is more important than leaf species physicochemistry in influencing leaf litter microbial assemblage composition, and indicate the need for investigation into seasonal and temporal dynamics of leaf litter microbial assemblage succession. PMID:26098687
Quantitative microbiome profiling links gut community variation to microbial load.
Vandeputte, Doris; Kathagen, Gunter; D'hoe, Kevin; Vieira-Silva, Sara; Valles-Colomer, Mireia; Sabino, João; Wang, Jun; Tito, Raul Y; De Commer, Lindsey; Darzi, Youssef; Vermeire, Séverine; Falony, Gwen; Raes, Jeroen
2017-11-23
Current sequencing-based analyses of faecal microbiota quantify microbial taxa and metabolic pathways as fractions of the sample sequence library generated by each analysis. Although these relative approaches permit detection of disease-associated microbiome variation, they are limited in their ability to reveal the interplay between microbiota and host health. Comparative analyses of relative microbiome data cannot provide information about the extent or directionality of changes in taxa abundance or metabolic potential. If microbial load varies substantially between samples, relative profiling will hamper attempts to link microbiome features to quantitative data such as physiological parameters or metabolite concentrations. Saliently, relative approaches ignore the possibility that altered overall microbiota abundance itself could be a key identifier of a disease-associated ecosystem configuration. To enable genuine characterization of host-microbiota interactions, microbiome research must exchange ratios for counts. Here we build a workflow for the quantitative microbiome profiling of faecal material, through parallelization of amplicon sequencing and flow cytometric enumeration of microbial cells. We observe up to tenfold differences in the microbial loads of healthy individuals and relate this variation to enterotype differentiation. We show how microbial abundances underpin both microbiota variation between individuals and covariation with host phenotype. Quantitative profiling bypasses compositionality effects in the reconstruction of gut microbiota interaction networks and reveals that the taxonomic trade-off between Bacteroides and Prevotella is an artefact of relative microbiome analyses. Finally, we identify microbial load as a key driver of observed microbiota alterations in a cohort of patients with Crohn's disease, here associated with a low-cell-count Bacteroides enterotype (as defined through relative profiling).
NASA Astrophysics Data System (ADS)
Platiša, Ljiljana; Goossens, Bart; Vansteenkiste, Ewout; Badano, Aldo; Philips, Wilfried
2010-02-01
Clinical practice is rapidly moving in the direction of volumetric imaging. Often, radiologists interpret these images in liquid crystal displays at browsing rates of 30 frames per second or higher. However, recent studies suggest that the slow response of the display can compromise image quality. In order to quantify the temporal effect of medical displays on detection performance, we investigate two designs of a multi-slice channelized Hotelling observer (msCHO) model in the task of detecting a single-slice signal in multi-slice simulated images. The design of msCHO models is inspired by simplifying assumptions about how humans observe while viewing in the stack-browsing mode. For comparison, we consider a standard CHO applied only on the slice where the signal is located, recently used in a similar study. We refer to it as a single-slice CHO (ssCHO). Overall, our results confirm previous findings that the slow response of displays degrades the detection performance of the observers. More specifically, the observed performance range of msCHO designs is higher compared to the ssCHO suggesting that the extent and rate of degradation, though significant, may be less drastic than previously estimated by the ssCHO. Especially, the difference between msCHO and ssCHO is more significant for higher browsing speeds than for slow image sequences or static images. This, together with their design criteria driven by the assumptions about humans, makes the msCHO models promising candidates for further studies aimed at building anthropomorphic observer models for the stack-mode image presentation.
Connecting Taxon-Specific Microbial Activities to Carbon Cycling in the Rhizosphere
NASA Astrophysics Data System (ADS)
Hungate, B. A.; Morrissey, E.; Schwartz, E.; Dijkstra, P.; Blazewicz, S.; Pett-Ridge, J.; Koch, G. W.; Marks, J.; Koch, B.; McHugh, T. A.; Mau, R. L.; Hayer, M.
2016-12-01
Plant carbon inputs influence microbial growth in the rhizosphere, but the quantitative details of these effects are not well understood, nor are their consequences for carbon cycling in the rhizosphere. With a new pulse of carbon input to soil, which microbial taxa increase their growth rates, and by how much? Do any microbial taxa respond negatively? And how does the extra carbon addition alter the utilization of other resources, including other carbon sources, as well as inorganic nitrogen? This talk will present new research using quantitative stable isotope probing that reveals the distribution of growth responses among microbial taxa, from positive to neutral to negative, and how these growth responses are associated with various substrates. For example, decomposition of soil C in response to added labile carbon occurred as a phylogenetically-diverse majority of taxa shifted toward soil C use for growth. In contrast, bacteria with suppressed growth or that relied directly on glucose for growth clustered strongly by phylogeny. These results suggest that priming is a prototypical response of bacteria to sustained labile C addition, consistent with the widespread occurrence of the priming effect in nature. These results also illustrate the potential power of molecular tools and models that seek to estimate metrics directly relevant to quantitative ecology and biogeochemistry, moreso than is the standard currently in microbial ecology. Tools that estimate growth rate, mortality rate, and rates of substrate use - all quantified with the taxonomic precision afforded by modern sequencing - provide a foundation for quantifying the biogeochemical significance of microbial biodiversity, and a more complete understanding of the rich ecosystem of the rhizosphere.
Toyama, Hironori; Kajisa, Tsuyoshi; Tagane, Shuichiro; Mase, Keiko; Chhang, Phourin; Samreth, Vanna; Ma, Vuthy; Sokh, Heng; Ichihashi, Ryuji; Onoda, Yusuke; Mizoue, Nobuya; Yahara, Tetsukazu
2015-01-01
Ecological communities including tropical rainforest are rapidly changing under various disturbances caused by increasing human activities. Recently in Cambodia, illegal logging and clear-felling for agriculture have been increasing. Here, we study the effects of logging, mortality and recruitment of plot trees on phylogenetic community structure in 32 plots in Kampong Thom, Cambodia. Each plot was 0.25 ha; 28 plots were established in primary evergreen forests and four were established in secondary dry deciduous forests. Measurements were made in 1998, 2000, 2004 and 2010, and logging, recruitment and mortality of each tree were recorded. We estimated phylogeny using rbcL and matK gene sequences and quantified phylogenetic α and β diversity. Within communities, logging decreased phylogenetic diversity, and increased overall phylogenetic clustering and terminal phylogenetic evenness. Between communities, logging increased phylogenetic similarity between evergreen and deciduous plots. On the other hand, recruitment had opposite effects both within and between communities. The observed patterns can be explained by environmental homogenization under logging. Logging is biased to particular species and larger diameter at breast height, and forest patrol has been effective in decreasing logging. PMID:25561669
Toyama, Hironori; Kajisa, Tsuyoshi; Tagane, Shuichiro; Mase, Keiko; Chhang, Phourin; Samreth, Vanna; Ma, Vuthy; Sokh, Heng; Ichihashi, Ryuji; Onoda, Yusuke; Mizoue, Nobuya; Yahara, Tetsukazu
2015-02-19
Ecological communities including tropical rainforest are rapidly changing under various disturbances caused by increasing human activities. Recently in Cambodia, illegal logging and clear-felling for agriculture have been increasing. Here, we study the effects of logging, mortality and recruitment of plot trees on phylogenetic community structure in 32 plots in Kampong Thom, Cambodia. Each plot was 0.25 ha; 28 plots were established in primary evergreen forests and four were established in secondary dry deciduous forests. Measurements were made in 1998, 2000, 2004 and 2010, and logging, recruitment and mortality of each tree were recorded. We estimated phylogeny using rbcL and matK gene sequences and quantified phylogenetic α and β diversity. Within communities, logging decreased phylogenetic diversity, and increased overall phylogenetic clustering and terminal phylogenetic evenness. Between communities, logging increased phylogenetic similarity between evergreen and deciduous plots. On the other hand, recruitment had opposite effects both within and between communities. The observed patterns can be explained by environmental homogenization under logging. Logging is biased to particular species and larger diameter at breast height, and forest patrol has been effective in decreasing logging. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Interactions between genetic variation and cellular environment in skeletal muscle gene expression.
Taylor, D Leland; Knowles, David A; Scott, Laura J; Ramirez, Andrea H; Casale, Francesco Paolo; Wolford, Brooke N; Guan, Li; Varshney, Arushi; Albanus, Ricardo D'Oliveira; Parker, Stephen C J; Narisu, Narisu; Chines, Peter S; Erdos, Michael R; Welch, Ryan P; Kinnunen, Leena; Saramies, Jouko; Sundvall, Jouko; Lakka, Timo A; Laakso, Markku; Tuomilehto, Jaakko; Koistinen, Heikki A; Stegle, Oliver; Boehnke, Michael; Birney, Ewan; Collins, Francis S
2018-01-01
From whole organisms to individual cells, responses to environmental conditions are influenced by genetic makeup, where the effect of genetic variation on a trait depends on the environmental context. RNA-sequencing quantifies gene expression as a molecular trait, and is capable of capturing both genetic and environmental effects. In this study, we explore opportunities of using allele-specific expression (ASE) to discover cis-acting genotype-environment interactions (GxE)-genetic effects on gene expression that depend on an environmental condition. Treating 17 common, clinical traits as approximations of the cellular environment of 267 skeletal muscle biopsies, we identify 10 candidate environmental response expression quantitative trait loci (reQTLs) across 6 traits (12 unique gene-environment trait pairs; 10% FDR per trait) including sex, systolic blood pressure, and low-density lipoprotein cholesterol. Although using ASE is in principle a promising approach to detect GxE effects, replication of such signals can be challenging as validation requires harmonization of environmental traits across cohorts and a sufficient sampling of heterozygotes for a transcribed SNP. Comprehensive discovery and replication will require large human transcriptome datasets, or the integration of multiple transcribed SNPs, coupled with standardized clinical phenotyping.
Model-free aftershock forecasts constructed from similar sequences in the past
NASA Astrophysics Data System (ADS)
van der Elst, N.; Page, M. T.
2017-12-01
The basic premise behind aftershock forecasting is that sequences in the future will be similar to those in the past. Forecast models typically use empirically tuned parametric distributions to approximate past sequences, and project those distributions into the future to make a forecast. While parametric models do a good job of describing average outcomes, they are not explicitly designed to capture the full range of variability between sequences, and can suffer from over-tuning of the parameters. In particular, parametric forecasts may produce a high rate of "surprises" - sequences that land outside the forecast range. Here we present a non-parametric forecast method that cuts out the parametric "middleman" between training data and forecast. The method is based on finding past sequences that are similar to the target sequence, and evaluating their outcomes. We quantify similarity as the Poisson probability that the observed event count in a past sequence reflects the same underlying intensity as the observed event count in the target sequence. Event counts are defined in terms of differential magnitude relative to the mainshock. The forecast is then constructed from the distribution of past sequences outcomes, weighted by their similarity. We compare the similarity forecast with the Reasenberg and Jones (RJ95) method, for a set of 2807 global aftershock sequences of M≥6 mainshocks. We implement a sequence-specific RJ95 forecast using a global average prior and Bayesian updating, but do not propagate epistemic uncertainty. The RJ95 forecast is somewhat more precise than the similarity forecast: 90% of observed sequences fall within a factor of two of the median RJ95 forecast value, whereas the fraction is 85% for the similarity forecast. However, the surprise rate is much higher for the RJ95 forecast; 10% of observed sequences fall in the upper 2.5% of the (Poissonian) forecast range. The surprise rate is less than 3% for the similarity forecast. The similarity forecast may be useful to emergency managers and non-specialists when confidence or expertise in parametric forecasting may be lacking. The method makes over-tuning impossible, and minimizes the rate of surprises. At the least, this forecast constitutes a useful benchmark for more precisely tuned parametric forecasts.
Dutta, Sanjib; Koide, Akiko; Koide, Shohei
2008-01-01
Stability evaluation of many mutants can lead to a better understanding of the sequence determinants of a structural motif and of factors governing protein stability and protein evolution. The traditional biophysical analysis of protein stability is low throughput, limiting our ability to widely explore the sequence space in a quantitative manner. In this study, we have developed a high-throughput library screening method for quantifying stability changes, which is based on protein fragment reconstitution and yeast surface display. Our method exploits the thermodynamic linkage between protein stability and fragment reconstitution and the ability of the yeast surface display technique to quantitatively evaluate protein-protein interactions. The method was applied to a fibronectin type III (FN3) domain. Characterization of fragment reconstitution was facilitated by the co-expression of two FN3 fragments, thus establishing a "yeast surface two-hybrid" method. Importantly, our method does not rely on competition between clones and thus eliminates a common limitation of high-throughput selection methods in which the most stable variants are predominantly recovered. Thus, it allows for the isolation of sequences that exhibits a desired level of stability. We identified over one hundred unique sequences for a β-bulge motif, which was significantly more informative than natural sequences of the FN3 family in revealing the sequence determinants for the β-bulge. Our method provides a powerful means to rapidly assess stability of many variants, to systematically assess contribution of different factors to protein stability and to enhance protein stability. PMID:18674545
An optimized protocol for generation and analysis of Ion Proton sequencing reads for RNA-Seq.
Yuan, Yongxian; Xu, Huaiqian; Leung, Ross Ka-Kit
2016-05-26
Previous studies compared running cost, time and other performance measures of popular sequencing platforms. However, comprehensive assessment of library construction and analysis protocols for Proton sequencing platform remains unexplored. Unlike Illumina sequencing platforms, Proton reads are heterogeneous in length and quality. When sequencing data from different platforms are combined, this can result in reads with various read length. Whether the performance of the commonly used software for handling such kind of data is satisfactory is unknown. By using universal human reference RNA as the initial material, RNaseIII and chemical fragmentation methods in library construction showed similar result in gene and junction discovery number and expression level estimated accuracy. In contrast, sequencing quality, read length and the choice of software affected mapping rate to a much larger extent. Unspliced aligner TMAP attained the highest mapping rate (97.27 % to genome, 86.46 % to transcriptome), though 47.83 % of mapped reads were clipped. Long reads could paradoxically reduce mapping in junctions. With reference annotation guide, the mapping rate of TopHat2 significantly increased from 75.79 to 92.09 %, especially for long (>150 bp) reads. Sailfish, a k-mer based gene expression quantifier attained highly consistent results with that of TaqMan array and highest sensitivity. We provided for the first time, the reference statistics of library preparation methods, gene detection and quantification and junction discovery for RNA-Seq by the Ion Proton platform. Chemical fragmentation performed equally well with the enzyme-based one. The optimal Ion Proton sequencing options and analysis software have been evaluated.
Concrete thawing studied by single-point ramped imaging.
Prado, P J; Balcom, B J; Beyea, S D; Armstrong, R L; Bremner, T W
1997-12-01
A series of two-dimensional images of proton distribution in a hardened concrete sample has been obtained during the thawing process (from -50 degrees C up to 11 degrees C). The SPRITE sequence is optimal for this study given the characteristic short relaxation times of water in this porous media (T2* < 200 micros and T1 < 3.6 ms). The relaxation parameters of the sample were determined in order to optimize the time efficiency of the sequence, permitting a 4-scan 64 x 64 acquisition in under 3 min. The image acquisition is fast on the time scale of the temperature evolution of the specimen. The frozen water distribution is quantified through a position based study of the image contrast. A multiple point acquisition method is presented and the signal sensitivity improvement is discussed.
2010-01-01
Background Usual methods for inferring species boundaries from molecular sequence data rely either on gene trees or on population genetic analyses. Another way of delimiting species, based on a view of species as "fields for recombination" (FFRs) characterized by mutual allelic exclusivity, was suggested in 1995 by Doyle. Here we propose to use haplowebs (haplotype networks with additional connections between haplotypes found co-occurring in heterozygous individuals) to visualize and delineate single-locus FFRs (sl-FFRs). Furthermore, we introduce a method to quantify the reliability of putative species boundaries according to the number of independent markers that support them, and illustrate this approach with a case study of taxonomically difficult corals of the genus Pocillopora collected around Clipperton Island (far eastern Pacific). Results One haploweb built from intron sequences of the ATP synthase β subunit gene revealed the presence of two sl-FFRs among our 74 coral samples, whereas a second one built from ITS sequences turned out to be composed of four sl-FFRs. As a third independent marker, we performed a combined analysis of two regions of the mitochondrial genome: since haplowebs are not suited to analyze non-recombining markers, individuals were sorted into four haplogroups according to their mitochondrial sequences. Among all possible bipartitions of our set of samples, thirteen were supported by at least one molecular dataset, none by two and only one by all three datasets: this congruent pattern obtained from independent nuclear and mitochondrial markers indicates that two species of Pocillopora are present in Clipperton. Conclusions Our approach builds on Doyle's method and extends it by introducing an intuitive, user-friendly graphical representation and by proposing a conceptual framework to analyze and quantify the congruence between sl-FFRs obtained from several independent markers. Like delineation methods based on population-level statistical approaches, our method can distinguish closely-related species that have not yet reached reciprocal monophyly at most or all of their loci; like tree-based approaches, it can yield meaningful conclusions using a number of independent markers as low as three. Future efforts will aim to develop programs that speed up the construction of haplowebs from FASTA sequence alignments and help perform the congruence analysis outlined in this article. PMID:21118572
Flot, Jean-François; Couloux, Arnaud; Tillier, Simon
2010-11-30
Usual methods for inferring species boundaries from molecular sequence data rely either on gene trees or on population genetic analyses. Another way of delimiting species, based on a view of species as "fields for recombination" (FFRs) characterized by mutual allelic exclusivity, was suggested in 1995 by Doyle. Here we propose to use haplowebs (haplotype networks with additional connections between haplotypes found co-occurring in heterozygous individuals) to visualize and delineate single-locus FFRs (sl-FFRs). Furthermore, we introduce a method to quantify the reliability of putative species boundaries according to the number of independent markers that support them, and illustrate this approach with a case study of taxonomically difficult corals of the genus Pocillopora collected around Clipperton Island (far eastern Pacific). One haploweb built from intron sequences of the ATP synthase β subunit gene revealed the presence of two sl-FFRs among our 74 coral samples, whereas a second one built from ITS sequences turned out to be composed of four sl-FFRs. As a third independent marker, we performed a combined analysis of two regions of the mitochondrial genome: since haplowebs are not suited to analyze non-recombining markers, individuals were sorted into four haplogroups according to their mitochondrial sequences. Among all possible bipartitions of our set of samples, thirteen were supported by at least one molecular dataset, none by two and only one by all three datasets: this congruent pattern obtained from independent nuclear and mitochondrial markers indicates that two species of Pocillopora are present in Clipperton. Our approach builds on Doyle's method and extends it by introducing an intuitive, user-friendly graphical representation and by proposing a conceptual framework to analyze and quantify the congruence between sl-FFRs obtained from several independent markers. Like delineation methods based on population-level statistical approaches, our method can distinguish closely-related species that have not yet reached reciprocal monophyly at most or all of their loci; like tree-based approaches, it can yield meaningful conclusions using a number of independent markers as low as three. Future efforts will aim to develop programs that speed up the construction of haplowebs from FASTA sequence alignments and help perform the congruence analysis outlined in this article.
NASA Astrophysics Data System (ADS)
Closek, C. J.; Langevin, S.; Burge, C. A.; Crosson, L.; White, S.; Friedman, C. S.
2016-02-01
Withering syndrome (WS), caused by the bacterium Candidatus Xenohaliotis californiensis, a Rickettsia-like organism (RLO), infects many species of abalone. Black abalone (Haliotis cracherodii), one of two endangered species of abalone, has experienced high population losses along the California coast due to WS. Recently, we observed reduced pathogenicity and mortality events in RLO-infected abalone when a novel bacteriophage (phage) was also present. To better understand phage-bacterium dynamics and develop more informative diagnostic tools, we sequenced the genome of the novel phage associated with the RLO responsible for WS. Metagenomic sequencing libraries were prepared with extracted genomic DNA from two experimentally infected H. cracherodii and phage sequences were enriched using hydroxyapatite chromatography normalization. Normalized libraries were individually barcoded and sequenced with Illumina MiSeq. Raw sequence reads were processed using VIrominer and de novo assembly produced one single phage-like contig (35.7Kb) from the experimentally infected abalone. This highly divergent genome had closest homology with a virus associated with abalone shriveling syndrome (SS). Of the 34 predicted ORFs, overlapping homology with the SS virus ranged from 20-72%, demonstrating the phage sequenced is genetically distinct from any known phage. The phage-like sequences represented a significant portion of the total reads sequenced ( 2 million of the 12 million paired-end reads; 17%) and we obtained 94,000X coverage across the novel phage genome. Beyond characterization of this novel phage, which appears to reduce pathogenicity of the RLO, the genome enabled us to develop quantitative PCR and in situ hybridization assays as diagnostic tools. These tools allow us to detect and quantify this phage in the endangered H. cracherodii.
Thomas, Austen C; Jarman, Simon N; Haman, Katherine H; Trites, Andrew W; Deagle, Bruce E
2014-08-01
Ecologists are increasingly interested in quantifying consumer diets based on food DNA in dietary samples and high-throughput sequencing of marker genes. It is tempting to assume that food DNA sequence proportions recovered from diet samples are representative of consumer's diet proportions, despite the fact that captive feeding studies do not support that assumption. Here, we examine the idea of sequencing control materials of known composition along with dietary samples in order to correct for technical biases introduced during amplicon sequencing and biological biases such as variable gene copy number. Using the Ion Torrent PGM(©) , we sequenced prey DNA amplified from scats of captive harbour seals (Phoca vitulina) fed a constant diet including three fish species in known proportions. Alongside, we sequenced a prey tissue mix matching the seals' diet to generate tissue correction factors (TCFs). TCFs improved the diet estimates (based on sequence proportions) for all species and reduced the average estimate error from 28 ± 15% (uncorrected) to 14 ± 9% (TCF-corrected). The experimental design also allowed us to infer the magnitude of prey-specific digestion biases and calculate digestion correction factors (DCFs). The DCFs were compared with possible proxies for differential digestion (e.g. fish protein%, fish lipid%) revealing a strong relationship between the DCFs and percent lipid of the fish prey, suggesting prey-specific corrections based on lipid content would produce accurate diet estimates in this study system. These findings demonstrate the value of parallel sequencing of food tissue mixtures in diet studies and offer new directions for future research in quantitative DNA diet analysis. © 2013 John Wiley & Sons Ltd.
Effect of extreme data loss on heart rate signals quantified by entropy analysis
NASA Astrophysics Data System (ADS)
Li, Yu; Wang, Jun; Li, Jin; Liu, Dazhao
2015-02-01
The phenomenon of data loss always occurs in the analysis of large databases. Maintaining the stability of analysis results in the event of data loss is very important. In this paper, we used a segmentation approach to generate a synthetic signal that is randomly wiped from data according to the Gaussian distribution and the exponential distribution of the original signal. Then, the logistic map is used as verification. Finally, two methods of measuring entropy-base-scale entropy and approximate entropy-are comparatively analyzed. Our results show the following: (1) Two key parameters-the percentage and the average length of removed data segments-can change the sequence complexity according to logistic map testing. (2) The calculation results have preferable stability for base-scale entropy analysis, which is not sensitive to data loss. (3) The loss percentage of HRV signals should be controlled below the range (p = 30 %), which can provide useful information in clinical applications.
A study of material damping in large space structures
NASA Technical Reports Server (NTRS)
Highsmith, A. L.; Allen, D. H.
1989-01-01
A constitutive model was developed for predicting damping as a function of damage in continuous fiber reinforced laminated composites. The damage model is a continuum formulation, and uses internal state variables to quantify damage and its subsequent effect on material response. The model is sensitive to the stacking sequence of the laminate. Given appropriate baseline data from unidirectional material, and damping as a function of damage in one crossply laminate, damage can be predicted as a function of damage in other crossply laminates. Agreement between theory and experiment was quite good. A micromechanics model was also developed for examining the influence of damage on damping. This model explicitly includes crack surfaces. The model provides reasonable predictions of bending stiffness as a function of damage. Damping predictions are not in agreement with the experiment. This is thought to be a result of dissipation mechanisms such as friction, which are not presently included in the analysis.
Ambiguities in the retrieval of rain rates from radar returns at attenuating wavelengths
NASA Technical Reports Server (NTRS)
Haddad, Z. S.; Im, E.; Durden, S. L.
1993-01-01
It is well-known that there are significant deterministic ambiguities inherent in trying to determine the particular rain rate profile which produced some given sequence of air- or space-borne radar echo powers at a single attenuating frequency. We quantify these ambiguities mathematically, and examine their effect on various proposed rain-rate profile retrieval algorithms. When the given data consist of a single radiometer measurement together with a single-look-angle single-frequency set of range-compressed echo powers, we show that several substantially different rain profiles can realistically be considered solutions. On the other hand, if the data consist of a single-look-angle two-frequency set of echo powers, the inversion problem generically has a unique solution. We note that traditional 'back-of-the-envelope' arguments can be quite misleading in assessing the extent of the ambiguity, even in the simplest cases.
Conservation genetics and genomics of amphibians and reptiles.
Shaffer, H Bradley; Gidiş, Müge; McCartney-Melstad, Evan; Neal, Kevin M; Oyamaguchi, Hilton M; Tellez, Marisa; Toffelmier, Erin M
2015-01-01
Amphibians and reptiles as a group are often secretive, reach their greatest diversity often in remote tropical regions, and contain some of the most endangered groups of organisms on earth. Particularly in the past decade, genetics and genomics have been instrumental in the conservation biology of these cryptic vertebrates, enabling work ranging from the identification of populations subject to trade and exploitation, to the identification of cryptic lineages harboring critical genetic variation, to the analysis of genes controlling key life history traits. In this review, we highlight some of the most important ways that genetic analyses have brought new insights to the conservation of amphibians and reptiles. Although genomics has only recently emerged as part of this conservation tool kit, several large-scale data sources, including full genomes, expressed sequence tags, and transcriptomes, are providing new opportunities to identify key genes, quantify landscape effects, and manage captive breeding stocks of at-risk species.
[Hydrologic variability and sensitivity based on Hurst coefficient and Bartels statistic].
Lei, Xu; Xie, Ping; Wu, Zi Yi; Sang, Yan Fang; Zhao, Jiang Yan; Li, Bin Bin
2018-04-01
Due to the global climate change and frequent human activities in recent years, the pure stochastic components of hydrological sequence is mixed with one or several of the variation ingredients, including jump, trend, period and dependency. It is urgently needed to clarify which indices should be used to quantify the degree of their variability. In this study, we defined the hydrological variability based on Hurst coefficient and Bartels statistic, and used Monte Carlo statistical tests to test and analyze their sensitivity to different variants. When the hydrological sequence had jump or trend variation, both Hurst coefficient and Bartels statistic could reflect the variation, with the Hurst coefficient being more sensitive to weak jump or trend variation. When the sequence had period, only the Bartels statistic could detect the mutation of the sequence. When the sequence had a dependency, both the Hurst coefficient and the Bartels statistics could reflect the variation, with the latter could detect weaker dependent variations. For the four variations, both the Hurst variability and Bartels variability increased with the increases of variation range. Thus, they could be used to measure the variation intensity of the hydrological sequence. We analyzed the temperature series of different weather stations in the Lancang River basin. Results showed that the temperature of all stations showed the upward trend or jump, indicating that the entire basin had experienced warming in recent years and the temperature variability in the upper and lower reaches was much higher. This case study showed the practicability of the proposed method.
pico-PLAZA, a genome database of microbial photosynthetic eukaryotes.
Vandepoele, Klaas; Van Bel, Michiel; Richard, Guilhem; Van Landeghem, Sofie; Verhelst, Bram; Moreau, Hervé; Van de Peer, Yves; Grimsley, Nigel; Piganeau, Gwenael
2013-08-01
With the advent of next generation genome sequencing, the number of sequenced algal genomes and transcriptomes is rapidly growing. Although a few genome portals exist to browse individual genome sequences, exploring complete genome information from multiple species for the analysis of user-defined sequences or gene lists remains a major challenge. pico-PLAZA is a web-based resource (http://bioinformatics.psb.ugent.be/pico-plaza/) for algal genomics that combines different data types with intuitive tools to explore genomic diversity, perform integrative evolutionary sequence analysis and study gene functions. Apart from homologous gene families, multiple sequence alignments, phylogenetic trees, Gene Ontology, InterPro and text-mining functional annotations, different interactive viewers are available to study genome organization using gene collinearity and synteny information. Different search functions, documentation pages, export functions and an extensive glossary are available to guide non-expert scientists. To illustrate the versatility of the platform, different case studies are presented demonstrating how pico-PLAZA can be used to functionally characterize large-scale EST/RNA-Seq data sets and to perform environmental genomics. Functional enrichments analysis of 16 Phaeodactylum tricornutum transcriptome libraries offers a molecular view on diatom adaptation to different environments of ecological relevance. Furthermore, we show how complementary genomic data sources can easily be combined to identify marker genes to study the diversity and distribution of algal species, for example in metagenomes, or to quantify intraspecific diversity from environmental strains. © 2013 John Wiley & Sons Ltd and Society for Applied Microbiology.
Hobbs, Jennifer A; Towal, R Blythe; Hartmann, Mitra J Z
2015-08-01
Analysis of natural scene statistics has been a powerful approach for understanding neural coding in the auditory and visual systems. In the field of somatosensation, it has been more challenging to quantify the natural tactile scene, in part because somatosensory signals are so tightly linked to the animal's movements. The present work takes a step towards quantifying the natural tactile scene for the rat vibrissal system by simulating rat whisking motions to systematically investigate the probabilities of whisker-object contact in naturalistic environments. The simulations permit an exhaustive search through the complete space of possible contact patterns, thereby allowing for the characterization of the patterns that would most likely occur during long sequences of natural exploratory behavior. We specifically quantified the probabilities of 'concomitant contact', that is, given that a particular whisker makes contact with a surface during a whisk, what is the probability that each of the other whiskers will also make contact with the surface during that whisk? Probabilities of concomitant contact were quantified in simulations that assumed increasingly naturalistic conditions: first, the space of all possible head poses; second, the space of behaviorally preferred head poses as measured experimentally; and third, common head poses in environments such as cages and burrows. As environments became more naturalistic, the probability distributions shifted from exhibiting a 'row-wise' structure to a more diagonal structure. Results also reveal that the rat appears to use motor strategies (e.g. head pitches) that generate contact patterns that are particularly well suited to extract information in the presence of uncertainty. © 2015. Published by The Company of Biologists Ltd.
Monneuse, Jean-Marc; Sugano, Madeleine; Becue, Thierry; Santoni, Véronique; Hem, Sonia; Rossignol, Michel
2011-05-01
Plant membranes bear a variety of transporters belonging to multigene families that are affected by environmental and nutritional conditions. In addition, they often display high-sequence identity, making difficult in-depth investigation by current shot-gun strategies. In this study, we set up a targeted proteomics approach aimed at identifying and quantifying within single experiments the five major proton pumps of the autoinhibited H(+) ATPases (AHA) family, the 13 plasma membrane intrinsic proteins (PIP) water channels (PIPs), and ten members of ammonium transporters (AMTs) and nitrate transporter (NRT) families. Proteotypic peptides were selected and isotopically labeled heavy versions were used for technical optimization and for quantification of the corresponding light version in biological samples. This approach allowed to quantify simultaneously nine PIPs in leaf membranes and 13 PIPs together with three autoinhibited H(+) ATPases, two ammonium transporters, and two NRTs in root membranes. Similarly, it was used to investigate the effect of a salt stress on the expression of these latter 20 transporters in roots. These novel isoform-specific data were compared with published transcriptome information and revealed a close correlation between PIP isoforms and transcripts levels. The obtained resource is reusable and can be expanded to other transporter families for large-scale profiling of membrane transporters. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Heterogeneous recurrence monitoring and control of nonlinear stochastic processes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Hui, E-mail: huiyang@usf.edu; Chen, Yun
Recurrence is one of the most common phenomena in natural and engineering systems. Process monitoring of dynamic transitions in nonlinear and nonstationary systems is more concerned with aperiodic recurrences and recurrence variations. However, little has been done to investigate the heterogeneous recurrence variations and link with the objectives of process monitoring and anomaly detection. Notably, nonlinear recurrence methodologies are based on homogeneous recurrences, which treat all recurrence states in the same way as black dots, and non-recurrence is white in recurrence plots. Heterogeneous recurrences are more concerned about the variations of recurrence states in terms of state properties (e.g., valuesmore » and relative locations) and the evolving dynamics (e.g., sequential state transitions). This paper presents a novel approach of heterogeneous recurrence analysis that utilizes a new fractal representation to delineate heterogeneous recurrence states in multiple scales, including the recurrences of both single states and multi-state sequences. Further, we developed a new set of heterogeneous recurrence quantifiers that are extracted from fractal representation in the transformed space. To that end, we integrated multivariate statistical control charts with heterogeneous recurrence analysis to simultaneously monitor two or more related quantifiers. Experimental results on nonlinear stochastic processes show that the proposed approach not only captures heterogeneous recurrence patterns in the fractal representation but also effectively monitors the changes in the dynamics of a complex system.« less
Aftershock collapse vulnerability assessment of reinforced concrete frame structures
Raghunandan, Meera; Liel, Abbie B.; Luco, Nicolas
2015-01-01
In a seismically active region, structures may be subjected to multiple earthquakes, due to mainshock–aftershock phenomena or other sequences, leaving no time for repair or retrofit between the events. This study quantifies the aftershock vulnerability of four modern ductile reinforced concrete (RC) framed buildings in California by conducting incremental dynamic analysis of nonlinear MDOF analytical models. Based on the nonlinear dynamic analysis results, collapse and damage fragility curves are generated for intact and damaged buildings. If the building is not severely damaged in the mainshock, its collapse capacity is unaffected in the aftershock. However, if the building is extensively damaged in the mainshock, there is a significant reduction in its collapse capacity in the aftershock. For example, if an RC frame experiences 4% or more interstory drift in the mainshock, the median capacity to resist aftershock shaking is reduced by about 40%. The study also evaluates the effectiveness of different measures of physical damage observed in the mainshock-damaged buildings for predicting the reduction in collapse capacity of the damaged building in subsequent aftershocks. These physical damage indicators for the building are chosen such that they quantify the qualitative red tagging (unsafe for occupation) criteria employed in post-earthquake evaluation of RC frames. The results indicated that damage indicators related to the drift experienced by the damaged building best predicted the reduced aftershock collapse capacities for these ductile structures.
Comparison of timing and force control of foot tapping between elderly and young subjects.
Takimoto, Koji; Takebayashi, Hideaki; Miyamoto, Kenzo; Takuma, Yutaka; Inoue, Yoshikazu; Miyamoto, Shoko; Okabe, Takao; Okuda, Takahiro; Kaba, Hideto
2016-06-01
[Purpose] To examine the ability of young and elderly individuals to control the timing and force of periodic sequential foot tapping. [Subjects and Methods] Participants were 10 young (age, 22.1 ± 4.3 years) and 10 elderly individuals (74.8 ± 6.7 years) who were healthy and active. The foot tapping task consisted of practice (stimulus-synchronized tapping with visual feedback) and recall trials (self-paced tapping without visual feedback), periodically performed in this order, at 500-, 1,000-, and 2,000-ms target interstimulus-onset intervals, with a target force of 20% maximum voluntary contraction of the ankle plantar-flexor muscle. [Results] The coefficients of variation of force and intertap interval, used for quantifying the steadiness of the trials, were significantly greater in the elderly than in the young individuals. At the 500-ms interstimulus-onset interval, age-related effects were observed on the normalized mean absolute error of force, which was used to quantify the accuracy of the trials. The coefficients of variation of intertap interval for elderly individuals were significantly greater in the practice than in the recall trials at the 500- and 1,000-ms interstimulus-onset intervals. [Conclusion] The elderly individuals exhibited greater force and timing variability than the young individuals and showed impaired visuomotor processing during foot tapping sequences.
Targeted RNA-Sequencing with Competitive Multiplex-PCR Amplicon Libraries
Blomquist, Thomas M.; Crawford, Erin L.; Lovett, Jennie L.; Yeo, Jiyoun; Stanoszek, Lauren M.; Levin, Albert; Li, Jia; Lu, Mei; Shi, Leming; Muldrew, Kenneth; Willey, James C.
2013-01-01
Whole transcriptome RNA-sequencing is a powerful tool, but is costly and yields complex data sets that limit its utility in molecular diagnostic testing. A targeted quantitative RNA-sequencing method that is reproducible and reduces the number of sequencing reads required to measure transcripts over the full range of expression would be better suited to diagnostic testing. Toward this goal, we developed a competitive multiplex PCR-based amplicon sequencing library preparation method that a) targets only the sequences of interest and b) controls for inter-target variation in PCR amplification during library preparation by measuring each transcript native template relative to a known number of synthetic competitive template internal standard copies. To determine the utility of this method, we intentionally selected PCR conditions that would cause transcript amplification products (amplicons) to converge toward equimolar concentrations (normalization) during library preparation. We then tested whether this approach would enable accurate and reproducible quantification of each transcript across multiple library preparations, and at the same time reduce (through normalization) total sequencing reads required for quantification of transcript targets across a large range of expression. We demonstrate excellent reproducibility (R2 = 0.997) with 97% accuracy to detect 2-fold change using External RNA Controls Consortium (ERCC) reference materials; high inter-day, inter-site and inter-library concordance (R2 = 0.97–0.99) using FDA Sequencing Quality Control (SEQC) reference materials; and cross-platform concordance with both TaqMan qPCR (R2 = 0.96) and whole transcriptome RNA-sequencing following “traditional” library preparation using Illumina NGS kits (R2 = 0.94). Using this method, sequencing reads required to accurately quantify more than 100 targeted transcripts expressed over a 107-fold range was reduced more than 10,000-fold, from 2.3×109 to 1.4×105 sequencing reads. These studies demonstrate that the competitive multiplex-PCR amplicon library preparation method presented here provides the quality control, reproducibility, and reduced sequencing reads necessary for development and implementation of targeted quantitative RNA-sequencing biomarkers in molecular diagnostic testing. PMID:24236095
Kaschani, Farnusch; Graupner, Nadine; Grossmann, Lars; Jensen, Manfred; Ninck, Sabrina; Schulz, Florian; Rahmann, Sven; Boenigk, Jens; Kaiser, Markus
2017-01-01
Aquatic environments are heavily impacted by human activities including climate warming and the introduction of xenobiotics. Due to the application of silver nanoparticles as bactericidal agent the introduction of silver into the environment strongly has increased during the past years. Silver ions affect the primary metabolism of algae, in particular photosynthesis. Mixotrophic algae are an interesting test case as they do not exclusively rely on photosynthesis which may attenuate the harmful effect of silver. In order to study the effect of silver ions on mixotrophs, cultures of the chrysophyte Poterioochromonas malhamensis were treated in a replicate design in light and darkness with silver nitrate at a sub-lethal concentration. At five time points samples were taken for the identification and quantitation of proteins by mass spectrometry. In our analysis, relative quantitative protein mass spectrometry has shown to be a useful tool for functional analyses in conjunction with transcriptome reference sequences. A total of 3,952 proteins in 63 samples were identified and quantified, mapping to 4,829 transcripts of the sequenced and assembled transcriptome. Among them, 720 and 104 proteins performing various cellular functions were differentially expressed after eight days in light versus darkness and after three days of silver treatment, respectively. Specifically pathways of the energy and primary carbon metabolism were differentially affected by light and the utilization of expensive reactions hints to an energy surplus of P. malhamensis under light conditions. The excess energy is not invested in growth, but in the synthesis of storage metabolites. The effects of silver were less explicit, observable especially in the dark treatments where the light effect could not mask coinciding but weaker effects of silver. Photosynthesis, particularly the light harvesting complexes, and several sulphur containing enzymes were affected presumably due to a direct interference with the silver ions, mainly affecting energy supply. PMID:28056027
Microbial Community Profiling of Human Saliva Using Shotgun Metagenomic Sequencing
Hasan, Nur A.; Young, Brian A.; Minard-Smith, Angela T.; Saeed, Kelly; Li, Huai; Heizer, Esley M.; McMillan, Nancy J.; Isom, Richard; Abdullah, Abdul Shakur; Bornman, Daniel M.; Faith, Seth A.; Choi, Seon Young; Dickens, Michael L.; Cebula, Thomas A.; Colwell, Rita R.
2014-01-01
Human saliva is clinically informative of both oral and general health. Since next generation shotgun sequencing (NGS) is now widely used to identify and quantify bacteria, we investigated the bacterial flora of saliva microbiomes of two healthy volunteers and five datasets from the Human Microbiome Project, along with a control dataset containing short NGS reads from bacterial species representative of the bacterial flora of human saliva. GENIUS, a system designed to identify and quantify bacterial species using unassembled short NGS reads was used to identify the bacterial species comprising the microbiomes of the saliva samples and datasets. Results, achieved within minutes and at greater than 90% accuracy, showed more than 175 bacterial species comprised the bacterial flora of human saliva, including bacteria known to be commensal human flora but also Haemophilus influenzae, Neisseria meningitidis, Streptococcus pneumoniae, and Gamma proteobacteria. Basic Local Alignment Search Tool (BLASTn) analysis in parallel, reported ca. five times more species than those actually comprising the in silico sample. Both GENIUSand BLAST analyses of saliva samples identified major genera comprising the bacterial flora of saliva, but GENIUS provided a more precise description of species composition, identifying to strain in most cases and delivered results at least 10,000 times faster. Therefore, GENIUS offers a facile and accurate system for identification and quantification of bacterial species and/or strains in metagenomic samples. PMID:24846174
Assessment of alveolar bone marrow fat content using 15 T MRI.
Cortes, Arthur Rodriguez Gonzalez; Cohen, Ouri; Zhao, Ming; Aoki, Eduardo Massaharu; Ribeiro, Rodrigo Alves; Abu Nada, Lina; Costa, Claudio; Arita, Emiko Saito; Tamimi, Faleh; Ackerman, Jerome L
2018-03-01
Bone marrow fat is inversely correlated with bone mineral density. The aim of this study is to present a method to quantify alveolar bone marrow fat content using a 15 T magnetic resonance imaging (MRI) scanner. A 15 T MRI scanner with a 13-mm inner diameter loop-gap radiofrequency coil was used to scan seven 3-mm diameter alveolar bone biopsy specimens. A 3-D gradient-echo relaxation time (T1)-weighted pulse sequence was chosen to obtain images. All images were obtained with a voxel size (58 µm 3 ) sufficient to resolve trabecular spaces. Automated volume of the bone marrow fat content and derived bone volume fraction (BV/TV) were calculated. Results were compared with actual BV/TV obtained from micro-computed tomography (CT) scans. Mean fat tissue volume was 20.1 ± 11%. There was a significantly strong inverse correlation between fat tissue volume and BV/TV (r = -0.68; P = .045). Furthermore, there was a strong agreement between BV/TV derived from MRI and obtained with micro-CT (interclass correlation coefficient = 0.92; P = .001). Bone marrow fat of small alveolar bone biopsy specimens can be quantified with sufficient spatial resolution using an ultra-high-field MRI scanner and a T1-weighted pulse sequence. Copyright © 2017 Elsevier Inc. All rights reserved.
Cassler, M; Peterson, C L; Ledger, A; Pomponi, S A; Wright, A E; Winegar, R; McCarthy, P J; Lopez, J V
2008-04-01
In this report, real-time quantitative PCR (TaqMan qPCR) of the small subunit (SSU) 16S-like rRNA molecule, a universal phylogenetic marker, was used to quantify the relative abundance of individual bacterial members of a diverse, yet mostly unculturable, microbial community from a marine sponge. Molecular phylogenetic analyses of bacterial communities derived from Caribbean Lithistid sponges have shown a wide diversity of microbes that included at least six major subdivisions; however, very little overlap was observed between the culturable and unculturable microbial communities. Based on sequence data of three culture-independent Lithistid-derived representative bacteria, we designed probe/primer sets for TaqMan qPCR to quantitatively characterize selected microbial residents in a Lithistid sponge, Vetulina, metagenome. TaqMan assays included specificity testing, DNA limit of detection analysis, and quantification of specific microbial rRNA sequences such as Nitrospira-like microbes and Actinobacteria up to 172 million copies per microgram per Lithistid sponge metagenome. By contrast, qPCR amplification with probes designed for common previously cultured sponge-associated bacteria in the genera Rheinheimera and Marinomonas and a representative of the CFB group resulted in only minimal detection of the Rheiheimera in total DNA extracted from the sponge. These data verify that a large portion of the microbial community within Lithistid sponges may consist of currently unculturable microorganisms.
Fast 3D 1H MRSI of the Corticospinal Tract in Pediatric Brain
Kim, Dong-Hyun; Gu, Meng; Cunningham, Charles; Chen, Albert; Baumer, Fiona; Glenn, Orit A.; Vigneron, Daniel B.; Spielman, Daniel Mark; Barkovich, Anthony James
2010-01-01
Purpose: To develop a 1H magnetic resonance spectroscopic imaging (MRSI) sequence that can be used to image infants/children at 3T and by combining it with diffusion tensor imaging (DTI) tractography, extract relevant metabolic information corresponding to the corticospinal tract (CST). Materials and Methods: A fast 3D MRSI sequence was developed for pediatric neuroimaging at 3T using spiral k-space readout and dual band RF pulses (32 × 32 × 8 cm field of view [FOV], 1 cc iso-resolution, TR/TE = 1500/130, 6:24 minute scan). Using DTI tractography to identify the motor tracts, spectra were extracted from the CSTs and quantified. Initial data from infants/children with suspected motor delay (n = 5) and age-matched controls (n = 3) were collected and N-acetylaspartate (NAA) ratios were quantified. Results: The average signal-to-noise ratio of the NAA peak from the studies was ≈22. Metabolite profiles were successfully acquired from the CST by using DTI tractography. Decreased NAA ratios in those with motor delay compared to controls of ≈10% at the CST were observed. Conclusion: A fast and robust 3D MRSI technique targeted for pediatric neuroimaging has been developed. By combining with DTI tractography, metabolic information from the CSTs can be retrieved and estimated. By combining DTI and 3D MRSI, spectral information from various tracts can be obtained and processed. PMID:19097091
Orellana, Luis H.; Rodriguez-R, Luis M.; Konstantinidis, Konstantinos T.
2016-10-07
Functional annotation of metagenomic and metatranscriptomic data sets relies on similarity searches based on e-value thresholds resulting in an unknown number of false positive and negative matches. To overcome these limitations, we introduce ROCker, aimed at identifying position-specific, most-discriminant thresholds in sliding windows along the sequence of a target protein, accounting for non-discriminative domains shared by unrelated proteins. ROCker employs the receiver operating characteristic (ROC) curve to minimize false discovery rate (FDR) and calculate the best thresholds based on how simulated shotgun metagenomic reads of known composition map onto well-curated reference protein sequences and thus, differs from HMM profiles andmore » related methods. We showcase ROCker using ammonia monooxygenase (amoA) and nitrous oxide reductase (nosZ) genes, mediating oxidation of ammonia and the reduction of the potent greenhouse gas, N 2O, to inert N 2, respectively. ROCker typically showed 60-fold lower FDR when compared to the common practice of using fixed e-values. Previously uncounted ‘atypical’ nosZ genes were found to be two times more abundant, on average, than their typical counterparts in most soil metagenomes and the abundance of bacterial amoA was quantified against the highly-related particulate methane monooxygenase (pmoA). Therefore, ROCker can reliably detect and quantify target genes in short-read metagenomes.« less
Watzinger, Franz; Hörth, Elfriede; Lion, Thomas
2001-01-01
Despite the recent introduction of real-time PCR methods, competitive PCR techniques continue to play an important role in nucleic acid quantification because of the significantly lower cost of equipment and consumables. Here we describe a shifted restriction-site competitive PCR (SRS-cPCR) assay based on a modified type of competitor. The competitor fragments are designed to contain a recognition site for a restriction endonuclease that is also present in the target sequence to be quantified, but in a different position. Upon completion of the PCR, the amplicons are digested in the same tube with a single restriction enzyme, without the need to purify PCR products. The generated competitor- and target-specific restriction fragments display different sizes, and can be readily separated by electrophoresis and quantified by image analysis. Suboptimal digestion affects competitor- and target-derived amplicons to the same extent, thus eliminating the problem of incorrect quantification as a result of incomplete digestion of PCR products. We have established optimized conditions for a panel of 20 common restriction endonucleases permitting efficient digestion in PCR buffer. It is possible, therefore, to find a suitable restriction site for competitive PCR in virtually any sequence of interest. The assay presented is inexpensive, widely applicable, and permits reliable and accurate quantification of nucleic acid targets. PMID:11376164
DOE Office of Scientific and Technical Information (OSTI.GOV)
Orellana, Luis H.; Rodriguez-R, Luis M.; Konstantinidis, Konstantinos T.
Functional annotation of metagenomic and metatranscriptomic data sets relies on similarity searches based on e-value thresholds resulting in an unknown number of false positive and negative matches. To overcome these limitations, we introduce ROCker, aimed at identifying position-specific, most-discriminant thresholds in sliding windows along the sequence of a target protein, accounting for non-discriminative domains shared by unrelated proteins. ROCker employs the receiver operating characteristic (ROC) curve to minimize false discovery rate (FDR) and calculate the best thresholds based on how simulated shotgun metagenomic reads of known composition map onto well-curated reference protein sequences and thus, differs from HMM profiles andmore » related methods. We showcase ROCker using ammonia monooxygenase (amoA) and nitrous oxide reductase (nosZ) genes, mediating oxidation of ammonia and the reduction of the potent greenhouse gas, N 2O, to inert N 2, respectively. ROCker typically showed 60-fold lower FDR when compared to the common practice of using fixed e-values. Previously uncounted ‘atypical’ nosZ genes were found to be two times more abundant, on average, than their typical counterparts in most soil metagenomes and the abundance of bacterial amoA was quantified against the highly-related particulate methane monooxygenase (pmoA). Therefore, ROCker can reliably detect and quantify target genes in short-read metagenomes.« less
2017-01-01
Abstract Functional annotation of metagenomic and metatranscriptomic data sets relies on similarity searches based on e-value thresholds resulting in an unknown number of false positive and negative matches. To overcome these limitations, we introduce ROCker, aimed at identifying position-specific, most-discriminant thresholds in sliding windows along the sequence of a target protein, accounting for non-discriminative domains shared by unrelated proteins. ROCker employs the receiver operating characteristic (ROC) curve to minimize false discovery rate (FDR) and calculate the best thresholds based on how simulated shotgun metagenomic reads of known composition map onto well-curated reference protein sequences and thus, differs from HMM profiles and related methods. We showcase ROCker using ammonia monooxygenase (amoA) and nitrous oxide reductase (nosZ) genes, mediating oxidation of ammonia and the reduction of the potent greenhouse gas, N2O, to inert N2, respectively. ROCker typically showed 60-fold lower FDR when compared to the common practice of using fixed e-values. Previously uncounted ‘atypical’ nosZ genes were found to be two times more abundant, on average, than their typical counterparts in most soil metagenomes and the abundance of bacterial amoA was quantified against the highly-related particulate methane monooxygenase (pmoA). Therefore, ROCker can reliably detect and quantify target genes in short-read metagenomes. PMID:28180325
Quantification of fetal heart rate regularity using symbolic dynamics
NASA Astrophysics Data System (ADS)
van Leeuwen, P.; Cysarz, D.; Lange, S.; Geue, D.; Groenemeyer, D.
2007-03-01
Fetal heart rate complexity was examined on the basis of RR interval time series obtained in the second and third trimester of pregnancy. In each fetal RR interval time series, short term beat-to-beat heart rate changes were coded in 8bit binary sequences. Redundancies of the 28 different binary patterns were reduced by two different procedures. The complexity of these sequences was quantified using the approximate entropy (ApEn), resulting in discrete ApEn values which were used for classifying the sequences into 17 pattern sets. Also, the sequences were grouped into 20 pattern classes with respect to identity after rotation or inversion of the binary value. There was a specific, nonuniform distribution of the sequences in the pattern sets and this differed from the distribution found in surrogate data. In the course of gestation, the number of sequences increased in seven pattern sets, decreased in four and remained unchanged in six. Sequences that occurred less often over time, both regular and irregular, were characterized by patterns reflecting frequent beat-to-beat reversals in heart rate. They were also predominant in the surrogate data, suggesting that these patterns are associated with stochastic heart beat trains. Sequences that occurred more frequently over time were relatively rare in the surrogate data. Some of these sequences had a high degree of regularity and corresponded to prolonged heart rate accelerations or decelerations which may be associated with directed fetal activity or movement or baroreflex activity. Application of the pattern classes revealed that those sequences with a high degree of irregularity correspond to heart rate patterns resulting from complex physiological activity such as fetal breathing movements. The results suggest that the development of the autonomic nervous system and the emergence of fetal behavioral states lead to increases in not only irregular but also regular heart rate patterns. Using symbolic dynamics to examine the cardiovascular system may thus lead to new insight with respect to fetal development.
Protein 3D Structure Computed from Evolutionary Sequence Variation
Sheridan, Robert; Hopf, Thomas A.; Pagnani, Andrea; Zecchina, Riccardo; Sander, Chris
2011-01-01
The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing. In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy. We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues., including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7–4.8 Å Cα-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org). This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of protein structures, new strategies in protein and drug design, and the identification of functional genetic variants in normal and disease genomes. PMID:22163331
Gaiser, Maria Rita; Skorokhod, Alexander; Gransheier, Diana; Weide, Benjamin; Koch, Winfried; Schif, Birgit; Enk, Alexander; Garbe, Claus; Bauer, Jürgen
2017-01-01
The incidence of melanoma, particularly in older patients, has steadily increased over the past few decades. Activating mutations of BRAF, the majority occurring in BRAFV600, are frequently detected in melanoma; however, the prognostic significance remains unclear. This study aimed to define the probability and distribution of BRAFV600 mutations, and the clinico-pathological factors that may affect BRAF mutation status, in patients with advanced melanoma using next-generation sequencing. This was a non-interventional, retrospective study of BRAF mutation testing at two German centers, in Heidelberg and Tübingen. Archival tumor samples from patients with histologically confirmed melanoma (stage IIIB, IIIC, IV) were analyzed using PCR amplification and deep sequencing. Clinical, histological, and mutation data were collected. The statistical influence of patient- and tumor-related characteristics on BRAFV600 mutation status was assessed using multiple logistic regression (MLR) and a prediction profiler. BRAFV600 mutation status was assessed in 453 samples. Mutations were detected in 57.6% of patients (n = 261), with 48.1% (n = 102) at the Heidelberg site and 66.0% (n = 159) at the Tübingen site. The decreasing influence of increasing age on mutation probability was quantified. A main effects MLR model identified age (p = 0.0001), center (p = 0.0004), and melanoma subtype (p = 0.014) as significantly influencing BRAFV600 mutation probability; ultraviolet (UV) exposure showed a statistical trend (p = 0.1419). An interaction model of age versus other variables showed that center (p<0.0001) and melanoma subtype (p = 0.0038) significantly influenced BRAF mutation probability; age had a statistically significant effect only as part of an interaction with both UV exposure (p = 0.0110) and melanoma subtype (p = 0.0134). This exploratory study highlights that testing center, melanoma subtype, and age in combination with UV exposure and melanoma subtype significantly influence BRAFV600 mutation probability in patients with melanoma. Further validation of this model, in terms of reproducibility and broader relevance, is required.
2015-01-01
Do negative quantifiers like “few” reduce people’s ability to rapidly evaluate incoming language with respect to world knowledge? Previous research has addressed this question by examining whether online measures of quantifier comprehension match the “final” interpretation reflected in verification judgments. However, these studies confounded quantifier valence with its impact on the unfolding expectations for upcoming words, yielding mixed results. In the current event-related potentials study, participants read negative and positive quantifier sentences matched on cloze probability and on truth-value (e.g., “Most/Few gardeners plant their flowers during the spring/winter for best results”). Regardless of whether participants explicitly verified the sentences or not, true-positive quantifier sentences elicited reduced N400s compared with false-positive quantifier sentences, reflecting the facilitated semantic retrieval of words that render a sentence true. No such facilitation was seen in negative quantifier sentences. However, mixed-effects model analyses (with cloze value and truth-value as continuous predictors) revealed that decreasing cloze values were associated with an interaction pattern between truth-value and quantifier, whereas increasing cloze values were associated with more similar truth-value effects regardless of quantifier. Quantifier sentences are thus understood neither always in 2 sequential stages, nor always in a partial-incremental fashion, nor always in a maximally incremental fashion. Instead, and in accordance with prediction-based views of sentence comprehension, quantifier sentence comprehension depends on incorporation of quantifier meaning into an online, knowledge-based prediction for upcoming words. Fully incremental quantifier interpretation occurs when quantifiers are incorporated into sufficiently strong online predictions for upcoming words. PMID:26375784
Brossard, Patrick; Weitkunat, Rolf; Poux, Valerie; Lama, Nicola; Haziza, Christelle; Picavet, Patrick; Baker, Gizelle; Lüdicke, Frank
2017-10-01
Two open-label randomized cross-over studies in Japanese smokers investigated the single-use nicotine pharmacokinetic profile of the Tobacco Heating System (THS) 2.2, cigarettes (CC) and nicotine replacement therapy (Gum). In each study, one on the regular and one on the menthol variants of the THS and CC, both using Gum as reference, 62 subjects were randomized to four sequences: Sequence 1: THS - CC (n = 22); Sequence 2: CC - THS (n = 22); Sequence 3: THS - Gum (n = 9); Sequence 4: Gum - THS (n = 9). Plasma nicotine concentrations were measured in 16 blood samples collected over 24 h after single use. Maximal nicotine concentration (C max ) and area under the curve from start of product use to time of last quantifiable concentration (AUC 0-last ) were similar between THS and CC in both studies, with ratios varying from 88 to 104% for C max and from 96 to 98% for AUC 0-last . Urge-to-smoke total scores were comparable between THS and CC. The THS nicotine pharmacokinetic profile was close to CC, with similar levels of urge-to-smoke. This suggests that THS can satisfy smokers and be a viable alternative to cigarettes for adult smokers who want to continue using tobacco. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Statistical and linguistic features of DNA sequences
NASA Technical Reports Server (NTRS)
Havlin, S.; Buldyrev, S. V.; Goldberger, A. L.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.
1995-01-01
We present evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationary" feature of the sequence of base pairs by applying a new algorithm called Detrended Fluctuation Analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and noncoding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to all eukaryotic DNA sequences (33 301 coding and 29 453 noncoding) in the entire GenBank database. We describe a simple model to account for the presence of long-range power-law correlations which is based upon a generalization of the classic Levy walk. Finally, we describe briefly some recent work showing that the noncoding sequences have certain statistical features in common with natural languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function. We suggest that noncoding regions in plants and invertebrates may display a smaller entropy and larger redundancy than coding regions, further supporting the possibility that noncoding regions of DNA may carry biological information.
Cultivating microbial dark matter in benzene-degrading methanogenic consortia.
Luo, Fei; Devine, Cheryl E; Edwards, Elizabeth A
2016-09-01
The microbes responsible for anaerobic benzene biodegradation remain poorly characterized. In this study, we identified and quantified microbial populations in a series of 16 distinct methanogenic, benzene-degrading enrichment cultures using a combination of traditional 16S rRNA clone libraries (four cultures), pyrotag 16S rRNA amplicon sequencing (11 cultures), metagenome sequencing (1 culture) and quantitative polymerase chain reaction (qPCR; 12 cultures). An operational taxonomic unit (OTU) from the Deltaproteobacteria designated ORM2 that is only 84% to 86% similar to Syntrophus or Desulfobacterium spp. was consistently identified in all enrichment cultures, and typically comprised more than half of the bacterial sequences. In addition to ORM2, a sequence belonging to Parcubacteria (candidate division OD1) identified from the metagenome data was the only other OTU common to all the cultures surveyed. Culture transfers (1% and 0.1%) were made in the presence and absence of benzene, and the abundance of ORM2, OD1 and other OTUs was tracked over 415 days using qPCR. ORM2 sequence abundance increased only when benzene was present, while the abundance of OD1 and other OTUs increased even in the absence of benzene. Deltaproteobacterium ORM2 is unequivocally the benzene-metabolizing population. This study also hints at laboratory cultivation conditions for a member of the widely distributed yet uncultivated Parcubacteria (OD1). © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Smeragliuolo, Anna H.; Long, John Davis; Bumanlag, Silverio Joseph; He, Victor; Lampe, Anna
2017-01-01
The objective of this study was to determine whether kinematic data collected by the Microsoft Kinect 2 (MK2) could be used to quantify postural stability in healthy subjects. Twelve subjects were recruited for the project, and were instructed to perform a sequence of simple postural stability tasks. The movement sequence was performed as subjects were seated on top of a force platform, and the MK2 was positioned in front of them. This sequence of tasks was performed by each subject under three different postural conditions: “both feet on the ground” (1), “One foot off the ground” (2), and “both feet off the ground” (3). We compared force platform and MK2 data to quantify the degree to which the MK2 was returning reliable data across subjects. We then applied a novel machine-learning paradigm to the MK2 data in order to determine the extent to which data from the MK2 could be used to reliably classify different postural conditions. Our initial comparison of force plate and MK2 data showed a strong agreement between the two devices, with strong Pearson correlations between the trunk centroids “Spine_Mid” (0.85 ± 0.06), “Neck” (0.86 ± 0.07) and “Head” (0.87 ± 0.07), and the center of pressure centroid inferred by the force platform. Mean accuracy for the machine learning classifier from MK2 was 97.0%, with a specific classification accuracy breakdown of 90.9%, 100%, and 100% for conditions 1 through 3, respectively. Mean accuracy for the machine learning classifier derived from the force platform data was lower at 84.4%. We conclude that data from the MK2 has sufficient information content to allow us to classify sequences of tasks being performed under different levels of postural stability. Future studies will focus on validating this protocol on large populations of individuals with actual balance impairments in order to create a toolkit that is clinically validated and available to the medical community. PMID:28196139
Dehbandi, Behdad; Barachant, Alexandre; Smeragliuolo, Anna H; Long, John Davis; Bumanlag, Silverio Joseph; He, Victor; Lampe, Anna; Putrino, David
2017-01-01
The objective of this study was to determine whether kinematic data collected by the Microsoft Kinect 2 (MK2) could be used to quantify postural stability in healthy subjects. Twelve subjects were recruited for the project, and were instructed to perform a sequence of simple postural stability tasks. The movement sequence was performed as subjects were seated on top of a force platform, and the MK2 was positioned in front of them. This sequence of tasks was performed by each subject under three different postural conditions: "both feet on the ground" (1), "One foot off the ground" (2), and "both feet off the ground" (3). We compared force platform and MK2 data to quantify the degree to which the MK2 was returning reliable data across subjects. We then applied a novel machine-learning paradigm to the MK2 data in order to determine the extent to which data from the MK2 could be used to reliably classify different postural conditions. Our initial comparison of force plate and MK2 data showed a strong agreement between the two devices, with strong Pearson correlations between the trunk centroids "Spine_Mid" (0.85 ± 0.06), "Neck" (0.86 ± 0.07) and "Head" (0.87 ± 0.07), and the center of pressure centroid inferred by the force platform. Mean accuracy for the machine learning classifier from MK2 was 97.0%, with a specific classification accuracy breakdown of 90.9%, 100%, and 100% for conditions 1 through 3, respectively. Mean accuracy for the machine learning classifier derived from the force platform data was lower at 84.4%. We conclude that data from the MK2 has sufficient information content to allow us to classify sequences of tasks being performed under different levels of postural stability. Future studies will focus on validating this protocol on large populations of individuals with actual balance impairments in order to create a toolkit that is clinically validated and available to the medical community.
Uncertainty and sensitivity analysis for photovoltaic system modeling.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hansen, Clifford W.; Pohl, Andrew Phillip; Jordan, Dirk
2013-12-01
We report an uncertainty and sensitivity analysis for modeling DC energy from photovoltaic systems. We consider two systems, each comprised of a single module using either crystalline silicon or CdTe cells, and located either at Albuquerque, NM, or Golden, CO. Output from a PV system is predicted by a sequence of models. Uncertainty in the output of each model is quantified by empirical distributions of each model's residuals. We sample these distributions to propagate uncertainty through the sequence of models to obtain an empirical distribution for each PV system's output. We considered models that: (1) translate measured global horizontal, directmore » and global diffuse irradiance to plane-of-array irradiance; (2) estimate effective irradiance from plane-of-array irradiance; (3) predict cell temperature; and (4) estimate DC voltage, current and power. We found that the uncertainty in PV system output to be relatively small, on the order of 1% for daily energy. Four alternative models were considered for the POA irradiance modeling step; we did not find the choice of one of these models to be of great significance. However, we observed that the POA irradiance model introduced a bias of upwards of 5% of daily energy which translates directly to a systematic difference in predicted energy. Sensitivity analyses relate uncertainty in the PV system output to uncertainty arising from each model. We found that the residuals arising from the POA irradiance and the effective irradiance models to be the dominant contributors to residuals for daily energy, for either technology or location considered. This analysis indicates that efforts to reduce the uncertainty in PV system output should focus on improvements to the POA and effective irradiance models.« less
Tran-Duy, An; Boonen, Annelies; Kievit, Wietske; van Riel, Piet L C M; van de Laar, Mart A F J; Severens, Johan L
2014-10-01
Management of rheumatoid arthritis (RA) is characterised by a sequence of disease-modifying antirheumatic drugs (DMARDs) and biological response modifiers (BRMs). In most of the Western countries, the drug sequences are determined based on disease activity and treatment history of the patients. A model for realistic patient outcomes should reflect the treatment pathways relevant for patients with specific characteristics. This study aimed at developing a model that could simulate long-term patient outcomes and cost effectiveness of treatment strategies with and without inclusion of BRMs following a clinical guideline for treatment decisions. Discrete event simulation taking into account patient characteristics and treatment history was used for model development. Treatment effect on disease activity, costs, health utilities and times to events were estimated using Dutch observational studies. Long-term progression of physical functioning was quantified using a linear mixed-effects model. Costs and health utilities were estimated using two-part models. The treatment strategy recommended by the Dutch Society for Rheumatology where both DMARDs and BRMs were available (Strategy 2) was compared with the treatment strategy without BRMs (Strategy 1). Ten thousand theoretical patients were tracked individually until death. In the probabilistic sensitivity analysis, Monte Carlo simulations were performed with 1,000 sets of parameters sampled from appropriate probability distributions. The simulated changes over time in disease activity and physical functioning were plausible. The incremental cost per quality-adjusted life-year gained of Strategy 2 compared with Strategy 1 was
Characterization of the Core and Caste-Specific Microbiota in the Termite, Reticulitermes flavipes
Benjamino, Jacquelynn; Graf, Joerg
2016-01-01
The hindgut of the termite Reticulitermes flavipes harbors a complex symbiotic community consisting of protists, bacteria, and archaea. These symbionts aid in the digestion of lignocellulose from the termite’s wood meal. Termite hindguts were sampled and the V4 hyper-variable region of the 16S rRNA gene was sequenced and analyzed from individual termites. The core microbiota of worker termites consisted of 69 OTUs at the 97% identity level, grouped into 16 taxa, and together accounted for 67.05% of the sequences from the bacterial community. The core was dominated by Treponema, which contained 36 different OTUs and accounted for ∼32% of the sequences, which suggests Treponema sp. have an important impact on the overall physiology in the hindgut. Bray–Curtis beta diversity metrics showed that hindgut samples from termites of the same colony were more similar to each other than to samples from other colonies despite possessing a core that accounted for the majority of the sequences. The specific tasks and dietary differences of the termite castes could have an effect on the composition of the microbial community. The hindgut microbiota of termites from the alate castes differed from the worker caste with significantly lower abundances of Treponema and Endomicrobia, which dominated the hindgut microbiota in workers and soldiers. Protist abundances were also quantified in the same samples using qPCR of the 18S rRNA gene. Parabasalia abundances dropped significantly in the winged alates and the Oxymonadida abundances dropped in both alate castes. These data suggest that the changes in diet or overall host physiology affected the protist and bacterial populations in the hindgut. The in-depth bacterial characterization and protist quantification in this study sheds light on the potential community dynamics within the R. flavipes hindgut and identified a large and complex core microbiota in termites obtained from multiple colonies and castes. PMID:26925043
The spread of hepatitis C virus genotype 1a in North America: a retrospective phylogenetic study.
Joy, Jeffrey B; McCloskey, Rosemary M; Nguyen, Thuy; Liang, Richard H; Khudyakov, Yury; Olmstead, Andrea; Krajden, Mel; Ward, John W; Harrigan, P Richard; Montaner, Julio S G; Poon, Art F Y
2016-06-01
The timing of the initial spread of hepatitis C virus genotype 1a in North America is controversial. In particular, how and when hepatitis C virus reached extraordinary prevalence in specific demographic groups remains unclear. We quantified, using all available hepatitis C virus sequence data and phylodynamic methods, the timing of the spread of hepatitis C virus genotype 1a in North America. We screened 45 316 publicly available sequences of hepatitis C virus genotype 1a for location and genotype, and then did phylogenetic analyses of available North American sequences from five hepatitis C virus genes (E1, E2, NS2, NS4B, NS5B), with an emphasis on including as many sequences with early collection dates as possible. We inferred the historical population dynamics of this epidemic for all five gene regions using Bayesian skyline plots. Most of the spread of genotype 1a in North America occurred before 1965, and the hepatitis C virus epidemic has undergone relatively little expansion since then. The effective population size of the North American epidemic stabilised around 1960. These results were robust across all five gene regions analysed, although analyses of each gene separately show substantial variation in estimates of the timing of the early exponential growth, ranging roughly from 1940 for NS2, to 1965 for NS4B. The expansion of genotype 1a before 1965 suggests that nosocomial or iatrogenic factors rather than past sporadic behavioural risk (ie, experimentation with injection drug use, unsafe tattooing, high risk sex, travel to high endemic areas) were key contributors to the hepatitis C virus epidemic in North America. Our results might reduce stigmatisation around screening and diagnosis, potentially increasing rates of screening and treatment for hepatitis C virus. The Canadian Institutes of Health Research, Michael Smith Foundation for Health Research, and BC Centre for Excellence in HIV/AIDS. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sodium and T1rho MRI for molecular and diagnostic imaging of articular cartilage.
Borthakur, Arijitt; Mellon, Eric; Niyogi, Sampreet; Witschey, Walter; Kneeland, J Bruce; Reddy, Ravinder
2006-11-01
In this article, both sodium magnetic resonance (MR) and T1rho relaxation mapping aimed at measuring molecular changes in cartilage for the diagnostic imaging of osteoarthritis are reviewed. First, an introduction to structure of cartilage, its degeneration in osteoarthritis (OA) and an outline of diagnostic imaging methods in quantifying molecular changes and early diagnostic aspects of cartilage degeneration are described. The sodium MRI section begins with a brief overview of the theory of sodium NMR of biological tissues and is followed by a section on multiple quantum filters that can be used to quantify both bi-exponential relaxation and residual quadrupolar interaction. Specifically, (i) the rationale behind the use of sodium MRI in quantifying proteoglycan (PG) changes, (ii) validation studies using biochemical assays, (iii) studies on human OA specimens, (iv) results on animal models and (v) clinical imaging protocols are reviewed. Results demonstrating the feasibility of quantifying PG in OA patients and comparison with that in healthy subjects are also presented. The section concludes with the discussion of advantages and potential issues with sodium MRI and the impact of new technological advancements (e.g. ultra-high field scanners and parallel imaging methods). In the theory section on T1rho, a brief description of (i) principles of measuring T1rho relaxation, (ii) pulse sequences for computing T1rho relaxation maps, (iii) issues regarding radio frequency power deposition, (iv) mechanisms that contribute to T1rho in biological tissues and (v) effects of exchange and dipolar interaction on T1rho dispersion are discussed. Correlation of T1rho relaxation rate with macromolecular content and biomechanical properties in cartilage specimens subjected to trypsin and cytokine-induced glycosaminoglycan depletion and validation against biochemical assay and histopathology are presented. Experimental T1rho data from osteoarthritic specimens, animal models, healthy human subjects and as well from osteoarthritic patients are provided. The current status of T1rho relaxation mapping of cartilage and future directions is also discussed. Copyright 2006 John Wiley & Sons, Ltd.
The influence of ferucarbotran on the chondrogenesis of human mesenchymal stem cells
Henning, Tobias D; Sutton, Elizabeth J; Kim, Anne; Golovko, Daniel; Horvai, Andrew; Ackerman, Larry; Sennino, Barbara; McDonald, Donald; Lotz, Jeffrey; Daldrup-Link, Heike E
2010-01-01
For in vivo applications of magnetically labeled stem cells, biological effects of the labeling procedure have to be precluded. This study evaluates the effect of different Ferucarbotran cell labeling protocols on chondrogenic differentiation of human mesenchymal stem cells (hMSC) as well as their implications for MR imaging. hMSC were labeled with Ferucarbotran using various protocols: Cells were labeled with 100μg Fe/ml for 4h and 18h and additional samples were cultured for 6 or 12 days after the 18-hour labeling. Supplementary samples were labeled by transfection with protamine sulfate. Iron uptake was quantified by ICP-spectrometry and labeled cells were investigated by transmission electron microscopy and by immunostaining for ferucarbotran. The differentiation potential of labeled cells was compared to unlabeled controls by staining with alcian blue and hematoxylin & eosin, then quantified by measurements of glucosaminoglycans (GAG). Contrast agent effect at 3T was investigated on day 1 and day 14 of chondrogenic differentiation by measuring signal-to-noise ratios on T2-SE and T2*-GE-sequences. Iron uptake was significant for all labeling protocols (p< 0.05). The uptake was highest after transfection with protamine sulfate (25.65 ± 3.96 pg/cell) and lowest at an incubation time of 4h without transfection (3.21 ± 0.21 pg/cell). While chondrogenic differentiation was decreased using all labeling protocols, the decrease in GAG synthesis was not significant after labeling for 4h without transfection. After labeling by simple incubation, chondrogenesis was found to be dose-dependent. MR imaging showed markedly lower SNR values of all labeled cells compared to the unlabeled controls. This contrast agent effect persisted for 14 days and the duration of differentiation. Magnetic labeling of hMSC with ferucarbotran inhibits chondrogenesis in a dose-dependent manner when using simple incubation techniques. When decreasing the incubation time to 4h, inhibition of chondrogenesis was not significant. PMID:19670250
Incremental comprehension of spoken quantifier sentences: Evidence from brain potentials.
Freunberger, Dominik; Nieuwland, Mante S
2016-09-01
Do people incrementally incorporate the meaning of quantifier expressions to understand an unfolding sentence? Most previous studies concluded that quantifiers do not immediately influence how a sentence is understood based on the observation that online N400-effects differed from offline plausibility judgments. Those studies, however, used serial visual presentation (SVP), which involves unnatural reading. In the current ERP-experiment, we presented spoken positive and negative quantifier sentences ("Practically all/practically no postmen prefer delivering mail, when the weather is good/bad during the day"). Different from results obtained in a previously reported SVP-study (Nieuwland, 2016) sentence truth-value N400 effects occurred in positive and negative quantifier sentences alike, reflecting fully incremental quantifier comprehension. This suggests that the prosodic information available during spoken language comprehension supports the generation of online predictions for upcoming words and that, at least for quantifier sentences, comprehension of spoken language may proceed more incrementally than comprehension during SVP reading. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Wilkie, Joel R.; Matuszak, Martha M.; Feng, Mary; Moran, Jean M.; Fraass, Benedick A.
2013-01-01
Purpose: Plan degradation resulting from compromises made to enhance delivery efficiency is an important consideration for intensity modulated radiation therapy (IMRT) treatment plans. IMRT optimization and/or multileaf collimator (MLC) sequencing schemes can be modified to generate more efficient treatment delivery, but the effect those modifications have on plan quality is often difficult to quantify. In this work, the authors present a method for quantitative assessment of overall plan quality degradation due to tradeoffs between delivery efficiency and treatment plan quality, illustrated using comparisons between plans developed allowing different numbers of intensity levels in IMRT optimization and/or MLC sequencing for static segmental MLC IMRT plans. Methods: A plan quality degradation method to evaluate delivery efficiency and plan quality tradeoffs was developed and used to assess planning for 14 prostate and 12 head and neck patients treated with static IMRT. Plan quality was evaluated using a physician's predetermined “quality degradation” factors for relevant clinical plan metrics associated with the plan optimization strategy. Delivery efficiency and plan quality were assessed for a range of optimization and sequencing limitations. The “optimal” (baseline) plan for each case was derived using a clinical cost function with an unlimited number of intensity levels. These plans were sequenced with a clinical MLC leaf sequencer which uses >100 segments, assuring delivered intensities to be within 1% of the optimized intensity pattern. Each patient's optimal plan was also sequenced limiting the number of intensity levels (20, 10, and 5), and then separately optimized with these same numbers of intensity levels. Delivery time was measured for all plans, and direct evaluation of the tradeoffs between delivery time and plan degradation was performed. Results: When considering tradeoffs, the optimal number of intensity levels depends on the treatment site and on the stage in the process at which the levels are limited. The cost of improved delivery efficiency, in terms of plan quality degradation, increased as the number of intensity levels in the sequencer or optimizer decreased. The degradation was more substantial for the head and neck cases relative to the prostate cases, particularly when fewer than 20 intensity levels were used. Plan quality degradation was less severe when the number of intensity levels was limited in the optimizer rather than the sequencer. Conclusions: Analysis of plan quality degradation allows for a quantitative assessment of the compromises in clinical plan quality as delivery efficiency is improved, in order to determine the optimal delivery settings. The technique is based on physician-determined quality degradation factors and can be extended to other clinical situations where investigation of various tradeoffs is warranted. PMID:23822412
Keisam, Santosh; Romi, Wahengbam; Ahmed, Giasuddin; Jeyaram, Kumaraswamy
2016-09-27
Cultivation-independent investigation of microbial ecology is biased by the DNA extraction methods used. We aimed to quantify those biases by comparative analysis of the metagenome mined from four diverse naturally fermented foods (bamboo shoot, milk, fish, soybean) using eight different DNA extraction methods with different cell lysis principles. Our findings revealed that the enzymatic lysis yielded higher eubacterial and yeast metagenomic DNA from the food matrices compared to the widely used chemical and mechanical lysis principles. Further analysis of the bacterial community structure by Illumina MiSeq amplicon sequencing revealed a high recovery of lactic acid bacteria by the enzymatic lysis in all food types. However, Bacillaceae, Acetobacteraceae, Clostridiaceae and Proteobacteria were more abundantly recovered when mechanical and chemical lysis principles were applied. The biases generated due to the differential recovery of operational taxonomic units (OTUs) by different DNA extraction methods including DNA and PCR amplicons mix from different methods have been quantitatively demonstrated here. The different methods shared only 29.9-52.0% of the total OTUs recovered. Although similar comparative research has been performed on other ecological niches, this is the first in-depth investigation of quantifying the biases in metagenome mining from naturally fermented foods.