Sample records for sequence characteristics required

  1. Methods for determining the genetic affinity of microorganisms and viruses

    NASA Technical Reports Server (NTRS)

    Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)

    2012-01-01

    Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.

  2. Global Performance Characterization of the Three Burn Trans-Earth Injection Maneuver Sequence over the Lunar Nodal Cycle

    NASA Technical Reports Server (NTRS)

    Williams, Jacob; Davis, Elizabeth C.; Lee, David E.; Condon, Gerald L.; Dawn, Tim

    2009-01-01

    The Orion spacecraft will be required to perform a three-burn trans-Earth injection (TEI) maneuver sequence to return to Earth from low lunar orbit. The origin of this approach lies in the Constellation Program requirements for access to any lunar landing site location combined with anytime lunar departure. This paper documents the development of optimized databases used to rapidly model the performance requirements of the TEI three-burn sequence for an extremely large number of mission cases. It also discusses performance results for lunar departures covering a complete 18.6 year lunar nodal cycle as well as general characteristics of the optimized three-burn TEI sequence.

  3. A machine learning model to determine the accuracy of variant calls in capture-based next generation sequencing.

    PubMed

    van den Akker, Jeroen; Mishne, Gilad; Zimmer, Anjali D; Zhou, Alicia Y

    2018-04-17

    Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to differentiate low from high confidence variants. Additionally, it reveals the importance of incorporating site-specific features as well as variant call features in such a model.

  4. Statistical properties of filtered pseudorandom digital sequences formed from the sum of maximum-length sequences

    NASA Technical Reports Server (NTRS)

    Wallace, G. R.; Weathers, G. D.; Graf, E. R.

    1973-01-01

    The statistics of filtered pseudorandom digital sequences called hybrid-sum sequences, formed from the modulo-two sum of several maximum-length sequences, are analyzed. The results indicate that a relation exists between the statistics of the filtered sequence and the characteristic polynomials of the component maximum length sequences. An analysis procedure is developed for identifying a large group of sequences with good statistical properties for applications requiring the generation of analog pseudorandom noise. By use of the analysis approach, the filtering process is approximated by the convolution of the sequence with a sum of unit step functions. A parameter reflecting the overall statistical properties of filtered pseudorandom sequences is derived. This parameter is called the statistical quality factor. A computer algorithm to calculate the statistical quality factor for the filtered sequences is presented, and the results for two examples of sequence combinations are included. The analysis reveals that the statistics of the signals generated with the hybrid-sum generator are potentially superior to the statistics of signals generated with maximum-length generators. Furthermore, fewer calculations are required to evaluate the statistics of a large group of hybrid-sum generators than are required to evaluate the statistics of the same size group of approximately equivalent maximum-length sequences.

  5. Sequence independent amplification of DNA

    DOEpatents

    Bohlander, S.K.

    1998-03-24

    The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example, the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei. 25 figs.

  6. Sequence independent amplification of DNA

    DOEpatents

    Bohlander, Stefan K.

    1998-01-01

    The present invention is a rapid sequence-independent amplification procedure (SIA). Even minute amounts of DNA from various sources can be amplified independent of any sequence requirements of the DNA or any a priori knowledge of any sequence characteristics of the DNA to be amplified. This method allows, for example the sequence independent amplification of microdissected chromosomal material and the reliable construction of high quality fluorescent in situ hybridization (FISH) probes from YACs or from other sources. These probes can be used to localize YACs on metaphase chromosomes but also--with high efficiency--in interphase nuclei.

  7. Spectrum requirements for dedicated short range communications (DSRC) : public safety and commercial applications

    DOT National Transportation Integrated Search

    1996-07-01

    This is the third in a sequence of papers that present the factors involved in identifying the : radio frequency spectrum required for both current and future DSRC operations. Since the : proposed applications, signal characteristics and channel allo...

  8. Deriving video content type from HEVC bitstream semantics

    NASA Astrophysics Data System (ADS)

    Nightingale, James; Wang, Qi; Grecos, Christos; Goma, Sergio R.

    2014-05-01

    As network service providers seek to improve customer satisfaction and retention levels, they are increasingly moving from traditional quality of service (QoS) driven delivery models to customer-centred quality of experience (QoE) delivery models. QoS models only consider metrics derived from the network however, QoE models also consider metrics derived from within the video sequence itself. Various spatial and temporal characteristics of a video sequence have been proposed, both individually and in combination, to derive methods of classifying video content either on a continuous scale or as a set of discrete classes. QoE models can be divided into three broad categories, full reference, reduced reference and no-reference models. Due to the need to have the original video available at the client for comparison, full reference metrics are of limited practical value in adaptive real-time video applications. Reduced reference metrics often require metadata to be transmitted with the bitstream, while no-reference metrics typically operate in the decompressed domain at the client side and require significant processing to extract spatial and temporal features. This paper proposes a heuristic, no-reference approach to video content classification which is specific to HEVC encoded bitstreams. The HEVC encoder already makes use of spatial characteristics to determine partitioning of coding units and temporal characteristics to determine the splitting of prediction units. We derive a function which approximates the spatio-temporal characteristics of the video sequence by using the weighted averages of the depth at which the coding unit quadtree is split and the prediction mode decision made by the encoder to estimate spatial and temporal characteristics respectively. Since the video content type of a sequence is determined by using high level information parsed from the video stream, spatio-temporal characteristics are identified without the need for full decoding and can be used in a timely manner to aid decision making in QoE oriented adaptive real time streaming.

  9. Centromere-Like Regions in the Budding Yeast Genome

    PubMed Central

    Lefrançois, Philippe; Auerbach, Raymond K.; Yellman, Christopher M.; Roeder, G. Shirleen; Snyder, Michael

    2013-01-01

    Accurate chromosome segregation requires centromeres (CENs), the DNA sequences where kinetochores form, to attach chromosomes to microtubules. In contrast to most eukaryotes, which have broad centromeres, Saccharomyces cerevisiae possesses sequence-defined point CENs. Chromatin immunoprecipitation followed by sequencing (ChIP–Seq) reveals colocalization of four kinetochore proteins at novel, discrete, non-centromeric regions, especially when levels of the centromeric histone H3 variant, Cse4 (a.k.a. CENP-A or CenH3), are elevated. These regions of overlapping protein binding enhance the segregation of plasmids and chromosomes and have thus been termed Centromere-Like Regions (CLRs). CLRs form in close proximity to S. cerevisiae CENs and share characteristics typical of both point and regional CENs. CLR sequences are conserved among related budding yeasts. Many genomic features characteristic of CLRs are also associated with these conserved homologous sequences from closely related budding yeasts. These studies provide general and important insights into the origin and evolution of centromeres. PMID:23349633

  10. Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.

    PubMed

    Magner, Abram; Duda, Jarosław; Szpankowski, Wojciech; Grama, Ananth

    2016-06-01

    Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to accurately reconstruct the true sequence with high probability? Our results provide a number of important insights: (i) the probability of accurate reconstruction of a sequence from a single sample in the presence of indel errors tends quickly (i.e., exponentially) to zero as the length of the sequence increases; and (ii) replicated extrusion is an effective technique for accurate reconstruction. We show that for typical distributions of indel errors, the required number of replicas is a slow function (polylogarithmic) of sequence length - implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Moreover, we show that in certain cases, the required number of replicas can be related to information-theoretic parameters of the indel error distributions.

  11. Outcomes of Cleft Palate Repair in Patients with Pierre Robin Sequence: A Matched Case-Control Study.

    PubMed

    Hardwicke, Joseph T; Richards, Helen; Cafferky, Louise; Underwood, Imogen; ter Horst, Britt; Slator, Rona

    2016-03-01

    Pierre Robin sequence results from a cascade of events that occur during embryologic development and frequently presents with cleft palate. Some studies have shown speech outcomes to be worse in patients with Pierre Robin sequence after cleft palate repair. A cohort of Pierre Robin sequence patients who all required an airway intervention and nasogastric feeding in the neonatal period were identified and speech outcomes assessed at 5 years of age. A cleft- and sex-matched non-Pierre Robin sequence, cleft palate-only comparison group was also identified from the same institution and study period. A total of 24 patients with Pierre Robin sequence that required airway and nutritional support in the neonatal period were matched for age, sex, and cleft type to a group of 24 non-Pierre Robin sequence cleft patients. There was no significant difference in the incidence of oronasal fistula between the groups. Secondary surgery for velopharyngeal incompetence was significantly more (p = 0.017) in the Pierre Robin sequence group, who also had significantly greater nasality (p = 0.031) and cleft speech characteristic (p = 0.023) scores. The authors hypothesize that other factors may exist in Pierre Robin sequence that may lead to poor speech outcomes. The authors would suggest counseling parents of children with Pierre Robin sequence that have required a neonatal airway intervention, that speech development may be poorer than in other children with cleft palate, and that these children will have a significantly higher incidence of secondary speech surgery. Risk, II.

  12. Hidden Markov models of biological primary sequence information.

    PubMed Central

    Baldi, P; Chauvin, Y; Hunkapiller, T; McClure, M A

    1994-01-01

    Hidden Markov model (HMM) techniques are used to model families of biological sequences. A smooth and convergent algorithm is introduced to iteratively adapt the transition and emission parameters of the models from the examples in a given family. The HMM approach is applied to three protein families: globins, immunoglobulins, and kinases. In all cases, the models derived capture the important statistical characteristics of the family and can be used for a number of tasks, including multiple alignments, motif detection, and classification. For K sequences of average length N, this approach yields an effective multiple-alignment algorithm which requires O(KN2) operations, linear in the number of sequences. PMID:8302831

  13. Comparison of the genomic sequence of the microminipig, a novel breed of swine, with the genomic database for conventional pig.

    PubMed

    Miura, Naoki; Kucho, Ken-Ichi; Noguchi, Michiko; Miyoshi, Noriaki; Uchiumi, Toshiki; Kawaguchi, Hiroaki; Tanimoto, Akihide

    2014-01-01

    The microminipig, which weighs less than 10 kg at an early stage of maturity, has been reported as a potential experimental model animal. Its extremely small size and other distinct characteristics suggest the possibility of a number of differences between the genome of the microminipig and that of conventional pigs. In this study, we analyzed the genomes of two healthy microminipigs using a next-generation sequencer SOLiD™ system. We then compared the obtained genomic sequences with a genomic database for the domestic pig (Sus scrofa). The mapping coverage of sequenced tag from the microminipig to conventional pig genomic sequences was greater than 96% and we detected no clear, substantial genomic variance from these data. The results may indicate that the distinct characteristics of the microminipig derive from small-scale alterations in the genome, such as Single Nucleotide Polymorphisms or translational modifications, rather than large-scale deletion or insertion polymorphisms. Further investigation of the entire genomic sequence of the microminipig with methods enabling deeper coverage is required to elucidate the genetic basis of its distinct phenotypic traits. Copyright © 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  14. The transfer of movement sequences: effects of decreased and increased load.

    PubMed

    Muehlbauer, Thomas; Panzer, Stefan; Shea, Charles H

    2007-06-01

    A number of recent experiments have demonstrated that a movement structure develops during the course of learning a movement sequence that provides the basis for transfer. After learning a movement sequence participants have been shown to be able to effectively produce the sequence when movement demands require that the sequence be rescaled in amplitude or produced with an unpractised set of effectors. The purpose of the present experiment was to determine whether participants, after learning a complex 16-element movement sequence with a 0.567-kg load, could also effectively produce the sequence when the load was decreased (0.0 kg) or increased (1.134 kg). The results indicated that participants were able to effectively compensate for decreased and increased load with virtually no changes in performance characteristics (displacement, velocity, acceleration, and pattern of element durations) while electromyographic (EMG) signals demonstrated that smaller (reduced load) or larger forces (increased load) were spontaneously generated to compensate for the change in load. The muscle activation patterns of the biceps and triceps as well as the level of coactivation appeared to be generally upscaled to generate and dissipate the changes in force requirement needed to compensate for the increased load.

  15. Terminal Duplex Stability and Nucleotide Identity Differentially Control siRNA Loading and Activity in RNA Interference

    PubMed Central

    Angart, Phillip A.; Carlson, Rebecca J.; Adu-Berchie, Kwasi

    2016-01-01

    Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5′ terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5′ terminus (Nucleotides: 1–2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)–specific activity was found to be improved by lower hybridization stability in the 5′ terminus (Nucleotides: 3–4) of the loaded siRNA strand and greater hybridization stability toward the 3′ terminus (Nucleotides: 17–18). Concomitantly, specific recognition of the 5′ terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand. PMID:27399870

  16. Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

    PubMed Central

    Huang, Yongjie; Mrázek, Jan

    2014-01-01

    Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877

  17. Development and Evaluation of a Performance Modeling Flight Test Approach Based on Quasi Steady-State Maneuvers

    NASA Technical Reports Server (NTRS)

    Yechout, T. R.; Braman, K. B.

    1984-01-01

    The development, implementation and flight test evaluation of a performance modeling technique which required a limited amount of quasisteady state flight test data to predict the overall one g performance characteristics of an aircraft. The concept definition phase of the program include development of: (1) the relationship for defining aerodynamic characteristics from quasi steady state maneuvers; (2) a simplified in flight thrust and airflow prediction technique; (3) a flight test maneuvering sequence which efficiently provided definition of baseline aerodynamic and engine characteristics including power effects on lift and drag; and (4) the algorithms necessary for cruise and flight trajectory predictions. Implementation of the concept include design of the overall flight test data flow, definition of instrumentation system and ground test requirements, development and verification of all applicable software and consolidation of the overall requirements in a flight test plan.

  18. Alabama Course of Study: Humanities, K-12. Bulletin 1983, No. 16.

    ERIC Educational Resources Information Center

    Alabama State Dept. of Education, Montgomery.

    A scope and sequence for incorporating humanities into the existing K-12 curriculum contains 8 sections. Following an introduction, the first section outlines characteristics of an effective humanities program. The second and third sections contain teacher and student objectives for a humanities program, minimum requirements, and alternatives for…

  19. 49 CFR 236.1011 - PTC Implementation Plan content requirements.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... the following risk factors by track segment: (i) Segment traffic characteristics such as typical...; (4) How, to the extent practical, the PTC system will be implemented to address areas of greater risk to the public and railroad employees before areas of lesser risk; (5) The sequence and schedule in...

  20. Production of Supra-regular Spatial Sequences by Macaque Monkeys.

    PubMed

    Jiang, Xinjian; Long, Tenghai; Cao, Weicong; Li, Junru; Dehaene, Stanislas; Wang, Liping

    2018-06-18

    Understanding and producing embedded sequences in language, music, or mathematics, is a central characteristic of our species. These domains are hypothesized to involve a human-specific competence for supra-regular grammars, which can generate embedded sequences that go beyond the regular sequences engendered by finite-state automata. However, is this capacity truly unique to humans? Using a production task, we show that macaque monkeys can be trained to produce time-symmetrical embedded spatial sequences whose formal description requires supra-regular grammars or, equivalently, a push-down stack automaton. Monkeys spontaneously generalized the learned grammar to novel sequences, including longer ones, and could generate hierarchical sequences formed by an embedding of two levels of abstract rules. Compared to monkeys, however, preschool children learned the grammars much faster using a chunking strategy. While supra-regular grammars are accessible to nonhuman primates through extensive training, human uniqueness may lie in the speed and learning strategy with which they are acquired. Copyright © 2018 Elsevier Ltd. All rights reserved.

  1. Cool circumstellar matter around nearby main-sequence stars

    NASA Technical Reports Server (NTRS)

    Walker, H. J.; Wolstencroft, R. D.

    1988-01-01

    Stars are presented which have characteristics similar to Vega and other main-sequence stars with cool dust disks, based on the IRAS Point Source Catalog fluxes. The objects are selected to have a 60-micron/100-micron ratio similar to Vega, Beta Pic, Alpha PsA, and Epsilon Eri, and they are also required to show evidence of extension in the IRAS Working Survey Database. The fluxes are modeled using a blackbody energy distribution. The temperatures derived range from 50 to 650 K. The diameters of the dust disks observed by IRAS are estimated.

  2. A Selective-Echo Method for Chemical-Shift Imaging of Two-Component Systems

    NASA Astrophysics Data System (ADS)

    Gerald, Rex E., II; Krasavin, Anatoly O.; Botto, Robert E.

    A simple and effective method for selectively imaging either one of two chemical species in a two-component system is presented and demonstrated experimentally. The pulse sequence employed, selective- echo chemical- shift imaging (SECSI), is a hybrid (frequency-selective/ T1-contrast) technique that is executed in a short period of time, utilizes the full Boltzmann magnetization of each chemical species to form the corresponding image, and requires only hard pulses of quadrature phase. This approach provides a direct and unambiguous representation of the spatial distribution of the two chemical species. In addition, the performance characteristics and the advantages of the SECSI sequence are compared on a common basis to those of other pulse sequences.

  3. Local alignment of two-base encoded DNA sequence

    PubMed Central

    Homer, Nils; Merriman, Barry; Nelson, Stanley F

    2009-01-01

    Background DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. Results We present an extension of the standard dynamic programming method for local alignment, which simultaneously decodes the data and performs the alignment, maximizing a similarity score based on a weighted combination of errors and edits, and allowing an affine gap penalty. We also present simulations that demonstrate the performance characteristics of our two base encoded alignment method and contrast those with standard DNA sequence alignment under the same conditions. Conclusion The new local alignment algorithm for two-base encoded data has substantial power to properly detect and correct measurement errors while identifying underlying sequence variants, and facilitating genome re-sequencing efforts based on this form of sequence data. PMID:19508732

  4. Nucleotide sequence of the Saccharomyces cerevisiae PUT4 proline-permease-encoding gene: similarities between CAN1, HIP1 and PUT4 permeases.

    PubMed

    Vandenbol, M; Jauniaux, J C; Grenson, M

    1989-11-15

    The complete nucleotide (nt) sequence of the PUT4 gene, whose product is required for high-affinity proline active transport in the yeast Saccharomyces cerevisiae, is presented. The sequence contains a single long open reading frame of 1881 nt, encoding a polypeptide with a calculated Mr of 68,795. The predicted protein is strongly hydrophobic and exhibits six potential glycosylation sites. Its hydropathy profile suggests the presence of twelve membrane-spanning regions flanked by hydrophilic N- and C-terminal domains. The N terminus does not resemble signal sequences found in secreted proteins. These features are characteristic of integral membrane proteins catalyzing translocation of ligands across cellular membranes. Protein sequence comparisons indicate strong resemblance to the arginine and histidine permeases of S. cerevisiae, but no marked sequence similarity to the proline permease of Escherichia coli or to other known prokaryotic or eukaryotic transport proteins. The strong similarity between the three yeast amino acid permeases suggests a common ancestor for the three proteins.

  5. Retroviral DNA Integration Directed by HIV Integration Protein in Vitro

    NASA Astrophysics Data System (ADS)

    Bushman, Frederic D.; Fujiwara, Tamio; Craigie, Robert

    1990-09-01

    Efficient retroviral growth requires integration of a DNA copy of the viral RNA genome into a chromosome of the host. As a first step in analyzing the mechanism of integration of human immunodeficiency virus (HIV) DNA, a cell-free system was established that models the integration reaction. The in vitro system depends on the HIV integration (IN) protein, which was partially purified from insect cells engineered to express IN protein in large quantities. Integration was detected in a biological assay that scores the insertion of a linear DNA containing HIV terminal sequences into a λ DNA target. Some integration products generated in this assay contained five-base pair duplications of the target DNA at the recombination junctions, a characteristic of HIV integration in vivo; the remaining products contained aberrant junctional sequences that may have been produced in a variation of the normal reaction. These results indicate that HIV IN protein is the only viral protein required to insert model HIV DNA sequences into a target DNA in vitro.

  6. Spatio-Temporal Structure, Path Characteristics, and Perceptual Grouping in Immediate Serial Spatial Recall

    PubMed Central

    De Lillo, Carlo; Kirby, Melissa; Poole, Daniel

    2016-01-01

    Immediate serial spatial recall measures the ability to retain sequences of locations in short-term memory and is considered the spatial equivalent of digit span. It is tested by requiring participants to reproduce sequences of movements performed by an experimenter or displayed on a monitor. Different organizational factors dramatically affect serial spatial recall but they are often confounded or underspecified. Untangling them is crucial for the characterization of working-memory models and for establishing the contribution of structure and memory capacity to spatial span. We report five experiments assessing the relative role and independence of factors that have been reported in the literature. Experiment 1 disentangled the effects of spatial clustering and path-length by manipulating the distance of items displayed on a touchscreen monitor. Long-path sequences segregated by spatial clusters were compared with short-path sequences not segregated by clusters. Recall was more accurate for sequences segregated by clusters independently from path-length. Experiment 2 featured conditions where temporal pauses were introduced between or within cluster boundaries during the presentation of sequences with the same paths. Thus, the temporal structure of the sequences was either consistent or inconsistent with a hierarchical representation based on segmentation by spatial clusters but the effect of structure could not be confounded with effects of path-characteristics. Pauses at cluster boundaries yielded more accurate recall, as predicted by a hierarchical model. In Experiment 3, the systematic manipulation of sequence structure, path-length, and presence of path-crossings of sequences showed that structure explained most of the variance, followed by the presence/absence of path-crossings, and path-length. Experiments 4 and 5 replicated the results of the previous experiments in immersive virtual reality navigation tasks where the viewpoint of the observer changed dynamically during encoding and recall. This suggested that the effects of structure in spatial span are not dependent on perceptual grouping processes induced by the aerial view of the stimulus array typically afforded by spatial recall tasks. These results demonstrate the independence of coding strategies based on structure from effects of path characteristics and perceptual grouping in immediate serial spatial recall. PMID:27891101

  7. DNASynth: a software application to optimization of artificial gene synthesis

    NASA Astrophysics Data System (ADS)

    Muczyński, Jan; Nowak, Robert M.

    2017-08-01

    DNASynth is a client-server software application in which the client runs in a web browser. The aim of this program is to support and optimize process of artificial gene synthesizing using Ligase Chain Reaction. Thanks to LCR it is possible to obtain DNA strand coding defined by user peptide. The DNA sequence is calculated by optimization algorithm that consider optimal codon usage, minimal energy of secondary structures and minimal number of required LCR. Additionally absence of sequences characteristic for defined by user set of restriction enzymes is guaranteed. The presented software was tested on synthetic and real data.

  8. Designing deep sequencing experiments: detecting structural variation and estimating transcript abundance.

    PubMed

    Bashir, Ali; Bansal, Vikas; Bafna, Vineet

    2010-06-18

    Massively parallel DNA sequencing technologies have enabled the sequencing of several individual human genomes. These technologies are also being used in novel ways for mRNA expression profiling, genome-wide discovery of transcription-factor binding sites, small RNA discovery, etc. The multitude of sequencing platforms, each with their unique characteristics, pose a number of design challenges, regarding the technology to be used and the depth of sequencing required for a particular sequencing application. Here we describe a number of analytical and empirical results to address design questions for two applications: detection of structural variations from paired-end sequencing and estimating mRNA transcript abundance. For structural variation, our results provide explicit trade-offs between the detection and resolution of rearrangement breakpoints, and the optimal mix of paired-read insert lengths. Specifically, we prove that optimal detection and resolution of breakpoints is achieved using a mix of exactly two insert library lengths. Furthermore, we derive explicit formulae to determine these insert length combinations, enabling a 15% improvement in breakpoint detection at the same experimental cost. On empirical short read data, these predictions show good concordance with Illumina 200 bp and 2 Kbp insert length libraries. For transcriptome sequencing, we determine the sequencing depth needed to detect rare transcripts from a small pilot study. With only 1 Million reads, we derive corrections that enable almost perfect prediction of the underlying expression probability distribution, and use this to predict the sequencing depth required to detect low expressed genes with greater than 95% probability. Together, our results form a generic framework for many design considerations related to high-throughput sequencing. We provide software tools http://bix.ucsd.edu/projects/NGS-DesignTools to derive platform independent guidelines for designing sequencing experiments (amount of sequencing, choice of insert length, mix of libraries) for novel applications of next generation sequencing.

  9. Classification of viral zoonosis through receptor pattern analysis.

    PubMed

    Bae, Se-Eun; Son, Hyeon Seok

    2011-04-13

    Viral zoonosis, the transmission of a virus from its primary vertebrate reservoir species to humans, requires ubiquitous cellular proteins known as receptor proteins. Zoonosis can occur not only through direct transmission from vertebrates to humans, but also through intermediate reservoirs or other environmental factors. Viruses can be categorized according to genotype (ssDNA, dsDNA, ssRNA and dsRNA viruses). Among them, the RNA viruses exhibit particularly high mutation rates and are especially problematic for this reason. Most zoonotic viruses are RNA viruses that change their envelope proteins to facilitate binding to various receptors of host species. In this study, we sought to predict zoonotic propensity through the analysis of receptor characteristics. We hypothesized that the major barrier to interspecies virus transmission is that receptor sequences vary among species--in other words, that the specific amino acid sequence of the receptor determines the ability of the viral envelope protein to attach to the cell. We analysed host-cell receptor sequences for their hydrophobicity/hydrophilicity characteristics. We then analysed these properties for similarities among receptors of different species and used a statistical discriminant analysis to predict the likelihood of transmission among species. This study is an attempt to predict zoonosis through simple computational analysis of receptor sequence differences. Our method may be useful in predicting the zoonotic potential of newly discovered viral strains.

  10. Self-Organizing Hidden Markov Model Map (SOHMMM).

    PubMed

    Ferles, Christos; Stafylopatis, Andreas

    2013-12-01

    A hybrid approach combining the Self-Organizing Map (SOM) and the Hidden Markov Model (HMM) is presented. The Self-Organizing Hidden Markov Model Map (SOHMMM) establishes a cross-section between the theoretic foundations and algorithmic realizations of its constituents. The respective architectures and learning methodologies are fused in an attempt to meet the increasing requirements imposed by the properties of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), and protein chain molecules. The fusion and synergy of the SOM unsupervised training and the HMM dynamic programming algorithms bring forth a novel on-line gradient descent unsupervised learning algorithm, which is fully integrated into the SOHMMM. Since the SOHMMM carries out probabilistic sequence analysis with little or no prior knowledge, it can have a variety of applications in clustering, dimensionality reduction and visualization of large-scale sequence spaces, and also, in sequence discrimination, search and classification. Two series of experiments based on artificial sequence data and splice junction gene sequences demonstrate the SOHMMM's characteristics and capabilities. Copyright © 2013 Elsevier Ltd. All rights reserved.

  11. An evolution based biosensor receptor DNA sequence generation algorithm.

    PubMed

    Kim, Eungyeong; Lee, Malrey; Gatton, Thomas M; Lee, Jaewan; Zang, Yupeng

    2010-01-01

    A biosensor is composed of a bioreceptor, an associated recognition molecule, and a signal transducer that can selectively detect target substances for analysis. DNA based biosensors utilize receptor molecules that allow hybridization with the target analyte. However, most DNA biosensor research uses oligonucleotides as the target analytes and does not address the potential problems of real samples. The identification of recognition molecules suitable for real target analyte samples is an important step towards further development of DNA biosensors. This study examines the characteristics of DNA used as bioreceptors and proposes a hybrid evolution-based DNA sequence generating algorithm, based on DNA computing, to identify suitable DNA bioreceptor recognition molecules for stable hybridization with real target substances. The Traveling Salesman Problem (TSP) approach is applied in the proposed algorithm to evaluate the safety and fitness of the generated DNA sequences. This approach improves efficiency and stability for enhanced and variable-length DNA sequence generation and allows extension to generation of variable-length DNA sequences with diverse receptor recognition requirements.

  12. Use of eluted peptide sequence data to identify the binding characteristics of peptides to the insulin-dependent diabetes susceptibility allele HLA-DQ8 (DQ 3.2).

    PubMed

    Godkin, A; Friede, T; Davenport, M; Stevanovic, S; Willis, A; Jewell, D; Hill, A; Rammensee, H G

    1997-06-01

    HLA-DQ8 (A1*0301, B1*0302) and -DQ2 (A1*0501, B1*0201) are both associated with diseases such as insulin-dependent diabetes mellitus and coeliac disease. We used the technique of pool sequencing to look at the requirements of peptides binding to HLA-DQ8, and combined these data with naturally sequenced ligands and in vitro binding assays to describe a novel motif for HLA-DQ8. The motif, which has the same basic format as many HLA-DR molecules, consists of four or five anchor regions, in the positions from the N-terminus of the binding core of n, n + 3, n + 5/6 and n + 8, i.e. P1, P4, P6/7 and P9. P1 and P9 require negative or polar residues, with mainly aliphatic residues at P4 and P6/7. The features of the HLA-DQ8 motif were then compared to a pool sequence of peptides eluted from HLA-DQ2. A consensus motif for the binding of a common peptide which may be involved in disease pathogenesis is described. Neither of the disease-associated alleles HLA-DQ2 and -DQ8 have Asp at position 57 of the beta-chain. This Asp, if present, may form a salt bridge with an Arg at position 79 of the alpha-chain and so alter the binding specificity of P9. HLA-DQ2 and -DQ8 both appear to prefer negatively charged amino acids at P9. In contrast, HLA-DQ7 (A1*0301, B1*0301), which is not associated with diabetes, has Asp at beta 57, allowing positively charged amino acids at P9. This analysis of the sequence features of DQ-binding peptides suggests molecular characteristics which may be useful to predict epitopes involved in disease pathogenesis.

  13. More Genetic Engineering With Cloned Hemoglobin Genes

    NASA Technical Reports Server (NTRS)

    Bailey, James E.

    1992-01-01

    Cells modified to enhance growth and production of proteins. Method for enhancing both growth of micro-organisms in vitro and production of various proteins or metalbolites in these micro-organisms provides for incorporation of selected chromosomal or extrachormosomal deoxyribonucleic acid (DNA) sequences into micro-organisms from other cells or from artificial sources. Incorporated DNA includes parts encoding desired product(s) or characteristic(s) of cells and parts that control expression of productor characteristic-encoding parts in response to variations in environment. Extended method enables increased research into growth of organisms in oxygen-poor environments. Industrial applications found in enhancement of processing steps requiring oxygen in fermentation, enzymatic degradation, treatment of wastes containing toxic chemicals, brewing, and some oxidative chemical reactions.

  14. SEQ-REVIEW: A tool for reviewing and checking spacecraft sequences

    NASA Astrophysics Data System (ADS)

    Maldague, Pierre F.; El-Boushi, Mekki; Starbird, Thomas J.; Zawacki, Steven J.

    1994-11-01

    A key component of JPL's strategy to make space missions faster, better and cheaper is the Advanced Multi-Mission Operations System (AMMOS), a ground software intensive system currently in use and in further development. AMMOS intends to eliminate the cost of re-engineering a ground system for each new JPL mission. This paper discusses SEQ-REVIEW, a component of AMMOS that was designed to facilitate and automate the task of reviewing and checking spacecraft sequences. SEQ-REVIEW is a smart browser for inspecting files created by other sequence generation tools in the AMMOS system. It can parse sequence-related files according to a computer-readable version of a 'Software Interface Specification' (SIS), which is a standard document for defining file formats. It lets users display one or several linked files and check simple constraints using a Basic-like 'Little Language'. SEQ-REVIEW represents the first application of the Quality Function Development (QFD) method to sequence software development at JPL. The paper will show how the requirements for SEQ-REVIEW were defined and converted into a design based on object-oriented principles. The process starts with interviews of potential users, a small but diverse group that spans multiple disciplines and 'cultures'. It continues with the development of QFD matrices that related product functions and characteristics to user-demanded qualities. These matrices are then turned into a formal Software Requirements Document (SRD). The process concludes with the design phase, in which the CRC (Class, Responsibility, Collaboration) approach was used to convert requirements into a blueprint for the final product.

  15. SEQ-REVIEW: A tool for reviewing and checking spacecraft sequences

    NASA Technical Reports Server (NTRS)

    Maldague, Pierre F.; El-Boushi, Mekki; Starbird, Thomas J.; Zawacki, Steven J.

    1994-01-01

    A key component of JPL's strategy to make space missions faster, better and cheaper is the Advanced Multi-Mission Operations System (AMMOS), a ground software intensive system currently in use and in further development. AMMOS intends to eliminate the cost of re-engineering a ground system for each new JPL mission. This paper discusses SEQ-REVIEW, a component of AMMOS that was designed to facilitate and automate the task of reviewing and checking spacecraft sequences. SEQ-REVIEW is a smart browser for inspecting files created by other sequence generation tools in the AMMOS system. It can parse sequence-related files according to a computer-readable version of a 'Software Interface Specification' (SIS), which is a standard document for defining file formats. It lets users display one or several linked files and check simple constraints using a Basic-like 'Little Language'. SEQ-REVIEW represents the first application of the Quality Function Development (QFD) method to sequence software development at JPL. The paper will show how the requirements for SEQ-REVIEW were defined and converted into a design based on object-oriented principles. The process starts with interviews of potential users, a small but diverse group that spans multiple disciplines and 'cultures'. It continues with the development of QFD matrices that related product functions and characteristics to user-demanded qualities. These matrices are then turned into a formal Software Requirements Document (SRD). The process concludes with the design phase, in which the CRC (Class, Responsibility, Collaboration) approach was used to convert requirements into a blueprint for the final product.

  16. Full-Length Venom Protein cDNA Sequences from Venom-Derived mRNA: Exploring Compositional Variation and Adaptive Multigene Evolution

    PubMed Central

    Modahl, Cassandra M.; Mackessy, Stephen P.

    2016-01-01

    Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639

  17. Implementation of Objective PASC-Derived Taxon Demarcation Criteria for Official Classification of Filoviruses.

    PubMed

    Bào, Yīmíng; Amarasinghe, Gaya K; Basler, Christopher F; Bavari, Sina; Bukreyev, Alexander; Chandran, Kartik; Dolnik, Olga; Dye, John M; Ebihara, Hideki; Formenty, Pierre; Hewson, Roger; Kobinger, Gary P; Leroy, Eric M; Mühlberger, Elke; Netesov, Sergey V; Patterson, Jean L; Paweska, Janusz T; Smither, Sophie J; Takada, Ayato; Towner, Jonathan S; Volchkov, Viktor E; Wahl-Jensen, Victoria; Kuhn, Jens H

    2017-05-11

    The mononegaviral family Filoviridae has eight members assigned to three genera and seven species. Until now, genus and species demarcation were based on arbitrarily chosen filovirus genome sequence divergence values (≈50% for genera, ≈30% for species) and arbitrarily chosen phenotypic virus or virion characteristics. Here we report filovirus genome sequence-based taxon demarcation criteria using the publicly accessible PAirwise Sequencing Comparison (PASC) tool of the US National Center for Biotechnology Information (Bethesda, MD, USA). Comparison of all available filovirus genomes in GenBank using PASC revealed optimal genus demarcation at the 55-58% sequence diversity threshold range for genera and at the 23-36% sequence diversity threshold range for species. Because these thresholds do not change the current official filovirus classification, these values are now implemented as filovirus taxon demarcation criteria that may solely be used for filovirus classification in case additional data are absent. A near-complete, coding-complete, or complete filovirus genome sequence will now be required to allow official classification of any novel "filovirus." Classification of filoviruses into existing taxa or determining the need for novel taxa is now straightforward and could even become automated using a presented algorithm/flowchart rooted in RefSeq (type) sequences.

  18. A 'new lease of life': FnCpf1 possesses DNA cleavage activity for genome editing in human cells.

    PubMed

    Tu, Mengjun; Lin, Li; Cheng, Yilu; He, Xiubin; Sun, Huihui; Xie, Haihua; Fu, Junhao; Liu, Changbao; Li, Jin; Chen, Ding; Xi, Haitao; Xue, Dongyu; Liu, Qi; Zhao, Junzhao; Gao, Caixia; Song, Zongming; Qu, Jia; Gu, Feng

    2017-11-02

    Cpf1 nucleases were recently reported to be highly specific and programmable nucleases with efficiencies comparable to those of SpCas9. AsCpf1 and LbCpf1 require a single crRNA and recognize a 5'-TTTN-3' protospacer adjacent motif (PAM) at the 5' end of the protospacer for genome editing. For widespread application in precision site-specific human genome editing, the range of sequences that AsCpf1 and LbCpf1 can recognize is limited due to the size of this PAM. To address this limitation, we sought to identify a novel Cpf1 nuclease with simpler PAM requirements. Specifically, here we sought to test and engineer FnCpf1, one reported Cpf1 nuclease (FnCpf1) only requires 5'-TTN-3' as a PAM but does not exhibit detectable levels of nuclease-induced indels at certain locus in human cells. Surprisingly, we found that FnCpf1 possesses DNA cleavage activity in human cells at multiple loci. We also comprehensively and quantitatively examined various FnCpf1 parameters in human cells, including spacer sequence, direct repeat sequence and the PAM sequence. Our study identifies FnCpf1 as a new member of the Cpf1 family for human genome editing with distinctive characteristics, which shows promise as a genome editing tool with the potential for both research and therapeutic applications. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  19. A ‘new lease of life’: FnCpf1 possesses DNA cleavage activity for genome editing in human cells

    PubMed Central

    Tu, Mengjun; Lin, Li; Cheng, Yilu; He, Xiubin; Sun, Huihui; Xie, Haihua; Fu, Junhao; Liu, Changbao; Li, Jin; Chen, Ding; Xi, Haitao; Xue, Dongyu; Liu, Qi; Zhao, Junzhao; Gao, Caixia; Song, Zongming; Qu, Jia

    2017-01-01

    Abstract Cpf1 nucleases were recently reported to be highly specific and programmable nucleases with efficiencies comparable to those of SpCas9. AsCpf1 and LbCpf1 require a single crRNA and recognize a 5′-TTTN-3′ protospacer adjacent motif (PAM) at the 5′ end of the protospacer for genome editing. For widespread application in precision site-specific human genome editing, the range of sequences that AsCpf1 and LbCpf1 can recognize is limited due to the size of this PAM. To address this limitation, we sought to identify a novel Cpf1 nuclease with simpler PAM requirements. Specifically, here we sought to test and engineer FnCpf1, one reported Cpf1 nuclease (FnCpf1) only requires 5′-TTN-3′ as a PAM but does not exhibit detectable levels of nuclease-induced indels at certain locus in human cells. Surprisingly, we found that FnCpf1 possesses DNA cleavage activity in human cells at multiple loci. We also comprehensively and quantitatively examined various FnCpf1 parameters in human cells, including spacer sequence, direct repeat sequence and the PAM sequence. Our study identifies FnCpf1 as a new member of the Cpf1 family for human genome editing with distinctive characteristics, which shows promise as a genome editing tool with the potential for both research and therapeutic applications. PMID:28977650

  20. A Bioinformatic Strategy for the Detection, Classification and Analysis of Bacterial Autotransporters

    PubMed Central

    Celik, Nermin; Webb, Chaille T.; Leyton, Denisse L.; Holt, Kathryn E.; Heinz, Eva; Gorrell, Rebecca; Kwok, Terry; Naderer, Thomas; Strugnell, Richard A.; Speed, Terence P.; Teasdale, Rohan D.; Likić, Vladimir A.; Lithgow, Trevor

    2012-01-01

    Autotransporters are secreted proteins that are assembled into the outer membrane of bacterial cells. The passenger domains of autotransporters are crucial for bacterial pathogenesis, with some remaining attached to the bacterial surface while others are released by proteolysis. An enigma remains as to whether autotransporters should be considered a class of secretion system, or simply a class of substrate with peculiar requirements for their secretion. We sought to establish a sensitive search protocol that could identify and characterize diverse autotransporters from bacterial genome sequence data. The new sequence analysis pipeline identified more than 1500 autotransporter sequences from diverse bacteria, including numerous species of Chlamydiales and Fusobacteria as well as all classes of Proteobacteria. Interrogation of the proteins revealed that there are numerous classes of passenger domains beyond the known proteases, adhesins and esterases. In addition the barrel-domain-a characteristic feature of autotransporters-was found to be composed from seven conserved sequence segments that can be arranged in multiple ways in the tertiary structure of the assembled autotransporter. One of these conserved motifs overlays the targeting information required for autotransporters to reach the outer membrane. Another conserved and diagnostic motif maps to the linker region between the passenger domain and barrel-domain, indicating it as an important feature in the assembly of autotransporters. PMID:22905239

  1. Down-regulation of Rab5 decreases characteristics associated with maintenance of cell transformation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Silva, Patricio; Soto, Nicolás; Díaz, Jorge

    2015-08-21

    The early endosomal protein Rab5 is highly expressed in tumor samples, although a causal relationship between Rab5 expression and cell transformation has not been established. Here, we report the functional effects of targeting endogenous Rab5 with specific shRNA sequences in different tumor cell lines. Rab5 down-regulation in B16-F10 cells decreased tumor formation by subcutaneous injection into C57/BL6 mice. Accordingly, Rab5 targeting in B16-F10 and A549, but not MDA-MB-231 cells was followed by decreased cell proliferation, increased apoptosis and decreased anchorage-independent growth. These findings suggest that Rab5 expression is required to maintain characteristics associated with cell transformation. - Highlights: • Rab5more » is important to the maintenance of cell transformation characteristics. • Down-regulation of Rab5 decreases cell proliferation and increases apoptosis in different cancer cells. • Rab5 is required for anchorage-independent growth and tumorigenicity in-vivo.« less

  2. A Window Into Clinical Next-Generation Sequencing-Based Oncology Testing Practices.

    PubMed

    Nagarajan, Rakesh; Bartley, Angela N; Bridge, Julia A; Jennings, Lawrence J; Kamel-Reid, Suzanne; Kim, Annette; Lazar, Alexander J; Lindeman, Neal I; Moncur, Joel; Rai, Alex J; Routbort, Mark J; Vasalos, Patricia; Merker, Jason D

    2017-12-01

    - Detection of acquired variants in cancer is a paradigm of precision medicine, yet little has been reported about clinical laboratory practices across a broad range of laboratories. - To use College of American Pathologists proficiency testing survey results to report on the results from surveys on next-generation sequencing-based oncology testing practices. - College of American Pathologists proficiency testing survey results from more than 250 laboratories currently performing molecular oncology testing were used to determine laboratory trends in next-generation sequencing-based oncology testing. - These presented data provide key information about the number of laboratories that currently offer or are planning to offer next-generation sequencing-based oncology testing. Furthermore, we present data from 60 laboratories performing next-generation sequencing-based oncology testing regarding specimen requirements and assay characteristics. The findings indicate that most laboratories are performing tumor-only targeted sequencing to detect single-nucleotide variants and small insertions and deletions, using desktop sequencers and predesigned commercial kits. Despite these trends, a diversity of approaches to testing exists. - This information should be useful to further inform a variety of topics, including national discussions involving clinical laboratory quality systems, regulation and oversight of next-generation sequencing-based oncology testing, and precision oncology efforts in a data-driven manner.

  3. High-Throughput Block Optical DNA Sequence Identification.

    PubMed

    Sagar, Dodderi Manjunatha; Korshoj, Lee Erik; Hanson, Katrina Bethany; Chowdhury, Partha Pratim; Otoupal, Peter Britton; Chatterjee, Anushree; Nagpal, Prashant

    2018-01-01

    Optical techniques for molecular diagnostics or DNA sequencing generally rely on small molecule fluorescent labels, which utilize light with a wavelength of several hundred nanometers for detection. Developing a label-free optical DNA sequencing technique will require nanoscale focusing of light, a high-throughput and multiplexed identification method, and a data compression technique to rapidly identify sequences and analyze genomic heterogeneity for big datasets. Such a method should identify characteristic molecular vibrations using optical spectroscopy, especially in the "fingerprinting region" from ≈400-1400 cm -1 . Here, surface-enhanced Raman spectroscopy is used to demonstrate label-free identification of DNA nucleobases with multiplexed 3D plasmonic nanofocusing. While nanometer-scale mode volumes prevent identification of single nucleobases within a DNA sequence, the block optical technique can identify A, T, G, and C content in DNA k-mers. The content of each nucleotide in a DNA block can be a unique and high-throughput method for identifying sequences, genes, and other biomarkers as an alternative to single-letter sequencing. Additionally, coupling two complementary vibrational spectroscopy techniques (infrared and Raman) can improve block characterization. These results pave the way for developing a novel, high-throughput block optical sequencing method with lossy genomic data compression using k-mer identification from multiplexed optical data acquisition. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. A Data Type for Efficient Representation of Other Data Types

    NASA Technical Reports Server (NTRS)

    James, Mark

    2008-01-01

    A self-organizing, monomorphic data type denoted a sequence has been conceived to address certain concerns that arise in programming parallel computers. A sequence in the present sense can be regarded abstractly as a vector, set, bag, queue, or other construct. Heretofore, in programming a parallel computer, it has been necessary for the programmer to state explicitly, at the outset, what parts of the program and the underlying data structures must be represented in parallel form. Not only is this requirement not optimal from the perspective of implementation; it entails an additional requirement that the programmer have intimate understanding of the underlying parallel structure. The present sequence data type overcomes both the implementation and parallel structure obstacles. In so doing, the sequence data type provides unified means by which the programmer can represent a data structure for natural and automatic decomposition to a parallel computing architecture. Sequences exhibit the behavioral and structural characteristics of vectors, but the underlying representations are automatically synthesized from combinations of programmers advice and execution use metrics. Sequences can vary bidirectionally between sparseness and density, making them excellent choices for many kinds of algorithms. The novelty and benefit of this behavior lies in the fact that it can relieve programmers of the details of implementations. The creation of a sequence enables decoupling of a conceptual representation from an implementation. The underlying representation of a sequence is a hybrid of representations composed of vectors, linked lists, connected blocks, and hash tables. The internal structure of a sequence can automatically change from time to time on the basis of how it is being used. Those portions of a sequence where elements have not been added or removed can be as efficient as vectors. As elements are inserted and removed in a given portion, then different methods are utilized to provide both an access and memory strategy that is optimized for that portion and the use to which it is put.

  5. SELEX and SHAPE reveal that sequence motifs and an extended hairpin in the 5' portion of Turnip crinkle virus satellite RNA C mediate fitness in plants.

    PubMed

    Bayne, Charlie F; Widawski, Max E; Gao, Feng; Masab, Mohammed H; Chattopadhyay, Maitreyi; Murawski, Allison M; Sansevere, Robert M; Lerner, Bryan D; Castillo, Rinaldys J; Griesman, Trevor; Fu, Jiantao; Hibben, Jennifer K; Garcia-Perez, Alma D; Simon, Anne E; Kushner, David B

    2018-07-01

    Noncoding RNAs use their sequence and/or structure to mediate function(s). The 5' portion (166 nt) of the 356-nt noncoding satellite RNA C (satC) of Turnip crinkle virus (TCV) was previously modeled to contain a central region with two stem-loops (H6 and H7) and a large connecting hairpin (H2). We now report that in vivo functional selection (SELEX) experiments assessing sequence/structure requirements in H2, H6, and H7 reveal that H6 loop sequence motifs were recovered at nonrandom rates and only some residues are proposed to base-pair with accessible complementary sequences within the 5' central region. In vitro SHAPE of SELEX winners indicates that the central region is heavily base-paired, such that along with the lower stem and H2 region, one extensive hairpin exists composing the entire 5' region. As these SELEX winners are highly fit, these characteristics facilitate satRNA amplification in association with TCV in plants. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. Momentum management strategy during Space Station buildup

    NASA Technical Reports Server (NTRS)

    Bishop, Lynda; Malchow, Harvey; Hattis, Philip

    1988-01-01

    The use of momentum storage devices to control effectors for Space Station attitude control throughout the buildup sequence is discussed. Particular attention is given to the problem of providing satisfactory management of momentum storage effectors throughout buildup while experiencing variable torque loading. Continuous and discrete control strategies are compared and the effects of alternative control moment gyro strategies on peak momentum storage requirements and on commanded maneuver characteristics are described.

  7. Does order matter? Investigating the effect of sequence on glance duration during on-road driving

    PubMed Central

    Roberts, Shannon C.; Reimer, Bryan; Mehler, Bruce

    2017-01-01

    Previous literature has shown that vehicle crash risks increases as drivers’ off-road glance duration increases. Many factors influence drivers’ glance duration such as individual differences, driving environment, or task characteristics. Theories and past studies suggest that glance duration increases as the task progresses, but the exact relationship between glance sequence and glance durations is not fully understood. The purpose of this study was to examine the effect of glance sequence on glance duration among drivers completing a visual-manual radio tuning task and an auditory-vocal based multi-modal navigation entry task. Eighty participants drove a vehicle on urban highways while completing radio tuning and navigation entry tasks. Forty participants drove under an experimental protocol that required three button presses followed by rotation of a tuning knob to complete the radio tuning task while the other forty participants completed the task with one less button press. Multiple statistical analyses were conducted to measure the effect of glance sequence on glance duration. Results showed that across both tasks and a variety of statistical tests, glance sequence had inconsistent effects on glance duration—the effects varied according to the number of glances, task type, and data set that was being evaluated. Results suggest that other aspects of the task as well as interface design effect glance duration and should be considered in the context of examining driver attention or lack thereof. All in all, interface design and task characteristics have a more influential impact on glance duration than glance sequence, suggesting that classical design considerations impacting driver attention, such as the size and location of buttons, remain fundamental in designing in-vehicle interfaces. PMID:28158301

  8. Rescaled earthquake recurrence time statistics: application to microrepeaters

    NASA Astrophysics Data System (ADS)

    Goltz, Christian; Turcotte, Donald L.; Abaimov, Sergey G.; Nadeau, Robert M.; Uchida, Naoki; Matsuzawa, Toru

    2009-01-01

    Slip on major faults primarily occurs during `characteristic' earthquakes. The recurrence statistics of characteristic earthquakes play an important role in seismic hazard assessment. A major problem in determining applicable statistics is the short sequences of characteristic earthquakes that are available worldwide. In this paper, we introduce a rescaling technique in which sequences can be superimposed to establish larger numbers of data points. We consider the Weibull and log-normal distributions, in both cases we rescale the data using means and standard deviations. We test our approach utilizing sequences of microrepeaters, micro-earthquakes which recur in the same location on a fault. It seems plausible to regard these earthquakes as a miniature version of the classic characteristic earthquakes. Microrepeaters are much more frequent than major earthquakes, leading to longer sequences for analysis. In this paper, we present results for the analysis of recurrence times for several microrepeater sequences from Parkfield, CA as well as NE Japan. We find that, once the respective sequence can be considered to be of sufficient stationarity, the statistics can be well fitted by either a Weibull or a log-normal distribution. We clearly demonstrate this fact by our technique of rescaled combination. We conclude that the recurrence statistics of the microrepeater sequences we consider are similar to the recurrence statistics of characteristic earthquakes on major faults.

  9. SequenceCEROSENE: a computational method and web server to visualize spatial residue neighborhoods at the sequence level.

    PubMed

    Heinke, Florian; Bittrich, Sebastian; Kaiser, Florian; Labudde, Dirk

    2016-01-01

    To understand the molecular function of biopolymers, studying their structural characteristics is of central importance. Graphics programs are often utilized to conceive these properties, but with the increasing number of available structures in databases or structure models produced by automated modeling frameworks this process requires assistance from tools that allow automated structure visualization. In this paper a web server and its underlying method for generating graphical sequence representations of molecular structures is presented. The method, called SequenceCEROSENE (color encoding of residues obtained by spatial neighborhood embedding), retrieves the sequence of each amino acid or nucleotide chain in a given structure and produces a color coding for each residue based on three-dimensional structure information. From this, color-highlighted sequences are obtained, where residue coloring represent three-dimensional residue locations in the structure. This color encoding thus provides a one-dimensional representation, from which spatial interactions, proximity and relations between residues or entire chains can be deduced quickly and solely from color similarity. Furthermore, additional heteroatoms and chemical compounds bound to the structure, like ligands or coenzymes, are processed and reported as well. To provide free access to SequenceCEROSENE, a web server has been implemented that allows generating color codings for structures deposited in the Protein Data Bank or structure models uploaded by the user. Besides retrieving visualizations in popular graphic formats, underlying raw data can be downloaded as well. In addition, the server provides user interactivity with generated visualizations and the three-dimensional structure in question. Color encoded sequences generated by SequenceCEROSENE can aid to quickly perceive the general characteristics of a structure of interest (or entire sets of complexes), thus supporting the researcher in the initial phase of structure-based studies. In this respect, the web server can be a valuable tool, as users are allowed to process multiple structures, quickly switch between results, and interact with generated visualizations in an intuitive manner. The SequenceCEROSENE web server is available at https://biosciences.hs-mittweida.de/seqcerosene.

  10. How should Fitts' Law be applied to human-computer interaction?

    NASA Technical Reports Server (NTRS)

    Gillan, D. J.; Holden, K.; Adam, S.; Rudisill, M.; Magee, L.

    1992-01-01

    The paper challenges the notion that any Fitts' Law model can be applied generally to human-computer interaction, and proposes instead that applying Fitts' Law requires knowledge of the users' sequence of movements, direction of movement, and typical movement amplitudes as well as target sizes. Two experiments examined a text selection task with sequences of controlled movements (point-click and point-drag). For the point-click sequence, a Fitts' Law model that used the diagonal across the text object in the direction of pointing (rather than the horizontal extent of the text object) as the target size provided the best fit for the pointing time data, whereas for the point-drag sequence, a Fitts' Law model that used the vertical size of the text object as the target size gave the best fit. Dragging times were fitted well by Fitts' Law models that used either the vertical or horizontal size of the terminal character in the text object. Additional results of note were that pointing in the point-click sequence was consistently faster than in the point-drag sequence, and that pointing in either sequence was consistently faster than dragging. The discussion centres around the need to define task characteristics before applying Fitts' Law to an interface design or analysis, analyses of pointing and of dragging, and implications for interface design.

  11. Spread Spectrum Receiver Electromagnetic Interference (EMI) Test Guide

    NASA Technical Reports Server (NTRS)

    Wheeler, M. L.

    1998-01-01

    The objective of this test guide is to document appropriate unit level test methods and techniques for the performance of EMI testing of Direct Sequence (DS) spread spectrum receivers. Consideration of EMI test methods tailored for spread spectrum receivers utilizing frequency spreading, techniques other than direct sequence (such as frequency hopping, frequency chirping, and various hybrid methods) is beyond the scope of this test guide development program and is not addressed as part of this document EMI test requirements for NASA programs are primarily developed based on the requirements contained in MIL-STD-46 1 D (or earlier revisions of MIL-STD-46 1). The corresponding test method guidelines for the MIL-STD-461 D tests are provided in MIL-STD-462D. These test methods are well documented with the exception of the receiver antenna port susceptibility tests (intermodulation, cross modulation, and rejection of undesired signals) which must be tailored to the specific type of receiver that is being tested. Thus, test methods addressed in this guide consist only of antenna port tests designed to evaluate receiver susceptibility characteristics. MIL-STD-462D should be referred for guidance pertaining to test methods for EMI tests other than the antenna port tests. The scope of this test guide includes: (1) a discussion of generic DS receiver performance characteristics; (2) a summary of S-band TDRSS receiver operation; (3) a discussion of DS receiver EMI susceptibility mechanisms and characteristics; (4) a summary of military standard test guidelines; (5) recommended test approach and methods; and (6) general conclusions and recommendations for future studies in the area of spread spectrum receiver testing.

  12. Integration deficiencies associated with continuous limb movement sequences in Parkinson's disease.

    PubMed

    Park, Jin-Hoon; Stelmach, George E

    2009-11-01

    The present study examined the extent to which Parkinson's disease (PD) influences integration of continuous limb movement sequences. Eight patients with idiopathic PD and 8 age-matched normal subjects were instructed to perform repetitive sequential aiming movements to specified targets under three-accuracy constraints: 1) low accuracy (W = 7 cm) - minimal accuracy constraint, 2) high accuracy (W = 0.64 cm) - maximum accuracy constraint, and 3) mixed accuracy constraint - one target of high accuracy and another target of low accuracy. The characteristic of sequential movements in the low accuracy condition was mostly cyclical, whereas in the high accuracy condition it was discrete in both groups. When the accuracy constraint was mixed, the sequential movements were executed by assembling discrete and cyclical movements in both groups, suggesting that for PD patients the capability to combine discrete and cyclical movements to meet a task requirement appears to be intact. However, such functional linkage was not as pronounced as was in normal subjects. Close examination of movement from the mixed accuracy condition revealed marked movement hesitations in the vicinity of the large target in PD patients, resulting in a bias toward discrete movement. These results suggest that PD patients may have deficits in ongoing planning and organizing processes during movement execution when the tasks require to assemble various accuracy requirements into more complex movement sequences.

  13. Noncoding sequence classification based on wavelet transform analysis: part II

    NASA Astrophysics Data System (ADS)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez-Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. We hypothesize that the characteristic periodicities of the noncoding sequences are related to their function. We describe the procedure to identify these characteristic periodicities using the wavelet analysis. Our results show that three groups of noncoding sequences, each one with different biological function, may be differentiated by their wavelet coefficients within specific frequency range.

  14. Is the extraction by Whatman FTA filter matrix technology and sequencing of large ribosomal subunit D1-D2 region sufficient for identification of clinical fungi?

    PubMed

    Kiraz, Nuri; Oz, Yasemin; Aslan, Huseyin; Erturan, Zayre; Ener, Beyza; Akdagli, Sevtap Arikan; Muslumanoglu, Hamza; Cetinkaya, Zafer

    2015-10-01

    Although conventional identification of pathogenic fungi is based on the combination of tests evaluating their morphological and biochemical characteristics, they can fail to identify the less common species or the differentiation of closely related species. In addition these tests are time consuming, labour-intensive and require experienced personnel. We evaluated the feasibility and sufficiency of DNA extraction by Whatman FTA filter matrix technology and DNA sequencing of D1-D2 region of the large ribosomal subunit gene for identification of clinical isolates of 21 yeast and 160 moulds in our clinical mycology laboratory. While the yeast isolates were identified at species level with 100% homology, 102 (63.75%) clinically important mould isolates were identified at species level, 56 (35%) isolates at genus level against fungal sequences existing in DNA databases and two (1.25%) isolates could not be identified. Consequently, Whatman FTA filter matrix technology was a useful method for extraction of fungal DNA; extremely rapid, practical and successful. Sequence analysis strategy of D1-D2 region of the large ribosomal subunit gene was found considerably sufficient in identification to genus level for the most clinical fungi. However, the identification to species level and especially discrimination of closely related species may require additional analysis. © 2015 Blackwell Verlag GmbH.

  15. Robust one-Tube Ω-PCR Strategy Accelerates Precise Sequence Modification of Plasmids for Functional Genomics

    PubMed Central

    Chen, Letian; Wang, Fengpin; Wang, Xiaoyu; Liu, Yao-Guang

    2013-01-01

    Functional genomics requires vector construction for protein expression and functional characterization of target genes; therefore, a simple, flexible and low-cost molecular manipulation strategy will be highly advantageous for genomics approaches. Here, we describe a Ω-PCR strategy that enables multiple types of sequence modification, including precise insertion, deletion and substitution, in any position of a circular plasmid. Ω-PCR is based on an overlap extension site-directed mutagenesis technique, and is named for its characteristic Ω-shaped secondary structure during PCR. Ω-PCR can be performed either in two steps, or in one tube in combination with exonuclease I treatment. These strategies have wide applications for protein engineering, gene function analysis and in vitro gene splicing. PMID:23335613

  16. Rapid Flow Cytometry-Based Test for the Diagnosis of Lipopolysaccharide Responsive Beige-Like Anchor (LRBA) Deficiency.

    PubMed

    Gámez-Díaz, Laura; Sigmund, Elena C; Reiser, Veronika; Vach, Werner; Jung, Sophie; Grimbacher, Bodo

    2018-01-01

    The diagnosis of lipopolysaccharide-responsive beige-like-anchor-protein (LRBA) deficiency currently relies on gene sequencing approaches that do not support a timely diagnosis and clinical management. We developed a rapid and sensitive test for clinical implementation based on the detection of LRBA protein by flow cytometry in peripheral blood cells after stimulation. LRBA protein was assessed in a prospective cohort of 54 healthy donors and 57 patients suspected of LRBA deficiency. Receiver operating characteristics analysis suggested an LRBA:MFI ratio cutoff point of 2.6 to identify LRBA-deficient patients by FACS with 94% sensitivity and 80% specificity and to discriminate them from patients with a similar clinical picture but other disease-causing mutations. This easy flow cytometry-based assay allows a fast screening of patients with suspicion of LRBA deficiency reducing therefore the number of patients requiring LRBA sequencing and accelerating the treatment implementation. Detection of biallelic mutations in LRBA is however required for a definitive diagnosis.

  17. Widespread and Persistent Populations of a Major New Marine Actinomycete Taxon in Ocean Sediments

    PubMed Central

    Mincer, Tracy J.; Jensen, Paul R.; Kauffman, Christopher A.; Fenical, William

    2002-01-01

    A major taxon of obligate marine bacteria within the order Actinomycetales has been discovered from ocean sediments. Populations of these bacteria (designated MAR 1) are persistent and widespread, spanning at least three distinct ocean systems. In this study, 212 actinomycete isolates possessing MAR 1 morphologies were examined and all but two displayed an obligate requirement of seawater for growth. Forty-five of these isolates, representing all observed seawater-requiring morphotypes, were partially sequenced and found to share characteristic small-subunit rRNA signature nucleotides between positions 207 and 468 (Escherichia coli numbering). Phylogenetic characterization of seven representative isolates based on almost complete sequences of genes encoding 16S rRNA (16S ribosomal DNA) yielded a monophyletic clade within the family Micromonosporaceae and suggests novelty at the genus level. This is the first evidence for the existence of widespread populations of obligate marine actinomycetes. Organic extracts from cultured members of this new group exhibit remarkable biological activity, suggesting that they represent a prolific resource for biotechnological applications. PMID:12324350

  18. Sequence fingerprints distinguish erroneous from correct predictions of intrinsically disordered protein regions.

    PubMed

    Saravanan, Konda Mani; Dunker, A Keith; Krishnaswamy, Sankaran

    2017-12-27

    More than 60 prediction methods for intrinsically disordered proteins (IDPs) have been developed over the years, many of which are accessible on the World Wide Web. Nearly, all of these predictors give balanced accuracies in the ~65%-~80% range. Since predictors are not perfect, further studies are required to uncover the role of amino acid residues in native IDP as compared to predicted IDP regions. In the present work, we make use of sequences of 100% predicted IDP regions, false positive disorder predictions, and experimentally determined IDP regions to distinguish the characteristics of native versus predicted IDP regions. A higher occurrence of asparagine is observed in sequences of native IDP regions but not in sequences of false positive predictions of IDP regions. The occurrences of certain combinations of amino acids at the pentapeptide level provide a distinguishing feature in the IDPs with respect to globular proteins. The distinguishing features presented in this paper provide insights into the sequence fingerprints of amino acid residues in experimentally determined as compared to predicted IDP regions. These observations and additional work along these lines should enable the development of improvements in the accuracy of disorder prediction algorithm.

  19. Shuttle cryogenics supply system. Optimization study. Volume 5 B-4: Programmers manual for space shuttle orbit injection analysis (SOPSA)

    NASA Technical Reports Server (NTRS)

    1973-01-01

    A computer program for space shuttle orbit injection propulsion system analysis (SOPSA) is described to show the operational characteristics and the computer system requirements. The program was developed as an analytical tool to aid in the preliminary design of propellant feed systems for the space shuttle orbiter main engines. The primary purpose of the program is to evaluate the propellant tank ullage pressure requirements imposed by the need to accelerate propellants rapidly during the engine start sequence. The SOPSA program will generate parametric feed system pressure histories and weight data for a range of nominal feedline sizes.

  20. MRI and MRA of spinal cord arteriovenous shunts.

    PubMed

    Condette-Auliac, Stéphanie; Boulin, Anne; Roccatagliata, Luca; Coskun, Oguzhan; Guieu, Stéphanie; Guedin, Pierre; Rodesch, Georges

    2014-12-01

    The purpose of this review is to describe the diagnostic criteria for spinal cord arteriovenous shunts (SCAVSs) when using magnetic resonance imaging (MRI) and magnetic resonance angiography (MRA), and to discuss the extent to which the different MRI and MRA sequences and technical parameters provide the information that is required to diagnose these lesions properly. SCAVSs are divided into four groups according to location (paraspinal, epidural, dural, or intradural) and type (fistula or nidus); each type of lesion is described. SCAVSs are responsible for neurological symptoms due to spinal cord or nerve root involvement. MRI is usually the first examination performed when a spinal cord lesion is suspected. Recognition of the image characteristics of vascular lesions is mandatory if useful sequences are to be performed-especially MRA sequences. Because the treatment of SCAVSs relies mainly on endovascular therapies, MRI and MRA help with the planning of the angiographic procedure. We explain the choice of MRA sequences and parameters, the advantages and pitfalls to be aware of in order to obtain the best visualization, and the analysis of each lesion. © 2014 Wiley Periodicals, Inc.

  1. Evolutionary and biophysical relationships among the papillomavirus E2 proteins.

    PubMed

    Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael

    2009-01-01

    Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.

  2. Genome-Wide Identification and Comparative Analysis of Albumin Family in Vertebrates

    PubMed Central

    Li, Shugang; Cao, Yiping; Geng, Fang

    2017-01-01

    Albumins are the most well-known globular proteins, and the most typical representatives are the serum albumins. However, less attention was paid to the albumin family, except for the human and bovine serum albumin. To characterize the features of albumin family, we have mined all the putative albumin proteins from the available genome sequences. The results showed that albumin is widely distributed in vertebrates, but not present in the bacteria and archaea. The phylogenetic analysis of vertebrate albumin family implied an evolutionary relationship between members of serum albumin, α-fetoprotein, vitamin D–binding protein, and afamin. Meanwhile, a new member from the albumin family was found, namely, extracellular matrix protein 1. The structural analysis revealed that the motifs for forming the internal disulfide bonds are highly conserved in the albumin family, despite the low overall sequence identity across the family. The domain arrangement of albumin proteins indicated that most of vertebrate albumins contain 3 characteristic domains, arising from 2 evolutionary patterns. And a significant trend has been observed that the albumin proteins in higher vertebrate species tend to possess more characteristic domains. This study has provided the fundamental information required for achieving a better understanding of the albumin distribution, phylogenetic relationship, characteristic motif, structure, and new insights into the evolutionary pattern. PMID:28680266

  3. Identification of characteristic oligonucleotides in the bacterial 16S ribosomal RNA sequence dataset

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; Willson, Richard C.; Fox, George E.

    2002-01-01

    MOTIVATION: The phylogenetic structure of the bacterial world has been intensively studied by comparing sequences of 16S ribosomal RNA (16S rRNA). This database of sequences is now widely used to design probes for the detection of specific bacteria or groups of bacteria one at a time. The success of such methods reflects the fact that there are local sequence segments that are highly characteristic of particular organisms or groups of organisms. It is not clear, however, the extent to which such signature sequences exist in the 16S rRNA dataset. A better understanding of the numbers and distribution of highly informative oligonucleotide sequences may facilitate the design of hybridization arrays that can characterize the phylogenetic position of an unknown organism or serve as the basis for the development of novel approaches for use in bacterial identification. RESULTS: A computer-based algorithm that characterizes the extent to which any individual oligonucleotide sequence in 16S rRNA is characteristic of any particular bacterial grouping was developed. A measure of signature quality, Q(s), was formulated and subsequently calculated for every individual oligonucleotide sequence in the size range of 5-11 nucleotides and for 15mers with reference to each cluster and subcluster in a 929 organism representative phylogenetic tree. Subsequently, the perfect signature sequences were compared to the full set of 7322 sequences to see how common false positives were. The work completed here establishes beyond any doubt that highly characteristic oligonucleotides exist in the bacterial 16S rRNA sequence dataset in large numbers. Over 16,000 15mers were identified that might be useful as signatures. Signature oligonucleotides are available for over 80% of the nodes in the representative tree.

  4. Comparison of Intracellular "Ca. Endomicrobium Trichonymphae" Genomovars Illuminates the Requirement and Decay of Defense Systems against Foreign DNA.

    PubMed

    Izawa, Kazuki; Kuwahara, Hirokazu; Kihara, Kumiko; Yuki, Masahiro; Lo, Nathan; Itoh, Takehiko; Ohkuma, Moriya; Hongoh, Yuichi

    2016-10-13

    "Candidatus Endomicrobium trichonymphae" (Bacteria; Elusimicrobia) is an obligate intracellular symbiont of the cellulolytic protist genus Trichonympha in the termite gut. A previous genome analysis of "Ca Endomicrobium trichonymphae" phylotype Rs-D17 (genomovar Ri2008), obtained from a Trichonympha agilis cell in the gut of the termite Reticulitermes speratus, revealed that its genome is small (1.1 Mb) and contains many pseudogenes; it is in the course of reductive genome evolution. Here we report the complete genome sequence of another Rs-D17 genomovar, Ti2015, obtained from a different T. agilis cell present in an R. speratus gut. These two genomovars share most intact protein-coding genes and pseudogenes, showing 98.6% chromosome sequence similarity. However, characteristic differences were found in their defense systems, which comprised restriction-modification and CRISPR/Cas systems. The repertoire of intact restriction-modification systems differed between the genomovars, and two of the three CRISPR/Cas loci in genomovar Ri2008 are pseudogenized or missing in genomovar Ti2015. These results suggest relaxed selection pressure for maintaining these defense systems. Nevertheless, the remaining CRISPR/Cas system in each genomovar appears to be active; none of the "spacer" sequences (112 in Ri2008 and 128 in Ti2015) were shared whereas the "repeat" sequences were identical. Furthermore, we obtained draft genomes of three additional endosymbiotic Endomicrobium phylotypes from different host protist species, and discovered multiple, intact CRISPR/Cas systems in each genome. Collectively, unlike bacteriome endosymbionts in insects, the Endomicrobium endosymbionts of termite-gut protists appear to require defense against foreign DNA, although the required level of defense has likely been reduced during their intracellular lives. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. YAMAT-seq: an efficient method for high-throughput sequencing of mature transfer RNAs

    PubMed Central

    Shigematsu, Megumi; Honda, Shozo; Loher, Phillipe; Telonis, Aristeidis G.; Rigoutsos, Isidore

    2017-01-01

    Abstract Besides translation, transfer RNAs (tRNAs) play many non-canonical roles in various biological pathways and exhibit highly variable expression profiles. To unravel the emerging complexities of tRNA biology and molecular mechanisms underlying them, an efficient tRNA sequencing method is required. However, the rigid structure of tRNA has been presenting a challenge to the development of such methods. We report the development of Y-shaped Adapter-ligated MAture TRNA sequencing (YAMAT-seq), an efficient and convenient method for high-throughput sequencing of mature tRNAs. YAMAT-seq circumvents the issue of inefficient adapter ligation, a characteristic of conventional RNA sequencing methods for mature tRNAs, by employing the efficient and specific ligation of Y-shaped adapter to mature tRNAs using T4 RNA Ligase 2. Subsequent cDNA amplification and next-generation sequencing successfully yield numerous mature tRNA sequences. YAMAT-seq has high specificity for mature tRNAs and high sensitivity to detect most isoacceptors from minute amount of total RNA. Moreover, YAMAT-seq shows quantitative capability to estimate expression levels of mature tRNAs, and has high reproducibility and broad applicability for various cell lines. YAMAT-seq thus provides high-throughput technique for identifying tRNA profiles and their regulations in various transcriptomes, which could play important regulatory roles in translation and other biological processes. PMID:28108659

  6. Protein classification using modified n-grams and skip-grams.

    PubMed

    Islam, S M Ashiqul; Heil, Benjamin J; Kearney, Christopher Michel; Baker, Erich J

    2018-05-01

    Classification by supervised machine learning greatly facilitates the annotation of protein characteristics from their primary sequence. However, the feature generation step in this process requires detailed knowledge of attributes used to classify the proteins. Lack of this knowledge risks the selection of irrelevant features, resulting in a faulty model. In this study, we introduce a supervised protein classification method with a novel means of automating the work-intensive feature generation step via a Natural Language Processing (NLP)-dependent model, using a modified combination of n-grams and skip-grams (m-NGSG). A meta-comparison of cross-validation accuracy with twelve training datasets from nine different published studies demonstrates a consistent increase in accuracy of m-NGSG when compared to contemporary classification and feature generation models. We expect this model to accelerate the classification of proteins from primary sequence data and increase the accessibility of protein characteristic prediction to a broader range of scientists. m-NGSG is freely available at Bitbucket: https://bitbucket.org/sm_islam/mngsg/src. A web server is available at watson.ecs.baylor.edu/ngsg. erich_baker@baylor.edu. Supplementary data are available at Bioinformatics online.

  7. A Nonparametric Approach For Representing Interannual Dependence In Monthly Streamflow Sequences

    NASA Astrophysics Data System (ADS)

    Sharma, A.; Oneill, R.

    The estimation of risks associated with water management plans requires generation of synthetic streamflow sequences. The mathematical algorithms used to generate these sequences at monthly time scales are found lacking in two main respects: inability in preserving dependence attributes particularly at large (seasonal to interannual) time lags; and, a poor representation of observed distributional characteristics, in partic- ular, representation of strong assymetry or multimodality in the probability density function. Proposed here is an alternative that naturally incorporates both observed de- pendence and distributional attributes in the generated sequences. Use of a nonpara- metric framework provides an effective means for representing the observed proba- bility distribution, while the use of a Svariable kernelT ensures accurate modeling of & cedil;streamflow data sets that contain a substantial number of zero flow values. A careful selection of prior flows imparts the appropriate short-term memory, while use of an SaggregateT flow variable allows representation of interannual dependence. The non- & cedil;parametric simulation model is applied to monthly flows from the Beaver River near Beaver, Utah, USA, and the Burrendong dam inflows, New South Wales, Australia. Results indicate that while the use of traditional simulation approaches leads to an inaccurate representation of dependence at long (annual and interannual) time scales, the proposed model can simulate both short and long-term dependence. As a result, the proposed model ensures a significantly improved representation of reservoir storage statistics, particularly for systems influenced by long droughts. It is important to note that the proposed method offers a simpler and better alternative to conventional dis- aggregation models as: (a) a separate annual flow series is not required, (b) stringent assumptions relating annual and monthly flows are not needed, and (c) the method does not require the specification of a "water year", instead ensuring that the sum of any sequence of flows lasting twelve months will result in the type of dependence that is observed in the historical annual flow series.

  8. Probability of coding of a DNA sequence: an algorithm to predict translated reading frames from their thermodynamic characteristics.

    PubMed Central

    Tramontano, A; Macchiato, M F

    1986-01-01

    An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand. PMID:3753761

  9. Simulation of spatial and temporal properties of aftershocks by means of the fiber bundle model

    NASA Astrophysics Data System (ADS)

    Monterrubio-Velasco, Marisol; Zúñiga, F. R.; Márquez-Ramírez, Victor Hugo; Figueroa-Soto, Angel

    2017-11-01

    The rupture processes of any heterogeneous material constitute a complex physical problem. Earthquake aftershocks show temporal and spatial behaviors which are consequence of the heterogeneous stress distribution and multiple rupturing following the main shock. This process is difficult to model deterministically due to the number of parameters and physical conditions, which are largely unknown. In order to shed light on the minimum requirements for the generation of aftershock clusters, in this study, we perform a simulation of the main features of such a complex process by means of a fiber bundle (FB) type model. The FB model has been widely used to analyze the fracture process in heterogeneous materials. It is a simple but powerful tool that allows modeling the main characteristics of a medium such as the brittle shallow crust of the earth. In this work, we incorporate spatial properties, such as the Coulomb stress change pattern, which help simulate observed characteristics of aftershock sequences. In particular, we introduce a parameter ( P) that controls the probability of spatial distribution of initial loads. Also, we use a "conservation" parameter ( π), which accounts for the load dissipation of the system, and demonstrate its influence on the simulated spatio-temporal patterns. Based on numerical results, we find that P has to be in the range 0.06 < P < 0.30, whilst π needs to be limited by a very narrow range ( 0.60 < π < 0.66) in order to reproduce aftershocks pattern characteristics which resemble those of observed sequences. This means that the system requires a small difference in the spatial distribution of initial stress, and a very particular fraction of load transfer in order to generate realistic aftershocks.

  10. Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks.

    PubMed

    Umarov, Ramzan Kh; Solovyev, Victor V

    2017-01-01

    Accurate computational identification of promoters remains a challenge as these key DNA regulatory regions have variable structures composed of functional motifs that provide gene-specific initiation of transcription. In this paper we utilize Convolutional Neural Networks (CNN) to analyze sequence characteristics of prokaryotic and eukaryotic promoters and build their predictive models. We trained a similar CNN architecture on promoters of five distant organisms: human, mouse, plant (Arabidopsis), and two bacteria (Escherichia coli and Bacillus subtilis). We found that CNN trained on sigma70 subclass of Escherichia coli promoter gives an excellent classification of promoters and non-promoter sequences (Sn = 0.90, Sp = 0.96, CC = 0.84). The Bacillus subtilis promoters identification CNN model achieves Sn = 0.91, Sp = 0.95, and CC = 0.86. For human, mouse and Arabidopsis promoters we employed CNNs for identification of two well-known promoter classes (TATA and non-TATA promoters). CNN models nicely recognize these complex functional regions. For human promoters Sn/Sp/CC accuracy of prediction reached 0.95/0.98/0,90 on TATA and 0.90/0.98/0.89 for non-TATA promoter sequences, respectively. For Arabidopsis we observed Sn/Sp/CC 0.95/0.97/0.91 (TATA) and 0.94/0.94/0.86 (non-TATA) promoters. Thus, the developed CNN models, implemented in CNNProm program, demonstrated the ability of deep learning approach to grasp complex promoter sequence characteristics and achieve significantly higher accuracy compared to the previously developed promoter prediction programs. We also propose random substitution procedure to discover positionally conserved promoter functional elements. As the suggested approach does not require knowledge of any specific promoter features, it can be easily extended to identify promoters and other complex functional regions in sequences of many other and especially newly sequenced genomes. The CNNProm program is available to run at web server http://www.softberry.com.

  11. Resources and costs for microbial sequence analysis evaluated using virtual machines and cloud computing.

    PubMed

    Angiuoli, Samuel V; White, James R; Matalka, Malcolm; White, Owen; Fricke, W Florian

    2011-01-01

    The widespread popularity of genomic applications is threatened by the "bioinformatics bottleneck" resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers.

  12. Resources and Costs for Microbial Sequence Analysis Evaluated Using Virtual Machines and Cloud Computing

    PubMed Central

    Angiuoli, Samuel V.; White, James R.; Matalka, Malcolm; White, Owen; Fricke, W. Florian

    2011-01-01

    Background The widespread popularity of genomic applications is threatened by the “bioinformatics bottleneck” resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly. Results We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers. Conclusions Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers. PMID:22028928

  13. Combustion Stability Characteristics of the Project Morpheus Liquid Oxygen / Liquid Methane Main Engine

    NASA Technical Reports Server (NTRS)

    Melcher, John C.; Morehead, Robert L.

    2014-01-01

    The project Morpheus liquid oxygen (LOX) / liquid methane (LCH4) main engine is a Johnson Space Center (JSC) designed 5,000 lbf-thrust, 4:1 throttling, pressure-fed cryogenic engine using an impinging element injector design. The engine met or exceeded all performance requirements without experiencing any in- ight failures, but the engine exhibited acoustic-coupled combustion instabilities during sea-level ground-based testing. First tangential (1T), rst radial (1R), 1T1R, and higher order modes were triggered by conditions during the Morpheus vehicle derived low chamber pressure startup sequence. The instability was never observed to initiate during mainstage, even at low power levels. Ground-interaction acoustics aggravated the instability in vehicle tests. Analysis of more than 200 hot re tests on the Morpheus vehicle and Stennis Space Center (SSC) test stand showed a relationship between ignition stability and injector/chamber pressure. The instability had the distinct characteristic of initiating at high relative injection pressure drop at low chamber pressure during the start sequence. Data analysis suggests that the two-phase density during engine start results in a high injection velocity, possibly triggering the instabilities predicted by the Hewitt stability curves. Engine ignition instability was successfully mitigated via a higher-chamber pressure start sequence (e.g., 50% power level vs 30%) and operational propellant start temperature limits that maintained \\cold LOX" and \\warm methane" at the engine inlet. The main engine successfully demonstrated 4:1 throttling without chugging during mainstage, but chug instabilities were observed during some engine shutdown sequences at low injector pressure drop, especially during vehicle landing.

  14. Design and manufacture of wheels for a dual-mode (manned - automatic) lunar surface roving vehicle. Volume 2: Proposed test plan

    NASA Technical Reports Server (NTRS)

    1970-01-01

    A developmental test plan for the wheel and wheel drive assembly of the dual-mode (manned/automated) lunar surface roving vehicle is presented. The tests cover performance, as well as critical environmental characteristics. Insofar as practical, the environmental conditions imposed will be in the sequence expected during the hardware's life from storage through the lunar mission. Test procedures are described for static load deflection and endurance tests. Soft soil tests to determine mobility characteristics including drawbar-pull and thrust vs slip, and motion resistance for various wheel loads are also discussed. Test designs for both ambient and thermal vacuum conditions are described. Facility, transducer, and instrumentation requirements are outlined.

  15. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  16. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  17. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  18. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  19. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  20. ARResT/AssignSubsets: a novel application for robust subclassification of chronic lymphocytic leukemia based on B cell receptor IG stereotypy.

    PubMed

    Bystry, Vojtech; Agathangelidis, Andreas; Bikos, Vasilis; Sutton, Lesley Ann; Baliakas, Panagiotis; Hadzidimitriou, Anastasia; Stamatopoulos, Kostas; Darzentas, Nikos

    2015-12-01

    An ever-increasing body of evidence supports the importance of B cell receptor immunoglobulin (BcR IG) sequence restriction, alias stereotypy, in chronic lymphocytic leukemia (CLL). This phenomenon accounts for ∼30% of studied cases, one in eight of which belong to major subsets, and extends beyond restricted sequence patterns to shared biologic and clinical characteristics and, generally, outcome. Thus, the robust assignment of new cases to major CLL subsets is a critical, and yet unmet, requirement. We introduce a novel application, ARResT/AssignSubsets, which enables the robust assignment of BcR IG sequences from CLL patients to major stereotyped subsets. ARResT/AssignSubsets uniquely combines expert immunogenetic sequence annotation from IMGT/V-QUEST with curation to safeguard quality, statistical modeling of sequence features from more than 7500 CLL patients, and results from multiple perspectives to allow for both objective and subjective assessment. We validated our approach on the learning set, and evaluated its real-world applicability on a new representative dataset comprising 459 sequences from a single institution. ARResT/AssignSubsets is freely available on the web at http://bat.infspire.org/arrest/assignsubsets/ nikos.darzentas@gmail.com. Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. A Novel Computational Strategy to Identify A-to-I RNA Editing Sites by RNA-Seq Data: De Novo Detection in Human Spinal Cord Tissue

    PubMed Central

    Picardi, Ernesto; Gallo, Angela; Galeano, Federica; Tomaselli, Sara; Pesole, Graziano

    2012-01-01

    RNA editing is a post-transcriptional process occurring in a wide range of organisms. In human brain, the A-to-I RNA editing, in which individual adenosine (A) bases in pre-mRNA are modified to yield inosine (I), is the most frequent event. Modulating gene expression, RNA editing is essential for cellular homeostasis. Indeed, its deregulation has been linked to several neurological and neurodegenerative diseases. To date, many RNA editing sites have been identified by next generation sequencing technologies employing massive transcriptome sequencing together with whole genome or exome sequencing. While genome and transcriptome reads are not always available for single individuals, RNA-Seq data are widespread through public databases and represent a relevant source of yet unexplored RNA editing sites. In this context, we propose a simple computational strategy to identify genomic positions enriched in novel hypothetical RNA editing events by means of a new two-steps mapping procedure requiring only RNA-Seq data and no a priori knowledge of RNA editing characteristics and genomic reads. We assessed the suitability of our procedure by confirming A-to-I candidates using conventional Sanger sequencing and performing RNA-Seq as well as whole exome sequencing of human spinal cord tissue from a single individual. PMID:22957051

  2. Retrotransposon Capture Sequencing (RC-Seq): A Targeted, High-Throughput Approach to Resolve Somatic L1 Retrotransposition in Humans.

    PubMed

    Sanchez-Luque, Francisco J; Richardson, Sandra R; Faulkner, Geoffrey J

    2016-01-01

    Mobile genetic elements (MGEs) are of critical importance in genomics and developmental biology. Polymorphic and somatic MGE insertions have the potential to impact the phenotype of an individual, depending on their genomic locations and functional consequences. However, the identification of polymorphic and somatic insertions among the plethora of copies residing in the genome presents a formidable technical challenge. Whole genome sequencing has the potential to address this problem; however, its efficacy depends on the abundance of cells carrying the new insertion. Robust detection of somatic insertions present in only a subset of cells within a given sample can also be prohibitively expensive due to a requirement for high sequencing depth. Here, we describe retrotransposon capture sequencing (RC-seq), a sequence capture approach in which Illumina libraries are enriched for fragments containing the 5' and 3' termini of specific MGEs. RC-seq allows the detection of known polymorphic insertions present in an individual, as well as the identification of rare or private germline insertions not previously described. Furthermore, RC-seq can be used to detect and characterize somatic insertions, providing a valuable tool to elucidate the extent and characteristics of MGE activity in healthy tissues and in various disease states.

  3. Relationships between functional genes in Lactobacillus delbrueckii ssp. bulgaricus isolates and phenotypic characteristics associated with fermentation time and flavor production in yogurt elucidated using multilocus sequence typing.

    PubMed

    Liu, Wenjun; Yu, Jie; Sun, Zhihong; Song, Yuqin; Wang, Xueni; Wang, Hongmei; Wuren, Tuoya; Zha, Musu; Menghe, Bilige; Heping, Zhang

    2016-01-01

    Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is well known for its worldwide application in yogurt production. Flavor production and acid producing are considered as the most important characteristics for starter culture screening. To our knowledge this is the first study applying functional gene sequence multilocus sequence typing technology to predict the fermentation and flavor-producing characteristics of yogurt-producing bacteria. In the present study, phenotypic characteristics of 35 L. bulgaricus strains were quantified during the fermentation of milk to yogurt and during its subsequent storage; these included fermentation time, acidification rate, pH, titratable acidity, and flavor characteristics (acetaldehyde concentration). Furthermore, multilocus sequence typing analysis of 7 functional genes associated with fermentation time, acid production, and flavor formation was done to elucidate the phylogeny and genetic evolution of the same L. bulgaricus isolates. The results showed that strains significantly differed in fermentation time, acidification rate, and acetaldehyde production. Combining functional gene sequence analysis with phenotypic characteristics demonstrated that groups of strains established using genotype data were consistent with groups identified based on their phenotypic traits. This study has established an efficient and rapid molecular genotyping method to identify strains with good fermentation traits; this has the potential to replace time-consuming conventional methods based on direct measurement of phenotypic traits. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  4. YAMAT-seq: an efficient method for high-throughput sequencing of mature transfer RNAs.

    PubMed

    Shigematsu, Megumi; Honda, Shozo; Loher, Phillipe; Telonis, Aristeidis G; Rigoutsos, Isidore; Kirino, Yohei

    2017-05-19

    Besides translation, transfer RNAs (tRNAs) play many non-canonical roles in various biological pathways and exhibit highly variable expression profiles. To unravel the emerging complexities of tRNA biology and molecular mechanisms underlying them, an efficient tRNA sequencing method is required. However, the rigid structure of tRNA has been presenting a challenge to the development of such methods. We report the development of Y-shaped Adapter-ligated MAture TRNA sequencing (YAMAT-seq), an efficient and convenient method for high-throughput sequencing of mature tRNAs. YAMAT-seq circumvents the issue of inefficient adapter ligation, a characteristic of conventional RNA sequencing methods for mature tRNAs, by employing the efficient and specific ligation of Y-shaped adapter to mature tRNAs using T4 RNA Ligase 2. Subsequent cDNA amplification and next-generation sequencing successfully yield numerous mature tRNA sequences. YAMAT-seq has high specificity for mature tRNAs and high sensitivity to detect most isoacceptors from minute amount of total RNA. Moreover, YAMAT-seq shows quantitative capability to estimate expression levels of mature tRNAs, and has high reproducibility and broad applicability for various cell lines. YAMAT-seq thus provides high-throughput technique for identifying tRNA profiles and their regulations in various transcriptomes, which could play important regulatory roles in translation and other biological processes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites

    PubMed Central

    Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer

    2004-01-01

    Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290

  6. Visualization of genome signatures of eukaryote genomes by batch-learning self-organizing map with a special emphasis on Drosophila genomes.

    PubMed

    Abe, Takashi; Hamano, Yuta; Ikemura, Toshimichi

    2014-01-01

    A strategy of evolutionary studies that can compare vast numbers of genome sequences is becoming increasingly important with the remarkable progress of high-throughput DNA sequencing methods. We previously established a sequence alignment-free clustering method "BLSOM" for di-, tri-, and tetranucleotide compositions in genome sequences, which can characterize sequence characteristics (genome signatures) of a wide range of species. In the present study, we generated BLSOMs for tetra- and pentanucleotide compositions in approximately one million sequence fragments derived from 101 eukaryotes, for which almost complete genome sequences were available. BLSOM recognized phylotype-specific characteristics (e.g., key combinations of oligonucleotide frequencies) in the genome sequences, permitting phylotype-specific clustering of the sequences without any information regarding the species. In our detailed examination of 12 Drosophila species, the correlation between their phylogenetic classification and the classification on the BLSOMs was observed to visualize oligonucleotides diagnostic for species-specific clustering.

  7. NF-E2 and GATA binding motifs are required for the formation of DNase I hypersensitive site 4 of the human beta-globin locus control region.

    PubMed Central

    Stamatoyannopoulos, J A; Goodwin, A; Joyce, T; Lowrey, C H

    1995-01-01

    The beta-like globin genes require the upstream locus control region (LCR) for proper expression. The active elements of the LCR coincide with strong erythroid-specific DNase I-hypersensitive sites (HSs). We have used 5' HS4 as a model to study the formation of these HSs. Previously, we identified a 101 bp element that is required for the formation of this HS. This element binds six proteins in vitro. We now report a mutational analysis of the HS4 HS-forming element (HSFE). This analysis indicates that binding sites for the hematopoietic transcription factors NF-E2 and GATA-1 are required for the formation of the characteristic chromatin structure of the HS following stable transfection into murine erythroleukemia cells. Similarly arranged NF-E2 and GATA binding sites are present in the other HSs of the human LCR, as well as in the homologous mouse and goat sequences and the chicken beta-globin enhancer. A combination of DNase I and micrococcal nuclease sensitivity assays indicates that the characteristic erythroid-specific hypersensitivity of HS4 to DNase I is the result of tissue-specific alterations in both nucleosome positioning and tertiary DNA structure. Images PMID:7828582

  8. Reclassification of Actinobacillus muris as Muribacter muris gen. nov., comb. nov.

    PubMed

    Nicklas, Werner; Bisgaard, Magne; Aalbæk, Bent; Kuhnert, Peter; Christensen, Henrik

    2015-10-01

    To reinvestigate the taxonomy of [Actinobacillus] muris, 474 strains, mainly from mice and rats, were characterized by phenotype and 130 strains selected for genotypic characterization by 16S rRNA and partial rpoB gene sequencing. The type strain was further investigated by whole-genome sequencing. Phylogenetic analysis of the DNA sequences showed one monophyletic group with intragroup similarities of 96.7 and 97.2 % for the 16S rRNA and rpoB genes, respectively. The highest 16S rRNA gene sequence similarity to a taxon with a validly published name outside the group was 95.9 %, to the type strain of [Pasteurella] pneumotropica. The closest related taxon based on rpoB sequence comparison was 'Haemophilus influenzae-murium', with 88.4 % similarity. A new genus and a new combination, Muribacter muris gen. nov., comb. nov., are proposed based on a distinct phylogenetic position based on 16S rRNA and rpoB gene sequence comparisons, with major divergence from the existing genera of the family Pasteurellaceae. The new genus has the characteristics of [A.] muris with the emendation that acid formation from ( - )-d-mannitol and hydrolysis of aesculin are variable, while the α-glucosidase test is positive. There is no requirement for exogenously supplied NAD (V factor) for the majority of strains investigated; however, one strain was found to require NAD. The major fatty acids of the type strain of Muribacter muris were C14 : 0, C14 : 0 3-OH/iso-C16 : 1 I, C16 : 1ω7c and C16 : 0, which is in line with most genera of the Pasteurellaceae. The type strain of Muribacter muris is CCUG 16938T ( = NCTC 12432T = ATCC 49577T).

  9. Nephrocalcinosis (Enamel Renal Syndrome) Caused by Autosomal Recessive FAM20A Mutations

    PubMed Central

    Jaureguiberry, Graciana; De la Dure-Molla, Muriel; Parry, David; Quentric, Mickael; Himmerkus, Nina; Koike, Toshiyasu; Poulter, James; Klootwijk, Enriko; Robinette, Steven L.; Howie, Alexander J.; Patel, Vaksha; Figueres, Marie-Lucile; Stanescu, Horia C.; Issler, Naomi; Nicholson, Jeremy K.; Bockenhauer, Detlef; Laing, Christopher; Walsh, Stephen B.; McCredie, David A.; Povey, Sue; Asselin, Audrey; Picard, Arnaud; Coulomb, Aurore; Medlar, Alan J.; Bailleul-Forestier, Isabelle; Verloes, Alain; Le Caignec, Cedric; Roussey, Gwenaelle; Guiol, Julien; Isidor, Bertrand; Logan, Clare; Shore, Roger; Johnson, Colin; Inglehearn, Christopher; Al-Bahlani, Suhaila; Schmittbuhl, Matthieu; Clauss, François; Huckert, Mathilde; Laugel, Virginie; Ginglinger, Emmanuelle; Pajarola, Sandra; Spartà, Giuseppina; Bartholdi, Deborah; Rauch, Anita; Addor, Marie-Claude; Yamaguti, Paulo M.; Safatle, Heloisa P.; Acevedo, Ana Carolina; Martelli-Júnior, Hercílio; dos Santos Netos, Pedro E.; Coletta, Ricardo D.; Gruessel, Sandra; Sandmann, Carolin; Ruehmann, Denise; Langman, Craig B.; Scheinman, Steven J.; Ozdemir-Ozenen, Didem; Hart, Thomas C.; Hart, P. Suzanne; Neugebauer, Ute; Schlatter, Eberhard; Houillier, Pascal; Gahl, William A.; Vikkula, Miikka; Bloch-Zupan, Agnès; Bleich, Markus; Kitagawa, Hiroshi; Unwin, Robert J.; Mighell, Alan; Berdal, Ariane; Kleta, Robert

    2013-01-01

    Background/Aims Calcium homeostasis requires regulated cellular and interstitial systems interacting to modulate the activity and movement of this ion. Disruption of these systems in the kidney results in nephrocalcinosis and nephrolithiasis, important medical problems whose pathogenesis is incompletely understood. Methods We investigated 25 patients from 16 families with unexplained nephrocalcinosis and characteristic dental defects (amelogenesis imperfecta, gingival hyperplasia, impaired tooth eruption). To identify the causative gene, we performed genome-wide linkage analysis, exome capture, next-generation sequencing, and Sanger sequencing. Results All patients had bi-allelic FAM20A mutations segregating with the disease; 20 different mutations were identified. Conclusions This au-tosomal recessive disorder, also known as enamel renal syndrome, of FAM20A causes nephrocalcinosis and amelogenesis imperfecta. We speculate that all individuals with biallelic FAM20A mutations will eventually show nephrocalcinosis. PMID:23434854

  10. Operating Characteristics of the Implicit Learning System Supporting Serial Interception Sequence Learning

    ERIC Educational Resources Information Center

    Sanchez, Daniel J.; Reber, Paul J.

    2012-01-01

    The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the…

  11. Detecting novel genes with sparse arrays

    PubMed Central

    Haiminen, Niina; Smit, Bart; Rautio, Jari; Vitikainen, Marika; Wiebe, Marilyn; Martinez, Diego; Chee, Christine; Kunkel, Joe; Sanchez, Charles; Nelson, Mary Anne; Pakula, Tiina; Saloheimo, Markku; Penttilä, Merja; Kivioja, Teemu

    2014-01-01

    Species-specific genes play an important role in defining the phenotype of an organism. However, current gene prediction methods can only efficiently find genes that share features such as sequence similarity or general sequence characteristics with previously known genes. Novel sequencing methods and tiling arrays can be used to find genes without prior information and they have demonstrated that novel genes can still be found from extensively studied model organisms. Unfortunately, these methods are expensive and thus are not easily applicable, e.g., to finding genes that are expressed only in very specific conditions. We demonstrate a method for finding novel genes with sparse arrays, applying it on the 33.9 Mb genome of the filamentous fungus Trichoderma reesei. Our computational method does not require normalisations between arrays and it takes into account the multiple-testing problem typical for analysis of microarray data. In contrast to tiling arrays, that use overlapping probes, only one 25mer microarray oligonucleotide probe was used for every 100 b. Thus, only relatively little space on a microarray slide was required to cover the intergenic regions of a genome. The analysis was done as a by-product of a conventional microarray experiment with no additional costs. We found at least 23 good candidates for novel transcripts that could code for proteins and all of which were expressed at high levels. Candidate genes were found to neighbour ire1 and cre1 and many other regulatory genes. Our simple, low-cost method can easily be applied to finding novel species-specific genes without prior knowledge of their sequence properties. PMID:20691772

  12. The twin-arginine translocation pathway of Mycobacterium smegmatis is functional and required for the export of mycobacterial beta-lactamases.

    PubMed

    McDonough, Justin A; Hacker, Kari E; Flores, Anthony R; Pavelka, Martin S; Braunstein, Miriam

    2005-11-01

    The twin-arginine translocation (Tat) pathway exports folded proteins across the bacterial cytoplasmic membrane and is responsible for the proper extracytoplasmic localization of proteins involved in a variety of cellular functions, including pathogenesis. The Mycobacterium tuberculosis and Mycobacterium smegmatis genomes contain open reading frames with homology to components of the Tat export system (TatABC) as well as potential Tat-exported proteins possessing N-terminal signal sequences with the characteristic twin-arginine motif. Due to the importance of exported virulence factors in the pathogenesis of M. tuberculosis and the limited understanding of mycobacterial protein export systems, we sought to determine the functional nature of the Tat export pathway in mycobacteria. Here we describe phenotypic analyses of DeltatatA and DeltatatC deletion mutants of M. smegmatis, which demonstrated that tatA and tatC encode components of a functional Tat system capable of exporting characteristic Tat substrates. Both mutants displayed a growth defect on agar medium and hypersensitivity to sodium dodecyl sulfate. The mutants were also defective in the export of active beta-lactamases of M. smegmatis (BlaS) and M. tuberculosis (BlaC), both of which possess twin-arginine signal sequences. The Tat-dependent nature of BlaC was further revealed by mutation of the twin-arginine motif. Finally, we demonstrated that replacement of the native signal sequence of BlaC with the predicted Tat signal sequences of M. tuberculosis phospholipase C proteins (PlcA and PlcB) resulted in the Tat-dependent export of an enzymatically active 'BlaC. Thus, 'BlaC can be used as a genetic reporter for Tat-dependent export in mycobacteria.

  13. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...

  14. Liquid rocket combustion computer model with distributed energy release. DER computer program documentation and user's guide, volume 1

    NASA Technical Reports Server (NTRS)

    Combs, L. P.

    1974-01-01

    A computer program for analyzing rocket engine performance was developed. The program is concerned with the formation, distribution, flow, and combustion of liquid sprays and combustion product gases in conventional rocket combustion chambers. The capabilities of the program to determine the combustion characteristics of the rocket engine are described. Sample data code sheets show the correct sequence and formats for variable values and include notes concerning options to bypass the input of certain data. A seperate list defines the variables and indicates their required dimensions.

  15. Wideband propagation measurement system using spread spectrum signaling and TDRS

    NASA Technical Reports Server (NTRS)

    Jenkins, Jeffrey D.; Fan, Yiping; Osborne, William P.

    1995-01-01

    In this paper, a wideband propagation measurement system, which consisted of a ground-based transmitter, a mobile receiver, and a data acquisition system, was constructed. This system has been employed in a study of the characteristics of different propagation environments, such as urban, suburban and rural areas, by using a pseudonoise spreading sequence transmitted over NASA's Tracking and Data Relay Satellite System. The hardware and software tests showed that it met overall system requirements and it was very robust during a 3-month-long outdoor data collection experiment.

  16. Personalized Cancer Medicine: Molecular Diagnostics, Predictive biomarkers, and Drug Resistance

    PubMed Central

    Gonzalez de Castro, D; Clarke, P A; Al-Lazikani, B; Workman, P

    2013-01-01

    The progressive elucidation of the molecular pathogenesis of cancer has fueled the rational development of targeted drugs for patient populations stratified by genetic characteristics. Here we discuss general challenges relating to molecular diagnostics and describe predictive biomarkers for personalized cancer medicine. We also highlight resistance mechanisms for epidermal growth factor receptor (EGFR) kinase inhibitors in lung cancer. We envisage a future requiring the use of longitudinal genome sequencing and other omics technologies alongside combinatorial treatment to overcome cellular and molecular heterogeneity and prevent resistance caused by clonal evolution. PMID:23361103

  17. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment

    PubMed Central

    2013-01-01

    Background Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. Results In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Conclusion Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA. PMID:24564200

  18. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

    PubMed

    Nagar, Anurag; Hahsler, Michael

    2013-01-01

    Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA.

  19. SP-Designer: a user-friendly program for designing species-specific primer pairs from DNA sequence alignments.

    PubMed

    Villard, Pierre; Malausa, Thibaut

    2013-07-01

    SP-Designer is an open-source program providing a user-friendly tool for the design of specific PCR primer pairs from a DNA sequence alignment containing sequences from various taxa. SP-Designer selects PCR primer pairs for the amplification of DNA from a target species on the basis of several criteria: (i) primer specificity, as assessed by interspecific sequence polymorphism in the annealing regions, (ii) the biochemical characteristics of the primers and (iii) the intended PCR conditions. SP-Designer generates tables, detailing the primer pair and PCR characteristics, and a FASTA file locating the primer sequences in the original sequence alignment. SP-Designer is Windows-compatible and freely available from http://www2.sophia.inra.fr/urih/sophia_mart/sp_designer/info_sp_designer.php. © 2013 John Wiley & Sons Ltd.

  20. Multi-site Stochastic Simulation of Daily Streamflow with Markov Chain and KNN Algorithm

    NASA Astrophysics Data System (ADS)

    Mathai, J.; Mujumdar, P.

    2017-12-01

    A key focus of this study is to develop a method which is physically consistent with the hydrologic processes that can capture short-term characteristics of daily hydrograph as well as the correlation of streamflow in temporal and spatial domains. In complex water resource systems, flow fluctuations at small time intervals require that discretisation be done at small time scales such as daily scales. Also, simultaneous generation of synthetic flows at different sites in the same basin are required. We propose a method to equip water managers with a streamflow generator within a stochastic streamflow simulation framework. The motivation for the proposed method is to generate sequences that extend beyond the variability represented in the historical record of streamflow time series. The method has two steps: In step 1, daily flow is generated independently at each station by a two-state Markov chain, with rising limb increments randomly sampled from a Gamma distribution and the falling limb modelled as exponential recession and in step 2, the streamflow generated in step 1 is input to a nonparametric K-nearest neighbor (KNN) time series bootstrap resampler. The KNN model, being data driven, does not require assumptions on the dependence structure of the time series. A major limitation of KNN based streamflow generators is that they do not produce new values, but merely reshuffle the historical data to generate realistic streamflow sequences. However, daily flow generated using the Markov chain approach is capable of generating a rich variety of streamflow sequences. Furthermore, the rising and falling limbs of daily hydrograph represent different physical processes, and hence they need to be modelled individually. Thus, our method combines the strengths of the two approaches. We show the utility of the method and improvement over the traditional KNN by simulating daily streamflow sequences at 7 locations in the Godavari River basin in India.

  1. Structural requirements for recognition of the HLA-Dw14 class II epitope: A key HLA determinant associated with rheumatoid arthritis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hiraiwa, Akikazu; Yamanaka, Katsuo; Kwok, W.W.

    Although HLA genes have been shown to be associated with certain diseases, the basis for this association is unknown. Recent studies, however, have documented patterns of nucleotide sequence variation among some HLA genes associated with a particular disease. For rheumatoid arthritis, HLA genes in most patients have a shared nucleotide sequence encoding a key structural element of an HLA class II polypeptide; this sequence element is critical for the interaction of the HLA molecule with antigenic peptides and with responding T cells, suggestive of a direct role for this sequence element in disease susceptibility. The authors describe the serological andmore » cellular immunologic characteristics encoded by this rheumatoid arthritis-associated sequence element. Site-directed mutagenesis of the DRB1 gene was used to define amino acids critical for antibody and T-cell recognition of this structural element, focusing on residues that distinguish the rheumatoid arthritis-associated alleles Dw4 and Dw14 from a closely related allele, Dw10, not associated with disease. Both the gain and loss of rheumatoid arthritis-associated epitopes were highly dependent on three residues within a discrete domain of the HLA-DR molecule. Recognition was most strongly influenced by the following amino acids (in order): 70 > 71 > 67. Some alloreactive T-cell clones were also influenced by amino acid variation in portions of the DR molecule lying outside the shared sequence element.« less

  2. Operating characteristics of the implicit learning system supporting serial interception sequence learning.

    PubMed

    Sanchez, Daniel J; Reber, Paul J

    2012-04-01

    The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the implicit sequence learning mechanism, participants performed the serial interception sequence learning (SISL) task with covertly embedded repeating sequences that were much longer than most previous studies: ranging from 30 to 60 (Experiment 1) and 60 to 90 (Experiment 2) items in length. Robust sequence-specific learning was observed for sequences up to 80 items in length, extending the known capacity of implicit sequence learning. In Experiment 3, 12-item repeating sequences were embedded among increasing amounts of irrelevant nonrepeating sequences (from 20 to 80% of training trials). Despite high levels of irrelevant trials, learning occurred across conditions. A comparison of learning rates across all three experiments found a surprising degree of constancy in the rate of learning regardless of sequence length or embedded noise. Sequence learning appears to be constant with the logarithm of the number of sequence repetitions practiced during training. The consistency in learning rate across experiments and conditions implies that the mechanisms supporting implicit sequence learning are not capacity-constrained by very long sequences nor adversely affected by high rates of irrelevant sequences during training.

  3. Purification, characterization, and sequencing of antimicrobial peptides, Cy-AMP1, Cy-AMP2, and Cy-AMP3, from the Cycad (Cycas revoluta) seeds.

    PubMed

    Yokoyama, Seiya; Kato, Kouji; Koba, Atsuko; Minami, Yuji; Watanabe, Keiichi; Yagi, Fumio

    2008-12-01

    Novel antimicrobial peptides (AMP), designated Cy-AMP1, Cy-AMP2, and Cy-AMP3, were purified from seeds of the cycad (Cycas revoluta) by a CM cellulofine column, ion-exchange HPLC on SP COSMOGEL, and reverse-phase HPLC. They had molecular masses of 4583.2 Da, 4568.9 Da and 9275.8 Da, respectively, by MALDI-TOF MS analysis. Half of the amino acid residues of Cy-AMP1 and Cy-AMP2 were cysteine, glycine and proline, and their sequences were similar. The sequence of Cy-AMP3 showed high homology to various lipid transfer proteins. For Cy-AMP1 and Cy-AMP2, the concentrations of peptides required for 50% inhibition (IC(50)) of the growth of plant pathogenic fungi, Gram-positive and Gram-negative bacteria were 7.0-8.9 microg/ml. The Cy-AMP3 had weak antimicrobial activity. The structural and antimicrobial characteristics of Cy-AMP1 and Cy-AMP2 indicated that they are a novel type of antimicrobial peptide belonging to a plant defensin family.

  4. Molecular characterization of infectious bursal disease viruses from Pakistan.

    PubMed

    Shabbir, Muhammad Zubair; Ali, Muhammad; Abbas, Muhammad; Chaudhry, Umer Naveed; Zia-Ur-Rehman; Munir, Muhammad

    2016-07-01

    Since the first report of infectious bursal disease in Pakistan in 1987, outbreaks have been common even in vaccinated flocks. Despite appropriate administration of vaccines, concerns arise if the circulating strains are different from the ones used in the vaccine. Here, we sequenced the hypervariable region (HVR) of the VP2 gene of circulating strains of infectious bursal disease virus (IBDV) originating from outbreaks (n = 4) in broiler flocks in Pakistan. Nucleotide sequencing followed by phylogeny and deduced amino acid sequence analysis showed the circulating strains to be very virulent (vv) and identified characteristic residues at position 222 (A), 242 (I), 256 (I), 294 (I) and 299 (S). In addition, a substitution at positions 221 (Q→H) was found to be exclusive to Pakistani strains in our analysis, although a larger dataset is required to confirm this finding. Compared to vaccine strains that are commonly used in Pakistan, substitution mutations were found at key amino acid positions in VP2 that may be responsible for potential changes in neutralization epitopes and vaccine failure.

  5. Identification of human-to-human transmissibility factors in PB2 proteins of influenza A by large-scale mutual information analysis

    PubMed Central

    Miotto, Olivo; Heiny, AT; Tan, Tin Wee; August, J Thomas; Brusic, Vladimir

    2008-01-01

    Background The identification of mutations that confer unique properties to a pathogen, such as host range, is of fundamental importance in the fight against disease. This paper describes a novel method for identifying amino acid sites that distinguish specific sets of protein sequences, by comparative analysis of matched alignments. The use of mutual information to identify distinctive residues responsible for functional variants makes this approach highly suitable for analyzing large sets of sequences. To support mutual information analysis, we developed the AVANA software, which utilizes sequence annotations to select sets for comparison, according to user-specified criteria. The method presented was applied to an analysis of influenza A PB2 protein sequences, with the objective of identifying the components of adaptation to human-to-human transmission, and reconstructing the mutation history of these components. Results We compared over 3,000 PB2 protein sequences of human-transmissible and avian isolates, to produce a catalogue of sites involved in adaptation to human-to-human transmission. This analysis identified 17 characteristic sites, five of which have been present in human-transmissible strains since the 1918 Spanish flu pandemic. Sixteen of these sites are located in functional domains, suggesting they may play functional roles in host-range specificity. The catalogue of characteristic sites was used to derive sequence signatures from historical isolates. These signatures, arranged in chronological order, reveal an evolutionary timeline for the adaptation of the PB2 protein to human hosts. Conclusion By providing the most complete elucidation to date of the functional components participating in PB2 protein adaptation to humans, this study demonstrates that mutual information is a powerful tool for comparative characterization of sequence sets. In addition to confirming previously reported findings, several novel characteristic sites within PB2 are reported. Sequence signatures generated using the characteristic sites catalogue characterize concisely the adaptation characteristics of individual isolates. Evolutionary timelines derived from signatures of early human influenza isolates suggest that characteristic variants emerged rapidly, and remained remarkably stable through subsequent pandemics. In addition, the signatures of human-infecting H5N1 isolates suggest that this avian subtype has low pandemic potential at present, although it presents more human adaptation components than most avian subtypes. PMID:18315849

  6. Universal and idiosyncratic characteristic lengths in bacterial genomes

    NASA Astrophysics Data System (ADS)

    Junier, Ivan; Frémont, Paul; Rivoire, Olivier

    2018-05-01

    In condensed matter physics, simplified descriptions are obtained by coarse-graining the features of a system at a certain characteristic length, defined as the typical length beyond which some properties are no longer correlated. From a physics standpoint, in vitro DNA has thus a characteristic length of 300 base pairs (bp), the Kuhn length of the molecule beyond which correlations in its orientations are typically lost. From a biology standpoint, in vivo DNA has a characteristic length of 1000 bp, the typical length of genes. Since bacteria live in very different physico-chemical conditions and since their genomes lack translational invariance, whether larger, universal characteristic lengths exist is a non-trivial question. Here, we examine this problem by leveraging the large number of fully sequenced genomes available in public databases. By analyzing GC content correlations and the evolutionary conservation of gene contexts (synteny) in hundreds of bacterial chromosomes, we conclude that a fundamental characteristic length around 10–20 kb can be defined. This characteristic length reflects elementary structures involved in the coordination of gene expression, which are present all along the genome of nearly all bacteria. Technically, reaching this conclusion required us to implement methods that are insensitive to the presence of large idiosyncratic genomic features, which may co-exist along these fundamental universal structures.

  7. A Quantitative Tool to Distinguish Isobaric Leucine and Isoleucine Residues for Mass Spectrometry-Based De Novo Monoclonal Antibody Sequencing

    NASA Astrophysics Data System (ADS)

    Poston, Chloe N.; Higgs, Richard E.; You, Jinsam; Gelfanova, Valentina; Hale, John E.; Knierman, Michael D.; Siegel, Robert; Gutierrez, Jesus A.

    2014-07-01

    De novo sequencing by mass spectrometry (MS) allows for the determination of the complete amino acid (AA) sequence of a given protein based on the mass difference of detected ions from MS/MS fragmentation spectra. The technique relies on obtaining specific masses that can be attributed to characteristic theoretical masses of AAs. A major limitation of de novo sequencing by MS is the inability to distinguish between the isobaric residues leucine (Leu) and isoleucine (Ile). Incorrect identification of Ile as Leu or vice versa often results in loss of activity in recombinant antibodies. This functional ambiguity is commonly resolved with costly and time-consuming AA mutation and peptide sequencing experiments. Here, we describe a set of orthogonal biochemical protocols, which experimentally determine the identity of Ile or Leu residues in monoclonal antibodies (mAb) based on the selectivity that leucine aminopeptidase shows for n-terminal Leu residues and the cleavage preference for Leu by chymotrypsin. The resulting observations are combined with germline frequencies and incorporated into a logistic regression model, called Predictor for Xle Sites (PXleS) to provide a statistical likelihood for the identity of Leu at an ambiguous site. We demonstrate that PXleS can generate a probability for an Xle site in mAbs with 96% accuracy. The implementation of PXleS precludes the expression of several possible sequences and, therefore, reduces the overall time and resources required to go from spectra generation to a biologically active sequence for a mAb when an Ile or Leu residue is in question.

  8. A quantitative tool to distinguish isobaric leucine and isoleucine residues for mass spectrometry-based de novo monoclonal antibody sequencing.

    PubMed

    Poston, Chloe N; Higgs, Richard E; You, Jinsam; Gelfanova, Valentina; Hale, John E; Knierman, Michael D; Siegel, Robert; Gutierrez, Jesus A

    2014-07-01

    De novo sequencing by mass spectrometry (MS) allows for the determination of the complete amino acid (AA) sequence of a given protein based on the mass difference of detected ions from MS/MS fragmentation spectra. The technique relies on obtaining specific masses that can be attributed to characteristic theoretical masses of AAs. A major limitation of de novo sequencing by MS is the inability to distinguish between the isobaric residues leucine (Leu) and isoleucine (Ile). Incorrect identification of Ile as Leu or vice versa often results in loss of activity in recombinant antibodies. This functional ambiguity is commonly resolved with costly and time-consuming AA mutation and peptide sequencing experiments. Here, we describe a set of orthogonal biochemical protocols, which experimentally determine the identity of Ile or Leu residues in monoclonal antibodies (mAb) based on the selectivity that leucine aminopeptidase shows for n-terminal Leu residues and the cleavage preference for Leu by chymotrypsin. The resulting observations are combined with germline frequencies and incorporated into a logistic regression model, called Predictor for Xle Sites (PXleS) to provide a statistical likelihood for the identity of Leu at an ambiguous site. We demonstrate that PXleS can generate a probability for an Xle site in mAbs with 96% accuracy. The implementation of PXleS precludes the expression of several possible sequences and, therefore, reduces the overall time and resources required to go from spectra generation to a biologically active sequence for a mAb when an Ile or Leu residue is in question.

  9. Genetic and DNA sequence analysis of the kanamycin resistance transposon Tn903.

    PubMed Central

    Grindley, N D; Joyce, C M

    1980-01-01

    The kanamycin resistance transposon Tn903 consists of a unique region of about 1000 base pairs bounded by a pair of 1050-base-pair inverted repeat sequences. Each repeat contains two Pvu II endonuclease cleavage sites separated by 520 base pairs. We have constructed derivatives of Tn903 in which this 520-base-pair fragment is deleted from one or both repeats. Those derivatives that lack both 520-base-pair fragments cannot transpose, whereas those that lack just one remain transposition proficient. One such transposable derivative, Tn903 delta I, has been selected for further study. We have determined the sequence of the intact inverted repeat. The 18 base pairs at each end are identical and inverted relative to one another, a structure characteristic of insertion sequences. Additional experiments indicate that a single inverted repeat from Tn903 can, in fact, transpose; we propose that this element be called IS903. To correlate the DNA sequence with genetic activities, we have created mutations by inserting a 10-base-pair DNA fragment at several sites within the intact repeat of Tn903 delta 1, and we have examined the effect of such insertions on transposability. The results suggest that IS903 encodes a 307-amino-acid polypeptide (a "transposase") that is absolutely required for transposition of IS903 or Tn903. Images PMID:6261245

  10. Phonotactic Probability of Brand Names: I'd buy that!

    PubMed Central

    Vitevitch, Michael S.; Donoso, Alexander J.

    2011-01-01

    Psycholinguistic research shows that word-characteristics influence the speed and accuracy of various language-related processes. Analogous characteristics of brand names influence the retrieval of product information and the perception of risks associated with that product. In the present experiment we examined how phonotactic probability—the frequency with which phonological segments and sequences of segments appear in a word—might influence consumer behavior. Participants rated brand names that varied in phonotactic probability on the likelihood that they would buy the product. Participants indicated that they were more likely to purchase a product if the brand name was comprised of common segments and sequences of segments rather than less common segments and sequences of segments. This result suggests that word-characteristics may influence higher-level cognitive processes, in addition to language-related processes. Furthermore, the benefits of using objective measures of word characteristics in the design of brand names are discussed. PMID:21870135

  11. A space-efficient algorithm for local similarities.

    PubMed

    Huang, X Q; Hardison, R C; Miller, W

    1990-10-01

    Existing dynamic-programming algorithms for identifying similar regions of two sequences require time and space proportional to the product of the sequence lengths. Often this space requirement is more limiting than the time requirement. We describe a dynamic-programming local-similarity algorithm that needs only space proportional to the sum of the sequence lengths. The method can also find repeats within a single long sequence. To illustrate the algorithm's potential, we discuss comparison of a 73,360 nucleotide sequence containing the human beta-like globin gene cluster and a corresponding 44,594 nucleotide sequence for rabbit, a problem well beyond the capabilities of other dynamic-programming software.

  12. Automated identification of complementarity determining regions (CDRs) reveals peculiar characteristics of CDRs and B cell epitopes.

    PubMed

    Ofran, Yanay; Schlessinger, Avner; Rost, Burkhard

    2008-11-01

    Exact identification of complementarity determining regions (CDRs) is crucial for understanding and manipulating antigenic interactions. One way to do this is by marking residues on the antibody that interact with B cell epitopes on the antigen. This, of course, requires identification of B cell epitopes, which could be done by marking residues on the antigen that bind to CDRs, thus requiring identification of CDRs. To circumvent this vicious circle, existing tools for identifying CDRs are based on sequence analysis or general biophysical principles. Often, these tools, which are based on partial data, fail to agree on the boundaries of the CDRs. Herein we present an automated procedure for identifying CDRs and B cell epitopes using consensus structural regions that interact with the antigens in all known antibody-protein complexes. Consequently, we provide the first comprehensive analysis of all CDR-epitope complexes of known three-dimensional structure. The CDRs we identify only partially overlap with the regions suggested by existing methods. We found that the general physicochemical properties of both CDRs and B cell epitopes are rather peculiar. In particular, only four amino acids account for most of the sequence of CDRs, and several types of amino acids almost never appear in them. The secondary structure content and the conservation of B cell epitopes are found to be different than previously thought. These characteristics of CDRs and epitopes may be instrumental in choosing which residues to mutate in experimental search for epitopes. They may also assist in computational design of antibodies and in predicting B cell epitopes.

  13. Evolution of infectious hematopoietic necrosis virus (IHNV), a fish rhabdovirus, in Europe over 20 years: implications for control.

    PubMed

    Enzmann, Peter-Joachim; Castric, Jeannette; Bovo, Giuseppe; Thiery, Richard; Fichtner, Dieter; Schütze, Heike; Wahli, Thomas

    2010-02-24

    The fish pathogenic rhabdovirus infectious hematopoietic necrosis virus (IHNV) causes substantial losses in European aquaculture. IHNV was first detected in Europe in 1987 and has since undergone considerable spread. Phylogenetic analyses of the full G-gene sequences of 73 isolates obtained from 4 countries in Europe (France, n = 18; Italy, 9; Switzerland, 4; Germany, 42) enable determination of the evolution of the virus in Europe since the first detection, and identification of characteristic changes within the G-genes of European strains. Further, the database allows us to analyse the pathways of distribution in Europe over time. The results suggest that in most of the recent cases, spread of IHNV was related to trade of infected fish. The data further demonstrate that knowledge of the sequence is required to determine the source of infections in farms.

  14. Single-Cell Sequencing Technology in Oncology: Applications for Clinical Therapies and Research.

    PubMed

    Ye, Baixin; Gao, Qingping; Zeng, Zhi; Stary, Creed M; Jian, Zhihong; Xiong, Xiaoxing; Gu, Lijuan

    2016-01-01

    Cellular heterogeneity is a fundamental characteristic of many cancers. A lack of cellular homogeneity contributes to difficulty in designing targeted oncological therapies. Therefore, the development of novel methods to determine and characterize oncologic cellular heterogeneity is a critical next step in the development of novel cancer therapies. Single-cell sequencing (SCS) technology has been recently employed for analyzing the genetic polymorphisms of individual cells at the genome-wide level. SCS requires (1) precise isolation of the single cell of interest; (2) isolation and amplification of genetic material; and (3) descriptive analysis of genomic, transcriptomic, and epigenomic data. In addition to targeted analysis of single cells isolated from tumor biopsies, SCS technology may be applied to circulating tumor cells, which may aid in predicting tumor progression and metastasis. In this paper, we provide an overview of SCS technology and review the current literature on the potential application of SCS to clinical oncology and research.

  15. DNA-based identification of forensically important species of Sarcophagidae (Insecta: Diptera) from Rio de Janeiro, Brazil.

    PubMed

    Napoleão, K S; Mello-Patiu, C A; Oliveira-Costa, J; Takiya, D M; Silva, R; Moura-Neto, R S

    2016-05-06

    Sarcophagidae, or flesh flies, are of great importance in forensic entomology, but their effective application requires precise taxonomic identification, which relies almost exclusively on characteristics of the male genitalia. Given that female flies and larvae are most abundant in animal carcasses or on corpses, precise morphological identification can be difficult; therefore, DNA sequencing can be an additional tool for use in taxonomic identification. This paper analyzes part of the mitochondrial cytochrome c oxidase subunit I (COI) gene from three Sarcophagidae species of forensic importance in the City of Rio de Janeiro: Oxysarcodexia fluminensis, Peckia chrysostoma, and Peckia intermutans. COI fragments of 400 bp from 36 specimens of these three species were sequenced. No intraspecific differences were found among specimens of O. fluminensis, but P. chrysostoma and P. intermutans each had two haplotypes, ranging from 0 to 0.7%. The interspecific divergence was 8.5-11.6%, corroborating previously reported findings.

  16. Method for assigning sites to projected generic nuclear power plants

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Holter, G.M.; Purcell, W.L.; Shutz, M.E.

    1986-07-01

    Pacific Northwest Laboratory developed a method for forecasting potential locations and startup sequences of nuclear power plants that will be required in the future but have not yet been specifically identified by electric utilities. Use of the method results in numerical ratings for potential nuclear power plant sites located in each of the 10 federal energy regions. The rating for each potential site is obtained from numerical factors assigned to each of 5 primary siting characteristics: (1) cooling water availability, (2) site land area, (3) power transmission land area, (4) proximity to metropolitan areas, and (5) utility plans for themore » site. The sequence of plant startups in each federal energy region is obtained by use of the numerical ratings and the forecasts of generic nuclear power plant startups obtained from the EIA Middle Case electricity forecast. Sites are assigned to generic plants in chronological order according to startup date.« less

  17. Distinctive and Complementary MS2 Fragmentation Characteristics for Identification of Sulfated Sialylated N-Glycopeptides by nanoLC-MS/MS Workflow

    NASA Astrophysics Data System (ADS)

    Kuo, Chu-Wei; Guu, Shih-Yun; Khoo, Kay-Hooi

    2018-04-01

    High sensitivity identification of sulfated glycans carried on specific sites of glycoproteins is an important requisite for investigation of molecular recognition events involved in diverse biological processes. However, aiming for resolving site-specific glycosylation of sulfated glycopeptides by direct LC-MS2 sequencing is technically most challenging. Other than the usual limiting factors such as lower abundance and ionization efficiency compared to analysis of non-glycosylated peptides, confident identification of sulfated glycopeptides among the more abundant non-sulfated glycopeptides requires additional considerations in the selective enrichment and detection strategies. Metal oxide has been applied to enrich phosphopeptides and sialylated glycopeptides, but its use to capture sulfated glycopeptides has not been investigated. Likewise, various complementary MS2 fragmentation modes have yet to be tested against sialylated and non-sialylated sulfoglycopeptides due to limited appropriate sample availability. In this study, we have investigated the feasibility of sequencing tryptic sulfated N-glycopeptide and its MS2 fragmentation characteristics by first optimizing the enrichment methods to allow efficient LC-MS detection and MS2 analysis by a combination of CID, HCD, ETD, and EThcD on hybrid and tribrid Orbitrap instruments. Characteristic sulfated glyco-oxonium ions and direct loss of sulfite from precursors were detected as evidences of sulfate modification. It is anticipated that the technical advances demonstrated in this study would allow a feasible extension of our sulfoglycomic analysis to sulfoglycoproteomics. [Figure not available: see fulltext.

  18. Biologically important conformational features of DNA as interpreted by quantum mechanics and molecular mechanics computations of its simple fragments.

    PubMed

    Poltev, V; Anisimov, V M; Dominguez, V; Gonzalez, E; Deriabina, A; Garcia, D; Rivas, F; Polteva, N A

    2018-02-01

    Deciphering the mechanism of functioning of DNA as the carrier of genetic information requires identifying inherent factors determining its structure and function. Following this path, our previous DFT studies attributed the origin of unique conformational characteristics of right-handed Watson-Crick duplexes (WCDs) to the conformational profile of deoxydinucleoside monophosphates (dDMPs) serving as the minimal repeating units of DNA strand. According to those findings, the directionality of the sugar-phosphate chain and the characteristic ranges of dihedral angles of energy minima combined with the geometric differences between purines and pyrimidines determine the dependence on base sequence of the three-dimensional (3D) structure of WCDs. This work extends our computational study to complementary deoxydinucleotide-monophosphates (cdDMPs) of non-standard conformation, including those of Z-family, Hoogsteen duplexes, parallel-stranded structures, and duplexes with mispaired bases. For most of these systems, except Z-conformation, computations closely reproduce experimental data within the tolerance of characteristic limits of dihedral parameters for each conformation family. Computation of cdDMPs with Z-conformation reveals that their experimental structures do not correspond to the internal energy minimum. This finding establishes the leading role of external factors in formation of the Z-conformation. Energy minima of cdDMPs of non-Watson-Crick duplexes demonstrate different sequence-dependence features than those known for WCDs. The obtained results provide evidence that the biologically important regularities of 3D structure distinguish WCDs from duplexes having non-Watson-Crick nucleotide pairing.

  19. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

    PubMed

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education.

  20. A motif detection and classification method for peptide sequences using genetic programming.

    PubMed

    Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki

    2008-08-01

    An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.

  1. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing

    PubMed Central

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J.; O'Donnell, Kerry; Geiser, David M.; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  2. TnSeq of Mycobacterium tuberculosis clinical isolates reveals strain-specific antibiotic liabilities

    PubMed Central

    Carey, Allison F.; Rock, Jeremy M.; Krieger, Inna V.; Gagneux, Sebastien; Sacchettini, James C.; Fortune, Sarah M.

    2018-01-01

    Once considered a phenotypically monomorphic bacterium, there is a growing body of work demonstrating heterogeneity among Mycobacterium tuberculosis (Mtb) strains in clinically relevant characteristics, including virulence and response to antibiotics. However, the genetic and molecular basis for most phenotypic differences among Mtb strains remains unknown. To investigate the basis of strain variation in Mtb, we performed genome-wide transposon mutagenesis coupled with next-generation sequencing (TnSeq) for a panel of Mtb clinical isolates and the reference strain H37Rv to compare genetic requirements for in vitro growth across these strains. We developed an analytic approach to identify quantitative differences in genetic requirements between these genetically diverse strains, which vary in genomic structure and gene content. Using this methodology, we found differences between strains in their requirements for genes involved in fundamental cellular processes, including redox homeostasis and central carbon metabolism. Among the genes with differential requirements were katG, which encodes the activator of the first-line antitubercular agent isoniazid, and glcB, which encodes malate synthase, the target of a novel small-molecule inhibitor. Differences among strains in their requirement for katG and glcB predicted differences in their response to these antimicrobial agents. Importantly, these strain-specific differences in antibiotic response could not be predicted by genetic variants identified through whole genome sequencing or by gene expression analysis. Our results provide novel insight into the basis of variation among Mtb strains and demonstrate that TnSeq is a scalable method to predict clinically important phenotypic differences among Mtb strains. PMID:29505613

  3. Comparative transgenic analysis of enhancers from the human SHOX and mouse Shox2 genomic regions.

    PubMed

    Rosin, Jessica M; Abassah-Oppong, Samuel; Cobb, John

    2013-08-01

    Disruption of presumptive enhancers downstream of the human SHOX gene (hSHOX) is a frequent cause of the zeugopodal limb defects characteristic of Léri-Weill dyschondrosteosis (LWD). The closely related mouse Shox2 gene (mShox2) is also required for limb development, but in the more proximal stylopodium. In this study, we used transgenic mice in a comparative approach to characterize enhancer sequences in the hSHOX and mShox2 genomic regions. Among conserved noncoding elements (CNEs) that function as enhancers in vertebrate genomes, those that are maintained near paralogous genes are of particular interest given their ancient origins. Therefore, we first analyzed the regulatory potential of a genomic region containing one such duplicated CNE (dCNE) downstream of mShox2 and hSHOX. We identified a strong limb enhancer directly adjacent to the mShox2 dCNE that recapitulates the expression pattern of the endogenous gene. Interestingly, this enhancer requires sequences only conserved in the mammalian lineage in order to drive strong limb expression, whereas the more deeply conserved sequences of the dCNE function as a neural enhancer. Similarly, we found that a conserved element downstream of hSHOX (CNE9) also functions as a neural enhancer in transgenic mice. However, when the CNE9 transgenic construct was enlarged to include adjacent, non-conserved sequences frequently deleted in LWD patients, the transgene drove expression in the zeugopodium of the limbs. Therefore, both hSHOX and mShox2 limb enhancers are coupled to distinct neural enhancers. This is the first report demonstrating the activity of cis-regulatory elements from the hSHOX and mShox2 genomic regions in mammalian embryos.

  4. Detection of soft-tissue sarcoma recurrence: added value of functional MR imaging techniques at 3.0 T.

    PubMed

    Del Grande, Filippo; Subhawong, Ty; Weber, Kristy; Aro, Michael; Mugera, Charles; Fayad, Laura M

    2014-05-01

    To determine the added value of functional magnetic resonance (MR) sequences (dynamic contrast material-enhanced [DCE] and quantitative diffusion-weighted [DW] imaging with apparent diffusion coefficient [ADC] mapping) for the detection of recurrent soft-tissue sarcomas following surgical resection. This retrospective study was approved by the institutional review board. The requirement to obtain informed consent was waived. Thirty-seven patients referred for postoperative surveillance after resection of soft-tissue sarcoma (35 with high-grade sarcoma) were studied. Imaging at 3.0 T included conventional (T1-weighted, fluid-sensitive, and contrast-enhanced T1-weighted imaging) and functional (DCE MR imaging, DW imaging with ADC mapping) sequences. Recurrences were confirmed with biopsy or resection. A disease-free state was determined with at least 6 months of follow-up. Two readers independently recorded the signal and morphologic characteristics with conventional sequences, the presence or absence of arterial enhancement at DCE MR imaging, and ADCs of the surgical bed. The accuracy of conventional MR imaging in the detection of recurrence was compared with that with the addition of functional sequences. The Fisher exact and Wilcoxon rank sum tests were used to define the accuracy of imaging features, the Cohen κ and Lin interclass correlation were used to define interobserver variability, and receiver operating characteristic analysis was used to define a threshold to detect recurrence and assess reader confidence after the addition of functional imaging to conventional sequences. There were six histologically proved recurrences in 37 patients. Sensitivity and specificity of MR imaging in the detection of tumor recurrence were 100% (six of six patients) and 52% (16 of 31 patients), respectively, with conventional sequences, 100% (six of six patients) and 97% (30 of 31 patients) with the addition of DCE MR imaging, and 60% (three of five patients) and 97% (30 of 31 patients) with the addition of DW imaging and ADC mapping. The average ADC of recurrence (1.08 mm(2)/sec ± 0.19) was significantly different from those of postoperative scarring (0.9 mm(2)/sec ± 0.00) and hematomas (2.34 mm(2)/sec ± 0.72) (P = .03 for both). The addition of functional MR sequences to a routine MR protocol, in particular DCE MR imaging, offers a specificity of more than 95% for distinguishing recurrent sarcoma from postsurgical scarring.

  5. Conceptual design of a moving belt radiator shuttle-attached experiments: Technical requirement Document

    NASA Technical Reports Server (NTRS)

    Aguilar, Jerry L.

    1989-01-01

    The technical requirements for a shuttle-attached Moving Belt Radiator (MBR) experiment are defined. The MBR is an advanced radiator concept in which a rotating belt radiates thermal energy to space. The requirements for integrating the MBR experiment in the shuttle bay are discussed. Requirements for the belt material and working fluid are outlined along with some possible options. The proposed size and relationship to a full scale Moving Belt Radiator are defined. The experiment is defined with the primary goal of dynamic testing and a secondary goal of demonstrating the sealing and heat transfer characteristics. A perturbation system which will simulate a docking maneuver or other type of short term acceleration is proposed for inclusion in the experimental apparatus. A deployment and retraction capability which will aid in evaluating the dynamics of a belt during such a maneuver is also described. The proposed test sequence for the experiment is presented. Details of the conceptual design are not presented herein, but rather in a separate Final Report.

  6. Universal Recurrence Time Statistics of Characteristic Earthquakes

    NASA Astrophysics Data System (ADS)

    Goltz, C.; Turcotte, D. L.; Abaimov, S.; Nadeau, R. M.

    2006-12-01

    Characteristic earthquakes are defined to occur quasi-periodically on major faults. Do recurrence time statistics of such earthquakes follow a particular statistical distribution? If so, which one? The answer is fundamental and has important implications for hazard assessment. The problem cannot be solved by comparing the goodness of statistical fits as the available sequences are too short. The Parkfield sequence of M ≍ 6 earthquakes, one of the most extensive reliable data sets available, has grown to merely seven events with the last earthquake in 2004, for example. Recently, however, advances in seismological monitoring and improved processing methods have unveiled so-called micro-repeaters, micro-earthquakes which recur exactly in the same location on a fault. It seems plausible to regard these earthquakes as a miniature version of the classic characteristic earthquakes. Micro-repeaters are much more frequent than major earthquakes, leading to longer sequences for analysis. Due to their recent discovery, however, available sequences contain less than 20 events at present. In this paper we present results for the analysis of recurrence times for several micro-repeater sequences from Parkfield and adjacent regions. To improve the statistical significance of our findings, we combine several sequences into one by rescaling the individual sets by their respective mean recurrence intervals and Weibull exponents. This novel approach of rescaled combination yields the most extensive data set possible. We find that the resulting statistics can be fitted well by an exponential distribution, confirming the universal applicability of the Weibull distribution to characteristic earthquakes. A similar result is obtained from rescaled combination, however, with regard to the lognormal distribution.

  7. An object-oriented mobile health system with usability features.

    PubMed

    Escarfullet, Krystle; Moore, Cantera; Tucker, Shari; Wei, June

    2012-01-01

    Mobile health (m-health) comprises the concept of utilising mobile devices to carry out the task of viewing electronic medical records, reserving medical appointments with a patient's medical provider and electronically refilling prescriptions. This paper aims at developing a m-health system to improve usability from a user's perspective. Specifically, it first developed a m-health model by logically linking characteristics of the m-health system together based on information flows. Then, the system requirements were collected by using a developed questionnaire. These requirements were structured and further in-depth analysis was conducted by using an object-oriented approach based on unified modelling language, such as use-case, sequence and analysis class diagrams. This research will be beneficial to decision makers and developers in the mobile healthcare industry.

  8. Quantiprot - a Python package for quantitative analysis of protein sequences.

    PubMed

    Konopka, Bogumił M; Marciniak, Marta; Dyrka, Witold

    2017-07-17

    The field of protein sequence analysis is dominated by tools rooted in substitution matrices and alignments. A complementary approach is provided by methods of quantitative characterization. A major advantage of the approach is that quantitative properties defines a multidimensional solution space, where sequences can be related to each other and differences can be meaningfully interpreted. Quantiprot is a software package in Python, which provides a simple and consistent interface to multiple methods for quantitative characterization of protein sequences. The package can be used to calculate dozens of characteristics directly from sequences or using physico-chemical properties of amino acids. Besides basic measures, Quantiprot performs quantitative analysis of recurrence and determinism in the sequence, calculates distribution of n-grams and computes the Zipf's law coefficient. We propose three main fields of application of the Quantiprot package. First, quantitative characteristics can be used in alignment-free similarity searches, and in clustering of large and/or divergent sequence sets. Second, a feature space defined by quantitative properties can be used in comparative studies of protein families and organisms. Third, the feature space can be used for evaluating generative models, where large number of sequences generated by the model can be compared to actually observed sequences.

  9. 40 CFR 86.1230-96 - Test sequence; general requirements.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Petroleum Gas-Fueled and Methanol-Fueled Heavy-Duty Vehicles § 86.1230-96 Test sequence; general requirements. (a)(1) Gasoline- and methanol-fueled vehicles. The test sequence shown in figure M96-1 of this...

  10. 40 CFR 86.1230-96 - Test sequence; general requirements.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... Petroleum Gas-Fueled and Methanol-Fueled Heavy-Duty Vehicles § 86.1230-96 Test sequence; general requirements. (a)(1) Gasoline- and methanol-fueled vehicles. The test sequence shown in figure M96-1 of this...

  11. Characteristics and management of Enterobacteriaceae harboring IMP-4 or IMP-8 carbapenemase in a tertiary hospital.

    PubMed

    Pang, Feng; Jia, Xiu-Qin; Song, Zhen-Zhu; Li, Yan-Hua; Wang, Bin; Zhao, Qi-Gang; Wang, Chuan-Xin; Zhang, Yi; Wang, Le-Xin

    2016-03-01

    The emergence of Enterobacteriaceae harboring IMP-4 or IMP-8 carbapenemases is rare. We report an occurrence of Enterobacteriaceae harboring IMP-4 or IMP-8 carbapenemases in a Chinese tertiary care hospital from November 2010 to December 2012. The clinical characteristics of 30 patients were described. The genetic relationship of isolates was determined by pulsed-field gel electrophoresis (PFGE). Carbapenemases were detected by modified Hodge test (MHT) and polymerase chain reactions (PCRs). Amplicons were sequenced and blasted to determine the genotype. Most infected patients were from intensive care unit and had complex and serious underlying illnesses requiring mechanical ventilation. PFGE revealed that Klebsiella pneumoniae showed two major PFGE types. Two Klebsiella oxytoca had an indistinguishable PFGE pattern, while four Enterobacter cloacae were different strains. The sequencing studies showed Enterobacteriaceae harboring IMP-4 or IMP-8 carbapenemase in the 23 infected patients. The majority of patients had infections with the carbapenemase-producing Enterobacteriaceae (CPE) strain, most were successfully treated with a range of antibiotics and discharged. It is important to maintain a high index of suspicion to screen for carbapenemase-producing Enterobacteriaceae strains. Rapid identification of these strains and implementation of stringent procedures are the key to prevent major outbreaks in a hospital setting.

  12. Spatio-temporal Variations of Characteristic Repeating Earthquake Sequences along the Middle America Trench in Mexico

    NASA Astrophysics Data System (ADS)

    Dominguez, L. A.; Taira, T.; Hjorleifsdottir, V.; Santoyo, M. A.

    2015-12-01

    Repeating earthquake sequences are sets of events that are thought to rupture the same area on the plate interface and thus provide nearly identical waveforms. We systematically analyzed seismic records from 2001 through 2014 to identify repeating earthquakes with highly correlated waveforms occurring along the subduction zone of the Cocos plate. Using the correlation coefficient (cc) and spectral coherency (coh) of the vertical components as selection criteria, we found a set of 214 sequences whose waveforms exceed cc≥95% and coh≥95%. Spatial clustering along the trench shows large variations in repeating earthquakes activity. Particularly, the rupture zone of the M8.1, 1985 earthquake shows an almost absence of characteristic repeating earthquakes, whereas the Guerrero Gap zone and the segment of the trench close to the Guerrero-Oaxaca border shows a significantly larger number of repeating earthquakes sequences. Furthermore, temporal variations associated to stress changes due to major shows episodes of unlocking and healing of the interface. Understanding the different components that control the location and recurrence time of characteristic repeating sequences is a key factor to pinpoint areas where large megathrust earthquakes may nucleate and consequently to improve the seismic hazard assessment.

  13. A user-friendly, menu-driven, language-free laser characteristics curves graphing program for desk-top IBM PC compatible computers

    NASA Technical Reports Server (NTRS)

    Klutz, Glenn

    1989-01-01

    A facility was established that uses collected data and feeds it into mathematical models that generate improved data arrays by correcting for various losses, base line drift, and conversion to unity scaling. These developed data arrays have headers and other identifying information affixed and are subsequently stored in a Laser Materials and Characteristics data base which is accessible to various users. The two part data base: absorption - emission spectra and tabulated data, is developed around twelve laser models. The tabulated section of the data base is divided into several parts: crystalline, optical, mechanical, and thermal properties; aborption and emission spectra information; chemical name and formulas; and miscellaneous. A menu-driven, language-free graphing program will reduce and/or remove the requirement that users become competent FORTRAN programmers and the concomitant requirement that they also spend several days to a few weeks becoming conversant with the GEOGRAF library and sequence of calls and the continual refreshers of both. The work included becoming thoroughly conversant with or at least very familiar with GEOGRAF by GEOCOMP Corp. The development of the graphing program involved trial runs of the various callable library routines on dummy data in order to become familiar with actual implementation and sequencing. This was followed by trial runs with actual data base files and some additional data from current research that was not in the data base but currently needed graphs. After successful runs, with dummy and real data, using actual FORTRAN instructions steps were undertaken to develop the menu-driven language-free implementation of a program which would require the user only know how to use microcomputers. The user would simply be responding to items displayed on the video screen. To assist the user in arriving at the optimum values needed for a specific graph, a paper, and pencil check list was made available to use on the trial runs.

  14. Biological sequence compression algorithms.

    PubMed

    Matsumoto, T; Sadakane, K; Imai, H

    2000-01-01

    Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.

  15. A Study of the Comparative Effectiveness of Zoology Prerequisites at Slippery Rock State College.

    ERIC Educational Resources Information Center

    Morrison, William Sechler

    This study compared the effectiveness of three sequences of prerequisite courses required before taking zoology. Sequence 1 prerequisite courses consisted of general biology and human biology; Sequence 2 consisted of general biology; and Sequence 3 required cell biology. Zoology students in the spring of 1972 were pretest and a posttest. The mean…

  16. Transcriptional "silencer" element in rat repetitive sequences associated with the rat insulin 1 gene locus.

    PubMed Central

    Laimins, L; Holmgren-König, M; Khoury, G

    1986-01-01

    The enhancer elements from either simian virus 40 or murine sarcoma virus activate the expression of a transfected rat insulin 1 (rI1) gene when placed within 2.0 kilobases or less of the rI1 gene cap site. Inclusion of 4.0 kilobases of upstream rI1 sequence, however, results in a substantial reduction in the enhancer-dependent insulin gene expression. These observations suggested that a negative transcriptional regulatory element was present between 2.0 and 4.0 kilobases of the rI1 sequence. To test this notion, we employed a heterologous enhancer-dependent transcription assay in which the simian virus 40 72-base-pair repeat is linked to a human beta-globin gene. Addition of the upstream rI1 element to this system decreased the level of enhancer-dependent beta-globin transcription by a factor of 5 to 15. This rI1 "silencer" element functions in a manner relatively independent of position and orientation and requires a cis-dependent relationship to the transcription unit on which it acts. Thus, the silencer sequence seems to have a number of the characteristics of enhancer elements, and we suggest that it may function by the converse of the enhancer mechanism. The rI1 silencer sequence was identified as a member of a long interspersed rat repetitive family. Thus, a potential role for certain repetitive sequences interspersed throughout the eukaryotic genome may be to regulate gene expression by retaining transcriptional activity within defined domains. Images PMID:3010279

  17. Variability and repertoire size of T-cell receptor V alpha gene segments.

    PubMed

    Becker, D M; Pattern, P; Chien, Y; Yokota, T; Eshhar, Z; Giedlin, M; Gascoigne, N R; Goodnow, C; Wolf, R; Arai, K

    The immune system of higher organisms is composed largely of two distinct cell types, B lymphocytes and T lymphocytes, each of which is independently capable of recognizing an enormous number of distinct entities through their antigen receptors; surface immunoglobulin in the case of the former, and the T-cell receptor (TCR) in the case of the latter. In both cell types, the genes encoding the antigen receptors consist of multiple gene segments which recombine during maturation to produce many possible peptides. One striking difference between B- and T-cell recognition that has not yet been resolved by the structural data is the fact that T cells generally require a major histocompatibility determinant together with an antigen whereas, in most cases, antibodies recognize antigen alone. Recently, we and others have found that a series of TCR V beta gene sequences show conservation of many of the same residues that are conserved between heavy- and light-chain immunoglobulin V regions, and these V beta sequences are predicted to have an immunoglobulin-like secondary structure. To extend these studies, we have isolated and sequenced eight additional alpha-chain complementary cDNA clones and compared them with published sequences. Analyses of these sequences, reported here, indicate that V alpha regions have many of the characteristics of V beta gene segments but differ in that they almost always occur as cross-hybridizing gene families. We conclude that there may be very different selective pressures operating on V alpha and V beta sequences and that the V alpha repertoire may be considerably larger than that of V beta.

  18. Alu sequence involvement in transcriptional insulation of the keratin 18 gene in transgenic mice.

    PubMed Central

    Thorey, I S; Ceceña, G; Reynolds, W; Oshima, R G

    1993-01-01

    The human keratin 18 (K18) gene is expressed in a variety of adult simple epithelial tissues, including liver, intestine, lung, and kidney, but is not normally found in skin, muscle, heart, spleen, or most of the brain. Transgenic animals derived from the cloned K18 gene express the transgene in appropriate tissues at levels directly proportional to the copy number and independently of the sites of integration. We have investigated in transgenic mice the dependence of K18 gene expression on the distal 5' and 3' flanking sequences and upon the RNA polymerase III promoter of an Alu repetitive DNA transcription unit immediately upstream of the K18 promoter. Integration site-independent expression of tandemly duplicated K18 transgenes requires the presence of either an 825-bp fragment of the 5' flanking sequence or the 3.5-kb 3' flanking sequence. Mutation of the RNA polymerase III promoter of the Alu element within the 825-bp fragment abolishes copy number-dependent expression in kidney but does not abolish integration site-independent expression when assayed in the absence of the 3' flanking sequence of the K18 gene. The characteristics of integration site-independent expression and copy number-dependent expression are separable. In addition, the formation of the chromatin state of the K18 gene, which likely restricts the tissue-specific expression of this gene, is not dependent upon the distal flanking sequences of the 10-kb K18 gene but rather may depend on internal regulatory regions of the gene. Images PMID:7692231

  19. BEST: Improved Prediction of B-Cell Epitopes from Antigen Sequences

    PubMed Central

    Gao, Jianzhao; Faraggi, Eshel; Zhou, Yaoqi; Ruan, Jishou; Kurgan, Lukasz

    2012-01-01

    Accurate identification of immunogenic regions in a given antigen chain is a difficult and actively pursued problem. Although accurate predictors for T-cell epitopes are already in place, the prediction of the B-cell epitopes requires further research. We overview the available approaches for the prediction of B-cell epitopes and propose a novel and accurate sequence-based solution. Our BEST (B-cell Epitope prediction using Support vector machine Tool) method predicts epitopes from antigen sequences, in contrast to some method that predict only from short sequence fragments, using a new architecture based on averaging selected scores generated from sliding 20-mers by a Support Vector Machine (SVM). The SVM predictor utilizes a comprehensive and custom designed set of inputs generated by combining information derived from the chain, sequence conservation, similarity to known (training) epitopes, and predicted secondary structure and relative solvent accessibility. Empirical evaluation on benchmark datasets demonstrates that BEST outperforms several modern sequence-based B-cell epitope predictors including ABCPred, method by Chen et al. (2007), BCPred, COBEpro, BayesB, and CBTOPE, when considering the predictions from antigen chains and from the chain fragments. Our method obtains a cross-validated area under the receiver operating characteristic curve (AUC) for the fragment-based prediction at 0.81 and 0.85, depending on the dataset. The AUCs of BEST on the benchmark sets of full antigen chains equal 0.57 and 0.6, which is significantly and slightly better than the next best method we tested. We also present case studies to contrast the propensity profiles generated by BEST and several other methods. PMID:22761950

  20. Newborn Screening in the Era of Precision Medicine.

    PubMed

    Yang, Lan; Chen, Jiajia; Shen, Bairong

    2017-01-01

    As newborn screening success stories gained general confirmation during the past 50 years, scientists quickly discovered diagnostic tests for a host of genetic disorders that could be treated at birth. Outstanding progress in sequencing technologies over the last two decades has made it possible to comprehensively profile newborn screening (NBS) and identify clinically relevant genomic alterations. With the rapid developments in whole-genome sequencing (WGS) and whole-exome sequencing (WES) recently, we can detect newborns at the genomic level and be able to direct the appropriate diagnosis to the different individuals at the appropriate time, which is also encompassed in the concept of precision medicine. Besides, we can develop novel interventions directed at the molecular characteristics of genetic diseases in newborns. The implementation of genomics in NBS programs would provide an effective premise for the identification of the majority of genetic aberrations and primarily help in accurate guidance in treatment and better prediction. However, there are some debate correlated with the widespread application of genome sequencing in NBS due to some major concerns such as clinical analysis, result interpretation, storage of sequencing data, and communication of clinically relevant mutations to pediatricians and parents, along with the ethical, legal, and social implications (so-called ELSI). This review is focused on these critical issues and concerns about the expanding role of genomics in NBS for precision medicine. If WGS or WES is to be incorporated into NBS practice, considerations about these challenges should be carefully regarded and tackled properly to adapt the requirement of genome sequencing in the era of precision medicine.

  1. Hierarchical Traces for Reduced NSM Memory Requirements

    NASA Astrophysics Data System (ADS)

    Dahl, Torbjørn S.

    This paper presents work on using hierarchical long term memory to reduce the memory requirements of nearest sequence memory (NSM) learning, a previously published, instance-based reinforcement learning algorithm. A hierarchical memory representation reduces the memory requirements by allowing traces to share common sub-sequences. We present moderated mechanisms for estimating discounted future rewards and for dealing with hidden state using hierarchical memory. We also present an experimental analysis of how the sub-sequence length affects the memory compression achieved and show that the reduced memory requirements do not effect the speed of learning. Finally, we analyse and discuss the persistence of the sub-sequences independent of specific trace instances.

  2. Intrusion Detection in Control Systems using Sequence Characteristics

    NASA Astrophysics Data System (ADS)

    Kiuchi, Mai; Onoda, Takashi

    Intrusion detection is considered effective in control systems. Sequences of the control application behavior observed in the communication, such as the order of the control device to be controlled, are important in control systems. However, most intrusion detection systems do not effectively reflect sequences in the application layer into the detection rules. In our previous work, we considered utilizing sequences for intrusion detection in control systems, and demonstrated the usefulness of sequences for intrusion detection. However, manually writing the detection rules for a large system can be difficult, so using machine learning methods becomes feasible. Also, in the case of control systems, there have been very few observed cyber attacks, so we have very little knowledge of the attack data that should be used to train the intrusion detection system. In this paper, we use an approach that combines CRF (Conditional Random Field) considering the sequence of the system, thus able to reflect the characteristics of control system sequences into the intrusion detection system, and also does not need the knowledge of attack data to construct the detection rules.

  3. A Partial Least Squares Based Procedure for Upstream Sequence Classification in Prokaryotes.

    PubMed

    Mehmood, Tahir; Bohlin, Jon; Snipen, Lars

    2015-01-01

    The upstream region of coding genes is important for several reasons, for instance locating transcription factor, binding sites, and start site initiation in genomic DNA. Motivated by a recently conducted study, where multivariate approach was successfully applied to coding sequence modeling, we have introduced a partial least squares (PLS) based procedure for the classification of true upstream prokaryotic sequence from background upstream sequence. The upstream sequences of conserved coding genes over genomes were considered in analysis, where conserved coding genes were found by using pan-genomics concept for each considered prokaryotic species. PLS uses position specific scoring matrix (PSSM) to study the characteristics of upstream region. Results obtained by PLS based method were compared with Gini importance of random forest (RF) and support vector machine (SVM), which is much used method for sequence classification. The upstream sequence classification performance was evaluated by using cross validation, and suggested approach identifies prokaryotic upstream region significantly better to RF (p-value < 0.01) and SVM (p-value < 0.01). Further, the proposed method also produced results that concurred with known biological characteristics of the upstream region.

  4. Precise determination, cross-recognition, and functional analysis of the double-strand origins of the rolling-circle replication plasmids in haloarchaea.

    PubMed

    Zhou, Ligang; Zhou, Meixian; Sun, Chaomin; Han, Jing; Lu, Qiuhe; Zhou, Jian; Xiang, Hua

    2008-08-01

    The precise nick site in the double-strand origin (DSO) of pZMX201, a 1,668-bp rolling-circle replication (RCR) plasmid from the haloarchaeon Natrinema sp. CX2021, was determined by electron microscopy and DSO mapping. In this plasmid, DSO nicking occurred between residues C404 and G405 within a heptanucleotide sequence (TCTC/GGC) located in the stem region of an imperfect hairpin structure. This nick site sequence was conserved among the haloarchaeal RCR plasmids, including pNB101, suggesting that the DSO nick site might be the same for all members of this plasmid family. Interestingly, the DSOs of pZMX201 and pNB101 were found to be cross-recognized in RCR initiation and termination in a hybrid plasmid system. Mutation analysis of the DSO from pZMX201 (DSO(Z)) in this hybrid plasmid system revealed that: (i) the nucleotides in the middle of the conserved TCTCGGC sequence play more-important roles in the initiation and termination process; (ii) the left half of the hairpin structure is required for initiation but not for termination; and (iii) a 36-bp sequence containing TCTCGGC and the downstream sequence is essential and sufficient for termination. In conclusion, these haloarchaeal plasmids, with novel features that are different from the characteristics of both single-stranded DNA phages and bacterial RCR plasmids, might serve as a good model for studying the evolution of RCR replicons.

  5. Molecular testing for familial hypercholesterolaemia-associated mutations in a UK-based cohort: development of an NGS-based method and comparison with multiplex polymerase chain reaction and oligonucleotide arrays.

    PubMed

    Reiman, Anne; Pandey, Sarojini; Lloyd, Kate L; Dyer, Nigel; Khan, Mike; Crockard, Martin; Latten, Mark J; Watson, Tracey L; Cree, Ian A; Grammatopoulos, Dimitris K

    2016-11-01

    Background Detection of disease-associated mutations in patients with familial hypercholesterolaemia is crucial for early interventions to reduce risk of cardiovascular disease. Screening for these mutations represents a methodological challenge since more than 1200 different causal mutations in the low-density lipoprotein receptor has been identified. A number of methodological approaches have been developed for screening by clinical diagnostic laboratories. Methods Using primers targeting, the low-density lipoprotein receptor, apolipoprotein B, and proprotein convertase subtilisin/kexin type 9, we developed a novel Ion Torrent-based targeted re-sequencing method. We validated this in a West Midlands-UK small cohort of 58 patients screened in parallel with other mutation-targeting methods, such as multiplex polymerase chain reaction (Elucigene FH20), oligonucleotide arrays (Randox familial hypercholesterolaemia array) or the Illumina next-generation sequencing platform. Results In this small cohort, the next-generation sequencing method achieved excellent analytical performance characteristics and showed 100% and 89% concordance with the Randox array and the Elucigene FH20 assay. Investigation of the discrepant results identified two cases of mutation misclassification of the Elucigene FH20 multiplex polymerase chain reaction assay. A number of novel mutations not previously reported were also identified by the next-generation sequencing method. Conclusions Ion Torrent-based next-generation sequencing can deliver a suitable alternative for the molecular investigation of familial hypercholesterolaemia patients, especially when comprehensive mutation screening for rare or unknown mutations is required.

  6. Variation of b and p values from aftershocks sequences along the Mexican subduction zone and their relation to plate characteristics

    NASA Astrophysics Data System (ADS)

    Ávila-Barrientos, L.; Zúñiga, F. R.; Rodríguez-Pérez, Q.; Guzmán-Speziale, M.

    2015-11-01

    Aftershock sequences along the Mexican subduction margin (between coordinates 110ºW and 91ºW) were analyzed by means of the p value from the Omori-Utsu relation and the b value from the Gutenberg-Richter relation. We focused on recent medium to large (Mw > 5.6) events considered susceptible of generating aftershock sequences suitable for analysis. The main goal was to try to find a possible correlation between aftershock parameters and plate characteristics, such as displacement rate, age and segmentation. The subduction regime of Mexico is one of the most active regions of the world with a high frequency of occurrence of medium to large events and plate characteristics change along the subduction margin. Previous studies have observed differences in seismic source characteristics at the subduction regime, which may indicate a difference in rheology and possible segmentation. The results of the analysis of the aftershock sequences indicate a slight tendency for p values to decrease from west to east with increasing of plate age although a statistical significance is undermined by the small number of aftershocks in the sequences, a particular feature distinctive of the region as compared to other world subduction regimes. The b values show an opposite, increasing trend towards the east even though the statistical significance is not enough to warrant the validation of such a trend. A linear regression between both parameters provides additional support for the inverse relation. Moreover, we calculated the seismic coupling coefficient, showing a direct relation with the p and b values. While we cannot undoubtedly confirm the hypothesis that aftershock generation depends on certain tectonic characteristics (age, thickness, temperature), our results do not reject it thus encouraging further study into this question.

  7. The Genome Sequence of Avibacterium paragallinarum Strain CL Has a Large Repertoire of Insertion Sequence Elements.

    PubMed

    Horta-Valerdi, Guillermo; Sanchez-Alonso, Maria Patricia; Perez-Marquez, Victor M; Negrete-Abascal, Erasmo; Vaca-Pacheco, Sergio; Hernandez-Gonzalez, Ismael; Gomez-Lunar, Zulema; Olmedo-Álvarez, Gabriela; Vázquez-Cruz, Candelario

    2017-04-13

    The draft genome sequence of Avibacterium paragallinarum strain CL serovar C is reported here. The genome comprises 154 contigs corresponding to 2.4 Mb with 41% G+C content and many insertion sequence (IS) elements, a characteristic not previously reported in A. paragallinarum . Copyright © 2017 Horta-Valerdi et al.

  8. Dynamics of actin evolution in dinoflagellates.

    PubMed

    Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

    2011-04-01

    Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.

  9. A population study of the minicircles in Trypanosoma cruzi: predicting guide RNAs in the absence of empirical RNA editing.

    PubMed

    Thomas, Sean; Martinez, L L Isadora Trejo; Westenberger, Scott J; Sturm, Nancy R

    2007-05-24

    The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle genes. Although a few of the guide RNAs that direct this editing process have been discovered on maxicircles, guide RNAs are mostly found on the minicircles. The nuclear and maxicircle genomes have been sequenced and assembled for Trypanosoma cruzi, the causative agent of Chagas disease, however the complement of 1.4-kb minicircles, carrying four guide RNA genes per molecule in this parasite, has been less thoroughly characterised. Fifty-four CL Brener and 53 Esmeraldo strain minicircle sequence reads were extracted from T. cruzi whole genome shotgun sequencing data. With these sequences and all published T. cruzi minicircle sequences, 108 unique guide RNAs from all known T. cruzi minicircle sequences and two guide RNAs from the CL Brener maxicircle were predicted using a local alignment algorithm and mapped onto predicted or experimentally determined sequences of edited maxicircle open reading frames. For half of the sequences no statistically significant guide RNA could be assigned. Likely positions of these unidentified gRNAs in T. cruzi minicircle sequences are estimated using a simple Hidden Markov Model. With the local alignment predictions as a standard, the HMM had an ~85% chance of correctly identifying at least 20 nucleotides of guide RNA from a given minicircle sequence. Inter-minicircle recombination was documented. Variable regions contain species-specific areas of distinct nucleotide preference. Two maxicircle guide RNA genes were found. The identification of new minicircle sequences and the further characterization of all published minicircles are presented, including the first observation of recombination between minicircles. Extrapolation suggests a level of 4% recombinants in the population, supporting a relatively high recombination rate that may serve to minimize the persistence of gRNA pseudogenes. Characteristic nucleotide preferences observed within variable regions provide potential clues regarding the transcription and maturation of T. cruzi guide RNAs. Based on these preferences, a method of predicting T. cruzi guide RNAs using only primary minicircle sequence data was created.

  10. Sequence of Changes in Maize Responding to Soil Water Deficit and Related Critical Thresholds

    PubMed Central

    Ma, Xueyan; He, Qijin; Zhou, Guangsheng

    2018-01-01

    The sequence of changes in crop responding to soil water deficit and related critical thresholds are essential for better drought damage classification and drought monitoring indicators. This study was aimed to investigate the critical thresholds of maize growth and physiological characteristics responding to changing soil water and to reveal the sequence of changes in maize responding to soil water deficit both in seedling and jointing stages based on 2-year’s maize field experiment responding to six initial soil water statuses conducted in 2013 and 2014. Normal distribution tolerance limits were newly adopted to identify critical thresholds of maize growth and physiological characteristics to a wide range of soil water status. The results showed that in both stages maize growth characteristics related to plant water status [stem moisture content (SMC) and leaf moisture content (LMC)], leaf gas exchange [net photosynthetic rate (Pn), transpiration rate (Tr), and stomatal conductance (Gs)], and leaf area were sensitive to soil water deficit, while biomass-related characteristics were less sensitive. Under the concurrent weather conditions and agronomic managements, the critical soil water thresholds in terms of relative soil moisture of 0–30 cm depth (RSM) of maize SMC, LMC, net Pn, Tr, Gs, and leaf area were 72, 65, 62, 60, 58, and 46%, respectively, in seedling stage, and 64, 64, 51, 53, 48, and 46%, respectively, in jointing stage. It indicated that there is a sequence of changes in maize responding to soil water deficit, i.e., their response sequences as soil water deficit intensified: SMC ≥ LMC > leaf gas exchange > leaf area in both stages. This sequence of changes in maize responding to soil water deficit and related critical thresholds may be better indicators of damage classification and drought monitoring. PMID:29765381

  11. Use of Fe(III) as an electron acceptor to recover previously uncultured hyperthermophiles: isolation and characterization of Geothermobacterium ferrireducens gen. nov., sp. nov.

    PubMed

    Kashefi, Kazem; Holmes, Dawn E; Reysenbach, Anna-Louise; Lovley, Derek R

    2002-04-01

    It has recently been recognized that the ability to use Fe(III) as a terminal electron acceptor is a highly conserved characteristic in hyperthermophilic microorganisms. This suggests that it may be possible to recover as-yet-uncultured hyperthermophiles in pure culture if Fe(III) is used as an electron acceptor. As part of a study of the microbial diversity of the Obsidian Pool area in Yellowstone National Park, Wyo., hot sediment samples were used as the inoculum for enrichment cultures in media containing hydrogen as the sole electron donor and poorly crystalline Fe(III) oxide as the electron acceptor. A pure culture was recovered on solidified, Fe(III) oxide medium. The isolate, designated FW-1a, is a hyperthermophilic anaerobe that grows exclusively by coupling hydrogen oxidation to the reduction of poorly crystalline Fe(III) oxide. Organic carbon is not required for growth. Magnetite is the end product of Fe(III) oxide reduction under the culture conditions evaluated. The cells are rod shaped, about 0.5 microm by 1.0 to 1.2 microm, and motile and have a single flagellum. Strain FW-1a grows at circumneutral pH, at freshwater salinities, and at temperatures of between 65 and 100 degrees C with an optimum of 85 to 90 degrees C. To our knowledge this is the highest temperature optimum of any organism in the Bacteria. Analysis of the 16S ribosomal DNA (rDNA) sequence of strain FW-1a places it within the Bacteria, most closely related to abundant but uncultured microorganisms whose 16S rDNA sequences have been previously recovered from Obsidian Pool and a terrestrial hot spring in Iceland. While previous studies inferred that the uncultured microorganisms with these 16S rDNA sequences were sulfate-reducing organisms, the physiology of the strain FW-1a, which does not reduce sulfate, indicates that these organisms are just as likely to be Fe(III) reducers. These results further demonstrate that Fe(III) may be helpful for recovering as-yet-uncultured microorganisms from hydrothermal environments and illustrate that caution must be used in inferring the physiological characteristics of at least some thermophilic microorganisms solely from 16S rDNA sequences. Based on both its 16S rDNA sequence and physiological characteristics, strain FW-1a represents a new genus among the Bacteria. The name Geothermobacterium ferrireducens gen. nov., sp. nov., is proposed (ATCC BAA-426).

  12. Shallow-seated explosions in the construction of the Motukorea tuff ring (Auckland, New Zealand): Evidence from lithic and sedimentary characteristics

    NASA Astrophysics Data System (ADS)

    Agustín-Flores, Javier; Németh, Károly; Cronin, Shane J.; Lindsay, Jan M.; Kereszturi, Gábor

    2015-10-01

    At least 52 eruption centres are scattered within the 360 km2 Auckland Volcanic Field (AVF). Motukorea, now an island in the Waitemata Harbour, is one of 39 AVF volcanoes that experienced a phreatomagmatic explosive phase, before a magmatic phase. The volcano erupted through a 200-300 m-thick, consolidated, mudstone/sandstone sequence of the Miocene Waitemata Group, which overlies the Waipapa Terrane greywacke basement. Detailed field descriptions of the sedimentary characteristics of the early phreatomagmatic deposits were carried out, along with examination of lithics. The ejecta ring deposit comprises 55 to 60 vol.% lithics, of which Waitemata Group fragments constitute approximately 90 vol.%, whereas < 10 vol.% are Waipapa fragments, suggesting a dominance of shallow fragmentation. The sedimentary characteristics of the stratigraphic sequence at Motukorea suggest a dominance of wet surges at the beginning of the eruption with progression into drier sequences upwards. This is reflected in increasing inter-bedded juvenile-pyroclast-dominated fall deposits up-sequence. These characteristics are attributed to the changing hydrogeological conditions within the diatreme and the host rocks. These findings shed light on the eruption dynamics of phreatomagmatic eruptions through consolidated rocks in the AVF and enable the depiction of a scenario of future eruptions within the field in similar substrates.

  13. Inducible Alkylation of DNA by a Quinone Methide-Peptide Nucleic Acid Conjugate†

    PubMed Central

    Liu, Yang; Rokita, Steven E.

    2012-01-01

    The reversibility of alkylation by a quinone methide intermediate (QM) avoids the irreversible consumption that plagues most reagents based on covalent chemistry and allows for site specific reaction that is controlled by the thermodynamics rather than kinetics of target association. This characteristic was originally examined with an oligonucleotide QM conjugate but broad application depends on alternative derivatives that are compatible with a cellular environment. Now, a peptide nucleic acid (PNA) derivative has been constructed and shown to exhibit an equivalent ability to delivery the reactive QM in a controlled manner. This new conjugate demonstrates high selectivity for a complementary sequence of DNA even when challenged with an alternative sequence containing a single T/T mismatch. Alkylation of non-complementary sequences is only possible when a template strand is present to co-localize the conjugate and its target. For efficient alkylation in this example, a single-stranded region of the target is required adjacent to the QM conjugate. Most importantly, the intrastrand self adducts formed between the PNA and its attached QM remained active and reversible over more than eight days in aqueous solution prior to reaction with a chosen target added subsequently. PMID:22243337

  14. Quantifying Transmission.

    PubMed

    Woolhouse, Mark

    2017-07-01

    Transmissibility is the defining characteristic of infectious diseases. Quantifying transmission matters for understanding infectious disease epidemiology and designing evidence-based disease control programs. Tracing individual transmission events can be achieved by epidemiological investigation coupled with pathogen typing or genome sequencing. Individual infectiousness can be estimated by measuring pathogen loads, but few studies have directly estimated the ability of infected hosts to transmit to uninfected hosts. Individuals' opportunities to transmit infection are dependent on behavioral and other risk factors relevant given the transmission route of the pathogen concerned. Transmission at the population level can be quantified through knowledge of risk factors in the population or phylogeographic analysis of pathogen sequence data. Mathematical model-based approaches require estimation of the per capita transmission rate and basic reproduction number, obtained by fitting models to case data and/or analysis of pathogen sequence data. Heterogeneities in infectiousness, contact behavior, and susceptibility can have substantial effects on the epidemiology of an infectious disease, so estimates of only mean values may be insufficient. For some pathogens, super-shedders (infected individuals who are highly infectious) and super-spreaders (individuals with more opportunities to transmit infection) may be important. Future work on quantifying transmission should involve integrated analyses of multiple data sources.

  15. Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges.

    PubMed

    Yin, Zekun; Lan, Haidong; Tan, Guangming; Lu, Mian; Vasilakos, Athanasios V; Liu, Weiguo

    2017-01-01

    The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and computer scientists are facing the challenge of gaining a profound insight into the deepest biological functions from big biological data. This in turn requires massive computational resources. Therefore, high performance computing (HPC) platforms are highly needed as well as efficient and scalable algorithms that can take advantage of these platforms. In this paper, we survey the state-of-the-art HPC platforms for big biological data analytics. We first list the characteristics of big biological data and popular computing platforms. Then we provide a taxonomy of different biological data analysis applications and a survey of the way they have been mapped onto various computing platforms. After that, we present a case study to compare the efficiency of different computing platforms for handling the classical biological sequence alignment problem. At last we discuss the open issues in big biological data analytics.

  16. Investigation of the Iterative Phase Retrieval Algorithm for Interferometric Applications

    NASA Astrophysics Data System (ADS)

    Gombkötő, Balázs; Kornis, János

    2010-04-01

    Sequentially recorded intensity patterns reflected from a coherently illuminated diffuse object can be used to reconstruct the complex amplitude of the scattered beam. Several iterative phase retrieval algorithms are known in the literature to obtain the initially unknown phase from these longitudinally displaced intensity patterns. When two sequences are recorded in two different states of a centimeter sized object in optical setups that are similar to digital holographic interferometry-but omitting the reference wave-, displacement, deformation, or shape measurement is theoretically possible. To do this, the retrieved phase pattern should contain information not only about the intensities and locations of the point sources of the object surface, but their relative phase as well. Not only experiments require strict mechanical precision to record useful data, but even in simulations several parameters influence the capabilities of iterative phase retrieval, such as object to camera distance range, uniform or varying camera step sequence, speckle field characteristics, and sampling. Experiments were done to demonstrate this principle with an as large as 5×5 cm sized deformable object as well. Good initial results were obtained in an imaging setup, where the intensity pattern sequences were recorded near the image plane.

  17. Genotyping-by-sequencing (GBS) revealed molecular genetic diversity of Iranian wheat landraces and cultivars

    USDA-ARS?s Scientific Manuscript database

    Genetic diversity is an essential resource for breeders to improve new cultivars with desirable characteristics. Recently genotyping-by-sequencing (GBS), a next generation sequencing (NGS) based technology that can simplify complex genomes, has been used as a high-throughput and cost-effective molec...

  18. 47 CFR 2.201 - Emission, modulation, and transmission characteristics.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... carrier is amplitude-modulated (including cases where sub-carriers are angle-modulated): —Double-sideband... is amplitude and angle-modulated either simultaneously or in a pre-established sequence D (5) Emission of pulses: 1 —Sequence of unmodulated pulses P —A sequence of pulses: —Modulated in amplitude K...

  19. 47 CFR 2.201 - Emission, modulation, and transmission characteristics.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... carrier is amplitude-modulated (including cases where sub-carriers are angle-modulated): —Double-sideband... is amplitude and angle-modulated either simultaneously or in a pre-established sequence D (5) Emission of pulses: 1 —Sequence of unmodulated pulses P —A sequence of pulses: —Modulated in amplitude K...

  20. 47 CFR 2.201 - Emission, modulation, and transmission characteristics.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... carrier is amplitude-modulated (including cases where sub-carriers are angle-modulated): —Double-sideband... is amplitude and angle-modulated either simultaneously or in a pre-established sequence D (5) Emission of pulses: 1 —Sequence of unmodulated pulses P —A sequence of pulses: —Modulated in amplitude K...

  1. Work Sequences of Women During the Family Life Cycle

    ERIC Educational Resources Information Center

    Young, Christabel M.

    1978-01-01

    Identifies main work sequences of women during the first three stages of marriage and considers the influence of level of education, birthplace, and year of marriage on work sequence. An A.I.D. analysis illustrates characteristics of women most likely to adopt a given pattern of work. (Author)

  2. Visualisation of the mechanosensitive channel of large conductance in bacteria using confocal microscopy.

    PubMed

    Norman, Christel; Liu, Zhen-Wei; Rigby, Paul; Raso, Albert; Petrov, Yevgeniy; Martinac, Boris

    2005-07-01

    The mechanosensitive channel of large conductance (MscL) plays an important role in the survival of bacterial cells to hypo-osmotic shock. This channel has been extensively studied and its sequence, structure and electrophysiological characteristics are well known. Here we present a method to visualise MscL in living bacteria using confocal microscopy. By creating a gene fusion between mscl and the gene encoding the green fluorescent protein (GFP) we were able to express the fusion protein MscL-GFP in bacteria. We show that MscL-GFP is present in the cytoplasmic membrane and forms functional channels. These channels have the same characteristics as wild-type MscL, except that they require more pressure to open. This method could prove an interesting, non-invasive, tool to study the localisation and the regulation of expression of MscL in bacteria.

  3. The recurrence sequences via Sylvester matrices

    NASA Astrophysics Data System (ADS)

    Karaduman, Erdal; Deveci, Ömür

    2017-07-01

    In this work, we define the Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by using the Slyvester matrices which are obtained from the characteristic polynomials of the Pell and Jacobsthal sequences and then, we study the sequences defined modulo m. Also, we obtain the cyclic groups and the semigroups from the generating matrices of these sequences when read modulo m and then, we derive the relationships among the orders of the cyclic groups and the periods of the sequences. Furthermore, we redefine Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by means of the elements of the groups and then, we examine them in the finite groups.

  4. An in-silico insight into the characteristics of β-propeller phytase.

    PubMed

    Mathew, Akash; Verma, Anukriti; Gaur, Smriti

    2014-06-01

    Phytase is an enzyme that is found extensively in the plant kingdom and in some species of bacteria and fungi. This paper identifies and analyses the available full length sequences of β-propeller phytases (BPP). BPP was chosen due to its potential applicability in the field of aquaculture. The sequences were obtained from the Uniprot database and subject to various online bioinformatics tools to elucidate the physio-chemical characteristics, secondary structures and active site compositions of BPP. Protparam and SOPMA were used to analyse the physiochemical and secondary structure characteristics, while the Expasy online modelling tool and CASTp were used to model the 3-D structure and identify the active sites of the BPP sequences. The amino acid compositions of the four sequences were compared and composed in a graphical format to identify similarities and highlight the potentially important amino acids that form the active site of BPP. This study aims to analyse BPP and contribute to the clarification of the molecular mechanism involved in the enzyme activity of BPP and contribute in part to the possibility of constructing a synthetic version of BPP.

  5. Genotyping of Echinococcus granulosus from domestic animals and humans from Ardabil Province, northwest Iran.

    PubMed

    Pezeshki, A; Akhlaghi, L; Sharbatkhori, M; Razmjou, E; Oormazdi, H; Mohebali, M; Meamar, A R

    2013-12-01

    Cystic echinococcosis is endemic in Iran, particularly in Ardabil Province, where it causes health and economic problems. The genetic pattern of Echinococcus granulosus has been determined in most parts of Iran, except in this area. In the present investigation, 55 larval isolates were collected from humans (11), sheep (19), goats (4) and cattle (21). For analysis of the genetic characteristics of E. granulosus isolates, DNA sequencing of mitochondrial cytochrome c oxidase subunit 1 (cox1) and NADH dehydrogenase subunit 1 (nad1) genes was applied. Fifty isolates were successfully analysed, with 92% (46) and 8% (4) identified as G1 and G3 genotypes, respectively. The sequence analyses of the isolates displayed nine characteristic profiles in cox1 sequences and eight characteristic profiles in nad1 sequences. Based on these results, the sheep strain (G1 genotype) was the most prevalent in humans, sheep, goats and cattle. The buffalo strain (G3 genotype) was not only demonstrated in sheep (1 isolate) and cattle (1 isolate), but also for the first time in two human isolates. These findings will provide information for local control of echinococcosis.

  6. Application of next generation sequencing in clinical microbiology and infection prevention.

    PubMed

    Deurenberg, Ruud H; Bathoorn, Erik; Chlebowicz, Monika A; Couto, Natacha; Ferdous, Mithila; García-Cobos, Silvia; Kooistra-Smid, Anna M D; Raangs, Erwin C; Rosema, Sigrid; Veloo, Alida C M; Zhou, Kai; Friedrich, Alexander W; Rossen, John W A

    2017-02-10

    Current molecular diagnostics of human pathogens provide limited information that is often not sufficient for outbreak and transmission investigation. Next generation sequencing (NGS) determines the DNA sequence of a complete bacterial genome in a single sequence run, and from these data, information on resistance and virulence, as well as information for typing is obtained, useful for outbreak investigation. The obtained genome data can be further used for the development of an outbreak-specific screening test. In this review, a general introduction to NGS is presented, including the library preparation and the major characteristics of the most common NGS platforms, such as the MiSeq (Illumina) and the Ion PGM™ (ThermoFisher). An overview of the software used for NGS data analyses used at the medical microbiology diagnostic laboratory in the University Medical Center Groningen in The Netherlands is given. Furthermore, applications of NGS in the clinical setting are described, such as outbreak management, molecular case finding, characterization and surveillance of pathogens, rapid identification of bacteria using the 16S-23S rRNA region, taxonomy, metagenomics approaches on clinical samples, and the determination of the transmission of zoonotic micro-organisms from animals to humans. Finally, we share our vision on the use of NGS in personalised microbiology in the near future, pointing out specific requirements. Copyright © 2016 The Author(s). Published by Elsevier B.V. All rights reserved.

  7. Reprint of "Application of next generation sequencing in clinical microbiology and infection prevention".

    PubMed

    Deurenberg, Ruud H; Bathoorn, Erik; Chlebowicz, Monika A; Couto, Natacha; Ferdous, Mithila; García-Cobos, Silvia; Kooistra-Smid, Anna M D; Raangs, Erwin C; Rosema, Sigrid; Veloo, Alida C M; Zhou, Kai; Friedrich, Alexander W; Rossen, John W A

    2017-05-20

    Current molecular diagnostics of human pathogens provide limited information that is often not sufficient for outbreak and transmission investigation. Next generation sequencing (NGS) determines the DNA sequence of a complete bacterial genome in a single sequence run, and from these data, information on resistance and virulence, as well as information for typing is obtained, useful for outbreak investigation. The obtained genome data can be further used for the development of an outbreak-specific screening test. In this review, a general introduction to NGS is presented, including the library preparation and the major characteristics of the most common NGS platforms, such as the MiSeq (Illumina) and the Ion PGM™ (ThermoFisher). An overview of the software used for NGS data analyses used at the medical microbiology diagnostic laboratory in the University Medical Center Groningen in The Netherlands is given. Furthermore, applications of NGS in the clinical setting are described, such as outbreak management, molecular case finding, characterization and surveillance of pathogens, rapid identification of bacteria using the 16S-23S rRNA region, taxonomy, metagenomics approaches on clinical samples, and the determination of the transmission of zoonotic micro-organisms from animals to humans. Finally, we share our vision on the use of NGS in personalised microbiology in the near future, pointing out specific requirements. Copyright © 2017. Published by Elsevier B.V.

  8. Complete mitochondrial genome of Platevindex sp. (Gastropoda: Pulmonata: Systellommatophora: Onchidiidae).

    PubMed

    Liu, Chen; Shen, He Ding; Zhou, Na

    2016-01-01

    The complete mitochondrial genome sequence of Platevindex sp. is firstly described in the article. The mitogenome (13,908 bp) contains 22 tRNA genes, 2 ribosomal RNA genes and 13 protein-coding genes, and 1 putative control region (CR). CR is not well characterized due to lack of discrete conserved sequence blocks. This characteristic is similar with CRs of other invertebrate mitochondrial genomes. The characteristic is the typical bivalvia mitochondrial gene composition.

  9. Geologic Mapping of the Meridiani Region of Mars

    NASA Technical Reports Server (NTRS)

    DiAchille, G.; Hynek, B. M.

    2009-01-01

    The Mars Exploration Rover Opportunity observed an upper layer of a more than 600-m-thick sequence of light toned outcrops that characterize the Meridiani region of Mars. Results from the rover analyses have shown that the bedrock contains mineral and textural characteristics that require at least the interaction of, and possibly an overall formation by, water-related mechanisms in order to be explained [1]. Additionally, remote sensing studies of the region have suggested that the rocks sampled in places by the MER rover consist of many distinct layers extending over an area of more than 3 10(exp 5) sq km spanning 20deg of longitude [2].

  10. Noncoding sequence classification based on wavelet transform analysis: part I

    NASA Astrophysics Data System (ADS)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.

  11. The influence of phonological priming on variability in articulation

    NASA Astrophysics Data System (ADS)

    Babel, Molly E.; Munson, Benjamin

    2004-05-01

    Previous research [Sevald and Dell, Cognition 53, 91-127 (1994)] has found that reiterant sequences of CVC words are produced more quickly when the prime word and target word share VC sequences (i.e., sequences like sit sick) than when they are identical (sequences like sick sick). Even slower production rates are found when primes and targets share a CV sequence (sequences like kick sick). These data have been used to support a model of speech production in which lexical items and their constituent phonemes are activated sequentially. The current experiment investigated whether phonological priming also influences variability in the acoustic characteristics of words. Specifically, we examined whether greater variability in the acoustic characteristics of target words was noted in the CV-related prime context than in the identical-prime context, and whether less variability was noted in the VC-related context. Thirty adult subjects with typical speech, language, and hearing ability produced reiterant two-word sequences that varied in their phonological similarity. The duration, first, and second formant frequencies of the target-words' vowels were measured. Preliminary analyses indicate that phonological priming does not have a systematic effect on variability in these acoustic parameters.

  12. How Incidental Sequence Learning Creates Reportable Knowledge: The Role of Unexpected Events

    ERIC Educational Resources Information Center

    Runger, Dennis; Frensch, Peter A.

    2008-01-01

    Research on incidental sequence learning typically is concerned with the characteristics of implicit or nonconscious learning. In this article, the authors aim to elucidate the cognitive mechanisms that contribute to the generation of explicit, reportable sequence knowledge. According to the unexpected-event hypothesis (P. A. Frensch, H. Haider,…

  13. Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis

    USDA-ARS?s Scientific Manuscript database

    In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T formed a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these ot...

  14. Synthetic Spike-in Standards Improve Run-Specific Systematic Error Analysis for DNA and RNA Sequencing

    PubMed Central

    Zook, Justin M.; Samarov, Daniel; McDaniel, Jennifer; Sen, Shurjo K.; Salit, Marc

    2012-01-01

    While the importance of random sequencing errors decreases at higher DNA or RNA sequencing depths, systematic sequencing errors (SSEs) dominate at high sequencing depths and can be difficult to distinguish from biological variants. These SSEs can cause base quality scores to underestimate the probability of error at certain genomic positions, resulting in false positive variant calls, particularly in mixtures such as samples with RNA editing, tumors, circulating tumor cells, bacteria, mitochondrial heteroplasmy, or pooled DNA. Most algorithms proposed for correction of SSEs require a data set used to calculate association of SSEs with various features in the reads and sequence context. This data set is typically either from a part of the data set being “recalibrated” (Genome Analysis ToolKit, or GATK) or from a separate data set with special characteristics (SysCall). Here, we combine the advantages of these approaches by adding synthetic RNA spike-in standards to human RNA, and use GATK to recalibrate base quality scores with reads mapped to the spike-in standards. Compared to conventional GATK recalibration that uses reads mapped to the genome, spike-ins improve the accuracy of Illumina base quality scores by a mean of 5 Phred-scaled quality score units, and by as much as 13 units at CpG sites. In addition, since the spike-in data used for recalibration are independent of the genome being sequenced, our method allows run-specific recalibration even for the many species without a comprehensive and accurate SNP database. We also use GATK with the spike-in standards to demonstrate that the Illumina RNA sequencing runs overestimate quality scores for AC, CC, GC, GG, and TC dinucleotides, while SOLiD has less dinucleotide SSEs but more SSEs for certain cycles. We conclude that using these DNA and RNA spike-in standards with GATK improves base quality score recalibration. PMID:22859977

  15. [Magnetic resonance for the study of osteosarcoma].

    PubMed

    Spina, V; Romagnoli, R; Manfrini, M; Cerofolini, E; Capanna, R; Gaiani, L; Calandra Buonaura, P; Picci, P; Campanacci, M

    1991-01-01

    The authors report their experience with MR imaging in the study of osteosarcoma. Two main elements were evaluated: signal characteristics and loco-regional staging. Seventy-one patients were studied: 65 of them had central long-bone osteosarcoma, and 6 had telangiectatic long-bone osteosarcoma. T1- and T2-weighted spin-echo sequences were employed and all cases were scanned on 3 planes (sagittal, coronal, and axial). In 28 patients MR imaging was performed both before and after preoperative chemotherapy. The obtained data were compared to surgical and pathological findings. With the exception of the typical signal patterns of quite-osteoblastic osteosarcoma (which presents with low signal on both T1- and T2-weighted sequences), no particular signal features were observed which could help distinguish the different types of osteosarcoma. MR imaging is the method of choice in loco-regional staging for, in our series, it allowed a rational and adequate surgical planning. For this purpose, at least a longitudinal T1- and an axial T2-weighted images are required.

  16. Non-B-Form DNA Is Enriched at Centromeres

    PubMed Central

    Henikoff, Steven

    2018-01-01

    Abstract Animal and plant centromeres are embedded in repetitive “satellite” DNA, but are thought to be epigenetically specified. To define genetic characteristics of centromeres, we surveyed satellite DNA from diverse eukaryotes and identified variation in <10-bp dyad symmetries predicted to adopt non-B-form conformations. Organisms lacking centromeric dyad symmetries had binding sites for sequence-specific DNA-binding proteins with DNA-bending activity. For example, human and mouse centromeres are depleted for dyad symmetries, but are enriched for non-B-form DNA and are associated with binding sites for the conserved DNA-binding protein CENP-B, which is required for artificial centromere function but is paradoxically nonessential. We also detected dyad symmetries and predicted non-B-form DNA structures at neocentromeres, which form at ectopic loci. We propose that centromeres form at non-B-form DNA because of dyad symmetries or are strengthened by sequence-specific DNA binding proteins. This may resolve the CENP-B paradox and provide a general basis for centromere specification. PMID:29365169

  17. GMO quantification: valuable experience and insights for the future.

    PubMed

    Milavec, Mojca; Dobnik, David; Yang, Litao; Zhang, Dabing; Gruden, Kristina; Zel, Jana

    2014-10-01

    Cultivation and marketing of genetically modified organisms (GMOs) have been unevenly adopted worldwide. To facilitate international trade and to provide information to consumers, labelling requirements have been set up in many countries. Quantitative real-time polymerase chain reaction (qPCR) is currently the method of choice for detection, identification and quantification of GMOs. This has been critically assessed and the requirements for the method performance have been set. Nevertheless, there are challenges that should still be highlighted, such as measuring the quantity and quality of DNA, and determining the qPCR efficiency, possible sequence mismatches, characteristics of taxon-specific genes and appropriate units of measurement, as these remain potential sources of measurement uncertainty. To overcome these problems and to cope with the continuous increase in the number and variety of GMOs, new approaches are needed. Statistical strategies of quantification have already been proposed and expanded with the development of digital PCR. The first attempts have been made to use new generation sequencing also for quantitative purposes, although accurate quantification of the contents of GMOs using this technology is still a challenge for the future, and especially for mixed samples. New approaches are needed also for the quantification of stacks, and for potential quantification of organisms produced by new plant breeding techniques.

  18. Detection of Mycobacterium bovis in formalin-fixed, paraffin-embedded tissues of cattle and elk by PCR amplification of an IS6110 sequence specific for Mycobacterium tuberculosis complex organisms.

    PubMed

    Miller, J; Jenny, A; Rhyan, J; Saari, D; Suarez, D

    1997-07-01

    A presumptive diagnosis of tuberculosis can be made if a tissue has characteristic histopathologic changes and acid-fast organisms. However, definitive diagnosis requires culture and species identification of the causative mycobacterium, a process that takes several weeks to complete. The purpose of work reported here was to determine if formalin-fixed, paraffin-embedded tissues could be tested by polymerase chain reaction (PCR) to provide a more rapid diagnosis of tuberculosis. Nondecalcified tissues from cases of tuberculosis in cattle and elk (Cervus elaphus) were examined. The primers used for PCR amplified a 123-bp fragment of IS6110, an insertion sequence that is specific for organisms in the Mycobacterium tuberculosis complex (M. tuberculosis, M. bovis, M. microti, M. africanum). The PCR test detected this sequence in tissues from 92 of 99 (93%) tuberculosis cases, including 3 of 4 elk. In 80 tissues, the positive results were obtained using material prepared by immersion of paraffin sections in water containing a detergent, followed by alternating boil/freeze cycles. The remaining positive results were obtained with DNA isolated from the crude tissue extracts by proteinase K digestion and phenol/chloroform purification. Accuracy of the IS6110 PCR test was demonstrated by negative test results on 31 tissues that had either nonmycobacterial granulomas or granulomatous lesions caused by other mycobacteria (M. paratuberculosis or M. avium). The findings of this study show that a PCR test usually can provide a rapid diagnosis of tuberculosis when it is applied to paraffin sections that have characteristic lesions and acid-fast organisms.

  19. Building information models for astronomy projects

    NASA Astrophysics Data System (ADS)

    Ariño, Javier; Murga, Gaizka; Campo, Ramón; Eletxigerra, Iñigo; Ampuero, Pedro

    2012-09-01

    A Building Information Model is a digital representation of physical and functional characteristics of a building. BIMs represent the geometrical characteristics of the Building, but also properties like bills of quantities, definition of COTS components, status of material in the different stages of the project, project economic data, etc. The BIM methodology, which is well established in the Architecture Engineering and Construction (AEC) domain for conventional buildings, has been brought one step forward in its application for Astronomical/Scientific facilities. In these facilities steel/concrete structures have high dynamic and seismic requirements, M&E installations are complex and there is a large amount of special equipment and mechanisms involved as a fundamental part of the facility. The detail design definition is typically implemented by different design teams in specialized design software packages. In order to allow the coordinated work of different engineering teams, the overall model, and its associated engineering database, is progressively integrated using a coordination and roaming software which can be used before starting construction phase for checking interferences, planning the construction sequence, studying maintenance operation, reporting to the project office, etc. This integrated design & construction approach will allow to efficiently plan construction sequence (4D). This is a powerful tool to study and analyze in detail alternative construction sequences and ideally coordinate the work of different construction teams. In addition engineering, construction and operational database can be linked to the virtual model (6D), what gives to the end users a invaluable tool for the lifecycle management, as all the facility information can be easily accessed, added or replaced. This paper presents the BIM methodology as implemented by IDOM with the E-ELT and ATST Enclosures as application examples.

  20. GEITLERINEMA SPECIES (OSCILLATORIALES, CYANOBACTERIA) REVEALED BY CELLULAR MORPHOLOGY, ULTRASTRUCTURE, AND DNA SEQUENCING(1).

    PubMed

    Do Carmo Bittencourt-Oliveira, Maria; Do Nascimento Moura, Ariadne; De Oliveira, Mariana Cabral; Sidnei Massola, Nelson

    2009-06-01

    Geitlerinema amphibium (C. Agardh ex Gomont) Anagn. and G. unigranulatum (Rama N. Singh) Komárek et M. T. P. Azevedo are morphologically close species with characteristics frequently overlapping. Ten strains of Geitlerinema (six of G. amphibium and four of G. unigranulatum) were analyzed by DNA sequencing and transmission electronic and optical microscopy. Among the investigated strains, the two species were not separated with respect to cellular dimensions, and cellular width was the most varying characteristic. The number and localization of granules, as well as other ultrastructural characteristics, did not provide a means to discriminate between the two species. The two species were not separated either by geography or environment. These results were further corroborated by the analysis of the cpcB-cpcA intergenic spacer (PC-IGS) sequences. Given the fact that morphology is very uniform, plus the coexistence of these populations in the same habitat, it would be nearly impossible to distinguish between them in nature. On the other hand, two of the analyzed strains were distinct from all others based on the PC-IGS sequences, in spite of their morphological similarity. PC-IGS sequences indicate that these two strains could be a different species of Geitlerinema. Using morphology, cell ultrastructure, and PC-IGS sequences, it is not possible to distinguish G. amphibium and G. unigranulatum. Therefore, they should be treated as one species, G. unigranulatum as a synonym of G. amphibium. © 2009 Phycological Society of America.

  1. Shuttle OFT Level C navigation requirements

    NASA Technical Reports Server (NTRS)

    1980-01-01

    Detailed requirements for the orbital operations computer loads, OPS 2, and OPS 8 are given. These requirements represent the total on-orbit/rendezvous navigation baseline requirements for the following principal functions: on-orbital/rendezvous navigation sequencer; on-orbit/rendezvous UPP sequencer; on-orbit rendezvous navigation; on-orbit prediction; on-orbit user parameter processing; and landing Site update.

  2. Deletion mapping of the Aequorea victoria green fluorescent protein.

    PubMed

    Dopf, J; Horiagon, T M

    1996-01-01

    Aequorea victoria green fluorescent protein (GFP) is a promising fluorescent marker which is active in a diverse array of prokaryotic and eukaryotic organisms. A key feature underlying the versatility of GFP is its capacity to undergo heterocyclic chromophore formation by cyclization of a tripeptide present in its primary sequence and thereby acquiring fluorescent activity in a variety of intracellular environments. In order to define further the primary structure requirements for chromophore formation and fluorescence in GFP, a series of N- and C-terminal GFP deletion variant expression vectors were created using the polymerase chain reaction. Scanning spectrofluorometric analyses of crude soluble protein extracts derived from eleven GFP expression constructs revealed that amino acid (aa) residues 2-232, of a total of 238 aa in the native protein, were required for the characteristic emission and absorption spectra of native GFP. Heterocyclic chromophore formation was assayed by comparing the absorption spectrum of GFP deletion variants over the 300-500-nm range to the absorption spectra of full-length GFP and GFP deletion variants missing the chromophore substrate domain from the primary sequence. GFP deletion variants lacking fluorescent activity showed no evidence of heterocyclic ring structure formation when the soluble extracts of their bacterial expression hosts were studied at pH 7.9. These observations suggest that the primary structure requirements for the fluorescent activity of GFP are relatively extensive and are compatible with the view that much of the primary structure serves an autocatalytic function.

  3. Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).

    PubMed

    Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo

    2013-12-01

    The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.

  4. Characterization of GM events by insert knowledge adapted re-sequencing approaches

    PubMed Central

    Yang, Litao; Wang, Congmao; Holst-Jensen, Arne; Morisset, Dany; Lin, Yongjun; Zhang, Dabing

    2013-01-01

    Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events. PMID:24088728

  5. Characterization of GM events by insert knowledge adapted re-sequencing approaches.

    PubMed

    Yang, Litao; Wang, Congmao; Holst-Jensen, Arne; Morisset, Dany; Lin, Yongjun; Zhang, Dabing

    2013-10-03

    Detection methods and data from molecular characterization of genetically modified (GM) events are needed by stakeholders of public risk assessors and regulators. Generally, the molecular characteristics of GM events are incomprehensively revealed by current approaches and biased towards detecting transformation vector derived sequences. GM events are classified based on available knowledge of the sequences of vectors and inserts (insert knowledge). Herein we present three insert knowledge-adapted approaches for characterization GM events (TT51-1 and T1c-19 rice as examples) based on paired-end re-sequencing with the advantages of comprehensiveness, accuracy, and automation. The comprehensive molecular characteristics of two rice events were revealed with additional unintended insertions comparing with the results from PCR and Southern blotting. Comprehensive transgene characterization of TT51-1 and T1c-19 is shown to be independent of a priori knowledge of the insert and vector sequences employing the developed approaches. This provides an opportunity to identify and characterize also unknown GM events.

  6. Characterization of Bacteroides forsythus Strains from Cat and Dog Bite Wounds in Humans and Comparison with Monkey and Human Oral Strains

    PubMed Central

    Hudspeth, M. K.; Gerardo, S. Hunt; Maiden, M. F. J.; Citron, D. M.; Goldstein, E. J. C.

    1999-01-01

    Bacteroides forsythus strains recovered from cat and dog bite wound infections in humans (n = 3), monkey oral strains (n = 3), and the human oral ATCC 43037 type strain were characterized by using phenotypic characteristics, enzymatic tests, whole cell fatty acid analysis, sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) analysis, PCR fingerprinting, and 16S rDNA (genes coding for rRNA) sequencing. All three bite wound isolates grew on brucella agar supplemented with 5% sheep blood, vitamin K1, and hemin. These strains, unlike the ATCC strain and previously described monkey oral and human clinical strains, did not require N-acetylmuramic acid supplementation for growth as pure cultures. However, their phenotypic characteristics, except for catalase production, were similar to those of previously identified strains. PCR fingerprinting analysis showed differences in band patterns from the ATCC strain. Also, SDS-PAGE and whole cell fatty acid analysis indicated that the dog and cat bite wound strains were similar but not identical to the human B. forsythus ATCC 43037 type strain and the monkey oral strains. The rDNA sequence analysis indicated that the three bite wound isolates had 99.93% homology with each other and 98.9 and 99.22% homology with the human ATCC 43037 and monkey oral strains, respectively. These results suggest that there are host-specific variations within each group. PMID:10325363

  7. [Imaging characteristics of PROPELLER T2-weighted imaging].

    PubMed

    Goto, Masami; Aoki, Shigeki; Hayashi, Naoto; Mori, Harushi; Watanabe, Yasushi; Ino, Kenji; Satake, Yoshirou; Nishida, Katuji; Sato, Haruo; Iida, Kyouhito; Mima, Kazuo; Ohtomo, Kuni

    2004-11-01

    As the PROPELLER sequence is a combination of the radial scan and fast-spin-echo (FSE) sequence, it can be considered an FSE sequence with a motion correlation. However, there are some differences between PROPELLER and FSE owing to differences in k-space trajectory. We clarified the imaging characteristics of PROPELLER T2-weighted imaging (T2WI) for different parameters in comparison with usual FSE T2WI. When the same parameters were used, PROPELLER T2WI showed a higher signal-to-noise ratio (SNR) and lower spatial resolution than usual FSE. Effective echo time (TE) changed with different echo train lengths (ETL) or different bandwidths on PROPELLER, and imaging contrast changed accordingly to be more effective.

  8. Method for rapid base sequencing in DNA and RNA

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Moyzis, R.K.; Ratliff, R.L.; Shera, E.B.; Stewart, C.C.

    1987-10-07

    A method is provided for the rapid base sequencing of DNA or RNA fragments wherein a single fragment of DNA or RNA is provided with identifiable bases and suspended in a moving flow stream. An exonuclease sequentially cleaves individual bases from the end of the suspended fragment. The moving flow stream maintains the cleaved bases in an orderly train for subsequent detection and identification. In a particular embodiment, individual bases forming the DNA or RNA fragments are individually tagged with a characteristic fluorescent dye. The train of bases is then excited to fluorescence with an output spectrum characteristic of the individual bases. Accordingly, the base sequence of the original DNA or RNA fragment can be reconstructed. 2 figs.

  9. Method for rapid base sequencing in DNA and RNA

    DOEpatents

    Jett, J.H.; Keller, R.A.; Martin, J.C.; Moyzis, R.K.; Ratliff, R.L.; Shera, E.B.; Stewart, C.C.

    1990-10-09

    A method is provided for the rapid base sequencing of DNA or RNA fragments wherein a single fragment of DNA or RNA is provided with identifiable bases and suspended in a moving flow stream. An exonuclease sequentially cleaves individual bases from the end of the suspended fragment. The moving flow stream maintains the cleaved bases in an orderly train for subsequent detection and identification. In a particular embodiment, individual bases forming the DNA or RNA fragments are individually tagged with a characteristic fluorescent dye. The train of bases is then excited to fluorescence with an output spectrum characteristic of the individual bases. Accordingly, the base sequence of the original DNA or RNA fragment can be reconstructed. 2 figs.

  10. Method for rapid base sequencing in DNA and RNA

    DOEpatents

    Jett, James H.; Keller, Richard A.; Martin, John C.; Moyzis, Robert K.; Ratliff, Robert L.; Shera, E. Brooks; Stewart, Carleton C.

    1990-01-01

    A method is provided for the rapid base sequencing of DNA or RNA fragments wherein a single fragment of DNA or RNA is provided with identifiable bases and suspended in a moving flow stream. An exonuclease sequentially cleaves individual bases from the end of the suspended fragment. The moving flow stream maintains the cleaved bases in an orderly train for subsequent detection and identification. In a particular embodiment, individual bases forming the DNA or RNA fragments are individually tagged with a characteristic fluorescent dye. The train of bases is then excited to fluorescence with an output spectrum characteristic of the individual bases. Accordingly, the base sequence of the original DNA or RNA fragment can be reconstructed.

  11. Impact of exogenous sequences on the characteristics of an epidemic type 2 recombinant vaccine-derived poliovirus.

    PubMed

    Riquet, Franck B; Blanchard, Claire; Jegouic, Sophie; Balanant, Jean; Guillot, Sophie; Vibet, Marie-Anne; Rakoto-Andrianarivelo, Mala; Delpeyroux, Francis

    2008-09-01

    Pathogenic circulating vaccine-derived polioviruses (cVDPVs) have become a major obstacle to the successful completion of the global polio eradication program. Most cVDPVs are recombinant between the oral poliovirus vaccine (OPV) and human enterovirus species C (HEV-C). To study the role of HEV-C sequences in the phenotype of cVDPVs, we generated a series of recombinants between a Madagascar cVDPV isolate and its parental OPV type 2 strain. Results indicated that the HEV-C sequences present in this cVDPV contribute to its characteristics, including pathogenicity, suggesting that interspecific recombination contributes to the phenotypic biodiversity of polioviruses and may favor the emergence of cVDPVs.

  12. Theoretical modeling of masking DNA application in aptamer-facilitated biomarker discovery.

    PubMed

    Cherney, Leonid T; Obrecht, Natalia M; Krylov, Sergey N

    2013-04-16

    In aptamer-facilitated biomarker discovery (AptaBiD), aptamers are selected from a library of random DNA (or RNA) sequences for their ability to specifically bind cell-surface biomarkers. The library is incubated with intact cells, and cell-bound DNA molecules are separated from those unbound and amplified by the polymerase chain reaction (PCR). The partitioning/amplification cycle is repeated multiple times while alternating target cells and control cells. Efficient aptamer selection in AptaBiD relies on the inclusion of masking DNA within the cell and library mixture. Masking DNA lacks primer regions for PCR amplification and is typically taken in excess to the library. The role of masking DNA within the selection mixture is to outcompete any nonspecific binding sequences within the initial library, thus allowing specific DNA sequences (i.e., aptamers) to be selected more efficiently. Efficient AptaBiD requires an optimum ratio of masking DNA to library DNA, at which aptamers still bind specific binding sites but nonaptamers within the library do not bind nonspecific binding sites. Here, we have developed a mathematical model that describes the binding processes taking place within the equilibrium mixture of masking DNA, library DNA, and target cells. An obtained mathematical solution allows one to estimate the concentration of masking DNA that is required to outcompete the library DNA at a desirable ratio of bound masking DNA to bound library DNA. The required concentration depends on concentrations of the library and cells as well as on unknown cell characteristics. These characteristics include the concentration of total binding sites on the cell surface, N, and equilibrium dissociation constants, K(nsL) and K(nsM), for nonspecific binding of the library DNA and masking DNA, respectively. We developed a theory that allows the determination of N, K(nsL), and K(nsM) based on measurements of EC50 values for cells mixed separately with the library and masking DNA (EC50 is the concentration of fluorescently labeled DNA at which half of the maximum fluorescence signal from DNA-bound cells is reached). We also obtained expressions for signals from bound DNA (measured by flow cytometry) in terms of N, K(nsL), and K(nsM). These expressions can be used for the verification of N, K(nsL), and K(nsM) values found from EC50 measurements. The developed procedure was applied to MCF-7 breast cancer cells, and corresponding values of N, K(nsL), and K(nsM) were established for the first time. The concentration of masking DNA required for AptaBiD with MCF-7 breast cancer cells was also estimated.

  13. Improving the accuracy of protein stability predictions with multistate design using a variety of backbone ensembles.

    PubMed

    Davey, James A; Chica, Roberto A

    2014-05-01

    Multistate computational protein design (MSD) with backbone ensembles approximating conformational flexibility can predict higher quality sequences than single-state design with a single fixed backbone. However, it is currently unclear what characteristics of backbone ensembles are required for the accurate prediction of protein sequence stability. In this study, we aimed to improve the accuracy of protein stability predictions made with MSD by using a variety of backbone ensembles to recapitulate the experimentally measured stability of 85 Streptococcal protein G domain β1 sequences. Ensembles tested here include an NMR ensemble as well as those generated by molecular dynamics (MD) simulations, by Backrub motions, and by PertMin, a new method that we developed involving the perturbation of atomic coordinates followed by energy minimization. MSD with the PertMin ensembles resulted in the most accurate predictions by providing the highest number of stable sequences in the top 25, and by correctly binning sequences as stable or unstable with the highest success rate (≈90%) and the lowest number of false positives. The performance of PertMin ensembles is due to the fact that their members closely resemble the input crystal structure and have low potential energy. Conversely, the NMR ensemble as well as those generated by MD simulations at 500 or 1000 K reduced prediction accuracy due to their low structural similarity to the crystal structure. The ensembles tested herein thus represent on- or off-target models of the native protein fold and could be used in future studies to design for desired properties other than stability. Copyright © 2013 Wiley Periodicals, Inc.

  14. Prevalence of the F-type lectin domain.

    PubMed

    Bishnoi, Ritika; Khatri, Indu; Subramanian, Srikrishna; Ramya, T N C

    2015-08-01

    F-type lectins are fucolectins with characteristic fucose and calcium-binding sequence motifs and a unique lectin fold (the "F-type" fold). F-type lectins are phylogenetically widespread with selective distribution. Several eukaryotic F-type lectins have been biochemically and structurally characterized, and the F-type lectin domain (FLD) has also been studied in the bacterial proteins, Streptococcus mitis lectinolysin and Streptococcus pneumoniae SP2159. However, there is little knowledge about the extent of occurrence of FLDs and their domain organization, especially, in bacteria. We have now mined the extensive genomic sequence information available in the public databases with sensitive sequence search techniques in order to exhaustively survey prokaryotic and eukaryotic FLDs. We report 437 FLD sequence clusters (clustered at 80% sequence identity) from eukaryotic, eubacterial and viral proteins. Domain architectures are diverse but mostly conserved in closely related organisms, and domain organizations of bacterial FLD-containing proteins are very different from their eukaryotic counterparts, suggesting unique specialization of FLDs to suit different requirements. Several atypical phylogenetic associations hint at lateral transfer. Among eukaryotes, we observe an expansion of FLDs in terms of occurrence and domain organization diversity in the taxa Mollusca, Hemichordata and Branchiostomi, perhaps coinciding with greater emphasis on innate immune strategies in these organisms. The naturally occurring FLDs with diverse domain organizations that we have identified here will be useful for future studies aimed at creating designer molecular platforms for directing desired biological activities to fucosylated glycoconjugates in target niches. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Sequence requirement of the ade6-4095 meiotic recombination hotspot in Schizosaccharomyces pombe.

    PubMed

    Foulis, Steven J; Fowler, Kyle R; Steiner, Walter W

    2018-02-01

    Homologous recombination occurs at a greatly elevated frequency in meiosis compared to mitosis and is initiated by programmed double-strand DNA breaks (DSBs). DSBs do not occur at uniform frequency throughout the genome in most organisms, but occur preferentially at a limited number of sites referred to as hotspots. The location of hotspots have been determined at nucleotide-level resolution in both the budding and fission yeasts, and while several patterns have emerged regarding preferred locations for DSB hotspots, it remains unclear why particular sites experience DSBs at much higher frequency than other sites with seemingly similar properties. Short sequence motifs, which are often sites for binding of transcription factors, are known to be responsible for a number of hotspots. In this study we identified the minimum sequence required for activity of one of such motif identified in a screen of random sequences capable of producing recombination hotspots. The experimentally determined sequence, GGTCTRGACC, closely matches the previously inferred sequence. Full hotspot activity requires an effective sequence length of 9.5 bp, whereas moderate activity requires an effective sequence length of approximately 8.2 bp and shows significant association with DSB hotspots. In combination with our previous work, this result is consistent with a large number of different sequence motifs capable of producing recombination hotspots, and supports a model in which hotspots can be rapidly regenerated by mutation as they are lost through recombination.

  16. Sequence specificity of the human mRNA N6-adenosine methylase in vitro.

    PubMed Central

    Harper, J E; Miceli, S M; Roberts, R J; Manley, J L

    1990-01-01

    N6-adenosine methylation is a frequent modification of mRNAs and their precursors, but little is known about the mechanism of the reaction or the function of the modification. To explore these questions, we developed conditions to examine N6-adenosine methylase activity in HeLa cell nuclear extracts. Transfer of the methyl group from S-[3H methyl]-adenosylmethionine to unlabeled random copolymer RNA substrates of varying ribonucleotide composition revealed a substrate specificity consistent with a previously deduced consensus sequence, Pu[G greater than A]AC[A/C/U]. 32-P labeled RNA substrates of defined sequence were used to examine the minimum sequence requirements for methylation. Each RNA was 20 nucleotides long, and contained either the core consensus sequence GGACU, or some variation of this sequence. RNAs containing GGACU, either in single or multiple copies, were good substrates for methylation, whereas RNAs containing single base substitutions within the GGACU sequence gave dramatically reduced methylation. These results demonstrate that the N6-adenosine methylase has a strict sequence specificity, and that there is no requirement for extended sequences or secondary structures for methylation. Recognition of this sequence does not require an RNA component, as micrococcal nuclease pretreatment of nuclear extracts actually increased methylation efficiency. Images PMID:2216767

  17. A Correlational Analysis of the Effects of Learner and Linear Programming Characteristics on Learning Programmed Instruction. Final Report.

    ERIC Educational Resources Information Center

    Seibert, Warren F.; Reid, Christopher J.

    Learning and retention may be influenced by subtle instructional stimulus characteristics and certain visual memory aptitudes. Ten stimulus characteristics were chosen for study; 50 sequences of programed instructional material were specially written to conform to sampled values of each stimulus characteristic. Seventy-three freshman subjects…

  18. Pseudomonas sp. strain CA5 (a selenite-reducing bacterium) 16S rRNA gene complete sequence. National Institute of Health, National Center for Biotechnology Information, GenBank sequence. Accession FJ422810.1.

    USDA-ARS?s Scientific Manuscript database

    This study used 1321 base pair 16S rRNA gene sequence methods to confirm the phylogenetic position of a soil isolate as a bacterium belonging to the genus Pesudomonas sp. Morphological, biochemical characteristics, and fatty acid profiles are consistent with the 16S rRNA gene sequence identification...

  19. Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees.

    PubMed

    Williams, Philip H; Eyles, Rod; Weiller, Georg

    2012-01-01

    MicroRNAs (miRNAs) are nonprotein coding RNAs between 20 and 22 nucleotides long that attenuate protein production. Different types of sequence data are being investigated for novel miRNAs, including genomic and transcriptomic sequences. A variety of machine learning methods have successfully predicted miRNA precursors, mature miRNAs, and other nonprotein coding sequences. MirTools, mirDeep2, and miRanalyzer require "read count" to be included with the input sequences, which restricts their use to deep-sequencing data. Our aim was to train a predictor using a cross-section of different species to accurately predict miRNAs outside the training set. We wanted a system that did not require read-count for prediction and could therefore be applied to short sequences extracted from genomic, EST, or RNA-seq sources. A miRNA-predictive decision-tree model has been developed by supervised machine learning. It only requires that the corresponding genome or transcriptome is available within a sequence window that includes the precursor candidate so that the required sequence features can be collected. Some of the most critical features for training the predictor are the miRNA:miRNA(∗) duplex energy and the number of mismatches in the duplex. We present a cross-species plant miRNA predictor with 84.08% sensitivity and 98.53% specificity based on rigorous testing by leave-one-out validation.

  20. Comparison of MR imaging sequences for liver and head and neck interventions: is there a single optimal sequence for all purposes?

    PubMed

    Boll, Daniel T; Lewin, Jonathan S; Duerk, Jeffrey L; Aschoff, Andrik J; Merkle, Elmar M

    2004-05-01

    To compare the appropriate pulse sequences for interventional device guidance during magnetic resonance (MR) imaging at 0.2 T and to evaluate the dependence of sequence selection on the anatomic region of the procedure. Using a C-arm 0.2 T system, four interventional MR sequences were applied in 23 liver cases and during MR-guided neck interventions in 13 patients. The imaging protocol consisted of: multislice turbo spin echo (TSE) T2w, sequential-slice fast imaging with steady precession (FISP), a time-reversed version of FISP (PSIF), and FISP with balanced gradients in all spatial directions (True-FISP) sequences. Vessel conspicuity was rated and contrast-to-noise ratio (CNR) was calculated for each sequence and a differential receiver operating characteristic was performed. Liver findings were detected in 96% using the TSE sequence. PSIF, FISP, and True-FISP imaging showed lesions in 91%, 61%, and 65%, respectively. The TSE sequence offered the best CNR, followed by PSIF imaging. Differential receiver operating characteristic analysis also rated TSE and PSIF to be the superior sequences. Lesions in the head and neck were detected in all cases by TSE and FISP, in 92% using True-FISP, and in 84% using PSIF. True-FISP offered the best CNR, followed by TSE imaging. Vessels appeared bright on FISP and True-FISP imaging and dark on the other sequences. In interventional MR imaging, no single sequence fits all purposes. Image guidance for interventional MR during liver procedures is best achieved by PSIF or TSE, whereas biopsies in the head and neck are best performed using FISP or True-FISP sequences.

  1. Considerations in video playback design: using optic flow analysis to examine motion characteristics of live and computer-generated animation sequences.

    PubMed

    Woo, Kevin L; Rieucau, Guillaume

    2008-07-01

    The increasing use of the video playback technique in behavioural ecology reveals a growing need to ensure better control of the visual stimuli that focal animals experience. Technological advances now allow researchers to develop computer-generated animations instead of using video sequences of live-acting demonstrators. However, care must be taken to match the motion characteristics (speed and velocity) of the animation to the original video source. Here, we presented a tool based on the use of an optic flow analysis program to measure the resemblance of motion characteristics of computer-generated animations compared to videos of live-acting animals. We examined three distinct displays (tail-flick (TF), push-up body rock (PUBR), and slow arm wave (SAW)) exhibited by animations of Jacky dragons (Amphibolurus muricatus) that were compared to the original video sequences of live lizards. We found no significant differences between the motion characteristics of videos and animations across all three displays. Our results showed that our animations are similar the speed and velocity features of each display. Researchers need to ensure that similar motion characteristics in animation and video stimuli are represented, and this feature is a critical component in the future success of the video playback technique.

  2. [Infection and molecular characteristics of Giardia in clinical diarrheal patients].

    PubMed

    Liu, Hua; Shen, Yu-juan; Zhang, Yu-mei; Wang, Bin; Liu, Hui; Cao, Jian-ping

    2015-04-01

    To initially understand the infection status and the molecular characteristics of Giardia in clinical diarrheal patients. A total of 95 stool samples were collected from the clinical diarrheal patients admitted in a hospital in Shanghai from May to July, 2014, and the Giardia cysts in the samples were examined by an optical microscope. Then the tpi gene of Giardia in the positive samples were amplified by using the nested-PCR method, and the PCR products were sequenced and analyzed by using BLAST, ClustalX 1.83, and the phylogenetic tree was drawn by using MEGA6.0 software. Only one patient was infected with Giardia and the positive detection rate was 1.05%. The Giardia cysts in the fecal specimen were seen clearly under the microscope. Through the identification by PCR, the amplified fragment was about 530 bp, and the sequencing analysis indicated it was Giardia and which was further identified as assemblage B by drawing phylogenetic tree based on tpi gene. Meanwhile, the sequence had 100% homology with the reported sequence from huian (KF271445). Giardia infection can occur in the clinical diarrheal patients. The study could provide more data for understanding the genetic characteristics of Giardia and the epidemiological study of giardiasis.

  3. Hepatitis Delta Antigen Requires a Flexible Quasi-Double-Stranded RNA Structure To Bind and Condense Hepatitis Delta Virus RNA in a Ribonucleoprotein Complex

    PubMed Central

    Griffin, Brittany L.; Chasovskikh, Sergey; Dritschilo, Anatoly

    2014-01-01

    ABSTRACT The circular genome and antigenome RNAs of hepatitis delta virus (HDV) form characteristic unbranched, quasi-double-stranded RNA secondary structures in which short double-stranded helical segments are interspersed with internal loops and bulges. The ribonucleoprotein complexes (RNPs) formed by these RNAs with the virus-encoded protein hepatitis delta antigen (HDAg) perform essential roles in the viral life cycle, including viral replication and virion formation. Little is understood about the formation and structure of these complexes and how they function in these key processes. Here, the specific RNA features required for HDAg binding and the topology of the complexes formed were investigated. Selective 2′OH acylation analyzed by primer extension (SHAPE) applied to free and HDAg-bound HDV RNAs indicated that the characteristic secondary structure of the RNA is preserved when bound to HDAg. Notably, the analysis indicated that predicted unpaired positions in the RNA remained dynamic in the RNP. Analysis of the in vitro binding activity of RNAs in which internal loops and bulges were mutated and of synthetically designed RNAs demonstrated that the distinctive secondary structure, not the primary RNA sequence, is the major determinant of HDAg RNA binding specificity. Atomic force microscopy analysis of RNPs formed in vitro revealed complexes in which the HDV RNA is substantially condensed by bending or wrapping. Our results support a model in which the internal loops and bulges in HDV RNA contribute flexibility to the quasi-double-stranded structure that allows RNA bending and condensing by HDAg. IMPORTANCE RNA-protein complexes (RNPs) formed by the hepatitis delta virus RNAs and protein, HDAg, perform critical roles in virus replication. Neither the structures of these RNPs nor the RNA features required to form them have been characterized. HDV RNA is unusual in that it forms an unbranched quasi-double-stranded structure in which short base-paired segments are interspersed with internal loops and bulges. We analyzed the role of the HDV RNA sequence and secondary structure in the formation of a minimal RNP and visualized the structure of this RNP using atomic force microscopy. Our results indicate that HDAg does not recognize the primary sequence of the RNA; rather, the principle contribution of unpaired bases in HDV RNA to HDAg binding is to allow flexibility in the unbranched quasi-double-stranded RNA structure. Visualization of RNPs by atomic force microscopy indicated that the RNA is significantly bent or condensed in the complex. PMID:24741096

  4. Study design requirements for RNA sequencing-based breast cancer diagnostics.

    PubMed

    Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias

    2016-02-01

    Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.

  5. Impacts of vegetation change on groundwater recharge

    NASA Astrophysics Data System (ADS)

    Bond, W. J.; Verburg, K.; Smith, C. J.

    2003-12-01

    Vegetation change is the accepted cause of increasing river salt concentrations and the salinisation of millions of hectares of farm land in Australia. Replacement of perennial native vegetation by annual crops and pastures following European settlement has altered the water balance causing increased groundwater recharge and mobilising the naturally saline groundwater. The Redesigning Agriculture for Australian Landscapes Program, of which the work described here is a part, was established to develop agricultural practices that are more attuned to the delicate water balance described above. Results of field measurements will be presented that contrast the water balance characteristics of native vegetation with those of conventional agricultural plants, and indicate the functional characteristics required of new agricultural practices to reduce recharge. New agricultural practices may comprise different management of current crops and pastures, or may involve introducing totally new species. In either case, long-term testing is required to examine their impact on recharge over a long enough climate record to encompass the natural variability of rainfall that is characteristic of most Australian farming regions. Field experimentation therefore needs to be complemented and extended by computer simulation. This requires a modelling approach that is more robust than conventional crop modelling because (a) it needs to be sensitive enough to predict small changes in the residual recharge term, (b) it needs to be able to simulate a variety of vegetation in different sequences, (c) it needs to be able to simulate continuously for several decades of input data, and (d) it therefore needs to be able to simulate the period between crops, which often has a critical impact on recharge. The APSIM simulation framework will be used to illustrate these issues and to explore the effect of different vegetation combinations on recharge.

  6. Brain metastasis and treatment

    PubMed Central

    Ahluwalia, Manmeet S.; Vogelbaum, Michael V.; Chao, Samuel T.

    2014-01-01

    Despite major therapeutic advances in the management of patients with systemic malignancies, management of brain metastases remains a significant challenge. These patients often require multidisciplinary care that includes surgical resection, radiation therapy, chemotherapy, and targeted therapies. Complex decisions about the sequencing of therapies to control extracranial and intracranial disease require input from neurosurgeons, radiation oncologists, and medical/neuro-oncologists. With advances in understanding of the biology of brain metastases, molecularly defined disease subsets and the advent of targeted therapy as well as immunotherapeutic agents offer promise. Future care of these patients will entail tailoring treatment based on host (performance status and age) and tumor (molecular cytogenetic characteristics, number of metastases, and extracranial disease status) factors. Considerable work involving preclinical models and better clinical trial designs that focus not only on effective control of tumor but also on quality of life and neurocognition needs to be done to improve the outcome of these patients. PMID:25580268

  7. The building blocks of a 'Liveable Neighbourhood': Identifying the key performance indicators for walking of an operational planning policy in Perth, Western Australia.

    PubMed

    Hooper, Paula; Knuiman, Matthew; Foster, Sarah; Giles-Corti, Billie

    2015-11-01

    Planning policy makers are requesting clearer guidance on the key design features required to build neighbourhoods that promote active living. Using a backwards stepwise elimination procedure (logistic regression with generalised estimating equations adjusting for demographic characteristics, self-selection factors, stage of construction and scale of development) this study identified specific design features (n=16) from an operational planning policy ("Liveable Neighbourhoods") that showed the strongest associations with walking behaviours (measured using the Neighbourhood Physical Activity Questionnaire). The interacting effects of design features on walking behaviours were also investigated. The urban design features identified were grouped into the "building blocks of a Liveable Neighbourhood", reflecting the scale, importance and sequencing of the design and implementation phases required to create walkable, pedestrian friendly developments. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. High density bit transition requirements versus the effects on BCH error correcting code. [bit synchronization

    NASA Technical Reports Server (NTRS)

    Ingels, F. M.; Schoggen, W. O.

    1982-01-01

    The design to achieve the required bit transition density for the Space Shuttle high rate multiplexes (HRM) data stream of the Space Laboratory Vehicle is reviewed. It contained a recommended circuit approach, specified the pseudo random (PN) sequence to be used and detailed the properties of the sequence. Calculations showing the probability of failing to meet the required transition density were included. A computer simulation of the data stream and PN cover sequence was provided. All worst case situations were simulated and the bit transition density exceeded that required. The Preliminary Design Review and the critical Design Review are documented. The Cover Sequence Generator (CSG) Encoder/Decoder design was constructed and demonstrated. The demonstrations were successful. All HRM and HRDM units incorporate the CSG encoder or CSG decoder as appropriate.

  9. CROSS-DISCIPLINARY PHYSICS AND RELATED AREAS OF SCIENCE AND TECHNOLOGY: Characteristics of alternating current hopping conductivity in DNA sequences

    NASA Astrophysics Data System (ADS)

    Ma, Song-Shan; Xu, Hui; Wang, Huan-You; Guo, Rui

    2009-08-01

    This paper presents a model to describe alternating current (AC) conductivity of DNA sequences, in which DNA is considered as a one-dimensional (1D) disordered system, and electrons transport via hopping between localized states. It finds that AC conductivity in DNA sequences increases as the frequency of the external electric field rises, and it takes the form of øac(ω) ~ ω2 ln2(1/ω). Also AC conductivity of DNA sequences increases with the increase of temperature, this phenomenon presents characteristics of weak temperature-dependence. Meanwhile, the AC conductivity in an off-diagonally correlated case is much larger than that in the uncorrelated case of the Anderson limit in low temperatures, which indicates that the off-diagonal correlations in DNA sequences have a great effect on the AC conductivity, while at high temperature the off-diagonal correlations no longer play a vital role in electric transport. In addition, the proportion of nucleotide pairs p also plays an important role in AC electron transport of DNA sequences. For p < 0.5, the conductivity of DNA sequence decreases with the increase of p, while for p >= 0.5, the conductivity increases with the increase of p.

  10. Breaking Lander-Waterman’s Coverage Bound

    PubMed Central

    Nashta-ali, Damoun; Motahari, Seyed Abolfazl; Hosseinkhalaj, Babak

    2016-01-01

    Lander-Waterman’s coverage bound establishes the total number of reads required to cover the whole genome of size G bases. In fact, their bound is a direct consequence of the well-known solution to the coupon collector’s problem which proves that for such genome, the total number of bases to be sequenced should be O(G ln G). Although the result leads to a tight bound, it is based on a tacit assumption that the set of reads are first collected through a sequencing process and then are processed through a computation process, i.e., there are two different machines: one for sequencing and one for processing. In this paper, we present a significant improvement compared to Lander-Waterman’s result and prove that by combining the sequencing and computing processes, one can re-sequence the whole genome with as low as O(G) sequenced bases in total. Our approach also dramatically reduces the required computational power for the combined process. Simulation results are performed on real genomes with different sequencing error rates. The results support our theory predicting the log G improvement on coverage bound and corresponding reduction in the total number of bases required to be sequenced. PMID:27806058

  11. Genomics dataset on unclassified published organism (patent US 7547531).

    PubMed

    Khan Shawan, Mohammad Mahfuz Ali; Hasan, Md Ashraful; Hossain, Md Mozammel; Hasan, Md Mahmudul; Parvin, Afroza; Akter, Salina; Uddin, Kazi Rasel; Banik, Subrata; Morshed, Mahbubul; Rahman, Md Nazibur; Rahman, S M Badier

    2016-12-01

    Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms' hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.

  12. Mini-midi-mito: adapting the amplification and sequencing strategy of mtDNA to the degradation state of crime scene samples.

    PubMed

    Berger, Cordula; Parson, Walther

    2009-06-01

    The degradation state of some biological traces recovered from the crime scene requires the amplification of very short fragments to attain a useful mitochondrial (mt)DNA sequence. We have previously introduced two mini-multiplex assays that amplify 10 overlapping control region (CR) fragments in two separate multiplex PCRs, which brought successful CR consensus sequences from even highly degraded DNA extracts. This procedure requires a total of 20 sequencing reactions per sample, which is laborious and cost intensive. For only moderately degraded samples that we encounter more frequently with typical mtDNA casework material, we developed two new multiplex assays that use a subset of the mini-amplicon primers but embrace larger fragments (midis) and require only 10 sequencing reactions to build a double-stranded CR consensus sequence. We used a preceding mtDNA quantitation step by real-time PCR with two different target fragments (143 and 283 bp) that roughly correspond to the average fragment sizes of the different multiplex approaches to estimate size-dependent mtDNA quantities and to aid the choice of the appropriate PCR multiplexes with respect to quality of the results and required costs.

  13. Influenza virus sequence feature variant type analysis: evidence of a role for NS1 in influenza virus host range restriction.

    PubMed

    Noronha, Jyothi M; Liu, Mengya; Squires, R Burke; Pickett, Brett E; Hale, Benjamin G; Air, Gillian M; Galloway, Summer E; Takimoto, Toru; Schmolke, Mirco; Hunt, Victoria; Klem, Edward; García-Sastre, Adolfo; McGee, Monnie; Scheuermann, Richard H

    2012-05-01

    Genetic drift of influenza virus genomic sequences occurs through the combined effects of sequence alterations introduced by a low-fidelity polymerase and the varying selective pressures experienced as the virus migrates through different host environments. While traditional phylogenetic analysis is useful in tracking the evolutionary heritage of these viruses, the specific genetic determinants that dictate important phenotypic characteristics are often difficult to discern within the complex genetic background arising through evolution. Here we describe a novel influenza virus sequence feature variant type (Flu-SFVT) approach, made available through the public Influenza Research Database resource (www.fludb.org), in which variant types (VTs) identified in defined influenza virus protein sequence features (SFs) are used for genotype-phenotype association studies. Since SFs have been defined for all influenza virus proteins based on known structural, functional, and immune epitope recognition properties, the Flu-SFVT approach allows the rapid identification of the molecular genetic determinants of important influenza virus characteristics and their connection to underlying biological functions. We demonstrate the use of the SFVT approach to obtain statistical evidence for effects of NS1 protein sequence variations in dictating influenza virus host range restriction.

  14. Hepatozoon silvestris sp. nov.: morphological and molecular characterization of a new species of Hepatozoon (Adeleorina: Hepatozoidae) from the European wild cat (Felis silvestris silvestris).

    PubMed

    Hodžić, Adnan; Alić, Amer; Prašović, Senad; Otranto, Domenico; Baneth, Gad; Duscher, Georg Gerhard

    2017-04-01

    Based on morphological and genetic characteristics, we describe a new species of Hepatozoon in the European wild cat (Felis silvestris silvestris), herein named Hepatozoon silvestris sp. nov. The study also provides the first data on the occurrence of H. felis in this wild felid. Hepatozoon meronts were observed in multiple cross-sections of different organs of four (44%) cats. Additionally, extracellular forms, resembling mature gamonts of Hepatozoon, were found in the spleen and myocardium of two cats. Furthermore, tissues of six animals (67%) were positive by PCR. Hepatozoon felis was identified infecting one cat (11%), whereas the 18S rRNA sequences of the remaining five cats (56%) were identical, but distinct from the sequences of H. felis. Phylogenetic analyses revealed that those sequences form a highly supported clade distant from other Hepatozoon spp. Future studies should include domestic cats from the areas where the wild cats positive for H. silvestris sp. nov. were found, in order to investigate their potential role to serve as intermediate hosts of this newly described species. Identification of its definitive host(s) and experimental transmission studies are required for elucidating the full life cycle of this parasite and the possible alternative routes of its transmission.

  15. Novel techniques for data decomposition and load balancing for parallel processing of vision systems: Implementation and evaluation using a motion estimation system

    NASA Technical Reports Server (NTRS)

    Choudhary, Alok Nidhi; Leung, Mun K.; Huang, Thomas S.; Patel, Janak H.

    1989-01-01

    Computer vision systems employ a sequence of vision algorithms in which the output of an algorithm is the input of the next algorithm in the sequence. Algorithms that constitute such systems exhibit vastly different computational characteristics, and therefore, require different data decomposition techniques and efficient load balancing techniques for parallel implementation. However, since the input data for a task is produced as the output data of the previous task, this information can be exploited to perform knowledge based data decomposition and load balancing. Presented here are algorithms for a motion estimation system. The motion estimation is based on the point correspondence between the involved images which are a sequence of stereo image pairs. Researchers propose algorithms to obtain point correspondences by matching feature points among stereo image pairs at any two consecutive time instants. Furthermore, the proposed algorithms employ non-iterative procedures, which results in saving considerable amounts of computation time. The system consists of the following steps: (1) extraction of features; (2) stereo match of images in one time instant; (3) time match of images from consecutive time instants; (4) stereo match to compute final unambiguous points; and (5) computation of motion parameters.

  16. A novel class of small RNAs bind to MILI protein in mouse testes.

    PubMed

    Aravin, Alexei; Gaidatzis, Dimos; Pfeffer, Sébastien; Lagos-Quintana, Mariana; Landgraf, Pablo; Iovino, Nicola; Morris, Patricia; Brownstein, Michael J; Kuramochi-Miyagawa, Satomi; Nakano, Toru; Chien, Minchen; Russo, James J; Ju, Jingyue; Sheridan, Robert; Sander, Chris; Zavolan, Mihaela; Tuschl, Thomas

    2006-07-13

    Small RNAs bound to Argonaute proteins recognize partially or fully complementary nucleic acid targets in diverse gene-silencing processes. A subgroup of the Argonaute proteins--known as the 'Piwi family'--is required for germ- and stem-cell development in invertebrates, and two Piwi members--MILI and MIWI--are essential for spermatogenesis in mouse. Here we describe a new class of small RNAs that bind to MILI in mouse male germ cells, where they accumulate at the onset of meiosis. The sequences of the over 1,000 identified unique molecules share a strong preference for a 5' uridine, but otherwise cannot be readily classified into sequence families. Genomic mapping of these small RNAs reveals a limited number of clusters, suggesting that these RNAs are processed from long primary transcripts. The small RNAs are 26-31 nucleotides (nt) in length--clearly distinct from the 21-23 nt of microRNAs (miRNAs) or short interfering RNAs (siRNAs)--and we refer to them as 'Piwi-interacting RNAs' or piRNAs. Orthologous human chromosomal regions also give rise to small RNAs with the characteristics of piRNAs, but the cloned sequences are distinct. The identification of this new class of small RNAs provides an important starting point to determine the molecular function of Piwi proteins in mammalian spermatogenesis.

  17. RNA Polymerase III promoter screen uncovers a novel noncoding RNA family conserved in Caenorhabditis and other clade V nematodes.

    PubMed

    Gruber, Andreas R

    2014-07-10

    RNA Polymerase III is a highly specialized enzyme complex responsible for the transcription of a very distinct set of housekeeping noncoding RNAs including tRNAs, 7SK snRNA, Y RNAs, U6 snRNA, and the RNA components of RNaseP and RNaseMRP. In this work we have utilized the conserved promoter structure of known RNA Polymerase III transcripts consisting of characteristic sequence elements termed proximal sequence elements (PSE) A and B and a TATA-box to uncover a novel RNA Polymerase III-transcribed, noncoding RNA family found to be conserved in Caenorhabditis as well as other clade V nematode species. Homology search in combination with detailed sequence and secondary structure analysis revealed that members of this novel ncRNA family evolve rapidly, and only maintain a potentially functional small stem structure that links the 5' end to the very 3' end of the transcript and a small hairpin structure at the 3' end. This is most likely required for efficient transcription termination. In addition, our study revealed evidence that canonical C/D box snoRNAs are also transcribed from a PSE A-PSE B-TATA-box promoter in Caenorhabditis elegans. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. Independence of amplitude-frequency and phase calibrations in an SSVEP-based BCI using stepping delay flickering sequences.

    PubMed

    Chang, Hsiang-Chih; Lee, Po-Lei; Lo, Men-Tzung; Lee, I-Hui; Yeh, Ting-Kuang; Chang, Chun-Yen

    2012-05-01

    This study proposes a steady-state visual evoked potential (SSVEP)-based brain-computer interface (BCI) independent of amplitude-frequency and phase calibrations. Six stepping delay flickering sequences (SDFSs) at 32-Hz flickering frequency were used to implement a six-command BCI system. EEG signals recorded from Oz position were first filtered within 29-35 Hz, segmented based on trigger events of SDFSs to obtain SDFS epochs, and then stored separately in epoch registers. An epoch-average process suppressed the inter-SDFS interference. For each detection point, the latest six SDFS epochs in each epoch register were averaged and the normalized power of averaged responses was calculated. The visual target that induced the maximum normalized power was identified as the visual target. Eight subjects were recruited in this study. All subjects were requested to produce the "563241" command sequence four times. The averaged accuracy, command transfer interval, and information transfer rate (mean ± std.) values for all eight subjects were 97.38 ± 5.97%, 3.56 ± 0.68 s, and 42.46 ± 11.17 bits/min, respectively. The proposed system requires no calibration in either the amplitude-frequency characteristic or the reference phase of SSVEP which may provide an efficient and reliable channel for the neuromuscular disabled to communicate with external environments.

  19. Regional stochastic generation of streamflows using an ARIMA (1,0,1) process and disaggregation

    USGS Publications Warehouse

    Armbruster, Jeffrey T.

    1979-01-01

    An ARIMA (1,0,1) model was calibrated and used to generate long annual flow sequences at three sites in the Juniata River basin, Pennsylvania. The model preserves the mean, variance, and cross correlations of the observed station data. In addition, it has a desirable blend of both high and low frequency characteristics and therefore is capable of preserving the Hurst coefficient, h. The generated annual flows are disaggregated into monthly sequences using a modification of the Valencia-Schaake model. The low-flow frequency and flow duration characteristics of the generated monthly flows, with length equal to the historical data, compare favorably with the historical data. Once the models were verified, 100-year sequences were generated and analyzed for their low flow characteristics. One-, three- and six- month low-flow frequencies at recurrence intervals greater than 10 years are generally found to be lower than flow computed from the historical flows. A method is proposed for synthesizing flows at ungaged sites. (Kosco-USGS)

  20. Anaerobic sequencing batch reactor in pilot scale for treatment of tofu industry wastewater

    NASA Astrophysics Data System (ADS)

    Rahayu, Suparni Setyowati; Purwanto, Budiyono

    2015-12-01

    The small industry of tofu production process releases the waste water without being processed first, and the wastewater is directly discharged into water. In this study, Anaerobic Sequencing Batch Reactor in Pilot Scale for Treatment of Tofu Industry was developed through an anaerobic process to produce biogas as one kind of environmentally friendly renewable energy which can be developed into the countryside. The purpose of this study was to examine the fundamental characteristics of organic matter elimination of industrial wastewater with small tofu effective method and utilize anaerobic active sludge with Anaerobic Sequencing Bath Reactor (ASBR) to get rural biogas as an energy source. The first factor is the amount of the active sludge concentration which functions as the decomposers of organic matter and controlling selectivity allowance to degrade organic matter. The second factor is that HRT is the average period required substrate to react with the bacteria in the Anaerobic Sequencing Bath Reactor (ASBR).The results of processing the waste of tofu production industry using ASBR reactor with active sludge additions as starter generates cumulative volume of 5814.4 mL at HRT 5 days so that in this study it is obtained the conversion 0.16 L of CH4/g COD and produce biogas containing of CH4: 81.23% and CO2: 16.12%. The wastewater treatment of tofu production using ASBR reactor is able to produce renewable energy that has economic value as well as environmentally friendly by nature.

  1. ECHO: A reference-free short-read error correction algorithm

    PubMed Central

    Kao, Wei-Chun; Chan, Andrew H.; Song, Yun S.

    2011-01-01

    Developing accurate, scalable algorithms to improve data quality is an important computational challenge associated with recent advances in high-throughput sequencing technology. In this study, a novel error-correction algorithm, called ECHO, is introduced for correcting base-call errors in short-reads, without the need of a reference genome. Unlike most previous methods, ECHO does not require the user to specify parameters of which optimal values are typically unknown a priori. ECHO automatically sets the parameters in the assumed model and estimates error characteristics specific to each sequencing run, while maintaining a running time that is within the range of practical use. ECHO is based on a probabilistic model and is able to assign a quality score to each corrected base. Furthermore, it explicitly models heterozygosity in diploid genomes and provides a reference-free method for detecting bases that originated from heterozygous sites. On both real and simulated data, ECHO is able to improve the accuracy of previous error-correction methods by several folds to an order of magnitude, depending on the sequence coverage depth and the position in the read. The improvement is most pronounced toward the end of the read, where previous methods become noticeably less effective. Using a whole-genome yeast data set, it is demonstrated here that ECHO is capable of coping with nonuniform coverage. Also, it is shown that using ECHO to perform error correction as a preprocessing step considerably facilitates de novo assembly, particularly in the case of low-to-moderate sequence coverage depth. PMID:21482625

  2. Library construction for next-generation sequencing: Overviews and challenges

    PubMed Central

    Head, Steven R.; Komori, H. Kiyomi; LaMere, Sarah A.; Whisenant, Thomas; Van Nieuwerburgh, Filip; Salomon, Daniel R.; Ordoukhanian, Phillip

    2014-01-01

    High-throughput sequencing, also known as next-generation sequencing (NGS), has revolutionized genomic research. In recent years, NGS technology has steadily improved, with costs dropping and the number and range of sequencing applications increasing exponentially. Here, we examine the critical role of sequencing library quality and consider important challenges when preparing NGS libraries from DNA and RNA sources. Factors such as the quantity and physical characteristics of the RNA or DNA source material as well as the desired application (i.e., genome sequencing, targeted sequencing, RNA-seq, ChIP-seq, RIP-seq, and methylation) are addressed in the context of preparing high quality sequencing libraries. In addition, the current methods for preparing NGS libraries from single cells are also discussed. PMID:24502796

  3. Curriculum Mapping in Higher Education: A Case Study and Proposed Content Scope and Sequence Mapping Tool

    ERIC Educational Resources Information Center

    Arafeh, Sousan

    2016-01-01

    Best practice in curriculum development and implementation requires that discipline-based standards or requirements embody both curricular and programme scopes and sequences. Ensuring these are present and aligned in course/programme content, activities and assessments to support student success requires formalised and systematised review and…

  4. Plastid-targeting peptides from the chlorarachniophyte Bigelowiella natans.

    PubMed

    Rogers, Matthew B; Archibald, John M; Field, Matthew A; Li, Catherine; Striepen, Boris; Keeling, Patrick J

    2004-01-01

    Chlorarachniophytes are marine amoeboflagellate protists that have acquired their plastid (chloroplast) through secondary endosymbiosis with a green alga. Like other algae, most of the proteins necessary for plastid function are encoded in the nuclear genome of the secondary host. These proteins are targeted to the organelle using a bipartite leader sequence consisting of a signal peptide (allowing entry in to the endomembrane system) and a chloroplast transit peptide (for transport across the chloroplast envelope membranes). We have examined the leader sequences from 45 full-length predicted plastid-targeted proteins from the chlorarachniophyte Bigelowiella natans with the goal of understanding important features of these sequences and possible conserved motifs. The chemical characteristics of these sequences were compared with a set of 10 B. natans endomembrane-targeted proteins and 38 cytosolic or nuclear proteins, which show that the signal peptides are similar to those of most other eukaryotes, while the transit peptides differ from those of other algae in some characteristics. Consistent with this, the leader sequence from one B. natans protein was tested for function in the apicomplexan parasite, Toxoplasma gondii, and shown to direct the secretion of the protein.

  5. Design and Research of the Sewage Treatment Control System

    NASA Astrophysics Data System (ADS)

    Chu, J.; Hu, W. W.

    Due to the rapid development of China's economy, the water pollution has become a problem that we have to face. In particular, how to deal with industrial wastewater has become a top priority. In wastewater treatment, the control system based on PLC has met the design requirement in real-time, reliability, precision and so on. The integration of sequence control and process control in PLC, has the characteristics of high reliability, simple network, convenient and flexible use. PLC is a powerful tool for small and medium-sized industrial automation. Therefore, the sewage treatment control system take PLC as the core of control system, can nicely solve the problem of industrial wastewater in a certain extent.

  6. The evolution of phase holographic imaging from a research idea to publicly traded company

    NASA Astrophysics Data System (ADS)

    Egelberg, Peter

    2018-02-01

    Recognizing the value and unmet need for label-free kinetic cell analysis, Phase Holograhic Imaging defines its market segment as automated, easy to use and affordable time-lapse cytometry. The process of developing new technology, meeting customer expectations, sources of corporate funding and R&D adjustments prompted by field experience will be reviewed. Additionally, it is discussed how relevant biological information can be extracted from a sequence of quantitative phase images, with negligible user assistance and parameter tweaking, to simultaneously provide cell culture characteristics such as cell growth rate, viability, division rate, mitosis duration, phagocytosis rate, migration, motility and cell-cell adherence without requiring any artificial cell manipulation.

  7. Involvement of Sp1 and Microsatellite Repressor Sequences in the Transcriptional Control of the Human CD30 Gene

    PubMed Central

    Croager, Emma J.; Gout, Alexander M.; Abraham, Lawrence J.

    2000-01-01

    CD30, as a member of the tumor necrosis factor (TNF) receptor family, is expressed on the surface of activated lymphoid cells. CD30 overexpression is a characteristic of lymphoproliferative diseases such as Hodgkin’s/non-Hodgkin’s lymphomas, embryonal carcinoma, and a number of Th2-associated diseases. The CD30 gene has been mapped to a region of the murine genome that is involved in susceptibility to systemic lupus erythematosus. Functionally, CD30 may play a role in the deletion of autoreactive T cells. We were interested in determining the molecular nature of CD30 overexpression. Sequence comparison has revealed significant identity between the TATA-less human and murine CD30 promoters; they share a number of common consensus binding motifs. Transfection assays identified three regions of transcriptional importance; the region between position −1.2 kb and −336 bp, containing a CCAT microsatellite sequence, a conserved Sp1 site at positions −43 to −38, and a downstream promoter element (DPE) at positions +24 to +29. EMSA and DNase I footprinting showed specific DNA-protein interactions of the CD30 promoter with the Sp1 site and the CCAT repeat region. The DPE element was shown to be essential for start site selection. We conclude that the conserved Sp1 site at −43 to −38 is associated with maximum reporter gene activity, the DPE element is required for start site selection, and the CCAT tetranucleotide repeats act to repress transcription. We also have shown that the microsatellite is multiallelic, when we screened a random healthy population. Further studies are required to determine whether microsatellite instability in the repressor predisposes susceptible individuals to CD30 overexpression. PMID:10793083

  8. Application of advanced cytometric and molecular technologies to minimal residual disease monitoring

    NASA Astrophysics Data System (ADS)

    Leary, James F.; He, Feng; Reece, Lisa M.

    2000-04-01

    Minimal residual disease monitoring presents a number of theoretical and practical challenges. Recently it has been possible to meet some of these challenges by combining a number of new advanced biotechnologies. To monitor the number of residual tumor cells requires complex cocktails of molecular probes that collectively provide sensitivities of detection on the order of one residual tumor cell per million total cells. Ultra-high-speed, multi parameter flow cytometry is capable of analyzing cells at rates in excess of 100,000 cells/sec. Residual tumor selection marker cocktails can be optimized by use of receiver operating characteristic analysis. New data minimizing techniques when combined with multi variate statistical or neural network classifications of tumor cells can more accurately predict residual tumor cell frequencies. The combination of these techniques can, under at least some circumstances, detect frequencies of tumor cells as low as one cell in a million with an accuracy of over 98 percent correct classification. Detection of mutations in tumor suppressor genes requires insolation of these rare tumor cells and single-cell DNA sequencing. Rare residual tumor cells can be isolated at single cell level by high-resolution single-cell cell sorting. Molecular characterization of tumor suppressor gene mutations can be accomplished using a combination of single- cell polymerase chain reaction amplification of specific gene sequences followed by TA cloning techniques and DNA sequencing. Mutations as small as a single base pair in a tumor suppressor gene of a single sorted tumor cell have been detected using these methods. Using new amplification procedures and DNA micro arrays it should be possible to extend the capabilities shown in this paper to screening of multiple DNA mutations in tumor suppressor and other genes on small numbers of sorted metastatic tumor cells.

  9. Magnetic resonance imaging for the detection, localisation, and characterisation of prostate cancer: recommendations from a European consensus meeting.

    PubMed

    Dickinson, Louise; Ahmed, Hashim U; Allen, Clare; Barentsz, Jelle O; Carey, Brendan; Futterer, Jurgen J; Heijmink, Stijn W; Hoskin, Peter J; Kirkham, Alex; Padhani, Anwar R; Persad, Raj; Puech, Philippe; Punwani, Shonit; Sohaib, Aslam S; Tombal, Bertrand; Villers, Arnauld; van der Meulen, Jan; Emberton, Mark

    2011-04-01

    Multiparametric magnetic resonance imaging (mpMRI) may have a role in detecting clinically significant prostate cancer in men with raised serum prostate-specific antigen levels. Variations in technique and the interpretation of images have contributed to inconsistency in its reported performance characteristics. Our aim was to make recommendations on a standardised method for the conduct, interpretation, and reporting of prostate mpMRI for prostate cancer detection and localisation. A consensus meeting of 16 European prostate cancer experts was held that followed the UCLA-RAND Appropriateness Method and facilitated by an independent chair. Before the meeting, 520 items were scored for "appropriateness" by panel members, discussed face to face, and rescored. Agreement was reached in 67% of 260 items related to imaging sequence parameters. T2-weighted, dynamic contrast-enhanced, and diffusion-weighted MRI were the key sequences incorporated into the minimum requirements. Consensus was also reached on 54% of 260 items related to image interpretation and reporting, including features of malignancy on individual sequences. A 5-point scale was agreed on for communicating the probability of malignancy, with a minimum of 16 prostatic regions of interest, to include a pictorial representation of suspicious foci. Limitations relate to consensus methodology. Dominant personalities are known to affect the opinions of the group and were countered by a neutral chairperson. Consensus was reached on a number of areas related to the conduct, interpretation, and reporting of mpMRI for the detection, localisation, and characterisation of prostate cancer. Before optimal dissemination of this technology, these outcomes will require formal validation in prospective trials. Copyright © 2010 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  10. A molecular model for illegitimate recombination in Bacillus subtilis.

    PubMed

    Temeyer, K B; Hopkins, K M; Chapman, L F

    1991-01-01

    The recombinant DNA junctions at which pUB110 and Bacillus subtilis chromosomal DNA were joined to form the plasmid pKBT1 were cloned and sequenced. From the sequencing data we conclude that the pUB110 sequence is intact in the pair of cloned pKBT1 fragments and pTL12 sequences are not present. A molecular model for the formation of pKBT1 based on structural motifs characteristic of the joint sites is presented.

  11. Sequences required for induction of neurotensin receptor gene expression during neuronal differentiation of N1E-115 neuroblastoma cells.

    PubMed

    Tavares, D; Tully, K; Dobner, P R

    1999-10-15

    The promoter region of the mouse high affinity neurotensin receptor (Ntr-1) gene was characterized, and sequences required for expression in neuroblastoma cell lines that express high affinity NT-binding sites were characterized. Me(2)SO-induced neuronal differentiation of N1E-115 neuroblastoma cells increased both the expression of the endogenous Ntr-1 gene and reporter genes driven by NTR-1 promoter sequences by 3-4-fold. Deletion analysis revealed that an 83-base pair promoter region containing the transcriptional start site is required for Me(2)SO activation. Detailed mutational analysis of this region revealed that a CACCC box and the central region of a large GC-rich palindrome are the crucial cis-regulatory elements required for Me(2)SO induction. The CACCC box is bound by at least one factor that is induced upon Me(2)SO treatment of N1E-115 cells. The Me(2)SO effect was found to be both selective and cell type-restricted. Basal expression in the neuroblastoma cell lines required a distinct set of sequences, including an Sp1-like sequence, and a sequence resembling an NGFI-A-binding site; however, a more distal 5' sequence was found to repress basal activity in N1E-115 cells. These results provide evidence that Ntr-1 gene regulation involves both positive and negative regulatory elements located in the 5'-flanking region and that Ntr-1 gene activation involves the coordinate activation or induction of several factors, including a CACCC box binding complex.

  12. Study on the Evolution of Genes Mutation Related With Gastrointestinal Stromal Tumors

    ClinicalTrials.gov

    2012-01-05

    Full Gene Sequences of c-KIT、PDGFRA and DOG1 Are Analyzed With the Screening-sequencing Approach; Investigate the Characteristics and Variations Associated With the Different Gene Mutations of c-KIT、PDGFRA and DOG1 in GIST Patients

  13. Complete genome sequence analysis of a duck circovirus from Guangxi pockmark ducks.

    PubMed

    Xie, Liji; Xie, Zhixun; Zhao, Guangyuan; Liu, Jiabo; Pang, Yaoshan; Deng, Xianwen; Xie, Zhiqin; Fan, Qing

    2012-12-01

    We report here the complete genomic sequence of a novel duck circovirus (DuCV) strain, GX1104, isolated from Guangxi pockmark ducks in Guangxi, China. The whole nucleotide sequence had the highest homology (97.2%) with the sequence of strain TC/2002 (GenBank accession number AY394721.1) and had a low homology (76.8% to 78.6%) with the sequences of other strains isolated from China, Germany, and the United States. This report will help to understand the epidemiology and molecular characteristics of Guangxi pockmark duck circovirus in southern China.

  14. A Homologue of an Operon Required for DNA Transfer in Agrobacterium Is Required in Brucella abortus for Virulence and Intracellular Multiplication

    PubMed Central

    Sieira, Rodrigo; Comerci, Diego J.; Sánchez, Daniel O.; Ugalde, Rodolfo A.

    2000-01-01

    As part of a Brucella abortus 2308 genome project carried out in our laboratory, we identified, cloned, and sequenced a genomic DNA fragment containing a locus (virB) highly homologous to bacterial type IV secretion systems. The B. abortus virB locus is a collinear arrangement of 13 open reading frames (ORFs). Between virB1 and virB2 and downstream of ORF12, two degenerated, palindromic repeat sequences characteristic of Brucella intergenic regions were found. Gene reporter studies demonstrated that the B. abortus virB locus constitutes an operon transcribed from virB1 which is turned on during the stationary phase of growth. A B. abortus polar virB1 mutant failed to replicate in HeLa cells, indicating that the virB operon plays a critical role in intracellular multiplication. Mutants with polar and nonpolar mutations introduced in virB10 showed different behaviors in mice and in the HeLa cell infection assay, suggesting that virB10 per se is necessary for the correct function of this type IV secretion apparatus. Mouse infection assays demonstrated that the virB operon constitutes a major determinant of B. abortus virulence. It is suggested that putative effector molecules secreted by this type IV secretion system determine routing of B. abortus to an endoplasmic reticulum-related replication compartment. PMID:10940027

  15. The AWA1 Gene Is Required for the Foam-Forming Phenotype and Cell Surface Hydrophobicity of Sake Yeast

    PubMed Central

    Shimoi, Hitoshi; Sakamoto, Kazutoshi; Okuda, Masaki; Atthi, Ratchanee; Iwashita, Kazuhiro; Ito, Kiyoshi

    2002-01-01

    Sake, a traditional alcoholic beverage in Japan, is brewed with sake yeasts, which are classified as Saccharomyces cerevisiae. Almost all sake yeasts form a thick foam layer on sake mash during the fermentation process because of their cell surface hydrophobicity, which increases the cells' affinity for bubbles. To reduce the amount of foam, nonfoaming mutants were bred from foaming sake yeasts. Nonfoaming mutants have hydrophilic cell surfaces and no affinity for bubbles. We have cloned a gene from a foam-forming sake yeast that confers foaming ability to a nonfoaming mutant. This gene was named AWA1 and structures of the gene and its product were analyzed. The N- and C-terminal regions of Awa1p have the characteristic sequences of a glycosylphosphatidylinositol anchor protein. The entire protein is rich in serine and threonine residues and has a lot of repetitive sequences. These results suggest that Awa1p is localized in the cell wall. This was confirmed by immunofluorescence microscopy and Western blotting analysis using hemagglutinin-tagged Awa1p. Moreover, an awa1 disruptant of sake yeast was hydrophilic and showed a nonfoaming phenotype in sake mash. We conclude that Awa1p is a cell wall protein and is required for the foam-forming phenotype and the cell surface hydrophobicity of sake yeast. PMID:11916725

  16. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses.

    PubMed

    Furuse, Yuki; Matsuzaki, Yoko; Nishimura, Hidekazu; Oshitani, Hitoshi

    2016-11-26

    Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1) multiple lineages have been circulating globally; (2) there have been weak and infrequent selective bottlenecks; (3) the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4) there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics.

  17. Analyses of Evolutionary Characteristics of the Hemagglutinin-Esterase Gene of Influenza C Virus during a Period of 68 Years Reveals Evolutionary Patterns Different from Influenza A and B Viruses

    PubMed Central

    Furuse, Yuki; Matsuzaki, Yoko; Nishimura, Hidekazu; Oshitani, Hitoshi

    2016-01-01

    Infections with the influenza C virus causing respiratory symptoms are common, particularly among children. Since isolation and detection of the virus are rarely performed, compared with influenza A and B viruses, the small number of available sequences of the virus makes it difficult to analyze its evolutionary dynamics. Recently, we reported the full genome sequence of 102 strains of the virus. Here, we exploited the data to elucidate the evolutionary characteristics and phylodynamics of the virus compared with influenza A and B viruses. Along with our data, we obtained public sequence data of the hemagglutinin-esterase gene of the virus; the dataset consists of 218 unique sequences of the virus collected from 14 countries between 1947 and 2014. Informatics analyses revealed that (1) multiple lineages have been circulating globally; (2) there have been weak and infrequent selective bottlenecks; (3) the evolutionary rate is low because of weak positive selection and a low capability to induce mutations; and (4) there is no significant positive selection although a few mutations affecting its antigenicity have been induced. The unique evolutionary dynamics of the influenza C virus must be shaped by multiple factors, including virological, immunological, and epidemiological characteristics. PMID:27898037

  18. Simian virus 40 major late promoter: an upstream DNA sequence required for efficient in vitro transcription.

    PubMed Central

    Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P

    1984-01-01

    We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950

  19. Molecular characterization of an ependymin precursor from goldfish brain.

    PubMed

    Königstorfer, A; Sterrer, S; Eckerskorn, C; Lottspeich, F; Schmidt, R; Hoffmann, W

    1989-01-01

    Ependymins are thought to be implicated in fundamental processes involved in plasticity of the goldfish CNS. Gas-phase sequencing of purified ependymins beta and gamma revealed that they share the same N-terminal sequence. Each sequence displays microheterogeneities at several positions. Based on the protein sequences obtained, we constructed synthetic oligonucleotides and used them as hybridization probes for screening cDNA libraries of goldfish brain. In this article we describe the full-length sequence of a mRNA encoding a precursor of ependymins. A cleavable signal sequence characteristic of secretory proteins is located at the N-terminal end, followed directly by the ependymin sequence. Also, two potential N-glycosylation sites were detected. A computer search revealed that ependymins form a novel family of unique proteins.

  20. Single-cell genome sequencing at ultra-high-throughput with microfluidic droplet barcoding.

    PubMed

    Lan, Freeman; Demaree, Benjamin; Ahmed, Noorsher; Abate, Adam R

    2017-07-01

    The application of single-cell genome sequencing to large cell populations has been hindered by technical challenges in isolating single cells during genome preparation. Here we present single-cell genomic sequencing (SiC-seq), which uses droplet microfluidics to isolate, fragment, and barcode the genomes of single cells, followed by Illumina sequencing of pooled DNA. We demonstrate ultra-high-throughput sequencing of >50,000 cells per run in a synthetic community of Gram-negative and Gram-positive bacteria and fungi. The sequenced genomes can be sorted in silico based on characteristic sequences. We use this approach to analyze the distributions of antibiotic-resistance genes, virulence factors, and phage sequences in microbial communities from an environmental sample. The ability to routinely sequence large populations of single cells will enable the de-convolution of genetic heterogeneity in diverse cell populations.

  1. Evidence that Altered Cis Element Spacing Affects PpsR Mediated Redox Control of Photosynthesis Gene Expression in Rubrivivax gelatinosus.

    PubMed

    Shimizu, Takayuki; Cheng, Zhuo; Matsuura, Katsumi; Masuda, Shinji; Bauer, Carl E

    2015-01-01

    PpsR is a major regulator of photosynthesis gene expression among all characterized purple photosynthetic bacteria. This transcription regulator has been extensively characterized in Rhodobacter (Rba.) capsulatus and Rba. sphaeroides which are members of the α-proteobacteria lineage. In this study, we have investigated the biochemical properties and mutational effects of a ppsR deletion strain in the β-proteobacterium Rubrivivax (Rvi.) gelatinosus in order to reveal phylogenetically conserved mechanisms and species-specific characteristics. A deletion of the ppsR gene resulted in de-repression of photosystem synthesis showing that PpsR functions as a repressor of photosynthesis genes in this species. We also constructed a Rvi. gelatinosus PpsR mutant in which a conserved cysteine at position 436 was changed to an alanine to examine whether or not this residue is important for sensing redox, as reported in Rhodobacter species. Surprisingly, the Cys436 Ala mutant retained the ability to repress photosynthesis gene expression under aerobic conditions, suggesting that PpsR from Rvi. gelatinosus has different redox-responding characteristics. Furthermore, biochemical analyses demonstrated that Rvi. gelatinosus PpsR only shows redox-dependent binding to promoters with 9-bp spacing, but not 8-bp spacing, between two PpsR-recognition sequences. These results indicate that redox-dependent binding of PpsR requires appropriate cis configuration of PpsR target sequences in Rvi. gelatinosus. These results also indicate that PpsR homologs from different species regulate photosynthesis genes with altered biochemical properties.

  2. Marine Structural Biomaterials in Medical Biomimicry.

    PubMed

    Green, David W; Lee, Jong-Min; Jung, Han-Sung

    2015-10-01

    Marine biomaterials display properties, behaviors, and functions that have not been artificially matched in relation to their hierarchical construction, crack-stopping properties, growth adaptation, and energy efficiency. The discovery and understanding of such features that are characteristic of natural biomaterials can be used to manufacture more energy-efficient and lightweight materials. However, a more detailed understanding of the design of natural biomaterials with good performance and the mechanism of their design is required. Far-reaching biomolecular characterization of biomaterials and biostructures from the ocean world is possible with sophisticated analytical methods, such as whole-genome RNA-seq, and de novo transcriptome sequencing and mass spectrophotometry-based sequencing. In combination with detailed material characterization, the elements in newly discovered biomaterials and their properties can be reconstituted into biomimetic or bio-inspired materials. A major aim of harnessing marine biomaterials is their translation into biomimetic counterparts. To achieve full translation, the genome, proteome, and hierarchical material characteristics, and their profiles in space and time, have to be associated to allow for smooth biomimetic translation. In this article, we highlight the novel science of marine biomimicry from a materials perspective. We focus on areas of material design and fabrication that have excelled in marine biological models, such as embedded interfaces, chiral organization, and the use of specialized composite material-on-material designs. Our emphasis is primarily on key materials with high value in healthcare in which we evaluate their future prospects. Marine biomaterials are among the most exquisite and powerful aspects in materials science today.

  3. Quantum sequencing: opportunities and challenges

    NASA Astrophysics Data System (ADS)

    di Ventra, Massimiliano

    Personalized or precision medicine refers to the ability of tailoring drugs to the specific genome and transcriptome of each individual. It is however not yet feasible due the high costs and slow speed of present DNA sequencing methods. I will discuss a sequencing protocol that requires the measurement of the distributions of transverse tunneling currents during the translocation of single-stranded DNA into nanochannels. I will show that such a quantum sequencing approach can reach unprecedented speeds, without requiring any chemical preparation, amplification or labeling. I will discuss recent experiments that support these theoretical predictions, the advantages of this approach over other sequencing methods, and stress the challenges that need to be overcome to render it commercially viable.

  4. SEED 2: a user-friendly platform for amplicon high-throughput sequencing data analyses.

    PubMed

    Vetrovský, Tomáš; Baldrian, Petr; Morais, Daniel; Berger, Bonnie

    2018-02-14

    Modern molecular methods have increased our ability to describe microbial communities. Along with the advances brought by new sequencing technologies, we now require intensive computational resources to make sense of the large numbers of sequences continuously produced. The software developed by the scientific community to address this demand, although very useful, require experience of the command-line environment, extensive training and have steep learning curves, limiting their use. We created SEED 2, a graphical user interface for handling high-throughput amplicon-sequencing data under Windows operating systems. SEED 2 is the only sequence visualizer that empowers users with tools to handle amplicon-sequencing data of microbial community markers. It is suitable for any marker genes sequences obtained through Illumina, IonTorrent or Sanger sequencing. SEED 2 allows the user to process raw sequencing data, identify specific taxa, produce of OTU-tables, create sequence alignments and construct phylogenetic trees. Standard dual core laptops with 8 GB of RAM can handle ca. 8 million of Illumina PE 300 bp sequences, ca. 4GB of data. SEED 2 was implemented in Object Pascal and uses internal functions and external software for amplicon data processing. SEED 2 is a freeware software, available at http://www.biomed.cas.cz/mbu/lbwrf/seed/ as a self-contained file, including all the dependencies, and does not require installation. Supplementary data contain a comprehensive list of supported functions. daniel.morais@biomed.cas.cz. Supplementary data are available at Bioinformatics online. © The Author(s) 2018. Published by Oxford University Press.

  5. Improvement of energy efficiency via spectrum optimization of excitation sequence for multichannel simultaneously triggered airborne sonar system

    NASA Astrophysics Data System (ADS)

    Meng, Qing-Hao; Yao, Zhen-Jing; Peng, Han-Yang

    2009-12-01

    Both the energy efficiency and correlation characteristics are important in airborne sonar systems to realize multichannel ultrasonic transducers working together. High energy efficiency can increase echo energy and measurement range, and sharp autocorrelation and flat cross correlation can help eliminate cross-talk among multichannel transducers. This paper addresses energy efficiency optimization under the premise that cross-talk between different sonar transducers can be avoided. The nondominated sorting genetic algorithm-II is applied to optimize both the spectrum and correlation characteristics of the excitation sequence. The central idea of the spectrum optimization is to distribute most of the energy of the excitation sequence within the frequency band of the sonar transducer; thus, less energy is filtered out by the transducers. Real experiments show that a sonar system consisting of eight-channel Polaroid 600 series electrostatic transducers excited with 2 ms optimized pulse-position-modulation sequences can work together without cross-talk and can measure distances up to 650 cm with maximal 1% relative error.

  6. Effects of dispense equipment sequence on process start-up defects

    NASA Astrophysics Data System (ADS)

    Brakensiek, Nick; Sevegney, Michael

    2013-03-01

    Photofluid dispense systems within coater/developer tools have been designed with the intent to minimize cost of ownership to the end user. Waste and defect minimization, dispense quality and repeatability, and ease of use are all desired characteristics. One notable change within commercially available systems is the sequence in which process fluid encounters dispense pump and filtration elements. Traditionally, systems adopted a pump-first sequence, where fluid is "pushed through" a point-of-use filter just prior to dispensing on the wafer. Recently, systems configured in a pump-last scheme have become available, where fluid is "pulled through" the filter, into the pump, and then is subsequently dispensed. The present work constitutes a comparative evaluation of the two equipment sequences with regard to the aforementioned characteristics that impact cost of ownership. Additionally, removal rating and surface chemistry (i.e., hydrophilicity) of the point-of-use filter are varied in order to evaluate their influence on system start-up and defects.

  7. Targeted sequencing of plant genomes

    Treesearch

    Mark D. Huynh

    2014-01-01

    Next-generation sequencing (NGS) has revolutionized the field of genetics by providing a means for fast and relatively affordable sequencing. With the advancement of NGS, wholegenome sequencing (WGS) has become more commonplace. However, sequencing an entire genome is still not cost effective or even beneficial in all cases. In studies that do not require a whole-...

  8. A Simulation of DNA Sequencing Utilizing 3M Post-It[R] Notes

    ERIC Educational Resources Information Center

    Christensen, Doug

    2009-01-01

    An inexpensive and equipment free approach to teaching the technical aspects of DNA sequencing. The activity described requires an instructor with a familiarity of DNA sequencing technology but provides a straight forward method of teaching the technical aspects of sequencing in the absence of expensive sequencing equipment. The final sequence…

  9. Adaptive efficient compression of genomes

    PubMed Central

    2012-01-01

    Modern high-throughput sequencing technologies are able to generate DNA sequences at an ever increasing rate. In parallel to the decreasing experimental time and cost necessary to produce DNA sequences, computational requirements for analysis and storage of the sequences are steeply increasing. Compression is a key technology to deal with this challenge. Recently, referential compression schemes, storing only the differences between a to-be-compressed input and a known reference sequence, gained a lot of interest in this field. However, memory requirements of the current algorithms are high and run times often are slow. In this paper, we propose an adaptive, parallel and highly efficient referential sequence compression method which allows fine-tuning of the trade-off between required memory and compression speed. When using 12 MB of memory, our method is for human genomes on-par with the best previous algorithms in terms of compression ratio (400:1) and compression speed. In contrast, it compresses a complete human genome in just 11 seconds when provided with 9 GB of main memory, which is almost three times faster than the best competitor while using less main memory. PMID:23146997

  10. [Rocuronium and sugammadex in emergency medicine: requirements of a muscle relaxant for rapid sequence induction].

    PubMed

    Luxen, J; Trentzsch, H; Urban, B

    2014-04-01

    The required characteristics of neuromuscular blockers for rapid sequence induction (RSI) are clearly defined: nearly immediate effectiveness and short duration of effect. These demands are not only necessary for ideal conditions of quick endotracheal intubation without mask-bag intermediate ventilation but are also essential to enable a quick return to sufficient spontaneous breathing in case of a cannot intubate cannot ventilate situation. Until recently only succinylcholine had these characteristics; however, a considerable number of dangerous side effects and contraindications had to be accepted. In 1996, rocuronium was introduced, which was capable of immediately establishing good intubation conditions similar to succinylcholine. However, the median duration of effect is 45-60 min and it therefore contains a risk if the patient cannot be ventilated and oxygenated. Therefore, rocuronium is considered a good alternative but not a complete substitute for succinylcholine. The introduction of sugammadex in 2008 for quick reversal of rocuronium changed matters. Comparative studies from the past 4 years dealing with rocuronium/sugammadex versus uccinylcholine in RSI showed that rocuronium and sugammadex combined enabled a significantly faster return to sufficient spontaneous ventilation in emergency situations and also proved that the use of rocuronium significantly reduced the degree of desaturation during the interval between injection and ventilation postintubation. rocuronium used in hospital is a very good substitute for succinylcholine as a neuromuscular blocker during RSI as long as sugammadex is at hand for reversal. It remains to be considered that in a situation with severe problems of the airway and breathing, which are the main preclinical indications for intubation, a forward strategy for ventilation of the patient is the only acceptable way in most cases and the return to spontaneous breathing is not an alternative. Therefore, the value of sugammadex and also of succinylcholine is limited for these situations. Additionally, economic factors such as storage conditions for rocuronium and the cost of sugammadex must also be considered.

  11. Pathology of serrated colorectal lesions.

    PubMed

    Bateman, Adrian C

    2014-10-01

    The concept of serrated colorectal neoplasia has become recognised as a key process in the development of colorectal cancer (CRC) and an important alternative pathway to malignancy compared with the long established ‘adenoma-carcinoma’ sequence. Increasing recognition of the morphological spectrum of serrated lesions has occurred in parallel with elucidation of the distinct molecular genetic characteristics of progression from normal mucosa, via the ‘serrated pathway’, to CRC. Some of these lesions can be difficult to identify at colonoscopy. Challenges for pathologists include the requirement for accurate recognition of the forms of serrated lesions that are associated with a significant risk of malignant progression and therefore the need for widely disseminated reproducible criteria for their diagnosis. Alongside this process, pathologists and endoscopists need to formulate clear guidelines for the management of patients with these lesions, particularly with respect to the optimal follow-up intervals. This review provides practical guidance for the recognition of these lesions by pathologists, a discussion of ‘serrated adenocarcinoma’ and an insight into the distinct molecular genetic alterations that are seen in this spectrum of lesions in comparison to those that characterise the classic ‘adenoma-carcinoma’ sequence.

  12. An Overview on Prenatal Screening for Chromosomal Aberrations.

    PubMed

    Hixson, Lucas; Goel, Srishti; Schuber, Paul; Faltas, Vanessa; Lee, Jessica; Narayakkadan, Anjali; Leung, Ho; Osborne, Jim

    2015-10-01

    This article is a review of current and emerging methods used for prenatal detection of chromosomal aneuploidies. Chromosomal anomalies in the developing fetus can occur in any pregnancy and lead to death prior to or shortly after birth or to costly lifelong disabilities. Early detection of fetal chromosomal aneuploidies, an atypical number of certain chromosomes, can help parents evaluate their pregnancy options. Current diagnostic methods include maternal serum sampling or nuchal translucency testing, which are minimally invasive diagnostics, but lack sensitivity and specificity. The gold standard, karyotyping, requires amniocentesis or chorionic villus sampling, which are highly invasive and can cause abortions. In addition, many of these methods have long turnaround times, which can cause anxiety in mothers. Next-generation sequencing of fetal DNA in maternal blood enables minimally invasive, sensitive, and reasonably rapid analysis of fetal chromosomal anomalies and can be of clinical utility to parents. This review covers traditional methods and next-generation sequencing techniques for diagnosing aneuploidies in terms of clinical utility, technological characteristics, and market potential. © 2015 Society for Laboratory Automation and Screening.

  13. Estimation of bladder wall location in ultrasound images.

    PubMed

    Topper, A K; Jernigan, M E

    1991-05-01

    A method of automatically estimating the location of the bladder wall in ultrasound images is proposed. Obtaining this estimate is intended to be the first stage in the development of an automatic bladder volume calculation system. The first step in the bladder wall estimation scheme involves globally processing the images using standard image processing techniques to highlight the bladder wall. Separate processing sequences are required to highlight the anterior bladder wall and the posterior bladder wall. The sequence to highlight the anterior bladder wall involves Gaussian smoothing and second differencing followed by zero-crossing detection. Median filtering followed by thresholding and gradient detection is used to highlight as much of the rest of the bladder wall as was visible in the original images. Then a 'bladder wall follower'--a line follower with rules based on the characteristics of ultrasound imaging and the anatomy involved--is applied to the processed images to estimate the bladder wall location by following the portions of the bladder wall which are highlighted and filling in the missing segments. The results achieved using this scheme are presented.

  14. Geometric phase coded metasurface: from polarization dependent directive electromagnetic wave scattering to diffusion-like scattering.

    PubMed

    Chen, Ke; Feng, Yijun; Yang, Zhongjie; Cui, Li; Zhao, Junming; Zhu, Bo; Jiang, Tian

    2016-10-24

    Ultrathin metasurface compromising various sub-wavelength meta-particles offers promising advantages in controlling electromagnetic wave by spatially manipulating the wavefront characteristics across the interface. The recently proposed digital coding metasurface could even simplify the design and optimization procedures due to the digitalization of the meta-particle geometry. However, current attempts to implement the digital metasurface still utilize several structural meta-particles to obtain certain electromagnetic responses, and requiring time-consuming optimization especially in multi-bits coding designs. In this regard, we present herein utilizing geometric phase based single structured meta-particle with various orientations to achieve either 1-bit or multi-bits digital metasurface. Particular electromagnetic wave scattering patterns dependent on the incident polarizations can be tailored by the encoded metasurfaces with regular sequences. On the contrast, polarization insensitive diffusion-like scattering can also been successfully achieved by digital metasurface encoded with randomly distributed coding sequences leading to substantial suppression of backward scattering in a broadband microwave frequency. The proposed digital metasurfaces provide simple designs and reveal new opportunities for controlling electromagnetic wave scattering with or without polarization dependence.

  15. Planar Covariation of Hindlimb and Forelimb Elevation Angles during Terrestrial and Aquatic Locomotion of Dogs

    PubMed Central

    Catavitello, Giovanna; Ivanenko, Yuri P.; Lacquaniti, Francesco

    2015-01-01

    The rich repertoire of locomotor behaviors in quadrupedal animals requires flexible inter-limb and inter-segmental coordination. Here we studied the kinematic coordination of different gaits (walk, trot, gallop, and swim) of six dogs (Canis lupus familiaris) and, in particular, the planar covariation of limb segment elevation angles. The results showed significant variations in the relative duration of rearward limb movement, amplitude of angular motion, and inter-limb coordination, with gait patterns ranging from a lateral sequence of footfalls during walking to a diagonal sequence in swimming. Despite these differences, the planar law of inter-segmental coordination was maintained across different gaits in both forelimbs and hindlimbs. Notably, phase relationships and orientation of the covariation plane were highly limb specific, consistent with the functional differences in their neural control. Factor analysis of published muscle activity data also demonstrated differences in the characteristic timing of basic activation patterns of the forelimbs and hindlimbs. Overall, the results demonstrate that the planar covariation of inter-segmental coordination has emerged for both fore- and hindlimbs and all gaits, although in a limb-specific manner. PMID:26218076

  16. Magnetoencephalographic Signals Identify Stages in Real-Life Decision Processes

    PubMed Central

    Braeutigam, Sven; Stins, John F.; Rose, Steven P. R.; Swithenby, Stephen J.; Ambler, Tim

    2001-01-01

    We used magnetoencephalography (MEG) to study the dynamics of neural responses in eight subjects engaged in shopping for day-to-day items from supermarket shelves. This behavior not only has personal and economic importance but also provides an example of an experience that is both personal and shared between individuals. The shopping experience enables the exploration of neural mechanisms underlying choice based on complex memories. Choosing among different brands of closely related products activated a robust sequence of signals within the first second after the presentation of the choice images. This sequence engaged first the visual cortex (80-100 ms), then as the images were analyzed, predominantly the left temporal regions (310-340 ms). At longer latency, characteristic neural activetion was found in motor speech areas (500-520 ms) for images requiring low salience choices with respect to previous (brand) memory, and in right parietal cortex for high salience choices (850-920 ms). We argue that the neural processes associated with the particular brand-choice stimulus can be separated into identifiable stages through observation of MEG responses and knowledge of functional anatomy. PMID:12018772

  17. Geometric phase coded metasurface: from polarization dependent directive electromagnetic wave scattering to diffusion-like scattering

    PubMed Central

    Chen, Ke; Feng, Yijun; Yang, Zhongjie; Cui, Li; Zhao, Junming; Zhu, Bo; Jiang, Tian

    2016-01-01

    Ultrathin metasurface compromising various sub-wavelength meta-particles offers promising advantages in controlling electromagnetic wave by spatially manipulating the wavefront characteristics across the interface. The recently proposed digital coding metasurface could even simplify the design and optimization procedures due to the digitalization of the meta-particle geometry. However, current attempts to implement the digital metasurface still utilize several structural meta-particles to obtain certain electromagnetic responses, and requiring time-consuming optimization especially in multi-bits coding designs. In this regard, we present herein utilizing geometric phase based single structured meta-particle with various orientations to achieve either 1-bit or multi-bits digital metasurface. Particular electromagnetic wave scattering patterns dependent on the incident polarizations can be tailored by the encoded metasurfaces with regular sequences. On the contrast, polarization insensitive diffusion-like scattering can also been successfully achieved by digital metasurface encoded with randomly distributed coding sequences leading to substantial suppression of backward scattering in a broadband microwave frequency. The proposed digital metasurfaces provide simple designs and reveal new opportunities for controlling electromagnetic wave scattering with or without polarization dependence. PMID:27775064

  18. Development and use of molecular markers: past and present.

    PubMed

    Grover, Atul; Sharma, P C

    2016-01-01

    Molecular markers, due to their stability, cost-effectiveness and ease of use provide an immensely popular tool for a variety of applications including genome mapping, gene tagging, genetic diversity diversity, phylogenetic analysis and forensic investigations. In the last three decades, a number of molecular marker techniques have been developed and exploited worldwide in different systems. However, only a handful of these techniques, namely RFLPs, RAPDs, AFLPs, ISSRs, SSRs and SNPs have received global acceptance. A recent revolution in DNA sequencing techniques has taken the discovery and application of molecular markers to high-throughput and ultrahigh-throughput levels. Although, the choice of marker will obviously depend on the targeted use, microsatellites, SNPs and genotyping by sequencing (GBS) largely fulfill most of the user requirements. Further, modern transcriptomic and functional markers will lead the ventures onto high-density genetic map construction, identification of QTLs, breeding and conservation strategies in times to come in combination with other high throughput techniques. This review presents an overview of different marker technologies and their variants with a comparative account of their characteristic features and applications.

  19. Probabilistic models of eukaryotic evolution: time for integration

    PubMed Central

    Lartillot, Nicolas

    2015-01-01

    In spite of substantial work and recent progress, a global and fully resolved picture of the macroevolutionary history of eukaryotes is still under construction. This concerns not only the phylogenetic relations among major groups, but also the general characteristics of the underlying macroevolutionary processes, including the patterns of gene family evolution associated with endosymbioses, as well as their impact on the sequence evolutionary process. All these questions raise formidable methodological challenges, calling for a more powerful statistical paradigm. In this direction, model-based probabilistic approaches have played an increasingly important role. In particular, improved models of sequence evolution accounting for heterogeneities across sites and across lineages have led to significant, although insufficient, improvement in phylogenetic accuracy. More recently, one main trend has been to move away from simple parametric models and stepwise approaches, towards integrative models explicitly considering the intricate interplay between multiple levels of macroevolutionary processes. Such integrative models are in their infancy, and their application to the phylogeny of eukaryotes still requires substantial improvement of the underlying models, as well as additional computational developments. PMID:26323768

  20. The importance of genetic verification for determination of Atlantic salmon in north Pacific waters

    USGS Publications Warehouse

    Nielsen, J.L.; Williams, I.; Sage, G.K.; Zimmerman, C.E.

    2003-01-01

    Genetic analyses of two unknown but putative Atlantic salmon Salmo salar captured in the Copper River drainage, Alaska, demonstrated the need for validation of morphologically unusual fishes. Mitochondrial DNA sequences (control region and cytochrome b) and data from two nuclear genes [first internal transcribed spacer (ITS-1) sequence and growth hormone (GH1) amplification product] indicated that the fish caught in fresh water on the Martin River was a coho salmon Oncorhynchus kisutch, while the other fish caught in the intertidal zone of the Copper River delta near Grass Island was an Atlantic salmon. Determination of unusual or cryptic fish based on limited physical characteristics and expected seasonal spawning run timing will add to the controversy over farmed Atlantic salmon and their potential effects on native Pacific species. It is clear that determination of all putative collections of Atlantic salmon found in Pacific waters requires validation. Due to uncertainty of fish identification in the field using plastic morphometric characters, it is recommended that genetic analyses be part of the validation process. ?? 2003 The Fisheries Society of the British Isles.

  1. Pressure-induced structural transformations and polymerization in ThC2

    PubMed Central

    Guo, Yongliang; Yu, Cun; Lin, Jun; Wang, Changying; Ren, Cuilan; Sun, Baoxing; Huai, Ping; Xie, Ruobing; Ke, Xuezhi; Zhu, Zhiyuan; Xu, Hongjie

    2017-01-01

    Thorium-carbon systems have been thought as promising nuclear fuel for Generation IV reactors which require high-burnup and safe nuclear fuel. Existing knowledge on thorium carbides under extreme condition remains insufficient and some is controversial due to limited studies. Here we systematically predict all stable structures of thorium dicarbide (ThC2) under the pressure ranging from ambient to 300 GPa by merging ab initio total energy calculations and unbiased structure searching method, which are in sequence of C2/c, C2/m, Cmmm, Immm and P6/mmm phases. Among these phases, the C2/m is successfully observed for the first time via in situ synchrotron XRD measurements, which exhibits an excellent structural correspondence to our theoretical predictions. The transition sequence and the critical pressures are predicted. The calculated results also reveal the polymerization behaviors of the carbon atoms and the corresponding characteristic C-C bonding under various pressures. Our work provides key information on the fundamental material behavior and insights into the underlying mechanisms that lay the foundation for further exploration and application of ThC2. PMID:28383571

  2. Pressure-induced structural transformations and polymerization in ThC2

    NASA Astrophysics Data System (ADS)

    Guo, Yongliang; Yu, Cun; Lin, Jun; Wang, Changying; Ren, Cuilan; Sun, Baoxing; Huai, Ping; Xie, Ruobing; Ke, Xuezhi; Zhu, Zhiyuan; Xu, Hongjie

    2017-04-01

    Thorium-carbon systems have been thought as promising nuclear fuel for Generation IV reactors which require high-burnup and safe nuclear fuel. Existing knowledge on thorium carbides under extreme condition remains insufficient and some is controversial due to limited studies. Here we systematically predict all stable structures of thorium dicarbide (ThC2) under the pressure ranging from ambient to 300 GPa by merging ab initio total energy calculations and unbiased structure searching method, which are in sequence of C2/c, C2/m, Cmmm, Immm and P6/mmm phases. Among these phases, the C2/m is successfully observed for the first time via in situ synchrotron XRD measurements, which exhibits an excellent structural correspondence to our theoretical predictions. The transition sequence and the critical pressures are predicted. The calculated results also reveal the polymerization behaviors of the carbon atoms and the corresponding characteristic C-C bonding under various pressures. Our work provides key information on the fundamental material behavior and insights into the underlying mechanisms that lay the foundation for further exploration and application of ThC2.

  3. Pressure-induced structural transformations and polymerization in ThC2.

    PubMed

    Guo, Yongliang; Yu, Cun; Lin, Jun; Wang, Changying; Ren, Cuilan; Sun, Baoxing; Huai, Ping; Xie, Ruobing; Ke, Xuezhi; Zhu, Zhiyuan; Xu, Hongjie

    2017-04-06

    Thorium-carbon systems have been thought as promising nuclear fuel for Generation IV reactors which require high-burnup and safe nuclear fuel. Existing knowledge on thorium carbides under extreme condition remains insufficient and some is controversial due to limited studies. Here we systematically predict all stable structures of thorium dicarbide (ThC 2 ) under the pressure ranging from ambient to 300 GPa by merging ab initio total energy calculations and unbiased structure searching method, which are in sequence of C2/c, C2/m, Cmmm, Immm and P6/mmm phases. Among these phases, the C2/m is successfully observed for the first time via in situ synchrotron XRD measurements, which exhibits an excellent structural correspondence to our theoretical predictions. The transition sequence and the critical pressures are predicted. The calculated results also reveal the polymerization behaviors of the carbon atoms and the corresponding characteristic C-C bonding under various pressures. Our work provides key information on the fundamental material behavior and insights into the underlying mechanisms that lay the foundation for further exploration and application of ThC 2 .

  4. Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis and proposals to emend the description of Streptomyces albus and describe Streptomyces pathocidini sp. nov

    USDA-ARS?s Scientific Manuscript database

    In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T forms a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these oth...

  5. A catalog of aftershock sequences in Greece (1971 1997): Their spatial and temporal characteristics

    NASA Astrophysics Data System (ADS)

    Drakatos, George; Latoussakis, John

    A complete catalog of aftershock sequences is provided for main earthquakes with ML 5.0, which occurred in the area of Greece and surrounding regions the last twenty-seven years. The Monthly Bulletins of the Institute of Geodynamics (National Observatory of Athens) have been used as data source. In order to get a homogeneous catalog, several selection criteria have been applied and hence a catalog of 44 aftershock sequences is compiled. The relations between the duration of the sequence, the number of aftershocks, the magnitude of the largest aftershock and its delay time from the main shock as well as the subsurface rupture length versus the magnitude of the main shock are calculated. The results show that linearity exists between the subsurface rupture length and the magnitude of the main shock independent of the slip type, as well as between the magnitude of the main shock (M) and its largest aftershock (Ma). The mean difference M-Ma is almost one unit. In the 40% of the analyzed sequences, the largest aftershock occurred within one day after the main shock.The fact that the aftershock sequences show the same behavior for earthquakes that occur in the same region supports the theory that the spatial and temporal characteristics are strongly related to the stress distribution of the fault area.

  6. Transferring the Characteristics of Naturally Occurring and Biased Antibody Repertoires to Human Antibody Libraries by Trapping CDRH3 Sequences

    PubMed Central

    Venet, Sophie; Ravn, Ulla; Buatois, Vanessa; Gueneau, Franck; Calloud, Sébastien; Kosco-Vilbois, Marie; Fischer, Nicolas

    2012-01-01

    Antibody repertoires are characterized by diversity as they vary not only amongst individuals and post antigen exposure but also differ significantly between vertebrate species. Such plasticity can be exploited to generate human antibody libraries featuring hallmarks of these diverse repertoires. In this study, the focus was to capture CDRH3 sequences, as this region generally accounts for most of the interaction energy with antigen. Sequences from human as well as non-human sources were successfully integrated into human antibody libraries. Next generation sequencing of these libraries proved that the CDRH3 lengths and amino acid composition corresponded to the species of origin. Specific CDRH3 sequences, biased towards the recognition of a model antigen either by immunizing mice or by selecting with phage display, were then integrated into another set of libraries. From these antigen biased libraries, highly potent antibodies were more frequently isolated, indicating that the characteristics of an immune repertoire is transferrable via CDRH3 sequences into a human antibody library. Taken together, these data demonstrate that the properties of naturally or experimentally biased repertoires can be effectively harnessed for the generation of targeted human antibody libraries, substantially increasing the probability of isolating antibodies suitable for therapeutic and diagnostic applications. PMID:22937053

  7. The HIP1 initiator element plays a role in determining the in vitro requirement of the dihydrofolate reductase gene promoter for the C-terminal domain of RNA polymerase II.

    PubMed

    Buermeyer, A B; Thompson, N E; Strasheim, L A; Burgess, R R; Farnham, P J

    1992-05-01

    We examined the ability of purified RNA polymerase (RNAP) II lacking the carboxy-terminal heptapeptide repeat domain (CTD), called RNAP IIB, to transcribe a variety of promoters in HeLa extracts in which endogenous RNAP II activity was inhibited with anti-CTD monoclonal antibodies. Not all promoters were efficiently transcribed by RNAP IIB, and transcription did not correlate with the in vitro strength of the promoter or with the presence of a consensus TATA box. This was best illustrated by the GC-rich, non-TATA box promoters of the bidirectional dihydrofolate reductase (DHFR)-REP-encoding locus. Whereas the REP promoter was transcribed by RNAP IIB, the DHFR promoter remained inactive after addition of RNAP IIB to the antibody-inhibited reactions. However, both promoters were efficiently transcribed when purified RNAP with an intact CTD was added. We analyzed a series of promoter deletions to identify which cis elements determine the requirement for the CTD of RNAP II. All of the promoter deletions of both DHFR and REP retained the characteristics of their respective full-length promoters, suggesting that the information necessary to specify the requirement for the CTD is contained within approximately 65 bp near the initiation site. Furthermore, a synthetic minimal promoter of DHFR, consisting of a single binding site for Sp1 and a binding site for the HIP1 initiator cloned into a bacterial vector sequence, required RNAP II with an intact CTD for activity in vitro. Since the synthetic minimal promoter of DHFR and the smallest REP promoter deletion are both activated by Sp1, the differential response in this assay does not result from upstream activators. However, the sequences around the start sites of DHFR and REP are not similar and our data suggest that they bind different proteins. Therefore, we propose that specific initiator elements are important for determination of the requirement of some promoters for the CTD.

  8. Standardized Metadata for Human Pathogen/Vector Genomic Sequences

    PubMed Central

    Dugan, Vivien G.; Emrich, Scott J.; Giraldo-Calderón, Gloria I.; Harb, Omar S.; Newman, Ruchi M.; Pickett, Brett E.; Schriml, Lynn M.; Stockwell, Timothy B.; Stoeckert, Christian J.; Sullivan, Dan E.; Singh, Indresh; Ward, Doyle V.; Yao, Alison; Zheng, Jie; Barrett, Tanya; Birren, Bruce; Brinkac, Lauren; Bruno, Vincent M.; Caler, Elizabet; Chapman, Sinéad; Collins, Frank H.; Cuomo, Christina A.; Di Francesco, Valentina; Durkin, Scott; Eppinger, Mark; Feldgarden, Michael; Fraser, Claire; Fricke, W. Florian; Giovanni, Maria; Henn, Matthew R.; Hine, Erin; Hotopp, Julie Dunning; Karsch-Mizrachi, Ilene; Kissinger, Jessica C.; Lee, Eun Mi; Mathur, Punam; Mongodin, Emmanuel F.; Murphy, Cheryl I.; Myers, Garry; Neafsey, Daniel E.; Nelson, Karen E.; Nierman, William C.; Puzak, Julia; Rasko, David; Roos, David S.; Sadzewicz, Lisa; Silva, Joana C.; Sobral, Bruno; Squires, R. Burke; Stevens, Rick L.; Tallon, Luke; Tettelin, Herve; Wentworth, David; White, Owen; Will, Rebecca; Wortman, Jennifer; Zhang, Yun; Scheuermann, Richard H.

    2014-01-01

    High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium’s minimal information (MIxS) and NCBI’s BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant. PMID:24936976

  9. Standardized metadata for human pathogen/vector genomic sequences.

    PubMed

    Dugan, Vivien G; Emrich, Scott J; Giraldo-Calderón, Gloria I; Harb, Omar S; Newman, Ruchi M; Pickett, Brett E; Schriml, Lynn M; Stockwell, Timothy B; Stoeckert, Christian J; Sullivan, Dan E; Singh, Indresh; Ward, Doyle V; Yao, Alison; Zheng, Jie; Barrett, Tanya; Birren, Bruce; Brinkac, Lauren; Bruno, Vincent M; Caler, Elizabet; Chapman, Sinéad; Collins, Frank H; Cuomo, Christina A; Di Francesco, Valentina; Durkin, Scott; Eppinger, Mark; Feldgarden, Michael; Fraser, Claire; Fricke, W Florian; Giovanni, Maria; Henn, Matthew R; Hine, Erin; Hotopp, Julie Dunning; Karsch-Mizrachi, Ilene; Kissinger, Jessica C; Lee, Eun Mi; Mathur, Punam; Mongodin, Emmanuel F; Murphy, Cheryl I; Myers, Garry; Neafsey, Daniel E; Nelson, Karen E; Nierman, William C; Puzak, Julia; Rasko, David; Roos, David S; Sadzewicz, Lisa; Silva, Joana C; Sobral, Bruno; Squires, R Burke; Stevens, Rick L; Tallon, Luke; Tettelin, Herve; Wentworth, David; White, Owen; Will, Rebecca; Wortman, Jennifer; Zhang, Yun; Scheuermann, Richard H

    2014-01-01

    High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium's minimal information (MIxS) and NCBI's BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.

  10. Characteristics of HIV-infected U.S. Army soldiers linked in molecular transmission clusters, 2001-2012

    PubMed Central

    Jagodzinski, Linda L.; Liu, Ying; Pham, Peter T.; Kijak, Gustavo H.; Tovanabutra, Sodsai; McCutchan, Francine E.; Scoville, Stephanie L.; Cersovsky, Steven B.; Michael, Nelson L.; Scott, Paul T.; Peel, Sheila A.

    2017-01-01

    Objective Recent surveillance data suggests the United States (U.S.) Army HIV epidemic is concentrated among men who have sex with men. To identify potential targets for HIV prevention strategies, the relationship between demographic and clinical factors and membership within transmission clusters based on baseline pol sequences of HIV-infected Soldiers from 2001 through 2012 were analyzed. Methods We conducted a retrospective analysis of baseline partial pol sequences, demographic and clinical characteristics available for all Soldiers in active service and newly-diagnosed with HIV-1 infection from January 1, 2001 through December 31, 2012. HIV-1 subtype designations and transmission clusters were identified from phylogenetic analysis of sequences. Univariate and multivariate logistic regression models were used to evaluate and adjust for the association between characteristics and cluster membership. Results Among 518 of 995 HIV-infected Soldiers with available partial pol sequences, 29% were members of a transmission cluster. Assignment to a southern U.S. region at diagnosis and year of diagnosis were independently associated with cluster membership after adjustment for other significant characteristics (p<0.10) of age, race, year of diagnosis, region of duty assignment, sexually transmitted infections, last negative HIV test, antiretroviral therapy, and transmitted drug resistance. Subtyping of the pol fragment indicated HIV-1 subtype B infection predominated (94%) among HIV-infected Soldiers. Conclusion These findings identify areas to explore as HIV prevention targets in the U.S. Army. An increased frequency of current force testing may be justified, especially among Soldiers assigned to duty in installations with high local HIV prevalence such as southern U.S. states. PMID:28759645

  11. Automatic Debugging Support for UML Designs

    NASA Technical Reports Server (NTRS)

    Schumann, Johann; Swanson, Keith (Technical Monitor)

    2001-01-01

    Design of large software systems requires rigorous application of software engineering methods covering all phases of the software process. Debugging during the early design phases is extremely important, because late bug-fixes are expensive. In this paper, we describe an approach which facilitates debugging of UML requirements and designs. The Unified Modeling Language (UML) is a set of notations for object-orient design of a software system. We have developed an algorithm which translates requirement specifications in the form of annotated sequence diagrams into structured statecharts. This algorithm detects conflicts between sequence diagrams and inconsistencies in the domain knowledge. After synthesizing statecharts from sequence diagrams, these statecharts usually are subject to manual modification and refinement. By using the "backward" direction of our synthesis algorithm. we are able to map modifications made to the statechart back into the requirements (sequence diagrams) and check for conflicts there. Fed back to the user conflicts detected by our algorithm are the basis for deductive-based debugging of requirements and domain theory in very early development stages. Our approach allows to generate explanations oil why there is a conflict and which parts of the specifications are affected.

  12. Fast single-pass alignment and variant calling using sequencing data

    USDA-ARS?s Scientific Manuscript database

    Sequencing research requires efficient computation. Few programs use already known information about DNA variants when aligning sequence data to the reference map. New program findmap.f90 reads the previous variant list before aligning sequence, calling variant alleles, and summing the allele counts...

  13. Mariner 9 mapping science sequence design.

    NASA Technical Reports Server (NTRS)

    Goldman, A. M., Jr.

    1973-01-01

    The primary mission of Mariner 9 was to map the Martian surface. This paper discusses in detail the design of the mapping science sequences which were executed by the spacecraft in sixty days and during which over eighty percent of the surface was photographed. The sequence design was influenced by many factors: experimenter scientific objectives, instrument capabilities, spacecraft capabilities, orbit characteristics, and data return rates, which are illustrated graphically. Typical orbits are depicted for each of the three different mapping phases lasting twenty days. Examples of typical orbital sequence plans prepared daily during mission operations are given.

  14. Modern Computational Techniques for the HMMER Sequence Analysis

    PubMed Central

    2013-01-01

    This paper focuses on the latest research and critical reviews on modern computing architectures, software and hardware accelerated algorithms for bioinformatics data analysis with an emphasis on one of the most important sequence analysis applications—hidden Markov models (HMM). We show the detailed performance comparison of sequence analysis tools on various computing platforms recently developed in the bioinformatics society. The characteristics of the sequence analysis, such as data and compute-intensive natures, make it very attractive to optimize and parallelize by using both traditional software approach and innovated hardware acceleration technologies. PMID:25937944

  15. Influence of stacking sequence on scattering characteristics of the fundamental anti-symmetric Lamb wave at through holes in composite laminates.

    PubMed

    Veidt, Martin; Ng, Ching-Tai

    2011-03-01

    This paper investigates the scattering characteristics of the fundamental anti-symmetric (A(0)) Lamb wave at through holes in composite laminates. Three-dimensional (3D) finite element (FE) simulations and experimental measurements are used to study the physical phenomenon. Unidirectional, bidirectional, and quasi-isotropic composite laminates are considered in the study. The influence of different hole diameter to wavelength aspect ratios and different stacking sequences on wave scattering characteristics are investigated. The results show that amplitudes and directivity distribution of the scattered Lamb wave depend on these parameters. In the case of quasi-isotropic composite laminates, the scattering directivity patterns are dominated by the fiber orientation of the outer layers and are quite different for composite laminates with the same number of laminae but different stacking sequence. The study provides improved physical insight into the scattering phenomena at through holes in composite laminates, which is essential to develop, validate, and optimize guided wave damage detection and characterization techniques. © 2011 Acoustical Society of America

  16. Reference voltage calculation method based on zero-sequence component optimisation for a regional compensation DVR

    NASA Astrophysics Data System (ADS)

    Jian, Le; Cao, Wang; Jintao, Yang; Yinge, Wang

    2018-04-01

    This paper describes the design of a dynamic voltage restorer (DVR) that can simultaneously protect several sensitive loads from voltage sags in a region of an MV distribution network. A novel reference voltage calculation method based on zero-sequence voltage optimisation is proposed for this DVR to optimise cost-effectiveness in compensation of voltage sags with different characteristics in an ungrounded neutral system. Based on a detailed analysis of the characteristics of voltage sags caused by different types of faults and the effect of the wiring mode of the transformer on these characteristics, the optimisation target of the reference voltage calculation is presented with several constraints. The reference voltages under all types of voltage sags are calculated by optimising the zero-sequence component, which can reduce the degree of swell in the phase-to-ground voltage after compensation to the maximum extent and can improve the symmetry degree of the output voltages of the DVR, thereby effectively increasing the compensation ability. The validity and effectiveness of the proposed method are verified by simulation and experimental results.

  17. Biosynthesis of riboflavin: an unusual riboflavin synthase of Methanobacterium thermoautotrophicum.

    PubMed Central

    Eberhardt, S; Korn, S; Lottspeich, F; Bacher, A

    1997-01-01

    Riboflavin synthase was purified by a factor of about 1,500 from cell extract of Methanobacterium thermoautotrophicum. The enzyme had a specific activity of about 2,700 nmol mg(-1) h(-1) at 65 degrees C, which is relatively low compared to those of riboflavin synthases of eubacteria and yeast. Amino acid sequences obtained after proteolytic cleavage had no similarity with known riboflavin synthases. The gene coding for riboflavin synthase (designated ribC) was subsequently cloned by marker rescue with a ribC mutant of Escherichia coli. The ribC gene of M. thermoautotrophicum specifies a protein of 153 amino acid residues. The predicted amino acid sequence agrees with the information gleaned from Edman degradation of the isolated protein and shows 67% identity with the sequence predicted for the unannotated reading frame MJ1184 of Methanococcus jannaschii. The ribC gene is adjacent to a cluster of four genes with similarity to the genes cbiMNQO of Salmonella typhimurium, which form part of the cob operon (this operon contains most of the genes involved in the biosynthesis of vitamin B12). The amino acid sequence predicted by the ribC gene of M. thermoautotrophicum shows no similarity whatsoever to the sequences of riboflavin synthases of eubacteria and yeast. Most notably, the M. thermoautotrophicum protein does not show the internal sequence homology characteristic of eubacterial and yeast riboflavin synthases. The protein of M. thermoautotrophicum can be expressed efficiently in a recombinant E. coli strain. The specific activity of the purified, recombinant protein is 1,900 nmol mg(-1) h(-1) at 65 degrees C. In contrast to riboflavin synthases from eubacteria and fungi, the methanobacterial enzyme has an absolute requirement for magnesium ions. The 5' phosphate of 6,7-dimethyl-8-ribityllumazine does not act as a substrate. The findings suggest that riboflavin synthase has evolved independently in eubacteria and methanobacteria. PMID:9139911

  18. From sequencer to supercomputer: an automatic pipeline for managing and processing next generation sequencing data.

    PubMed

    Camerlengo, Terry; Ozer, Hatice Gulcin; Onti-Srinivasan, Raghuram; Yan, Pearlly; Huang, Tim; Parvin, Jeffrey; Huang, Kun

    2012-01-01

    Next Generation Sequencing is highly resource intensive. NGS Tasks related to data processing, management and analysis require high-end computing servers or even clusters. Additionally, processing NGS experiments requires suitable storage space and significant manual interaction. At The Ohio State University's Biomedical Informatics Shared Resource, we designed and implemented a scalable architecture to address the challenges associated with the resource intensive nature of NGS secondary analysis built around Illumina Genome Analyzer II sequencers and Illumina's Gerald data processing pipeline. The software infrastructure includes a distributed computing platform consisting of a LIMS called QUEST (http://bisr.osumc.edu), an Automation Server, a computer cluster for processing NGS pipelines, and a network attached storage device expandable up to 40TB. The system has been architected to scale to multiple sequencers without requiring additional computing or labor resources. This platform provides demonstrates how to manage and automate NGS experiments in an institutional or core facility setting.

  19. Integration of Temporal and Ordinal Information During Serial Interception Sequence Learning

    PubMed Central

    Gobel, Eric W.; Sanchez, Daniel J.; Reber, Paul J.

    2011-01-01

    The expression of expert motor skills typically involves learning to perform a precisely timed sequence of movements (e.g., language production, music performance, athletic skills). Research examining incidental sequence learning has previously relied on a perceptually-cued task that gives participants exposure to repeating motor sequences but does not require timing of responses for accuracy. Using a novel perceptual-motor sequence learning task, learning a precisely timed cued sequence of motor actions is shown to occur without explicit instruction. Participants learned a repeating sequence through practice and showed sequence-specific knowledge via a performance decrement when switched to an unfamiliar sequence. In a second experiment, the integration of representation of action order and timing sequence knowledge was examined. When either action order or timing sequence information was selectively disrupted, performance was reduced to levels similar to completely novel sequences. Unlike prior sequence-learning research that has found timing information to be secondary to learning action sequences, when the task demands require accurate action and timing information, an integrated representation of these types of information is acquired. These results provide the first evidence for incidental learning of fully integrated action and timing sequence information in the absence of an independent representation of action order, and suggest that this integrative mechanism may play a material role in the acquisition of complex motor skills. PMID:21417511

  20. Light-modulated abundance of an mRNA encoding a calmodulin-regulated, chromatin-associated NTPase in pea

    NASA Technical Reports Server (NTRS)

    Hsieh, H. L.; Tong, C. G.; Thomas, C.; Roux, S. J.

    1996-01-01

    A CDNA encoding a 47 kDa nucleoside triphosphatase (NTPase) that is associated with the chromatin of pea nuclei has been cloned and sequenced. The translated sequence of the cDNA includes several domains predicted by known biochemical properties of the enzyme, including five motifs characteristic of the ATP-binding domain of many proteins, several potential casein kinase II phosphorylation sites, a helix-turn-helix region characteristic of DNA-binding proteins, and a potential calmodulin-binding domain. The deduced primary structure also includes an N-terminal sequence that is a predicted signal peptide and an internal sequence that could serve as a bipartite-type nuclear localization signal. Both in situ immunocytochemistry of pea plumules and immunoblots of purified cell fractions indicate that most of the immunodetectable NTPase is within the nucleus, a compartment proteins typically reach through nuclear pores rather than through the endoplasmic reticulum pathway. The translated sequence has some similarity to that of human lamin C, but not high enough to account for the earlier observation that IgG against human lamin C binds to the NTPase in immunoblots. Northern blot analysis shows that the NTPase MRNA is strongly expressed in etiolated plumules, but only poorly or not at all in the leaf and stem tissues of light-grown plants. Accumulation of NTPase mRNA in etiolated seedlings is stimulated by brief treatments with both red and far-red light, as is characteristic of very low-fluence phytochrome responses. Southern blotting with pea genomic DNA indicates the NTPase is likely to be encoded by a single gene.

  1. Metagenomics of rumen bacteriophage from thirteen lactating dairy cattle

    PubMed Central

    2013-01-01

    Background The bovine rumen hosts a diverse and complex community of Eukarya, Bacteria, Archea and viruses (including bacteriophage). The rumen viral population (the rumen virome) has received little attention compared to the rumen microbial population (the rumen microbiome). We used massively parallel sequencing of virus like particles to investigate the diversity of the rumen virome in thirteen lactating Australian Holstein dairy cattle all housed in the same location, 12 of which were sampled on the same day. Results Fourteen putative viral sequence fragments over 30 Kbp in length were assembled and annotated. Many of the putative genes in the assembled contigs showed no homology to previously annotated genes, highlighting the large amount of work still required to fully annotate the functions encoded in viral genomes. The abundance of the contig sequences varied widely between animals, even though the cattle were of the same age, stage of lactation and fed the same diets. Additionally the twelve animals which were co-habited shared a number of their dominant viral contigs. We compared the functional characteristics of our bovine viromes with that of other viromes, as well as rumen microbiomes. At the functional level, we found strong similarities between all of the viral samples, which were highly distinct from the rumen microbiome samples. Conclusions Our findings suggest a large amount of between animal variation in the bovine rumen virome and that co-habiting animals may have more similar viromes than non co-habited animals. We report the deepest sequencing to date of the rumen virome. This work highlights the enormous amount of novelty and variation present in the rumen virome. PMID:24180266

  2. Simplifying complex sequence information: a PCP-consensus protein binds antibodies against all four Dengue serotypes.

    PubMed

    Bowen, David M; Lewis, Jessica A; Lu, Wenzhe; Schein, Catherine H

    2012-09-14

    Designing proteins that reflect the natural variability of a pathogen is essential for developing novel vaccines and drugs. Flaviviruses, including Dengue (DENV) and West Nile (WNV), evolve rapidly and can "escape" neutralizing monoclonal antibodies by mutation. Designing antigens that represent many distinct strains is important for DENV, where infection with a strain from one of the four serotypes may lead to severe hemorrhagic disease on subsequent infection with a strain from another serotype. Here, a DENV physicochemical property (PCP)-consensus sequence was derived from 671 unique sequences from the Flavitrack database. PCP-consensus proteins for domain 3 of the envelope protein (EdomIII) were expressed from synthetic genes in Escherichia coli. The ability of the purified consensus proteins to bind polyclonal antibodies generated in response to infection with strains from each of the four DENV serotypes was determined. The initial consensus protein bound antibodies from DENV-1-3 in ELISA and Western blot assays. This sequence was altered in 3 steps to incorporate regions of maximum variability, identified as significant changes in the PCPs, characteristic of DENV-4 strains. The final protein was recognized by antibodies against all four serotypes. Two amino acids essential for efficient binding to all DENV antibodies are part of a discontinuous epitope previously defined for a neutralizing monoclonal antibody. The PCP-consensus method can significantly reduce the number of experiments required to define a multivalent antigen, which is particularly important when dealing with pathogens that must be tested at higher biosafety levels. Copyright © 2012 Elsevier Ltd. All rights reserved.

  3. TaqMan Real-Time PCR Assays To Assess Arbuscular Mycorrhizal Responses to Field Manipulation of Grassland Biodiversity: Effects of Soil Characteristics, Plant Species Richness, and Functional Traits▿ †

    PubMed Central

    König, Stephan; Wubet, Tesfaye; Dormann, Carsten F.; Hempel, Stefan; Renker, Carsten; Buscot, François

    2010-01-01

    Large-scale (temporal and/or spatial) molecular investigations of the diversity and distribution of arbuscular mycorrhizal fungi (AMF) require considerable sampling efforts and high-throughput analysis. To facilitate such efforts, we have developed a TaqMan real-time PCR assay to detect and identify AMF in environmental samples. First, we screened the diversity in clone libraries, generated by nested PCR, of the nuclear ribosomal DNA internal transcribed spacer (ITS) of AMF in environmental samples. We then generated probes and forward primers based on the detected sequences, enabling AMF sequence type-specific detection in TaqMan multiplex real-time PCR assays. In comparisons to conventional clone library screening and Sanger sequencing, the TaqMan assay approach provided similar accuracy but higher sensitivity with cost and time savings. The TaqMan assays were applied to analyze the AMF community composition within plots of a large-scale plant biodiversity manipulation experiment, the Jena Experiment, primarily designed to investigate the interactive effects of plant biodiversity on element cycling and trophic interactions. The results show that environmental variables hierarchically shape AMF communities and that the sequence type spectrum is strongly affected by previous land use and disturbance, which appears to favor disturbance-tolerant members of the genus Glomus. The AMF species richness of disturbance-associated communities can be largely explained by richness of plant species and plant functional groups, while plant productivity and soil parameters appear to have only weak effects on the AMF community. PMID:20418424

  4. Unique Phylogenetic Lineage Found in the Fusarium-like Clade after Re-examining BCCM/IHEM Fungal Culture Collection Material

    PubMed Central

    De Cremer, Koen; Piérard, Denis; Hendrickx, Marijke

    2016-01-01

    Recently, the Fusarium genus has been narrowed based upon phylogenetic analyses and a Fusarium-like clade was adopted. The few species of the Fusarium-like clade were moved to new, re-installed or existing genera or provisionally retained as "Fusarium." Only a limited number of reference strains and DNA marker sequences are available for this clade and not much is known about its actual species diversity. Here, we report six strains, preserved by the Belgian fungal culture collection BCCM/IHEM as a Fusarium species, that belong to the Fusarium-like clade. They showed a slow growth and produced pionnotes, typical morphological characteristics of many Fusarium-like species. Multilocus sequencing with comparative sequence analyses in GenBank and phylogenetic analyses, using reference sequences of type material, confirmed that they were indeed member of the Fusarium-like clade. One strain was identified as "Fusarium" ciliatum whereas another strain was identified as Fusicolla merismoides. The four remaining strains were shown to represent a unique phylogenetic lineage in the Fusarium-like clade and were also found morphologically distinct from other members of the Fusarium-like clade. Based upon phylogenetic considerations, a new genus, Pseudofusicolla gen. nov., and a new species, Pseudofusicolla belgica sp. nov., were installed for this lineage. A formal description is provided in this study. Additional sampling will be required to gather isolates other than the historical strains presented in the present study as well as to further reveal the actual species diversity in the Fusarium-like clade. PMID:27790062

  5. Efficient burst image compression using H.265/HEVC

    NASA Astrophysics Data System (ADS)

    Roodaki-Lavasani, Hoda; Lainema, Jani

    2014-02-01

    New imaging use cases are emerging as more powerful camera hardware is entering consumer markets. One family of such use cases is based on capturing multiple pictures instead of just one when taking a photograph. That kind of a camera operation allows e.g. selecting the most successful shot from a sequence of images, showing what happened right before or after the shot was taken or combining the shots by computational means to improve either visible characteristics of the picture (such as dynamic range or focus) or the artistic aspects of the photo (e.g. by superimposing pictures on top of each other). Considering that photographic images are typically of high resolution and quality and the fact that these kind of image bursts can consist of at least tens of individual pictures, an efficient compression algorithm is desired. However, traditional video coding approaches fail to provide the random access properties these use cases require to achieve near-instantaneous access to the pictures in the coded sequence. That feature is critical to allow users to browse the pictures in an arbitrary order or imaging algorithms to extract desired pictures from the sequence quickly. This paper proposes coding structures that provide such random access properties while achieving coding efficiency superior to existing image coders. The results indicate that using HEVC video codec with a single reference picture fixed for the whole sequence can achieve nearly as good compression as traditional IPPP coding structures. It is also shown that the selection of the reference frame can further improve the coding efficiency.

  6. Culture-Independent Analysis of Aerosol Microbiology in a Metropolitan Subway System

    PubMed Central

    Robertson, Charles E.; Baumgartner, Laura K.; Harris, J. Kirk; Peterson, Kristen L.; Stevens, Mark J.; Frank, Daniel N.

    2013-01-01

    The goal of this study was to determine the composition and diversity of microorganisms associated with bioaerosols in a heavily trafficked metropolitan subway environment. We collected bioaerosols by fluid impingement on several New York City subway platforms and associated sites in three sampling sessions over a 1.5-year period. The types and quantities of aerosolized microorganisms were determined by culture-independent phylogenetic analysis of small-subunit rRNA gene sequences by using both Sanger (universal) and pyrosequencing (bacterial) technologies. Overall, the subway bacterial composition was relatively simple; only 26 taxonomic families made up ∼75% of the sequences determined. The microbiology was more or less similar throughout the system and with time and was most similar to outdoor air, consistent with highly efficient air mixing in the system. Identifiable bacterial sequences indicated that the subway aerosol assemblage was composed of a mixture of genera and species characteristic of soil, environmental water, and human skin commensal bacteria. Eukaryotic diversity was mainly fungal, dominated by organisms of types associated with wood rot. Human skin bacterial species (at 99% rRNA sequence identity) included the Staphylococcus spp. Staphylococcus epidermidis (the most abundant and prevalent commensal of the human integument), S. hominis, S. cohnii, S. caprae, and S. haemolyticus, all well-documented human commensal bacteria. We encountered no organisms of public health concern. This study is the most extensive culture-independent survey of subway microbiota so far and puts in place pre-event information required for any bioterrorism surveillance activities or monitoring of the microbiological impact of recent subway flooding events. PMID:23542619

  7. Culture-independent analysis of aerosol microbiology in a metropolitan subway system.

    PubMed

    Robertson, Charles E; Baumgartner, Laura K; Harris, J Kirk; Peterson, Kristen L; Stevens, Mark J; Frank, Daniel N; Pace, Norman R

    2013-06-01

    The goal of this study was to determine the composition and diversity of microorganisms associated with bioaerosols in a heavily trafficked metropolitan subway environment. We collected bioaerosols by fluid impingement on several New York City subway platforms and associated sites in three sampling sessions over a 1.5-year period. The types and quantities of aerosolized microorganisms were determined by culture-independent phylogenetic analysis of small-subunit rRNA gene sequences by using both Sanger (universal) and pyrosequencing (bacterial) technologies. Overall, the subway bacterial composition was relatively simple; only 26 taxonomic families made up ~75% of the sequences determined. The microbiology was more or less similar throughout the system and with time and was most similar to outdoor air, consistent with highly efficient air mixing in the system. Identifiable bacterial sequences indicated that the subway aerosol assemblage was composed of a mixture of genera and species characteristic of soil, environmental water, and human skin commensal bacteria. Eukaryotic diversity was mainly fungal, dominated by organisms of types associated with wood rot. Human skin bacterial species (at 99% rRNA sequence identity) included the Staphylococcus spp. Staphylococcus epidermidis (the most abundant and prevalent commensal of the human integument), S. hominis, S. cohnii, S. caprae, and S. haemolyticus, all well-documented human commensal bacteria. We encountered no organisms of public health concern. This study is the most extensive culture-independent survey of subway microbiota so far and puts in place pre-event information required for any bioterrorism surveillance activities or monitoring of the microbiological impact of recent subway flooding events.

  8. Driving on the surface of Mars with the rover sequencing and visualization program

    NASA Technical Reports Server (NTRS)

    Wright, J.; Hartman, F.; Cooper, B.; Maxwell, S.; Yen, J.; Morrison, J.

    2005-01-01

    Operating a rover on Mars is not possible using teleoperations due to the distance involved and the bandwith limitations. To operate these rovers requires sophisticated tools to make operators knowledgeable of the terrain, hazards, features of interest, and rover state and limitations, and to support building command sequences and rehearsing expected operations. This paper discusses how the Rover Sequencing and Visualization program and a small set of associated tools support this requirement.

  9. Increasing Success Rates in Developmental Math: The Complementary Role of Individual and Institutional Characteristics

    ERIC Educational Resources Information Center

    Fong, Kristen E.; Melguizo, Tatiana; Prather, George

    2015-01-01

    This study tracks students' progression through developmental math sequences and defines progression as both attempting and passing each level of the sequence. A model of successful progression in developmental education was built utilizing individual-, institutional-, and developmental math-level factors. Employing step-wise logistic regression…

  10. Function-Based Algorithms for Biological Sequences

    ERIC Educational Resources Information Center

    Mohanty, Pragyan Sheela P.

    2015-01-01

    Two problems at two different abstraction levels of computational biology are studied. At the molecular level, efficient pattern matching algorithms in DNA sequences are presented. For gene order data, an efficient data structure is presented capable of storing all gene re-orderings in a systematic manner. A common characteristic of presented…

  11. The Genome Sequence of a Type ST239 Methicillin-Resistant Staphylococcus aureus Isolate from a Malaysian Hospital

    PubMed Central

    Lee, LS; Teh, LK; Zainuddin, ZF; Salleh, MZ

    2014-01-01

    We report the genome sequence of a healthcare-associated MRSA type ST239 clone isolated from a patient with septicemia in Malaysia. This clone typifies the characteristics of ST239 lineage, including resistance to multiple antibiotics and antiseptics. PMID:25197474

  12. A survey and evaluations of histogram-based statistics in alignment-free sequence comparison.

    PubMed

    Luczak, Brian B; James, Benjamin T; Girgis, Hani Z

    2017-12-06

    Since the dawn of the bioinformatics field, sequence alignment scores have been the main method for comparing sequences. However, alignment algorithms are quadratic, requiring long execution time. As alternatives, scientists have developed tens of alignment-free statistics for measuring the similarity between two sequences. We surveyed tens of alignment-free k-mer statistics. Additionally, we evaluated 33 statistics and multiplicative combinations between the statistics and/or their squares. These statistics are calculated on two k-mer histograms representing two sequences. Our evaluations using global alignment scores revealed that the majority of the statistics are sensitive and capable of finding similar sequences to a query sequence. Therefore, any of these statistics can filter out dissimilar sequences quickly. Further, we observed that multiplicative combinations of the statistics are highly correlated with the identity score. Furthermore, combinations involving sequence length difference or Earth Mover's distance, which takes the length difference into account, are always among the highest correlated paired statistics with identity scores. Similarly, paired statistics including length difference or Earth Mover's distance are among the best performers in finding the K-closest sequences. Interestingly, similar performance can be obtained using histograms of shorter words, resulting in reducing the memory requirement and increasing the speed remarkably. Moreover, we found that simple single statistics are sufficient for processing next-generation sequencing reads and for applications relying on local alignment. Finally, we measured the time requirement of each statistic. The survey and the evaluations will help scientists with identifying efficient alternatives to the costly alignment algorithm, saving thousands of computational hours. The source code of the benchmarking tool is available as Supplementary Materials. © The Author 2017. Published by Oxford University Press.

  13. Quantizing and sampling considerations in digital phased-locked loops

    NASA Technical Reports Server (NTRS)

    Hurst, G. T.; Gupta, S. C.

    1974-01-01

    The quantizer problem is first considered. The conditions under which the uniform white sequence model for the quantizer error is valid are established independent of the sampling rate. An equivalent spectral density is defined for the quantizer error resulting in an effective SNR value. This effective SNR may be used to determine quantized performance from infinitely fine quantized results. Attention is given to sampling rate considerations. Sampling rate characteristics of the digital phase-locked loop (DPLL) structure are investigated for the infinitely fine quantized system. The predicted phase error variance equation is examined as a function of the sampling rate. Simulation results are presented and a method is described which enables the minimum required sampling rate to be determined from the predicted phase error variance equations.

  14. Energy balance of stellar coronae. III - Effect of stellar mass and radius

    NASA Technical Reports Server (NTRS)

    Hammer, R.

    1984-01-01

    A homologous transformation is derived which permits the application of the numerical coronal models of Hammer from a star with solar mass and radius to other stars. This scaling requires a few approximations concerning the lower boundary conditions and the temperature dependence of the conductivity and emissivity. These approximations are discussed and found to be surprisingly mild. Therefore, the scaling of the coronal models to other stars is rather accurate; it is found to be particularly accurate for main-sequence stars. The transformation is used to derive an equation that gives the maximum temperature of open coronal regions as a function of stellar mass and radius, the coronal heating flux, and the characteristic damping length over which the corona is heated.

  15. Biochemical characteristics of a free cyanide and total nitrogen assimilating Fusarium oxysporum EKT01/02 isolate from cyanide contaminated soil.

    PubMed

    Akinpelu, Enoch A; Adetunji, Adewole T; Ntwampe, Seteno K O; Nchu, Felix; Mekuto, Lukhanyo

    2017-10-01

    Sustainability of nutrient requirements for microbial proliferation on a large scale is a challenge in bioremediation processes. This article presents data on biochemical properties of a free cyanide resistant and total nitrogen assimilating fungal isolate from the rhizosphere of Zea mays (maize) growing in soil contaminated with a cyanide-based pesticide. DNA extracted from this isolate were PCR amplified using universal primers; TEF1-α and ITS. The raw sequence files are available on the NCBI database. Characterisation using biochemical data was obtained using colorimetric reagents analysed with VITEK ® 2 software version 7.01. The data will be informative in selection of biocatalyst for environmental engineering application.

  16. Modeling Explosion Induced Aftershocks

    NASA Astrophysics Data System (ADS)

    Kroll, K.; Ford, S. R.; Pitarka, A.; Walter, W. R.; Richards-Dinger, K. B.

    2017-12-01

    Many traditional earthquake-explosion discrimination tools are based on properties of the seismic waveform or their spectral components. Common discrimination methods include estimates of body wave amplitude ratios, surface wave magnitude scaling, moment tensor characteristics, and depth. Such methods are limited by station coverage and noise. Ford and Walter (2010) proposed an alternate discrimination method based on using properties of aftershock sequences as a means of earthquakeexplosion differentiation. Previous studies have shown that explosion sources produce fewer aftershocks that are generally smaller in magnitude compared to aftershocks of similarly sized earthquake sources (Jarpe et al., 1994, Ford and Walter, 2010). It has also been suggested that the explosion-induced aftershocks have smaller Gutenberg- Richter b-values (Ryall and Savage, 1969) and that their rates decay faster than a typical Omori-like sequence (Gross, 1996). To discern whether these observations are generally true of explosions or are related to specific site conditions (e.g. explosion proximity to active faults, tectonic setting, crustal stress magnitudes) would require a thorough global analysis. Such a study, however, is hindered both by lack of evenly distributed explosion-sources and the availability of global seismicity data. Here, we employ two methods to test the efficacy of explosions at triggering aftershocks under a variety of physical conditions. First, we use the earthquake rate equations from Dieterich (1994) to compute the rate of aftershocks related to an explosion source assuming a simple spring-slider model. We compare seismicity rates computed with these analytical solutions to those produced by the 3D, multi-cycle earthquake simulator, RSQSim. We explore the relationship between geological conditions and the characteristics of the resulting explosion-induced aftershock sequence. We also test hypothesis that aftershock generation is dependent upon the frequency content of the passing dynamic seismic waves as suggested by Parsons and Velasco (2009). Lastly, we compare all results of explosion-induced aftershocks with aftershocks generated by similarly sized earthquake sources. Prepared by LLNL under Contract DE-AC52-07NA27344.

  17. Testudinibacter aquarius gen. nov., sp. nov., a member of the family Pasteurellaceae isolated from the oral cavity of freshwater turtles.

    PubMed

    Hansen, Mie Johanne; Pennanen, Elin Anna Erica; Bojesen, Anders Miki; Christensen, Henrik; Bertelsen, Mads Frost

    2016-02-01

    A total of 13 Pasteurellaceae isolates from healthy freshwater turtles were characterized by genotypic and phenotypic tests. Phylogenetic analysis of partial 16S rRNA and rpoB gene sequences showed that the isolates investigated formed a monophyletic group. The closest related species based on 16S rRNA gene sequencing was Chelonobacter oris CCUG 55632T with 94.4 % similarity and the closest related species based on rpoB gene sequence comparison was [Pasteurella] testudinis CCUG 19802T with 91.5 % similarity. All the investigated isolates exhibited phenotypic characteristics of the family Pasteurellaceae. However, they could be separated from existing genera of the Pasteurellaceae by the following test results: indole, ornithine decarboxylase and Voges-Proskauer positive; and methyl red, urease and PNPG (α-glucosidase) negative. No X- or V-factor requirement was observed. A zone of β-haemolysis surrounded the colonies after 24 h of incubation on bovine blood agar at 37 °C. Acid was produced from l-arabinose, dulcitol, d-mannitol, sucrose and trehalose. Representative strain ELNT2xT had a fatty acid profile that was characteristic for members of the Pasteurellaceae. ELNT2xT expressed only one respiratory quinone, ubiquinone-8 (100 %). The DNA G+C content of strain ELNT2xT was 42.8 mol%. On the basis of both phylogenetic and phenotypic evidence, it is proposed that the strains should be classified as representatives of a novel species of a new genus, Testudinibacter aquarius gen. nov., sp. nov. The type strain of Testudinibacter aquarius is ELNT2xT ( = CCUG 65146T = DSM 28140T), which was isolated from the oral cavity of a captive eastern long-necked turtle (Chelodina longicollis) in Denmark in 2012.

  18. "Hook"-calibration of GeneChip-microarrays: theory and algorithm.

    PubMed

    Binder, Hans; Preibisch, Stephan

    2008-08-29

    : The improvement of microarray calibration methods is an essential prerequisite for quantitative expression analysis. This issue requires the formulation of an appropriate model describing the basic relationship between the probe intensity and the specific transcript concentration in a complex environment of competing interactions, the estimation of the magnitude these effects and their correction using the intensity information of a given chip and, finally the development of practicable algorithms which judge the quality of a particular hybridization and estimate the expression degree from the intensity values. : We present the so-called hook-calibration method which co-processes the log-difference (delta) and -sum (sigma) of the perfect match (PM) and mismatch (MM) probe-intensities. The MM probes are utilized as an internal reference which is subjected to the same hybridization law as the PM, however with modified characteristics. After sequence-specific affinity correction the method fits the Langmuir-adsorption model to the smoothed delta-versus-sigma plot. The geometrical dimensions of this so-called hook-curve characterize the particular hybridization in terms of simple geometric parameters which provide information about the mean non-specific background intensity, the saturation value, the mean PM/MM-sensitivity gain and the fraction of absent probes. This graphical summary spans a metrics system for expression estimates in natural units such as the mean binding constants and the occupancy of the probe spots. The method is single-chip based, i.e. it separately uses the intensities for each selected chip. : The hook-method corrects the raw intensities for the non-specific background hybridization in a sequence-specific manner, for the potential saturation of the probe-spots with bound transcripts and for the sequence-specific binding of specific transcripts. The obtained chip characteristics in combination with the sensitivity corrected probe-intensity values provide expression estimates scaled in natural units which are given by the binding constants of the particular hybridization.

  19. Idiopathic and diabetic skeletal muscle necrosis: evaluation by magnetic resonance imaging.

    PubMed

    Kattapuram, Taj M; Suri, Rajeev; Rosol, Michael S; Rosenberg, Andrew E; Kattapuram, Susan V

    2005-04-01

    Idiopathic and diabetic-associated muscle necrosis are similar, uncommon clinical entities requiring conservative management and minimal intervention to avoid complications and prolonged hospitalization. An early noninvasive diagnosis is therefore essential. We evaluated the magnetic resonance imaging (MRI) characteristics of muscle necrosis in 14 patients, in eight of whom the diagnoses were confirmed histologically. Two experienced musculoskeletal radiologists performed retrospective evaluations of the MRI studies of 14 patients with the diagnoses of skeletal muscle infarction. In 10 cases gadolinium-enhanced (T1-weighted fat-suppressed) sequences were available along with T1-weighted, T2-weighted images and STIR sequences, while in four cases contrast-enhanced images were not available. Eight patients had underlying diabetes and in six patients the cause of the myonecrosis was considered idiopathic. T1-weighted images demonstrated isointense swelling of the involved muscle, with mildly displaced fascial planes. There was effacement of the fat signal intensity within the muscle. Fat-suppressed T2-weighted images showed diffuse heterogeneous high signal intensity in the muscles suggestive of edema. Perifascial fluid collection was seen in eight cases. Subcutaneous edema was present in seven patients. Following intravenous gadolinium administration, MRI demonstrated a focal area of heterogeneously enhancing mass with peripheral enhancement. Within this focal lesion, linear dark areas were seen with serpentine enhancing streaks separating them in eight cases. In two cases, a central relatively nonenhancing mass with irregular margins and peripheral enhancement was noted. The peripheral enhancement involved a significant part of the muscle. No focal fluid collection was noted. We believe that the constellation of imaging findings on T1- and T2-weighted images and post-gadolinium sequences is highly suggestive of muscle necrosis. We consider certain specific findings on gadolinium-enhanced images to be characteristic. The findings reported here should provide radiologists with useful information in making the diagnosis of skeletal muscle necrosis without resorting to invasive procedures.

  20. Sustainable Design of EPA's Campus in Research Triangle Park, NC—Environmental Performance Specifications in Construction Contracts—Section 01450 Sequence of Finishes Installation

    EPA Pesticide Factsheets

    Learn more about the special construction scheduling/sequencing requirements and procedures necessary to assure achievement of designed Indoor Air Quality (IAQ) levels for the completed project required by the EPA IAQ Program.

  1. The influence of food consistency on chewing rate and muscular work.

    PubMed

    van der Bilt, A; Abbink, J H

    2017-11-01

    Food properties influence the parameters of the masticatory process, such as jaw movement, muscle activity and chewing rate. Firm foods will require more muscle activity than softer foods. However, the influence of food hardness on chewing rate is ambiguous as both slower and higher chewing rates have been reported for harder foods. Rheological characteristics of the food, such as plasticity and elasticity, may help to explain differences in chewing rate. The aim of our study was to determine the influence of food properties on chewing rate and muscular work in five phases of a chewing sequence. Eighty-four participants chewed on five foods, which strongly differed in consistency. Chewing gum was used as a reference food. The phase in the chewing sequence had a large significant effect on cycle duration for the five foods. A significant decrease in cycle duration at the beginning of chewing was followed by an increase in later phases, leading to U-shaped curves. Food type had a small effect on the average cycle duration. However, large significant differences in cycle duration were observed between the foods at the beginning of a chewing sequence. In that phase, the firm foods were chewed much slower than the soft foods. Muscular work was significantly influenced by both chewing phase and food type. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. Questioning short-term memory and its measurement: Why digit span measures long-term associative learning.

    PubMed

    Jones, Gary; Macken, Bill

    2015-11-01

    Traditional accounts of verbal short-term memory explain differences in performance for different types of verbal material by reference to inherent characteristics of the verbal items making up memory sequences. The role of previous experience with sequences of different types is ostensibly controlled for either by deliberate exclusion or by presenting multiple trials constructed from different random permutations. We cast doubt on this general approach in a detailed analysis of the basis for the robust finding that short-term memory for digit sequences is superior to that for other sequences of verbal material. Specifically, we show across four experiments that this advantage is not due to inherent characteristics of digits as verbal items, nor are individual digits within sequences better remembered than other types of individual verbal items. Rather, the advantage for digit sequences stems from the increased frequency, compared to other verbal material, with which digits appear in random sequences in natural language, and furthermore, relatively frequent digit sequences support better short-term serial recall than less frequent ones. We also provide corpus-based computational support for the argument that performance in a short-term memory setting is a function of basic associative learning processes operating on the linguistic experience of the rememberer. The experimental and computational results raise questions not only about the role played by measurement of digit span in cognition generally, but also about the way in which long-term memory processes impact on short-term memory functioning. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.

  3. The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry)

    PubMed Central

    Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro

    2018-01-01

    Abstract Background The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. Findings In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Conclusions Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family. PMID:29659812

  4. The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry).

    PubMed

    Buti, Matteo; Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro; Sargent, Daniel James

    2018-04-01

    The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family.

  5. Drinking from the Fire Hose: Why the Flight Management System Can Be Hard to Train and Difficult to Use

    NASA Technical Reports Server (NTRS)

    Sherry, Lance; Feary, Michael; Polson, Peter; Fennell, Karl

    2003-01-01

    The Flight Management Computer (FMC) and its interface, the Multi-function Control and Display Unit (MCDU) have been identified by researchers and airlines as difficult to train and use. Specifically, airline pilots have described the "drinking from the fire-hose" effect during training. Previous research has identified memorized action sequences as a major factor in a user s ability to learn and operate complex devices. This paper discusses the use of a method to examine the quantity of memorized action sequences required to perform a sample of 102 tasks, using features of the Boeing 777 Flight Management Computer Interface. The analysis identified a large number of memorized action sequences that must be learned during training and then recalled during line operations. Seventy-five percent of the tasks examined require recall of at least one memorized action sequence. Forty-five percent of the tasks require recall of a memorized action sequence and occur infrequently. The large number of memorized action sequences may provide an explanation for the difficulties in training and usage of the automation. Based on these findings, implications for training and the design of new user-interfaces are discussed.

  6. Identification of an EMS-induced causal mutation in a gene required for boron-mediated root development by low-coverage genome re-sequencing in Arabidopsis

    PubMed Central

    Tabata, Ryo; Kamiya, Takehiro; Shigenobu, Shuji; Yamaguchi, Katsushi; Yamada, Masashi; Hasebe, Mitsuyasu; Fujiwara, Toru; Sawa, Shinichiro

    2013-01-01

    Next-generation sequencing (NGS) technologies enable the rapid production of an enormous quantity of sequence data. These powerful new technologies allow the identification of mutations by whole-genome sequencing. However, most reported NGS-based mapping methods, which are based on bulked segregant analysis, are costly and laborious. To address these limitations, we designed a versatile NGS-based mapping method that consists of a combination of low- to medium-coverage multiplex SOLiD (Sequencing by Oligonucleotide Ligation and Detection) and classical genetic rough mapping. Using only low to medium coverage reduces the SOLiD sequencing costs and, since just 10 to 20 mutant F2 plants are required for rough mapping, the operation is simple enough to handle in a laboratory with limited space and funding. As a proof of principle, we successfully applied this method to identify the CTR1, which is involved in boron-mediated root development, from among a population of high boron requiring Arabidopsis thaliana mutants. Our work demonstrates that this NGS-based mapping method is a moderately priced and versatile method that can readily be applied to other model organisms. PMID:23104114

  7. 40 CFR 92.124 - Test sequence; general requirements.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    .... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 21 2012-07-01 2012-07-01 false Test sequence; general requirements...

  8. 40 CFR 92.124 - Test sequence; general requirements.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    .... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2014-07-01 2013-07-01 true Test sequence; general requirements. 92...

  9. 40 CFR 92.124 - Test sequence; general requirements.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    .... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2010-07-01 2010-07-01 false Test sequence; general requirements...

  10. 40 CFR 92.124 - Test sequence; general requirements.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    .... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 21 2013-07-01 2013-07-01 false Test sequence; general requirements...

  11. 40 CFR 92.124 - Test sequence; general requirements.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    .... (e) Pre-test engine measurements (e.g., idle and throttle notch speeds, fuel flows, etc.), pre-test engine performance checks (e.g., verification of engine power, etc.) and pre-test system calibrations (e... 40 Protection of Environment 20 2011-07-01 2011-07-01 false Test sequence; general requirements...

  12. Neural-network-designed pulse sequences for robust control of singlet-triplet qubits

    NASA Astrophysics Data System (ADS)

    Yang, Xu-Chen; Yung, Man-Hong; Wang, Xin

    2018-04-01

    Composite pulses are essential for universal manipulation of singlet-triplet spin qubits. In the absence of noise, they are required to perform arbitrary single-qubit operations due to the special control constraint of a singlet-triplet qubit, while in a noisy environment, more complicated sequences have been developed to dynamically correct the error. Tailoring these sequences typically requires numerically solving a set of nonlinear equations. Here we demonstrate that these pulse sequences can be generated by a well-trained, double-layer neural network. For sequences designed for the noise-free case, the trained neural network is capable of producing almost exactly the same pulses known in the literature. For more complicated noise-correcting sequences, the neural network produces pulses with slightly different line shapes, but the robustness against noises remains comparable. These results indicate that the neural network can be a judicious and powerful alternative to existing techniques in developing pulse sequences for universal fault-tolerant quantum computation.

  13. Elimination sequence optimization for SPAR

    NASA Technical Reports Server (NTRS)

    Hogan, Harry A.

    1986-01-01

    SPAR is a large-scale computer program for finite element structural analysis. The program allows user specification of the order in which the joints of a structure are to be eliminated since this order can have significant influence over solution performance, in terms of both storage requirements and computer time. An efficient elimination sequence can improve performance by over 50% for some problems. Obtaining such sequences, however, requires the expertise of an experienced user and can take hours of tedious effort to affect. Thus, an automatic elimination sequence optimizer would enhance productivity by reducing the analysts' problem definition time and by lowering computer costs. Two possible methods for automating the elimination sequence specifications were examined. Several algorithms based on the graph theory representations of sparse matrices were studied with mixed results. Significant improvement in the program performance was achieved, but sequencing by an experienced user still yields substantially better results. The initial results provide encouraging evidence that the potential benefits of such an automatic sequencer would be well worth the effort.

  14. LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Tier 1 Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Slezak, T; Borucki, M; Lenhoff, R

    2009-09-29

    The Lawrence Livermore National Lab Bioinformatics group has recently taken on a role in DTRA's Transformation Medical Technologies Initiative (TMTI). The high-level goal of TMTI is to accelerate the development of broad-spectrum countermeasures. To achieve those goals, TMTI has a near term need to obtain more sequence information across a large range of pathogens, near neighbors, and across a broad geographical and host range. Our role in this project is to research available sequence data for the organisms of interest and identify critical microbial sequence and knowledge gaps that need to be filled to meet TMTI objectives. This effort includes:more » (1) assessing current genomic sequence for each agent including phylogenetic and geographical diversity, host range, date of isolation range, virulence, sequence availability of key near neighbors, and other characteristics; (2) identifying Subject Matter Experts (SME's) and potential holders of isolate collections, contacting appropriate SME's with known expertise and isolate collections to obtain information on isolate availability and specific recommendations; (3) identifying sequence as well as knowledge gaps (eg virulence, host range, and antibiotic resistance determinants); (4) providing specific recommendations as to the most valuable strains to be placed on the DTRA sequencing queue. We acknowledge that criteria for prioritization of isolates for sequencing falls into two categories aligning with priority queues 1 and 2 as described in the summary. (Priority queue 0 relates to DTRA operational isolates whose availability is not predictable in advance.) 1. Selection of isolates that appear to have likelihood to provide information on virulence and antibiotic resistance. This will include sequence of known virulent strains. Particularly valuable would be virulent strains that have genetically similar yet avirulent, or non human transmissible, counterparts that can be used for comparison to help identify key virulence or host range genes. This approach will provide information that can be used by structural biologists to help develop therapeutics and vaccines. We have pointed out such high priority strains of which we are aware, and note that if any such isolates should be discovered, they will rise to the top priority. We anticipate difficulty locating samples with unusual resistance phenotypes, in particular. Sequencing strategies for isolates in queue 1 should aim for as complete finishing status as possible, since high-quality initial annotation (gene-calling) will be necessary for the follow-on protein structure analyses contributing to countermeasure development. Queue 2 for sequencing determination will be more dynamic than queue 1, and samples will be added to it as they become available to the TMTI program. 2. Selection of isolates that will provide broader information about diversity and phylogenetics and aid in specific detection as well as forensics. This approach focuses on sequencing of isolates that will provide better resolution of variants that are (or were) circulating in nature. The finishing strategy for queue 2 does not require complete closing with annotation. This queue is more static, as there is considerable phylogenetic data, and in this report we have sought to reveal gaps and make suggestions to fill them given existing sequence data and strain information. In this report we identify current sequencing gaps in both priority queue categories. Note that this is most applicable to the bacterial pathogens, as most viruses are by default in queue 1. The Phase I focus of this project is on viral hemorrhagic fever viruses and Category A bacterial agents as defined to us by TMTI. We have carried out individual analyses on each species of interest, and these are included as chapters in this report. Viruses and bacteria are biologically very distinct from each other and require different methods of analysis and criteria for sequencing prioritization. Therefore, we will describe our methods, analyses and conclusions separately for each category.« less

  15. Isolation and characterization of a virus infecting the freshwater algae Chrysochromulina parva

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mirza, S.F.; Staniewski, M.A.; Short, C.M.

    Water samples from Lake Ontario, Canada were tested for lytic activity against the freshwater haptophyte algae Chrysochromulina parva. A filterable lytic agent was isolated and identified as a virus via transmission electron microscopy and molecular methods. The virus, CpV-BQ1, is icosahedral, ca. 145 nm in diameter, assembled within the cytoplasm, and has a genome size of ca. 485 kb. Sequences obtained through PCR-amplification of DNA polymerase (polB) genes clustered among sequences from the family Phycodnaviridae, whereas major capsid protein (MCP) sequences clustered among sequences from either the Phycodnaviridae or Mimiviridae. Based on quantitative molecular assays, C. parva's abundance in Lakemore » Ontario was relatively stable, yet CpV-BQ1's abundance was variable suggesting complex virus-host dynamics. This study demonstrates that CpV-BQ1 is a member of the proposed order Megavirales with characteristics of both phycodnaviruses and mimiviruses indicating that, in addition to its complex ecological dynamics, it also has a complex evolutionary history. - Highlights: • A virus infecting the algae C. parva was isolated from Lake Ontario. • Virus characteristics demonstrated that this novel virus is an NCLDV. • The virus's polB sequence suggests taxonomic affiliation with the Phycodnaviridae. • The virus's capsid protein sequences also suggest Mimiviridae ancestry. • Surveys of host and virus natural abundances revealed complex host–virus dynamics.« less

  16. Identification of Cis-Acting Promoter Elements in Cold- and Dehydration-Induced Transcriptional Pathways in Arabidopsis, Rice, and Soybean

    PubMed Central

    Maruyama, Kyonoshin; Todaka, Daisuke; Mizoi, Junya; Yoshida, Takuya; Kidokoro, Satoshi; Matsukura, Satoko; Takasaki, Hironori; Sakurai, Tetsuya; Yamamoto, Yoshiharu Y.; Yoshiwara, Kyouko; Kojima, Mikiko; Sakakibara, Hitoshi; Shinozaki, Kazuo; Yamaguchi-Shinozaki, Kazuko

    2012-01-01

    The genomes of three plants, Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and soybean (Glycine max), have been sequenced, and their many genes and promoters have been predicted. In Arabidopsis, cis-acting promoter elements involved in cold- and dehydration-responsive gene expression have been extensively analysed; however, the characteristics of such cis-acting promoter sequences in cold- and dehydration-inducible genes of rice and soybean remain to be clarified. In this study, we performed microarray analyses using the three species, and compared characteristics of identified cold- and dehydration-inducible genes. Transcription profiles of the cold- and dehydration-responsive genes were similar among these three species, showing representative upregulated (dehydrin/LEA) and downregulated (photosynthesis-related) genes. All (46 = 4096) hexamer sequences in the promoters of the three species were investigated, revealing the frequency of conserved sequences in cold- and dehydration-inducible promoters. A core sequence of the abscisic acid-responsive element (ABRE) was the most conserved in dehydration-inducible promoters of all three species, suggesting that transcriptional regulation for dehydration-inducible genes is similar among these three species, with the ABRE-dependent transcriptional pathway. In contrast, for cold-inducible promoters, the conserved hexamer sequences were diversified among these three species, suggesting the existence of diverse transcriptional regulatory pathways for cold-inducible genes among the species. PMID:22184637

  17. Shotgun Protein Sequencing with Meta-contig Assembly*

    PubMed Central

    Guthals, Adrian; Clauser, Karl R.; Bandeira, Nuno

    2012-01-01

    Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings. PMID:22798278

  18. Shotgun protein sequencing with meta-contig assembly.

    PubMed

    Guthals, Adrian; Clauser, Karl R; Bandeira, Nuno

    2012-10-01

    Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.

  19. A measurement of disorder in binary sequences

    NASA Astrophysics Data System (ADS)

    Gong, Longyan; Wang, Haihong; Cheng, Weiwen; Zhao, Shengmei

    2015-03-01

    We propose a complex quantity, AL, to characterize the degree of disorder of L-length binary symbolic sequences. As examples, we respectively apply it to typical random and deterministic sequences. One kind of random sequences is generated from a periodic binary sequence and the other is generated from the logistic map. The deterministic sequences are the Fibonacci and Thue-Morse sequences. In these analyzed sequences, we find that the modulus of AL, denoted by |AL | , is a (statistically) equivalent quantity to the Boltzmann entropy, the metric entropy, the conditional block entropy and/or other quantities, so it is a useful quantitative measure of disorder. It can be as a fruitful index to discern which sequence is more disordered. Moreover, there is one and only one value of |AL | for the overall disorder characteristics. It needs extremely low computational costs. It can be easily experimentally realized. From all these mentioned, we believe that the proposed measure of disorder is a valuable complement to existing ones in symbolic sequences.

  20. A Shellcode Detection Method Based on Full Native API Sequence and Support Vector Machine

    NASA Astrophysics Data System (ADS)

    Cheng, Yixuan; Fan, Wenqing; Huang, Wei; An, Jing

    2017-09-01

    Dynamic monitoring the behavior of a program is widely used to discriminate between benign program and malware. It is usually based on the dynamic characteristics of a program, such as API call sequence or API call frequency to judge. The key innovation of this paper is to consider the full Native API sequence and use the support vector machine to detect the shellcode. We also use the Markov chain to extract and digitize Native API sequence features. Our experimental results show that the method proposed in this paper has high accuracy and low detection rate.

  1. Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

    PubMed

    Pietrowski, D; Förster, M

    2000-01-01

    The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).

  2. Evolutional dynamics of 45S and 5S ribosomal DNA in ancient allohexaploid Atropa belladonna.

    PubMed

    Volkov, Roman A; Panchuk, Irina I; Borisjuk, Nikolai V; Hosiawa-Baranska, Marta; Maluszynska, Jolanta; Hemleben, Vera

    2017-01-23

    Polyploid hybrids represent a rich natural resource to study molecular evolution of plant genes and genomes. Here, we applied a combination of karyological and molecular methods to investigate chromosomal structure, molecular organization and evolution of ribosomal DNA (rDNA) in nightshade, Atropa belladonna (fam. Solanaceae), one of the oldest known allohexaploids among flowering plants. Because of their abundance and specific molecular organization (evolutionarily conserved coding regions linked to variable intergenic spacers, IGS), 45S and 5S rDNA are widely used in plant taxonomic and evolutionary studies. Molecular cloning and nucleotide sequencing of A. belladonna 45S rDNA repeats revealed a general structure characteristic of other Solanaceae species, and a very high sequence similarity of two length variants, with the only difference in number of short IGS subrepeats. These results combined with the detection of three pairs of 45S rDNA loci on separate chromosomes, presumably inherited from both tetraploid and diploid ancestor species, example intensive sequence homogenization that led to substitution/elimination of rDNA repeats of one parent. Chromosome silver-staining revealed that only four out of six 45S rDNA sites are frequently transcriptionally active, demonstrating nucleolar dominance. For 5S rDNA, three size variants of repeats were detected, with the major class represented by repeats containing all functional IGS elements required for transcription, the intermediate size repeats containing partially deleted IGS sequences, and the short 5S repeats containing severe defects both in the IGS and coding sequences. While shorter variants demonstrate increased rate of based substitution, probably in their transition into pseudogenes, the functional 5S rDNA variants are nearly identical at the sequence level, pointing to their origin from a single parental species. Localization of the 5S rDNA genes on two chromosome pairs further supports uniparental inheritance from the tetraploid progenitor. The obtained molecular, cytogenetic and phylogenetic data demonstrate complex evolutionary dynamics of rDNA loci in allohexaploid species of Atropa belladonna. The high level of sequence unification revealed in 45S and 5S rDNA loci of this ancient hybrid species have been seemingly achieved by different molecular mechanisms.

  3. Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood

    PubMed Central

    Fan, H. Christina; Blumenfeld, Yair J.; Chitkara, Usha; Hudgins, Louanne; Quake, Stephen R.

    2008-01-01

    We directly sequenced cell-free DNA with high-throughput shotgun sequencing technology from plasma of pregnant women, obtaining, on average, 5 million sequence tags per patient sample. This enabled us to measure the over- and underrepresentation of chromosomes from an aneuploid fetus. The sequencing approach is polymorphism-independent and therefore universally applicable for the noninvasive detection of fetal aneuploidy. Using this method, we successfully identified all nine cases of trisomy 21 (Down syndrome), two cases of trisomy 18 (Edward syndrome), and one case of trisomy 13 (Patau syndrome) in a cohort of 18 normal and aneuploid pregnancies; trisomy was detected at gestational ages as early as the 14th week. Direct sequencing also allowed us to study the characteristics of cell-free plasma DNA, and we found evidence that this DNA is enriched for sequences from nucleosomes. PMID:18838674

  4. Complete genome sequence of duck Tembusu virus, isolated from Muscovy ducks in southern China.

    PubMed

    Zhu, Wanjun; Chen, Jidang; Wei, Chunya; Wang, Heng; Huang, Zhen; Zhang, Minze; Tang, Fengfeng; Xie, Jiexiong; Liang, Huanbin; Zhang, Guihong; Su, Shuo

    2012-12-01

    We report here the complete genomic sequence of the duck Tembusu virus (DTMUV) WJ-1 strain, isolated from Muscovy ducks. This is the first complete genome sequence of DTMUV reported in southern China. Compared with the other strains (TA, GH-2, YY5, and ZJ-407) that were previously found in eastern China, WJ-1 bears a few differences in the nucleotide and amino acid sequences. We found that there are 47 mutations of amino acids encoded by the whole open reading frame (ORF) among these five strains. The whole-genome sequence of DTMUV will help in understanding the epidemiology and molecular characteristics of duck Tembusu virus in southern China.

  5. A Teaching-Learning Sequence about Weather Map Reading

    ERIC Educational Resources Information Center

    Mandrikas, Achilleas; Stavrou, Dimitrios; Skordoulis, Constantine

    2017-01-01

    In this paper a teaching-learning sequence (TLS) introducing pre-service elementary teachers (PET) to weather map reading, with emphasis on wind assignment, is presented. The TLS includes activities about recognition of wind symbols, assignment of wind direction and wind speed on a weather map and identification of wind characteristics in a…

  6. Raman-based system for DNA sequencing-mapping and other separations

    DOEpatents

    Vo-Dinh, Tuan

    1994-01-01

    DNA sequencing and mapping are performed by using a Raman spectrometer with a surface enhanced Raman scattering (SERS) substrate to enhance the Raman signal. A SERS label is attached to a DNA fragment and then analyzed with the Raman spectrometer to identify the DNA fragment according to characteristics of the Raman spectrum generated.

  7. Draft Genome Sequence of Lactobacillus pobuzihii E100301T.

    PubMed

    Chiu, Chi-Ming; Chang, Chi-Huan; Pan, Shwu-Fen; Wu, Hui-Chung; Li, Shiao-Wen; Chang, Chuan-Hsiung; Lee, Yun-Shien; Chiang, Chih-Ming; Chen, Yi-Sheng

    2013-05-09

    Lactobacillus pobuzihii E100301(T) is a novel Lactobacillus species previously isolated from pobuzihi (fermented cummingcordia) in Taiwan. Phylogenetically, this strain is closest to Lactobacillus acidipiscis, but its phenotypic characteristics can be clearly distinguished from those of L. acidipiscis. We present the draft genome sequence of strain L. pobuzihii E100301(T).

  8. Genome sequence of Chinese porcine parvovirus strain PPV2010.

    PubMed

    Cui, Jin; Wang, Xin; Ren, Yudong; Cui, Shangjin; Li, Guangxing; Ren, Xiaofeng

    2012-02-01

    Porcine parvovirus (PPV) isolate PPV2010 has recently emerged in China. Herein, we analyze the complete genome sequence of PPV2010. Our results indicate that the genome of PPV2010 bears mixed characteristics of virulent PPV and vaccine strains. Importantly, PPV2010 has the potential to be a naturally attenuated candidate vaccine strain.

  9. AFTERSHOCK SEQUENCES AND CRUSTAL STRUCTURE IN THE REGION OF GREECE.

    DTIC Science & Technology

    the strain release characteristics and other properties of the aftershock and foreshock sequences (1) of all shocks of M 5.9 which have occurred in...relation between the water loading of two artificial lakes in the region of Greece and the earthquake activity in foreshocks or swarm of shocks triggered

  10. MHz-Rate NO PLIF Imaging in a Mach 10 Hypersonic Wind Tunnel

    NASA Technical Reports Server (NTRS)

    Jiang, N.; Webster, M.; Lempert, Walter R.; Miller, J. D.; Meyer, T. R.; Danehy, Paul M.

    2010-01-01

    NO PLIF imaging at repetition rates as high as 1 MHz is demonstrated in the NASA Langley 31 inch Mach 10 hypersonic wind tunnel. Approximately two hundred time correlated image sequences, of between ten and twenty individual frames, were obtained over eight days of wind tunnel testing spanning two entries in March and September of 2009. The majority of the image sequences were obtained from the boundary layer of a 20 flat plate model, in which transition was induced using a variety of cylindrical and triangular shaped protuberances. The high speed image sequences captured a variety of laminar and transitional flow phenomena, ranging from mostly laminar flow, typically at lower Reynolds number and/or in the near wall region of the model, to highly transitional flow in which the temporal evolution and progression of characteristic streak instabilities and/or corkscrew-shaped vortices could be clearly identified. A series of image sequences were also obtained from a 20 compression ramp at a 10 angle of attack in which the temporal dynamics of the characteristic separated flow was captured in a time correlated manner.

  11. UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

    PubMed

    Meinicke, Peter

    2009-09-02

    Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.

  12. Source parameters of the 2014 Ms6.5 Ludian earthquake sequence and their implications on the seismogenic structure

    NASA Astrophysics Data System (ADS)

    Zheng, Y.

    2015-12-01

    On August 3, 2014, an Ms6.5 earthquake struck Ludian county, Zhaotong city in Yunnan province, China. Although this earthquake is not very big, it caused abnormal severe damages. Thus, study on the causes of the serious damages of this moderate strong earthquake may help us to evaluate seismic hazards for similar earthquakes. Besides the factors which directly relate to the damages, such as site effects, quality of buildings, seismogenic structures and the characteristics of the mainshock and the aftershocks may also responsible for the seismic hazards. Since focal mechanism solution and centroid depth provide key information of earthquake source properties and tectonic stress field, and the focal depth is one of the most important parameters which control the damages of earthquakes, obtaining precise FMSs and focal depths of the Ludian earthquake sequence may help us to determine the detailed geometric features of the rupture fault and the seismogenic environment. In this work we obtained the FMSs and centroid depths of the Ludian earthquake and its Ms>3.0 aftershocks by the revised CAP method, and further verified some focal depths using the depth phase method. Combining the FMSs of the mainshock and the strong aftershocks, as well as their spatial distributions, and the seismogenic environment of the source region, we can make the following characteristics of the Ludian earthquake sequence and its seismogenic structure: (1) The Ludian earthquake is a left-lateral strike slip earthquake, with magnitude of about Mw6.1. The FMS of nodal plane I is 75o/56o/180o for strike, dip and rake angles, and 165o/90o/34ofor the other nodal plane. (2) The Ludian earthquake is very shallow with the optimum centroid depth of ~3 km, which is consistent with the strong ground shaking and the surface rupture observed by field survey and strengthens the damages of the Ludian earthquake. (3) The Ludian Earthquake should occur on the NNW trend BXF. Because two later aftershocks occurred close to the fault zone of the ZLF, and their FMSs are similar with the characteristics of the ZLF, the shallower part of the ZLF may also rupture during the aftershock duration of the Ludian earthquake. Since the ZLF is much longer than the BXF, the seismic risk of the ZLF may be high and should be required more attention.

  13. Development of a universal control unit for functional electrical stimulation (FES).

    PubMed

    Brandell, B R

    1982-12-01

    In collaboration with the College of Engineering the author has developed a laboratory, or clinic, based, battery operated "universal" control system, designed to improve disabled gait in upper motor neuron disabilities, especially stroke, hemiplegia, and cerebral palsy, by applying several channels of FES (Functional Electrical Stimulation) to the lower limb muscles while the patient is walking. The timing of the FES pulses, which can be applied to as many as six of the patient's muscles, is determined by potentiometer controlled one-shot timers, which are triggered by any of three switches in the sole of either shoe. Combinations of inverters, flip flops, AND gates and OR gates in the externally connected logic circuits determine the sequence of delays and pulses applied to the patient's muscles. This paper describes and diagrams some of the logic circuits and as an example of the possible application of the concept of a "universal" control unit reports the modifications of gait induced in a hemiplegic, four year post-stroke, patient. The characteristics of this patient's gait with FES in comparison to its characteristics without FES are demonstrated with motion picture frames, EMG recordings and graphic tracings of her right knee and ankle joint positions. They include more symmetrical timing of her right and left stance and swing phases, increased dorsiflexion of her right ankle in the swing phase, followed by a more distinct heel strike, and improved flexion--extension sequences of the knee and ankle joints and an increased heel rise in the stance phase. The author concludes that the gait characteristics of some hemiplegic patients will improve as they become adapted over a period of weeks or months to a control logic, which lessens their functional limitations by the use of a properly timed and amplified sequence of FES pulses. He suggests that the FES control requirements for individual patients should be determined experimentally with a control system "universally" adaptable to a wide range of disabilities, and that these control parameters could then determine the design of portable units, which may be used on a long term basis. These units would include only the operational options needed to duplicate the gait corrections found to be practicable for each individual patient, by the testing procedure, through a universal logic unit as described in this paper.

  14. Science Opportunity Analyzer (SOA): Not Just Another Pretty Face

    NASA Technical Reports Server (NTRS)

    Polanskey, Carol A.; Streiiffert, Barbara; O'Reilly, Taifun

    2004-01-01

    This viewgraph presentation reviews the Science Opportunity Analyzer (SOA). For the first time at JPL, the Cassini mission to Saturn is using distributed science operations for sequence generation. This means that scientist at other institutions has more responsibility to build the spacecraft sequence. Tools are required to support the sequence development. JPL tools required a complete configuration behind a firewall, and the tools that the user community had developed did not interface with the JPL tools. Therefore the SOA was created to bridge the gap between the remote scientists and the JPL operations teams. The presentation reviews the development of the SOA, and what was required of the system. The presentation reviews the functions that the SOA performed.

  15. Detection of malignant hepatic tumors with ferumoxides-enhanced MRI: comparison of five gradient-recalled echo sequences with different TEs.

    PubMed

    Matsuo, Masayuki; Kanematsu, Masayuki; Itoh, Kyo; Murakami, Takamichi; Maetani, Yoji; Kondo, Hiroshi; Goshima, Satoshi; Kako, Nobuo; Hoshi, Hiroaki; Konishi, Junji; Moriyama, Noriyuki; Nakamura, Hironobu

    2004-01-01

    The purpose of our study was to compare the detectability of malignant hepatic tumors on ferumoxides-enhanced MRI using five gradient-recalled echo sequences at different TEs. Ferumoxides-enhanced MRIs obtained in 31 patients with 50 malignant hepatic tumors (33 hepatocellular carcinomas, 17 metastases) were reviewed retrospectively by three independent offsite radiologists. T1-weighted gradient-recalled echo images with TEs of 1.4 and 4.2 msec; T2*-weighted gradient-recalled echo images with TEs of 6, 8, and 10 msec; and T2-weighted fast spin-echo images of livers were randomly reviewed on a segment-by-segment basis. Observer performance was tested using the McNemar test and receiver operating characteristic analysis for the clustered data. Lesion-to-liver contrast-to-noise ratio was also assessed. Mean lesion-to-liver contrast-to-noise ratios were negative and lower with gradient-recalled echo at 1.4 msec than with the other sequences. Sensitivity was higher (p < 0.05) with gradient-recalled echo at 6, 8, and 10 msec and fast spin-echo sequences (75-83%) than with gradient-recalled echo sequences at 1.4 and 4.2 msec (46-48%), and was higher (p < 0.05) with gradient-recalled echo sequence at 8 msec (83%) than with gradient-recalled echo at 6 msec and fast spin-echo sequences (75-78%). Specificity was comparably high with all sequences (95-98%). The area under the receiver operating characteristic curve (A(z)) was greater (p < 0.05) with gradient-recalled echo at 6, 8, and 10 msec and fast spin-echo sequences (A(z) = 0.91-0.93) than with gradient-recalled echo sequences at 1.4 and 4.2 msec (A(z) = 0.82-0.85). In the detection of malignant hepatic tumors, gradient-recalled echo sequences at 8 msec showed the highest sensitivity and had an A(z) value and lesion-to-liver contrast-to-noise ratio comparable with values from gradient-recalled echo sequences at 6 and 10 msec and fast spin-echo sequences.

  16. Diversity of halophilic archaea from six hypersaline environments in Turkey.

    PubMed

    Ozcan, Birgul; Ozcengiz, Gulay; Coleri, Arzu; Cokmus, Cumhur

    2007-06-01

    The diversity of archaeal strains from six hypersaline environments in Turkey was analyzed by comparing their phenotypic characteristics and 16S rDNA sequences. Thirty-three isolates were characterized in terms of their phenotypic properties including morphological and biochemical characteristics, susceptibility to different antibiotics, and total lipid and plasmid contents, and finally compared by 16S rDNA gene sequences. The results showed that all isolates belong to the family Halobacteriaceae. Phylogenetic analyses using approximately 1,388 bp comparisions of 16S rDNA sequences demonstrated that all isolates clustered closely to species belonging to 9 genera, namely Halorubrum (8 isolates), Natrinema (5 isolates), Haloarcula (4 isolates), Natronococcus (4 isolates), Natrialba (4 isolates), Haloferax (3 isolates), Haloterrigena (3 isolates), Halalkalicoccus (1 isolate), and Halomicrobium (1 isolate). The results revealed a high diversity among the isolated halophilic strains and indicated that some of these strains constitute new taxa of extremely halophilic archaea.

  17. The nucleotide sequence of the putative transcription initiation site of a cloned ribosomal RNA gene of the mouse.

    PubMed Central

    Urano, Y; Kominami, R; Mishima, Y; Muramatsu, M

    1980-01-01

    Approximately one kilobase pairs surrounding and upstream the transcription initiation site of a cloned ribosomal DNA (rDNA) of the mouse were sequenced. The putative transcription initiation site was determined by two independent methods: one nuclease S1 protection and the other reverse transcriptase elongation mapping using isolated 45S ribosomal RNA precursor (45S RNA) and appropriate restriction fragments of rDNA. Both methods gave an identical result; 45S RNA had a structure starting from ACTCTTAG---. Characteristically, mouse rDNA had many T clusters (greater than or equal to 5) upstream the initiation site, the longest being 21 consecutive T's. A pentadecanucleotide, TGCCTCCCGAGTGCA, appeared twice within 260 nucleotides upstream the putative initiation site. No such characteristic sequences were found downstream this site. Little similarity was found in the upstream of the transcription initiation site between the mouse, Xenopus laevis and Saccharomyces cerevisiae rDNA. Images PMID:6162156

  18. Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.

    PubMed

    Desai, Aarti; Marwah, Veer Singh; Yadav, Akshay; Jha, Vineet; Dhaygude, Kishor; Bangar, Ujwala; Kulkarni, Vivek; Jere, Abhay

    2013-01-01

    Next Generation Sequencing (NGS) is a disruptive technology that has found widespread acceptance in the life sciences research community. The high throughput and low cost of sequencing has encouraged researchers to undertake ambitious genomic projects, especially in de novo genome sequencing. Currently, NGS systems generate sequence data as short reads and de novo genome assembly using these short reads is computationally very intensive. Due to lower cost of sequencing and higher throughput, NGS systems now provide the ability to sequence genomes at high depth. However, currently no report is available highlighting the impact of high sequence depth on genome assembly using real data sets and multiple assembly algorithms. Recently, some studies have evaluated the impact of sequence coverage, error rate and average read length on genome assembly using multiple assembly algorithms, however, these evaluations were performed using simulated datasets. One limitation of using simulated datasets is that variables such as error rates, read length and coverage which are known to impact genome assembly are carefully controlled. Hence, this study was undertaken to identify the minimum depth of sequencing required for de novo assembly for different sized genomes using graph based assembly algorithms and real datasets. Illumina reads for E.coli (4.6 MB) S.kudriavzevii (11.18 MB) and C.elegans (100 MB) were assembled using SOAPdenovo, Velvet, ABySS, Meraculous and IDBA-UD. Our analysis shows that 50X is the optimum read depth for assembling these genomes using all assemblers except Meraculous which requires 100X read depth. Moreover, our analysis shows that de novo assembly from 50X read data requires only 6-40 GB RAM depending on the genome size and assembly algorithm used. We believe that this information can be extremely valuable for researchers in designing experiments and multiplexing which will enable optimum utilization of sequencing as well as analysis resources.

  19. [Analysis of Conformational Features of Watson-Crick Duplex Fragments by Molecular Mechanics and Quantum Mechanics Methods].

    PubMed

    Poltev, V I; Anisimov, V M; Sanchez, C; Deriabina, A; Gonzalez, E; Garcia, D; Rivas, F; Polteva, N A

    2016-01-01

    It is generally accepted that the important characteristic features of the Watson-Crick duplex originate from the molecular structure of its subunits. However, it still remains to elucidate what properties of each subunit are responsible for the significant characteristic features of the DNA structure. The computations of desoxydinucleoside monophosphates complexes with Na-ions using density functional theory revealed a pivotal role of DNA conformational properties of single-chain minimal fragments in the development of unique features of the Watson-Crick duplex. We found that directionality of the sugar-phosphate backbone and the preferable ranges of its torsion angles, combined with the difference between purines and pyrimidines. in ring bases, define the dependence of three-dimensional structure of the Watson-Crick duplex on nucleotide base sequence. In this work, we extended these density functional theory computations to the minimal' fragments of DNA duplex, complementary desoxydinucleoside monophosphates complexes with Na-ions. Using several computational methods and various functionals, we performed a search for energy minima of BI-conformation for complementary desoxydinucleoside monophosphates complexes with different nucleoside sequences. Two sequences are optimized using ab initio method at the MP2/6-31++G** level of theory. The analysis of torsion angles, sugar ring puckering and mutual base positions of optimized structures demonstrates that the conformational characteristic features of complementary desoxydinucleoside monophosphates complexes with Na-ions remain within BI ranges and become closer to the corresponding characteristic features of the Watson-Crick duplex crystals. Qualitatively, the main characteristic features of each studied complementary desoxydinucleoside monophosphates complex remain invariant when different computational methods are used, although the quantitative values of some conformational parameters could vary lying within the limits typical for the corresponding family. We observe that popular functionals in density functional theory calculations lead to the overestimated distances between base pairs, while MP2 computations and the newer complex functionals produce the structures that have too close atom-atom contacts. A detailed study of some complementary desoxydinucleoside monophosphate complexes with Na-ions highlights the existence of several energy minima corresponding to BI-conformations, in other words, the complexity of the relief pattern of the potential energy surface of complementary desoxydinucleoside monophosphate complexes. This accounts for variability of conformational parameters of duplex fragments with the same base sequence. Popular molecular mechanics force fields AMBER and CHARMM reproduce most of the conformational characteristics of desoxydinucleoside monophosphates and their complementary complexes with Na-ions but fail to reproduce some details of the dependence of the Watson-Crick duplex conformation on the nucleotide sequence.

  20. Waterborne Transportation Lines of the United States : calendar year 2008. Volume 3 : vessel characteristics

    DOT National Transportation Integrated Search

    2009-11-16

    The Vessel Characteristics, Volume 3, is : one of three publications for the annual revision : of the WTLUS, which lists the vessel companies : in alphabetical sequence and describes each : vessel surveyed by indicating its name and : number, Coast G...

  1. "The devil's in the detail": Release of an expanded, enhanced and dynamically revised forensic STR Sequence Guide.

    PubMed

    Phillips, C; Gettings, K Butler; King, J L; Ballard, D; Bodner, M; Borsuk, L; Parson, W

    2018-05-01

    The STR sequence template file published in 2016 as part of the considerations from the DNA Commission of the International Society for Forensic Genetics on minimal STR sequence nomenclature requirements, has been comprehensively revised and audited using the latest GRCh38 genome assembly. The list of forensic STRs characterized was expanded by including supplementary autosomal, X- and Y-chromosome microsatellites in less common use for routine DNA profiling, but some likely to be adopted in future massively parallel sequencing (MPS) STR panels. We outline several aspects of sequence alignment and annotation that required care and attention to detail when comparing sequences to GRCh37 and GRCh38 assemblies, as well as the necessary matching of MPS-based allele descriptions to previously established repeat region structures described in initial sequencing studies of the less well known forensic STRs. The revised sequence guide is now available in a dynamically updated FTP format from the STRidER website with a date-stamped change log to allow users to explore their own MPS data with the most up-to-date forensic STR sequence information compiled in a simple guide. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. Rational design of DNA sequences for nanotechnology, microarrays and molecular computers using Eulerian graphs.

    PubMed

    Pancoska, Petr; Moravek, Zdenek; Moll, Ute M

    2004-01-01

    Nucleic acids are molecules of choice for both established and emerging nanoscale technologies. These technologies benefit from large functional densities of 'DNA processing elements' that can be readily manufactured. To achieve the desired functionality, polynucleotide sequences are currently designed by a process that involves tedious and laborious filtering of potential candidates against a series of requirements and parameters. Here, we present a complete novel methodology for the rapid rational design of large sets of DNA sequences. This method allows for the direct implementation of very complex and detailed requirements for the generated sequences, thus avoiding 'brute force' filtering. At the same time, these sequences have narrow distributions of melting temperatures. The molecular part of the design process can be done without computer assistance, using an efficient 'human engineering' approach by drawing a single blueprint graph that represents all generated sequences. Moreover, the method eliminates the necessity for extensive thermodynamic calculations. Melting temperature can be calculated only once (or not at all). In addition, the isostability of the sequences is independent of the selection of a particular set of thermodynamic parameters. Applications are presented for DNA sequence designs for microarrays, universal microarray zip sequences and electron transfer experiments.

  3. Analysis of Pre-Analytic Factors Affecting the Success of Clinical Next-Generation Sequencing of Solid Organ Malignancies.

    PubMed

    Chen, Hui; Luthra, Rajyalakshmi; Goswami, Rashmi S; Singh, Rajesh R; Roy-Chowdhuri, Sinchita

    2015-08-28

    Application of next-generation sequencing (NGS) technology to routine clinical practice has enabled characterization of personalized cancer genomes to identify patients likely to have a response to targeted therapy. The proper selection of tumor sample for downstream NGS based mutational analysis is critical to generate accurate results and to guide therapeutic intervention. However, multiple pre-analytic factors come into play in determining the success of NGS testing. In this review, we discuss pre-analytic requirements for AmpliSeq PCR-based sequencing using Ion Torrent Personal Genome Machine (PGM) (Life Technologies), a NGS sequencing platform that is often used by clinical laboratories for sequencing solid tumors because of its low input DNA requirement from formalin fixed and paraffin embedded tissue. The success of NGS mutational analysis is affected not only by the input DNA quantity but also by several other factors, including the specimen type, the DNA quality, and the tumor cellularity. Here, we review tissue requirements for solid tumor NGS based mutational analysis, including procedure types, tissue types, tumor volume and fraction, decalcification, and treatment effects.

  4. Wld S protein requires Nmnat activity and a short N-terminal sequence to protect axons in mice.

    PubMed

    Conforti, Laura; Wilbrey, Anna; Morreale, Giacomo; Janeckova, Lucie; Beirowski, Bogdan; Adalbert, Robert; Mazzola, Francesca; Di Stefano, Michele; Hartley, Robert; Babetto, Elisabetta; Smith, Trevor; Gilley, Jonathan; Billington, Richard A; Genazzani, Armando A; Ribchester, Richard R; Magni, Giulio; Coleman, Michael

    2009-02-23

    The slow Wallerian degeneration (Wld(S)) protein protects injured axons from degeneration. This unusual chimeric protein fuses a 70-amino acid N-terminal sequence from the Ube4b multiubiquitination factor with the nicotinamide adenine dinucleotide-synthesizing enzyme nicotinamide mononucleotide adenylyl transferase 1. The requirement for these components and the mechanism of Wld(S)-mediated neuroprotection remain highly controversial. The Ube4b domain is necessary for the protective phenotype in mice, but precisely which sequence is essential and why are unclear. Binding to the AAA adenosine triphosphatase valosin-containing protein (VCP)/p97 is the only known biochemical property of the Ube4b domain. Using an in vivo approach, we show that removing the VCP-binding sequence abolishes axon protection. Replacing the Wld(S) VCP-binding domain with an alternative ataxin-3-derived VCP-binding sequence restores its protective function. Enzyme-dead Wld(S) is unable to delay Wallerian degeneration in mice. Thus, neither domain is effective without the function of the other. Wld(S) requires both of its components to protect axons from degeneration.

  5. Complete-proteome mapping of human influenza A adaptive mutations: implications for human transmissibility of zoonotic strains.

    PubMed

    Miotto, Olivo; Heiny, A T; Albrecht, Randy; García-Sastre, Adolfo; Tan, Tin Wee; August, J Thomas; Brusic, Vladimir

    2010-02-03

    There is widespread concern that H5N1 avian influenza A viruses will emerge as a pandemic threat, if they become capable of human-to-human (H2H) transmission. Avian strains lack this capability, which suggests that it requires important adaptive mutations. We performed a large-scale comparative analysis of proteins from avian and human strains, to produce a catalogue of mutations associated with H2H transmissibility, and to detect their presence in avian isolates. We constructed a dataset of influenza A protein sequences from 92,343 public database records. Human and avian sequence subsets were compared, using a method based on mutual information, to identify characteristic sites where human isolates present conserved mutations. The resulting catalogue comprises 68 characteristic sites in eight internal proteins. Subtype variability prevented the identification of adaptive mutations in the hemagglutinin and neuraminidase proteins. The high number of sites in the ribonucleoprotein complex suggests interdependence between mutations in multiple proteins. Characteristic sites are often clustered within known functional regions, suggesting their functional roles in cellular processes. By isolating and concatenating characteristic site residues, we defined adaptation signatures, which summarize the adaptive potential of specific isolates. Most adaptive mutations emerged within three decades after the 1918 pandemic, and have remained remarkably stable thereafter. Two lineages with stable internal protein constellations have circulated among humans without reassorting. On the contrary, H5N1 avian and swine viruses reassort frequently, causing both gains and losses of adaptive mutations. Human host adaptation appears to be complex and systemic, involving nearly all influenza proteins. Adaptation signatures suggest that the ability of H5N1 strains to infect humans is related to the presence of an unusually high number of adaptive mutations. However, these mutations appear unstable, suggesting low pandemic potential of H5N1 in its current form. In addition, adaptation signatures indicate that pandemic H1N1/09 strain possesses multiple human-transmissibility mutations, though not an unusually high number with respect to swine strains that infected humans in the past. Adaptation signatures provide a novel tool for identifying zoonotic strains with the potential to infect humans.

  6. A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

    PubMed

    Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

    2011-09-01

    Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.

  7. A novel, privacy-preserving cryptographic approach for sharing sequencing data

    PubMed Central

    Cassa, Christopher A; Miller, Rachel A; Mandl, Kenneth D

    2013-01-01

    Objective DNA samples are often processed and sequenced in facilities external to the point of collection. These samples are routinely labeled with patient identifiers or pseudonyms, allowing for potential linkage to identity and private clinical information if intercepted during transmission. We present a cryptographic scheme to securely transmit externally generated sequence data which does not require any patient identifiers, public key infrastructure, or the transmission of passwords. Materials and methods This novel encryption scheme cryptographically protects participant sequence data using a shared secret key that is derived from a unique subset of an individual’s genetic sequence. This scheme requires access to a subset of an individual’s genetic sequence to acquire full access to the transmitted sequence data, which helps to prevent sample mismatch. Results We validate that the proposed encryption scheme is robust to sequencing errors, population uniqueness, and sibling disambiguation, and provides sufficient cryptographic key space. Discussion Access to a set of an individual’s genotypes and a mutually agreed cryptographic seed is needed to unlock the full sequence, which provides additional sample authentication and authorization security. We present modest fixed and marginal costs to implement this transmission architecture. Conclusions It is possible for genomics researchers who sequence participant samples externally to protect the transmission of sequence data using unique features of an individual’s genetic sequence. PMID:23125421

  8. A reassessment of IgM memory subsets in humans

    PubMed Central

    Bagnara, Davide; Squillario, Margherita; Kipling, David; Mora, Thierry; Walczak, Aleksandra M.; Da Silva, Lucie; Weller, Sandra; Dunn-Walters, Deborah K.; Weill, Jean-Claude; Reynaud, Claude-Agnès

    2015-01-01

    From paired blood and spleen samples from three adult donors we performed high-throughput V-h sequencing of human B-cell subsets defined by IgD and CD27 expression: IgD+CD27+ (“MZ”), IgD−CD27+(“memory”, including IgM (“IgM-only”), IgG and IgA) and IgD−CD27− cells (“double-negative”, including IgM, IgG and IgA). 91,294 unique sequences clustered in 42,670 clones, revealing major clonal expansions in each of these subsets. Among these clones, we further analyzed those shared sequences from different subsets or tissues for Vh-gene mutation, H-CDR3-length, and Vh/Jh usage, comparing these different characteristics with all sequences from their subset of origin, for which these parameters constitute a distinct signature. The IgM-only repertoire profile differed notably from that of MZ B cells by a higher mutation frequency, and lower Vh4 and higher Jh6 gene usage. Strikingly, IgM sequences from clones shared between the MZ and the memory IgG/IgA compartments showed a mutation and repertoire profile of IgM-only and not of MZ B cells. Similarly, all IgM clonal relationships (between MZ, IgM-only, and double-negative compartments) involved sequences with the characteristics of IgM-only B cells. Finally, clonal relationships between tissues suggested distinct recirculation characteristics between MZ and switched B cells. The “IgM-only” subset (including cells with its repertoire signature but higher IgD or lower CD27 expression levels) thus appear as the only subset showing precursor-product relationships with CD27+ switched memory B cells, indicating that they represent germinal center-derived IgM memory B cells, and that IgM memory and MZ B cells constitute two distinct entities. PMID:26355154

  9. A Reassessment of IgM Memory Subsets in Humans.

    PubMed

    Bagnara, Davide; Squillario, Margherita; Kipling, David; Mora, Thierry; Walczak, Aleksandra M; Da Silva, Lucie; Weller, Sandra; Dunn-Walters, Deborah K; Weill, Jean-Claude; Reynaud, Claude-Agnès

    2015-10-15

    From paired blood and spleen samples from three adult donors, we performed high-throughput VH sequencing of human B cell subsets defined by IgD and CD27 expression: IgD(+)CD27(+) ("marginal zone [MZ]"), IgD(-)CD27(+) ("memory," including IgM ["IgM-only"], IgG and IgA) and IgD(-)CD27(-) cells ("double-negative," including IgM, IgG, and IgA). A total of 91,294 unique sequences clustered in 42,670 clones, revealing major clonal expansions in each of these subsets. Among these clones, we further analyzed those shared sequences from different subsets or tissues for VH gene mutation, H-CDR3-length, and VH/JH usage, comparing these different characteristics with all sequences from their subset of origin for which these parameters constitute a distinct signature. The IgM-only repertoire profile differed notably from that of MZ B cells by a higher mutation frequency and lower VH4 and higher JH6 gene usage. Strikingly, IgM sequences from clones shared between the MZ and the memory IgG/IgA compartments showed a mutation and repertoire profile of IgM-only and not of MZ B cells. Similarly, all IgM clonal relationships (among MZ, IgM-only, and double-negative compartments) involved sequences with the characteristics of IgM-only B cells. Finally, clonal relationships between tissues suggested distinct recirculation characteristics between MZ and switched B cells. The "IgM-only" subset (including cells with its repertoire signature but higher IgD or lower CD27 expression levels) thus appear as the only subset showing precursor-product relationships with CD27(+) switched memory B cells, indicating that they represent germinal center-derived IgM memory B cells and that IgM memory and MZ B cells constitute two distinct entities. Copyright © 2015 by The American Association of Immunologists, Inc.

  10. Evolutionary Dynamics of Microsatellite Distribution in Plants: Insight from the Comparison of Sequenced Brassica, Arabidopsis and Other Angiosperm Species

    PubMed Central

    Shi, Jiaqin; Huang, Shunmou; Fu, Donghui; Yu, Jinyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2013-01-01

    Despite their ubiquity and functional importance, microsatellites have been largely ignored in comparative genomics, mostly due to the lack of genomic information. In the current study, microsatellite distribution was characterized and compared in the whole genomes and both the coding and non-coding DNA sequences of the sequenced Brassica, Arabidopsis and other angiosperm species to investigate their evolutionary dynamics in plants. The variation in the microsatellite frequencies of these angiosperm species was much smaller than those for their microsatellite numbers and genome sizes, suggesting that microsatellite frequency may be relatively stable in plants. The microsatellite frequencies of these angiosperm species were significantly negatively correlated with both their genome sizes and transposable elements contents. The pattern of microsatellite distribution may differ according to the different genomic regions (such as coding and non-coding sequences). The observed differences in many important microsatellite characteristics (especially the distribution with respect to motif length, type and repeat number) of these angiosperm species were generally accordant with their phylogenetic distance, which suggested that the evolutionary dynamics of microsatellite distribution may be generally consistent with plant divergence/evolution. Importantly, by comparing these microsatellite characteristics (especially the distribution with respect to motif type) the angiosperm species (aside from a few species) all clustered into two obviously different groups that were largely represented by monocots and dicots, suggesting a complex and generally dichotomous evolutionary pattern of microsatellite distribution in angiosperms. Polyploidy may lead to a slight increase in microsatellite frequency in the coding sequences and a significant decrease in microsatellite frequency in the whole genome/non-coding sequences, but have little effect on the microsatellite distribution with respect to motif length, type and repeat number. Interestingly, several microsatellite characteristics seemed to be constant in plant evolution, which can be well explained by the general biological rules. PMID:23555856

  11. Characterization and Pathogenicity of Alternaria vanuatuensis, a New Record from Allium Plants in Korea and China.

    PubMed

    Li, Mei Jia; Deng, Jian Xin; Paul, Narayan Chandra; Lee, Hyang Burm; Yu, Seung Hun

    2014-12-01

    Alternaria from different Allium plants was characterized by multilocus sequence analysis. Based on sequences of the β-tubulin (BT2b), the Alternaria allergen a1 (Alt a1), and the RNA polymerase II second largest subunit (RPB2) genes and phylogenetic data analysis, isolates were divided into two groups. The two groups were identical to representative isolates of A. porri (EGS48-147) and A. vanuatuensis (EGS45-018). The conidial characteristics and pathogenicity of A. vanuatuensis also well supported the molecular characteristics. This is the first record of A. vanuatuensis E. G. Simmons & C. F. Hill from Korea and China.

  12. Non-Genomic Origins of Proteins and Metabolism

    NASA Technical Reports Server (NTRS)

    Pohorille, Andrew

    2003-01-01

    It is proposed that evolution of inanimate matter to cells endowed with a nucleic acid- based coding of genetic information was preceded by an evolutionary phase, in which peptides not coded by nucleic acids were able to self-organize into networks capable of evolution towards increasing metabolic complexity. Recent findings that truly different, simple peptides (Keefe and Szostak, 2001) can perform the same function (such as ATP binding) provide experimental support for this mechanism of early protobiological evolution. The central concept underlying this mechanism is that the reproduction of cellular functions alone was sufficient for self-maintenance of protocells, and that self- replication of macromolecules was not required at this stage of evolution. The precise transfer of information between successive generations of the earliest protocells was unnecessary and, possibly, undesirable. The key requirement in the initial stage of protocellular evolution was an ability to rapidly explore a large number of protein sequences in order to discover a set of molecules capable of supporting self- maintenance and growth of protocells. Undoubtedly, the essential protocellular functions were carried out by molecules not nearly as efficient or as specific as contemporary proteins. Many, potentially unrelated sequences could have performed each of these functions at an evolutionarily acceptable level. As evolution progressed, however proteins must have performed their functions with increasing efficiency and specificity. This, in turn, put additional constraints on protein sequences and the fraction of proteins capable of performing their functions at the required level decreased. At some point, the likelihood of generating a sufficiently efficient set of proteins through a non-coded synthesis was so small that further evolution was not possible without storing information about the sequences of these proteins. Beyond this point, further evolution required coupling between proteins and informational polymers that is characteristic to all known forms of life. The emergence of such coupling must be postulated in any scenario of the origin of life, no matter whether it starts with RNA or proteins. To examine the evolutionary potential of non-genomic systems, a simple, computationally tractable model, which is still capable of capturing the essential features of the real system, has been studied computationally. Both constructive and destructive processes have been introduced into the model in a stochastic manner. Instead of assuming random reaction sets, only a suite of protobiologically plausible reactions has been considered. Peptides have been explicitly considered as protoenzymes and their catalytic efficiencies have been assigned on the basis of biochemical principles and experimental estimates. Simulations have been carried out using a novel approach (The Next Reaction Method) that is appropriate even for very low concentrations of reactants. Studies have focused on global autocatalytic processes and their diversity.

  13. Genome sequence of the white koji mold Aspergillus kawachii IFO 4308, used for brewing the Japanese distilled spirit shochu.

    PubMed

    Futagami, Taiki; Mori, Kazuki; Yamashita, Ayaka; Wada, Shotaro; Kajiwara, Yasuhiro; Takashita, Hideharu; Omori, Toshiro; Takegawa, Kaoru; Tashiro, Kosuke; Kuhara, Satoru; Goto, Masatoshi

    2011-11-01

    The filamentous fungus Aspergillus kawachii has traditionally been used for brewing the Japanese distilled spirit shochu. A. kawachii characteristically hyperproduces citric acid and a variety of polysaccharide glycoside hydrolases. Here the genome sequence of A. kawachii IFO 4308 was determined and annotated. Analysis of the sequence may provide insight into the properties of this fungus that make it superior for use in shochu production, leading to the further development of A. kawachii for industrial applications.

  14. Imaging different components of a tectonic tremor sequence in southwestern Japan using an automatic statistical detection and location method

    NASA Astrophysics Data System (ADS)

    Poiata, Natalia; Vilotte, Jean-Pierre; Bernard, Pascal; Satriano, Claudio; Obara, Kazushige

    2018-06-01

    In this study, we demonstrate the capability of an automatic network-based detection and location method to extract and analyse different components of tectonic tremor activity by analysing a 9-day energetic tectonic tremor sequence occurring at the downdip extension of the subducting slab in southwestern Japan. The applied method exploits the coherency of multiscale, frequency-selective characteristics of non-stationary signals recorded across the seismic network. Use of different characteristic functions, in the signal processing step of the method, allows to extract and locate the sources of short-duration impulsive signal transients associated with low-frequency earthquakes and of longer-duration energy transients during the tectonic tremor sequence. Frequency-dependent characteristic functions, based on higher-order statistics' properties of the seismic signals, are used for the detection and location of low-frequency earthquakes. This allows extracting a more complete (˜6.5 times more events) and time-resolved catalogue of low-frequency earthquakes than the routine catalogue provided by the Japan Meteorological Agency. As such, this catalogue allows resolving the space-time evolution of the low-frequency earthquakes activity in great detail, unravelling spatial and temporal clustering, modulation in response to tide, and different scales of space-time migration patterns. In the second part of the study, the detection and source location of longer-duration signal energy transients within the tectonic tremor sequence is performed using characteristic functions built from smoothed frequency-dependent energy envelopes. This leads to a catalogue of longer-duration energy sources during the tectonic tremor sequence, characterized by their durations and 3-D spatial likelihood maps of the energy-release source regions. The summary 3-D likelihood map for the 9-day tectonic tremor sequence, built from this catalogue, exhibits an along-strike spatial segmentation of the long-duration energy-release regions, matching the large-scale clustering features evidenced from the low-frequency earthquake's activity analysis. Further examination of the two catalogues showed that the extracted short-duration low-frequency earthquakes activity coincides in space, within about 10-15 km distance, with the longer-duration energy sources during the tectonic tremor sequence. This observation provides a potential constraint on the size of the longer-duration energy-radiating source region in relation with the clustering of low-frequency earthquakes activity during the analysed tectonic tremor sequence. We show that advanced statistical network-based methods offer new capabilities for automatic high-resolution detection, location and monitoring of different scale-components of tectonic tremor activity, enriching existing slow earthquakes catalogues. Systematic application of such methods to large continuous data sets will allow imaging the slow transient seismic energy-release activity at higher resolution, and therefore, provide new insights into the underlying multiscale mechanisms of slow earthquakes generation.

  15. Imaging different components of a tectonic tremor sequence in southwestern Japan using an automatic statistical detection and location method

    NASA Astrophysics Data System (ADS)

    Poiata, Natalia; Vilotte, Jean-Pierre; Bernard, Pascal; Satriano, Claudio; Obara, Kazushige

    2018-02-01

    In this study, we demonstrate the capability of an automatic network-based detection and location method to extract and analyse different components of tectonic tremor activity by analysing a 9-day energetic tectonic tremor sequence occurring at the down-dip extension of the subducting slab in southwestern Japan. The applied method exploits the coherency of multi-scale, frequency-selective characteristics of non-stationary signals recorded across the seismic network. Use of different characteristic functions, in the signal processing step of the method, allows to extract and locate the sources of short-duration impulsive signal transients associated with low-frequency earthquakes and of longer-duration energy transients during the tectonic tremor sequence. Frequency-dependent characteristic functions, based on higher-order statistics' properties of the seismic signals, are used for the detection and location of low-frequency earthquakes. This allows extracting a more complete (˜6.5 times more events) and time-resolved catalogue of low-frequency earthquakes than the routine catalogue provided by the Japan Meteorological Agency. As such, this catalogue allows resolving the space-time evolution of the low-frequency earthquakes activity in great detail, unravelling spatial and temporal clustering, modulation in response to tide, and different scales of space-time migration patterns. In the second part of the study, the detection and source location of longer-duration signal energy transients within the tectonic tremor sequence is performed using characteristic functions built from smoothed frequency-dependent energy envelopes. This leads to a catalogue of longer-duration energy sources during the tectonic tremor sequence, characterized by their durations and 3-D spatial likelihood maps of the energy-release source regions. The summary 3-D likelihood map for the 9-day tectonic tremor sequence, built from this catalogue, exhibits an along-strike spatial segmentation of the long-duration energy-release regions, matching the large-scale clustering features evidenced from the low-frequency earthquake's activity analysis. Further examination of the two catalogues showed that the extracted short-duration low-frequency earthquakes activity coincides in space, within about 10-15 km distance, with the longer-duration energy sources during the tectonic tremor sequence. This observation provides a potential constraint on the size of the longer-duration energy-radiating source region in relation with the clustering of low-frequency earthquakes activity during the analysed tectonic tremor sequence. We show that advanced statistical network-based methods offer new capabilities for automatic high-resolution detection, location and monitoring of different scale-components of tectonic tremor activity, enriching existing slow earthquakes catalogues. Systematic application of such methods to large continuous data sets will allow imaging the slow transient seismic energy-release activity at higher resolution, and therefore, provide new insights into the underlying multi-scale mechanisms of slow earthquakes generation.

  16. Effect of hot acid hydrolysis and hot chlorine dioxide stage on bleaching effluent biodegradability.

    PubMed

    Gomes, C M; Colodette, J L; Delantonio, N R N; Mounteer, A H; Silva, C M

    2007-01-01

    The hot acid hydrolysis followed by chlorine dioxide (A/D*) and hot chlorine dioxide (D*) technologies have proven very useful for bleaching of eucalyptus kraft pulp. Although the characteristics and biodegradability of effluents from conventional chlorine dioxide bleaching are well known, such information is not yet available for effluents derived from hot acid hydrolysis and hot chorine dioxide bleaching. This study discusses the characteristics and biodegradability of such effluents. Combined whole effluents from the complete sequences DEpD, D*EpD, A/D*EpD and ADEpD, and from the pre-bleaching sequences DEp, D*Ep, A/D*Ep and ADEp were characterized by quantifying their colour, AOX and organic load (BOD, COD, TOC). These effluents were also evaluated for their treatability by simulation of an activated sludge system. It was concluded that treatment in the laboratory sequencing batch reactor was efficient for removal of COD, BOD and TOC of all effluents. However, colour increased after biological treatment, with the greatest increase found for the effluent produced using the AD technology. Biological treatment was less efficient at removing AOX of effluents from the sequences with D*, A/D* and AD as the first stages, when compared to the reference D stage; there was evidence of the lower treatability of these organochlorine compounds from these sequences.

  17. QRS complex detection based on continuous density hidden Markov models using univariate observations

    NASA Astrophysics Data System (ADS)

    Sotelo, S.; Arenas, W.; Altuve, M.

    2018-04-01

    In the electrocardiogram (ECG), the detection of QRS complexes is a fundamental step in the ECG signal processing chain since it allows the determination of other characteristics waves of the ECG and provides information about heart rate variability. In this work, an automatic QRS complex detector based on continuous density hidden Markov models (HMM) is proposed. HMM were trained using univariate observation sequences taken either from QRS complexes or their derivatives. The detection approach is based on the log-likelihood comparison of the observation sequence with a fixed threshold. A sliding window was used to obtain the observation sequence to be evaluated by the model. The threshold was optimized by receiver operating characteristic curves. Sensitivity (Sen), specificity (Spc) and F1 score were used to evaluate the detection performance. The approach was validated using ECG recordings from the MIT-BIH Arrhythmia database. A 6-fold cross-validation shows that the best detection performance was achieved with 2 states HMM trained with QRS complexes sequences (Sen = 0.668, Spc = 0.360 and F1 = 0.309). We concluded that these univariate sequences provide enough information to characterize the QRS complex dynamics from HMM. Future works are directed to the use of multivariate observations to increase the detection performance.

  18. Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

    PubMed

    Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

    1987-08-01

    To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded.

  19. Airway and Feeding Outcomes of Mandibular Distraction, Tongue-Lip Adhesion, and Conservative Management in Pierre Robin Sequence: A Prospective Study.

    PubMed

    Khansa, Ibrahim; Hall, Courtney; Madhoun, Lauren L; Splaingard, Mark; Baylis, Adriane; Kirschner, Richard E; Pearson, Gregory D

    2017-04-01

    Pierre Robin sequence is characterized by mandibular retrognathia and glossoptosis resulting in airway obstruction and feeding difficulties. When conservative management fails, mandibular distraction osteogenesis or tongue-lip adhesion may be required to avoid tracheostomy. The authors' goal was to prospectively evaluate the airway and feeding outcomes of their comprehensive approach to Pierre Robin sequence, which includes conservative management, mandibular distraction osteogenesis, and tongue-lip adhesion. A longitudinal study of newborns with Pierre Robin sequence treated at a pediatric academic medical center between 2010 and 2015 was performed. Baseline feeding and respiratory data were collected. Patients underwent conservative management if they demonstrated sustainable weight gain without tube feeds, and if their airway was stable with positioning alone. Patients who required surgery underwent tongue-lip adhesion or mandibular distraction osteogenesis based on family and surgeon preference. Postoperative airway and feeding data were collected. Twenty-eight patients with Pierre Robin sequence were followed prospectively. Thirty-two percent had a syndrome. Ten underwent mandibular distraction osteogenesis, eight underwent tongue-lip adhesion, and 10 were treated conservatively. There were no differences in days to extubation or discharge, change in weight percentile, requirement for gastrostomy tube, or residual obstructive sleep apnea between the three groups. No patients required tracheostomy. The greatest reduction in apnea-hypopnea index occurred with mandibular distraction osteogenesis, followed by tongue-lip adhesion and conservative management. Careful selection of which patients with Pierre Robin sequence need surgery, and of the most appropriate surgical procedure for each patient, can minimize the need for postprocedure tracheostomy. A comprehensive approach to Pierre Robin sequence that includes conservative management, mandibular distraction osteogenesis, and tongue-lip adhesion can result in excellent airway and feeding outcomes. Therapeutic, II.

  20. Affordable hands-on DNA sequencing and genotyping: an exercise for teaching DNA analysis to undergraduates.

    PubMed

    Shah, Kushani; Thomas, Shelby; Stein, Arnold

    2013-01-01

    In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.

  1. Characteristics of MHC class I genes in house sparrows Passer domesticus as revealed by long cDNA transcripts and amplicon sequencing.

    PubMed

    Karlsson, Maria; Westerdahl, Helena

    2013-08-01

    In birds the major histocompatibility complex (MHC) organization differs both among and within orders; chickens Gallus gallus of the order Galliformes have a simple arrangement, while many songbirds of the order Passeriformes have a more complex arrangement with larger numbers of MHC class I and II genes. Chicken MHC genes are found at two independent loci, classical MHC-B and non-classical MHC-Y, whereas non-classical MHC genes are yet to be verified in passerines. Here we characterize MHC class I transcripts (α1 to α3 domain) and perform amplicon sequencing using a next-generation sequencing technique on exon 3 from house sparrow Passer domesticus (a passerine) families. Then we use phylogenetic, selection, and segregation analyses to gain a better understanding of the MHC class I organization. Trees based on the α1 and α2 domain revealed a distinct cluster with short terminal branches for transcripts with a 6-bp deletion. Interestingly, this cluster was not seen in the tree based on the α3 domain. 21 exon 3 sequences were verified in a single individual and the average numbers within an individual were nine and five for sequences with and without a 6-bp deletion, respectively. All individuals had exon 3 sequences with and without a 6-bp deletion. The sequences with a 6-bp deletion have many characteristics in common with non-classical MHC, e.g., highly conserved amino acid positions were substituted compared with the other alleles, low nucleotide diversity and just a single site was subject to positive selection. However, these alleles also have characteristics that suggest they could be classical, e.g., complete linkage and absence of a distinct cluster in a tree based on the α3 domain. Thus, we cannot determine for certain whether or not the alleles with a 6-bp deletion are non-classical based on our present data. Further analyses on segregation patterns of these alleles in combination with dating the 6-bp deletion through MHC characterization across the genus Passer may solve this matter in the future.

  2. Autogen Version 2.0

    NASA Technical Reports Server (NTRS)

    Gladden, Roy

    2007-01-01

    Version 2.0 of the autogen software has been released. "Autogen" (automated sequence generation) signifies both a process and software used to implement the process of automated generation of sequences of commands in a standard format for uplink to spacecraft. Autogen requires fewer workers than are needed for older manual sequence-generation processes and reduces sequence-generation times from weeks to minutes.

  3. Characteristics of Viruses Derived from Nude Mice with Persistent Measles Virus Infection

    PubMed Central

    Hashimoto, Koichi; Watanabe, Masahiro; Ohara, Shinichiro; Sato, Masatoki; Kawasaki, Yukihiko; Hashimoto, Yuko; Hosoya, Mitsuaki

    2013-01-01

    Measles virus (MV) isolates from patients with subacute sclerosing panencephalitis (SSPE) differ from wild-type MV virologically. However, few animal models have reported viruses with characteristics of the SSPE virus. The MV Edmonston strain was inoculated into the subarachnoid space of nude mice. All nude mice displayed weight loss and required euthanasia, with a mean survival duration of 73.2 days. The viral load in the brain was 4- to 400-fold higher than the inoculated load, and brain infection was confirmed by immunostaining. Gene sequencing of the viruses revealed that amino acid mutations occurred more frequently in matrix proteins. The most common mutation was a uridine-to-cytosine transition. The virus exhibited lower free virus particle formation ability than the Edmonston strain. When nude mice were challenged with 2 × 102 PFU of the brain-derived virus, the mean survival duration was 34.7 days, which was significantly shorter than that of the mice challenged with 4 × 104 PFU of the Edmonston strain (P < 0.01). This study indicated that MV in a nude mouse model of persistent infection exhibited characteristics of the SSPE virus. This model may prove useful in elucidating the pathogenic mechanism of SSPE and developing potential therapeutics. PMID:23345518

  4. Characteristics of viruses derived from nude mice with persistent measles virus infection.

    PubMed

    Abe, Yusaku; Hashimoto, Koichi; Watanabe, Masahiro; Ohara, Shinichiro; Sato, Masatoki; Kawasaki, Yukihiko; Hashimoto, Yuko; Hosoya, Mitsuaki

    2013-04-01

    Measles virus (MV) isolates from patients with subacute sclerosing panencephalitis (SSPE) differ from wild-type MV virologically. However, few animal models have reported viruses with characteristics of the SSPE virus. The MV Edmonston strain was inoculated into the subarachnoid space of nude mice. All nude mice displayed weight loss and required euthanasia, with a mean survival duration of 73.2 days. The viral load in the brain was 4- to 400-fold higher than the inoculated load, and brain infection was confirmed by immunostaining. Gene sequencing of the viruses revealed that amino acid mutations occurred more frequently in matrix proteins. The most common mutation was a uridine-to-cytosine transition. The virus exhibited lower free virus particle formation ability than the Edmonston strain. When nude mice were challenged with 2 × 10(2) PFU of the brain-derived virus, the mean survival duration was 34.7 days, which was significantly shorter than that of the mice challenged with 4 × 10(4) PFU of the Edmonston strain (P < 0.01). This study indicated that MV in a nude mouse model of persistent infection exhibited characteristics of the SSPE virus. This model may prove useful in elucidating the pathogenic mechanism of SSPE and developing potential therapeutics.

  5. Teachers' Situation-Specific Mastery Experiences: Teacher, Student Group and Lesson Effects

    ERIC Educational Resources Information Center

    Malmberg, Lars-Erik; Hagger, Hazel; Webster, Sophie

    2014-01-01

    Following a model on the cyclical nature of teacher ("trait") self-efficacy and context-, task- and situation-specific ("state") "mastery experiences" (TSSME), we investigated the variability and effects of lesson characteristics (e.g. lesson sequence), student group characteristics (e.g. proportion of students…

  6. Rapid amplification of 5' complementary DNA ends (5' RACE).

    PubMed

    2005-08-01

    This method is used to extend partial cDNA clones by amplifying the 5' sequences of the corresponding mRNAs 1-3. The technique requires knowledge of only a small region of sequence within the partial cDNA clone. During PCR, the thermostable DNA polymerase is directed to the appropriate target RNA by a single primer derived from the region of known sequence; the second primer required for PCR is complementary to a general feature of the target-in the case of 5' RACE, to a homopolymeric tail added (via terminal transferase) to the 3' termini of cDNAs transcribed from a preparation of mRNA. This synthetic tail provides a primer-binding site upstream of the unknown 5' sequence of the target mRNA. The products of the amplification reaction are cloned into a plasmid vector for sequencing and subsequent manipulation.

  7. An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures

    PubMed Central

    Zandi, Kasra; Butler, Gregory; Kharma, Nawwaf

    2016-01-01

    Computational design of RNA sequences that fold into targeted secondary structures has many applications in biomedicine, nanotechnology and synthetic biology. An RNA molecule is made of different types of secondary structure elements and an important RNA element named pseudoknot plays a key role in stabilizing the functional form of the molecule. However, due to the computational complexities associated with characterizing pseudoknotted RNA structures, most of the existing RNA sequence designer algorithms generally ignore this important structural element and therefore limit their applications. In this paper we present a new algorithm to design RNA sequences for pseudoknotted secondary structures. We use NUPACK as the folding algorithm to compute the equilibrium characteristics of the pseudoknotted RNAs, and describe a new adaptive defect weighted sampling algorithm named Enzymer to design low ensemble defect RNA sequences for targeted secondary structures including pseudoknots. We used a biological data set of 201 pseudoknotted structures from the Pseudobase library to benchmark the performance of our algorithm. We compared the quality characteristics of the RNA sequences we designed by Enzymer with the results obtained from the state of the art MODENA and antaRNA. Our results show our method succeeds more frequently than MODENA and antaRNA do, and generates sequences that have lower ensemble defect, lower probability defect and higher thermostability. Finally by using Enzymer and by constraining the design to a naturally occurring and highly conserved Hammerhead motif, we designed 8 sequences for a pseudoknotted cis-acting Hammerhead ribozyme. Enzymer is available for download at https://bitbucket.org/casraz/enzymer. PMID:27499762

  8. A characteristic phenotypic retinal appearance in Norrie disease.

    PubMed

    Drenser, Kimberly A; Fecko, Alice; Dailey, Wendy; Trese, Michael T

    2007-02-01

    To describe a striking retinal finding that the authors have only seen in Norrie disease eyes and to determine if a particular genotype corresponds to this dramatic presentation. This is a retrospective, interventional case report of four patients seen in the clinic over a 1-year period. All patients had analysis of the Norrie gene by direct sequencing. All patients presented with a similar retinal appearance of dense stalk tissue, globular dystrophic retina, and peripheral avascular retina with pigmentary changes. Each patient was found to have a mutation in the Norrie gene affecting a cystine residue in the cystine knot domain. The mutations are predicted to disrupt the structure of the protein product, norrin, which is required for activation of the Wnt receptor:beta-catenin pathway. No other vitreoretinopathy that the authors have seen demonstrates this characteristic retinal presentation of severe retinal dysplasia. All four patients were found to have mutations in the Norrie gene which alter the cystine knot motif. Mutations affecting this domain appear to have devastating effects on retinal development and indicate phenotype correlates with mutations affecting the cystine knot domain.

  9. Modern representation of databases on the example of the Catalog of Solar Proton Events in the 23rd Cycle of Solar Activity

    NASA Astrophysics Data System (ADS)

    Ishkov, V. N.; Zabarinskaya, L. P.; Sergeeva, N. A.

    2017-11-01

    The development of studies of solar sources and their effects on the state of the near-Earth space required systematization of the corresponding information in the form of databases and catalogs for the entire time of observation of any geoeffective phenomenon that includes, if possible at the time of creation, all of the characteristics of the phenomena themselves and the sources of these phenomena on the Sun. A uniform presentation of information in the form of a series of similar catalogs that cover long time intervals is of particular importance. The large amount of information collected in such catalogs makes it necessary to use modern methods of its organization and presentation that allow a transition between individual parts of the catalog and a quick search for necessary events and their characteristics, which is implemented in the presented Catalog of Solar Proton Events in the 23rd Cycle of Solar Activity of the sequence of catalogs (six separate issues) that cover the period from 1970 to 2009 (20th-23rd solar cycles).

  10. On the Time Scale of Nocturnal Boundary Layer Cooling in Valleys and Basins and over Plains

    NASA Astrophysics Data System (ADS)

    de Wekker, Stephan F. J.; Whiteman, C. David

    2006-06-01

    Sequences of vertical temperature soundings over flat plains and in a variety of valleys and basins of different sizes and shapes were used to determine cooling-time-scale characteristics in the nocturnal stable boundary layer under clear, undisturbed weather conditions. An exponential function predicts the cumulative boundary layer cooling well. The fitting parameter or time constant in the exponential function characterizes the cooling of the valley atmosphere and is equal to the time required for the cumulative cooling to attain 63.2% of its total nighttime value. The exponential fit finds time constants varying between 3 and 8 h. Calculated time constants are smallest in basins, are largest over plains, and are intermediate in valleys. Time constants were also calculated from air temperature measurements made at various heights on the sidewalls of a small basin. The variation with height of the time constant exhibited a characteristic parabolic shape in which the smallest time constants occurred near the basin floor and on the upper sidewalls of the basin where cooling was governed by cold-air drainage and radiative heat loss, respectively.

  11. Overload characteristics of paper-polypropylene-paper cable

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ernst, A.

    1990-09-01

    The short-time rating of PPP pipe-type cable may be lower than the equivalent paper cable sized to carry the same normal load. The ratings depend on the relative conductor sizes and the maximum allowable conductor temperatures of the insulation. The insulation thermal resistivity may be a significant parameter for overload times of approximately one hour and should be verified for PPP insulation. The thermal capacitance temperature characteristic of PPP insulation is not known. However, the overload ratings are not very sensitive to this parameter. Overload ratings are given for maximum conductor temperatures from 105 C to 130 C. Use ofmore » ratings based on temperatures greater than 105 C would require testing to determine the extent of degradation of the insulation at these higher temperatures. PPP-insulated cable will be thermally stable over a wider range of operating conditions (voltage and current) compared with paper-insulated cable. The short-circuit ratings of PPP- and paper-insulated cable systems and the positive/negative and zero sequence impedances are compared. 21 refs., 22 figs., 5 tabs.« less

  12. Multi-Objective Optimization of Spacecraft Trajectories for Small-Body Coverage Missions

    NASA Technical Reports Server (NTRS)

    Hinckley, David, Jr.; Englander, Jacob; Hitt, Darren

    2017-01-01

    Visual coverage of surface elements of a small-body object requires multiple images to be taken that meet many requirements on their viewing angles, illumination angles, times of day, and combinations thereof. Designing trajectories capable of maximizing total possible coverage may not be useful since the image target sequence and the feasibility of said sequence given the rotation-rate limitations of the spacecraft are not taken into account. This work presents a means of optimizing, in a multi-objective manner, surface target sequences that account for such limitations.

  13. DNA viewed as an out-of-equilibrium structure

    NASA Astrophysics Data System (ADS)

    Provata, A.; Nicolis, C.; Nicolis, G.

    2014-05-01

    The complexity of the primary structure of human DNA is explored using methods from nonequilibrium statistical mechanics, dynamical systems theory, and information theory. A collection of statistical analyses is performed on the DNA data and the results are compared with sequences derived from different stochastic processes. The use of χ2 tests shows that DNA can not be described as a low order Markov chain of order up to r =6. Although detailed balance seems to hold at the level of a binary alphabet, it fails when all four base pairs are considered, suggesting spatial asymmetry and irreversibility. Furthermore, the block entropy does not increase linearly with the block size, reflecting the long-range nature of the correlations in the human genomic sequences. To probe locally the spatial structure of the chain, we study the exit distances from a specific symbol, the distribution of recurrence distances, and the Hurst exponent, all of which show power law tails and long-range characteristics. These results suggest that human DNA can be viewed as a nonequilibrium structure maintained in its state through interactions with a constantly changing environment. Based solely on the exit distance distribution accounting for the nonequilibrium statistics and using the Monte Carlo rejection sampling method, we construct a model DNA sequence. This method allows us to keep both long- and short-range statistical characteristics of the native DNA data. The model sequence presents the same characteristic exponents as the natural DNA but fails to capture spatial correlations and point-to-point details.

  14. Characterization of Cryptocaryon irritans, a parasite isolated from marine fishes in Taiwan.

    PubMed

    Yambot, Apolinario V; Song, Yen-Ling; Sung, Hung-Hung

    2003-03-31

    The ciliated protozoan parasite Cryptocaryon irritans infecting marine fishes in Taiwan is described. Developmental characteristics and sequences of the ribosomal DNA regions such as part of 18 S, the entire first internal transcribed spacer, and part of 5.8 S of various Taiwan isolates of C. irritans were investigated. A total of 5 isolates was obtained from different fish-host species and localities, the majority from cultured fish species. C. irritans from Taiwan is able to shift its developmental characteristics, i.e. from non-adherent to adherent tomonts, from individualistic to aggregate-forming tomonts, from infection of the gills only to infection of the gills and body. Thus, it is not possible to classify strains of C. irritans on the basis of these parameters. Premature tomonts that developed from dead fishes were able to produce theronts that could infect fish host. Isolates from Pingtung and the USA had identical nucleotide sequences while an isolate from Malaysia was identical to an Israel isolate. Percentage variation among pairs of Taiwan isolates showed a higher degree of variation than isolate sequences listed in GenBank. Sequence analysis revealed highly aberrant isolates in Taiwan, and a phylogenetic tree distinguished a marine and a low-salinity variant. C. irritans from marine fishes in Taiwan, therefore, display some characteristics not previously reported. Since manipulation of salinity in brackishwater ponds and marine cage sites is not feasible, there is a need to develop new strategies for the control and prevention of cryptocaryoniasis.

  15. Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

    PubMed

    Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

    1993-02-01

    A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.

  16. DNA viewed as an out-of-equilibrium structure.

    PubMed

    Provata, A; Nicolis, C; Nicolis, G

    2014-05-01

    The complexity of the primary structure of human DNA is explored using methods from nonequilibrium statistical mechanics, dynamical systems theory, and information theory. A collection of statistical analyses is performed on the DNA data and the results are compared with sequences derived from different stochastic processes. The use of χ^{2} tests shows that DNA can not be described as a low order Markov chain of order up to r=6. Although detailed balance seems to hold at the level of a binary alphabet, it fails when all four base pairs are considered, suggesting spatial asymmetry and irreversibility. Furthermore, the block entropy does not increase linearly with the block size, reflecting the long-range nature of the correlations in the human genomic sequences. To probe locally the spatial structure of the chain, we study the exit distances from a specific symbol, the distribution of recurrence distances, and the Hurst exponent, all of which show power law tails and long-range characteristics. These results suggest that human DNA can be viewed as a nonequilibrium structure maintained in its state through interactions with a constantly changing environment. Based solely on the exit distance distribution accounting for the nonequilibrium statistics and using the Monte Carlo rejection sampling method, we construct a model DNA sequence. This method allows us to keep both long- and short-range statistical characteristics of the native DNA data. The model sequence presents the same characteristic exponents as the natural DNA but fails to capture spatial correlations and point-to-point details.

  17. The Nostoc punctiforme Genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9more » Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.« less

  18. Equally parsimonious pathways through an RNA sequence space are not equally likely

    NASA Technical Reports Server (NTRS)

    Lee, Y. H.; DSouza, L. M.; Fox, G. E.

    1997-01-01

    An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.

  19. Stratigraphic framework and evolution of the Cretaceous continental sequences of the Bauru, Sanfranciscana, and Parecis basins, Brazil

    NASA Astrophysics Data System (ADS)

    Batezelli, Alessandro; Ladeira, Francisco Sergio Bernardes

    2016-01-01

    With the breakup of the supercontinent Gondwana, the South American Plate has undergone an intense process of tectonic restructuring that led to the genesis of the interior basins that encompassed continental sedimentary sequences. The Brazilian Bauru, Sanfranciscana and Parecis basins during Late Cretaceous have had their evolution linked to this process of structuring and therefore have very similar sedimentary characteristics. The purpose of this study is to establish a detailed understanding of alluvial sedimentary processes and architecture within a stratigraphic sequence framework using the concept of the stratigraphic base level or the ratio between the accommodation space and sediment supply. The integration of the stratigraphic and facies data contributed to defining the stratigraphic architecture of the Bauru, Sanfranciscana and Parecis Basins, supporting a model for continental sequences that depicts qualitative changes in the sedimentation rate (S) and accommodation space (A) that occurred during the Cretaceous. This study discusses the origin of the unconformity surfaces (K-0, K-1 and K-1A) that separate Sequences 1, 2A and 2B and the sedimentary characteristics of the Bauru, Sanfranciscana and Parecis Basins from the Aptian to the Maastrichtian, comparing the results with other Cretaceous Brazilian basins. The lower Cretaceous Sequence 1 (Caiuá and Areado groups) is interpreted as a low-accommodation systems tract compound by fluvial and aeolian systems. The upper Cretaceous lacustrine, braided river-dominated alluvial fan and aeolian systems display characteristics of the evolution from high-to low-accommodation systems tracts (Sequences 2A and 2B). Unconformity K-0 is related to the origin of the Bauru Basin itself in the Early Cretaceous. In Sanfranciscana and Parecis basins, the unconformity K-0 marks the contact between aeolian deposits from Lower Cretaceous and Upper Cretaceous alluvial systems (Sequences 1 and 2). Unconformity K-1, which was generated in the Late Cretaceous, is related to an increase of the A/S ratio, whereas Unconformity K-1A is the result of the decrease in the A/S ratio. Unconformity K-1A bound Sequence 2A (lacustrine and fluvial systems) and Sequence 2B (alluvial deposits) in Bauru Basin whereas in the Sanfranciscana and Parecis basins this unconformity marks the transition from alluvial system to aeolian system (Sequences 2A and 2B). Changes in depositional style in both basins correspond to two distinct tectonic moments occurring within the South American plate. The first associated with post-volcanic thermal subsidence of the Early Cretaceous (Serra Geral and Tapirapuã volcanismos), and the second moment associated with the uplift occurred in the Late Cretaceous (Alto Paranaíba, Vilhena and Serra Formosa Arcs).

  20. Identification of Delta5-fatty acid desaturase from the cellular slime mold dictyostelium discoideum.

    PubMed

    Saito, T; Ochiai, H

    1999-10-01

    cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.

  1. HIV-1 sequence variation between isolates from mother-infant transmission pairs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wike, C.M.; Daniels, M.R.; Furtado, M.

    1991-12-31

    To examine the sequence diversity of human immunodeficiency virus type 1 (HIV-1) between known transmission sets, sequences from the V3 and V4-V5 region of the env gene from 4 mother-infant pairs were analyzed. The mean interpatient sequence variation between isolates from linked mother-infant pairs was comparable to the sequence diversity found between isolates from other close contacts. The mean intrapatient variation was significantly less in the infants` isolates then the isolates from both their mothers and other characterized intrapatient sequence sets. In addition, a distinct and characteristic difference in the glycosylation pattern preceding the V3 loop was found between eachmore » linked transmission pair. These findings indicate that selection of specific genotypic variants, which may play a role in some direct transmission sets, and the duration of infection are important factors in the degree of diversity seen between the sequence sets.« less

  2. Draft Genome Sequence of Lactobacillus pobuzihii E100301T

    PubMed Central

    Chiu, Chi-ming; Chang, Chi-huan; Pan, Shwu-fen; Wu, Hui-chung; Li, Shiao-wen; Chang, Chuan-hsiung; Lee, Yun-shien; Chiang, Chih-ming

    2013-01-01

    Lactobacillus pobuzihii E100301T is a novel Lactobacillus species previously isolated from pobuzihi (fermented cummingcordia) in Taiwan. Phylogenetically, this strain is closest to Lactobacillus acidipiscis, but its phenotypic characteristics can be clearly distinguished from those of L. acidipiscis. We present the draft genome sequence of strain L. pobuzihii E100301T. PMID:23661478

  3. Complete Genome Sequence of the Dairy Isolate Lactobacillus acidipiscis ACA-DC 1533

    PubMed Central

    Kazou, Maria; Alexandraki, Voula; Pot, Bruno; Tsakalidou, Effie

    2017-01-01

    ABSTRACT Lactobacillus acidipiscis is a Gram-positive lactic acid bacterium belonging to the Lactobacillus salivarius clade. Here, we present the first complete genome sequence of L. acidipiscis isolated from traditional Greek Kopanisti cheese. Strain ACA-DC 1533 may play a key role in the strong organoleptic characteristics of Kopanisti cheese. PMID:28126948

  4. Genome Sequence of Chinese Porcine Parvovirus Strain PPV2010

    PubMed Central

    Cui, Jin; Wang, Xin; Ren, Yudong; Cui, Shangjin; Li, Guangxing

    2012-01-01

    Porcine parvovirus (PPV) isolate PPV2010 has recently emerged in China. Herein, we analyze the complete genome sequence of PPV2010. Our results indicate that the genome of PPV2010 bears mixed characteristics of virulent PPV and vaccine strains. Importantly, PPV2010 has the potential to be a naturally attenuated candidate vaccine strain. PMID:22282333

  5. Raman-based system for DNA sequencing-mapping and other separations

    DOEpatents

    Vo-Dinh, T.

    1994-04-26

    DNA sequencing and mapping are performed by using a Raman spectrometer with a surface enhanced Raman scattering (SERS) substrate to enhance the Raman signal. A SERS label is attached to a DNA fragment and then analyzed with the Raman spectrometer to identify the DNA fragment according to characteristics of the Raman spectrum generated. 11 figures.

  6. Identification of genotyping-by-sequencing sequence tags associated with milling performance and end-use quality traits in hard red spring wheat (Triticum aestivum L.)

    USDA-ARS?s Scientific Manuscript database

    Wheat quality is defined by culinary end-uses and processing characteristics. Wheat breeders are interested to identify quantitative trait loci for grain, milling, and end-use quality traits because it is imperative to understand the genetic complexity underlying quantitatively inherited traits to ...

  7. Draft Genome Sequences of Two Aspergillus fumigatus Strains, Isolated from the International Space Station.

    PubMed

    Singh, Nitin Kumar; Blachowicz, Adriana; Checinska, Aleksandra; Wang, Clay; Venkateswaran, Kasthuri

    2016-07-14

    Draft genome sequences of Aspergillus fumigatus strains (ISSFT-021 and IF1SW-F4), opportunistic pathogens isolated from the International Space Station (ISS), were assembled to facilitate investigations of the nature of the virulence characteristics of the ISS strains to other clinical strains isolated on Earth. Copyright © 2016 Singh et al.

  8. Pulseq: A rapid and hardware-independent pulse sequence prototyping framework.

    PubMed

    Layton, Kelvin J; Kroboth, Stefan; Jia, Feng; Littin, Sebastian; Yu, Huijun; Leupold, Jochen; Nielsen, Jon-Fredrik; Stöcker, Tony; Zaitsev, Maxim

    2017-04-01

    Implementing new magnetic resonance experiments, or sequences, often involves extensive programming on vendor-specific platforms, which can be time consuming and costly. This situation is exacerbated when research sequences need to be implemented on several platforms simultaneously, for example, at different field strengths. This work presents an alternative programming environment that is hardware-independent, open-source, and promotes rapid sequence prototyping. A novel file format is described to efficiently store the hardware events and timing information required for an MR pulse sequence. Platform-dependent interpreter modules convert the file to appropriate instructions to run the sequence on MR hardware. Sequences can be designed in high-level languages, such as MATLAB, or with a graphical interface. Spin physics simulation tools are incorporated into the framework, allowing for comparison between real and virtual experiments. Minimal effort is required to implement relatively advanced sequences using the tools provided. Sequences are executed on three different MR platforms, demonstrating the flexibility of the approach. A high-level, flexible and hardware-independent approach to sequence programming is ideal for the rapid development of new sequences. The framework is currently not suitable for large patient studies or routine scanning although this would be possible with deeper integration into existing workflows. Magn Reson Med 77:1544-1552, 2017. © 2016 International Society for Magnetic Resonance in Medicine. © 2016 International Society for Magnetic Resonance in Medicine.

  9. AfterQC: automatic filtering, trimming, error removing and quality control for fastq data.

    PubMed

    Chen, Shifu; Huang, Tanxiao; Zhou, Yanqing; Han, Yue; Xu, Mingyan; Gu, Jia

    2017-03-14

    Some applications, especially those clinical applications requiring high accuracy of sequencing data, usually have to face the troubles caused by unavoidable sequencing errors. Several tools have been proposed to profile the sequencing quality, but few of them can quantify or correct the sequencing errors. This unmet requirement motivated us to develop AfterQC, a tool with functions to profile sequencing errors and correct most of them, plus highly automated quality control and data filtering features. Different from most tools, AfterQC analyses the overlapping of paired sequences for pair-end sequencing data. Based on overlapping analysis, AfterQC can detect and cut adapters, and furthermore it gives a novel function to correct wrong bases in the overlapping regions. Another new feature is to detect and visualise sequencing bubbles, which can be commonly found on the flowcell lanes and may raise sequencing errors. Besides normal per cycle quality and base content plotting, AfterQC also provides features like polyX (a long sub-sequence of a same base X) filtering, automatic trimming and K-MER based strand bias profiling. For each single or pair of FastQ files, AfterQC filters out bad reads, detects and eliminates sequencer's bubble effects, trims reads at front and tail, detects the sequencing errors and corrects part of them, and finally outputs clean data and generates HTML reports with interactive figures. AfterQC can run in batch mode with multiprocess support, it can run with a single FastQ file, a single pair of FastQ files (for pair-end sequencing), or a folder for all included FastQ files to be processed automatically. Based on overlapping analysis, AfterQC can estimate the sequencing error rate and profile the error transform distribution. The results of our error profiling tests show that the error distribution is highly platform dependent. Much more than just another new quality control (QC) tool, AfterQC is able to perform quality control, data filtering, error profiling and base correction automatically. Experimental results show that AfterQC can help to eliminate the sequencing errors for pair-end sequencing data to provide much cleaner outputs, and consequently help to reduce the false-positive variants, especially for the low-frequency somatic mutations. While providing rich configurable options, AfterQC can detect and set all the options automatically and require no argument in most cases.

  10. Comparison of single-use and reusable metal laryngoscope blades for orotracheal intubation during rapid sequence induction of anesthesia: a multicenter cluster randomized study.

    PubMed

    Amour, Julien; Le Manach, Yannick Le; Borel, Marie; Lenfant, François; Nicolas-Robin, Armelle; Carillion, Aude; Ripart, Jacques; Riou, Bruno; Langeron, Olivier

    2010-02-01

    Single-use metal laryngoscope blades are cheaper and carry a lower risk of infection than reusable metal blades. The authors compared single-use and reusable metal blades during rapid sequence induction of anesthesia in a multicenter cluster randomized trial. One thousand seventy-two adult patients undergoing general anesthesia under emergency conditions and requiring rapid sequence induction were randomly assigned on a weekly basis to either single-use or reusable metal blades (cluster randomization). After induction, a 60-s period was allowed to complete intubation. In the case of failed intubation, a second attempt was performed using the opposite type of blade. The primary endpoint was the rate of failed intubation, and the secondary endpoints were the incidence of complications (oxygen desaturation, lung aspiration, and/or oropharynx trauma) and the Cormack and Lehane score. Both groups were similar in their main characteristics, including the risk factors for difficult intubation. The rate of failed intubation was significantly decreased with single-use metal blades at the first attempt compared with reusable blades (2.8 vs. 5.4%, P < 0.05). In addition, the proportion of grades III and IV in Cormack and Lehane score were also significantly decreased with single-use metal blades (6 vs. 10%, P < 0.05). The global complication rate did not reach statistical significance, although the same trend was noted (6.8% vs. 11.5%, P = not significant). An investigator survey and a measure of illumination pointed that illumination might have been responsible for this result. The single-use metal blade was more efficient than a reusable metal blade in rapid sequence induction of anesthesia.

  11. tRNomics: analysis of tRNA genes from 50 genomes of Eukarya, Archaea, and Bacteria reveals anticodon-sparing strategies and domain-specific features.

    PubMed Central

    Marck, Christian; Grosjean, Henri

    2002-01-01

    From 50 genomes of the three domains of life (7 eukarya, 13 archaea, and 30 bacteria), we extracted, analyzed, and compared over 4,000 sequences corresponding to cytoplasmic, nonorganellar tRNAs. For each genome, the complete set of tRNAs required to read the 61 sense codons was identified, which permitted revelation of three major anticodon-sparing strategies. Other features and sequence peculiarities analyzed are the following: (1) fit to the standard cloverleaf structure, (2) characteristic consensus sequences for elongator and initiator tDNAs, (3) frequencies of bases at each sequence position, (4) type and frequencies of conserved 2D and 3D base pairs, (5) anticodon/tDNA usages and anticodon-sparing strategies, (6) identification of the tRNA-Ile with anticodon CAU reading AUA, (7) size of variable arm, (8) occurrence and location of introns, (9) occurrence of 3'-CCA and 5'-extra G encoded at the tDNA level, and (10) distribution of the tRNA genes in genomes and their mode of transcription. Among all tRNA isoacceptors, we found that initiator tDNA-iMet is the most conserved across the three domains, yet domain-specific signatures exist. Also, according to which tRNA feature is considered (5'-extra G encoded in tDNAs-His, AUA codon read by tRNA-Ile with anticodon CAU, presence of intron, absence of "two-out-of-three" reading mode and short V-arm in tDNA-Tyr) Archaea sequester either with Bacteria or Eukarya. No common features between Eukarya and Bacteria not shared with Archaea could be unveiled. Thus, from the tRNomic point of view, Archaea appears as an "intermediate domain" between Eukarya and Bacteria. PMID:12403461

  12. Complete chloroplast genome sequences of Praxelis (Eupatorium catarium Veldkamp), an important invasive species.

    PubMed

    Zhang, Ying; Li, Lei; Yan, Ting Liang; Liu, Qiang

    2014-10-01

    Praxelis (Eupatorium catarium Veldkamp) is a new hazardous invasive plant species that has caused serious economic losses and environmental damage in the Northern hemisphere tropical and subtropical regions. Although previous studies focused on detecting the biological characteristics of this plant to prevent its expansion, little effort has been made to understand the impact of Praxelis on the ecosystem in an evolutionary process. The genetic information of Praxelis is required for further phylogenetic identification and evolutionary studies. Here, we report the complete Praxelis chloroplast (cp) genome sequence. The Praxelis chloroplast genome is 151,410 bp in length including a small single-copy region (18,547 bp) and a large single-copy region (85,311 bp) separated by a pair of inverted repeats (IRs; 23,776 bp). The genome contains 85 unique and 18 duplicated genes in the IR region. The gene content and organization are similar to other Asteraceae tribe cp genomes. We also analyzed the whole cp genome sequence, repeat structure, codon usage, contraction of the IR and gene structure/organization features between native and invasive Asteraceae plants, in order to understand the evolution of organelle genomes between native and invasive Asteraceae. Comparative analysis identified the 14 markers containing greater than 2% parsimony-informative characters, indicating that they are potential informative markers for barcoding and phylogenetic analysis. Moreover, a sister relationship between Praxelis and seven other species in Asteraceae was found based on phylogenetic analysis of 28 protein-coding sequences. Complete cp genome information is useful for plant phylogenetic and evolutionary studies within this invasive species and also within the Asteraceae family. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. A Dual Interaction Between the 5'- and 3'-Ends of the Melon Necrotic Spot Virus (MNSV) RNA Genome Is Required for Efficient Cap-Independent Translation.

    PubMed

    Miras, Manuel; Rodríguez-Hernández, Ana M; Romero-López, Cristina; Berzal-Herranz, Alfredo; Colchero, Jaime; Aranda, Miguel A; Truniger, Verónica

    2018-01-01

    In eukaryotes, the formation of a 5'-cap and 3'-poly(A) dependent protein-protein bridge is required for translation of its mRNAs. In contrast, several plant virus RNA genomes lack both of these mRNA features, but instead have a 3'-CITE (for cap-independent translation enhancer), a RNA element present in their 3'-untranslated region that recruits translation initiation factors and is able to control its cap-independent translation. For several 3'-CITEs, direct RNA-RNA long-distance interactions based on sequence complementarity between the 5'- and 3'-ends are required for efficient translation, as they bring the translation initiation factors bound to the 3'-CITE to the 5'-end. For the carmovirus melon necrotic spot virus (MNSV), a 3'-CITE has been identified, and the presence of its 5'-end in cis has been shown to be required for its activity. Here, we analyze the secondary structure of the 5'-end of the MNSV RNA genome and identify two highly conserved nucleotide sequence stretches that are complementary to the apical loop of its 3'-CITE. In in vivo cap-independent translation assays with mutant constructs, by disrupting and restoring sequence complementarity, we show that the interaction between the 3'-CITE and at least one complementary sequence in the 5'-end is essential for virus RNA translation, although efficient virus translation and multiplication requires both connections. The complementary sequence stretches are invariant in all MNSV isolates, suggesting that the dual 5'-3' RNA:RNA interactions are required for optimal MNSV cap-independent translation and multiplication.

  14. Sequence diagrams and the presentation of structural and evolutionary relationships among proteins.

    PubMed

    Thomas, B R

    1975-01-01

    Protein sequences mapped on two-dimensional diagrams show characteristic patterns that should be of value in visualising sequence information and in distinguishing simpler structures. A convenient map form for comparative purposes is the alpha-helix diagram with aminoacid distribution analogous to the surface of an alpha-helix oriented so that an alpha-helix structure corresponds on the diagram to a vertical band 3.6 residues wide. The sequence diagram for an alpha-keratin, high-sulphur protein suggests a new form of polypeptide helix based on a repeating unit of five which may be an important component of alpha-keratin fibres.

  15. Nanopore-CMOS Interfaces for DNA Sequencing

    PubMed Central

    Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim

    2016-01-01

    DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces. PMID:27509529

  16. Nanopore-CMOS Interfaces for DNA Sequencing.

    PubMed

    Magierowski, Sebastian; Huang, Yiyun; Wang, Chengjie; Ghafar-Zadeh, Ebrahim

    2016-08-06

    DNA sequencers based on nanopore sensors present an opportunity for a significant break from the template-based incumbents of the last forty years. Key advantages ushered by nanopore technology include a simplified chemistry and the ability to interface to CMOS technology. The latter opportunity offers substantial promise for improvement in sequencing speed, size and cost. This paper reviews existing and emerging means of interfacing nanopores to CMOS technology with an emphasis on massively-arrayed structures. It presents this in the context of incumbent DNA sequencing techniques, reviews and quantifies nanopore characteristics and models and presents CMOS circuit methods for the amplification of low-current nanopore signals in such interfaces.

  17. LongISLND: in silico sequencing of lengthy and noisy datatypes

    PubMed Central

    Lau, Bayo; Mohiyuddin, Marghoob; Mu, John C.; Fang, Li Tai; Bani Asadi, Narges; Dallett, Carolina; Lam, Hugo Y. K.

    2016-01-01

    Summary: LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling. Availability and Implementation: LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd Contact: hugo.lam@roche.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27667791

  18. Large-scale DNA Barcode Library Generation for Biomolecule Identification in High-throughput Screens.

    PubMed

    Lyons, Eli; Sheridan, Paul; Tremmel, Georg; Miyano, Satoru; Sugano, Sumio

    2017-10-24

    High-throughput screens allow for the identification of specific biomolecules with characteristics of interest. In barcoded screens, DNA barcodes are linked to target biomolecules in a manner allowing for the target molecules making up a library to be identified by sequencing the DNA barcodes using Next Generation Sequencing. To be useful in experimental settings, the DNA barcodes in a library must satisfy certain constraints related to GC content, homopolymer length, Hamming distance, and blacklisted subsequences. Here we report a novel framework to quickly generate large-scale libraries of DNA barcodes for use in high-throughput screens. We show that our framework dramatically reduces the computation time required to generate large-scale DNA barcode libraries, compared with a naїve approach to DNA barcode library generation. As a proof of concept, we demonstrate that our framework is able to generate a library consisting of one million DNA barcodes for use in a fragment antibody phage display screening experiment. We also report generating a general purpose one billion DNA barcode library, the largest such library yet reported in literature. Our results demonstrate the value of our novel large-scale DNA barcode library generation framework for use in high-throughput screening applications.

  19. Antiwhirl PDC bits increased penetration rates in Alberta drilling. [Polycrystalline Diamond Compact

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bobrosky, D.; Osmak, G.

    1993-07-05

    The antiwhirl PDC bits and an inhibitive mud system contributed to the quicker drilling of the time-sensitive shales. The hole washouts in the intermediate section were dramatically reduced, resulting in better intermediate casing cement jobs. Also, the use of antirotation PDC-drillable cementing plugs eliminated the need to drill out plugs and float equipment with a steel tooth bit and then trip for the PDC bit. By using an antiwhirl PDC bit, at least one trip was eliminated in the intermediate section. Offset data indicated that two to six conventional bits would have been required to drill the intermediate hole interval.more » The PDC bit was rebuildable and therefore rerunnable even after being used on five wells. In each instance, the cost of replacing chipped cutters was less than the cost of a new insert roller cone bit. The paper describes the antiwhirl bits; the development of the bits; and their application in a clastic sequence, a carbonate sequence, and the Shekilie oil field; the improvement in the rate of penetration; the selection of bottom hole assemblies; washout problems; and drill-out characteristics.« less

  20. Multimodal biometric digital watermarking on immigrant visas for homeland security

    NASA Astrophysics Data System (ADS)

    Sasi, Sreela; Tamhane, Kirti C.; Rajappa, Mahesh B.

    2004-08-01

    Passengers with immigrant Visa's are a major concern to the International Airports due to the various fraud operations identified. To curb tampering of genuine Visa, the Visa's should contain human identification information. Biometric characteristic is a common and reliable way to authenticate the identity of an individual [1]. A Multimodal Biometric Human Identification System (MBHIS) that integrates iris code, DNA fingerprint, and the passport number on the Visa photograph using digital watermarking scheme is presented. Digital Watermarking technique is well suited for any system requiring high security [2]. Ophthalmologists [3], [4], [5] suggested that iris scan is an accurate and nonintrusive optical fingerprint. DNA sequence can be used as a genetic barcode [6], [7]. While issuing Visa at the US consulates, the DNA sequence isolated from saliva, the iris code and passport number shall be digitally watermarked in the Visa photograph. This information is also recorded in the 'immigrant database'. A 'forward watermarking phase' combines a 2-D DWT transformed digital photograph with the personal identification information. A 'detection phase' extracts the watermarked information from this VISA photograph at the port of entry, from which iris code can be used for identification and DNA biometric for authentication, if an anomaly arises.

  1. KungFQ: a simple and powerful approach to compress fastq files.

    PubMed

    Grassi, Elena; Di Gregorio, Federico; Molineris, Ivan

    2012-01-01

    Nowadays storing data derived from deep sequencing experiments has become pivotal and standard compression algorithms do not exploit in a satisfying manner their structure. A number of reference-based compression algorithms have been developed but they are less adequate when approaching new species without fully sequenced genomes or nongenomic data. We developed a tool that takes advantages of fastq characteristics and encodes them in a binary format optimized in order to be further compressed with standard tools (such as gzip or lzma). The algorithm is straightforward and does not need any external reference file, it scans the fastq only once and has a constant memory requirement. Moreover, we added the possibility to perform lossy compression, losing some of the original information (IDs and/or qualities) but resulting in smaller files; it is also possible to define a quality cutoff under which corresponding base calls are converted to N. We achieve 2.82 to 7.77 compression ratios on various fastq files without losing information and 5.37 to 8.77 losing IDs, which are often not used in common analysis pipelines. In this paper, we compare the algorithm performance with known tools, usually obtaining higher compression levels.

  2. Fingerprint multicast in secure video streaming.

    PubMed

    Zhao, H Vicky; Liu, K J Ray

    2006-01-01

    Digital fingerprinting is an emerging technology to protect multimedia content from illegal redistribution, where each distributed copy is labeled with unique identification information. In video streaming, huge amount of data have to be transmitted to a large number of users under stringent latency constraints, so the bandwidth-efficient distribution of uniquely fingerprinted copies is crucial. This paper investigates the secure multicast of anticollusion fingerprinted video in streaming applications and analyzes their performance. We first propose a general fingerprint multicast scheme that can be used with most spread spectrum embedding-based multimedia fingerprinting systems. To further improve the bandwidth efficiency, we explore the special structure of the fingerprint design and propose a joint fingerprint design and distribution scheme. From our simulations, the two proposed schemes can reduce the bandwidth requirement by 48% to 87%, depending on the number of users, the characteristics of video sequences, and the network and computation constraints. We also show that under the constraint that all colluders have the same probability of detection, the embedded fingerprints in the two schemes have approximately the same collusion resistance. Finally, we propose a fingerprint drift compensation scheme to improve the quality of the reconstructed sequences at the decoder's side without introducing extra communication overhead.

  3. The human genome: Some assembly required. Final report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    NONE

    1994-12-31

    The Human Genome Project promises to be one of the most rewarding endeavors in modern biology. The cost and the ethical and social implications, however, have made this project the source of considerable debate both in the scientific community and in the public at large. The 1994 Graduate Student Symposium addresses the scientific merits of the project, the technical issues involved in accomplishing the task, as well as the medical and social issues which stem from the wealth of knowledge which the Human Genome Project will help create. To this end, speakers were brought together who represent the diverse areasmore » of expertise characteristic of this multidisciplinary project. The keynote speaker addresses the project`s motivations and goals in the larger context of biological and medical sciences. The first two sessions address relevant technical issues, data collection with a focus on high-throughput sequencing methods and data analysis with an emphasis on identification of coding sequences. The third session explores recent advances in the understanding of genetic diseases and possible routes to treatment. Finally, the last session addresses some of the ethical, social and legal issues which will undoubtedly arise from having a detailed knowledge of the human genome.« less

  4. A Method for the Control of Multigrasp Myoelectric Prosthetic Hands

    PubMed Central

    Dalley, Skyler Ashton; Varol, Huseyin Atakan; Goldfarb, Michael

    2012-01-01

    This paper presents the design and preliminary experimental validation of a multigrasp myoelectric controller. The described method enables direct and proportional control of multigrasp prosthetic hand motion among nine characteristic postures using two surface electromyography electrodes. To assess the efficacy of the control method, five nonamputee subjects utilized the multigrasp myoelectric controller to command the motion of a virtual prosthesis between random sequences of target hand postures in a series of experimental trials. For comparison, the same subjects also utilized a data glove, worn on their native hand, to command the motion of the virtual prosthesis for similar sequences of target postures during each trial. The time required to transition from posture to posture and the percentage of correctly completed transitions were evaluated to characterize the ability to control the virtual prosthesis using each method. The average overall transition times across all subjects were found to be 1.49 and 0.81 s for the multigrasp myoelectric controller and the native hand, respectively. The average transition completion rates for both were found to be the same (99.2%). Supplemental videos demonstrate the virtual prosthesis experiments, as well as a preliminary hardware implementation. PMID:22180515

  5. MatureP: prediction of secreted proteins with exclusive information from their mature regions.

    PubMed

    Orfanoudaki, Georgia; Markaki, Maria; Chatzi, Katerina; Tsamardinos, Ioannis; Economou, Anastassios

    2017-06-12

    More than a third of the cellular proteome is non-cytoplasmic. Most secretory proteins use the Sec system for export and are targeted to membranes using signal peptides and mature domains. To specifically analyze bacterial mature domain features, we developed MatureP, a classifier that predicts secretory sequences through features exclusively computed from their mature domains. MatureP was trained using Just Add Data Bio, an automated machine learning tool. Mature domains are predicted efficiently with ~92% success, as measured by the Area Under the Receiver Operating Characteristic Curve (AUC). Predictions were validated using experimental datasets of mutated secretory proteins. The features selected by MatureP reveal prominent differences in amino acid content between secreted and cytoplasmic proteins. Amino-terminal mature domain sequences have enhanced disorder, more hydroxyl and polar residues and less hydrophobics. Cytoplasmic proteins have prominent amino-terminal hydrophobic stretches and charged regions downstream. Presumably, secretory mature domains comprise a distinct protein class. They balance properties that promote the necessary flexibility required for the maintenance of non-folded states during targeting and secretion with the ability of post-secretion folding. These findings provide novel insight in protein trafficking, sorting and folding mechanisms and may benefit protein secretion biotechnology.

  6. Characterization of Reconstructed Ancestral Proteins Suggests a Change in Temperature of the Ancient Biosphere.

    PubMed

    Akanuma, Satoshi

    2017-08-06

    Understanding the evolution of ancestral life, and especially the ability of some organisms to flourish in the variable environments experienced in Earth's early biosphere, requires knowledge of the characteristics and the environment of these ancestral organisms. Information about early life and environmental conditions has been obtained from fossil records and geological surveys. Recent advances in phylogenetic analysis, and an increasing number of protein sequences available in public databases, have made it possible to infer ancestral protein sequences possessed by ancient organisms. However, the in silico studies that assess the ancestral base content of ribosomal RNAs, the frequency of each amino acid in ancestral proteins, and estimate the environmental temperatures of ancient organisms, show conflicting results. The characterization of ancestral proteins reconstructed in vitro suggests that ancient organisms had very thermally stable proteins, and therefore were thermophilic or hyperthermophilic. Experimental data supports the idea that only thermophilic ancestors survived the catastrophic increase in temperature of the biosphere that was likely associated with meteorite impacts during the early history of Earth. In addition, by expanding the timescale and including more ancestral proteins for reconstruction, it appears as though the Earth's surface temperature gradually decreased over time, from Archean to present.

  7. Effect of manufacturing process sequence on the corrosion resistance characteristics of coated metallic bipolar plates

    NASA Astrophysics Data System (ADS)

    Dur, Ender; Cora, Ömer Necati; Koç, Muammer

    2014-01-01

    Metallic bipolar plate (BPP) with high corrosion and low contact resistance, durability, strength, low cost, volume, and weight requirements is one of the critical parts of the PEMFC. This study is dedicated to understand the effect of the process sequence (manufacturing then coating vs. coating then manufacturing) on the corrosion resistance of coated metallic bipolar plates. To this goal, three different PVD coatings (titanium nitride (TiN), chromium nitride (CrN), zirconium nitride (ZrN)), with three thicknesses, (0.1, 0.5, 1 μm) were applied on BPPs made of 316L stainless steel alloy before and after two types of manufacturing (i.e., stamping or hydroforming). Corrosion test results indicated that ZrN coating exhibited the best corrosion protection while the performance of TiN coating was the lowest among the tested coatings and thicknesses. For most of the cases tested, in which coating was applied before manufacturing, occurrence of corrosion was found to be more profound than the case where coating was applied after manufacturing. Increasing the coating thickness was found to improve the corrosion resistance. It was also revealed that hydroformed BPPs performed slightly better than stamped BPPs in terms of the corrosion behavior.

  8. Eddy current correction in volume-localized MR spectroscopy

    NASA Technical Reports Server (NTRS)

    Lin, C.; Wendt, R. E. 3rd; Evans, H. J.; Rowe, R. M.; Hedrick, T. D.; LeBlanc, A. D.

    1994-01-01

    The quality of volume-localized magnetic resonance spectroscopy is affected by eddy currents caused by gradient switching. Eddy currents can be reduced with improved gradient systems; however, it has been suggested that the distortion due to eddy currents can be compensated for during postprocessing with a single-frequency reference signal. The authors propose modifying current techniques for acquiring the single-frequency reference signal by using relaxation weighting to reduce interference from components that cannot be eliminated by digital filtering alone. Additional sequences with T1 or T2 weighting for reference signal acquisition are shown to have the same eddy current characteristics as the original signal without relaxation weighting. The authors also studied a new eddy current correction method that does not require a single-frequency reference signal. This method uses two free induction decays (FIDs) collected from the same volume with two sequences with opposite gradients. Phase errors caused by eddy currents are opposite in these two FIDs and can be canceled completely by combining the FIDs. These methods were tested in a phantom. Eddy current distortions were corrected, allowing quantitative measurement of structures such as the -CH = CH- component, which is otherwise undetectable.

  9. Adaptation of the Haloarcula hispanica CRISPR-Cas system to a purified virus strictly requires a priming process

    PubMed Central

    Li, Ming; Wang, Rui; Zhao, Dahe; Xiang, Hua

    2014-01-01

    The clustered regularly interspaced short palindromic repeat (CRISPR)-Cas system mediates adaptive immunity against foreign nucleic acids in prokaryotes. However, efficient adaptation of a native CRISPR to purified viruses has only been observed for the type II-A system from a Streptococcus thermophilus industry strain, and rarely reported for laboratory strains. Here, we provide a second native system showing efficient adaptation. Infected by a newly isolated virus HHPV-2, Haloarcula hispanica type I-B CRISPR system acquired spacers discriminatively from viral sequences. Unexpectedly, in addition to Cas1, Cas2 and Cas4, this process also requires Cas3 and at least partial Cascade proteins, which are involved in interference and/or CRISPR RNA maturation. Intriguingly, a preexisting spacer partially matching a viral sequence is also required, and spacer acquisition from upstream and downstream sequences of its target sequence (i.e. priming protospacer) shows different strand bias. These evidences strongly indicate that adaptation in this system strictly requires a priming process. This requirement, if validated also true for other CRISPR systems as implied by our bioinformatic analysis, may help to explain failures to observe efficient adaptation to purified viruses in many laboratory strains, and the discrimination mechanism at the adaptation level that has confused scientists for years. PMID:24265226

  10. Targeted RNA-Sequencing with Competitive Multiplex-PCR Amplicon Libraries

    PubMed Central

    Blomquist, Thomas M.; Crawford, Erin L.; Lovett, Jennie L.; Yeo, Jiyoun; Stanoszek, Lauren M.; Levin, Albert; Li, Jia; Lu, Mei; Shi, Leming; Muldrew, Kenneth; Willey, James C.

    2013-01-01

    Whole transcriptome RNA-sequencing is a powerful tool, but is costly and yields complex data sets that limit its utility in molecular diagnostic testing. A targeted quantitative RNA-sequencing method that is reproducible and reduces the number of sequencing reads required to measure transcripts over the full range of expression would be better suited to diagnostic testing. Toward this goal, we developed a competitive multiplex PCR-based amplicon sequencing library preparation method that a) targets only the sequences of interest and b) controls for inter-target variation in PCR amplification during library preparation by measuring each transcript native template relative to a known number of synthetic competitive template internal standard copies. To determine the utility of this method, we intentionally selected PCR conditions that would cause transcript amplification products (amplicons) to converge toward equimolar concentrations (normalization) during library preparation. We then tested whether this approach would enable accurate and reproducible quantification of each transcript across multiple library preparations, and at the same time reduce (through normalization) total sequencing reads required for quantification of transcript targets across a large range of expression. We demonstrate excellent reproducibility (R2 = 0.997) with 97% accuracy to detect 2-fold change using External RNA Controls Consortium (ERCC) reference materials; high inter-day, inter-site and inter-library concordance (R2 = 0.97–0.99) using FDA Sequencing Quality Control (SEQC) reference materials; and cross-platform concordance with both TaqMan qPCR (R2 = 0.96) and whole transcriptome RNA-sequencing following “traditional” library preparation using Illumina NGS kits (R2 = 0.94). Using this method, sequencing reads required to accurately quantify more than 100 targeted transcripts expressed over a 107-fold range was reduced more than 10,000-fold, from 2.3×109 to 1.4×105 sequencing reads. These studies demonstrate that the competitive multiplex-PCR amplicon library preparation method presented here provides the quality control, reproducibility, and reduced sequencing reads necessary for development and implementation of targeted quantitative RNA-sequencing biomarkers in molecular diagnostic testing. PMID:24236095

  11. Genome-wide determination of on-target and off-target characteristics for RNA-guided DNA methylation by dCas9 methyltransferases

    PubMed Central

    Lin, Lin; Liu, Yong; Xu, Fengping; Huang, Jinrong; Daugaard, Tina Fuglsang; Petersen, Trine Skov; Hansen, Bettina; Ye, Lingfei; Zhou, Qing; Fang, Fang; Yang, Ling; Li, Shengting; Fløe, Lasse; Jensen, Kristopher Torp; Shrock, Ellen; Chen, Fang; Yang, Huanming; Wang, Jian; Liu, Xin; Xu, Xun; Bolund, Lars; Nielsen, Anders Lade; Luo, Yonglun

    2018-01-01

    Abstract Background Fusion of DNA methyltransferase domains to the nuclease-deficient clustered regularly interspaced short palindromic repeat (CRISPR) associated protein 9 (dCas9) has been used for epigenome editing, but the specificities of these dCas9 methyltransferases have not been fully investigated. Findings We generated CRISPR-guided DNA methyltransferases by fusing the catalytic domain of DNMT3A or DNMT3B to the C terminus of the dCas9 protein from Streptococcus pyogenes and validated its on-target and global off-target characteristics. Using targeted quantitative bisulfite pyrosequencing, we prove that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can efficiently methylate the CpG dinucleotides flanking its target sites at different genomic loci (uPA and TGFBR3) in human embryonic kidney cells (HEK293T). Furthermore, we conducted whole genome bisulfite sequencing (WGBS) to address the specificity of our dCas9 methyltransferases. WGBS revealed that although dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B did not cause global methylation changes, a substantial number (more than 1000) of the off-target differentially methylated regions (DMRs) were identified. The off-target DMRs, which were hypermethylated in cells expressing dCas9 methyltransferase and guide RNAs, were predominantly found in promoter regions, 5΄ untranslated regions, CpG islands, and DNase I hypersensitivity sites, whereas unexpected hypomethylated off-target DMRs were significantly enriched in repeated sequences. Through chromatin immunoprecipitation with massive parallel DNA sequencing analysis, we further revealed that these off-target DMRs were weakly correlated with dCas9 off-target binding sites. Using quantitative polymerase chain reaction, RNA sequencing, and fluorescence reporter cells, we also found that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can mediate transient inhibition of gene expression, which might be caused by dCas9-mediated de novo DNA methylation as well as interference with transcription. Conclusion Our results prove that dCas9 methyltransferases cause efficient RNA-guided methylation of specific endogenous CpGs. However, there is significant off-target methylation indicating that further improvements of the specificity of CRISPR-dCas9 based DNA methylation modifiers are required. PMID:29635374

  12. Genome-wide determination of on-target and off-target characteristics for RNA-guided DNA methylation by dCas9 methyltransferases.

    PubMed

    Lin, Lin; Liu, Yong; Xu, Fengping; Huang, Jinrong; Daugaard, Tina Fuglsang; Petersen, Trine Skov; Hansen, Bettina; Ye, Lingfei; Zhou, Qing; Fang, Fang; Yang, Ling; Li, Shengting; Fløe, Lasse; Jensen, Kristopher Torp; Shrock, Ellen; Chen, Fang; Yang, Huanming; Wang, Jian; Liu, Xin; Xu, Xun; Bolund, Lars; Nielsen, Anders Lade; Luo, Yonglun

    2018-03-01

    Fusion of DNA methyltransferase domains to the nuclease-deficient clustered regularly interspaced short palindromic repeat (CRISPR) associated protein 9 (dCas9) has been used for epigenome editing, but the specificities of these dCas9 methyltransferases have not been fully investigated. We generated CRISPR-guided DNA methyltransferases by fusing the catalytic domain of DNMT3A or DNMT3B to the C terminus of the dCas9 protein from Streptococcus pyogenes and validated its on-target and global off-target characteristics. Using targeted quantitative bisulfite pyrosequencing, we prove that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can efficiently methylate the CpG dinucleotides flanking its target sites at different genomic loci (uPA and TGFBR3) in human embryonic kidney cells (HEK293T). Furthermore, we conducted whole genome bisulfite sequencing (WGBS) to address the specificity of our dCas9 methyltransferases. WGBS revealed that although dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B did not cause global methylation changes, a substantial number (more than 1000) of the off-target differentially methylated regions (DMRs) were identified. The off-target DMRs, which were hypermethylated in cells expressing dCas9 methyltransferase and guide RNAs, were predominantly found in promoter regions, 5΄ untranslated regions, CpG islands, and DNase I hypersensitivity sites, whereas unexpected hypomethylated off-target DMRs were significantly enriched in repeated sequences. Through chromatin immunoprecipitation with massive parallel DNA sequencing analysis, we further revealed that these off-target DMRs were weakly correlated with dCas9 off-target binding sites. Using quantitative polymerase chain reaction, RNA sequencing, and fluorescence reporter cells, we also found that dCas9-BFP-DNMT3A and dCas9-BFP-DNMT3B can mediate transient inhibition of gene expression, which might be caused by dCas9-mediated de novo DNA methylation as well as interference with transcription. Our results prove that dCas9 methyltransferases cause efficient RNA-guided methylation of specific endogenous CpGs. However, there is significant off-target methylation indicating that further improvements of the specificity of CRISPR-dCas9 based DNA methylation modifiers are required.

  13. Possibility of Database Research as a Means of Pharmacovigilance in Japan Based on a Comparison with Sertraline Postmarketing Surveillance.

    PubMed

    Hirano, Yoko; Asami, Yuko; Kuribayashi, Kazuhiko; Kitazaki, Shigeru; Yamamoto, Yuji; Fujimoto, Yoko

    2018-05-01

    Many pharmacoepidemiologic studies using large-scale databases have recently been utilized to evaluate the safety and effectiveness of drugs in Western countries. In Japan, however, conventional methodology has been applied to postmarketing surveillance (PMS) to collect safety and effectiveness information on new drugs to meet regulatory requirements. Conventional PMS entails enormous costs and resources despite being an uncontrolled observational study method. This study is aimed at examining the possibility of database research as a more efficient pharmacovigilance approach by comparing a health care claims database and PMS with regard to the characteristics and safety profiles of sertraline-prescribed patients. The characteristics of sertraline-prescribed patients recorded in a large-scale Japanese health insurance claims database developed by MinaCare Co. Ltd. were scanned and compared with the PMS results. We also explored the possibility of detecting signals indicative of adverse reactions based on the claims database by using sequence symmetry analysis. Diabetes mellitus, hyperlipidemia, and hyperthyroidism served as exploratory events, and their detection criteria for the claims database were reported by the Pharmaceuticals and Medical Devices Agency in Japan. Most of the characteristics of sertraline-prescribed patients in the claims database did not differ markedly from those in the PMS. There was no tendency for higher risks of the exploratory events after exposure to sertraline, and this was consistent with sertraline's known safety profile. Our results support the concept of using database research as a cost-effective pharmacovigilance tool that is free of selection bias . Further investigation using database research is required to confirm our preliminary observations. Copyright © 2018. Published by Elsevier Inc.

  14. Graph pyramids for protein function prediction

    PubMed Central

    2015-01-01

    Background Uncovering the hidden organizational characteristics and regularities among biological sequences is the key issue for detailed understanding of an underlying biological phenomenon. Thus pattern recognition from nucleic acid sequences is an important affair for protein function prediction. As proteins from the same family exhibit similar characteristics, homology based approaches predict protein functions via protein classification. But conventional classification approaches mostly rely on the global features by considering only strong protein similarity matches. This leads to significant loss of prediction accuracy. Methods Here we construct the Protein-Protein Similarity (PPS) network, which captures the subtle properties of protein families. The proposed method considers the local as well as the global features, by examining the interactions among 'weakly interacting proteins' in the PPS network and by using hierarchical graph analysis via the graph pyramid. Different underlying properties of the protein families are uncovered by operating the proposed graph based features at various pyramid levels. Results Experimental results on benchmark data sets show that the proposed hierarchical voting algorithm using graph pyramid helps to improve computational efficiency as well the protein classification accuracy. Quantitatively, among 14,086 test sequences, on an average the proposed method misclassified only 21.1 sequences whereas baseline BLAST score based global feature matching method misclassified 362.9 sequences. With each correctly classified test sequence, the fast incremental learning ability of the proposed method further enhances the training model. Thus it has achieved more than 96% protein classification accuracy using only 20% per class training data. PMID:26044522

  15. Graph pyramids for protein function prediction.

    PubMed

    Sandhan, Tushar; Yoo, Youngjun; Choi, Jin; Kim, Sun

    2015-01-01

    Uncovering the hidden organizational characteristics and regularities among biological sequences is the key issue for detailed understanding of an underlying biological phenomenon. Thus pattern recognition from nucleic acid sequences is an important affair for protein function prediction. As proteins from the same family exhibit similar characteristics, homology based approaches predict protein functions via protein classification. But conventional classification approaches mostly rely on the global features by considering only strong protein similarity matches. This leads to significant loss of prediction accuracy. Here we construct the Protein-Protein Similarity (PPS) network, which captures the subtle properties of protein families. The proposed method considers the local as well as the global features, by examining the interactions among 'weakly interacting proteins' in the PPS network and by using hierarchical graph analysis via the graph pyramid. Different underlying properties of the protein families are uncovered by operating the proposed graph based features at various pyramid levels. Experimental results on benchmark data sets show that the proposed hierarchical voting algorithm using graph pyramid helps to improve computational efficiency as well the protein classification accuracy. Quantitatively, among 14,086 test sequences, on an average the proposed method misclassified only 21.1 sequences whereas baseline BLAST score based global feature matching method misclassified 362.9 sequences. With each correctly classified test sequence, the fast incremental learning ability of the proposed method further enhances the training model. Thus it has achieved more than 96% protein classification accuracy using only 20% per class training data.

  16. Genomic analyses of Clostridium perfringens isolates from five toxinotypes.

    PubMed

    Hassan, Karl A; Elbourne, Liam D H; Tetu, Sasha G; Melville, Stephen B; Rood, Julian I; Paulsen, Ian T

    2015-05-01

    Clostridium perfringens can be isolated from a range of environments, including soil, marine and fresh water sediments, and the gastrointestinal tracts of animals and humans. Some C. perfringens strains have attractive industrial applications, e.g., in the degradation of waste products or the production of useful chemicals. However, C. perfringens has been most studied as the causative agent of a range of enteric and soft tissue infections of varying severities in humans and animals. Host preference and disease type in C. perfringens are intimately linked to the production of key extracellular toxins and on this basis toxigenic C. perfringens strains have been classified into five toxinotypes (A-E). To date, twelve genome sequences have been generated for a diverse collection of C. perfringens isolates, including strains associated with human and animal infections, a human commensal strain, and a strain with potential industrial utility. Most of the sequenced strains are classified as toxinotype A. However, genome sequences of representative strains from each of the other four toxinotypes have also been determined. Analysis of this collection of sequences has highlighted a lack of features differentiating toxinotype A strains from the other isolates, indicating that the primary defining characteristic of toxinotype A strains is their lack of key plasmid-encoded extracellular toxin genes associated with toxinotype B to E strains. The representative B-E strains sequenced to date each harbour many unique genes. Additional genome sequences are needed to determine if these genes are characteristic of their respective toxinotypes. Copyright © 2014. Published by Elsevier Masson SAS.

  17. Tickling the retina: integration of subthreshold electrical pulses can activate retinal neurons

    NASA Astrophysics Data System (ADS)

    Sekhar, S.; Jalligampala, A.; Zrenner, E.; Rathbun, D. L.

    2016-08-01

    Objective. The field of retinal prosthetics has made major progress over the last decade, restoring visual percepts to people suffering from retinitis pigmentosa. The stimulation pulses used by present implants are suprathreshold, meaning individual pulses are designed to activate the retina. In this paper we explore subthreshold pulse sequences as an alternate stimulation paradigm. Subthreshold pulses have the potential to address important open problems such as fading of visual percepts when patients are stimulated at moderate pulse repetition rates and the difficulty in preferentially stimulating different retinal pathways. Approach. As a first step in addressing these issues we used Gaussian white noise electrical stimulation combined with spike-triggered averaging to interrogate whether a subthreshold sequence of pulses can be used to activate the mouse retina. Main results. We demonstrate that the retinal network can integrate multiple subthreshold electrical stimuli under an experimental paradigm immediately relevant to retinal prostheses. Furthermore, these characteristic stimulus sequences varied in their shape and integration window length across the population of retinal ganglion cells. Significance. Because the subthreshold sequences activate the retina at stimulation rates that would typically induce strong fading (25 Hz), such retinal ‘tickling’ has the potential to minimize the fading problem. Furthermore, the diversity found across the cell population in characteristic pulse sequences suggests that these sequences could be used to selectively address the different retinal pathways (e.g. ON versus OFF). Both of these outcomes may significantly improve visual perception in retinal implant patients.

  18. Tackle characteristics and injury in a cross section of rugby union football.

    PubMed

    McIntosh, Andrew S; Savage, Trevor N; McCrory, Paul; Fréchède, Bertrand O; Wolfe, Rory

    2010-05-01

    The tackle is the game event in rugby union most associated with injury. This study's main aims were to measure tackle characteristics from video using a qualitative protocol, to assess whether the characteristics differed by level of play, and to measure the associations between tackle characteristics and injury. A cohort study was undertaken. The cohort comprised male rugby players in the following levels: younger than 15 yr, 18 yr, and 20 yr, grade, and elite (Super 12 and Wallabies). All tackle events and technique characteristics were coded in 77 game halves using a standardized qualitative protocol. Game injuries and missed-game injuries were identified and correlated with tackle events. A total of 6618 tackle events, including 81 resulting in a game injury, were observed and coded in the 77 game halves fully analyzed (145 tackle events per hour). An increase in the proportion of active shoulder tackles was observed from younger than 15 yr (13%) to elite (31%). Younger players engaged in more passive tackles and tended to stay on their feet more than experienced players. Younger than 15 yr rugby players had a significantly lower risk of tackle game injury compared with elite players. No specific tackle technique was observed to be associated with a significantly increased risk of game injury. There was a greater risk of game injury associated with two or more tacklers involved in the tackle event, and the greatest risk was associated with simultaneous contact by tacklers, after adjusting for level of play. Tackle characteristics differed between levels of play. The number of tacklers and the sequence of tackler contact with the ball carrier require consideration from an injury prevention perspective.

  19. Evaluating imputation algorithms for low-depth genotyping-by-sequencing (GBS) data

    USDA-ARS?s Scientific Manuscript database

    Well-powered genomic studies require genome-wide marker coverage across many individuals. For non-model species with few genomic resources, high-throughput sequencing (HTS) methods, such as Genotyping-By-Sequencing (GBS), offer an inexpensive alternative to array-based genotyping. Although affordabl...

  20. Diff-seq: A high throughput sequencing-based mismatch detection assay for DNA variant enrichment and discovery

    PubMed Central

    Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin

    2018-01-01

    Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139

  1. Probe-Directed Degradation (PDD) for Flexible Removal of Unwanted cDNA Sequences from RNA-Seq Libraries.

    PubMed

    Archer, Stuart K; Shirokikh, Nikolay E; Preiss, Thomas

    2015-04-01

    Most applications for RNA-seq require the depletion of abundant transcripts to gain greater coverage of the underlying transcriptome. The sequences to be targeted for depletion depend on application and species and in many cases may not be supported by commercial depletion kits. This unit describes a method for generating RNA-seq libraries that incorporates probe-directed degradation (PDD), which can deplete any unwanted sequence set, with the low-bias split-adapter method of library generation (although many other library generation methods are in principle compatible). The overall strategy is suitable for applications requiring customized sequence depletion or where faithful representation of fragment ends and lack of sequence bias is paramount. We provide guidelines to rapidly design specific probes against the target sequence, and a detailed protocol for library generation using the split-adapter method including several strategies for streamlining the technique and reducing adapter dimer content. Copyright © 2015 John Wiley & Sons, Inc.

  2. Discrete sequence prediction and its applications

    NASA Technical Reports Server (NTRS)

    Laird, Philip

    1992-01-01

    Learning from experience to predict sequences of discrete symbols is a fundamental problem in machine learning with many applications. We apply sequence prediction using a simple and practical sequence-prediction algorithm, called TDAG. The TDAG algorithm is first tested by comparing its performance with some common data compression algorithms. Then it is adapted to the detailed requirements of dynamic program optimization, with excellent results.

  3. Effort in Multitasking: Local and Global Assessment of Effort.

    PubMed

    Kiesel, Andrea; Dignath, David

    2017-01-01

    When performing multiple tasks in succession, self-organization of task order might be superior compared to external-controlled task schedules, because self-organization allows optimizing processing modes and thus reduces switch costs, and it increases commitment to task goals. However, self-organization is an additional executive control process that is not required if task order is externally specified and as such it is considered as time-consuming and effortful. To compare self-organized and externally controlled task scheduling, we suggest assessing global subjective and objectives measures of effort in addition to local performance measures. In our new experimental approach, we combined characteristics of dual tasking settings and task switching settings and compared local and global measures of effort in a condition with free choice of task sequence and a condition with cued task sequence. In a multi-tasking environment, participants chose the task order while the task requirement of the not-yet-performed task remained the same. This task preview allowed participants to work on the previously non-chosen items in parallel and resulted in faster responses and fewer errors in task switch trials than in task repetition trials. The free-choice group profited more from this task preview than the cued group when considering local performance measures. Nevertheless, the free-choice group invested more effort than the cued group when considering global measures. Thus, self-organization in task scheduling seems to be effortful even in conditions in which it is beneficiary for task processing. In a second experiment, we reduced the possibility of task preview for the not-yet-performed tasks in order to hinder efficient self-organization. Here neither local nor global measures revealed substantial differences between the free-choice and a cued task sequence condition. Based on the results of both experiments, we suggest that global assessment of effort in addition to local performance measures might be a useful tool for multitasking research.

  4. A Comparison of the Effects of Temporary Hippocampal Lesions on Single and Dual Context Versions of the Olfactory Sequence Memory Task

    PubMed Central

    Sill, Orriana C.; Smith, David M.

    2012-01-01

    In recent years, many animal models of memory have focused on one or more of the various components of episodic memory. For example, the odor sequence memory task requires subjects to remember individual items and events (the odors) and the temporal aspects of the experience (the sequence of odor presentation). The well-known spatial context coding function of the hippocampus, as exemplified by place cell firing, may reflect the ‘where’ component of episodic memory. In the present study, we added a contextual component to the odor sequence memory task by training rats to choose the earlier odor in one context and the later odor in another context and we compared the effects of temporary hippocampal lesions on performance of the original single context task and the new dual context task. Temporary lesions significantly impaired the single context task, although performance remained significantly above chance levels. In contrast, performance dropped all the way to chance when temporary lesions were used in the dual context task. These results demonstrate that rats can learn a dual context version of the odor sequence learning task which requires the use of contextual information along with the requirement to remember the ‘what’ and ‘when’ components of the odor sequence. Moreover, the additional requirement of context-dependent expression of the ‘what-when’ memory made the task fully dependent on the hippocampus. Moreover, the addition of the contextual component made the task fully dependent on the hippocampus. PMID:22687149

  5. Approximate matching of regular expressions.

    PubMed

    Myers, E W; Miller, W

    1989-01-01

    Given a sequence A and regular expression R, the approximate regular expression matching problem is to find a sequence matching R whose optimal alignment with A is the highest scoring of all such sequences. This paper develops an algorithm to solve the problem in time O(MN), where M and N are the lengths of A and R. Thus, the time requirement is asymptotically no worse than for the simpler problem of aligning two fixed sequences. Our method is superior to an earlier algorithm by Wagner and Seiferas in several ways. First, it treats real-valued costs, in addition to integer costs, with no loss of asymptotic efficiency. Second, it requires only O(N) space to deliver just the score of the best alignment. Finally, its structure permits implementation techniques that make it extremely fast in practice. We extend the method to accommodate gap penalties, as required for typical applications in molecular biology, and further refine it to search for sub-strings of A that strongly align with a sequence in R, as required for typical data base searches. We also show how to deliver an optimal alignment between A and R in only O(N + log M) space using O(MN log M) time. Finally, an O(MN(M + N) + N2log N) time algorithm is presented for alignment scoring schemes where the cost of a gap is an arbitrary increasing function of its length.

  6. Satellite DNA in Plants: More than Just Rubbish.

    PubMed

    Garrido-Ramos, Manuel A

    2015-01-01

    For decades, satellite DNAs have been the hidden part of genomes. Initially considered as junk DNA, there is currently an increasing appreciation of the functional significance of satellite DNA repeats and of their sequences. Satellite DNA families accumulate in the heterochromatin in different parts of the eukaryotic chromosomes, mainly in pericentromeric and subtelomeric regions, but they also span the functional centromere. Tandem repeat sequences may spread from subtelomeric to interstitial loci, leading to the formation of chromosome-specific loci or to the accumulation in equilocal sites in different chromosomes. They also appear as the main components of the heterochromatin in the sex-specific region of sex chromosomes. Satellite DNA, required for chromosome organization, also plays a role in pairing and segregation. Some satellite repeats are transcribed and can participate in the formation and maintenance of heterochromatin structure and in the modulation of gene expression. In addition to the identification of the different satellite DNA families, their characteristics and location, we are interested in determining their impact on the genomes, by identifying the mechanisms leading to their appearance and amplification as well as in understanding how they change over time, the factors affecting these changes, and the influence exerted by the evolutionary history of the organisms. On the other hand, satellite DNA sequences are rapidly evolving sequences that may cause reproductive barriers between organisms and promote speciation. The accumulation of experimental data collected in recent years and the emergence of new approaches based on next-generation sequencing and high-throughput genome analysis are opening new perspectives that are changing our understanding of satellite DNA. This review examines recent data to provide a timely update on the overall information gathered about this part of the genome, focusing on the advances in the knowledge of its origin, its evolution, and its potential functional roles. © 2015 S. Karger AG, Basel.

  7. Different phylogenomic approaches to resolve the evolutionary relationships among model fish species.

    PubMed

    Negrisolo, Enrico; Kuhl, Heiner; Forcato, Claudio; Vitulo, Nicola; Reinhardt, Richard; Patarnello, Tomaso; Bargelloni, Luca

    2010-12-01

    Comparative genomics holds the promise to magnify the information obtained from individual genome sequencing projects, revealing common features conserved across genomes and identifying lineage-specific characteristics. To implement such a comparative approach, a robust phylogenetic framework is required to accurately reconstruct evolution at the genome level. Among vertebrate taxa, teleosts represent the second best characterized group, with high-quality draft genome sequences for five model species (Danio rerio, Gasterosteus aculeatus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis), and several others are in the finishing lane. However, the relationships among the acanthomorph teleost model fishes remain an unresolved taxonomic issue. Here, a genomic region spanning over 1.2 million base pairs was sequenced in the teleost fish Dicentrarchus labrax. Together with genomic data available for the above fish models, the new sequence was used to identify unique orthologous genomic regions shared across all target taxa. Different strategies were applied to produce robust multiple gene and genomic alignments spanning from 11,802 to 186,474 amino acid/nucleotide positions. Ten data sets were analyzed according to Bayesian inference, maximum likelihood, maximum parsimony, and neighbor joining methods. Extensive analyses were performed to explore the influence of several factors (e.g., alignment methodology, substitution model, data set partitions, and long-branch attraction) on the tree topology. Although a general consensus was observed for a closer relationship between G. aculeatus (Gasterosteidae) and Di. labrax (Moronidae) with the atherinomorph O. latipes (Beloniformes) sister taxon of this clade, with the tetraodontiform group Ta. rubripes and Te. nigroviridis (Tetraodontiformes) representing a more distantly related taxon among acanthomorph model fish species, conflicting results were obtained between data sets and methods, especially with respect to the choice of alignment methodology applied to noncoding parts of the genomic region under study. This may limit the use of intergenic/noncoding sequences in phylogenomics until more robust alignment algorithms are developed.

  8. Comparison of allelic discrimination by dHPLC, HRM, and TaqMan in the detection of BRAF mutation V600E.

    PubMed

    Carbonell, Pablo; Turpin, María C; Torres-Moreno, Daniel; Molina-Martínez, Irene; García-Solano, José; Perez-Guillermo, Miguel; Conesa-Zamora, Pablo

    2011-09-01

    The V600E mutation in the BRAF oncogene is associated with colorectal carcinomas, with mismatch-repair deficiency and, recently, with nonresponse to epidermal growth factor receptor inhibitor therapy. The use of reliable techniques for its detection is important. The aim of our study was to compare the performance characteristics in V600E detection of denaturing high-performance liquid chromatography (dHPLC) and high-resolution melting (HRM) with TaqMan allelic discrimination as well as direct-sequencing methods in a series of 195 colorectal paraffin-embedded specimens up to the age of 15 years. The effectiveness for obtaining results on mutation status was best using TaqMan (96.9%), followed by dHPLC (93.3%), HRM (88.7%), and sequencing (88.2%). In general, TaqMan was best for analyzing older tissues, whereas sequencing was the least efficient. Heterozygotic V600E was detected in 11.6%, 9.9%, 11.6%, and 9.9% of tissues using TaqMan, dHPLC, HRM, and sequencing, respectively. Result concordances between dHPLC and TaqMan or sequencing were excellent (κ = 0.9411 and κ = 0.8988, respectively); for HRM, the concordances were good (κ = 0.7973 and κ = 0.7488, respectively). By using DNA dilutions from tumor tissue, a minimum of 10% of V600E harboring cancer content was required for the analysis by dHPLC and HRM. dHPLC could detect four non-V600E mutations, whereas HRM detected one. Our results indicate that dHPLC and HRM are techniques that can be reliably used for the detection of the BRAFV600E mutation in archival paraffin-embedded tissues. Copyright © 2011 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  9. High-resolution definition of the Vibrio cholerae essential gene set with hidden Markov model–based analyses of transposon-insertion sequencing data

    PubMed Central

    Chao, Michael C.; Pritchard, Justin R.; Zhang, Yanjia J.; Rubin, Eric J.; Livny, Jonathan; Davis, Brigid M.; Waldor, Matthew K.

    2013-01-01

    The coupling of high-density transposon mutagenesis to high-throughput DNA sequencing (transposon-insertion sequencing) enables simultaneous and genome-wide assessment of the contributions of individual loci to bacterial growth and survival. We have refined analysis of transposon-insertion sequencing data by normalizing for the effect of DNA replication on sequencing output and using a hidden Markov model (HMM)-based filter to exploit heretofore unappreciated information inherent in all transposon-insertion sequencing data sets. The HMM can smooth variations in read abundance and thereby reduce the effects of read noise, as well as permit fine scale mapping that is independent of genomic annotation and enable classification of loci into several functional categories (e.g. essential, domain essential or ‘sick’). We generated a high-resolution map of genomic loci (encompassing both intra- and intergenic sequences) that are required or beneficial for in vitro growth of the cholera pathogen, Vibrio cholerae. This work uncovered new metabolic and physiologic requirements for V. cholerae survival, and by combining transposon-insertion sequencing and transcriptomic data sets, we also identified several novel noncoding RNA species that contribute to V. cholerae growth. Our findings suggest that HMM-based approaches will enhance extraction of biological meaning from transposon-insertion sequencing genomic data. PMID:23901011

  10. A Novel Cylindrical Representation for Characterizing Intrinsic Properties of Protein Sequences.

    PubMed

    Yu, Jia-Feng; Dou, Xiang-Hua; Wang, Hong-Bo; Sun, Xiao; Zhao, Hui-Ying; Wang, Ji-Hua

    2015-06-22

    The composition and sequence order of amino acid residues are the two most important characteristics to describe a protein sequence. Graphical representations facilitate visualization of biological sequences and produce biologically useful numerical descriptors. In this paper, we propose a novel cylindrical representation by placing the 20 amino acid residue types in a circle and sequence positions along the z axis. This representation allows visualization of the composition and sequence order of amino acids at the same time. Ten numerical descriptors and one weighted numerical descriptor have been developed to quantitatively describe intrinsic properties of protein sequences on the basis of the cylindrical model. Their applications to similarity/dissimilarity analysis of nine ND5 proteins indicated that these numerical descriptors are more effective than several classical numerical matrices. Thus, the cylindrical representation obtained here provides a new useful tool for visualizing and charactering protein sequences. An online server is available at http://biophy.dzu.edu.cn:8080/CNumD/input.jsp .

  11. Analysis on the use of Multi-Sequence MRI Series for Segmentation of Abdominal Organs

    NASA Astrophysics Data System (ADS)

    Selver, M. A.; Selvi, E.; Kavur, E.; Dicle, O.

    2015-01-01

    Segmentation of abdominal organs from MRI data sets is a challenging task due to various limitations and artefacts. During the routine clinical practice, radiologists use multiple MR sequences in order to analyze different anatomical properties. These sequences have different characteristics in terms of acquisition parameters (such as contrast mechanisms and pulse sequence designs) and image properties (such as pixel spacing, slice thicknesses and dynamic range). For a complete understanding of the data, computational techniques should combine the information coming from these various MRI sequences. These sequences are not acquired in parallel but in a sequential manner (one after another). Therefore, patient movements and respiratory motions change the position and shape of the abdominal organs. In this study, the amount of these effects is measured using three different symmetric surface distance metrics performed to three dimensional data acquired from various MRI sequences. The results are compared to intra and inter observer differences and discussions on using multiple MRI sequences for segmentation and the necessities for registration are presented.

  12. Precise assignment of the heavy-strand promoter of mouse mitochondrial DNA: cognate start sites are not required for transcriptional initiation.

    PubMed Central

    Chang, D D; Clayton, D A

    1986-01-01

    Transcription of the heavy strand of mouse mitochondrial DNA starts from two closely spaced, distinct sites located in the displacement loop region of the genome. We report here an analysis of regulatory sequences required for faithful transcription from these two sites. Data obtained from in vitro assays demonstrated that a 51-base-pair region, encompassing nucleotides -40 to +11 of the downstream start site, contains sufficient information for accurate transcription from both start sites. Deletion of the 3' flanking sequences, including one or both start sites to -17, resulted in the initiation of transcription by the mitochondrial RNA polymerase from alternative sites within vector DNA sequences. This feature places the mouse heavy-strand promoter uniquely among other known mitochondrial promoters, all of which absolutely require cognate start sites for transcription. Comparison of the heavy-strand promoter with those of other vertebrate mitochondrial DNAs revealed a remarkably high rate of sequence divergence among species. Images PMID:3785226

  13. MIPS bacterial genomes functional annotation benchmark dataset.

    PubMed

    Tetko, Igor V; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Fobo, Gisela; Ruepp, Andreas; Antonov, Alexey V; Surmeli, Dimitrij; Mewes, Hans-Wernen

    2005-05-15

    Any development of new methods for automatic functional annotation of proteins according to their sequences requires high-quality data (as benchmark) as well as tedious preparatory work to generate sequence parameters required as input data for the machine learning methods. Different program settings and incompatible protocols make a comparison of the analyzed methods difficult. The MIPS Bacterial Functional Annotation Benchmark dataset (MIPS-BFAB) is a new, high-quality resource comprising four bacterial genomes manually annotated according to the MIPS functional catalogue (FunCat). These resources include precalculated sequence parameters, such as sequence similarity scores, InterPro domain composition and other parameters that could be used to develop and benchmark methods for functional annotation of bacterial protein sequences. These data are provided in XML format and can be used by scientists who are not necessarily experts in genome annotation. BFAB is available at http://mips.gsf.de/proj/bfab

  14. Integration of surface-active, periodically sequenced peptides into lipid-based microbubbles.

    PubMed

    Badami, Joseph V; Desir, Pierre; Tu, Raymond S

    2014-07-29

    The development of microbubbles toward functional, "theranostic" particles requires the incorporation of constituents with high binding specificity and therapeutic efficacy. Integrating peptides or proteins into the shell of lipid-based microbubbles can provide a means to access both receptor-ligand interactions and therapeutic properties. Simultaneously, peptides or proteins can define the characteristic monolayer mechanics of lipid bubbles and eliminate the need for post-bubble generation modification. The ability to engineer peptide sequences de novo that effectively partition into the bubble monolayer remains parametrically daunting. This work contributes to this effort using two simple amphipathic helical peptides that examine the role of local electrostatics and secondary structure. The two periodically sequenced peptides both have three positive charges, but peptide "K-2.5" spaces those charges 2.5 amino acids apart, while peptide "K-6.0" spaces the charges six amino acids apart. Size populations were determined for bubbles containing each peptide species using light scattering, and a quantitative method was developed to clearly define the fraction of peptides binding onto the microbubble monolayer. The impact of both the initial peptide concentration and the zwitterionic:anionic lipid ratio on peptide binding was also evaluated. Our results indicate that the lipid ratio affected only K-6.0 binding, which appears to be an outcome of the greater ensemble average α-helical population of the K-6.0. These findings provide further insights into the role of charge separation on peptide secondary structure, establishing a simple design metric for peptide binding onto microbubble systems.

  15. Evidence for thermal convection in the deep carbonate aquifer of the eastern sector of the Po Plain, Italy

    NASA Astrophysics Data System (ADS)

    Pasquale, V.; Chiozzi, P.; Verdoya, M.

    2013-05-01

    Temperatures recorded in wells as deep as 6 km drilled for hydrocarbon prospecting were used together with geological information to depict the thermal regime of the sedimentary sequence of the eastern sector of the Po Plain. After correction for drilling disturbance, temperature data were analyzed through an inversion technique based on a laterally constant thermal gradient model. The obtained thermal gradient is quite low within the deep carbonate unit (14 mK m- 1), while it is larger (53 mK m- 1) in the overlying impermeable formations. In the uppermost sedimentary layers, the thermal gradient is close to the regional average (21 mK m- 1). We argue that such a vertical change cannot be ascribed to thermal conductivity variation within the sedimentary sequence, but to deep groundwater flow. Since the hydrogeological characteristics (including litho-stratigraphic sequence and structural setting) hardly permit forced convection, we suggest that thermal convection might occur within the deep carbonate aquifer. The potential of this mechanism was evaluated by means of the Rayleigh number analysis. It turned out that permeability required for convection to occur must be larger than 3 10- 15 m2. The average over-heat ratio is 0.45. The lateral variation of hydrothermal regime was tested by using temperature data representing the aquifer thermal conditions. We found that thermal convection might be more developed and variable at the Ferrara High and its surroundings, where widespread fracturing may have increased permeability.

  16. Znrg, a novel gene expressed mainly in the developing notochord of zebrafish.

    PubMed

    Zhou, Yaping; Xu, Yan; Li, Jianzhen; Liu, Yao; Zhang, Zhe; Deng, Fengjiao

    2010-06-01

    The notochord, a defining characteristic of the chordate embryo is a critical midline structure required for axial skeletal formation in vertebrates, and acts as a signaling center throughout embryonic development. We utilized the digital differential display program of the National Center for Biotechnology Information, and identified a contig of expressed sequence tags (no. Dr. 83747) from the zebrafish ovary library in Genbank. Full-length cDNA of the identified gene was cloned by 5'- and 3'- RACE, and the resulting sequence was confirmed by polymerase chain reaction and sequencing. The cDNA clone contains 2,505 base pairs and encodes a novel protein of 707 amino acids that shares no significant homology with any known proteins. This gene was expressed in mature oocytes and at the one-cell stage, and persisted until the 5th day of development, as determined by RT-PCR. Transcripts were detected by whole-mount RNA in situ hybridization from the two-cell stage to 72 h of embryonic development. This gene was uniformly distributed from the cleavage stage up to the blastula stage. During early gastrulation, it was present in the dorsal region, and became restricted to the notochord and pectoral fin at 48 and 72 h of embryonic development. Based on its abundance in the notochord, we hypothesized that the novel gene may play an important role in notochord development in zebrafish; we named this gene, zebrafish notochord-related gene, or znrg.

  17. Soil development on a Pleistocene terrace sequence, Boise Valley, Idaho

    USGS Publications Warehouse

    Othberg, K.L.; McDaniel, P.A.; Fosberg, M.A.

    1997-01-01

    Study of a sequence of terraces in the western Snake River Plain of Idaho reveals a record of at least seven terraces, the ages of which span the Pleistocene. In the Boise Valley, the youngest terraces are less than -14,500 yr and the oldest terraces are -1.7 Ma. Within this sequence, several relationships exist between soil morphology and terrace chronology. On terraces older than -14,500 yr, argillic horizon development generally increases with terrace age with maximum development occurring in soils of the oldest terraces. CaCO3- and SiO2-cemented duripans are found in soils on terraces that are late middle Pleistocene and older. By virtue of their physical and chemical properties, duripans are very resistant to erosion, and therefore provide stable records of CaCO3 and SiO2 accumulation throughout multiple cycles of loess deposition onto the terrace treads, pedogenesis, and partial erosion. Mean duripan thickness increases with age to a maximum of 0.66 m on the oldest terraces. Our results suggest that a geomorphic surface age of approximately 130,000 yr is required to form the initial plugged horizon that is characteristic of a duripan. CaCO3 and SiO2 accumulation is most rapid in duripans occupying geomorphic surfaces with ages ranging from 130,000 to 300,000 yr. After this, apparent accumulation rates decrease and little additional accumulation of these cementing agents occurs with time.

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rahayu, Suparni Setyowati, E-mail: suparnirahayu@yahoo.co.id; Department of Mechanical Engineering, State Polytechnic of Semarang, Semarang Indonesia; Purwanto,, E-mail: p.purwanto@che.undip.ac.id

    The small industry of tofu production process releases the waste water without being processed first, and the wastewater is directly discharged into water. In this study, Anaerobic Sequencing Batch Reactor in Pilot Scale for Treatment of Tofu Industry was developed through an anaerobic process to produce biogas as one kind of environmentally friendly renewable energy which can be developed into the countryside. The purpose of this study was to examine the fundamental characteristics of organic matter elimination of industrial wastewater with small tofu effective method and utilize anaerobic active sludge with Anaerobic Sequencing Bath Reactor (ASBR) to get rural biogasmore » as an energy source. The first factor is the amount of the active sludge concentration which functions as the decomposers of organic matter and controlling selectivity allowance to degrade organic matter. The second factor is that HRT is the average period required substrate to react with the bacteria in the Anaerobic Sequencing Bath Reactor (ASBR).The results of processing the waste of tofu production industry using ASBR reactor with active sludge additions as starter generates cumulative volume of 5814.4 mL at HRT 5 days so that in this study it is obtained the conversion 0.16 L of CH{sub 4}/g COD and produce biogas containing of CH{sub 4}: 81.23% and CO{sub 2}: 16.12%. The wastewater treatment of tofu production using ASBR reactor is able to produce renewable energy that has economic value as well as environmentally friendly by nature.« less

  19. 14 CFR 133.41 - Flight characteristics requirements.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 14 Aeronautics and Space 3 2013-01-01 2013-01-01 false Flight characteristics requirements. 133.41... EXTERNAL-LOAD OPERATIONS Airworthiness Requirements § 133.41 Flight characteristics requirements. (a) The applicant must demonstrate to the Administrator, by performing the operational flight checks prescribed in...

  20. 14 CFR 133.41 - Flight characteristics requirements.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 14 Aeronautics and Space 3 2012-01-01 2012-01-01 false Flight characteristics requirements. 133.41... EXTERNAL-LOAD OPERATIONS Airworthiness Requirements § 133.41 Flight characteristics requirements. (a) The applicant must demonstrate to the Administrator, by performing the operational flight checks prescribed in...

  1. 14 CFR 133.41 - Flight characteristics requirements.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 14 Aeronautics and Space 3 2010-01-01 2010-01-01 false Flight characteristics requirements. 133.41... EXTERNAL-LOAD OPERATIONS Airworthiness Requirements § 133.41 Flight characteristics requirements. (a) The applicant must demonstrate to the Administrator, by performing the operational flight checks prescribed in...

  2. Optical Processing Techniques For Pseudorandom Sequence Prediction

    NASA Astrophysics Data System (ADS)

    Gustafson, Steven C.

    1983-11-01

    Pseudorandom sequences are series of apparently random numbers generated, for example, by linear or nonlinear feedback shift registers. An important application of these sequences is in spread spectrum communication systems, in which, for example, the transmitted carrier phase is digitally modulated rapidly and pseudorandomly and in which the information to be transmitted is incorporated as a slow modulation in the pseudorandom sequence. In this case the transmitted information can be extracted only by a receiver that uses for demodulation the same pseudorandom sequence used by the transmitter, and thus this type of communication system has a very high immunity to third-party interference. However, if a third party can predict in real time the probable future course of the transmitted pseudorandom sequence given past samples of this sequence, then interference immunity can be significantly reduced.. In this application effective pseudorandom sequence prediction techniques should be (1) applicable in real time to rapid (e.g., megahertz) sequence generation rates, (2) applicable to both linear and nonlinear pseudorandom sequence generation processes, and (3) applicable to error-prone past sequence samples of limited number and continuity. Certain optical processing techniques that may meet these requirements are discussed in this paper. In particular, techniques based on incoherent optical processors that perform general linear transforms or (more specifically) matrix-vector multiplications are considered. Computer simulation examples are presented which indicate that significant prediction accuracy can be obtained using these transforms for simple pseudorandom sequences. However, the useful prediction of more complex pseudorandom sequences will probably require the application of more sophisticated optical processing techniques.

  3. Simultaneous master-slave Omega pairs. [navigation system featuring low cost receiver

    NASA Technical Reports Server (NTRS)

    Burhans, R. W.

    1974-01-01

    Master-slave sequence ordering of the Omega system is suggested as a method of improving the pair geometry for low-cost receiver user benefit. The sequence change will not affect present sophisticated processor users other than require new labels for some pair combinations, but may require worldwide transmitter operators to slightly alter their long-range synchronizing techniques.

  4. Sound Sequence Discrimination Learning Motivated by Reward Requires Dopaminergic D2 Receptor Activation in the Rat Auditory Cortex

    ERIC Educational Resources Information Center

    Kudoh, Masaharu; Shibuki, Katsuei

    2006-01-01

    We have previously reported that sound sequence discrimination learning requires cholinergic inputs to the auditory cortex (AC) in rats. In that study, reward was used for motivating discrimination behavior in rats. Therefore, dopaminergic inputs mediating reward signals may have an important role in the learning. We tested the possibility in the…

  5. A definition of the domains Archaea, Bacteria and Eucarya in terms of small subunit ribosomal RNA characteristics

    NASA Technical Reports Server (NTRS)

    Winker, S.; Woese, C. R.

    1991-01-01

    The number of small subunit rRNA sequences is now great enough that the three domains Archaea, Bacteria and Eucarya (Woese et al., 1990) can be reliably defined in terms of their sequence "signatures". Approximately 50 homologous positions (or nucleotide pairs) in the small subunit rRNA characterize and distinguish among the three. In addition, the three can be recognized by a variety of nonhomologous rRNA characters, either individual positions and/or higher-order structural features. The Crenarchaeota and the Euryarchaeota, the two archaeal kingdoms, can also be defined and distinguished by their characteristic compositions at approximately fifteen positions in the small subunit rRNA molecule.

  6. The dependence of the tunneling characteristic on the electronic energy bands and the carrier’s states of Graphene superlattice

    NASA Astrophysics Data System (ADS)

    Yang, C. H.; Shen, G. Z.; Ao, Z. M.; Xu, Y. W.

    2016-09-01

    Using the transfer matrix method, the carrier tunneling properties in graphene superlattice generated by the Thue-Morse sequence and Kolakoski sequence are investigated. The positions and strength of the transmission can be modulated by the barrier structures, the incident energy and angle, the height and width of the potential. These carriers tunneling characteristic can be understood from the energy band structures in the corresponding superlattice systems and the carrier’s states in well/barriers. The transmission peaks above the critical incident angle rely on the carrier’s resonance in the well regions. The structural diversity can modulate the electronic and transport properties, thus expanding its applications.

  7. Characterization and Pathogenicity of Alternaria vanuatuensis, a New Record from Allium Plants in Korea and China

    PubMed Central

    Li, Mei Jia; Deng, Jian Xin; Paul, Narayan Chandra

    2014-01-01

    Alternaria from different Allium plants was characterized by multilocus sequence analysis. Based on sequences of the β-tubulin (BT2b), the Alternaria allergen a1 (Alt a1), and the RNA polymerase II second largest subunit (RPB2) genes and phylogenetic data analysis, isolates were divided into two groups. The two groups were identical to representative isolates of A. porri (EGS48-147) and A. vanuatuensis (EGS45-018). The conidial characteristics and pathogenicity of A. vanuatuensis also well supported the molecular characteristics. This is the first record of A. vanuatuensis E. G. Simmons & C. F. Hill from Korea and China. PMID:25606017

  8. Insights into the phylogeny of Northern Hemisphere Armillaria: Neighbor-net and Bayesian analyses of translation elongation factor 1-α gene sequences

    Treesearch

    Ned B. Klopfenstein; Jane E. Stewart; Yuko Ota; John W. Hanna; Bryce A. Richardson; Amy L. Ross-Davis; Ruben D. Elias-Roman; Kari Korhonen; Nenad Keca; Eugenia Iturritxa; Dionicio Alvarado-Rosales; Halvor Solheim; Nicholas J. Brazee; Piotr Lakomy; Michelle R. Cleary; Eri Hasegawa; Taisei Kikuchi; Fortunato Garza-Ocanas; Panaghiotis Tsopelas; Daniel Rigling; Simone Prospero; Tetyana Tsykun; Jean A. Berube; Franck O. P. Stefani; Saeideh Jafarpour; Vladimir Antonin; Michal Tomsovsky; Geral I. McDonald; Stephen Woodward; Mee-Sook Kim

    2017-01-01

    Armillaria possesses several intriguing characteristics that have inspired wide interest in understanding phylogenetic relationships within and among species of this genus. Nuclear ribosomal DNA sequence–based analyses of Armillaria provide only limited information for phylogenetic studies among widely divergent taxa. More recent studies have shown that translation...

  9. Oxygenation of the Root Zone and TCE Remediation: A Plant Model of Rhizosphere Dynamics

    DTIC Science & Technology

    2008-03-01

    Behavior Test .......................................................................................... 128 IV. Results and Analysis ...Circadian Rhythms and Diurnal Cycles. Just as humans have a rhythmic response to the environment, plants also have a periodic cycle governed by light...characteristics, fatty acid carbon lengths, G + C values, and 16S rRNA sequences. 16S RNA sequence analysis has identified eight genera of methanotrophs

  10. Complete Genome Sequence of the Dairy Isolate Lactobacillus acidipiscis ACA-DC 1533.

    PubMed

    Kazou, Maria; Alexandraki, Voula; Pot, Bruno; Tsakalidou, Effie; Papadimitriou, Konstantinos

    2017-01-26

    Lactobacillus acidipiscis is a Gram-positive lactic acid bacterium belonging to the Lactobacillus salivarius clade. Here, we present the first complete genome sequence of L. acidipiscis isolated from traditional Greek Kopanisti cheese. Strain ACA-DC 1533 may play a key role in the strong organoleptic characteristics of Kopanisti cheese. Copyright © 2017 Kazou et al.

  11. Genome Sequence of the Thermophile Bacillus coagulans Hammer, the Type Strain of the Species

    PubMed Central

    Su, Fei; Tao, Fei; Tang, Hongzhi

    2012-01-01

    Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans. PMID:23105047

  12. Genome sequence of the thermophile Bacillus coagulans Hammer, the type strain of the species.

    PubMed

    Su, Fei; Tao, Fei; Tang, Hongzhi; Xu, Ping

    2012-11-01

    Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans.

  13. Examining Potential Predictors for Completion of the Gardasil Vaccine Sequence Based on Data Gathered at Clinics of Johns Hopkins Medical Institutions

    ERIC Educational Resources Information Center

    Barat, Christopher E.; Wright, Courtney; Chou, Betty

    2011-01-01

    This paper presents categorical data that were gathered at two urban clinics and two suburban clinics of Johns Hopkins in an effort to identify characteristics of young female patients who successfully complete the three-injection sequence of the Gardasil quadrivalent human papillomavirus vaccine (HPV4). Available categorical correlates included…

  14. Rehearsal dynamics in elementary school children.

    PubMed

    Lehmann, Martin; Hasselhorn, Marcus

    2012-03-01

    Several studies on free recall suggest that processes responsible for recall are analogous to processes responsible for rehearsal. In children, the relationship between cumulative rehearsal and recall performance has been proven to be critical; however, the locus of the effect of rehearsal is not yet fully understood. To unfold the mechanisms that come into play in an overt rehearsal free recall task, we assessed rehearsal and recall sequences in children between 8 and 10 years of age. These sequences give information about the context in which items are repeated and rearranged throughout the list and subsequently recalled. Rehearsal sequences consisted mainly of items from neighboring list positions in their original temporal order. The same characteristics were true for recall sequences. Qualitatively, order effects during study and recall did not differ over age groups. However, in older children who were using cumulative rehearsal more intensively, successive rehearsal and recall of items in their original order was more pronounced. Therefore, we suggest that a main feature of item rehearsal with regard to facilitating recall is the strengthening of interitem associations based on the temporal order within a list and that this characteristic develops with age. Copyright © 2011 Elsevier Inc. All rights reserved.

  15. Sarcocystis spp. in domestic sheep in Kunming City, China: prevalence, morphology, and molecular characteristics.

    PubMed

    Hu, Jun-Jie; Huang, Si; Wen, Tao; Esch, Gerald W; Liang, Yu; Li, Hong-Liang

    2017-01-01

    Sheep (Ovis aries) are intermediate hosts for at least six named species of Sarcocystis: S. tenella, S. arieticanis, S. gigantea, S. medusiformis, S. mihoensis, and S. microps. Here, only two species, S. tenella and S. arieticanis, were found in 79 of 86 sheep (91.9%) in Kunming, China, based on their morphological characteristics. Four genetic markers, i.e., 18S rRNA gene, 28S rRNA gene, mitochondrial cox1 gene, and ITS-1 region, were sequenced and characterized for the two species of Sarcocystis. Sequences of the three former markers for S. tenella shared high identities with those of S. capracanis in goats, i.e., 99.0%, 98.3%, and 93.6%, respectively; the same three marker sequences of S. arieticanis shared high identities with those of S. hircicanis in goats, i.e., 98.5%, 96.5%, and 92.5%, respectively. No sequences in GenBank were found to significantly resemble the ITS-1 regions of S. tenella and S. arieticanis. Identities of the four genetic markers for S. tenella and S. arieticanis were 96.3%, 95.4%, 82.5%, and 66.2%, respectively. © J.-J. Hu et al., published by EDP Sciences, 2017.

  16. Normalization of Complete Genome Characteristics: Application to Evolution from Primitive Organisms to Homo sapiens.

    PubMed

    Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji

    2015-04-01

    Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.

  17. Nucleotide Sequence of the blaRTG-2 (CARB-5) Gene and Phylogeny of a New Group of Carbenicillinases

    PubMed Central

    Choury, Daniele; Szajnert, Marie-France; Joly-Guillou, Marie-Laure; Azibi, Kemal; Delpech, Marc; Paul, Gérard

    2000-01-01

    We determined the nucleotide sequence of the bla gene for the Acinetobacter calcoaceticus β-lactamase previously described as CARB-5. Alignment of the deduced amino acid sequence with those of known β-lactamases revealed that CARB-5 possesses an RTG triad in box VII, as described for the Proteus mirabilis GN79 enzyme, instead of the RSG consensus characteristic of the other carbenicillinases. Phylogenetic studies showed that these RTG enzymes constitute a new, separate group, possibly ancestors of the carbenicillinase family. PMID:10722515

  18. Genome Sequence of the White Koji Mold Aspergillus kawachii IFO 4308, Used for Brewing the Japanese Distilled Spirit Shochu

    PubMed Central

    Futagami, Taiki; Mori, Kazuki; Yamashita, Ayaka; Wada, Shotaro; Kajiwara, Yasuhiro; Takashita, Hideharu; Omori, Toshiro; Takegawa, Kaoru; Tashiro, Kosuke; Kuhara, Satoru; Goto, Masatoshi

    2011-01-01

    The filamentous fungus Aspergillus kawachii has traditionally been used for brewing the Japanese distilled spirit shochu. A. kawachii characteristically hyperproduces citric acid and a variety of polysaccharide glycoside hydrolases. Here the genome sequence of A. kawachii IFO 4308 was determined and annotated. Analysis of the sequence may provide insight into the properties of this fungus that make it superior for use in shochu production, leading to the further development of A. kawachii for industrial applications. PMID:22045919

  19. Clinical usefulness of the definitions for defining characteristics of activity intolerance, excess fluid volume and decreased cardiac output in decompensated heart failure: a descriptive exploratory study.

    PubMed

    de Souza, Vanessa; Zeitoun, Sandra Salloum; Lopes, Camila Takao; de Oliveira, Ana Paula Dias; Lopes, Juliana de Lima; de Barros, Alba Lucia Bottura Leite

    2015-09-01

    To assess the clinical usefulness of the operational definitions for the defining characteristics of the NANDA International nursing diagnoses, activity intolerance, decreased cardiac output and excess fluid volume, and the concomitant presence of those diagnoses in patients with decompensated heart failure. Content validity of the operational definitions for the defining characteristics of activity intolerance, excess fluid volume and decreased cardiac output have been previously validated by experts. Their clinical usefulness requires clinical validation. This was a descriptive exploratory study. Two expert nurses independently assessed 25 patients with decompensated heart failure for the presence or absence of 29 defining characteristics. Interrater reliability was analysed using the Kappa coefficient as a measure of clinical usefulness. The Fisher's exact test was used to test the association of the defining characteristics of activity intolerance and excess fluid volume in the presence of decreased cardiac output, and the correlation between the three diagnoses. Assessments regarding the presence of all defining characteristics reached 100% agreement, except with anxiety. Five defining characteristics of excess fluid volume were significantly associated with the presence of decreased cardiac output. Concomitant presence of the three diagnoses occurred in 80% of the patients. However, there was no significant correlation between the three diagnoses. The operational definitions for the diagnoses had strong interrater reliability, therefore they were considered clinically useful. Only five defining characteristics were representative of the association between excess fluid volume and decreased cardiac output. Therefore, excess fluid volume is related to decreased cardiac output, although these diagnoses are not necessarily associated with activity intolerance. The operational definitions may favour early recognition of the sequence of responses to decompensation, guiding the choice of common interventions to improve or resolve excess fluid volume and decreased cardiac output. © 2015 John Wiley & Sons Ltd.

  20. Adaptive Learning Resources Sequencing in Educational Hypermedia Systems

    ERIC Educational Resources Information Center

    Karampiperis, Pythagoras; Sampson, Demetrios

    2005-01-01

    Adaptive learning resources selection and sequencing is recognized as among the most interesting research questions in adaptive educational hypermedia systems (AEHS). In order to adaptively select and sequence learning resources in AEHS, the definition of adaptation rules contained in the Adaptation Model, is required. Although, some efforts have…

  1. The Advanced Glaucoma Intervention Study (AGIS): 12. Baseline risk factors for sustained loss of visual field and visual acuity in patients with advanced glaucoma.

    PubMed

    2002-10-01

    To examine the relationships between baseline risk factors and sustained decrease of visual field (SDVF) and sustained decrease of visual acuity (SDVA). Cohort study of participants in the Advanced Glaucoma Intervention Study (AGIS). This multicenter study enrolled patients between 1988 and 1992 and followed them until 2001; 789 eyes of 591 patients with advanced glaucoma were randomly assigned to one of two surgical sequences, argon laser trabeculoplasty (ALT)-trabeculectomy-trabeculectomy (ATT) or trabeculectomy-ALT-trabeculectomy (TAT). This report is based on data from 747 eyes. Eyes were offered the next intervention in the sequence upon failure of the previous intervention. Failure was based on recurrent intraocular pressure elevation, visual field defect, and disk rim criteria. Study visits occurred every 6 months; potential follow-up ranged from 8 to 13 years. For each intervention sequence, Cox multiple regression analyses were used to examine the baseline characteristics for association with two vision outcomes: SDVF and SDVA. The magnitude of the association is measured by the hazard ratio (HR), where HR for binary variables is the relative change in the hazard (or risk) of the outcome in eyes with the factor divided by the hazard in eyes without the factor, and HR for continuous variables is the relative change in the hazard (or risk) of the outcome in eyes with a unit increase in the factor. Characteristics associated with increased SDVF risk in the ATT sequence are: less baseline visual field defect (hazard ratio [HR] = 0.86, P <.001, 95% CI = 0.82-0.90), male gender (HR = 2.23, P <.001, 1.54-3.23), and worse baseline visual acuity (HR = 0.96, P =.001, 0.94-0.98); in the TAT sequence: less baseline visual field defect (HR = 0.93, P =.001, 0.89-0.97) and diabetes (HR = 1.87, P =.007, 1.18-2.97). Characteristics associated with increased SDVA risk in both treatment sequences are better baseline acuity (ATT: HR = 1.05, P <.001, 1.02-1.09; TAT: HR = 1.06, P <.001, 1.03-1.08), older age (ATT: HR = 1.05, P =.001, 1.02-1.08; TAT: HR = 1.04, P =.002, 1.01-1.06), and less formal education (ATT: HR = 1.92, P =.001, 1.29-2.88; TAT: HR = 1.77, P =.002, 1.22-2.54). For SDVF, risk factors were better baseline visual field in both treatment sequences, male gender, and worse baseline visual acuity in the ATT sequence, and diabetes in the TAT sequence. For SDVA, risk factors in both treatment sequences were better baseline visual acuity, older age, and less formal education.

  2. Proposal of Vespertiliibacter pulmonis gen. nov., sp. nov. and two genomospecies as new members of the family Pasteurellaceae isolated from European bats.

    PubMed

    Mühldorfer, Kristin; Speck, Stephanie; Wibbelt, Gudrun

    2014-07-01

    Five bacterial strains isolated from bats of the family Vespertilionidae were characterized by phenotypic tests and multilocus sequence analysis (MLSA) using the 16S rRNA gene and four housekeeping genes (rpoA, rpoB, infB, recN). Phylogenetic analyses of individual and combined datasets indicated that the five strains represent a monophyletic cluster within the family Pasteurellaceae. Comparison of 16S rRNA gene sequences demonstrated a high degree of similarity (98.3-99.9%) among the group of bat-derived strains, while searches in nucleotide databases indicated less than 96% sequence similarity to known members of the Pasteurellaceae. The housekeeping genes rpoA, rpoB, infB and recN provided higher resolution compared with the 16S rRNA gene and subdivided the group according to the bat species from which the strains were isolated. Three strains derived from noctule bats shared 98.6-100% sequence similarity in all four genes investigated, whereas, based on rpoB, infB and recN gene sequences, 91.8-96% similarity was observed with and between the remaining two strains isolated from a serotine bat and a pipistrelle bat, respectively. Genome relatedness as deduced from recN gene sequences correlated well with the results of MLSA and indicated that the five strains represent a new genus. Based on these results, it is proposed to classify the five strains derived from bats within Vespertiliibacter pulmonis gen. nov., sp. nov. (the type species), Vespertiliibacter genomospecies 1 and Vespertiliibacter genomospecies 2. The genus can be distinguished phenotypically from recognized genera of the Pasteurellaceae by at least three characteristics. All strains are nutritionally fastidious and require a chemically defined supplement with NAD for growth. The DNA G+C content of strain E127/08(T) is 38.2 mol%. The type strain of Vespertiliibacter pulmonis gen. nov., sp. nov. is E127/08(T) ( = CCUG 64585(T) = DSM 27238(T)). The reference strains of Vespertiliibacter genomospecies 1 and 2 are E145/08 and E157/08, respectively. © 2014 IUMS.

  3. Transcriptional insulation of the human keratin 18 gene in transgenic mice.

    PubMed Central

    Neznanov, N; Thorey, I S; Ceceña, G; Oshima, R G

    1993-01-01

    Expression of the 10-kb human keratin 18 (K18) gene in transgenic mice results in efficient and appropriate tissue-specific expression in a variety of internal epithelial organs, including liver, lung, intestine, kidney, and the ependymal epithelium of brain, but not in spleen, heart, or skeletal muscle. Expression at the RNA level is directly proportional to the number of integrated K18 transgenes. These results indicate that the K18 gene is able to insulate itself both from the commonly observed cis-acting effects of the sites of integration and from the potential complications of duplicated copies of the gene arranged in head-to-tail fashion. To begin to identify the K18 gene sequences responsible for this property of transcriptional insulation, additional transgenic mouse lines containing deletions of either the 5' or 3' distal end of the K18 gene have been characterized. Deletion of 1.5 kb of the distal 5' flanking sequence has no effect upon either the tissue specificity or the copy number-dependent behavior of the transgene. In contrast, deletion of the 3.5-kb 3' flanking sequence of the gene results in the loss of the copy number-dependent behavior of the gene in liver and intestine. However, expression in kidney, lung, and brain remains efficient and copy number dependent in these transgenic mice. Furthermore, herpes simplex virus thymidine kinase gene expression is copy number dependent in transgenic mice when the gene is located between the distal 5'- and 3'-flanking sequences of the K18 gene. Each adult transgenic male expressed the thymidine kinase gene in testes and brain and proportionally to the number of integrated transgenes. We conclude that the characteristic of copy number-dependent expression of the K18 gene is tissue specific because the sequence requirements for transcriptional insulation in adult liver and intestine are different from those for lung and kidney. In addition, the behavior of the transgenic thymidine kinase gene in testes and brain suggests that the property of transcriptional insulation of the K18 gene may be conferred by the distal flanking sequences of the K18 gene and, additionally, may function for other genes. Images PMID:7681143

  4. Clustering evolving proteins into homologous families.

    PubMed

    Chan, Cheong Xin; Mahbob, Maisarah; Ragan, Mark A

    2013-04-08

    Clustering sequences into groups of putative homologs (families) is a critical first step in many areas of comparative biology and bioinformatics. The performance of clustering approaches in delineating biologically meaningful families depends strongly on characteristics of the data, including content bias and degree of divergence. New, highly scalable methods have recently been introduced to cluster the very large datasets being generated by next-generation sequencing technologies. However, there has been little systematic investigation of how characteristics of the data impact the performance of these approaches. Using clusters from a manually curated dataset as reference, we examined the performance of a widely used graph-based Markov clustering algorithm (MCL) and a greedy heuristic approach (UCLUST) in delineating protein families coded by three sets of bacterial genomes of different G+C content. Both MCL and UCLUST generated clusters that are comparable to the reference sets at specific parameter settings, although UCLUST tends to under-cluster compositionally biased sequences (G+C content 33% and 66%). Using simulated data, we sought to assess the individual effects of sequence divergence, rate heterogeneity, and underlying G+C content. Performance decreased with increasing sequence divergence, decreasing among-site rate variation, and increasing G+C bias. Two MCL-based methods recovered the simulated families more accurately than did UCLUST. MCL using local alignment distances is more robust across the investigated range of sequence features than are greedy heuristics using distances based on global alignment. Our results demonstrate that sequence divergence, rate heterogeneity and content bias can individually and in combination affect the accuracy with which MCL and UCLUST can recover homologous protein families. For application to data that are more divergent, and exhibit higher among-site rate variation and/or content bias, MCL may often be the better choice, especially if computational resources are not limiting.

  5. Bioinformatics analysis and genetic diversity of the poliovirus.

    PubMed

    Liu, Yanhan; Ma, Tengfei; Liu, Jianzhu; Zhao, Xiaona; Cheng, Ziqiang; Guo, Huijun; Wang, Shujing; Xu, Ruixue

    2014-12-01

    Poliomyelitis, a disease which can manifest as muscle paralysis, is caused by the poliovirus, which is a human enterovirus and member of the family Picornaviridae that usually transmits by the faecal-oral route. The viruses of the OPV (oral poliovirus attenuated-live vaccine) strains can mutate in the human intestine during replication and some of these mutations can lead to the recovery of serious neurovirulence. Informatics research of the poliovirus genome can be used to explain further the characteristics of this virus. In this study, sequences from 100 poliovirus isolates were acquired from GenBank. To determine the evolutionary relationship between the strains, we compared and analysed the sequences of the complete poliovirus genome and the VP1 region. The reconstructed phylogenetic trees for the complete sequences and the VP1 sequences were both divided into two branches, indicating that the genetic relationships of the whole poliovirus genome and the VP1 sequences are very similar. This branching indicates that the virulence and pathogenicity of poliomyelitis may be associated with the VP1 region. Sequence alignment of the VP1 region revealed numerous mutation sites in which mutation rates of >30 % were detected. In a group of strains recorded in the USA, mutation sites and mutation types were the same and this may be associated with their distribution in the evolutionary tree and their genetic relationship. In conclusion, the genetic evolutionary relationships of poliovirus isolate sequences are determined to a great extent by the VP1 protein, and poliovirus strains located on the same branch of the phylogenetic tree contain the same mutation spots and mutation types. Hence, the genetic characteristics of the VP1 region in the poliovirus genome should be analysed to identify the transmission route of poliovirus and provide the basis of viral immunity development. © 2014 The Authors.

  6. Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

    NASA Astrophysics Data System (ADS)

    Chen, Ellson Y.

    1997-05-01

    So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.

  7. Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

    PubMed

    Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan

    2016-07-01

    This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.

  8. A dehydrin cognate protein from pea (Pisum sativum L.) with an atypical pattern of expression.

    PubMed

    Robertson, M; Chandler, P M

    1994-11-01

    Dehydrins are a family of proteins characterised by conserved amino acid motifs, and induced in plants by dehydration or treatment with ABA. An antiserum was raised against a synthetic oligopeptide based on the most highly conserved dehydrin amino acid motif, the lysine-rich (core sequence KIKEK-LPG). This antiserum detected a novel M(r) 40,000 polypeptide and enabled isolation of a corresponding cDNA clone, pPsB61 (B61). The deduced amino acid sequence contained two lysine-rich blocks, however the remainder of the sequenced differed markedly from other pea dehydrins. Surprisingly, the sequence contained a stretch of serine residues, a characteristic common to dehydrins from many plant species but which is missing in pea dehydrin. The expression patterns of B61 mRNA and polypeptide were distinctively different from those of the pea dehydrins during seed development, germination and in young seedlings exposed to dehydration stress or treated with ABA. In particular, dehydration stress led to slightly reduced levels of B61 RNA, and ABA application to young seedlings had no marked effect on its abundance. The M(r) 40,000 polypeptide is thus related to pea dehydrin by the presence of the most highly conserved amino acid sequence motifs, but lacks the characteristic expression pattern of dehydrin. By analogy with heat shock cognate proteins we refer to this protein as a dehydrin cognate.

  9. Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

    PubMed Central

    Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

    1987-01-01

    To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded. Images PMID:2823109

  10. Background sequence characteristics influence the occurrence and severity of disease-causing mtDNA mutations

    PubMed Central

    Wei, Wei; Hudson, Gavin

    2017-01-01

    Inherited mitochondrial DNA (mtDNA) mutations have emerged as a common cause of human disease, with mutations occurring multiple times in the world population. The clinical presentation of three pathogenic mtDNA mutations is strongly associated with a background mtDNA haplogroup, but it is not clear whether this is limited to a handful of examples or is a more general phenomenon. To address this, we determined the characteristics of 30,506 mtDNA sequences sampled globally. After performing several quality control steps, we ascribed an established pathogenicity score to the major alleles for each sequence. The mean pathogenicity score for known disease-causing mutations was significantly different between mtDNA macro-haplogroups. Several mutations were observed across all haplogroup backgrounds, whereas others were only observed on specific clades. In some instances this reflected a founder effect, but in others, the mutation recurred but only within the same phylogenetic cluster. Sequence diversity estimates showed that disease-causing mutations were more frequent on young sequences, and genomes with two or more disease-causing mutations were more common than expected by chance. These findings implicate the mtDNA background more generally in recurrent mutation events that have been purified through natural selection in older populations. This provides an explanation for the low frequency of mtDNA disease reported in specific ethnic groups. PMID:29253894

  11. Temporal variation of aftershocks by means of multifractal characterization of their inter-event time and cluster analysis

    NASA Astrophysics Data System (ADS)

    Figueroa-Soto, A.; Zuñiga, R.; Marquez-Ramirez, V.; Monterrubio-Velasco, M.

    2017-12-01

    . The inter-event time characteristics of seismic aftershock sequences can provide important information to discern stages in the aftershock generation process. In order to investigate whether separate dynamic stages can be identified, (1) aftershock series after selected earthquake mainshocks, which took place at similar tectonic regimes were analyzed. To this end we selected two well-defined aftershock sequences from New Zealand and one aftershock sequence for Mexico, we (2) analyzed the fractal behavior of the logarithm of inter-event times (also called waiting times) of aftershocks by means of Holdeŕs exponent, and (3) their magnitude and spatial location based on a methodology proposed by Zaliapin and Ben Zion [2011] which accounts for the clustering properties of the sequence. In general, more than two coherent process stages can be identified following the main rupture, evidencing a type of "cascade" process which precludes implying a single generalized power law even though the temporal rate and average fractal character appear to be unique (as in a single Omorís p value). We found that aftershock processes indeed show multi-fractal characteristics, which may be related to different stages in the process of diffusion, as seen in the temporary-spatial distribution of aftershocks. Our method provides a way of defining the onset of the return to seismic background activity and the end of the main aftershock sequence.

  12. Shark (Scyliorhinus torazame) metallothionein: cDNA cloning, genomic sequence, and expression analysis.

    PubMed

    Cho, Young Sun; Choi, Buyl Nim; Ha, En-Mi; Kim, Ki Hong; Kim, Sung Koo; Kim, Dong Soo; Nam, Yoon Kwon

    2005-01-01

    Novel metallothionein (MT) complementary DNA and genomic sequences were isolated from a cartilaginous shark species, Scyliorhinus torazame. The full-length open reading frame (ORF) of shark MT cDNA encoded 68 amino acids with a high cysteine content (29%). The genomic ORF sequence (932 bp) of shark MT isolated by polymerase chain reaction (PCR) comprised 3 exons with 2 interventing introns. Shark MT sequence shared many conserved features with other vertebrate MTs: overall amino acid identities of shark MT ranged from 47% to 57% with fish MTs, and 41% to 62% with mammalian MTs. However, in addition to these conserved characteristics, shark MT sequence exhibited some unique characteristics. It contained 4 extra amino acids (Lys-Ala-Gly-Arg) at the end of the beta-domain, which have not been reported in any other vertebrate MTs. The last amino acid residue at the C-terminus was Ser, which also has not been reported in fish and mammalian MTs. The MT messenger RNA levels in shark liver and kidney, assessed by semiquantitative reverse transcriptase PCR and RNA blot hybridization, were significantly affected by experimental exposures to heavy metals (cadmium, copper, and zinc). Generally, the transcriptional activation of shark MT gene was dependent on the dose (0-10 mg/kg body weight for injection and 0-20 microM for immersion) and duration (1-10 days); zinc was a more potent inducer than copper and cadmium.

  13. AgdbNet – antigen sequence database software for bacterial typing

    PubMed Central

    Jolley, Keith A; Maiden, Martin CJ

    2006-01-01

    Background Bacterial typing schemes based on the sequences of genes encoding surface antigens require databases that provide a uniform, curated, and widely accepted nomenclature of the variants identified. Due to the differences in typing schemes, imposed by the diversity of genes targeted, creating these databases has typically required the writing of one-off code to link the database to a web interface. Here we describe agdbNet, widely applicable web database software that facilitates simultaneous BLAST querying of multiple loci using either nucleotide or peptide sequences. Results Databases are described by XML files that are parsed by a Perl CGI script. Each database can have any number of loci, which may be defined by nucleotide and/or peptide sequences. The software is currently in use on at least five public databases for the typing of Neisseria meningitidis, Campylobacter jejuni and Streptococcus equi and can be set up to query internal isolate tables or suitably-configured external isolate databases, such as those used for multilocus sequence typing. The style of the resulting website can be fully configured by modifying stylesheets and through the use of customised header and footer files that surround the output of the script. Conclusion The software provides a rapid means of setting up customised Internet antigen sequence databases. The flexible configuration options enable typing schemes with differing requirements to be accommodated. PMID:16790057

  14. A comparison of the effects of temporary hippocampal lesions on single and dual context versions of the olfactory sequence memory task.

    PubMed

    Sill, Orriana C; Smith, David M

    2012-08-01

    In recent years, many animal models of memory have focused on one or more of the various components of episodic memory. For example, the odor sequence memory task requires subjects to remember individual items and events (the odors) and the temporal aspects of the experience (the sequence of odor presentation). The well-known spatial context coding function of the hippocampus, as exemplified by place cell firing, may reflect the "where" component of episodic memory. In the present study, we added a contextual component to the odor sequence memory task by training rats to choose the earlier odor in one context and the later odor in another context and we compared the effects of temporary hippocampal lesions on performance of the original single context task and the new dual context task. Temporary lesions significantly impaired the single context task, although performance remained significantly above chance levels. In contrast, performance dropped all the way to chance when temporary lesions were used in the dual context task. These results demonstrate that rats can learn a dual context version of the odor sequence learning task that requires the use of contextual information along with the requirement to remember the "what" and "when" components of the odor sequence. Moreover, the addition of the contextual component made the task fully dependent on the hippocampus.

  15. RetroTector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences

    PubMed Central

    Sperber, Göran; Lövgren, Anders; Eriksson, Nils-Einar; Benachenhou, Farid; Blomberg, Jonas

    2009-01-01

    Background The rapid accumulation of genomic information in databases necessitates rapid and specific algorithms for extracting biologically meaningful information. More or less complete retroviral sequences, also called proviral or endogenous retroviral sequences; ERVs, constitutes at least 5% of vertebrate genomes. After infecting the host, these retroviruses have integrated in germ line cells, and have then been carried in genomes for at least several 100 million years. A better understanding of structure and function of these sequences can have profound biological and medical consequences. Methods RetroTector© (ReTe) is a platform-independent Java program for identification and characterization of proviral sequences in vertebrate genomes. The full ReTe requires a local installation with a MySQL database. Although not overly complicated, the installation may take some time. A "light" version of ReTe, (RetroTector online; ROL) which does not require specific installation procedures is provided, via the World Wide Web. Results ROL was implemented under the Batchelor web interface (A Lövgren et al). It allows both GenBank accession number, file and FASTA cut-and-paste admission of sequences (5 to 10 000 kilobases). Up to ten submissions can be done simultaneously, allowing batch analysis of <= 100 Megabases. Jobs are shown in an IP-number specific list. Results are text files, and can be viewed with the program, RetroTectorViewer.jar (at the same site), which has the full graphical capabilities of the basic ReTe program. A detailed analysis of any retroviral sequences found in the submitted sequence is graphically presented, exportable in standard formats. With the current server, a complete analysis of a 1 Megabase sequence is complete in 10 minutes. It is possible to mask nonretroviral repetitive sequences in the submitted sequence, using host genome specific "brooms", which increase specificity. Discussion Proviral sequences can be hard to recognize, especially if the integration occurred many million years ago. Precise delineation of LTR, gag, pro, pol and env can be difficult, requiring manual work. ROL is a way of simplifying these tasks. Conclusion ROL provides 1. annotation and presentation of known retroviral sequences, 2. detection of proviral chains in unknown genomic sequences, with up to 100 Mbase per submission. PMID:19534753

  16. RetroTector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences.

    PubMed

    Sperber, Göran; Lövgren, Anders; Eriksson, Nils-Einar; Benachenhou, Farid; Blomberg, Jonas

    2009-06-16

    The rapid accumulation of genomic information in databases necessitates rapid and specific algorithms for extracting biologically meaningful information. More or less complete retroviral sequences, also called proviral or endogenous retroviral sequences; ERVs, constitutes at least 5% of vertebrate genomes. After infecting the host, these retroviruses have integrated in germ line cells, and have then been carried in genomes for at least several 100 million years. A better understanding of structure and function of these sequences can have profound biological and medical consequences. RetroTector (ReTe) is a platform-independent Java program for identification and characterization of proviral sequences in vertebrate genomes. The full ReTe requires a local installation with a MySQL database. Although not overly complicated, the installation may take some time. A "light" version of ReTe, (RetroTector online; ROL) which does not require specific installation procedures is provided, via the World Wide Web. ROL http://www.fysiologi.neuro.uu.se/jbgs/ was implemented under the Batchelor web interface (A Lövgren et al). It allows both GenBank accession number, file and FASTA cut-and-paste admission of sequences (5 to 10,000 kilobases). Up to ten submissions can be done simultaneously, allowing batch analysis of

  17. Archaebacterial rhodopsin sequences: Implications for evolution

    NASA Technical Reports Server (NTRS)

    Lanyi, J. K.

    1991-01-01

    It was proposed over 10 years ago that the archaebacteria represent a separate kingdom which diverged very early from the eubacteria and eukaryotes. It follows that investigations of archaebacterial characteristics might reveal features of early evolution. So far, two genes, one for bacteriorhodopsin and another for halorhodopsin, both from Halobacterium halobium, have been sequenced. We cloned and sequenced the gene coding for the polypeptide of another one of these rhodopsins, a halorhodopsin in Natronobacterium pharaonis. Peptide sequencing of cyanogen bromide fragments, and immuno-reactions of the protein and synthetic peptides derived from the C-terminal gene sequence, confirmed that the open reading frame was the structural gene for the pharaonis halorhodopsin polypeptide. The flanking DNA sequences of this gene, as well as those of other bacterial rhodopsins, were compared to previously proposed archaebacterial consensus sequences. In pairwise comparisons of the open reading frame with DNA sequences for bacterio-opsin and halo-opsin from Halobacterium halobium, silent divergences were calculated. These indicate very considerable evolutionary distance between each pair of genes, even in the dame organism. In spite of this, three protein sequences show extensive similarities, indicating strong selective pressures.

  18. Genomics dataset of unidentified disclosed isolates.

    PubMed

    Rekadwad, Bhagwan N

    2016-09-01

    Analysis of DNA sequences is necessary for higher hierarchical classification of the organisms. It gives clues about the characteristics of organisms and their taxonomic position. This dataset is chosen to find complexities in the unidentified DNA in the disclosed patents. A total of 17 unidentified DNA sequences were thoroughly analyzed. The quick response codes were generated. AT/GC content of the DNA sequences analysis was carried out. The QR is helpful for quick identification of isolates. AT/GC content is helpful for studying their stability at different temperatures. Additionally, a dataset on cleavage code and enzyme code studied under the restriction digestion study, which helpful for performing studies using short DNA sequences was reported. The dataset disclosed here is the new revelatory data for exploration of unique DNA sequences for evaluation, identification, comparison and analysis.

  19. Building Conceptual Understanding in Young Scientists.

    ERIC Educational Resources Information Center

    Hawley, Duncan

    2002-01-01

    Describes the use of a new pedagogic approach to geology used to create a sequence of investigative activities enabling students to speculate, hypothesize, observe, test, reason, and infer about the characteristics of rocks. The approach is framed by two questions: (1) What are the key characteristics of different rock groups?; and (2) How did the…

  20. Moisture removal characteristics of thin layer rough rice under sequenced infrared radiation heating and cooling

    USDA-ARS?s Scientific Manuscript database

    Rice drying with infrared (IR) radiation has been investigated during recent years and showed promising potential with improved quality and energy efficiency. The objective of this study was to further investigate the moisture removal characteristics of thin layer rough rice heated by IR and cooled ...

  1. Driving style recognition method using braking characteristics based on hidden Markov model

    PubMed Central

    Wu, Chaozhong; Lyu, Nengchao; Huang, Zhen

    2017-01-01

    Since the advantage of hidden Markov model in dealing with time series data and for the sake of identifying driving style, three driving style (aggressive, moderate and mild) are modeled reasonably through hidden Markov model based on driver braking characteristics to achieve efficient driving style. Firstly, braking impulse and the maximum braking unit area of vacuum booster within a certain time are collected from braking operation, and then general braking and emergency braking characteristics are extracted to code the braking characteristics. Secondly, the braking behavior observation sequence is used to describe the initial parameters of hidden Markov model, and the generation of the hidden Markov model for differentiating and an observation sequence which is trained and judged by the driving style is introduced. Thirdly, the maximum likelihood logarithm could be implied from the observable parameters. The recognition accuracy of algorithm is verified through experiments and two common pattern recognition algorithms. The results showed that the driving style discrimination based on hidden Markov model algorithm could realize effective discriminant of driving style. PMID:28837580

  2. Using Common Graphics Paradigms Implemented in a Java Applet to Represent Complex Scheduling Requirements

    NASA Technical Reports Server (NTRS)

    Jaap, John; Meyer, Patrick; Davis, Elizabeth

    1997-01-01

    The experiments planned for the International Space Station promise to be complex, lengthy and diverse. The scarcity of the space station resources will cause significant competition for resources between experiments. The scheduling job facing the Space Station mission planning software requires a concise and comprehensive description of the experiments' requirements (to ensure a valid schedule) and a good description of the experiments' flexibility (to effectively utilize available resources). In addition, the continuous operation of the station, the wide geographic dispersion of station users, and the budgetary pressure to reduce operations manpower make a low-cost solution mandatory. A graphical representation of the scheduling requirements for station payloads implemented via an Internet-based application promises to be an elegant solution that addresses all of these issues. The graphical representation of experiment requirements permits a station user to describe his experiment by defining "activities" and "sequences of activities". Activities define the resource requirements (with alternatives) and other quantitative constraints of tasks to be performed. Activities definitions use an "outline" graphics paradigm. Sequences define the time relationships between activities. Sequences may also define time relationships with activities of other payloads or space station systems. Sequences of activities are described by a "network" graphics paradigm. The bulk of this paper will describe the graphical approach to representing requirements and provide examples that show the ease and clarity with which complex requirements can be represented. A Java applet, to run in a web browser, is being developed to support the graphical representation of payload scheduling requirements. Implementing the entry and editing of requirements via the web solves the problems introduced by the geographic dispersion of users. Reducing manpower is accomplished by developing a concise representation which eliminates the misunderstanding possible with verbose representations and which captures the complete requirements and flexibility of the experiments.

  3. Integrated biostratigraphic and sequence stratigraphic framework for Upper Cretaceous strata of the eastern Gulf Coastal Plain, USA

    USGS Publications Warehouse

    Mancini, E.A.; Puckett, T.M.; Tew, B.H.

    1996-01-01

    Upper Cretaceous (Santonian-Maastrichtian stages) strata of the eastern US Gulf Coastal Plain represent a relatively complete section of marine to nonmarine mixed siliciclastic and carbonate sediments. This section includes three depositional sequences which display characteristic systems tracts and distinct physical defining surfaces. The marine lithofacies are rich in calcareous nannoplankton and planktonic foraminifera which can be used for biostratigraphic zonation. Integration of this zonation with the lithostratigraphy and sequence stratigraphy of these strata results in a framework that can be used for local and regional intrabasin correlation and potentially for global interbasin correlation. Only the synchronous maximum flooding surfaces of these depositional sequences, however, have chronostratigraphic significance. The sequence boundaries and initial flooding surfaces are diachronous, and their use for correlation can produce conflicting results. The availability of high resolution biostratigraphy is critical for global correlation of depositional sequences. ?? 1996 Academic Press Limited.

  4. Skilled memory in expert figure skaters.

    PubMed

    Deakin, J M; Allard, F

    1991-01-01

    The present studies extend skilled-memory theory to a domain involving the performance of motor sequences. Skilled figure skaters were better able than their less skilled counterparts to perform short skating sequences that were choreographed, rather than randomly constructed. Expert skaters encoded sequences for performance very differently from the way in which they encoded sequences that were verbally presented for verbal recall. Tasks interpolated between sequence and recall showed no significant influence on recall accuracy, implicating long-term memory in skating memory. There was little evidence for the use of retrieval structures when skaters learned the brief sequences used throughout these studies. Finally, expert skaters were able to judge the similarity of two skating elements faster than less skilled skaters, indicating a faster access to semantic memory for experts. The data indicate that skaters show many of the same skilled-memory characteristics as have been described in other skill domains involving memorization, such as digit span and memory for dinner orders.

  5. Genotype and Phenotype of Echinococcus granulosus Derived from Wild Sheep (Ovis orientalis) in Iran.

    PubMed

    Eslami, Ali; Meshgi, Behnam; Jalousian, Fatemeh; Rahmani, Shima; Salari, Mohammad Ali

    2016-02-01

    The aim of the present study is to determine the characteristics of genotype and phenotype of Echinococcus granulosus derived from wild sheep and to compare them with the strains of E. granulosus sensu stricto (sheep-dog) and E. granulosus camel strain (camel-dog) in Iran. In Khojir National Park, near Tehran, Iran, a fertile hydatid cyst was recently found in the liver of a dead wild sheep (Ovis orientalis). The number of protoscolices (n=6,000) proved enough for an experimental infection in a dog. The characteristics of large and small hooks of metacestode were statistically determined as the sensu stricto strain but not the camel strain (P=0.5). To determine E. granulosus genotype, 20 adult worms of this type were collected from the infected dog. The second internal transcribed spacer (ITS2) of the nuclear ribosomal DNA (rDNA) and cytochrome c oxidase 1 subunit (COX1) of the mitochondrial DNA were amplified from individual adult worm by PCR. Subsequently, the PCR product was sequenced by Sanger method. The lengths of ITS2 and COX1 sequences were 378 and 857 bp, respectively, for all the sequenced samples. The amplified DNA sequences from both ribosomal and mitochondrial genes were highly similar (99% and 98%, respectively) to that of the ovine strain in the GenBank database. The results of the present study indicate that the morpho-molecular features and characteristics of E. granulosus in the Iranian wild sheep are the same as those of the sheep-dog E. granulosus sensu stricto strain.

  6. Sequence Segmentation with changeptGUI.

    PubMed

    Tasker, Edward; Keith, Jonathan M

    2017-01-01

    Many biological sequences have a segmental structure that can provide valuable clues to their content, structure, and function. The program changept is a tool for investigating the segmental structure of a sequence, and can also be applied to multiple sequences in parallel to identify a common segmental structure, thus providing a method for integrating multiple data types to identify functional elements in genomes. In the previous edition of this book, a command line interface for changept is described. Here we present a graphical user interface for this package, called changeptGUI. This interface also includes tools for pre- and post-processing of data and results to facilitate investigation of the number and characteristics of segment classes.

  7. The genome sequence of pepper vein yellows virus (family Luteoviridae, genus Polerovirus).

    PubMed

    Murakami, Ritsuko; Nakashima, Nobuhiko; Hinomoto, Norihide; Kawano, Shinji; Toyosato, Tetsuya

    2011-05-01

    The complete genome of pepper vein yellows virus (PeVYV) was sequenced using random amplification of RNA samples isolated from vector insects (Aphis gossypii) that had been given access to PeVYV-infected plants. The PeVYV genome consisted of 6244 nucleotides and had a genomic organization characteristic of members of the genus Polerovirus. PeVYV had highest amino acid sequence identities in ORF0 to ORF3 (75.9 - 91.9%) with tobacco vein distorting polerovirus, with which it was only 25.1% identical in ORF5. These sequence comparisons and previously studied biological properties indicate that PeVYV is a distinctly different virus and belongs to a new species of the genus Polerovirus.

  8. LongISLND: in silico sequencing of lengthy and noisy datatypes.

    PubMed

    Lau, Bayo; Mohiyuddin, Marghoob; Mu, John C; Fang, Li Tai; Bani Asadi, Narges; Dallett, Carolina; Lam, Hugo Y K

    2016-12-15

    LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling. LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd CONTACT: hugo.lam@roche.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  9. LETTER TO THE EDITOR: Exhaustive search for low-autocorrelation binary sequences

    NASA Astrophysics Data System (ADS)

    Mertens, S.

    1996-09-01

    Binary sequences with low autocorrelations are important in communication engineering and in statistical mechanics as ground states of the Bernasconi model. Computer searches are the main tool in the construction of such sequences. Owing to the exponential size 0305-4470/29/18/005/img1 of the configuration space, exhaustive searches are limited to short sequences. We discuss an exhaustive search algorithm with run-time characteristic 0305-4470/29/18/005/img2 and apply it to compile a table of exact ground states of the Bernasconi model up to N = 48. The data suggest F > 9 for the optimal merit factor in the limit 0305-4470/29/18/005/img3.

  10. A Plasmodium falciparum copper-binding membrane protein with copper transport motifs

    PubMed Central

    2012-01-01

    Background Copper is an essential catalytic co-factor for metabolically important cellular enzymes, such as cytochrome-c oxidase. Eukaryotic cells acquire copper through a copper transport protein and distribute intracellular copper using molecular chaperones. The copper chelator, neocuproine, inhibits Plasmodium falciparum ring-to-trophozoite transition in vitro, indicating a copper requirement for malaria parasite development. How the malaria parasite acquires or secretes copper still remains to be fully elucidated. Methods PlasmoDB was searched for sequences corresponding to candidate P. falciparum copper-requiring proteins. The amino terminal domain of a putative P. falciparum copper transport protein was cloned and expressed as a maltose binding fusion protein. The copper binding ability of this protein was examined. Copper transport protein-specific anti-peptide antibodies were generated in chickens and used to establish native protein localization in P. falciparum parasites by immunofluorescence microscopy. Results Six P. falciparum copper-requiring protein orthologs and a candidate P. falciparum copper transport protein (PF14_0369), containing characteristic copper transport protein features, were identified in PlasmoDB. The recombinant amino terminal domain of the transport protein bound reduced copper in vitro and within Escherichia coli cells during recombinant expression. Immunolocalization studies tracked the copper binding protein translocating from the erythrocyte plasma membrane in early ring stage to a parasite membrane as the parasites developed to schizonts. The protein appears to be a PEXEL-negative membrane protein. Conclusion Plasmodium falciparum parasites express a native protein with copper transporter characteristics that binds copper in vitro. Localization of the protein to the erythrocyte and parasite plasma membranes could provide a mechanism for the delivery of novel anti-malarial compounds. PMID:23190769

  11. Multiplexing Short Primers for Viral Family PCR

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gardner, S N; Hiddessen, A L; Hara, C A

    We describe a Multiplex Primer Prediction (MPP) algorithm to build multiplex compatible primer sets for large, diverse, and unalignable sets of target sequences. The MPP algorithm is scalable to larger target sets than other available software, and it does not require a multiple sequence alignment. We applied it to questions in viral detection, and demonstrated that there are no universally conserved priming sequences among viruses and that it could require an unfeasibly large number of primers ({approx}3700 18-mers or {approx}2000 10-mers) to generate amplicons from all sequenced viruses. We then designed primer sets separately for each viral family, and formore » several diverse species such as foot-and-mouth disease virus, hemagglutinin and neuraminidase segments of influenza A virus, Norwalk virus, and HIV-1.« less

  12. A microfluidic device for preparing next generation DNA sequencing libraries and for automating other laboratory protocols that require one or more column chromatography steps.

    PubMed

    Tan, Swee Jin; Phan, Huan; Gerry, Benjamin Michael; Kuhn, Alexandre; Hong, Lewis Zuocheng; Min Ong, Yao; Poon, Polly Suk Yean; Unger, Marc Alexander; Jones, Robert C; Quake, Stephen R; Burkholder, William F

    2013-01-01

    Library preparation for next-generation DNA sequencing (NGS) remains a key bottleneck in the sequencing process which can be relieved through improved automation and miniaturization. We describe a microfluidic device for automating laboratory protocols that require one or more column chromatography steps and demonstrate its utility for preparing Next Generation sequencing libraries for the Illumina and Ion Torrent platforms. Sixteen different libraries can be generated simultaneously with significantly reduced reagent cost and hands-on time compared to manual library preparation. Using an appropriate column matrix and buffers, size selection can be performed on-chip following end-repair, dA tailing, and linker ligation, so that the libraries eluted from the chip are ready for sequencing. The core architecture of the device ensures uniform, reproducible column packing without user supervision and accommodates multiple routine protocol steps in any sequence, such as reagent mixing and incubation; column packing, loading, washing, elution, and regeneration; capture of eluted material for use as a substrate in a later step of the protocol; and removal of one column matrix so that two or more column matrices with different functional properties can be used in the same protocol. The microfluidic device is mounted on a plastic carrier so that reagents and products can be aliquoted and recovered using standard pipettors and liquid handling robots. The carrier-mounted device is operated using a benchtop controller that seals and operates the device with programmable temperature control, eliminating any requirement for the user to manually attach tubing or connectors. In addition to NGS library preparation, the device and controller are suitable for automating other time-consuming and error-prone laboratory protocols requiring column chromatography steps, such as chromatin immunoprecipitation.

  13. A Microfluidic Device for Preparing Next Generation DNA Sequencing Libraries and for Automating Other Laboratory Protocols That Require One or More Column Chromatography Steps

    PubMed Central

    Tan, Swee Jin; Phan, Huan; Gerry, Benjamin Michael; Kuhn, Alexandre; Hong, Lewis Zuocheng; Min Ong, Yao; Poon, Polly Suk Yean; Unger, Marc Alexander; Jones, Robert C.; Quake, Stephen R.; Burkholder, William F.

    2013-01-01

    Library preparation for next-generation DNA sequencing (NGS) remains a key bottleneck in the sequencing process which can be relieved through improved automation and miniaturization. We describe a microfluidic device for automating laboratory protocols that require one or more column chromatography steps and demonstrate its utility for preparing Next Generation sequencing libraries for the Illumina and Ion Torrent platforms. Sixteen different libraries can be generated simultaneously with significantly reduced reagent cost and hands-on time compared to manual library preparation. Using an appropriate column matrix and buffers, size selection can be performed on-chip following end-repair, dA tailing, and linker ligation, so that the libraries eluted from the chip are ready for sequencing. The core architecture of the device ensures uniform, reproducible column packing without user supervision and accommodates multiple routine protocol steps in any sequence, such as reagent mixing and incubation; column packing, loading, washing, elution, and regeneration; capture of eluted material for use as a substrate in a later step of the protocol; and removal of one column matrix so that two or more column matrices with different functional properties can be used in the same protocol. The microfluidic device is mounted on a plastic carrier so that reagents and products can be aliquoted and recovered using standard pipettors and liquid handling robots. The carrier-mounted device is operated using a benchtop controller that seals and operates the device with programmable temperature control, eliminating any requirement for the user to manually attach tubing or connectors. In addition to NGS library preparation, the device and controller are suitable for automating other time-consuming and error-prone laboratory protocols requiring column chromatography steps, such as chromatin immunoprecipitation. PMID:23894273

  14. Method and apparatus for automated assembly

    DOEpatents

    Jones, Rondall E.; Wilson, Randall H.; Calton, Terri L.

    1999-01-01

    A process and apparatus generates a sequence of steps for assembly or disassembly of a mechanical system. Each step in the sequence is geometrically feasible, i.e., the part motions required are physically possible. Each step in the sequence is also constraint feasible, i.e., the step satisfies user-definable constraints. Constraints allow process and other such limitations, not usually represented in models of the completed mechanical system, to affect the sequence.

  15. Measures of phylogenetic differentiation provide robust and complementary insights into microbial communities.

    PubMed

    Parks, Donovan H; Beiko, Robert G

    2013-01-01

    High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.

  16. 47 CFR 2.1047 - Measurements required: Modulation characteristics.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 47 Telecommunication 1 2010-10-01 2010-10-01 false Measurements required: Modulation characteristics. 2.1047 Section 2.1047 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL FREQUENCY... Certification § 2.1047 Measurements required: Modulation characteristics. (a) Voice modulated communication...

  17. 47 CFR 2.1047 - Measurements required: Modulation characteristics.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 47 Telecommunication 1 2011-10-01 2011-10-01 false Measurements required: Modulation characteristics. 2.1047 Section 2.1047 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL FREQUENCY... Certification § 2.1047 Measurements required: Modulation characteristics. (a) Voice modulated communication...

  18. Sequence search on a supercomputer.

    PubMed

    Gotoh, O; Tagashira, Y

    1986-01-10

    A set of programs was developed for searching nucleic acid and protein sequence data bases for sequences similar to a given sequence. The programs, written in FORTRAN 77, were optimized for vector processing on a Hitachi S810-20 supercomputer. A search of a 500-residue protein sequence against the entire PIR data base Ver. 1.0 (1) (0.5 M residues) is carried out in a CPU time of 45 sec. About 4 min is required for an exhaustive search of a 1500-base nucleotide sequence against all mammalian sequences (1.2M bases) in Genbank Ver. 29.0. The CPU time is reduced to about a quarter with a faster version.

  19. Spliced leader RNA of trypanosomes: in vivo mutational analysis reveals extensive and distinct requirements for trans splicing and cap4 formation.

    PubMed Central

    Lücke, S; Xu, G L; Palfi, Z; Cross, M; Bellofatto, V; Bindereif, A

    1996-01-01

    In trypanosomes mRNAs are generated through trans splicing. The spliced leader (SL) RNA, which donates the 5'-terminal mini-exon to each of the protein coding exons, plays a central role in the trans splicing process. We have established in vivo assays to study in detail trans splicing, cap4 modification, and RNP assembly of the SL RNA in the trypanosomatid species Leptomonas seymouri. First, we found that extensive sequences within the mini-exon are required for SL RNA function in vivo, although a conserved length of 39 nt is not essential. In contrast, the intron sequence appears to be surprisingly tolerant to mutation; only the stem-loop II structure is indispensable. The asymmetry of the sequence requirements in the stem I region suggests that this domain may exist in different functional conformations. Second, distinct mini-exon sequences outside the modification site are important for efficient cap4 formation. Third, all SL RNA mutations tested allowed core RNP assembly, suggesting flexible requirements for core protein binding. In sum, the results of our mutational analysis provide evidence for a discrete domain structure of the SL RNA and help to explain the strong phylogenetic conservation of the mini-exon sequence and of the overall SL RNA secondary structure; they also suggest that there may be certain differences between trans splicing in nematodes and trypanosomes. This approach provides a basis for studying RNA-RNA interactions in the trans spliceosome. Images PMID:8861965

  20. Characteristic motifs for families of allergenic proteins

    PubMed Central

    Ivanciuc, Ovidiu; Garcia, Tzintzuni; Torres, Miguel; Schein, Catherine H.; Braun, Werner

    2008-01-01

    The identification of potential allergenic proteins is usually done by scanning a database of allergenic proteins and locating known allergens with a high sequence similarity. However, there is no universally accepted cut-off value for sequence similarity to indicate potential IgE cross-reactivity. Further, overall sequence similarity may be less important than discrete areas of similarity in proteins with homologous structure. To identify such areas, we first classified all allergens and their subdomains in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/) to their closest protein families as defined in Pfam, and identified conserved physicochemical property motifs characteristic of each group of sequences. Allergens populate only a small subset of all known Pfam families, as all allergenic proteins in SDAP could be grouped to only 130 (of 9318 total) Pfams, and 31 families contain more than four allergens. Conserved physicochemical property motifs for the aligned sequences of the most populated Pfam families were identified with the PCPMer program suite and catalogued in the webserver Motif-Mate (http://born.utmb.edu/motifmate/summary.php). We also determined specific motifs for allergenic members of a family that could distinguish them from non-allergenic ones. These allergen specific motifs should be most useful in database searches for potential allergens. We found that sequence motifs unique to the allergens in three families (seed storage proteins, Bet v 1, and tropomyosin) overlap with known IgE epitopes, thus providing evidence that our motif based approach can be used to assess the potential allergenicity of novel proteins. PMID:18951633

Top