substitution sequence process: Topics by Science.gov

Sample records for substitution sequence process

FRAGS: estimation of coding sequence substitution rates from fragmentary data

PubMed Central

Swart, Estienne C; Hide, Winston A; Seoighe, Cathal

2004-01-01

Background Rates of substitution in protein-coding sequences can provide important insights into evolutionary processes that are of biomedical and theoretical interest. Increased availability of coding sequence data has enabled researchers to estimate more accurately the coding sequence divergence of pairs of organisms. However the use of different data sources, alignment protocols and methods to estimate substitution rates leads to widely varying estimates of key parameters that define the coding sequence divergence of orthologous genes. Although complete genome sequence data are not available for all organisms, fragmentary sequence data can provide accurate estimates of substitution rates provided that an appropriate and consistent methodology is used and that differences in the estimates obtainable from different data sources are taken into account. Results We have developed FRAGS, an application framework that uses existing, freely available software components to construct in-frame alignments and estimate coding substitution rates from fragmentary sequence data. Coding sequence substitution estimates for human and chimpanzee sequences, generated by FRAGS, reveal that methodological differences can give rise to significantly different estimates of important substitution parameters. The estimated substitution rates were also used to infer upper-bounds on the amount of sequencing error in the datasets that we have analysed. Conclusion We have developed a system that performs robust estimation of substitution rates for orthologous sequences from a pair of organisms. Our system can be used when fragmentary genomic or transcript data is available from one of the organisms and the other is a completely sequenced genome within the Ensembl database. As well as estimating substitution statistics our system enables the user to manage and query alignment and substitution data. PMID:15005802
Overdispersion of the Molecular Clock: Temporal Variation of Gene-Specific Substitution Rates in Drosophila

PubMed Central

Hartl, Daniel L.

2008-01-01

Simple models of molecular evolution assume that sequences evolve by a Poisson process in which nucleotide or amino acid substitutions occur as rare independent events. In these models, the expected ratio of the variance to the mean of substitution counts equals 1, and substitution processes with a ratio greater than 1 are called overdispersed. Comparing the genomes of 10 closely related species of Drosophila, we extend earlier evidence for overdispersion in amino acid replacements as well as in four-fold synonymous substitutions. The observed deviation from the Poisson expectation can be described as a linear function of the rate at which substitutions occur on a phylogeny, which implies that deviations from the Poisson expectation arise from gene-specific temporal variation in substitution rates. Amino acid sequences show greater temporal variation in substitution rates than do four-fold synonymous sequences. Our findings provide a general phenomenological framework for understanding overdispersion in the molecular clock. Also, the presence of substantial variation in gene-specific substitution rates has broad implications for work in phylogeny reconstruction and evolutionary rate estimation. PMID:18480070
The Flushtration Count Illusion: Attribute substitution tricks our interpretation of a simple visual event sequence.

PubMed

Thomas, Cyril; Didierjean, André; Kuhn, Gustav

2018-04-17

When faced with a difficult question, people sometimes work out an answer to a related, easier question without realizing that a substitution has taken place (e.g., Kahneman, 2011, Thinking, fast and slow. New York, Farrar, Strauss, Giroux). In two experiments, we investigated whether this attribute substitution effect can also affect the interpretation of a simple visual event sequence. We used a magic trick called the 'Flushtration Count Illusion', which involves a technique used by magicians to give the illusion of having seen multiple cards with identical backs, when in fact only the back of one card (the bottom card) is repeatedly shown. In Experiment 1, we demonstrated that most participants are susceptible to the illusion, even if they have the visual and analytical reasoning capacity to correctly process the sequence. In Experiment 2, we demonstrated that participants construct a biased and simplified representation of the Flushtration Count by substituting some attributes of the event sequence. We discussed of the psychological processes underlying this attribute substitution effect. © 2018 The British Psychological Society.
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage

PubMed Central

Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent

2016-01-01

Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
A new molecular evolution model for limited insertion independent of substitution.

PubMed

Lèbre, Sophie; Michel, Christian J

2013-10-01

We recently introduced a new molecular evolution model called the IDIS model for Insertion Deletion Independent of Substitution [13,14]. In the IDIS model, the three independent processes of substitution, insertion and deletion of residues have constant rates. In order to control the genome expansion during evolution, we generalize here the IDIS model by introducing an insertion rate which decreases when the sequence grows and tends to 0 for a maximum sequence length nmax. This new model, called LIIS for Limited Insertion Independent of Substitution, defines a matrix differential equation satisfied by a vector P(t) describing the sequence content in each residue at evolution time t. An analytical solution is obtained for any diagonalizable substitution matrix M. Thus, the LIIS model gives an expression of the sequence content vector P(t) in each residue under evolution time t as a function of the eigenvalues and the eigenvectors of matrix M, the residue insertion rate vector R, the total insertion rate r, the initial and maximum sequence lengths n0 and nmax, respectively, and the sequence content vector P(t0) at initial time t0. The derivation of the analytical solution is much more technical, compared to the IDIS model, as it involves Gauss hypergeometric functions. Several propositions of the LIIS model are derived: proof that the IDIS model is a particular case of the LIIS model when the maximum sequence length nmax tends to infinity, fixed point, time scale, time step and time inversion. Using a relation between the sequence length l and the evolution time t, an expression of the LIIS model as a function of the sequence length l=n(t) is obtained. Formulas for 'insertion only', i.e. when the substitution rates are all equal to 0, are derived at evolution time t and sequence length l. Analytical solutions of the LIIS model are explicitly derived, as a function of either evolution time t or sequence length l, for two classical substitution matrices: the 3-parameter symmetric substitution matrix [12] (LIIS-SYM3) and the HKY asymmetric substitution matrix[9] (LIIS-HKY). An evaluation of the LIIS model (precisely, LIIS-HKY) based on four statistical analyses of the GC content in complete genomes of four prokaryotic taxonomic groups, namely Chlamydiae, Crenarchaeota, Spirochaetes and Thermotogae, shows the expected improvement from the theory of the LIIS model compared to the IDIS model. Copyright © 2013 Elsevier Inc. All rights reserved.
Phylogenetic Invariants for Metazoan Mitochondrial Genome Evolution.

PubMed

Sankoff; Blanchette

1998-01-01

The method of phylogenetic invariants was developed to apply to aligned sequence data generated, according to a stochastic substitution model, for N species related through an unknown phylogenetic tree. The invariants are functions of the probabilities of the observable N-tuples, which are identically zero, over all choices of branch length, for some trees. Evaluating the invariants associated with all possible trees, using observed N-tuple frequencies over all sequence positions, enables us to rapidly infer the generating tree. An aspect of evolution at the genomic level much studied recently is the rearrangements of gene order along the chromosome from one species to another. Instead of the substitutions responsible for sequence evolution, we examine the non-local processes responsible for genome rearrangements such as inversion of arbitrarily long segments of chromosomes. By treating the potential adjacency of each possible pair of genes as a position", an appropriate substitution" model can be recognized as governing the rearrangement process, and a probabilistically principled phylogenetic inference can be set up. We calculate the invariants for this process for N=5, and apply them to mitochondrial genome data from coelomate metazoans, showing how they resolve key aspects of branching order.
Synthesis and Late-Stage Functionalization of Complex Molecules through C–H Fluorination and Nucleophilic Aromatic Substitution

PubMed Central

2015-01-01

We report the late-stage functionalization of multisubstituted pyridines and diazines at the position α to nitrogen. By this process, a series of functional groups and substituents bound to the ring through nitrogen, oxygen, sulfur, or carbon are installed. This functionalization is accomplished by a combination of fluorination and nucleophilic aromatic substitution of the installed fluoride. A diverse array of functionalities can be installed because of the mild reaction conditions revealed for nucleophilic aromatic substitutions (SNAr) of the 2-fluoroheteroarenes. An evaluation of the rates for substitution versus the rates for competitive processes provides a framework for planning this functionalization sequence. This process is illustrated by the modification of a series of medicinally important compounds, as well as the increase in efficiency of synthesis of several existing pharmaceuticals. PMID:24918484
Substituted 1H-1,2,3-Triazol-4-yl-1H-pyrrolo[2,3-b]pyridines by De Novo One-Pot Ring Forming Coupling-Cyclization-Desilylation-CuAAC-Sequence.

PubMed

Müller, Thomas J J; Lessing, Timo; van Mark, Hauke

2018-05-04

Substituted 1H-1,2,3-triazol-4-yl-pyrrolo[2,3-b]pyridines are efficiently prepared by a one-pot coupling-cyclization-desilylation-CuAAC-sequence in the sense of a consecutive three-component fashion. The key feature of this novel de novo formation of azole and triazole anellation is the sequentially Pd/Cu-catalyzed process employing tri(iso-propyl)silylbutadiyne (TIPS-butadiyne) as a four-carbon building block. In addition, the sequence can be expanded in a four-component fashion also employing the in situ formation of the require azides. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Time Clustered Sampling Can Inflate the Inferred Substitution Rate in Foot-And-Mouth Disease Virus Analyses.

PubMed

Pedersen, Casper-Emil T; Frandsen, Peter; Wekesa, Sabenzia N; Heller, Rasmus; Sangula, Abraham K; Wadsworth, Jemma; Knowles, Nick J; Muwanika, Vincent B; Siegismund, Hans R

2015-01-01

With the emergence of analytical software for the inference of viral evolution, a number of studies have focused on estimating important parameters such as the substitution rate and the time to the most recent common ancestor (tMRCA) for rapidly evolving viruses. Coupled with an increasing abundance of sequence data sampled under widely different schemes, an effort to keep results consistent and comparable is needed. This study emphasizes commonly disregarded problems in the inference of evolutionary rates in viral sequence data when sampling is unevenly distributed on a temporal scale through a study of the foot-and-mouth (FMD) disease virus serotypes SAT 1 and SAT 2. Our study shows that clustered temporal sampling in phylogenetic analyses of FMD viruses will strongly bias the inferences of substitution rates and tMRCA because the inferred rates in such data sets reflect a rate closer to the mutation rate rather than the substitution rate. Estimating evolutionary parameters from viral sequences should be performed with due consideration of the differences in short-term and longer-term evolutionary processes occurring within sets of temporally sampled viruses, and studies should carefully consider how samples are combined.
A stochastic evolution model for residue Insertion-Deletion Independent from Substitution.

PubMed

Lèbre, Sophie; Michel, Christian J

2010-12-01

We develop here a new class of stochastic models of gene evolution based on residue Insertion-Deletion Independent from Substitution (IDIS). Indeed, in contrast to all existing evolution models, insertions and deletions are modeled here by a concept in population dynamics. Therefore, they are not only independent from each other, but also independent from the substitution process. After a separate stochastic analysis of the substitution and the insertion-deletion processes, we obtain a matrix differential equation combining these two processes defining the IDIS model. By deriving a general solution, we give an analytical expression of the residue occurrence probability at evolution time t as a function of a substitution rate matrix, an insertion rate vector, a deletion rate and an initial residue probability vector. Various mathematical properties of the IDIS model in relation with time t are derived: time scale, time step, time inversion and sequence length. Particular expressions of the nucleotide occurrence probability at time t are given for classical substitution rate matrices in various biological contexts: equal insertion rate, insertion-deletion only and substitution only. All these expressions can be directly used for biological evolutionary applications. The IDIS model shows a strongly different stochastic behavior from the classical substitution only model when compared on a gene dataset. Indeed, by considering three processes of residue insertion, deletion and substitution independently from each other, it allows a more realistic representation of gene evolution and opens new directions and applications in this research field. Copyright © 2010 Elsevier Ltd. All rights reserved.
A broad survey reveals substitution tolerance of residues ligating FeS clusters in [NiFe] hydrogenase

PubMed Central

2014-01-01

Background In order to understand the effects of FeS cluster attachment in [NiFe] hydrogenase, we undertook a study to substitute all 12 amino acid positions normally ligating the three FeS clusters in the hydrogenase small subunit. Using the hydrogenase from Alteromonas macleodii “deep ecotype” as a model, we substituted one of four amino acids (Asp, His, Asn, Gln) at each of the 12 ligating positions because these amino acids are alternative coordinating residues in otherwise conserved-cysteine positions found in a broad survey of NiFe hydrogenase sequences. We also hoped to discover an enzyme with elevated hydrogen evolution activity relative to a previously reported “G1” (H230C/P285C) improved enzyme in which the medial FeS cluster Pro and the distal FeS cluster His were each substituted for Cys. Results Among all the substitutions screened, aspartic acid substitutions were generally well-tolerated, and examination suggests that the observed deficiency in enzyme activity may be largely due to misprocessing of the small subunit of the enzyme. Alignment of hydrogenase sequences from sequence databases revealed many rare substitutions; the five substitutions present in databases that we tested all exhibited measurable hydrogen evolution activity. Select substitutions were purified and tested, supporting the results of the screening assay. Analysis of these results confirms the importance of small subunit processing. Normalizing activity to quantity of mature small subunit, indicative of total enzyme maturation, weakly suggests an improvement over the “G1” enzyme. Conclusions We have comprehensively screened 48 amino acid substitutions of the hydrogenase from A. macleodii “deep ecotype”, to understand non-canonical ligations of amino acids to FeS clusters and to improve hydrogen evolution activity of this class of hydrogenase. Our studies show that non-canonical ligations can be functional and also suggests a new limiting factor in the production of active enzyme. PMID:24934472
Identification of IBV QX vaccine markers : Should vaccine acceptance by authorities require similar identifications for all live IBV vaccines?

PubMed

Listorti, Valeria; Laconi, Andrea; Catelli, Elena; Cecchinato, Mattia; Lupini, Caterina; Naylor, Clive J

2017-10-09

IBV genotype QX causes sufficient disease in Europe for several commercial companies to have started developing live attenuated vaccines. Here, one of those vaccines (L1148) was fully consensus sequenced alongside its progenitor field strain (1148-A) to determine vaccine markers, thereby enabling detection on farms. Twenty-eight single nucleotide substitutions were associated with the 1148-A attenuation, of which any combination can identify vaccine L1148 in the field. Sixteen substitutions resulted in amino acid coding changes of which half were in spike. One change in the 1b gene altered the normally highly conserved final 5 nucleotides of the transcription regulatory sequence of the S gene, common to all IBV QX genes. No mutations can currently be associated with the attenuation process. Field vaccination strategies would greatly benefit by such comparative sequence data being mandatorily submitted to regulators prior to vaccine release following a successful registration process. Copyright © 2017. Published by Elsevier Ltd.
The tangled bank of amino acids

PubMed Central

Pollock, David D.

2016-01-01

Abstract The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution. PMID:27028523
Evaluating the efficacy of a structure-derived amino acid substitution matrix in detecting protein homologs by BLAST and PSI-BLAST.

PubMed

Goonesekere, Nalin Cw

2009-01-01

The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment

PubMed Central

Coan, Heather B.; Youker, Robert T.

2017-01-01

Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the resulting alignment is used to assess the similarity of two or more sequences and can vary according to assumptions modeled by the substitution matrix. Substitution strategies with minor parameter variations are often grouped together in families. For example, the BLOSUM and PAM matrix families are commonly used because they provide a standard, predefined way of modeling substitutions. However, researchers often do not know if a given matrix family or any individual matrix within a family is the most suitable. Furthermore, predefined matrix families may inaccurately reflect a particular hypothesis that a researcher wishes to model or otherwise result in unsatisfactory alignments. In these cases, the ability to compare the effects of one or more custom matrices may be needed. This laborious process is often performed manually because the ability to simultaneously load multiple matrices and then compare their effects on alignments is not readily available in current software tools. This paper presents SubVis, an interactive R package for loading and applying multiple substitution matrices to pairwise alignments. Users can simultaneously explore alignments resulting from multiple predefined and custom substitution matrices. SubVis utilizes several of the alignment functions found in R, a common language among protein scientists. Functions are tied together with the Shiny platform which allows the modification of input parameters. Information regarding alignment quality and individual amino acid substitutions is displayed with the JavaScript language which provides interactive visualizations for revealing both high-level and low-level alignment information. PMID:28674656
Nonsynonymous substitution rate heterogeneity in the peptide-binding region among different HLA-DRB1 lineages in humans.

PubMed

Yasukochi, Yoshiki; Satta, Yoko

2014-05-02

An extraordinary diversity of amino acid sequences in the peptide-binding region (PBR) of human leukocyte antigen [HLA; human major histocompatibility complex (MHC)] molecules has been maintained by balancing selection. The process of accumulation of amino acid diversity in the PBR for six HLA genes (HLA-A, B, C, DRB1, DQB1, and DPB1) shows that the number of amino acid substitutions in the PBR among alleles does not linearly correlate with the divergence time of alleles at the six HLA loci. At these loci, some pairs of alleles show significantly less nonsynonymous substitutions at the PBR than expected from the divergence time. The same phenomenon was observed not only in the HLA but also in the rat MHC. To identify the cause for this, DRB1 sequences, a representative case of a typical nonlinear pattern of substitutions, were examined. When the amino acid substitutions in the PBR were placed with maximum parsimony on a maximum likelihood tree based on the non-PBR substitutions, heterogeneous rates of nonsynonymous substitutions in the PBR were observed on several branches. A computer simulation supported the hypothesis that allelic pairs with low PBR substitution rates were responsible for the stagnation of accumulation of PBR nonsynonymous substitutions. From these observations, we conclude that the nonsynonymous substitution rate at the PBR sites is not constant among the allelic lineages. The deceleration of the rate may be caused by the coexistence of certain pathogens for a substantially long time during HLA evolution. Copyright © 2014 Yasukochi and Satta.
SNAD: Sequence Name Annotation-based Designer.

PubMed

Sidorov, Igor A; Reshetov, Denis A; Gorbalenya, Alexander E

2009-08-14

A growing diversity of biological data is tagged with unique identifiers (UIDs) associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Here we introduce SNAD (Sequence Name Annotation-based Designer) that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list) into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.
Selenomethionine incorporation into amyloid sequences regulates fibrillogenesis and toxicity.

PubMed

Martínez, Javier; Lisa, Silvia; Sánchez, Rosa; Kowalczyk, Wioleta; Zurita, Esther; Teixidó, Meritxell; Giralt, Ernest; Andreu, David; Avila, Jesús; Gasset, María

2011-01-01

The capacity of a polypeptide chain to engage in an amyloid formation process and cause a conformational disease is contained in its sequence. Some of the sequences undergoing fibrillation contain critical methionine (Met) residues which in vivo can be synthetically substituted by selenomethionine (SeM) and alter their properties. Using peptide synthesis, biophysical techniques and cell viability determinations we have studied the effect of the substitution of methionine (Met) by selenomethionine (SeM) on the fibrillogenesis and toxic properties of Aβ40 and HuPrP(106-140). We have found that the effects display site-specificity and vary from inhibition of fibrillation and decreased toxicity ([SeM(35)]Aβ40, [SeM(129)]HuPrP(106-140) and [SeM(134)]HuPrP(106-140)), retarded assembly, modulation of polymer shape and retention of toxicity ([SeM(112)]HuPrP(106-140) to absence of effects ([SeM(109)]HuPrP(106-140)). This work provides direct evidence that the substitution of Met by SeM in proamyloid sequences has a major impact on their self-assembly and toxic properties, suggesting that the SeM pool can play a major role in dictating the allowance and efficiency of a polypeptide chain to undergo toxic polymerization.
The tangled bank of amino acids.

PubMed

Goldstein, Richard A; Pollock, David D

2016-07-01

The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution. © 2016 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Identification of Low- and High-Impact Hemagglutinin Amino Acid Substitutions That Drive Antigenic Drift of Influenza A(H1N1) Viruses

PubMed Central

Harvey, William T.; Benton, Donald J.; Gregory, Victoria; Hall, James P. J.; Daniels, Rodney S.; Bedford, Trevor; Haydon, Daniel T.; Hay, Alan J.; McCauley, John W.; Reeve, Richard

2016-01-01

Determining phenotype from genetic data is a fundamental challenge. Identification of emerging antigenic variants among circulating influenza viruses is critical to the vaccine virus selection process, with vaccine effectiveness maximized when constituents are antigenically similar to circulating viruses. Hemagglutination inhibition (HI) assay data are commonly used to assess influenza antigenicity. Here, sequence and 3-D structural information of hemagglutinin (HA) glycoproteins were analyzed together with corresponding HI assay data for former seasonal influenza A(H1N1) virus isolates (1997–2009) and reference viruses. The models developed identify and quantify the impact of eighteen amino acid substitutions on the antigenicity of HA, two of which were responsible for major transitions in antigenic phenotype. We used reverse genetics to demonstrate the causal effect on antigenicity for a subset of these substitutions. Information on the impact of substitutions allowed us to predict antigenic phenotypes of emerging viruses directly from HA gene sequence data and accuracy was doubled by including all substitutions causing antigenic changes over a model incorporating only the substitutions with the largest impact. The ability to quantify the phenotypic impact of specific amino acid substitutions should help refine emerging techniques that predict the evolution of virus populations from one year to the next, leading to stronger theoretical foundations for selection of candidate vaccine viruses. These techniques have great potential to be extended to other antigenically variable pathogens. PMID:27057693

Convergent evolution of marine mammals is associated with distinct substitutions in common genes

PubMed Central

Zhou, Xuming; Seim, Inge; Gladyshev, Vadim N.

2015-01-01

Phenotypic convergence is thought to be driven by parallel substitutions coupled with natural selection at the sequence level. Multiple independent evolutionary transitions of mammals to an aquatic environment offer an opportunity to test this thesis. Here, whole genome alignment of coding sequences identified widespread parallel amino acid substitutions in marine mammals; however, the majority of these changes were not unique to these animals. Conversely, we report that candidate aquatic adaptation genes, identified by signatures of likelihood convergence and/or elevated ratio of nonsynonymous to synonymous nucleotide substitution rate, are characterized by very few parallel substitutions and exhibit distinct sequence changes in each group. Moreover, no significant positive correlation was found between likelihood convergence and positive selection in all three marine lineages. These results suggest that convergence in protein coding genes associated with aquatic lifestyle is mainly characterized by independent substitutions and relaxed negative selection. PMID:26549748
Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

PubMed

Zimmermann, Karel; Gibrat, Jean-François

2010-01-04

Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.
Evaluation of second-generation sequencing of 19 dilated cardiomyopathy genes for clinical applications.

PubMed

Gowrisankar, Sivakumar; Lerner-Ellis, Jordan P; Cox, Stephanie; White, Emily T; Manion, Megan; LeVan, Kevin; Liu, Jonathan; Farwell, Lisa M; Iartchouk, Oleg; Rehm, Heidi L; Funke, Birgit H

2010-11-01

Medical sequencing for diseases with locus and allelic heterogeneities has been limited by the high cost and low throughput of traditional sequencing technologies. "Second-generation" sequencing (SGS) technologies allow the parallel processing of a large number of genes and, therefore, offer great promise for medical sequencing; however, their use in clinical laboratories is still in its infancy. Our laboratory offers clinical resequencing for dilated cardiomyopathy (DCM) using an array-based platform that interrogates 19 of more than 30 genes known to cause DCM. We explored both the feasibility and cost effectiveness of using PCR amplification followed by SGS technology for sequencing these 19 genes in a set of five samples enriched for known sequence alterations (109 unique substitutions and 27 insertions and deletions). While the analytical sensitivity for substitutions was comparable to that of the DCM array (98%), SGS technology performed better than the DCM array for insertions and deletions (90.6% versus 58%). Overall, SGS performed substantially better than did the current array-based testing platform; however, the operational cost and projected turnaround time do not meet our current standards. Therefore, efficient capture methods and/or sample pooling strategies that shorten the turnaround time and decrease reagent and labor costs are needed before implementing this platform into routine clinical applications.
Biosynthesis of small proteoglycan II (decorin) by chondrocytes and evidence for a procore protein.

PubMed

Sawhney, R S; Hering, T M; Sandell, L J

1991-05-15

We have studied the biosynthesis of cartilage dermatan sulfate proteoglycan II (DS-PGII) (decorin) using in vitro translation of mRNA to determine the size of the primary gene product and by radiolabeling the protein in the presence of tunicamycin to inhibit the addition of Asn-linked oligosaccharides. Pulse-chase experiments were performed to examine post-translational processing and secretion. Inhibitors of oligosaccharide processing were used to determine whether DS-PGII molecules containing partially processed oligosaccharides could become proteoglycans and be secreted. Cell-free translation of sucrose gradient-fractionated RNA and subsequent immunoprecipitation of the core protein confirmed that the functional translated mRNA is in the size range of the two mRNA species observed by hybridization of chondrocyte RNA with a bone PGII cloned probe and that the translation product is a single protein with an apparent molecular mass of 42 kDa. Digestion of the intact proteoglycan (average molecular mass = 103 kDa) with chondroitinase ABC or AC results in an approximately 48-49-kDa product. Chondrocytes treated with tunicamycin to inhibit Asn-linked oligosaccharide addition synthesize and secrete a glycosaminoglycan (GAG)-substituted proteoglycan (average molecular mass = 86 kDa), yielding a 42-kDa core protein after chondroitinase ABC digestion, showing that Asn-linked oligosaccharides are not required for the addition of GAG chains or secretion. Following a short pulse (10 min) of [3H]leucine, three glycosylated forms of the DS-PGII core protein were observed, one of which is likely to be the precursor form of PGII predicted by the implied protein sequence of both bovine and human cDNA clones. Following the apparent cleavage of the propeptide, GAG-substituted intracellular core protein is detectable. Susceptibility to endoglycosidase H indicates that approximately one-third of the secreted core protein contains exclusively complex-type Asn-linked oligosaccharides and approximately two-thirds contain high mannose as well as complex-type oligosaccharides. Secreted DS-PGII appears to be fully substituted with three Asn-linked oligosaccharide chains. Inhibitors of oligosaccharide processing, however, permitted secretion of GAG-substituted DS-PGII that was fully (three chains) or incompletely (one or two chains) substituted with partially processed Asn-linked carbohydrate chains. By comparison of chondrocyte DS-PGII with fibroblast DS-PGII, we conclude that the addition and processing of Asn-linked carbohydrate chains are directed by the amino acid sequence of the core protein. The results reported here also suggest that the addition of xylose, the initial step in GAG chain synthesis, occurs early in biosynthesis and is determined by the primary amino acid sequence of the core protein.(ABSTRACT TRUNCATED AT 400 WORDS)
Quantitation of base substitutions in eukaryotic 5S rRNA: selection for the maintenance of RNA secondary structure.

PubMed

Curtiss, W C; Vournakis, J N

1984-01-01

Eukaryotic 5S rRNA sequences from 34 diverse species were compared by the following method: (1) The sequences were aligned; (2) the positions of substitutions were located by comparison of all possible pairs of sequences; (3) the substitution sites were mapped to an assumed general base pairing model; and (4) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. An analysis of the sequence and structure variability in each region of the molecule is presented. It was found that the degree of base substitution varies over a wide range, from absolute conservation to occurrence of over 90% of the possible observable substitutions. The substitutions are located primarily in stem regions of the 5S rRNA secondary structure. More than 88% of the substitutions in helical regions maintain base pairing. The disruptive substitutions are primarily located at the edges of helical regions, resulting in shortening of the helical regions and lengthening of the adjacent nonpaired regions. Base stacking patterns determined by the R-Y model are mapped onto the general secondary structure. Intrastrand and interstrand stacking could stabilize alternative coaxial structures and limit the conformational flexibility of nonpaired regions. Two short contiguous regions are 100% conserved in all species. This may reflect evolutionary constraints imposed at the DNA level by the requirement for binding of a 5S gene transcription initiation factor during gene expression.
Evidence for Widespread Reticulate Evolution within Human Duplicons

PubMed Central

Jackson, Michael S. ; Oliver, Karen ; Loveland, Jane ; Humphray, Sean ; Dunham, Ian ; Rocchi, Mariano ; Viggiano, Luigi ; Park, Jonathan P. ; Hurles, Matthew E. ; Santibanez-Koref, Mauro

2005-01-01

Approximately 5% of the human genome consists of segmental duplications that can cause genomic mutations and may play a role in gene innovation. Reticulate evolutionary processes, such as unequal crossing-over and gene conversion, are known to occur within specific duplicon families, but the broader contribution of these processes to the evolution of human duplications remains poorly characterized. Here, we use phylogenetic profiling to analyze multiple alignments of 24 human duplicon families that span >8 Mb of DNA. Our results indicate that none of them are evolving independently, with all alignments showing sharp discontinuities in phylogenetic signal consistent with reticulation. To analyze these results in more detail, we have developed a quartet method that estimates the relative contribution of nucleotide substitution and reticulate processes to sequence evolution. Our data indicate that most of the duplications show a highly significant excess of sites consistent with reticulate evolution, compared with the number expected by nucleotide substitution alone, with 15 of 30 alignments showing a >20-fold excess over that expected. Using permutation tests, we also show that at least 5% of the total sequence shares 100% sequence identity because of reticulation, a figure that includes 74 independent tracts of perfect identity >2 kb in length. Furthermore, analysis of a subset of alignments indicates that the density of reticulation events is as high as 1 every 4 kb. These results indicate that phylogenetic relationships within recently duplicated human DNA can be rapidly disrupted by reticulate evolution. This finding has important implications for efforts to finish the human genome sequence, complicates comparative sequence analysis of duplicon families, and could profoundly influence the tempo of gene-family evolution. PMID:16252241
RY-Coding and Non-Homogeneous Models Can Ameliorate the Maximum-Likelihood Inferences From Nucleotide Sequence Data with Parallel Compositional Heterogeneity.

PubMed

Ishikawa, Sohta A; Inagaki, Yuji; Hashimoto, Tetsuo

2012-01-01

In phylogenetic analyses of nucleotide sequences, 'homogeneous' substitution models, which assume the stationarity of base composition across a tree, are widely used, albeit individual sequences may bear distinctive base frequencies. In the worst-case scenario, a homogeneous model-based analysis can yield an artifactual union of two distantly related sequences that achieved similar base frequencies in parallel. Such potential difficulty can be countered by two approaches, 'RY-coding' and 'non-homogeneous' models. The former approach converts four bases into purine and pyrimidine to normalize base frequencies across a tree, while the heterogeneity in base frequency is explicitly incorporated in the latter approach. The two approaches have been applied to real-world sequence data; however, their basic properties have not been fully examined by pioneering simulation studies. Here, we assessed the performances of the maximum-likelihood analyses incorporating RY-coding and a non-homogeneous model (RY-coding and non-homogeneous analyses) on simulated data with parallel convergence to similar base composition. Both RY-coding and non-homogeneous analyses showed superior performances compared with homogeneous model-based analyses. Curiously, the performance of RY-coding analysis appeared to be significantly affected by a setting of the substitution process for sequence simulation relative to that of non-homogeneous analysis. The performance of a non-homogeneous analysis was also validated by analyzing a real-world sequence data set with significant base heterogeneity.
Risk management with substitution options: Valuing flexibility in small-scale energy systems

NASA Astrophysics Data System (ADS)

Knapp, Karl Eric

Several features of small-scale energy systems make them more easily adapted to a changing operating environment than large centralized designs. This flexibility is often manifested as the ability to substitute inputs. This research explores the value of this substitution flexibility and the marginal value of becoming a "little more flexible" in the context of real project investment in developing countries. The elasticity of substitution is proposed as a stylized measure of flexibility and a choice variable. A flexible alternative (elasticity > 0) can be thought of as holding a fixed-proportions "nflexible" asset plus a sequence of exchange options---the option to move to another feasible "recipe" each period. Substitutability derives value from following a contour of anticipated variations and from responding to new information. Substitutability value, a "cost savings option", increases with elasticity and price risk. However, the required premium to incrementally increase flexibility can in some cases decrease with an increase in risk. Variance is not always a measure of risk. Tools from stochastic dominance are newly applied to real options with convex payoffs to correct some misperceptions and clarify many common modeling situations that meet the criteria for increased variance to imply increased risk. The behavior of the cost savings option is explored subject to a stochastic input price process. At the point where costs are identical for all alternatives, the stochastic process for cost savings becomes deterministic, with savings directly proportional to elasticity of substitution and price variance. The option is also formulated as a derivative security via dynamic programming. The partial differential equation is solved for the special case of Cobb-Douglas (elasticity = 1) (also shown are linear (infinite elasticity), Leontief (elasticity = 0)). Risk aversion is insufficient to prefer a more flexible alternative with the same expected value. Intertemporal links convert the sequence of independent options to a single compound option and require an expansion of the flexibility concept. Additional options increase the value of the project but generally decrease flexibility value. The framework is applied to case study in India: an urban industry electricity strategy decision with reliability risk.
Sequence analysis of the GP, NP, VP40 and VP24 genes of Ebola virus isolated from deceased, surviving and asymptomatically infected individuals during the 1996 outbreak in Gabon: comparative studies and phylogenetic characterization.

PubMed

Leroy, Eric M; Baize, Sylvain; Mavoungou, Elie; Apetrei, Cristian

2002-01-01

The aims of this study were to determine if the clinical outcome of Ebola virus (EBOV) infection is associated with virus genetic structure and to document the genetic changes in the Gabon strains of EBOV by sequencing the GP, NP, VP40 and VP24 genes from deceased and surviving symptomatic and asymptomatic individuals. GP and NP sequences were identical in the three groups of patients and only one silent substitution occurred in the VP40 and VP24 genes in asymptomatic individuals. A strain from an asymptomatic individual had a reverse substitution to the Gabon-94 sequence, indicating that minor virus variants may cocirculate during an outbreak. These results suggest that the different clinical outcomes of EBOV infection do not result from virus mutations. Phylogenetic analysis confirmed that Gabon-96 belonged to the Zaire subtype of EBOV and revealed that synonymous substitution rates were higher than nonsynonymous substitution rates in the GP, VP40 and VP24 genes. In contrast, nonsynonymous substitutions predominated over synonymous substitutions in the NP gene of the two Gabon strains, pointing to divergent evolution of these strains and to selective pressures on this gene.
Recent African origin of modern humans revealed by complete sequences of hominoid mitochondrial DNAs.

PubMed Central

Horai, S; Hayasaka, K; Kondo, R; Tsugane, K; Takahata, N

1995-01-01

We analyzed the complete mitochondrial DNA (mtDNA) sequences of three humans (African, European, and Japanese), three African apes (common and pygmy chimpanzees, and gorilla), and one orangutan in an attempt to estimate most accurately the substitution rates and divergence times of hominoid mtDNAs. Nonsynonymous substitutions and substitutions in RNA genes have accumulated with an approximately clock-like regularity. From these substitutions and under the assumption that the orangutan and African apes diverged 13 million years ago, we obtained a divergence time for humans and chimpanzees of 4.9 million years. This divergence time permitted calibration of the synonymous substitution rate (3.89 x 10(-8)/site per year). To obtain the substitution rate in the displacement (D)-loop region, we compared the three human mtDNAs and measured the relative abundance of substitutions in the D-loop region and at synonymous sites. The estimated substitution rate in the D-loop region was 7.00 x 10(-8)/site per year. Using both synonymous and D-loop substitutions, we inferred the age of the last common ancestor of the human mtDNAs as 143,000 +/- 18,000 years. The shallow ancestry of human mtDNAs, together with the observation that the African sequence is the most diverged among humans, strongly supports the recent African origin of modern humans, Homo sapiens sapiens. PMID:7530363
Differentiation of highly virulent strains of Streptococcus suis serotype 2 according to glutamate dehydrogenase electrophoretic and sequence type.

PubMed

Kutz, Russell; Okwumabua, Ogi

2008-10-01

The glutamate dehydrogenase (GDH) enzymes of 19 Streptococcus suis serotype 2 strains, consisting of 18 swine isolates and 1 human clinical isolate from a geographically varied collection, were analyzed by activity staining on a nondenaturing gel. All seven (100%) of the highly virulent strains tested produced an electrophoretic type (ET) distinct from those of moderately virulent and nonvirulent strains. By PCR and nucleotide sequence determination, the gdh genes of the 19 strains and of 2 highly virulent strains involved in recent Chinese outbreaks yielded a 1,820-bp fragment containing an open reading frame of 1,344 nucleotides, which encodes a protein of 448 amino acid residues with a calculated molecular mass of approximately 49 kDa. The nucleotide sequences contained base pair differences, but most were silent. Cluster analysis of the deduced amino acid sequences separated the isolates into three groups. Group I (ETI) consisted of the seven highly virulent isolates and the two Chinese outbreak strains, containing Ala(299)-to-Ser, Glu(305)-to-Lys, and Glu(330)-to-Lys amino acid substitutions compared with groups II and III (ETII). Groups II and III consisted of moderately virulent and nonvirulent strains, which are separated from each other by Tyr(72)-to-Asp and Thr(296)-to-Ala substitutions. Gene exchange studies resulted in the change of ETI to ETII and vice versa. A spectrophotometric activity assay for GDH did not show significant differences between the groups. These results suggest that the GDH ETs and sequence types may serve as useful markers in predicting the pathogenic behavior of strains of this serotype and that the molecular basis for the observed differences in the ETs was amino acid substitutions and not deletion, insertion, or processing uniqueness.
Transition-metal-free one-pot synthesis of biaryls from Grignard reagents and substituted cyclohexanones.

PubMed

Zhou, Feng; Simon, Marc-Oliver; Li, Chao-Jun

2013-05-27

A new strategy for the construction of biaryls by a transition-metal-free process is presented. A sequence of a Grignard reaction, dehydration, and oxidative aromatization affords the desired products in a one-pot fashion. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
In vivo gene correction with targeted sequence substitution through microhomology-mediated end joining.

PubMed

Shin, Jeong Hong; Jung, Soobin; Ramakrishna, Suresh; Kim, Hyongbum Henry; Lee, Junwon

2018-07-07

Genome editing technology using programmable nucleases has rapidly evolved in recent years. The primary mechanism to achieve precise integration of a transgene is mainly based on homology-directed repair (HDR). However, an HDR-based genome-editing approach is less efficient than non-homologous end-joining (NHEJ). Recently, a microhomology-mediated end-joining (MMEJ)-based transgene integration approach was developed, showing feasibility both in vitro and in vivo. We expanded this method to achieve targeted sequence substitution (TSS) of mutated sequences with normal sequences using double-guide RNAs (gRNAs), and a donor template flanking the microhomologies and target sequence of the gRNAs in vitro and in vivo. Our method could realize more efficient sequence substitution than the HDR-based method in vitro using a reporter cell line, and led to the survival of a hereditary tyrosinemia mouse model in vivo. The proposed MMEJ-based TSS approach could provide a novel therapeutic strategy, in addition to HDR, to achieve gene correction from a mutated sequence to a normal sequence. Copyright © 2018 Elsevier Inc. All rights reserved.
Twisted trees and inconsistency of tree estimation when gaps are treated as missing data - The impact of model mis-specification in distance corrections.

PubMed

McTavish, Emily Jane; Steel, Mike; Holder, Mark T

2015-12-01

Statistically consistent estimation of phylogenetic trees or gene trees is possible if pairwise sequence dissimilarities can be converted to a set of distances that are proportional to the true evolutionary distances. Susko et al. (2004) reported some strikingly broad results about the forms of inconsistency in tree estimation that can arise if corrected distances are not proportional to the true distances. They showed that if the corrected distance is a concave function of the true distance, then inconsistency due to long branch attraction will occur. If these functions are convex, then two "long branch repulsion" trees will be preferred over the true tree - though these two incorrect trees are expected to be tied as the preferred true. Here we extend their results, and demonstrate the existence of a tree shape (which we refer to as a "twisted Farris-zone" tree) for which a single incorrect tree topology will be guaranteed to be preferred if the corrected distance function is convex. We also report that the standard practice of treating gaps in sequence alignments as missing data is sufficient to produce non-linear corrected distance functions if the substitution process is not independent of the insertion/deletion process. Taken together, these results imply inconsistent tree inference under mild conditions. For example, if some positions in a sequence are constrained to be free of substitutions and insertion/deletion events while the remaining sites evolve with independent substitutions and insertion/deletion events, then the distances obtained by treating gaps as missing data can support an incorrect tree topology even given an unlimited amount of data. Copyright © 2015 Elsevier Inc. All rights reserved.
The Repeat Sequences and Elevated Substitution Rates of the Chloroplast accD Gene in Cupressophytes

PubMed Central

Li, Jia; Su, Yingjuan; Wang, Ting

2018-01-01

The plastid accD gene encodes a subunit of the acetyl-CoA carboxylase (ACCase) enzyme. The length of accD gene has been supposed to expand in Cryptomeria japonica, Taiwania cryptomerioides, Cephalotaxus, Taxus chinensis, and Podocarpus lambertii, and the main reason for this phenomenon was the existence of tandemly repeated sequences. However, it is still unknown whether the accD gene length in other cupressophytes has expanded. Here, in order to investigate how widespread this phenomenon was, 18 accD sequences and its surrounding regions of cupressophyte were sequenced and analyzed. Together with 39 GenBank sequence data, our taxon sampling covered all the extant gymnosperm orders. The repetitive elements and substitution rates of accD among 57 gymnosperm species were analyzed, the results show: (1) Reading frame length of accD gene in 18 cupressophytes species has also expanded. (2) Many repetitive elements were identified in accD gene of cupressophyte lineages. (3) The synonymous and non-synonymous substitution rates of accD were accelerated in cupressophytes. (4) accD was located in rearrangement endpoints. These results suggested that repetitive elements may mediate the chloroplast genome rearrangement and accelerated the substitution rates. PMID:29731764
Occurrence probability of structured motifs in random sequences.

PubMed

Robin, S; Daudin, J-J; Richard, H; Sagot, M-F; Schbath, S

2002-01-01

The problem of extracting from a set of nucleic acid sequences motifs which may have biological function is more and more important. In this paper, we are interested in particular motifs that may be implicated in the transcription process. These motifs, called structured motifs, are composed of two ordered parts separated by a variable distance and allowing for substitutions. In order to assess their statistical significance, we propose approximations of the probability of occurrences of such a structured motif in a given sequence. An application of our method to evaluate candidate promoters in E. coli and B. subtilis is presented. Simulations show the goodness of the approximations.
Identification by whole-genome resequencing of gene defect responsible for severe hypercholesterolemia

PubMed Central

Rios, Jonathan; Stein, Evan; Shendure, Jay; Hobbs, Helen H.; Cohen, Jonathan C.

2010-01-01

Whole-genome sequencing is a potentially powerful tool for the diagnosis of genetic diseases. Here, we used sequencing-by-ligation to sequence the genome of an 11-month-old breast-fed girl with xanthomas and very high plasma cholesterol levels (1023 mg/dl). Her parents had normal plasma cholesterol levels and reported no family history of hypercholesterolemia, suggesting either an autosomal recessive disorder or a de novo mutation. Known genetic causes of severe hypercholesterolemia were ruled out by sequencing the responsible genes (LDLRAP, LDLR, PCSK9, APOE and APOB), and sitosterolemia was ruled out by documenting a normal plasma sitosterol:cholesterol ratio. Sequencing revealed 3 797 207 deviations from the reference sequence, of which 9726 were nonsynonymous single-nucleotide substitutions. A total of 9027 of the nonsynonymous substitutions were present in dbSNP or in 21 additional individuals from whom complete exonic sequences were available. The 699 novel nonsynonymous substitutions were distributed among 604 genes, 23 of which were single-copy genes that each contained 2 nonsynonymous substitutions consistent with an autosomal recessive model. One gene, ABCG5, had two nonsense mutations (Q16X and R446X). This finding indicated that the infant has sitosterolemia. Thus, whole-genome sequencing led to the diagnosis of a known disease with an atypical presentation. Diagnosis was confirmed by the finding of severe sitosterolemia in a blood sample obtained after the infant had been weaned. These findings demonstrate that whole-genome (or exome) sequencing can be a valuable aid to diagnose genetic diseases, even in individual patients. PMID:20719861
Determining orientation and direction of DNA sequences

DOEpatents

Goodwin, Edwin H.; Meyne, Julianne

2000-01-01

Determining orientation and direction of DNA sequences. A method by which fluorescence in situ hybridization can be made strand specific is described. Cell cultures are grown in a medium containing a halogenated nucleotide. The analog is partially incorporated in one DNA strand of each chromatid. This substitution takes place in opposite strands of the two sister chromatids. After staining with the fluorescent DNA-binding dye Hoechst 33258, cells are exposed to long-wavelength ultraviolet light which results in numerous strand nicks. These nicks enable the substituted strand to be denatured and solubilized by heat, treatment with high or low pH aqueous solutions, or by immersing the strands in 2.times.SSC (0.3M NaCl+0.03M sodium citrate), to name three procedures. It is unnecessary to enzymatically digest the strands using Exo III or another exonuclease in order to excise and solubilize nucleotides starting at the sites of the nicks. The denaturing/solubilizing process removes most of the substituted strand while leaving the prereplication strand largely intact. Hybridization of a single-stranded probe of a tandem repeat arranged in a head-to-tail orientation will result in hybridization only to the chromatid with the complementary strand present.
Basecalling with LifeTrace

PubMed Central

Walther, Dirk; Bartha, Gábor; Morris, Macdonald

2001-01-01

A pivotal step in electrophoresis sequencing is the conversion of the raw, continuous chromatogram data into the actual sequence of discrete nucleotides, a process referred to as basecalling. We describe a novel algorithm for basecalling implemented in the program LifeTrace. Like Phred, currently the most widely used basecalling software program, LifeTrace takes processed trace data as input. It was designed to be tolerant to variable peak spacing by means of an improved peak-detection algorithm that emphasizes local chromatogram information over global properties. LifeTrace is shown to generate high-quality basecalls and reliable quality scores. It proved particularly effective when applied to MegaBACE capillary sequencing machines. In a benchmark test of 8372 dye-primer MegaBACE chromatograms, LifeTrace generated 17% fewer substitution errors, 16% fewer insertion/deletion errors, and 2.4% more aligned bases to the finished sequence than did Phred. For two sets totaling 6624 dye-terminator chromatograms, the performance improvement was 15% fewer substitution errors, 10% fewer insertion/deletion errors, and 2.1% more aligned bases. The processing time required by LifeTrace is comparable to that of Phred. The predicted quality scores were in line with observed quality scores, permitting direct use for quality clipping and in silico single nucleotide polymorphism (SNP) detection. Furthermore, we introduce a new type of quality score associated with every basecall: the gap-quality. It estimates the probability of a deletion error between the current and the following basecall. This additional quality score improves detection of single basepair deletions when used for locating potential basecalling errors during the alignment. We also describe a new protocol for benchmarking that we believe better discerns basecaller performance differences than methods previously published. PMID:11337481
Coestimation of recombination, substitution and molecular adaptation rates by approximate Bayesian computation.

PubMed

Lopes, J S; Arenas, M; Posada, D; Beaumont, M A

2014-03-01

The estimation of parameters in molecular evolution may be biased when some processes are not considered. For example, the estimation of selection at the molecular level using codon-substitution models can have an upward bias when recombination is ignored. Here we address the joint estimation of recombination, molecular adaptation and substitution rates from coding sequences using approximate Bayesian computation (ABC). We describe the implementation of a regression-based strategy for choosing subsets of summary statistics for coding data, and show that this approach can accurately infer recombination allowing for intracodon recombination breakpoints, molecular adaptation and codon substitution rates. We demonstrate that our ABC approach can outperform other analytical methods under a variety of evolutionary scenarios. We also show that although the choice of the codon-substitution model is important, our inferences are robust to a moderate degree of model misspecification. In addition, we demonstrate that our approach can accurately choose the evolutionary model that best fits the data, providing an alternative for when the use of full-likelihood methods is impracticable. Finally, we applied our ABC method to co-estimate recombination, substitution and molecular adaptation rates from 24 published human immunodeficiency virus 1 coding data sets.

Improved bioactivity of G-rich triplex-forming oligonucleotides containing modified guanine bases

PubMed Central

Rogers, Faye A; Lloyd, Janice A; Tiwari, Meetu Kaushik

2014-01-01

Triplex structures generated by sequence-specific triplex-forming oligonucleotides (TFOs) have proven to be promising tools for gene targeting strategies. In addition, triplex technology has been highly utilized to study the molecular mechanisms of DNA repair, recombination and mutagenesis. However, triplex formation utilizing guanine-rich oligonucleotides as third strands can be inhibited by potassium-induced self-association resulting in G-quadruplex formation. We report here that guanine-rich TFOs partially substituted with 8-aza-7-deaza-guanine (PPG) have improved target site binding in potassium compared with TFOs containing the natural guanine base. We designed PPG-substituted TFOs to bind to a polypurine sequence in the supFG1 reporter gene. The binding efficiency of PPG-substituted TFOs to the target sequence was analyzed using electrophoresis mobility gel shift assays. We have determined that in the presence of potassium, the non-substituted TFO, AG30 did not bind to its target sequence, however binding was observed with the PPG-substituted AG30 under conditions with up to 140 mM KCl. The PPG-TFOs were able to maintain their ability to induce genomic modifications as measured by an assay for gene-targeted mutagenesis. In addition, these compounds were capable of triplex-induced DNA double strand breaks, which resulted in activation of apoptosis. PMID:25483840
[The primary structure of a vaccine strain of tobacco mosaic virus V-69].

PubMed

Shiian, A N; Mil'shina, N V; Snegireva, P B; Pukhal'skiĭ, V A

1994-12-01

A random set of cDNA fragments were synthesized on genomic RNA of TMV vaccine strain V-69, using random primers and reverse transcriptase. Following synthesis of double-stranded cDNA, they were cloned into the pUC-19 plasmid; and 28 clones were sequenced (insert size 100-500 bp). High nucleotide sequence homology of V-69 (more than 95%) was shown only with tomato strain TMV-L [1]. Sequenced clones represent 54% of the genome (50% of the replicase gene, 98% of the transport protein gene, and 60% of the coat protein gene). In this genome region, 24 base substitutions were revealed, as compared to the wild-type TMV-L sequence. Six base substitutions resulted in changes in corresponding amino acid codons. No substitutions coincided with those discovered in the related TMV vaccine strain L11A [2], while two substitutions in the replicase gene were identical to those found in TMV strain Lta1 [3], which is capable of overcoming protection in tomatoes with the resistance gene Tm-1.
Production of Substitute Natural Gas from Coal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Andrew Lucero

2009-01-31

The goal of this research program was to develop and demonstrate a novel gasification technology to produce substitute natural gas (SNG) from coal. The technology relies on a continuous sequential processing method that differs substantially from the historic methanation or hydro-gasification processing technologies. The thermo-chemistry relies on all the same reactions, but the processing sequences are different. The proposed concept is appropriate for western sub-bituminous coals, which tend to be composed of about half fixed carbon and about half volatile matter (dry ash-free basis). In the most general terms the process requires four steps (1) separating the fixed carbon frommore » the volatile matter (pyrolysis); (2) converting the volatile fraction into syngas (reforming); (3) reacting the syngas with heated carbon to make methane-rich fuel gas (methanation and hydro-gasification); and (4) generating process heat by combusting residual char (combustion). A key feature of this technology is that no oxygen plant is needed for char combustion.« less
Mitochondrial DNA sequence analysis of four Alzheimer`s and Parkinson`s disease patients

DOE Office of Scientific and Technical Information (OSTI.GOV)

Brown, M.D.; Shoffner, J.M.; Wallace, D.C.

1996-01-22

The mitochondrial DNA (mtDNA) sequence was determined on 3 patients with Alzheimer`s disease (AD) exhibiting AD plus Parkinson`s disease (PD) neuropathologic changes and one patient with PD. Patient mtDNA sequences were compared to the standard Cambridge sequence to identify base changes. In the first AD + PD patient, 2 of the 15 nucleotide substitutions may contribute to the neuropathology, a nucleotide pair (np) 4336 transition in the tRNA{sup Gln} gene found 7.4 times more frequently in patients than in controls, and a unique np 721 transition in the 12S rRNA gene which was not found in 70 other patients ormore » 905 controls. In the second AD + PD patient, 27 nucleotide substitutions were detected, including an np 3397 transition in the ND1 gene which converts a conserved methionine to a valine. In the third AD + PD patient, 2 polymorphic base substitutions frequently found at increased frequency in Leber`s hereditary optic neuropathy patients were observed, an np 4216 transition in ND1 and an np 13708 transition in the ND5 gene. For the PD patient, 2 novel variants were observed among 25 base substitutions, an np 1709 substitution in the 16S rRNA gene and an np 15851 missense mutation in the cytb gene. Further studies will be required to demonstrate a casual role for these base substitutions in neurodegenerative disease. 68 refs., 2 tabs.« less
Gene-Specific Substitution Profiles Describe the Types and Frequencies of Amino Acid Changes during Antibody Somatic Hypermutation.

PubMed

Sheng, Zizhang; Schramm, Chaim A; Kong, Rui; Mullikin, James C; Mascola, John R; Kwong, Peter D; Shapiro, Lawrence

2017-01-01

Somatic hypermutation (SHM) plays a critical role in the maturation of antibodies, optimizing recognition initiated by recombination of V(D)J genes. Previous studies have shown that the propensity to mutate is modulated by the context of surrounding nucleotides and that SHM machinery generates biased substitutions. To investigate the intrinsic mutation frequency and substitution bias of SHMs at the amino acid level, we analyzed functional human antibody repertoires and developed mGSSP (method for gene-specific substitution profile), a method to construct amino acid substitution profiles from next-generation sequencing-determined B cell transcripts. We demonstrated that these gene-specific substitution profiles (GSSPs) are unique to each V gene and highly consistent between donors. We also showed that the GSSPs constructed from functional antibody repertoires are highly similar to those constructed from antibody sequences amplified from non-productively rearranged passenger alleles, which do not undergo functional selection. This suggests the types and frequencies, or mutational space, of a majority of amino acid changes sampled by the SHM machinery to be well captured by GSSPs. We further observed the rates of mutational exchange between some amino acids to be both asymmetric and context dependent and to correlate weakly with their biochemical properties. GSSPs provide an improved, position-dependent alternative to standard substitution matrices, and can be utilized to developing software for accurately modeling the SHM process. GSSPs can also be used for predicting the amino acid mutational space available for antigen-driven selection and for understanding factors modulating the maturation pathways of antibody lineages in a gene-specific context. The mGSSP method can be used to build, compare, and plot GSSPs; we report the GSSPs constructed for 69 common human V genes (DOI: 10.6084/m9.figshare.3511083) and provide high-resolution logo plots for each (DOI: 10.6084/m9.figshare.3511085).
Translational control of Nrf2 within the open reading frame

DOE Office of Scientific and Technical Information (OSTI.GOV)

Perez-Leal, Oscar, E-mail: operez@temple.edu; Barrero, Carlos A.; Merali, Salim, E-mail: smerali@temple.edu

2013-07-19

Highlights: •Identification of a novel Nrf2 translational repression mechanism. •The repressor is within the 3′ portion of the Nrf2 ORF. •The translation of Nrf2 or eGFP is reduced by the regulatory element. •The translational repression can be reversed with synonymous codon substitutions. •The molecular mechanism requires the mRNA sequence, but not the encoded amino acids. -- Abstract: Nuclear Factor Erythroid 2-Related Factor 2 (Nrf2) is a transcription factor that is essential for the regulation of an effective antioxidant and detoxifying response. The regulation of its activity can occur at transcription, translation and post-translational levels. Evidence suggests that under environmental stressmore » conditions, new synthesis of Nrf2 is required – a process that is regulated by translational control and is not fully understood. Here we described the identification of a novel molecular process that under basal conditions strongly represses the translation of Nrf2 within the open reading frame (ORF). This mechanism is dependent on the mRNA sequence within the 3′ portion of the ORF of Nrf2 but not in the encoded amino acid sequence. The Nrf2 translational repression can be reversed with the use of synonymous codon substitutions. This discovery suggests an additional layer of control to explain the reason for the low Nrf2 concentration under quiescent state.« less
1,5-(H, RO, RS) shift/6π-electrocyclic ring closure tandem processes on N-[(α-heterosubstituted)-2-tolyl]ketenimines: a case study of relative migratory aptitudes and activating effects.

PubMed

Alajarín, Mateo; Bonillo, Baltasar; Orenes, Raúl-Angel; Ortín, María-Mar; Vidal, Angel

2012-12-28

A number of N-aryl ketenimines, substituted at the ortho position either with different non-cyclic acetalic functions (acetals, monothioacetals, dithioacetals) or with only one alkoxymethyl or (alkylthio)methyl group, have been prepared and submitted to thermal treatment in toluene solution. Under smooth heating the ketenimines bearing non-cyclic acetals converted into 3,4-dihydroquinolines following two competitive tandem sequences that involve the alternative 1,5 migration of a hydride or alkoxy group as the first mechanistic step, followed by subsequent 6π electrocyclic ring closure. The heterocumulenes bearing acyclic monothioacetal and dithioacetal functions converted via a unique consecutive process involving the selective migration of the alkanethiolate group. Ketenimines bearing only one ether or thioether group transformed exclusively by the tandem sequence initiated by a 1,5 hydride shift. All these transformations provided as final reaction products a variety of quinoline derivatives with a range of substitution patterns. From these experiments the following order of propensity to migration can be extracted: RS > RO > H. It was also possible to estimate the following order of relative activating activities: RO > RS > H.
The Embedding Problem for Markov Models of Nucleotide Substitution

PubMed Central

Verbyla, Klara L.; Yap, Von Bing; Pahwa, Anuj; Shao, Yunli; Huttley, Gavin A.

2013-01-01

Continuous-time Markov processes are often used to model the complex natural phenomenon of sequence evolution. To make the process of sequence evolution tractable, simplifying assumptions are often made about the sequence properties and the underlying process. The validity of one such assumption, time-homogeneity, has never been explored. Violations of this assumption can be found by identifying non-embeddability. A process is non-embeddable if it can not be embedded in a continuous time-homogeneous Markov process. In this study, non-embeddability was demonstrated to exist when modelling sequence evolution with Markov models. Evidence of non-embeddability was found primarily at the third codon position, possibly resulting from changes in mutation rate over time. Outgroup edges and those with a deeper time depth were found to have an increased probability of the underlying process being non-embeddable. Overall, low levels of non-embeddability were detected when examining individual edges of triads across a diverse set of alignments. Subsequent phylogenetic reconstruction analyses demonstrated that non-embeddability could impact on the correct prediction of phylogenies, but at extremely low levels. Despite the existence of non-embeddability, there is minimal evidence of violations of the local time homogeneity assumption and consequently the impact is likely to be minor. PMID:23935949
High-resolution characterization of a hepatocellular carcinoma genome.

PubMed

Totoki, Yasushi; Tatsuno, Kenji; Yamamoto, Shogo; Arai, Yasuhito; Hosoda, Fumie; Ishikawa, Shumpei; Tsutsumi, Shuichi; Sonoda, Kohtaro; Totsuka, Hirohiko; Shirakihara, Takuya; Sakamoto, Hiromi; Wang, Linghua; Ojima, Hidenori; Shimada, Kazuaki; Kosuge, Tomoo; Okusaka, Takuji; Kato, Kazuto; Kusuda, Jun; Yoshida, Teruhiko; Aburatani, Hiroyuki; Shibata, Tatsuhiro

2011-05-01

Hepatocellular carcinoma, one of the most common virus-associated cancers, is the third most frequent cause of cancer-related death worldwide. By massively parallel sequencing of a primary hepatitis C virus-positive hepatocellular carcinoma (36× coverage) and matched lymphocytes (>28× coverage) from the same individual, we identified more than 11,000 somatic substitutions of the tumor genome that showed predominance of T>C/A>G transition and a decrease of the T>C substitution on the transcribed strand, suggesting preferential DNA repair. Gene annotation enrichment analysis of 63 validated non-synonymous substitutions revealed enrichment of phosphoproteins. We further validated 22 chromosomal rearrangements, generating four fusion transcripts that had altered transcriptional regulation (BCORL1-ELF4) or promoter activity. Whole-exome sequencing at a higher sequence depth (>76× coverage) revealed a TSC1 nonsense substitution in a subpopulation of the tumor cells. This first high-resolution characterization of a virus-associated cancer genome identified previously uncharacterized mutation patterns, intra-chromosomal rearrangements and fusion genes, as well as genetic heterogeneity within the tumor.
FoxP2 in song-learning birds and vocal-learning mammals.

PubMed

Webb, D M; Zhang, J

2005-01-01

FoxP2 is the first identified gene that is specifically involved in speech and language development in humans. Population genetic studies of FoxP2 revealed a selective sweep in recent human history associated with two amino acid substitutions in exon 7. Avian song learning and human language acquisition share many behavioral and neurological similarities. To determine whether FoxP2 plays a similar role in song-learning birds, we sequenced exon 7 of FoxP2 in multiple song-learning and nonlearning birds. We show extreme conservation of FoxP2 sequences in birds, including unusually low rates of synonymous substitutions. However, no amino acid substitutions are shared between the song-learning birds and humans. Furthermore, sequences from vocal-learning whales, dolphins, and bats do not share the human-unique substitutions. While FoxP2 appears to be under strong functional constraints in mammals and birds, we find no evidence for its role during the evolution of vocal learning in nonhuman animals as in humans.
Using Phylogenetic Analysis to Detect Market Substitution of Atlantic Salmon for Pacific Salmon: An Introductory Biology Laboratory Experiment

ERIC Educational Resources Information Center

Cline, Erica; Gogarten, Jennifer

2012-01-01

We describe a laboratory exercise developed for the cell and molecular biology quarter of a year-long majors' undergraduate introductory biology sequence. In an analysis of salmon samples collected by students in their local stores and restaurants, DNA sequencing and phylogenetic analysis were used to detect market substitution of Atlantic salmon…
Deep Sequencing of Random Mutant Libraries Reveals the Active Site of the Narrow Specificity CphA Metallo-β-Lactamase is Fragile to Mutations.

PubMed

Sun, Zhizeng; Mehta, Shrenik C; Adamski, Carolyn J; Gibbs, Richard A; Palzkill, Timothy

2016-09-12

CphA is a Zn(2+)-dependent metallo-β-lactamase that efficiently hydrolyzes only carbapenem antibiotics. To understand the sequence requirements for CphA function, single codon random mutant libraries were constructed for residues in and near the active site and mutants were selected for E. coli growth on increasing concentrations of imipenem, a carbapenem antibiotic. At high concentrations of imipenem that select for phenotypically wild-type mutants, the active-site residues exhibit stringent sequence requirements in that nearly all residues in positions that contact zinc, the substrate, or the catalytic water do not tolerate amino acid substitutions. In addition, at high imipenem concentrations a number of residues that do not directly contact zinc or substrate are also essential and do not tolerate substitutions. Biochemical analysis confirmed that amino acid substitutions at essential positions decreased the stability or catalytic activity of the CphA enzyme. Therefore, the CphA active - site is fragile to substitutions, suggesting active-site residues are optimized for imipenem hydrolysis. These results also suggest that resistance to inhibitors targeted to the CphA active site would be slow to develop because of the strong sequence constraints on function.
MEGA-CC: computing core of molecular evolutionary genetics analysis program for automated and iterative data analysis.

PubMed

Kumar, Sudhir; Stecher, Glen; Peterson, Daniel; Tamura, Koichiro

2012-10-15

There is a growing need in the research community to apply the molecular evolutionary genetics analysis (MEGA) software tool for batch processing a large number of datasets and to integrate it into analysis workflows. Therefore, we now make available the computing core of the MEGA software as a stand-alone executable (MEGA-CC), along with an analysis prototyper (MEGA-Proto). MEGA-CC provides users with access to all the computational analyses available through MEGA's graphical user interface version. This includes methods for multiple sequence alignment, substitution model selection, evolutionary distance estimation, phylogeny inference, substitution rate and pattern estimation, tests of natural selection and ancestral sequence inference. Additionally, we have upgraded the source code for phylogenetic analysis using the maximum likelihood methods for parallel execution on multiple processors and cores. Here, we describe MEGA-CC and outline the steps for using MEGA-CC in tandem with MEGA-Proto for iterative and automated data analysis. http://www.megasoftware.net/.
Modifications to the Foot-and-Mouth Disease Virus 2A Peptide: Influence on Polyprotein Processing and Virus Replication.

PubMed

Kjær, Jonas; Belsham, Graham J

2018-04-15

Foot-and-mouth disease virus (FMDV) has a positive-sense single-stranded RNA (ssRNA) genome that includes a single, large open reading frame encoding a polyprotein. The cotranslational "cleavage" of this polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues in length) using a nonproteolytic mechanism termed "ribosome skipping" or "StopGo." Multiple variants of the 2A polypeptide with this property among the picornaviruses share a conserved C-terminal motif [D(V/I)E(S/T)NPG↓P]. The impact of 2A modifications within this motif on FMDV protein synthesis, polyprotein processing, and virus viability were investigated. Amino acid substitutions are tolerated at residues E 14 , S 15 , and N 16 within the 2A sequences of infectious FMDVs despite their reported "cleavage" efficiencies at the 2A/2B junction of only ca. 30 to 50% compared to that of the wild type (wt). In contrast, no viruses containing substitutions at residue P 17 , G 18 , or P 19 , which displayed little or no "cleavage" activity in vitro , were rescued, but wt revertants were obtained. The 2A substitutions impaired the replication of an FMDV replicon. Using transient-expression assays, it was shown that certain amino acid substitutions at residues E 14 , S 15 , N 16 , and P 19 resulted in partial "cleavage" of a protease-free polyprotein, indicating that these specific residues are not essential for cotranslational "cleavage." Immunofluorescence studies, using full-length FMDV RNA transcripts encoding mutant 2A peptides, indicated that the 2A peptide remained attached to adjacent proteins, presumably 2B. These results show that efficient "cleavage" at the 2A/2B junction is required for optimal virus replication. However, maximal StopGo activity does not appear to be essential for the viability of FMDV. IMPORTANCE Foot-and-mouth disease virus (FMDV) causes one of the most economically important diseases of farm animals. Cotranslational "cleavage" of the FMDV polyprotein precursor at the 2A/2B junction, termed StopGo, is mediated by the short 2A peptide through a nonproteolytic mechanism which leads to release of the nascent protein and continued translation of the downstream sequence. Improved understanding of this process will not only give a better insight into how this peptide influences the FMDV replication cycle but may also assist the application of this sequence in biotechnology for the production of multiple proteins from a single mRNA. Our data show that single amino acid substitutions in the 2A peptide can have a major influence on viral protein synthesis, virus viability, and polyprotein processing. They also indicate that efficient "cleavage" at the 2A/2B junction is required for optimal virus replication. However, maximal StopGo activity is not essential for the viability of FMDV. Copyright © 2018 American Society for Microbiology.
FPGA-based protein sequence alignment : A review

NASA Astrophysics Data System (ADS)

Isa, Mohd. Nazrin Md.; Muhsen, Ku Noor Dhaniah Ku; Saiful Nurdin, Dayana; Ahmad, Muhammad Imran; Anuar Zainol Murad, Sohiful; Nizam Mohyar, Shaiful; Harun, Azizi; Hussin, Razaidi

2017-11-01

Sequence alignment have been optimized using several techniques in order to accelerate the computation time to obtain the optimal score by implementing DP-based algorithm into hardware such as FPGA-based platform. During hardware implementation, there will be performance challenges such as the frequent memory access and highly data dependent in computation process. Therefore, investigation in processing element (PE) configuration where involves more on memory access in load or access the data (substitution matrix, query sequence character) and the PE configuration time will be the main focus in this paper. There are various approaches to enhance the PE configuration performance that have been done in previous works such as by using serial configuration chain and parallel configuration chain i.e. the configuration data will be loaded into each PEs sequentially and simultaneously respectively. Some researchers have proven that the performance using parallel configuration chain has optimized both the configuration time and area.
Scope and mechanism in palladium-catalyzed isomerizations of highly substituted allylic, homoallylic, and alkenyl alcohols.

PubMed

Larionov, Evgeny; Lin, Luqing; Guénée, Laure; Mazet, Clément

2014-12-03

Herein we report the palladium-catalyzed isomerization of highly substituted allylic alcohols and alkenyl alcohols by means of a single catalytic system. The operationally simple reaction protocol is applicable to a broad range of substrates and displays a wide functional group tolerance, and the products are usually isolated in high chemical yield. Experimental and computational mechanistic investigations provide complementary and converging evidence for a chain-walking process consisting of repeated migratory insertion/β-H elimination sequences. Interestingly, the catalyst does not dissociate from the substrate in the isomerization of allylic alcohols, whereas it disengages during the isomerization of alkenyl alcohols when additional substituents are present on the alkyl chain.
Structure and Temporal Dynamics of Populations within Wheat Streak Mosaic Virus Isolates

PubMed Central

Hall, Jeffrey S.; French, Roy; Morris, T. Jack; Stenger, Drake C.

2001-01-01

Variation within the Type and Sidney 81 strains of wheat streak mosaic virus was assessed by single-strand conformation polymorphism (SSCP) analysis and confirmed by nucleotide sequencing. Limiting-dilution subisolates (LDSIs) of each strain were evaluated for polymorphism in the P1, P3, NIa, and CP cistrons. Different SSCP patterns among LDSIs of a strain were associated with single-nucleotide substitutions. Sidney 81 LDSI-S10 was used as founding inoculum to establish three lineages each in wheat, corn, and barley. The P1, HC-Pro, P3, CI, NIa, NIb, and CP cistrons of LDSI-S10 and each lineage at passages 1, 3, 6, and 9 were evaluated for polymorphism. By passage 9, each lineage differed in consensus sequence from LDSI-S10. The majority of substitutions occurred within NIa and CP, although at least one change occurred in each cistron except HC-Pro and P3. Most consensus sequence changes among lineages were independent, with substitutions accumulating over time. However, LDSI-S10 bore a variant nucleotide (G6016) in NIa that was restored to A6016 in eight of nine lineages by passage 6. This near-global reversion is most easily explained by selection. Examination of nonconsensus variation revealed a pool of unique substitutions (singletons) that remained constant in frequency during passage, regardless of the host species examined. These results suggest that mutations arising by viral polymerase error are generated at a constant rate but that most newly generated mutants are sequestered in virions and do not serve as replication templates. Thus, a substantial fraction of variation generated is static and has yet to be tested for relative fitness. In contrast, nonsingleton variation increased upon passage, suggesting that some mutants do serve as replication templates and may become established in a population. Replicated mutants may or may not rise to prominence to become the consensus sequence in a lineage, with the fate of any particular mutant subject to selection and stochastic processes such as genetic drift and population growth factors. PMID:11581391
Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites.

PubMed

Rogan, P K; Schneider, T D

1995-01-01

Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.
Correlations of nucleotide substitution rates and base composition of mammalian coding sequences with protein structure.

PubMed

Chiusano, M L; D'Onofrio, G; Alvarez-Valin, F; Jabbari, K; Colonna, G; Bernardi, G

1999-09-30

We investigated the relationships between the nucleotide substitution rates and the predicted secondary structures in the three states representation (alpha-helix, beta-sheet, and coil). The analysis was carried out on 34 alignments, each of which comprised sequences belonging to at least four different mammalian orders. The rates of synonymous substitution were found to be significantly different in regions predicted to be alpha-helix, beta-sheet, or coil. Likewise, the nonsynonymous rates also differ, although expectedly at a lower extent, in the three types of secondary structure, suggesting that different selective constraints associated with the different structures are affecting in a similar way the synonymous and nonsynonymous rates. Moreover, the base composition of the third codon positions is different in coding sequence regions corresponding to different secondary structures of proteins.
Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

PubMed

Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

2012-05-01

The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.

Regioselective SN2 reactions for rapid syntheses of azido-inositols by one-pot sequence-specific nucleophilysis.

PubMed

Ravi, Arthi; Hassan, Syed Zahid; Vanikrishna, Ajithkumar N; Sureshan, Kana M

2017-04-04

Triflates of myo-inositol undergo facile solvolysis in DMSO and DMF yielding S N 2 products substituted with O-nucleophiles; DMF showed slower kinetics. Axial O-triflate undergoes faster substitution than equatorial O-triflate. By exploiting this difference in kinetics, solvent-tuning and sequence-controlled nucleophilysis, rapid synthesis of three azido-inositols of myo-configuration from myo-inositol itself has been achieved.
On the conservative nature of intragenic recombination

PubMed Central

Drummond, D. Allan; Silberg, Jonathan J.; Meyer, Michelle M.; Wilke, Claus O.; Arnold, Frances H.

2005-01-01

Intragenic recombination rapidly creates protein sequence diversity compared with random mutation, but little is known about the relative effects of recombination and mutation on protein function. Here, we compare recombination of the distantly related β-lactamases PSE-4 and TEM-1 to mutation of PSE-4. We show that, among β-lactamase variants containing the same number of amino acid substitutions, variants created by recombination retain function with a significantly higher probability than those generated by random mutagenesis. We present a simple model that accurately captures the differing effects of mutation and recombination in real and simulated proteins with only four parameters: (i) the amino acid sequence distance between parents, (ii) the number of substitutions, (iii) the average probability that random substitutions will preserve function, and (iv) the average probability that substitutions generated by recombination will preserve function. Our results expose a fundamental functional enrichment in regions of protein sequence space accessible by recombination and provide a framework for evaluating whether the relative rates of mutation and recombination observed in nature reflect the underlying imbalance in their effects on protein function. PMID:15809422
On the conservative nature of intragenic recombination.

PubMed

Drummond, D Allan; Silberg, Jonathan J; Meyer, Michelle M; Wilke, Claus O; Arnold, Frances H

2005-04-12

Intragenic recombination rapidly creates protein sequence diversity compared with random mutation, but little is known about the relative effects of recombination and mutation on protein function. Here, we compare recombination of the distantly related beta-lactamases PSE-4 and TEM-1 to mutation of PSE-4. We show that, among beta-lactamase variants containing the same number of amino acid substitutions, variants created by recombination retain function with a significantly higher probability than those generated by random mutagenesis. We present a simple model that accurately captures the differing effects of mutation and recombination in real and simulated proteins with only four parameters: (i) the amino acid sequence distance between parents, (ii) the number of substitutions, (iii) the average probability that random substitutions will preserve function, and (iv) the average probability that substitutions generated by recombination will preserve function. Our results expose a fundamental functional enrichment in regions of protein sequence space accessible by recombination and provide a framework for evaluating whether the relative rates of mutation and recombination observed in nature reflect the underlying imbalance in their effects on protein function.
Molecular Analysis of Dehalococcoides 16S Ribosomal DNA from Chloroethene-Contaminated Sites throughout North America and Europe

PubMed Central

Hendrickson, Edwin R.; Payne, Jo Ann; Young, Roslyn M.; Starr, Mark G.; Perry, Michael P.; Fahnestock, Stephen; Ellis, David E.; Ebersole, Richard C.

2002-01-01

The environmental distribution of Dehalococcoides group organisms and their association with chloroethene-contaminated sites were examined. Samples from 24 chloroethene-dechlorinating sites scattered throughout North America and Europe were tested for the presence of members of the Dehalococcoides group by using a PCR assay developed to detect Dehalococcoides 16S rRNA gene (rDNA) sequences. Sequences identified by sequence analysis as sequences of members of the Dehalococcoides group were detected at 21 sites. Full dechlorination of chloroethenes to ethene occurred at these sites. Dehalococcoides sequences were not detected in samples from three sites at which partial dechlorination of chloroethenes occurred, where dechlorination appeared to stop at 1,2-cis-dichloroethene. Phylogenetic analysis of the 16S rDNA amplicons confirmed that Dehalococcoides sequences formed a unique 16S rDNA group. These 16S rDNA sequences were divided into three subgroups based on specific base substitution patterns in variable regions 2 and 6 of the Dehalococcoides 16S rDNA sequence. Analyses also demonstrated that specific base substitution patterns were signature patterns. The specific base substitutions distinguished the three sequence subgroups phylogenetically. These results demonstrated that members of the Dehalococcoides group are widely distributed in nature and can be found in a variety of geological formations and in different climatic zones. Furthermore, the association of these organisms with full dechlorination of chloroethenes suggests that they are promising candidates for engineered bioremediation and may be important contributors to natural attenuation of chloroethenes. PMID:11823182
Analysis of Protein Thermostability Enhancing Factors in Industrially Important Thermus Bacteria Species

PubMed Central

Kumwenda, Benjamin; Litthauer, Derek; Bishop, Özlem Tastan; Reva, Oleg

2013-01-01

Elucidation of evolutionary factors that enhance protein thermostability is a critical problem and was the focus of this work on Thermus species. Pairs of orthologous sequences of T. scotoductus SA-01 and T. thermophilus HB27, with the largest negative minimum folding energy (MFE) as predicted by the UNAFold algorithm, were statistically analyzed. Favored substitutions of amino acids residues and their properties were determined. Substitutions were analyzed in modeled protein structures to determine their locations and contribution to energy differences using PyMOL and FoldX programs respectively. Dominant trends in amino acid substitutions consistent with differences in thermostability between orthologous sequences were observed. T. thermophilus thermophilic proteins showed an increase in non-polar, tiny, and charged amino acids. An abundance of alanine substituted by serine and threonine, as well as arginine substituted by glutamine and lysine was observed in T. thermophilus HB27. Structural comparison showed that stabilizing mutations occurred on surfaces and loops in protein structures. PMID:24023508
Molecular evolutionary rates predict both extinction and speciation in temperate angiosperm lineages

PubMed Central

2010-01-01

Background A positive relationship between diversification (i.e., speciation) and nucleotide substitution rates is commonly reported for angiosperm clades. However, the underlying cause of this relationship is often unknown because multiple intrinsic and extrinsic factors can affect the relationship, and these have confounded previous attempts infer causation. Determining which factor drives this oft-reported correlation can lend insight into the macroevolutionary process. Results Using a new database of 13 time-calibrated angiosperm phylogenies based on internal transcribed spacer (ITS) sequences, and controlling for extrinsic variables of life history and habitat, I evaluated several potential intrinsic causes of this correlation. Speciation rates (λ) and relative extinction rates (ε) were positively correlated with mean substitution rates, but were uncorrelated with substitution rate heterogeneity. It is unlikely that the positive diversification-substitution correlation is due to accelerated molecular evolution during speciation (e.g., via enhanced selection or drift), because punctuated increases in ITS rate (i.e., greater mean and variation in ITS rate for rapidly speciating clades) were not observed. Instead, fast molecular evolution likely increases speciation rate (via increased mutational variation as a substrate for selection and reproductive isolation) but also increases extinction (via mutational genetic load). Conclusions In general, these results predict that clades with higher background substitution rates may undergo successful diversification under new conditions while clades with lower substitution rates may experience decreased extinction during environmental stasis. PMID:20515493
Human Lineage-Specific Transcriptional Regulation through GA-Binding Protein Transcription Factor Alpha (GABPa)

PubMed Central

Perdomo-Sabogal, Alvaro; Nowick, Katja; Piccini, Ilaria; Sudbrak, Ralf; Lehrach, Hans; Yaspo, Marie-Laure; Warnatz, Hans-Jörg; Querfurth, Robert

2016-01-01

A substantial fraction of phenotypic differences between closely related species are likely caused by differences in gene regulation. While this has already been postulated over 30 years ago, only few examples of evolutionary changes in gene regulation have been verified. Here, we identified and investigated binding sites of the transcription factor GA-binding protein alpha (GABPa) aiming to discover cis-regulatory adaptations on the human lineage. By performing chromatin immunoprecipitation-sequencing experiments in a human cell line, we found 11,619 putative GABPa binding sites. Through sequence comparisons of the human GABPa binding regions with orthologous sequences from 34 mammals, we identified substitutions that have resulted in 224 putative human-specific GABPa binding sites. To experimentally assess the transcriptional impact of those substitutions, we selected four promoters for promoter-reporter gene assays using human and African green monkey cells. We compared the activities of wild-type promoters to mutated forms, where we have introduced one or more substitutions to mimic the ancestral state devoid of the GABPa consensus binding sequence. Similarly, we introduced the human-specific substitutions into chimpanzee and macaque promoter backgrounds. Our results demonstrate that the identified substitutions are functional, both in human and nonhuman promoters. In addition, we performed GABPa knock-down experiments and found 1,215 genes as strong candidates for primary targets. Further analyses of our data sets link GABPa to cognitive disorders, diabetes, KRAB zinc finger (KRAB-ZNF), and human-specific genes. Thus, we propose that differences in GABPa binding sites played important roles in the evolution of human-specific phenotypes. PMID:26814189
Mapping the neutralizing epitopes on the glycoprotein of infectious haematopoietic necrosis virus, a fish rhabdovirus

USGS Publications Warehouse

Huang, C.; Chien, M.S.; Landolt, M.L.; Batts, W.; Winton, J.

1996-01-01

Twelve neutralizing monoclonal antibodies (MAbs) against the fish rhabdovirus, infectious haematopoietic necrosis virus (IHNV), were used to select 20 MAb escape mutants. The nucleotide sequence of the entire glycoprotein (G) gene was determined for six mutants representing differing cross-neutralization patterns and each had a single nucleotide change leading to a single amino acid substitution within one of three regions of the protein. These data were used to design nested PCR primers to amplify portions of the G gene of the 14 remaining mutants. When the PCR products from these mutants were sequenced, they also had single nucleotide substitutions coding for amino acid substitutions at the same, or nearby, locations. Of the 20 mutants for which all or part of the glycoprotein gene was sequenced, two MAbs selected mutants with substitutions at amino acids 230-231 (antigenic site I) and the remaining MAbs selected mutants with substitutions at amino acids 272-276 (antigenic site II). Two MAbs that selected mutants mapping to amino acids 272-276, selected other mutants that mapped to amino acids 78-81, raising the possibility that this portion of the N terminus of the protein was part of a discontinuous epitope defining antigenic site II. CLUSTAL alignment of the glycoproteins of rabies virus, vesicular stomatitis virus and IHNV revealed similarities in the location of the neutralizing epitopes and a high degree of conservation among cysteine residues, indicating that the glycoproteins of three different genera of animal rhabdoviruses may share a similar three-dimensional structure in spite of extensive sequence divergence.
C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency

PubMed Central

Meier, Bettina; Cooke, Susanna L.; Weiss, Joerg; Bailly, Aymeric P.; Alexandrov, Ludmil B.; Marshall, John; Raine, Keiran; Maddison, Mark; Anderson, Elizabeth; Stratton, Michael R.; Campbell, Peter J.

2014-01-01

Mutation is associated with developmental and hereditary disorders, aging, and cancer. While we understand some mutational processes operative in human disease, most remain mysterious. We used Caenorhabditis elegans whole-genome sequencing to model mutational signatures, analyzing 183 worm populations across 17 DNA repair-deficient backgrounds propagated for 20 generations or exposed to carcinogens. The baseline mutation rate in C. elegans was approximately one per genome per generation, not overtly altered across several DNA repair deficiencies over 20 generations. Telomere erosion led to complex chromosomal rearrangements initiated by breakage–fusion–bridge cycles and completed by simultaneously acquired, localized clusters of breakpoints. Aflatoxin B1 induced substitutions of guanines in a GpC context, as observed in aflatoxin-induced liver cancers. Mutational burden increased with impaired nucleotide excision repair. Cisplatin and mechlorethamine, DNA crosslinking agents, caused dose- and genotype-dependent signatures among indels, substitutions, and rearrangements. Strikingly, both agents induced clustered rearrangements resembling “chromoanasynthesis,” a replication-based mutational signature seen in constitutional genomic disorders, suggesting that interstrand crosslinks may play a pathogenic role in such events. Cisplatin mutagenicity was most pronounced in xpf-1 mutants, suggesting that this gene critically protects cells against platinum chemotherapy. Thus, experimental model systems combined with genome sequencing can recapture and mechanistically explain mutational signatures associated with human disease. PMID:25030888
Multiflavor streptavidin

DOEpatents

Reznik, Gabriel O.; Sano, Takeshi; Vajda, Sandor; Smith, Cassandra; Cantor, Charles

2002-01-01

Compounds and methods are described for producing streptavidin mutants with changed affinities. In particular, modifications to the sequence of the natural streptavidin gene is described to create amino acid substitutions resulting in greater affinity for biotin substitutes than for biotin.
[Polymorphisms of inhibin α gene exon 1 in buffalo (Bubalus bubalis), gayal (Bos frontalis) and yak (Bos grunniens)].

PubMed

Miao, Yong-Wang; Ha, Fu; Gao, Hua-Shan; Yuan, Feng; Li, Da-Lin; Yuan, Yue-Yun

2012-08-01

To elucidate the genetic characteristics of the bovine Inhibin α subunit (INHA) gene, the polymorphisms in exon 1 of INHA and its bilateral sequences were assayed using PCR with direct sequencing in buffalo, gayal and yak. A comparative analysis was conducted by pooled the results in this study with the published data of INHA on some mammals including some bovine species together. A synonymous substitution c.73C>A was identified in exon 1 of INHA for buffalo, which results in identical encoding product in river and swamp buffalo. In gayal, two non-synonymous but same property substitutions in exon 1 of INHA, viz. c.62 C>T and c.187 G>A, were detected, which lead to p. P21L, p. V63M changes in INHA, respectively. In yak, nucleotide substitution c.62C> T, c.129A>G were found in exon 1 of INHA, the former still causes p. P21L substitution and the latter is synonymous. For the sequence of the 5'-flanking region of INHA examined, no SNPs were found within the species, but a substitution, c. -6T>G, was found. The nucleotide in this site in gayal, yak and cattle was c. -6G, whereas in buffalo it was c. -6T. Meanwhile, a 6-bp deletion, namely c. 262+31_262+36delTCTGAC, was found in the intron of buffalo INHA gene. For this deletion, wild types (+/+) account for main part in river buffalo while mutant types (-/-) are predominant in swamp buffalo. This deletion was not found in gayal, yak and cattle, though these all have another deletion in the intron of INHA, c. 262+78_262+79delTG. The results of sequence alignment showed that the substitutions c. 43A and c. 67G in exon 1 of INHA are specific to buffalo, whereas the substitutions c. 173A and c. 255G are exclusive to gayal, yak and cattle, and c. 24C, c. 47G, c. 174T and c. 206T are specific to goat. Furthermore, there are few differences among gayal, yak and cattle, but there relatively great differences between buffalo, goat and other bovine species regarding the sequences of INHA exon 1.
Hannaella phyllophila sp. nov., a basidiomycetous yeast species associated with plants in Thailand and Taiwan.

PubMed

Surussawadee, Janjira; Jindamorakot, Sasitorn; Nakase, Takashi; Lee, Ching-Fu; Limtong, Savitree

2015-07-01

Five strains representing one novel anamorphic yeast species were isolated from plant leaves collected in Thailand (strains DMKU-SP186(T), ST-111 and ST-201) and Taiwan (strains FN20L02 and SM13L16). On the basis of morphological, biochemical, physiological and chemotaxonomic characteristics and sequence analysis of the D1/D2 region of the large subunit (LSU) rRNA gene and the internal transcribed spacer (ITS) region, they were assigned to a single novel species of the genus Hannaella. The sequences of the D1/D2 regions of the LSU rRNA genes of four of the strains (DMKU-SP186(T), ST-111, FN20L02 and SM13L16) were identical, while differing from strain ST-201 by 2 substitutions and 2 gaps. The nucleotide sequence of the ITS regions of the five strains differed from each other by between 0 and 3 nucleotide substitutions. The novel species was most closely related to Hannaella luteola, but showed 1.0-1.3% nucleotide substitutions (between 6 substitutions out of 568-606 nt and 8 substitutions, and 2 gaps out of 597 nt) in the D1/D2 region of the LSU rRNA gene and 1.4-2.0% nucleotide substitutions (6-9 substitutions out of 435 nt) in the ITS region. Ballistospores were produced by three of the strains on cornmeal agar at 15 and 20 °C after 4 weeks, while H. luteola did not produce ballistospores. The name Hannaella phyllophila sp. nov. is proposed. The type strain is DMKU-SP186(T) ( = BCC 69500(T) = NBRC 110428(T) = CBS 13921(T)).
Whole-Gene Positive Selection, Elevated Synonymous Substitution Rates, Duplication, and Indel Evolution of the Chloroplast clpP1 Gene

PubMed Central

Erixon, Per; Oxelman, Bengt

2008-01-01

Background Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. Methodology/Principle Findings We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family) and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family). Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying) selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. Conclusions/Significance We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the controversial issue of whether negative or positive selection is to be expected after gene duplications by providing evidence for the latter alternative. The observed increase in synonymous substitution rates in some of the lineages indicates that the detection of positive selection may be obscured under such circumstances. Future studies are required to explore the functional significance of the large inserted repeated amino acid motifs, as well as the possibility that synonymous substitution rates may be affected by positive selection. PMID:18167545
Mitochondrial genome nucleotide substitution pattern between domesticated silkmoth, Bombyx mori, and its wild ancestors, Chinese Bombyx mandarina and Japanese Bombyx mandarina

PubMed Central

2010-01-01

Bombyx mori and Bombyx mandarina are morphologically and physiologically similar. In this study, we compared the nucleotide variations in the complete mitochondrial (mt) genomes between the domesticated silkmoth, B. mori, and its wild ancestors, Chinese B. mandarina (ChBm) and Japanese B. mandarina (JaBm). The sequence divergence and transition mutation ratio between B. mori and ChBm are significantly smaller than those observed between B. mori and JaBm. The preference of transition by DNA strands between B. mori and ChBm is consistent with that between B. mori and JaBm, however, the regional variation in nucleotide substitution rate shows a different feature. These results suggest that the ChBm mt genome is not undergoing the same evolutionary process as JaBm, providing evidence for selection on mtDNA. Moreover, investigation of the nucleotide sequence divergence in the A+T-rich region of Bombyx mt genomes also provides evidence for the assumption that the A+T-rich region might not be the fastest evolving region of the mtDNA of insects. PMID:21637625
Algorithm to find distant repeats in a single protein sequence

PubMed Central

Banerjee, Nirjhar; Sarani, Rangarajan; Ranjani, Chellamuthu Vasuki; Sowmiya, Govindaraj; Michael, Daliah; Balakrishnan, Narayanasamy; Sekar, Kanagaraj

2008-01-01

Distant repeats in protein sequence play an important role in various aspects of protein analysis. A keen analysis of the distant repeats would enable to establish a firm relation of the repeats with respect to their function and three-dimensional structure during the evolutionary process. Further, it enlightens the diversity of duplication during the evolution. To this end, an algorithm has been developed to find all distant repeats in a protein sequence. The scores from Point Accepted Mutation (PAM) matrix has been deployed for the identification of amino acid substitutions while detecting the distant repeats. Due to the biological importance of distant repeats, the proposed algorithm will be of importance to structural biologists, molecular biologists, biochemists and researchers involved in phylogenetic and evolutionary studies. PMID:19052663
Nonneutral mitochondrial DNA variation in humans and chimpanzees

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nachman, M.W.; Aquadro, C.F.; Brown, W.M.

1996-03-01

We sequenced the NADH dehydrogenase subunit 3 (ND3) gene from a sample of 61 humans, five common chimpanzees, and one gorilla to test whether patterns of mitochondrial DNA (mtDNA) variation are consistent with a neutral model of molecular evolution. Within humans and within chimpanzees, the ratio of replacement to silent nucleotide substitutions was higher than observed in comparisons between species, contrary to neutral expectations. To test the generality of this result, we reanalyzed published human RFLP data from the entire mitochondrial genome. Gains of restriction sites relative to a known human mtDNA sequence were used to infer unambiguous nucleotide substitutions.more » We also compared the complete mtDNA sequences of three humans. Both the RFLP data and the sequence data reveal a higher ratio of replacement to silent nucleotide substitutions within humans than is seen between species. This pattern is observed at most or all human mitochondrial genes and is inconsistent with a strictly neutral model. These data suggest that many mitochondrial protein polymorphisms are slightly deleterious, consistent with studies of human mitochondrial diseases. 59 refs., 2 figs., 8 tabs.« less
Changing genetic information through RNA editing

NASA Technical Reports Server (NTRS)

Maas, S.; Rich, A.

2000-01-01

RNA editing, the post-transcriptional alteration of a gene-encoded sequence, is a widespread phenomenon in eukaryotes. As a consequence of RNA editing, functionally distinct proteins can be produced from a single gene. The molecular mechanisms involved include single or multiple base insertions or deletions as well as base substitutions. In mammals, one type of substitutional RNA editing, characterized by site-specific base-modification, was shown to modulate important physiological processes. The underlying reaction mechanism of substitutional RNA editing involves hydrolytic deamination of cytosine or adenosine bases to uracil or inosine, respectively. Protein factors have been characterized that are able to induce RNA editing in vitro. A supergene family of RNA-dependent deaminases has emerged with the recent addition of adenosine deaminases specific for tRNA. Here we review the developments that have substantially increased our understanding of base-modification RNA editing over the past few years, with an emphasis on mechanistic differences, evolutionary aspects and the first insights into the regulation of editing activity.
The spatial distribution of fixed mutations within genes coding for proteins

NASA Technical Reports Server (NTRS)

Holmquist, R.; Goodman, M.; Conroy, T.; Czelusniak, J.

1983-01-01

An examination has been conducted of the extensive amino acid sequence data now available for five protein families - the alpha crystallin A chain, myoglobin, alpha and beta hemoglobin, and the cytochromes c - with the goal of estimating the true spatial distribution of base substitutions within genes that code for proteins. In every case the commonly used Poisson density failed to even approximate the experimental pattern of base substitution. For the 87 species of beta hemoglobin examined, for example, the probability that the observed results were from a Poisson process was the minuscule 10 to the -44th. Analogous results were obtained for the other functional families. All the data were reasonably, but not perfectly, described by the negative binomial density. In particular, most of the data were described by one of the very simple limiting forms of this density, the geometric density. The implications of this for evolutionary inference are discussed. It is evident that most estimates of total base substitutions between genes are badly in need of revision.
A noise resistant symmetric key cryptosystem based on S8 S-boxes and chaotic maps

NASA Astrophysics Data System (ADS)

Hussain, Iqtadar; Anees, Amir; Aslam, Muhammad; Ahmed, Rehan; Siddiqui, Nasir

2018-04-01

In this manuscript, we have proposed an encryption algorithm to encrypt any digital data. The proposed algorithm is primarily based on the substitution-permutation in which the substitution process is performed by the S 8 Substitution boxes. The proposed algorithm incorporates three different chaotic maps. We have analysed the behaviour of chaos by secure communication in great length, and accordingly, we have applied those chaotic sequences in the proposed encryption algorithm. The simulation and statistical results revealed that the proposed encryption scheme is secure against different attacks. Moreover, the encryption scheme can tolerate the channel noise as well; if the encrypted data is corrupted by the unauthenticated user or by the channel noise, the decryption can still be successfully done with some distortion. The overall results confirmed that the presented work has good cryptographic features, low computational complexity and resistant to the channel noise which makes it suitable for low profile mobile applications.
Diversification and the rate of molecular evolution: no evidence of a link in mammals.

PubMed

Goldie, Xavier; Lanfear, Robert; Bromham, Lindell

2011-10-04

Recent research has indicated a positive association between rates of molecular evolution and diversification in a number of taxa. However debate continues concerning the universality and cause of this relationship. Here, we present the first systematic investigation of this relationship within the mammals. We use phylogenetically independent sister-pair comparisons to test for a relationship between substitution rates and clade size at a number of taxonomic levels. Total, non-synonymous and synonymous substitution rates were estimated from mitochondrial and nuclear DNA sequences. We found no evidence for an association between clade size and substitution rates in mammals, for either the nuclear or the mitochondrial sequences. We found significant associations between body size and substitution rates, as previously reported. Our results present a contrast to previous research, which has reported significant positive associations between substitution rates and diversification for birds, angiosperms and reptiles. There are three possible reasons for the differences between the observed results in mammals versus other clades. First, there may be no link between substitution rates and diversification in mammals. Second, this link may exist, but may be much weaker in mammals than in other clades. Third, the link between substitution rates and diversification may exist in mammals, but may be confounded by other variables.

Substantial Regional Variation in Substitution Rates in the Human Genome: Importance of GC Content, Gene Density, and Telomere-Specific Effects

NASA Astrophysics Data System (ADS)

Arndt, Peter F.; Hwa, Terence; Petrov, Dmitri A.

2005-06-01

This study presents the first global, 1 Mbp level analysis of patterns of nucleotide substitutions along the human lineage. The study is based on the analysis of a large amount of repetitive elements deposited into the human genome since the mammalian radiation, yielding a number of results that would have been difficult to obtain using the more conventional comparative method of analysis. This analysis revealed substantial and consistent variability of rates of substitution, with the variability ranging up to 2-fold among different regions. The rates of substitutions of C or G nucleotides with A or T nucleotides vary much more sharply than the reverse rates suggesting that much of that variation is due to differences in mutation rates rather than in the probabilities of fixation of C/G vs. A/T nucleotides across the genome. For all types of substitution we observe substantially more hotspots than coldspots, with hotspots showing substantial clustering over tens of Mbp's. Our analysis revealed that GC-content of surrounding sequences is the best predictor of the rates of substitution. The pattern of substitution appears very different near telomeres compared to the rest of the genome and cannot be explained by the genome-wide correlations of the substitution rates with GC content or exon density. The telomere pattern of substitution is consistent with natural selection or biased gene conversion acting to increase the GC-content of the sequences that are within 10-15 Mbp away from the telomere.
Detection of Rickettsia helvetica and Candidatus R. tarasevichiae DNA in Ixodes persulcatus ticks collected in Northeastern European Russia (Komi Republic).

PubMed

Kartashov, Mikhail Yu; Glushkova, Ludmila I; Mikryukova, Tamara P; Korabelnikov, Igor V; Egorova, Yulia I; Tupota, Natalia L; Protopopova, Elena V; Konovalova, Svetlana N; Ternovoi, Vladimir A; Loktev, Valery B

2017-06-01

The number of tick-borne infections in the northern European regions of Russia has increased considerably in the last years. In the present study, 676 unfed adult Ixodes persulcatus ticks were collected in the Komi Republic from 2011 to 2013 to study tick-borne rickettsioses. Rickettsia spp. DNA was detected by PCR in 51 (7.6%) ticks. The nucleotide sequence analysis of gltA fragments (765bp) from 51 ticks indicated that 60.8% and 39.2% of the ticks were infected with Rickettsia helvetica and Candidatus R. tarasevichiae, respectively. The gltA fragments showed 100% identity with those of Candidatus R. tarasevichiae previously discovered in Siberia and China, whereas R. helvetica showed 99.9% sequence identity with European isolates. The ompB had 8 nucleotide substitutions, 6 of which resulted in amino acid substitutions. In the sca9 gene, 3 nucleotide substitutions were detected, and only one resulted in amino acid substitution. The smpA, ompW, and β-lactamase genes of R. helvetica also showed a high level of sequence identity. Copyright © 2017 Elsevier GmbH. All rights reserved.
Genetic characterization of the dihydrofolate reductase gene of Pneumocystis jirovecii isolates from Portugal.

PubMed

Costa, Marina C; Esteves, Francisco; Antunes, Francisco; Matos, Olga

2006-12-01

The aim of the present study was to evaluate the genetic variation of Pneumocystis jirovecii dihydrofolate reductase (DHFR) gene in an immunocompromised Portuguese population and to investigate the possible association between DHFR genotypes and P. jirovecii pneumonia (PcP) prophylaxis with co-trimoxazole. One hundred and thirty-eight P. jirovecii isolates were submitted to DHFR genetic characterization by PCR and sequencing. In the studied population, 72.7% of the patients presented sequences identical to the wild-type sequence of the P. jirovecii DHFR gene and 27.3% presented point substitutions. A total of nine substitution sites were identified; four synonymous substitutions at nucleotide positions 201, 272, 312 and 381 were detected in 31 patients. Five non-synonymous substitutions were observed, leading to the DHFR mutations Leu-13-->Ser, Asn-23-->Ser, Ser-31-->Phe, Met-52-->Leu and Ala-67-->Val. With the exception of the polymorphism at position 312 and the mutation at codon 52, all polymorphisms were reported in this study for the first time. Our results suggest that DHFR gene polymorphisms are frequent in the Portuguese immunocompromised population but do not seem to be associated with PcP prophylaxis failure (P = 0.748 and P = 0.730).
Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences

PubMed Central

Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi

2015-01-01

Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera. PMID:25704094
Characterization of minimal sequences associated with self-similar interval exchange maps

NASA Astrophysics Data System (ADS)

Cobo, Milton; Gutiérrez-Romo, Rodolfo; Maass, Alejandro

2018-04-01

The construction of affine interval exchange maps (IEMs) with wandering intervals that are semi-conjugate to a given self-similar IEM is strongly related to the existence of the so-called minimal sequences associated with local potentials, which are certain elements of the substitution subshift arising from the given IEM. In this article, under the condition called unique representation property, we characterize such minimal sequences for potentials coming from non-real eigenvalues of the substitution matrix. We also give conditions on the slopes of the affine extensions of a self-similar IEM that determine whether it exhibits a wandering interval or not.
Single Amino Acid Substitutions at Specific Positions of the Heptad Repeat Sequence of Piscidin-1 Yielded Novel Analogs That Show Low Cytotoxicity and In Vitro and In Vivo Antiendotoxin Activity

PubMed Central

Kumar, Amit; Tripathi, Amit Kumar; Kathuria, Manoj; Shree, Sonal; Tripathi, Jitendra Kumar; Purshottam, R. K.; Ramachandran, Ravishankar; Mitra, Kalyan

2016-01-01

Piscidin-1 possesses significant antimicrobial and cytotoxic activities. To recognize the primary amino acid sequence(s) in piscidin-1 that could be important for its biological activity, a long heptad repeat sequence located in the region from amino acids 2 to 19 was identified. To comprehend the possible role of this motif, six analogs of piscidin-1 were designed by selectively replacing a single isoleucine residue at a d (5th) position or at an a (9th or 16th) position with either an alanine or a valine residue. Two more analogs, namely, I5F,F6A-piscidin-1 and V12I-piscidin-1, were designed for investigating the effect of interchanging an alanine residue at a d position with an adjacent phenylalanine residue and replacing a valine residue with an isoleucine residue at another d position of the heptad repeat of piscidin-1, respectively. Single alanine-substituted analogs exhibited significantly reduced cytotoxicity against mammalian cells compared with that of piscidin-1 but appreciably retained the antibacterial and antiendotoxin activities of piscidin-1. All the single valine-substituted piscidin-1 analogs and I5F,F6A-piscidin-1 showed cytotoxicity greater than that of the corresponding alanine-substituted analogs, antibacterial activity marginally greater than or similar to that of the corresponding alanine-substituted analogs, and also antiendotoxin activity superior to that of the corresponding alanine-substituted analogs. Interestingly, among these peptides, V12I-piscidin-1 showed the highest cytotoxicity and antibacterial and antiendotoxin activities. Lipopolysaccharide (12 mg/kg of body weight)-treated mice, further treated with I16A-piscidin-1, the piscidin-1 analog with the highest therapeutic index, at a single dose of 1 or 2 mg/kg of body weight, showed 80 and 100% survival, respectively. Structural and functional characterization of these peptides revealed the basis of their biological activity and demonstrated that nontoxic piscidin-1 analogs with significant antimicrobial and antiendotoxin activities can be designed by incorporating single alanine substitutions in the piscidin-1 heptad repeat. PMID:27067326
Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

PubMed

Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

2013-08-01

To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.
Cytogenetic and molecular identification of a wheat-Leymus mollis alien multiple substitution line from octoploid Tritileymus x Triticum durum.

PubMed

Pang, Y H; Zhao, J X; Du, W L; Li, Y L; Wang, J; Wang, L M; Wu, J; Cheng, X N; Yang, Q H; Chen, X H

2014-05-23

Leymus mollis (Trin.) Pilger (NsNsXmXm, 2n = 28), a wild relative of common wheat, possesses many traits that are potentially valuable for wheat improvement. In order to exploit and utilize the useful genes of L. mollis, we developed a multiple alien substitution line, 10DM50, from the progenies of octoploid Tritileymus M842-16 x Triticum durum cv. D4286. Genomic in situ hybridization analysis of mitosis and meiosis (metaphase I), using labeled total DNA of Psathyrostachys huashanica as probe, showed that the substitution line 10DM50 was a cytogenetically stable alien substitution line with 36 chromosomes from wheat and three pairs of Ns genome chromosomes from L. mollis. Simple sequence repeat analysis showed that the chromosomes 3D, 6D, and 7D were absent in 10DM50. Expressed sequence tag-sequence tagged sites analysis showed that new chromatin from 3Ns, 6Ns, and 7Ns of L. mollis were detected in 10DM50. We deduced that the substitution line 10DM50 was a multiple alien substitution line with the 3D, 6D, and 7D chromosomes replaced by 3Ns, 6Ns, and 7Ns from L. mollis. 10DM50 showed high resistance to leaf rust and significantly improved spike length, spikes per plant, and kernels per spike, which are correlated with higher wheat yield. These results suggest that line 10DM50 could be used as intermediate material for transferring desirable traits from L. mollis into common wheat in breeding programs.
Stereoselective synthesis of novel highly substituted isochromanone and isoquinolinone-containing exocyclic tetrasubstituted alkenes.

PubMed

Arthuis, Martin; Pontikis, Renée; Florent, Jean-Claude

2009-03-06

An efficient synthetic route toward the synthesis of highly substituted arylethylidene-isoquinolinones/isochromanones is reported. The tandem carbopalladation/Suzuki-Miyaura coupling sequence stereoselectively provided various functionalized polycyclic compounds in moderate to excellent yields.
Comparative transcriptome analysis of cotton fiber development of Upland cotton (Gossypium hirsutum) and Chromosome Segment Substitution Lines from G. hirsutum × G. barbadense.

PubMed

Li, Peng-Tao; Wang, Mi; Lu, Quan-Wei; Ge, Qun; Rashid, Md Harun Or; Liu, Ai-Ying; Gong, Ju-Wu; Shang, Hai-Hong; Gong, Wan-Kui; Li, Jun-Wen; Song, Wei-Wu; Guo, Li-Xue; Su, Wei; Li, Shao-Qi; Guo, Xiao-Ping; Shi, Yu-Zhen; Yuan, You-Lu

2017-09-08

How to develop new cotton varieties possessing high yield traits of Upland cotton and superior fiber quality traits of Sea Island cotton remains a key task for cotton breeders and researchers. While multiple attempts bring in little significant progresses, the development of Chromosome Segment Substitution Lines (CSSLs) from Gossypium barbadense in G. hirsutum background provided ideal materials for aforementioned breeding purposes in upland cotton improvement. Based on the excellent fiber performance and relatively clear chromosome substitution segments information identified by Simple Sequence Repeat (SSR) markers, two CSSLs, MBI9915 and MBI9749, together with the recurrent parent CCRI36 were chosen to conduct transcriptome sequencing during the development stages of fiber elongation and Secondary Cell Wall (SCW) synthesis (from 10DPA and 28DPA), aiming at revealing the mechanism of fiber development and the potential contribution of chromosome substitution segments from Sea Island cotton to fiber development of Upland cotton. In total, 15 RNA-seq libraries were constructed and sequenced separately, generating 705.433 million clean reads with mean GC content of 45.13% and average Q30 of 90.26%. Through multiple comparisons between libraries, 1801 differentially expressed genes (DEGs) were identified, of which the 902 up-regulated DEGs were mainly involved in cell wall organization and response to oxidative stress and auxin, while the 898 down-regulated ones participated in translation, regulation of transcription, DNA-templated and cytoplasmic translation based on GO annotation and KEGG enrichment analysis. Subsequently, STEM software was performed to explicate the temporal expression pattern of DEGs. Two peroxidases and four flavonoid pathway-related genes were identified in the "oxidation-reduction process", which could play a role in fiber development and quality formation. Finally, the reliability of RNA-seq data was validated by quantitative real-time PCR of randomly selected 20 genes. The present report focuses on the similarities and differences of transcriptome profiles between the two CSSLs and the recurrent parent CCRI36 and provides novel insights into the molecular mechanism of fiber development, and into further exploration of the feasible contribution of G. barbadense substitution segments to fiber quality formation, which will lay solid foundation for simultaneously improving fiber yield and quality of upland cotton through CSSLs.
The role of proline-containing peptide triads in β-sheet formation: A kinetic study.

PubMed

Takor, Gaius A; Higashiya, Seiichiro; Sikirzhytski, Vitali K; Seeley, Jason P; Lednev, Igor K; Welch, John T

2015-06-01

The design of biomimetic materials through molecular self-assembly is a growing area of modern nanotechnology. With problems of protein folding, self-assembly, and sequence-structure relationships as essential in nanotechnology as in biology, the effect of the nucleation of β-hairpin formation by proline on the folding process has been investigated in model studies. Previously such studies were limited to investigations of the influence of proline on the formation of turns in short peptide sequences. The effect of proline-based triads on the folding of an 11-kDa amyloidogenic peptide GH6[(GA)3GY(GA)3GE]8 GAH6 (YE8) was investigated by selective substitution of the proline-substituted triads at the γ-turn sites. The folding and fibrillation of the singly proline-substituted polypeptides, e.g., GH6-[(GA)3GY(GA)3GE]7(GA)3GY(GA)3PD-GAH6 (8PD), and doubly proline-substituted polypeptides, e.g., GH6-[(GA)3GY(GA)3GE]3(GA)3GY(GA)3PD[(GA)3GY(GA)3GE]3(GA)3GY(GA)3PD-GAH6 (4,8PD), were directly monitored by circular dichroism and deep UV resonance Raman and fluorescence spectroscopies. These findings were used to identify the essential folding domains, i.e., the minimum number of β-strands necessary for stable folding. These experimental findings may be especially useful in the design and construction of peptidic materials for a wide range of applications as well as in understanding the mechanisms of folding critical to fibril formation. © 2015 Wiley Periodicals, Inc.
Conserved intergenic sequences revealed by CTAG-profiling in Salmonella: thermodynamic modeling for function prediction

NASA Astrophysics Data System (ADS)

Tang, Le; Zhu, Songling; Mastriani, Emilio; Fang, Xin; Zhou, Yu-Jie; Li, Yong-Guo; Johnston, Randal N.; Guo, Zheng; Liu, Gui-Rong; Liu, Shu-Lin

2017-03-01

Highly conserved short sequences help identify functional genomic regions and facilitate genomic annotation. We used Salmonella as the model to search the genome for evolutionarily conserved regions and focused on the tetranucleotide sequence CTAG for its potentially important functions. In Salmonella, CTAG is highly conserved across the lineages and large numbers of CTAG-containing short sequences fall in intergenic regions, strongly indicating their biological importance. Computer modeling demonstrated stable stem-loop structures in some of the CTAG-containing intergenic regions, and substitution of a nucleotide of the CTAG sequence would radically rearrange the free energy and disrupt the structure. The postulated degeneration of CTAG takes distinct patterns among Salmonella lineages and provides novel information about genomic divergence and evolution of these bacterial pathogens. Comparison of the vertically and horizontally transmitted genomic segments showed different CTAG distribution landscapes, with the genome amelioration process to remove CTAG taking place inward from both terminals of the horizontally acquired segment.
ArrayPitope: Automated Analysis of Amino Acid Substitutions for Peptide Microarray-Based Antibody Epitope Mapping.

PubMed

Hansen, Christian Skjødt; Østerbye, Thomas; Marcatili, Paolo; Lund, Ole; Buus, Søren; Nielsen, Morten

2017-01-01

Identification of epitopes targeted by antibodies (B cell epitopes) is of critical importance for the development of many diagnostic and therapeutic tools. For clinical usage, such epitopes must be extensively characterized in order to validate specificity and to document potential cross-reactivity. B cell epitopes are typically classified as either linear epitopes, i.e. short consecutive segments from the protein sequence or conformational epitopes adapted through native protein folding. Recent advances in high-density peptide microarrays enable high-throughput, high-resolution identification and characterization of linear B cell epitopes. Using exhaustive amino acid substitution analysis of peptides originating from target antigens, these microarrays can be used to address the specificity of polyclonal antibodies raised against such antigens containing hundreds of epitopes. However, the interpretation of the data provided in such large-scale screenings is far from trivial and in most cases it requires advanced computational and statistical skills. Here, we present an online application for automated identification of linear B cell epitopes, allowing the non-expert user to analyse peptide microarray data. The application takes as input quantitative peptide data of fully or partially substituted overlapping peptides from a given antigen sequence and identifies epitope residues (residues that are significantly affected by substitutions) and visualize the selectivity towards each residue by sequence logo plots. Demonstrating utility, the application was used to identify and address the antibody specificity of 18 linear epitope regions in Human Serum Albumin (HSA), using peptide microarray data consisting of fully substituted peptides spanning the entire sequence of HSA and incubated with polyclonal rabbit anti-HSA (and mouse anti-rabbit-Cy3). The application is made available at: www.cbs.dtu.dk/services/ArrayPitope.
Identification of single amino acid substitutions (SAAS) in neuraminidase from influenza a virus (H1N1) via mass spectrometry analysis coupled with de novo peptide sequencing.

PubMed

Peng, Qisheng; Wang, Zijian; Wu, Donglin; Li, Xiaoou; Liu, Xiaofeng; Sun, Wanchun; Liu, Ning

2016-08-01

Amino acid substitutions in the neuraminidase of the influenza virus are the main cause of the emergence of resistance to zanamivir or oseltamivir during seasonal influenza treatment; they are the result of non-synonymous mutations in the viral genome that can be successfully detected by polymer chain reaction (PCR)-based approaches. There is always an urgent need to detect variation in amino acid sequences directly at the protein level. Mass spectrometry coupled with de novo sequencing has been explored as an alternative and straightforward strategy for detecting amino acid substitutions, as well - this approach is the primary focus of the present study. Influenza virus (A/Puerto Rico/8/1934 H1N1) propagated in embryonated chicken eggs was purified by ultracentrifugation, followed by PNGase F treatment. The deglycosylated virion was lysed and separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The gel band corresponding to neuraminidase was picked up and subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis. LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three amino acid substitutions: R346K, S349 N, and S370I/L, in the neuraminidase from the influenza virus (A/Puerto Rico/8/1934 H1N1), which were located in three mutated peptides of the neuraminidase: YGNGVWIGK, TKNHSSR, and PNGWTETDI/LK, respectively. We found that the amino acid substitutions in the proteins of RNA viruses (including influenza A virus) resulting from non-synonymous gene mutations can indeed be directly analyzed via mass spectrometry, and that manual interpretation of the MS/MS data may be beneficial. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
ArrayPitope: Automated Analysis of Amino Acid Substitutions for Peptide Microarray-Based Antibody Epitope Mapping

PubMed Central

Hansen, Christian Skjødt; Østerbye, Thomas; Marcatili, Paolo; Lund, Ole; Buus, Søren

2017-01-01

Identification of epitopes targeted by antibodies (B cell epitopes) is of critical importance for the development of many diagnostic and therapeutic tools. For clinical usage, such epitopes must be extensively characterized in order to validate specificity and to document potential cross-reactivity. B cell epitopes are typically classified as either linear epitopes, i.e. short consecutive segments from the protein sequence or conformational epitopes adapted through native protein folding. Recent advances in high-density peptide microarrays enable high-throughput, high-resolution identification and characterization of linear B cell epitopes. Using exhaustive amino acid substitution analysis of peptides originating from target antigens, these microarrays can be used to address the specificity of polyclonal antibodies raised against such antigens containing hundreds of epitopes. However, the interpretation of the data provided in such large-scale screenings is far from trivial and in most cases it requires advanced computational and statistical skills. Here, we present an online application for automated identification of linear B cell epitopes, allowing the non-expert user to analyse peptide microarray data. The application takes as input quantitative peptide data of fully or partially substituted overlapping peptides from a given antigen sequence and identifies epitope residues (residues that are significantly affected by substitutions) and visualize the selectivity towards each residue by sequence logo plots. Demonstrating utility, the application was used to identify and address the antibody specificity of 18 linear epitope regions in Human Serum Albumin (HSA), using peptide microarray data consisting of fully substituted peptides spanning the entire sequence of HSA and incubated with polyclonal rabbit anti-HSA (and mouse anti-rabbit-Cy3). The application is made available at: www.cbs.dtu.dk/services/ArrayPitope. PMID:28095436
Hypermutation in shark immunoglobulin light chain genes results in contiguous substitutions.

PubMed

Lee, Susan S; Tranchina, Daniel; Ohta, Yuko; Flajnik, Martin F; Hsu, Ellen

2002-04-01

Among 631 substitutions present in 90 nurse shark immunoglobulin light chain somatic mutants, 338 constitute 2-4 bp stretches of adjacent changes. An absence of mutations in perinatal sequences and the bias for one mutating V gene in adults suggest that the diversification is antigen dependent. The substitutions shared no patterns, and the absence of donor sequences, including from family members, supports the idea that most changes arose from nontemplated mutation. The tandem mutations as a group are distinguished by consistently fewer transition changes and an A bias. We suggest this is one of several pathways of hypermutation diversifying shark antigen-receptor genes--point mutations, tandem mutations, and mutations with a G-C preference--that coevolved with or preceded gene rearrangement.
Comparative analysis of the prion protein gene sequences in African lion.

PubMed

Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

2006-10-01

The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.
Electronic hybridization detection in microarray format and DNA genotyping

NASA Astrophysics Data System (ADS)

Blin, Antoine; Cissé, Ismaïl; Bockelmann, Ulrich

2014-02-01

We describe an approach to substituting a fluorescence microarray with a surface made of an arrangement of electrolyte-gated field effect transistors. This was achieved using a dedicated blocking of non-specific interactions and comparing threshold voltage shifts of transistors exhibiting probe molecules of different base sequence. We apply the approach to detection of the 35delG mutation, which is related to non-syndromic deafness and is one of the most frequent mutations in humans. The process involves barcode sequences that are generated by Tas-PCR, a newly developed replication reaction using polymerase blocking. The barcodes are recognized by hybridization to surface attached probes and are directly detected by the semiconductor device.
Electronic hybridization detection in microarray format and DNA genotyping

PubMed Central

Blin, Antoine; Cissé, Ismaïl; Bockelmann, Ulrich

2014-01-01

We describe an approach to substituting a fluorescence microarray with a surface made of an arrangement of electrolyte-gated field effect transistors. This was achieved using a dedicated blocking of non-specific interactions and comparing threshold voltage shifts of transistors exhibiting probe molecules of different base sequence. We apply the approach to detection of the 35delG mutation, which is related to non-syndromic deafness and is one of the most frequent mutations in humans. The process involves barcode sequences that are generated by Tas-PCR, a newly developed replication reaction using polymerase blocking. The barcodes are recognized by hybridization to surface attached probes and are directly detected by the semiconductor device. PMID:24569823
When syntax meets action: Brain potential evidence of overlapping between language and motor sequencing.

PubMed

Casado, Pilar; Martín-Loeches, Manuel; León, Inmaculada; Hernández-Gutiérrez, David; Espuny, Javier; Muñoz, Francisco; Jiménez-Ortega, Laura; Fondevila, Sabela; de Vega, Manuel

2018-03-01

This study aims to extend the embodied cognition approach to syntactic processing. The hypothesis is that the brain resources to plan and perform motor sequences are also involved in syntactic processing. To test this hypothesis, Event-Related brain Potentials (ERPs) were recorded while participants read sentences with embedded relative clauses, judging for their acceptability (half of the sentences contained a subject-verb morphosyntactic disagreement). The sentences, previously divided into three segments, were self-administered segment-by-segment in two different sequential manners: linear or non-linear. Linear self-administration consisted of successively pressing three buttons with three consecutive fingers in the right hand, while non-linear self-administration implied the substitution of the finger in the middle position by the right foot. Our aim was to test whether syntactic processing could be affected by the manner the sentences were self-administered. Main results revealed that the ERPs LAN component vanished whereas the P600 component increased in response to incorrect verbs, for non-linear relative to linear self-administration. The LAN and P600 components reflect early and late syntactic processing, respectively. Our results convey evidence that language syntactic processing and performing non-linguistic motor sequences may share resources in the human brain. Copyright © 2017 Elsevier Ltd. All rights reserved.

Regio- and Stereoselective Cascades via Aldol Condensation and 1,3-Dipolar Cycloaddition for Construction of Functional Pyrrolizidine Derivatives.

PubMed

Mao, Zhuo-Ya; Liu, Yi-Wen; Han, Pan; Dong, Han-Qing; Si, Chang-Mei; Wei, Bang-Guo; Lin, Guo-Qiang

2018-02-16

An efficient and step-economical approach to access functionalized pyrrolizidine derivatives by a one-pot tandem sequence, including an aldol condensation and subsequent 1,3-dipolar cycloaddition process, has been developed, starting from acetone, aldehyde, and proline. A number of substituted aromatic aldehydes were amenable to this transformation, and the desired products, racemic 7a-7w and chiral 9a-9m, were obtained with excellent regioselectivities and outstanding diastereoselectivities. Moreover, in situ NMR studies revealed MgSO 4 could effectively promote the aldol condensation pathway in this tandem process.
Evolutionary Dynamics of the Gametologous CTNNB1 Gene on the Z and W Chromosomes of Snakes.

PubMed

Laopichienpong, Nararat; Muangmai, Narongrit; Chanhome, Lawan; Suntrarachun, Sunutcha; Twilprawat, Panupon; Peyachoknagul, Surin; Srikulnath, Kornsorn

2017-03-01

Snakes exhibit genotypic sex determination with female heterogamety (ZZ males and ZW females), and the state of sex chromosome differentiation also varies among lineages. To investigate the evolutionary history of homologous genes located in the nonrecombining region of differentiated sex chromosomes in snakes, partial sequences of the gametologous CTNNB1 gene were analyzed for 12 species belonging to henophid (Cylindrophiidae, Xenopeltidae, and Pythonidae) and caenophid snakes (Viperidae, Elapidae, and Colubridae). Nonsynonymous/synonymous substitution ratios (Ka/Ks) in coding sequences were low (Ka/Ks < 1) between CTNNB1Z and CTNNB1W, suggesting that these 2 genes may have similar functional properties. However, frequencies of intron sequence substitutions and insertion–deletions were higher in CTNNB1Z than CTNNB1W, suggesting that Z-linked sequences evolved faster than W-linked sequences. Molecular phylogeny based on both intron and exon sequences showed the presence of 2 major clades: 1) Z-linked sequences of Caenophidia and 2) W-linked sequences of Caenophidia clustered with Z-linked sequences of Henophidia, which suggests that the sequence divergence between CTNNB1Z and CTNNB1W in Caenophidia may have occurred by the cessation of recombination after the split from Henophidia.
The nucleotide sequence of HLA-B{sup *}2704 reveals a new amino acid substitution in exon 4 which is also present in HLA-B{sup *}2706

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rudwaleit, M.; Bowness, P.; Wordsworth, P.

1996-12-31

The HLA-B27 subtype HLA-B{sup *}2704 is virtually absent in Caucasians but common in Orientals, where it is associated with ankylosing spondylitis. The amino acid sequence of HLA-B{sup *}2704 has been established by peptide mapping and was shown to differ by two amino acids from HLA-B{sup *}2705, HLA-B{sup *}2704 is characterized by a serine for aspartic acid substitution at position 77 and glutamic acid for valine at position 152. To date, however, no nucleotide sequence confirming these changes at the DNA level has been published. 13 refs., 2 figs.
The cyc1-11 mutation in yeast reverts by recombination with a nonallelic gene: composite genes determining the iso-cytochromes c.

PubMed Central

Ernst, J F; Stewart, J W; Sherman, F

1981-01-01

DNA sequence analysis of a cloned fragment directly established that the cyc1-11 mutation of iso-1-cytochrome c in the yeast Saccharomyces cerevisiae is a two-base-pair substitution that changes the CCA proline codon at amino acid position 76 to a UAA nonsense codon. Analysis of 11 revertant proteins and one cloned revertant gene showed that reversion of the cyc1-11 mutation can occur in three ways: a single base-pair substitution, which produces a serine replacement at position 76; recombination with the nonallelic CYC7 gene of iso-2-cytochrome c, which causes replacement of a segment in the cyc1-11 gene by the corresponding segment of the CYC7 gene; and either a two-base-pair substitution or recombination with the CYC7 gene, which causes the formation of the normal iso-1-cytochrome c sequence. These results demonstrate the occurrence of low frequencies of recombination between nonallelic genes having extensive but not complete homology. The formation of composite genes that share sequences from nonallelic genes may be an evolutionary mechanism for producing protein diversities and for maintaining identical sequences at different loci. Images PMID:6273865
Suppression Analysis Reveals a Functional Difference between the Serines in Positions Two and Five in the Consensus Sequence of the C-Terminal Domain of Yeast RNA Polymerase II

PubMed Central

Yuryev, A.; Corden, J. L.

1996-01-01

The largest subunit of RNA polymerase II contains a repetitive C-terminal domain (CTD) consisting of tandem repeats of the consensus sequence Tyr(1)Ser(2)Pro(3)Thr(4) Ser(5)Pro(6) Ser(7). Substitution of nonphosphorylatable amino acids at positions two or five of the Saccharomyces cerevisiae CTD is lethal. We developed a selection ssytem for isolating suppressors of this lethal phenotype and cloned a gene, SCA1 (suppressor of CTD alanine), which complements recessive suppressors of lethal multiple-substitution mutations. A partial deletion of SCA1 (sca1Δ::hisG) suppresses alanine or glutamate substitutions at position two of the consensus CTD sequence, and a lethal CTD truncation mutation, but SCA1 deletion does not suppress alanine or glutamate substitutions at position five. SCA1 is identical to SRB9, a suppressor of a cold-sensitive CTD truncation mutation. Strains carrying dominant SRB mutations have the same suppression properties as a sca1Δ::hisG strain. These results reveal a functional difference between positions two and five of the consensus CTD heptapeptide repeat. The ability of SCA1 and SRB mutant alleles to suppress CTD truncation mutations suggest that substitutions at position two, but not at position five, cause a defect in RNA polymerase II function similar to that introduced by CTD truncation. PMID:8725217
Calculating Higher-Order Moments of Phylogenetic Stochastic Mapping Summaries in Linear Time.

PubMed

Dhar, Amrit; Minin, Vladimir N

2017-05-01

Stochastic mapping is a simulation-based method for probabilistically mapping substitution histories onto phylogenies according to continuous-time Markov models of evolution. This technique can be used to infer properties of the evolutionary process on the phylogeny and, unlike parsimony-based mapping, conditions on the observed data to randomly draw substitution mappings that do not necessarily require the minimum number of events on a tree. Most stochastic mapping applications simulate substitution mappings only to estimate the mean and/or variance of two commonly used mapping summaries: the number of particular types of substitutions (labeled substitution counts) and the time spent in a particular group of states (labeled dwelling times) on the tree. Fast, simulation-free algorithms for calculating the mean of stochastic mapping summaries exist. Importantly, these algorithms scale linearly in the number of tips/leaves of the phylogenetic tree. However, to our knowledge, no such algorithm exists for calculating higher-order moments of stochastic mapping summaries. We present one such simulation-free dynamic programming algorithm that calculates prior and posterior mapping variances and scales linearly in the number of phylogeny tips. Our procedure suggests a general framework that can be used to efficiently compute higher-order moments of stochastic mapping summaries without simulations. We demonstrate the usefulness of our algorithm by extending previously developed statistical tests for rate variation across sites and for detecting evolutionarily conserved regions in genomic sequences.
Calculating Higher-Order Moments of Phylogenetic Stochastic Mapping Summaries in Linear Time

PubMed Central

Dhar, Amrit

2017-01-01

Abstract Stochastic mapping is a simulation-based method for probabilistically mapping substitution histories onto phylogenies according to continuous-time Markov models of evolution. This technique can be used to infer properties of the evolutionary process on the phylogeny and, unlike parsimony-based mapping, conditions on the observed data to randomly draw substitution mappings that do not necessarily require the minimum number of events on a tree. Most stochastic mapping applications simulate substitution mappings only to estimate the mean and/or variance of two commonly used mapping summaries: the number of particular types of substitutions (labeled substitution counts) and the time spent in a particular group of states (labeled dwelling times) on the tree. Fast, simulation-free algorithms for calculating the mean of stochastic mapping summaries exist. Importantly, these algorithms scale linearly in the number of tips/leaves of the phylogenetic tree. However, to our knowledge, no such algorithm exists for calculating higher-order moments of stochastic mapping summaries. We present one such simulation-free dynamic programming algorithm that calculates prior and posterior mapping variances and scales linearly in the number of phylogeny tips. Our procedure suggests a general framework that can be used to efficiently compute higher-order moments of stochastic mapping summaries without simulations. We demonstrate the usefulness of our algorithm by extending previously developed statistical tests for rate variation across sites and for detecting evolutionarily conserved regions in genomic sequences. PMID:28177780
C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency.

PubMed

Meier, Bettina; Cooke, Susanna L; Weiss, Joerg; Bailly, Aymeric P; Alexandrov, Ludmil B; Marshall, John; Raine, Keiran; Maddison, Mark; Anderson, Elizabeth; Stratton, Michael R; Gartner, Anton; Campbell, Peter J

2014-10-01

Mutation is associated with developmental and hereditary disorders, aging, and cancer. While we understand some mutational processes operative in human disease, most remain mysterious. We used Caenorhabditis elegans whole-genome sequencing to model mutational signatures, analyzing 183 worm populations across 17 DNA repair-deficient backgrounds propagated for 20 generations or exposed to carcinogens. The baseline mutation rate in C. elegans was approximately one per genome per generation, not overtly altered across several DNA repair deficiencies over 20 generations. Telomere erosion led to complex chromosomal rearrangements initiated by breakage-fusion-bridge cycles and completed by simultaneously acquired, localized clusters of breakpoints. Aflatoxin B1 induced substitutions of guanines in a GpC context, as observed in aflatoxin-induced liver cancers. Mutational burden increased with impaired nucleotide excision repair. Cisplatin and mechlorethamine, DNA crosslinking agents, caused dose- and genotype-dependent signatures among indels, substitutions, and rearrangements. Strikingly, both agents induced clustered rearrangements resembling "chromoanasynthesis," a replication-based mutational signature seen in constitutional genomic disorders, suggesting that interstrand crosslinks may play a pathogenic role in such events. Cisplatin mutagenicity was most pronounced in xpf-1 mutants, suggesting that this gene critically protects cells against platinum chemotherapy. Thus, experimental model systems combined with genome sequencing can recapture and mechanistically explain mutational signatures associated with human disease. © 2014 Meier et al.; Published by Cold Spring Harbor Laboratory Press.
[Genetic evidence for recombination and mutation in the emergence of human enterovirus 71].

PubMed

Liu, Ai-Ping; Tan, Hui; Xie, Qun; Chen, Bai-Tang; Liu, Xiao-Feng; Zhang, Yong

2014-09-01

We wished to understand the genetic recombination and phylogenetic characteristics of human en- terovirus A71 (EV-A71) and to explore its potential virulence-related sites. Full-length genomes of three EV-A71 strains isolated from patients in Chenzhou City (China) were sequenced and analyzed. Possible re- combination events and crossover sites were analyzed with Recombination Detection Program v4. 1. 6 by comparison with the complete genome sequences of 231 strains of EV-A71. Similarly, plot and bootscanning analyses were undertaken with SimPlot v3. 5. 1. Phylogenetic trees based on the sequences of VP1 regions were constructed with MEGA v5. 2 using the Kimura two-parameter model and neighbor-joining method. Results suggested that recombination events were detected among the three EV-A71 isolates from Chenzhou City. The common main parent sequence was from JF799986 isolated from samples in Guang- zhou City (China) in 2009, and the minor parent sequence was TW/70516/08. Intertypic recombination e- vents were found in the C4b strain (strain SHZH98 isolated in 1998) and C4a strain (Fuyang strain isola- ted in 2008) with the prototype strains of CVA4 and CVA14 in the 3D region. The chi-square test was used to screen-out potential virulence-related sites with nucleotide substitutions of different types of hand, foot, and mouth disease (HFMD) cases using SPSS v19.0. Results suggested that there were no significant nucleotide substitutions between death cases and severe-HFMD cases. Eighteen significant nucleotide substitutions were found between death/severe-HFMD cases and mild-HFMD cases, and all these 18 substitutions were distributed only in P2 and P3 regions. Intertypic recombination among the predominant circulating EV-A71 strains in the Chinese mainland and other EV-A strains probably dates before 1998, and intratypic recombination might have occurred frequently in the HFMD outbreak from 2008 to 2012. Substitutions in the non-capsid region may be correlated with the changes in virulence of EV-A71. These data suggest that researchers should pay more attention to the relationships between substitutions in the noncapsid region and the virulence of the virus.
Evaluation of Different Oligonucleotide Base Substitutions at CpG Binding sites in Multiplex Bisulfite-PCR sequencing.

PubMed

Lu, Jennifer; Ru, Kelin; Candiloro, Ida; Dobrovic, Alexander; Korbie, Darren; Trau, Matt

2017-03-22

Multiplex bisulfite-PCR sequencing is a convenient and scalable method for the quantitative determination of the methylation state of target DNA regions. A challenge of this application is the presence of CpGs in the same region where primers are being placed. A common solution to the presence of CpGs within a primer-binding region is to substitute a base degeneracy at the cytosine position. However, the efficacy of different substitutions and the extent to which bias towards methylated or unmethylated templates may occur has never been evaluated in bisulfite multiplex sequencing applications. In response, we examined the performance of four different primer substitutions at the cytosine position of CpG's contained within the PCR primers. In this study, deoxyinosine-, 5-nitroindole-, mixed-base primers and primers with an abasic site were evaluated across a series of methylated controls. Primers that contained mixed- or deoxyinosine- base modifications performed most robustly. Mixed-base primers were further selected to determine the conditions that induce bias towards methylated templates. This identified an optimized set of conditions where the methylated state of bisulfite DNA templates can be accurately assessed using mixed-base primers, and expands the scope of bisulfite resequencing assays when working with challenging templates.
West Nile virus (WNV) genome RNAs with up to three adjacent mutations that disrupt long distance 5'-3' cyclization sequence basepairs are viable

DOE Office of Scientific and Technical Information (OSTI.GOV)

Basu, Mausumi; Brinton, Margo A., E-mail: mbrinton@gsu.ed

2011-03-30

Mosquito-borne flavivirus genomes contain conserved 5' and 3' cyclization sequences (CYC) that facilitate long distance RNA-RNA interactions. In previous studies, flavivirus replicon RNA replication was completely inhibited by single or multiple mismatching CYC nt substitutions. In the present study, full-length WNV genomes with one, two or three mismatching CYC substitutions showed reduced replication efficiencies but were viable and generated revertants with increased replication efficiency. Several different three adjacent mismatching CYC substitution mutant RNAs were rescued by a second site mutation that created an additional basepair (nts 147-10913) on the internal genomic side of the 5'-3' CYC. The finding that full-lengthmore » genomes with up to three mismatching CYC mutations are viable and can be rescued by a single nt spontaneous mutation indicates that more than three adjacent CYC basepair substitutions would be required to increase the safety of vaccine genomes by creating mismatches in inter-genomic recombinants.« less
Training alignment parameters for arbitrary sequencers with LAST-TRAIN.

PubMed

Hamada, Michiaki; Ono, Yukiteru; Asai, Kiyoshi; Frith, Martin C

2017-03-15

LAST-TRAIN improves sequence alignment accuracy by inferring substitution and gap scores that fit the frequencies of substitutions, insertions, and deletions in a given dataset. We have applied it to mapping DNA reads from IonTorrent and PacBio RS, and we show that it reduces reference bias for Oxford Nanopore reads. the source code is freely available at http://last.cbrc.jp/. mhamada@waseda.jp or mcfrith@edu.k.u-tokyo.ac.jp. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Evolutionary inference via the Poisson Indel Process.

PubMed

Bouchard-Côté, Alexandre; Jordan, Michael I

2013-01-22

We address the problem of the joint statistical inference of phylogenetic trees and multiple sequence alignments from unaligned molecular sequences. This problem is generally formulated in terms of string-valued evolutionary processes along the branches of a phylogenetic tree. The classic evolutionary process, the TKF91 model [Thorne JL, Kishino H, Felsenstein J (1991) J Mol Evol 33(2):114-124] is a continuous-time Markov chain model composed of insertion, deletion, and substitution events. Unfortunately, this model gives rise to an intractable computational problem: The computation of the marginal likelihood under the TKF91 model is exponential in the number of taxa. In this work, we present a stochastic process, the Poisson Indel Process (PIP), in which the complexity of this computation is reduced to linear. The Poisson Indel Process is closely related to the TKF91 model, differing only in its treatment of insertions, but it has a global characterization as a Poisson process on the phylogeny. Standard results for Poisson processes allow key computations to be decoupled, which yields the favorable computational profile of inference under the PIP model. We present illustrative experiments in which Bayesian inference under the PIP model is compared with separate inference of phylogenies and alignments.
Evolutionary inference via the Poisson Indel Process

PubMed Central

Bouchard-Côté, Alexandre; Jordan, Michael I.

2013-01-01

We address the problem of the joint statistical inference of phylogenetic trees and multiple sequence alignments from unaligned molecular sequences. This problem is generally formulated in terms of string-valued evolutionary processes along the branches of a phylogenetic tree. The classic evolutionary process, the TKF91 model [Thorne JL, Kishino H, Felsenstein J (1991) J Mol Evol 33(2):114–124] is a continuous-time Markov chain model composed of insertion, deletion, and substitution events. Unfortunately, this model gives rise to an intractable computational problem: The computation of the marginal likelihood under the TKF91 model is exponential in the number of taxa. In this work, we present a stochastic process, the Poisson Indel Process (PIP), in which the complexity of this computation is reduced to linear. The Poisson Indel Process is closely related to the TKF91 model, differing only in its treatment of insertions, but it has a global characterization as a Poisson process on the phylogeny. Standard results for Poisson processes allow key computations to be decoupled, which yields the favorable computational profile of inference under the PIP model. We present illustrative experiments in which Bayesian inference under the PIP model is compared with separate inference of phylogenies and alignments. PMID:23275296
A KCNH2 branch point mutation causing aberrant splicing contributes to an explanation of genotype-negative long QT syndrome.

PubMed

Crotti, Lia; Lewandowska, Marzena A; Schwartz, Peter J; Insolia, Roberto; Pedrazzini, Matteo; Bussani, Erica; Dagradi, Federica; George, Alfred L; Pagani, Franco

2009-02-01

Genetic screening of long QT syndrome (LQTS) fails to identify disease-causing mutations in about 30% of patients. So far, molecular screening has focused mainly on coding sequence mutations or on substitutions at canonical splice sites. The purpose of this study was to explore the possibility that intronic variants not at canonical splice sites might affect splicing regulatory elements, lead to aberrant transcripts, and cause LQTS. Molecular screening was performed through DHPLC and sequence analysis. The role of the intronic mutation identified was assessed with a hybrid minigene splicing assay. A three-generation LQTS family was investigated. Molecular screening failed to identify an obvious disease-causing mutation in the coding sequences of the major LQTS genes but revealed an intronic A-to-G substitution in KCNH2 (IVS9-28A/G) cosegregating with the clinical phenotype in family members. In vitro analysis proved that the mutation disrupts the acceptor splice site definition by affecting the branch point (BP) sequence and promoting intron retention. We further demonstrated a tight functional relationship between the BP and the polypyrimidine tract, whose weakness is responsible for the pathological effect of the IVS9-28A/G mutation. We identified a novel BP mutation in KCNH2 that disrupts the intron 9 acceptor splice site definition and causes LQT2. The present finding demonstrates that intronic mutations affecting pre-mRNA processing may contribute to the failure of traditional molecular screening in identifying disease-causing mutations in LQTS subjects and offers a rationale strategy for the reduction of genotype-negative cases.
Influence of 5'-flanking sequence on 4.5SI RNA gene transcription by RNA polymerase III.

PubMed

Gogolevskaya, Irina K; Stasenko, Danil V; Tatosyan, Karina A; Kramerov, Dmitri A

2018-05-01

Short nuclear 4.5SI RNA can be found in three related rodent families. Its function remains unknown. The genes of 4.5SI RNA contain an internal promoter of RNA polymerase III composed of the boxes A and B. Here, the effect of the sequence immediately upstream of the mouse 4.5SI RNA gene on its transcription was studied. The gene with deletions and substitutions in the 5'-flanking sequence was used to transfect HeLa cells and its transcriptional activity was evaluated from the cellular level of 4.5SI RNA. Single-nucleotide substitutions in the region adjacent to the transcription start site (positions -2 to -8) decreased the expression activity of the gene down to 40%-60% of the control. The substitution of the conserved pentanucleotide AGAAT (positions -14 to -18) could either decrease (43%-56%) or increase (134%) the gene expression. A TATA-like box (TACATGA) was found at positions -24 to -30 of the 4.5SI RNA gene. Its replacement with a polylinker fragment of the vector did not decrease the transcription level, while its replacement with a GC-rich sequence almost completely (down to 2%-5%) suppressed the transcription of the 4.5SI RNA gene. The effect of plasmid sequences bordering the gene on its transcription by RNA polymerase III is discussed.
A New Challenge for Compression Algorithms: Genetic Sequences.

ERIC Educational Resources Information Center

Grumbach, Stephane; Tahi, Fariza

1994-01-01

Analyzes the properties of genetic sequences that cause the failure of classical algorithms used for data compression. A lossless algorithm, which compresses the information contained in DNA and RNA sequences by detecting regularities such as palindromes, is presented. This algorithm combines substitutional and statistical methods and appears to…
Favorable 2'-substitution in the loop region of a thrombin-binding DNA aptamer.

PubMed

Awachat, Ragini; Wagh, Atish A; Aher, Manisha; Fernandes, Moneesha; Kumar, Vaijayanti A

2018-06-01

Simple 2'-OMe-chemical modification in the loop region of the 15mer G-rich DNA sequence GGTTGGTGTGGTTGG is reported. The G-quadruplex structure of this thrombin-binding aptamer (TBA), is stabilized by single modifications (T → 2'-OMe-U), depending on the position of the modification. The structural stability also renders significantly increased inhibition of thrombin-induced fibrin polymerization, a process closely associated with blood-clotting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Revised Mechanism and Improved Efficiency of the QuikChange Site-Directed Mutagenesis Method.

PubMed

Xia, Yongzhen; Xun, Luying

2017-01-01

Site-directed mutagenesis has been widely used for the substitution, addition or deletion of nucleotide residues in a defined DNA sequence. QuikChange™ site-directed mutagenesis and its related protocols have been widely used for this purpose because of convenience and efficiency. We have recently demonstrated that the mechanism of the QuikChange™ site-directed mutagenesis process is different from that being proposed. The new mechanism promotes the use of partially overlapping primers and commercial PCR enzymes for efficient PCR and mutagenesis.
A DNA Mini-Barcoding System for Authentication of Processed Fish Products.

PubMed

Shokralla, Shadi; Hellberg, Rosalee S; Handy, Sara M; King, Ian; Hajibabaei, Mehrdad

2015-10-30

Species substitution is a form of seafood fraud for the purpose of economic gain. DNA barcoding utilizes species-specific DNA sequence information for specimen identification. Previous work has established the usability of short DNA sequences-mini-barcodes-for identification of specimens harboring degraded DNA. This study aims at establishing a DNA mini-barcoding system for all fish species commonly used in processed fish products in North America. Six mini-barcode primer pairs targeting short (127-314 bp) fragments of the cytochrome c oxidase I (CO1) DNA barcode region were developed by examining over 8,000 DNA barcodes from species in the U.S. Food and Drug Administration (FDA) Seafood List. The mini-barcode primer pairs were then tested against 44 processed fish products representing a range of species and product types. Of the 44 products, 41 (93.2%) could be identified at the species or genus level. The greatest mini-barcoding success rate found with an individual primer pair was 88.6% compared to 20.5% success rate achieved by the full-length DNA barcode primers. Overall, this study presents a mini-barcoding system that can be used to identify a wide range of fish species in commercial products and may be utilized in high throughput DNA sequencing for authentication of heavily processed fish products.

Classification of rare missense substitutions, using risk surfaces, with genetic- and molecular-epidemiology applications.

PubMed

Tavtigian, Sean V; Byrnes, Graham B; Goldgar, David E; Thomas, Alun

2008-11-01

Many individually rare missense substitutions are encountered during deep resequencing of candidate susceptibility genes and clinical mutation screening of known susceptibility genes. BRCA1 and BRCA2 are among the most resequenced of all genes, and clinical mutation screening of these genes provides an extensive data set for analysis of rare missense substitutions. Align-GVGD is a mathematically simple missense substitution analysis algorithm, based on the Grantham difference, which has already contributed to classification of missense substitutions in BRCA1, BRCA2, and CHEK2. However, the distribution of genetic risk as a function of Align-GVGD's output variables Grantham variation (GV) and Grantham deviation (GD) has not been well characterized. Here, we used data from the Myriad Genetic Laboratories database of nearly 70,000 full-sequence tests plus two risk estimates, one approximating the odds ratio and the other reflecting strength of selection, to display the distribution of risk in the GV-GD plane as a series of surfaces. We abstracted contours from the surfaces and used the contours to define a sequence of missense substitution grades ordered from greatest risk to least risk. The grades were validated internally using a third, personal and family history-based, measure of risk. The Align-GVGD grades defined here are applicable to both the genetic epidemiology problem of classifying rare missense substitutions observed in known susceptibility genes and the molecular epidemiology problem of analyzing rare missense substitutions observed during case-control mutation screening studies of candidate susceptibility genes. (c) 2008 Wiley-Liss, Inc.
Recovery of West Nile Virus Envelope Protein Domain III Chimeras with Altered Antigenicity and Mouse Virulence

PubMed Central

McAuley, Alexander J.; Torres, Maricela; Plante, Jessica A.; Huang, Claire Y.-H.; Bente, Dennis A.

2016-01-01

ABSTRACT Flaviviruses are positive-sense, single-stranded RNA viruses responsible for millions of human infections annually. The envelope (E) protein of flaviviruses comprises three structural domains, of which domain III (EIII) represents a discrete subunit. The EIII gene sequence typically encodes epitopes recognized by virus-specific, potently neutralizing antibodies, and EIII is believed to play a major role in receptor binding. In order to assess potential interactions between EIII and the remainder of the E protein and to assess the effects of EIII sequence substitutions on the antigenicity, growth, and virulence of a representative flavivirus, chimeric viruses were generated using the West Nile virus (WNV) infectious clone, into which EIIIs from nine flaviviruses with various levels of genetic diversity from WNV were substituted. Of the constructs tested, chimeras containing EIIIs from Koutango virus (KOUV), Japanese encephalitis virus (JEV), St. Louis encephalitis virus (SLEV), and Bagaza virus (BAGV) were successfully recovered. Characterization of the chimeras in vitro and in vivo revealed differences in growth and virulence between the viruses, with in vivo pathogenesis often not being correlated with in vitro growth. Taken together, the data demonstrate that substitutions of EIII can allow the generation of viable chimeric viruses with significantly altered antigenicity and virulence. IMPORTANCE The envelope (E) glycoprotein is the major protein present on the surface of flavivirus virions and is responsible for mediating virus binding and entry into target cells. Several viable West Nile virus (WNV) variants with chimeric E proteins in which the putative receptor-binding domain (EIII) sequences of other mosquito-borne flaviviruses were substituted in place of the WNV EIII were recovered, although the substitution of several more divergent EIII sequences was not tolerated. The differences in virulence and tissue tropism observed with the chimeric viruses indicate a significant role for this sequence in determining the pathogenesis of the virus within the mammalian host. Our studies demonstrate that these chimeras are viable and suggest that such recombinant viruses may be useful for investigation of domain-specific antibody responses and the more extensive definition of the contributions of EIII to the tropism and pathogenesis of WNV or other flaviviruses. PMID:26912625
Recovery of West Nile Virus Envelope Protein Domain III Chimeras with Altered Antigenicity and Mouse Virulence.

PubMed

McAuley, Alexander J; Torres, Maricela; Plante, Jessica A; Huang, Claire Y-H; Bente, Dennis A; Beasley, David W C

2016-05-01

Flaviviruses are positive-sense, single-stranded RNA viruses responsible for millions of human infections annually. The envelope (E) protein of flaviviruses comprises three structural domains, of which domain III (EIII) represents a discrete subunit. The EIII gene sequence typically encodes epitopes recognized by virus-specific, potently neutralizing antibodies, and EIII is believed to play a major role in receptor binding. In order to assess potential interactions between EIII and the remainder of the E protein and to assess the effects of EIII sequence substitutions on the antigenicity, growth, and virulence of a representative flavivirus, chimeric viruses were generated using the West Nile virus (WNV) infectious clone, into which EIIIs from nine flaviviruses with various levels of genetic diversity from WNV were substituted. Of the constructs tested, chimeras containing EIIIs from Koutango virus (KOUV), Japanese encephalitis virus (JEV), St. Louis encephalitis virus (SLEV), and Bagaza virus (BAGV) were successfully recovered. Characterization of the chimeras in vitro and in vivo revealed differences in growth and virulence between the viruses, within vivo pathogenesis often not being correlated within vitro growth. Taken together, the data demonstrate that substitutions of EIII can allow the generation of viable chimeric viruses with significantly altered antigenicity and virulence. The envelope (E) glycoprotein is the major protein present on the surface of flavivirus virions and is responsible for mediating virus binding and entry into target cells. Several viable West Nile virus (WNV) variants with chimeric E proteins in which the putative receptor-binding domain (EIII) sequences of other mosquito-borne flaviviruses were substituted in place of the WNV EIII were recovered, although the substitution of several more divergent EIII sequences was not tolerated. The differences in virulence and tissue tropism observed with the chimeric viruses indicate a significant role for this sequence in determining the pathogenesis of the virus within the mammalian host. Our studies demonstrate that these chimeras are viable and suggest that such recombinant viruses may be useful for investigation of domain-specific antibody responses and the more extensive definition of the contributions of EIII to the tropism and pathogenesis of WNV or other flaviviruses. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Turn stability in beta-hairpin peptides: Investigation of peptides containing 3:5 type I G1 bulge turns.

PubMed

Blandl, Tamas; Cochran, Andrea G; Skelton, Nicholas J

2003-02-01

The turn-forming ability of a series of three-residue sequences was investigated by substituting them into a well-characterized beta-hairpin peptide. The starting scaffold, bhpW, is a disulfide-cyclized 10-residue peptide that folds into a stable beta-hairpin with two antiparallel strands connected by a two-residue reverse turn. Substitution of the central two residues with the three-residue test sequences leads to less stable hairpins, as judged by thiol-disulfide equilibrium measurements. However, analysis of NMR parameters indicated that each molecule retains a significant folded population, and that the type of turn adopted by the three-residue sequence is the same in all cases. The solution structure of a selected peptide with a PDG turn contained an antiparallel beta-hairpin with a 3:5 type I + G1 bulge turn. Analysis of the energetic contributions of individual turn residues in the series of peptides indicates that substitution effects have significant context dependence, limiting the predictive power of individual amino acid propensities for turn formation. The most stable and least stable sequences were also substituted into a more stable disulfide-cyclized scaffold and a linear beta-hairpin scaffold. The relative stabilities remained the same, suggesting that experimental measurements in the bhpW context are a useful way to evaluate turn stability for use in protein design projects. Moreover, these scaffolds are capable of displaying a diverse set of turns, which can be exploited for the mimicry of protein loops or for generating libraries of reverse turns.
Genome Sequencing and Analysis of the Tasmanian Devil and Its Transmissible Cancer

PubMed Central

Murchison, Elizabeth P.; Schulz-Trieglaff, Ole B.; Ning, Zemin; Alexandrov, Ludmil B.; Bauer, Markus J.; Fu, Beiyuan; Hims, Matthew; Ding, Zhihao; Ivakhno, Sergii; Stewart, Caitlin; Ng, Bee Ling; Wong, Wendy; Aken, Bronwen; White, Simon; Alsop, Amber; Becq, Jennifer; Bignell, Graham R.; Cheetham, R. Keira; Cheng, William; Connor, Thomas R.; Cox, Anthony J.; Feng, Zhi-Ping; Gu, Yong; Grocock, Russell J.; Harris, Simon R.; Khrebtukova, Irina; Kingsbury, Zoya; Kowarsky, Mark; Kreiss, Alexandre; Luo, Shujun; Marshall, John; McBride, David J.; Murray, Lisa; Pearse, Anne-Maree; Raine, Keiran; Rasolonjatovo, Isabelle; Shaw, Richard; Tedder, Philip; Tregidgo, Carolyn; Vilella, Albert J.; Wedge, David C.; Woods, Gregory M.; Gormley, Niall; Humphray, Sean; Schroth, Gary; Smith, Geoffrey; Hall, Kevin; Searle, Stephen M.J.; Carter, Nigel P.; Papenfuss, Anthony T.; Futreal, P. Andrew; Campbell, Peter J.; Yang, Fengtang; Bentley, David R.; Evers, Dirk J.; Stratton, Michael R.

2012-01-01

Summary The Tasmanian devil (Sarcophilus harrisii), the largest marsupial carnivore, is endangered due to a transmissible facial cancer spread by direct transfer of living cancer cells through biting. Here we describe the sequencing, assembly, and annotation of the Tasmanian devil genome and whole-genome sequences for two geographically distant subclones of the cancer. Genomic analysis suggests that the cancer first arose from a female Tasmanian devil and that the clone has subsequently genetically diverged during its spread across Tasmania. The devil cancer genome contains more than 17,000 somatic base substitution mutations and bears the imprint of a distinct mutational process. Genotyping of somatic mutations in 104 geographically and temporally distributed Tasmanian devil tumors reveals the pattern of evolution and spread of this parasitic clonal lineage, with evidence of a selective sweep in one geographical area and persistence of parallel lineages in other populations. PaperClip PMID:22341448
The Preparation and Reaction of Phenyl-Substituted Pyrylium and Pyridinium Salts.

ERIC Educational Resources Information Center

Awartani, Radi; And Others

1986-01-01

Describes this reaction sequence involving reactivity and synthesis of heterocycles: (1) synthesis of 2,4,6-triphenylpyrylium tetrafluoroborate, II; (2) its reaction with nucleophiles; (3) reaction of pyrylium salt II with a primary amine (benzylamine, p-methoxybenzylamine, and furfurylamine) to form the N-substituted-2,4,6-triphenylpyridinium…
Drought-induced gene expression in Atriplex canescens (salt bush): Transcriptional and post transcriptional response

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cairney, J.; Hays, D.; Stockand, J.D.

1991-05-01

The rangeland shrub Atriplex canescens (saltbush) is extremely drought-tolerant and is capable of growing at water potentials below {minus}40 bar. To discover the molecular basis of this tolerance, the authors have isolated a number of cDNA clones of drought-stress induced genes. Analysis of the nucleotide sequence and expression of these genes in different tissues and in response to different stresses reveals the diversity of the stress response. Members of a drought-induced, multi-gene family, have been sequenced. Although 95% homologous, non-conservative substitutions result in proteins of different tertiary structure. Additionally, the genes are expressed through a number of mature forms ofmore » mRNA which may arise by alternative RNA processing.« less
GASP: Gapped Ancestral Sequence Prediction for proteins

PubMed Central

Edwards, Richard J; Shields, Denis C

2004-01-01

Background The prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments. Results Here we present a new algorithm, GASP (Gapped Ancestral Sequence Prediction), for predicting ancestral sequences from phylogenetic trees and the corresponding multiple sequence alignments. Alignments may be of any size and contain gaps. GASP first assigns the positions of gaps in the phylogeny before using a likelihood-based approach centred on amino acid substitution matrices to assign ancestral amino acids. Important outgroup information is used by first working down from the tips of the tree to the root, using descendant data only to assign probabilities, and then working back up from the root to the tips using descendant and outgroup data to make predictions. GASP was tested on a number of simulated datasets based on real phylogenies. Prediction accuracy for ungapped data was similar to three alternative algorithms tested, with GASP performing better in some cases and worse in others. Adding simple insertions and deletions to the simulated data did not have a detrimental effect on GASP accuracy. Conclusions GASP (Gapped Ancestral Sequence Prediction) will predict ancestral sequences from multiple protein alignments of any size. Although not as accurate in all cases as some of the more sophisticated maximum likelihood approaches, it can process a wide range of input phylogenies and will predict ancestral sequences for gapped and ungapped residues alike. PMID:15350199
Determinants of Base-Pair Substitution Patterns Revealed by Whole-Genome Sequencing of DNA Mismatch Repair Defective Escherichia coli.

PubMed

Foster, Patricia L; Niccum, Brittany A; Popodi, Ellen; Townes, Jesse P; Lee, Heewook; MohammedIsmail, Wazim; Tang, Haixu

2018-06-15

Mismatch repair (MMR) is a major contributor to replication fidelity, but its impact varies with sequence context and the nature of the mismatch. Mutation accumulation experiments followed by whole-genome sequencing of MMR-defective E. coli strains yielded ≈30,000 base-pair substitutions, revealing mutational patterns across the entire chromosome. The base-pair substitution spectrum was dominated by A:T > G:C transitions, which occurred predominantly at the center base of 5'N A C3'+5'G T N3' triplets. Surprisingly, growth on minimal medium or at low temperature attenuated these mutations. Mononucleotide runs were also hotspots for base-pair substitutions, and the rate at which these occurred increased with run length. Comparison with ≈2000 base-pair substitutions accumulated in MMR-proficient strains revealed that both kinds of hotspots appeared in the wild-type spectrum and so are likely to be sites of frequent replication errors. In MMR-defective strains transitions were strand biased, occurring twice as often when A and C rather than T and G were on the lagging-strand template. Loss of nucleotide diphosphate kinase increases the cellular concentration of dCTP, which resulted in increased rates of mutations due to misinsertion of C opposite A and T. In an mmr ndk double mutant strain, these mutations were more frequent when the template A and T were on the leading strand, suggesting that lagging-strand synthesis was more error-prone or less well corrected by proofreading than was leading strand synthesis. Copyright © 2018, Genetics.
Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA

PubMed Central

Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.

1998-01-01

Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763
Stochastic dynamics of adaptive trait and neutral marker driven by eco-evolutionary feedbacks.

PubMed

Billiard, Sylvain; Ferrière, Régis; Méléard, Sylvie; Tran, Viet Chi

2015-11-01

How the neutral diversity is affected by selection and adaptation is investigated in an eco-evolutionary framework. In our model, we study a finite population in continuous time, where each individual is characterized by a trait under selection and a completely linked neutral marker. Population dynamics are driven by births and deaths, mutations at birth, and competition between individuals. Trait values influence ecological processes (demographic events, competition), and competition generates selection on trait variation, thus closing the eco-evolutionary feedback loop. The demographic effects of the trait are also expected to influence the generation and maintenance of neutral variation. We consider a large population limit with rare mutation, under the assumption that the neutral marker mutates faster than the trait under selection. We prove the convergence of the stochastic individual-based process to a new measure-valued diffusive process with jumps that we call Substitution Fleming-Viot Process (SFVP). When restricted to the trait space this process is the Trait Substitution Sequence first introduced by Metz et al. (1996). During the invasion of a favorable mutation, a genetical bottleneck occurs and the marker associated with this favorable mutant is hitchhiked. By rigorously analysing the hitchhiking effect and how the neutral diversity is restored afterwards, we obtain the condition for a time-scale separation; under this condition, we show that the marker distribution is approximated by a Fleming-Viot distribution between two trait substitutions. We discuss the implications of the SFVP for our understanding of the dynamics of neutral variation under eco-evolutionary feedbacks and illustrate the main phenomena with simulations. Our results highlight the joint importance of mutations, ecological parameters, and trait values in the restoration of neutral diversity after a selective sweep.
Mining for class-specific motifs in protein sequence classification

PubMed Central

2013-01-01

Background In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new substitution-based scoring function for identifying discriminative n-grams that are highly specific to a class. Results We present a scoring function based on discriminative n-grams that can effectively discriminate between classes. The scoring function, initially, harvests the entire set of 4- to 8-grams from the protein sequences of different classes in the dataset. Similar n-grams of the same size are combined to form new n-grams, where the similarity is defined by positive amino acid substitution scores in the BLOSUM62 matrix. Substitution has resulted in a large increase in the number of discriminatory n-grams harvested. Due to the unbalanced nature of the dataset, the frequencies of the n-grams are normalized using a dampening factor, which gives more weightage to the n-grams that appear in fewer classes and vice-versa. After the n-grams are normalized, the scoring function identifies discriminative 4- to 8-grams for each class that are frequent enough to be above a selection threshold. By mapping these discriminative n-grams back to the protein sequences, we obtained contiguous n-grams that represent short class-specific motifs in protein sequences. Our method fared well compared to an existing motif finding method known as Wordspy. We have validated our enriched set of class-specific motifs against the functionally important motifs obtained from the NLSdb, Prosite and ELM databases. We demonstrate that this method is very generic; thus can be widely applied to detect class-specific motifs in many protein sequence classification tasks. Conclusion The proposed scoring function and methodology is able to identify class-specific motifs using discriminative n-grams derived from the protein sequences. The implementation of amino acid substitution scores for similarity detection, and the dampening factor to normalize the unbalanced datasets have significant effect on the performance of the scoring function. Our multipronged validation tests demonstrate that this method can detect class-specific motifs from a wide variety of protein sequence classes with a potential application to detecting proteome-specific motifs of different organisms. PMID:23496846
The nearly neutral and selection theories of molecular evolution under the fisher geometrical framework: substitution rate, population size, and complexity.

PubMed

Razeto-Barry, Pablo; Díaz, Javier; Vásquez, Rodrigo A

2012-06-01

The general theories of molecular evolution depend on relatively arbitrary assumptions about the relative distribution and rate of advantageous, deleterious, neutral, and nearly neutral mutations. The Fisher geometrical model (FGM) has been used to make distributions of mutations biologically interpretable. We explored an FGM-based molecular model to represent molecular evolutionary processes typically studied by nearly neutral and selection models, but in which distributions and relative rates of mutations with different selection coefficients are a consequence of biologically interpretable parameters, such as the average size of the phenotypic effect of mutations and the number of traits (complexity) of organisms. A variant of the FGM-based model that we called the static regime (SR) represents evolution as a nearly neutral process in which substitution rates are determined by a dynamic substitution process in which the population's phenotype remains around a suboptimum equilibrium fitness produced by a balance between slightly deleterious and slightly advantageous compensatory substitutions. As in previous nearly neutral models, the SR predicts a negative relationship between molecular evolutionary rate and population size; however, SR does not have the unrealistic properties of previous nearly neutral models such as the narrow window of selection strengths in which they work. In addition, the SR suggests that compensatory mutations cannot explain the high rate of fixations driven by positive selection currently found in DNA sequences, contrary to what has been previously suggested. We also developed a generalization of SR in which the optimum phenotype can change stochastically due to environmental or physiological shifts, which we called the variable regime (VR). VR models evolution as an interplay between adaptive processes and nearly neutral steady-state processes. When strong environmental fluctuations are incorporated, the process becomes a selection model in which evolutionary rate does not depend on population size, but is critically dependent on the complexity of organisms and mutation size. For SR as well as VR we found that key parameters of molecular evolution are linked by biological factors, and we showed that they cannot be fixed independently by arbitrary criteria, as has usually been assumed in previous molecular evolutionary models.
The Nearly Neutral and Selection Theories of Molecular Evolution Under the Fisher Geometrical Framework: Substitution Rate, Population Size, and Complexity

PubMed Central

Razeto-Barry, Pablo; Díaz, Javier; Vásquez, Rodrigo A.

2012-01-01

The general theories of molecular evolution depend on relatively arbitrary assumptions about the relative distribution and rate of advantageous, deleterious, neutral, and nearly neutral mutations. The Fisher geometrical model (FGM) has been used to make distributions of mutations biologically interpretable. We explored an FGM-based molecular model to represent molecular evolutionary processes typically studied by nearly neutral and selection models, but in which distributions and relative rates of mutations with different selection coefficients are a consequence of biologically interpretable parameters, such as the average size of the phenotypic effect of mutations and the number of traits (complexity) of organisms. A variant of the FGM-based model that we called the static regime (SR) represents evolution as a nearly neutral process in which substitution rates are determined by a dynamic substitution process in which the population’s phenotype remains around a suboptimum equilibrium fitness produced by a balance between slightly deleterious and slightly advantageous compensatory substitutions. As in previous nearly neutral models, the SR predicts a negative relationship between molecular evolutionary rate and population size; however, SR does not have the unrealistic properties of previous nearly neutral models such as the narrow window of selection strengths in which they work. In addition, the SR suggests that compensatory mutations cannot explain the high rate of fixations driven by positive selection currently found in DNA sequences, contrary to what has been previously suggested. We also developed a generalization of SR in which the optimum phenotype can change stochastically due to environmental or physiological shifts, which we called the variable regime (VR). VR models evolution as an interplay between adaptive processes and nearly neutral steady-state processes. When strong environmental fluctuations are incorporated, the process becomes a selection model in which evolutionary rate does not depend on population size, but is critically dependent on the complexity of organisms and mutation size. For SR as well as VR we found that key parameters of molecular evolution are linked by biological factors, and we showed that they cannot be fixed independently by arbitrary criteria, as has usually been assumed in previous molecular evolutionary models. PMID:22426879
Diversity of transcripts and transcript processing forms in plastids of the dinoflagellate alga Karenia mikimotoi.

PubMed

Dorrell, Richard G; Hinksman, George A; Howe, Christopher J

2016-02-01

Plastids produce a vast diversity of transcripts. These include mature transcripts containing coding sequences, and their processing precursors, as well as transcripts that lack direct coding functions, such as antisense transcripts. Although plastid transcriptomes have been characterised for many plant species, less is known about the transcripts produced in other plastid lineages. We characterised the transcripts produced in the fucoxanthin-containing plastids of the dinoflagellate alga Karenia mikimotoi. This plastid lineage, acquired through tertiary endosymbiosis, utilises transcript processing pathways that are very different from those found in plants and green algae, including 3' poly(U) tail addition, and extensive substitutional editing of transcript sequences. We have sequenced the plastid transcriptome of K. mikimotoi, and have detected evidence for divergent evolution of fucoxanthin plastid genomes. We have additionally characterised polycistronic and monocistronic transcripts from two plastid loci, psbD-tRNA (Met)-ycf4 and rpl36-rps13-rps11. We find evidence for a range of transcripts produced from each locus that differ in terms of editing state, 5' end cleavage position, and poly(U) tail addition. Finally, we identify antisense transcripts in K. mikimotoi, which appear to undergo different processing events from the corresponding sense transcripts. Overall, our study provides insights into the diversity of transcripts and processing intermediates found in plastid lineages across the eukaryotes.
Amino Acid Substitution in Trichophyton rubrum Squalene Epoxidase Associated with Resistance to Terbinafine

PubMed Central

Osborne, Colin S.; Leitner, Ingrid; Favre, Bertrand; Ryder, Neil S.

2005-01-01

There has only been one clinically confirmed case of terbinafine resistance in dermatophytes, where six sequential Trichophyton rubrum isolates from the same patient were found to be resistant to terbinafine and cross-resistant to other squalene epoxidase (SE) inhibitors. Microsomal SE activity from these resistant isolates was insensitive to terbinafine, suggesting a target-based mechanism of resistance (B. Favre, M. Ghannoum, and N. S. Ryder, Med. Mycol. 42:525-529, 2004). In this study, we have characterized at the molecular level the cause of the resistant phenotype of these clinical isolates. Cloning and sequencing of the SE gene and cDNA from T. rubrum revealed the presence of an intron in the gene and an open reading frame encoding a protein of 489 residues, with an equivalent similarity (57%) to both yeast and mammalian SEs. The nucleotide sequences of SE from two terbinafine-susceptible strains were identical whereas those of terbinafine-resistant strains, serially isolated from the same patient, each contained the same single missense introducing the amino acid substitution L393F. Introduction of the corresponding substitution in the Candida albicans SE gene (L398F) and expression of this gene in Saccharomyces cerevisiae conferred a resistant phenotype to the transformants when compared to those expressing the wild-type sequence. Terbinafine resistance in these T. rubrum clinical isolates appears to be due to a single amino acid substitution in SE. PMID:15980358
Mechanisms of Exchange Reactions of Primary and Secondary Alkyl Iodides with Elementary Iodine

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bujake, John E.; Pratt, M. W. T.; Noyes, Richard M.

1961-04-01

Several primary and secondary alkyl iodides exchange thermally with I/ sup 131/ in hexachlorobutadiene between 130 and 200 deg . If the solutions are saturated with oxygen at one atmosphere, rates of exchange fit the kinetic expression k/sub b/STARI! STAl/sub 2/!1/2. Degassed solutions always exchange faster than oxygen saturated ones, but methyl, ethyl, and n-propyl iodides show the same kinetics as with oxygen. Exchange rates of degassed isopropyl and neopentyl iodides also show contributions from a k/sub a/STARI! term. Exchange in degassed ethylene dichloride is 3 to 4 times as fast as in degassed hexachlorobutadiene. Activation energies for k/sub b/more » are usually about 27 to 31 kcal/mole. Effects of substitution on alpha carbon are illustrated by the rate sequence methyl < ethyl < i-propyl = sec-butyl. Effects of substitution on beta carbon are illustrated by the rate sequence ethyl < npropyl>> neopentyl. Since the rates of exchange of methyl, ethyl, and i-propyl iodides vary in the opposite direction from the sequence for bimolecular nucleophilic substitution, the explanation proposed suggests that for nucleophilic substitution the effect of added methyl groups on an alpha carbon is a steric hindrance to solvation by solvent dipoles rather than a steric hindrance to the group attacking the carbon atom itself.« less
Structure and stability of the ankyrin domain of the Drosophila Notch receptor.

PubMed

Zweifel, Mark E; Leahy, Daniel J; Hughson, Frederick M; Barrick, Doug

2003-11-01

The Notch receptor contains a conserved ankyrin repeat domain that is required for Notch-mediated signal transduction. The ankyrin domain of Drosophila Notch contains six ankyrin sequence repeats previously identified as closely matching the ankyrin repeat consensus sequence, and a putative seventh C-terminal sequence repeat that exhibits lower similarity to the consensus sequence. To better understand the role of the Notch ankyrin domain in Notch-mediated signaling and to examine how structure is distributed among the seven ankyrin sequence repeats, we have determined the crystal structure of this domain to 2.0 angstroms resolution. The seventh, C-terminal, ankyrin sequence repeat adopts a regular ankyrin fold, but the first, N-terminal ankyrin repeat, which contains a 15-residue insertion, appears to be largely disordered. The structure reveals a substantial interface between ankyrin polypeptides, showing a high degree of shape and charge complementarity, which may be related to homotypic interactions suggested from indirect studies. However, the Notch ankyrin domain remains largely monomeric in solution, demonstrating that this interface alone is not sufficient to promote tight association. Using the structure, we have classified reported mutations within the Notch ankyrin domain that are known to disrupt signaling into those that affect buried residues and those restricted to surface residues. We show that the buried substitutions greatly decrease protein stability, whereas the surface substitutions have only a marginal affect on stability. The surface substitutions are thus likely to interfere with Notch signaling by disrupting specific Notch-effector interactions and map the sites of these interactions.
Sequence space and the ongoing expansion of the protein universe.

PubMed

Povolotskaya, Inna S; Kondrashov, Fyodor A

2010-06-17

The need to maintain the structural and functional integrity of an evolving protein severely restricts the repertoire of acceptable amino-acid substitutions. However, it is not known whether these restrictions impose a global limit on how far homologous protein sequences can diverge from each other. Here we explore the limits of protein evolution using sequence divergence data. We formulate a computational approach to study the rate of divergence of distant protein sequences and measure this rate for ancient proteins, those that were present in the last universal common ancestor. We show that ancient proteins are still diverging from each other, indicating an ongoing expansion of the protein sequence universe. The slow rate of this divergence is imposed by the sparseness of functional protein sequences in sequence space and the ruggedness of the protein fitness landscape: approximately 98 per cent of sites cannot accept an amino-acid substitution at any given moment but a vast majority of all sites may eventually be permitted to evolve when other, compensatory, changes occur. Thus, approximately 3.5 x 10(9) yr has not been enough to reach the limit of divergent evolution of proteins, and for most proteins the limit of sequence similarity imposed by common function may not exceed that of random sequences.
Positive selection and propeptide repeats promote rapid interspecific divergence of a gastropod sperm protein.

PubMed

Hellberg, M E; Moy, G W; Vacquier, V D

2000-03-01

Male-specific proteins have increasingly been reported as targets of positive selection and are of special interest because of the role they may play in the evolution of reproductive isolation. We report the rapid interspecific divergence of cDNA encoding a major acrosomal protein of unknown function (TMAP) of sperm from five species of teguline gastropods. A mitochondrial DNA clock (calibrated by congeneric species divided by the Isthmus of Panama) estimates that these five species diverged 2-10 MYA. Inferred amino acid sequences reveal a propeptide that has diverged rapidly between species. The mature protein has diverged faster still due to high nonsynonymous substitution rates (> 25 nonsynonymous substitutions per site per 10(9) years). cDNA encoding the mature protein (89-100 residues) shows evidence of positive selection (Dn/Ds > 1) for 4 of 10 pairwise species comparisons. cDNA and predicted secondary-structure comparisons suggest that TMAP is neither orthologous nor paralogous to abalone lysin, and thus marks a second, phylogenetically independent, protein subject to strong positive selection in free-spawning marine gastropods. In addition, an internal repeat in one species (Tegula aureotincta) produces a duplicated cleavage site which results in two alternatively processed mature proteins differing by nine amino acid residues. Such alternative processing may provide a mechanism for introducing novel amino acid sequence variation at the amino-termini of proteins. Highly divergent TMAP N-termini from two other tegulines (Tegula regina and Norrisia norrisii) may have originated by such a mechanism.

Sequence diversity and molecular evolutionary rates between buffalo and cattle.

PubMed

Moaeen-ud-Din, M; Bilal, G

2015-02-01

Identification of genes of importance regarding production traits in buffalo is impaired by a paucity of genomic resources. Choice to fill this gap is to exploit data available for cow. The cross-species application of comparative genomics tools is potential gear to investigate the buffalo genome. However, this is dependent on nucleotide sequences similarity. In this study, gene diversity between buffalo and cattle was determined using 86 gene orthologues. There was approximately 3% difference in all genes in terms of nucleotide diversity and 0.267 ± 0.134 in amino acids, indicating the possibility for successfully using cross-species strategies for genomic studies. There were significantly higher non-synonymous substitutions both in cattle and buffalo; however, there was similar difference in terms of dN- dS (4.414 versus 4.745) in buffalo and cattle, respectively. Higher rate of non-synonymous substitutions at similar level in buffalo and cattle indicated a similar positive selection pressure. Results for relative rate test were assessed with the chi-squared test. There was no significance difference on unique mutations between cattle and buffalo lineages at synonymous sites. However, there was a significance difference on unique mutations for non-synonymous sites, indicating ongoing mutagenic process that generates substitutional mutation at approximately the same rate at silent sites. Moreover, despite of common ancestry, our results indicate a different divergent time among genes of cattle and buffalo. This is the first demonstration that variable rates of molecular evolution may be present within the family Bovidae. © 2014 Blackwell Verlag GmbH.
Synthesis of a Fluorescent Acridone Using a Grignard Addition, Oxidation, and Nucleophilic Aromatic Substitution Reaction Sequence

ERIC Educational Resources Information Center

Goodrich, Samuel; Patel, Miloni; Woydziak, Zachary R.

2015-01-01

A three-pot synthesis oriented for an undergraduate organic chemistry laboratory was developed to construct a fluorescent acridone molecule. This laboratory experiment utilizes Grignard addition to an aldehyde, alcohol oxidation, and iterative nucleophilic aromatic substitution steps to produce the final product. Each of the intermediates and the…
Expansion of inverted repeat does not decrease substitution rates in Pelargonium plastid genomes.

PubMed

Weng, Mao-Lun; Ruhlman, Tracey A; Jansen, Robert K

2017-04-01

For species with minor inverted repeat (IR) boundary changes in the plastid genome (plastome), nucleotide substitution rates were previously shown to be lower in the IR than the single copy regions (SC). However, the impact of large-scale IR expansion/contraction on plastid nucleotide substitution rates among closely related species remains unclear. We included plastomes from 22 Pelargonium species, including eight newly sequenced genomes, and used both pairwise and model-based comparisons to investigate the impact of the IR on sequence evolution in plastids. Ten types of plastome organization with different inversions or IR boundary changes were identified in Pelargonium. Inclusion in the IR was not sufficient to explain the variation of nucleotide substitution rates. Instead, the rate heterogeneity in Pelargonium plastomes was a mixture of locus-specific, lineage-specific and IR-dependent effects. Our study of Pelargonium plastomes that vary in IR length and gene content demonstrates that the evolutionary consequences of retaining these repeats are more complicated than previously suggested. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Molecular adaptation in the world's deepest-living animal: Insights from transcriptome sequencing of the hadal amphipod Hirondellea gigas.

PubMed

Lan, Yi; Sun, Jin; Tian, Renmao; Bartlett, Douglas H; Li, Runsheng; Wong, Yue Him; Zhang, Weipeng; Qiu, Jian-Wen; Xu, Ting; He, Li-Sheng; Tabata, Harry G; Qian, Pei-Yuan

2017-07-01

The Challenger Deep in the Mariana Trench is the deepest point in the oceans of our planet. Understanding how animals adapt to this harsh environment characterized by high hydrostatic pressure, food limitation, dark and cold is of great scientific interest. Of the animals dwelling in the Challenger Deep, amphipods have been captured using baited traps. In this study, we sequenced the transcriptome of the amphipod Hirondellea gigas collected at a depth of 10,929 m from the East Pond of the Challenger Deep. Assembly of these sequences resulted in 133,041 contigs and 22,046 translated proteins. Functional annotation of these contigs was made using the go and kegg databases. Comparison of these translated proteins with those of four shallow-water amphipods revealed 10,731 gene families, of which 5659 were single-copy orthologs. Base substitution analysis on these single-copy orthologs showed that 62 genes are positively selected in H. gigas, including genes related to β-alanine biosynthesis, energy metabolism and genetic information processing. For multiple-copy orthologous genes, gene family expansion analysis revealed that cold-inducible proteins (i.e., transcription factors II A and transcription elongation factor 1) as well as zinc finger domains are expanded in H. gigas. Overall, our results indicate that genetic adaptation to the hadal environment by H. gigas may be mediated by both gene family expansion and amino acid substitutions of specific proteins. © 2017 John Wiley & Sons Ltd.
A TATA binding protein mutant with increased affinity for DNA directs transcription from a reversed TATA sequence in vivo.

PubMed

Spencer, J Vaughn; Arndt, Karen M

2002-12-01

The TATA-binding protein (TBP) nucleates the assembly and determines the position of the preinitiation complex at RNA polymerase II-transcribed genes. We investigated the importance of two conserved residues on the DNA binding surface of Saccharomyces cerevisiae TBP to DNA binding and sequence discrimination. Because they define a significant break in the twofold symmetry of the TBP-TATA interface, Ala100 and Pro191 have been proposed to be key determinants of TBP binding orientation and transcription directionality. In contrast to previous predictions, we found that substitution of an alanine for Pro191 did not allow recognition of a reversed TATA box in vivo; however, the reciprocal change, Ala100 to proline, resulted in efficient utilization of this and other variant TATA sequences. In vitro assays demonstrated that TBP mutants with the A100P and P191A substitutions have increased and decreased affinity for DNA, respectively. The TATA binding defect of TBP with the P191A mutation could be intragenically suppressed by the A100P substitution. Our results suggest that Ala100 and Pro191 are important for DNA binding and sequence recognition by TBP, that the naturally occurring asymmetry of Ala100 and Pro191 is not essential for function, and that a single amino acid change in TBP can lead to elevated DNA binding affinity and recognition of a reversed TATA sequence.
Discriminative Prediction of A-To-I RNA Editing Events from DNA Sequence

PubMed Central

Sun, Jiangming; Singh, Pratibha; Bagge, Annika; Valtat, Bérengère; Vikman, Petter; Spégel, Peter; Mulder, Hindrik

2016-01-01

RNA editing is a post-transcriptional alteration of RNA sequences that, via insertions, deletions or base substitutions, can affect protein structure as well as RNA and protein expression. Recently, it has been suggested that RNA editing may be more frequent than previously thought. A great impediment, however, to a deeper understanding of this process is the paramount sequencing effort that needs to be undertaken to identify RNA editing events. Here, we describe an in silico approach, based on machine learning, that ameliorates this problem. Using 41 nucleotide long DNA sequences, we show that novel A-to-I RNA editing events can be predicted from known A-to-I RNA editing events intra- and interspecies. The validity of the proposed method was verified in an independent experimental dataset. Using our approach, 203 202 putative A-to-I RNA editing events were predicted in the whole human genome. Out of these, 9% were previously reported. The remaining sites require further validation, e.g., by targeted deep sequencing. In conclusion, the approach described here is a useful tool to identify potential A-to-I RNA editing events without the requirement of extensive RNA sequencing. PMID:27764195
Characterization of Urtica dioica agglutinin isolectins and the encoding gene family.

PubMed

Does, M P; Ng, D K; Dekker, H L; Peumans, W J; Houterman, P M; Van Damme, E J; Cornelissen, B J

1999-01-01

Urtica dioica agglutinin (UDA) has previously been found in roots and rhizomes of stinging nettles as a mixture of UDA-isolectins. Protein and cDNA sequencing have shown that mature UDA is composed of two hevein domains and is processed from a precursor protein. The precursor contains a signal peptide, two in-tandem hevein domains, a hinge region and a carboxyl-terminal chitinase domain. Genomic fragments encoding precursors for UDA-isolectins have been amplified by five independent polymerase chain reactions on genomic DNA from stinging nettle ecotype Weerselo. One amplified gene was completely sequenced. As compared to the published cDNA sequence, the genomic sequence contains, besides two basepair substitutions, two introns located at the same positions as in other plant chitinases. By partial sequence analysis of 40 amplified genes, 16 different genes were identified which encode seven putative UDA-isolectins. The deduced amino acid sequences share 78.9-98.9% identity. In extracts of roots and rhizomes of stinging nettle ecotype Weerselo six out of these seven isolectins were detected by mass spectrometry. One of them is an acidic form, which has not been identified before. Our results demonstrate that UDA is encoded by a large gene family.
Assessing the role of aromatic residues in the amyloid aggregation of human muscle acylphosphatase

PubMed Central

Bemporad, Francesco; Taddei, Niccolò; Stefani, Massimo; Chiti, Fabrizio

2006-01-01

Among the many parameters that have been proposed to promote amyloid fibril formation is the π-stacking of aromatic residues. We have studied the amyloid aggregation of several mutants of human muscle acylphosphatase in which an aromatic residue was substituted with a non-aromatic one. The aggregation rate was determined using the Thioflavin T test under conditions in which the variants populated initially an ensemble of partially unfolded conformations. Substitutions in aggregation-promoting fragments of the sequence result in a dramatically decreased aggregation rate of the protein, confirming the propensity of aromatic residues to promote this process. Nevertheless, a statistical analysis shows that the measured decrease of aggregation rate following mutation arises predominantly from a reduction of hydrophobicity and intrinsic β-sheet propensity. This suggests that aromatic residues favor aggregation because of these factors rather than for their aromaticity. PMID:16600970
Strongly hydrogen-bond acidic polymer and methods of making and using

DOEpatents

Grate, Jay W.; Kaganove, Steven N.

2000-01-01

The present invention is a sorbent polymer with the (AB)n sequence where the fluorinated interactive A segment is fluoroalkyl-substituted bisphenol and the oligosiloxane B segment is an oligodimethylsiloxane. More specifically, the fluoroalkyl-substituted bisphenol contains two allyl groups and the oligodimethylsiloxane has terminal Si--H groups. The sorbent polymer may be used as thin films on a variety of chemical sensors, or as a component of a thin film on a chemical sensor. Crosslinked sorbent polymers are processable into stable thin films on sensor devices. Sorbent polymers are also useful in sensor arrays, in surface acoustic wave sensors, and in cladding of optical fibers. Sensor arrays provide better selectivity than single sensors and permit identification and quantification of more than one species in a mixture. The sorbent polymer is synthesized by hydrosilylation polymerization which is achieved by catalyzed heating.
Discovery of melanocortin ligands via a double simultaneous substitution strategy based on the Ac-His-DPhe-Arg-Trp-NH2 template.

PubMed

Todorovic, Aleksandar; Lensing, Cody J; Holder, Jerry Ryan; Scott, Joseph W; Sorensen, Nicholas B; Haskell-Luevano, Carrie

2018-05-21

The melanocortin system regulates an array of diverse physiological functions including pigmentation, feeding behavior, energy homeostasis, cardiovascular regulation, sexual function, and steroidogenesis. Endogenous melanocortin agonist ligands all possess the minimal messaging tetrapeptide sequence His-Phe-Arg-Trp. Based on this endogenous sequence, the Ac-His1-DPhe2-Arg3-Trp4-NH 2 tetrapeptide has previously been shown to be a useful scaffold when utilizing traditional positional scanning approaches to modify activity at the various melanocortin receptors (MC1-5R). The study reported herein was undertaken to evaluate a double simultaneous substitution strategy as an approach to further diversify the Ac-His1-DPhe2-Arg3-Trp4-NH 2 tetrapeptide with concurrent introduction of natural and unnatural amino acids at positions 1, 2, or 4 as well as an octanoyl residue at the N-terminus. The designed library includes the following combinations: (A) double simultaneous substitution at capping group position (Ac) together with position 1, 2, or 4, (B) double simultaneous substitution at position 1 and 2, (C) double simultaneous substitution at position 1 and 4, and (D) double simultaneous substitution at position 2 and 4. Several lead ligands with unique pharmacologies were discovered in the current study including antagonists targeting the neuronal mMC3R with minimal agonist activity and ligands with selective profiles for the various melanocortin subtypes. The results suggest that the double simultaneous substitution strategy is a suitable approach in altering melanocortin receptor potency, selectivity, or converting agonists into antagonists and vice versa.
Molecular characterization of two high-level ceftriaxone-resistant Neisseria gonorrhoeae isolates detected in Catalonia, Spain.

PubMed

Cámara, Jordi; Serra, Judit; Ayats, Josefina; Bastida, Teresa; Carnicer-Pont, Dolors; Andreu, Antònia; Ardanuy, Carmen

2012-08-01

The aim of this study was to characterize the first two extended-spectrum cephalosporin-resistant and multidrug-resistant (MDR) Neisseria gonorrhoeae isolates collected from two sexually related patients (men who have sex with men) in Spain. Antimicrobial susceptibility was studied by Etest. Genes involved in quinolone, ceftriaxone and multidrug resistance were amplified by PCR and sequenced in both directions. The isolates were typed by N. gonorrhoeae multi-antigen sequence typing (NG-MAST). The two isolates had the same MDR profile, showing resistance to penicillin (MIC 0.094 mg/L; β-lactamase negative), ceftriaxone (MIC 1.5 mg/L), cefixime (MIC 1.5 mg/L), cefotaxime (MIC 1 mg/L), ciprofloxacin (MIC >32 mg/L) and tetracycline (MIC 1.5 mg/L). NG-MAST showed that both isolates belonged to sequence type (ST) 1407 (porB-908 and tbpB-110). Ciprofloxacin resistance was due to amino acid substitutions in GyrA (S91F and D95G) and ParC (S87R). An A deletion in the promoter of the MtrCDE efflux pump (mtrR) was detected. No changes were detected in the pilQ gene. The outer membrane protein PorB showed two substitutions at G120K and A121N. An L421P substitution was observed in the PBP1A (ponA) sequence. The sequence of PBP2 (penA) showed a mosaic structure related to genotype XXXIV with a single additional amino acid substitution (A501P). This genotype was identical to a recently described French isolate (F89). This is the first reported case of high-level extended-spectrum cephalosporin-resistant N. gonorrhoeae transmission. The molecular typing and MDR genotype suggest possible European spread of this strain, highlighting the need for surveillance and the importance of testing the susceptibility of N. gonorrhoeae to extended-spectrum cephalosporins.
Changing selective pressure during antigenic changes in human influenza H3.

PubMed

Blackburne, Benjamin P; Hay, Alan J; Goldstein, Richard A

2008-05-02

The rapid evolution of influenza viruses presents difficulties in maintaining the optimal efficiency of vaccines. Amino acid substitutions result in antigenic drift, a process whereby antisera raised in response to one virus have reduced effectiveness against future viruses. Interestingly, while amino acid substitutions occur at a relatively constant rate, the antigenic properties of H3 move in a discontinuous, step-wise manner. It is not clear why this punctuated evolution occurs, whether this represents simply the fact that some substitutions affect these properties more than others, or if this is indicative of a changing relationship between the virus and the host. In addition, the role of changing glycosylation of the haemagglutinin in these shifts in antigenic properties is unknown. We analysed the antigenic drift of HA1 from human influenza H3 using a model of sequence change that allows for variation in selective pressure at different locations in the sequence, as well as at different parts of the phylogenetic tree. We detect significant changes in selective pressure that occur preferentially during major changes in antigenic properties. Despite the large increase in glycosylation during the past 40 years, changes in glycosylation did not correlate either with changes in antigenic properties or with significantly more rapid changes in selective pressure. The locations that undergo changes in selective pressure are largely in places undergoing adaptive evolution, in antigenic locations, and in locations or near locations undergoing substitutions that characterise the change in antigenicity of the virus. Our results suggest that the relationship of the virus to the host changes with time, with the shifts in antigenic properties representing changes in this relationship. This suggests that the virus and host immune system are evolving different methods to counter each other. While we are able to characterise the rapid increase in glycosylation of the haemagglutinin during time in human influenza H3, an increase not present in influenza in birds, this increase seems unrelated to the observed changes in antigenic properties.
The use of additive and subtractive approaches to examine the nuclear localization sequence of the polyomavirus major capsid protein VP1

NASA Technical Reports Server (NTRS)

Chang, D.; Haynes, J. I. 2nd; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

1992-01-01

A nuclear localization signal (NLS) has been identified in the N-terminal (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) amino acid sequence of the polyomavirus major capsid protein VP1. The importance of this amino acid sequence for nuclear transport of VP1 protein was demonstrated by a genetic "subtractive" study using the constructs pSG5VP1 (full-length VP1) and pSG5 delta 5'VP1 (truncated VP1, lacking amino acids Ala1-Cys11). These constructs were used to transfect COS-7 cells, and expression and intracellular localization of the VP1 protein was visualized by indirect immunofluorescence. These studies revealed that the full-length VP1 was expressed and localized in the nucleus, while the truncated VP1 protein was localized in the cytoplasm and not transported to the nucleus. These findings were substantiated by an "additive" approach using FITC-labeled conjugates of synthetic peptides homologous to the NLS of VP1 cross-linked to bovine serum albumin or immunoglobulin G. Both conjugates localized in the nucleus after microinjection into the cytoplasm of 3T6 cells. The importance of individual amino acids found in the basic sequence (Lys3-Arg-Lys5) of the NLS was also investigated. This was accomplished by synthesizing three additional peptides in which lysine-3 was substituted with threonine, arginine-4 was substituted with threonine, or lysine-5 was substituted with threonine. It was found that lysine-3 was crucial for nuclear transport, since substitution of this amino acid with threonine prevented nuclear localization of the microinjected, FITC-labeled conjugate.
Experimental rugged fitness landscape in protein sequence space.

PubMed

Hayashi, Yuuki; Aita, Takuyo; Toyota, Hitoshi; Husimi, Yuzuru; Urabe, Itaru; Yomo, Tetsuya

2006-12-20

The fitness landscape in sequence space determines the process of biomolecular evolution. To plot the fitness landscape of protein function, we carried out in vitro molecular evolution beginning with a defective fd phage carrying a random polypeptide of 139 amino acids in place of the g3p minor coat protein D2 domain, which is essential for phage infection. After 20 cycles of random substitution at sites 12-130 of the initial random polypeptide and selection for infectivity, the selected phage showed a 1.7x10(4)-fold increase in infectivity, defined as the number of infected cells per ml of phage suspension. Fitness was defined as the logarithm of infectivity, and we analyzed (1) the dependence of stationary fitness on library size, which increased gradually, and (2) the time course of changes in fitness in transitional phases, based on an original theory regarding the evolutionary dynamics in Kauffman's n-k fitness landscape model. In the landscape model, single mutations at single sites among n sites affect the contribution of k other sites to fitness. Based on the results of these analyses, k was estimated to be 18-24. According to the estimated parameters, the landscape was plotted as a smooth surface up to a relative fitness of 0.4 of the global peak, whereas the landscape had a highly rugged surface with many local peaks above this relative fitness value. Based on the landscapes of these two different surfaces, it appears possible for adaptive walks with only random substitutions to climb with relative ease up to the middle region of the fitness landscape from any primordial or random sequence, whereas an enormous range of sequence diversity is required to climb further up the rugged surface above the middle region.
Experimental Rugged Fitness Landscape in Protein Sequence Space

PubMed Central

Hayashi, Yuuki; Aita, Takuyo; Toyota, Hitoshi; Husimi, Yuzuru; Urabe, Itaru; Yomo, Tetsuya

2006-01-01

The fitness landscape in sequence space determines the process of biomolecular evolution. To plot the fitness landscape of protein function, we carried out in vitro molecular evolution beginning with a defective fd phage carrying a random polypeptide of 139 amino acids in place of the g3p minor coat protein D2 domain, which is essential for phage infection. After 20 cycles of random substitution at sites 12–130 of the initial random polypeptide and selection for infectivity, the selected phage showed a 1.7×104-fold increase in infectivity, defined as the number of infected cells per ml of phage suspension. Fitness was defined as the logarithm of infectivity, and we analyzed (1) the dependence of stationary fitness on library size, which increased gradually, and (2) the time course of changes in fitness in transitional phases, based on an original theory regarding the evolutionary dynamics in Kauffman's n-k fitness landscape model. In the landscape model, single mutations at single sites among n sites affect the contribution of k other sites to fitness. Based on the results of these analyses, k was estimated to be 18–24. According to the estimated parameters, the landscape was plotted as a smooth surface up to a relative fitness of 0.4 of the global peak, whereas the landscape had a highly rugged surface with many local peaks above this relative fitness value. Based on the landscapes of these two different surfaces, it appears possible for adaptive walks with only random substitutions to climb with relative ease up to the middle region of the fitness landscape from any primordial or random sequence, whereas an enormous range of sequence diversity is required to climb further up the rugged surface above the middle region. PMID:17183728
Double-strand break repair processes drive evolution of the mitochondrial genome in Arabidopsis.

PubMed

Davila, Jaime I; Arrieta-Montiel, Maria P; Wamboldt, Yashitola; Cao, Jun; Hagmann, Joerg; Shedge, Vikas; Xu, Ying-Zhi; Weigel, Detlef; Mackenzie, Sally A

2011-09-27

The mitochondrial genome of higher plants is unusually dynamic, with recombination and nonhomologous end-joining (NHEJ) activities producing variability in size and organization. Plant mitochondrial DNA also generally displays much lower nucleotide substitution rates than mammalian or yeast systems. Arabidopsis displays these features and expedites characterization of the mitochondrial recombination surveillance gene MSH1 (MutS 1 homolog), lending itself to detailed study of de novo mitochondrial genome activity. In the present study, we investigated the underlying basis for unusual plant features as they contribute to rapid mitochondrial genome evolution. We obtained evidence of double-strand break (DSB) repair, including NHEJ, sequence deletions and mitochondrial asymmetric recombination activity in Arabidopsis wild-type and msh1 mutants on the basis of data generated by Illumina deep sequencing and confirmed by DNA gel blot analysis. On a larger scale, with mitochondrial comparisons across 72 Arabidopsis ecotypes, similar evidence of DSB repair activity differentiated ecotypes. Forty-seven repeat pairs were active in DNA exchange in the msh1 mutant. Recombination sites showed asymmetrical DNA exchange within lengths of 50- to 556-bp sharing sequence identity as low as 85%. De novo asymmetrical recombination involved heteroduplex formation, gene conversion and mismatch repair activities. Substoichiometric shifting by asymmetrical exchange created the appearance of rapid sequence gain and loss in association with particular repeat classes. Extensive mitochondrial genomic variation within a single plant species derives largely from DSB activity and its repair. Observed gene conversion and mismatch repair activity contribute to the low nucleotide substitution rates seen in these genomes. On a phenotypic level, these patterns of rearrangement likely contribute to the reproductive versatility of higher plants.
Shifts in the evolutionary rate and intensity of purifying selection between two Brassica genomes revealed by analyses of orthologous transposons and relics of a whole genome triplication.

PubMed

Zhao, Meixia; Du, Jianchang; Lin, Feng; Tong, Chaobo; Yu, Jingyin; Huang, Shunmou; Wang, Xiaowu; Liu, Shengyi; Ma, Jianxin

2013-10-01

Recent sequencing of the Brassica rapa and Brassica oleracea genomes revealed extremely contrasting genomic features such as the abundance and distribution of transposable elements between the two genomes. However, whether and how these structural differentiations may have influenced the evolutionary rates of the two genomes since their split from a common ancestor are unknown. Here, we investigated and compared the rates of nucleotide substitution between two long terminal repeats (LTRs) of individual orthologous LTR-retrotransposons, the rates of synonymous and non-synonymous substitution among triplicated genes retained in both genomes from a shared whole genome triplication event, and the rates of genetic recombination estimated/deduced by the comparison of physical and genetic distances along chromosomes and ratios of solo LTRs to intact elements. Overall, LTR sequences and genic sequences showed more rapid nucleotide substitution in B. rapa than in B. oleracea. Synonymous substitution of triplicated genes retained from a shared whole genome triplication was detected at higher rates in B. rapa than in B. oleracea. Interestingly, non-synonymous substitution was observed at lower rates in the former than in the latter, indicating shifted densities of purifying selection between the two genomes. In addition to evolutionary asymmetry, orthologous genes differentially regulated and/or disrupted by transposable elements between the two genomes were also characterized. Our analyses suggest that local genomic and epigenomic features, such as recombination rates and chromatin dynamics reshaped by independent proliferation of transposable elements and elimination between the two genomes, are perhaps partially the causes and partially the outcomes of the observed inter-specific asymmetric evolution. © 2013 Purdue University The Plant Journal © 2013 John Wiley & Sons Ltd.
Acquisition of initial /s/-stop and stop-/s/sequences in Greek.

PubMed

Syrika, Asimina; Nicolaidis, Katerina; Edwards, Jan; Beckman, Mary E

2011-09-01

Previous work on children's acquisition of complex sequences points to a tendency for affricates to be acquired before clusters, but there is no clear evidence of a difference in order of acquisition between clusters with /s/ that violate the Sonority Sequencing Principle (SSP), such as /s/ followed by stop in onset position, and other clusters that obey the SSP. One problem with studies that have compared the acquisition of SSP-obeying and SSP-violating clusters is that the component sounds in the two types of sequences were different.This paper examines the acquisition of initial /s/-stop and stop-/s/ sequences by sixty Greek children aged 2 through 5 years. Results showed greater accuracy for the /s/-stop relative to the stop-/s/ sequences, but no difference in accuracy between /ts/, which is usually analyzed as an affricate in Greek, and the other stop-/s/ sequences. Moreover, errors for the /s/-stop sequences and /ts/ primarily involved stop substitutions, whereas errors for /ps/ and /ks/ were more variable and often involved fricative substitutions, a pattern which may have a perceptual explanation. Finally, /ts/ showed a distinct temporal pattern relative to the stop-/s/ clusters /ps/ and /ks/, similar to what has been reported for productions of Greek adults.
Acquisition of initial /s/-stop and stop-/s/ sequences in Greek

PubMed Central

Syrika, Asimina; Nicolaidis, Katerina; Edwards, Jan; Beckman, Mary E.

2010-01-01

Previous work on children’s acquisition of complex sequences points to a tendency for affricates to be acquired before clusters, but there is no clear evidence of a difference in order of acquisition between clusters with /s/ that violate the Sonority Sequencing Principle (SSP), such as /s/ followed by stop in onset position, and other clusters that obey the SSP. One problem with studies that have compared the acquisition of SSP-obeying and SSP-violating clusters is that the component sounds in the two types of sequences were different. This paper examines the acquisition of initial /s/-stop and stop-/s/ sequences by sixty Greek children aged 2 through 5 years. Results showed greater accuracy for the /s/-stop relative to the stop-/s/ sequences, but no difference in accuracy between /ts/, which is usually analyzed as an affricate in Greek, and the other stop-/s/ sequences. Moreover, errors for the /s/-stop sequences and /ts/ primarily involved stop substitutions, whereas errors for /ps/ and /ks/ were more variable and often involved fricative substitutions, a pattern which may have a perceptual explanation. Finally, /ts/ showed a distinct temporal pattern relative to the stop-/s/ clusters /ps/ and /ks/, similarly to what has been reported for productions of Greek adults. PMID:22070044
Differential Effects of the G118R, H51Y, and E138K Resistance Substitutions in Different Subtypes of HIV Integrase

PubMed Central

Quashie, Peter K.; Oliviera, Maureen; Veres, Tamar; Osman, Nathan; Han, Ying-Shan; Hassounah, Said; Lie, Yolanda; Huang, Wei; Mesplède, Thibault

2014-01-01

ABSTRACT Dolutegravir (DTG) is the latest antiretroviral (ARV) approved for the treatment of human immunodeficiency virus (HIV) infection. The G118R substitution, previously identified with MK-2048 and raltegravir, may represent the initial substitution in a dolutegravir resistance pathway. We have found that subtype C integrase proteins have a low enzymatic cost associated with the G118R substitution, mostly at the strand transfer step of integration, compared to either subtype B or recombinant CRF02_AG proteins. Subtype B and circulating recombinant form AG (CRF02_AG) clonal viruses encoding G118R-bearing integrases were severely restricted in their viral replication capacity, and G118R/E138K-bearing viruses had various levels of resistance to dolutegravir, raltegravir, and elvitegravir. In cell-free experiments, the impacts of the H51Y and E138K substitutions on resistance and enzyme efficiency, when present with G118R, were highly dependent on viral subtype. Sequence alignment and homology modeling showed that the subtype-specific effects of these mutations were likely due to differential amino acid residue networks in the different integrase proteins, caused by polymorphic residues, which significantly affect native protein activity, structure, or function and are important for drug-mediated inhibition of enzyme activity. This preemptive study will aid in the interpretation of resistance patterns in dolutegravir-treated patients. IMPORTANCE Recognized drug resistance mutations have never been reported for naive patients treated with dolutegravir. Additionally, in integrase inhibitor-experienced patients, only R263K and other previously known integrase resistance substitutions have been reported. Here we suggest that alternate resistance pathways may develop in non-B HIV-1 subtypes and explain how “minor” polymorphisms and substitutions in HIV integrase that are associated with these subtypes can influence resistance against dolutegravir. This work also highlights the importance of phenotyping versus genotyping when a strong inhibitor such as dolutegravir is being used. By characterizing the G118R substitution, this work also preemptively defines parameters for a potentially important pathway in some non-B HIV subtype viruses treated with dolutegravir and will aid in the inhibition of such a virus, if detected. The general inability of strand transfer-related substitutions to diminish 3′ processing indicates the importance of the 3′ processing step and highlights a therapeutic angle that needs to be better exploited. PMID:25552724

Differential effects of the G118R, H51Y, and E138K resistance substitutions in different subtypes of HIV integrase.

PubMed

Quashie, Peter K; Oliviera, Maureen; Veres, Tamar; Osman, Nathan; Han, Ying-Shan; Hassounah, Said; Lie, Yolanda; Huang, Wei; Mesplède, Thibault; Wainberg, Mark A

2015-03-01

Dolutegravir (DTG) is the latest antiretroviral (ARV) approved for the treatment of human immunodeficiency virus (HIV) infection. The G118R substitution, previously identified with MK-2048 and raltegravir, may represent the initial substitution in a dolutegravir resistance pathway. We have found that subtype C integrase proteins have a low enzymatic cost associated with the G118R substitution, mostly at the strand transfer step of integration, compared to either subtype B or recombinant CRF02_AG proteins. Subtype B and circulating recombinant form AG (CRF02_AG) clonal viruses encoding G118R-bearing integrases were severely restricted in their viral replication capacity, and G118R/E138K-bearing viruses had various levels of resistance to dolutegravir, raltegravir, and elvitegravir. In cell-free experiments, the impacts of the H51Y and E138K substitutions on resistance and enzyme efficiency, when present with G118R, were highly dependent on viral subtype. Sequence alignment and homology modeling showed that the subtype-specific effects of these mutations were likely due to differential amino acid residue networks in the different integrase proteins, caused by polymorphic residues, which significantly affect native protein activity, structure, or function and are important for drug-mediated inhibition of enzyme activity. This preemptive study will aid in the interpretation of resistance patterns in dolutegravir-treated patients. Recognized drug resistance mutations have never been reported for naive patients treated with dolutegravir. Additionally, in integrase inhibitor-experienced patients, only R263K and other previously known integrase resistance substitutions have been reported. Here we suggest that alternate resistance pathways may develop in non-B HIV-1 subtypes and explain how "minor" polymorphisms and substitutions in HIV integrase that are associated with these subtypes can influence resistance against dolutegravir. This work also highlights the importance of phenotyping versus genotyping when a strong inhibitor such as dolutegravir is being used. By characterizing the G118R substitution, this work also preemptively defines parameters for a potentially important pathway in some non-B HIV subtype viruses treated with dolutegravir and will aid in the inhibition of such a virus, if detected. The general inability of strand transfer-related substitutions to diminish 3' processing indicates the importance of the 3' processing step and highlights a therapeutic angle that needs to be better exploited. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Improved neural network based scene-adaptive nonuniformity correction method for infrared focal plane arrays.

PubMed

Lai, Rui; Yang, Yin-tang; Zhou, Duan; Li, Yue-jin

2008-08-20

An improved scene-adaptive nonuniformity correction (NUC) algorithm for infrared focal plane arrays (IRFPAs) is proposed. This method simultaneously estimates the infrared detectors' parameters and eliminates the nonuniformity causing fixed pattern noise (FPN) by using a neural network (NN) approach. In the learning process of neuron parameter estimation, the traditional LMS algorithm is substituted with the newly presented variable step size (VSS) normalized least-mean square (NLMS) based adaptive filtering algorithm, which yields faster convergence, smaller misadjustment, and lower computational cost. In addition, a new NN structure is designed to estimate the desired target value, which promotes the calibration precision considerably. The proposed NUC method reaches high correction performance, which is validated by the experimental results quantitatively tested with a simulative testing sequence and a real infrared image sequence.
Consistency of VDJ Rearrangement and Substitution Parameters Enables Accurate B Cell Receptor Sequence Annotation.

PubMed

Ralph, Duncan K; Matsen, Frederick A

2016-01-01

VDJ rearrangement and somatic hypermutation work together to produce antibody-coding B cell receptor (BCR) sequences for a remarkable diversity of antigens. It is now possible to sequence these BCRs in high throughput; analysis of these sequences is bringing new insight into how antibodies develop, in particular for broadly-neutralizing antibodies against HIV and influenza. A fundamental step in such sequence analysis is to annotate each base as coming from a specific one of the V, D, or J genes, or from an N-addition (a.k.a. non-templated insertion). Previous work has used simple parametric distributions to model transitions from state to state in a hidden Markov model (HMM) of VDJ recombination, and assumed that mutations occur via the same process across sites. However, codon frame and other effects have been observed to violate these parametric assumptions for such coding sequences, suggesting that a non-parametric approach to modeling the recombination process could be useful. In our paper, we find that indeed large modern data sets suggest a model using parameter-rich per-allele categorical distributions for HMM transition probabilities and per-allele-per-position mutation probabilities, and that using such a model for inference leads to significantly improved results. We present an accurate and efficient BCR sequence annotation software package using a novel HMM "factorization" strategy. This package, called partis (https://github.com/psathyrella/partis/), is built on a new general-purpose HMM compiler that can perform efficient inference given a simple text description of an HMM.
Base substitutions at scissile bond sites are sufficient to alter RNA-binding and cleavage activity of RNase III.

PubMed

Kim, Kyungsub; Sim, Se-Hoon; Jeon, Che Ok; Lee, Younghoon; Lee, Kangseok

2011-02-01

RNase III, a double-stranded RNA-specific endoribonuclease, degrades bdm mRNA via cleavage at specific sites. To better understand the mechanism of cleavage site selection by RNase III, we performed a genetic screen for sequences containing mutations at the bdm RNA cleavage sites that resulted in altered mRNA stability using a transcriptional bdm'-'cat fusion construct. While most of the isolated mutants showed the increased bdm'-'cat mRNA stability that resulted from the inability of RNase III to cleave the mutated sequences, one mutant sequence (wt-L) displayed in vivo RNA stability similar to that of the wild-type sequence. In vivo and in vitro analyses of the wt-L RNA substrate showed that it was cut only once on the RNA strand to the 5'-terminus by RNase III, while the binding constant of RNase III to this mutant substrate was moderately increased. A base substitution at the uncleaved RNase III cleavage site in wt-L mutant RNA found in another mutant lowered the RNA-binding affinity by 11-fold and abolished the hydrolysis of scissile bonds by RNase III. Our results show that base substitutions at sites forming the scissile bonds are sufficient to alter RNA cleavage as well as the binding activity of RNase III. © 2010 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Molecular characterization of amino acid deletion in VP1 (1D) protein and novel amino acid substitutions in 3D polymerase protein of foot and mouth disease virus subtype A/Iran87.

PubMed

Esmaelizad, Majid; Jelokhani-Niaraki, Saber; Hashemnejad, Khadije; Kamalzadeh, Morteza; Lotfi, Mohsen

2011-12-01

The nucleotide sequence of the VP1 (1D) and partial 3D polymerase (3D(pol)) coding regions of the foot and mouth disease virus (FMDV) vaccine strain A/Iran87, a highly passaged isolate (~150 passages), was determined and aligned with previously published FMDV serotype A sequences. Overall analysis of the amino acid substitutions revealed that the partial 3D(pol) coding region contained four amino acid alterations. Amino acid sequence comparison of the VP1 coding region of the field isolates revealed deletions in the highly passaged Iranian isolate (A/Iran87). The prominent G-H loop of the FMDV VP1 protein contains the conserved arginine-glycine-aspartic acid (RGD) tripeptide, which is a well-known ligand for a specific cell surface integrin. Despite losing the RGD sequence of the VP1 protein and an Asp(26)→Glu substitution in a beta sheet located within a small groove of the 3D(pol) protein, the virus grew in BHK 21 suspension cell cultures. Since this strain has been used as a vaccine strain, it may be inferred that the RGD deletion has no critical role in virus attachment to the cell during the initiation of infection. It is probable that this FMDV subtype can utilize other pathways for cell attachment.
A single base substitution in the coding region for neurophysin II associated with familial central diabetes insipidus.

PubMed Central

Ito, M; Mori, Y; Oiso, Y; Saito, H

1991-01-01

To elucidate the molecular mechanism of familial central diabetes insipidus (FDI), we sequenced the arginine vasopressin-neurophysin II (AVP-NPII) gene in 2 patients belonging to a pedigree that is consistent with an autosomal dominant mode of inheritance. 10 patients with idiopathic central diabetes insipidus (IDI) and 5 normals were also studied. The AVP-NPII gene, locating on chromosome 20, consists of three exons that encode putative signal peptide, AVP, NPII, and glycoprotein. Using polymerase chain reaction, fragments including the promoter region and all coding regions were amplified from genomic DNA and subjected to direct sequencing. Sequences of 10 patients with IDI were identical with those of normals, while in 2 patients with FDI, a single base substitution was detected in one of two alleles of the AVP-NPII gene, indicating they were heterozygotes for this mutation. It was a G----A transition at nucleotide position 1859 in the second exon, resulting in a substitution of Gly for Ser at amino acid position 57 in the NPII moiety. It was speculated that the mutated AVP-NPII precursor or the mutated NPII molecule, through their conformational changes, might be responsible for AVP deficiency. Images PMID:1840604
Temporal patterns of damage and decay kinetics of DNA retrieved from plant herbarium specimens.

PubMed

Weiß, Clemens L; Schuenemann, Verena J; Devos, Jane; Shirsekar, Gautam; Reiter, Ella; Gould, Billie A; Stinchcombe, John R; Krause, Johannes; Burbano, Hernán A

2016-06-01

Herbaria archive a record of changes of worldwide plant biodiversity harbouring millions of specimens that contain DNA suitable for genome sequencing. To profit from this resource, it is fundamental to understand in detail the process of DNA degradation in herbarium specimens. We investigated patterns of DNA fragmentation and nucleotide misincorporation by analysing 86 herbarium samples spanning the last 300 years using Illumina shotgun sequencing. We found an exponential decay relationship between DNA fragmentation and time, and estimated a per nucleotide fragmentation rate of 1.66 × 10(-4) per year, which is six times faster than the rate estimated for ancient bones. Additionally, we found that strand breaks occur specially before purines, and that depurination-driven DNA breakage occurs constantly through time and can to a great extent explain decreasing fragment length over time. Similar to what has been found analysing ancient DNA from bones, we found a strong correlation between the deamination-driven accumulation of cytosine to thymine substitutions and time, which reinforces the importance of substitution patterns to authenticate the ancient/historical nature of DNA fragments. Accurate estimations of DNA degradation through time will allow informed decisions about laboratory and computational procedures to take advantage of the vast collection of worldwide herbarium specimens.
Evolutionary rates of mitochondrial genomes correspond to diversification rates and to contemporary species richness in birds and reptiles

PubMed Central

Eo, Soo Hyung; DeWoody, J. Andrew

2010-01-01

Rates of biological diversification should ultimately correspond to rates of genome evolution. Recent studies have compared diversification rates with phylogenetic branch lengths, but incomplete phylogenies hamper such analyses for many taxa. Herein, we use pairwise comparisons of confamilial sauropsid (bird and reptile) mitochondrial DNA (mtDNA) genome sequences to estimate substitution rates. These molecular evolutionary rates are considered in light of the age and species richness of each taxonomic family, using a random-walk speciation–extinction process to estimate rates of diversification. We find the molecular clock ticks at disparate rates in different families and at different genes. For example, evolutionary rates are relatively fast in snakes and lizards, intermediate in crocodilians and slow in turtles and birds. There was also rate variation across genes, where non-synonymous substitution rates were fastest at ATP8 and slowest at CO3. Family-by-gene interactions were significant, indicating that local clocks vary substantially among sauropsids. Most importantly, we find evidence that mitochondrial genome evolutionary rates are positively correlated with speciation rates and with contemporary species richness. Nuclear sequences are poorly represented among reptiles, but the correlation between rates of molecular evolution and species diversification also extends to 18 avian nuclear genes we tested. Thus, the nuclear data buttress our mtDNA findings. PMID:20610427
Dynamic Nucleotide Mutation Gradients and Control Region Usage in Squamate Reptile Mitochondrial Genomes

PubMed Central

Castoe, T.A.; Gu, W.; de Koning, A.P.J.; Daza, J.M.; Jiang, Z.J.; Parkinson, C.L.; Pollock, D.D.

2010-01-01

Gradients of nucleotide bias and substitution rates occur in vertebrate mitochondrial genomes due to the asymmetric nature of the replication process. The evolution of these gradients has previously been studied in detail in primates, but not in other vertebrate groups. From the primate study, the strengths of these gradients are known to evolve in ways that can substantially alter the substitution process, but it is unclear how rapidly they evolve over evolutionary time or how different they may be in different lineages or groups of vertebrates. Given the importance of mitochondrial genomes in phylogenetics and molecular evolutionary research, a better understanding of how asymmetric mitochondrial substitution gradients evolve would contribute key insights into how this gradient evolution may mislead evolutionary inferences, and how it may also be incorporated into new evolutionary models. Most snake mitochondrial genomes have an additional interesting feature, 2 nearly identical control regions, which vary among different species in the extent that they are used as origins of replication. Given the expanded sampling of complete snake genomes currently available, together with 2 additional snakes sequenced in this study, we reexamined gradient strength and CR usage in alethinophidian snakes as well as several lizards that possess dual CRs. Our results suggest that nucleotide substitution gradients (and corresponding nucleotide bias) and CR usage is highly labile over the ∼200 m.y. of squamate evolution, and demonstrates greater overall variability than previously shown in primates. The evidence for the existence of such gradients, and their ability to evolve rapidly and converge among unrelated species suggests that gradient dynamics could easily mislead phylogenetic and molecular evolutionary inferences, and argues strongly that these dynamics should be incorporated into phylogenetic models. PMID:20215734
Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms.

PubMed

Buschiazzo, Emmanuel; Ritland, Carol; Bohlmann, Jörg; Ritland, Kermit

2012-01-20

Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10(-9) synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations.
Phylogenetic mixtures and linear invariants for equal input models.

PubMed

Casanellas, Marta; Steel, Mike

2017-04-01

The reconstruction of phylogenetic trees from molecular sequence data relies on modelling site substitutions by a Markov process, or a mixture of such processes. In general, allowing mixed processes can result in different tree topologies becoming indistinguishable from the data, even for infinitely long sequences. However, when the underlying Markov process supports linear phylogenetic invariants, then provided these are sufficiently informative, the identifiability of the tree topology can be restored. In this paper, we investigate a class of processes that support linear invariants once the stationary distribution is fixed, the 'equal input model'. This model generalizes the 'Felsenstein 1981' model (and thereby the Jukes-Cantor model) from four states to an arbitrary number of states (finite or infinite), and it can also be described by a 'random cluster' process. We describe the structure and dimension of the vector spaces of phylogenetic mixtures and of linear invariants for any fixed phylogenetic tree (and for all trees-the so called 'model invariants'), on any number n of leaves. We also provide a precise description of the space of mixtures and linear invariants for the special case of [Formula: see text] leaves. By combining techniques from discrete random processes and (multi-) linear algebra, our results build on a classic result that was first established by James Lake (Mol Biol Evol 4:167-191, 1987).
Differentiation of Trypanosoma cruzi I subgroups through characterization of cytochrome b gene sequences.

PubMed

Spotorno O, Angel E; Córdova, Luis; Solari I, Aldo

2008-12-01

To identify and characterize chilean samples of Trypanosoma cruzi and their association with hosts, the first 516 bp of the mitochondrial cytochrome b gene were sequenced from eight biological samples, and phylogenetically compared with other known 20 American sequences. The molecular characterization of these 28 sequences in a maximum likelihood phylogram (-lnL = 1255.12, tree length = 180, consistency index = 0.79) allowed the robust identification (bootstrap % > 99) of three previously known discrete typing units (DTU): DTU IIb, IIa, and I. An apparently undescribed new sequence found in four new chilean samples was detected and designated as DTU Ib; they were separated by 24.7 differences, but robustly related (bootstrap % = 97 in 500 replicates) to those of DTU I by sharing 12 substitutions, among which four were nonsynonymous ones. Such new DTU Ib was also robust (bootstrap % = 100), and characterized by 10 unambiguous substitutions, with a single nonsynonymous G to T change at site 409. The fact that two of such new sequences were found in parasites from a chilean endemic caviomorph rodent, Octodon degus, and that they were closely related to the ancient DTU I suggested old origins and a long association to caviomorph hosts.
Nucleotide sequences of immunoglobulin eta genes of chimpanzee and orangutan: DNA molecular clock and hominoid evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sakoyama, Y.; Hong, K.J.; Byun, S.M.

To determine the phylogenetic relationships among hominoids and the dates of their divergence, the complete nucleotide sequences of the constant region of the immunoglobulin eta-chain (C/sub eta1/) genes from chimpanzee and orangutan have been determined. These sequences were compared with the human eta-chain constant-region sequence. A molecular clock (silent molecular clock), measured by the degree of sequence divergence at the synonymous (silent) positions of protein-encoding regions, was introduced for the present study. From the comparison of nucleotide sequences of ..cap alpha../sub 1/-antitrypsin and ..beta..- and delta-globulin genes between humans and Old World monkeys, the silent molecular clock was calibrated: themore » mean evolutionary rate of silent substitution was determined to be 1.56 x 10/sup -9/ substitutions per site per year. Using the silent molecular clock, the mean divergence dates of chimpanzee and orangutan from the human lineage were estimated as 6.4 +/- 2.6 million years and 17.3 +/- 4.5 million years, respectively. It was also shown that the evolutionary rate of primate genes is considerably slower than those of other mammalian genes.« less
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
Molecular cloning and expression of the hyu genes from Microbacterium liquefaciens AJ 3912, responsible for the conversion of 5-substituted hydantoins to alpha-amino acids, in Escherichia coli.

PubMed

Suzuki, Shun'ichi; Takenaka, Yasuhiro; Onishi, Norimasa; Yokozeki, Kenzo

2005-08-01

A DNA fragment from Microbacterium liquefaciens AJ 3912, containing the genes responsible for the conversion of 5-substituted-hydantoins to alpha-amino acids, was cloned in Escherichia coli and sequenced. Seven open reading frames (hyuP, hyuA, hyuH, hyuC, ORF1, ORF2, and ORF3) were identified on the 7.5 kb fragment. The deduced amino acid sequence encoded by the hyuA gene included the N-terminal amino acid sequence of the hydantoin racemase from M. liquefaciens AJ 3912. The hyuA, hyuH, and hyuC genes were heterologously expressed in E. coli; their presence corresponded with the detection of hydantoin racemase, hydantoinase, and N-carbamoyl alpha-amino acid amido hydrolase enzymatic activities respectively. The deduced amino acid sequences of hyuP were similar to those of the allantoin (5-ureido-hydantoin) permease from Saccharomyces cerevisiae, suggesting that hyuP protein might function as a hydantoin transporter.
Evolutionary mechanisms involved in the virulence of infectious salmon anaemia virus (ISAV), a piscine orthomyxovirus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Markussen, Turhan; Jonassen, Christine Monceyron; Numanovic, Sanela

2008-05-10

Infectious salmon anaemia virus (ISAV) is an orthomyxovirus causing a multisystemic, emerging disease in Atlantic salmon. Here we present, for the first time, detailed sequence analyses of the full-genome sequence of a presumed avirulent isolate displaying a full-length hemagglutinin-esterase (HE) gene (HPR0), and compare this with full-genome sequences of 11 Norwegian ISAV isolates from clinically diseased fish. These analyses revealed the presence of a virulence marker right upstream of the putative cleavage site R{sub 267} in the fusion (F) protein, suggesting a Q{sub 266} {yields} L{sub 266} substitution to be a prerequisite for virulence. To gain virulence in isolates lackingmore » this substitution, a sequence insertion near the cleavage site seems to be required. This strongly suggests the involvement of a protease recognition pattern at the cleavage site of the fusion protein as a determinant of virulence, as seen in highly pathogenic influenza A virus H5 or H7 and the paramyxovirus Newcastle disease virus.« less
Landscape of somatic mutations in 560 breast cancer whole-genome sequences

DOE PAGES

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; ...

2016-05-02

Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Landscape of somatic mutations in 560 breast cancer whole-genome sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan

Here, we analysed whole-genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. We found that 93 protein-coding cancer genes carried probable driver mutations. Some non-coding regions exhibited high mutation frequencies, but most have distinctive structural features probably causing elevated mutation rates and do not contain driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed twelve base substitution and six rearrangement signatures. Three rearrangement signatures, characterized by tandem duplications or deletions, appear associated with defective homologous-recombination-based DNA repair: one with deficient BRCA1 function, anothermore » with deficient BRCA1 or BRCA2 function, the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operating, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer.« less
Backbone hydration determines the folding signature of amino acid residues.

PubMed

Bignucolo, Olivier; Leung, Hoi Tik Alvin; Grzesiek, Stephan; Bernèche, Simon

2015-04-08

The relation between the sequence of a protein and its three-dimensional structure remains largely unknown. A lasting dream is to elucidate the side-chain-dependent driving forces that govern the folding process. Different structural data suggest that aromatic amino acids play a particular role in the stabilization of protein structures. To better understand the underlying mechanism, we studied peptides of the sequence EGAAXAASS (X = Gly, Ile, Tyr, Trp) through comparison of molecular dynamics (MD) trajectories and NMR residual dipolar coupling (RDC) measurements. The RDC data for aromatic substitutions provide evidence for a kink in the peptide backbone. Analysis of the MD simulations shows that the formation of internal hydrogen bonds underlying a helical turn is key to reproduce the experimental RDC values. The simulations further reveal that the driving force leading to such helical-turn conformations arises from the lack of hydration of the peptide chain on either side of the bulky aromatic side chain, which can potentially act as a nucleation point initiating the folding process.
Landscape of somatic mutations in 560 breast cancer whole genome sequences

PubMed Central

Nik-Zainal, Serena; Davies, Helen; Staaf, Johan; Ramakrishna, Manasa; Glodzik, Dominik; Zou, Xueqing; Martincorena, Inigo; Alexandrov, Ludmil B.; Martin, Sancha; Wedge, David C.; Van Loo, Peter; Ju, Young Seok; Smid, Marcel; Brinkman, Arie B; Morganella, Sandro; Aure, Miriam R.; Lingjærde, Ole Christian; Langerød, Anita; Ringnér, Markus; Ahn, Sung-Min; Boyault, Sandrine; Brock, Jane E.; Broeks, Annegien; Butler, Adam; Desmedt, Christine; Dirix, Luc; Dronov, Serge; Fatima, Aquila; Foekens, John A.; Gerstung, Moritz; Hooijer, Gerrit KJ; Jang, Se Jin; Jones, David R.; Kim, Hyung-Yong; King, Tari A.; Krishnamurthy, Savitri; Lee, Hee Jin; Lee, Jeong-Yeon; Li, Yilong; McLaren, Stuart; Menzies, Andrew; Mustonen, Ville; O’Meara, Sarah; Pauporté, Iris; Pivot, Xavier; Purdie, Colin A.; Raine, Keiran; Ramakrishnan, Kamna; Rodríguez-González, F. Germán; Romieu, Gilles; Sieuwerts, Anieta M.; Simpson, Peter T; Shepherd, Rebecca; Stebbings, Lucy; Stefansson, Olafur A; Teague, Jon; Tommasi, Stefania; Treilleux, Isabelle; Van den Eynden, Gert G.; Vermeulen, Peter; Vincent-Salomon, Anne; Yates, Lucy; Caldas, Carlos; van’t Veer, Laura; Tutt, Andrew; Knappskog, Stian; Tan, Benita Kiat Tee; Jonkers, Jos; Borg, Åke; Ueno, Naoto T; Sotiriou, Christos; Viari, Alain; Futreal, P. Andrew; Campbell, Peter J; Span, Paul N.; Van Laere, Steven; Lakhani, Sunil R; Eyfjord, Jorunn E.; Thompson, Alastair M.; Birney, Ewan; Stunnenberg, Hendrik G; van de Vijver, Marc J; Martens, John W.M.; Børresen-Dale, Anne-Lise; Richardson, Andrea L.; Kong, Gu; Thomas, Gilles; Stratton, Michael R.

2016-01-01

We analysed whole genome sequences of 560 breast cancers to advance understanding of the driver mutations conferring clonal advantage and the mutational processes generating somatic mutations. 93 protein-coding cancer genes carried likely driver mutations. Some non-coding regions exhibited high mutation frequencies but most have distinctive structural features probably causing elevated mutation rates and do not harbour driver mutations. Mutational signature analysis was extended to genome rearrangements and revealed 12 base substitution and six rearrangement signatures. Three rearrangement signatures, characterised by tandem duplications or deletions, appear associated with defective homologous recombination based DNA repair: one with deficient BRCA1 function; another with deficient BRCA1 or BRCA2 function; the cause of the third is unknown. This analysis of all classes of somatic mutation across exons, introns and intergenic regions highlights the repertoire of cancer genes and mutational processes operative, and progresses towards a comprehensive account of the somatic genetic basis of breast cancer. PMID:27135926

A Stochastic Evolutionary Model for Protein Structure Alignment and Phylogeny

PubMed Central

Challis, Christopher J.; Schmidler, Scott C.

2012-01-01

We present a stochastic process model for the joint evolution of protein primary and tertiary structure, suitable for use in alignment and estimation of phylogeny. Indels arise from a classic Links model, and mutations follow a standard substitution matrix, whereas backbone atoms diffuse in three-dimensional space according to an Ornstein–Uhlenbeck process. The model allows for simultaneous estimation of evolutionary distances, indel rates, structural drift rates, and alignments, while fully accounting for uncertainty. The inclusion of structural information enables phylogenetic inference on time scales not previously attainable with sequence evolution models. The model also provides a tool for testing evolutionary hypotheses and improving our understanding of protein structural evolution. PMID:22723302
Ultrasensitive Genotypic Detection of Antiviral Resistance in Hepatitis B Virus Clinical Isolates▿ †

PubMed Central

Fang, Jie; Wichroski, Michael J.; Levine, Steven M.; Baldick, Carl J.; Mazzucco, Charles E.; Walsh, Ann W.; Kienzle, Bernadette K.; Rose, Ronald E.; Pokornowski, Kevin A.; Colonno, Richard J.; Tenney, Daniel J.

2009-01-01

Amino acid substitutions that confer reduced susceptibility to antivirals arise spontaneously through error-prone viral polymerases and are selected as a result of antiviral therapy. Resistance substitutions first emerge in a fraction of the circulating virus population, below the limit of detection by nucleotide sequencing of either the population or limited sets of cloned isolates. These variants can expand under drug pressure to dominate the circulating virus population. To enhance detection of these viruses in clinical samples, we established a highly sensitive quantitative, real-time allele-specific PCR assay for hepatitis B virus (HBV) DNA. Sensitivity was accomplished using a high-fidelity DNA polymerase and oligonucleotide primers containing locked nucleic acid bases. Quantitative measurement of resistant and wild-type variants was accomplished using sequence-matched standards. Detection methodology that was not reliant on hybridization probes, and assay modifications, minimized the effect of patient-specific sequence polymorphisms. The method was validated using samples from patients chronically infected with HBV through parallel sequencing of large numbers of cloned isolates. Viruses with resistance to lamivudine and other l-nucleoside analogs and entecavir, involving 17 different nucleotide substitutions, were reliably detected at levels at or below 0.1% of the total population. The method worked across HBV genotypes. Longitudinal analysis of patient samples showed earlier emergence of resistance on therapy than was seen with sequencing methodologies, including some cases of resistance that existed prior to treatment. In summary, we established and validated an ultrasensitive method for measuring resistant HBV variants in clinical specimens, which enabled earlier, quantitative measurement of resistance to therapy. PMID:19433559
Evolutionary distances in the twilight zone--a rational kernel approach.

PubMed

Schwarz, Roland F; Fletcher, William; Förster, Frank; Merget, Benjamin; Wolf, Matthias; Schultz, Jörg; Markowetz, Florian

2010-12-31

Phylogenetic tree reconstruction is traditionally based on multiple sequence alignments (MSAs) and heavily depends on the validity of this information bottleneck. With increasing sequence divergence, the quality of MSAs decays quickly. Alignment-free methods, on the other hand, are based on abstract string comparisons and avoid potential alignment problems. However, in general they are not biologically motivated and ignore our knowledge about the evolution of sequences. Thus, it is still a major open question how to define an evolutionary distance metric between divergent sequences that makes use of indel information and known substitution models without the need for a multiple alignment. Here we propose a new evolutionary distance metric to close this gap. It uses finite-state transducers to create a biologically motivated similarity score which models substitutions and indels, and does not depend on a multiple sequence alignment. The sequence similarity score is defined in analogy to pairwise alignments and additionally has the positive semi-definite property. We describe its derivation and show in simulation studies and real-world examples that it is more accurate in reconstructing phylogenies than competing methods. The result is a new and accurate way of determining evolutionary distances in and beyond the twilight zone of sequence alignments that is suitable for large datasets.
Transcription blockage by homopurine DNA sequences: role of sequence composition and single-strand breaks

PubMed Central

Belotserkovskii, Boris P.; Neil, Alexander J.; Saleh, Syed Shayon; Shin, Jane Hae Soo; Mirkin, Sergei M.; Hanawalt, Philip C.

2013-01-01

The ability of DNA to adopt non-canonical structures can affect transcription and has broad implications for genome functioning. We have recently reported that guanine-rich (G-rich) homopurine-homopyrimidine sequences cause significant blockage of transcription in vitro in a strictly orientation-dependent manner: when the G-rich strand serves as the non-template strand [Belotserkovskii et al. (2010) Mechanisms and implications of transcription blockage by guanine-rich DNA sequences., Proc. Natl Acad. Sci. USA, 107, 12816–12821]. We have now systematically studied the effect of the sequence composition and single-stranded breaks on this blockage. Although substitution of guanine by any other base reduced the blockage, cytosine and thymine reduced the blockage more significantly than adenine substitutions, affirming the importance of both G-richness and the homopurine-homopyrimidine character of the sequence for this effect. A single-strand break in the non-template strand adjacent to the G-rich stretch dramatically increased the blockage. Breaks in the non-template strand result in much weaker blockage signals extending downstream from the break even in the absence of the G-rich stretch. Our combined data support the notion that transcription blockage at homopurine-homopyrimidine sequences is caused by R-loop formation. PMID:23275544
MEICPS: substitution mutations to engineer intracellular protein stability.

PubMed

Reddy, B V; Ramesh, P; Tiwari, S

1998-01-01

In MEICPS, results from earlier analyses are utilized to suggest possible substitution point mutations to engineer intracellular stability using a given sequence or structure of the protein. From bvbreddy@ccmb.ap.nic.in. This program needs data from other software, PSA and SSTRUC, available from sali@tamika.rockefeller.edu and tom@cryst.bioc.cam.ac.uk, respectively. bvbreddy@ccmb.ap.nic.in
Identification of Novel Inherited Genetic Markers for Aggressive PCa in European and African Americans Using Whole Genome Sequencing

DTIC Science & Technology

2014-10-01

rs115393439 leads to the amino acid substitution from Threonine (Thr) to Proline (Pro); while rs61756080 results to the amino acid substitution from...gene encoding Krüppel-like factor 7 are associated with type 2 diabetes . Diabetologia. 2005 Jul;48(7):1315-22. Epub 2005 Jun 4. Karlsson R, Aly M
cis-β-Bromostyrene derivatives from cinnamic acids via a tandem substitutive bromination-decarboxylation sequence.

PubMed

Tang, Khanh G; Kent, Greggory T; Erden, Ihsan; Wu, Weiming

2017-10-04

cis -β-Bromostyrene derivatives were synthesized stereospecifically from cinnamic acids through β-lactone intermediates. The synthetic sequence did not require the purification of the β-lactone intermediates although they were found to be stable and readily purified in most cases.
Short template switch events explain mutation clusters in the human genome.

PubMed

Löytynoja, Ari; Goldman, Nick

2017-06-01

Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press.
Statistical Linkage Analysis of Substitutions in Patient-Derived Sequences of Genotype 1a Hepatitis C Virus Nonstructural Protein 3 Exposes Targets for Immunogen Design

PubMed Central

Quadeer, Ahmed A.; Louie, Raymond H. Y.; Shekhar, Karthik; Chakraborty, Arup K.; Hsing, I-Ming

2014-01-01

ABSTRACT Chronic hepatitis C virus (HCV) infection is one of the leading causes of liver failure and liver cancer, affecting around 3% of the world's population. The extreme sequence variability of the virus resulting from error-prone replication has thwarted the discovery of a universal prophylactic vaccine. It is known that vigorous and multispecific cellular immune responses, involving both helper CD4+ and cytotoxic CD8+ T cells, are associated with the spontaneous clearance of acute HCV infection. Escape mutations in viral epitopes can, however, abrogate protective T-cell responses, leading to viral persistence and associated pathologies. Despite the propensity of the virus to mutate, there might still exist substitutions that incur a fitness cost. In this paper, we identify groups of coevolving residues within HCV nonstructural protein 3 (NS3) by analyzing diverse sequences of this protein using ideas from random matrix theory and associated methods. Our analyses indicate that one of these groups comprises a large percentage of residues for which HCV appears to resist multiple simultaneous substitutions. Targeting multiple residues in this group through vaccine-induced immune responses should either lead to viral recognition or elicit escape substitutions that compromise viral fitness. Our predictions are supported by published clinical data, which suggested that immune genotypes associated with spontaneous clearance of HCV preferentially recognized and targeted this vulnerable group of residues. Moreover, mapping the sites of this group onto the available protein structure provided insight into its functional significance. An epitope-based immunogen is proposed as an alternative to the NS3 epitopes in the peptide-based vaccine IC41. IMPORTANCE Despite much experimental work on HCV, a thorough statistical study of the HCV sequences for the purpose of immunogen design was missing in the literature. Such a study is vital to identify epistatic couplings among residues that can provide useful insights for designing a potent vaccine. In this work, ideas from random matrix theory were applied to characterize the statistics of substitutions within the diverse publicly available sequences of the genotype 1a HCV NS3 protein, leading to a group of sites for which HCV appears to resist simultaneous substitutions possibly due to deleterious effect on viral fitness. Our analysis leads to completely novel immunogen designs for HCV. In addition, the NS3 epitopes used in the recently proposed peptide-based vaccine IC41 were analyzed in the context of our framework. Our analysis predicts that alternative NS3 epitopes may be worth exploring as they might be more efficacious. PMID:24760894
Conservation of hot regions in protein-protein interaction in evolution.

PubMed

Hu, Jing; Li, Jiarui; Chen, Nansheng; Zhang, Xiaolong

2016-11-01

The hot regions of protein-protein interactions refer to the active area which formed by those most important residues to protein combination process. With the research development on protein interactions, lots of predicted hot regions can be discovered efficiently by intelligent computing methods, while performing biology experiments to verify each every prediction is hardly to be done due to the time-cost and the complexity of the experiment. This study based on the research of hot spot residue conservations, the proposed method is used to verify authenticity of predicted hot regions that using machine learning algorithm combined with protein's biological features and sequence conservation, though multiple sequence alignment, module substitute matrix and sequence similarity to create conservation scoring algorithm, and then using threshold module to verify the conservation tendency of hot regions in evolution. This research work gives an effective method to verify predicted hot regions in protein-protein interactions, which also provides a useful way to deeply investigate the functional activities of protein hot regions. Copyright © 2016. Published by Elsevier Inc.
CRISPR/Cas9 for genome editing: progress, implications and challenges.

PubMed

Zhang, Feng; Wen, Yan; Guo, Xiong

2014-09-15

Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) protein 9 system provides a robust and multiplexable genome editing tool, enabling researchers to precisely manipulate specific genomic elements, and facilitating the elucidation of target gene function in biology and diseases. CRISPR/Cas9 comprises of a nonspecific Cas9 nuclease and a set of programmable sequence-specific CRISPR RNA (crRNA), which can guide Cas9 to cleave DNA and generate double-strand breaks at target sites. Subsequent cellular DNA repair process leads to desired insertions, deletions or substitutions at target sites. The specificity of CRISPR/Cas9-mediated DNA cleavage requires target sequences matching crRNA and a protospacer adjacent motif locating at downstream of target sequences. Here, we review the molecular mechanism, applications and challenges of CRISPR/Cas9-mediated genome editing and clinical therapeutic potential of CRISPR/Cas9 in future. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Distinct Escape Pathway by Hepatitis C Virus Genotype 1a from a Dominant CD8+ T Cell Response by Selection of Altered Epitope Processing.

PubMed

Walker, Andreas; Skibbe, Kathrin; Steinmann, Eike; Pfaender, Stephanie; Kuntzen, Thomas; Megger, Dominik A; Groten, Svenja; Sitek, Barbara; Lauer, Georg M; Kim, Arthur Y; Pietschmann, Thomas; Allen, Todd M; Timm, Joerg

2016-01-01

Antiviral CD8(+) T cells are a key component of the adaptive immune response against HCV, but their impact on viral control is influenced by preexisting viral variants in important target epitopes and the development of viral escape mutations. Immunodominant epitopes highly conserved across genotypes therefore are attractive for T cell based prophylactic vaccines. Here, we characterized the CD8(+) T cell response against the highly conserved HLA-B*51-restricted epitope IPFYGKAI1373-1380 located in the helicase domain of NS3 in people who inject drugs (PWID) exposed predominantly to HCV genotypes 1a and 3a. Despite this epitope being conserved in both genotypes, the corresponding CD8(+) T cell response was detected only in PWID infected with genotype 3a and HCV-RNA negative PWID, but not in PWID infected with genotype 1a. In genotype 3a, the detection of strong CD8(+) T cell responses was associated with epitope variants in the autologous virus consistent with immune escape. Analysis of viral sequences from multiple cohorts confirmed HLA-B*51-associated escape mutations inside the epitope in genotype 3a, but not in genotype 1a. Here, a distinct substitution in the N-terminal flanking region located 5 residues upstream of the epitope (S1368P; P = 0.00002) was selected in HLA-B*51-positive individuals. Functional assays revealed that the S1368P substitution impaired recognition of target cells presenting the endogenously processed epitope. The results highlight that, despite an epitope being highly conserved between two genotypes, there are major differences in the selected viral escape pathways and the corresponding T cell responses. HCV is able to evolutionary adapt to CD8(+) T cell immune pressure in multiple ways. Beyond selection of mutations inside targeted epitopes, this study demonstrates that HCV inhibits epitope processing by modification of the epitope flanking region under T cell immune pressure. Selection of a substitution five amino acids upstream of the epitope underlines that efficient antigen presentation strongly depends on its larger sequence context and that blocking of the multistep process of antigen processing by mutation is exploited also by HCV. The pathways to mutational escape of HCV are to some extent predictable but are distinct in different genotypes. Importantly, the selected escape pathway of HCV may have consequences for the destiny of antigen-specific CD8(+) T cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Designing robust watermark barcodes for multiplex long-read sequencing.

PubMed

Ezpeleta, Joaquín; Krsticevic, Flavia J; Bulacio, Pilar; Tapia, Elizabeth

2017-03-15

To attain acceptable sample misassignment rates, current approaches to multiplex single-molecule real-time sequencing require upstream quality improvement, which is obtained from multiple passes over the sequenced insert and significantly reduces the effective read length. In order to fully exploit the raw read length on multiplex applications, robust barcodes capable of dealing with the full single-pass error rates are needed. We present a method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11%. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10-7 under the above conditions, and are designed to be compatible with chemical constraints imposed by the sequencing process. Software tools for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark . ezpeleta@cifasis-conicet.gov.ar. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
A nucleotide substitution in one of the beta-tubulin genes of Trichoderma viride confers resistance to the antimitotic drug methyl benzimidazole-2-yl-carbamate.

PubMed

Goldman, G H; Temmerman, W; Jacobs, D; Contreras, R; Van Montagu, M; Herrera-Estrella, A

1993-07-01

We characterized a Trichoderma viride strain that is resistant to the antimitotic drug methyl benzimidazole-2-yl-carbamate (MBC). This species has two beta-tubulin genes (tub1 and tub2) and by reverse genetics we showed that a mutation in the tub2 gene confers MBC resistance in this strain. Comparison of the tub2 sequence of the mutant strain with that of the wild type revealed that a single amino acid substitution of tyrosine for histidine at a position 6 is responsible for the MBC tolerance. Furthermore, we showed that this gene can be used as a homologous dominant selectable marker in T. viride transformation. Both tubulin genes were completely sequenced. They differ by 48 residues and the degree of identity between their deduced amino acid sequences is 86.3%.
A history estimate and evolutionary analysis of rabies virus variants in China.

PubMed

Ming, Pinggang; Yan, Jiaxin; Rayner, Simon; Meng, Shengli; Xu, Gelin; Tang, Qing; Wu, Jie; Luo, Jing; Yang, Xiaoming

2010-03-01

To investigate the evolutionary dynamics of rabies virus (RABV) in China, we collected and sequenced 55 isolates sampled from 14 Chinese provinces over the last 40 years and performed a coalescent-based analysis of the G gene. This revealed that the RABV currently circulating in China is composed of three main groups. Bayesian coalescent analysis estimated the date of the most recent common ancestor for the current RABV Chinese strains to be 1412 (with a 95 % confidence interval of 1006-1736). The estimated mean substitution rate for the G gene sequences (3.961x10(-4) substitutions per site per year) was in accordance with previous reports for RABV.
Variants of glycoside hydrolases

DOEpatents

Teter, Sarah; Ward, Connie; Cherry, Joel; Jones, Aubrey; Harris, Paul; Yi, Jung

2013-02-26

The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Variants of glycoside hydrolases

DOEpatents

Teter, Sarah [Davis, CA; Ward, Connie [Hamilton, MT; Cherry, Joel [Davis, CA; Jones, Aubrey [Davis, CA; Harris, Paul [Carnation, WA; Yi, Jung [Sacramento, CA

2011-04-26

The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Polymorphism of follicle stimulating hormone beta (FSHβ) subunit gene and its association with litter traits in giant panda.

PubMed

Huang, Xiaoyu; Li, Desheng; Wang, Jiwen; Huang, Yan; Han, Chunchun; Zhang, Guiquan; Huang, Zhi; Wu, Honglin; Wei, Ming; Wang, Guosong; Hu, Haiping; Deng, Tao; He, Tao; Zhou, Yingming; Song, Shixian; Luo, Bo; Zhang, Heming

2013-11-01

The different SSCP patterns of the follicle stimulating hormone beta (FSHβ) gene amplified by three pairs of primers were sequenced. Comparisons among the three nucleotide sequences of three genotypes indicated that three base substitutions (A213T, A91G, and A89C) were detected in FSHβ gene, which A213T substitution led to one amino acids mutation (Lys > Met), and the other two substitutions were synonymous mutations. The AA, AB and BB genotypes patterns obtained by FSHβ primer1 had evident relation with the litter traits, but the SSCP genotypes patterns obtained by FSHβ primer2 and primer3 had no evident relation with the litter traits in giant panda. The giant panda with AA and AB genotype had the largest litter size and multiparity rate compared with the BB genotypes (P < 0.05). We speculated that the giant pandas with the A allele have better litter traits than those with the B allele.
Variants of glycoside hydrolases

DOEpatents

Teter, Sarah; Ward, Connie; Cherry, Joel; Jones, Aubrey; Harris, Paul; Yi, Jung

2017-07-11

The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Analysis of correlated mutations in HIV-1 protease using spectral clustering.

PubMed

Liu, Ying; Eyal, Eran; Bahar, Ivet

2008-05-15

The ability of human immunodeficiency virus-1 (HIV-1) protease to develop mutations that confer multi-drug resistance (MDR) has been a major obstacle in designing rational therapies against HIV. Resistance is usually imparted by a cooperative mechanism that can be elucidated by a covariance analysis of sequence data. Identification of such correlated substitutions of amino acids may be obscured by evolutionary noise. HIV-1 protease sequences from patients subjected to different specific treatments (set 1), and from untreated patients (set 2) were subjected to sequence covariance analysis by evaluating the mutual information (MI) between all residue pairs. Spectral clustering of the resulting covariance matrices disclosed two distinctive clusters of correlated residues: the first, observed in set 1 but absent in set 2, contained residues involved in MDR acquisition; and the second, included those residues differentiated in the various HIV-1 protease subtypes, shortly referred to as the phylogenetic cluster. The MDR cluster occupies sites close to the central symmetry axis of the enzyme, which overlap with the global hinge region identified from coarse-grained normal-mode analysis of the enzyme structure. The phylogenetic cluster, on the other hand, occupies solvent-exposed and highly mobile regions. This study demonstrates (i) the possibility of distinguishing between the correlated substitutions resulting from neutral mutations and those induced by MDR upon appropriate clustering analysis of sequence covariance data and (ii) a connection between global dynamics and functional substitution of amino acids.

Ultra-Deep Sequencing Analysis of the Hepatitis A Virus 5'-Untranslated Region among Cases of the Same Outbreak from a Single Source

PubMed Central

Wu, Shuang; Nakamoto, Shingo; Kanda, Tatsuo; Jiang, Xia; Nakamura, Masato; Miyamura, Tatsuo; Shirasawa, Hiroshi; Sugiura, Nobuyuki; Takahashi-Nakaguchi, Azusa; Gonoi, Tohru; Yokosuka, Osamu

2014-01-01

Hepatitis A virus (HAV) is a causative agent of acute viral hepatitis for which an effective vaccine has been developed. Here we describe ultra-deep pyrosequences (UDPSs) of HAV 5'-untranslated region (5'UTR) among cases of the same outbreak, which arose from a single source, associated with a revolving sushi bar. We determined the reference sequence from HAV-derived clone from an attendant by the Sanger method. Sixteen UDPSs from this outbreak and one from another sporadic case were compared with this reference. Nucleotide errors yielded a UDPS error rate of < 1%. This study confirmed that nucleotide substitutions of this region are transition mutations in outbreak cases, that insertion was observed only in non-severe cases, and that these nucleotide substitutions were different from those of the sporadic case. Analysis of UDPSs detected low-prevalence HAV variations in 5'UTR, but no specific mutations associated with severity in these outbreak cases. To our surprise, HAV strains in this outbreak conserved HAV IRES sequence even if we performed analysis of UDPSs. UDPS analysis of HAV 5'UTR gave us no association between the disease severity of hepatitis A and HAV 5'UTR substitutions. It might be more interesting to perform ultra-deep sequencing of full length HAV genome in order to reveal possible unknown genomic determinants associated with disease severity. Further studies will be needed. PMID:24396287
Zinc-binding Domain of the Bacteriophage T7 DNA Primase Modulates Binding to the DNA Template*

PubMed Central

Lee, Seung-Joo; Zhu, Bin; Akabayov, Barak; Richardson, Charles C.

2012-01-01

The zinc-binding domain (ZBD) of prokaryotic DNA primases has been postulated to be crucial for recognition of specific sequences in the single-stranded DNA template. To determine the molecular basis for this role in recognition, we carried out homolog-scanning mutagenesis of the zinc-binding domain of DNA primase of bacteriophage T7 using a bacterial homolog from Geobacillus stearothermophilus. The ability of T7 DNA primase to catalyze template-directed oligoribonucleotide synthesis is eliminated by substitution of any five-amino acid residue-long segment within the ZBD. The most significant defect occurs upon substitution of a region (Pro-16 to Cys-20) spanning two cysteines that coordinate the zinc ion. The role of this region in primase function was further investigated by generating a protein library composed of multiple amino acid substitutions for Pro-16, Asp-18, and Asn-19 followed by genetic screening for functional proteins. Examination of proteins selected from the screening reveals no change in sequence-specific recognition. However, the more positively charged residues in the region facilitate DNA binding, leading to more efficient oligoribonucleotide synthesis on short templates. The results suggest that the zinc-binding mode alone is not responsible for sequence recognition, but rather its interaction with the RNA polymerase domain is critical for DNA binding and for sequence recognition. Consequently, any alteration in the ZBD that disturbs its conformation leads to loss of DNA-dependent oligoribonucleotide synthesis. PMID:23024359
Amino acid substitutions in the thymidine kinase gene of induced acyclovir-resistant herpes simplex virus type 1

NASA Astrophysics Data System (ADS)

Hussin, Ainulkhir; Nor, Norefrina Shafinaz Md; Ibrahim, Nazlina

2013-11-01

Acyclovir (ACV) is an antiviral drug of choice in healthcare setting to treat infections caused by herpes viruses, including, but not limited to genital herpes, cold sores, shingles and chicken pox. Acyclovir resistance has emerged significantly due to extensive use and misuse of this antiviral in human, especially in immunocompromised patients. However, it remains unclear about the amino acid substitutions in thymidine (TK) gene, which specifically confer the resistance-associated mutation in herpes simplex virus. Hence, acyclovir-resistant HSV-1 was selected at high concentration (2.0 - 4.5 μg/mL), and the TK-gene was subjected to sequencing and genotypic characterization. Genotypic sequences comparison was done using HSV-1 17 (GenBank Accesion no. X14112) for resistance-associated mutation determination whereas HSV-1 KOS, HSV-1 473/08 and HSV clinical isolates sequences were used for polymorphism-associated mutation. The result showed that amino acid substitutions at the non-conserved region (UKM-1: Gln34Lys, UKM-2: Arg32Ser & UKM-5: Arg32Cys) and ATP-binding site (UKM-3: Tyr53End & UKM-4: Ile54Leu) of the TK-gene. These discoveries play an important role to extend another dimension to the evolution of acyclovir-resistant HSV-1 and suggest that selection at high ACV concentration induced ACV-resistant HSV-1 evolution. These findings also expand the knowledge on the type of mutations among acyclovir-resistant HSV-1. In conclusion, HSV-1 showed multiple strategies to exhibit acyclovir resistance, including amino acid substitutions in the TK gene.
Genome-wide signatures of convergent evolution in echolocating mammals

PubMed Central

Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A.; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J.

2013-01-01

Evolution is typically thought to proceed through divergence of genes, proteins, and ultimately phenotypes1-3. However, similar traits might also evolve convergently in unrelated taxa due to similar selection pressures4,5. Adaptive phenotypic convergence is widespread in nature, and recent results from a handful of genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level6-9. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution9,10 although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show for the first time that convergence is not a rare process restricted to a handful of loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four new bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Surprisingly we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognised. PMID:24005325
The Complete Nucleotide Sequence of the Mitochondrial Genome of Bactrocera minax (Diptera: Tephritidae)

PubMed Central

Zhang, Bin; Nardi, Francesco; Hull-Sanders, Helen; Wan, Xuanwu; Liu, Yinghong

2014-01-01

The complete 16,043 bp mitochondrial genome (mitogenome) of Bactrocera minax (Diptera: Tephritidae) has been sequenced. The genome encodes 37 genes usually found in insect mitogenomes. The mitogenome information for B. minax was compared to the homologous sequences of Bactrocera oleae, Bactrocera tryoni, Bactrocera philippinensis, Bactrocera carambolae, Bactrocera papayae, Bactrocera dorsalis, Bactrocera correcta, Bactrocera cucurbitae and Ceratitis capitata. The analysis indicated the structure and organization are typical of, and similar to, the nine closely related species mentioned above, although it contains the lowest genome-wide A+T content (67.3%). Four short intergenic spacers with a high degree of conservation among the nine tephritid species mentioned above and B. minax were observed, which also have clear counterparts in the control regions (CRs). Correlation analysis among these ten tephritid species revealed close positive correlation between the A+T content of zero-fold degenerate sites (P0FD), the ratio of nucleotide substitution frequency at P0FD sites to all degenerate sites (zero-fold degenerate sites, two-fold degenerate sites and four-fold degenerate sites) and amino acid sequence distance (ASD) were found. Further, significant positive correlation was observed between the A+T content of four-fold degenerate sites (P4FD) and the ratio of nucleotide substitution frequency at P4FD sites to all degenerate sites; however, we found significant negative correlation between ASD and the A+T content of P4FD, and the ratio of nucleotide substitution frequency at P4FD sites to all degenerate sites. A higher nucleotide substitution frequency at non-synonymous sites compared to synonymous sites was observed in nad4, the first time that has been observed in an insect mitogenome. A poly(T) stretch at the 5′ end of the CR followed by a [TA(A)]n-like stretch was also found. In addition, a highly conserved G+A-rich sequence block was observed in front of the poly(T) stretch among the ten tephritid species and two tandem repeats were present in the CR. PMID:24964138
PHYSICO2: an UNIX based standalone procedure for computation of physicochemical, window-dependent and substitution based evolutionary properties of protein sequences along with automated block preparation tool, version 2.

PubMed

Banerjee, Shyamashree; Gupta, Parth Sarthi Sen; Nayek, Arnab; Das, Sunit; Sur, Vishma Pratap; Seth, Pratyay; Islam, Rifat Nawaz Ul; Bandyopadhyay, Amal K

2015-01-01

Automated genome sequencing procedure is enriching the sequence database very fast. To achieve a balance between the entry of sequences in the database and their analyses, efficient software is required. In this end PHYSICO2, compare to earlier PHYSICO and other public domain tools, is most efficient in that it i] extracts physicochemical, window-dependent and homologousposition-based-substitution (PWS) properties including positional and BLOCK-specific diversity and conservation, ii] provides users with optional-flexibility in setting relevant input-parameters, iii] helps users to prepare BLOCK-FASTA-file by the use of Automated Block Preparation Tool of the program, iv] performs fast, accurate and user-friendly analyses and v] redirects itemized outputs in excel format along with detailed methodology. The program package contains documentation describing application of methods. Overall the program acts as efficient PWS-analyzer and finds application in sequence-bioinformatics. PHYSICO2: is freely available at http://sourceforge.net/projects/physico2/ along with its documentation at https://sourceforge.net/projects/physico2/files/Documentation.pdf/download for all users.
PHYSICO2: an UNIX based standalone procedure for computation of physicochemical, window-dependent and substitution based evolutionary properties of protein sequences along with automated block preparation tool, version 2

PubMed Central

Banerjee, Shyamashree; Gupta, Parth Sarthi Sen; Nayek, Arnab; Das, Sunit; Sur, Vishma Pratap; Seth, Pratyay; Islam, Rifat Nawaz Ul; Bandyopadhyay, Amal K

2015-01-01

Automated genome sequencing procedure is enriching the sequence database very fast. To achieve a balance between the entry of sequences in the database and their analyses, efficient software is required. In this end PHYSICO2, compare to earlier PHYSICO and other public domain tools, is most efficient in that it i] extracts physicochemical, window-dependent and homologousposition-based-substitution (PWS) properties including positional and BLOCK-specific diversity and conservation, ii] provides users with optional-flexibility in setting relevant input-parameters, iii] helps users to prepare BLOCK-FASTA-file by the use of Automated Block Preparation Tool of the program, iv] performs fast, accurate and user-friendly analyses and v] redirects itemized outputs in excel format along with detailed methodology. The program package contains documentation describing application of methods. Overall the program acts as efficient PWS-analyzer and finds application in sequence-bioinformatics. Availability PHYSICO2: is freely available at http://sourceforge.net/projects/physico2/ along with its documentation at https://sourceforge.net/projects/physico2/files/Documentation.pdf/download for all users. PMID:26339154
Plastome Sequences of Lygodium japonicum and Marsilea crenata Reveal the Genome Organization Transformation from Basal Ferns to Core Leptosporangiates

PubMed Central

Gao, Lei; Wang, Bo; Wang, Zhi-Wei; Zhou, Yuan; Su, Ying-Juan; Wang, Ting

2013-01-01

Previous studies have shown that core leptosporangiates, the most species-rich group of extant ferns (monilophytes), have a distinct plastid genome (plastome) organization pattern from basal fern lineages. However, the details of genome structure transformation from ancestral ferns to core leptosporangiates remain unclear because of limited plastome data available. Here, we have determined the complete chloroplast genome sequences of Lygodium japonicum (Lygodiaceae), a member of schizaeoid ferns (Schizaeales), and Marsilea crenata (Marsileaceae), a representative of heterosporous ferns (Salviniales). The two species represent the sister and the basal lineages of core leptosporangiates, respectively, for which the plastome sequences are currently unavailable. Comparative genomic analysis of all sequenced fern plastomes reveals that the gene order of L. japonicum plastome occupies an intermediate position between that of basal ferns and core leptosporangiates. The two exons of the fern ndhB gene have a unique pattern of intragenic copy number variances. Specifically, the substitution rate heterogeneity between the two exons is congruent with their copy number changes, confirming the constraint role that inverted repeats may play on the substitution rate of chloroplast gene sequences. PMID:23821521
Isosteric And Non-Isosteric Base Pairs In RNA Motifs: Molecular Dynamics And Bioinformatics Study Of The Sarcin-Ricin Internal Loop

PubMed Central

Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří

2013-01-01

The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333
Aromatic residues engineered into the beta-turn nucleation site of ubiquitin lead to a complex folding landscape, non-native side-chain interactions, and kinetic traps.

PubMed

Rea, Anita M; Simpson, Emma R; Meldrum, Jill K; Williams, Huw E L; Searle, Mark S

2008-12-02

The fast folding of small proteins is likely to be the product of evolutionary pressures that balance the search for native-like contacts in the transition state with the minimum number of stable non-native interactions that could lead to partially folded states prone to aggregation and amyloid formation. We have investigated the effects of non-native interactions on the folding landscape of yeast ubiquitin by introducing aromatic substitutions into the beta-turn region of the N-terminal beta-hairpin, using both the native G-bulged type I turn sequence (TXTGK) as well as an engineered 2:2 XNGK type I' turn sequence. The N-terminal beta-hairpin is a recognized folding nucleation site in ubiquitin. The folding kinetics for wt-Ub (TLTGK) and the type I' turn mutant (TNGK) reveal only a weakly populated intermediate, however, substitution with X = Phe or Trp in either context results in a high propensity to form a stable compact intermediate where the initial U-->I collapse is visible as a distinct kinetic phase. The introduction of Trp into either of the two host turn sequences results in either complex multiphase kinetics with the possibility of parallel folding pathways, or formation of a highly compact I-state stabilized by non-native interactions that must unfold before refolding. Sequence substitutions with aromatic residues within a localized beta-turn capable of forming non-native hydrophobic contacts in both the native state and partially folded states has the undesirable consequence that folding is frustrated by the formation of stable compact intermediates that evolutionary pressures at the sequence level may have largely eliminated.
Program Criteria Specifications Document. Computer Program TWDA for Design and Analysis of Inverted-T Retaining Walls and Floodwalls.

DTIC Science & Technology

1981-02-01

or analysis IloduIls,* each pCr forming one specific step in the design or analysis process. These modules will be callable , in any logical sequence...tempt to 1)l 1cC Cind cut of I bar, hut Will slow the required steel area and bond r i u I rl- t t)s per I oot at Uitablt intervals across the base... bond strength) shall be as required in ACI 318-71 Chapter 12, except that computed shear V shall be multiplied by 2.0 and substituted for V u. Tn
Mutations of the phage lambda nutL region that prevent the action of Nun, a site-specific transcription termination factor.

PubMed Central

Baron, J; Weisberg, R A

1992-01-01

Phage HK022 encodes a protein, Nun, that promotes transcription termination within the pL and pR operons of its relative, phage lambda. The lambda sequences required for termination had previously been shown to overlap the nut sites, which are essential for transcription antitermination during normal lambda growth. To further specify the Nun target and to determine its relation to the nut sites, we constructed deletion and base substitution mutations of the lambda nutL region and measured Nun-dependent reduction of the expression of a downstream reporter gene. The shortest construct that retained full Nun responsiveness was a 42-bp segment that included both boxA and boxB, sequences that have been implicated in lambda antitermination. Deletion of boxA reduced Nun termination, and deletion of both sequences eliminated Nun termination. Base substitutions in boxA and the proximal portion of boxB impaired Nun termination, while base substitutions between boxA and boxB, in the distal portion of boxB, and immediately downstream from boxB had no appreciable effect. The termination defect of all of the base substitution mutations was relieved by increasing the level of Nun protein; in contrast, the deletions and a multiple-base substitution did not regain full Nun responsiveness at elevated Nun concentrations. We also asked if these mutant nut regions retained their ability to interact with N, the lambda-encoded antitermination protein. A qualitative assay showed that mutations within boxA or boxB reduced interaction, while mutations outside boxA and boxB did not. These data show that (i) the recognition sites for N and Nun overlap to a very considerable extent but are probably not identical and (ii) a high concentration of Nun promotes its interaction with mutant nut sites, a behavior also reported to be characteristic of N. PMID:1532174
Surface targeting of the dopamine transporter involves discrete epitopes in the distal C terminus but does not require canonical PDZ domain interactions.

PubMed

Bjerggaard, Christian; Fog, Jacob U; Hastrup, Hanne; Madsen, Kenneth; Loland, Claus J; Javitch, Jonathan A; Gether, Ulrik

2004-08-04

The human dopamine transporter (hDAT) contains a C-terminal type 2 PDZ (postsynaptic density 95/Discs large/zona occludens 1) domain-binding motif (LKV) known to interact with PDZ domain proteins such as PICK1 (protein interacting with C-kinase 1). As reported previously, we found that, after deletion of this motif, hDAT was retained in the endoplasmic reticulum (ER) of human embryonic kidney (HEK) 293 and Neuro2A cells, suggesting that PDZ domain interactions might be critical for hDAT targeting. Nonetheless, substitution of LKV with SLL, the type 1 PDZ-binding sequence from the beta2-adrenergic receptor, did not disrupt plasma membrane targeting. Moreover, the addition of an alanine to the hDAT C terminus (+Ala), resulting in an LKVA termination sequence, or substitution of LKV with alanines (3xAla_618-620) prevented neither plasma membrane targeting nor targeting into sprouting neurites of differentiated N2A cells. The inability of +Ala and 3xAla_618-620 to bind PDZ domains was confirmed by lack of colocalization with PICK1 in cotransfected HEK293 cells and by the inability of corresponding C-terminal fusion proteins to pull down purified PICK1. Thus, although residues in the hDAT C terminus are indispensable for proper targeting, PDZ domain interactions are not required. By progressive substitutions with beta2-adrenergic receptor sequence, and by triple-alanine substitutions in the hDAT C terminus, we examined the importance of epitopes preceding the LKV motif. Substitution of RHW(615-617) with alanines caused retention of the transporter in the ER despite preserved ability of this mutant to bind PICK1. We propose dual roles of the hDAT C terminus: a role independent of PDZ interactions for ER export and surface targeting, and a not fully clarified role involving PDZ interactions with proteins such as PICK1.
Development of PCR primers specific for the amplification and direct sequencing of gyrB genes from microbacteria, order Actinomycetales.

PubMed

Richert, Kathrin; Brambilla, Evelyne; Stackebrandt, Erko

2005-01-01

PCR primer sets were developed for the specific amplification and sequence analyses encoding the gyrase subunit B (gyrB) of members of the family Microbacteriaceae, class Actinobacteria. The family contains species highly related by 16S rRNA gene sequence analyses. In order to test if the gene sequence analysis of gyrB is appropriate to discriminate between closely related species, we evaluate the 16S rRNA gene phylogeny of its members. As the published universal primer set for gyrB failed to amplify the responding gene of the majority of the 80 type strains of the family, three new primer sets were identified that generated fragments with a composite sequence length of about 900 nt. However, the amplification of all three fragments was successful only in 25% of the 80 type strains. In this study, the substitution frequencies in genes encoding gyrase and 16S rDNA were compared for 10 strains of nine genera. The frequency of gyrB nucleotide substitution is significantly higher than that of the 16S rDNA, and no linear correlation exists between the similarities of both molecules among members of the Microbacteriaceae. The phylogenetic analyses using the gyrB sequences provide higher resolution than using 16S rDNA sequences and seem able to discriminate between closely related species.
Three closely related herpesviruses are associated with fibropapillomatosis in marine turtles

USGS Publications Warehouse

Quackenbush, S.L.; Work, Thierry M.; Balazs, George H.; Casey, Rufina N.; Rovnak, J.; Chaves, A.; duToit, L.; Baines, J.D.; Parrish, C.R.; Bowser, Paul R.; Casey, James W.

1998-01-01

Green turtle fibropapillomatosis is a neoplastic disease of increasingly significant threat to the survivability of this species. Degenerate PCR primers that target highly conserved regions of genes encoding herpesvirus DNA polymerases were used to amplify a DNA sequence from fibropapillomas and fibromas from Hawaiian and Florida green turtles. All of the tumors tested (n= 23) were found to harbor viral DNA, whereas no viral DNA was detected in skin biopsies from tumor-negative turtles. The tissue distribution of the green turtle herpesvirus appears to be generally limited to tumors where viral DNA was found to accumulate at approximately two to five copies per cell and is occasionally detected, only by PCR, in some tissues normally associated with tumor development. In addition, herpesviral DNA was detected in fibropapillomas from two loggerhead and four olive ridley turtles. Nucleotide sequencing of a 483-bp fragment of the turtle herpesvirus DNA polymerase gene determined that the Florida green turtle and loggerhead turtle sequences are identical and differ from the Hawaiian green turtle sequence by five nucleotide changes, which results in two amino acid substitutions. The olive ridley sequence differs from the Florida and Hawaiian green turtle sequences by 15 and 16 nucleotide changes, respectively, resulting in four amino acid substitutions, three of which are unique to the olive ridley sequence. Our data suggest that these closely related turtle herpesviruses are intimately involved in the genesis of fibropapillomatosis.
Detection and characterization of hepatitis A virus circulating in Egypt.

PubMed

Hamza, Hazem; Abd-Elshafy, Dina Nadeem; Fayed, Sayed A; Bahgat, Mahmoud Mohamed; El-Esnawy, Nagwa Abass; Abdel-Mobdy, Emam

2017-07-01

Hepatitis A virus (HAV) still poses a considerable problem worldwide. In the current study, hepatitis A virus was recovered from wastewater samples collected from three wastewater treatment plants over one year. Using RT-PCR, HAV was detected in 43 out of 68 samples (63.2%) representing both inlet and outlet. Eleven positive samples were subjected to sequencing targeting the VP1-2A junction region. Phylogenetic analysis revealed that all samples belonged to subgenotype IB with few substitutions at the amino acid level. The complete sequence of one isolate (HAV/Egy/BI-11/2015) showed that the similarity at the amino acid level was not reflected at the nucleotide level. However, the deduced amino acid sequence derived from the complete nucleotide sequence showed distinct substitutions in the 2B, 2C, and 3A regions. Recombination analysis revealed a recombination event between X75215 (subgenotype IA) and AF268396 (subgenotype IB) involving a portion of the 2B nonstructural protein coding region (nucleotides 3757-3868) assuming the herein characterized sequence an actual recombinant. Despite the role of recombination in picornaviruses evolution, its involvement in HAV evolution has rarely been reported, and this may be due to the limited available complete HAV sequences. To our knowledge, this represents the first characterized complete sequence of an Egyptian isolate and the described recombination event provides an important update on the circulating HAV strains in Egypt.
Complete genome analysis of dengue virus type 3 isolated from the 2013 dengue outbreak in Yunnan, China.

PubMed

Wang, Xiaodan; Ma, Dehong; Huang, Xinwei; Li, Lihua; Li, Duo; Zhao, Yujiao; Qiu, Lijuan; Pan, Yue; Chen, Junying; Xi, Juemin; Shan, Xiyun; Sun, Qiangming

2017-06-15

In the past few decades, dengue has spread rapidly and is an emerging disease in China. An unexpected dengue outbreak occurred in Xishuangbanna, Yunnan, China, resulting in 1331 patients in 2013. In order to obtain the complete genome information and perform mutation and evolutionary analysis of causative agent related to this largest outbreak of dengue fever. The viruses were isolated by cell culture and evaluated by genome sequence analysis. Phylogenetic trees were then constructed by Neighbor-Joining methods (MEGA6.0), followed by analysis of nucleotide mutation and amino acid substitution. The analysis of the diversity of secondary structure for E and NS1 protein were also performed. Then selection pressures acting on the coding sequences were estimated by PAML software. The complete genome sequences of two isolated strains (YNSW1, YNSW2) were 10,710 and 10,702 nucleotides in length, respectively. Phylogenetic analysis revealed both strain were classified as genotype II of DENV-3. The results indicated that both isolated strains of Xishuangbanna in 2013 and Laos 2013 stains (KF816161.1, KF816158.1, LC147061.1, LC147059.1, KF816162.1) were most similar to Bangladesh (AY496873.2) in 2002. After comparing with the DENV-3SS (H87) 62 amino acid substitutions were identified in translated regions, and 38 amino acid substitutions were identified in translated regions compared with DENV-3 genotype II stains Bangladesh (AY496873.2). 27(YNSW1) or 28(YNSW2) single nucleotide changes were observed in structural protein sequences with 7(YNSW1) or 8(YNSW2) non-synonymous mutations compared with AY496873.2. Of them, 4 non-synonymous mutations were identified in E protein sequences with (2 in the β-sheet, 2 in the coil). Meanwhile, 117(YNSW1) or 115 (YNSW2) single nucleotide changes were observed in non-structural protein sequences with 31(YNSW1) or 30 (YNSW2) non-synonymous mutations. Particularly, 14 single nucleotide changes were observed in NS1 sequences with 4/14 non-synonymous substitutions (4 in the coil). Selection pressure analysis revealed no positive selection in the amino acid sites of the genes encoding for structural and non-structural proteins. This study may help understand the intrinsic geographical relatedness of dengue virus 3 and contributes further to research on their infectivity, pathogenicity and vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.
Evidence for the role of basic amino acids in the coat protein arm region of Cucumber necrosis virus in particle assembly and selective encapsidation of viral RNA.

PubMed

Alam, Syed Benazir; Reade, Ron; Theilmann, Jane; Rochon, D'Ann

2017-12-01

Cucumber necrosis virus (CNV) is a T = 3 icosahedral virus with a (+)ssRNA genome. The N-terminal CNV coat protein arm contains a conserved, highly basic sequence ("KGRKPR"), which we postulate is involved in RNA encapsidation during virion assembly. Seven mutants were constructed by altering the CNV "KGRKPR" sequence; the four basic residues were mutated to alanine individually, in pairs, or in total. Virion accumulation and vRNA encapsidation were significantly reduced in mutants containing two or four substitutions and virion morphology was also affected, where both T = 1 and intermediate-sized particles were produced. Mutants with two or four substitutions encapsidated significantly greater levels of truncated RNA than that of WT, suggesting that basic residues in the "KGRKPR" sequence are important for encapsidation of full-length CNV RNA. Interestingly, "KGRKPR" mutants also encapsidated relatively higher levels of host RNA, suggesting that the "KGRKPR" sequence also contributes to selective encapsidation of CNV RNA. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Systematic Error in Seed Plant Phylogenomics

PubMed Central

Zhong, Bojian; Deusch, Oliver; Goremykin, Vadim V.; Penny, David; Biggs, Patrick J.; Atherton, Robin A.; Nikiforova, Svetlana V.; Lockhart, Peter James

2011-01-01

Resolving the closest relatives of Gnetales has been an enigmatic problem in seed plant phylogeny. The problem is known to be difficult because of the extent of divergence between this diverse group of gymnosperms and their closest phylogenetic relatives. Here, we investigate the evolutionary properties of conifer chloroplast DNA sequences. To improve taxon sampling of Cupressophyta (non-Pinaceae conifers), we report sequences from three new chloroplast (cp) genomes of Southern Hemisphere conifers. We have applied a site pattern sorting criterion to study compositional heterogeneity, heterotachy, and the fit of conifer chloroplast genome sequences to a general time reversible + G substitution model. We show that non-time reversible properties of aligned sequence positions in the chloroplast genomes of Gnetales mislead phylogenetic reconstruction of these seed plants. When 2,250 of the most varied sites in our concatenated alignment are excluded, phylogenetic analyses favor a close evolutionary relationship between the Gnetales and Pinaceae—the Gnepine hypothesis. Our analytical protocol provides a useful approach for evaluating the robustness of phylogenomic inferences. Our findings highlight the importance of goodness of fit between substitution model and data for understanding seed plant phylogeny. PMID:22016337
Thermodynamic prediction of protein neutrality.

PubMed

Bloom, Jesse D; Silberg, Jonathan J; Wilke, Claus O; Drummond, D Allan; Adami, Christoph; Arnold, Frances H

2005-01-18

We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 beta-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications.

Thermodynamic prediction of protein neutrality

PubMed Central

Bloom, Jesse D.; Silberg, Jonathan J.; Wilke, Claus O.; Drummond, D. Allan; Adami, Christoph; Arnold, Frances H.

2005-01-01

We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 β-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications. PMID:15644440
The Impact of Environmental and Endogenous Damage on Somatic Mutation Load in Human Skin Fibroblasts

PubMed Central

Saini, Natalie; Chan, Kin; Grimm, Sara A.; Dai, Shuangshuang; Fargo, David C.; Kaufmann, William K.; Taylor, Jack A.; Lee, Eunjung; Cortes-Ciriano, Isidro; Park, Peter J.; Schurman, Shepherd H.; Malc, Ewa P.; Mieczkowski, Piotr A.

2016-01-01

Accumulation of somatic changes, due to environmental and endogenous lesions, in the human genome is associated with aging and cancer. Understanding the impacts of these processes on mutagenesis is fundamental to understanding the etiology, and improving the prognosis and prevention of cancers and other genetic diseases. Previous methods relying on either the generation of induced pluripotent stem cells, or sequencing of single-cell genomes were inherently error-prone and did not allow independent validation of the mutations. In the current study we eliminated these potential sources of error by high coverage genome sequencing of single-cell derived clonal fibroblast lineages, obtained after minimal propagation in culture, prepared from skin biopsies of two healthy adult humans. We report here accurate measurement of genome-wide magnitude and spectra of mutations accrued in skin fibroblasts of healthy adult humans. We found that every cell contains at least one chromosomal rearrangement and 600–13,000 base substitutions. The spectra and correlation of base substitutions with epigenomic features resemble many cancers. Moreover, because biopsies were taken from body parts differing by sun exposure, we can delineate the precise contributions of environmental and endogenous factors to the accrual of genetic changes within the same individual. We show here that UV-induced and endogenous DNA damage can have a comparable impact on the somatic mutation loads in skin fibroblasts. Trial Registration ClinicalTrials.gov NCT01087307 PMID:27788131
A combined mechanistic and computational study of the gold(I)-catalyzed formation of substituted indenes.

PubMed

Nun, Pierrick; Gaillard, Sylvain; Poater, Albert; Cavallo, Luigi; Nolan, Steven P

2011-01-07

Substituted indenes can be prepared after a sequence [1,3] O-acyl shift-hydroarylation-[1,3] O-acyl shift. Each step is catalyzed by a cationic NHC-Gold(I) species generated in situ after reaction between [(IPr)AuOH] and HBF(4)·OEt(2). This interesting silver-free way is fully supported by a computational study justifying the formation of each intermediate.
Ethynyl and substituted ethynyl-terminated polysulfones

NASA Technical Reports Server (NTRS)

Hergenrother, P. M. (Inventor)

1984-01-01

Ethynyl and substituted ethynyl-terminated polysulfones and a process for preparing the same are disclosed. These polysulfones are thermally cured to induce cross-linking and chain extension, producing a polymer system with improved solvent resistance and use temperature. Also disclosed are substituted 4-ethynylbenzoyl chlorides as precursors to the substituted ethynyl-terminated polysulfones and a process for preparing the same.
Molecular evolution of the leptin exon 3 in some species of the family Canidae.

PubMed

Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek

2003-01-01

The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris)--16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical.
The topography of mutational processes in breast cancer genomes.

PubMed

Morganella, Sandro; Alexandrov, Ludmil B; Glodzik, Dominik; Zou, Xueqing; Davies, Helen; Staaf, Johan; Sieuwerts, Anieta M; Brinkman, Arie B; Martin, Sancha; Ramakrishna, Manasa; Butler, Adam; Kim, Hyung-Yong; Borg, Åke; Sotiriou, Christos; Futreal, P Andrew; Campbell, Peter J; Span, Paul N; Van Laere, Steven; Lakhani, Sunil R; Eyfjord, Jorunn E; Thompson, Alastair M; Stunnenberg, Hendrik G; van de Vijver, Marc J; Martens, John W M; Børresen-Dale, Anne-Lise; Richardson, Andrea L; Kong, Gu; Thomas, Gilles; Sale, Julian; Rada, Cristina; Stratton, Michael R; Birney, Ewan; Nik-Zainal, Serena

2016-05-02

Somatic mutations in human cancers show unevenness in genomic distribution that correlate with aspects of genome structure and function. These mutations are, however, generated by multiple mutational processes operating through the cellular lineage between the fertilized egg and the cancer cell, each composed of specific DNA damage and repair components and leaving its own characteristic mutational signature on the genome. Using somatic mutation catalogues from 560 breast cancer whole-genome sequences, here we show that each of 12 base substitution, 2 insertion/deletion (indel) and 6 rearrangement mutational signatures present in breast tissue, exhibit distinct relationships with genomic features relating to transcription, DNA replication and chromatin organization. This signature-based approach permits visualization of the genomic distribution of mutational processes associated with APOBEC enzymes, mismatch repair deficiency and homologous recombinational repair deficiency, as well as mutational processes of unknown aetiology. Furthermore, it highlights mechanistic insights including a putative replication-dependent mechanism of APOBEC-related mutagenesis.
SUBSTITUTION OF CADMIUM CYANIDE ELECTROPLATING WITH ZINC CHLORIDE ELECTROPLATING

EPA Science Inventory

The study evaluated the zinc chloride electroplating process as a substitute for cadmium cyanide electroplating in the manufacture of industrial connectors and fittings at Aeroquip Corporation. The process substitution eliminates certain wastes, specifically cadmium and cyanide, ...
Maximum parsimony, substitution model, and probability phylogenetic trees.

PubMed

Weng, J F; Thomas, D A; Mareels, I

2011-01-01

The problem of inferring phylogenies (phylogenetic trees) is one of the main problems in computational biology. There are three main methods for inferring phylogenies-Maximum Parsimony (MP), Distance Matrix (DM) and Maximum Likelihood (ML), of which the MP method is the most well-studied and popular method. In the MP method the optimization criterion is the number of substitutions of the nucleotides computed by the differences in the investigated nucleotide sequences. However, the MP method is often criticized as it only counts the substitutions observable at the current time and all the unobservable substitutions that really occur in the evolutionary history are omitted. In order to take into account the unobservable substitutions, some substitution models have been established and they are now widely used in the DM and ML methods but these substitution models cannot be used within the classical MP method. Recently the authors proposed a probability representation model for phylogenetic trees and the reconstructed trees in this model are called probability phylogenetic trees. One of the advantages of the probability representation model is that it can include a substitution model to infer phylogenetic trees based on the MP principle. In this paper we explain how to use a substitution model in the reconstruction of probability phylogenetic trees and show the advantage of this approach with examples.
Integrating mRNA and Protein Sequencing Enables the Detection and Quantitative Profiling of Natural Protein Sequence Variants of Populus trichocarpa.

PubMed

Abraham, Paul E; Wang, Xiaojing; Ranjan, Priya; Nookaew, Intawat; Zhang, Bing; Tuskan, Gerald A; Hettich, Robert L

2015-12-04

Next-generation sequencing has transformed the ability to link genotypes to phenotypes and facilitates the dissection of genetic contribution to complex traits. However, it is challenging to link genetic variants with the perturbed functional effects on proteins encoded by such genes. Here we show how RNA sequencing can be exploited to construct genotype-specific protein sequence databases to assess natural variation in proteins, providing information about the molecular toolbox driving cellular processes. For this study, we used two natural genotypes selected from a recent genome-wide association study of Populus trichocarpa, an obligate outcrosser with tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs), as well as insertions and deletions. We profiled the frequency of 128 types of naturally occurring amino acid substitutions, including both expected (neutral) and unexpected (non-neutral) SAAPs, with a subset occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. By zeroing in on the molecular signatures of these important regions that might have previously been uncharacterized, we now provide a high-resolution molecular inventory that should improve accessibility and subsequent identification of natural protein variants in future genotype-to-phenotype studies.
Using Biomimetic Polymers in Place of Noncollagenous Proteins to Achieve Functional Remineralization of Dentin Tissues

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chien, Yung-Ching; Tao, Jinhui; Saeki, Kuniko

In calcified tissues such as bones and teeth, mineralization is regulated by an extracellular matrix, which includes non-collagenous proteins (NCP). This natural process has been adapted or mimicked to restore tissues following physical damage or demineralization by using polyanionic acids in place of NCPs, but the remineralized tissues fail to fully recover their mechanical properties. Here we show that pre-treatment with certain amphiphilic peptoids, a class of peptide-like polymers consisting of N-substituted glycines that have defined monomer sequences, enhances ordering and mineralization of collagen and induces functional remineralization of dentin lesions in vitro. In the vicinity of dentin tubules, themore » newly formed apatite nano-crystals are co-aligned with the c-axis parallel to the tubular periphery and recovery of tissue ultrastructure is accompanied by development of high mechanical strength. The observed effects are highly sequence-dependent with alternating polar and non-polar groups leading to positive outcomes while diblock sequences have no effect. The observations suggest aromatic groups interact with the collagen while the hydrophilic side chains bind the mineralizing constituents and highlight the potential of synthetic sequence-defined biomimetic polymers to serve as NCP mimics in tissue remineralization.« less
Differentiated evolutionary conservatism and lack of polymorphism of crucial sex determination genes (SRY and SOX9) in four species of the family Canidae.

PubMed

Nowacka-Woszuk, Joanna; Switonski, Marek

2009-01-01

The sex determination process is under the control of several genes of which two (SRY and SOX9), encoding transcription factors, play a crucial role. It is well-known that mutations at these genes may cause the development of an intersexual phenotype. The aim of this study was to conduct a comparative analysis of the coding sequence and 5'-flanking regions of both genes in four species of the family Canidae (the dog, red fox, arctic fox and Chinese raccoon dog). Similarity of the coding sequence of the SOX9 gene among the studied species was higher (99.7-99.9%) than in the case of the SRY gene (96.7-97.3%). Only single nucleotide changes were found in the compared coding sequences, whereas in the 5'-flanking region of both genes nucleotide substitutions, as well as insertions and deletions were observed. None of the changes detected in the 5'-flanking region occurred within the potential consensus sequences for transcription factors. No polymorphism was found for either of these genes in any of the analyzed species.
Chaotic Image Encryption Algorithm Based on Bit Permutation and Dynamic DNA Encoding.

PubMed

Zhang, Xuncai; Han, Feng; Niu, Ying

2017-01-01

With the help of the fact that chaos is sensitive to initial conditions and pseudorandomness, combined with the spatial configurations in the DNA molecule's inherent and unique information processing ability, a novel image encryption algorithm based on bit permutation and dynamic DNA encoding is proposed here. The algorithm first uses Keccak to calculate the hash value for a given DNA sequence as the initial value of a chaotic map; second, it uses a chaotic sequence to scramble the image pixel locations, and the butterfly network is used to implement the bit permutation. Then, the image is coded into a DNA matrix dynamic, and an algebraic operation is performed with the DNA sequence to realize the substitution of the pixels, which further improves the security of the encryption. Finally, the confusion and diffusion properties of the algorithm are further enhanced by the operation of the DNA sequence and the ciphertext feedback. The results of the experiment and security analysis show that the algorithm not only has a large key space and strong sensitivity to the key but can also effectively resist attack operations such as statistical analysis and exhaustive analysis.
Chaotic Image Encryption Algorithm Based on Bit Permutation and Dynamic DNA Encoding

PubMed Central

2017-01-01

With the help of the fact that chaos is sensitive to initial conditions and pseudorandomness, combined with the spatial configurations in the DNA molecule's inherent and unique information processing ability, a novel image encryption algorithm based on bit permutation and dynamic DNA encoding is proposed here. The algorithm first uses Keccak to calculate the hash value for a given DNA sequence as the initial value of a chaotic map; second, it uses a chaotic sequence to scramble the image pixel locations, and the butterfly network is used to implement the bit permutation. Then, the image is coded into a DNA matrix dynamic, and an algebraic operation is performed with the DNA sequence to realize the substitution of the pixels, which further improves the security of the encryption. Finally, the confusion and diffusion properties of the algorithm are further enhanced by the operation of the DNA sequence and the ciphertext feedback. The results of the experiment and security analysis show that the algorithm not only has a large key space and strong sensitivity to the key but can also effectively resist attack operations such as statistical analysis and exhaustive analysis. PMID:28912802
Molecular Characterization and Comparative Sequence Analysis of Defense-Related Gene, Oryza rufipogon Receptor-Like Protein Kinase 1

PubMed Central

Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann

2012-01-01

Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769
Sequence Characterization of the MC1R Gene in Yak (Poephagus grunniens) Breeds with Different Coat Colors

PubMed Central

Chen, Shi-Yi; Huang, Yi; Zhu, Qing; Fontanesi, Luca; Yao, Yong-Gang; Liu, Yi-Ping

2009-01-01

Melanocortin 1 receptor (MC1R) gene plays a key role in determining coat color in several species, including the cattle. However, up to now there is no report regarding the MC1R gene and the potential association of its mutations with coat colors in yak (Poephagus grunniens). In this study, we sequenced the encoding region of the MC1R gene in three yak breeds with completely white (Tianzhu breed) or black coat color (Jiulong and Maiwa breeds). The predicted coding region of the yak MC1R gene resulted of 954 bp, the same to that of the wild-type cattle sequence, with >99% identity. None of the mutation events reported in cattle was found. Comparing the yak obtained sequences, five nucleotide substitutions were detected, which defined three haplotypes (EY1, EY2, and EY3). Of the five mutations, two, characterizing the EY1 haplotype, were nonsynonymous substitutions (c.340C>A and c.871G>A) causing amino acid changes located in the first extracellular loop (p.Q114K) and in the seventh transmembrane region (p.A291T). In silico prediction might indicate a functional effect of the latter substitution. However, all three haplotypes were present in the three yak breeds with relatively consistent frequency distribution, despite of their distinguished coat colors, which suggested that there was no across-breed association between haplotypes or genotypes and black/white phenotypes, at least in the investigated breeds. Other genes may be involved in affecting coat color in the analyzed yaks. PMID:19584942
Multiple and substitute addictions involving prescription drugs misuse among 12th graders: gateway theory revisited with Market Basket Analysis.

PubMed

Jayawardene, Wasantha Parakrama; YoussefAgha, Ahmed Hassan

2014-01-01

This study aimed to identify the sequential patterns of drug use initiation, which included prescription drugs misuse (PDM), among 12th-grade students in Indiana. The study also tested the suitability of the data mining method Market Basket Analysis (MBA) to detect common drug use initiation sequences in large-scale surveys. Data from 2007 to 2009 Annual Surveys of Alcohol, Tobacco, and Other Drug Use by Indiana Children and Adolescents were used for this study. A close-ended, self-administered questionnaire was used to ask adolescents about the use of 21 substance categories and the age of first use. "Support%" and "confidence%" statistics of Market Basket Analysis detected multiple and substitute addictions, respectively. The lifetime prevalence of using any addictive substance was 73.3%, and it has been decreasing during past few years. Although the lifetime prevalence of PDM was 19.2%, it has been increasing. Males and whites were more likely to use drugs and engage in multiple addictions. Market Basket Analysis identified common drug use initiation sequences that involved 11 drugs. High levels of support existed for associations among alcohol, cigarettes, and marijuana, whereas associations that included prescription drugs had medium levels of support. Market Basket Analysis is useful for the detection of common substance use initiation sequences in large-scale surveys. Before initiation of prescription drugs, physicians should consider the adolescents' risk of addiction. Prevention programs should address multiple addictions, substitute addictions, common sequences in drug use initiation, sex and racial differences in PDM, and normative beliefs of parents and adolescents in relation to PDM.
THE SMALL ACID SOLUBLE PROTEINS (SASP α and SASP β) OF BACILLUS WEIHENSTEPHANENSIS AND B. MYCOIDES GROUP 2 ARE THE MOST DISTINCT AMONG THE B. CEREUS GROUP

PubMed Central

Callahan, Courtney; Fox, Karen; Fox, Alvin

2009-01-01

The Bacillus cereus group includes Bacillus anthracis, Bacillus cereus, Bacillus thuringiensis, Bacillus mycoides and Bacillus weihenstephanensis. The small acid-soluble spore protein (SASP) β has been previously demonstrated to be among the biomarkers differentiating B. anthracis and B. cereus; SASP β of B. cereus most commonly exhibits one or two amino acid substitutions when compared to B. anthracis. SASP α is conserved in sequence among these two species. Neither SASP α nor β for B. thuringiensis, B. mycoides and B. weihenstephanensis have been previously characterized as taxonomic discriminators. In the current work molecular weight (MW) variation of these SASPs were determined by matrix assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI TOF MS) for representative strains of the 5 species within the B. cereus group. The measured MWs also correlate with calculated MWs of translated amino acid sequences generated from whole genome sequencing projects. SASP α and β demonstrated consistent MW among B. cereus, B. thuringiensis, and B. mycoides strains (group 1). However B. mycoides (group 2) and B. weihenstephanensis SASP α and β were quite distinct making them unique among the B. cereus group. Limited sequence changes were observed in SASP α (at most 3 substitutions and 2 deletions) indicating it is a more conserved protein than SASP β (up to 6 substitutions and a deletion). Another even more conserved SASP, SASP α-β type, was described here for the first time. PMID:19616612
Substitute CT generation from a single ultra short time echo MRI sequence: preliminary study

NASA Astrophysics Data System (ADS)

Ghose, Soumya; Dowling, Jason A.; Rai, Robba; Liney, Gary P.

2017-04-01

In MR guided radiation therapy planning both MR and CT images for a patient are acquired and co-registered to obtain a tissue specific HU map. Generation of the HU map directly from the MRI would eliminate the CT acquisition and may improve radiation therapy planning. In this preliminary study of substitute CT (sCT) generation, two porcine leg phantoms were scanned using a 3D ultrashort echo time (PETRA) sequence and co-registered to corresponding CT images to build tissue specific regression models. The model was created from one co-registered CT-PETRA pair to generate the sCT for the other PETRA image. An expectation maximization based clustering was performed on the co-registered PETRA image to identify the soft tissues, dense bone and air class membership probabilities. A tissue specific non linear regression model was built from one registered CT-PETRA pair dataset to predict the sCT of the second PETRA image in a two-fold cross validation schema. A complete substitute CT is generated in 3 min. The mean absolute HU error for air was 0.3 HU, bone was 95 HU, fat was 30 HU and for muscle it was 10 HU. The mean surface reconstruction error for the bone was 1.3 mm. The PETRA sequence enabled a low mean absolute surface distance for the bone and a low HU error for other classes. The sCT generated from a single PETRA sequence shows promise for the generation of fast sCT for MRI based radiation therapy planning.
Fine-scale genetic mapping of a hybrid sterility factor between Drosophila simulans and D. mauritiana: the varied and elusive functions of "speciation genes".

PubMed

Araripe, Luciana O; Montenegro, Horácio; Lemos, Bernardo; Hartl, Daniel L

2010-12-14

Hybrid male sterility (HMS) is a usual outcome of hybridization between closely related animal species. It arises because interactions between alleles that are functional within one species may be disrupted in hybrids. The identification of genes leading to hybrid sterility is of great interest for understanding the evolutionary process of speciation. In the current work we used marked P-element insertions as dominant markers to efficiently locate one genetic factor causing a severe reduction in fertility in hybrid males of Drosophila simulans and D. mauritiana. Our mapping effort identified a region of 9 kb on chromosome 3, containing three complete and one partial coding sequences. Within this region, two annotated genes are suggested as candidates for the HMS factor, based on the comparative molecular characterization and public-source information. Gene Taf1 is partially contained in the region, but yet shows high polymorphism with four fixed non-synonymous substitutions between the two species. Its molecular functions involve sequence-specific DNA binding and transcription factor activity. Gene agt is a small, intronless gene, whose molecular function is annotated as methylated-DNA-protein-cysteine S-methyltransferase activity. High polymorphism and one fixed non-synonymous substitution suggest this is a fast evolving gene. The gene trees of both genes perfectly separate D. simulans and D. mauritiana into monophyletic groups. Analysis of gene expression using microarray revealed trends that were similar to those previously found in comparisons between whole-genome hybrids and parental species. The identification following confirmation of the HMS candidate gene will add another case study leading to understanding the evolutionary process of hybrid incompatibility.
Ampicillin-Resistant Non-β-Lactamase-Producing Haemophilus influenzae in Spain: Recent Emergence of Clonal Isolates with Increased Resistance to Cefotaxime and Cefixime▿

PubMed Central

García-Cobos, Silvia; Campos, José; Lázaro, Edurne; Román, Federico; Cercenado, Emilia; García-Rey, César; Pérez-Vázquez, María; Oteo, Jesús; de Abajo, Francisco

2007-01-01

The sequence of the ftsI gene encoding the transpeptidase domain of penicillin-binding protein 3 (PBP 3) was determined for 354 nonconsecutive Haemophilus influenzae isolates from Spain; 17.8% of them were ampicillin susceptible, 56% were β-lactamase nonproducing ampicillin resistant (BLNAR), 15.8% were β-lactamase producers and ampicillin resistant, and 10.4% displayed both resistance mechanisms. The ftsI gene sequences had 28 different mutation patterns and amino acid substitutions at 23 positions. Some 93.2% of the BLNAR strains had amino acid substitutions at the Lys-Thr-Gly (KTG) motif, the two most common being Asn526 to Lys (83.9%) and Arg517 to His (9.3%). Amino acid substitutions at positions 377, 385, and 389, which conferred cefotaxime and cefixime MICs 10 to 60 times higher than those of susceptible strains, were found for the first time in Europe. In 72 isolates for which the repressor acrR gene of the AcrAB efflux pump was sequenced, numerous amino acid substitutions were found. Eight isolates with ampicillin MICs of 0.25 to 2 μg/ml showed changes that predicted the early termination of the acrR reading frame. Pulsed-field gel electrophoresis analysis demonstrated that most BLNAR strains were genetically diverse, although clonal dissemination was detected in a group of isolates presenting with increased resistance to cefotaxime and cefixime. Background antibiotic use at the community level revealed a marked trend toward increased amoxicillin-clavulanic acid consumption. BLNAR H. influenzae strains have arisen by vertical and horizontal spread and have evolved to adapt rapidly to the increased selective pressures posed by the use of oral penicillins and cephalosporins. PMID:17470649

Evolution of flavone synthase I from parsley flavanone 3beta-hydroxylase by site-directed mutagenesis.

PubMed

Gebhardt, Yvonne Helen; Witte, Simone; Steuber, Holger; Matern, Ulrich; Martens, Stefan

2007-07-01

Flavanone 3beta-hydroxylase (FHT) and flavone synthase I (FNS I) are 2-oxoglutarate-dependent dioxygenases with 80% sequence identity, which catalyze distinct reactions in flavonoid biosynthesis. However, FNS I has been reported exclusively from a few Apiaceae species, whereas FHTs are more abundant. Domain-swapping experiments joining the N terminus of parsley (Petroselinum crispum) FHT with the C terminus of parsley FNS I and vice versa revealed that the C-terminal portion is not essential for FNS I activity. Sequence alignments identified 26 amino acid substitutions conserved in FHT versus FNS I genes. Homology modeling, based on the related anthocyanidin synthase structure, assigned seven of these amino acids (FHT/FNS I, M106T, I115T, V116I, I131F, D195E, V200I, L215V, and K216R) to the active site. Accordingly, FHT was modified by site-directed mutagenesis, creating mutants encoding from one to seven substitutions, which were expressed in yeast (Saccharomyces cerevisiae) for FNS I and FHT assays. The exchange I131F in combination with either M106T and D195E or L215V and K216R replacements was sufficient to confer some FNS I side activity. Introduction of all seven FNS I substitutions into the FHT sequence, however, caused a nearly complete change in enzyme activity from FHT to FNS I. Both FHT and FNS I were proposed to initially withdraw the beta-face-configured hydrogen from carbon-3 of the naringenin substrate. Our results suggest that the 7-fold substitution affects the orientation of the substrate in the active-site pocket such that this is followed by syn-elimination of hydrogen from carbon-2 (FNS I reaction) rather than the rebound hydroxylation of carbon-3 (FHT reaction).
Neural/Bayes network predictor for inheritable cardiac disease pathogenicity and phenotype.

PubMed

Burghardt, Thomas P; Ajtai, Katalin

2018-04-11

The cardiac muscle sarcomere contains multiple proteins contributing to contraction energy transduction and its regulation during a heartbeat. Inheritable heart disease mutants affect most of them but none more frequently than the ventricular myosin motor and cardiac myosin binding protein c (mybpc3). These co-localizing proteins have mybpc3 playing a regulatory role to the energy transducing motor. Residue substitution and functional domain assignment of each mutation in the protein sequence decides, under the direction of a sensible disease model, phenotype and pathogenicity. The unknown model mechanism is decided here using a method combing neural and Bayes networks. Missense single nucleotide polymorphisms (SNPs) are clues for the disease mechanism summarized in an extensive database collecting mutant sequence location and residue substitution as independent variables that imply the dependent disease phenotype and pathogenicity characteristics in 4 dimensional data points (4ddps). The SNP database contains entries with the majority having one or both dependent data entries unfulfilled. A neural network relating causes (mutant residue location and substitution) and effects (phenotype and pathogenicity) is trained, validated, and optimized using fulfilled 4ddps. It then predicts unfulfilled 4ddps providing the implicit disease model. A discrete Bayes network interprets fulfilled and predicted 4ddps with conditional probabilities for phenotype and pathogenicity given mutation location and residue substitution thus relating the neural network implicit model to explicit features of the motor and mybpc3 sequence and structural domains. Neural/Bayes network forecasting automates disease mechanism modeling by leveraging the world wide human missense SNP database that is in place and expanding. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Evolution of major histocompatibility complex class I and class II genes in the brown bear

PubMed Central

2012-01-01

Background Major histocompatibility complex (MHC) proteins constitute an essential component of the vertebrate immune response, and are coded by the most polymorphic of the vertebrate genes. Here, we investigated sequence variation and evolution of MHC class I and class II DRB, DQA and DQB genes in the brown bear Ursus arctos to characterise the level of polymorphism, estimate the strength of positive selection acting on them, and assess the extent of gene orthology and trans-species polymorphism in Ursidae. Results We found 37 MHC class I, 16 MHC class II DRB, four DQB and two DQA alleles. We confirmed the expression of several loci: three MHC class I, two DRB, two DQB and one DQA. MHC class I also contained two clusters of non-expressed sequences. MHC class I and DRB allele frequencies differed between northern and southern populations of the Scandinavian brown bear. The rate of nonsynonymous substitutions (dN) exceeded the rate of synonymous substitutions (dS) at putative antigen binding sites of DRB and DQB loci and, marginally significantly, at MHC class I loci. Models of codon evolution supported positive selection at DRB and MHC class I loci. Both MHC class I and MHC class II sequences showed orthology to gene clusters found in the giant panda Ailuropoda melanoleuca. Conclusions Historical positive selection has acted on MHC class I, class II DRB and DQB, but not on the DQA locus. The signal of historical positive selection on the DRB locus was particularly strong, which may be a general feature of caniforms. The presence of MHC class I pseudogenes may indicate faster gene turnover in this class through the birth-and-death process. South–north population structure at MHC loci probably reflects origin of the populations from separate glacial refugia. PMID:23031405
Evolution of major histocompatibility complex class I and class II genes in the brown bear.

PubMed

Kuduk, Katarzyna; Babik, Wiesław; Bojarska, Katarzyna; Sliwińska, Ewa B; Kindberg, Jonas; Taberlet, Pierre; Swenson, Jon E; Radwan, Jacek

2012-10-02

Major histocompatibility complex (MHC) proteins constitute an essential component of the vertebrate immune response, and are coded by the most polymorphic of the vertebrate genes. Here, we investigated sequence variation and evolution of MHC class I and class II DRB, DQA and DQB genes in the brown bear Ursus arctos to characterise the level of polymorphism, estimate the strength of positive selection acting on them, and assess the extent of gene orthology and trans-species polymorphism in Ursidae. We found 37 MHC class I, 16 MHC class II DRB, four DQB and two DQA alleles. We confirmed the expression of several loci: three MHC class I, two DRB, two DQB and one DQA. MHC class I also contained two clusters of non-expressed sequences. MHC class I and DRB allele frequencies differed between northern and southern populations of the Scandinavian brown bear. The rate of nonsynonymous substitutions (dN) exceeded the rate of synonymous substitutions (dS) at putative antigen binding sites of DRB and DQB loci and, marginally significantly, at MHC class I loci. Models of codon evolution supported positive selection at DRB and MHC class I loci. Both MHC class I and MHC class II sequences showed orthology to gene clusters found in the giant panda Ailuropoda melanoleuca. Historical positive selection has acted on MHC class I, class II DRB and DQB, but not on the DQA locus. The signal of historical positive selection on the DRB locus was particularly strong, which may be a general feature of caniforms. The presence of MHC class I pseudogenes may indicate faster gene turnover in this class through the birth-and-death process. South-north population structure at MHC loci probably reflects origin of the populations from separate glacial refugia.
Genetic differentiation between fake abalone and genuine Haliotis species using the forensically informative nucleotide sequencing (FINS) method.

PubMed

Ha, Wai Y; Reid, David G; Kam, Wan L; Lau, Yuk Y; Sham, Wing C; Tam, Silvia Y K; Sin, Della W M; Mok, Chuen S

2011-05-25

Abalones ( Haliotis species) are a popular delicacy and commonly preserved in dried form either whole or in slices or small pieces for consumption in Asian countries. Driven by the huge profit from trading abalones, dishonest traders may substitute other molluscan species for processed abalone, of which the morphological characteristics are frequently lost in the processed form. For protection of consumer rights and law enforcement against fraud, there is a need for an effective methodology to differentiate between fake and genuine abalone. This paper describes a method (validated according to the international forensic guidelines provided by SWGDAM) for the identification of fake abalone species using forensically informative nucleotide sequence (FINS) analysis. A study of the local market revealed that many claimed "abalone slice" samples on sale are not genuine. The fake abalone samples were found to be either volutids of the genus Cymbium (93%) or the muricid Concholepas concholepas (7%). This is the first report of Cymbium species being used for the preparation and sale as "abalone" in dried sliced form in Hong Kong.
Thermal and acid tolerant beta xylosidases, arabinofuranosidases, genes encoding, related organisms, and methods

DOEpatents

Thompson, David N; Thompson, Vicki S; Schaller, Kastli D; Apel, William A; Reed, David W; Lacey, Jeffrey A

2013-04-30

Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof are provided. Further provided are methods of at least partially degrading xylotriose, xylobiose, and/or arabinofuranose-substituted xylan using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof.
Unlocking hidden genomic sequence

PubMed Central

Keith, Jonathan M.; Cochran, Duncan A. E.; Lala, Gita H.; Adams, Peter; Bryant, Darryn; Mitchelson, Keith R.

2004-01-01

Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. PMID:14973330
New small-molecule inhibitor class targeting human immunodeficiency virus type 1 virion maturation.

PubMed

Blair, Wade S; Cao, Joan; Fok-Seang, Juin; Griffin, Paul; Isaacson, Jason; Jackson, R Lynn; Murray, Edward; Patick, Amy K; Peng, Qinghai; Perros, Manos; Pickford, Chris; Wu, Hua; Butler, Scott L

2009-12-01

A new small-molecule inhibitor class that targets virion maturation was identified from a human immunodeficiency virus type 1 (HIV-1) antiviral screen. PF-46396, a representative molecule, exhibits antiviral activity against HIV-1 laboratory strains and clinical isolates in T-cell lines and peripheral blood mononuclear cells (PBMCs). PF-46396 specifically inhibits the processing of capsid (CA)/spacer peptide 1 (SP1) (p25), resulting in the accumulation of CA/SP1 (p25) precursor proteins and blocked maturation of the viral core particle. Viral variants resistant to PF-46396 contain a single amino acid substitution in HIV-1 CA sequences (CAI201V), distal to the CA/SP1 cleavage site in the primary structure, which we demonstrate is sufficient to confer significant resistance to PF-46396 and 3-O-(3',3'-dimethylsuccinyl) betulinic acid (DSB), a previously described maturation inhibitor. Conversely, a single amino substitution in SP1 (SP1A1V), which was previously associated with DSB in vitro resistance, was sufficient to confer resistance to DSB and PF-46396. Further, the CAI201V substitution restored CA/SP1 processing in HIV-1-infected cells treated with PF-46396 or DSB. Our results demonstrate that PF-46396 acts through a mechanism that is similar to DSB to inhibit the maturation of HIV-1 virions. To our knowledge, PF-46396 represents the first small-molecule HIV-1 maturation inhibitor that is distinct in chemical class from betulinic acid-derived maturation inhibitors (e.g., DSB), demonstrating that molecules of diverse chemical classes can inhibit this mechanism.
An Optimal Seed Based Compression Algorithm for DNA Sequences

PubMed Central

Gopalakrishnan, Gopakumar; Karunakaran, Muralikrishnan

2016-01-01

This paper proposes a seed based lossless compression algorithm to compress a DNA sequence which uses a substitution method that is similar to the LempelZiv compression scheme. The proposed method exploits the repetition structures that are inherent in DNA sequences by creating an offline dictionary which contains all such repeats along with the details of mismatches. By ensuring that only promising mismatches are allowed, the method achieves a compression ratio that is at par or better than the existing lossless DNA sequence compression algorithms. PMID:27555868
Apolipoprotein A-I mutant proteins having cysteine substitutions and polynucleotides encoding same

DOEpatents

Oda, Michael N [Benicia, CA; Forte, Trudy M [Berkeley, CA

2007-05-29

Functional Apolipoprotein A-I mutant proteins, having one or more cysteine substitutions and polynucleotides encoding same, can be used to modulate paraoxonase's arylesterase activity. These ApoA-I mutant proteins can be used as therapeutic agents to combat cardiovascular disease, atherosclerosis, acute phase response and other inflammatory related diseases. The invention also includes modifications and optimizations of the ApoA-I nucleotide sequence for purposes of increasing protein expression and optimization.
Sequential allylic substitution/Pauson-Khand reaction: a strategy to bicyclic fused cyclopentenones from MBH-acetates of acetylenic aldehydes.

PubMed

Raji Reddy, Chada; Kumaraswamy, Paridala; Singarapu, Kiran K

2014-09-05

An efficient approach for the construction of novel bicyclic fused cyclopentenones starting from Morita-Baylis-Hillman (MBH) acetates of acetylenic aldehydes with flexible scaffold diversity has been achieved using a two-step reaction sequence involving allylic substitution and the Pauson-Khand reaction. This strategy provided a facile access to various bicyclic cyclopentenones fused with either a carbocyclic or a heterocyclic ring system in good yield.
Sequence-structural features and evolutionary relationships of family GH57 α-amylases and their putative α-amylase-like homologues.

PubMed

Janeček, Stefan; Blesák, Karol

2011-08-01

The glycoside hydrolase family 57 (GH57) contains α-amylase and a few other amylolytic specificities. It counts ~400 members from Archaea (1/4) and Bacteria (3/4), mostly of extremophilic prokaryotes. Only 17 GH57 enzymes have been biochemically characterized. The main goal of the present bioinformatics study was to analyze sequences having the clear GH57 α-amylase features. Of the 107 GH57 sequences, 59 were evaluated as α-amylases (containing both GH57 catalytic residues), whereas 48 were assigned as GH57 α-amylase-like proteins (having a substitution in one or both catalytic residues). Forty-eight of 59 α-amylases were from Archaea, but 42 of 48 α-amylase-like proteins were of bacterial origin. The catalytic residues were substituted in most cases in Bacteroides and Prevotella by serine (instead of catalytic nucleophile glutamate) and glutamate (instead of proton donor aspartate). The GH57 α-amylase specificity has thus been evolved and kept enzymatically active mainly in Archaea.
Plasmids encoding therapeutic agents

DOEpatents

Keener, William K [Idaho Falls, ID

2007-08-07

Plasmids encoding anti-HIV and anti-anthrax therapeutic agents are disclosed. Plasmid pWKK-500 encodes a fusion protein containing DP178 as a targeting moiety, the ricin A chain, an HIV protease cleavable linker, and a truncated ricin B chain. N-terminal extensions of the fusion protein include the maltose binding protein and a Factor Xa protease site. C-terminal extensions include a hydrophobic linker, an L domain motif peptide, a KDEL ER retention signal, another Factor Xa protease site, an out-of-frame buforin II coding sequence, the lacZ.alpha. peptide, and a polyhistidine tag. More than twenty derivatives of plasmid pWKK-500 are described. Plasmids pWKK-700 and pWKK-800 are similar to pWKK-500 wherein the DP178-encoding sequence is substituted by RANTES- and SDF-1-encoding sequences, respectively. Plasmid pWKK-900 is similar to pWKK-500 wherein the HIV protease cleavable linker is substituted by a lethal factor (LF) peptide-cleavable linker.
Evolution of DMY, a newly emergent male sex-determination gene of medaka fish.

PubMed

Zhang, Jianzhi

2004-04-01

The Japanese medaka fish Oryzias latipes has an XX/XY sex-determination system. The Y-linked sex-determination gene DMY is a duplicate of the autosomal gene DMRT1, which encodes a DM-domain-containing transcriptional factor. DMY appears to have originated recently within Oryzias, allowing a detailed evolutionary study of the initial steps that led to the new gene and new sex-determination system. Here I analyze the publicly available DMRT1 and DMY gene sequences of Oryzias species and report the following findings. First, the synonymous substitution rate in DMY is 1.73 times that in DMRT1, consistent with the male-driven evolution hypothesis. Second, the ratio of the rate of nonsynonymous nucleotide substitution (d(N)) to that of synonymous substitution (d(S)) is significantly higher in DMY than in DMRT1. Third, in DMRT1, the d(N)/d(S) ratio for the DM domain is lower than that for non-DM regions, as expected from the functional importance of the DM domain. But in DMY, the opposite is observed and the DM domain is likely under positive Darwinian selection. Fourth, only one characteristic amino acid distinguishes all DMY sequences from all DMRT1 sequences, suggesting that a single amino acid change may be largely responsible for the establishment of DMY as the male sex-determination gene in medaka fish.
[A novel M142T mutation in the B glycosyltransferase gene associated with B3 variant in Chinese].

PubMed

Xu, Xian-guo; Hong, Xiao-zhen; Liu, Ying; Zhu, Fa-ming; Lv, Hang-jun; Yan, Li-xing

2009-06-01

To investigate the molecular genetic basis of the B3 variant of ABO blood group system with mixed-field hemagglutination in Chinese. Serological techniques were performed to characterize the erythrocyte phenotype of two discrepant samples. A sequential agglutination method and 13 short tandem repeat (STR) loci were tested to exclude the possibility of exogenous or endogenous DNA chimera. Mutations in exons 6 and 7, including partial intron of the ABO gene, were screened by polymerase chain reaction and DNA sequencing. Haplotypes of the two individuals were also analyzed by sequencing. A mixed-field hemagglutination of RBCs with anti-B and anti-AB antibodies was detected in the two unrelated individuals. Exogenous ABO-incompatible RBC transfusion and endogenous genetic chimera were excluded by sequential agglutination method and STR. The ABO phenotypes of the two individuals were classified as A1B3 according to the ABO subgroup definition. The sequence region from intron 5 to 3'-UTR of the B allele was identical to that of ABO*B101 allele, except for a T to C substitution at nucleotide position 425 in exon 7. This substitution resulted in an amino acid change of M142T in the B glycosyltransferase. A novel B allele with 425T>C substitution resulting in B3 subgroup was identified in two Chinese individuals.
Differences in glycosyltransferase family 61 accompany variation in seed coat mucilage composition in Plantago spp.

PubMed

Phan, Jana L; Tucker, Matthew R; Khor, Shi Fang; Shirley, Neil; Lahnstein, Jelle; Beahan, Cherie; Bacic, Antony; Burton, Rachel A

2016-12-01

Xylans are the most abundant non-cellulosic polysaccharide found in plant cell walls. A diverse range of xylan structures influence tissue function during growth and development. Despite the abundance of xylans in nature, details of the genes and biochemical pathways controlling their biosynthesis are lacking. In this study we have utilized natural variation within the Plantago genus to examine variation in heteroxylan composition and structure in seed coat mucilage. Compositional assays were combined with analysis of the glycosyltransferase family 61 (GT61) family during seed coat development, with the aim of identifying GT61 sequences participating in xylan backbone substitution. The results reveal natural variation in heteroxylan content and structure, particularly in P. ovata and P. cunninghamii, species which show a similar amount of heteroxylan but different backbone substitution profiles. Analysis of the GT61 family identified specific sequences co-expressed with IRREGULAR XYLEM 10 genes, which encode putative xylan synthases, revealing a close temporal association between xylan synthesis and substitution. Moreover, in P. ovata, several abundant GT61 sequences appear to lack orthologues in P. cunninghamii. Our results indicate that natural variation in Plantago species can be exploited to reveal novel details of seed coat development and polysaccharide biosynthetic pathways. © The Author 2016. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Primary structure of rat cardiac beta-adrenergic and muscarinic cholinergic receptors obtained by automated DNA sequence analysis: further evidence for a multigene family.

PubMed

Gocayne, J; Robinson, D A; FitzGerald, M G; Chung, F Z; Kerlavage, A R; Lentes, K U; Lai, J; Wang, C D; Fraser, C M; Venter, J C

1987-12-01

Two cDNA clones, lambda RHM-MF and lambda RHB-DAR, encoding the muscarinic cholinergic receptor and the beta-adrenergic receptor, respectively, have been isolated from a rat heart cDNA library. The cDNA clones were characterized by restriction mapping and automated DNA sequence analysis utilizing fluorescent dye primers. The rat heart muscarinic receptor consists of 466 amino acids and has a calculated molecular weight of 51,543. The rat heart beta-adrenergic receptor consists of 418 amino acids and has a calculated molecular weight of 46,890. The two cardiac receptors have substantial amino acid homology (27.2% identity, 50.6% with favored substitutions). The rat cardiac beta receptor has 88.0% homology (92.5% with favored substitutions) with the human brain beta receptor and the rat cardiac muscarinic receptor has 94.6% homology (97.6% with favored substitutions) with the porcine cardiac muscarinic receptor. The muscarinic cholinergic and beta-adrenergic receptors appear to be as conserved as hemoglobin and cytochrome c but less conserved than histones and are clearly members of a multigene family. These data support our hypothesis, based upon biochemical and immunological evidence, that suggests considerable structural homology and evolutionary conservation between adrenergic and muscarinic cholinergic receptors. To our knowledge, this is the first report utilizing automated DNA sequence analysis to determine the structure of a gene.
Reconstruction of the ancestral plastid genome in Geraniaceae reveals a correlation between genome rearrangements, repeats, and nucleotide substitution rates.

PubMed

Weng, Mao-Lun; Blazier, John C; Govindu, Madhumita; Jansen, Robert K

2014-03-01

Geraniaceae plastid genomes are highly rearranged, and each of the four genera already sequenced in the family has a distinct genome organization. This study reports plastid genome sequences of six additional species, Francoa sonchifolia, Melianthus villosus, and Viviania marifolia from Geraniales, and Pelargonium alternans, California macrophylla, and Hypseocharis bilobata from Geraniaceae. These genome sequences, combined with previously published species, provide sufficient taxon sampling to reconstruct the ancestral plastid genome organization of Geraniaceae and the rearrangements unique to each genus. The ancestral plastid genome of Geraniaceae has a 4 kb inversion and a reduced, Pelargonium-like small single copy region. Our ancestral genome reconstruction suggests that a few minor rearrangements occurred in the stem branch of Geraniaceae followed by independent rearrangements in each genus. The genomic comparison demonstrates that a series of inverted repeat boundary shifts and inversions played a major role in shaping genome organization in the family. The distribution of repeats is strongly associated with breakpoints in the rearranged genomes, and the proportion and the number of large repeats (>20 bp and >60 bp) are significantly correlated with the degree of genome rearrangements. Increases in the degree of plastid genome rearrangements are correlated with the acceleration in nonsynonymous substitution rates (dN) but not with synonymous substitution rates (dS). Possible mechanisms that might contribute to this correlation, including DNA repair system and selection, are discussed.
Role of promoter DNA sequence variations on the binding of EGR1 transcription factor.

PubMed

Mikles, David C; Schuchardt, Brett J; Bhat, Vikas; McDonald, Caleb B; Farooq, Amjad

2014-05-01

In response to a wide variety of stimuli such as growth factors and hormones, EGR1 transcription factor is rapidly induced and immediately exerts downstream effects central to the maintenance of cellular homeostasis. Herein, our biophysical analysis reveals that DNA sequence variations within the target gene promoters tightly modulate the energetics of binding of EGR1 and that nucleotide substitutions at certain positions are much more detrimental to EGR1-DNA interaction than others. Importantly, the reduction in binding affinity poorly correlates with the loss of enthalpy and gain of entropy-a trend indicative of a complex interplay between underlying thermodynamic factors due to the differential role of water solvent upon nucleotide substitution. We also provide a rationale for the physical basis of the effect of nucleotide substitutions on the EGR1-DNA interaction at atomic level. Taken together, our study bears important implications on understanding the molecular determinants of a key protein-DNA interaction at the cross-roads of human health and disease. Copyright © 2014 Elsevier Inc. All rights reserved.
Hepatitis E virus and fulminant hepatitis--a virus or host-specific pathology?

PubMed

Smith, Donald B; Simmonds, Peter

2015-04-01

Fulminant hepatitis is a rare outcome of infection with hepatitis E virus. Several recent reports suggest that virus variation is an important determinant of disease progression. To critically examine the evidence that virus-specific factors underlie the development of fulminant hepatitis following hepatitis E virus infection. Published sequence information of hepatitis E virus isolates from patients with and without fulminant hepatitis was collected and analysed using statistical tests to identify associations between virus polymorphisms and disease outcome. Fulminant hepatitis has been reported following infection with all four hepatitis E virus genotypes that infect humans comprising multiple phylogenetic lineages within genotypes 1, 3 and 4. Analysis of virus sequences from individuals infected by a common source did not detect any common substitutions associated with progression to fulminant hepatitis. Re-analysis of previously reported associations between virus substitutions and fulminant hepatitis suggests that these were probably the result of sampling biases. Host-specific factors rather than virus genotype, variants or specific substitutions appear to be responsible for the development of fulminant hepatitis. © 2014 The Authors. Liver International Published by John Wiley & Sons Ltd.

[Studying of molecular mechanisms of rubella virus attenuation evidence from Russian strain C-77].

PubMed

Dmitriev, G V; Borisova, T K; Faĭzuloev, E B; Zabiiaka, Iu I; Desiatskova, R G; Zverev, V V

2012-01-01

Live attenuated rubella vaccine is used for vaccination. Temperature-sensitive (ts) phenotype was proved for almost all rubella vaccine strains, and the acquisition of the ts phenotype during cold adaptation was strongly correlated with the attenuation of the wild-type viruses. Nevertheless, the molecular mechanisms of the attenuation have been insufficiently understood for rubella virus. Study ofthese mechanisms, identifying genotypic markers of attenuation, which together with the sequence analyses could be used for genetic stability control of vaccine strains, is still of current interest. In this work, we determined nearly complete genome sequences of attenuated (ca) and the wildtype progenitor (wt) of the rubella virus strain C-77 isolated in Russia. Possible genetic determinants of attenuation were detected. Thus, 13 nucleotide differences leading to 6 amino acid substitutions were found. Four amino acid substitutions were found to be almost unique. Special consideration should be given to Tyr1042Cys substitution in the protease domain of C-77 strain, because it most probably plays the crucial role in acquisition of ts-phenotype.
SeSaM-Tv-II generates a protein sequence space that is unobtainable by epPCR.

PubMed

Mundhada, Hemanshu; Marienhagen, Jan; Scacioc, Andreea; Schenk, Alexander; Roccatano, Danilo; Schwaneberg, Ulrich

2011-07-04

Generating high-quality mutant libraries in which each amino acid is equally targeted and substituted in a chemically diverse manner is crucial to obtain improved variants in small mutant libraries. The sequence saturation mutagenesis method (SeSaM-Tv(+) ) offers the opportunity to generate such high-quality mutant libraries by introducing consecutive mutations and by enriching transversions. In this study, automated gel electrophoresis, real-time quantitative PCR, and a phosphorimager quantification system were developed and employed to optimize each step of previously reported SeSaM-Tv(+) method. Advancements of the SeSaM-Tv(+) protocol and the use of a novel DNA polymerase quadrupled the number of transversions, by doubling the fraction of consecutive mutations (from 16.7 to 37.1 %). About 33 % of all amino acid substitutions observed in a model library are rarely introduced by epPCR methods, and around 10 % of all clones carried amino acid substitutions that are unobtainable by epPCR. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Neuraminidase inhibitor susceptibility and neuraminidase enzyme kinetics of human influenza A and B viruses circulating in Thailand in 2010-2015.

PubMed

Tewawong, Nipaporn; Marathe, Bindumadhav M; Poovorawan, Yong; Vongpunsawad, Sompong; Webby, Richard J; Govorkova, Elena A

2018-01-01

Amino acid substitutions within or near the active site of the viral neuraminidase (NA) may affect influenza virus fitness. In influenza A(H3N2) and B viruses circulating in Thailand between 2010 and 2015, we identified several NA substitutions that were previously reported to be associated with reduced inhibition by NA inhibitors (NAIs). To study the effect of these substitutions on the enzymatic properties of NA and on virus characteristics, we generated recombinant influenza viruses possessing either a wild type (WT) NA or an NA with a single I222V, S331G, or S331R substitution [in influenza A(H3N2) viruses] or a single D342S, A395T, A395V, or A395D NA substitution (in influenza B viruses). We generated recombinant (7:1) influenza A and B viruses on the genetic background of A/Puerto Rico/8/1934 (A/PR/8, H1N1) or B/Yamanashi/166/1998 (B/YAM) viruses, respectively. In contrast to the expected phenotypes, all the recombinant influenza A(H3N2) and B viruses carrying putative NA resistance substitutions were susceptible to NAIs. The Km and Vmax for the NAs of A/PR8-S331G and A/PR8-S331R viruses were higher than for the NA of WT virus, and the corresponding values for the B/YAM-D342S virus were lower than for the NA of WT virus. Although there was initial variation in the kinetics of influenza A and B viruses' replication in MDCK cells, their titers were comparable to each other and to WT viruses at later time points. All introduced substitutions were stable except for B/YAM-D342S and B/YAM-A395V which reverted to WT sequences after three passages. Our data suggest that inferring susceptibility to NAIs based on sequence information alone should be cautioned. The impact of NA substitution on NAI resistance, viral growth, and enzymatic properties is viral context dependent and should be empirically determined.
PyEvolve: a toolkit for statistical modelling of molecular evolution.

PubMed

Butterfield, Andrew; Vedagiri, Vivek; Lang, Edward; Lawrence, Cath; Wakefield, Matthew J; Isaev, Alexander; Huttley, Gavin A

2004-01-05

Examining the distribution of variation has proven an extremely profitable technique in the effort to identify sequences of biological significance. Most approaches in the field, however, evaluate only the conserved portions of sequences - ignoring the biological significance of sequence differences. A suite of sophisticated likelihood based statistical models from the field of molecular evolution provides the basis for extracting the information from the full distribution of sequence variation. The number of different problems to which phylogeny-based maximum likelihood calculations can be applied is extensive. Available software packages that can perform likelihood calculations suffer from a lack of flexibility and scalability, or employ error-prone approaches to model parameterisation. Here we describe the implementation of PyEvolve, a toolkit for the application of existing, and development of new, statistical methods for molecular evolution. We present the object architecture and design schema of PyEvolve, which includes an adaptable multi-level parallelisation schema. The approach for defining new methods is illustrated by implementing a novel dinucleotide model of substitution that includes a parameter for mutation of methylated CpG's, which required 8 lines of standard Python code to define. Benchmarking was performed using either a dinucleotide or codon substitution model applied to an alignment of BRCA1 sequences from 20 mammals, or a 10 species subset. Up to five-fold parallel performance gains over serial were recorded. Compared to leading alternative software, PyEvolve exhibited significantly better real world performance for parameter rich models with a large data set, reducing the time required for optimisation from approximately 10 days to approximately 6 hours. PyEvolve provides flexible functionality that can be used either for statistical modelling of molecular evolution, or the development of new methods in the field. The toolkit can be used interactively or by writing and executing scripts. The toolkit uses efficient processes for specifying the parameterisation of statistical models, and implements numerous optimisations that make highly parameter rich likelihood functions solvable within hours on multi-cpu hardware. PyEvolve can be readily adapted in response to changing computational demands and hardware configurations to maximise performance. PyEvolve is released under the GPL and can be downloaded from http://cbis.anu.edu.au/software.
Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms

PubMed Central

2012-01-01

Background Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Results Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10-9 synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Conclusions Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations. PMID:22264329
Developmentally distinct MYB genes encode functionally equivalent proteins in Arabidopsis.

PubMed

Lee, M M; Schiefelbein, J

2001-05-01

The duplication and divergence of developmental control genes is thought to have driven morphological diversification during the evolution of multicellular organisms. To examine the molecular basis of this process, we analyzed the functional relationship between two paralogous MYB transcription factor genes, WEREWOLF (WER) and GLABROUS1 (GL1), in Arabidopsis. The WER and GL1 genes specify distinct cell types and exhibit non-overlapping expression patterns during Arabidopsis development. Nevertheless, reciprocal complementation experiments with a series of gene fusions showed that WER and GL1 encode functionally equivalent proteins, and their unique roles in plant development are entirely due to differences in their cis-regulatory sequences. Similar experiments with a distantly related MYB gene (MYB2) showed that its product cannot functionally substitute for WER or GL1. Furthermore, an analysis of the WER and GL1 proteins shows that conserved sequences correspond to specific functional domains. These results provide new insights into the evolution of the MYB gene family in Arabidopsis, and, more generally, they demonstrate that novel developmental gene function may arise solely by the modification of cis-regulatory sequences.
L-Asparaginase from Streptomyces griseus NIOT-VKMA29: optimization of process variables using factorial designs and molecular characterization of L-asparaginase gene

NASA Astrophysics Data System (ADS)

Meena, Balakrishnan; Anburajan, Lawrance; Sathish, Thadikamala; Vijaya Raghavan, Rangamaran; Dharani, Gopal; Valsalan Vinithkumar, Nambali; Kirubagaran, Ramalingam

2015-07-01

Marine actinobacteria are known to be a rich source for novel metabolites with diverse biological activities. In this study, a potential extracellular L-asparaginase was characterised from the Streptomyces griseus NIOT-VKMA29. Box-Behnken based optimization was used to determine the culture medium components to enhance the L-asparaginase production. pH, starch, yeast extract and L-asparagine has a direct correlation for enzyme production with a maximum yield of 56.78 IU mL-1. A verification experiment was performed to validate the experiment and more than 99% validity was established. L-Asparaginase biosynthesis gene (ansA) from Streptomyces griseus NIOT-VKMA29 was heterologously expressed in Escherichia coli M15 and the enzyme production was increased threefold (123 IU mL-1) over the native strain. The ansA gene sequences reported in this study encloses several base substitutions with that of reported sequences in GenBank, resulting in altered amino acid sequences of the translated protein.
Regions of conservation and divergence in the 3' untranslated sequences of genomic RNA from Ross River virus isolates.

PubMed

Faragher, S G; Dalgarno, L

1986-07-20

The 3' untranslated (UT) sequences of the genomic RNAs of five geographic variants of the alphavirus Ross River virus (RRV) were determined and compared with the 3' UT sequence of RRV T48, the prototype strain. Part of the 3' UT region of Getah virus, a close serological relative of RRV, was also sequenced. The RRV 3' UT region varies markedly in length between variants. Large deletions or insertions, sequence rearrangements and single nucleotide substitutions are observed. A sequence tract of 49 to 58 nucleotides, which is repeated as four blocks in the RRV T48 3' UT region, occurs only once in the 3' UT region of one RRV strain (NB5092), indicating that the existence of repeat sequence blocks is not essential for RRV replication. However, the precise sequence of the 3' proximal copy of the repeat block and its position relative to the poly(A) tail were identical in all RRV isolates examined, suggesting that it has an important role in RRV replication. Nucleotide substitutions between RRV variants are distributed non-randomly along the length of the 3' UT region. The sequence of 120 to 130 nucleotides adjacent to the poly(A) tail is strongly conserved. Getah virus RNA contains three repeat sequence blocks in the 3' UT region. These are similar in sequence to those in RRV RNA but differ in their arrangement. Homology between the RRV and Getah 3' UT sequences is greatest in the 3' proximal repeat sequence block that shows three differences in 49 nucleotides. The 3' proximal repeat in Getah RNA occurs at the same position, relative to the poly(A) tail, as in all RRV variants. The RRV and Getah virus 3' UT sequences show extensive homology in the region between the 3' proximal repeat and the poly(A) tail but, apart from the repeat blocks themselves, they show no significant homology elsewhere.
Investigation of the protein osteocalcin of Camelops hesternus: Sequence, structure and phylogenetic implications

NASA Astrophysics Data System (ADS)

Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.

2007-12-01

Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.
12 CFR Appendix C to Part 229 - Model Availability Policy Disclosures, Clauses, and Notices; Model Substitute Check Policy...

Code of Federal Regulations, 2014 CFR

2014-01-01

... processing regions)]. If you make the deposit in person to one of our employees, funds from the following... in different states or check processing regions)]. If you make the deposit in person to one of our...] Substitute Checks and Your Rights What Is a Substitute Check? To make check processing faster, federal law...
Molecular evolution of the leptin exon 3 in some species of the family Canidae

PubMed Central

Chmurzynska, Agata; Zajac, Magdalena; Switonski, Marek

2003-01-01

The structure of the leptin gene seems to be well conserved. The polymorphism of this gene in four species belonging to the Canidae family (the dog (Canis familiaris) – 16 different breeds, the Chinese racoon dog (Nyctereutes procyonoides procyonoides), the red fox (Vulpes vulpes) and the arctic fox (Alopex lagopus)) were studied with the use of single strand conformation polymorphism (SSCP), restriction fragment length polymorphism (RFLP) and DNA sequencing techniques. For exon 2, all species presented the same SSCP pattern, while in exon 3 some differences were found. DNA sequencing of exon 3 revealed the presence of six nucleotide substitutions, differentiating the studied species. Three of them cause amino acid substitutions as well. For all dog breeds studied, SSCP patterns were identical. PMID:12939206
The National Shipbuilding Research Program: Implementation of Past NSRP Research Through Education and Training

DTIC Science & Technology

1999-01-05

used in each chapter to define the techniques of waste minimization are: improved operation management , material substitution, process substitution...1994 – Reduce Quantity & Toxicity of Waste • Improved Operation Management • Material & Process Substitution • Recycling • Treatment Advantages
Analysis of the whole mitochondrial genome: translation of the Ion Torrent Personal Genome Machine system to the diagnostic bench?

PubMed

Seneca, Sara; Vancampenhout, Kim; Van Coster, Rudy; Smet, Joél; Lissens, Willy; Vanlander, Arnaud; De Paepe, Boel; Jonckheere, An; Stouffs, Katrien; De Meirleir, Linda

2015-01-01

Next-generation sequencing (NGS), an innovative sequencing technology that enables the successful analysis of numerous gene sequences in a massive parallel sequencing approach, has revolutionized the field of molecular biology. Although NGS was introduced in a rather recent past, the technology has already demonstrated its potential and effectiveness in many research projects, and is now on the verge of being introduced into the diagnostic setting of routine laboratories to delineate the molecular basis of genetic disease in undiagnosed patient samples. We tested a benchtop device on retrospective genomic DNA (gDNA) samples of controls and patients with a clinical suspicion of a mitochondrial DNA disorder. This Ion Torrent Personal Genome Machine platform is a high-throughput sequencer with a fast turnaround time and reasonable running costs. We challenged the chemistry and technology with the analysis and processing of a mutational spectrum composed of samples with single-nucleotide substitutions, indels (insertions and deletions) and large single or multiple deletions, occasionally in heteroplasmy. The output data were compared with previously obtained conventional dideoxy sequencing results and the mitochondrial revised Cambridge Reference Sequence (rCRS). We were able to identify the majority of all nucleotide alterations, but three false-negative results were also encountered in the data set. At the same time, the poor performance of the PGM instrument in regions associated with homopolymeric stretches generated many false-positive miscalls demanding additional manual curation of the data.
Determining geographical spread pattern of MERS-CoV by distance method using Kimura model

NASA Astrophysics Data System (ADS)

Amiroch, Siti; Rohmatullah, Arif

2017-03-01

MERS-CoV or generally called as Middle East Respiratory Syndrome Coronavirus, a respiratory disease syndrome caused by a corona virus that attacks the respiratory tract ranging from mild to severe acute indication of fever, cough and shortness of breath. The cases happened relate to the countries in the Arabian Peninsula (Middle East) and there were 356 deaths have been reported due to the spread of the epidemic MERS. The data used in the case of MERS are the data DNA sequences taken from Genbank, the online database of the United States that stores the results of molecular biological experiments from all over the world (http://www.ncbi.nlm.nih.gov). In this case, bioinformatics plays an important role of reading sequences of DNA and genetic information by using the main device in the form of software that is supported by the availability of the Internet, while the analysis there in made and proven with mathematical methods. In similar research conducted by molecular biologists and physicians, the process of DNA sequencing is done with software that is already available like BLAST. In order to determine the MERS geographical distribution patterns in the Arabian Peninsula is done with program Clustal W, Bayesian, Phylip, etc. In this study, the writer use the Matlab simulation for all processes starting sequence alignment, counting the number of transitions and transversion substitutions for each sequence and its location up to the process of forming a phylogenetic tree that figures out the pattern of spread of the epidemic MERS. Mathematical analysis performed on a decline in the formula is to find Kimura evolutionary models and the process of forming a phylogenetic tree (the pattern of the epidemic MERS distribution) with neighbor joining algorithm. Finally it was obtained the pattern of geographical spread with 6 groups epidemic of MERS which ultimately turns out that all the MERS viruses that were spread in the Arabian Peninsula everything are almost the same as the virus sequence found in al-Hasa.
Real-time, portable genome sequencing for Ebola surveillance.

PubMed

Quick, Joshua; Loman, Nicholas J; Duraffour, Sophie; Simpson, Jared T; Severi, Ettore; Cowley, Lauren; Bore, Joseph Akoi; Koundouno, Raymond; Dudas, Gytis; Mikhail, Amy; Ouédraogo, Nobila; Afrough, Babak; Bah, Amadou; Baum, Jonathan Hj; Becker-Ziaja, Beate; Boettcher, Jan-Peter; Cabeza-Cabrerizo, Mar; Camino-Sanchez, Alvaro; Carter, Lisa L; Doerrbecker, Juiliane; Enkirch, Theresa; Dorival, Isabel Graciela García; Hetzelt, Nicole; Hinzmann, Julia; Holm, Tobias; Kafetzopoulou, Liana Eleni; Koropogui, Michel; Kosgey, Abigail; Kuisma, Eeva; Logue, Christopher H; Mazzarelli, Antonio; Meisel, Sarah; Mertens, Marc; Michel, Janine; Ngabo, Didier; Nitzsche, Katja; Pallash, Elisa; Patrono, Livia Victoria; Portmann, Jasmine; Repits, Johanna Gabriella; Rickett, Natasha Yasmin; Sachse, Andrea; Singethan, Katrin; Vitoriano, Inês; Yemanaberhan, Rahel L; Zekeng, Elsa G; Trina, Racine; Bello, Alexander; Sall, Amadou Alpha; Faye, Ousmane; Faye, Oumar; Magassouba, N'Faly; Williams, Cecelia V; Amburgey, Victoria; Winona, Linda; Davis, Emily; Gerlach, Jon; Washington, Franck; Monteil, Vanessa; Jourdain, Marine; Bererd, Marion; Camara, Alimou; Somlare, Hermann; Camara, Abdoulaye; Gerard, Marianne; Bado, Guillaume; Baillet, Bernard; Delaune, Déborah; Nebie, Koumpingnin Yacouba; Diarra, Abdoulaye; Savane, Yacouba; Pallawo, Raymond Bernard; Gutierrez, Giovanna Jaramillo; Milhano, Natacha; Roger, Isabelle; Williams, Christopher J; Yattara, Facinet; Lewandowski, Kuiama; Taylor, Jamie; Rachwal, Philip; Turner, Daniel; Pollakis, Georgios; Hiscox, Julian A; Matthews, David A; O'Shea, Matthew K; Johnston, Andrew McD; Wilson, Duncan; Hutley, Emma; Smit, Erasmus; Di Caro, Antonino; Woelfel, Roman; Stoecker, Kilian; Fleischmann, Erna; Gabriel, Martin; Weller, Simon A; Koivogui, Lamine; Diallo, Boubacar; Keita, Sakoba; Rambaut, Andrew; Formenty, Pierre; Gunther, Stephan; Carroll, Miles W

2016-02-11

The Ebola virus disease epidemic in West Africa is the largest on record, responsible for over 28,599 cases and more than 11,299 deaths. Genome sequencing in viral outbreaks is desirable to characterize the infectious agent and determine its evolutionary rate. Genome sequencing also allows the identification of signatures of host adaptation, identification and monitoring of diagnostic targets, and characterization of responses to vaccines and treatments. The Ebola virus (EBOV) genome substitution rate in the Makona strain has been estimated at between 0.87 × 10(-3) and 1.42 × 10(-3) mutations per site per year. This is equivalent to 16-27 mutations in each genome, meaning that sequences diverge rapidly enough to identify distinct sub-lineages during a prolonged epidemic. Genome sequencing provides a high-resolution view of pathogen evolution and is increasingly sought after for outbreak surveillance. Sequence data may be used to guide control measures, but only if the results are generated quickly enough to inform interventions. Genomic surveillance during the epidemic has been sporadic owing to a lack of local sequencing capacity coupled with practical difficulties transporting samples to remote sequencing facilities. To address this problem, here we devise a genomic surveillance system that utilizes a novel nanopore DNA sequencing instrument. In April 2015 this system was transported in standard airline luggage to Guinea and used for real-time genomic surveillance of the ongoing epidemic. We present sequence data and analysis of 142 EBOV samples collected during the period March to October 2015. We were able to generate results less than 24 h after receiving an Ebola-positive sample, with the sequencing process taking as little as 15-60 min. We show that real-time genomic surveillance is possible in resource-limited settings and can be established rapidly to monitor outbreaks.
Efficient Synthesis of γ-Lactams by a Tandem Reductive Amination/Lactamization Sequence

PubMed Central

Nöth, Julica; Frankowski, Kevin J.; Neuenswander, Benjamin; Aubé, Jeffrey; Reiser, Oliver

2009-01-01

A three-component method for synthesizing highly-substituted γ-lactams from readily available maleimides, aldehydes and amines is described. A new reductive amination/intramolecular lactamization sequence provides a straightforward route to the lactam products in a single manipulation. The general utility of this method is demonstrated by the parallel synthesis of a γ-lactam library. PMID:18338857
Efficient synthesis of gamma-lactams by a tandem reductive amination/lactamization sequence.

PubMed

Nöth, Julica; Frankowski, Kevin J; Neuenswander, Benjamin; Aubé, Jeffrey; Reiser, Oliver

2008-01-01

A three-component method for the synthesis of highly substituted gamma-lactams from readily available maleimides, aldehydes, and amines is described. A new reductive amination/intramolecular lactamization sequence provides a straightforward route to the lactam products in a single manipulation. The general utility of this method is demonstrated by the parallel synthesis of a gamma-lactam library.
From N-triisopropylsilylpyrrole to an optically active C-4 substituted pyroglutamic acid: total synthesis of penmacric acid.

PubMed

Berini, Christophe; Pelloux-Léon, Nadia; Minassian, Frédéric; Denis, Jean-Noël

2009-11-07

The stereoselective synthesis of penmacric acid, an optically active C-4 substituted pyroglutamic acid, has been efficiently achieved through an unusual 11-step sequence starting from simple N-triisopropylsilylpyrrole. The key-steps are the initial addition of the pyrrole nucleus onto a chiral nitrone and the obtention of the pyroglutamic acid moiety by reductive hydrogenation of the pyrrole followed by oxidation of the corresponding pyrrolidine into pyrrolidinone.
A glycine-to-glutamate substitution abolishes alanine:glyoxylate aminotransferase catalytic activity in a subset of patients with primary hyperoxaluria type 1.

PubMed

Purdue, P E; Lumb, M J; Allsop, J; Minatogawa, Y; Danpure, C J

1992-05-01

We have synthesized and sequenced alanine:glyoxylate aminotransferase (AGT; HGMW-approved symbol for the gene--AGXT) cDNA from the liver of a primary hyperoxaluria type 1 (PH1) patient who had normal levels of hepatic peroxisomal immunoreactive AGT protein, but no AGT catalytic activity. This revealed the presence of a single point mutation (G----A at cDNA nucleotide 367), which is predicted to cause a glycine-to-glutamate substitution at residue 82 of the AGT protein. This mutation is located in exon 2 of the AGT gene and leads to the loss of an AvaI restriction site. Exon 2-specific PCR followed by AvaI digestion showed that this patient was homozygous for this mutation. In addition, three other PH1 patients, one related to and two unrelated to, but with enzymological phenotype similar to that of the first patient, were also shown to be homozygous for the mutation. However, one other phenotypically similar PH1 patient was shown to lack this mutation. The mechanism by which the glycine-to-glutamate substitution at residue 82 causes loss of catalytic activity remains to be resolved. However, the protein sequence in this region is highly conserved between different mammals, and the substitution at residue 82 is predicted to cause significant local structural alterations.
Mammalian genome projects reveal new growth hormone (GH) sequences. Characterization of the GH-encoding genes of armadillo (Dasypus novemcinctus), hedgehog (Erinaceus europaeus), bat (Myotis lucifugus), hyrax (Procavia capensis), shrew (Sorex araneus), ground squirrel (Spermophilus tridecemlineatus), elephant (Loxodonta africana), cat (Felis catus) and opossum (Monodelphis domestica).

PubMed

Wallis, Michael

2008-01-15

Mammalian growth hormone (GH) sequences have been shown previously to display episodic evolution: the sequence is generally strongly conserved but on at least two occasions during mammalian evolution (on lineages leading to higher primates and ruminants) bursts of rapid evolution occurred. However, the number of mammalian orders studied previously has been relatively limited, and the availability of sequence data via mammalian genome projects provides the potential for extending the range of GH gene sequences examined. Complete or nearly complete GH gene sequences for six mammalian species for which no data were previously available have been extracted from the genome databases-Dasypus novemcinctus (nine-banded armadillo), Erinaceus europaeus (western European hedgehog), Myotis lucifugus (little brown bat), Procavia capensis (cape rock hyrax), Sorex araneus (European shrew), Spermophilus tridecemlineatus (13-lined ground squirrel). In addition incomplete data for several other species have been extended. Examination of the data in detail and comparison with previously available sequences has allowed assessment of the reliability of deduced sequences. Several of the new sequences differ substantially from the consensus sequence previously determined for eutherian GHs, indicating greater variability than previously recognised, and confirming the episodic pattern of evolution. The episodic pattern is not seen for signal sequences, 5' upstream sequence or synonymous substitutions-it is specific to the mature protein sequence, suggesting that it relates to the hormonal function. The substitutions accumulated during the course of GH evolution have occurred mainly on the side of the hormone facing away from the receptor, in a non-random fashion, and it is suggested that this may reflect interaction of the receptor-bound hormone with other proteins or small ligands.

In search of efficient 5-endo-dig cyclization of a carbon-centered radical: 40 years from a prediction to another success for the Baldwin rules.

PubMed

Alabugin, Igor V; Timokhin, Vitaliy I; Abrams, Jason N; Manoharan, Mariappan; Abrams, Rachel; Ghiviriga, Ion

2008-08-20

Despite being predicted to be stereoelectronically favorable by the Baldwin rules, efficient formation of a C-C bond through a 5-endo-dig radical cyclization remained unknown for more than 40 years. This work reports a remarkable increase in the efficiency of this process upon beta-Ts substitution, which led to the development of an expedient approach to densely functionalized cyclic 1,3-dienes. Good qualitative agreement between the increased efficiency and stereoselectivity for the 5-endo-dig cyclization of Ts-substituted vinyl radicals and the results of density functional theory analysis further confirms the utility of computational methods in the design of new radical processes. Although reactions of Br atoms generated through photochemical Ts-Br bond homolysis lead to the formation of cyclic dibromide side products, the yields of target bromosulfones in the photochemically induced reactions can be increased by recycling the dibromide byproduct into the target bromosulfones through a sequence of addition/elimination reactions at the exocyclic double bond. Discovery of a relatively efficient radical 5-endo-dig closure, accompanied by a C-C bond formation, provides further support to stereoelectronic considerations at the heart of the Baldwin rules and fills one of the last remaining gaps in the arsenal of radical cyclizations.
The influence of specific neighboring bases on substitution bias in noncoding regions of the plant chloroplast genome.

PubMed

Morton, B R; Oberholzer, V M; Clegg, M T

1997-09-01

Substitutions occurring in noncoding sequences of the plant chloroplast genome violate the independence of sites that is assumed by substitution models in molecular evolution. The probability that a substitution at a site is a transversion, as opposed to a transition, increases significantly with increasing A + T content of the two adjacent nucleotides. In the present study, this dependency of substitutions on local context is examined further in a number of noncoding regions from the chloroplast genome of members of the grass family (Poaceae). Two features were examined; the influence of specific neighboring bases, as opposed to the general A + T content, on transversion proportion and an influence on substitutions by nucleotides other than the two immediately adjacent to the site of substitution. In both cases, a significant effect was found. In the case of specific nucleotides, transversion proportion is significantly higher at sites with a pyrimidine immediately 5' on either strand. Substitutions at sites of the type YNR, where N is the site of substitution, have the highest rate of transversion. This specific effect is secondary to the A + T content effect such that, in terms of proportion of substitutions that are transversions, the nucleotides are ranked T > A > C > G as to their effect when they are immediately 5' to the site of substitution. In the case of nucleotides other than the immediate neighbors, a significant influence on substitution dynamics is observed in the case where the two neighboring bases are both A and/or T. Thus, substitutions are primarily, but not exclusively, influenced by the composition of the two nucleotides that are immediately adjacent. These results indicate that the pattern of molecular evolution of the plant chloroplast genome is extremely complex as a result of a variety of inter-site dependencies.
The human immunodeficiency virus type 1 long terminal repeat specifies two different transcription complexes, only one of which is regulated by Tat.

PubMed Central

Lu, X; Welsh, T M; Peterlin, B M

1993-01-01

The human immunodeficiency virus type 1 long terminal repeat sets up two different transcription complexes, which have been called processive and nonprocessive complexes. By mutating and substituting cis-acting sequences, we mapped elements of the human immunodeficiency virus long terminal repeat that are responsible for creating each transcription complex. Whereas processive complexes are efficiently assembled by upstream promoter elements in the absence of the TATA box, nonprocessive complexes absolutely require the TATA box. Moreover, the TATA box alone can set up these nonprocessive complexes, and nonprocessive but not processive complexes are trans activated by Tat. Finally, a strong DNA-binding site between the TATA box and trans-activation-responsive region interferes with either the assembly or movement of these nonprocessive complexes and diminishes the effects of Tat. Thus, Tat affects a critical step in the formation of elongation-competent transcription complexes. Images PMID:8445708
Color differences among feral pigeons (Columba livia) are not attributable to sequence variation in the coding region of the melanocortin-1 receptor gene (MC1R)

PubMed Central

2013-01-01

Background Genetic variation at the melanocortin-1 receptor (MC1R) gene is correlated with melanin color variation in many birds. Feral pigeons (Columba livia) show two major melanin-based colorations: a red coloration due to pheomelanic pigment and a black coloration due to eumelanic pigment. Furthermore, within each color type, feral pigeons display continuous variation in the amount of melanin pigment present in the feathers, with individuals varying from pure white to a full dark melanic color. Coloration is highly heritable and it has been suggested that it is under natural or sexual selection, or both. Our objective was to investigate whether MC1R allelic variants are associated with plumage color in feral pigeons. Findings We sequenced 888 bp of the coding sequence of MC1R among pigeons varying both in the type, eumelanin or pheomelanin, and the amount of melanin in their feathers. We detected 10 non-synonymous substitutions and 2 synonymous substitution but none of them were associated with a plumage type. It remains possible that non-synonymous substitutions that influence coloration are present in the short MC1R fragment that we did not sequence but this seems unlikely because we analyzed the entire functionally important region of the gene. Conclusions Our results show that color differences among feral pigeons are probably not attributable to amino acid variation at the MC1R locus. Therefore, variation in regulatory regions of MC1R or variation in other genes may be responsible for the color polymorphism of feral pigeons. PMID:23915680
Correlation of Local Effects of DNA Sequence and Position of Beta-Alanine Inserts with Polyamide-DNA Complex Binding Affinities and Kinetics

PubMed Central

Wang, Shuo; Nanjunda, Rupesh; Aston, Karl; Bashkin, James K.; Wilson, W. David

2012-01-01

In order to better understand the effects of β-alanine (β) substitution and the number of heterocycles on DNA binding affinity and selectivity, the interactions of an eight-ring hairpin polyamide (PA) and two β derivatives as well as a six-heterocycle analog have been investigated with their cognate DNA sequence, 5′-TGGCTT-3′. Binding selectivity and the effects of β have been investigated with the cognate and five mutant DNAs. A set of powerful and complementary methods have been employed for both energetic and structural evaluations: UV-melting, biosensor-surface plasmon resonance, isothermal titration calorimetry, circular dichroism and a DNA ligation ladder global structure assay. The reduced number of heterocycles in the six-ring PA weakens the binding affinity; however, the smaller PA aggregates significantly less than the larger PAs, and allows us to obtain the binding thermodynamics. The PA-DNA binding enthalpy is large and negative with a large negative ΔCp, and is the primary driving component of the Gibbs free energy. The complete SPR binding results clearly show that β substitutions can substantially weaken the binding affinity of hairpin PAs in a position-dependent manner. More importantly, the changes in PA binding to the mutant DNAs further confirm the position-dependent effects on PA-DNA interaction affinity. Comparison of mutant DNA sequences also shows a different effect in recognition of T•A versus A•T base pairs. The effects of DNA mutations on binding of a single PA as well as the effects of the position of β substitution on binding tell a clear and very important story about sequence dependent binding of PAs to DNA. PMID:23167504
Full genome sequences and molecular characterization of tick-borne encephalitis virus strains isolated from human patients.

PubMed

Formanová, Petra; Černý, Jiří; Bolfíková, Barbora Černá; Valdés, James J; Kozlova, Irina; Dzhioev, Yuri; Růžek, Daniel

2015-02-01

Tick-borne encephalitis virus (TBEV) causes tick-borne encephalitis (TBE), one of the most important human neuroinfections across Eurasia. Up to date, only three full genome sequences of human European TBEV isolates are available, mostly due to difficulties with isolation of the virus from human patients. Here we present full genome characterization of an additional five low-passage TBEV strains isolated from human patients with severe forms of TBE. These strains were isolated in 1953 within Central Bohemia in the former Czechoslovakia, and belong to the historically oldest human TBEV isolates in Europe. We demonstrate here that all analyzed isolates are distantly phylogenetically related, indicating that the emergence of TBE in Central Europe was not caused by one predominant strain, but rather a pool of distantly related TBEV strains. Nucleotide identity between individual sequenced TBEV strains ranged from 97.5% to 99.6% and all strains shared large deletions in the 3' non-coding region, which has been recently suggested to be an important determinant of virulence. The number of unique amino acid substitutions varied from 3 to 9 in individual isolates, but no characteristic amino acid substitution typical exclusively for all human TBEV isolates was identified when compared to the isolates from ticks. We did, however, correlate that the exploration of the TBEV envelope glycoprotein by specific antibodies were in close proximity to these unique amino acid substitutions. Taken together, we report here the largest number of patient-derived European TBEV full genome sequences to date and provide a platform for further studies on evolution of TBEV since the first emergence of human TBE in Europe. Copyright © 2014 Elsevier GmbH. All rights reserved.
Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

PubMed Central

Axelsen, Jacob Bock; Yan, Koon-Kiu; Maslov, Sergei

2007-01-01

Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw") duplication and deletion rates rdup∗, rdel∗ which include gene copies that will be removed soon after the duplication event and their dramatically reduced long-term counterparts rdup, rdel. High deletion rate among recently duplicated proteins is consistent with a scenario in which they didn't have enough time to significantly change their functional roles and thus are to a large degree disposable. Systematic trends of each of the four duplication/deletion rates with the total number of genes in the genome were analyzed. All but the deletion rate of recent duplicates rdel∗ were shown to systematically increase with Ngenes. Abnormally flat shapes of sequence identity histograms observed for yeast and human are consistent with lineages leading to these organisms undergoing one or more whole-genome duplications. This interpretation is corroborated by our analysis of the genome of Paramecium tetraurelia where the p-4 profile of the histogram is gradually restored by the successive removal of paralogs generated in its four known whole-genome duplication events. PMID:18039386
Genetic and evolutionary characterization of RABVs from China using the phosphoprotein gene.

PubMed

Wang, Lihua; Wu, Hui; Tao, Xiaoyan; Li, Hao; Rayner, Simon; Liang, Guodong; Tang, Qing

2013-01-07

While the function of the phosphoprotein (P) gene of the rabies virus (RABV) has been well studied in laboratory adapted RABVs, the genetic diversity and evolution characteristics of the P gene of street RABVs remain unclear. The objective of the present study was to investigate the mutation and evolution of P genes in Chinese street RABVs. The P gene of 77 RABVs from brain samples of dogs and wild animals collected in eight Chinese provinces through 2003 to 2008 were sequenced. The open reading frame (ORF) of the P genes was 894 nucleotides (nt) in length, with 85-99% (80-89%) amino acid (nucleotide) identity compared with the laboratory RABVs and vaccine strains. Phylogenetic analysis based on the P gene revealed that Chinese RABVs strains could be divided into two distinct clades, and several RABV variants were found to co circulating in the same province. Two conserved (CD1, 2) and two variable (VD1, 2) domains were identified by comparing the deduced primary sequences of the encoded P proteins. Two sequence motifs, one believed to confer binding to the cytoplasmic dynein light chain LC8 and a lysine-rich sequence were conserved throughout the Chinese RABVs. In contrast, the isolates exhibited lower conservation of one phosphate acceptor and one internal translation initiation site identified in the P protein of the rabies challenge virus standard (CVS) strain. Bayesian coalescent analysis showed that the P gene in Chinese RABVs have a substitution rate (3.305x10(-4) substitutions per site per year) and evolution history (592 years ago) similar to values for the glycoprotein (G) and nucleoprotein (N) reported previously. Several substitutions were found in the P gene of Chinese RABVs strains compared to the laboratory adapted and vaccine strains, whether these variations could affect the biological characteristics of Chinese RABVs need to be further investigated. The substitution rate and evolution history of P gene is similar to G and N gene, combine the topology of phylogenetic tree based on the P gene is similar to the G and N gene trees, indicate that the P, G and N genes are equally valid for examining the phylogenetics of RABVs.
Molecular Analysis of Glucose-6-Phosphate Dehydrogenase Gene Mutations in Bangladeshi Individuals.

PubMed

Sarker, Suprovath Kumar; Islam, Md Tarikul; Eckhoff, Grace; Hossain, Mohammad Amir; Qadri, Syeda Kashfi; Muraduzzaman, A K M; Bhuyan, Golam Sarower; Shahidullah, Mohammod; Mannan, Mohammad Abdul; Tahura, Sarabon; Hussain, Manzoor; Akhter, Shahida; Nahar, Nazmun; Shirin, Tahmina; Qadri, Firdausi; Mannoor, Kaiissar

2016-01-01

Glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common X-linked human enzyme defect of red blood cells (RBCs). Individuals with this gene defect appear normal until exposed to oxidative stress which induces hemolysis. Consumption of certain foods such as fava beans, legumes; infection with bacteria or virus; and use of certain drugs such as primaquine, sulfa drugs etc. may result in lysis of RBCs in G6PD deficient individuals. The genetic defect that causes G6PD deficiency has been identified mostly as single base missense mutations. One hundred and sixty G6PD gene mutations, which lead to amino acid substitutions, have been described worldwide. The purpose of this study was to detect G6PD gene mutations in hospital-based settings in the local population of Dhaka city, Bangladesh. Qualitative fluorescent spot test and quantitative enzyme activity measurement using RANDOX G6PDH kit were performed for analysis of blood specimens and detection of G6PD-deficient participants. For G6PD-deficient samples, PCR was done with six sets of primers specific for G6PD gene. Automated Sanger sequencing of the PCR products was performed to identify the mutations in the gene. Based on fluorescence spot test and quantitative enzyme assay followed by G6PD gene sequencing, 12 specimens (11 males and one female) among 121 clinically suspected patient-specimens were found to be deficient, suggesting a frequency of 9.9% G6PD deficiency. Sequencing of the G6PD-deficient samples revealed c.C131G substitution (exon-3: Ala44Gly) in six samples, c.G487A substitution (exon-6:Gly163Ser) in five samples and c.G949A substitution (exon-9: Glu317Lys) of coding sequence in one sample. These mutations either affect NADP binding or disrupt protein structure. From the study it appears that Ala44Gly and Gly163Ser are the most common G6PD mutations in Dhaka, Bangladesh. This is the first study of G6PD mutations in Bangladesh.
Molecular Analysis of Glucose-6-Phosphate Dehydrogenase Gene Mutations in Bangladeshi Individuals

PubMed Central

Sarker, Suprovath Kumar; Hossain, Mohammad Amir; Qadri, Syeda Kashfi; Muraduzzaman, A. K. M.; Bhuyan, Golam Sarower; Shahidullah, Mohammod; Mannan, Mohammad Abdul; Tahura, Sarabon; Hussain, Manzoor; Akhter, Shahida; Nahar, Nazmun; Shirin, Tahmina; Qadri, Firdausi; Mannoor, Kaiissar

2016-01-01

Glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common X-linked human enzyme defect of red blood cells (RBCs). Individuals with this gene defect appear normal until exposed to oxidative stress which induces hemolysis. Consumption of certain foods such as fava beans, legumes; infection with bacteria or virus; and use of certain drugs such as primaquine, sulfa drugs etc. may result in lysis of RBCs in G6PD deficient individuals. The genetic defect that causes G6PD deficiency has been identified mostly as single base missense mutations. One hundred and sixty G6PD gene mutations, which lead to amino acid substitutions, have been described worldwide. The purpose of this study was to detect G6PD gene mutations in hospital-based settings in the local population of Dhaka city, Bangladesh. Qualitative fluorescent spot test and quantitative enzyme activity measurement using RANDOX G6PDH kit were performed for analysis of blood specimens and detection of G6PD-deficient participants. For G6PD-deficient samples, PCR was done with six sets of primers specific for G6PD gene. Automated Sanger sequencing of the PCR products was performed to identify the mutations in the gene. Based on fluorescence spot test and quantitative enzyme assay followed by G6PD gene sequencing, 12 specimens (11 males and one female) among 121 clinically suspected patient-specimens were found to be deficient, suggesting a frequency of 9.9% G6PD deficiency. Sequencing of the G6PD-deficient samples revealed c.C131G substitution (exon-3: Ala44Gly) in six samples, c.G487A substitution (exon-6:Gly163Ser) in five samples and c.G949A substitution (exon-9: Glu317Lys) of coding sequence in one sample. These mutations either affect NADP binding or disrupt protein structure. From the study it appears that Ala44Gly and Gly163Ser are the most common G6PD mutations in Dhaka, Bangladesh. This is the first study of G6PD mutations in Bangladesh. PMID:27880809
Concise, stereodivergent and highly stereoselective synthesis of cis- and trans-2-substituted 3-hydroxypiperidines – development of a phosphite-driven cyclodehydration

PubMed Central

Westphal, Julia C

2014-01-01

Summary A concise (5 to 6 steps), stereodivergent, highly diastereoselective (dr up to >19:1 for both stereoisomers) and scalable synthesis (up to 14 g) of cis- and trans-2-substituted 3-piperidinols, a core motif in numerous bioactive compounds, is presented. This sequence allowed an efficient synthesis of the NK-1 inhibitor L-733,060 in 8 steps. Additionally, a cyclodehydration-realizing simple triethylphosphite as a substitute for triphenylphosphine is developed. Here the stoichiometric oxidized P(V)-byproduct (triethylphosphate) is easily removed during the work up through saponification overcoming separation difficulties usually associated to triphenylphosphine oxide. PMID:24605158
Domino reactions initiated by intramolecular hydride transfers from tri(di)arylmethane fragments to ketenimine and carbodiimide functions.

PubMed

Alajarin, Mateo; Bonillo, Baltasar; Ortin, Maria-Mar; Sanchez-Andrada, Pilar; Vidal, Angel; Orenes, Raul-Angel

2010-10-21

The ability of triarylmethane and diarylmethane fragments to behave as hydride donors participating in thermal [1,5]-H shift/6π-ERC tandem processes involving ketenimine and carbodiimide functions is disclosed. C-Alkyl-C-phenyl ketenimines N-substituted by a triarylmethane substructure convert into a variety of 3,3,4,4-tetrasubstituted-3,4-dihydroquinolines, as structurally related carbodiimides transform into 3,4,4-trisubstituted-3,4-dihydroquinazolines via transient ortho-azaxylylenes. The first step of these one-pot conversions, the [1,5]-H shift, is considered to be a hydride migration on the basis of the known hydricity of the tri(di)arylmethane fragment and the electrophilicity of the central heterocumulenic carbon atom, whereas the final electrocyclization involves the formation of a sterically congested C-C or C-N bond. In the cases of C,C-diphenyl substituted triarylmethane-ketenimines the usual 6π-ERC becomes prohibited by the presence of two phenyl rings at each end of the azatrienic system. This situation opens new reaction channels: (a) following the initial hydride shift, the tandem sequence continues with an alternative electrocyclization mode to give 9,10-dihydroacridines, (b) the full sequence is initiated by a rare 1,5 migration of an electron-rich aryl group, followed by a 6π-ERC which leads to 2-aryl-3,4-dihydroquinolines, or (c) a different [1,5]-H shift/6π-ERC sequence involving the initial migration of a hydrogen atom from a methyl group at the ortho position to the nitrogen atom of the ketenimine function. Diarylmethane-ketenimines bearing a methyl group at the benzylic carbon atom experience a tandem double [1,5]-H shift, the first one being the usual benzylic hydride transfer whereas the second one involves the methyl group at the initial benzylic carbon atom, the reaction products being 2-aminostyrenes. Diarylmethane-ketenimines lacking such a methyl group convert into 3,4-dihydroquinolines by the habitual tandem [1,5]-H shift/6π-ERC processes.
Three novel ascomycetous yeast species of the Kazachstania clade, Kazachstania saulgeensis sp. nov., Kazachstaniaserrabonitensis sp. nov. and Kazachstania australis sp. nov. Reassignment of Candida humilis to Kazachstania humilis f.a. comb. nov. and Candida pseudohumilis to Kazachstania pseudohumilis f.a. comb. nov.

PubMed

Jacques, Noémie; Sarilar, Véronique; Urien, Charlotte; Lopes, Mariana R; Morais, Camila G; Uetanabaro, Ana Paula T; Tinsley, Colin R; Rosa, Carlos A; Sicard, Delphine; Casaregola, Serge

2016-12-01

Five ascosporogenous yeast strains related to the genus Kazachstania were isolated. Two strains (CLIB 1764T and CLIB 1780) were isolated from French sourdoughs; three others (UFMG-CM-Y273T, UFMG-CM-Y451 and UFMG-CM-Y452) were from rotting wood in Brazil. The sequences of the French and Brazilian strains differed by one and three substitutions, respectively, in the D1/D2 large subunit (LSU) rRNA gene and the internal transcribed spacer (ITS). The D1/D2 LSU rRNA sequence of these strains differed by 0.5 and 0.7 % from Kazachstania exigua, but their ITS sequences diverged by 8.1 and 8.3 %, respectively, from that of the closest described species Kazachstania barnettii. Analysis of protein coding sequences of RPB1, RPB2 and EF-1α distinguished the French from the Brazilian strains, with respectively 3.3, 6 and 11.7 % substitutions. Two novel species are described to accommodate these newly isolated strains: Kazachstania saulgeensis sp. nov. (type strain CLIB 1764T=CBS 14374T) and Kazachstania serrabonitensis sp. nov. (type strain UFMG-CM-Y273T=CLIB 1783T=CBS 14236T). Further analysis of culture collections revealed a strain previously assigned to the K. exigua species, but having 3.8 % difference (22 substitutions and 2 indels) in its ITS with respect to K. exigua. Hence, we describe a new taxon, Kazachstania australis sp. nov. (type strain CLIB 162T=CBS 2141T), to accommodate this strain. Finally, Candida humilis and Candida pseudohumilis are reassigned to the genus Kazachstania as new combinations. On the basis of sequence analysis, we also propose that Candida milleri and Kazachstania humilis comb. nov. are conspecific.
Diversity of interferon inducible Mx gene in horses and association of variations with susceptibility vis-à-vis resistance against equine influenza infection.

PubMed

Manuja, Balvinder K; Manuja, Anju; Dahiya, Rajni; Singh, Sandeep; Sharma, R C; Gahlot, S K

2014-10-01

Equine influenza (EI) is primarily an infection of the upper respiratory tract and is one of the major infectious respiratory diseases of economic importance in equines. Re-emergence of the disease, species jumping by H3N8 virus in canines and possible threat of human pandemic due to the unpredictable nature of the virus have necessitated research on devising strategies for preventing the disease. The myxovirus resistance protein (Mx) has been reported to confer resistance to Orthomyxo virus infection by modifying cellular functions needed along the viral replication pathway. Polymorphisms and differential antiviral activities of Mx gene have been reported in pigs and chicken. Here we report the diversity of Mx gene, its expression in response to stimulation with interferon (IFN) α/β and their association with EI resistance and susceptibility in Marwari horses. Blood samples were collected from horses declared positive for equine influenza and in contact animals with a history of no clinical signs. Mx gene was amplified by reverse transcription from total RNA isolated from peripheral blood mononuclear cells (PBMCs) stimulated with IFN α/β using gene specific primers. The amplified gene products from representative samples were cloned and sequenced. Nucleotide sequences and deduced amino acid sequences were analyzed. Out of a total 24 amino acids substitutions sorting intolerant from tolerant (SIFT) analysis predicted 13 substitutions with functional consequences. Five substitutions (V67A, W123L, E346Y, N347Y, S689N) were observed only in resistant animals. Evolutionary distances based on nucleotide sequences with in equines ranged between 0.3-2.0% and 20-24% with other species. On phylogenetic analysis all equine sequences clustered together while other species formed separate clades. Copyright © 2014 Elsevier B.V. All rights reserved.
Comparative mitogenomic analysis of mirid bugs (Hemiptera: Miridae) and evaluation of potential DNA barcoding markers.

PubMed

Wang, Juan; Zhang, Li; Zhang, Qi-Lin; Zhou, Min-Qiang; Wang, Xiao-Tong; Yang, Xing-Zhuo; Yuan, Ming-Long

2017-01-01

The family Miridae is one of the most species-rich families of insects. To better understand the diversity and evolution of mirids, we determined the mitogenome of Lygus pratenszs and re-sequenced the mitogenomes of four mirids (i.e., Apolygus lucorum , Adelphocoris suturalis , Ade. fasciaticollis and Ade. lineolatus ). We performed a comparative analysis for 15 mitogenomic sequences representing 11 species of five genera within Miridae and evaluated the potential of these mitochondrial genes as molecular markers. Our results showed that the general mitogenomic features (gene content, gene arrangement, base composition and codon usage) were well conserved among these mirids. Four protein-coding genes (PCGs) ( cox1 , cox3 , nad1 and nad3 ) had no length variability, where nad5 showed the largest size variation; no intraspecific length variation was found in PCGs. Two PCGs ( nad4 and nad5 ) showed relatively high substitution rates at the nucleotide and amino acid levels, where cox1 had the lowest substitution rate. The Ka/Ks values for all PCGs were far lower than 1 (<0.59), but the Ka/Ks values of cox1 -barcode sequences were always larger than 1 (1.34 -15.20), indicating that the 658 bp sequences of cox1 may be not the appropriate marker due to positive selection or selection relaxation. Phylogenetic analyses based on two concatenated mitogenomic datasets consistently supported the relationship of Nesidiocoris + ( Trigonotylus + ( Adelphocoris + ( Apolygus + Lygus ))), as revealed by nad4 , nad5 , rrnL and the combined 22 transfer RNA genes (tRNAs), respectively. Taken sequence length, substitution rate and phylogenetic signal together, the individual genes ( nad4 , nad5 and rrnL ) and the combined 22 tRNAs could been used as potential molecular markers for Miridae at various taxonomic levels. Our results suggest that it is essential to evaluate and select suitable markers for different taxa groups when performing phylogenetic, population genetic and species identification studies.
Cleavage site and Ectodomain of HA2 sub-unit sequence of three equine influenza virus isolated in Morocco

PubMed Central

2014-01-01

Background The equine influenza (EI) is an infectious and contagious disease of the upper respiratory tract of horses. Two outbreaks were notified in Morocco during 1997 and 2004 respectively in Nador and Essaouira. The aims of the present study concern the amino acids sequences comparison with reference strain A/equine/Miami/1963(H3N8) of the HA2 subunit including the cleavage site of three equine influenza viruses (H3N8) isolated in Morocco: A/equine/Nador/1/1997(H3N8), A/equine/Essaouira/2/2004 (H3N8) and A/equine/Essaouira/3/2004 (H3N8). Results The obtained results demonstrated that the substitutions were located at Ectodomain (ED) and transmembrane domain (TD), and they have only one arginine in cleavage site (HA1-PEKQI-R329-GI-HA2). In the Ectodomain, the mutation N/154 2 /T deleted the NGT glycosylation site at position 154 for both strains A/equine/Essaouira/2/2004(H3N8) and A/equine/Essaouira/3/2004(H3N8). Except for mutation D/1602/Y of the A/equine/Nador/1/1997(H3N8) strain, the other mutations were involved in non conserved sites. While the transmembrane domain (TM) of the strain A/equine/Essaouira/3/2004(H3N8) exhibits a substitution at residue C/199 2 /F. For the A/equine/Nador/1/1997(H3N8) strain the HA2 shows a mutation at residue M/207 2 /L. Three Moroccan strains reveals a common substitution at the residue E/211 2 /Q located between transmembrane domain TM and the cytoplasmic domain (CD). Conclusion The given nature virulence of three Moroccan strains, the identified and reported mutations certainly played a permissive role of infection viral process. PMID:25016480
The surface glycoprotein of feline leukemia virus isolate FeLV-945 is a determinant of altered pathogenesis in the presence or absence of the unique viral long terminal repeat.

PubMed

Bolin, Lisa L; Ahmad, Shamim; Lobelle-Rich, Patricia A; Ooms, Tara G; Alvarez-Hernandez, Xavier; Didier, Peter J; Levy, Laura S

2013-10-01

Feline leukemia virus (FeLV) is a naturally transmitted gammaretrovirus that infects domestic cats. FeLV-945, the predominant isolate associated with non-T-cell disease in a natural cohort, is a member of FeLV subgroup A but differs in sequence from the FeLV-A prototype, FeLV-A/61E, in the surface glycoprotein (SU) and long terminal repeat (LTR). Substitution of the FeLV-945 LTR into FeLV-A/61E resulted in pathogenesis indistinguishable from that of FeLV-A/61E, namely, thymic lymphoma of T-cell origin. In contrast, substitution of both FeLV-945 LTR and SU into FeLV-A/61E resulted in multicentric lymphoma of non-T-cell origin. These results implicated the FeLV-945 SU as a determinant of pathogenic spectrum. The present study was undertaken to test the hypothesis that FeLV-945 SU can act in the absence of other unique sequence elements of FeLV-945 to determine the disease spectrum. Substitution of FeLV-A/61E SU with that of FeLV-945 altered the clinical presentation and resulted in tumors that demonstrated expression of CD45R in the presence or absence of CD3. Despite the evident expression of CD45R, a typical B-cell marker, T-cell receptor beta (TCRβ) gene rearrangement indicated a T-cell origin. Tumor cells were detectable in bone marrow and blood at earlier times during the disease process, and the predominant SU genes from proviruses integrated in tumor DNA carried markers of genetic recombination. The findings demonstrate that FeLV-945 SU alters pathogenesis, although incompletely, in the absence of FeLV-945 LTR. Evidence demonstrates that FeLV-945 SU and LTR are required together to fully recapitulate the distinctive non-T-cell disease outcome seen in the natural cohort.
Rapid Multistep Synthesis of 1,2,4-Oxadiazoles in a Single Continuous Microreactor Sequence

PubMed Central

Grant, Daniel; Dahl, Russell; Cosford, Nicholas D. P.

2009-01-01

A general method for the synthesis of bis-substituted 1,2,4-oxadiazoles from readily available arylnitriles and activated carbonyls in a single continuous microreactor sequence is described. The synthesis incorporates three sequential microreactors to produce 1,2,4-oxadiazoles in ~30 min in quantities (40–80 mg) sufficient for full characterization and rapid library supply. PMID:18687005
Development and evaluation of 200 novel SNP assays for population genetic studies of westslope cutthroat trout and genetic identification of related taxa

Treesearch

N. R. Campbell; S. J. Amish; V. L. Prichard; K. M. McKelvey; M. K. Young; M. K. Schwartz; J. C. Garza; G. Luikart; S. R. Narum

2012-01-01

DNA sequence data were collected and screened for single nucleotide polymorphisms (SNPs) in westslope cutthroat trout (Oncorhynchus clarki lewisi) and also for substitutions that could be used to genetically discriminate rainbow trout (O. mykiss) and cutthroat trout, as well as several cutthroat trout subspecies. In total, 260 expressed sequence tag-derived loci were...
Sequence polymorphism in an insect RNA virus field population: A snapshot from a single point in space and time reveals stochastic differences among and within individual hosts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stenger, Drake C., E-mail: drake.stenger@ars.usda.

Population structure of Homalodisca coagulata Virus-1 (HoCV-1) among and within field-collected insects sampled from a single point in space and time was examined. Polymorphism in complete consensus sequences among single-insect isolates was dominated by synonymous substitutions. The mutant spectrum of the C2 helicase region within each single-insect isolate was unique and dominated by nonsynonymous singletons. Bootstrapping was used to correct the within-isolate nonsynonymous:synonymous arithmetic ratio (N:S) for RT-PCR error, yielding an N:S value ~one log-unit greater than that of consensus sequences. Probability of all possible single-base substitutions for the C2 region predicted N:S values within 95% confidence limits of themore » corrected within-isolate N:S when the only constraint imposed was viral polymerase error bias for transitions over transversions. These results indicate that bottlenecks coupled with strong negative/purifying selection drive consensus sequences toward neutral sequence space, and that most polymorphism within single-insect isolates is composed of newly-minted mutations sampled prior to selection. -- Highlights: •Sampling protocol minimized differential selection/history among isolates. •Polymorphism among consensus sequences dominated by negative/purifying selection. •Within-isolate N:S ratio corrected for RT-PCR error by bootstrapping. •Within-isolate mutant spectrum dominated by new mutations yet to undergo selection.« less

Characterization of a native hammerhead ribozyme derived from schistosomes

PubMed Central

OSBORNE, EDITH M.; SCHAAK, JANELL E.; DEROSE, VICTORIA J.

2005-01-01

A recent re-examination of the role of the helices surrounding the conserved core of the hammerhead ribozyme has identified putative loop–loop interactions between stems I and II in native hammerhead sequences. These extended hammerhead sequences are more active at low concentrations of divalent cations than are minimal hammerheads. The loop–loop interactions are proposed to stabilize a more active conformation of the conserved core. Here, a kinetic and thermodynamic characterization of an extended hammerhead sequence derived from Schistosoma mansoni is performed. Biphasic kinetics are observed, suggesting the presence of at least two conformers, one cleaving with a fast rate and the other with a slow rate. Replacing loop II with a poly(U) sequence designed to eliminate the interaction between the two loops results in greatly diminished activity, suggesting that the loop–loop interactions do aid in forming a more active conformation. Previous studies with minimal hammerheads have shown deleterious effects of Rp-phosphorothioate substitutions at the cleavage site and 5′ to A9, both of which could be rescued with Cd2+. Here, phosphorothioate modifications at the cleavage site and 5′ to A9 were made in the schistosome-derived sequence. In Mg2+, both phosphorothioate substitutions decreased the overall fraction cleaved without significantly affecting the observed rate of cleavage. The addition of Cd2+ rescued cleavage in both cases, suggesting that these are still putative metal binding sites in this native sequence. PMID:15659358
The complete mitochondrial genome of dhole Cuon alpinus: phylogenetic analysis and dating evolutionary divergence within Canidae.

PubMed

Zhang, Honghai; Chen, Lei

2011-03-01

The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.
Detecting Coevolution in and among Protein Domains

PubMed Central

Yeang, Chen-Hsiang; Haussler, David

2007-01-01

Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Palladium-Copper Catalyzed Alkyne Activation as an Entry to Multicomponent Syntheses of Heterocycles

NASA Astrophysics Data System (ADS)

Müller, Thomas J. J.

Alkynones and chalcones are of paramount importance in heterocyclic chemistry as three-carbon building blocks. In a very efficient manner, they can be easily generated by palladium-copper catalyzed reactions: ynones are formed from acid chlorides and terminal alkynes, and chalcones are synthesized in the sense of a coupling-isomerization (CI) sequence from (hetero)aryl halides and propargyl alcohols. Mild reaction conditions now open entries to sequential and consecutive transformations to heterocycles, such as furans, 3-halo furans, pyrroles, pyrazoles, substituted and annelated pyridines, annelated thiopyranones, pyridimines, meridianins, benzoheteroazepines and tetrahydro-β-carbolines, by consecutive coupling-cyclocondensation or CI-cyclocondensation sequences, as new diversity oriented routes to heterocycles. Domino reactions based upon the coupling-isomerization reaction (CIR) have been probed in the synthesis of antiparasital 2-substituted quinoline derivatives and highly luminescent spiro-benzofuranones and spiro-indolones.
X-Linked Glomerulopathy Due to COL4A5 Founder Variant.

PubMed

Barua, Moumita; John, Rohan; Stella, Lorenzo; Li, Weili; Roslin, Nicole M; Sharif, Bedra; Hack, Saidah; Lajoie-Starkell, Ginette; Schwaderer, Andrew L; Becknell, Brian; Wuttke, Matthias; Köttgen, Anna; Cattran, Daniel; Paterson, Andrew D; Pei, York

2018-03-01

Alport syndrome is a rare hereditary disorder caused by rare variants in 1 of 3 genes encoding for type IV collagen. Rare variants in COL4A5 on chromosome Xq22 cause X-linked Alport syndrome, which accounts for ∼80% of the cases. Alport syndrome has a variable clinical presentation, including progressive kidney failure, hearing loss, and ocular defects. Exome sequencing performed in 2 affected related males with an undefined X-linked glomerulopathy characterized by global and segmental glomerulosclerosis, mesangial hypercellularity, and vague basement membrane immune complex deposition revealed a COL4A5 sequence variant, a substitution of a thymine by a guanine at nucleotide 665 (c.T665G; rs281874761) of the coding DNA predicted to lead to a cysteine to phenylalanine substitution at amino acid 222, which was not seen in databases cataloguing natural human genetic variation, including dbSNP138, 1000 Genomes Project release version 01-11-2004, Exome Sequencing Project 21-06-2014, or ExAC 01-11-2014. Review of the literature identified 2 additional families with the same COL4A5 variant leading to similar atypical histopathologic features, suggesting a unique pathologic mechanism initiated by this specific rare variant. Homology modeling suggests that the substitution alters the structural and dynamic properties of the type IV collagen trimer. Genetic analysis comparing members of the 3 families indicated a distant relationship with a shared haplotype, implying a founder effect. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Primary structure of rat cardiac beta-adrenergic and muscarinic cholinergic receptors obtained by automated DNA sequence analysis: further evidence for a multigene family.

PubMed Central

Gocayne, J; Robinson, D A; FitzGerald, M G; Chung, F Z; Kerlavage, A R; Lentes, K U; Lai, J; Wang, C D; Fraser, C M; Venter, J C

1987-01-01

Two cDNA clones, lambda RHM-MF and lambda RHB-DAR, encoding the muscarinic cholinergic receptor and the beta-adrenergic receptor, respectively, have been isolated from a rat heart cDNA library. The cDNA clones were characterized by restriction mapping and automated DNA sequence analysis utilizing fluorescent dye primers. The rat heart muscarinic receptor consists of 466 amino acids and has a calculated molecular weight of 51,543. The rat heart beta-adrenergic receptor consists of 418 amino acids and has a calculated molecular weight of 46,890. The two cardiac receptors have substantial amino acid homology (27.2% identity, 50.6% with favored substitutions). The rat cardiac beta receptor has 88.0% homology (92.5% with favored substitutions) with the human brain beta receptor and the rat cardiac muscarinic receptor has 94.6% homology (97.6% with favored substitutions) with the porcine cardiac muscarinic receptor. The muscarinic cholinergic and beta-adrenergic receptors appear to be as conserved as hemoglobin and cytochrome c but less conserved than histones and are clearly members of a multigene family. These data support our hypothesis, based upon biochemical and immunological evidence, that suggests considerable structural homology and evolutionary conservation between adrenergic and muscarinic cholinergic receptors. To our knowledge, this is the first report utilizing automated DNA sequence analysis to determine the structure of a gene. Images PMID:2825184
Enantioselective functionalization of allylic C-H bonds following a strategy of functionalization and diversification.

PubMed

Sharma, Ankit; Hartwig, John F

2013-11-27

We report the enantioselective functionalization of allylic C-H bonds in terminal alkenes by a strategy involving the installation of a temporary functional group at the terminal carbon atom by C-H bond functionalization, followed by the catalytic diversification of this intermediate with a broad scope of reagents. The method consists of a one-pot sequence of palladium-catalyzed allylic C-H bond oxidation under neutral conditions to form linear allyl benzoates, followed by iridium-catalyzed allylic substitution. This overall transformation forms a variety of chiral products containing a new C-N, C-O, C-S, or C-C bond at the allylic position in good yield with a high branched-to-linear selectivity and excellent enantioselectivity (ee ≤97%). The broad scope of the overall process results from separating the oxidation and functionalization steps; by doing so, the scope of nucleophile encompasses those sensitive to direct oxidative functionalization. The high enantioselectivity of the overall process is achieved by developing an allylic oxidation that occurs without acid to form the linear isomer with high selectivity. These allylic functionalization processes are amenable to an iterative sequence leading to (1,n)-functionalized products with catalyst-controlled diastereo- and enantioselectivity. The utility of the method in the synthesis of biologically active molecules has been demonstrated.
[Study of the functional role of mutation in the guinea pig-adapted Ebola virus genome on a Drosophila melanogaster model].

PubMed

Shelemba-Chepurnova, A A; Omel'ianchuk, L V; Chepurnov, A A

2011-01-01

Ebola virus virulence in guinea pigs, which appears through virus adaptation to this animal host, correlates with substitutions in the gene encoding vp24 protein. In particular, the substitution His-->Tyr186 was found when obtaining strain 8 ms. An attempt was made to clarify the functional role of this substitution in a transgenic fruit fly model. Using the drosophila transformation technique provided transgenic strains that contained genomic insertions of wild-type Ebola virus vp24 gene and the mutant gene with the His-->Tyr substitution at the above position. Thus, the drosophila strains carrying the sequences encoding for the vp24 proteins of Ebola virus Zaire and 8 ms in pUAST vector were obtained. This makes it possible to study the expression of transgenic constructs in various D. melanogaster organs and tissues.
Ring[bond]chain tautomerism of 2-Aryl-substituted cis- and trans-decahydroquinazolines.

PubMed

Lázár, László; Göblyös, Anikó; Martinek, Tamás A; Fülöp, Ferenc

2002-07-12

In CDCl(3) at 300 K, 2-aryl-substituted cis- and trans-3-isopropyldecahydroquinazolines and trans-3-phenyldecahydroquinazolines proved to be three-component (r(1)[bond]o[bond]r(2)) ring[bond]chain tautomeric mixtures, whereas only ring-closed tautomers could be detected for the 3-methyl-substituted analogues. The proportions of the ring-chain tautomeric forms at equilibrium were strongly influenced by the N-substitutents and the cis-trans ring junction and could be described by the equation log K(X) = rho sigma(+) + log K(X=H). These are the first examples among 2-aryl-1,3-N,N-heterocycles of a three-component ring-chain tautomeric equilibrium characterized by a Hammett-type equation. The stabilities of the ring-closed forms of cis- and trans-2-aryldecahydroquinazolines and the corresponding 3,1-benzoxazines were found to increase in the following sequence of the heteroatom at position 3: NPh < N-i-Pr < O < NMe.
Strain-Tuning Atomic Substitution in Two-Dimensional Atomic Crystals.

PubMed

Li, Honglai; Liu, Hongjun; Zhou, Linwei; Wu, Xueping; Pan, Yuhao; Ji, Wei; Zheng, Biyuan; Zhang, Qinglin; Zhuang, Xiujuan; Zhu, Xiaoli; Wang, Xiao; Duan, Xiangfeng; Pan, Anlian

2018-05-22

Atomic substitution offers an important route to achieve compositionally engineered two-dimensional nanostructures and their heterostructures. Despite the recent research progress, the fundamental understanding of the reaction mechanism has still remained unclear. Here, we reveal the atomic substitution mechanism of two-dimensional atomic layered materials. We found that the atomic substitution process depends on the varying lattice constant (strain) in monolayer crystals, dominated by two strain-tuning (self-promoted and self-limited) mechanisms using density functional theory calculations. These mechanisms were experimentally confirmed by the controllable realization of a graded substitution ratio in the monolayers by controlling the substitution temperature and time and further theoretically verified by kinetic Monte Carlo simulations. The strain-tuning atomic substitution processes are of general importance to other two-dimensional layered materials, which offers an interesting route for tailoring electronic and optical properties of these materials.
Inter-individual and intragenomic variations in the ITS region of Clonorchis sinensis (Trematoda: Opisthorchiidae) from Russia and Vietnam.

PubMed

Tatonova, Yulia V; Chelomina, Galina N; Nguyen, Hung Manh

2017-11-01

Here we examined the intraspecific genetic variability of Clonorchis sinensis from Russia and Vietnam using nuclear DNA sequences (the 5.8S gene and two internal transcribed spacers of the ribosomal cluster). Despite the low level of variability in the ITS1 region, this marker has revealed some features of C. sinensis across multiple geographic regions. The genetic diversity levels for the Russian and Vietnamese populations were similar (0.1 and 0.09%, respectively) but were significantly lower than the C. sinensis from China (0.31%). About half of the sequences of the Chinese (53%) and Korean (47%) populations and about a tenth of the Vietnamese (12%) and Russian (8%) sequences included a 5bp insertion. No sequences with nucleotide substitutions both upstream and downstream of the 5bp insertion were found within the whole data set. The population of northern China had both sequence variants (with substitutions either upstream or downstream of the insertion), while only one of these variants was presented at the other localities. The Vietnamese population had a higher frequency of intragenomic polymorphism than the Russian population (69% vs. 46% and 23% vs. 3% at the 114bp and 339bp positions, respectively). These data are discussed in connection with parasite origin and adaptation, and also its invasive capacity and drug-resistance. Copyright © 2017 Elsevier B.V. All rights reserved.
Classification of European Mtdnas from an Analysis of Three European Populations

PubMed Central

Torroni, A.; Huoponen, K.; Francalacci, P.; Petrozzi, M.; Morelli, L.; Scozzari, R.; Obinu, D.; Savontaus, M. L.; Wallace, D. C.

1996-01-01

Mitochondrial DNA (mtDNA) sequence variation was examined in Finns, Swedes and Tuscans by PCR amplification and restriction analysis. About 99% of the mtDNAs were subsumed within 10 mtDNA haplogroups (H, I, J, K, M, T, U, V, W, and X) suggesting that the identified haplogroups could encompass virtually all European mtDNAs. Because both hypervariable segments of the mtDNA control region were previously sequenced in the Tuscan samples, the mtDNA haplogroups and control region sequences could be compared. Using a combination of haplogroup-specific restriction site changes and control region nucleotide substitutions, the distribution of the haplogroups was surveyed through the published restriction site polymorphism and control region sequence data of Caucasoids. This supported the conclusion that most haplogroups observed in Europe are Caucasoid-specific, and that at least some of them occur at varying frequencies in different Caucasoid populations. The classification of almost all European mtDNA variation in a number of well defined haplogroups could provide additional insights about the origin and relationships of Caucasoid populations and the process of human colonization of Europe, and is valuable for the definition of the role played by mtDNA backgrounds in the expression of pathological mtDNA mutations PMID:8978068
Array-Based Rational Design of Short Peptide Probe-Derived from an Anti-TNT Monoclonal Antibody.

PubMed

Okochi, Mina; Muto, Masaki; Yanai, Kentaro; Tanaka, Masayoshi; Onodera, Takeshi; Wang, Jin; Ueda, Hiroshi; Toko, Kiyoshi

2017-10-09

Complementarity-determining regions (CDRs) are sites on the variable chains of antibodies responsible for binding to specific antigens. In this study, a short peptide probe for recognition of 2,4,6-trinitrotoluene (TNT), was identified by testing sequences derived from the CDRs of an anti-TNT monoclonal antibody. The major TNT-binding site in this antibody was identified in the heavy chain CDR3 by antigen docking simulation and confirmed by an immunoassay using a spot-synthesis based peptide array comprising amino acid sequences of six CDRs in the variable region. A peptide derived from heavy chain CDR3 (RGYSSFIYWF) bound to TNT with a dissociation constant of 1.3 μM measured by surface plasmon resonance. Substitution of selected amino acids with basic residues increased TNT binding while substitution with acidic amino acids decreased affinity, an isoleucine to arginine change showed the greatest improvement of 1.8-fold. The ability to create simple peptide binders of volatile organic compounds from sequence information provided by the immune system in the creation of an immune response will be beneficial for sensor developments in the future.
Molecular characterization of infectious bursal disease viruses from Pakistan.

PubMed

Shabbir, Muhammad Zubair; Ali, Muhammad; Abbas, Muhammad; Chaudhry, Umer Naveed; Zia-Ur-Rehman; Munir, Muhammad

2016-07-01

Since the first report of infectious bursal disease in Pakistan in 1987, outbreaks have been common even in vaccinated flocks. Despite appropriate administration of vaccines, concerns arise if the circulating strains are different from the ones used in the vaccine. Here, we sequenced the hypervariable region (HVR) of the VP2 gene of circulating strains of infectious bursal disease virus (IBDV) originating from outbreaks (n = 4) in broiler flocks in Pakistan. Nucleotide sequencing followed by phylogeny and deduced amino acid sequence analysis showed the circulating strains to be very virulent (vv) and identified characteristic residues at position 222 (A), 242 (I), 256 (I), 294 (I) and 299 (S). In addition, a substitution at positions 221 (Q→H) was found to be exclusive to Pakistani strains in our analysis, although a larger dataset is required to confirm this finding. Compared to vaccine strains that are commonly used in Pakistan, substitution mutations were found at key amino acid positions in VP2 that may be responsible for potential changes in neutralization epitopes and vaccine failure.
Structural analysis of HLA-B40 epitopes.

PubMed

Kawaguchi, G; Kato, N; Kashiwase, K; Karaki, S; Kohsaka, T; Akaza, T; Kano, K; Takiguchi, M

1993-03-01

Two genes encoding HLA-B60 or HLA-B61 were cloned from Japanese and the exons of their genes were sequenced. One silent mutation was observed at the exon 1 between HLA-B60 (B*40012) and B*40011. Seven nucleotide substitutions were seen at the exon 3 between HLA-B61 (B*4006) and B*4002. Three substitutions at codon 95, CTC in B*4002 to TGG in B*4006, changed Leu in B*4002 to Trp in B*4006, while two substitutions at codon 97, AGC in B*4002 and ACG in B*4006, changed Ser in B*4002 to Thr in B*4006. Since B*4002 shares the epitope of alloantibodies specific for HLA-B61, two HLA-B61 subtypes are discriminated by two amino acid substitutions at residues 95 and 97. B*40012 and B*4006 differ by four amino acid substitutions on the beta sheet and five amino acid substitutions on the alpha 2 helix. Since the residues at the beta sheet seem hardly to affect the binding of alloantibody, it is suspected that the residues on the alpha 2 helix provide epitopes for alloantibodies that discriminate allospecificity between HLA-B60 and HLA-B61.
The topography of mutational processes in breast cancer genomes

DOE PAGES

Morganella, Sandro; Alexandrov, Ludmil B.; Glodzik, Dominik; ...

2016-01-01

Somatic mutations in human cancers show unevenness in genomic distribution that correlate with aspects of genome structure and function. These mutations are, however, generated by multiple mutational processes operating through the cellular lineage between the fertilized egg and the cancer cell, each composed of specific DNA damage and repair components and leaving its own characteristic mutational signature on the genome. Using somatic mutation catalogues from 560 breast cancer whole-genome sequences, here we show that each of 12 base substitution, 2 insertion/deletion (indel) and 6 rearrangement mutational signatures present in breast tissue, exhibit distinct relationships with genomic features relating to transcription,more » DNA replication and chromatin organization. This signature-based approach permits visualization of the genomic distribution of mutational processes associated with APOBEC enzymes, mismatch repair deficiency and homologous recombinational repair deficiency, as well as mutational processes of unknown aetiology. Lastly, it highlights mechanistic insights including a putative replication-dependent mechanism of APOBEC-related mutagenesis.« less
Fat substitutes in processing of sausages using piramutaba waste.

PubMed

de Fátima Henriques Lourenço, Lúcia; Dos Santos Galvão, Giane Célia; da Conceição Amaral Ribeiro, Suezilde; de Fátima Amaral Ribeiro, Carmelita; Park, Kil Jin

2014-07-01

The aim of this study was to evaluate fat substitute in processing of sausages prepared with surimi of waste from piramutaba filleting. The formulation ingredients were mixed with the fat substitutes added according to a fractional planning 2(4-1), where the independent variables, manioc starch (Ms), hydrogenated soy fat (F), texturized soybean protein (Tsp) and carrageenan (Cg) were evaluated on the responses of pH, texture (Tx), raw batter stability (RBS) and water holding capacity (WHC) of the sausage. Fat substitutes were evaluated in 11 formulations and the results showed that the greatest effects on the responses were found to Ms, F and Cg, being eliminated from the formulation Tsp. To find the best formulation for processing piramutaba sausage was made a complete factorial planning of 2(3) to evaluate the concentrations of fat substitutes in an enlarged range. The optimum condition found for fat substitutes in the sausages formulation were carrageenan (0.51%), manioc starch (1.45%) and fat (1.2%).
Human genetics: measuring the raw material of evolution.

PubMed

Armour, John A L

2009-09-15

By direct sequencing of two Y chromosomes inherited from the same paternal ancestor, a landmark study has derived a good direct estimate for the rate of base substitution mutations on the human Y chromosome.
Rapid sequence evolution of street rabies glycoprotein is related to the highly heterogeneous nature of the viral population.

PubMed

Benmansour, A; Brahimi, M; Tuffereau, C; Coulon, P; Lafay, F; Flamand, A

1992-03-01

The sequence of the glycoprotein gene of a street rabies virus was determined directly using fragments of a rabid dog brain after PCR amplification. Compared with that of the prototype strain CVS, this sequence displayed 10% divergence in overall amino acid composition. However only 6% divergence was noted in the ectodomain suggesting that structural constraints are exerted on this portion of the glycoprotein. A human strain isolated on cell culture from the saliva of a patient with clinical rabies had only five amino acid differences with the canine isolate, an indication of their close relatedness. These differences could have originated during transmission from dog to dog, or from dog to man, or during isolation on cell culture; they are nonetheless indicative of a genetic evolution of street rabies virus. This evolution was further evidenced by the selection of cell-adapted variants which displayed new amino acid substitutions in the glycoprotein. One of them concerned antigenic site III where arginine at position 333 was replaced by glutamine. As expected this substitution conferred resistance to a site IIIa monoclonal antibody (MAb), but surprisingly did not abolish neurovirulence for adult mice. However, a decrease in the neurovirulence of the cell-adapted variant in the presence of a site IIIa specific MAb was noted, suggesting that neurovirulence was due to a subpopulation neutralizable by the MAb. Simultaneous presence of both the parental and variant sequences was indeed evidenced in the brain of a mouse inoculated with the cell-adapted variant; during multiplication in the mouse brain, the frequency of the parental sequence rose from less than 10% to nearly 50%, indicating the selective advantage conferred by arginine 333 in nervous tissue. Altogether these results were suggestive of an intrinsic heterogeneity of street rabies virus. This heterogeneity was further demonstrated by the sequencing of molecular clones of the glycoprotein gene, which revealed that only one-third of the viral genomes present in the brain of a rabid dog had the consensus sequence. Two-thirds of the clones analyzed displayed from one to three amino acid substitutions. Such heterogeneous populations have been referred to as quasispecies, a concept which implies heterogeneous populations kept together in a dynamic equilibrium. This equilibrium could be rapidly displaced, giving the virus the capacity to adapt easily to new environmental conditions.
Candida kantuleensis sp. nov., a d-xylose-fermenting yeast species isolated from peat in a tropical peat swamp forest.

PubMed

Nitiyon, Sukanya; Khunnamwong, Pannida; Lertwattanasakul, Noppon; Limtong, Savitree

2018-05-24

Three strains (DMKU-XE11 T , DMKU-XE15 and DMKU-XE20) representing a single novel anamorphic and d-xylose-fermenting yeast species were obtained from three peat samples collected from Khan Thulee peat swamp forest in Surat Thani province, Thailand. The strains differed from each other by one to two nucleotide substitutions in the sequences of the D1/D2 region of the large subunit (LSU) rRNA gene and zero to one nucleotide substitution in the internal transcribed spacer (ITS) region. Phylogenetic analysis based on the combined sequences of the ITS and the D1/D2 regions showed that the three strains represented a single Candida species that was distinct from the other related species in the Lodderomyces/Candida albicans clade. The three strains form a subclade with the other Candida species including Candida sanyaensis, Candida tropicalis and Candida sojae. C. sanyaensis was the most closely related species, with 2.1-2.4 % nucleotide substitutions in the D1/D2 region of the LSU rRNA gene, and 3.8-4.0 % nucleotide substitutions in the ITS region. The three strains (DMKU-XE11 T , DMKU-XE15 and DMKU-XE20) were assigned as a single novel species, which was named Candida kantuleensis sp. nov. The type strain is DMKU-XE11 T (=CBS 15219 T =TBRC 7764 T ). The MycoBank number for C. kantuleensis sp. nov. is MB 824179.

A novel amino acid substitution Trp574Arg in acetolactate synthase (ALS) confers broad resistance to ALS-inhibiting herbicides in crabgrass (Digitaria sanguinalis).

PubMed

Li, Jian; Li, Mei; Gao, Xingxiang; Fang, Feng

2017-12-01

Crabgrass (Digitaria sanguinalis) is an annual monocotyledonous weed. In recent years, field applications of nicosulfuron have been ineffective in controlling crabgrass populations in Shandong Province, China. To investigate the mechanisms of resistance to nicosulfuron in crabgrass populations, the acetolactate synthase (ALS) gene fragment covering known resistance-confering mutation sites was amplified and sequenced. Dose-response experiments suggested that the resistant population SD13 (R) was highly resistant to nicosulfuron (resistance index R/S = 43.7) compared with the sensitive population SD22 (S). ALS gene sequencing revealed a Trp574Arg substitution in the SD13 population, and no other known resistance-conferring mutations were found. In vitro ALS enzyme assays further confirmed that the SD13 population was resistant to all tested ALS-inhibiting herbicides. The resistance pattern experiments revealed that, compared with SD22, the SD13 population exhibited broad-spectrum resistance to nicosulfuron (43.7-fold), imazethapyr (11.4-fold) and flumetsulam (16.1-fold); however, it did not develop resistance to atrazine, mesotrione and topramezone. This study demonstrated that Trp574Arg substitution was the main reason for crabgrass resistance to ALS-inhibiting herbicides. To our knowledge, this is the first report of Trp574Arg substitution in a weed species, and is the first report of target-site mechanisms of herbicide resistance for crabgrass. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Constraining the timing of the Great Oxidation Event within the Rubisco phylogenetic tree.

PubMed

Kacar, B; Hanson-Smith, V; Adam, Z R; Boekelheide, N

2017-09-01

Ribulose 1,5-bisphosphate (RuBP) carboxylase/oxygenase (RuBisCO, or Rubisco) catalyzes a key reaction by which inorganic carbon is converted into organic carbon in the metabolism of many aerobic and anaerobic organisms. Across the broader Rubisco protein family, homologs exhibit diverse biochemical characteristics and metabolic functions, but the evolutionary origins of this diversity are unclear. Evidence of the timing of Rubisco family emergence and diversification of its different forms has been obscured by a meager paleontological record of early Earth biota, their subcellular physiology and metabolic components. Here, we use computational models to reconstruct a Rubisco family phylogenetic tree, ancestral amino acid sequences at branching points on the tree, and protein structures for several key ancestors. Analysis of historic substitutions with respect to their structural locations shows that there were distinct periods of amino acid substitution enrichment above background levels near and within its oxygen-sensitive active site and subunit interfaces over the divergence between Form III (associated with anoxia) and Form I (associated with oxia) groups in its evolutionary history. One possible interpretation is that these periods of substitutional enrichment are coincident with oxidative stress exerted by the rise of oxygenic photosynthesis in the Precambrian era. Our interpretation implies that the periods of Rubisco substitutional enrichment inferred near the transition from anaerobic Form III to aerobic Form I ancestral sequences predate the acquisition of Rubisco by fully derived cyanobacterial (i.e., dual photosystem-bearing, oxygen-evolving) clades. The partitioning of extant lineages at high clade levels within our Rubisco phylogeny indicates that horizontal transfer of Rubisco is a relatively infrequent event. Therefore, it is possible that the mutational enrichment periods between the Form III and Form I common ancestral sequences correspond to the adaptation of key oxygen-sensitive components of Rubisco prior to, or coincident with, the Great Oxidation Event. © 2017 The Authors. Geobiology Published by John Wiley & Sons Ltd.
[Molecular evolution of the tick-borne encephalitis and Powassan viruses].

PubMed

Subbotina, E L; Loktev, V B

2012-01-01

The problem of emerging viruses, their genetic diversity and viral evolution in nature are attracting more attention. The phylogenetic analysis and evaluationary rate estimation were made for pathogenic flaviviruses such as tick-borne encephalitis virus (TBEV) and Powassan (PV) circulated in natural foci in Russia. 47 nucleotide sequences of encoded protein E of the TBEV and 17 sequences of NS5 genome region of the PV have been used. It was found that the rate of accumulation of nucleotide substitutions for E genome region of TBEV was approximately 1.4 x 10(-4) and 5.4 x 10(-5) substitutions per site per year for NS5 genome region of PV. The ratio of non-synonymous nucleotide substitutions to synonymous substitution (dN/dS) for viral sequences were estimated of 0.049 for TBEV and 0.098 for PV. Maximum value dN/dS was 0.201-0.220 for sub-cluster of Russian and Canadian strains of PV and the minimum - 0.024 for cluster of Russian and Chinese strains of Far Eastern genotype TBEV. Evaluation of time intervals of evolutionary events associated with these viruses showed that European subtype TBEV are diverged from all-TBEV ancestor within approximately 2750 years and the Siberian and Far Eastern subtypes are emerged about 2250 years ago. The PV was introduced into natural foci of the Primorsky Krai of Russia only about 70 years ago and PV is a very close to Canadian strains of PV. Evolutionary picture for PV in North America is similar to evolution of Siberian and Far Eastern subtypes TBEV in Asia. The divergence time for main genetic groups of TBEV and PV are correlated with historical periods of warming and cooling. These allow to propose a hypothesis that climate changes were essential to the evolution of the flaviviruses in the past millenniums.
Predominance of influenza A(H3N2) viruses during the 2016/2017 season in Bulgaria.

PubMed

Korsun, Neli; Angelova, Svetla; Trifonova, Ivelina; Tzotcheva, Iren; Mileva, Sirma; Voleva, Silvia; Georgieva, Irina; Perenovska, Penka

2018-02-01

Influenza viruses are characterised by high variability, which makes them able to cause annual epidemics. The aim of this study is to determine the antigenic and genetic characteristics of influenza viruses circulating in Bulgaria during the 2016/2017 season. The detection and typing/subtyping of influenza viruses were performed using real time RT-PCR. Results of antigenic characterisation, phylogenetic and amino acid sequence analyses of representative influenza strains are presented herein. The 2016/2017 season was characterised by an early start, an exclusive dominance of A(H3N2) viruses accounting for 93 % of total influenza virus detections, and a low circulation of A(H1N1)pdm09 (4.2 %) and type B (2.5 %) viruses. The analysed A(H3N2) viruses belonged to subclades 3C.2a (52 %) and 3C.2a1 (48 %); all studied A(H1N1)pdm09 and B/Victoria-lineage viruses belonged to subclades 6B.1 and 1A, respectively. The amino acid sequence analysis of 56 A(H3N2) isolates revealed the presence of substitutions in 18 positions in haemagglutinin (HA) as compared to the A/Hong Kong/4801/2014 vaccine virus, seven of which occurred in four antigenic sites, together with changes in 23 positions in neuraminidase (NA), and a number of substitutions in internal proteins PB2, PB1, PB1-F2, PA, NP and NS1. Despite the many amino acid substitutions, A(H3N2) viruses remained antigenically similar to the vaccine strain. Substitutions in HA and NA sequences of A(H1N1)pdm09 and B/Victoria-lineage strains were also identified, including in antigenic sites. The results of this study confirm the genetic variability of circulating influenza viruses, particularly A(H3N2), and the need for continued antigenic and molecular surveillance.
Evolution of Flavone Synthase I from Parsley Flavanone 3β-Hydroxylase by Site-Directed Mutagenesis1[W][OA

PubMed Central

Gebhardt, Yvonne Helen; Witte, Simone; Steuber, Holger; Matern, Ulrich; Martens, Stefan

2007-01-01

Flavanone 3β-hydroxylase (FHT) and flavone synthase I (FNS I) are 2-oxoglutarate-dependent dioxygenases with 80% sequence identity, which catalyze distinct reactions in flavonoid biosynthesis. However, FNS I has been reported exclusively from a few Apiaceae species, whereas FHTs are more abundant. Domain-swapping experiments joining the N terminus of parsley (Petroselinum crispum) FHT with the C terminus of parsley FNS I and vice versa revealed that the C-terminal portion is not essential for FNS I activity. Sequence alignments identified 26 amino acid substitutions conserved in FHT versus FNS I genes. Homology modeling, based on the related anthocyanidin synthase structure, assigned seven of these amino acids (FHT/FNS I, M106T, I115T, V116I, I131F, D195E, V200I, L215V, and K216R) to the active site. Accordingly, FHT was modified by site-directed mutagenesis, creating mutants encoding from one to seven substitutions, which were expressed in yeast (Saccharomyces cerevisiae) for FNS I and FHT assays. The exchange I131F in combination with either M106T and D195E or L215V and K216R replacements was sufficient to confer some FNS I side activity. Introduction of all seven FNS I substitutions into the FHT sequence, however, caused a nearly complete change in enzyme activity from FHT to FNS I. Both FHT and FNS I were proposed to initially withdraw the β-face-configured hydrogen from carbon-3 of the naringenin substrate. Our results suggest that the 7-fold substitution affects the orientation of the substrate in the active-site pocket such that this is followed by syn-elimination of hydrogen from carbon-2 (FNS I reaction) rather than the rebound hydroxylation of carbon-3 (FHT reaction). PMID:17535823
Genetic characterization of the hemagglutinin genes of wild-type measles virus circulating in china, 1993-2009.

PubMed

Xu, Songtao; Zhang, Yan; Zhu, Zhen; Liu, Chunyu; Mao, Naiying; Ji, Yixin; Wang, Huiling; Jiang, Xiaohong; Li, Chongshan; Tang, Wei; Feng, Daxing; Wang, Changyin; Zheng, Lei; Lei, Yue; Ling, Hua; Zhao, Chunfang; Ma, Yan; He, Jilan; Wang, Yan; Li, Ping; Guan, Ronghui; Zhou, Shujie; Zhou, Jianhui; Wang, Shuang; Zhang, Hong; Zheng, Huanying; Liu, Leng; Ma, Hemuti; Guan, Jing; Lu, Peishan; Feng, Yan; Zhang, Yanjun; Zhou, Shunde; Xiong, Ying; Ba, Zhuoma; Chen, Hui; Yang, Xiuhui; Bo, Fang; Ma, Yujie; Liang, Yong; Lei, Yake; Gu, Suyi; Liu, Wei; Chen, Meng; Featherstone, David; Jee, Youngmee; Bellini, William J; Rota, Paul A; Xu, Wenbo

2013-01-01

China experienced several large measles outbreaks in the past two decades, and a series of enhanced control measures were implemented to achieve the goal of measles elimination. Molecular epidemiologic surveillance of wild-type measles viruses (MeV) provides valuable information about the viral transmission patterns. Since 1993, virologic surveillnace has confirmed that a single endemic genotype H1 viruses have been predominantly circulating in China. A component of molecular surveillance is to monitor the genetic characteristics of the hemagglutinin (H) gene of MeV, the major target for virus neutralizing antibodies. Analysis of the sequences of the complete H gene from 56 representative wild-type MeV strains circulating in China during 1993-2009 showed that the H gene sequences were clustered into 2 groups, cluster 1 and cluster 2. Cluster1 strains were the most frequently detected cluster and had a widespread distribution in China after 2000. The predicted amino acid sequences of the H protein were relatively conserved at most of the functionally significant amino acid positions. However, most of the genotype H1 cluster1 viruses had an amino acid substitution (Ser240Asn), which removed a predicted N-linked glycosylation site. In addition, the substitution of Pro397Leu in the hemagglutinin noose epitope (HNE) was identified in 23 of 56 strains. The evolutionary rate of the H gene of the genotype H1 viruses was estimated to be approximately 0.76×10(-3) substitutions per site per year, and the ratio of dN to dS (dN/dS) was <1 indicating the absence of selective pressure. Although H genes of the genotype H1 strains were conserved and not subjected to selective pressure, several amino acid substitutions were observed in functionally important positions. Therefore the antigenic and genetic properties of H genes of wild-type MeVs should be monitored as part of routine molecular surveillance for measles in China.
Fibonacci chain polynomials: Identities from self-similarity

NASA Technical Reports Server (NTRS)

Lang, Wolfdieter

1995-01-01

Fibonacci chains are special diatomic, harmonic chains with uniform nearest neighbor interaction and two kinds of atoms (mass-ratio r) arranged according to the self-similar binary Fibonacci sequence ABAABABA..., which is obtained by repeated substitution of A yields AB and B yields A. The implications of the self-similarity of this sequence for the associated orthogonal polynomial systems which govern these Fibonacci chains with fixed mass-ratio r are studied.
Complete Genome Sequence of the Circulatory Foot-and-Mouth Disease Virus Serotype Asia1 in Bangladesh

PubMed Central

Ali, M. Rahmat; Alam, A. S. M. Rubayet Ul; Amin, M. Al; Ullah, Huzzat; Siddique, Mohammad Anwar; Momtaz, Samina; Sultana, Munawar

2017-01-01

ABSTRACT The complete genome sequence of foot-and-mouth disease virus (FMDV) serotype Asia1 isolated from Bangladesh is reported here. Genome analysis revealed amino acid substitutions in the VP1 antigenic region and deletions in both the 5′ and 3′ untranslated regions (UTRs) compared to the genome of the existing vaccine strain (GenBank accession no. AY304994). PMID:29074654
Sample substitution can be an acceptable data-collection strategy: the case of the Belgian Health Interview Survey.

PubMed

Demarest, Stefaan; Molenberghs, Geert; Van der Heyden, Johan; Gisle, Lydia; Van Oyen, Herman; de Waleffe, Sandrine; Van Hal, Guido

2017-11-01

Substitution of non-participating households is used in the Belgian Health Interview Survey (BHIS) as a method to obtain the predefined net sample size. Yet, possible effects of applying substitution on response rates and health estimates remain uncertain. In this article, the process of substitution with its impact on response rates and health estimates is assessed. The response rates (RR)-both at household and individual level-according to the sampling criteria were calculated for each stage of the substitution process, together with the individual accrual rate (AR). Unweighted and weighted health estimates were calculated before and after applying substitution. Of the 10,468 members of 4878 initial households, 5904 members (RRind: 56.4%) of 2707 households (RRhh: 55.5%) participated. For the three successive (matched) substitutes, the RR dropped to 45%. The composition of the net sample resembles the one of the initial samples. Applying substitution did not produce any important distorting effects on the estimates. Applying substitution leads to an increase in non-participation, but does not impact the estimations.
Complement component 3: characterization and association with mastitis resistance in Egyptian water buffalo and cattle.

PubMed

El-Halawany, Nermin; Abd-El-Monsif, Shawky A; Al-Tohamy Ahmed, F M; Hegazy, Lamees; Abdel-Shafy, Hamdy; Abdel-Latif, Magdy A; Ghazi, Yasser A; Neuhoff, Christiane; Salilew-Wondim, Dessie; Schellander, Karl

2017-03-01

Mastitis is an infectious disease of the mammary gland that leads to reduced milk production and change in milk composition. Complement component C3 plays a major role as a central molecule of the complement cascade involving in killing of microorganisms, either directly or in cooperation with phagocytic cells. C3 cDNA were isolated, from Egyptian buffalo and cattle, sequenced and characterized. The C3 cDNA sequences of buffalo and cattle consist of 5025 and 5019 bp, respectively. Buffalo and cattle C3 cDNAs share 99% of sequence identity with each other. The 4986 bp open reading frame in buffalo encodes a putative protein of 1661 amino acids-as in cattle-and includes all the functional domains. Further, analysis of the C3 cDNA sequences detected six novel single-nucleotide polymorphisms (SNPs) in buffalo and three novel SNPs in cattle. The association analysis of the detected SNPs with milk somatic cell score as an indicator of mastitis revealed that the most significant association in buffalo was found in the C>A substitution (ss: 1752816097) in exon 27, whereas in cattle it was in the C>T substitution (ss: 1752816085) in exon 12. Our findings provide preliminary information about the contribution of C3 polymorphisms to mastitis resistance in buffalo and cattle.
Chloroplast and nuclear gene sequences indicate late Pennsylvanian time for the last common ancestor of extant seed plants.

PubMed Central

Savard, L; Li, P; Strauss, S H; Chase, M W; Michaud, M; Bousquet, J

1994-01-01

We have estimated the time for the last common ancestor of extant seed plants by using molecular clocks constructed from the sequences of the chloroplastic gene coding for the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (rbcL) and the nuclear gene coding for the small subunit of rRNA (Rrn18). Phylogenetic analyses of nucleotide sequences indicated that the earliest divergence of extant seed plants is likely represented by a split between conifer-cycad and angiosperm lineages. Relative-rate tests were used to assess homogeneity of substitution rates among lineages, and annual angiosperms were found to evolve at a faster rate than other taxa for rbcL and, thus, these sequences were excluded from construction of molecular clocks. Five distinct molecular clocks were calibrated using substitution rates for the two genes and four divergence times based on fossil and published molecular clock estimates. The five estimated times for the last common ancestor of extant seed plants were in agreement with one another, with an average of 285 million years and a range of 275-290 million years. This implies a substantially more recent ancestor of all extant seed plants than suggested by some theories of plant evolution. PMID:8197201
First detection of multiple knockdown resistance (kdr)-like mutations in voltage-gated sodium channel using three new genotyping methods in Anopheles sinensis from Guangxi Province, China.

PubMed

Tan, Wei L; Li, Chun X; Wang, Zhong M; Liu, Mei D; Dong, Yan D; Feng, Xiang Y; Wu, Zhi M; Guo, Xiao X; Xing, Dan; Zhang, Ying M; Wang, Zhong C; Zhao, Tong Y

2012-09-01

To investigate knockdown resistance (kdr)-like mutations associated with pyrethroid resistance in Anopheles sinensis (Wiedemann, 1828), from Guangxi province, southwest China, a segment of a sodium channel gene was sequenced and genotyped using three new genotyping assays. Direct sequencing revealed the presence of TTG-to-TCG and TG-to-TTT mutations at allele position L1014, which led to L1014S and L1014F substitutions in a few individual and two novel substitutions of N1013S and L1014W in two DNA templates. A low frequency of the kdr allele mostly in the heterozygous state of L1014S and L1014F was observed in this mosquito population. In this study, the genotyping of An. sinensis using three polymerase chain reaction-based methods generated consistent results, which agreed with the results of DNA sequencing. In total, 52 mosquitoes were genotyped using a direct sequencing assay. The number of mosquitoes and their genotypes were as follows: L/L = 24, L/S = 19, L/F = 8, and F/W = 1. The allelic frequency of L1014, 1014S, and 1014F were 72, 18, and 9%, respectively.
DNA Barcoding analysis of seafood accuracy in Washington, D.C. restaurants

PubMed Central

Stern, David B.; Castro Nallar, Eduardo; Rathod, Jason

2017-01-01

In Washington D.C., recent legislation authorizes citizens to test if products are properly represented and, if they are not, to bring a lawsuit for the benefit of the general public. Recent studies revealing the widespread phenomenon of seafood substitution across the United States make it a fertile area for consumer protection testing. DNA barcoding provides an accurate and cost-effective way to perform these tests, especially when tissue alone is available making species identification based on morphology impossible. In this study, we sequenced the 5′ barcoding region of the Cytochrome Oxidase I gene for 12 samples of vertebrate and invertebrate food items across six restaurants in Washington, D.C. and used multiple analytical methods to make identifications. These samples included several ambiguous menu listings, sequences with little genetic variation among closely related species and one sequence with no available reference sequence. Despite these challenges, we were able to make identifications for all samples and found that 33% were potentially mislabeled. While we found a high degree of mislabeling, the errors involved closely related species and we did not identify egregious substitutions as have been found in other cities. This study highlights the efficacy of DNA barcoding and robust analyses in identifying seafood items for consumer protection. PMID:28462038
The Relationship between the Structure of the Tick-Borne Encephalitis Virus Strains and Their Pathogenic Properties

PubMed Central

Belikov, Sergei I.; Kondratov, Ilya G.; Potapova, Ulyana V.; Leonova, Galina N.

2014-01-01

Tick-borne encephalitis virus (TBEV) is transmitted to vertebrates by taiga or forest ticks through bites, inducing disease of variable severity. The reasons underlying these differences in the severity of the disease are unknown. In order to identify genetic factors affecting the pathogenicity of virus strains, we have sequenced and compared the complete genomes of 34 Far-Eastern subtype (FE) TBEV strains isolated from patients with different disease severity (Primorye, the Russian Far East). We analyzed the complete genomes of 11 human pathogenic strains isolated from the brains of dead patients with the encephalitic form of the disease (Efd), 4 strains from the blood of patients with the febrile form of TBE (Ffd), and 19 strains from patients with the subclinical form of TBE (Sfd). On the phylogenetic tree, pathogenic Efd strains formed two clusters containing the prototype strains, Senzhang and Sofjin, respectively. Sfd strains formed a third separate cluster, including the Oshima strain. The strains that caused the febrile form of the disease did not form a separate cluster. In the viral proteins, we found 198 positions with at least one amino acid residue substitution, of which only 17 amino acid residue substitutions were correlated with the variable pathogenicity of these strains in humans and they authentically differed between the groups. We considered the role of each amino acid substitution and assumed that the deletion of 111 amino acids in the capsid protein in combination with the amino acid substitutions R16K and S45F in the NS3 protease may affect the budding process of viral particles. These changes may be the major reason for the diminished pathogenicity of TBEV strains. We recommend Sfd strains for testing as attenuation vaccine candidates. PMID:24740396
The degree of attenuation of tick-borne encephalitis virus depends on the cumulative effects of point mutations.

PubMed

Gritsun, T S; Desai, A; Gould, E A

2001-07-01

An infectious clone (pGGVs) of the tick-borne encephalitis complex virus Vasilchenko (Vs) was constructed previously. Virus recovered from pGGVs produced slightly smaller plaques than the Vs parental virus. Sequence analysis demonstrated five nucleotide differences between the original Vs virus and pGGVs; four of these mutations resulted in amino acid substitutions, while the fifth mutation was located in the 3' untranslated region (3'UTR). Two mutations were located in conserved regions and three mutations were located in variable regions of the virus genome. Reverse substitutions from the conserved regions of the genome, R(496)-->H in the envelope (E) gene and C(10884)-->T in the 3'UTR, were introduced both separately and together into the infectious clone and their biological effect on virus phenotype was evaluated. The engineered viruses with R(496) in the E protein produced plaques of smaller size than viruses with H(496) at this position. This mutation also affected the growth and neuroinvasiveness of the virus. In contrast, the consequence of a T(10884)-->C substitution within the 3'UTR was noticeable only in cytotoxicity and neuroinvasiveness tests. However, all virus mutants engineered by modification of the infectious clone, including one with two wild-type mutations, H(496) and T(10884), showed reduced neuroinvasiveness in comparison with the Vs parental virus. Therefore, although the H(496)-->R and T(10884)-->C substitutions clearly reduce virus virulence, the other mutations within the variable regions of the capsid (I(45)-->F) and the NS5 (T(2688)-->A and M(3385)-->I) genes also contribute to the process of attenuation. In terms of developing flavivirus vaccines, the impact of accumulating apparently minor mutations should be assessed in detail.
Electrotactile and vibrotactile displays for sensory substitution systems

NASA Technical Reports Server (NTRS)

Kaczmarek, Kurt A.; Webster, John G.; Bach-Y-rita, Paul; Tompkins, Willis J.

1991-01-01

Sensory substitution systems provide their users with environmental information through a human sensory channel (eye, ear, or skin) different from that normally used or with the information processed in some useful way. The authors review the methods used to present visual, auditory, and modified tactile information to the skin and discuss present and potential future applications of sensory substitution, including tactile vision substitution (TVS), tactile auditory substitution, and remote tactile sensing or feedback (teletouch). The relevant sensory physiology of the skin, including the mechanisms of normal touch and the mechanisms and sensations associated with electrical stimulation of the skin using surface electrodes (electrotactile, or electrocutaneous, stimulation), is reviewed. The information-processing ability of the tactile sense and its relevance to sensory substitution is briefly summarized. The limitations of current tactile display technologies are discussed.
Hiding message into DNA sequence through DNA coding and chaotic maps.

PubMed

Liu, Guoyan; Liu, Hongjun; Kadir, Abdurahman

2014-09-01

The paper proposes an improved reversible substitution method to hide data into deoxyribonucleic acid (DNA) sequence, and four measures have been taken to enhance the robustness and enlarge the hiding capacity, such as encode the secret message by DNA coding, encrypt it by pseudo-random sequence, generate the relative hiding locations by piecewise linear chaotic map, and embed the encoded and encrypted message into a randomly selected DNA sequence using the complementary rule. The key space and the hiding capacity are analyzed. Experimental results indicate that the proposed method has a better performance compared with the competing methods with respect to robustness and capacity.
Variance to mean ratio, R(t), for poisson processes on phylogenetic trees.

PubMed

Goldman, N

1994-09-01

The ratio of expected variance to mean, R(t), of numbers of DNA base substitutions for contemporary sequences related by a "star" phylogeny is widely seen as a measure of the adherence of the sequences' evolution to a Poisson process with a molecular clock, as predicted by the "neutral theory" of molecular evolution under certain conditions. A number of estimators of R(t) have been proposed, all predicted to have mean 1 and distributions based on the chi 2. Various genes have previously been analyzed and found to have values of R(t) far in excess of 1, calling into question important aspects of the neutral theory. In this paper, I use Monte Carlo simulation to show that the previously suggested means and distributions of estimators of R(t) are highly inaccurate. The analysis is applied to star phylogenies and to general phylogenetic trees, and well-known gene sequences are reanalyzed. For star phylogenies the results show that Kimura's estimators ("The Neutral Theory of Molecular Evolution," Cambridge Univ. Press, Cambridge, 1983) are unsatisfactory for statistical testing of R(t), but confirm the accuracy of Bulmer's correction factor (Genetics 123: 615-619, 1989). For all three nonstar phylogenies studied, attained values of all three estimators of R(t), although larger than 1, are within their true confidence limits under simple Poisson process models. This shows that lineage effects can be responsible for high estimates of R(t), restoring some limited confidence in the molecular clock and showing that the distinction between lineage and molecular clock effects is vital.(ABSTRACT TRUNCATED AT 250 WORDS)
The Doubting System 1: Evidence for automatic substitution sensitivity.

PubMed

Johnson, Eric D; Tubau, Elisabet; De Neys, Wim

2016-02-01

A long prevailing view of human reasoning suggests severe limits on our ability to adhere to simple logical or mathematical prescriptions. A key position assumes these failures arise from insufficient monitoring of rapidly produced intuitions. These faulty intuitions are thought to arise from a proposed substitution process, by which reasoners unknowingly interpret more difficult questions as easier ones. Recent work, however, suggests that reasoners are not blind to this substitution process, but in fact detect that their erroneous responses are not warranted. Using the popular bat-and-ball problem, we investigated whether this substitution sensitivity arises out of an automatic System 1 process or whether it depends on the operation of an executive resource demanding System 2 process. Results showed that accuracy on the bat-and-ball problem clearly declined under cognitive load. However, both reduced response confidence and increased response latencies indicated that biased reasoners remained sensitive to their faulty responses under load. Results suggest that a crucial substitution monitoring process is not only successfully engaged, but that it automatically operates as an autonomous System 1 process. By signaling its doubt along with a biased intuition, it appears System 1 is "smarter" than traditionally assumed.
Substitution rate and natural selection in parvovirus B19

PubMed Central

Stamenković, Gorana G.; Ćirković, Valentina S.; Šiljić, Marina M.; Blagojević, Jelena V.; Knežević, Aleksandra M.; Joksić, Ivana D.; Stanojević, Maja P.

2016-01-01

The aim of this study was to estimate substitution rate and imprints of natural selection on parvovirus B19 genotype 1. Studied datasets included 137 near complete coding B19 genomes (positions 665 to 4851) for phylogenetic and substitution rate analysis and 146 and 214 partial genomes for selection analyses in open reading frames ORF1 and ORF2, respectively, collected 1973–2012 and including 9 newly sequenced isolates from Serbia. Phylogenetic clustering assigned majority of studied isolates to G1A. Nucleotide substitution rate for total coding DNA was 1.03 (0.6–1.27) x 10−4 substitutions/site/year, with higher values for analyzed genome partitions. In spite of the highest evolutionary rate, VP2 codons were found to be under purifying selection with rare episodic positive selection, whereas codons under diversifying selection were found in the unique part of VP1, known to contain B19 immune epitopes important in persistent infection. Analyses of overlapping gene regions identified nucleotide positions under opposite selective pressure in different ORFs, suggesting complex evolutionary mechanisms of nucleotide changes in B19 viral genomes. PMID:27775080

Somatic diversification in the heavy chain variable region genes expressed by human autoantibodies bearing a lupus-associated nephritogenic anti-DNA idiotype

DOE Office of Scientific and Technical Information (OSTI.GOV)

Demaison, C.; Chastagner, P.; Theze, J.

1994-01-18

Monoclonal anti-DNA antibodies bearing a lupus nephritis-associated idiotype were derived from five patients with systemic lupus erythematosus (SLE). Genes encoding their heavy (H)-chain variable (V[sub H]) regions were cloned and sequenced. When compared with their closest V[sub h] germ-line gene relatives, these sequences exhibit a number of silent (S) and replacement (R) substitutions. The ratios of R/S mutations were much higher in the complementarity-determining regions (CDRs) of the antibodies than in the framework regions. Molecular amplification of genomic V[sub H] genes and Southern hybridization with somatic CDR2-specific oligonucleotide probes showed that the configuration of the V[sub H] genes corresponding tomore » V[sub H] sequences in the nephritogenic antibodies is not present in the patient's own germ-line DNA, implying that the B-cell clones underwent somatic mutation in vivo. These findings, together with the characteristics of the diversity and junctional gene elements utilized to form the antibody, indicate that these autoantibodies have been driven through somatic selection processes reminiscent of those that govern antibody responses triggered by exogenous stimuli.« less
Homology between DNA polymerases of poxviruses, herpesviruses, and adenoviruses: nucleotide sequence of the vaccinia virus DNA polymerase gene.

PubMed Central

Earl, P L; Jones, E V; Moss, B

1986-01-01

A 5400-base-pair segment of the vaccinia virus genome was sequenced and an open reading frame of 938 codons was found precisely where the DNA polymerase had been mapped by transfer of a phosphonoacetate-resistance marker. A single nucleotide substitution changing glycine at position 347 to aspartic acid accounts for the drug resistance of the mutant vaccinia virus. The 5' end of the DNA polymerase mRNA was located 80 base pairs before the methionine codon initiating the open reading frame. Correspondence between the predicted Mr 108,577 polypeptide and the 110,000 purified enzyme indicates that little or no proteolytic processing occurs. Extensive homology, extending over 435 amino acids, was found upon comparing the DNA polymerase of vaccinia virus and DNA polymerase of Epstein-Barr virus. A highly conserved sequence of 14 amino acids in the carboxyl-terminal regions of the above DNA polymerases is also present at a similar location in adenovirus DNA polymerase. This structure, which is predicted to form a turn flanked by beta-pleated sheets, may form part of an essential binding or catalytic site that accounts for its presence in DNA polymerases of poxviruses, herpesviruses, and adenoviruses. Images PMID:3012524
Effects of nucleoside analog incorporation on DNA binding to the DNA binding domain of the GATA-1 erythroid transcription factor.

PubMed

Foti, M; Omichinski, J G; Stahl, S; Maloney, D; West, J; Schweitzer, B I

1999-02-05

We investigate here the effects of the incorporation of the nucleoside analogs araC (1-beta-D-arabinofuranosylcytosine) and ganciclovir (9-[(1,3-dihydroxy-2-propoxy)methyl] guanine) into the DNA binding recognition sequence for the GATA-1 erythroid transcription factor. A 10-fold decrease in binding affinity was observed for the ganciclovir-substituted DNA complex in comparison to an unmodified DNA of the same sequence composition. AraC substitution did not result in any changes in binding affinity. 1H-15N HSQC and NOESY NMR experiments revealed a number of chemical shift changes in both DNA and protein in the ganciclovir-modified DNA-protein complex when compared to the unmodified DNA-protein complex. These changes in chemical shift and binding affinity suggest a change in the binding mode of the complex when ganciclovir is incorporated into the GATA DNA binding site.
Generalized Majority Logic Criterion to Analyze the Statistical Strength of S-Boxes

NASA Astrophysics Data System (ADS)

Hussain, Iqtadar; Shah, Tariq; Gondal, Muhammad Asif; Mahmood, Hasan

2012-05-01

The majority logic criterion is applicable in the evaluation process of substitution boxes used in the advanced encryption standard (AES). The performance of modified or advanced substitution boxes is predicted by processing the results of statistical analysis by the majority logic criteria. In this paper, we use the majority logic criteria to analyze some popular and prevailing substitution boxes used in encryption processes. In particular, the majority logic criterion is applied to AES, affine power affine (APA), Gray, Lui J, residue prime, S8 AES, Skipjack, and Xyi substitution boxes. The majority logic criterion is further extended into a generalized majority logic criterion which has a broader spectrum of analyzing the effectiveness of substitution boxes in image encryption applications. The integral components of the statistical analyses used for the generalized majority logic criterion are derived from results of entropy analysis, contrast analysis, correlation analysis, homogeneity analysis, energy analysis, and mean of absolute deviation (MAD) analysis.
Within-Host Variations of Human Papillomavirus Reveal APOBEC Signature Mutagenesis in the Viral Genome.

PubMed

Hirose, Yusuke; Onuki, Mamiko; Tenjimbayashi, Yuri; Mori, Seiichiro; Ishii, Yoshiyuki; Takeuchi, Takamasa; Tasaka, Nobutaka; Satoh, Toyomi; Morisada, Tohru; Iwata, Takashi; Miyamoto, Shingo; Matsumoto, Koji; Sekizawa, Akihiko; Kukimoto, Iwao

2018-06-15

Persistent infection with oncogenic human papillomaviruses (HPVs) causes cervical cancer, accompanied by the accumulation of somatic mutations into the host genome. There are concomitant genetic changes in the HPV genome during viral infection; however, their relevance to cervical carcinogenesis is poorly understood. Here, we explored within-host genetic diversity of HPV by performing deep-sequencing analyses of viral whole-genome sequences in clinical specimens. The whole genomes of HPV types 16, 52, and 58 were amplified by type-specific PCR from total cellular DNA of cervical exfoliated cells collected from patients with cervical intraepithelial neoplasia (CIN) and invasive cervical cancer (ICC) and were deep sequenced. After constructing a reference viral genome sequence for each specimen, nucleotide positions showing changes with >0.5% frequencies compared to the reference sequence were determined for individual samples. In total, 1,052 positions of nucleotide variations were detected in HPV genomes from 151 samples (CIN1, n = 56; CIN2/3, n = 68; ICC, n = 27), with various numbers per sample. Overall, C-to-T and C-to-A substitutions were the dominant changes observed across all histological grades. While C-to-T transitions were predominantly detected in CIN1, their prevalence was decreased in CIN2/3 and fell below that of C-to-A transversions in ICC. Analysis of the trinucleotide context encompassing substituted bases revealed that TpCpN, a preferred target sequence for cellular APOBEC cytosine deaminases, was a primary site for C-to-T substitutions in the HPV genome. These results strongly imply that the APOBEC proteins are drivers of HPV genome mutation, particularly in CIN1 lesions. IMPORTANCE HPVs exhibit surprisingly high levels of genetic diversity, including a large repertoire of minor genomic variants in each viral genotype. Here, by conducting deep-sequencing analyses, we show for the first time a comprehensive snapshot of the within-host genetic diversity of high-risk HPVs during cervical carcinogenesis. Quasispecies harboring minor nucleotide variations in viral whole-genome sequences were extensively observed across different grades of CIN and cervical cancer. Among the within-host variations, C-to-T transitions, a characteristic change mediated by cellular APOBEC cytosine deaminases, were predominantly detected throughout the whole viral genome, most strikingly in low-grade CIN lesions. The results strongly suggest that within-host variations of the HPV genome are primarily generated through the interaction with host cell DNA-editing enzymes and that such within-host variability is an evolutionary source of the genetic diversity of HPVs. Copyright © 2018 American Society for Microbiology.
Molecular Evolution of a Type 1 Wild-Vaccine Poliovirus Recombinant during Widespread Circulation in China

PubMed Central

Liu, Hong-Mei; Zheng, Du-Ping; Zhang, Li-Bi; Oberste, M. Steven; Pallansch, Mark A.; Kew, Olen M.

2000-01-01

Type 1 wild-vaccine recombinant polioviruses were isolated from poliomyelitis patients in China from 1991 to 1993. We compared the sequences of 34 recombinant isolates over the 1,353-nucleotide (nt) genomic interval (nt 2480 to 3832) encoding the major capsid protein, VP1, and the protease, 2A. All recombinants had a 367-nt block of sequence (nt 3271 to 3637) derived from the Sabin 1 oral poliovirus vaccine strain spanning the 3′-terminal sequences of VP1 (115 nt) and the 5′ half of 2A (252 nt). The remaining VP1 sequences were closely (up to 99.5%) related to those of a major genotype of wild type 1 poliovirus endemic to China up to 1994. In contrast, the non-vaccine-derived sequences at the 3′ half of 2A were more distantly related (<90% nucleotide sequence match) to those of other contemporary wild polioviruses from China. The vaccine-derived sequences of the earliest (April 1991) isolates completely matched those of Sabin 1. Later isolates diverged from the early isolates primarily by accumulation of synonymous base substitutions (at a rate of ∼3.7 × 10−2 substitutions per synonymous site per year) over the entire VP1-2A interval. Distinct evolutionary lineages were found in different Chinese provinces. From the combined epidemiologic and evolutionary analyses, we propose that the recombinant virus arose during mixed infection of a single individual in northern China in early 1991 and that its progeny spread by multiple independent chains of transmission into some of the most populous areas of China within a year of the initiating infection. PMID:11070012
Implications of the plastid genome sequence of typha (typhaceae, poales) for understanding genome evolution in poaceae.

PubMed

Guisinger, Mary M; Chumley, Timothy W; Kuehl, Jennifer V; Boore, Jeffrey L; Jansen, Robert K

2010-02-01

Plastid genomes of the grasses (Poaceae) are unusual in their organization and rates of sequence evolution. There has been a recent surge in the availability of grass plastid genome sequences, but a comprehensive comparative analysis of genome evolution has not been performed that includes any related families in the Poales. We report on the plastid genome of Typha latifolia, the first non-grass Poales sequenced to date, and we present comparisons of genome organization and sequence evolution within Poales. Our results confirm that grass plastid genomes exhibit acceleration in both genomic rearrangements and nucleotide substitutions. Poaceae have multiple structural rearrangements, including three inversions, three genes losses (accD, ycf1, ycf2), intron losses in two genes (clpP, rpoC1), and expansion of the inverted repeat (IR) into both large and small single-copy regions. These rearrangements are restricted to the Poaceae, and IR expansion into the small single-copy region correlates with the phylogeny of the family. Comparisons of 73 protein-coding genes for 47 angiosperms including nine Poaceae genera confirm that the branch leading to Poaceae has significantly accelerated rates of change relative to other monocots and angiosperms. Furthermore, rates of sequence evolution within grasses are lower, indicating a deceleration during diversification of the family. Overall there is a strong correlation between accelerated rates of genomic rearrangements and nucleotide substitutions in Poaceae, a phenomenon that has been noted recently throughout angiosperms. The cause of the correlation is unknown, but faulty DNA repair has been suggested in other systems including bacterial and animal mitochondrial genomes.
Rapid rate of control-region evolution in Pacific butterflyfishes (Chaetodontidae).

PubMed

McMillan, W O; Palumbi, S R

1997-11-01

Sequence differences in the tRNA-proline (tRNApro) end of the mitochondrial control-region of three species of Pacific butterflyfishes accumulated 33-43 times more rapidly than did changes within the mitochondrial cytochrome b gene (cytb). Rapid evolution in this region was accompanied by strong transition/transversion bias and large variation in the probability of a DNA substitution among sites. These substitution constraints placed an absolute ceiling on the magnitude of sequence divergence that could be detected between individuals. This divergence "ceiling" was reached rapidly and led to a decay in the relative rate of control-region/cytb b evolution. A high rate of evolution in this section of the control-region of butterflyfishes stands in marked contrast to the patterns reported in some other fish lineages. Although the mechanism underlying rate variation remains unclear, all taxa with rapid evolution in the 5'-end of the control-region showed extreme transition biases. By contrast, in taxa with slower control-region evolution, transitions accumulated at nearly the same rate as transversions. More information is needed to understand the relationship between nucleotide bias and the rate of evolution in the 5'-end of the control-region. Despite strong constraints on sequence change, phylogenetic information was preserved in the group of recently differentiated species and supported the clustering of sequences into three major mtDNA groupings. Within these groups, very similar control-region sequences were widely distributed across the Pacific Ocean and were shared between recognized species, indicating a lack of mitochondrial sequence monophyly among species.
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions.

PubMed

Nishizawa, M; Nishizawa, K

2000-10-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions

PubMed Central

Nishizawa, Manami; Nishizawa, Kazuhisa

2000-01-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
A Rapid Method to Test for Chloroplast DNA Involvement in Atrazine Resistance

PubMed Central

McNally, Sheila; Bettini, Priscilla; Sevignac, Mireille; Darmency, Henry; Gasquez, Jacques; Dron, Michel

1987-01-01

A point mutation in the chloroplast psbA gene at codon 264 resulting in an animo acid substitution (ser-gly) manifests itself as atrazine resistance in all recognized weed species studied to date. The single base substitution overlaps a highly conserved Mae1 restriction site which is present in susceptible but not in resistant plants. This restriction enzyme, recently commercialized, has been used to show that it is now possible to discriminate rapidly between the two biotypes without the need for DNA sequencing. Images Fig. 1 PMID:16665229
Heterobimetallic Pd-Sn catalysis: a Suzuki, tandem ring-closing sequence toward indeno[2,1-b]thiophenes and indeno[2,1-b]indoles.

PubMed

Das, Debjit; Pratihar, Sanjay; Roy, Sujit

2012-09-21

Indeno[2,1-b]thiophene and indeno[1,2-b]indole motifs have been obtained in moderate to good yields from easily available substituted boronic acids, 2-bromo aryl/vinyl aldehydes, and nucleophiles such as arenes/heteroarenes and others using a catalytic combination of bimetallic "Pd-Sn" and AgPF(6). This formal three-component coupling involves a Suzuki reaction followed by nucleophile assisted tandem ring closure. The sequential synthesis of substituted heterocycle-fused indenes, benzofluorene, and fluorenes was also accomplished.
Synthesis of isochromene-type scaffolds via single-flask Diels-Alder-[4 + 2]-annulation sequence of a silyl-substituted diene with menadione.

PubMed

Lee, Jihoon; Panek, James S

2014-06-20

A sequential Diels-Alder reaction/silicon-directed [4 + 2]-annulation was developed to assemble hydroisochromene-type ring systems from menadione 2. In the first step, a Diels-Alder of the 1-silyl-substituted butadiene 1 with 2 furnished an intermediate cyclic allylsilane. Subsequently, TMSOTf promoted a [4 + 2]-annulation through trapping of an oxonium, generated by condensation between an aldehyde and the TBS protected alcohol resulted in the formation of a cis-fused hydroisochromene 13.
Genetic diversity and evolutionary dynamics of Ebola virus in Sierra Leone.

PubMed

Tong, Yi-Gang; Shi, Wei-Feng; Liu, Di; Qian, Jun; Liang, Long; Bo, Xiao-Chen; Liu, Jun; Ren, Hong-Guang; Fan, Hang; Ni, Ming; Sun, Yang; Jin, Yuan; Teng, Yue; Li, Zhen; Kargbo, David; Dafae, Foday; Kanu, Alex; Chen, Cheng-Chao; Lan, Zhi-Heng; Jiang, Hui; Luo, Yang; Lu, Hui-Jun; Zhang, Xiao-Guang; Yang, Fan; Hu, Yi; Cao, Yu-Xi; Deng, Yong-Qiang; Su, Hao-Xiang; Sun, Yu; Liu, Wen-Sen; Wang, Zhuang; Wang, Cheng-Yu; Bu, Zhao-Yang; Guo, Zhen-Dong; Zhang, Liu-Bo; Nie, Wei-Min; Bai, Chang-Qing; Sun, Chun-Hua; An, Xiao-Ping; Xu, Pei-Song; Zhang, Xiang-Li-Lan; Huang, Yong; Mi, Zhi-Qiang; Yu, Dong; Yao, Hong-Wu; Feng, Yong; Xia, Zhi-Ping; Zheng, Xue-Xing; Yang, Song-Tao; Lu, Bing; Jiang, Jia-Fu; Kargbo, Brima; He, Fu-Chu; Gao, George F; Cao, Wu-Chun

2015-08-06

A novel Ebola virus (EBOV) first identified in March 2014 has infected more than 25,000 people in West Africa, resulting in more than 10,000 deaths. Preliminary analyses of genome sequences of 81 EBOV collected from March to June 2014 from Guinea and Sierra Leone suggest that the 2014 EBOV originated from an independent transmission event from its natural reservoir followed by sustained human-to-human infections. It has been reported that the EBOV genome variation might have an effect on the efficacy of sequence-based virus detection and candidate therapeutics. However, only limited viral information has been available since July 2014, when the outbreak entered a rapid growth phase. Here we describe 175 full-length EBOV genome sequences from five severely stricken districts in Sierra Leone from 28 September to 11 November 2014. We found that the 2014 EBOV has become more phylogenetically and genetically diverse from July to November 2014, characterized by the emergence of multiple novel lineages. The substitution rate for the 2014 EBOV was estimated to be 1.23 × 10(-3) substitutions per site per year (95% highest posterior density interval, 1.04 × 10(-3) to 1.41 × 10(-3) substitutions per site per year), approximating to that observed between previous EBOV outbreaks. The sharp increase in genetic diversity of the 2014 EBOV warrants extensive EBOV surveillance in Sierra Leone, Guinea and Liberia to better understand the viral evolution and transmission dynamics of the ongoing outbreak. These data will facilitate the international efforts to develop vaccines and therapeutics.
A novel amino acid substitution in a voltage-gated sodium channel is associated with knockdown resistance to permethrin in Aedes aegypti.

PubMed

Chang, Cheng; Shen, Wen-Kai; Wang, Tzu-Ting; Lin, Ying-Hsi; Hsu, Err-Lieh; Dai, Shu-Mei

2009-04-01

To identify pertinent mutations associated with knockdown resistance to permethrin, the entire coding sequence of the voltage-gated sodium channel gene Aa-para was sequenced and analyzed from a Per-R strain with 190-fold resistance to permethrin and two susceptible strains of Aedes aegypti. The longest transcript, a 6441bp open reading frame, encodes 2147 amino acid residues with an estimated molecular mass of 241kDa. A total of 33 exons were found in the Aa-para gene over 293kb of genomic DNA. Three previously unreported optional exons were identified. The first two exons, m and n, were located within the intracellular domain I/II, and the third, f', was found within the II/III linkers. The two mutually exclusive exons, d and l, were the only alternative exons in all the cDNA clones sequenced in this study. The most distinct finding was a novel amino acid substitution mutation, D1794Y, located within the extracellular linker between IVS5 and IVS6, which is concurrent with the known V1023G mutation in Aa-para of the Per-R strain. The high frequency and coexistence of the two mutations in the Per-R strain suggest that they might exert a synergistic effect to provide the knockdown resistance to permethrin. Furthermore, both cDNA and genomic DNA data from the same individual mosquitoes have demonstrated that RNA editing was not involved in amino acid substitutions of the Per-R strain.
Unrealistic phylogenetic trees may improve phylogenetic footprinting.

PubMed

Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo

2017-06-01

The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Tracking the Molecular Evolution of Calcium Permeability in a Nicotinic Acetylcholine Receptor

PubMed Central

Lipovsek, Marcela; Fierro, Angélica; Pérez, Edwin G.; Boffi, Juan C.; Millar, Neil S.; Fuchs, Paul A.; Katz, Eleonora; Elgoyhen, Ana Belén

2014-01-01

Nicotinic acetylcholine receptors are a family of ligand-gated nonselective cationic channels that participate in fundamental physiological processes at both the central and the peripheral nervous system. The extent of calcium entry through ligand-gated ion channels defines their distinct functions. The α9α10 nicotinic cholinergic receptor, expressed in cochlear hair cells, is a peculiar member of the family as it shows differences in the extent of calcium permeability across species. In particular, mammalian α9α10 receptors are among the ligand-gated ion channels which exhibit the highest calcium selectivity. This acquired differential property provides the unique opportunity of studying how protein function was shaped along evolutionary history, by tracking its evolutionary record and experimentally defining the amino acid changes involved. We have applied a molecular evolution approach of ancestral sequence reconstruction, together with molecular dynamics simulations and an evolutionary-based mutagenesis strategy, in order to trace the molecular events that yielded a high calcium permeable nicotinic α9α10 mammalian receptor. Only three specific amino acid substitutions in the α9 subunit were directly involved. These are located at the extracellular vestibule and at the exit of the channel pore and not at the transmembrane region 2 of the protein as previously thought. Moreover, we show that these three critical substitutions only increase calcium permeability in the context of the mammalian but not the avian receptor, stressing the relevance of overall protein structure on defining functional properties. These results highlight the importance of tracking evolutionarily acquired changes in protein sequence underlying fundamental functional properties of ligand-gated ion channels. PMID:25193338
Remarkable alkaline stability of an engineered protein A as immunoglobulin affinity ligand: C domain having only one amino acid substitution

PubMed Central

Minakuchi, Kazunobu; Murata, Dai; Okubo, Yuji; Nakano, Yoshiyuki; Yoshida, Shinichi

2013-01-01

Protein A affinity chromatography is the standard purification process for the capture of therapeutic antibodies. The individual IgG-binding domains of protein A (E, D, A, B, C) have highly homologous amino acid sequences. From a previous report, it has been assumed that the C domain has superior resistance to alkaline conditions compared to the other domains. We investigated several properties of the C domain as an IgG-Fc capture ligand. Based on cleavage site analysis of a recombinant protein A using a protein sequencer, the C domain was found to be the only domain to have neither of the potential alkaline cleavage sites. Circular dichroism (CD) analysis also indicated that the C domain has good physicochemical stability. Additionally, we evaluated the amino acid substitutions at the Gly-29 position of the C domain, as the Z domain (an artificial B domain) acquired alkaline resistance through a G29A mutation. The G29A mutation proved to increase the alkaline resistance of the C domain, based on BIACORE analysis, although the improvement was significantly smaller than that observed for the B domain. Interestingly, a number of other amino acid mutations at the same position increased alkaline resistance more than did the G29A mutation. This result supports the notion that even a single mutation on the originally alkali-stable C domain would improve its alkaline stability. An engineered protein A based on this C domain is expected to show remarkable performance as an affinity ligand for immunoglobulin. PMID:23868198
Sequence swapping does not result in conformation swapping for the beta4/beta5 and beta8/beta9 beta-hairpin turns in human acidic fibroblast growth factor.

PubMed

Kim, Jaewon; Lee, Jihun; Brych, Stephen R; Logan, Timothy M; Blaber, Michael

2005-02-01

The beta-turn is the most common type of nonrepetitive structure in globular proteins, comprising ~25% of all residues; however, a detailed understanding of effects of specific residues upon beta-turn stability and conformation is lacking. Human acidic fibroblast growth factor (FGF-1) is a member of the beta-trefoil superfold and contains a total of five beta-hairpin structures (antiparallel beta-sheets connected by a reverse turn). beta-Turns related by the characteristic threefold structural symmetry of this superfold exhibit different primary structures, and in some cases, different secondary structures. As such, they represent a useful system with which to study the role that turn sequences play in determining structure, stability, and folding of the protein. Two turns related by the threefold structural symmetry, the beta4/beta5 and beta8/beta9 turns, were subjected to both sequence-swapping and poly-glycine substitution mutations, and the effects upon stability, folding, and structure were investigated. In the wild-type protein these turns are of identical length, but exhibit different conformations. These conformations were observed to be retained during sequence-swapping and glycine substitution mutagenesis. The results indicate that the beta-turn structure at these positions is not determined by the turn sequence. Structural analysis suggests that residues flanking the turn are a primary structural determinant of the conformation within the turn.
Evolution of multi-drug resistant HCV clones from pre-existing resistant-associated variants during direct-acting antiviral therapy determined by third-generation sequencing

NASA Astrophysics Data System (ADS)

Takeda, Haruhiko; Ueda, Yoshihide; Inuzuka, Tadashi; Yamashita, Yukitaka; Osaki, Yukio; Nasu, Akihiro; Umeda, Makoto; Takemura, Ryo; Seno, Hiroshi; Sekine, Akihiro; Marusawa, Hiroyuki

2017-03-01

Resistance-associated variant (RAV) is one of the most significant clinical challenges in treating HCV-infected patients with direct-acting antivirals (DAAs). We investigated the viral dynamics in patients receiving DAAs using third-generation sequencing technology. Among 283 patients with genotype-1b HCV receiving daclatasvir + asunaprevir (DCV/ASV), 32 (11.3%) failed to achieve sustained virological response (SVR). Conventional ultra-deep sequencing of HCV genome was performed in 104 patients (32 non-SVR, 72 SVR), and detected representative RAVs in all non-SVR patients at baseline, including Y93H in 28 (87.5%). Long contiguous sequences spanning NS3 to NS5A regions of each viral clone in 12 sera from 6 representative non-SVR patients were determined by third-generation sequencing, and showed the concurrent presence of several synonymous mutations linked to resistance-associated substitutions in a subpopulation of pre-existing RAVs and dominant isolates at treatment failure. Phylogenetic analyses revealed close genetic distances between pre-existing RAVs and dominant RAVs at treatment failure. In addition, multiple drug-resistant mutations developed on pre-existing RAVs after DCV/ASV in all non-SVR cases. In conclusion, multi-drug resistant viral clones at treatment failure certainly originated from a subpopulation of pre-existing RAVs in HCV-infected patients. Those RAVs were selected for and became dominant with the acquisition of multiple resistance-associated substitutions under DAA treatment pressure.

Variant of TREM2 Associated with the Risk of Alzheimer’s Disease

PubMed Central

Jonsson, Thorlakur; Stefansson, Hreinn; Steinberg, Stacy; Jonsdottir, Ingileif; Jonsson, Palmi V.; Snaedal, Jon; Bjornsson, Sigurbjorn; Huttenlocher, Johanna; Levey, Allan I.; Lah, James J.; Rujescu, Dan; Hampel, Harald; Giegling, Ina; Andreassen, Ole A.; Engedal, Knut; Ulstein, Ingun; Djurovic, Srdjan; Ibrahim-Verbaas, Carla; Hofman, Albert; Ikram, M. Arfan; van Duijn, Cornelia M; Thorsteinsdottir, Unnur; Kong, Augustine; Stefansson, Kari

2013-01-01

BACKGROUND Sequence variants, including the ε4 allele of apolipoprotein E, have been associated with the risk of the common late-onset form of Alzheimer’s disease. Few rare variants affecting the risk of late-onset Alzheimer’s disease have been found. METHODS We obtained the genome sequences of 2261 Icelanders and identified sequence variants that were likely to affect protein function. We imputed these variants into the genomes of patients with Alzheimer’s disease and control participants and then tested for an association with Alzheimer’s disease. We performed replication tests using case–control series from the United States, Norway, the Netherlands, and Germany. We also tested for a genetic association with cognitive function in a population of unaffected elderly persons. RESULTS A rare missense mutation (rs75932628-T) in the gene encoding the triggering receptor expressed on myeloid cells 2 (TREM2), which was predicted to result in an R47H substitution, was found to confer a significant risk of Alzheimer’s disease in Iceland (odds ratio, 2.92; 95% confidence interval [CI], 2.09 to 4.09; P = 3.42×10−10). The mutation had a frequency of 0.46% in controls 85 years of age or older. We observed the association in additional sample sets (odds ratio, 2.90; 95% CI, 2.16 to 3.91; P = 2.1×10−12 in combined discovery and replication samples). We also found that carriers of rs75932628-T between the ages of 80 and 100 years without Alzheimer’s disease had poorer cognitive function than noncarriers (P = 0.003). CONCLUSIONS Our findings strongly implicate variant TREM2 in the pathogenesis of Alzheimer’s disease. Given the reported antiinflammatory role of TREM2 in the brain, the R47H substitution may lead to an increased predisposition to Alzheimer’s disease through impaired containment of inflammatory processes. (Funded by the National Institute on Aging and others.) PMID:23150908
Unfolding thermodynamics of intramolecular G-quadruplexes: base sequence contributions of the loops.

PubMed

Olsen, Chris M; Lee, Hui-Ting; Marky, Luis A

2009-03-05

G-quadruplexes are a highly studied DNA motif with a potential role in a variety of cellular processes and more recently are considered novel targets for drug therapy in aging and anticancer research. In this work, we have investigated the thermodynamic contributions of the loops on the stable formation of G-quadruplexes. Specifically, we use a combination of UV, circular dichroism (CD) and fluorescence spectroscopies, and differential scanning calorimetry (DSC) to determine thermodynamic profiles, including the differential binding of ions and water, for the unfolding of the thrombin aptamer: d(GGT2GGTGTGGT2GG) that is referred to as G2. The sequences in italics, TGT and T2, are known to form loops. Other sequences examined contained base substitutions in the TGT loop (TAT, TCT, TTT, TAPT, and UUU), in the T2 loops (T4, U2), or in both loops (UGU and U2, UUU and U2). The CD spectra of all molecules show a positive band centered at 292 nm, which corresponds to the "chair" conformation. The UV and DSC melting curves of each G-quadruplex show monophasic transitions with transition temperatures (T(M)s) that remained constant with increasing strand concentration, confirming their intramolecular formation. These G-quadruplexes unfold with T(M)s in the range from 43.2 to 56.5 degrees C and endothermic enthalpies from 22.9 to 37.2 kcal/mol. Subtracting the contribution of a G-quartet stack from each experimental profile indicated that the presence of the loops stabilize each G-quadruplex by favorable enthalpy contributions, larger differential binding of K+ ions (0.1-0.6 mol K+/ mol), and a variable uptake/release of water molecules (-6 to 8 mol H2O/mol). The thermodynamic contributions for these specific base substitutions are discussed in terms of loop stacking (base-base stacking within the loops) and their hydration effects.
Analysis of evolutionary rate of HIV-1 subtype B using blood donor samples in Japan.

PubMed

Shinohara, Naoya; Matsumoto, Chieko; Matsubayashi, Keiji; Nagai, Tadashi; Satake, Masahiro

2018-06-01

There are few reports on HIV-1 intra-host evolutionary rate in asymptomatic treatment-naïve patients. Here, the HIV-1 intra-host evolutionary rate was estimated based on HIV-1 RNA sequences from plasma samples of blood donors in Japan. Blood donors were assumed to have received no treatment for and have no symptoms of HIV-1 infection because they were healthy, and declared no risky behaviors of HIV-1 infection on a self-reported questionnaire or interview followed by donation. HIV-1 RNA was obtained from 85 plasma samples from 36 blood donors who donated blood multiple times and were HIV-1-positive. The C2V3C3 region which encodes for a part of the envelope protein, and the V3 loop in the C2V3C3 region were analyzed by RT-PCR and direct sequencing, and the sequences were compared. The nucleotide substitution rate was calculated by linear regression. All HIV-1 samples analyzed were classified as subtype B. The mean nucleotide substitution rate in C2V3C3 was calculated to be 6.2 × 10 -3 -1.8 × 10 -2 /site/year (V3: 4.5 × 10 -3 -2.3 × 10 -2 /site/year). The mean non-synonymous substitution rate in C2V3C3 was calculated to be 5.2 × 10 -3 -1.7 × 10 -2 /site/year (V3: 4.5 × 10 -3 -2.1 × 10 -2 /site/year). The mean synonymous substitution rate in C2V3C3 was calculated to be 1.1 × 10 -4 -2.3 × 10 -3 /site/year (V3: 2.9 × 10 -3 /site/year). Among HIV-1 subtype B RNA-positive blood donors in Japan, the nucleotide substitution rate in C2V3C3 was estimated to be higher than that of reported cases using HIV-1 samples mainly obtained from AIDS patients. Compared to AIDS patients, immune responses against HIV-1 are probably more effective in HIV-1 RNA-positive blood donors. Consequently, immune pressure presumably promotes mutation of the virus genome.
Ethynyl and substituted ethynyl-terminated polysulfones

NASA Technical Reports Server (NTRS)

Hergenrother, P. M. (Inventor)

1986-01-01

Ethynyl and substituted ethynyl-terminated polysulfones and their synthesis are disclosed. These polysulfones are thermally cured to induce cross-linking and chain extension, producing a polymer system with improved solvent resistance and use temperatures. Also disclosed are substituted 4-ethynylbenzoyl chlorides as precursors to the substituted ethynyl-terminated polysulfones and a process for preparing the same.
Statistical Physics of Complex Substitutive Systems

NASA Astrophysics Data System (ADS)

Jin, Qing

Diffusion processes are central to human interactions. Despite extensive studies that span multiple disciplines, our knowledge is limited to spreading processes in non-substitutive systems. Yet, a considerable number of ideas, products, and behaviors spread by substitution; to adopt a new one, agents must give up an existing one. This captures the spread of scientific constructs--forcing scientists to choose, for example, a deterministic or probabilistic worldview, as well as the adoption of durable items, such as mobile phones, cars, or homes. In this dissertation, I develop a statistical physics framework to describe, quantify, and understand substitutive systems. By empirically exploring three collected high-resolution datasets pertaining to such systems, I build a mechanistic model describing substitutions, which not only analytically predicts the universal macroscopic phenomenon discovered in the collected datasets, but also accurately captures the trajectories of individual items in a complex substitutive system, demonstrating a high degree of regularity and universality in substitutive systems. I also discuss the origins and insights of the parameters in the substitution model and possible generalization form of the mathematical framework. The systematical study of substitutive systems presented in this dissertation could potentially guide the understanding and prediction of all spreading phenomena driven by substitutions, from electric cars to scientific paradigms, and from renewable energy to new healthy habits.
Object individuation is invariant to attentional diffusion: Changes in the size of the attended region do not interact with object-substitution masking.

PubMed

Goodhew, Stephanie C; Edwards, Mark

2016-12-01

When the human brain is confronted with complex and dynamic visual scenes, two pivotal processes are at play: visual attention (the process of selecting certain aspects of the scene for privileged processing) and object individuation (determining what information belongs to a continuing object over time versus what represents two or more distinct objects). Here we examined whether these processes are independent or whether they interact. Object-substitution masking (OSM) has been used as a tool to examine such questions, however, there is controversy surrounding whether OSM reflects object individuation versus substitution processes. The object-individuation account is agnostic regarding the role of attention, whereas object-substitution theory stipulates a pivotal role for attention. There have been attempts to investigate the role of attention in OSM, but they have been subject to alternative explanations. Here, therefore, we manipulated the size of the attended region, a pure and uncontaminated attentional manipulation, and examined the impact on OSM. Across three experiments, there was no interaction. This refutes the object-substitution theory of OSM. This, in turn, tell us that object-individuation is invariant the distribution of attention. Copyright © 2016 Elsevier B.V. All rights reserved.
A minimal peptide scaffold for beta-turn display: optimizing a strand position in disulfide-cyclized beta-hairpins.

PubMed

Cochran, A G; Tong, R T; Starovasnik, M A; Park, E J; McDowell, R S; Theaker, J E; Skelton, N J

2001-01-31

Phage display of peptide libraries has become a powerful tool for the evolution of novel ligands that bind virtually any protein target. However, the rules governing conformational preferences in natural peptides are poorly understood, and consequently, structure-activity relationships in these molecules can be difficult to define. In an effort to simplify this process, we have investigated the structural stability of 10-residue, disulfide-constrained beta-hairpins and assessed their suitability as scaffolds for beta-turn display. Using disulfide formation as a probe, relative free energies of folding were measured for 19 peptides that differ at a one strand position. A tryptophan substitution promotes folding to a remarkable degree. NMR analysis confirms that the measured energies correlate well with the degree of beta-hairpin structure in the disulfide-cyclized peptides. Reexamination of a subset of the strand substitutions in peptides with different turn sequences reveals linear free energy relationships, indicating that turns and strand-strand interactions make independent, additive contributions to hairpin stability. Significantly, the tryptophan strand substitution is highly stabilizing with all turns tested, and peptides that display model turns or the less stable C'-C' ' turn of CD4 on this tryptophan "stem" are highly structured beta-hairpins in water. Thus, we have developed a small, structured beta-turn scaffold, containing only natural L-amino acids, that may be used to display peptide libraries of limited conformational diversity on phage.
Integrating mRNA and protein sequencing enables the detection and quantitative profiling of natural protein sequence variants of Populus trichocarpa

DOE Office of Scientific and Technical Information (OSTI.GOV)

Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya

The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Integrating mRNA and protein sequencing enables the detection and quantitative profiling of natural protein sequence variants of Populus trichocarpa

DOE PAGES

Abraham, Paul E.; Wang, Xiaojing; Ranjan, Priya; ...

2015-10-20

The availability of next-generation sequencing technologies has rapidly transformed our ability to link genotypes to phenotypes, and as such, promises to facilitate the dissection of genetic contribution to complex traits. Although discoveries of genetic associations will further our understanding of biology, once candidate variants have been identified, investigators are faced with the challenge of characterizing the functional effects on proteins encoded by such genes. Here we show how next-generation RNA sequencing data can be exploited to construct genotype-specific protein sequence databases, which provide a clearer picture of the molecular toolbox underlying cellular and organismal processes and their variation in amore » natural population. For this study, we used two individual genotypes (DENA-17-3 and VNDL-27-4) from a recent genome wide association (GWA) study of Populus trichocarpa, an obligate outcrosser that exhibits tremendous phenotypic variation across the natural population. This strategy allowed us to comprehensively catalogue proteins containing single amino acid polymorphisms (SAAPs) and insertions and deletions (INDELS). Based on large-scale identification of SAAPs, we profiled the frequency of 128 types of naturally occurring amino acid substitutions, with a subset of SAAPs occurring in regions of the genome having strong polymorphism patterns consistent with recent positive and/or divergent selection. In addition, we were able to explore the diploid landscape of Populus at the proteome-level, allowing the characterization of heterozygous variants.« less
Identification and expression analysis of cDNA encoding insulin-like growth factor 2 in horses

PubMed Central

KIKUCHI, Kohta; SASAKI, Keisuke; AKIZAWA, Hiroki; TSUKAHARA, Hayato; BAI, Hanako; TAKAHASHI, Masashi; NAMBO, Yasuo; HATA, Hiroshi; KAWAHARA, Manabu

2017-01-01

Insulin-like growth factor 2 (IGF2) is responsible for a broad range of physiological processes during fetal development and adulthood, but genomic analyses of IGF2 containing the 5ʹ- and 3ʹ-untranslated regions (UTRs) in equines have been limited. In this study, we characterized the IGF2 mRNA containing the UTRs, and determined its expression pattern in the fetal tissues of horses. The complete equine IGF2 mRNA sequence harboring another exon approximately 2.8 kb upstream from the canonical transcription start site was identified as a new transcript variant. As this upstream exon did not contain the start codon, the amino acid sequence was identical to the canonical variant. Analysis of the deduced amino acid sequence revealed that the protein possessed two major domains, IlGF and IGF2_C, and analysis of IGF2 sequence polymorphism in fetal tissues of Hokkaido native horse and Thoroughbreds revealed a single nucleotide polymorphism (T to C transition) at position 398 in Thoroughbreds, which caused an amino acid substitution at position 133 in the IGF2 sequence. Furthermore, the expression pattern of the IGF2 mRNA in the fetal tissues of horses was determined for the first time, and was found to be consistent with those of other species. Taken together, these results suggested that the transcriptional and translational products of the IGF2 gene have conserved functions in the fetal development of mammals, including horses. PMID:29151450
Whole genome characterization of human influenza A(H1N1)pdm09 viruses isolated from Kenya during the 2009 pandemic.

PubMed

Gachara, George; Symekher, Samuel; Otieno, Michael; Magana, Japheth; Opot, Benjamin; Bulimo, Wallace

2016-06-01

An influenza pandemic caused by a novel influenza virus A(H1N1)pdm09 spread worldwide in 2009 and is estimated to have caused between 151,700 and 575,400 deaths globally. While whole genome data on new virus enables a deeper insight in the pathogenesis, epidemiology, and drug sensitivities of the circulating viruses, there are relatively limited complete genetic sequences available for this virus from African countries. We describe herein the full genome analysis of influenza A(H1N1)pdm09 viruses isolated in Kenya between June 2009 and August 2010. A total of 40 influenza A(H1N1)pdm09 viruses isolated during the pandemic were selected. The segments from each isolate were amplified and directly sequenced. The resulting sequences of individual gene segments were concatenated and used for subsequent analysis. These were used to infer phylogenetic relationships and also to reconstruct the time of most recent ancestor, time of introduction into the country, rates of substitution and to estimate a time-resolved phylogeny. The Kenyan complete genome sequences clustered with globally distributed clade 2 and clade 7 sequences but local clade 2 viruses did not circulate beyond the introductory foci while clade 7 viruses disseminated country wide. The time of the most recent common ancestor was estimated between April and June 2009, and distinct clusters circulated during the pandemic. The complete genome had an estimated rate of nucleotide substitution of 4.9×10(-3) substitutions/site/year and greater diversity in surface expressed proteins was observed. We show that two clades of influenza A(H1N1)pdm09 virus were introduced into Kenya from the UK and the pandemic was sustained as a result of importations. Several closely related but distinct clusters co-circulated locally during the peak pandemic phase but only one cluster dominated in the late phase of the pandemic suggesting that it possessed greater adaptability. Copyright © 2016 Elsevier B.V. All rights reserved.
Global genetic diversity of the Plasmodium vivax transmission-blocking vaccine candidate Pvs48/45.

PubMed

Vallejo, Andres F; Martinez, Nora L; Tobon, Alejandra; Alger, Jackeline; Lacerda, Marcus V; Kajava, Andrey V; Arévalo-Herrera, Myriam; Herrera, Sócrates

2016-04-12

Plasmodium vivax 48/45 protein is expressed on the surface of gametocytes/gametes and plays a key role in gamete fusion during fertilization. This protein was recently expressed in Escherichia coli host as a recombinant product that was highly immunogenic in mice and monkeys and induced antibodies with high transmission-blocking activity, suggesting its potential as a P. vivax transmission-blocking vaccine candidate. To determine sequence polymorphism of natural parasite isolates and its potential influence on the protein structure, all pvs48/45 sequences reported in databases from around the world as well as those from low-transmission settings of Latin America were compared. Plasmodium vivax parasite isolates from malaria-endemic regions of Colombia, Brazil and Honduras (n = 60) were used to sequence the Pvs48/45 gene, and compared to those previously reported to GenBank and PlasmoDB (n = 222). Pvs48/45 gene haplotypes were analysed to determine the functional significance of genetic variation in protein structure and vaccine potential. Nine non-synonymous substitutions (E35K, Y196H, H211N, K250N, D335Y, E353Q, A376T, K390T, K418R) and three synonymous substitutions (I73, T149, C156) that define seven different haplotypes were found among the 282 isolates from nine countries when compared with the Sal I reference sequence. Nucleotide diversity (π) was 0.00173 for worldwide samples (range 0.00033-0.00216), resulting in relatively high diversity in Myanmar and Colombia, and low diversity in Mexico, Peru and South Korea. The two most frequent substitutions (E353Q: 41.9 %, K250N: 39.5 %) were predicted to be located in antigenic regions without affecting putative B cell epitopes or the tertiary protein structure. There is limited sequence polymorphism in pvs48/45 with noted geographical clustering among Asian and American isolates. The low genetic diversity of the protein does not influence the predicted antigenicity or protein structure and, therefore, supports its further development as transmission-blocking vaccine candidate.
Antibiotic Susceptibility and Sequence Type Distribution of Ureaplasma Species Isolated from Genital Samples in Switzerland.

PubMed

Schneider, Sarah C; Tinguely, Regula; Droz, Sara; Hilty, Markus; Donà, Valentina; Bodmer, Thomas; Endimiani, Andrea

2015-10-01

Antibiotic resistance in Ureaplasma urealyticum/Ureaplasma parvum and Mycoplasma hominis is an issue of increasing importance. However, data regarding the susceptibility and, more importantly, the clonality of these organisms are limited. We analyzed 140 genital samples obtained in Bern, Switzerland, in 2014. Identification and antimicrobial susceptibility tests were performed by using the Mycoplasma IST 2 kit and sequencing of 16S rRNA genes. MICs for ciprofloxacin and azithromycin were obtained in broth microdilution assays. Clonality was analyzed with PCR-based subtyping and multilocus sequence typing (MLST), whereas quinolone resistance and macrolide resistance were studied by sequencing gyrA, gyrB, parC, and parE genes, as well as 23S rRNA genes and genes encoding L4/L22 ribosomal proteins. A total of 103 samples were confirmed as positive for U. urealyticum/U. parvum, whereas 21 were positive for both U. urealyticum/U. parvum and M. hominis. According to the IST 2 kit, the rates of nonsusceptibility were highest for ciprofloxacin (19.4%) and ofloxacin (9.7%), whereas low rates were observed for clarithromycin (4.9%), erythromycin (1.9%), and azithromycin (1%). However, inconsistent results between microdilution and IST 2 kit assays were recorded. Various sequence types (STs) observed previously in China (ST1, ST2, ST4, ST9, ST22, and ST47), as well as eight novel lineages, were detected. Only some quinolone-resistant isolates had amino acid substitutions in ParC (Ser83Leu in U. parvum of serovar 6) and ParE (Val417Thr in U. parvum of serovar 1 and the novel Thr417Val substitution in U. urealyticum). Isolates with mutations in 23S rRNA or substitutions in L4/L22 were not detected. This is the first study analyzing the susceptibility of U. urealyticum/U. parvum isolates in Switzerland and the clonality outside China. Resistance rates were low compared to those in other countries. We hypothesize that some hyperepidemic STs spread worldwide via sexual intercourse. Large combined microbiological and clinical studies should address this important issue. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Antibiotic Susceptibility and Sequence Type Distribution of Ureaplasma Species Isolated from Genital Samples in Switzerland

PubMed Central

Schneider, Sarah C.; Tinguely, Regula; Droz, Sara; Hilty, Markus; Donà, Valentina; Bodmer, Thomas

2015-01-01

Antibiotic resistance in Ureaplasma urealyticum/Ureaplasma parvum and Mycoplasma hominis is an issue of increasing importance. However, data regarding the susceptibility and, more importantly, the clonality of these organisms are limited. We analyzed 140 genital samples obtained in Bern, Switzerland, in 2014. Identification and antimicrobial susceptibility tests were performed by using the Mycoplasma IST 2 kit and sequencing of 16S rRNA genes. MICs for ciprofloxacin and azithromycin were obtained in broth microdilution assays. Clonality was analyzed with PCR-based subtyping and multilocus sequence typing (MLST), whereas quinolone resistance and macrolide resistance were studied by sequencing gyrA, gyrB, parC, and parE genes, as well as 23S rRNA genes and genes encoding L4/L22 ribosomal proteins. A total of 103 samples were confirmed as positive for U. urealyticum/U. parvum, whereas 21 were positive for both U. urealyticum/U. parvum and M. hominis. According to the IST 2 kit, the rates of nonsusceptibility were highest for ciprofloxacin (19.4%) and ofloxacin (9.7%), whereas low rates were observed for clarithromycin (4.9%), erythromycin (1.9%), and azithromycin (1%). However, inconsistent results between microdilution and IST 2 kit assays were recorded. Various sequence types (STs) observed previously in China (ST1, ST2, ST4, ST9, ST22, and ST47), as well as eight novel lineages, were detected. Only some quinolone-resistant isolates had amino acid substitutions in ParC (Ser83Leu in U. parvum of serovar 6) and ParE (Val417Thr in U. parvum of serovar 1 and the novel Thr417Val substitution in U. urealyticum). Isolates with mutations in 23S rRNA or substitutions in L4/L22 were not detected. This is the first study analyzing the susceptibility of U. urealyticum/U. parvum isolates in Switzerland and the clonality outside China. Resistance rates were low compared to those in other countries. We hypothesize that some hyperepidemic STs spread worldwide via sexual intercourse. Large combined microbiological and clinical studies should address this important issue. PMID:26195516
Epistatic interactions influence terrestrial–marine functional shifts in cetacean rhodopsin

PubMed Central

2017-01-01

Like many aquatic vertebrates, whales have blue-shifting spectral tuning substitutions in the dim-light visual pigment, rhodopsin, that are thought to increase photosensitivity in underwater environments. We have discovered that known spectral tuning substitutions also have surprising epistatic effects on another function of rhodopsin, the kinetic rates associated with light-activated intermediates. By using absorbance spectroscopy and fluorescence-based retinal release assays on heterologously expressed rhodopsin, we assessed both spectral and kinetic differences between cetaceans (killer whale) and terrestrial outgroups (hippo, bovine). Mutation experiments revealed that killer whale rhodopsin is unusually resilient to pleiotropic effects on retinal release from key blue-shifting substitutions (D83N and A292S), largely due to a surprisingly specific epistatic interaction between D83N and the background residue, S299. Ancestral sequence reconstruction indicated that S299 is an ancestral residue that predates the evolution of blue-shifting substitutions at the origins of Cetacea. Based on these results, we hypothesize that intramolecular epistasis helped to conserve rhodopsin's kinetic properties while enabling blue-shifting spectral tuning substitutions as cetaceans adapted to aquatic environments. Trade-offs between different aspects of molecular function are rarely considered in protein evolution, but in cetacean and other vertebrate rhodopsins, may underlie multiple evolutionary scenarios for the selection of specific amino acid substitutions. PMID:28250185
Influence of partial replacement of NaCl with KCl on profiles of volatile compounds in dry-cured bacon during processing.

PubMed

Wu, Haizhou; Zhuang, Hong; Zhang, Yingyang; Tang, Jing; Yu, Xiang; Long, Men; Wang, Jiamei; Zhang, Jianhao

2015-04-01

This study investigated the influence of partial substitution of NaCl with KCl on the formation of volatile compounds in bacons during processing using a purge and trap dynamic headspace GC/MS system. Three substitutions were 0% KCl (I), 40% KCl (II), and 70% KCl (III). The profiles of the volatile compounds significantly changed during processing, particularly during the drying/ripening. At the end of process, the bacons from substitution III formed significantly higher levels of lipid-derived volatiles, such as straight chain aldehydes, hydrocarbons than bacons from substitution I and II, whereas the latter formed higher levels of volatiles from amino acid degradation such as 3-methylbutanal. There were very few differences in volatile formation between 0% and 40% KCl application. These results suggest that K(+) substitution of Na(+) by more than 40% may significantly change profiles of volatiles in finished dry-cured bacons and therefore would result in changes in the product aroma and/or flavour. Copyright © 2014 Elsevier Ltd. All rights reserved.
Evolution of bacterial-like phosphoprotein phosphatases in photosynthetic eukaryotes features ancestral mitochondrial or archaeal origin and possible lateral gene transfer.

PubMed

Uhrig, R Glen; Kerk, David; Moorhead, Greg B

2013-12-01

Protein phosphorylation is a reversible regulatory process catalyzed by the opposing reactions of protein kinases and phosphatases, which are central to the proper functioning of the cell. Dysfunction of members in either the protein kinase or phosphatase family can have wide-ranging deleterious effects in both metazoans and plants alike. Previously, three bacterial-like phosphoprotein phosphatase classes were uncovered in eukaryotes and named according to the bacterial sequences with which they have the greatest similarity: Shewanella-like (SLP), Rhizobiales-like (RLPH), and ApaH-like (ALPH) phosphatases. Utilizing the wealth of data resulting from recently sequenced complete eukaryotic genomes, we conducted database searching by hidden Markov models, multiple sequence alignment, and phylogenetic tree inference with Bayesian and maximum likelihood methods to elucidate the pattern of evolution of eukaryotic bacterial-like phosphoprotein phosphatase sequences, which are predominantly distributed in photosynthetic eukaryotes. We uncovered a pattern of ancestral mitochondrial (SLP and RLPH) or archaeal (ALPH) gene entry into eukaryotes, supplemented by possible instances of lateral gene transfer between bacteria and eukaryotes. In addition to the previously known green algal and plant SLP1 and SLP2 protein forms, a more ancestral third form (SLP3) was found in green algae. Data from in silico subcellular localization predictions revealed class-specific differences in plants likely to result in distinct functions, and for SLP sequences, distinctive and possibly functionally significant differences between plants and nonphotosynthetic eukaryotes. Conserved carboxyl-terminal sequence motifs with class-specific patterns of residue substitutions, most prominent in photosynthetic organisms, raise the possibility of complex interactions with regulatory proteins.
Landscape of Insertion Polymorphisms in the Human Genome

PubMed Central

Onozawa, Masahiro; Goldberg, Liat; Aplan, Peter D.

2015-01-01

Nucleotide substitutions, small (<50 bp) insertions or deletions (indels), and large (>50 bp) deletions are well-known causes of genetic variation within the human genome. We recently reported a previously unrecognized form of polymorphic insertions, termed templated sequence insertion polymorphism (TSIP), in which the inserted sequence was templated from a distant genomic region, and was inserted in the genome through reverse transcription of an RNA intermediate. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; class 1 TSIPs show target site duplication, polyadenylation, and preference for insertion at a 5′-TTTT/A-3′ sequence, suggesting a LINE-1 based insertion mechanism, whereas class 2 TSIPs show features consistent with repair of a DNA double strand break by nonhomologous end joining. To gain a more complete picture of TSIPs throughout the human population, we evaluated whole-genome sequence from 52 individuals, and identified 171 TSIPs. Most individuals had 25–30 TSIPs, and common (present in >20% of individuals) TSIPs were found in individuals throughout the world, whereas rare TSIPs tended to cluster in specific geographic regions. The number of rare TSIPs was greater than the number of common TSIPs, suggesting that TSIP generation is an ongoing process. Intriguingly, mitochondrial sequences were a frequent template for class 2 insertions, used more commonly than any nuclear chromosome. Similar to single nucleotide polymorphisms and indels, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases, and can be useful in tracking historical migration of populations. PMID:25745018
Evolutionary growth process of highly conserved sequences in vertebrate genomes.

PubMed

Ishibashi, Minaka; Noda, Akiko Ogura; Sakate, Ryuichi; Imanishi, Tadashi

2012-08-01

Genome sequence comparison between evolutionarily distant species revealed ultraconserved elements (UCEs) among mammals under strong purifying selection. Most of them were also conserved among vertebrates. Because they tend to be located in the flanking regions of developmental genes, they would have fundamental roles in creating vertebrate body plans. However, the evolutionary origin and selection mechanism of these UCEs remain unclear. Here we report that UCEs arose in primitive vertebrates, and gradually grew in vertebrate evolution. We searched for UCEs in two teleost fishes, Tetraodon nigroviridis and Oryzias latipes, and found 554 UCEs with 100% identity over 100 bps. Comparison of teleost and mammalian UCEs revealed 43 pairs of common, jawed-vertebrate UCEs (jUCE) with high sequence identities, ranging from 83.1% to 99.2%. Ten of them retain lower similarities to the Petromyzon marinus genome, and the substitution rates of four non-exonic jUCEs were reduced after the teleost-mammal divergence, suggesting that robust conservation had been acquired in the jawed vertebrate lineage. Our results indicate that prototypical UCEs originated before the divergence of jawed and jawless vertebrates and have been frozen as perfect conserved sequences in the jawed vertebrate lineage. In addition, our comparative sequence analyses of UCEs and neighboring regions resulted in a discovery of lineage-specific conserved sequences. They were added progressively to prototypical UCEs, suggesting step-wise acquisition of novel regulatory roles. Our results indicate that conserved non-coding elements (CNEs) consist of blocks with distinct evolutionary history, each having been frozen since different evolutionary era along the vertebrate lineage. Copyright © 2012 Elsevier B.V. All rights reserved.
A novel HLA-B allele, B*5214, detected in a Taiwanese volunteer bone marrow donor using a sequence-based typing method.

PubMed

Chen, M J; Chu, C C; Shyr, M H; Lin, C L; Lin, P Y; Yang, K L

2010-02-01

HLA-B*5214, a novel rare allele of HLA-B*52 variant, was found in a Taiwanese volunteer bone marrow donor by sequence-based typing method. The sequence of B*5214 is identical to that of B*520101 in exon 2 but differs from B*520101 in exon 3 at nucleotide positions 419 A-->T and 435 A-->G. Alteration of these two nucleotides resulted an amino acid substitution at amino acid residue 116 Y-->F ( TAC-->TTC) and a silent exchange at residue 121 K-->K (AAA-->AAG).

Nucleotide variability in the 5-enolpyruvylshikimate-3-phosphate synthase gene from Eleusine indica (L.) Gaertn.

PubMed

Chong, J L; Wickneswari, R; Ismail, B S; Salmijah, S

2008-02-01

This study reports the results of the partial DNA sequence analysis of the 5-enolpyruvyl-shikimate-3-phosphate synthase (EPSPS) gene in glyphosate-resistant (R) and glyphosate-susceptible (S) biotypes of Eleusine indica (L.) Gaertn from Peninsular Malaysia. Sequencing results revealed point mutation at nucleotide position 875 in the R biotypes of Bidor, Chaah and Temerloh. In the Chaah R population, substitution of cytosine (C) to adenine (A) resulted in the change of threonine (Thr106) to proline (Pro106) and from C to thymidine (T) in the Bidor R population, leading to serine (Ser106) from Pro106. As for the Temerloh R, C was substituted by T resulting in the change of Pro106 to Ser106. A new mutation previously undetected in the Temerloh R was revealed with C being substituted with A, resulting in the change of Pro106 to Thr106 indicating multiple founding events rather than to the spread of a single resistant allele. There was no point mutation recorded at nucleotide position 875 previously demonstrated to play a pivotal role in conferring glyphosate resistance to E. indica for the Lenggeng, Kuala Selangor, Melaka R populations. Thus, there may be another resistance mechanism yet undiscovered in the resistant Lenggeng, Kuala Selangor and Melaka populations.
Primary structure of Lep d I, the main Lepidoglyphus destructor allergen.

PubMed

Varela, J; Ventas, P; Carreira, J; Barbas, J A; Gimenez-Gallego, G; Polo, F

1994-10-01

The most relevant allergen of the storage mite Lepidoglyphus destructor (Lep d I) has been characterized. Lep d I is a monomer protein of 13273 Da. The primary structure of Lep d I was determined by N-terminal Edman degradation and partially confirmed by cDNA sequencing. Sequence polymorphism was observed at six positions, with non-conservative substitutions in three of them. No potential N-glycosylation site was revealed by peptide sequencing. The 125-residue sequence of Lep d I shows approximately 40% identity (including the six cysteines) with the overlapping regions of group II allergens from the genus Dermatophagoides, which, however, do not share common allergenic epitopes with Lep d I.
Reconstructing the origin and transmission dynamics of the 1967–68 foot-and-mouth disease epidemic in the United Kingdom☆

PubMed Central

Wright, Caroline F.; Knowles, Nick J.; Di Nardo, Antonello; Paton, David J.; Haydon, Daniel T.; King, Donald P.

2013-01-01

A large epidemic of foot-and-mouth disease (FMD) occurred in the United Kingdom (UK) over a seven month period in Northwest England from late 1967 to the summer of 1968. This was preceded by a number of smaller FMD outbreaks in the country, two in 1967, in Hampshire and Warwickshire and one in Northumberland during 1966. The causative agent of all four events was identified as FMD virus (FMDV) serotype O and the source of the large epidemic was attributed to infected bone marrow in lamb products imported from Argentina. However, the diagnostic tools available at the time were unable to entirely rule out connections with the earlier UK FMD outbreaks, as well as other potential sources from Europe. The aim of this study was to apply molecular sequencing to investigate the likely source of this epidemic using VP1 region and full genome (FG) sequences determined directly from clinical epithelium samples (n = 13) or cell culture isolates (n = 6), from this and contemporary outbreaks in the UK, Europe and South America. Analysis of the VP1 sequences provided evidence for at least three separate incursions of FMDV into the UK including one independent introduction that was responsible for the main 1967/68 epidemic. Analysis of FG sequences from the main 1967/68 outbreak (n = 10) revealed nucleotide substitutions at 94 genomic sites providing evidence for the linear accumulation of nucleotide substitutions (rate = 2.42 × 10−5 nt substitutions/site/day). However, there were five samples where this linear relationship was absent, indicating evolutional dormancy of the virus, presumably outside a host. These results help define the evolutionary dynamics of FMDV during an epidemic and contribute to the knowledge and understanding from which to base future outbreak control strategies. PMID:24035793
Genetic Diversity of Hepatitis A Virus in China: VP3-VP1-2A Genes and Evidence of Quasispecies Distribution in the Isolates

PubMed Central

Cao, Jingyuan; Zhou, Wenting; Yi, Yao; Jia, Zhiyuan; Bi, Shengli

2013-01-01

Hepatitis A virus (HAV) is the most common cause of infectious hepatitis throughout the world, spread largely by the fecal-oral route. To characterize the genetic diversity of the virus circulating in China where HAV in endemic, we selected the outbreak cases with identical sequences in VP1-2A junction region and compiled a panel of 42 isolates. The VP3-VP1-2A regions of the HAV capsid-coding genes were further sequenced and analyzed. The quasispecies distribution was evaluated by cloning the VP3 and VP1-2A genes in three clinical samples. Phylogenetic analysis demonstrated that the same genotyping results could be obtained whether using the complete VP3, VP1, or partial VP1-2A genes for analysis in this study, although some differences did exist. Most isolates clustered in sub-genotype IA, and fewer in sub-genotype IB. No amino acid mutations were found at the published neutralizing epitope sites, however, several unique amino acid substitutions in the VP3 or VP1 region were identified, with two amino acid variants closely located to the immunodominant site. Quasispecies analysis showed the mutation frequencies were in the range of 7.22x10-4 -2.33x10-3 substitutions per nucleotide for VP3, VP1, or VP1-2A. When compared with the consensus sequences, mutated nucleotide sites represented the minority of all the analyzed sequences sites. HAV replicated as a complex distribution of closely genetically related variants referred to as quasispecies, and were under negative selection. The results indicate that diverse HAV strains and quasispecies inside the viral populations are presented in China, with unique amino acid substitutions detected close to the immunodominant site, and that the possibility of antigenic escaping mutants cannot be ruled out and needs to be further analyzed. PMID:24069343
Natural Selection and Adaptive Evolution of Leptin in the Ochotona Family Driven by the Cold Environmental Stress

PubMed Central

Yang, Jie; Wang, Zhen Long; Zhao, Xin Quan; Wang, De Peng; Qi, De Lin; Xu, Bao Hong; Ren, Yong Hong; Tian, Hui Fang

2008-01-01

Background Environmental stress can accelerate the evolutionary rate of specific stress-response proteins and create new functions specialized for different environments, enhancing an organism's fitness to stressful environments. Pikas (order Lagomorpha), endemic, non-hibernating mammals in the modern Holarctic Region, live in cold regions at either high altitudes or high latitudes and have a maximum distribution of species diversification confined to the Qinghai-Tibet Plateau. Variations in energy metabolism are remarkable for them living in cold environments. Leptin, an adipocyte-derived hormone, plays important roles in energy homeostasis. Methodology/Principal Findings To examine the extent of leptin variations within the Ochotona family, we cloned the entire coding sequence of pika leptin from 6 species in two regions (Qinghai-Tibet Plateau and Inner Mongolia steppe in China) and the leptin sequences of plateau pikas (O. curzonia) from different altitudes on Qinghai-Tibet Plateau. We carried out both DNA and amino acid sequence analyses in molecular evolution and compared modeled spatial structures. Our results show that positive selection (PS) acts on pika leptin, while nine PS sites located within the functionally significant segment 85-119 of leptin and one unique motif appeared only in pika lineages-the ATP synthase α and β subunit signature site. To reveal the environmental factors affecting sequence evolution of pika leptin, relative rate test was performed in pikas from different altitudes. Stepwise multiple regression shows that temperature is significantly and negatively correlated with the rates of non-synonymous substitution (Ka) and amino acid substitution (Aa), whereas altitude does not significantly affect synonymous substitution (Ks), Ka and Aa. Conclusions/Significance Our findings support the viewpoint that adaptive evolution may occur in pika leptin, which may play important roles in pikas' ecological adaptation to extreme environmental stress. We speculate that cold, and probably not hypoxia, may be the primary environmental factor for driving adaptive evolution of pika leptin. PMID:18213380
Big and slow: phylogenetic estimates of molecular evolution in baleen whales (suborder mysticeti).

PubMed

Jackson, J A; Baker, C S; Vant, M; Steel, D J; Medrano-González, L; Palumbi, S R

2009-11-01

Baleen whales are the largest animals that have ever lived. To develop an improved estimation of substitution rate for nuclear and mitochondrial DNA for this taxon, we implemented a relaxed-clock phylogenetic approach using three fossil calibration dates: the divergence between odontocetes and mysticetes approximately 34 million years ago (Ma), between the balaenids and balaenopterids approximately 28 Ma, and the time to most recent common ancestor within the Balaenopteridae approximately 12 Ma. We examined seven mitochondrial genomes, a large number of mitochondrial control region sequences (219 haplotypes for 465 bp) and nine nuclear introns representing five species of whales, within which multiple species-specific alleles were sequenced to account for within-species diversity (1-15 for each locus). The total data set represents >1.65 Mbp of mitogenome and nuclear genomic sequence. The estimated substitution rate for the humpback whale control region (3.9%/million years, My) was higher than previous estimates for baleen whales but slow relative to other mammal species with similar generation times (e.g., human-chimp mean rate > 20%/My). The mitogenomic third codon position rate was also slow relative to other mammals (mean estimate 1%/My compared with a mammalian average of 9.8%/My for the cytochrome b gene). The mean nuclear genomic substitution rate (0.05%/My) was substantially slower than average synonymous estimates for other mammals (0.21-0.37%/My across a range of studies). The nuclear and mitogenome rate estimates for baleen whales were thus roughly consistent with an 8- to 10-fold slowing due to a combination of large body size and long generation times. Surprisingly, despite the large data set of nuclear intron sequences, there was only weak and conflicting support for alternate hypotheses about the phylogeny of balaenopterid whales, suggesting that interspecies introgressions or a rapid radiation has obscured species relationships in the nuclear genome.
Understanding the evolution and spread of chikungunya virus in the Americas using complete genome sequences.

PubMed

Sahadeo, N S D; Allicock, O M; De Salazar, P M; Auguste, A J; Widen, S; Olowokure, B; Gutierrez, C; Valadere, A M; Polson-Edwards, K; Weaver, S C; Carrington, C V F

2017-01-01

Local transmission of chikungunya virus (CHIKV) was first detected in the Americas in December 2013, after which it spread rapidly throughout the Caribbean islands and American mainland, causing a major chikungunya fever epidemic. Previous phylogenetic analysis of CHIKV from a limited number of countries in the Americas suggests that an Asian genotype strain was responsible, except in Brazil where both Asian and East/Central/South African (ECSA) lineage strains were detected. In this study, we sequenced thirty-three complete CHIKV genomes from viruses isolated in 2014 from fourteen Caribbean islands, the Bahamas and two mainland countries in the Americas. Phylogenetic analyses confirmed that they all belonged to the Asian genotype and clustered together with other Caribbean and mainland sequences isolated during the American outbreak, forming an 'Asian/American' lineage defined by two amino acid substitutions, E2 V368A and 6K L20M, and divided into two well-supported clades. This lineage is estimated to be evolving at a mean rate of 5 × 10 -4 substitutions per site per year (95% higher probability density, 2.9-7.9 × 10 -4 ) and to have arisen from an ancestor introduced to the Caribbean (most likely from Oceania) in about March 2013, 9 months prior to the first report of CHIKV in the Americas. Estimation of evolutionary rates for individual gene regions and selection analyses indicate that (in contrast to the Indian Ocean Lineage that emerged from the ECSA genotype followed by adaptive evolution and with a significantly higher substitution rate) the evolutionary dynamics of the Asian/American lineage are very similar to the rest of the Asian genotype and natural selection does not appear to have played a major role in its emergence. However, several codon sites with evidence of positive selection were identified within the non-structural regions of Asian genotype sequences outside of the Asian/American lineage.
Formulations in Psychotherapy: Admission Interviews and the Conversational Construction of Diagnosis.

PubMed

Bonnin, Juan Eduardo

2017-09-01

In this article, we contribute to understanding the interactional aspects of making clinical diagnosis in mental health care. We observe that therapists, during the "problem presentation" sequence in clinical encounters, often use a specific form of diagnostic formulations to elicit more diagnostically relevant information. By doing so, they often substitute one type of verb with another, following a diagnostic hypothesis. Specifically, in interviews that arrive at a diagnosis of neurosis, therapists formulate with behavioral verbal processes; in interviews that arrive at a diagnosis of psychosis, they do so with material ones. Such formulations often prove useful to define clinical diagnoses. They can, however, also be dangerous in that they may favor the therapist's agenda over the patient's. Our analysis helps therapists not only better understand the diagnostic process but also reflect upon their own use of diagnostic formulations and become aware of the clinical effects of their interactional performance.
The role of skin substitutes in the treatment of burn injuries.

PubMed

Shakespeare, Peter G

2005-01-01

Extensive burn wounds are difficult to manage and repair. Several engineered skin substitutes have been developed to aid in this process. These substitutes are designed with particular objectives in mind which dictate the circumstances under which they can, and should, be employed to promote healing or prepare the burn wound for final closure with autograft. This article discusses some of the rationale behind the use of skin substitutes and reviews some of the substitutes in use at the present time. Current perspectives suggest that skin substitute use is still in its infancy and that there is some way to go before their role in clinical practice becomes clear. Nevertheless the prospect of being able to supply new wound repair components and to influence the healing process to modify outcome and improve the quality of the healed burn wound will ensure a continuing high degree of interest in these potentially useful and beneficial medical devices.
Plastid–Nuclear Interaction and Accelerated Coevolution in Plastid Ribosomal Genes in Geraniaceae

PubMed Central

Weng, Mao-Lun; Ruhlman, Tracey A.; Jansen, Robert K.

2016-01-01

Plastids and mitochondria have many protein complexes that include subunits encoded by organelle and nuclear genomes. In animal cells, compensatory evolution between mitochondrial and nuclear-encoded subunits was identified and the high mitochondrial mutation rates were hypothesized to drive compensatory evolution in nuclear genomes. In plant cells, compensatory evolution between plastid and nucleus has rarely been investigated in a phylogenetic framework. To investigate plastid–nuclear coevolution, we focused on plastid ribosomal protein genes that are encoded by plastid and nuclear genomes from 27 Geraniales species. Substitution rates were compared for five sets of genes representing plastid- and nuclear-encoded ribosomal subunit proteins targeted to the cytosol or the plastid as well as nonribosomal protein controls. We found that nonsynonymous substitution rates (dN) and the ratios of nonsynonymous to synonymous substitution rates (ω) were accelerated in both plastid- (CpRP) and nuclear-encoded subunits (NuCpRP) of the plastid ribosome relative to control sequences. Our analyses revealed strong signals of cytonuclear coevolution between plastid- and nuclear-encoded subunits, in which nonsynonymous substitutions in CpRP and NuCpRP tend to occur along the same branches in the Geraniaceae phylogeny. This coevolution pattern cannot be explained by physical interaction between amino acid residues. The forces driving accelerated coevolution varied with cellular compartment of the sequence. Increased ω in CpRP was mainly due to intensified positive selection whereas increased ω in NuCpRP was caused by relaxed purifying selection. In addition, the many indels identified in plastid rRNA genes in Geraniaceae may have contributed to changes in plastid subunits. PMID:27190001
Purification and characterization of insulin and the C-peptide of proinsulin from Przewalski's horse, zebra, rhino, and tapir (Perissodactyla).

PubMed

Henry, J S; Lance, V A; Conlon, J M

1993-02-01

Within the order Perissodactyla, the primary structure of insulin has been strongly conserved. Insulin from Przewalski's horse and the mountain zebra (suborder Hippomorpha) is the same as that from the domestic horse and differs from insulin from the white rhinoceros and mountain tapir (suborder Ceratomorpha) by a single substitution (Gly-->Ser) at position 9 in the A-chain. A second molecular form of Przewalski's horse insulin isolated in this study was shown to represent the gamma-ethyl ester of the Glu17 residue of the A-chain. This component was probably formed during the extraction of the pancreas with acidified ethanol. The amino acid sequence of the C-peptide of proinsulin has been less well conserved. Zebra C-peptide comprises 31 amino acid residues and differs from Przewalski's horse and domestic horse C-peptide by one substitution (Gln30-->Pro). Rhino C-peptide was isolated only in a truncated form corresponding to residues (1-23) of intact C-peptide. Its amino acid sequence contains three substitutions compared with the corresponding region of horse C-peptide. It is postulated that the substitution (Pro23-->Thr) renders rhino C-peptide more liable to proteolytic cleavage by a chymotrypsin-like enzyme than horse C-peptide. C-peptide could not be identified in the extract of tapir pancreas, suggesting that proteolytic degradation may have been more extensive than in the rhino. In contrast to the ox and pig (order Artiodactyla), there was no evidence for the expression of more than one proinsulin gene in the species of Perissodactyla examined.
Development of a strategy and computational application to select candidate protein analogues with reduced HLA binding and immunogenicity.

PubMed

Dhanda, Sandeep Kumar; Grifoni, Alba; Pham, John; Vaughan, Kerrie; Sidney, John; Peters, Bjoern; Sette, Alessandro

2018-01-01

Unwanted immune responses against protein therapeutics can reduce efficacy or lead to adverse reactions. T-cell responses are key in the development of such responses, and are directed against immunodominant regions within the protein sequence, often associated with binding to several allelic variants of HLA class II molecules (promiscuous binders). Herein, we report a novel computational strategy to predict 'de-immunized' peptides, based on previous studies of erythropoietin protein immunogenicity. This algorithm (or method) first predicts promiscuous binding regions within the target protein sequence and then identifies residue substitutions predicted to reduce HLA binding. Further, this method anticipates the effect of any given substitution on flanking peptides, thereby circumventing the creation of nascent HLA-binding regions. As a proof-of-principle, the algorithm was applied to Vatreptacog α, an engineered Factor VII molecule associated with unintended immunogenicity. The algorithm correctly predicted the two immunogenic peptides containing the engineered residues. As a further validation, we selected and evaluated the immunogenicity of seven substitutions predicted to simultaneously reduce HLA binding for both peptides, five control substitutions with no predicted reduction in HLA-binding capacity, and additional flanking region controls. In vitro immunogenicity was detected in 21·4% of the cultures of peptides predicted to have reduced HLA binding and 11·4% of the flanking regions, compared with 46% for the cultures of the peptides predicted to be immunogenic. This method has been implemented as an interactive application, freely available online at http://tools.iedb.org/deimmunization/. © 2017 John Wiley & Sons Ltd.
How the dual process model of human cognition can inform efforts to de‐implement ineffective and harmful clinical practices: A preliminary model of unlearning and substitution

PubMed Central

Rose, Adam J.; Hartmann, Christine W.; van Bodegom‐Vos, Leti; Graham, Ian D.; Wood, Suzanne J.; Majerczyk, Barbara R.; Good, Chester B.; Pogach, Leonard M.; Ball, Sherry L.; Au, David H.; Aron, David C.

2018-01-01

Abstract Rationale and objectives One way to understand medical overuse at the clinician level is in terms of clinical decision‐making processes that are normally adaptive but become maladaptive. In psychology, dual process models of cognition propose 2 decision‐making processes. Reflective cognition is a conscious process of evaluating options based on some combination of utility, risk, capabilities, and/or social influences. Automatic cognition is a largely unconscious process occurring in response to environmental or emotive cues based on previously learned, ingrained heuristics. De‐implementation strategies directed at clinicians may be conceptualized as corresponding to cognition: (1) a process of unlearning based on reflective cognition and (2) a process of substitution based on automatic cognition. Results We define unlearning as a process in which clinicians consciously change their knowledge, beliefs, and intentions about an ineffective practice and alter their behaviour accordingly. Unlearning has been described as “the questioning of established knowledge, habits, beliefs and assumptions as a prerequisite to identifying inappropriate or obsolete knowledge underpinning and/or embedded in existing practices and routines.” We hypothesize that as an unintended consequence of unlearning strategies clinicians may experience “reactance,” ie, feel their professional prerogative is being violated and, consequently, increase their commitment to the ineffective practice. We define substitution as replacing the ineffective practice with one or more alternatives. A substitute is a specific alternative action or decision that either precludes the ineffective practice or makes it less likely to occur. Both approaches may work independently, eg, a substitute could displace an ineffective practice without changing clinicians' knowledge, and unlearning could occur even if no alternative exists. For some clinical practice, unlearning and substitution strategies may be most effectively used together. Conclusions By taking into account the dual process model of cognition, we may be able to design de‐implementation strategies matched to clinicians' decision‐making processes and avoid unintended consequence. PMID:29314508
Detecting novel SNPs and breed-specific haplotypes at calpastatin gene in Iranian fat- and thin-tailed sheep breeds and their effects on protein structure.

PubMed

Aali, Mohsen; Moradi-Shahrbabak, Mohammad; Moradi-Shahrbabak, Hosein; Sadeghi, Mostafa

2014-03-01

Calpastatin has been introduced as a potential candidate gene for growth and meat quality traits. In this study, genetic variability was investigated in the exon 6 and its intron boundaries of ovine CAST gene by PCR-SSCP analysis and DNA sequencing. Also a protein sequence and structural analysis were performed to predict the possible impact of amino acid substitutions on physicochemical properties and structure of the CAST protein. A total of 487 animals belonging to four ancient Iranian sheep breeds with different fat metabolisms, Lori-Bakhtiari and Chall (fat-tailed), Zel-Atabay cross-bred (medium fat-tailed) and Zel (thin-tailed), were analyzed. Eight unique SSCP patterns, representing eight different sequences or haplotypes, CAST-1, CAST-2 and CAST-6 to CAST-11, were identified. Haplotypes CAST-1 and CAST-2 were most common with frequency of 0.365 and 0.295. The novel haplotype CAST-8 had considerable frequency in Iranian sheep breeds (0.129). All the consensus sequences showed 98-99%, 94-98%, 92-93% and 82-83% similarity to the published ovine, caprine, bovine and porcine CAST locus sequences, respectively. Sequence analysis revealed four SNPs in intron 5 (C24T, G62A, G65T and T69-) and three SNPs in exon 6 (c.197A>T, c.282G>T and c.296C>G). All three SNPs in exon 6 were missense mutations which would result in p.Gln 66 Leu, p.Glu 94 Asp and p.Pro 99 Arg substitutions, respectively, in CAST protein. All three amino acid substitutions affected the physicochemical properties of ovine CAST protein including hydrophobicity, amphiphilicity and net charge and subsequently might influence its structure and effect on the activity of Ca2+ channels; hence, they might regulate calpain activity and afterwards meat tenderness and growth rate. The Lori-Bakhtiari population showed the highest heterozygosity in the ovine CAST locus (0.802). Frequency difference of haplotypes CAST-10 and CAST-8 between Lori-Bakhtiari (fat-tailed) and Zel (thin-tailed) breeds was highly significant (P<0.001), indicating that these two haplotypes might be breed-specific haplotypes that distinguish between fat-tailed and thin-tailed sheep breeds. Copyright © 2013 Elsevier B.V. All rights reserved.
Sentence Combining: A Sequence for Instruction.

ERIC Educational Resources Information Center

Lawlor, Joseph

1983-01-01

Classifies various syntactic structures normally included in sentence-combining instruction into five categories: coordinates, adverbials, restrictive noun modifiers, noun substitutes, and free modifiers. Within each category, structures are further divided into three levels to provide teachers with guidelines for planning instruction. (RH)
Muscle MRI of classic infantile pompe patients: Fatty substitution and edema-like changes.

PubMed

Pichiecchio, Anna; Rossi, Marta; Cinnante, Claudia; Colafati, Giovanna Stefania; De Icco, Roberto; Parini, Rossella; Menni, Francesca; Furlan, Francesca; Burlina, Alberto; Sacchini, Michele; Donati, Maria Alice; Fecarotta, Simona; Casa, Roberto Della; Deodato, Federica; Taurisano, Roberta; Di Rocco, Maja

2017-06-01

The aim of this study was to evaluate the muscle MRI pattern of 9 patients (median age: 6.5 ± 2.74 years) affected by classic infantile-onset Pompe disease who were treated with enzyme replacement therapy. We performed and qualitatively scored T1-weighted (T1-w) sequences of the facial, shoulder girdle, paravertebral, and lower limb muscles and short-tau inversion recovery (STIR) sequences of the lower limbs using the Mercuri and Morrow scales, respectively. On T1-w images, mild (grade 1) or moderate (grade 2) involvement was found in the tongue in 6 of 6 patients and in the adductor magnus muscle in 6 of 9. STIR hyperintensity was detected in all areas examined and was categorized as limited to mild in 5 of 8 patients. On T1-w sequences, mild/moderate adipose substitution in the adductor magnus and tongue muscles was documented. STIR edema-like alterations of thigh and calf muscles are novel findings. Correlations with biopsy findings and clinical parameters are needed to fully understand these findings. Muscle Nerve 55: 841-848, 2017. © 2016 Wiley Periodicals, Inc.
High levels of MHC class II allelic diversity in lake trout from Lake Superior

USGS Publications Warehouse

Dorschner, M.O.; Duris, T.; Bronte, C.R.; Burnham-Curtis, M. K.; Phillips, R.B.

2000-01-01

Sequence variation in a 216 bp portion of the major histocompatibility complex (MHC) II B1 domain was examined in 74 individual lake trout (Salvelinus namaycush) from different locations in Lake Superior. Forty-three alleles were obtained which encoded 71-72 amino acids of the mature protein. These sequences were compared with previous data obtained from five Pacific salmon species and Atlantic salmon using the same primers. Although all of the lake trout alleles clustered together in the neighbor-joining analysis of amino acid sequences, one amino acid allelic lineage was shared with Atlantic salmon (Salmo salar), a species in another genus which probably diverged from Salvelinus more than 10-20 million years ago. As shown previously in other salmonids, the level of nonsynonymous nucleotide substitution (d(N)) exceeded the level of synonymous substitution (d(S)). The level of nucleotide diversity at the MHC class II B1 locus was considerably higher in lake trout than in the Pacific salmon (genus Oncorhynchus). These results are consistent with the hypothesis that lake trout colonized Lake Superior from more than one refuge following the Wisconsin glaciation. Recent population bottlenecks may have reduced nucleotide diversity in Pacific salmon populations.
Cytotoxic T lymphocytes and CD4 epitope mutations in the pre-core/core region of hepatitis B virus in chronic hepatitis B carriers in Northeast Iran.

PubMed

Zhand, Sareh; Tabarraei, Alijan; Nazari, Amineh; Moradi, Abdolvahab

2017-07-01

Hepatitis B virus (HBV) is vulnerable to many various mutations. Those within epitopes recognized by sensitized T cells may influence the re-emergence of the virus. This study was designed to investigate the mutation in immune epitope regions of HBV pre-core/core among chronic HBV patients of Golestan province, Northeast Iran. In 120 chronic HBV carriers, HBV DNA was extracted from blood plasma samples and PCR was done using specific primers. Direct sequencing and alignment of the pre-core/core region were applied using reference sequence from Gene Bank database (Accession Number AB033559). The study showed 27 inferred amino acid substitutions, 9 of which (33.3%) were in CD4 and 2 (7.4%) in cytotoxic T lymphocytes' (CTL) epitopes and 16 other mutations (59.2%) were observed in other regions. CTL escape mutations were not commonly observed in pre-core/core sequences of chronic HBV carriers in the locale of study. It can be concluded that most of the inferred amino acid substitutions occur in different immune epitopes other than CTL and CD4.
L1-mediated retrotransposition of murine B1 and B2 SINEs recapitulated in cultured cells.

PubMed

Dewannieux, Marie; Heidmann, Thierry

2005-06-03

SINEs are short interspersed nucleotide elements with transpositional activity, present at a high copy number (up to a million) in mammalian genomes. They are 80-400 bp long, non-coding sequences which derive either from the 7SL RNA (e.g. human Alus, murine B1s) or tRNA (e.g. murine B2s) polymerase III-driven genes. We have previously demonstrated that Alus very efficiently divert the enzymatic machinery of the autonomous L1 LINE (long interspersed nucleotide element) retrotransposons to transpose at a high rate. Here we show, using an ex vivo assay for transposition, that both B1 and B2 SINEs can be mobilized by murine LINEs, with the hallmarks of a bona fide retrotransposition process, including target site duplications of varying lengths and integrations into A-rich sequences. Despite different phylogenetic origins, transposition of the tRNA-derived B2 sequences is as efficient as that of the human Alus, whereas that of B1s is 20-100-fold lower despite a similar high copy number of these elements in the mouse genome. We provide evidence, via an appropriate nucleotide substitution within the B1 sequence in a domain essential for its intracellular targeting, that the current B1 SINEs are not optimal for transposition, a feature most probably selected for the host sake in the course of evolution.
Structural basis of DNA target recognition by the B3 domain of Arabidopsis epigenome reader VAL1

PubMed Central

Sasnauskas, Giedrius; Kauneckaitė, Kotryna; Siksnys, Virginijus

2018-01-01

Abstract Arabidopsis thaliana requires a prolonged period of cold exposure during winter to initiate flowering in a process termed vernalization. Exposure to cold induces epigenetic silencing of the FLOWERING LOCUS C (FLC) gene by Polycomb group (PcG) proteins. A key role in this epigenetic switch is played by transcriptional repressors VAL1 and VAL2, which specifically recognize Sph/RY DNA sequences within FLC via B3 DNA binding domains, and mediate recruitment of PcG silencing machinery. To understand the structural mechanism of site-specific DNA recognition by VAL1, we have solved the crystal structure of VAL1 B3 domain (VAL1-B3) bound to a 12 bp oligoduplex containing the canonical Sph/RY DNA sequence 5′-CATGCA-3′/5′-TGCATG-3′. We find that VAL1-B3 makes H-bonds and van der Waals contacts to DNA bases of all six positions of the canonical Sph/RY element. In agreement with the structure, in vitro DNA binding studies show that VAL1-B3 does not tolerate substitutions at any position of the 5′-TGCATG-3′ sequence. The VAL1-B3–DNA structure presented here provides a structural model for understanding the specificity of plant B3 domains interacting with the Sph/RY and other DNA sequences. PMID:29660015

HIV-1 adaptation to antigen processing results in population-level immune evasion and affects subtype diversification.

PubMed

Tenzer, Stefan; Crawford, Hayley; Pymm, Phillip; Gifford, Robert; Sreenu, Vattipally B; Weimershaus, Mirjana; de Oliveira, Tulio; Burgevin, Anne; Gerstoft, Jan; Akkad, Nadja; Lunn, Daniel; Fugger, Lars; Bell, John; Schild, Hansjörg; van Endert, Peter; Iversen, Astrid K N

2014-04-24

The recent HIV-1 vaccine failures highlight the need to better understand virus-host interactions. One key question is why CD8(+) T cell responses to two HIV-Gag regions are uniquely associated with delayed disease progression only in patients expressing a few rare HLA class I variants when these regions encode epitopes presented by ~30 more common HLA variants. By combining epitope processing and computational analyses of the two HIV subtypes responsible for ~60% of worldwide infections, we identified a hitherto unrecognized adaptation to the antigen-processing machinery through substitutions at subtype-specific motifs. Multiple HLA variants presenting epitopes situated next to a given subtype-specific motif drive selection at this subtype-specific position, and epitope abundances correlate inversely with the HLA frequency distribution in affected populations. This adaptation reflects the sum of intrapatient adaptations, is predictable, facilitates viral subtype diversification, and increases global HIV diversity. Because low epitope abundance is associated with infrequent and weak T cell responses, this most likely results in both population-level immune evasion and inadequate responses in most people vaccinated with natural HIV-1 sequence constructs. Our results suggest that artificial sequence modifications at subtype-specific positions in vitro could refocus and reverse the poor immunogenicity of HIV proteins. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Rational Protein Engineering Guided by Deep Mutational Scanning

PubMed Central

Shin, HyeonSeok; Cho, Byung-Kwan

2015-01-01

Sequence–function relationship in a protein is commonly determined by the three-dimensional protein structure followed by various biochemical experiments. However, with the explosive increase in the number of genome sequences, facilitated by recent advances in sequencing technology, the gap between protein sequences available and three-dimensional structures is rapidly widening. A recently developed method termed deep mutational scanning explores the functional phenotype of thousands of mutants via massive sequencing. Coupled with a highly efficient screening system, this approach assesses the phenotypic changes made by the substitution of each amino acid sequence that constitutes a protein. Such an informational resource provides the functional role of each amino acid sequence, thereby providing sufficient rationale for selecting target residues for protein engineering. Here, we discuss the current applications of deep mutational scanning and consider experimental design. PMID:26404267
Contribution of single amino acid and codon substitutions to the production and secretion of a lipase by Bacillus subtilis.

PubMed

Skoczinski, Pia; Volkenborn, Kristina; Fulton, Alexander; Bhadauriya, Anuseema; Nutschel, Christina; Gohlke, Holger; Knapp, Andreas; Jaeger, Karl-Erich

2017-09-25

Bacillus subtilis produces and secretes proteins in amounts of up to 20 g/l under optimal conditions. However, protein production can be challenging if transcription and cotranslational secretion are negatively affected, or the target protein is degraded by extracellular proteases. This study aims at elucidating the influence of a target protein on its own production by a systematic mutational analysis of the homologous B. subtilis model protein lipase A (LipA). We have covered the full natural diversity of single amino acid substitutions at 155 positions of LipA by site saturation mutagenesis excluding only highly conserved residues and qualitatively and quantitatively screened about 30,000 clones for extracellular LipA production. Identified variants with beneficial effects on production were sequenced and analyzed regarding B. subtilis growth behavior, extracellular lipase activity and amount as well as changes in lipase transcript levels. In total, 26 LipA variants were identified showing an up to twofold increase in either amount or activity of extracellular lipase. These variants harbor single amino acid or codon substitutions that did not substantially affect B. subtilis growth. Subsequent exemplary combination of beneficial single amino acid substitutions revealed an additive effect solely at the level of extracellular lipase amount; however, lipase amount and activity could not be increased simultaneously. Single amino acid and codon substitutions can affect LipA secretion and production by B. subtilis. Several codon-related effects were observed that either enhance lipA transcription or promote a more efficient folding of LipA. Single amino acid substitutions could improve LipA production by increasing its secretion or stability in the culture supernatant. Our findings indicate that optimization of the expression system is not sufficient for efficient protein production in B. subtilis. The sequence of the target protein should also be considered as an optimization target for successful protein production. Our results further suggest that variants with improved properties might be identified much faster and easier if mutagenesis is prioritized towards elements that contribute to enzymatic activity or structural integrity.
Predicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition.

PubMed

Shen, Hong-Bin; Chou, Kuo-Chen

2005-11-25

The nucleus is the brain of eukaryotic cells that guides the life processes of the cell by issuing key instructions. For in-depth understanding of the biochemical process of the nucleus, the knowledge of localization of nuclear proteins is very important. With the avalanche of protein sequences generated in the post-genomic era, it is highly desired to develop an automated method for fast annotating the subnuclear locations for numerous newly found nuclear protein sequences so as to be able to timely utilize them for basic research and drug discovery. In view of this, a novel approach is developed for predicting the protein subnuclear location. It is featured by introducing a powerful classifier, the optimized evidence-theoretic K-nearest classifier, and using the pseudo amino acid composition [K.C. Chou, PROTEINS: Structure, Function, and Genetics, 43 (2001) 246], which can incorporate a considerable amount of sequence-order effects, to represent protein samples. As a demonstration, identifications were performed for 370 nuclear proteins among the following 9 subnuclear locations: (1) Cajal body, (2) chromatin, (3) heterochromatin, (4) nuclear diffuse, (5) nuclear pore, (6) nuclear speckle, (7) nucleolus, (8) PcG body, and (9) PML body. The overall success rates thus obtained by both the re-substitution test and jackknife cross-validation test are significantly higher than those by existing classifiers on the same working dataset. It is anticipated that the powerful approach may also become a useful high throughput vehicle to bridge the huge gap occurring in the post-genomic era between the number of gene sequences in databases and the number of gene products that have been functionally characterized. The OET-KNN classifier will be available at www.pami.sjtu.edu.cn/people/hbshen.
Molecular characterization of subgenotype A1 (subgroup Aa) of hepatitis B virus.

PubMed

Kramvis, Anna; Kew, Michael C

2007-07-01

Subgenotypes of hepatitis B virus (HBV) were first recognized after a unique segment of genotype A was identified when sequencing the preS2/S region of southern African HBV isolates. Originally named subgroup A', subsequently called subgroup Aa (for Africa) or subgenotype A1, this subgenotype is found in South Africa, Malawi, Uganda, Tanzania, Somalia, Yemen, India, Nepal, the Philippines and Brazil. The relatively higher mean nucleotide divergence of subgenotype A1 suggests that it has been endemic and has a long evolutionary history in the populations where it prevails. Distinctive sequence characteristics could account for the high hepatitis B e-antigen (HBeAg) negativity and low HBV DNA levels in carriers of this subgenotype. Substitutions or mutations can reduce HBeAg expression at three levels: (i) 1762T1764A atthe transcriptional level; (ii) substitutions at nt 1809-1812 at the translational level; and (iii) 1862T at the post-translational level. Co-existence of 1762T1764A and nt 1809-1812 mutations reduces HBeAg expression in an additive manner. In addition, subgenotype A1 has unique sequence alterations in the transcriptional regulatory elements and the polymerase coding region. The distinct sequence characteristics of subgenotype A1 may contribute to the 4.5-fold increased risk of heptocellular carcinoma in HBV carriers infected with genotype A, which is entirely attributable to subgenotype A1.
MultiGeMS: detection of SNVs from multiple samples using model selection on high-throughput sequencing data.

PubMed

Murillo, Gabriel H; You, Na; Su, Xiaoquan; Cui, Wei; Reilly, Muredach P; Li, Mingyao; Ning, Kang; Cui, Xinping

2016-05-15

Single nucleotide variant (SNV) detection procedures are being utilized as never before to analyze the recent abundance of high-throughput DNA sequencing data, both on single and multiple sample datasets. Building on previously published work with the single sample SNV caller genotype model selection (GeMS), a multiple sample version of GeMS (MultiGeMS) is introduced. Unlike other popular multiple sample SNV callers, the MultiGeMS statistical model accounts for enzymatic substitution sequencing errors. It also addresses the multiple testing problem endemic to multiple sample SNV calling and utilizes high performance computing (HPC) techniques. A simulation study demonstrates that MultiGeMS ranks highest in precision among a selection of popular multiple sample SNV callers, while showing exceptional recall in calling common SNVs. Further, both simulation studies and real data analyses indicate that MultiGeMS is robust to low-quality data. We also demonstrate that accounting for enzymatic substitution sequencing errors not only improves SNV call precision at low mapping quality regions, but also improves recall at reference allele-dominated sites with high mapping quality. The MultiGeMS package can be downloaded from https://github.com/cui-lab/multigems xinping.cui@ucr.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Exploring the active site binding specificity of kallikrein-related peptidase 5 (KLK5) guides the design of new peptide substrates and inhibitors.

PubMed

de Veer, Simon J; Swedberg, Joakim E; Brattsand, Maria; Clements, Judith A; Harris, Jonathan M

2016-12-01

Kallikrein-related peptidase 5 (KLK5) is a promising therapeutic target in several skin diseases, including Netherton syndrome, and is emerging as a potential target in various cancers. In this study, we used a sparse matrix library of 125 individually synthesized peptide substrates to characterize the binding specificity of KLK5. The sequences most favored by KLK5 were GRSR, YRSR and GRNR, and we identified sequence-specific interactions involving the peptide N-terminus by analyzing kinetic constants (kcat and KM) and performing molecular dynamics simulations. KLK5 inhibitors were subsequently engineered by substituting substrate sequences into the binding loop (P1, P2 and P4 residues) of sunflower trypsin inhibitor-1 (SFTI-1). These inhibitors were effective against KLK5 but showed limited selectivity, and performing a further substitution at P2' led to the design of a new variant that displayed improved activity against KLK5 (Ki=4.2±0.2 nm), weak activity against KLK7 and 12-fold selectivity over KLK14. Collectively, these findings provide new insight into the design of highly favored binding sequences for KLK5 and reveal several opportunities for modulating inhibitor selectivity over closely related proteases that will be useful for future studies aiming to develop therapeutic molecules targeting KLK5.
Theileria parva antigens recognized by CD8+ T cells show varying degrees of diversity in buffalo-derived infected cell lines.

PubMed

Sitt, Tatjana; Pelle, Roger; Chepkwony, Maurine; Morrison, W Ivan; Toye, Philip

2018-05-06

The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.
Real-time, portable genome sequencing for Ebola surveillance

PubMed Central

Bore, Joseph Akoi; Koundouno, Raymond; Dudas, Gytis; Mikhail, Amy; Ouédraogo, Nobila; Afrough, Babak; Bah, Amadou; Baum, Jonathan HJ; Becker-Ziaja, Beate; Boettcher, Jan-Peter; Cabeza-Cabrerizo, Mar; Camino-Sanchez, Alvaro; Carter, Lisa L.; Doerrbecker, Juiliane; Enkirch, Theresa; Dorival, Isabel Graciela García; Hetzelt, Nicole; Hinzmann, Julia; Holm, Tobias; Kafetzopoulou, Liana Eleni; Koropogui, Michel; Kosgey, Abigail; Kuisma, Eeva; Logue, Christopher H; Mazzarelli, Antonio; Meisel, Sarah; Mertens, Marc; Michel, Janine; Ngabo, Didier; Nitzsche, Katja; Pallash, Elisa; Patrono, Livia Victoria; Portmann, Jasmine; Repits, Johanna Gabriella; Rickett, Natasha Yasmin; Sachse, Andrea; Singethan, Katrin; Vitoriano, Inês; Yemanaberhan, Rahel L; Zekeng, Elsa G; Trina, Racine; Bello, Alexander; Sall, Amadou Alpha; Faye, Ousmane; Faye, Oumar; Magassouba, N’Faly; Williams, Cecelia V.; Amburgey, Victoria; Winona, Linda; Davis, Emily; Gerlach, Jon; Washington, Franck; Monteil, Vanessa; Jourdain, Marine; Bererd, Marion; Camara, Alimou; Somlare, Hermann; Camara, Abdoulaye; Gerard, Marianne; Bado, Guillaume; Baillet, Bernard; Delaune, Déborah; Nebie, Koumpingnin Yacouba; Diarra, Abdoulaye; Savane, Yacouba; Pallawo, Raymond Bernard; Gutierrez, Giovanna Jaramillo; Milhano, Natacha; Roger, Isabelle; Williams, Christopher J; Yattara, Facinet; Lewandowski, Kuiama; Taylor, Jamie; Rachwal, Philip; Turner, Daniel; Pollakis, Georgios; Hiscox, Julian A.; Matthews, David A.; O’Shea, Matthew K.; Johnston, Andrew McD; Wilson, Duncan; Hutley, Emma; Smit, Erasmus; Di Caro, Antonino; Woelfel, Roman; Stoecker, Kilian; Fleischmann, Erna; Gabriel, Martin; Weller, Simon A.; Koivogui, Lamine; Diallo, Boubacar; Keita, Sakoba; Rambaut, Andrew; Formenty, Pierre; Gunther, Stephan; Carroll, Miles W.

2016-01-01

The Ebola virus disease (EVD) epidemic in West Africa is the largest on record, responsible for >28,599 cases and >11,299 deaths 1. Genome sequencing in viral outbreaks is desirable in order to characterize the infectious agent to determine its evolutionary rate, signatures of host adaptation, identification and monitoring of diagnostic targets and responses to vaccines and treatments. The Ebola virus genome (EBOV) substitution rate in the Makona strain has been estimated at between 0.87 × 10−3 to 1.42 × 10−3 mutations per site per year. This is equivalent to 16 to 27 mutations in each genome, meaning that sequences diverge rapidly enough to identify distinct sub-lineages during a prolonged epidemic 2-7. Genome sequencing provides a high-resolution view of pathogen evolution and is increasingly sought-after for outbreak surveillance. Sequence data may be used to guide control measures, but only if the results are generated quickly enough to inform interventions 8. Genomic surveillance during the epidemic has been sporadic due to a lack of local sequencing capacity coupled with practical difficulties transporting samples to remote sequencing facilities 9. In order to address this problem, we devised a genomic surveillance system that utilizes a novel nanopore DNA sequencing instrument. In April 2015 this system was transported in standard airline luggage to Guinea and used for real-time genomic surveillance of the ongoing epidemic. Here we present sequence data and analysis of 142 Ebola virus (EBOV) samples collected during the period March to October 2015. We were able to generate results in less than 24 hours after receiving an Ebola positive sample, with the sequencing process taking as little as 15-60 minutes. We show that real-time genomic surveillance is possible in resource-limited settings and can be established rapidly to monitor outbreaks. PMID:26840485
Rates of molecular evolution in tree ferns are associated with body size, environmental temperature, and biological productivity.

PubMed

Barrera-Redondo, Josué; Ramírez-Barahona, Santiago; Eguiarte, Luis E

2018-05-01

Variation in rates of molecular evolution (heterotachy) is a common phenomenon among plants. Although multiple theoretical models have been proposed, fundamental questions remain regarding the combined effects of ecological and morphological traits on rate heterogeneity. Here, we used tree ferns to explore the correlation between rates of molecular evolution in chloroplast DNA sequences and several morphological and environmental factors within a Bayesian framework. We revealed direct and indirect effects of body size, biological productivity, and temperature on substitution rates, where smaller tree ferns living in warmer and less productive environments tend to have faster rates of molecular evolution. In addition, we found that variation in the ratio of nonsynonymous to synonymous substitution rates (dN/dS) in the chloroplast rbcL gene was significantly correlated with ecological and morphological variables. Heterotachy in tree ferns may be influenced by effective population size associated with variation in body size and productivity. Macroevolutionary hypotheses should go beyond explaining heterotachy in terms of mutation rates and instead, should integrate population-level factors to better understand the processes affecting the tempo of evolution at the molecular level. © 2018 The Author(s). Evolution © 2018 The Society for the Study of Evolution.
Homologation chemistry with nucleophilic α-substituted organometallic reagents: chemocontrol, new concepts and (solved) challenges.

PubMed

Castoldi, Laura; Monticelli, Serena; Senatore, Raffaele; Ielo, Laura; Pace, Vittorio

2018-05-31

The transfer of a reactive nucleophilic CH2X unit into a preformed bond enables the introduction of a fragment featuring the exact and desired degree of functionalization through a single synthetic operation. The instability of metallated α-organometallic species often poses serious questions regarding the practicability of using this conceptually intuitive and simple approach for forming C-C or C-heteroatom bonds. A deep understanding of processes regulating the formation of these nucleophiles is a precious source of inspiration not only for successfully applying theoretically feasible transformations (i.e. determining how to employ a given reagent), but also for designing new reactions which ultimately lead to the introduction of molecular complexity via short experimental sequences.
Evolution and dispersal of St. Louis encephalitis virus in the Americas.

PubMed

Auguste, Albert J; Pybus, Oliver G; Carrington, Christine V F

2009-07-01

Using a Bayesian coalescent approach on a dataset of 73 envelope gene sequences we estimated substitution rates and dates of divergence for St. Louis encephalitis virus (SLEV) in the Americas. We found significant rate heterogeneity among lineages, such that "relaxed" molecular clock models were much better supported than a strict molecular clock. The mean substitution rate estimated for all SLEV was 4.1x10(-4)substitutions/site/year (95% HPD 2.5-5.7)-higher than previous estimates that relied on the less well-suited strict clock. Mean substitution rates for individual lineages varied from 3.7x10(-4) to 7.2x10(-4)substitutions/site/year. For the first time we also assessed the magnitude and direction of viral gene flow within the Americas. The overall direction of gene flow during the period represented by the phylogeny is from South to North, and the region between 15 degrees N and 30 degrees N latitude appears to be the major source of virus for the rest of North America, which is consistent with migratory birds returning to their northern breeding grounds having acquired infection while wintering in the region of the Gulf of Mexico.
Parametric and non-parametric masking of randomness in sequence alignments can be improved and leads to better resolved trees.

PubMed

Kück, Patrick; Meusemann, Karen; Dambach, Johannes; Thormann, Birthe; von Reumont, Björn M; Wägele, Johann W; Misof, Bernhard

2010-03-31

Methods of alignment masking, which refers to the technique of excluding alignment blocks prior to tree reconstructions, have been successful in improving the signal-to-noise ratio in sequence alignments. However, the lack of formally well defined methods to identify randomness in sequence alignments has prevented a routine application of alignment masking. In this study, we compared the effects on tree reconstructions of the most commonly used profiling method (GBLOCKS) which uses a predefined set of rules in combination with alignment masking, with a new profiling approach (ALISCORE) based on Monte Carlo resampling within a sliding window, using different data sets and alignment methods. While the GBLOCKS approach excludes variable sections above a certain threshold which choice is left arbitrary, the ALISCORE algorithm is free of a priori rating of parameter space and therefore more objective. ALISCORE was successfully extended to amino acids using a proportional model and empirical substitution matrices to score randomness in multiple sequence alignments. A complex bootstrap resampling leads to an even distribution of scores of randomly similar sequences to assess randomness of the observed sequence similarity. Testing performance on real data, both masking methods, GBLOCKS and ALISCORE, helped to improve tree resolution. The sliding window approach was less sensitive to different alignments of identical data sets and performed equally well on all data sets. Concurrently, ALISCORE is capable of dealing with different substitution patterns and heterogeneous base composition. ALISCORE and the most relaxed GBLOCKS gap parameter setting performed best on all data sets. Correspondingly, Neighbor-Net analyses showed the most decrease in conflict. Alignment masking improves signal-to-noise ratio in multiple sequence alignments prior to phylogenetic reconstruction. Given the robust performance of alignment profiling, alignment masking should routinely be used to improve tree reconstructions. Parametric methods of alignment profiling can be easily extended to more complex likelihood based models of sequence evolution which opens the possibility of further improvements.
Whole-genome sequencing and analyses identify high genetic heterogeneity, diversity and endemicity of rotavirus genotype P[6] strains circulating in Africa.

PubMed

Nyaga, Martin M; Tan, Yi; Seheri, Mapaseka L; Halpin, Rebecca A; Akopov, Asmik; Stucker, Karla M; Fedorova, Nadia B; Shrivastava, Susmita; Duncan Steele, A; Mwenda, Jason M; Pickett, Brett E; Das, Suman R; Jeffrey Mphahlele, M

2018-05-18

Rotavirus A (RVA) exhibits a wide genotype diversity globally. Little is known about the genetic composition of genotype P[6] from Africa. This study investigated possible evolutionary mechanisms leading to genetic diversity of genotype P[6] VP4 sequences. Phylogenetic analyses on 167 P[6] VP4 full-length sequences were conducted, which included six porcine-origin sequences. Of the 167 sequences, 57 were newly acquired through whole genome sequencing as part of this study. The other 110 sequences were all publicly-available global P[6] VP4 full-length sequences downloaded from GenBank. The strength of association between the phenotypic features and the phylogeny was also determined. A number of reassortment and mixed infections of RVA genotype P[6] strains were observed in this study. Phylogenetic analyses demostrated the extensive genetic diversity that exists among human P[6] strains, porcine-like strains, their concomitant clades/subclades and estimated that P[6] VP4 gene has a higher substitution rate with the mean of 1.05E-3 substitutions/site/year. Further, the phylogenetic analyses indicated that genotype P[6] strains were endemic in Africa, characterised by an extensive genetic diversity and long-time local evolution of the viruses. This was also supported by phylogeographic clustering and G-genotype clustering of the P[6] strains when Bayesian Tip-association Significance testing (BaTS) was applied, clearly supporting that the viruses evolved locally in Africa instead of spatial mixing among different regions. Overall, the results demonstrated that multiple mechanisms such as reassortment events, various mutations and possibly interspecies transmission account for the enormous diversity of genotype P[6] strains in Africa. These findings highlight the need for continued global surveillance of rotavirus diversity. Copyright © 2018 Elsevier B.V. All rights reserved.
Complete chloroplast DNA sequence from a Korean endemic genus, Megaleranthis saniculifolia, and its evolutionary implications.

PubMed

Kim, Young-Kyu; Park, Chong-wook; Kim, Ki-Joong

2009-03-31

The chloroplast DNA sequences of Megaleranthis saniculifolia, an endemic and monotypic endangered plant species, were completed in this study (GenBank FJ597983). The genome is 159,924 bp in length. It harbors a pair of IR regions consisting of 26,608 bp each. The lengths of the LSC and SSC regions are 88,326 bp and 18,382 bp, respectively. The structural organizations, gene and intron contents, gene orders, AT contents, codon usages, and transcription units of the Megaleranthis chloroplast genome are similar to those of typical land plant cp DNAs. However, the detailed features of Megaleranthis chloroplast genomes are substantially different from that of Ranunculus, which belongs to the same family, the Ranunculaceae. First, the Megaleranthis cp DNA was 4,797 bp longer than that of Ranunculus due to an expanded IR region into the SSC region and duplicated sequence elements in several spacer regions of the Megaleranthis cp genome. Second, the chloroplast genomes of Megaleranthis and Ranunculus evidence 5.6% sequence divergence in the coding regions, 8.9% sequence divergence in the intron regions, and 18.7% sequence divergence in the intergenic spacer regions, respectively. In both the coding and noncoding regions, average nucleotide substitution rates differed markedly, depending on the genome position. Our data strongly implicate the positional effects of the evolutionary modes of chloroplast genes. The genes evidencing higher levels of base substitutions also have higher incidences of indel mutations and low Ka/Ks ratios. A total of 54 simple sequence repeat loci were identified from the Megaleranthis cp genome. The existence of rich cp SSR loci in the Megaleranthis cp genome provides a rare opportunity to study the population genetic structures of this endangered species. Our phylogenetic trees based on the two independent markers, the nuclear ITS and chloroplast matK sequences, strongly support the inclusion of the Megaleranthis to the Trollius. Therefore, our molecular trees support Ohwi's original treatment of Megaleranthis saniculiforia to Trollius chosenensis Ohwi.
Two-Year Assessment of Entecavir Resistance in Lamivudine-Refractory Hepatitis B Virus Patients Reveals Different Clinical Outcomes Depending on the Resistance Substitutions Present▿

PubMed Central

Tenney, Daniel J.; Rose, Ronald E.; Baldick, Carl J.; Levine, Steven M.; Pokornowski, Kevin A.; Walsh, Ann W.; Fang, Jie; Yu, Cheng-Fang; Zhang, Sharon; Mazzucco, Charles E.; Eggers, Betsy; Hsu, Mayla; Plym, Mary Jane; Poundstone, Patricia; Yang, Joanna; Colonno, Richard J.

2007-01-01

Entecavir (ETV) is a deoxyguanosine analog approved for use for the treatment of chronic infection with wild-type and lamivudine-resistant (LVDr) hepatitis B virus (HBV). In LVD-refractory patients, 1.0 mg ETV suppressed HBV DNA levels to below the level of detection by PCR (<300 copies/ml) in 21% and 34% of patients by Weeks 48 and 96, respectively. Prior studies showed that virologic rebound due to ETV resistance (ETVr) required preexisting LVDr HBV reverse transcriptase substitutions M204V and L180M plus additional changes at T184, S202, or M250. To monitor for resistance, available isolates from 192 ETV-treated patients were sequenced, with phenotyping performed for all isolates with all emerging substitutions, in addition to isolates from all patients experiencing virologic rebounds. The T184, S202, or M250 substitution was found in LVDr HBV at baseline in 6% of patients and emerged in isolates from another 11/187 (6%) and 12/151 (8%) ETV-treated patients by Weeks 48 and 96, respectively. However, use of a more sensitive PCR assay detected many of the emerging changes at baseline, suggesting that they originated during LVD therapy. Only a subset of the changes in ETVr isolates altered their susceptibilities, and virtually all isolates were significantly replication impaired in vitro. Consequently, only 2/187 (1%) patients experienced ETVr rebounds in year 1, with an additional 14/151 (9%) patients experiencing ETVr rebounds in year 2. Isolates from all 16 patients with rebounds were LVDr and harbored the T184 and/or S202 change. Seventeen other novel substitutions emerged during ETV therapy, but none reduced the susceptibility to ETV or resulted in a rebound. In summary, ETV was effective in LVD-refractory patients, with resistant sequences arising from a subset of patients harboring preexisting LVDr/ETVr variants and with approximately half of the patients experiencing a virologic rebound. PMID:17178796
Two-year assessment of entecavir resistance in Lamivudine-refractory hepatitis B virus patients reveals different clinical outcomes depending on the resistance substitutions present.

PubMed

Tenney, Daniel J; Rose, Ronald E; Baldick, Carl J; Levine, Steven M; Pokornowski, Kevin A; Walsh, Ann W; Fang, Jie; Yu, Cheng-Fang; Zhang, Sharon; Mazzucco, Charles E; Eggers, Betsy; Hsu, Mayla; Plym, Mary Jane; Poundstone, Patricia; Yang, Joanna; Colonno, Richard J

2007-03-01

Entecavir (ETV) is a deoxyguanosine analog approved for use for the treatment of chronic infection with wild-type and lamivudine-resistant (LVDr) hepatitis B virus (HBV). In LVD-refractory patients, 1.0 mg ETV suppressed HBV DNA levels to below the level of detection by PCR (<300 copies/ml) in 21% and 34% of patients by Weeks 48 and 96, respectively. Prior studies showed that virologic rebound due to ETV resistance (ETVr) required preexisting LVDr HBV reverse transcriptase substitutions M204V and L180M plus additional changes at T184, S202, or M250. To monitor for resistance, available isolates from 192 ETV-treated patients were sequenced, with phenotyping performed for all isolates with all emerging substitutions, in addition to isolates from all patients experiencing virologic rebounds. The T184, S202, or M250 substitution was found in LVDr HBV at baseline in 6% of patients and emerged in isolates from another 11/187 (6%) and 12/151 (8%) ETV-treated patients by Weeks 48 and 96, respectively. However, use of a more sensitive PCR assay detected many of the emerging changes at baseline, suggesting that they originated during LVD therapy. Only a subset of the changes in ETVr isolates altered their susceptibilities, and virtually all isolates were significantly replication impaired in vitro. Consequently, only 2/187 (1%) patients experienced ETVr rebounds in year 1, with an additional 14/151 (9%) patients experiencing ETVr rebounds in year 2. Isolates from all 16 patients with rebounds were LVDr and harbored the T184 and/or S202 change. Seventeen other novel substitutions emerged during ETV therapy, but none reduced the susceptibility to ETV or resulted in a rebound. In summary, ETV was effective in LVD-refractory patients, with resistant sequences arising from a subset of patients harboring preexisting LVDr/ETVr variants and with approximately half of the patients experiencing a virologic rebound.
LenVarDB: database of length-variant protein domains.

PubMed

Mutt, Eshita; Mathew, Oommen K; Sowdhamini, Ramanathan

2014-01-01

Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenVarDB (freely available at http://caps.ncbs.res.in/lenvardb/) traces these length variations, starting from structure-based sequence alignments in our Protein Alignments organized as Structural Superfamilies (PASS2) database, across 731 structural classification of proteins (SCOP)-based protein domain superfamilies connected to 2 730 625 sequence homologues. Alignment of sequence homologues corresponding to a structural domain is available, starting from a structure-based sequence alignment of the superfamily. Orientation of the length-variant (indel) regions in protein domains can be visualized by mapping them on the structure and on the alignment. Knowledge about location of length variations within protein domains and their visual representation will be useful in predicting changes within structurally or functionally relevant sites, which may ultimately regulate protein function. Non-technical summary: Evolutionary changes bring about natural changes to proteins that may be found in many organisms. Such changes could be reflected as amino acid substitutions or insertions-deletions (indels) in protein sequences. LenVarDB is a database that provides an early overview of observed length variations that were set among 731 protein families and after examining >2 million sequences. Indels are followed up to observe if they are close to the active site such that they can affect the activity of proteins. Inclusion of such information can aid the design of bioengineering experiments.
Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.

PubMed

Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz

2004-08-18

The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.
Improved detection of genetic markers of antimicrobial resistance by hybridization probe-based melting curve analysis using primers to mask proximal mutations: examples include the influenza H275Y substitution.

PubMed

Whiley, David M; Jacob, Kevin; Nakos, Jennifer; Bletchly, Cheryl; Nimmo, Graeme R; Nissen, Michael D; Sloots, Theo P

2012-06-01

Numerous real-time PCR assays have been described for detection of the influenza A H275Y alteration. However, the performance of these methods can be undermined by sequence variation in the regions flanking the codon of interest. This is a problem encountered more broadly in microbial diagnostics. In this study, we developed a modification of hybridization probe-based melting curve analysis, whereby primers are used to mask proximal mutations in the sequence targets of hybridization probes, so as to limit the potential for sequence variation to interfere with typing. The approach was applied to the H275Y alteration of the influenza A (H1N1) 2009 strain, as well as a Neisseria gonorrhoeae mutation associated with antimicrobial resistance. Assay performances were assessed using influenza A and N. gonorrhoeae strains characterized by DNA sequencing. The modified hybridization probe-based approach proved successful in limiting the effects of proximal mutations, with the results of melting curve analyses being 100% consistent with the results of DNA sequencing for all influenza A and N. gonorrhoeae strains tested. Notably, these included influenza A and N. gonorrhoeae strains exhibiting additional mutations in hybridization probe targets. Of particular interest was that the H275Y assay correctly typed influenza A strains harbouring a T822C nucleotide substitution, previously shown to interfere with H275Y typing methods. Overall our modified hybridization probe-based approach provides a simple means of circumventing problems caused by sequence variation, and offers improved detection of the influenza A H275Y alteration and potentially other resistance mechanisms.

A genome sequence resource for the aye-aye (Daubentonia madagascariensis), a nocturnal lemur from Madagascar.

PubMed

Perry, George H; Reeves, Darryl; Melsted, Páll; Ratan, Aakrosh; Miller, Webb; Michelini, Katelyn; Louis, Edward E; Pritchard, Jonathan K; Mason, Christopher E; Gilad, Yoav

2012-01-01

We present a high-coverage draft genome assembly of the aye-aye (Daubentonia madagascariensis), a highly unusual nocturnal primate from Madagascar. Our assembly totals ~3.0 billion bp (3.0 Gb), roughly the size of the human genome, comprised of ~2.6 million scaffolds (N50 scaffold size = 13,597 bp) based on short paired-end sequencing reads. We compared the aye-aye genome sequence data with four other published primate genomes (human, chimpanzee, orangutan, and rhesus macaque) as well as with the mouse and dog genomes as nonprimate outgroups. Unexpectedly, we observed strong evidence for a relatively slow substitution rate in the aye-aye lineage compared with these and other primates. In fact, the aye-aye branch length is estimated to be ~10% shorter than that of the human lineage, which is known for its low substitution rate. This finding may be explained, in part, by the protracted aye-aye life-history pattern, including late weaning and age of first reproduction relative to other lemurs. Additionally, the availability of this draft lemur genome sequence allowed us to polarize nucleotide and protein sequence changes to the ancestral primate lineage-a critical period in primate evolution, for which the relevant fossil record is sparse. Finally, we identified 293,800 high-confidence single nucleotide polymorphisms in the donor individual for our aye-aye genome sequence, a captive-born individual from two wild-born parents. The resulting heterozygosity estimate of 0.051% is the lowest of any primate studied to date, which is understandable considering the aye-aye's extensive home-range size and relatively low population densities. Yet this level of genetic diversity also suggests that conservation efforts benefiting this unusual species should be prioritized, especially in the face of the accelerating degradation and fragmentation of Madagascar's forests.
Genetic variation in potential Giardia vaccine candidates cyst wall protein 2 and α1-giardin.

PubMed

Radunovic, Matej; Klotz, Christian; Saghaug, Christina Skår; Brattbakk, Hans-Richard; Aebischer, Toni; Langeland, Nina; Hanevik, Kurt

2017-08-01

Giardia is a prevalent intestinal parasitic infection. The trophozoite structural protein a1-giardin (a1-g) and the cyst protein cyst wall protein 2 (CWP2) have shown promise as Giardia vaccine antigen candidates in murine models. The present study assesses the genetic diversity of a1-g and CWP2 between and within assemblages A and B in human clinical isolates. a1-g and CWP2 sequences were acquired from 15 Norwegian isolates by PCR amplification and 20 sequences from German cultured isolates by whole genome sequencing. Sequences were aligned to reference genomes from assemblage A2 and B to identify genetic variance. Genetic diversity was found between assemblage A and B reference sequences for both a1-g (90.8% nucleotide identity) and CWP2 (82.5% nucleotide identity). However, for a1-g, this translated into only 3 amino acid (aa) substitutions, while for CWP2 there were 41 aa substitutions, and also one aa deletion. Genetic diversity within assemblage B was larger; nucleotide identity 92.0% for a1-g and 94.3% for CWP2, than within assemblage A (nucleotide identity 99.0% for a1-g and 99.7% for CWP2). For CWP2, the diversity on both nucleotide and protein level was higher in the C-terminal end. Predicted antigenic epitopes were not affected for a1-g, but partially for CWP2. Despite genetic diversity in a1-g, we found aa sequence, characteristics, and antigenicity to be well preserved. CWP2 showed more aa variance and potential antigenic differences. Several CWP2 antigens might be necessary in a future Giardia vaccine to provide cross protection against both Giardia assemblages infecting humans.
Substitute decision-making for adults with intellectual disabilities living in residential care: learning through experience.

PubMed

Dunn, Michael C; Clare, Isabel C H; Holland, Anthony J

2008-03-01

In the UK, current policies and services for people with mental disorders, including those with intellectual disabilities (ID), presume that these men and women can, do, and should, make decisions for themselves. The new Mental Capacity Act (England and Wales) 2005 (MCA) sets this presumption into statute, and codifies how decisions relating to health and welfare should be made for those adults judged unable to make one or more such decisions autonomously. The MCA uses a procedural checklist to guide this process of substitute decision-making. The personal experiences of providing direct support to seven men and women with ID living in residential care, however, showed that substitute decision-making took two forms, depending on the type of decision to be made. The first process, 'strategic substitute decision-making', paralleled the MCA's legal and ethical framework, whilst the second process, 'relational substitute decision-making', was markedly different from these statutory procedures. In this setting, 'relational substitute decision-making' underpinned everyday personal and social interventions connected with residents' daily living, and was situated within a framework of interpersonal and interdependent care relationships. The implications of these findings for residential services and the implementation of the MCA are discussed.
MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics.

PubMed

Helaers, Raphaël; Milinkovitch, Michel C

2010-07-15

The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities. Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers. The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org.
MetaPIGA v2.0: maximum likelihood large phylogeny estimation using the metapopulation genetic algorithm and other stochastic heuristics

PubMed Central

2010-01-01

Background The development, in the last decade, of stochastic heuristics implemented in robust application softwares has made large phylogeny inference a key step in most comparative studies involving molecular sequences. Still, the choice of a phylogeny inference software is often dictated by a combination of parameters not related to the raw performance of the implemented algorithm(s) but rather by practical issues such as ergonomics and/or the availability of specific functionalities. Results Here, we present MetaPIGA v2.0, a robust implementation of several stochastic heuristics for large phylogeny inference (under maximum likelihood), including a Simulated Annealing algorithm, a classical Genetic Algorithm, and the Metapopulation Genetic Algorithm (metaGA) together with complex substitution models, discrete Gamma rate heterogeneity, and the possibility to partition data. MetaPIGA v2.0 also implements the Likelihood Ratio Test, the Akaike Information Criterion, and the Bayesian Information Criterion for automated selection of substitution models that best fit the data. Heuristics and substitution models are highly customizable through manual batch files and command line processing. However, MetaPIGA v2.0 also offers an extensive graphical user interface for parameters setting, generating and running batch files, following run progress, and manipulating result trees. MetaPIGA v2.0 uses standard formats for data sets and trees, is platform independent, runs in 32 and 64-bits systems, and takes advantage of multiprocessor and multicore computers. Conclusions The metaGA resolves the major problem inherent to classical Genetic Algorithms by maintaining high inter-population variation even under strong intra-population selection. Implementation of the metaGA together with additional stochastic heuristics into a single software will allow rigorous optimization of each heuristic as well as a meaningful comparison of performances among these algorithms. MetaPIGA v2.0 gives access both to high customization for the phylogeneticist, as well as to an ergonomic interface and functionalities assisting the non-specialist for sound inference of large phylogenetic trees using nucleotide sequences. MetaPIGA v2.0 and its extensive user-manual are freely available to academics at http://www.metapiga.org. PMID:20633263
Mutational Dynamics of Aroid Chloroplast Genomes

PubMed Central

Ahmed, Ibrar; Biggs, Patrick J.; Matthews, Peter J.; Collins, Lesley J.; Hendy, Michael D.; Lockhart, Peter J.

2012-01-01

A characteristic feature of eukaryote and prokaryote genomes is the co-occurrence of nucleotide substitution and insertion/deletion (indel) mutations. Although similar observations have also been made for chloroplast DNA, genome-wide associations have not been reported. We determined the chloroplast genome sequences for two morphotypes of taro (Colocasia esculenta; family Araceae) and compared these with four publicly available aroid chloroplast genomes. Here, we report the extent of genome-wide association between direct and inverted repeats, indels, and substitutions in these aroid chloroplast genomes. We suggest that alternative but not mutually exclusive hypotheses explain the mutational dynamics of chloroplast genome evolution. PMID:23204304
Structure of genes for Hsp30 from the white-rot fungus Coriolus versicolor and the increase of their expression by heat shock and exposure to a hazardous chemical.

PubMed

Iimura, Yosuke; Tatsumi, Kenji

2002-07-01

We isolated and analysed two genomic DNAs that encode the heat-shock protein Hsp30 from Coriolus versicolor. The amino acid sequences substitute only three amino acid substitutions. The promoter regions contain the consensus heat-shock element, a xenobiotic-response element, a stress-response element, and a metal-response element. The levels of mRNAs for Hsp30 increased markedly after exposure of C. versicolor to pentachlorophenol and levels were higher than those after heat shock.
Calibration of Multiple In Silico Tools for Predicting Pathogenicity of Mismatch Repair Gene Missense Substitutions

PubMed Central

Thompson, Bryony A.; Greenblatt, Marc S.; Vallee, Maxime P.; Herkert, Johanna C.; Tessereau, Chloe; Young, Erin L.; Adzhubey, Ivan A.; Li, Biao; Bell, Russell; Feng, Bingjian; Mooney, Sean D.; Radivojac, Predrag; Sunyaev, Shamil R.; Frebourg, Thierry; Hofstra, Robert M.W.; Sijmons, Rolf H.; Boucher, Ken; Thomas, Alun; Goldgar, David E.; Spurdle, Amanda B.; Tavtigian, Sean V.

2015-01-01

Classification of rare missense substitutions observed during genetic testing for patient management is a considerable problem in clinical genetics. The Bayesian integrated evaluation of unclassified variants is a solution originally developed for BRCA1/2. Here, we take a step toward an analogous system for the mismatch repair (MMR) genes (MLH1, MSH2, MSH6, and PMS2) that confer colon cancer susceptibility in Lynch syndrome by calibrating in silico tools to estimate prior probabilities of pathogenicity for MMR gene missense substitutions. A qualitative five-class classification system was developed and applied to 143 MMR missense variants. This identified 74 missense substitutions suitable for calibration. These substitutions were scored using six different in silico tools (Align-Grantham Variation Grantham Deviation, multivariate analysis of protein polymorphisms [MAPP], Mut-Pred, PolyPhen-2.1, Sorting Intolerant From Tolerant, and Xvar), using curated MMR multiple sequence alignments where possible. The output from each tool was calibrated by regression against the classifications of the 74 missense substitutions; these calibrated outputs are interpretable as prior probabilities of pathogenicity. MAPP was the most accurate tool and MAPP + PolyPhen-2.1 provided the best-combined model (R2 = 0.62 and area under receiver operating characteristic = 0.93). The MAPP + PolyPhen-2.1 output is sufficiently predictive to feed as a continuous variable into the quantitative Bayesian integrated evaluation for clinical classification of MMR gene missense substitutions. PMID:22949387
Genetic evidence for contribution of human dispersal to the genetic diversity of EBA-175 in Plasmodium falciparum.

PubMed

Yasukochi, Yoshiki; Naka, Izumi; Patarapotikul, Jintana; Hananantachai, Hathairad; Ohashi, Jun

2015-08-01

The 175-kDa erythrocyte binding antigen (EBA-175) of Plasmodium falciparum plays a crucial role in merozoite invasion into human erythrocytes. EBA-175 is believed to have been under diversifying selection; however, there have been no studies investigating the effect of dispersal of humans out of Africa on the genetic variation of EBA-175 in P. falciparum. The PCR-direct sequencing was performed for a part of the eba-175 gene (regions II and III) using DNA samples obtained from Thai patients infected with P. falciparum. The divergence times for the P. falciparum eba-175 alleles were estimated assuming that P. falciparum/Plasmodium reichenowi divergence occurred 6 million years ago (MYA). To examine the possibility of diversifying selection, nonsynonymous and synonymous substitution rates for Plasmodium species were also estimated. A total of 32 eba-175 alleles were identified from 131 Thai P. falciparum isolates. Their estimated divergence time was 0.13-0.14 MYA, before the exodus of humans from Africa. A phylogenetic tree for a large sequence dataset of P. falciparum eba-175 alleles from across the world showed the presence of a basal Asian-specific cluster for all P. falciparum sequences. A markedly more nonsynonymous substitutions than synonymous substitutions in region II in P. falciparum was also detected, but not within Plasmodium species parasitizing African apes, suggesting that diversifying selection has acted specifically on P. falciparum eba-175. Plasmodium falciparum eba-175 genetic diversity appeared to increase following the exodus of Asian ancestors from Africa. Diversifying selection may have played an important role in the diversification of eba-175 allelic lineages. The present results suggest that the dispersals of humans out of Africa influenced significantly the molecular evolution of P. falciparum EBA-175.
Mutations that alter a repeated ACCA element located at the 5' end of the Potato virus X genome affect RNA accumulation.

PubMed

Park, Mi-Ri; Kwon, Sun-Jung; Choi, Hong-Soo; Hemenway, Cynthia L; Kim, Kook-Hyung

2008-08-15

The repeated ACCA or AC-rich sequence and structural (SL1) elements in the 5' non-translated region (NTR) of the Potato virus X (PVX) RNA play vital roles in the PVX life cycle by controlling translation, RNA replication, movement, and assembly. It has already been shown that the repeated ACCA or AC-rich sequence affect both gRNA and sgRNA accumulation, while not affecting minus-strand RNA accumulation, and are also required for host protein binding. The functional significance of the repeated ACCA sequence elements in the 5' NTR region was investigated by analyzing the effects of deletion and site-directed mutations on PVX replication in Nicotiana benthamiana plants and NT1 protoplasts. Substitution (ACCA into AAAA or UUUU) mutations introduced in the first (nt 10-13) element in the 5' NTR of the PVX RNA significantly affected viral replication, while mutations introduced in the second (nt 17-20) and third (nt 20-23) elements did not. The fourth (nt 29-32) ACCA element weakly affected virus replication, whereas mutations in the fifth (nt 38-41) significantly reduced virus replication due to the structure disruption of SL1 by AAAA and/or UUUU substitutions. Further characterization of the first ACCA element indicated that duplication of ACCA at nt 10-13 (nt 10-17, ACCAACCA) caused severe symptom development as compared to that of wild type, while deletion of the single element (nt 10-13), DeltaACCA) or tripling of this element caused reduced symptom development. Single- and double-nucleotide substitutions introduced into the first ACCA element revealed the importance of CC located at nt positions 11 and 12. Altogether, these results indicate that the first ACCA element is important for PVX replication.
Genetic diversity of Grapevine virus A in Washington and California vineyards.

PubMed

Alabi, Olufemi J; Al Rwahnih, Maher; Mekuria, Tefera A; Naidu, Rayapati A

2014-05-01

Grapevine virus A (GVA; genus Vitivirus, family Betaflexiviridae) has been implicated with the Kober stem grooving disorder of the rugose wood disease complex. In this study, 26 isolates of GVA recovered from wine grape (Vitis vinifera) cultivars from California and Washington were analyzed for their genetic diversity. An analysis of a portion of the RNA-dependent RNA polymerase (RdRp) and complete coat protein (CP) sequences revealed intra- and inter-isolate sequence diversity. Our results indicated that both RdRp and CP are under strong negative selection based on the normalized values for the ratio of nonsynonymous substitutions per nonsynonymous site to synonymous substitutions per synonymous site. A global phylogenetic analysis of CP sequences revealed segregation of virus isolates into four major clades with no geographic clustering. In contrast, the RdRp-based phylogenetic tree indicated segregation of GVA isolates from California and Washington into six clades, independent of geographic origin or cultivar. Phylogenetic network coupled with recombination analyses showed putative recombination events in both RdRp and CP sequence data sets, with more of these events located in the CP sequence. The preponderance of divergent variants of GVA co-replicating within individual grapevines could increase viral genotypic complexity with implications for phylogenetic analysis and evolutionary history of the virus. The knowledge of genetic diversity of GVA generated in this study will provide a foundation for elucidating the epidemiological characteristics of virus populations at different scales and implementing appropriate management strategies for minimizing the spread of genetic variants of the virus by vectors and via planting materials supplied to nurseries and grape growers.
Genetic Characterization of the Hemagglutinin Genes of Wild-Type Measles Virus Circulating in China, 1993–2009

PubMed Central

Zhu, Zhen; Liu, Chunyu; Mao, Naiying; Ji, Yixin; Wang, Huiling; Jiang, Xiaohong; Li, Chongshan; Tang, Wei; Feng, Daxing; Wang, Changyin; Zheng, Lei; Lei, Yue; Ling, Hua; Zhao, Chunfang; Ma, Yan; He, Jilan; Wang, Yan; Li, Ping; Guan, Ronghui; Zhou, Shujie; Zhou, Jianhui; Wang, Shuang; Zhang, Hong; Zheng, Huanying; Liu, Leng; Ma, Hemuti; Guan, Jing; Lu, Peishan; Feng, Yan; Zhang, Yanjun; Zhou, Shunde; Xiong, Ying; Ba, Zhuoma; Chen, Hui; Yang, Xiuhui; Bo, Fang; Ma, Yujie; Liang, Yong; Lei, Yake; Gu, Suyi; Liu, Wei; Chen, Meng; Featherstone, David; Jee, Youngmee; Bellini, William J.; Rota, Paul A.; Xu, Wenbo

2013-01-01

Background China experienced several large measles outbreaks in the past two decades, and a series of enhanced control measures were implemented to achieve the goal of measles elimination. Molecular epidemiologic surveillance of wild-type measles viruses (MeV) provides valuable information about the viral transmission patterns. Since 1993, virologic surveillnace has confirmed that a single endemic genotype H1 viruses have been predominantly circulating in China. A component of molecular surveillance is to monitor the genetic characteristics of the hemagglutinin (H) gene of MeV, the major target for virus neutralizing antibodies. Principal Findings Analysis of the sequences of the complete H gene from 56 representative wild-type MeV strains circulating in China during 1993–2009 showed that the H gene sequences were clustered into 2 groups, cluster 1 and cluster 2. Cluster1 strains were the most frequently detected cluster and had a widespread distribution in China after 2000. The predicted amino acid sequences of the H protein were relatively conserved at most of the functionally significant amino acid positions. However, most of the genotype H1 cluster1 viruses had an amino acid substitution (Ser240Asn), which removed a predicted N-linked glycosylation site. In addition, the substitution of Pro397Leu in the hemagglutinin noose epitope (HNE) was identified in 23 of 56 strains. The evolutionary rate of the H gene of the genotype H1 viruses was estimated to be approximately 0.76×10−3 substitutions per site per year, and the ratio of dN to dS (dN/dS) was <1 indicating the absence of selective pressure. Conclusions Although H genes of the genotype H1 strains were conserved and not subjected to selective pressure, several amino acid substitutions were observed in functionally important positions. Therefore the antigenic and genetic properties of H genes of wild-type MeVs should be monitored as part of routine molecular surveillance for measles in China. PMID:24073194
Evolution of I-SceI Homing Endonucleases with Increased DNA Recognition Site Specificity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Joshi, Rakesh; Ho, Kwok Ki; Tenney, Kristen

2013-09-18

Elucidating how homing endonucleases undergo changes in recognition site specificity will facilitate efforts to engineer proteins for gene therapy applications. I-SceI is a monomeric homing endonuclease that recognizes and cleaves within an 18-bp target. It tolerates limited degeneracy in its target sequence, including substitution of a C:G{sub +4} base pair for the wild-type A:T{sub +4} base pair. Libraries encoding randomized amino acids at I-SceI residue positions that contact or are proximal to A:T{sub +4} were used in conjunction with a bacterial one-hybrid system to select I-SceI derivatives that bind to recognition sites containing either the A:T{sub +4} or the C:G{submore » +4} base pairs. As expected, isolates encoding wild-type residues at the randomized positions were selected using either target sequence. All I-SceI proteins isolated using the C:G{sub +4} recognition site included small side-chain substitutions at G100 and either contained (K86R/G100T, K86R/G100S and K86R/G100C) or lacked (G100A, G100T) a K86R substitution. Interestingly, the binding affinities of the selected variants for the wild-type A:T{sub +4} target are 4- to 11-fold lower than that of wild-type I-SceI, whereas those for the C:G{sub +4} target are similar. The increased specificity of the mutant proteins is also evident in binding experiments in vivo. These differences in binding affinities account for the observed -36-fold difference in target preference between the K86R/G100T and wild-type proteins in DNA cleavage assays. An X-ray crystal structure of the K86R/G100T mutant protein bound to a DNA duplex containing the C:G{sub +4} substitution suggests how sequence specificity of a homing enzyme can increase. This biochemical and structural analysis defines one pathway by which site specificity is augmented for a homing endonuclease.« less
Functional Characterization of Adaptive Mutations during the West African Ebola Virus Outbreak.

PubMed

Dietzel, Erik; Schudt, Gordian; Krähling, Verena; Matrosovich, Mikhail; Becker, Stephan

2017-01-15

The Ebola virus (EBOV) outbreak in West Africa started in December 2013, claimed more than 11,000 lives, threatened to destabilize a whole region, and showed how easily health crises can turn into humanitarian disasters. EBOV genomic sequences of the West African outbreak revealed nonsynonymous mutations, which induced considerable public attention, but their role in virus spread and disease remains obscure. In this study, we investigated the functional significance of three nonsynonymous mutations that emerged early during the West African EBOV outbreak. Almost 90% of more than 1,000 EBOV genomes sequenced during the outbreak carried the signature of three mutations: a D759G substitution in the active center of the L polymerase, an A82V substitution in the receptor binding domain of surface glycoprotein GP, and an R111C substitution in the self-assembly domain of RNA-encapsidating nucleoprotein NP. Using a newly developed virus-like particle system and reverse genetics, we found that the mutations have an impact on the functions of the respective viral proteins and on the growth of recombinant EBOVs. The mutation in L increased viral transcription and replication, whereas the mutation in NP decreased viral transcription and replication. The mutation in the receptor binding domain of the glycoprotein GP improved the efficiency of GP-mediated viral entry into target cells. Recombinant EBOVs with combinations of the three mutations showed a growth advantage over the prototype isolate Makona C7 lacking the mutations. This study showed that virus variants with improved fitness emerged early during the West African EBOV outbreak. The dimension of the Ebola virus outbreak in West Africa was unprecedented. Amino acid substitutions in the viral L polymerase, surface glycoprotein GP, and nucleocapsid protein NP emerged, were fixed early in the outbreak, and were found in almost 90% of the sequences. Here we showed that these mutations affected the functional activity of viral proteins and improved viral growth in cell culture. Our results demonstrate emergence of adaptive changes in the Ebola virus genome during virus circulation in humans and prompt further studies on the potential role of these changes in virus transmissibility and pathogenicity. Copyright © 2017 American Society for Microbiology.
Variants of beta-glucosidase

DOEpatents

Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

2015-07-14

The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Peptoid architectures: elaboration, actuation, and application.

PubMed

Yoo, Barney; Kirshenbaum, Kent

2008-12-01

Peptoids are peptidomimetic oligomers composed of N-substituted glycine units. Their convenient synthesis enables strict control over the sequence of highly diverse monomers and is capable of generating extensive compound libraries. Recent studies are beginning to explore the relationship between peptoid sequence, structure and function. We describe new approaches to direct the conformation of the peptoid backbone, leading to secondary structures such as helices, loops, and turns. These advances are enabling the discovery of bioactive peptoids and will establish modules for the design and assembly of protein mimetics.
Variants of beta-glucosidases

DOEpatents

Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

2014-10-07

The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
Variants of beta-glucosidase

DOEpatents

Fidantsef, Ana [Davis, CA; Lamsa, Michael [Davis, CA; Gorre-Clancy, Brian [Elk Grove, CA

2009-12-29

The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.
77 FR 42554 - Proposed Information Collection (Authorization To Substitute a Claim of a Deceased Claimant...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-07-19

... needed to allow claimants to request substitution for a claimant, who passed away, prior to VA processing... away, prior to VA processing a claim to completion. This is only allowed when a claimant dies while a...
Routine HLA-B genotyping with PCR-sequence-specific oligonucleotides detects a B*52 variant (B*5206).

PubMed

Hoelsch, K; Lenggeler, I; Pfannes, W; Knabe, H; Klein, H-G; Woelpl, A

2005-05-01

A new human leukocyte antigen (HLA)-B allele was found during routine typing of samples for a German unrelated bone marrow donor registry, the "Aktion Knochenmarkspende Bayern". After first interpretation of data of two independent low-resolution sequence-specific oligonucleotide typing tests, a B*51 variant was suggested. Further analysis via sequence-based typing identified the sequence as new B*52 allele. This new allele officially assigned as B*5206 differs from HLA-B*520102 by one nucleotide exchange in exon 2. The mutation is located at nucleotide position 274, at which a cytosine is substituted by a thymine leading to an amino acid change at protein position 67 from serine (TCC) to phenylalanine (TTC).

Pop Goes the Poster! Tips for Effective Poster Design.

ERIC Educational Resources Information Center

Helmken, Charles M.

1979-01-01

Posters' power in academic advertising to persuade and inform is successful when message and graphics are combined. Design principles and techniques are identified: symbolism, symbiosis, substitution, sequence, scale, silhouette, script, spectrum, and simplicity. Designers should avoid: typographic fog, color confusion, excessive elements, and…
Candida sanyaensis sp. nov., an ascomycetous yeast species isolated from soil.

PubMed

Hui, Feng-Li; Niu, Qiu-Hong; Ke, Tao; Li, Ying-Xia; Lee, Ching-Fu

2013-01-01

Strains representing a novel ascomycetous yeast species, Candida sanyaensis, were isolated from soil samples collected on Hainan Island and Taiwan Island in China. Analysis of the D1/D2 domains of the large subunit (LUS) rRNA gene and internal transcribed spacer (ITS) regions of these strains showed that this species was related to Candida tropicalis and Candida sojae, their closest relatives. C. sanyaensis differed by three substitutions and one gap from C. tropicalis, and by four substitutions and one gap from C. sojae, in the D1/D2 domain sequences. However, the ITS sequences of C. sanyaensis were quite divergent from the latter two species, showing that it is a genetically separate species. The novel strains were also found to have very similar PCR-fingerprinting profiles which were quite distinct from those of C. tropicalis and C. sojae strains. The type strain of C. sanyaensis is HN-26(T) (= CICC 1979(T) = CBS 12637(T)).
Sulfur-doped Graphene Nanoribbons with a Sequence of Distinct Band Gaps

NASA Astrophysics Data System (ADS)

Du, Shi-Xuan; Zhang, Yan-Fang; Zhang, Yi; Berger, Reinhard; Feng, Xinliang; Mullen, Klaus; Lin, Xiao; Zhang, Yu-Yang; Pantelides, Sokrates T.; Gao, Hong-Jun

Unlike free-standing graphene, graphene nanoribbons (GNRs) can possess semiconducting band gap. However, achieving such control has been a major challenge in the fabrication of GNRs. Chevron-type GNRs were recently achieved by surface-assisted polymerization of pristine or N-substituted oligophenylene monomers. By mixing two different monomers, GNR heterojunctions can in principle be fabricated. Here we report fabrication and characterization of chevron-type GNRs by using sulfur-substituted oligophenylene monomers to achieve GNRs and related heterostructures for the first time. Importantly, our first-principles calculations show that the band gaps of GNRs can be tailored by different S configurations in cyclodehydrogenated isomers through debromination and intramolecular cyclodehydrogenation. This feature should open up new avenues to create multiple GNR heterojunctions by engineering the sulfur configurations. These predictions have been confirmed by Scanning Tunneling Microscopy (STM) and Scanning Tunneling Spectroscopy (STS). The unusual sequence of intraribbon heterojunctions may be useful for nanoscale optoelectronic applications based on quantum dots
Synonymous Mutations at the Beginning of the Influenza A Virus Hemagglutinin Gene Impact Experimental Fitness.

PubMed

Canale, Aneth S; Venev, Sergey V; Whitfield, Troy W; Caffrey, Daniel R; Marasco, Wayne A; Schiffer, Celia A; Kowalik, Timothy F; Jensen, Jeffrey D; Finberg, Robert W; Zeldovich, Konstantin B; Wang, Jennifer P; Bolon, Daniel N A

2018-04-13

The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses. Copyright © 2018. Published by Elsevier Ltd.
Novel human CRYGD rare variant in a Brazilian family with congenital cataract

PubMed Central

Giordano, Gabriel Gorgone; Tavares, Anderson; da Silva, Márcio José; de Vasconcellos, José Paulo Cabral; Arieta, Carlos Eduardo Leite; de Melo, Mônica Barbosa

2011-01-01

Purpose To describe a novel polymorphism in the γD-crystallin (CRYGD) gene in a Brazilian family with congenital cataract. Methods A Brazilian four-generation family was analyzed. The proband had bilateral lamellar cataract and the phenotypes were classified by slit lamp examination. Genomic DNA was extracted from peripheral blood and coding regions and intron/exon boundaries of the αA-crystallin (CRYAA), γC-crystallin (CRYGC), and CRYGD genes were amplified by polymerase chain reaction and directly sequenced. Results Sequencing of the coding regions of CRYGD showed the presence of a heterozygous A→G transversion at c.401 position, which results in the substitution of a tyrosine to a cysteine (Y134C). The polymorphism was identified in three individuals, two affected and one unaffected. Conclusions A novel rare variant in CRYGD (Y134C) was detected in a Brazilian family with congenital cataract. Because there is no segregation between the substitution and the phenotypes in this family, other genetic alterations are likely to be present. PMID:21866214
Fatal canine distemper virus infection of giant pandas in China

PubMed Central

Feng, Na; Yu, Yicong; Wang, Tiecheng; Wilker, Peter; Wang, Jianzhong; Li, Yuanguo; Sun, Zhe; Gao, Yuwei; Xia, Xianzhu

2016-01-01

We report an outbreak of canine distemper virus (CDV) infection among endangered giant pandas (Ailuropoda melanoleuca). Five of six CDV infected giant pandas died. The surviving giant panda was previously vaccinated against CDV. Genomic sequencing of CDV isolated from one of the infected pandas (giant panda/SX/2014) suggests it belongs to the Asia-1 cluster. The hemagglutinin protein of the isolated virus and virus sequenced from lung samples originating from deceased giant pandas all possessed the substitutions V26M, T213A, K281R, S300N, P340Q, and Y549H. The presence of the Y549H substitution is notable as it is found at the signaling lymphocytic activation molecule (SLAM) receptor-binding site and has been implicated in the emergence of highly pathogenic CDV and host switching. These findings demonstrate that giant pandas are susceptible to CDV and suggest that surveillance and vaccination among all captive giant pandas are warranted to support conservation efforts for this endangered species. PMID:27310722
Fatal canine distemper virus infection of giant pandas in China.

PubMed

Feng, Na; Yu, Yicong; Wang, Tiecheng; Wilker, Peter; Wang, Jianzhong; Li, Yuanguo; Sun, Zhe; Gao, Yuwei; Xia, Xianzhu

2016-06-16

We report an outbreak of canine distemper virus (CDV) infection among endangered giant pandas (Ailuropoda melanoleuca). Five of six CDV infected giant pandas died. The surviving giant panda was previously vaccinated against CDV. Genomic sequencing of CDV isolated from one of the infected pandas (giant panda/SX/2014) suggests it belongs to the Asia-1 cluster. The hemagglutinin protein of the isolated virus and virus sequenced from lung samples originating from deceased giant pandas all possessed the substitutions V26M, T213A, K281R, S300N, P340Q, and Y549H. The presence of the Y549H substitution is notable as it is found at the signaling lymphocytic activation molecule (SLAM) receptor-binding site and has been implicated in the emergence of highly pathogenic CDV and host switching. These findings demonstrate that giant pandas are susceptible to CDV and suggest that surveillance and vaccination among all captive giant pandas are warranted to support conservation efforts for this endangered species.
Studies of acrylamide level in coffee and coffee substitutes: influence of raw material and manufacturing conditions.

PubMed

Mojska, Hanna; Gielecińska, Iwona

2013-01-01

Many animal studies have shown that acrylamide is both neurotoxic and carcinogenic. The first reports of acrylamide actually having been found in foodstuffs were published in 2002 by the Swedish National Food Agency in conjunction with scientists from the University of Stockholm. It has since been demonstrated that acrylamide arises in foodstuffs by the Maillard reaction, ie. between free asparagine and reducing sugars at temperatures >120 degrees C. Coffee in fact, forms one of the principal dietary sources of acrylamide, where it is normally drunk in large quantities throughout many countries worldwide that includes Poland. Thus, it constitutes a major dietary component in a wide range of population groups, mainly ranging from late adolescents to the elderly. To determine the acrylamide level in commercial samples of roasted and instant coffee and in coffee substitutes by LC-MS/MS method. The influence of coffee species and colour intensity of coffee on acrylamide level was also detailed. A total of 42 samples of coffee were analysed which included 28 that were ground roasted coffee, 11 instant coffees and 3 coffee substitutes (grain coffee). Analytical separation of acrylamide from coffee was performed by liquid chromatography followed by tandem mass spectrometry (LC-MS/MS). To evaluate the colour intensity of ground roasted coffee and instant coffee we used method of arranging (sequence). The highest mean acrylamide concentrations were found in coffee substitutes (818 pg/kg) followed by instant coffee (358 microg/kg) and then roasted coffee (179 microg/kg). One single cup of coffee (160 ml) delivered on average from 0.45 microg acrylamide in roasted coffee to 3.21 microg in coffee substitutes. There were no significant differences in acrylamide level between the coffee species ie. Arabica vs Robusta or a mixture thereof. The various methods of coffee manufacture also showed no differences in acrylamide (ie. freeze-dried coffee vs agglomerated coffee). A significant negative correlation was observed between acrylamide levels and the intensity of colour in roasted coffee; this was not the case however for instant coffee. It was demonstrated that roasting process had the most significant effect on acrylamide levels in natural coffee, however there were no relationships found with coffee species. Due to the high acrylamide levels demonstrated in coffee substitutes, recommended amounts should be defined and manufacturers should be obliged to reduce such levels in these products.
40 CFR Appendix B to Subpart G of... - Substitutes Subject to Use Restrictions and Unacceptable Substitutes

Code of Federal Regulations, 2013 CFR

2013-07-01

... inches) and right-hand thread direction for CO2 refrigerant service containers.3 Manufacturers should... Failure Mode and Effect Analysis in Manufacturing and Assembly Process [Process FMEA] on the MVAC as... submitted to demonstrate it can be used safely in this end-use. CFC-11, CFC-12, R-502 Industrial Process...
40 CFR Appendix B to Subpart G of... - Substitutes Subject to Use Restrictions and Unacceptable Substitutes

Code of Federal Regulations, 2014 CFR

2014-07-01

... inches) and right-hand thread direction for CO2 refrigerant service containers.3 Manufacturers should... Failure Mode and Effect Analysis in Manufacturing and Assembly Process [Process FMEA] on the MVAC as... submitted to demonstrate it can be used safely in this end-use. CFC-11, CFC-12, R-502 Industrial Process...
Rare, evolutionarily unlikely missense substitutions in CHEK2 contribute to breast cancer susceptibility: results from a breast cancer family registry case-control mutation-screening study.

PubMed

Le Calvez-Kelm, Florence; Lesueur, Fabienne; Damiola, Francesca; Vallée, Maxime; Voegele, Catherine; Babikyan, Davit; Durand, Geoffroy; Forey, Nathalie; McKay-Chopin, Sandrine; Robinot, Nivonirina; Nguyen-Dumont, Tù; Thomas, Alun; Byrnes, Graham B; Hopper, John L; Southey, Melissa C; Andrulis, Irene L; John, Esther M; Tavtigian, Sean V

2011-01-18

Both protein-truncating variants and some missense substitutions in CHEK2 confer increased risk of breast cancer. However, no large-scale study has used full open reading frame mutation screening to assess the contribution of rare missense substitutions in CHEK2 to breast cancer risk. This absence has been due in part to a lack of validated statistical methods for summarizing risk attributable to large numbers of individually rare missense substitutions. Previously, we adapted an in silico assessment of missense substitutions used for analysis of unclassified missense substitutions in BRCA1 and BRCA2 to the problem of assessing candidate genes using rare missense substitution data observed in case-control mutation-screening studies. The method involves stratifying rare missense substitutions observed in cases and/or controls into a series of grades ordered a priori from least to most likely to be evolutionarily deleterious, followed by a logistic regression test for trends to compare the frequency distributions of the graded missense substitutions in cases versus controls. Here we used this approach to analyze CHEK2 mutation-screening data from a population-based series of 1,303 female breast cancer patients and 1,109 unaffected female controls. We found evidence of risk associated with rare, evolutionarily unlikely CHEK2 missense substitutions. Additional findings were that (1) the risk estimate for the most severe grade of CHEK2 missense substitutions (denoted C65) is approximately equivalent to that of CHEK2 protein-truncating variants; (2) the population attributable fraction and the familial relative risk explained by the pool of rare missense substitutions were similar to those explained by the pool of protein-truncating variants; and (3) post hoc power calculations implied that scaling up case-control mutation screening to examine entire biochemical pathways would require roughly 2,000 cases and controls to achieve acceptable statistical power. This study shows that CHEK2 harbors many rare sequence variants that confer increased risk of breast cancer and that a substantial proportion of these are missense substitutions. The study validates our analytic approach to rare missense substitutions and provides a method to combine data from protein-truncating variants and rare missense substitutions into a one degree of freedom per gene test.
Spectroscopic and DFT Study of RhIII Chloro Complex Transformation in Alkaline Solutions.

PubMed

Vasilchenko, Danila B; Berdyugin, Semen N; Korenev, Sergey V; O'Kennedy, Sean; Gerber, Wilhelmus J

2017-09-05

The hydrolysis of [RhCl 6 ] 3- in NaOH-water solutions was studied by spectrophotometric methods. The reaction proceeds via successive substitution of chloride with hydroxide to quantitatively form [Rh(OH) 6 ] 3- . Ligand substitution kinetics was studied in an aqueous 0.434-1.085 M NaOH matrix in the temperature range 5.5-15.3 °C. Transformation of [RhCl 6 ] 3- into [RhCl 5 (OH)] 3- was found to be the rate-determining step with activation parameters of ΔH † = 105 ± 4 kJ mol -1 and ΔS † = 59 ± 10 J K -1 mol -1 . The coordinated hydroxo ligand(s) induces rapid ligand substitution to form [Rh(OH) 6 ] 3- . By simulating ligand substitution as a dissociative mechanism, using density functional theory (DFT), we can now explain the relatively fast and slow kinetics of chloride substitution in basic and acidic matrices, respectively. Moreover, the DFT calculated activation energies corroborated experimental data that the kinetic stereochemical sequence of [RhCl 6 ] 3- hydrolysis in an acidic solution proceeds as [RhCl 6 ] 3- → [RhCl 5 (H 2 O)] 2- → cis-[RhCl 4 (H 2 O) 2 ] - . However, DFT calculations predict in a basic solution the trans route of substitution [RhCl 6 ] 3- → [RhCl 5 (OH)] 3- → trans-[RhCl 4 (OH) 2 ] 3- is kinetically favored.
The antigenic surface of staphylococcal nuclease. II. Analysis of the N-1 epitope by site-directed mutagenesis.

PubMed

Smith, A M; Benjamin, D C

1991-02-15

Previous studies in our laboratory on the production and isolation of a panel of mAb to staphylococcal nuclease allowed us to define a series of eight overlapping epitopes. Using site-directed mutagenesis of the nuclease coding sequences we were able to map the nonoverlapping epitopes recognized by two members of this panel. In the study reported here, we report the generation and analysis of a number of single amino acid substitutions for seven surface residues predicted to lie within one of these two epitopes. Immunochemical analysis showed that one or more substitutions at each of these seven positions had a major effect on mAb binding, whereas other substitutions had none. Based on the nature of these substitutions and the chemical and physical properties of the variant molecules, we believe that any structural effects induced by these substitutions are local and do not result in long-range structural alterations that indirectly influence antibody reactivity. Therefore, we conclude that disruption of mAb binding can be directly attributed to changes in amino acid side chains and that not only are all seven of the residues studied part of the epitope but all seven make contact with the antibody combining site. These studies demonstrate the advantages of using site-directed mutagenesis to study antigen structure and emphasize the importance of constructing the examining multiple substitutions for any given amino acid.
Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

PubMed Central

Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

2006-01-01

Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Mammalian prions: tolerance to sequence changes-how far?

PubMed

Salamat, Muhammad Khalid; Munoz-Montesino, Carola; Moudjou, Mohammed; Rezaei, Human; Laude, Hubert; Béringue, Vincent; Dron, Michel

2013-01-01

Upon prion infection, abnormal prion protein (PrP (Sc) ) self-perpetuate by conformational conversion of α-helix-rich PrP (C) into β sheet enriched form, leading to formation and deposition of PrP (Sc) aggregates in affected brains. However the process remains poorly understood at the molecular level and the regions of PrP critical for conversion are still debated. Minimal amino acid substitutions can impair prion replication at many places in PrP. Conversely, we recently showed that bona fide prions could be generated after introduction of eight and up to 16 additional amino acids in the H2-H3 inter-helix loop of PrP. Prion replication also accommodated the insertions of an octapeptide at different places in the last turns of H2. This reverse genetic approach reveals an unexpected tolerance of prions to substantial sequence changes in the protease-resistant part which is associated with infectivity. It also demonstrates that conversion does not require the presence of a specific sequence in the middle of the H2-H3 area. We discuss the implications of our findings according to different structural models proposed for PrP (Sc) and questioned the postulated existence of an N- or C-terminal prion domain in the protease-resistant region.
Comparative Analyses of DNA Methylation and Sequence Evolution Using Nasonia Genomes

PubMed Central

Park, Jungsun; Peng, Zuogang; Zeng, Jia; Elango, Navin; Park, Taesung; Wheeler, Dave; Werren, John H.; Yi, Soojin V.

2011-01-01

The functional and evolutionary significance of DNA methylation in insect genomes remains to be resolved. Nasonia is well situated for comparative analyses of DNA methylation and genome evolution, since the genomes of a moderately distant outgroup species as well as closely related sibling species are available. Using direct sequencing of bisulfite-converted DNA, we uncovered a substantial level of DNA methylation in 17 of 18 Nasonia vitripennis genes and a strong correlation between methylation level and CpG depletion. Notably, in the sex-determining locus transformer, the exon that is alternatively spliced between the sexes is heavily methylated in both males and females, whereas other exons are only sparsely methylated. Orthologous genes of the honeybee and Nasonia show highly similar relative levels of CpG depletion, despite ∼190 My divergence. Densely and sparsely methylated genes in these species also exhibit similar functional enrichments. We found that the degree of CpG depletion is negatively correlated with substitution rates between closely related Nasonia species for synonymous, nonsynonymous, and intron sites. This suggests that mutation rates increase with decreasing levels of germ line methylation. Thus, DNA methylation is prevalent in the Nasonia genome, may participate in regulatory processes such as sex determination and alternative splicing, and is correlated with several aspects of genome and sequence evolution. PMID:21693438
REDO: RNA Editing Detection in Plant Organelles Based on Variant Calling Results.

PubMed

Wu, Shuangyang; Liu, Wanfei; Aljohi, Hasan Awad; Alromaih, Sarah A; Alanazi, Ibrahim O; Lin, Qiang; Yu, Jun; Hu, Songnian

2018-05-01

RNA editing is a post-transcriptional or cotranscriptional process that changes the sequence of the precursor transcript by substitutions, insertions, or deletions. Almost all of the land plants undergo RNA editing in organelles (plastids and mitochondria). Although several software tools have been developed to identify RNA editing events, there has been a great challenge to distinguish true RNA editing events from genome variation, sequencing errors, and other factors. Here we introduce REDO, a comprehensive application tool for identifying RNA editing events in plant organelles based on variant call format files from RNA-sequencing data. REDO is a suite of Perl scripts that illustrate a bunch of attributes of RNA editing events in figures and tables. REDO can also detect RNA editing events in multiple samples simultaneously and identify the significant differential proportion of RNA editing loci. Comparing with similar tools, such as REDItools, REDO runs faster with higher accuracy, and more specificity at the cost of slightly lower sensitivity. Moreover, REDO annotates each RNA editing site in RNAs, whereas REDItools reports only possible RNA editing sites in genome, which need additional steps to obtain RNA editing profiles for RNAs. Overall, REDO can identify potential RNA editing sites easily and provide several functions such as detailed annotations, statistics, figures, and significantly differential proportion of RNA editing sites among different samples.
Genetic Diversity in Oxytocin Ligands and Receptors in New World Monkeys

PubMed Central

Ren, Dongren; Lu, Guoqing; Moriyama, Hideaki; Mustoe, Aaryn C.; Harrison, Emily B.; French, Jeffrey A.

2015-01-01

Oxytocin (OXT) is an important neurohypophyseal hormone that influences wide spectrum of reproductive and social processes. Eutherian mammals possess a highly conserved sequence of OXT (Cys-Tyr-Ile-Gln-Asn-Cys-Pro-Leu-Gly). However, in this study, we sequenced the coding region for OXT in 22 species covering all New World monkeys (NWM) genera and clades, and characterize five OXT variants, including consensus mammalian Leu8-OXT, major variant Pro8-OXT, and three previously unreported variants: Ala8-OXT, Thr8-OXT, and Phe2-OXT. Pro8-OXT shows clear structural and physicochemical differences from Leu8-OXT. We report multiple predicted amino acid substitutions in the G protein-coupled OXT receptor (OXTR), especially in the critical N-terminus, which is crucial for OXT recognition and binding. Genera with same Pro8-OXT tend to cluster together on a phylogenetic tree based on OXTR sequence, and we demonstrate significant coevolution between OXT and OXTR. NWM species are characterized by high incidence of social monogamy, and we document an association between OXTR phylogeny and social monogamy. Our results demonstrate remarkable genetic diversity in the NWM OXT/OXTR system, which can provide a foundation for molecular, pharmacological, and behavioral studies of the role of OXT signaling in regulating complex social phenotypes. PMID:25938568
Application of Mean of Absolute Deviation Method for the Selection of Best Nonlinear Component Based on Video Encryption

NASA Astrophysics Data System (ADS)

Anees, Amir; Khan, Waqar Ahmad; Gondal, Muhammad Asif; Hussain, Iqtadar

2013-07-01

The aim of this work is to make use of the mean of absolute deviation (MAD) method for the evaluation process of substitution boxes used in the advanced encryption standard. In this paper, we use the MAD technique to analyze some popular and prevailing substitution boxes used in encryption processes. In particular, MAD is applied to advanced encryption standard (AES), affine power affine (APA), Gray, Lui J., Residue Prime, S8 AES, SKIPJACK, and Xyi substitution boxes.
Modulation of DNA-Polyamide Interaction by β-alanine Substitutions: A Study of Positional Effects on Binding Affinity, Kinetics and Thermodynamics

PubMed Central

Wang, Shuo; Aston, Karl; Koeller, Kevin J.; Harris, G. Davis; Rath, Nigam P.

2014-01-01

Hairpin polyamides (PAs) are an important class of sequence-specific DNA minor groove binders, and frequently employ a flexible motif, β-alanine (β), to reduce the molecular rigidity to maintain the DNA recognition register. To better understand the diverse effects β can have on DNA-PA binding affinity, selectivity, and especially kinetics, which have rarely been reported, we have initiated a detailed study for an eight-heterocyclic hairpin PA and its β derivatives with their cognate and mutant sequences. With these derivatives, all internal pyrroles of the parent PA are systematically substituted with single or double βs. A set of complementary experiments have been conducted to evaluate the molecular interactions in detail: UV-melting, biosensor-surface plasmon resonance, circular dichroism and isothermal titration calorimetry. The β substitutions generally weaken the binding affinities of these PAs with cognate DNA, and have large and diverse influences on PA binding kinetics in a position- and number-dependent manner. The DNA base mutations have also shown positional effects on binding of a single PA. Besides the β substitutions, the monocationic Dp group [3-(dimethylamino) propylamine] in parent PA has been modified into a dicationic Ta group (3, 3'-Diamino-N-methyldipropylamine) to minimize the frequently observed PA aggregation with ITC experiments. The results clearly show that the Ta modification not only maintains the DNA binding mode and affinity of PA, but also significantly reduces PA aggregation and allows the complete thermodynamic signature of eight-ring hairpin PA to be determined for the first time. This combined set of results significantly extends our understanding of the energetic basis of specific DNA recognition by PAs. PMID:25141096

Plastid-Nuclear Interaction and Accelerated Coevolution in Plastid Ribosomal Genes in Geraniaceae.

PubMed

Weng, Mao-Lun; Ruhlman, Tracey A; Jansen, Robert K

2016-06-27

Plastids and mitochondria have many protein complexes that include subunits encoded by organelle and nuclear genomes. In animal cells, compensatory evolution between mitochondrial and nuclear-encoded subunits was identified and the high mitochondrial mutation rates were hypothesized to drive compensatory evolution in nuclear genomes. In plant cells, compensatory evolution between plastid and nucleus has rarely been investigated in a phylogenetic framework. To investigate plastid-nuclear coevolution, we focused on plastid ribosomal protein genes that are encoded by plastid and nuclear genomes from 27 Geraniales species. Substitution rates were compared for five sets of genes representing plastid- and nuclear-encoded ribosomal subunit proteins targeted to the cytosol or the plastid as well as nonribosomal protein controls. We found that nonsynonymous substitution rates (dN) and the ratios of nonsynonymous to synonymous substitution rates (ω) were accelerated in both plastid- (CpRP) and nuclear-encoded subunits (NuCpRP) of the plastid ribosome relative to control sequences. Our analyses revealed strong signals of cytonuclear coevolution between plastid- and nuclear-encoded subunits, in which nonsynonymous substitutions in CpRP and NuCpRP tend to occur along the same branches in the Geraniaceae phylogeny. This coevolution pattern cannot be explained by physical interaction between amino acid residues. The forces driving accelerated coevolution varied with cellular compartment of the sequence. Increased ω in CpRP was mainly due to intensified positive selection whereas increased ω in NuCpRP was caused by relaxed purifying selection. In addition, the many indels identified in plastid rRNA genes in Geraniaceae may have contributed to changes in plastid subunits. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gene Augmentation Therapy for a Missense Substitution in the cGMP-Binding Domain of Ovine CNGA3 Gene Restores Vision in Day-Blind Sheep.

PubMed

Gootwine, Elisha; Abu-Siam, Mazen; Obolensky, Alexey; Rosov, Alex; Honig, Hen; Nitzan, Tali; Shirak, Andrey; Ezra-Elia, Raaya; Yamin, Esther; Banin, Eyal; Averbukh, Edward; Hauswirth, William W; Ofri, Ron; Seroussi, Eyal

2017-03-01

Applying CNGA3 gene augmentation therapy to cure a novel causative mutation underlying achromatopsia (ACHM) in sheep. Impaired vision that spontaneously appeared in newborn lambs was characterized by behavioral, electroretinographic (ERG), and histologic techniques. Deep-sequencing reads of an affected lamb and an unaffected lamb were compared within conserved genomic regions orthologous to human genes involved in similar visual impairment. Observed nonsynonymous amino acid substitutions were classified by their deleteriousness score. The putative causative mutation was assessed by producing compound CNGA3 heterozygotes and applying gene augmentation therapy using the orthologous human cDNA. Behavioral assessment revealed day blindness, and subsequent ERG examination showed attenuated photopic responses. Histologic and immunohistochemical examination of affected sheep eyes did not reveal degeneration, and cone photoreceptors expressing CNGA3 were present. Bioinformatics and sequencing analyses suggested a c.1618G>A, p.Gly540Ser substitution in the GMP-binding domain of CNGA3 as the causative mutation. This was confirmed by genetic concordance test and by genetic complementation experiment: All five compound CNGA3 heterozygotes, carrying both p.Arg236* and p.Gly540Ser mutations in CNGA3, were day-blind. Furthermore, subretinal delivery of the intact human CNGA3 gene using an adeno-associated viral vector (AAV) restored photopic vision in two affected p.Gly540Ser homozygous rams. The c.1618G>A, p.Gly540Ser substitution in CNGA3 was identified as the causative mutation for a novel form of ACHM in Awassi sheep. Gene augmentation therapy restored vision in the affected sheep. This novel mutation provides a large-animal model that is valid for most human CNGA3 ACHM patients; the majority of them carry missense rather than premature-termination mutations.
Virus variants with differences in the P1 protein coexist in a Plum pox virus population and display particular host-dependent pathogenicity features.

PubMed

Maliogka, Varvara I; Salvador, Beatriz; Carbonell, Alberto; Sáenz, Pilar; León, David San; Oliveros, Juan Carlos; Delgadillo, Ma Otilia; García, Juan Antonio; Simón-Mateo, Carmen

2012-10-01

Subisolates segregated from an M-type Plum pox virus (PPV) isolate, PPV-PS, differ widely in pathogenicity despite their high degree of sequence similarity. A single amino acid substitution, K109E, in the helper component proteinase (HCPro) protein of PPV caused a significant enhancement of symptom severity in herbaceous hosts, and notably modified virus infectivity in peach seedlings. The presence of this substitution in certain subisolates that induced mild symptoms in herbaceous hosts and did not infect peach seedlings suggested the existence of uncharacterized attenuating factors in these subisolates. In this study, we show that two amino acid changes in the P1 protein are specifically associated with the mild pathogenicity exhibited by some PS subisolates. Site-directed mutagenesis studies demonstrated that both substitutions, W29R and V139E, but especially W29R, resulted in lower levels of virus accumulation and symptom severity in a woody host, Prunus persica. Furthermore, when W29R and V139E mutations were expressed concomitantly, PPV infectivity was completely abolished in this host. In contrast, the V139E substitution, but not W29R, was found to be responsible for symptom attenuation in herbaceous hosts. Deep sequencing analysis demonstrated that the W29R and V139E heterogeneities already existed in the original PPV-PS isolate before its segregation in different subisolates by local lesion cloning. These results highlight the potential complexity of potyviral populations and the relevance of the P1 protein of potyviruses in pathogenesis and viral adaptation to the host. © 2012 THE AUTHORS. MOLECULAR PLANT PATHOLOGY © 2012 BSPP AND BLACKWELL PUBLISHING LTD.
Characterization of a splicing mutation in group A xeroderma pigmentosum

DOE Office of Scientific and Technical Information (OSTI.GOV)

Satokata, Ichiro; Tanaka, Kiyoji; Miura, Naoyuki

1990-12-01

The molecular basis of group A xeroderma pigmentosum (WP) was investigated by comparison of the nucleotide sequences of multiple clones of the XP group A complementing gene (XPAC) from a patient with group A XP with that of a normal gene. The clones showed a G {r arrow} C substitution at the 3{prime} splice acceptor site of intron 3, which altered the obligatory AG acceptor dinucleotide to AC. Nucleotide sequencing of cDNAs amplified by the polymerase chain reaction revealed that this single base substitution abolishes the canonical 3{prime} splice site, thus creating two abnormally spliced mRNA forms. The larger formmore » is identical with normal mRNA except for a dinucleotide deletion at the 5{prime} end of exon 4. This deletion results in a frameshift with premature translation termination in exon 4. The smaller form has a deletion of the entire exon 3 and the dinucleotide at the 5{prime} end of exon 4. The result of a transfection study provided additional evidence that this single base substitution is the disease-causing mutation. This single base substitution creates a new cleavage site for the restriction nuclease AlwNI. Analysis of AlwNI restriction fragment length polymorphism showed a high frequency of this mutation in Japanese patients with group A XP: 16 of 21 unrelated Japanese patients were homozygous and 4 were heterozygous for this mutation. However, 11 Caucasians and 2 Blacks with group A XP did not have this mutant allele. The polymorphic AlwNI restriction fragments are concluded to be useful for diagnosis of group A XP in Japanese subjects, including prenatal cases and carriers.« less
The sequence of the CA-SP1 junction accounts for the differential sensitivity of HIV-1 and SIV to the small molecule maturation inhibitor 3-O-{3',3'-dimethylsuccinyl}-betulinic acid.

PubMed

Zhou, Jing; Chen, Chin Ho; Aiken, Christopher

2004-06-29

Despite the effectiveness of currently available antiretroviral therapies in the treatment of HIV-1 infection, a continuing need exists for novel compounds that can be used in combination with existing drugs to slow the emergence of drug-resistant viruses. We previously reported that the small molecule 3-O-{3',3'-dimethylsuccinyl}-betulinic acid (DSB) specifically inhibits HIV-1 replication by delaying the processing of the CA-SP1 junction in Pr55Gag. By contrast, SIVmac239 replicates efficiently in the presence of high concentrations of DSB. To determine whether sequence differences in the CA-SP1 junction can fully account for the differential sensitivity of HIV-1 and SIV to DSB, we engineered mutations in this region of two viruses and tested their sensitivity to DSB in replication assays using activated human primary CD4+ T cells. Substitution of the P2 and P1 residues of HIV-1 by the corresponding amino acids of SIV resulted in strong resistance to DSB, but the mutant virus replicated with reduced efficiency. Conversely, replication of an SIV mutant containing three amino acid substitutions in the CA-SP1 cleavage site was highly sensitive to DSB, and the mutations resulted in delayed cleavage of the CA-SP1 junction in the presence of the drug. These results demonstrate that the CA-SP1 junction in Pr55Gag represents the primary viral target of DSB. They further suggest that the therapeutic application of DSB will be accompanied by emergence of mutant viruses that are highly resistant to the drug but which exhibit reduced fitness relative to wild type HIV-1.
The major origin of seedless grapes is associated with a missense mutation in the MADS-box gene VviAGL11.

PubMed

Royo, Carolina; Torres-Pérez, Rafael; Mauri, Nuria; Diestro, Nieves; Cabezas, José Antonio; Marchal, Cécile; Lacombe, Thierry; Ibáñez, Javier; Tornel, Manuel; Carreño, Juan; Martínez-Zapater, José M; Carbonell-Bejerano, Pablo

2018-05-31

Seedlessness is greatly prized by consumers of fresh grapes. While stenospermocarpic seed abortion determined by the SEED DEVELOPMENT INHIBITOR (SDI) locus is the usual source of seedlessness in commercial grapevine (Vitis vinifera) cultivars, the underlying sdi mutation remains unknown. Here, we undertook an integrative approach to identify the causal mutation. Quantitative genetics and fine mapping in two 'Crimson Seedless' (CS)-derived F1 mapping populations confirmed the major effect of the SDI locus and delimited the sdi mutation to a 323-kb region on chromosome 18. RNA-seq comparing seed traces of seedless and seeds of seeded F1 individuals identified processes triggered during sdi-determined seed abortion, including activation of salicylic acid-dependent defenses. The RNA-seq dataset was investigated for candidate genes and, while no evidence for causal cis-acting regulatory mutations was detected, deleterious nucleotide changes in coding sequences of the seedless haplotype were predicted in two genes within the sdi fine mapping interval. Targeted re-sequencing of the two genes in a collection of 124 grapevine cultivars showed that only the point variation causing the Arg197Leu substitution in the seed morphogenesis regulator gene AGAMOUS-LIKE 11 (VviAGL11) was fully linked with stenospermocarpy. The concurrent post-zygotic variation identified for this missense polymorphism and seedlessness phenotype in seeded somatic variants of the original stenospermocarpic cultivar supports a causal effect. We postulate that seed abortion caused by this amino acid substitution in VviAGL11 is the major cause of seedlessness in cultivated grapevine. This information can be exploited to boost seedless grape breeding. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.
CD8+ T cell recognition of an endogenously processed epitope is regulated primarily by residues within the epitope

PubMed Central

1992-01-01

Cytotoxic T lymphocytes (CTL) recognize short antigenic peptides associated with cell surface class I major histocompatibility complex (MHC) molecules. This association presumably occurs between newly synthesized class I MHC molecules and peptide fragments in a pre-Golgi compartment. Little is known about the factors that regulate the formation of these antigenic peptide fragments within the cell. To examine the role of residues within a core epitope and in the flanking sequences for the generation and presentation of the newly synthesized peptide fragment recognized by CD8+ CTL, we have mutagenized the coding sequence for the CTL epitope spanning residues 202-221 in the influenza A/Japan/57 hemagglutinin (HA). In this study over 60 substitution mutations in the epitope were tested for their effects on target cell sensitization using a cytoplasmic viral expression system. The HA202- 221 site contains two overlapping subsites defined by CTL clones 11-1 and 40-2. Mutations in HA residues 204-213 or residues 210-219 often abolished target cell lysis by CTL clones 11-1 and 40-2, respectively. Although residues outside the core epitope did not usually affect the ability to be lysed by CTL clones, substitution of a Gly residue for Val-214 abolished lysis by clone 11-1. These data suggest that residues within a site that affect MHC binding and T cell receptor recognition appear to play the predominant role in dictating the formation of the antigenic complex recognized by CD8+ CTL, and therefore the antigenicity of the protein antigen presented to CD8+ T cells. Most alterations in residues flanking the endogenously expressed epitope do not appreciably affect the generation and recognition of the site. PMID:1383384
Cognitive performance in adolescents with Delayed Sleep-Wake Phase Disorder: Treatment effects and a comparison with good sleepers.

PubMed

Richardson, C; Micic, G; Cain, N; Bartel, K; Maddock, B; Gradisar, M

2018-06-01

The present study aimed to investigate whether Australian adolescents with Delayed Sleep-Wake Phase Disorder have impaired cognitive performance and whether chronobiological treatment for Delayed Sleep-Wake Phase Disorder improves adolescents' sleep, daytime functioning and cognitive performance. Adolescents with Delayed Sleep-Wake Phase Disorder (mean = 15.68 ± 2.1 y, 62% f) reported significantly later sleep timing (d = 1.03-1.45), less total sleep time (d = 0.82) and greater daytime sleepiness (d = 2.66), fatigue (d = 0.63) and impairment (d = 2.41), compared to good sleeping adolescents (mean = 15.9 ± 2.4 y, 75% f). However, there were no significant between-group differences (all p > 0.05) in performance on the Operation Span (ηp 2  = 0.043), Digit Span (forwards: ηp 2  = 0.002, backwards: ηp 2  = 0.003), Letter Number Sequencing (ηp 2  < 0.001) (working memory) and Digit-Symbol Substitution Tasks (ηp 2  = 0.010) (processing speed). Adolescents with Delayed Sleep-Wake Phase Disorder went on to receive 3 weeks of light therapy. At 3 months post-treatment, adolescents with Delayed Sleep-Wake Phase Disorder reported significantly advanced sleep timing (d = 0.56-0.65), greater total sleep time (d = 0.52) and improved daytime sleepiness (d = 1.33), fatigue (d = 0.84) and impairment (d = 0.78). Performance on the Operation Span (d = 0.46), Letter Number Sequencing (d = 0.45) and Digit-Symbol Substitution tasks (d = 0.57) also significantly improved. Copyright © 2018. Published by Elsevier Ltd.
Multiple primary tumors of the upper aerodigestive tract: is there a role for constitutional mutations in the p53 gene?

PubMed

Gallo, O; Sardi, I; Pepe, G; Franchi, A; Attanasio, M; Giusti, B; Bocciolini, C; Abbate, R

1999-07-19

Head-and-neck cancer (HNC) patients have a high risk of developing second primary tumors of the upper aerodigestive tract, the main cause of death. Although the roles of tobacco and diet in multiple head-and-neck carcinogenesis have been thoroughly investigated, little is known about individual genetic susceptibility factors involved in this process. Genomic instability, reflecting the propensity and the susceptibility of the genome to acquire multiple alterations, could be considered a driving force behind multiple carcinogenesis. Mutation of the p53 tumor-suppressor gene has been proposed to play an important role in this process. Therefore, we evaluated the incidence of inherited p53 germ-line alteration(s) in a population of 24 consecutive HNC patients and their first-degree relatives affected by multiple malignancies as well as the occurrence of p53 somatic acquired mutation(s) in 16 cancers, including first and second primaries from 5 HNCs of the same group. Mutations in exons 4-11 of the p53 gene were investigated using SSCP-PCR analysis and DNA sequencing. Analysis was extended to the peripheral blood and cancer biopsies available from first-degree relatives of cancer-prone families with p53 germ-line mutations. p53 germ-line mutations were identified in the peripheral blood and corresponding cancers of 3 HNC patients who had multiple malignancies. The only missense mutation detected was mapped in exon 6; it is a GTG to GAG substitution with an amino acid change from Val to Glu at codon 197. The remaining 2 p53 germ-line mutations were single-nucleotide substitutions without amino acid change in exon 6 (codon 213, CGA to CGG) and in exon 8 (codon 295, CCT to CCC), respectively. These mutations were found in HNC patients with a family history of cancer. Abnormal expression of wild-type p53 protein in normal and pathological tissues from patients with the same sense single-nucleotide substitutions was detected by immuno-histochemistry.
How the dual process model of human cognition can inform efforts to de-implement ineffective and harmful clinical practices: A preliminary model of unlearning and substitution.

PubMed

Helfrich, Christian D; Rose, Adam J; Hartmann, Christine W; van Bodegom-Vos, Leti; Graham, Ian D; Wood, Suzanne J; Majerczyk, Barbara R; Good, Chester B; Pogach, Leonard M; Ball, Sherry L; Au, David H; Aron, David C

2018-02-01

One way to understand medical overuse at the clinician level is in terms of clinical decision-making processes that are normally adaptive but become maladaptive. In psychology, dual process models of cognition propose 2 decision-making processes. Reflective cognition is a conscious process of evaluating options based on some combination of utility, risk, capabilities, and/or social influences. Automatic cognition is a largely unconscious process occurring in response to environmental or emotive cues based on previously learned, ingrained heuristics. De-implementation strategies directed at clinicians may be conceptualized as corresponding to cognition: (1) a process of unlearning based on reflective cognition and (2) a process of substitution based on automatic cognition. We define unlearning as a process in which clinicians consciously change their knowledge, beliefs, and intentions about an ineffective practice and alter their behaviour accordingly. Unlearning has been described as "the questioning of established knowledge, habits, beliefs and assumptions as a prerequisite to identifying inappropriate or obsolete knowledge underpinning and/or embedded in existing practices and routines." We hypothesize that as an unintended consequence of unlearning strategies clinicians may experience "reactance," ie, feel their professional prerogative is being violated and, consequently, increase their commitment to the ineffective practice. We define substitution as replacing the ineffective practice with one or more alternatives. A substitute is a specific alternative action or decision that either precludes the ineffective practice or makes it less likely to occur. Both approaches may work independently, eg, a substitute could displace an ineffective practice without changing clinicians' knowledge, and unlearning could occur even if no alternative exists. For some clinical practice, unlearning and substitution strategies may be most effectively used together. By taking into account the dual process model of cognition, we may be able to design de-implementation strategies matched to clinicians' decision-making processes and avoid unintended consequence. © 2018 The Authors. Journal of Evaluation in Clinical Practice published by John Wiley & Sons, Ltd.
Cloning and characterization of the gene encoding IMP dehydrogenase from Arabidopsis thaliana.

PubMed

Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

1996-10-03

We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Arabidopsis thaliana (At). The transcription unit of the At gene spans approximately 1900 bp and specifies a protein of 503 amino acids with a calculated relative molecular mass (M(r)) of 54,190. The gene is comprised of a minimum of four introns and five exons with all donor and acceptor splice sequences conforming to previously proposed consensus sequences. The deduced IMPDH amino-acid sequence from At shows a remarkable similarity to other eukaryotic IMPDH sequences, with a 48% identity to human Type II enzyme. Allowing for conservative substitutions, the enzyme is 69% similar to human Type II IMPDH. The putative active-site sequence of At IMPDH conforms to the IMP dehydrogenase/guanosine monophosphate reductase motif and contains an essential active-site cysteine residue.
WEB-server for search of a periodicity in amino acid and nucleotide sequences

NASA Astrophysics Data System (ADS)

E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

2017-12-01

A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
High stability of yellow fever 17D-204 vaccine: a 12-year restrospective analysis of large-scale production.

PubMed

Barban, V; Girerd, Y; Aguirre, M; Gulia, S; Pétiard, F; Riou, P; Barrere, B; Lang, J

2007-04-12

We have retrospectively analyzed 12 bulk lots of yellow fever vaccine Stamaril, produced between 1990 and 2002 and prepared from the same seed lot that has been in continuous use since 1990. All vaccine batches displayed identical genome sequence. Only four nucleotide substitutions were observed, compared to previously published sequence, with no incidence at amino-acid level. Fine analysis of viral plaque size distribution was used as an additional marker for genetic stability and demonstrated a remarkable homogeneity of the viral population. The total virus load, measured by qRT-PCR, was also homogeneous pointing out reproducibility of the vaccine production process. Mice inoculated intracerebrally with the different bulks exhibited a similar average survival time, and ratio between in vitro potency and mouse LD(50) titers remained constant from batch-to-batch. Taken together, these data demonstrate the genetic stability of the strain at mass production level over a period of 12 years and reinforce the generally admitted idea of the safety of YF17D-based vaccines.
A versatile transfection assay system to evaluate the biological effects of diverse industrial chemicals.

PubMed

Koizumi, Shinji; Ohno, Shotaro; Otsuka, Fuminori

2012-01-01

Gene expression processes are now recognized as important targets of the toxic effects exerted by industrial chemicals. The transient transfection assay is a powerful tool to evaluate such effects. Thus, we developed a versatile assay system by constructing a basic reporter plasmid in which the regulatory DNA sequence to be studied can easily be substituted. To verify the performance of this system, reporter plasmids carrying any of the three distinct regulatory sequences, estrogen responsive element (ERE), glucocorticoid responsive element (GRE) and xenobiotic responsive element (XRE) were constructed. After transfection of human cells, these plasmids successfully expressed the relevant reporter genes in response to specific inducers, β-estradiol, dexamethasone and 3-methylcholanthrene, respectively. Several industrial chemicals were assayed using these reporter plasmids, and the ability of p-dimethylaminoazobenzene to elevate GRE- and XRE-mediated transcription was detected. α-Naphthylamine and o-tolidine were also observed to increase the XRE-mediated response. The transfection assay system established here will be useful to evaluate the effects of a wide variety of industrial chemicals.
The rate and character of spontaneous mutation in an RNA virus.

PubMed Central

Malpica, José M; Fraile, Aurora; Moreno, Ignacio; Obies, Clara I; Drake, John W; García-Arenal, Fernando

2002-01-01

Estimates of spontaneous mutation rates for RNA viruses are few and uncertain, most notably due to their dependence on tiny mutation reporter sequences that may not well represent the whole genome. We report here an estimate of the spontaneous mutation rate of tobacco mosaic virus using an 804-base cognate mutational target, the viral MP gene that encodes the movement protein (MP). Selection against newly arising mutants was countered by providing MP function from a transgene. The estimated genomic mutation rate was on the lower side of the range previously estimated for lytic animal riboviruses. We also present the first unbiased riboviral mutational spectrum. The proportion of base substitutions is the same as that in a retrovirus but is lower than that in most DNA-based organisms. Although the MP mutant frequency was 0.02-0.05, 35% of the sequenced mutants contained two or more mutations. Therefore, the mutation process in populations of TMV and perhaps of riboviruses generally differs profoundly from that in populations of DNA-based microbes and may be strongly influenced by a subpopulation of mutator polymerases. PMID:12524327
Oligosaccharides from land plants and algae: production and applications in therapeutics and biotechnology.

PubMed

Courtois, Josiane

2009-06-01

Since the past decades, oligosaccharides are considered for their potential biological activities. To exploit them, it was essential to obtain pure molecules in large amounts. Several strategies were developed to produce specific sugar sequences with specific substitution patterns from land plants and algae polysaccharides. Then, pure oligosaccharides were analyzed for their potential biological activities and relations between oligomers structure and function were tackled. First they can be health beneficial molecules when they are added to the diet to enhance the growth of probiotic bacteria, in that case, oligomers that resist to the digestive process are used as specific substrate for the growth of health beneficial bacteria. In other cases, oligomers have to interact with receptors on cells. In this instance, a specific conformation is needed to allow the sugar sequence to establish specific linkages with the receptor. So, to be adapted to the receptor, the oligosaccharides have to present specific groups to the receptor, there, the polymerization degree of oligosaccharides as well as the flexibility of the glycosidic linkages has to be considered.
Investigation of occult hepatitis B virus infection in anti-hbc positive patients from a liver clinic.

PubMed

Martinez, Maria Carmela; Kok, Chee Choy; Baleriola, Cristina; Robertson, Peter; Rawlinson, William D

2015-01-01

Occult hepatitis B infection (OBI) is manifested by presence of very low levels (<200IU/mL) of Hepatitis B viral DNA (HBV DNA) in the blood and the liver while exhibiting undetectable HBV surface antigen (HBsAg). The molecular mechanisms underlying this occurrence are still not completely understood. This study investigated the prevalence of OBI in a high-risk Australian population and compared the HBV S gene sequences of our cohort with reference sequences. Serum from HBV DNA positive, HBsAg negative, and hepatitis B core antibody (anti-HBc) positive patients (study cohort) were obtained from samples tested at SEALS Serology Laboratory using the Abbott Architect, as part of screening and diagnostic testing. From a total of 228,108 samples reviewed, 1,451 patients were tested for all three OBI markers. Only 10 patients (0.69%) out of the 1,451 patients were found to fit the selection criteria for OBI. Sequence analysis of the HBV S gene from 5 suspected OBI infected patients showed increased sequence variability in the 'a' epitope of the major hydrophilic region compared to reference sequences. In addition, a total of eight consistent nucleotide substitutions resulting in seven amino acid changes were observed, and three patients had truncated S gene sequence. These mutations appeared to be stable and may result in alterations in HBsAg conformation. These may negatively impact the affinity of hepatitis B surface antibody (anti-HBs) and may explain the false negative results in serological HBV diagnosis. These changes may also enable the virus to persist in the liver by evading immune surveillance. Further studies on a bigger cohort are required to determine whether these amino acid variations have been acquired in the process of immune escape and serve as markers of OBI.
Mechanism of hydrodenitrogenation on phosphides and sulfides.

PubMed

Oyama, S Ted; Lee, Yong-Kul

2005-02-17

The mechanism of hydrodenitrogenation (HDN) of 2-methylpiperidine was studied over a silica-supported nickel phosphide catalyst (Ni2P/SiO2, Ni/P = 1/2) and a commercial Ni-Mo-S/Al2O3 catalyst in a three-phase trickle-bed reactor operated at 3.1 MPa and 450-600 K. Analysis of the product distribution as a function of contact time indicated that the reaction proceeded in both cases predominantly by a substitution mechanism, with a smaller contribution of an elimination mechanism. Fourier transform infrared spectroscopy (FTIR) of the 2-methylpiperidine indicated that at reaction conditions a piperidinium ion intermediate was formed on both the sulfide and the phosphide. It is concluded that the mechanism of HDN on nickel phosphide is very similar to that on sulfides. The mechanism on the nickel phosphide was also probed by comparing the reactivity of piperidine and several of its derivatives in the presence of 3000 ppm S. The relative elimination rates depended on the structure of the molecules, and followed the sequence: 4-methylpiperidine approximately piperidine > 3-methylpiperidine > 2,6-dimethylpiperidine > 2-methylpiperidine. [Chemical structure: see text] This order of reactivity was not dependent on the number of alpha-H or beta-H atoms in the molecules, ruling out their reaction through a single, simple mechanism. It is likely that the unhindered piperidine molecules reacted by an S(N)2 substitution process and the more hindered 2,6-dimethylpiperidine reacted by an E2 elimination process.
On the Overdispersed Molecular Clock

PubMed Central

Takahata, Naoyuki

1987-01-01

Rates of molecular evolution at some loci are more irregular than described by simple Poisson processes. Three situations under which molecular evolution would not follow simple Poisson processes are reevaluated from the viewpoint of the neutrality hypothesis: (i) concomitant or multiple substitutions in a gene, (ii) fluctuating substitution rates in time caused by coupled effects of deleterious mutations and bottlenecks, and (iii) changes in the degree of selective constraints against a gene (neutral space) caused by successive substitutions. The common underlying assumption that these causes are lineage nonspecific excludes the case where mutation rates themselves change systematically among lineages or taxonomic groups, and severely limits the extent of variation in the number of substitutions among lineages. Even under this stringent condition, however, the third hypothesis, the fluctuating neutral space model, can generate fairly large variation. This is described by a time-dependent renewal process, which does not exhibit any episodic nature of molecular evolution. It is argued that the observed elevated variances in the number of nucleotide or amino acid substitutions do not immediately call for positive Darwinian selection in molecular evolution. PMID:3596230
The complete chloroplast genome sequence of Aster spathulifolius (Asteraceae); genomic features and relationship with Asteraceae.

PubMed

Choi, Kyoung Su; Park, SeonJoo

2015-11-10

Aster spathulifolius, a member of the Asteraceae family, is distributed along the coast of Japan and Korea. This plant is used for medicinal and ornamental purposes. The complete chloroplast (cp) genome of A. sphathulifolius consists of 149,473 bp that include a pair of inverted repeats of 24,751 bp separated by a large single copy region of 81,998 bp and a small single copy region of 17,973 bp. The chloroplast genome contains 78 coding genes, four rRNA genes and 29 tRNA genes. When compared to other cpDNA sequences of Asteraceae, A. spathulifolius showed the closest relationship with Jacobaea vulgaris, and its atpB gene was found to be a pseudogene, unlike J. vulgaris. Furthermore, evaluation of the gene compositions of J. vulgaris, Helianthus annuus, Guizotia abyssinica and A. spathulifolius revealed that 13.6-kb showed inversion from ndhF to rps15, unlike Lactuca of Asteraceae. Comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates with J. vulgaris revealed that synonymous genes related to a small subunit of the ribosome showed the highest value (0.1558), while nonsynonymous rates of genes related to ATP synthase genes were highest (0.0118). These findings revealed that substitution has occurred at similar rates in most genes, and the substitution rates suggested that most genes is a purified selection. Copyright © 2015 Elsevier B.V. All rights reserved.

Weak Negative and Positive Selection and the Drift Load at Splice Sites

PubMed Central

Denisov, Stepan V.; Bazykin, Georgii A.; Sutormin, Roman; Favorov, Alexander V.; Mironov, Andrey A.; Gelfand, Mikhail S.; Kondrashov, Alexey S.

2014-01-01

Splice sites (SSs) are short sequences that are crucial for proper mRNA splicing in eukaryotic cells, and therefore can be expected to be shaped by strong selection. Nevertheless, in mammals and in other intron-rich organisms, many of the SSs often involve nonconsensus (Nc), rather than consensus (Cn), nucleotides, and beyond the two critical nucleotides, the SSs are not perfectly conserved between species. Here, we compare the SS sequences between primates, and between Drosophila fruit flies, to reveal the pattern of selection acting at SSs. Cn-to-Nc substitutions are less frequent, and Nc-to-Cn substitutions are more frequent, than neutrally expected, indicating, respectively, negative and positive selection. This selection is relatively weak (1 < |4Nes| < 4), and has a similar efficiency in primates and in Drosophila. Within some nucleotide positions, the positive selection in favor of Nc-to-Cn substitutions is weaker than the negative selection maintaining already established Cn nucleotides; this difference is due to site-specific negative selection favoring current Nc nucleotides. In general, however, the strength of negative selection protecting the Cn alleles is similar in magnitude to the strength of positive selection favoring replacement of Nc alleles, as expected under the simple nearly neutral turnover. In summary, although a fraction of the Nc nucleotides within SSs is maintained by selection, the abundance of deleterious nucleotides in this class suggests a substantial genome-wide drift load. PMID:24966225
A high proportion of ADA point mutations associated with a specific alanine-to-valine substitution.

PubMed

Markert, M L; Norby-Slycord, C; Ward, F E

1989-09-01

In 15%-20% of children with severe combined immunodeficiency (SCID), the underlying defect is adenosine deaminase (ADA) deficiency. The overall goal of our research has been to identify the precise molecular defects in patients with ADA-deficient SCID. In this study, we focused on a patient whom we found to have normal sized ADA mRNA by Northern analysis and an intact ADA structural gene by Southern analysis. By cloning and sequencing this patient's ADA cDNA, we found a C-to-T point mutation in exon 11. This resulted in the amino acid substitution of a valine for an alanine at position 329 of the ADA protein. Sequence analysis revealed that this mutation created a new BalI restriction site. Using Southern analyses, we were able to directly screen individuals to determine the frequency of this mutation. By combining data on eight families followed at our institution with data on five other families reported in the literature, we established that five of 13 patients (seven of 22 alleles) with known or suspected point mutations have this defect. This mutation was found to be associated with three different ADA haplotypes. This argues against a founder effect and suggests that the mutation is very old. In summary, a conservative amino acid substitution is found in a high proportion of patients with ADA deficiency; this can easily be detected by Southern analysis.
Identification of an alternative knockdown resistance (kdr)-like mutation, M918L, and a novel mutation, V1010A, in the Thrips tabaci voltage-gated sodium channel gene.

PubMed

Wu, Meixiang; Gotoh, Hiroki; Waters, Timothy; Walsh, Douglas B; Lavine, Laura Corley

2014-06-01

Knockdown resistance (kdr) has been identified as a main mechanism against pyrethroid insecticides in many arthropod pests including in the onion thrips, Thrips tabaci. To characterize and identify pyrethroid-resistance in onion thrips in Washington state, we conducted insecticide bioassays and sequenced a region of the voltage gated sodium channel gene from several different T. tabaci populations. Field collected Thrips tabaci were found to have large variations in resistance to the pyrethroid insecticide lambda-cyhalothrin. We identified two single nucleotide substitutions in our analysis of a partial sequence of the T. tabaci voltage-gated sodium channel gene. One mutation resulted in the non-synonymous substitution of methionine with leucine (M918L), which is well known to be responsible for super knockdown resistance in some pest species. Another non-synonymous substitution, a valine (GTT) to alanine (GCT) replacement at amino acid 1010 (V1010A) was identified in our study and was associated with lambda-cyhalothrin resistance. We have characterized a known kdr mutation and identified a novel mutation in the voltage-gated sodium channel gene of Thrips tabaci associated with resistance to lambda-cyhalothrin. This gene region and these mutations are expected to be useful in the development of a diagnostic test to detect kdr resistance in many onion thrips populations. © 2013 Society of Chemical Industry.
Exploration of the effect of sequence variations located inside the binding pocket of HIV-1 and HIV-2 proteases.

PubMed

Triki, Dhoha; Billot, Telli; Visseaux, Benoit; Descamps, Diane; Flatters, Delphine; Camproux, Anne-Claude; Regad, Leslie

2018-04-10

HIV-2 protease (PR2) is naturally resistant to most FDA (Food and Drug Administration)-approved HIV-1 protease inhibitors (PIs), a major antiretroviral class. In this study, we compared the PR1 and PR2 binding pockets extracted from structures complexed with 12 ligands. The comparison of PR1 and PR2 pocket properties showed that bound PR2 pockets were more hydrophobic with more oxygen atoms and fewer nitrogen atoms than PR1 pockets. The structural comparison of PR1 and PR2 pockets highlighted structural changes induced by their sequence variations and that were consistent with these property changes. Specifically, substitutions at residues 31, 46, and 82 induced structural changes in their main-chain atoms that could affect PI binding in PR2. In addition, the modelling of PR1 mutant structures containing V32I and L76M substitutions revealed a cooperative mechanism leading to structural deformation of flap-residue 45 that could modify PR2 flexibility. Our results suggest that substitutions in the PR1 and PR2 pockets can modify PI binding and flap flexibility, which could underlie PR2 resistance against PIs. These results provide new insights concerning the structural changes induced by PR1 and PR2 pocket variation changes, improving the understanding of the atomic mechanism of PR2 resistance to PIs.
Amino acid substitutions in the VanS sensor of the VanA-type vancomycin-resistant Enterococcus strains result in high-level vancomycin resistance and low-level teicoplanin resistance.

PubMed

Hashimoto, Y; Tanimoto, K; Ozawa, Y; Murata, T; Ike, Y

2000-04-15

The vancomycin-resistant enterococci GV1, GV2 and GV3, which were isolated from droppings from broiler farms in Japan have been characterized as VanA-type VRE, which express high-level vancomycin resistance (256 or 512 microg ml(-1), MIC) and low-level teicoplanin resistance (1 or 2 microg ml(-1), MIC). The vancomycin resistances were encoded on plasmids. The vancomycin resistance conjugative plasmid pMG2 was isolated from the GV2 strain. The VanA determinant of pMG2 showed the same genetic organization as that of the VanA genes encoded on the representative transposon Tn1546, which comprises vanRSHAXYZ. The nucleotide sequences of all the genes, except the gene related to the vanS gene on Tn1546, were completely identical to the genes encoded on Tn1546. Three amino acid substitutions in the N-terminal region of the deduced VanS were detected in the nucleotide sequence of vanS encoded on pMG2. There were also three amino acid substitutions in the vanS gene of the GV1 and GV3 strains in the same positions as in the vanS gene of pMG2. Vancomycin induced the increased teicoplanin resistance in these strains.
A single amino acid substitution in the Bombyx-specific mucin-like membrane protein causes resistance to Bombyx mori densovirus.

PubMed

Ito, Katsuhiko; Kidokoro, Kurako; Katsuma, Susumu; Sezutsu, Hideki; Uchino, Keiro; Kobayashi, Isao; Tamura, Toshiki; Yamamoto, Kimiko; Mita, Kazuei; Shimada, Toru; Kadono-Okuda, Keiko

2018-05-09

Bombyx mori densovirus type 1 (BmDV) is a pathogen that causes flacherie disease in the silkworm. The absolute nonsusceptibility to BmDV among certain silkworm strains is determined independently by two genes, nsd-1 and Nid-1. However, neither of these genes has been molecularly identified to date. Here, we isolated the nsd-1 gene by positional cloning and characterized the properties of its product, NSD-1. Sequence and biochemical analyses revealed that this gene encodes a Bombyx-specific mucin-like glycoprotein with a single transmembrane domain. The NSD-1 protein was specifically expressed in the larval midgut epithelium, the known infection site of BmDV. Sequence analysis of the nsd-1 gene from 13 resistant and 12 susceptible strains suggested that a specific arginine residue in the extracellular tail of the NSD-1 protein was common among susceptible strains. Germline transformation of the susceptible-type nsd-1 (with a single nucleotide substitution) conferred partial susceptibility to resistant larvae, indicating that the + nsd-1 gene is required for the susceptibility of B. mori larvae to BmDV and the susceptibility is solely a result of the substitution of a single amino acid with arginine. Taken together, our results provide striking evidence that a novel membrane-bound mucin-like protein functions as a cell-surface receptor for a densovirus.
Processing waste fats into a fuel oil substitute

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pudel, F.; Lengenfeld, P.

1993-12-31

Waste fats have a high energy potential. They also contain impurities. For example, fats used for deep-frying contain high contents of solids, water, and chlorides. The process described in this paper removes the impurities by simple processing such as screening, washing, separating, drying, and filtering. The final quality of processed fat allows its use as a fuel oil substitute, and also as a raw material for chemical production.
The Teaching of Protein Synthesis--A Microcomputer Based Method.

ERIC Educational Resources Information Center

Goodridge, Frank

1983-01-01

Describes two computer programs (BASIC for 32K Commodore PET) for teaching protein synthesis. The first is an interactive test of base-pairing knowledge, and the second generates random DNA nucleotide sequences, with instructions for substitution, insertion, and deletion printed out for each student. (JN)
Antiviral Activity and Resistance Analysis of NS3/4A Protease Inhibitor Grazoprevir and NS5A Inhibitor Elbasvir in Hepatitis C Virus GT4 Replicons.

PubMed

Asante-Appiah, Ernest; Curry, Stephanie; McMonagle, Patricia; Ingravallo, Paul; Chase, Robert; Nickle, David; Qiu, Ping; Howe, Anita; Lahser, Frederick C

2017-07-01

Although genotype 4 (GT4)-infected patients represent a minor overall percentage of the global hepatitis C virus (HCV)-infected population, the high prevalence of the genotype in specific geographic regions coupled with substantial sequence diversity makes it an important genotype to study for antiviral drug discovery and development. We evaluated two direct-acting antiviral agents-grazoprevir, an HCV NS3/4A protease inhibitor, and elbasvir, an HCV NS5A inhibitor-in GT4 replicons prior to clinical studies in this genotype. Following a bioinformatics analysis of available GT4 sequences, a set of replicons bearing representative GT4 clinical isolates was generated. For grazoprevir, the 50% effective concentration (EC 50 ) against the replicon bearing the reference GT4a (ED43) NS3 protease and NS4A was 0.7 nM. The median EC 50 for grazoprevir against chimeric replicons encoding NS3/4A sequences from GT4 clinical isolates was 0.2 nM (range, 0.11 to 0.33 nM; n = 5). The difficulty in establishing replicons bearing NS3/4A resistance-associated substitutions was substantially overcome with the identification of a G162R adaptive substitution in NS3. Single NS3 substitutions D168A/V identified from de novo resistance selection studies reduced grazoprevir antiviral activity by 137- and 47-fold, respectively, in the background of the G162R replicon. For elbasvir, the EC 50 against the replicon bearing the reference full-length GT4a (ED43) NS5A gene was 0.0002 nM. The median EC 50 for elbasvir against chimeric replicons bearing clinical isolates from GT4 was 0.0007 nM (range, 0.0002 to 34 nM; n = 14). De novo resistance selection studies in GT4 demonstrated a high propensity to suppress the emergence of amino acid substitutions that confer high-potency reductions to elbasvir. Phenotypic characterization of the NS5A amino acid substitutions identified (L30F, L30S, M31V, and Y93H) indicated that they conferred 15-, 4-, 2.5-, and 7.5-fold potency losses, respectively, to elbasvir. The activity profiles of grazoprevir and elbasvir supported the testing of the direct-acting antivirals in clinical studies. Copyright © 2017 American Society for Microbiology.
Feature-based classification of amino acid substitutions outside conserved functional protein domains.

PubMed

Gemovic, Branislava; Perovic, Vladimir; Glisic, Sanja; Veljkovic, Nevena

2013-01-01

There are more than 500 amino acid substitutions in each human genome, and bioinformatics tools irreplaceably contribute to determination of their functional effects. We have developed feature-based algorithm for the detection of mutations outside conserved functional domains (CFDs) and compared its classification efficacy with the most commonly used phylogeny-based tools, PolyPhen-2 and SIFT. The new algorithm is based on the informational spectrum method (ISM), a feature-based technique, and statistical analysis. Our dataset contained neutral polymorphisms and mutations associated with myeloid malignancies from epigenetic regulators ASXL1, DNMT3A, EZH2, and TET2. PolyPhen-2 and SIFT had significantly lower accuracies in predicting the effects of amino acid substitutions outside CFDs than expected, with especially low sensitivity. On the other hand, only ISM algorithm showed statistically significant classification of these sequences. It outperformed PolyPhen-2 and SIFT by 15% and 13%, respectively. These results suggest that feature-based methods, like ISM, are more suitable for the classification of amino acid substitutions outside CFDs than phylogeny-based tools.
The current status of REH theory. [Random Evolutionary Hits in biological molecular evolution

NASA Technical Reports Server (NTRS)

Holmquist, R.; Jukes, T. H.

1981-01-01

A response is made to the evaluation of Fitch (1980) of REH (random evolutionary hits) theory for the evolutionary divergence of proteins and nucleic acids. Correct calculations for the beta hemoglobin mRNAs of the human, mouse and rabbit in the absence and presence of selective constraints are summarized, and it is shown that the alternative evolutionary analysis of Fitch underestimates the total fixed mutations. It is further shown that the model used by Fitch to test for the completeness of the count of total base substitutions is in fact a variant of REH theory. Considerations of the variance inherent in evolutionary estimations are also presented which show the REH model to produce no more variance than other evolutionary models. In the reply, it is argued that, despite the objections raised, REH theory applied to proteins gives inaccurate estimates of total gene substitutions. It is further contended that REH theory developed for nucleic sequences suffers from problems relating to the frequency of nucleotide substitutions, the identity of the codons accepting silent and amino acid-changing substitutions, and estimate uncertainties.
Phylogenetic incongruence and the evolutionary origins of cardenolide-resistant forms of Na+,K+-ATPase in Danaus butterflies

PubMed Central

Aardema, Matthew L.; Andolfatto, Peter

2016-01-01

Many distantly-related insect species are specialized feeders of cardenolide-containing host plants such as milkweed (Asclepias spp.). Studies have revealed frequent, parallel substitution of a functionally important amino acid substitution (N122H) in the alpha subunit of Na+,K+-ATPase (N122H) in many of these species. This substitution facilitates the ability of these insects to feed on their toxic hosts. Among milkweed butterflies of the genus Danaus, the previously established phylogeny for this group suggests that N122H arose independently and fixed in two distinct lineages. We re-evaluate this conclusion by examining Danaus phylogenetic relationships using >400 orthologous gene sequences assembled from transcriptome data. Our results indicate that the three Danaus species known to harbor the N122H substitution are more closely related than previously thought, consistent with a single, common origin for N122H. However, we also find evidence of both incomplete lineage sorting and post-speciation genetic exchange among these butterfly species, raising the possibility of collateral evolution of cardenolide-insensitivity in this species group. PMID:27405795
Design and synthesis of some new pyrazolyl-pyrazolines as potential anti-inflammatory, analgesic and antibacterial agents.

PubMed

Viveka, Shivapura; Dinesha; Shama, Prasanna; Nagaraja, Gundibasappa Karikannar; Ballav, Shuvankar; Kerkar, Savita

2015-08-28

In the present study, an efficient synthesis of some new substituted pyrazoline derivatives linked to a substituted pyrazole scaffold was performed by a multistep reaction sequences and compounds were screened for their anti-inflammatory, analgesic and antibacterial activities. The preliminary results revealed that the N-acylated (5e, 5h) and nitro substituted N-phenyl (6f) pyrazolyl-pyrazolines derivatives exhibited a very promising anti-inflammatory activity whereas 5h, 6f were interesting analgesic agents. The compounds with halo substituted phenyl group at C-3 of the pyrazoline ring (4a, 5g, 5h, 6a and 6b) were found to be active against clinical bacterial pathogens with MIC in the range of 0.2-0.4 mg/mL. Compound containing N-propionyl pyrazolyl-pyrazoline (5h) could be identified as the most active member within this study with a dual anti-inflammatory and antibacterial profile. Taken together, this study has led to the development of promising compounds. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Mapping a nucleolar targeting sequence of an RNA binding nucleolar protein, Nop25

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fujiwara, Takashi; Suzuki, Shunji; Kanno, Motoko

2006-06-10

Nop25 is a putative RNA binding nucleolar protein associated with rRNA transcription. The present study was undertaken to determine the mechanism of Nop25 localization in the nucleolus. Deletion experiments of Nop25 amino acid sequence showed Nop25 to contain a nuclear targeting sequence in the N-terminal and a nucleolar targeting sequence in the C-terminal. By expressing derivative peptides from the C-terminal as GFP-fusion proteins in the cells, a lysine and arginine residue-enriched peptide (KRKHPRRAQDSTKKPPSATRTSKTQRRRR) allowed a GFP-fusion protein to be transported and fully retained in the nucleolus. When the peptide was fused with cMyc epitope and expressed in the cells, amore » cMyc epitope was then detected in the nucleolus. Nop25 did not localize in the nucleolus by deletion of the peptide from Nop25. Furthermore, deletion of a subdomain (KRKHPRRAQ) in the peptide or amino acid substitution of lysine and arginine residues in the subdomain resulted in the loss of Nop25 nucleolar localization. These results suggest that the lysine and arginine residue-enriched peptide is the most prominent nucleolar targeting sequence of Nop25 and that the long stretch of basic residues might play an important role in the nucleolar localization of Nop25. Although Nop25 contained putative SUMOylation, phosphorylation and glycosylation sites, the amino acid substitution in these sites had no effect on the nucleolar localization, thus suggesting that these post-translational modifications did not contribute to the localization of Nop25 in the nucleolus. The treatment of the cells, which expressed a GFP-fusion protein with a nucleolar targeting sequence of Nop25, with RNase A resulted in a complete dislocation of the protein from the nucleolus. These data suggested that the nucleolar targeting sequence might therefore play an important role in the binding of Nop25 to RNA molecules and that the RNA binding of Nop25 might be essential for the nucleolar localization of Nop25.« less
Heuristics for multiobjective multiple sequence alignment.

PubMed

Abbasi, Maryam; Paquete, Luís; Pereira, Francisco B

2016-07-15

Aligning multiple sequences arises in many tasks in Bioinformatics. However, the alignments produced by the current software packages are highly dependent on the parameters setting, such as the relative importance of opening gaps with respect to the increase of similarity. Choosing only one parameter setting may provide an undesirable bias in further steps of the analysis and give too simplistic interpretations. In this work, we reformulate multiple sequence alignment from a multiobjective point of view. The goal is to generate several sequence alignments that represent a trade-off between maximizing the substitution score and minimizing the number of indels/gaps in the sum-of-pairs score function. This trade-off gives to the practitioner further information about the similarity of the sequences, from which she could analyse and choose the most plausible alignment. We introduce several heuristic approaches, based on local search procedures, that compute a set of sequence alignments, which are representative of the trade-off between the two objectives (substitution score and indels). Several algorithm design options are discussed and analysed, with particular emphasis on the influence of the starting alignment and neighborhood search definitions on the overall performance. A perturbation technique is proposed to improve the local search, which provides a wide range of high-quality alignments. The proposed approach is tested experimentally on a wide range of instances. We performed several experiments with sequences obtained from the benchmark database BAliBASE 3.0. To evaluate the quality of the results, we calculate the hypervolume indicator of the set of score vectors returned by the algorithms. The results obtained allow us to identify reasonably good choices of parameters for our approach. Further, we compared our method in terms of correctly aligned pairs ratio and columns correctly aligned ratio with respect to reference alignments. Experimental results show that our approaches can obtain better results than TCoffee and Clustal Omega in terms of the first ratio.
Regulation of the Production of Infectious Genotype 1a Hepatitis C Virus by NS5A Domain III▿

PubMed Central

Kim, Seungtaek; Welsch, Christoph; Yi, MinKyung; Lemon, Stanley M.

2011-01-01

Although hepatitis C virus (HCV) assembly remains incompletely understood, recent studies with the genotype 2a JFH-1 strain suggest that it is dependent upon the phosphorylation of Ser residues near the C terminus of NS5A, a multifunctional nonstructural protein. Since genotype 1 viruses account for most HCV disease yet differ substantially in sequence from that of JFH-1, we studied the role of NS5A in the production of the H77S virus. While less efficient than JFH-1, genotype 1a H77S RNA produces infectious virus when transfected into permissive Huh-7 cells. The exchange of complete NS5A sequences between these viruses was highly detrimental to replication, while exchanges of the C-terminal domain III sequence (46% amino acid sequence identity) were well tolerated, with little effect on RNA synthesis. Surprisingly, the placement of the H77S domain III sequence into JFH-1 resulted in increased virus yields; conversely, H77S yields were reduced by the introduction of domain III from JFH-1. These changes in infectious virus yield correlated well with changes in the abundance of NS5A in RNA-transfected cells but not with RNA replication or core protein expression levels. Alanine replacement mutagenesis of selected Ser and Thr residues in the C-terminal domain III sequence revealed no single residue to be essential for infectious H77S virus production. However, virus production was eliminated by Ala substitutions at multiple residues and could be restored by phosphomimetic Asp substitutions at these sites. Thus, despite low overall sequence homology, the production of infectious virus is regulated similarly in JFH-1 and H77S viruses by a conserved function associated with a C-terminal Ser/Thr cluster in domain III of NS5A. PMID:21525356
A new mechanism to render clinical isolates of Escherichia coli non-susceptible to imipenem: substitutions in the PBP2 penicillin-binding domain.

PubMed

Aissa, Nejla; Mayer, Noémie; Bert, Fréderic; Labia, Roger; Lozniewski, Alain; Nicolas-Chanoine, Marie-Hélène

2016-01-01

So far, two types of mechanism are known to be involved in carbapenem non-susceptibility of Escherichia coli clinical isolates: reduced outer membrane permeability associated with production of ESBLs and/or overproduction of class C β-lactamases; and production of carbapenemases. Non-susceptibility to only imipenem observed in two clinical isolates suggested a new mechanism, described in the present study. The ST was determined for the two isolates of E. coli (strains LSNy and VSBj), and their chromosomal region encoding the penicillin-binding domain of PBP2 was amplified, sequenced and then used for recombination experiments in E. coli K12 C600. Antibiotic MICs were determined using the Etest method. Strains LSNy and VSBj, which displayed ST23 and ST345, respectively, showed amino acid substitutions in their PBP2 penicillin-binding domain. Substitution Ala388Ser located in motif 2 (SXD) was common to the two strains. Two additional substitutions (Ala488Thr and Leu573Val) located outside the two other motifs were identified in strain LSNy, whereas another one (Thr331Pro) located in motif 1 was identified in strain VSBj. Recombination experiments to reproduce non-susceptibility to imipenem in E. coli K12 C600 were not successful when only the common substitution was transferred, whereas recombination with DNA fragments including either the three substitutions (strain LSNy) or the two substitutions (strain VSBj) were successful. Substitution of amino acids in the penicillin-binding domain of PBP2 is a new mechanism by which E. coli clinical isolates specifically resist imipenem. © The Author 2015. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
12 CFR Appendix C to Part 229 - Model Availability Policy Disclosures, Clauses, and Notices; Model Substitute Check Policy...

Code of Federal Regulations, 2012 CFR

2012-01-01

... in different states or check processing regions)]. If you make the deposit in person to one of our... processing regions)]. If you make the deposit in person to one of our employees, funds from the following... Your Rights What Is a Substitute Check? To make check processing faster, federal law permits banks to...
12 CFR Appendix C to Part 229 - Model Availability Policy Disclosures, Clauses, and Notices; Model Substitute Check Policy...

Code of Federal Regulations, 2011 CFR

2011-01-01

... in different states or check processing regions)]. If you make the deposit in person to one of our... processing regions)]. If you make the deposit in person to one of our employees, funds from the following... Your Rights What Is a Substitute Check? To make check processing faster, federal law permits banks to...
12 CFR Appendix C to Part 229 - Model Availability Policy Disclosures, Clauses, and Notices; Model Substitute Check Policy...

Code of Federal Regulations, 2013 CFR

2013-01-01

... in different states or check processing regions)]. If you make the deposit in person to one of our... processing regions)]. If you make the deposit in person to one of our employees, funds from the following... Your Rights What Is a Substitute Check? To make check processing faster, federal law permits banks to...

Some links on this page may take you to non-federal websites. Their policies may differ from this site.