acid sequence designated: Topics by Science.gov

Sample records for acid sequence designated

EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2014-02-25

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-16

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-04-01

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-10-12

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVIII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-23

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl8, and the corresponding EGVIII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVIII, recombinant EGVIII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-10-05

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-06-06

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2009-05-05

The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2013-07-16

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2012-02-14

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2015-04-14

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2013-01-29

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2012-10-02

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-02-28

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-03-18

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dunn-Coleman, Nigel; Ward, Michael

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2014-03-04

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2015-04-14

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2014-03-25

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2015-08-11

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.

BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2007-09-25

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-04-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2011-12-06

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL4 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-16

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2011-06-14

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA

2009-09-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2012-10-30

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-01-22

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
Trichoderma .beta.-glucosidase

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-01-03

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
.beta.-glucosidase 5 (BGL5) compositions

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-06-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
Two-level QSAR network (2L-QSAR) for peptide inhibitor design based on amino acid properties and sequence positions.

PubMed

Du, Q S; Ma, Y; Xie, N Z; Huang, R B

2014-01-01

In the design of peptide inhibitors the huge possible variety of the peptide sequences is of high concern. In collaboration with the fast accumulation of the peptide experimental data and database, a statistical method is suggested for peptide inhibitor design. In the two-level peptide prediction network (2L-QSAR) one level is the physicochemical properties of amino acids and the other level is the peptide sequence position. The activity contributions of amino acids are the functions of physicochemical properties and the sequence positions. In the prediction equation two weight coefficient sets {ak} and {bl} are assigned to the physicochemical properties and to the sequence positions, respectively. After the two coefficient sets are optimized based on the experimental data of known peptide inhibitors using the iterative double least square (IDLS) procedure, the coefficients are used to evaluate the bioactivities of new designed peptide inhibitors. The two-level prediction network can be applied to the peptide inhibitor design that may aim for different target proteins, or different positions of a protein. A notable advantage of the two-level statistical algorithm is that there is no need for host protein structural information. It may also provide useful insight into the amino acid properties and the roles of sequence positions.
Cell culture compositions

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian

2014-03-18

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
Quantitative thermodynamic predication of interactions between nucleic acid and non-nucleic acid species using Microsoft excel.

PubMed

Zou, Jiaqi; Li, Na

2013-09-01

Proper design of nucleic acid sequences is crucial for many applications. We have previously established a thermodynamics-based quantitative model to help design aptamer-based nucleic acid probes by predicting equilibrium concentrations of all interacting species. To facilitate customization of this thermodynamic model for different applications, here we present a generic and easy-to-use platform to implement the algorithm of the model with Microsoft(®) Excel formulas and VBA (Visual Basic for Applications) macros. Two Excel spreadsheets have been developed: one for the applications involving only nucleic acid species, the other for the applications involving both nucleic acid and non-nucleic acid species. The spreadsheets take the nucleic acid sequences and the initial concentrations of all species as input, guide the user to retrieve the necessary thermodynamic constants, and finally calculate equilibrium concentrations for all species in various bound and unbound conformations. The validity of both spreadsheets has been verified by comparing the modeling results with the experimental results on nucleic acid sequences reported in the literature. This Excel-based platform described here will allow biomedical researchers to rationalize the sequence design of nucleic acid probes using the thermodynamics-based modeling even without relevant theoretical and computational skills. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Design of nucleic acid strands with long low-barrier folding pathways.

PubMed

Condon, Anne; Kirkpatrick, Bonnie; Maňuch, Ján

2017-01-01

A major goal of natural computing is to design biomolecules, such as nucleic acid sequences, that can be used to perform computations. We design sequences of nucleic acids that are "guaranteed" to have long folding pathways relative to their length. This particular sequences with high probability follow low-barrier folding pathways that visit a large number of distinct structures. Long folding pathways are interesting, because they demonstrate that natural computing can potentially support long and complex computations. Formally, we provide the first scalable designs of molecules whose low-barrier folding pathways, with respect to a simple, stacked pair energy model, grow superlinearly with the molecule length, but for which all significantly shorter alternative folding pathways have an energy barrier that is [Formula: see text] times that of the low-barrier pathway for any [Formula: see text] and a sufficiently long sequence.
CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design

PubMed Central

Rose, Timothy M.; Henikoff, Jorja G.; Henikoff, Steven

2003-01-01

We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all possible nucleotide sequences encoding 3–4 highly conserved amino acids within a 3′ degenerate core. A longer 5′ non-degenerate clamp region contains the most probable nucleotide predicted for each flanking codon. CODEHOPs are used in PCR amplification to isolate distantly related sequences encoding the conserved amino acid sequence. The primer design software and the CODEHOP PCR strategy have been utilized for the identification and characterization of new gene orthologs and paralogs in different plant, animal and bacterial species. In addition, this approach has been successful in identifying new pathogen species. The CODEHOP designer (http://blocks.fhcrc.org/codehop.html) is linked to BlockMaker and the Multiple Alignment Processor within the Blocks Database World Wide Web (http://blocks.fhcrc.org). PMID:12824413
Pyrin gene and mutants thereof, which cause familial Mediterranean fever

DOEpatents

Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL

2003-09-30

The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Automated design evolution of stereochemically randomized protein foldamers

NASA Astrophysics Data System (ADS)

Ranbhor, Ranjit; Kumar, Anil; Patel, Kirti; Ramakrishnan, Vibin; Durani, Susheel

2018-05-01

Diversification of chain stereochemistry opens up the possibilities of an ‘in principle’ increase in the design space of proteins. This huge increase in the sequence and consequent structural variation is aimed at the generation of smart materials. To diversify protein structure stereochemically, we introduced L- and D-α-amino acids as the design alphabet. With a sequence design algorithm, we explored the usage of specific variables such as chirality and the sequence of this alphabet in independent steps. With molecular dynamics, we folded stereochemically diverse homopolypeptides and evaluated their ‘fitness’ for possible design as protein-like foldamers. We propose a fitness function to prune the most optimal fold among 1000 structures simulated with an automated repetitive simulated annealing molecular dynamics (AR-SAMD) approach. The highly scored poly-leucine fold with sequence lengths of 24 and 30 amino acids were later sequence-optimized using a Dead End Elimination cum Monte Carlo based optimization tool. This paper demonstrates a novel approach for the de novo design of protein-like foldamers.
Automated design of degenerate codon libraries.

PubMed

Mena, Marco A; Daugherty, Patrick S

2005-12-01

Degenerate codon libraries are frequently used in protein engineering and evolution studies but are often limited to targeting a small number of positions to adequately limit the search space. To mitigate this, codon degeneracy can be limited using heuristics or previous knowledge of the targeted positions. To automate design of libraries given a set of amino acid sequences, an algorithm (LibDesign) was developed that generates a set of possible degenerate codon libraries, their resulting size, and their score relative to a user-defined scoring function. A gene library of a specified size can then be constructed that is representative of the given amino acid distribution or that includes specific sequences or combinations thereof. LibDesign provides a new tool for automated design of high-quality protein libraries that more effectively harness existing sequence-structure information derived from multiple sequence alignment or computational protein design data.
Using a color-coded ambigraphic nucleic acid notation to visualize conserved palindromic motifs within and across genomes

PubMed Central

2014-01-01

Background Ambiscript is a graphically-designed nucleic acid notation that uses symbol symmetries to support sequence complementation, highlight biologically-relevant palindromes, and facilitate the analysis of consensus sequences. Although the original Ambiscript notation was designed to easily represent consensus sequences for multiple sequence alignments, the notation’s black-on-white ambiguity characters are unable to reflect the statistical distribution of nucleotides found at each position. We now propose a color-augmented ambigraphic notation to encode the frequency of positional polymorphisms in these consensus sequences. Results We have implemented this color-coding approach by creating an Adobe Flash® application ( http://www.ambiscript.org) that shades and colors modified Ambiscript characters according to the prevalence of the encoded nucleotide at each position in the alignment. The resulting graphic helps viewers perceive biologically-relevant patterns in multiple sequence alignments by uniquely combining color, shading, and character symmetries to highlight palindromes and inverted repeats in conserved DNA motifs. Conclusion Juxtaposing an intuitive color scheme over the deliberate character symmetries of an ambigraphic nucleic acid notation yields a highly-functional nucleic acid notation that maximizes information content and successfully embodies key principles of graphic excellence put forth by the statistician and graphic design theorist, Edward Tufte. PMID:24447494
Molecular and Cellular Mechanisms for the Interaction between Gold Nanoparticles and Neuroimmune Cells Based on Size, Shape, and Charge

DTIC Science & Technology

2014-04-25

IgG secretion. 2.3 Designing of Synthetic peptide The immunogenic peptides against the foot and mouth disease virus ( FMDV ) were designed and...synthesized based on viral protein 1 of type O FMDV . The amino acid sequence for pFMDV is NGSSKYGDTSTNNVRGDLQVLAQKAERTLC. An extra cysteine was added...peptides were synthesized based on the amino acid sequence of the VP1 coat protein of the FMDV (table 1). The peptide pFMDVD (19 amino acids in length

DNA tetrominoes: the construction of DNA nanostructures using self-organised heterogeneous deoxyribonucleic acids shapes.

PubMed

Ong, Hui San; Rahim, Mohd Syafiq; Firdaus-Raih, Mohd; Ramlan, Effirul Ikhwan

2015-01-01

The unique programmability of nucleic acids offers alternative in constructing excitable and functional nanostructures. This work introduces an autonomous protocol to construct DNA Tetris shapes (L-Shape, B-Shape, T-Shape and I-Shape) using modular DNA blocks. The protocol exploits the rich number of sequence combinations available from the nucleic acid alphabets, thus allowing for diversity to be applied in designing various DNA nanostructures. Instead of a deterministic set of sequences corresponding to a particular design, the protocol promotes a large pool of DNA shapes that can assemble to conform to any desired structures. By utilising evolutionary programming in the design stage, DNA blocks are subjected to processes such as sequence insertion, deletion and base shifting in order to enrich the diversity of the resulting shapes based on a set of cascading filters. The optimisation algorithm allows mutation to be exerted indefinitely on the candidate sequences until these sequences complied with all the four fitness criteria. Generated candidates from the protocol are in agreement with the filter cascades and thermodynamic simulation. Further validation using gel electrophoresis indicated the formation of the designed shapes. Thus, supporting the plausibility of constructing DNA nanostructures in a more hierarchical, modular, and interchangeable manner.
Design and preparation of beta-sheet forming repetitive and block-copolymerized polypeptides.

PubMed

Higashiya, Seiichiro; Topilina, Natalya I; Ngo, Silvana C; Zagorevskii, Dmitri; Welch, John T

2007-05-01

The design and rapid construction of libraries of genes coding beta-sheet forming repetitive and block-copolymerized polypeptides bearing various C- and N-terminal sequences are described. The design was based on the assembly of DNA cassettes coding for the (GA)3GX amino acid sequence where the (GAGAGA) sequences would constitute the beta-strand units of a larger beta-sheet assembly. The edges of this beta-sheet would be functionalized by the turn-inducing amino acids (GX). The polypeptides were expressed in Escherichia coli using conventional vectors and were purified by Ni-nitriloacetic acid (NTA) chromatography. The correlation of polymer structure with molecular weight was investigated by gel electrophoresis and mass spectrometry. The monomer sequences and post-translational chemical modifications were found to influence the mobility of the polypeptides over the full range of polypeptide molecular weights while the electrophoretic mobility of lower molecular weight polypeptides was more susceptible to C- and N-termini polypeptide modifications.
WebLogo

DOE Office of Scientific and Technical Information (OSTI.GOV)

Crooks, Gavin E.

WebLogo is a web based application designed to make the generation of sequence logos as easy and painless as possible. Sequesnce logos are a graphical representation of an amino acid or nucleic acid multiple sequence alignment developed by Tom Schneider and Mike Stephens. Each logo consists of stacks of symbols, one stack for each position in the sequence. The overall height of the stack indicates the sequence conservation at that position, while the height of symbols within the stack indicates the relative frequency of each amino or nucleic acid at that position. In general, a sequence logo provides a richermore » and more precise description of, for example, a binding site, than would a consensus sequence.« less
WEB-server for search of a periodicity in amino acid and nucleotide sequences

NASA Astrophysics Data System (ADS)

E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

2017-12-01

A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Protein location prediction using atomic composition and global features of the amino acid sequence

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com; Nair, Achuthsankar S.

2010-01-22

Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectivelymore » used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.« less
Terminal sequence importance of de novo proteins from binary-patterned library: stable artificial proteins with 11- or 12-amino acid alphabet.

PubMed

Okura, Hiromichi; Takahashi, Tsuyoshi; Mihara, Hisakazu

2012-06-01

Successful approaches of de novo protein design suggest a great potential to create novel structural folds and to understand natural rules of protein folding. For these purposes, smaller and simpler de novo proteins have been developed. Here, we constructed smaller proteins by removing the terminal sequences from stable de novo vTAJ proteins and compared stabilities between mutant and original proteins. vTAJ proteins were screened from an α3β3 binary-patterned library which was designed with polar/ nonpolar periodicities of α-helix and β-sheet. vTAJ proteins have the additional terminal sequences due to the method of constructing the genetically repeated library sequences. By removing the parts of the sequences, we successfully obtained the stable smaller de novo protein mutants with fewer amino acid alphabets than the originals. However, these mutants showed the differences on ANS binding properties and stabilities against denaturant and pH change. The terminal sequences, which were designed just as flexible linkers not as secondary structure units, sufficiently affected these physicochemical details. This study showed implications for adjusting protein stabilities by designing N- and C-terminal sequences.
Epitaxial Nucleation on Rationally Designed Peptide Functionalized Interface

DTIC Science & Technology

2011-07-19

of 17 amino acid peptides. In this report, we focus on the findings from several variants of these sequences, including the role of charge...separation and histidine-gold coordination. We find that these 17 amino acid peptide sequences behave robustly, where periodicity appears to dominate the...26,27 Secondary structure propensity refers to the intrinsic inclination of individual amino acids to a given secondary structure, where side-group
A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.

PubMed

Etchebest, C; Benros, C; Bornot, A; Camproux, A-C; de Brevern, A G

2007-11-01

Protein sequence world is considerably larger than structure world. In consequence, numerous non-related sequences may adopt similar 3D folds and different kinds of amino acids may thus be found in similar 3D structures. By grouping together the 20 amino acids into a smaller number of representative residues with similar features, sequence world simplification may be achieved. This clustering hence defines a reduced amino acid alphabet (reduced AAA). Numerous works have shown that protein 3D structures are composed of a limited number of building blocks, defining a structural alphabet. We previously identified such an alphabet composed of 16 representative structural motifs (5-residues length) called Protein Blocks (PBs). This alphabet permits to translate the structure (3D) in sequence of PBs (1D). Based on these two concepts, reduced AAA and PBs, we analyzed the distributions of the different kinds of amino acids and their equivalences in the structural context. Different reduced sets were considered. Recurrent amino acid associations were found in all the local structures while other were specific of some local structures (PBs) (e.g Cysteine, Histidine, Threonine and Serine for the alpha-helix Ncap). Some similar associations are found in other reduced AAAs, e.g Ile with Val, or hydrophobic aromatic residues Trp with Phe and Tyr. We put into evidence interesting alternative associations. This highlights the dependence on the information considered (sequence or structure). This approach, equivalent to a substitution matrix, could be useful for designing protein sequence with different features (for instance adaptation to environment) while preserving mainly the 3D fold.
Molecular beacon sequence design algorithm.

PubMed

Monroe, W Todd; Haselton, Frederick R

2003-01-01

A method based on Web-based tools is presented to design optimally functioning molecular beacons. Molecular beacons, fluorogenic hybridization probes, are a powerful tool for the rapid and specific detection of a particular nucleic acid sequence. However, their synthesis costs can be considerable. Since molecular beacon performance is based on its sequence, it is imperative to rationally design an optimal sequence before synthesis. The algorithm presented here uses simple Microsoft Excel formulas and macros to rank candidate sequences. This analysis is carried out using mfold structural predictions along with other free Web-based tools. For smaller laboratories where molecular beacons are not the focus of research, the public domain algorithm described here may be usefully employed to aid in molecular beacon design.
RosettaAntibodyDesign (RAbD): A general framework for computational antibody design

PubMed Central

Adolf-Bryfogle, Jared; Kalyuzhniy, Oleks; Kubitz, Michael; Hu, Xiaozhen; Adachi, Yumiko; Schief, William R.

2018-01-01

A structural-bioinformatics-based computational methodology and framework have been developed for the design of antibodies to targets of interest. RosettaAntibodyDesign (RAbD) samples the diverse sequence, structure, and binding space of an antibody to an antigen in highly customizable protocols for the design of antibodies in a broad range of applications. The program samples antibody sequences and structures by grafting structures from a widely accepted set of the canonical clusters of CDRs (North et al., J. Mol. Biol., 406:228–256, 2011). It then performs sequence design according to amino acid sequence profiles of each cluster, and samples CDR backbones using a flexible-backbone design protocol incorporating cluster-based CDR constraints. Starting from an existing experimental or computationally modeled antigen-antibody structure, RAbD can be used to redesign a single CDR or multiple CDRs with loops of different length, conformation, and sequence. We rigorously benchmarked RAbD on a set of 60 diverse antibody–antigen complexes, using two design strategies—optimizing total Rosetta energy and optimizing interface energy alone. We utilized two novel metrics for measuring success in computational protein design. The design risk ratio (DRR) is equal to the frequency of recovery of native CDR lengths and clusters divided by the frequency of sampling of those features during the Monte Carlo design procedure. Ratios greater than 1.0 indicate that the design process is picking out the native more frequently than expected from their sampled rate. We achieved DRRs for the non-H3 CDRs of between 2.4 and 4.0. The antigen risk ratio (ARR) is the ratio of frequencies of the native amino acid types, CDR lengths, and clusters in the output decoys for simulations performed in the presence and absence of the antigen. For CDRs, we achieved cluster ARRs as high as 2.5 for L1 and 1.5 for H2. For sequence design simulations without CDR grafting, the overall recovery for the native amino acid types for residues that contact the antigen in the native structures was 72% in simulations performed in the presence of the antigen and 48% in simulations performed without the antigen, for an ARR of 1.5. For the non-contacting residues, the ARR was 1.08. This shows that the sequence profiles are able to maintain the amino acid types of these conserved, buried sites, while recovery of the exposed, contacting residues requires the presence of the antigen-antibody interface. We tested RAbD experimentally on both a lambda and kappa antibody–antigen complex, successfully improving their affinities 10 to 50 fold by replacing individual CDRs of the native antibody with new CDR lengths and clusters. PMID:29702641
RosettaAntibodyDesign (RAbD): A general framework for computational antibody design.

PubMed

Adolf-Bryfogle, Jared; Kalyuzhniy, Oleks; Kubitz, Michael; Weitzner, Brian D; Hu, Xiaozhen; Adachi, Yumiko; Schief, William R; Dunbrack, Roland L

2018-04-01

A structural-bioinformatics-based computational methodology and framework have been developed for the design of antibodies to targets of interest. RosettaAntibodyDesign (RAbD) samples the diverse sequence, structure, and binding space of an antibody to an antigen in highly customizable protocols for the design of antibodies in a broad range of applications. The program samples antibody sequences and structures by grafting structures from a widely accepted set of the canonical clusters of CDRs (North et al., J. Mol. Biol., 406:228-256, 2011). It then performs sequence design according to amino acid sequence profiles of each cluster, and samples CDR backbones using a flexible-backbone design protocol incorporating cluster-based CDR constraints. Starting from an existing experimental or computationally modeled antigen-antibody structure, RAbD can be used to redesign a single CDR or multiple CDRs with loops of different length, conformation, and sequence. We rigorously benchmarked RAbD on a set of 60 diverse antibody-antigen complexes, using two design strategies-optimizing total Rosetta energy and optimizing interface energy alone. We utilized two novel metrics for measuring success in computational protein design. The design risk ratio (DRR) is equal to the frequency of recovery of native CDR lengths and clusters divided by the frequency of sampling of those features during the Monte Carlo design procedure. Ratios greater than 1.0 indicate that the design process is picking out the native more frequently than expected from their sampled rate. We achieved DRRs for the non-H3 CDRs of between 2.4 and 4.0. The antigen risk ratio (ARR) is the ratio of frequencies of the native amino acid types, CDR lengths, and clusters in the output decoys for simulations performed in the presence and absence of the antigen. For CDRs, we achieved cluster ARRs as high as 2.5 for L1 and 1.5 for H2. For sequence design simulations without CDR grafting, the overall recovery for the native amino acid types for residues that contact the antigen in the native structures was 72% in simulations performed in the presence of the antigen and 48% in simulations performed without the antigen, for an ARR of 1.5. For the non-contacting residues, the ARR was 1.08. This shows that the sequence profiles are able to maintain the amino acid types of these conserved, buried sites, while recovery of the exposed, contacting residues requires the presence of the antigen-antibody interface. We tested RAbD experimentally on both a lambda and kappa antibody-antigen complex, successfully improving their affinities 10 to 50 fold by replacing individual CDRs of the native antibody with new CDR lengths and clusters.
Peptide array-based interaction assay of solid-bound peptides and anchorage-dependant cells and its effectiveness in cell-adhesive peptide design.

PubMed

Kato, Ryuji; Kaga, Chiaki; Kunimatsu, Mitoshi; Kobayashi, Takeshi; Honda, Hiroyuki

2006-06-01

Peptide array, the designable peptide library covalently synthesized on cellulose support, was applied to assay peptide-cell interaction, between solid-bound peptides and anchorage-dependant cells, to study objective peptide design. As a model case, cell-adhesive peptides that could enhance cell growth as tissue engineering scaffold material, was studied. On the peptide array, the relative cell-adhesion ratio of NIH/3T3 cells was 2.5-fold higher on the RGDS (Arg-Gly-Asp-Ser) peptide spot as compared to the spot with no peptide, thus indicating integrin-mediated peptide-cell interaction. Such strong cell adhesion mediated by the RGDS peptide was easily disrupted by single residue substitution on the peptide array, thus indicating that the sequence recognition accuracy of cells was strictly conserved in our optimized scheme. The observed cellular morphological extension with active actin stress-fiber on the RGD motif-containing peptide supported our strategy that peptide array-based interaction assay of solid-bound peptide and anchorage-dependant cells (PIASPAC) could provide quantitative data on biological peptide-cell interaction. The analysis of 180 peptides obtained from fibronectin type III domain (no. 1447-1629) yielded 18 novel cell-adhesive peptides without the RGD motif. Taken together with the novel candidates, representative rules of ineffective amino acid usage were obtained from non-effective candidate sequences for the effective designing of cell-adhesive peptides. On comparing the amino acid usage of the top 20 and last 20 peptides from the 180 peptides, the following four brief design rules were indicated: (i) Arg or Lys of positively charged amino acids (except His) could enhance cell adhesion, (ii) small hydrophilic amino acids are favored in cell-adhesion peptides, (iii) negatively charged amino acids and small amino acids (except Gly) could reduce cell adhesion, and (iv) Cys and Met could be excluded from the sequence combination since they have less influence on the peptide design. Such rules that are indicative of the nature of the functional peptide sequence can be obtained only by the mass comparison analysis of PIASPAC using peptide array. By following such indicative rules, numerous amino acid combinations can be effectively screened for further examination of novel peptide design.
The shikimate pathway: review of amino acid sequence, function and three-dimensional structures of the enzymes.

PubMed

Mir, Rafia; Jallu, Shais; Singh, T P

2015-06-01

The aromatic compounds such as aromatic amino acids, vitamin K and ubiquinone are important prerequisites for the metabolism of an organism. All organisms can synthesize these aromatic metabolites through shikimate pathway, except for mammals which are dependent on their diet for these compounds. The pathway converts phosphoenolpyruvate and erythrose 4-phosphate to chorismate through seven enzymatically catalyzed steps and chorismate serves as a precursor for the synthesis of variety of aromatic compounds. These enzymes have shown to play a vital role for the viability of microorganisms and thus are suggested to present attractive molecular targets for the design of novel antimicrobial drugs. This review focuses on the seven enzymes of the shikimate pathway, highlighting their primary sequences, functions and three-dimensional structures. The understanding of their active site amino acid maps, functions and three-dimensional structures will provide a framework on which the rational design of antimicrobial drugs would be based. Comparing the full length amino acid sequences and the X-ray crystal structures of these enzymes from bacteria, fungi and plant sources would contribute in designing a specific drug and/or in developing broad-spectrum compounds with efficacy against a variety of pathogens.
Sequence signatures of allosteric proteins towards rational design.

PubMed

Namboodiri, Saritha; Verma, Chandra; Dhar, Pawan K; Giuliani, Alessandro; Nair, Achuthsankar S

2010-12-01

Allostery is the phenomenon of changes in the structure and activity of proteins that appear as a consequence of ligand binding at sites other than the active site. Studying mechanistic basis of allostery leading to protein design with predetermined functional endpoints is an important unmet need of synthetic biology. Here, we screened the amino acid sequence landscape in search of sequence-signatures of allostery using Recurrence Quantitative Analysis (RQA) method. A characteristic vector, comprised of 10 features extracted from RQA was defined for amino acid sequences. Using Principal Component Analysis, four factors were found to be important determinants of allosteric behavior. Our sequence-based predictor method shows 82.6% accuracy, 85.7% sensitivity and 77.9% specificity with the current dataset. Further, we show that Laminarity-Mean-hydrophobicity representing repeated hydrophobic patches is the most crucial indicator of allostery. To our best knowledge this is the first report that describes sequence determinants of allostery based on hydrophobicity. As an outcome of these findings, we plan to explore possibility of inducing allostery in proteins.
Guaranteed Discrete Energy Optimization on Large Protein Design Problems.

PubMed

Simoncini, David; Allouche, David; de Givry, Simon; Delmas, Céline; Barbe, Sophie; Schiex, Thomas

2015-12-08

In Computational Protein Design (CPD), assuming a rigid backbone and amino-acid rotamer library, the problem of finding a sequence with an optimal conformation is NP-hard. In this paper, using Dunbrack's rotamer library and Talaris2014 decomposable energy function, we use an exact deterministic method combining branch and bound, arc consistency, and tree-decomposition to provenly identify the global minimum energy sequence-conformation on full-redesign problems, defining search spaces of size up to 10(234). This is achieved on a single core of a standard computing server, requiring a maximum of 66GB RAM. A variant of the algorithm is able to exhaustively enumerate all sequence-conformations within an energy threshold of the optimum. These proven optimal solutions are then used to evaluate the frequencies and amplitudes, in energy and sequence, at which an existing CPD-dedicated simulated annealing implementation may miss the optimum on these full redesign problems. The probability of finding an optimum drops close to 0 very quickly. In the worst case, despite 1,000 repeats, the annealing algorithm remained more than 1 Rosetta unit away from the optimum, leading to design sequences that could differ from the optimal sequence by more than 30% of their amino acids.
Protein Design Using Unnatural Amino Acids

NASA Astrophysics Data System (ADS)

Bilgiçer, Basar; Kumar, Krishna

2003-11-01

With the increasing availability of whole organism genome sequences, understanding protein structure and function is of capital importance. Recent developments in the methodology of incorporation of unnatural amino acids into proteins allow the exploration of proteins at a very detailed level. Furthermore, de novo design of novel protein structures and function is feasible with unprecedented sophistication. Using examples from the literature, this article describes the available methods for unnatural amino acid incorporation and highlights some recent applications including the design of hyperstable protein folds.
Coarse-grained sequences for protein folding and design.

PubMed

Brown, Scott; Fawzi, Nicolas J; Head-Gordon, Teresa

2003-09-16

We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the alpha/beta ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design.
Coarse-grained sequences for protein folding and design

PubMed Central

Brown, Scott; Fawzi, Nicolas J.; Head-Gordon, Teresa

2003-01-01

We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the α/β ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design. PMID:12963815
Spreadsheet macros for coloring sequence alignments.

PubMed

Haygood, M G

1993-12-01

This article describes a set of Microsoft Excel macros designed to color amino acid and nucleotide sequence alignments for review and preparation of visual aids. The colored alignments can then be modified to emphasize features of interest. Procedures for importing and coloring sequences are described. The macro file adds a new menu to the menu bar containing sequence-related commands to enable users unfamiliar with Excel to use the macros more readily. The macros were designed for use with Macintosh computers but will also run with the DOS version of Excel.
Computational design of enzyme-ligand binding using a combined energy function and deterministic sequence optimization algorithm.

PubMed

Tian, Ye; Huang, Xiaoqiang; Zhu, Yushan

2015-08-01

Enzyme amino-acid sequences at ligand-binding interfaces are evolutionarily optimized for reactions, and the natural conformation of an enzyme-ligand complex must have a low free energy relative to alternative conformations in native-like or non-native sequences. Based on this assumption, a combined energy function was developed for enzyme design and then evaluated by recapitulating native enzyme sequences at ligand-binding interfaces for 10 enzyme-ligand complexes. In this energy function, the electrostatic interaction between polar or charged atoms at buried interfaces is described by an explicitly orientation-dependent hydrogen-bonding potential and a pairwise-decomposable generalized Born model based on the general side chain in the protein design framework. The energy function is augmented with a pairwise surface-area based hydrophobic contribution for nonpolar atom burial. Using this function, on average, 78% of the amino acids at ligand-binding sites were predicted correctly in the minimum-energy sequences, whereas 84% were predicted correctly in the most-similar sequences, which were selected from the top 20 sequences for each enzyme-ligand complex. Hydrogen bonds at the enzyme-ligand binding interfaces in the 10 complexes were usually recovered with the correct geometries. The binding energies calculated using the combined energy function helped to discriminate the active sequences from a pool of alternative sequences that were generated by repeatedly solving a series of mixed-integer linear programming problems for sequence selection with increasing integer cuts.

Silver ions-mediated conformational switch: facile design of structure-controllable nucleic acid probes.

PubMed

Wang, Yongxiang; Li, Jishan; Wang, Hao; Jin, Jianyu; Liu, Jinhua; Wang, Kemin; Tan, Weihong; Yang, Ronghua

2010-08-01

Conformationally constraint nucleic acid probes were usually designed by forming an intramolecular duplex based on Watson-Crick hydrogen bonds. The disadvantages of these approaches are the inflexibility and instability in complex environment of the Watson-Crick-based duplex. We report that this hydrogen bonding pattern can be replaced by metal-ligation between specific metal ions and the natural bases. To demonstrate the feasibility of this principle, two linear oligonucleotides and silver ions were examined as models for DNA hybridization assay and adenosine triphosphate detection. The both nucleic acids contain target binding sequences in the middle and cytosine (C)-rich sequences at the lateral portions. The strong interaction between Ag(+) ions and cytosines forms stable C-Ag(+)-C structures, which promises the oligonucleotides to form conformationally constraint formations. In the presence of its target, interaction between the loop sequences and the target unfolds the C-Ag(+)-C structures, and the corresponding probes unfolding can be detected by a change in their fluorescence emission. We discuss the thermodynamic and kinetic opportunities that are provided by using Ag(+) ion complexes instead of traditional Watson-Crick-based duplex. In particular, the intrinsic feature of the metal-ligation motif facilitates the design of functional nucleic acids probes by independently varying the concentration of Ag(+) ions in the medium.
Identification and characterization of Theileria ovis surface protein (ToSp) resembled TaSp in Theileria annulata.

PubMed

Shayan, P; Jafari, S; Fattahi, R; Ebrahimzade, E; Amininia, N; Changizi, E

2016-05-01

Ovine theileriosis is an important hemoprotozoal disease of sheep and goats in tropical and subtropical regions which caused high economic loses in the livestock industry. Theileria annulata surface protein (TaSp) was used previously as a tool for serological analysis in livestock. Since the amino acid sequences of TaSp is, at least, in part very conserved in T. annulata, Theileria lestoquardi and Theileria china I and II, it is very important to determine the amino acid sequence of this protein in Theileria ovis as well, to avoid false interpretation of serological data based on this protein in small animal. In the present study, the nucleotide sequence and amino acid sequence of T. ovis surface protein (ToSp) were determined. The comparison of the nucleotide sequence of ToSp showed 96, 96, 99, and 86 % homology to the corresponding nucleotide sequence of TaSp genes by T. annulata, T. China I, T. China II and T. lestoquardi, previously registered in GenBank under accession nos. AJ316260.1, AY274329.1, DQ120058.1, and EF092924.1 respectively. The amino acid sequence analysis showed 95, 81, 98 and 70 % homology to the corresponding amino acid sequence of T. annulata, T chinaI, T china II and T. lestoquardi, registered in GenBank under accession nos. CAC87478.1, AAP36993.1, AAZ30365.1 and AAP36999.11, respectively. Interestingly, in contrast to the C terminus, a significant difference in amino acid sequence in the N teminus of the ToSp protein could be determined compared to the other known corresponding TaSp sequences, which make this region attractive for designing of a suitable tool for serological diagnosis.
Rapid Threat Organism Recognition Pipeline

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.

2013-05-07

The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less
Capturing the genetic makeup of the active microbiome in situ.

PubMed

Singer, Esther; Wagner, Michael; Woyke, Tanja

2017-09-01

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and that have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions.
Capturing the genetic makeup of the active microbiome in situ

PubMed Central

Singer, Esther; Wagner, Michael; Woyke, Tanja

2017-01-01

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and that have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions. PMID:28574490
Advances in Understanding Stimulus Responsive Phase Behavior of Intrinsically Disordered Protein Polymers.

PubMed

Ruff, Kiersten M; Roberts, Stefan; Chilkoti, Ashutosh; Pappu, Rohit V

2018-06-24

Proteins and synthetic polymers can undergo phase transitions in response to changes to intensive solution parameters such as temperature, proton chemical potentials (pH), and hydrostatic pressure. For proteins and protein-based polymers, the information required for stimulus responsive phase transitions is encoded in their amino acid sequence. Here, we review some of the key physical principles that govern the phase transitions of archetypal intrinsically disordered protein polymers (IDPPs). These are disordered proteins with highly repetitive amino acid sequences. Advances in recombinant technologies have enabled the design and synthesis of protein sequences of a variety of sequence complexities and lengths. We summarize insights that have been gleaned from the design and characterization of IDPPs that undergo thermo-responsive phase transitions and build on these insights to present a general framework for IDPPs with pH and pressure responsive phase behavior. In doing so, we connect the stimulus responsive phase behavior of IDPPs with repetitive sequences to the coil-to-globule transitions that these sequences undergo at the single chain level in response to changes in stimuli. The proposed framework and ongoing studies of stimulus responsive phase behavior of designed IDPPs have direct implications in bioengineering, where designing sequences with bespoke material properties broadens the spectrum of applications, and in biology and medicine for understanding the sequence-specific driving forces for the formation of protein-based membraneless organelles as well as biological matrices that act as scaffolds for cells and mediators of cell-to-cell communication. Copyright © 2018. Published by Elsevier Ltd.
Increasing Sequence Diversity with Flexible Backbone Protein Design: The Complete Redesign of a Protein Hydrophobic Core

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.

2015-10-15

Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helixmore » bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).« less
Structure-based conformational preferences of amino acids

PubMed Central

Koehl, Patrice; Levitt, Michael

1999-01-01

Proteins can be very tolerant to amino acid substitution, even within their core. Understanding the factors responsible for this behavior is of critical importance for protein engineering and design. Mutations in proteins have been quantified in terms of the changes in stability they induce. For example, guest residues in specific secondary structures have been used as probes of conformational preferences of amino acids, yielding propensity scales. Predicting these amino acid propensities would be a good test of any new potential energy functions used to mimic protein stability. We have recently developed a protein design procedure that optimizes whole sequences for a given target conformation based on the knowledge of the template backbone and on a semiempirical potential energy function. This energy function is purely physical, including steric interactions based on a Lennard-Jones potential, electrostatics based on a Coulomb potential, and hydrophobicity in the form of an environment free energy based on accessible surface area and interatomic contact areas. Sequences designed by this procedure for 10 different proteins were analyzed to extract conformational preferences for amino acids. The resulting structure-based propensity scales show significant agreements with experimental propensity scale values, both for α-helices and β-sheets. These results indicate that amino acid conformational preferences are a natural consequence of the potential energy we use. This confirms the accuracy of our potential and indicates that such preferences should not be added as a design criterion. PMID:10535955
Neutrality and evolvability of designed protein sequences

NASA Astrophysics Data System (ADS)

Bhattacherjee, Arnab; Biswas, Parbati

2010-07-01

The effect of foldability on protein’s evolvability is analyzed by a two-prong approach consisting of a self-consistent mean-field theory and Monte Carlo simulations. Theory and simulation models representing protein sequences with binary patterning of amino acid residues compatible with a particular foldability criteria are used. This generalized foldability criterion is derived using the high temperature cumulant expansion approximating the free energy of folding. The effect of cumulative point mutations on these designed proteins is studied under neutral condition. The robustness, protein’s ability to tolerate random point mutations is determined with a selective pressure of stability (ΔΔG) for the theory designed sequences, which are found to be more robust than that of Monte Carlo and mean-field-biased Monte Carlo generated sequences. The results show that this foldability criterion selects viable protein sequences more effectively compared to the Monte Carlo method, which has a marked effect on how the selective pressure shapes the evolutionary sequence space. These observations may impact de novo sequence design and its applications in protein engineering.
Combining Rosetta with molecular dynamics (MD): A benchmark of the MD-based ensemble protein design.

PubMed

Ludwiczak, Jan; Jarmula, Adam; Dunin-Horkawicz, Stanislaw

2018-07-01

Computational protein design is a set of procedures for computing amino acid sequences that will fold into a specified structure. Rosetta Design, a commonly used software for protein design, allows for the effective identification of sequences compatible with a given backbone structure, while molecular dynamics (MD) simulations can thoroughly sample near-native conformations. We benchmarked a procedure in which Rosetta design is started on MD-derived structural ensembles and showed that such a combined approach generates 20-30% more diverse sequences than currently available methods with only a slight increase in computation time. Importantly, the increase in diversity is achieved without a loss in the quality of the designed sequences assessed by their resemblance to natural sequences. We demonstrate that the MD-based procedure is also applicable to de novo design tasks started from backbone structures without any sequence information. In addition, we implemented a protocol that can be used to assess the stability of designed models and to select the best candidates for experimental validation. In sum our results demonstrate that the MD ensemble-based flexible backbone design can be a viable method for protein design, especially for tasks that require a large pool of diverse sequences. Copyright © 2018 Elsevier Inc. All rights reserved.
Genomic Sequence of the WHO International Standard for Hepatitis A Virus RNA.

PubMed

Jenkins, Adrian; Minhas, Rehan; Morris, Clare; Berry, Neil

2018-05-10

The World Health Organization (WHO) international standard for hepatitis A virus (HAV) RNA nucleic acid assays was characterized by complete genome sequencing. The entire coding sequence and noncoding regions were assigned HAV genotype IB. This information will aid the design, development, and evaluation of HAV RNA amplification assays. Copyright © 2018 Jenkins et al.
Rational design of DNA sequences for nanotechnology, microarrays and molecular computers using Eulerian graphs.

PubMed

Pancoska, Petr; Moravek, Zdenek; Moll, Ute M

2004-01-01

Nucleic acids are molecules of choice for both established and emerging nanoscale technologies. These technologies benefit from large functional densities of 'DNA processing elements' that can be readily manufactured. To achieve the desired functionality, polynucleotide sequences are currently designed by a process that involves tedious and laborious filtering of potential candidates against a series of requirements and parameters. Here, we present a complete novel methodology for the rapid rational design of large sets of DNA sequences. This method allows for the direct implementation of very complex and detailed requirements for the generated sequences, thus avoiding 'brute force' filtering. At the same time, these sequences have narrow distributions of melting temperatures. The molecular part of the design process can be done without computer assistance, using an efficient 'human engineering' approach by drawing a single blueprint graph that represents all generated sequences. Moreover, the method eliminates the necessity for extensive thermodynamic calculations. Melting temperature can be calculated only once (or not at all). In addition, the isostability of the sequences is independent of the selection of a particular set of thermodynamic parameters. Applications are presented for DNA sequence designs for microarrays, universal microarray zip sequences and electron transfer experiments.
Rational Protein Engineering Guided by Deep Mutational Scanning

PubMed Central

Shin, HyeonSeok; Cho, Byung-Kwan

2015-01-01

Sequence–function relationship in a protein is commonly determined by the three-dimensional protein structure followed by various biochemical experiments. However, with the explosive increase in the number of genome sequences, facilitated by recent advances in sequencing technology, the gap between protein sequences available and three-dimensional structures is rapidly widening. A recently developed method termed deep mutational scanning explores the functional phenotype of thousands of mutants via massive sequencing. Coupled with a highly efficient screening system, this approach assesses the phenotypic changes made by the substitution of each amino acid sequence that constitutes a protein. Such an informational resource provides the functional role of each amino acid sequence, thereby providing sufficient rationale for selecting target residues for protein engineering. Here, we discuss the current applications of deep mutational scanning and consider experimental design. PMID:26404267
Design of nucleic acid sequences for DNA computing based on a thermodynamic approach

PubMed Central

Tanaka, Fumiaki; Kameda, Atsushi; Yamamoto, Masahito; Ohuchi, Azuma

2005-01-01

We have developed an algorithm for designing multiple sequences of nucleic acids that have a uniform melting temperature between the sequence and its complement and that do not hybridize non-specifically with each other based on the minimum free energy (ΔGmin). Sequences that satisfy these constraints can be utilized in computations, various engineering applications such as microarrays, and nano-fabrications. Our algorithm is a random generate-and-test algorithm: it generates a candidate sequence randomly and tests whether the sequence satisfies the constraints. The novelty of our algorithm is that the filtering method uses a greedy search to calculate ΔGmin. This effectively excludes inappropriate sequences before ΔGmin is calculated, thereby reducing computation time drastically when compared with an algorithm without the filtering. Experimental results in silico showed the superiority of the greedy search over the traditional approach based on the hamming distance. In addition, experimental results in vitro demonstrated that the experimental free energy (ΔGexp) of 126 sequences correlated well with ΔGmin (|R| = 0.90) than with the hamming distance (|R| = 0.80). These results validate the rationality of a thermodynamic approach. We implemented our algorithm in a graphic user interface-based program written in Java. PMID:15701762
Molecular Design of Performance Proteins With Repetitive Sequences

NASA Astrophysics Data System (ADS)

Vendrely, Charlotte; Ackerschott, Christian; Römer, Lin; Scheibel, Thomas

Most performance proteins responsible for the mechanical stability of cells and organisms reveal highly repetitive sequences. Mimicking such performance proteins is of high interest for the design of nanostructured biomaterials. In this article, flagelliform silk is exemplary introduced to describe a general principle for designing genes of repetitive performance proteins for recombinant expression in Escherichia coli . In the first step, repeating amino acid sequence motifs are reversely transcripted into DNA cassettes, which can in a second step be seamlessly ligated, yielding a designed gene. Recombinant expression thereof leads to proteins mimicking the natural ones. The recombinant proteins can be assembled into nanostructured materials in a controlled manner, allowing their use in several applications.
Methods for determining the genetic affinity of microorganisms and viruses

NASA Technical Reports Server (NTRS)

Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)

2012-01-01

Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.
Sequence and structural implications of a bovine corneal keratan sulfate proteoglycan core protein. Protein 37B represents bovine lumican and proteins 37A and 25 are unique

NASA Technical Reports Server (NTRS)

Funderburgh, J. L.; Funderburgh, M. L.; Brown, S. J.; Vergnes, J. P.; Hassell, J. R.; Mann, M. M.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)

1993-01-01

Amino acid sequence from tryptic peptides of three different bovine corneal keratan sulfate proteoglycan (KSPG) core proteins (designated 37A, 37B, and 25) showed similarities to the sequence of a chicken KSPG core protein lumican. Bovine lumican cDNA was isolated from a bovine corneal expression library by screening with chicken lumican cDNA. The bovine cDNA codes for a 342-amino acid protein, M(r) 38,712, containing amino acid sequences identified in the 37B KSPG core protein. The bovine lumican is 68% identical to chicken lumican, with an 83% identity excluding the N-terminal 40 amino acids. Location of 6 cysteine and 4 consensus N-glycosylation sites in the bovine sequence were identical to those in chicken lumican. Bovine lumican had about 50% identity to bovine fibromodulin and 20% identity to bovine decorin and biglycan. About two-thirds of the lumican protein consists of a series of 10 amino acid leucine-rich repeats that occur in regions of calculated high beta-hydrophobic moment, suggesting that the leucine-rich repeats contribute to beta-sheet formation in these proteins. Sequences obtained from 37A and 25 core proteins were absent in bovine lumican, thus predicting a unique primary structure and separate mRNA for each of the three bovine KSPG core proteins.
Position-dependent effects of locked nucleic acid (LNA) on DNA sequencing and PCR primers

PubMed Central

Levin, Joshua D.; Fiala, Dean; Samala, Meinrado F.; Kahn, Jason D.; Peterson, Raymond J.

2006-01-01

Genomes are becoming heavily annotated with important features. Analysis of these features often employs oligonucleotides that hybridize at defined locations. When the defined location lies in a poor sequence context, traditional design strategies may fail. Locked Nucleic Acid (LNA) can enhance oligonucleotide affinity and specificity. Though LNA has been used in many applications, formal design rules are still being defined. To further this effort we have investigated the effect of LNA on the performance of sequencing and PCR primers in AT-rich regions, where short primers yield poor sequencing reads or PCR yields. LNA was used in three positional patterns: near the 5′ end (LNA-5′), near the 3′ end (LNA-3′) and distributed throughout (LNA-Even). Quantitative measures of sequencing read length (Phred Q30 count) and real-time PCR signal (cycle threshold, CT) were characterized using two-way ANOVA. LNA-5′ increased the average Phred Q30 score by 60% and it was never observed to decrease performance. LNA-5′ generated cycle thresholds in quantitative PCR that were comparable to high-yielding conventional primers. In contrast, LNA-3′ and LNA-Even did not improve read lengths or CT. ANOVA demonstrated the statistical significance of these results and identified significant interaction between the positional design rule and primer sequence. PMID:17071964
Rational design of new materials using recombinant structural proteins: Current state and future challenges.

PubMed

Sutherland, Tara D; Huson, Mickey G; Rapson, Trevor D

2018-01-01

Sequence-definable polymers are seen as a prerequisite for design of future materials, with many polymer scientists regarding such polymers as the holy grail of polymer science. Recombinant proteins are sequence-defined polymers. Proteins are dictated by DNA templates and therefore the sequence of amino acids in a protein is defined, and molecular biology provides tools that allow redesign of the DNA as required. Despite this advantage, proteins are underrepresented in materials science. In this publication we investigate the advantages and limitations of using proteins as templates for rational design of new materials. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Characterization of tannase protein sequences of bacteria and fungi: an in silico study.

PubMed

Banerjee, Amrita; Jana, Arijit; Pati, Bikash R; Mondal, Keshab C; Das Mohapatra, Pradeep K

2012-04-01

The tannase protein sequences of 149 bacteria and 36 fungi were retrieved from NCBI database. Among them only 77 bacterial and 31 fungal tannase sequences were taken which have different amino acid compositions. These sequences were analysed for different physical and chemical properties, superfamily search, multiple sequence alignment, phylogenetic tree construction and motif finding to find out the functional motif and the evolutionary relationship among them. The superfamily search for these tannase exposed the occurrence of proline iminopeptidase-like, biotin biosynthesis protein BioH, O-acetyltransferase, carboxylesterase/thioesterase 1, carbon-carbon bond hydrolase, haloperoxidase, prolyl oligopeptidase, C-terminal domain and mycobacterial antigens families and alpha/beta hydrolase superfamily. Some bacterial and fungal sequence showed similarity with different families individually. The multiple sequence alignment of these tannase protein sequences showed conserved regions at different stretches with maximum homology from amino acid residues 389-469 and 482-523 which could be used for designing degenerate primers or probes specific for tannase producing bacterial and fungal species. Phylogenetic tree showed two different clusters; one has only bacteria and another have both fungi and bacteria showing some relationship between these different genera. Although in second cluster near about all fungal species were found together in a corner which indicates the sequence level similarity among fungal genera. The distributions of fourteen motifs analysis revealed Motif 1 with a signature amino acid sequence of 29 amino acids, i.e. GCSTGGREALKQAQRWPHDYDGIIANNPA, was uniformly observed in 83.3 % of studied tannase sequences representing its participation with the structure and enzymatic function.

Uncovering the design rules for peptide synthesis of metal nanoparticles.

PubMed

Tan, Yen Nee; Lee, Jim Yang; Wang, Daniel I C

2010-04-28

Peptides are multifunctional reagents (reducing and capping agents) that can be used for the synthesis of biocompatible metal nanoparticles under relatively mild conditions. However, the progress in peptide synthesis of metal nanoparticles has been slow due to the lack of peptide design rules. It is difficult to establish sequence-reactivity relationships from peptides isolated from biological sources (e.g., biomineralizing organisms) or selected by combinatorial display libraries because of their widely varying compositions and structures. The abundance of random and inactive amino acid sequences in the peptides also increases the difficulty in knowledge extraction. In this study, a "bottom-up" approach was used to formulate a set of rudimentary rules for the size- and shape-controlled peptide synthesis of gold nanoparticles from the properties of the 20 natural alpha-amino acids for AuCl(4)(-) reduction and binding to Au(0). It was discovered that the reduction capability of a peptide depends on the presence of certain reducing amino acid residues, whose activity may be regulated by neighboring residues with different Au(0) binding strengths. Another finding is the effect of peptide net charge on the nucleation and growth of the Au nanoparticles. On the basis of these understandings, several multifunctional peptides were designed to synthesize gold nanoparticles in different morphologies (nanospheres and nanoplates) and with sizes tunable by the strategic placement of selected amino acid residues in the peptide sequence. The methodology presented here and the findings are useful for establishing the scientific basis for the rational design of peptides for the synthesis of metal nanostructures.
Capturing the genetic makeup of the active microbiome in situ

DOE PAGES

Singer, Esther; Wagner, Michael; Woyke, Tanja

2017-06-02

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and thatmore » have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions.« less
Capturing the genetic makeup of the active microbiome in situ

DOE Office of Scientific and Technical Information (OSTI.GOV)

Singer, Esther; Wagner, Michael; Woyke, Tanja

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and thatmore » have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions.« less
Effects of the amino acid sequence on thermal conduction through β-sheet crystals of natural silk protein.

PubMed

Zhang, Lin; Bai, Zhitong; Ban, Heng; Liu, Ling

2015-11-21

Recent experiments have discovered very different thermal conductivities between the spider silk and the silkworm silk. Decoding the molecular mechanisms underpinning the distinct thermal properties may guide the rational design of synthetic silk materials and other biomaterials for multifunctionality and tunable properties. However, such an understanding is lacking, mainly due to the complex structure and phonon physics associated with the silk materials. Here, using non-equilibrium molecular dynamics, we demonstrate that the amino acid sequence plays a key role in the thermal conduction process through β-sheets, essential building blocks of natural silks and a variety of other biomaterials. Three representative β-sheet types, i.e. poly-A, poly-(GA), and poly-G, are shown to have distinct structural features and phonon dynamics leading to different thermal conductivities. A fundamental understanding of the sequence effects may stimulate the design and engineering of polymers and biopolymers for desired thermal properties.
Fine tangled pili expressed by Haemophilus ducreyi are a novel class of pili.

PubMed Central

Brentjens, R J; Ketterer, M; Apicella, M A; Spinola, S M

1996-01-01

Haemophilus ducreyi synthesizes fine, tangled pili composed predominantly of a protein whose apparent molecular weight is 24,000 (24K). A hybridoma, 2D8, produced a monoclonal antibody (MAb) that bound to a 24K protein in H. ducreyi strains isolated from diverse geographic locations. A lambda gt11 H. ducreyi library was screened with MAb 2D8. A 3.5-kb chromosomal insert from one reactive plaque was amplified and ligated into the pCRII vector. The recombinant plasmid, designated pHD24, expressed a 24K protein in Escherichia coli INV alpha F that bound MAb 2D8. The coding sequence of the 24K gene was localized by exonuclease III digestion. The insert contained a 570-bp open reading frame, designated ftpA (fine, tangled pili). Translation of ftpA predicted a polypeptide with a molecular weight of 21.1K. The predicted N-terminal amino acid sequence of the polypeptide encoded by ftpA was identical to the N-terminal amino acid sequence of purified pilin and lacked a cleavable signal sequence. Primer extension analysis of ftpA confirmed the lack of a leader peptide. The predicted amino acid sequence lacked homology to known pilin sequences but shared homology with the sequences of E. coli Dps and Treponema pallidum antigen TpF1 or 4D, proteins which associate to form ordered rings. An isogenic pilin mutant, H. ducreyi 35000ftpA::mTn3(Cm), was constructed by shuttle mutagenesis and did not contain pili when examined by electron microscopy. We conclude that H. ducreyi synthesizes fine, tangled pili that are composed of a unique major subunit, which may be exported by a signal sequence independent mechanism. PMID:8550517
Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

PubMed

Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

2014-09-18

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
PH dependent adhesive peptides

DOEpatents

Tomich, John; Iwamoto, Takeo; Shen, Xinchun; Sun, Xiuzhi Susan

2010-06-29

A novel peptide adhesive motif is described that requires no receptor or cross-links to achieve maximal adhesive strength. Several peptides with different degrees of adhesive strength have been designed and synthesized using solid phase chemistries. All peptides contain a common hydrophobic core sequence flanked by positively or negatively charged amino acids sequences.
Identification and characterization of a NBS–LRR class resistance gene analog in Pistacia atlantica subsp. Kurdica

PubMed Central

Bahramnejad, Bahman

2014-01-01

P. atlantica subsp. Kurdica, with the local name of Baneh, is a wild medicinal plant which grows in Kurdistan, Iran. The identification of resistance gene analogs holds great promise for the development of resistant cultivars. A PCR approach with degenerate primers designed according to conserved NBS-LRR (nucleotide binding site-leucine rich repeat) regions of known disease-resistance (R) genes was used to amplify and clone homologous sequences from P. atlantica subsp. Kurdica. A DNA fragment of the expected 500-bp size was amplified. The nucleotide sequence of this amplicon was obtained through sequencing and the predicted amino acid sequence compared to the amino acid sequences of known R-genes revealed significant sequence similarity. Alignment of the deduced amino acid sequence of P. atlantica subsp. Kurdica resistance gene analog (RGA) showed strong identity, ranging from 68% to 77%, to the non-toll interleukin receptor (non-TIR) R-gene subfamily from other plants. A P-loop motif (GMMGGEGKTT), a conserved and hydrophobic motif GLPLAL, a kinase-2a motif (LLVLDDV), when replaced by IAVFDDI in PAKRGA1 and a kinase-3a (FGPGSRIII) were presented in all RGA. A phylogenetic tree, based on the deduced amino-acid sequences of PAKRGA1 and RGAs from different species indicated that they were separated in two clusters, PAKRGA1 being on cluster II. The isolated NBS analogs can be eventually used as guidelines to isolate numerous R-genes in Pistachio. PMID:27843981
Isolation, Cloning, and Expression of an Acid Phosphatase Containing Phosphotyrosyl Phosphatase Activity from Prevotella intermedia

PubMed Central

Chen, Xiaochi; Ansai, Toshihiro; Awano, Shuji; Iida, Toshiya; Barik, Sailen; Takehara, Tadamichi

1999-01-01

A novel acid phosphatase containing phosphotyrosyl phosphatase (PTPase) activity, designated PiACP, from Prevotella intermedia ATCC 25611, an anaerobe implicated in progressive periodontal disease, has been purified and characterized. PiACP, a monomer with an apparent molecular mass of 30 kDa, did not require divalent metal cations for activity and was sensitive to orthovanadate but highly resistant to okadaic acid. The enzyme exhibited substantial activity against tyrosine phosphate-containing peptides derived from the epidermal growth factor receptor. On the basis of N-terminal and internal amino acid sequences of purified PiACP, the gene coding for PiACP was isolated and sequenced. The PiACP gene consisted of 792 bp and coded for a basic protein with an Mr of 29,164. The deduced amino acid sequence exhibited striking similarity (25 to 64%) to those of members of class A bacterial acid phosphatases, including PhoC of Morganella morganii, and involved a conserved phosphatase sequence motif that is shared among several lipid phosphatases and the mammalian glucose-6-phosphatases. The highly conservative motif HCXAGXXR in the active domain of PTPase was not found in PiACP. Mutagenesis of recombinant PiACP showed that His-170 and His-209 were essential for activity. Thus, the class A bacterial acid phosphatases including PiACP may function as atypical PTPases, the biological functions of which remain to be determined. PMID:10559178
Computational Tools and Algorithms for Designing Customized Synthetic Genes

PubMed Central

Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

2014-01-01

Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

PubMed

Lathe, R

1985-05-05

Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Designing pH induced fold switch in proteins

NASA Astrophysics Data System (ADS)

Baruah, Anupaul; Biswas, Parbati

2015-05-01

This work investigates the computational design of a pH induced protein fold switch based on a self-consistent mean-field approach by identifying the ensemble averaged characteristics of sequences that encode a fold switch. The primary challenge to balance the alternative sets of interactions present in both target structures is overcome by simultaneously optimizing two foldability criteria corresponding to two target structures. The change in pH is modeled by altering the residual charge on the amino acids. The energy landscape of the fold switch protein is found to be double funneled. The fold switch sequences stabilize the interactions of the sites with similar relative surface accessibility in both target structures. Fold switch sequences have low sequence complexity and hence lower sequence entropy. The pH induced fold switch is mediated by attractive electrostatic interactions rather than hydrophobic-hydrophobic contacts. This study may provide valuable insights to the design of fold switch proteins.
Nucleic acid sequence detection using multiplexed oligonucleotide PCR

DOEpatents

Nolan, John P [Santa Fe, NM; White, P Scott [Los Alamos, NM

2006-12-26

Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.
Consolidation of glycosyl hydrolase family 30 : a dual domain 4/7 hydrolase family consisting of two structurally distinct groups

Treesearch

Franz J. St John; Javier M. Gonzalez; Edwin Pozharski

2010-01-01

In this work glycosyl hydrolase (GH) family 30 (GH30) is analyzed and shown to consist of its currently classified member sequences as well as several homologous sequence groups currently assigned within family GH5. A large scale amino acid sequence alignment and a phylogenetic tree were generated and GH30 groups and subgroups were designated. A partial rearrangement...
Fast computational methods for predicting protein structure from primary amino acid sequence

DOEpatents

Agarwal, Pratul Kumar [Knoxville, TN

2011-07-19

The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package

PubMed Central

Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M.

2013-01-01

Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs. PMID:24688703

A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package.

PubMed

Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M

2013-01-01

Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

PubMed Central

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-01-01

Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

PubMed

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-04-21

To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
Efficient production of artificially designed gelatins with a Bacillus brevis system.

PubMed

Kajino, T; Takahashi, H; Hirai, M; Yamada, Y

2000-01-01

Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

2016-11-03

Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
Rhodotorula svalbardensis sp. nov., a novel yeast species isolated from cryoconite holes of Ny-Ålesund, Arctic.

PubMed

Singh, Purnima; Singh, Shiv M; Tsuji, Masaharu; Prasad, Gandham S; Hoshino, Tamotsu

2014-02-01

A psychrophilic yeast species was isolated from glacier cryoconite holes of Svalbard. Nucleotide sequences of the strains were studied using D1/D2 domain, ITS region and partial sequences of mitochondrial cytochrome b gene. The strains belonged to a clade of psychrophilic yeasts, but showed marked differences from related species in the D1/D2 domain and biochemical characters. Effects of temperature, salt and media on growth of the cultures were also studied. Screening of the cultures for amylase, cellulase, protease, lipase, urease and catalase activities was carried out. The strains expressed high amylase and lipase activities. Freeze tolerance ability of the isolates indicated the formation of unique hexagonal ice crystal structures due to presence of 'antifreeze proteins' (AFPs). FAME analysis of cultures showed a unique trend of increase in unsaturated fatty acids with decrease in temperature. The major fatty acids recorded were oleic acid, linoleic acid, linolenic acid, palmitic acid, stearic acid, myristic acid and pentadecanoic acid. Based on sequence data and, physiological and morphological properties of the strains, we propose a novel species, Rhodotorula svalbardensis and designate strains MLB-I (CCP-II) and CRY-YB-1 (CBS 12863, JCM 19699, JCM 19700, MTCC 10952) as its type strains (Etymology: sval.bar.den'sis. N.L. fem. adj. svalbardensis pertaining to Svalbard). Copyright © 2014 Elsevier Inc. All rights reserved.
From a marine neuropeptide to antimicrobial pseudopeptides containing aza-β(3)-amino acids: structure and activity

PubMed Central

Laurencin, Mathieu; Legrand, Baptiste; Duval, Emilie; Henry, Joël; Baudy-Floc'H, Michèle; Zatylny-Gaudin, Céline; Bondon, Arnaud

2012-01-01

Incorporation of aza-β3-amino acids into endogenous neuropeptide from mollusks (ALSGDAFLRF-NH2) with weak antimicrobial activities allows us to design new AMPs sequences. We find that, depending on the nature of the substitution, these could result either in inactive pseudopeptides or in a drastic enhancement of the antimicrobial activity without high cytotoxicity resulted. Structural studies perform by NMR and circular dichroism on the pseudopeptides show the impact of aza-β3-amino acids on the peptide structures. We obtain the first three-dimensional structures of pseudopeptides containing aza-β3-amino acids in aqueous micellar SDS and demonstrate that hydrazino turn can be formed in aqueous solution. Overall, these results demonstrate the ability to modulate AMPs activities through structural modifications induced by the nature and the position of these amino acid analogs in the peptide sequences. PMID:22320306
Species-specific identification of commercial probiotic strains.

PubMed

Yeung, P S M; Sanders, M E; Kitts, C L; Cano, R; Tong, P S

2002-05-01

Products containing probiotic bacteria are gaining popularity, increasing the importance of their accurate speciation. Unfortunately, studies have suggested that improper labeling of probiotic species is common in commercial products. Species identification of a bank of commercial probiotic strains was attempted using partial 16S rDNA sequencing, carbohydrate fermentation analysis, and cellular fatty acid methyl ester analysis. Results from partial 16S rDNA sequencing indicated discrepancies between species designations for 26 out of 58 strains tested, including two ATCC Lactobacillus strains. When considering only the commercial strains obtained directly from the manufacturers, 14 of 29 strains carried species designations different from those obtained by partial 16S rDNA sequencing. Strains from six commercial products were species not listed on the label. The discrepancies mainly occurred in Lactobacillus acidophilus and Lactobacillus casei groups. Carbohydrate fermentation analysis was not sensitive enough to identify species within the L. acidophilus group. Fatty acid methyl ester analysis was found to be variable and inaccurate and is not recommended to identify probiotic lactobacilli.
Key Aspects of Nucleic Acid Library Design for in Vitro Selection

PubMed Central

Vorobyeva, Maria A.; Davydova, Anna S.; Vorobjev, Pavel E.; Pyshnyi, Dmitrii V.; Venyaminova, Alya G.

2018-01-01

Nucleic acid aptamers capable of selectively recognizing their target molecules have nowadays been established as powerful and tunable tools for biospecific applications, be it therapeutics, drug delivery systems or biosensors. It is now generally acknowledged that in vitro selection enables one to generate aptamers to almost any target of interest. However, the success of selection and the affinity of the resulting aptamers depend to a large extent on the nature and design of an initial random nucleic acid library. In this review, we summarize and discuss the most important features of the design of nucleic acid libraries for in vitro selection such as the nature of the library (DNA, RNA or modified nucleotides), the length of a randomized region and the presence of fixed sequences. We also compare and contrast different randomization strategies and consider computer methods of library design and some other aspects. PMID:29401748
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
ProForma: A Standard Proteoform Notation

DOE Office of Scientific and Technical Information (OSTI.GOV)

LeDuc, Richard D.; Schwämmle, Veit; Shortreed, Michael R.

The Consortium for Top-Down Proteomics (CTDP) proposes a standardized notation, ProForma, for writing the sequence of fully characterized proteoforms. ProForma provides a means to communicate any proteoform by writing the amino acid sequence using standard one-letter notation and specifying modifications or unidentified mass shifts within brackets following certain amino acids. The notation is unambiguous, human readable, and can easily be parsed and written by bioinformatic tools. This system uses seven rules and supports a wide range of possible use cases, ensuring compatibility and reproducibility of proteoform annotations. Standardizing proteoform sequences will simplify storage, comparison, and reanalysis of proteomic studies, andmore » the Consortium welcomes input and contributions from the research community on the continued design and maintenance of this standard.« less
Protein binding hot spots prediction from sequence only by a new ensemble learning method.

PubMed

Hu, Shan-Shan; Chen, Peng; Wang, Bing; Li, Jinyan

2017-10-01

Hot spots are interfacial core areas of binding proteins, which have been applied as targets in drug design. Experimental methods are costly in both time and expense to locate hot spot areas. Recently, in-silicon computational methods have been widely used for hot spot prediction through sequence or structure characterization. As the structural information of proteins is not always solved, and thus hot spot identification from amino acid sequences only is more useful for real-life applications. This work proposes a new sequence-based model that combines physicochemical features with the relative accessible surface area of amino acid sequences for hot spot prediction. The model consists of 83 classifiers involving the IBk (Instance-based k means) algorithm, where instances are encoded by important properties extracted from a total of 544 properties in the AAindex1 (Amino Acid Index) database. Then top-performance classifiers are selected to form an ensemble by a majority voting technique. The ensemble classifier outperforms the state-of-the-art computational methods, yielding an F1 score of 0.80 on the benchmark binding interface database (BID) test set. http://www2.ahu.edu.cn/pchen/web/HotspotEC.htm .
A computational framework to empower probabilistic protein design

PubMed Central

Fromer, Menachem; Yanover, Chen

2008-01-01

Motivation: The task of engineering a protein to perform a target biological function is known as protein design. A commonly used paradigm casts this functional design problem as a structural one, assuming a fixed backbone. In probabilistic protein design, positional amino acid probabilities are used to create a random library of sequences to be simultaneously screened for biological activity. Clearly, certain choices of probability distributions will be more successful in yielding functional sequences. However, since the number of sequences is exponential in protein length, computational optimization of the distribution is difficult. Results: In this paper, we develop a computational framework for probabilistic protein design following the structural paradigm. We formulate the distribution of sequences for a structure using the Boltzmann distribution over their free energies. The corresponding probabilistic graphical model is constructed, and we apply belief propagation (BP) to calculate marginal amino acid probabilities. We test this method on a large structural dataset and demonstrate the superiority of BP over previous methods. Nevertheless, since the results obtained by BP are far from optimal, we thoroughly assess the paradigm using high-quality experimental data. We demonstrate that, for small scale sub-problems, BP attains identical results to those produced by exact inference on the paradigmatic model. However, quantitative analysis shows that the distributions predicted significantly differ from the experimental data. These findings, along with the excellent performance we observed using BP on the smaller problems, suggest potential shortcomings of the paradigm. We conclude with a discussion of how it may be improved in the future. Contact: fromer@cs.huji.ac.il PMID:18586717
Use of conserved key amino acid positions to morph protein folds.

PubMed

Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E

2002-07-15

By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.
A retrotransposable element from the mosquito Anopheles gambiae .

PubMed Central

Besansky, N J

1990-01-01

A family of middle repetitive elements from the African malaria vector Anopheles gambiae is described. Approximately 100 copies of the element, designated T1Ag, are dispersed in the genome. Full-length elements are 4.6 kilobase pairs in length, but truncation of the 5' end is common. Nucleotide sequences of one full-length, two 5'-truncated, and two 5' ends of T1Ag elements were determined and aligned to define a consensus sequence. Sequence analysis revealed two long, overlapping open reading frames followed by a polyadenylation signal, AATAAA, and a tail consisting of tandem repetitions of the motif TGAAA. No direct or inverted long terminal repeats (LTRs) were detected. The first open reading frame, 442 amino acids in length, includes a domain resembling that of nucleic acid-binding proteins. The second open reading frame, 975 amino acids long, resembles the reverse transcriptases of a category of retrotransposable elements without LTRs, variously termed class II retrotransposons, class III elements or non-LTR retrotransposons. Similarity at the sequence and structural levels places T1Ag in this category. Images PMID:1689457
How to Tackle the Challenge of siRNA Delivery with Sequence-Defined Oligoamino Amides.

PubMed

Reinhard, Sören; Wagner, Ernst

2017-01-01

RNA interference (RNAi) as a mechanism of gene regulation provides exciting opportunities for medical applications. Synthetic small interfering RNA (siRNA) triggers the knockdown of complementary mRNA sequences in a catalytic fashion and has to be delivered into the cytosol of the targeted cells. The design of adequate carrier systems to overcome multiple extracellular and intracellular roadblocks within the delivery process has utmost importance. Cationic polymers form polyplexes through electrostatic interaction with negatively charged nucleic acids and present a promising class of carriers. Issues of polycations regarding toxicity, heterogeneity, and polydispersity can be overcome by solid-phase-assisted synthesis of sequence-defined cationic oligomers. These medium-sized highly versatile nucleic acid carriers display low cytotoxicity and can be modified and tailored in multiple ways to meet specific requirements of nucleic acid binding, polyplex size, shielding, targeting, and intracellular release of the cargo. In this way, sequence-defined cationic oligomers can mimic the dynamic and bioresponsive behavior of viruses. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Isolation, cDNA cloning and gene expression of an antibacterial protein from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros.

PubMed

Yang, J; Yamamoto, M; Ishibashi, J; Taniai, K; Yamakawa, M

1998-08-01

An antibacterial protein, designated rhinocerosin, was purified to homogeneity from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros immunized with Escherichia coli. Based on the amino acid sequence of the N-terminal region, a degenerate primer was synthesized and reverse-transcriptase PCR was performed to clone rhinocerosin cDNA. As a result, a 279-bp fragment was obtained. The complete nucleotide sequence was determined by sequencing the extended rhinocerosin cDNA clone by 5' rapid amplification of cDNA ends. The deduced amino acid sequence of the mature portion of rhinocerosin was composed of 72 amino acids without cystein residues and was shown to be rich in glycine (11.1%) and proline (11.1%) residues. Comparison of the deduced amino acid sequence of rhinocerosin with those of other antibacterial proteins indicated that it has 77.8% and 44.6% identity with holotricin 2 and coleoptrecin, respectively. Rhinocerosin had strong antibacterial activity against E. coli, Streptococcus pyogenes, Staphylococcus aureus but not against Pseudomonas aeruginosa. Results of reverse-transcriptase PCR analysis of gene expression in different tissues indicated that the rhinocerosin gene is strongly expressed in the fat body and the Malpighian tubule, and weakly expressed in hemocytes and midgut. In addition, gene expression was inducible by bacteria in the fat body, the Malpighian tubule and hemocyte but constitutive expression was observed in the midgut.
Sequence repeats and protein structure

NASA Astrophysics Data System (ADS)

Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos

2012-11-01

Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
Array-Based Rational Design of Short Peptide Probe-Derived from an Anti-TNT Monoclonal Antibody.

PubMed

Okochi, Mina; Muto, Masaki; Yanai, Kentaro; Tanaka, Masayoshi; Onodera, Takeshi; Wang, Jin; Ueda, Hiroshi; Toko, Kiyoshi

2017-10-09

Complementarity-determining regions (CDRs) are sites on the variable chains of antibodies responsible for binding to specific antigens. In this study, a short peptide probe for recognition of 2,4,6-trinitrotoluene (TNT), was identified by testing sequences derived from the CDRs of an anti-TNT monoclonal antibody. The major TNT-binding site in this antibody was identified in the heavy chain CDR3 by antigen docking simulation and confirmed by an immunoassay using a spot-synthesis based peptide array comprising amino acid sequences of six CDRs in the variable region. A peptide derived from heavy chain CDR3 (RGYSSFIYWF) bound to TNT with a dissociation constant of 1.3 μM measured by surface plasmon resonance. Substitution of selected amino acids with basic residues increased TNT binding while substitution with acidic amino acids decreased affinity, an isoleucine to arginine change showed the greatest improvement of 1.8-fold. The ability to create simple peptide binders of volatile organic compounds from sequence information provided by the immune system in the creation of an immune response will be beneficial for sensor developments in the future.
Pseudopropionibacterium sp. nov., a novel red-pigmented species isolated from human gingival sulcus.

PubMed

Saito, Masanori; Shinozaki-Kuwahara, Noriko; Tsudukibashi, Osamu; Hashizume-Takizawa, Tomomi; Kobayashi, Ryoki; Kurita-Ochiai, Tomoko

2018-04-24

Strain SK-1 T is a novel Gram stain-positive, pleomorphic, rod-shaped, non-spore forming, and non-motile organism, designated SK-1 T , isolated from human gingival sulcus that produces acetic acid, propionic acid, lactic acid, and succinic acid as end products of glucose fermentation. Strain SK-1 T had the closest relatedness to Pseudopropionibacterium (Propionibacterium) propionicum with sequence homologies of the 16S rRNA and RNA polymerase β subunit (rpoB) genes of 96.6% and 93.1%, respectively. The genomic DNA G + C content of the isolate was 61.8 mol%. Based on the sequence data of the 16S rRNA and housekeeping (rpoB) genes, we propose a novel taxon, Pseudopropionibacterium rubrum sp. nov. (type strain SK-1 T = JCM 31317T= DSM 100122T). The 16S rRNA and rpoB gene sequences of strain SK-1 T were deposited to the DNA Data Bank of Japan under the accession numbers LC002971 and LC102236, respectively. © 2018 The Societies and John Wiley & Sons Australia, Ltd.

Guiding principles for peptide nanotechnology through directed discovery.

PubMed

Lampel, A; Ulijn, R V; Tuttle, T

2018-05-21

Life's diverse molecular functions are largely based on only a small number of highly conserved building blocks - the twenty canonical amino acids. These building blocks are chemically simple, but when they are organized in three-dimensional structures of tremendous complexity, new properties emerge. This review explores recent efforts in the directed discovery of functional nanoscale systems and materials based on these same amino acids, but that are not guided by copying or editing biological systems. The review summarises insights obtained using three complementary approaches of searching the sequence space to explore sequence-structure relationships for assembly, reactivity and complexation, namely: (i) strategic editing of short peptide sequences; (ii) computational approaches to predicting and comparing assembly behaviours; (iii) dynamic peptide libraries that explore the free energy landscape. These approaches give rise to guiding principles on controlling order/disorder, complexation and reactivity by peptide sequence design.
Bioinformatic Analysis of the Contribution of Primer Sequences to Aptamer Structures

PubMed Central

Ellington, Andrew D.

2009-01-01

Aptamers are nucleic acid molecules selected in vitro to bind a particular ligand. While numerous experimental studies have examined the sequences, structures, and functions of individual aptamers, considerably fewer studies have applied bioinformatics approaches to try to infer more general principles from these individual studies. We have used a large Aptamer Database to parse the contributions of both random and constant regions to the secondary structures of more than 2000 aptamers. We find that the constant, primer-binding regions do not, in general, contribute significantly to aptamer structures. These results suggest that (a) binding function is not contributed to nor constrained by constant regions; (b) in consequence, the landscape of functional binding sequences is sparse but robust, favoring scenarios for short, functional nucleic acid sequences near origins; and (c) many pool designs for the selection of aptamers are likely to prove robust. PMID:18594898
Design of Cyclic Peptide Based Glucose Receptors and Their Application in Glucose Sensing.

PubMed

Li, Chao; Chen, Xin; Zhang, Fuyuan; He, Xingxing; Fang, Guozhen; Liu, Jifeng; Wang, Shuo

2017-10-03

Glucose assay is of great scientific significance in clinical diagnostics and bioprocess monitoring, and to design a new glucose receptor is necessary for the development of more sensitive, selective, and robust glucose detection techniques. Herein, a series of cyclic peptide (CP) glucose receptors were designed to mimic the binding sites of glucose binding protein (GBP), and CPs' sequence contained amino acid sites Asp, Asn, His, Asp, and Arg, which constituted the first layer interactions of GBP. The properties of these CPs used as a glucose receptor or substitute for the GBP were studied by using a quartz crystal microbalance (QCM) technique. It was found that CPs can form a self-assembled monolayer at the Au quartz electrode surface, and the monolayer's properties were characterized by using cyclic voltammetry, electrochemical impedance spectroscopy, and atomic force microscopy. The CPs' binding affinity to saccharide (i.e., galactose, fructose, lactose, sucrose, and maltose) was investigated, and the CPs' sensitivity and selectivity toward glucose were found to be dependent upon the configuration,i.e., the amino acids sequence of the CPs. The cyclic unit with a cyclo[-CNDNHCRDNDC-] sequence gave the highest selectivity and sensitivity for glucose sensing. This work suggests that a synthetic peptide bearing a particular functional sequence could be applied for developing a new generation of glucose receptors and would find huge application in biological, life science, and clinical diagnostics fields.
A de novo redesign of the WW domain

PubMed Central

Kraemer-Pecore, Christina M.; Lecomte, Juliette T.J.; Desjarlais, John R.

2003-01-01

We have used a sequence prediction algorithm and a novel sampling method to design protein sequences for the WW domain, a small β-sheet motif. The procedure, referred to as SPANS, designs sequences to be compatible with an ensemble of closely related polypeptide backbones, mimicking the inherent flexibility of proteins. Two designed sequences (termed SPANS-WW1 and SPANS-WW2), using only naturally occurring l-amino acids, were selected for study and the corresponding polypeptides were prepared in Escherichia coli. Circular dichroism data suggested that both purified polypeptides adopted secondary structure features related to those of the target without the aid of disulfide bridges or bound cofactors. The structure exhibited by SPANS-WW2 melted cooperatively by raising the temperature of the solution. Further analysis of this polypeptide by proton nuclear magnetic resonance spectroscopy demonstrated that at 5°C, it folds into a structure closely resembling a natural WW domain. This achievement constitutes one of a small number of successful de novo protein designs through fully automated computational methods and highlights the feasibility of including backbone flexibility in the design strategy. PMID:14500877
A de novo redesign of the WW domain.

PubMed

Kraemer-Pecore, Christina M; Lecomte, Juliette T J; Desjarlais, John R

2003-10-01

We have used a sequence prediction algorithm and a novel sampling method to design protein sequences for the WW domain, a small beta-sheet motif. The procedure, referred to as SPANS, designs sequences to be compatible with an ensemble of closely related polypeptide backbones, mimicking the inherent flexibility of proteins. Two designed sequences (termed SPANS-WW1 and SPANS-WW2), using only naturally occurring L-amino acids, were selected for study and the corresponding polypeptides were prepared in Escherichia coli. Circular dichroism data suggested that both purified polypeptides adopted secondary structure features related to those of the target without the aid of disulfide bridges or bound cofactors. The structure exhibited by SPANS-WW2 melted cooperatively by raising the temperature of the solution. Further analysis of this polypeptide by proton nuclear magnetic resonance spectroscopy demonstrated that at 5 degrees C, it folds into a structure closely resembling a natural WW domain. This achievement constitutes one of a small number of successful de novo protein designs through fully automated computational methods and highlights the feasibility of including backbone flexibility in the design strategy.
Sequence-specific unusual (1-->2)-type helical turns in alpha/beta-hybrid peptides.

PubMed

Prabhakaran, Panchami; Kale, Sangram S; Puranik, Vedavati G; Rajamohanan, P R; Chetina, Olga; Howard, Judith A K; Hofmann, Hans-Jörg; Sanjayan, Gangadhar J

2008-12-31

This article describes novel conformationally ordered alpha/beta-hybrid peptides consisting of repeating l-proline-anthranilic acid building blocks. These oligomers adopt a compact, right-handed helical architecture determined by the intrinsic conformational preferences of the individual amino acid residues. The striking feature of these oligomers is their ability to display an unusual periodic pseudo beta-turn network of nine-membered hydrogen-bonded rings formed in the forward direction of the sequence by 1-->2 amino acid interactions both in solid-state and in solution. Conformational investigations of several of these oligomers by single-crystal X-ray diffraction, solution-state NMR, and ab initio MO theory suggest that the characteristic steric and dihedral angle restraints exerted by proline are essential for stabilizing the unusual pseudo beta-turn network found in these oligomers. Replacing proline by the conformationally flexible analogue alanine (Ala) or by the conformationally more constrained alpha-amino isobutyric acid (Aib) had an adverse effect on the stabilization of this structural architecture. These findings increase the potential to design novel secondary structure elements profiting from the steric and dihedral angle constraints of the amino acid constituents and help to augment the conformational space available for synthetic oligomer design with diverse backbone structures.
ORENZA: a web resource for studying ORphan ENZyme activities

PubMed Central

Lespinet, Olivier; Labedan, Bernard

2006-01-01

Background Despite the current availability of several hundreds of thousands of amino acid sequences, more than 36% of the enzyme activities (EC numbers) defined by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB) are not associated with any amino acid sequence in major public databases. This wide gap separating knowledge of biochemical function and sequence information is found for nearly all classes of enzymes. Thus, there is an urgent need to explore these sequence-less EC numbers, in order to progressively close this gap. Description We designed ORENZA, a PostgreSQL database of ORphan ENZyme Activities, to collate information about the EC numbers defined by the NC-IUBMB with specific emphasis on orphan enzyme activities. Complete lists of all EC numbers and of orphan EC numbers are available and will be periodically updated. ORENZA allows one to browse the complete list of EC numbers or the subset associated with orphan enzymes or to query a specific EC number, an enzyme name or a species name for those interested in particular organisms. It is possible to search ORENZA for the different biochemical properties of the defined enzymes, the metabolic pathways in which they participate, the taxonomic data of the organisms whose genomes encode them, and many other features. The association of an enzyme activity with an amino acid sequence is clearly underlined, making it easy to identify at once the orphan enzyme activities. Interactive publishing of suggestions by the community would provide expert evidence for re-annotation of orphan EC numbers in public databases. Conclusion ORENZA is a Web resource designed to progressively bridge the unwanted gap between function (enzyme activities) and sequence (dataset present in public databases). ORENZA should increase interactions between communities of biochemists and of genomicists. This is expected to reduce the number of orphan enzyme activities by allocating gene sequences to the relevant enzymes. PMID:17026747
Nucleic acid arrays and methods of synthesis

DOEpatents

Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles

2001-01-01

The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
Workshop on the Design and Processing of Materials by Biomimicking Held in Seattle, Washington on 2-4 April 1991

DTIC Science & Technology

1991-11-01

These materials feature repeating stem-turn elements composed of 1-strands and amino acids predicted to participate in 1-hairpin formation. The...highly ordered crystalline material. Presently we are studying: (i) the effects of amino acid sequence on 1-turn formation, (ii) the influence of stem...length and amino acid composition on chain folding and materials properties, and (iii) the potential for biological incorporation of unnatural amino
Deep sequencing in library selection projects: what insight does it bring?

PubMed

Glanville, J; D'Angelo, S; Khan, T A; Reddy, S T; Naranjo, L; Ferrara, F; Bradbury, A R M

2015-08-01

High throughput sequencing is poised to change all aspects of the way antibodies and other binders are discovered and engineered. Millions of available sequence reads provide an unprecedented sampling depth able to guide the design and construction of effective, high quality naïve libraries containing tens of billions of unique molecules. Furthermore, during selections, high throughput sequencing enables quantitative tracing of enriched clones and position-specific guidance to amino acid variation under positive selection during antibody engineering. Successful application of the technologies relies on specific PCR reagent design, correct sequencing platform selection, and effective use of computational tools and statistical measures to remove error, identify antibodies, estimate diversity, and extract signatures of selection from the clone down to individual structural positions. Here we review these considerations and discuss some of the remaining challenges to the widespread adoption of the technology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Deep sequencing in library selection projects: what insight does it bring?

PubMed Central

Glanville, J; D’Angelo, S; Khan, T.A.; Reddy, S. T.; Naranjo, L.; Ferrara, F.; Bradbury, A.R.M.

2015-01-01

High throughput sequencing is poised to change all aspects of the way antibodies and other binders are discovered and engineered. Millions of available sequence reads provide an unprecedented sampling depth able to guide the design and construction of effective, high quality naïve libraries containing tens of billions of unique molecules. Furthermore, during selections, high throughput sequencing enables quantitative tracing of enriched clones and position-specific guidance to amino acid variation under positive selection during antibody engineering. Successful application of the technologies relies on specific PCR reagent design, correct sequencing platform selection, and effective use of computational tools and statistical measures to remove error, identify antibodies, estimate diversity, and extract signatures of selection from the clone down to individual structural positions. Here we review these considerations and discuss some of the remaining challenges to the widespread adoption of the technology. PMID:26451649
Molecular cloning, structural analysis, and expression in Escherichia coli of a chitinase gene from Enterobacter agglomerans.

PubMed Central

Chernin, L S; De la Fuente, L; Sobolev, V; Haran, S; Vorgias, C E; Oppenheim, A B; Chet, I

1997-01-01

The gene chiA, which codes for endochitinase, was cloned from a soilborne Enterobacter agglomerans. Its complete sequence was determined, and the deduced amino acid sequence of the enzyme designated Chia_Entag yielded an open reading frame coding for 562 amino acids of a 61-kDa precursor protein with a putative leader peptide at its N terminus. The nucleotide and polypeptide sequences of Chia_Entag showed 86.8 and 87.7% identity with the corresponding gene and enzyme, Chia_Serma, of Serratia marcescens, respectively. Homology modeling of Chia_Entag's three-dimensional structure demonstrated that most amino acid substitutions are at solvent-accessible sites. Escherichia coli JM109 carrying the E. agglomerans chiA gene produced and secreted Chia_Entag. The antifungal activity of the secreted endochitinase was demonstrated in vitro by inhibition of Fusarium oxysporum spore germination. The transformed strain inhibited Rhizoctonia solani growth on plates and the root rot disease caused by this fungus in cotton seedlings under greenhouse conditions. PMID:9055404
Cloning and characterization of the nagA gene that encodes beta-n-acetylglucosaminidase from Aspergillus nidulans and its expression in Aspergillus oryzae.

PubMed

Kim, Sunhwa; Matsuo, Ichiro; Ajisaka, Katsumi; Nakajima, Harushi; Kitamoto, Katsuhiko

2002-10-01

We isolated a beta-N-acetylglucosaminidase encoding gene and its cDNA from the filamentous fungus Aspergillus nidulans, and designated it nagA. The nagA gene contained no intron and encoded a polypeptide of 603 amino acids with a putative 19-amino acid signal sequence. The deduced amino acid sequence was very similar to the sequence of Candida albicans Hex1 and Trichoderma harzianum Nag1. Yeast cells containing the nagA cDNA under the control of the GAL1 promoter expressed beta-N-acetylglucosaminidase activity. The chromosomal nagA gene of A. nidulans was disrupted by replacement with the argB marker gene. The disruptant strains expressed low levels of beta-N-acetylglucosaminidase activity and showed poor growth on a medium containing chitobiose as a carbon source. Aspergillus oryzae strain carrying the nagA gene under the control of the improved glaA promoter produced large amounts of beta-N-acetylglucosaminidase in a wheat bran solid culture.
Roles of JnRAP2.6-like from the transition zone of black walnut in hormone signaling

Treesearch

Zhonglian Huang; Peng Zhao; Jose Medina; Richard Meilan; Keith Woeste

2013-01-01

An EST sequence, designated JnRAP2-like, was isolated from tissue at the heartwood/sapwood transition zone (TZ) in black walnut (Juglans nigra L). The deduced amino acid sequence of JnRAP2-like protein consists of a single AP2- containing domain with significant similarity to conserved AP2/ERF DNA-binding domains in other...
Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

PubMed

Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

2010-04-01

The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.
Controlling the Surface Chemistry of Graphite by Engineered Self-Assembled Peptides

PubMed Central

Khatayevich, Dmitriy; So, Christopher R.; Hayamizu, Yuhei; Gresswell, Carolyn; Sarikaya, Mehmet

2012-01-01

The systematic control over surface chemistry is a long-standing challenge in biomedical and nanotechnological applications for graphitic materials. As a novel approach, we utilize graphite-binding dodecapeptides that self-assemble into dense domains to form monolayer thick long-range ordered films on graphite. Specifically, the peptides are rationally designed through their amino acid sequences to predictably display hydrophilic and hydrophobic characteristics while maintaining their self-assembly capabilities on the solid substrate. The peptides are observed to maintain a high tolerance for sequence modification, allowing the control over surface chemistry via their amino acid sequence. Furthermore, through a single step co-assembly of two different designed peptides, we predictably and precisely tune the wettability of the resulting functionalized graphite surfaces from 44 to 83 degrees. The modular molecular structures and predictable behavior of short peptides demonstrated here give rise to a novel platform for functionalizing graphitic materials that offers numerous advantages, including non-invasive modification of the substrate, bio-compatible processing in an aqueous environment, and simple fusion with other functional biological molecules. PMID:22428620
Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV)

PubMed Central

Martin, Andrew C. R.

2014-01-01

The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and ’dotifying’ repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/. PMID:25653836
Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

PubMed

Martin, Andrew C R

2014-01-01

The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.
Designer proton-channel transgenic algae for photobiological hydrogen production

DOEpatents

Lee, James Weifu [Knoxville, TN

2011-04-26

A designer proton-channel transgenic alga for photobiological hydrogen production that is specifically designed for production of molecular hydrogen (H.sub.2) through photosynthetic water splitting. The designer transgenic alga includes proton-conductive channels that are expressed to produce such uncoupler proteins in an amount sufficient to increase the algal H.sub.2 productivity. In one embodiment the designer proton-channel transgene is a nucleic acid construct (300) including a PCR forward primer (302), an externally inducible promoter (304), a transit targeting sequence (306), a designer proton-channel encoding sequence (308), a transcription and translation terminator (310), and a PCR reverse primer (312). In various embodiments, the designer proton-channel transgenic algae are used with a gas-separation system (500) and a gas-products-separation and utilization system (600) for photobiological H.sub.2 production.
Mapping the neutralizing epitopes on the glycoprotein of infectious haematopoietic necrosis virus, a fish rhabdovirus

USGS Publications Warehouse

Huang, C.; Chien, M.S.; Landolt, M.L.; Batts, W.; Winton, J.

1996-01-01

Twelve neutralizing monoclonal antibodies (MAbs) against the fish rhabdovirus, infectious haematopoietic necrosis virus (IHNV), were used to select 20 MAb escape mutants. The nucleotide sequence of the entire glycoprotein (G) gene was determined for six mutants representing differing cross-neutralization patterns and each had a single nucleotide change leading to a single amino acid substitution within one of three regions of the protein. These data were used to design nested PCR primers to amplify portions of the G gene of the 14 remaining mutants. When the PCR products from these mutants were sequenced, they also had single nucleotide substitutions coding for amino acid substitutions at the same, or nearby, locations. Of the 20 mutants for which all or part of the glycoprotein gene was sequenced, two MAbs selected mutants with substitutions at amino acids 230-231 (antigenic site I) and the remaining MAbs selected mutants with substitutions at amino acids 272-276 (antigenic site II). Two MAbs that selected mutants mapping to amino acids 272-276, selected other mutants that mapped to amino acids 78-81, raising the possibility that this portion of the N terminus of the protein was part of a discontinuous epitope defining antigenic site II. CLUSTAL alignment of the glycoproteins of rabies virus, vesicular stomatitis virus and IHNV revealed similarities in the location of the neutralizing epitopes and a high degree of conservation among cysteine residues, indicating that the glycoproteins of three different genera of animal rhabdoviruses may share a similar three-dimensional structure in spite of extensive sequence divergence.

Self-assembled bionanostructures: proteins following the lead of DNA nanostructures

PubMed Central

2014-01-01

Natural polymers are able to self-assemble into versatile nanostructures based on the information encoded into their primary structure. The structural richness of biopolymer-based nanostructures depends on the information content of building blocks and the available biological machinery to assemble and decode polymers with a defined sequence. Natural polypeptides comprise 20 amino acids with very different properties in comparison to only 4 structurally similar nucleotides, building elements of nucleic acids. Nevertheless the ease of synthesizing polynucleotides with selected sequence and the ability to encode the nanostructural assembly based on the two specific nucleotide pairs underlay the development of techniques to self-assemble almost any selected three-dimensional nanostructure from polynucleotides. Despite more complex design rules, peptides were successfully used to assemble symmetric nanostructures, such as fibrils and spheres. While earlier designed protein-based nanostructures used linked natural oligomerizing domains, recent design of new oligomerizing interaction surfaces and introduction of the platform for topologically designed protein fold may enable polypeptide-based design to follow the track of DNA nanostructures. The advantages of protein-based nanostructures, such as the functional versatility and cost effective and sustainable production methods provide strong incentive for further development in this direction. PMID:24491139
Structure/Function Analyses of Human Serum Paraoxonase (HuPON1) Mutants Designed from a DFPase-Like Homology Model

DTIC Science & Technology

2004-08-23

purified HuPON1 Substitution of amino acid residues in the HuPONI enzyme was accomplished by PCR-based site-directed Two methods were utilized to...including organophosphates and lactones, and exhibits anti-atherogenic properties. A few amino acids have been shown to be essential for the enzyme’s...not been assigned to those residues. Based on scquence-structure alignment studies, we have folded the amino acid sequence of HuPON I onto the sixfold
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2011 CFR

2011-07-01

... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
Isolation and in silico analysis of a novel H+-pyrophosphatase gene orthologue from the halophytic grass Leptochloa fusca

NASA Astrophysics Data System (ADS)

Rauf, Muhammad; Saeed, Nasir A.; Habib, Imran; Ahmed, Moddassir; Shahzad, Khurram; Mansoor, Shahid; Ali, Rashid

2017-02-01

Structure prediction can provide information about function and active sites of protein which helps to design new functional proteins. H+-pyrophosphatase is transmembrane protein involved in establishing proton motive force for active transport of Na+ across membrane by Na+/H+ antiporters. A full length novel H+-pyrophosphatase gene was isolated from halophytic grass Leptochloa fusca using RT-PCR and RACE method. Full length LfVP1 gene sequence of 2292 nucleotides encodes protein of 764 amino acids. DNA and protein sequences were used for characterization using bioinformatics tools. Various important potential sites were predicted by PROSITE webserver. Primary structural analysis showed LfVP1 as stable protein and Grand average hydropathy (GRAVY) indicated that LfVP1 protein has good hydrosolubility. Secondary structure analysis showed that LfVP1 protein sequence contains significant proportion of alpha helix and random coil. Protein membrane topology suggested the presence of 14 transmembrane domains and presence of catalytic domain in TM3. Three dimensional structure from LfVP1 protein sequence also indicated the presence of 14 transmembrane domains and hydrophobicity surface model showed amino acid hydrophobicity. Ramachandran plot showed that 98% amino acid residues were predicted in the favored region.
Improved Modeling of Side-Chain–Base Interactions and Plasticity in Protein–DNA Interface Design

PubMed Central

Thyme, Summer B.; Baker, David; Bradley, Philip

2012-01-01

Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed “motifs”) was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein–DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent. PMID:22426128
Improved modeling of side-chain--base interactions and plasticity in protein--DNA interface design.

PubMed

Thyme, Summer B; Baker, David; Bradley, Philip

2012-06-08

Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent. Published by Elsevier Ltd.
Structure of a designed, right-handed coiled-coil tetramer containing all biological amino acids

PubMed Central

Sales, Mark; Plecs, Joseph J.; Holton, James M.; Alber, Tom

2007-01-01

The previous design of an unprecedented family of two-, three-, and four-helical, right-handed coiled coils utilized nonbiological amino acids to efficiently pack spaces in the oligomer cores. Here we show that a stable, right-handed parallel tetrameric coiled coil, called RH4B, can be designed entirely using biological amino acids. The X-ray crystal structure of RH4B was determined to 1.1 Å resolution using a designed metal binding site to coordinate a single Yb2+ ion per 33-amino acid polypeptide chain. The resulting experimental phases were particularly accurate, and the experimental electron density map provided an especially clear, unbiased view of the molecule. The RH4B structure closely matched the design, with equivalent core rotamers and an overall root-mean-square deviation for the N-terminal repeat of the tetramer of 0.24 Å. The clarity and resolution of the electron density map, however, revealed alternate rotamers and structural differences between the three sequence repeats in the molecule. These results suggest that the RH4B structure populates an unanticipated variety of structures. PMID:17766380
Structure of a designed, right-handed coiled-coil tetramer containing all biological amino acids.

PubMed

Sales, Mark; Plecs, Joseph J; Holton, James M; Alber, Tom

2007-10-01

The previous design of an unprecedented family of two-, three-, and four-helical, right-handed coiled coils utilized nonbiological amino acids to efficiently pack spaces in the oligomer cores. Here we show that a stable, right-handed parallel tetrameric coiled coil, called RH4B, can be designed entirely using biological amino acids. The X-ray crystal structure of RH4B was determined to 1.1 Angstrom resolution using a designed metal binding site to coordinate a single Yb(2+) ion per 33-amino acid polypeptide chain. The resulting experimental phases were particularly accurate, and the experimental electron density map provided an especially clear, unbiased view of the molecule. The RH4B structure closely matched the design, with equivalent core rotamers and an overall root-mean-square deviation for the N-terminal repeat of the tetramer of 0.24 Angstrom. The clarity and resolution of the electron density map, however, revealed alternate rotamers and structural differences between the three sequence repeats in the molecule. These results suggest that the RH4B structure populates an unanticipated variety of structures.
Functional genomics of lactic acid bacteria: from food to health

PubMed Central

2014-01-01

Genome analysis using next generation sequencing technologies has revolutionized the characterization of lactic acid bacteria and complete genomes of all major groups are now available. Comparative genomics has provided new insights into the natural and laboratory evolution of lactic acid bacteria and their environmental interactions. Moreover, functional genomics approaches have been used to understand the response of lactic acid bacteria to their environment. The results have been instrumental in understanding the adaptation of lactic acid bacteria in artisanal and industrial food fermentations as well as their interactions with the human host. Collectively, this has led to a detailed analysis of genes involved in colonization, persistence, interaction and signaling towards to the human host and its health. Finally, massive parallel genome re-sequencing has provided new opportunities in applied genomics, specifically in the characterization of novel non-GMO strains that have potential to be used in the food industry. Here, we provide an overview of the state of the art of these functional genomics approaches and their impact in understanding, applying and designing lactic acid bacteria for food and health. PMID:25186768
Functional genomics of lactic acid bacteria: from food to health.

PubMed

Douillard, François P; de Vos, Willem M

2014-08-29

Genome analysis using next generation sequencing technologies has revolutionized the characterization of lactic acid bacteria and complete genomes of all major groups are now available. Comparative genomics has provided new insights into the natural and laboratory evolution of lactic acid bacteria and their environmental interactions. Moreover, functional genomics approaches have been used to understand the response of lactic acid bacteria to their environment. The results have been instrumental in understanding the adaptation of lactic acid bacteria in artisanal and industrial food fermentations as well as their interactions with the human host. Collectively, this has led to a detailed analysis of genes involved in colonization, persistence, interaction and signaling towards to the human host and its health. Finally, massive parallel genome re-sequencing has provided new opportunities in applied genomics, specifically in the characterization of novel non-GMO strains that have potential to be used in the food industry. Here, we provide an overview of the state of the art of these functional genomics approaches and their impact in understanding, applying and designing lactic acid bacteria for food and health.
CHARACTERIZATION AND NUCLEOTIDE SEQUENCE DETERMINATION OF A REPEAT ELEMENT ISOLATED FROM A 2,4,5,-T DEGRADING STRAIN OF PSEUDOMONAS CEPACIA

EPA Science Inventory

Pseudomonas cepacia strain AC1100, capable of growth on 2,4,5-trichlorophenoxyacetic acid (2,4,5-T), was mutated to the 2,4,5-T− strain PT88 by a ColE1 :: Tn5 chromosomal insertion. Using cloned DNA from the region flanking the insertion, a 1477-bp sequence (designated RS1100) wa...
Discriminating between stabilizing and destabilizing protein design mutations via recombination and simulation.

PubMed

Johnson, Lucas B; Gintner, Lucas P; Park, Sehoo; Snow, Christopher D

2015-08-01

Accuracy of current computational protein design (CPD) methods is limited by inherent approximations in energy potentials and sampling. These limitations are often used to qualitatively explain design failures; however, relatively few studies provide specific examples or quantitative details that can be used to improve future CPD methods. Expanding the design method to include a library of sequences provides data that is well suited for discriminating between stabilizing and destabilizing design elements. Using thermophilic endoglucanase E1 from Acidothermus cellulolyticus as a model enzyme, we computationally designed a sequence with 60 mutations. The design sequence was rationally divided into structural blocks and recombined with the wild-type sequence. Resulting chimeras were assessed for activity and thermostability. Surprisingly, unlike previous chimera libraries, regression analysis based on one- and two-body effects was not sufficient for predicting chimera stability. Analysis of molecular dynamics simulations proved helpful in distinguishing stabilizing and destabilizing mutations. Reverting to the wild-type amino acid at destabilized sites partially regained design stability, and introducing predicted stabilizing mutations in wild-type E1 significantly enhanced thermostability. The ability to isolate stabilizing and destabilizing elements in computational design offers an opportunity to interpret previous design failures and improve future CPD methods. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
PinaColada: peptide-inhibitor ant colony ad-hoc design algorithm.

PubMed

Zaidman, Daniel; Wolfson, Haim J

2016-08-01

Design of protein-protein interaction (PPI) inhibitors is a major challenge in Structural Bioinformatics. Peptides, especially short ones (5-15 amino acid long), are natural candidates for inhibition of protein-protein complexes due to several attractive features such as high structural compatibility with the protein binding site (mimicking the surface of one of the proteins), small size and the ability to form strong hotspot binding connections with the protein surface. Efficient rational peptide design is still a major challenge in computer aided drug design, due to the huge space of possible sequences, which is exponential in the length of the peptide, and the high flexibility of peptide conformations. In this article we present PinaColada, a novel computational method for the design of peptide inhibitors for protein-protein interactions. We employ a version of the ant colony optimization heuristic, which is used to explore the exponential space ([Formula: see text]) of length n peptide sequences, in combination with our fast robotics motivated PepCrawler algorithm, which explores the conformational space for each candidate sequence. PinaColada is being run in parallel, on a DELL PowerEdge 2.8 GHZ computer with 20 cores and 256 GB memory, and takes up to 24 h to design a peptide of 5-15 amino acids length. An online server available at: http://bioinfo3d.cs.tau.ac.il/PinaColada/. danielza@post.tau.ac.il; wolfson@tau.ac.il. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Solid phase sequencing of double-stranded nucleic acids

DOEpatents

Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

2002-01-01

This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.
Metamorphic Proteins: Emergence of Dual Protein Folds from One Primary Sequence.

PubMed

Lella, Muralikrishna; Mahalakshmi, Radhakrishnan

2017-06-20

Every amino acid exhibits a different propensity for distinct structural conformations. Hence, decoding how the primary amino acid sequence undergoes the transition to a defined secondary structure and its final three-dimensional fold is presently considered predictable with reasonable certainty. However, protein sequences that defy the first principles of secondary structure prediction (they attain two different folds) have recently been discovered. Such proteins, aptly named metamorphic proteins, decrease the conformational constraint by increasing flexibility in the secondary structure and thereby result in efficient functionality. In this review, we discuss the major factors driving the conformational switch related both to protein sequence and to structure using illustrative examples. We discuss the concept of an evolutionary transition in sequence and structure, the functional impact of the tertiary fold, and the pressure of intrinsic and external factors that give rise to metamorphic proteins. We mainly focus on the major components of protein architecture, namely, the α-helix and β-sheet segments, which are involved in conformational switching within the same or highly similar sequences. These chameleonic sequences are widespread in both cytosolic and membrane proteins, and these folds are equally important for protein structure and function. We discuss the implications of metamorphic proteins and chameleonic peptide sequences in de novo peptide design.
Treatability of cheese whey for single-cell protein production in nonsterile systems: Part II. The application of aerobic sequencing batch reactor (aerobic SBR) to produce high biomass of Dioszegia sp. TISTR 5792.

PubMed

Monkoondee, Sarawut; Kuntiya, Ampin; Chaiyaso, Thanongsak; Leksawasdi, Noppol; Techapun, Charin; Kawee-Ai, Arthitaya; Seesuriyachan, Phisit

2016-07-03

This study aimed to investigate the efficiency of an aerobic sequencing batch reactor (aerobic SBR) in a nonsterile system using the application of an experimental design via central composite design (CCD). The acidic whey obtained from lactic acid fermentation by immobilized Lactobacillus plantarum sp. TISTR 2265 was fed into the bioreactor of the aerobic SBR in an appropriate ratio between acidic whey and cheese whey to produce an acidic environment below 4.5 and then was used to support the growth of Dioszegia sp. TISTR 5792 by inhibiting bacterial contamination. At the optimal condition for a high yield of biomass production, the system was run with a hydraulic retention time (HRT) of 4 days, a solid retention time (SRT) of 8.22 days, and an acidic whey concentration of 80% feeding. The chemical oxygen demand (COD) decreased from 25,230 mg/L to 6,928 mg/L, which represented a COD removal of 72.15%. The yield of biomass production and lactose utilization by Dioszegia sp. TISTR 5792 were 13.14 g/L and 33.36%, respectively, with a long run of up to 180 cycles and the pH values of effluent were rose up to 8.32 without any pH adjustment.
Miniaturized isothermal nucleic acid amplification, a review.

PubMed

Asiello, Peter J; Baeumner, Antje J

2011-04-21

Micro-Total Analysis Systems (µTAS) for use in on-site rapid detection of DNA or RNA are increasingly being developed. Here, amplification of the target sequence is key to increasing sensitivity, enabling single-cell and few-copy nucleic acid detection. The several advantages to miniaturizing amplification reactions and coupling them with sample preparation and detection on the same chip are well known and include fewer manual steps, preventing contamination, and significantly reducing the volume of expensive reagents. To-date, the majority of miniaturized systems for nucleic acid analysis have used the polymerase chain reaction (PCR) for amplification and those systems are covered in previous reviews. This review provides a thorough overview of miniaturized analysis systems using alternatives to PCR, specifically isothermal amplification reactions. With no need for thermal cycling, isothermal microsystems can be designed to be simple and low-energy consuming and therefore may outperform PCR in portable, battery-operated detection systems in the future. The main isothermal methods as miniaturized systems reviewed here include nucleic acid sequence-based amplification (NASBA), loop-mediated isothermal amplification (LAMP), helicase-dependent amplification (HDA), rolling circle amplification (RCA), and strand displacement amplification (SDA). Also, important design criteria for the miniaturized devices are discussed. Finally, the potential of miniaturization of some new isothermal methods such as the exponential amplification reaction (EXPAR), isothermal and chimeric primer-initiated amplification of nucleic acids (ICANs), signal-mediated amplification of RNA technology (SMART) and others is presented.
Direct Calculation of Protein Fitness Landscapes through Computational Protein Design

PubMed Central

Au, Loretta; Green, David F.

2016-01-01

Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A∗ search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones. PMID:26745411
Molecular characterization of chikungunya virus from Andhra Pradesh, India & phylogenetic relationship with Central African isolates.

PubMed

M Naresh Kumar, C V; Anthony Johnson, A M; R Sai Gopal, D V

2007-12-01

Chikungunya virus has caused numerous large outbreaks in India. Suspected blood samples from the epidemic were collected and characterized for the identification of the responsible causative from Rayalaseema region of Andhra Pradesh. RT-PCR was used for screening of suspected blood samples. Primers were designed to amplify partial E1 gene and the amplified fragment was cloned and sequenced. The sequence was analyzed and compared with other geographical isolates to find the phylogenetic relationship. The sequence was submitted to the Gen bank DNA database (accession DQ888620). Comparative nucleotide homology analysis of the AP Ra-CTR isolate with the other isolates revealed 94.7+/-3.6 per cent of homology of CHIKAPRa-CTR with other isolates of Chikungunya virus at nucleotide level and 96.8+/-3.2 per cent of homology at amino acid level. The current epidemic was caused by the Central African genotype of CHIKV, grouped in Central Africa cluster in phylogenetic trees generated based on nucleotide and amino acid sequences.
Detection of arc genes related with the ethyl carbamate precursors in wine lactic acid bacteria.

PubMed

Araque, Isabel; Gil, Joana; Carreté, Ramon; Bordons, Albert; Reguant, Cristina

2009-03-11

Trace amounts of the carcinogen ethyl carbamate can appear in wine by the reaction of ethanol with compounds such as citrulline and carbamyl phosphate, which are produced from arginine degradation by some wine lactic acid bacteria (LAB). In this work, the presence of arc genes for the arginine-deiminase pathway was studied in several strains of different species of LAB. Their ability to degrade arginine was also studied. To detect the presence of arc genes, degenerate primers were designed from the alignment of protein sequences in already sequenced LAB. The usefulness of these degenerate primers has been proven by sequencing some of the amplified PCR fragments and searching for homologies with published sequences of the same species and related ones. Correlation was found between the presence of genes and the ability to degrade arginine. Degrading strains included all heterofermentative lactobacilli, Oenococcus oeni , Pediococcus pentosaceus , and some strains of Leuconostoc mesenteroides and Lactobacillus plantarum .

Screening and Identification of Peptides Specifically Targeted to Gastric Cancer Cells from a Phage Display Peptide Library

PubMed

Sahin, Deniz; Taflan, Sevket Onur; Yartas, Gizem; Ashktorab, Hassan; Smoot, Duane T

2018-04-25

Background: Gastric cancer is the second most common cancer among the malign cancer types. Inefficiency of traditional techniques both in diagnosis and therapy of the disease makes the development of alternative and novel techniques indispensable. As an alternative to traditional methods, tumor specific targeting small peptides can be used to increase the efficiency of the treatment and reduce the side effects related to traditional techniques. The aim of this study is screening and identification of individual peptides specifically targeted to human gastric cancer cells using a phage-displayed peptide library and designing specific peptide sequences by using experimentally-eluted peptide sequences. Methods: Here, MKN-45 human gastric cancer cells and HFE-145 human normal gastric epithelial cells were used as the target and control cells, respectively. 5 rounds of biopannning with a phage display 12-peptide library were applied following subtraction biopanning with HFE-145 control cells. The selected phage clones were established by enzyme-linked immunosorbent assay and immunofluorescence detection. We first obtain random phage clones after five biopanning rounds, determine the binding levels of each individual clone. Then, we analyze the frequencies of each amino acid in best binding clones to determine positively overexpressed amino acids for designing novel peptide sequences. Results: DE532 (VETSQYFRGTLS) phage clone was screened positive, showing specific binding on MKN-45 gastric cancer cells. DE-Obs (HNDLFPSWYHNY) peptide, which was designed by using amino acid frequencies of experimentally selected peptides in the 5th round of biopanning, showed specific binding in MKN-45 cells. Conclusion: Selection and characterization of individual clones may give us specifically binding peptides, but more importantly, data extracted from eluted phage clones may be used to design theoretical peptides with better binding properties than even experimentally selected ones. Both peptides, experimental and designed, may be potential candidates to be developed as useful diagnostic or therapeutic ligand molecules in gastric cancer research. Creative Commons Attribution License
Solid phase sequencing of biopolymers

DOEpatents

Cantor, Charles; Koster, Hubert

2010-09-28

This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.
A multigene family related to chitin synthase genes of yeast in the opportunistic pathogen Aspergillus fumigatus.

PubMed

Mellado, E; Aufauvre-Brown, A; Specht, C A; Robbins, P W; Holden, D W

1995-02-06

Two approaches were used to isolate fragments of chitin synthase genes from the opportunistic human pathogen Aspergillus fumigatus. Firstly, regions of amino acid conservation in chitin synthases of Saccharomyces cerevisiae were used to design degenerate primers for amplification of portions of related genes, and secondly, a segment of the S. cerevisiae CSD2 gene was used to screen an A. fumigatus lambda genomic DNA library. the polymerase chain reaction (PCR)-based approach led to the identification of five different genes, designated chsA, chsB, chsC, chsD and chsE. chsA, chsB, and chsC fall into Classes I, II and III of the 'zymogen type' chitin synthases, respectively. The chsD fragment has approximately 35% amino acid sequence identity to both the zymogen type genes and the non-zymogen type CSD2 gene. chsF appears to be a homologue of CSD2, being 80% identical to CSD2 over 100 amino acids. An unexpected finding was the isolation by heterologous hybridization of another gene (chsE), which also has strong sequence similarity (54% identity at the amino acid level over the same region as chsF) to CSD2. Reverse transcriptase-PCR was used to show that each gene is expressed during hyphal growth in submerged cultures.
Consensus-Degenerate Hybrid Oligonucleotide Primers for Amplification of Priming Glycosyltransferase Genes of the Exopolysaccharide Locus in Strains of the Lactobacillus casei Group

PubMed Central

Provencher, Cathy; LaPointe, Gisèle; Sirois, Stéphane; Van Calsteren, Marie-Rose; Roy, Denis

2003-01-01

A primer design strategy named CODEHOP (consensus-degenerate hybrid oligonucleotide primer) for amplification of distantly related sequences was used to detect the priming glycosyltransferase (GT) gene in strains of the Lactobacillus casei group. Each hybrid primer consisted of a short 3′ degenerate core based on four highly conserved amino acids and a longer 5′ consensus clamp region based on six sequences of the priming GT gene products from exopolysaccharide (EPS)-producing bacteria. The hybrid primers were used to detect the priming GT gene of 44 commercial isolates and reference strains of Lactobacillus rhamnosus, L. casei, Lactobacillus zeae, and Streptococcus thermophilus. The priming GT gene was detected in the genome of both non-EPS-producing (EPS−) and EPS-producing (EPS+) strains of L. rhamnosus. The sequences of the cloned PCR products were similar to those of the priming GT gene of various gram-negative and gram-positive EPS+ bacteria. Specific primers designed from the L. rhamnosus RW-9595M GT gene were used to sequence the end of the priming GT gene in selected EPS+ strains of L. rhamnosus. Phylogenetic analysis revealed that Lactobacillus spp. form a distinctive group apart from other lactic acid bacteria for which GT genes have been characterized to date. Moreover, the sequences show a divergence existing among strains of L. rhamnosus with respect to the terminal region of the priming GT gene. Thus, the PCR approach with consensus-degenerate hybrid primers designed with CODEHOP is a practical approach for the detection of similar genes containing conserved motifs in different bacterial genomes. PMID:12788729
A modular DNA signal translator for the controlled release of a protein by an aptamer.

PubMed

Beyer, Stefan; Simmel, Friedrich C

2006-01-01

Owing to the intimate linkage of sequence and structure in nucleic acids, DNA is an extremely attractive molecule for the development of molecular devices, in particular when a combination of information processing and chemomechanical tasks is desired. Many of the previously demonstrated devices are driven by hybridization between DNA 'effector' strands and specific recognition sequences on the device. For applications it is of great interest to link several of such molecular devices together within artificial reaction cascades. Often it will not be possible to choose DNA sequences freely, e.g. when functional nucleic acids such as aptamers are used. In such cases translation of an arbitrary 'input' sequence into a desired effector sequence may be required. Here we demonstrate a molecular 'translator' for information encoded in DNA and show how it can be used to control the release of a protein by an aptamer using an arbitrarily chosen DNA input strand. The function of the translator is based on branch migration and the action of the endonuclease FokI. The modular design of the translator facilitates the adaptation of the device to various input or output sequences.
A modular DNA signal translator for the controlled release of a protein by an aptamer

PubMed Central

Beyer, Stefan; Simmel, Friedrich C.

2006-01-01

Owing to the intimate linkage of sequence and structure in nucleic acids, DNA is an extremely attractive molecule for the development of molecular devices, in particular when a combination of information processing and chemomechanical tasks is desired. Many of the previously demonstrated devices are driven by hybridization between DNA ‘effector’ strands and specific recognition sequences on the device. For applications it is of great interest to link several of such molecular devices together within artificial reaction cascades. Often it will not be possible to choose DNA sequences freely, e.g. when functional nucleic acids such as aptamers are used. In such cases translation of an arbitrary ‘input’ sequence into a desired effector sequence may be required. Here we demonstrate a molecular ‘translator’ for information encoded in DNA and show how it can be used to control the release of a protein by an aptamer using an arbitrarily chosen DNA input strand. The function of the translator is based on branch migration and the action of the endonuclease FokI. The modular design of the translator facilitates the adaptation of the device to various input or output sequences. PMID:16547201
Femtomolar Ln(III) affinity in peptide-based ligands containing unnatural chelating amino acids.

PubMed

Niedźwiecka, Agnieszka; Cisnetti, Federico; Lebrun, Colette; Delangle, Pascale

2012-05-07

The incorporation of unnatural chelating amino acids in short peptide sequences leads to lanthanide-binding peptides with a higher stability than sequences built exclusively from natural residues. In particular, the hexadentate peptide P(22), which incorporates two unnatural amino acids Ada(2) with aminodiacetate chelating arms, showed picomolar affinity for Tb(3+). To design peptides with higher denticity, expected to show higher affinity for Ln(3+), we synthesized the novel unnatural amino acid Ed3a(2) which carries an ethylenediamine triacetate side-chain and affords a pentadentate coordination site. The synthesis of the derivative Fmoc-Ed3a(2)(tBu)(3)-OH, with appropriate protecting groups for direct use in the solid phase peptide synthesis (Fmoc strategy), is described. The two high denticity peptides P(HD2) (Ac-Trp-Ed3a(2)-Pro-Gly-Ada(2)-Gly-NH(2)) and P(HD5) (Ac-Trp-Ada(2)-Pro-Gly-Ed3a(2)-Gly-NH(2)) led to octadentate Tb(3+) complexes with femtomolar stability in water. The position of the high denticity amino acid Ed3a(2) in the hexapeptide sequence appears to be critical for the control of the metal complex speciation. Whereas P(HD5) promotes the formation of polymetallic species in excess of Ln(3+), P(HD2) forms exclusively the mononuclear complex. The octadentate coordination of Tb(3+) by both P(HD) leads to total dehydration of the metal ion in the mononuclear complexes with long luminescence lifetimes (>2 ms). Hence, we demonstrated that unnatural amino acids carrying polyaminocarboxylate side-chains are interesting building blocks to design high affinity Ln-binding peptides. In particular the novel peptide P(HD2) forms a unique octadentate Tb(3+) complex with femtomolar stability in water and an improvement of the luminescence properties with respect to the trisaquo TbP(22) complex by a factor of 4.
Bacterial expression of self-assembling peptide hydrogelators

NASA Astrophysics Data System (ADS)

Sonmez, Cem

For tissue regeneration and drug delivery applications, various architectures are explored to serve as biomaterial tools. Via de novo design, functional peptide hydrogel materials have been developed as scaffolds for biomedical applications. The objective of this study is to investigate bacterial expression as an alternative method to chemical synthesis for the recombinant production of self-assembling peptides that can form rigid hydrogels under physiological conditions. The Schneider and Pochan Labs have designed and characterized a 20 amino acid beta-hairpin forming amphiphilic peptide containing a D-residue in its turn region (MAX1). As a result, this peptide must be prepared chemically. Peptide engineering, using the sequence of MAX1 as a template, afforded a small family of peptides for expression (EX peptides) that have different turn sequences consisting of natural amino acids and amenable to bacterial expression. Each sequence was initially chemically synthesized to quickly assess the material properties of its corresponding gel. One model peptide EX1, was chosen to start the bacterial expression studies. DNA constructs facilitating the expression of EX1 were designed in such that the peptide could be expressed with different fusion partners and subsequently cleaved by enzymatic or chemical means to afford the free peptide. Optimization studies were performed to increase the yield of pure peptide that ultimately allowed 50 mg of pure peptide to be harvested from one liter of culture, providing an alternate means to produce this hydrogel-forming peptide. Recombinant production of other self-assembling hairpins with different turn sequences was also successful using this optimized protocol. The studies demonstrate that new beta-hairpin self-assembling peptides that are amenable to bacterial production and form rigid hydrogels at physiological conditions can be designed and produced by fermentation in good yield at significantly reduced cost when compared to chemical synthesis.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Assignment of fatty acid-beta-oxidizing syntrophic bacteria to Syntrophomonadaceae fam. nov. on the basis of 16S rRNA sequence analyses

NASA Technical Reports Server (NTRS)

Zhao, H.; Yang, D.; Woese, C. R.; Bryant, M. P.

1993-01-01

After enrichment from Chinese rural anaerobic digestor sludge, anaerobic, sporing and nonsporing, saturated fatty acid-beta-oxidizing syntrophic bacteria were isolated as cocultures with H2- and formate-utilizing Methanospirillum hungatei or Desulfovibrio sp. strain G-11. The syntrophs degraded C4 to C8 saturated fatty acids, including isobutyrate and 2-methylbutyrate. They were adapted to grow on crotonate and were isolated as pure cultures. The crotonate-grown pure cultures alone did not grow on butyrate in either the presence or the absence of some common electron acceptors. However, when they were reconstituted with M. hungatei, growth on butyrate again occurred. In contrast, crotonate-grown Clostridium kluyveri and Clostridium sticklandii, as well as Clostridium sporogenes, failed to grow on butyrate when these organisms were cocultured with M. hungatei. The crotonate-grown pure subcultures of the syntrophs described above were subjected to 16S rRNA sequence analysis. Several previously documented fatty acid-beta-oxidizing syntrophs grown in pure cultures with crotonate were also subjected to comparative sequence analyses. The sequence analyses revealed that the new sporing and nonsporing isolates and other syntrophs that we sequenced, which had either gram-negative or gram-positive cell wall ultrastructure, all belonged to the phylogenetically gram-positive phylum. They were not closely related to any of the previously known subdivisions in the gram-positive phylum with which they were compared, but were closely related to each other, forming a new subdivision in the phylum. We recommend that this group be designated Syntrophomonadaceae fam. nov.; a description is given.
Development of a rapid and simple immunochromatographic assay to identify Vibrio parahaemolyticus.

PubMed

Sakata, Junko; Kawatsu, Kentaro; Iwasaki, Tadashi; Kumeda, Yuko

2015-09-01

To rapidly and simply determine whether or not bacterial colonies growing on agar were Vibrio parahaemolyticus, we developed an immunochromatographic assay (VP-ICA) using two different monoclonal antibodies (designated mAb-VP34 and mAb-VP109) against the delta subunit of V. parahaemolyticus-F0F1 ATP synthase. The epitopes recognized by mAb-VP34 and mAb-VP109 were mapped to sequences of eight ((47)LLTSSFSA(54)) and six amino acid residues ((16)FDFAVD(21)), respectively. An amino acid sequence similarity search of the NCBI database using BLASTP showed that both epitopic amino acid sequences were present together only in V. parahaemolyticus. When 124 V. parahaemolyticus strains and 94 strains of 27 other Vibrio species or 35 non-Vibrio species were tested using the VP-ICA, the VP-ICA identified V. parahaemolyticus with 100% accuracy. The VP-ICA rapidly and simply identified the pathogen directly from a single agar colony within 30 min, indicating that VP-ICA will greatly reduce labor and time required to identify V. parahaemolyticus compared with conventional biochemical tests. Copyright © 2015. Published by Elsevier B.V.
His-426 of the Pseudomonas aeruginosa exotoxin A is required for ADP-ribosylation of elongation factor II.

PubMed Central

Wozniak, D J; Hsu, L Y; Galloway, D R

1988-01-01

Exotoxin A (ETA) is recognized as the most toxic product associated with the opportunistic pathogen Pseudomonas aeruginosa. Identification of the amino acids in the polypeptide sequence that are required for toxin activity is critical for vaccine development. By defining the nucleotide sequence of the structural gene of a mutant that encodes an enzymatically inactive ETA (CRM 66), we identified an essential amino acid (His-426), which is involved in the ADP-ribosyltransferase activity associated with functional ETA. A monoclonal antibody that inhibits ETA enzymatic activity in vitro fails to react with ETA variants that have a His 426----Tyr substitution. Several mono-ADP-ribosylating toxins, including diphtheria and pertussis toxins, within the primary amino acid sequences carry a histidine residue that is conserved in spacing and in location with respect to other critical residues. Analysis of the three-dimensional structure of ETA revealed that His-426 is not associated with the proposed NAD+ binding site. These findings should be useful for the design and construction of toxin vaccines. Images PMID:3143111
Osteoblast-specific factor 2: cloning of a putative bone adhesion protein with homology with the insect protein fasciclin I.

PubMed Central

Takeshita, S; Kikuno, R; Tezuka, K; Amann, E

1993-01-01

A cDNA library prepared from the mouse osteoblastic cell line MC3T3-E1 was screened for the presence of specifically expressed genes by employing a combined subtraction hybridization/differential screening approach. A cDNA was identified and sequenced which encodes a protein designated osteoblast-specific factor 2 (OSF-2) comprising 811 amino acids. OSF-2 has a typical signal sequence, followed by a cysteine-rich domain, a fourfold repeated domain and a C-terminal domain. The protein lacks a typical transmembrane region. The fourfold repeated domain of OSF-2 shows homology with the insect protein fasciclin I. RNA analyses revealed that OSF-2 is expressed in bone and to a lesser extent in lung, but not in other tissues. Mouse OSF-2 cDNA was subsequently used as a probe to clone the human counterpart. Mouse and human OSF-2 show a high amino acid sequence conservation except for the signal sequence and two regions in the C-terminal domain in which 'in-frame' insertions or deletions are observed, implying alternative splicing events. On the basis of the amino acid sequence homology with fasciclin I, we suggest that OSF-2 functions as a homophilic adhesion molecule in bone formation. Images Figure 3 Figure 4 Figure 5 Figure 6 PMID:8363580
Hydrophobic and electrostatic interactions between cell penetrating peptides and plasmid DNA are important for stable non-covalent complexation and intracellular delivery.

PubMed

Upadhya, Archana; Sangave, Preeti C

2016-10-01

Cell penetrating peptides are useful tools for intracellular delivery of nucleic acids. Delivery of plasmid DNA, a large nucleic acid, poses a challenge for peptide mediated transport. The paper investigates and compares efficacy of five novel peptide designs for complexation of plasmid DNA and subsequent delivery into cells. The peptides were designed to contain reported DNA condensing agents and basic cell penetrating sequences, octa-arginine (R 8 ) and CHK 6 HC coupled to cell penetration accelerating peptides such as Bax inhibitory mutant peptide (KLPVM) and a peptide derived from the Kaposi fibroblast growth factor (kFGF) membrane translocating sequence. A tryptophan rich peptide, an analogue of Pep-3, flanked with CH 3 on either ends was also a part of the study. The peptides were analysed for plasmid DNA complexation, protection of peptide-plasmid DNA complexes against DNase I, serum components and competitive ligands by simple agarose gel electrophoresis techniques. Hemolysis of rat red blood corpuscles (RBCs) in the presence of the peptides was used as a measure of peptide cytotoxicity. Plasmid DNA delivery through the designed peptides was evaluated in two cell lines, human cervical cancer cell line (HeLa) and (NIH/3 T3) mouse embryonic fibroblasts via expression of the secreted alkaline phosphatase (SEAP) reporter gene. The importance of hydrophobic sequences in addition to cationic sequences in peptides for non-covalent plasmid DNA complexation and delivery has been illustrated. An alternative to the employment of fatty acid moieties for enhanced gene transfer has been proposed. Comparison of peptides for plasmid DNA complexation and delivery of peptide-plasmid DNA complexes to cells estimated by expression of a reporter gene, SEAP. Copyright © 2016 European Peptide Society and John Wiley & Sons, Ltd. Copyright © 2016 European Peptide Society and John Wiley & Sons, Ltd.
Statistical distribution of amino acid sequences: a proof of Darwinian evolution.

PubMed

Eitner, Krystian; Koch, Uwe; Gaweda, Tomasz; Marciniak, Jedrzej

2010-12-01

The article presents results of the listing of the quantity of amino acids, dipeptides and tripeptides for all proteins available in the UNIPROT-TREMBL database and the listing for selected species and enzymes. UNIPROT-TREMBL contains protein sequences associated with computationally generated annotations and large-scale functional characterization. Due to the distinct metabolic pathways of amino acid syntheses and their physicochemical properties, the quantities of subpeptides in proteins vary. We have proved that the distribution of amino acids, dipeptides and tripeptides is statistical which confirms that the evolutionary biodiversity development model is subject to the theory of independent events. It seems interesting that certain short peptide combinations occur relatively rarely or even not at all. First, it confirms the Darwinian theory of evolution and second, it opens up opportunities for designing pharmaceuticals among rarely represented short peptide combinations. Furthermore, an innovative approach to the mass analysis of bioinformatic data is presented. eitner@amu.edu.pl Supplementary data are available at Bioinformatics online.
Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure.

PubMed

Kono, H; Saven, J G

2001-02-23

Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Specificity determinants for the abscisic acid response element.

PubMed

Sarkar, Aditya Kumar; Lahiri, Ansuman

2013-01-01

Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.
Purification, characterization and molecular cloning of chymotrypsin inhibitor peptides from the venom of Burmese Daboia russelii siamensis.

PubMed

Guo, Chun-Teng; McClean, Stephen; Shaw, Chris; Rao, Ping-Fan; Ye, Ming-Yu; Bjourson, Anthony J

2013-05-01

One novel Kunitz BPTI-like peptide designated as BBPTI-1, with chymotrypsin inhibitory activity was identified from the venom of Burmese Daboia russelii siamensis. It was purified by three steps of chromatography including gel filtration, cation exchange and reversed phase. A partial N-terminal sequence of BBPTI-1, HDRPKFCYLPADPGECLAHMRSF was obtained by automated Edman degradation and a Ki value of 4.77nM determined. Cloning of BBPTI-1 including the open reading frame and 3' untranslated region was achieved from cDNA libraries derived from lyophilized venom using a 3' RACE strategy. In addition a cDNA sequence, designated as BBPTI-5, was also obtained. Alignment of cDNA sequences showed that BBPTI-5 exhibited an identical sequence to BBPTI-1 cDNA except for an eight nucleotide deletion in the open reading frame. Gene variations that represented deletions in the BBPTI-5 cDNA resulted in a novel protease inhibitor analog. Amino acid sequence alignment revealed that deduced peptides derived from cloning of their respective precursor cDNAs from libraries showed high similarity and homology with other Kunitz BPTI proteinase inhibitors. BBPTI-1 and BBPTI-5 consist of 60 and 66 amino acid residues respectively, including six conserved cysteine residues. As these peptides have been reported to have influence on the processes of coagulation, fibrinolysis and inflammation, their potential application in biomedical contexts warrants further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.
Microsatellite analysis in the genome of Acanthaceae: An in silico approach.

PubMed

Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar

2015-01-01

Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.
Detection of nucleic acid sequences by invader-directed cleavage

DOEpatents

Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2011 CFR

2011-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2013 CFR

2013-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2012 CFR

2012-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2010 CFR

2010-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2014 CFR

2014-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
Nucleotide sequence analysis establishes the role of endogenous murine leukemia virus DNA segments in formation of recombinant mink cell focus-forming murine leukemia viruses.

PubMed Central

Khan, A S

1984-01-01

The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017
What can we learn about lyssavirus genomes using 454 sequencing?

PubMed

Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin

2012-01-01

The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.
The alpha-fetoprotein third domain receptor binding fragment: in search of scavenger and associated receptor targets.

PubMed

Mizejewski, G J

2015-01-01

Recent studies have demonstrated that the carboxyterminal third domain of alpha-fetoprotein (AFP-CD) binds with various ligands and receptors. Reports within the last decade have established that AFP-CD contains a large fragment of amino acids that interact with several different receptor types. Using computer software specifically designed to identify protein-to-protein interaction at amino acid sequence docking sites, the computer searches identified several types of scavenger-associated receptors and their amino acid sequence locations on the AFP-CD polypeptide chain. The scavenger receptors (SRs) identified were CD36, CD163, Stabilin, SSC5D, SRB1 and SREC; the SR-associated receptors included the mannose, low-density lipoprotein receptors, the asialoglycoprotein receptor, and the receptor for advanced glycation endproducts (RAGE). Interestingly, some SR interaction sites were localized on the AFP-derived Growth Inhibitory Peptide (GIP) segment at amino acids #480-500. Following the detection studies, a structural subdomain analysis of both the receptor and the AFP-CD revealed the presence of epidermal growth factor (EGF) repeats, extracellular matrix-like protein regions, amino acid-rich motifs and dimerization subdomains. For the first time, it was reported that EGF-like sequence repeats were identified on each of the three domains of AFP. Thereafter, the localization of receptors on specific cell types were reviewed and their functions were discussed.
Design and construction of 2A peptide-linked multicistronic vectors.

PubMed

Szymczak-Workman, Andrea L; Vignali, Kate M; Vignali, Dario A A

2012-02-01

The need for reliable, multicistronic vectors for multigene delivery is at the forefront of biomedical technology. This article describes the design and construction of 2A peptide-linked multicistronic vectors, which can be used to express multiple proteins from a single open reading frame (ORF). The small 2A peptide sequences, when cloned between genes, allow for efficient, stoichiometric production of discrete protein products within a single vector through a novel "cleavage" event within the 2A peptide sequence. Expression of more than two genes using conventional approaches has several limitations, most notably imbalanced protein expression and large size. The use of 2A peptide sequences alleviates these concerns. They are small (18-22 amino acids) and have divergent amino-terminal sequences, which minimizes the chance for homologous recombination and allows for multiple, different 2A peptide sequences to be used within a single vector. Importantly, separation of genes placed between 2A peptide sequences is nearly 100%, which allows for stoichiometric and concordant expression of the genes, regardless of the order of placement within the vector.
Dynamic peptide libraries for the discovery of supramolecular nanomaterials

NASA Astrophysics Data System (ADS)

Pappas, Charalampos G.; Shafi, Ramim; Sasselli, Ivan R.; Siccardi, Henry; Wang, Tong; Narang, Vishal; Abzalimov, Rinat; Wijerathne, Nadeesha; Ulijn, Rein V.

2016-11-01

Sequence-specific polymers, such as oligonucleotides and peptides, can be used as building blocks for functional supramolecular nanomaterials. The design and selection of suitable self-assembling sequences is, however, challenging because of the vast combinatorial space available. Here we report a methodology that allows the peptide sequence space to be searched for self-assembling structures. In this approach, unprotected homo- and heterodipeptides (including aromatic, aliphatic, polar and charged amino acids) are subjected to continuous enzymatic condensation, hydrolysis and sequence exchange to create a dynamic combinatorial peptide library. The free-energy change associated with the assembly process itself gives rise to selective amplification of self-assembling candidates. By changing the environmental conditions during the selection process, different sequences and consequent nanoscale morphologies are selected.
Dynamic peptide libraries for the discovery of supramolecular nanomaterials.

PubMed

Pappas, Charalampos G; Shafi, Ramim; Sasselli, Ivan R; Siccardi, Henry; Wang, Tong; Narang, Vishal; Abzalimov, Rinat; Wijerathne, Nadeesha; Ulijn, Rein V

2016-11-01

Sequence-specific polymers, such as oligonucleotides and peptides, can be used as building blocks for functional supramolecular nanomaterials. The design and selection of suitable self-assembling sequences is, however, challenging because of the vast combinatorial space available. Here we report a methodology that allows the peptide sequence space to be searched for self-assembling structures. In this approach, unprotected homo- and heterodipeptides (including aromatic, aliphatic, polar and charged amino acids) are subjected to continuous enzymatic condensation, hydrolysis and sequence exchange to create a dynamic combinatorial peptide library. The free-energy change associated with the assembly process itself gives rise to selective amplification of self-assembling candidates. By changing the environmental conditions during the selection process, different sequences and consequent nanoscale morphologies are selected.
Properties and cDNA cloning of antihemorrhagic factors in sera of Chinese and Japanese mamushi (Gloydius blomhoffi).

PubMed

Aoki, Narumi; Tsutsumi, Kadzuyo; Deshimaru, Masanobu; Terada, Shigeyuki

2008-02-01

An antihemorrhagic protein has been isolated from the serum of Chinese mamushi (Gloydius blomhoffi brevicaudus) by using a combination of ethanol precipitation and a reverse-phase high-performance liquid chromatography (HPLC) on a C8 column. This protein-designated Chinese mamushi serum factor (cMSF)-suppressed mamushi venom-induced hemorrhage in a dose-dependent manner. It had no effect on trypsin, chymotrypsin, thermolysin, and papain but inhibited the proteinase activities of several snake venom metalloproteinases (SVMPs) including hemorrhagic enzymes isolated from the venoms of mamushi and habu (Trimeresurus flavoviridis). A similar protein (Japanese MSF, jMSF) with antihemorrhagic activity has also been purified from the sera of Japanese mamushi (G. blomhoffi). The N-terminal 70 and 51 residues of the intact cMSF and jMSF were directly analyzed; a similarity between the sequences of two MSFs to that of antihemorrhagic protein (HSF) from habu serum was noticed. To obtain the complete amino acid sequences of MSFs, cDNAs encoding these proteins were cloned from the liver mRNA of Chinese and Japanese vipers based on their N-terminal amino acid sequences. The mature forms of both MSFs consisted of 305 amino acids with a 19-residue signal sequence, and a unique 17-residue deletion was detected in their His-rich domains.
A novel cry2Ab gene from the indigenous isolate Bacillus thuringiensis subsp. kurstaki.

PubMed

Sevim, Ali; Eryüzlü, Emine; Demirbağ, Zihni; Demir, Ismail

2012-01-01

A novel cry2Ab gene was cloned and sequenced from the indigenous isolate of Bacillus thuringiensis subsp. kurstaki. This gene was designated as cry2Ab25 and its sequence revealed an open reading frame of 1,902 bp encoding a 633 aa protein with calculated molecular mass of 70 kDa and pI value of 8.98. The amino acid sequence of the Cry2Ab25 protein was compared with previously known Cry2Ab toxins, and the phylogenetic relationships among them were determined. The deduced amino acid sequence of the Cry2Ab25 protein showed 99% homology to the known Cry2Ab proteins, except for Cry2Ab10 and Cry2Ab12 with 97% homology, and a variation in one amino acid residue in comparison with all known Cry2Ab proteins. The cry2Ab25 gene was expressed in Escherichia coli BL21(DE3) cells. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) revealed that the Cry2Ab25 protein is about 70 kDa. The toxin expressed in BL21(DE3) exhibited high toxicity against Malacosoma neustria and Rhagoletis cerasi with 73% and 75% mortality after 5 days of treatment, respectively.
Polymerase Spiral Reaction (PSR): A novel isothermal nucleic acid amplification method.

PubMed

Liu, Wei; Dong, Derong; Yang, Zhan; Zou, Dayang; Chen, Zeliang; Yuan, Jing; Huang, Liuyu

2015-07-29

In this study, we report a novel isothermal nucleic acid amplification method only requires one pair of primers and one enzyme, termed Polymerase Spiral Reaction (PSR) with high specificity, efficiency, and rapidity under isothermal condition. The recombinant plasmid of blaNDM-1 was imported to Escherichia coli BL21, and selected as the microbial target. PSR method employs a Bst DNA polymerase and a pair of primers designed targeting the blaNDM-1 gene sequence. The forward and reverse Tab primer sequences are reverse to each other at their 5' end (Nr and N), whereas their 3' end sequences are complementary to their respective target nucleic acid sequences. The PSR method was performed at a constant temperature 61 °C-65 °C, yielding a complicated spiral structure. PSR assay was monitored continuously in a real-time turbidimeter instrument or visually detected with the aid of a fluorescent dye (SYBR Greenı), and could be finished within 1 h with a high accumulation of 10(9) copies of the target and a fine sensitivity of 6 CFU per reaction. Clinical evaluation was also conducted using PSR, showing high specificity of this method. The PSR technique provides a convenient and cost-effective alternative for clinical screening, on-site diagnosis and primary quarantine purposes.
A peptidomimetic with a chiral switch is an inhibitor of epidermal growth factor receptor heterodimerization

PubMed Central

Kanthala, Shanthi P.; Liu, Yong-Yu; Singh, Sitanshu; Sable, Rushikesh; Pallerla, Sandeep; Jois, Seetharama D.

2017-01-01

Among different types of EGFR dimers, EGFR-HER2 and HER2-HER3 are well known in different types of cancers. Targeting dimerization of EGFR will have a significant impact on cancer therapies. A symmetric peptidomimetic was designed to inhibit the protein-protein interaction of EGFR. The peptidomimetic (Cyclo(1,10)PpR (R) Anapa-FDDF-(R)-Anapa)R, compound 18) was shown to exhibit antiproliferative activity with an IC50 of 194 nM in HER2-expressing breast cancer cell lines and 18 nM in lung cancer cell lines. The peptidomimetic has a Pro-Pro sequence in the structure to stabilize the β-turn and a β-amino acid, amino napthyl propionic acid. To investigate the effect of the chirality of β-amino acid on the structure of the peptide and its antiproliferative activity, diastereoisomers of compound 18 were designed and synthesized. Structure-activity relationships of these compounds indicated that there is a chiral switch at β-amino acid in the designed compound. The peptidomimetic with R configuration at β-amino acid and with a L-Pro-D-Pro sequence was the most active compound (18). Using enzyme complement fragmentation assay and proximity ligation assay, we show that compound 18 inhibits HER2:HER3 and EGFR:HER2 dimerization. Surface plasmon resonance studies suggested that compound 18 binds to the HER2 extracellular domain and in particular to domain IV. The anticancer activity of compound 18 was evaluated using a xenograft model of breast cancer in mice; compound 18 suppressed the tumor growth in mice compared to control. Compound 18 was also shown to have a synergistic effect with erlotinib on EGFR mutated lung cancer cell lines. PMID:29088782
77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-29

... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
Fluorescent probes for nucleic Acid visualization in fixed and live cells.

PubMed

Boutorine, Alexandre S; Novopashina, Darya S; Krasheninina, Olga A; Nozeret, Karine; Venyaminova, Alya G

2013-12-11

This review analyses the literature concerning non-fluorescent and fluorescent probes for nucleic acid imaging in fixed and living cells from the point of view of their suitability for imaging intracellular native RNA and DNA. Attention is mainly paid to fluorescent probes for fluorescence microscopy imaging. Requirements for the target-binding part and the fluorophore making up the probe are formulated. In the case of native double-stranded DNA, structure-specific and sequence-specific probes are discussed. Among the latest, three classes of dsDNA-targeting molecules are described: (i) sequence-specific peptides and proteins; (ii) triplex-forming oligonucleotides and (iii) polyamide oligo(N-methylpyrrole/N-methylimidazole) minor groove binders. Polyamides seem to be the most promising targeting agents for fluorescent probe design, however, some technical problems remain to be solved, such as the relatively low sequence specificity and the high background fluorescence inside the cells. Several examples of fluorescent probe applications for DNA imaging in fixed and living cells are cited. In the case of intracellular RNA, only modified oligonucleotides can provide such sequence-specific imaging. Several approaches for designing fluorescent probes are considered: linear fluorescent probes based on modified oligonucleotide analogs, molecular beacons, binary fluorescent probes and template-directed reactions with fluorescence probe formation, FRET donor-acceptor pairs, pyrene excimers, aptamers and others. The suitability of all these methods for living cell applications is discussed.
Some properties and cDNA cloning of proteinaceous toxins from two species of lionfish (Pterois antennata and Pterois volitans).

PubMed

Kiriake, Aya; Shiomi, Kazuo

2011-11-01

Lionfish, members of the genera Pterois, Parapterois and Dendrochirus, are well known to be venomous, having venomous glandular tissues in dorsal, pelvic and anal spines. The lionfish toxins have been shown to cross-react with the stonefish toxins by neutralization tests using the commercial stonefish antivenom, although their chemical properties including structures have been little characterized. In this study, an antiserum against neoverrucotoxin, the stonefish Synanceia verrucosa toxin, was first raised in a guinea pig and used in immunoblotting and inhibition immunoblotting to confirm that two species of Pterois lionfish (P. antennata and P. volitans) contain a 75kDa protein (corresponding to the toxin subunit) cross-reacting with neoverrucotoxin. Then, the amino acid sequences of the P. antennata and P. volitans toxins were successfully determined by cDNA cloning using primers designed from the highly conserved sequences of the stonefish toxins. Notably, either α-subunits (699 amino acid residues) or β-subunits (698 amino acid residues) of the P. antennata and P. volitans toxins share as high as 99% sequence identity with each other. Furthermore, both α- and β-subunits of the lionfish toxins exhibit high sequence identity (70-80% identity) with each other and also with the β-subunits of the stonefish toxins. As reported for the stonefish toxins, the lionfish toxins also contain a B30.2/SPRY domain (comprising nearly 200 amino acid residues) in the C-terminal region of each subunit. Copyright © 2011 Elsevier Ltd. All rights reserved.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.

2007-12-11

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.

Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.

2010-11-09

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2000-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Nucleic acid detection assays

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.

2005-04-05

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Geodermatophilus sabuli sp. nov., a γ-radiation-resistant actinobacterium isolated from desert limestone.

PubMed

Hezbri, Karima; Ghodhbane-Gtari, Faten; Montero-Calasanz, Maria del Carmen; Sghaier, Haïtham; Rohde, Manfred; Schumann, Peter; Klenk, Hans-Peter; Gtari, Maher

2015-10-01

A novel γ-radiation-resistant and Gram-staining-positive actinobacterium designated BMG 8133T was isolated from a limestone collected in the Sahara desert of Tunisia. The strain produced dry, pale-pink colonies with an optimum growth at 35–40 °C and pH 6.5–8.0. Chemotaxonomic and molecular characteristics of the isolate matched those described for members of the genus Geodermatophilus. The peptidoglycan contained meso-diaminopimelic acid as diagnostic diamino acid. The main polar lipids were phosphatidylcholine, diphosphatidylglycerol, phosphatidylinositol, phosphatidylethanolamine and one unspecified glycolipid. MK-9(H4) was the dominant menaquinone. Galactose and glucose were detected as diagnostic sugars. The major cellular fatty acids were branched-chain saturated acids iso-C16 : 0 and iso-C15 : 0. The DNA G+C content of the novel strain was 74.5 %. The 16S rRNA gene sequence showed highest sequence identity with Geodermatophilus ruber (98.3 %). Based on phenotypic results and 16S rRNA gene sequence analysis, strain BMG 8133T is proposed to represent a novel species, Geodermatophilus sabuli sp. nov. The type strain is BMG 8133T ( = DSM 46844T = CECT 8820T).
Endo-β-1,3-Glucanase GLU1, from the Fruiting Body of Lentinula edodes, Belongs to a New Glycoside Hydrolase Family ▿ †

PubMed Central

Sakamoto, Yuichi; Nakade, Keiko; Konno, Naotake

2011-01-01

The cell wall of the fruiting body of the mushroom Lentinula edodes is degraded after harvesting by enzymes such as β-1,3-glucanase. In this study, a novel endo-type β-1,3-glucanase, GLU1, was purified from L. edodes fruiting bodies after harvesting. The gene encoding it, glu1, was isolated by rapid amplification of cDNA ends (RACE)-PCR using primers designed from the N-terminal amino acid sequence of GLU1. The putative amino acid sequence of the mature protein contained 247 amino acid residues with a molecular mass of 26 kDa and a pI of 3.87, and recombinant GLU1 expressed in Pichia pastoris exhibited β-1,3-glucanase activity. GLU1 catalyzed depolymerization of glucans composed of β-1,3-linked main chains, and reaction product analysis by thin-layer chromatography (TLC) clearly indicated that the enzyme had an endolytic mode. However, the amino acid sequence of GLU1 showed no significant similarity to known glycoside hydrolases. GLU1 has similarity to several hypothetical proteins in fungi, and GLU1 and highly similar proteins should be classified as a novel glycoside hydrolase family (GH128). PMID:21965406
MUFOLD-SS: New deep inception-inside-inception networks for protein secondary structure prediction.

PubMed

Fang, Chao; Shang, Yi; Xu, Dong

2018-05-01

Protein secondary structure prediction can provide important information for protein 3D structure prediction and protein functions. Deep learning offers a new opportunity to significantly improve prediction accuracy. In this article, a new deep neural network architecture, named the Deep inception-inside-inception (Deep3I) network, is proposed for protein secondary structure prediction and implemented as a software tool MUFOLD-SS. The input to MUFOLD-SS is a carefully designed feature matrix corresponding to the primary amino acid sequence of a protein, which consists of a rich set of information derived from individual amino acid, as well as the context of the protein sequence. Specifically, the feature matrix is a composition of physio-chemical properties of amino acids, PSI-BLAST profile, and HHBlits profile. MUFOLD-SS is composed of a sequence of nested inception modules and maps the input matrix to either eight states or three states of secondary structures. The architecture of MUFOLD-SS enables effective processing of local and global interactions between amino acids in making accurate prediction. In extensive experiments on multiple datasets, MUFOLD-SS outperformed the best existing methods and other deep neural networks significantly. MUFold-SS can be downloaded from http://dslsrv8.cs.missouri.edu/~cf797/MUFoldSS/download.html. © 2018 Wiley Periodicals, Inc.
Intact Protein Analysis at 21 Tesla and X-Ray Crystallography Define Structural Differences in Single Amino Acid Variants of Human Mitochondrial Branched-Chain Amino Acid Aminotransferase 2 (BCAT2)

NASA Astrophysics Data System (ADS)

Anderson, Lissa C.; Håkansson, Maria; Walse, Björn; Nilsson, Carol L.

2017-09-01

Structural technologies are an essential component in the design of precision therapeutics. Precision medicine entails the development of therapeutics directed toward a designated target protein, with the goal to deliver the right drug to the right patient at the right time. In the field of oncology, protein structural variants are often associated with oncogenic potential. In a previous proteogenomic screen of patient-derived glioblastoma (GBM) tumor materials, we identified a sequence variant of human mitochondrial branched-chain amino acid aminotransferase 2 as a putative factor of resistance of GBM to standard-of-care-treatments. The enzyme generates glutamate, which is neurotoxic. To elucidate structural coordinates that may confer altered substrate binding or activity of the variant BCAT2 T186R, a 45 kDa protein, we applied combined ETD and CID top-down mass spectrometry in a LC-FT-ICR MS at 21 T, and X-Ray crystallography in the study of both the variant and non-variant intact proteins. The combined ETD/CID fragmentation pattern allowed for not only extensive sequence coverage but also confident localization of the amino acid variant to its position in the sequence. The crystallographic experiments confirmed the hypothesis generated by in silico structural homology modeling, that the Lys59 side-chain of BCAT2 may repulse the Arg186 in the variant protein (PDB code: 5MPR), leading to destabilization of the protein dimer and altered enzyme kinetics. Taken together, the MS and novel 3D structural data give us reason to further pursue BCAT2 T186R as a precision drug target in GBM. [Figure not available: see fulltext.
CYP98A6 from Lithospermum erythrorhizon encodes 4-coumaroyl-4'-hydroxyphenyllactic acid 3-hydroxylase involved in rosmarinic acid biosynthesis.

PubMed

Matsuno, Michiyo; Nagatsu, Akito; Ogihara, Yukio; Ellis, Brian E; Mizukami, Hajime

2002-03-13

Rosmarinic acid is the dominant hydroxycinnamic acid ester accumulated in Boraginaceae and Lamiaceae plants. A cytochrome P450 cDNA was isolated by differential display from cultured cells of Lithospermum erythrorhizon, and the gene product was designated CYP98A6 based on the deduced amino acid sequence. After expression in yeast, the P450 was shown to catalyze the 3-hydroxylation of 4-coumaroyl-4'-hydroxyphenyllactic acid, one of the final two steps leading to rosmarinic acid. The expression level of CYP98A6 is dramatically increased by addition of yeast extract or methyl jasmonate to L. erythrorhizon cells, and its expression pattern reflected the elicitor-induced change in rosmarinic acid production, indicating that CYP98A6 plays an important role in regulation of rosmarinic acid biosynthesis.
Method for nucleic acid hybridization using single-stranded DNA binding protein

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1996-01-01

Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
Molecular characterization of a novel orthomyxovirus from rainbow and steelhead trout (Oncorhynchus mykiss)

USGS Publications Warehouse

Batts, William N.; LaPatra, Scott E.; Katona, Ryan; Leis, Eric; Fei Fan Ng, Terry; Bruieuc, Marine S.O.; Breyta, Rachel; Purcell, Maureen; Waltzek, Thomas B.; Delwart, Eric; Winton, James

2017-01-01

A novel virus, rainbow trout orthomyxovirus (RbtOV), was isolated in 1997 and again in 2000 from commercially-reared rainbow trout (Oncorhynchus mykiss) in Idaho, USA. The virus grew optimally in the CHSE-214 cell line at 15°C producing a diffuse cytopathic effect; however, juvenile rainbow trout exposed to cell culture-grown virus showed no mortality or gross pathology. Electron microscopy of preparations from infected cell cultures revealed the presence of typical orthomyxovirus particles. The complete genome of RbtOV is comprised of eight linear segments of single-stranded, negative-sense RNA having highly conserved 5′ and 3′-terminal nucleotide sequences. Another virus isolated in 2014 from steelhead trout (also O. mykiss) in Wisconsin, USA, and designated SttOV was found to have eight genome segments with high amino acid sequence identities (89–99%) to the corresponding genes of RbtOV, suggesting these new viruses are isolates of the same virus species and may be more widespread than currently realized. The new isolates had the same genome segment order and the closest pairwise amino acid sequence identities of 16–42% with Infectious salmon anemia virus (ISAV), the type species and currently only member of the genus Isavirus in the family Orthomyxoviridae. However, pairwise comparisons of the predicted amino acid sequences of the 10 RbtOV and SttOV proteins with orthologs from representatives of the established orthomyxoviral genera and a phylogenetic analysis using the PB1 protein showed that while RbtOV and SttOV clustered most closely with ISAV, they diverged sufficiently to merit consideration as representatives of a novel genus. A set of PCR primers was designed using conserved regions of the PB1 gene to produce amplicons that may be sequenced for identification of similar fish orthomyxoviruses in the future.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

PubMed

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Shotgun Protein Sequencing with Meta-contig Assembly*

PubMed Central

Guthals, Adrian; Clauser, Karl R.; Bandeira, Nuno

2012-01-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings. PMID:22798278
Shotgun protein sequencing with meta-contig assembly.

PubMed

Guthals, Adrian; Clauser, Karl R; Bandeira, Nuno

2012-10-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.
Glutamate cysteine ligase (GCL) in the freshwater bivalve Unio tumidus: impact of storage conditions and seasons on activity and identification of partial coding sequence of the catalytic subunit.

PubMed

Coffinet, Stéphanie; Cossu-Leguille, Carole; Rodius, François; Vasseur, Paule

2008-09-01

Glutamate cysteine ligase (GCL; EC 6.3.2.2) is the first enzyme involved in the synthesis of glutathione. A HPLC method with fluorimetric detection was used to measure GCL activity in the gills and the digestive gland of the freshwater bivalve, Unio tumidus. Storage conditions were optimized in order to prevent decrease of GCL activity and consisted in freezing the cytosolic fraction in the presence of protease (1 mM phenylmethylsulfonic fluoric acid) and gamma-glutamyltranspeptidase (1 mM L-serine borate mixture and 0.5 mM acivicin) inhibitors. Seasonal variations of activity in the digestive gland and to a lesser extent in the gills were found with activity increasing in spring compared to winter. No sex differences were revealed. The GCL coding sequence was identified using degenerated primers designed in the highly conserved regions of the catalytic subunit of GCL. The partial sequence identified encoded for 121 amino acids. The comparison of the identified partial coding sequence of U. tumidus with those available from vertebrates and invertebrates indicated that GCL sequence was highly conserved.
Partial nucleotide sequences, and routine typing by polymerase chain reaction-restriction fragment length polymorphism, of the brown trout (Salmo trutta) lactate dehydrogenase, LDH-C1*90 and *100 alleles.

PubMed

McMeel, O M; Hoey, E M; Ferguson, A

2001-01-01

The cDNA nucleotide sequences of the lactate dehydrogenase alleles LDH-C1*90 and *100 of brown trout (Salmo trutta) were found to differ at position 308 where an A is present in the *100 allele but a G is present in the *90 allele. This base substitution results in an amino acid change from aspartic acid at position 82 in the LDH-C1 100 allozyme to a glycine in the 90 allozyme. Since aspartic acid has a net negative charge whilst glycine is uncharged, this is consistent with the electrophoretic observation that the LDH-C1 100 allozyme has a more anodal mobility relative to the LDH-C1 90 allozyme. Based on alignment of the cDNA sequence with the mouse genomic sequence, a local primer set was designed, incorporating the variable position, and was found to give very good amplification with brown trout genomic DNA. Sequencing of this fragment confirmed the difference in both homozygous and heterozygous individuals. Digestion of the polymerase chain reaction products with BslI, a restriction enzyme specific for the site difference, gave one, two and three fragments for the two homozygotes and the heterozygote, respectively, following electrophoretic separation. This provides a DNA-based means of routine screening of the highly informative LDH-C1* polymorphism in brown trout population genetic studies. Primer sets presented could be used to sequence cDNA of other LDH* genes of brown trout and other species.
Characterization of gonadotrophin-releasing hormone precursor cDNA in the Old World mole-rat Cryptomys hottentotus pretoriae: high degree of identity with the New World guinea pig sequence.

PubMed

Kalamatianos, T; du Toit, L; Hrabovszky, E; Kalló, I; Marsh, P J; Bennett, N C; Coen, C W

2005-05-01

Regulation of pituitary gonadotrophins by the decapeptide gonadotrophin-releasing hormone 1 (GnRH1) is crucial for the development and maintenance of reproductive functions. A common amino acid sequence for this decapeptide, designated as 'mammalian' GnRH, has been identified in all mammals thus far investigated with the exception of the guinea pig, in which there are two amino acid substitutions. Among hystricognath rodents, the members of the family Bathyergidae regulate reproduction in response to diverse cues. Thus, highveld mole-rats (Cryptomys hottentotus pretoriae) are social bathyergids in which breeding is restricted to a particular season in the dominant female, but continuously suppressed in subordinate colony members. Elucidation of reproductive control in these animals will be facilitated by characterization of their GnRH1 gene. A partial sequence of GnRH1 precursor cDNA was isolated and characterized. Comparative analysis revealed the highest degree of identity (86%) to guinea pig GnRH1 precursor mRNA. Nevertheless, the deduced amino acid sequence of the mole-rat decapeptide is identical to the 'mammalian' sequence rather than that of guinea pigs. Successful detection of GnRH1-synthesizing neurones using either a guinea pig GnRH1 riboprobe or an antibody against the 'mammalian' decapeptide is consistent with the guinea pig-like sequence for the precursor and the classic 'mammalian' form for the decapeptide. The high degree of identity in the GnRH1 precursor sequence between this Old World mole-rat and the New World guinea pig is consistent with the theory that caviomorphs and phiomorphs originated from a common ancestral line in the Palaeocene to mid Eocene, some 63-45 million years ago.
Identification of Delta5-fatty acid desaturase from the cellular slime mold dictyostelium discoideum.

PubMed

Saito, T; Ochiai, H

1999-10-01

cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.
Whole-genome characterization of a Peruvian alpaca rotavirus isolate expressing a novel VP4 genotype.

PubMed

Rojas, Miguel; Gonçalves, Jorge Luiz S; Dias, Helver G; Manchego, Alberto; Pezo, Danilo; Santos, Norma

2016-11-30

The SA44 isolate of Rotavirus A (RVA) was identified from a neonatal Peruvian alpaca presenting with diarrhea, and the full-length genome sequence of the isolate (designated RVA/Alpaca-tc/PER/SA44/2014/G3P[40]) was determined. Phylogenetic analyses showed that the isolate possessed the genotype constellation G3-P[40]-I8-R3-C3-M3-A9-N3-T3-E3-H6, which differs considerably from those of RVA strains isolated from other species of the order Artiodactyla. Overall, the genetic constellation of the SA44 strain was quite similar to those of RVA strains isolated from a bat in Asia (MSLH14 and MYAS33). Nonetheless, phylogenetic analyses of each genome segment identified a distinct combination of genes. Several sequences were closely related to corresponding gene sequences in RVA strains from other species, including human (VP1, VP2, NSP1, and NSP2), simian (VP3 and NSP5), bat (VP6 and NSP4), and equine (NSP3). The VP7 gene sequence was closely related to RVA strains from a Peruvian alpaca (K'ayra/3368-10; 99.0% nucleotide and 99.7% amino acid identity) and from humans (RCH272; 95% nucleotide and 99.0% amino acid identity). The nucleotide sequence of the VP4 gene was distantly related to other VP4 sequences and was designated as the reference strain for the new P[40] genotype. This unique genetic makeup suggests that the SA44 strain emerged from multiple reassortment events between bat-, equine-, and human-like RVA strains. Copyright © 2016 Elsevier B.V. All rights reserved.
Promoter sequence of 3-phosphoglycerate kinase gene 1 of lactic acid-producing fungus rhizopus oryzae and a method of expressing a gene of interest in fungal species

DOEpatents

Gao, Johnway [Richland, WA; Skeen, Rodney S [Pendleton, OR

2002-10-15

The present invention provides the promoter clone discovery of phosphoglycerate kinase gene 1 of a lactic acid-producing filamentous fungal strain, Rhizopus oryzae. The isolated promoter can constitutively regulate gene expression under various carbohydrate conditions. In addition, the present invention also provides a design of an integration vector for the transformation of a foreign gene in Rhizopus oryzae.

Promoter sequence of 3-phosphoglycerate kinase gene 2 of lactic acid-producing fungus rhizopus oryzae and a method of expressing a gene of interest in fungal species

DOEpatents

Gao, Johnway [Richland, WA; Skeen, Rodney S [Pendleton, OR

2003-03-04

The present invention provides the promoter clone discovery of phosphoglycerate kinase gene 2 of a lactic acid-producing filamentous fungal strain, Rhizopus oryzae. The isolated promoter can constitutively regulate gene expression under various carbohydrate conditions. In addition, the present invention also provides a design of an integration vector for the transformation of a foreign gene in Rhizopus oryzae.
MRPrimer: a MapReduce-based method for the thorough design of valid and ranked primers for PCR.

PubMed

Kim, Hyerin; Kang, NaNa; Chon, Kang-Wook; Kim, Seonho; Lee, NaHye; Koo, JaeHyung; Kim, Min-Soo

2015-11-16

Primer design is a fundamental technique that is widely used for polymerase chain reaction (PCR). Although many methods have been proposed for primer design, they require a great deal of manual effort to generate feasible and valid primers, including homology tests on off-target sequences using BLAST-like tools. That approach is inconvenient for many target sequences of quantitative PCR (qPCR) due to considering the same stringent and allele-invariant constraints. To address this issue, we propose an entirely new method called MRPrimer that can design all feasible and valid primer pairs existing in a DNA database at once, while simultaneously checking a multitude of filtering constraints and validating primer specificity. Furthermore, MRPrimer suggests the best primer pair for each target sequence, based on a ranking method. Through qPCR analysis using 343 primer pairs and the corresponding sequencing and comparative analyses, we showed that the primer pairs designed by MRPrimer are very stable and effective for qPCR. In addition, MRPrimer is computationally efficient and scalable and therefore useful for quickly constructing an entire collection of feasible and valid primers for frequently updated databases like RefSeq. Furthermore, we suggest that MRPrimer can be utilized conveniently for experiments requiring primer design, especially real-time qPCR. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
DNA–DNA kissing complexes as a new tool for the assembly of DNA nanostructures

PubMed Central

Barth, Anna; Kobbe, Daniela; Focke, Manfred

2016-01-01

Kissing-loop annealing of nucleic acids occurs in nature in several viruses and in prokaryotic replication, among other circumstances. Nucleobases of two nucleic acid strands (loops) interact with each other, although the two strands cannot wrap around each other completely because of the adjacent double-stranded regions (stems). In this study, we exploited DNA kissing-loop interaction for nanotechnological application. We functionalized the vertices of DNA tetrahedrons with DNA stem-loop sequences. The complementary loop sequence design allowed the hybridization of different tetrahedrons via kissing-loop interaction, which might be further exploited for nanotechnology applications like cargo transport and logical elements. Importantly, we were able to manipulate the stability of those kissing-loop complexes based on the choice and concentration of cations, the temperature and the number of complementary loops per tetrahedron either at the same or at different vertices. Moreover, variations in loop sequences allowed the characterization of necessary sequences within the loop as well as additional stability control of the kissing complexes. Therefore, the properties of the presented nanostructures make them an important tool for DNA nanotechnology. PMID:26773051
A new ALF from Litopenaeus vannamei and its SNPs related to WSSV resistance

NASA Astrophysics Data System (ADS)

Liu, Jingwen; Yu, Yang; Li, Fuhua; Zhang, Xiaojun; Xiang, Jianhai

2014-11-01

Anti-lipopolysaccharide factors (ALFs) are basic components of the crustacean immune system that defend against a range of pathogens. The cDNA sequence of a new ALF, designated nLvALF2, with an open reading frame encoding 132 amino acids was cloned. Its deduced amino acid sequence contained the conserved functional domain of ALFs, the LPS binding domain (LBD). Its genomic sequence consisted of three exons and four introns. nLvALF2 was mainly expressed in the Oka organ and gills of shrimps. The transcriptional level of nLvALF2 increased significantly after white spot syndrome virus (WSSV) infection, suggesting its important roles in protecting shrimps from WSSV. Single nucleotide polymorphisms (SNPs) were found in the genomic sequence of nLvALF2, of which 38 were analyzed for associations with the susceptibility/resistance of shrimps to WSSV. The loci g.2422 A>G, g.2466 T>C, and g.2529 G>A were significantly associated with the resistance to WSSV ( P<0.05). These SNP loci could be developed as markers for selection of WSSV-resistant varieties of Litopenaeus vannamei.
Molecular Cloning and Sequence Analysis of a Phenylalanine Ammonia-Lyase Gene from Dendrobium

PubMed Central

Cai, Yongping; Lin, Yi

2013-01-01

In this study, a phenylalanine ammonia-lyase (PAL) gene was cloned from Dendrobium candidum using homology cloning and RACE. The full-length sequence and catalytic active sites that appear in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum are also found: PAL cDNA of D. candidum (designated Dc-PAL1, GenBank No. JQ765748) has 2,458 bps and contains a complete open reading frame (ORF) of 2,142 bps, which encodes 713 amino acid residues. The amino acid sequence of DcPAL1 has more than 80% sequence identity with the PAL genes of other plants, as indicated by multiple alignments. The dominant sites and catalytic active sites, which are similar to that showing in PAL proteins of Arabidopsis thaliana and Nicotiana tabacum, are also found in DcPAL1. Phylogenetic tree analysis revealed that DcPAL is more closely related to PALs from orchidaceae plants than to those of other plants. The differential expression patterns of PAL in protocorm-like body, leaf, stem, and root, suggest that the PAL gene performs multiple physiological functions in Dendrobium candidum. PMID:23638048
Composition for nucleic acid sequencing

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-08-26

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-06-06

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-05-30

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

NASA Astrophysics Data System (ADS)

Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

2000-02-01

Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
[Completed sequences analysis on the Chinese attenuated yellow fever 17D vaccine strain and the WHO standard yellow fever 17D vaccine strain].

PubMed

Li, Jing; Yu, Yong-Xin; Dong, Guan-Mu

2009-04-01

To compare the molecular characteristics of the Chinese attenuated yellow fever 17D vaccine strain and the WHO reference yellow fever 17D vaccine strain. The primers were designed according to the published nucleotide sequences of YFV 17D strains in GenBank. Total RNA of was extracted by the Trizol and reverse transcripted. The each fragments of the YFV genome were amplified by PCR and sequenced subsequently. The fragments of the 5' and 3' end of the two strains were cloned into the pGEM T-easy vector and then sequenced. The nucleotide acid and amino acid sequences of the homology to both strains were 99% with each other. No obvious nulceotide changes were found in the sequences of the entire genome of each 17D strains. Moreover, there was no obvious changes in the E protein genes. But the E173 of YF17D Tiantan, associted with the virulence, had mutantions. And the two live attenuated yellow fever 17D vaccine strains fell to the same lineage by the phylogenetic analysis. The results indicated that the two attenuated yellow fever 17D vaccine viruses accumulates mutations at a very low frequency and the genomes were relative stable.
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.

PubMed

Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco

2007-02-21

Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
A newly constructed primer pair for the PCR amplification, cloning and sequencing of the flagellin (flaA) gene from isolatesof urease-negative Campylobacter lari.

PubMed

Sekizuka, Tsuyoshi; Yokoi, Taeko; Murayama, Ohoshi; Millar, B Cherie; Moore, Johne; Matsuda, Motoo

2005-08-01

A newly constructed primer pair (lari-Af/lari-Ar) designed to generate a product of the flagellin (flaA) gene for urease-negative Campylobacter lari produced a PCR amplicon of about 1700 bp for 16 isolates from 7 seagulls, 5 humans, 3 food animals and one mussel in Japan and Northern Ireland. Nucleotide sequencing and alignments of the flaA amplicons from these isolates demonstrated that the deduced amino acid sequences of the possible open reading frame were 564-572 amino acid residues in length with calculated molecular weights of 58,804 to 59,463. The deduced amino acid sequence similarity analysis strongly suggested that the ORF of the flaA from the 16 isolates showed 70-75% sequence similarities to those of Campylobacter jejuni isolates. The approximate Mr of the flagellin purified from some of the isolates of urease-negative C. lari was estimated to range from 59.6 to 61.8 kDa. Thus, flagellin from the isolates of urease-negative C. lari was shown for the first time to have a molecular size similar to those of C. jejuni and Campylobacter coli isolates, but to be different from the shorter flaA and smaller flagellin of urease-positive thermophilic Campylobacter (UPTC) isolates. Flagellins from C. lari spp., consisting of the two representative taxa of urease-negative C. lari and UPTC, thus show genotypic and phenotypic diversity.
Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

NASA Astrophysics Data System (ADS)

Hamid, Nur Athirah Abd; Ismail, Ismanizan

2013-11-01

Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.
Kit for detecting nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2001-01-01

A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
Tn5401, a new class II transposable element from Bacillus thuringiensis.

PubMed Central

Baum, J A

1994-01-01

A new class II (Tn3-like) transposable element, designated Tn5401, was recovered from a sporulation-deficient variant of Bacillus thuringiensis subsp. morrisoni EG2158 following its insertion into a recombinant plasmid. Sequence analysis of the insert revealed a 4,837-bp transposon with two large open reading frames, in the same orientation, encoding proteins of 36 kDa (306 residues) and 116 kDa (1,005 residues) and 53-bp terminal inverted repeats. The deduced amino acid sequence for the 36-kDa protein shows 24% sequence identity with the TnpI recombinase of the B. thuringiensis transposon Tn4430, a member of the phage integrase family of site-specific recombinases. The deduced amino acid sequence for the 116-kDa protein shows 42% sequence identity with the transposase of Tn3 but only 28% identity with the TnpA transposase of Tn4430. Two small open reading frames of unknown function, designated orf1 (85 residues) and orf2 (74 residues), were also identified. Southern blot analysis indicated that Tn5401, in contrast to Tn4430, is not commonly found among different subspecies of B. thuringiensis and is not typically associated with known insecticidal crystal protein genes. Transposition was studied with B. thuringiensis by using plasmid pEG922, a temperature-sensitive shuttle vector containing Tn5401. Tn5401 transposed to both chromosomal and plasmid target sites but displayed an apparent preference for plasmid sites. Transposition was replicative and resulted in the generation of a 5-bp duplication at the target site. Transcriptional start sites within Tn5401 were mapped by primer extension analysis. Two promoters, designated PL and PR, direct the transcription of orf1-orf2 and tnpI-tnpA, respectively, and are negatively regulated by TnpI. Sequence comparison of the promoter regions of Tn5401 and Tn4430 suggests that the conserved sequence element ATGTCCRCTAAY mediates TnpI binding and cointegrate resolution. The same element is contained within the 53-bp terminal inverted repeats, thus accounting for their unusual lengths and suggesting an additional role for TnpI in regulating Tn5401 transposition. Images PMID:7514590
Effect of sequence and stereochemistry reversal on p53 peptide mimicry.

PubMed

Atzori, Alessio; Baker, Audrey E; Chiu, Mark; Bryce, Richard A; Bonnet, Pascal

2013-01-01

Peptidomimetics effective in modulating protein-protein interactions and resistant to proteolysis have potential in therapeutic applications. An appealing yet underperforming peptidomimetic strategy is to employ D-amino acids and reversed sequences to mimic a lead peptide conformation, either separately or as the combined retro-inverso peptide. In this work, we examine the conformations of inverse, reverse and retro-inverso peptides of p53(15-29) using implicit solvent molecular dynamics simulation and circular dichroism spectroscopy. In order to obtain converged ensembles for the peptides, we find enhanced sampling is required via the replica exchange molecular dynamics method. From these replica exchange simulations, the D-peptide analogues of p53(15-29) result in a predominantly left-handed helical conformation. When the parent sequence is reversed sequence as either the L-peptide and D-peptide, these peptides display a greater helical propensity, feature reflected by NMR and CD studies in TFE/water solvent. The simulations also indicate that, while approximately similar orientations of the side-chains are possible by the peptide analogues, their ability to mimic the parent peptide is severely compromised by backbone orientation (for D-amino acids) and side-chain orientation (for reversed sequences). A retro-inverso peptide is disadvantaged as a mimic in both aspects, and further chemical modification is required to enable this concept to be used fruitfully in peptidomimetic design. The replica exchange molecular simulation approach adopted here, with its ability to provide detailed conformational insights into modified peptides, has potential as a tool to guide structure-based design of new improved peptidomimetics.
An Evolution-Based Approach to De Novo Protein Design and Case Study on Mycobacterium tuberculosis

PubMed Central

Brender, Jeffrey R.; Czajka, Jeff; Marsh, David; Gray, Felicia; Cierpicki, Tomasz; Zhang, Yang

2013-01-01

Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality. PMID:24204234
Chip-based sequencing nucleic acids

DOEpatents

Beer, Neil Reginald

2014-08-26

A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.
"De-novo" amino acid sequence elucidation of protein G'e by combined "top-down" and "bottom-up" mass spectrometry.

PubMed

Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O

2015-03-01

Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
Microsatellite analysis in the genome of Acanthaceae: An in silico approach

PubMed Central

Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar

2015-01-01

Background: Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. Objective: The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. Materials and Methods: The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Results: Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. Conclusion: The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future. PMID:25709226

Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

PubMed Central

Thomsen, Martin Christen Frølund; Nielsen, Morten

2012-01-01

Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583
Detection of the High-Level Aminoglycoside Resistance Gene aph(2")-Ib in Enterococcus faecium

PubMed Central

Kao, Susan J.; You, Il; Clewell, Don B.; Donabedian, Susan M.; Zervos, Marcus J.; Petrin, Joanne; Shaw, Karen J.; Chow, Joseph W.

2000-01-01

A new high-level gentamicin resistance gene, designated aph(2")-Ib, was cloned from Enterococcus faecium SF11770. The deduced amino acid sequence of the 897-bp open reading frame of aph(2")-Ib shares homology with the aminoglycoside-modifying enzymes AAC(6′)-APH(2"), APH(2")-Ic, and APH(2")-Id. The observed phosphotransferase activity is designated APH(2")-Ib. PMID:10991878
Isolation and characterization of NBS-LRR- resistance gene candidates in turmeric (Curcuma longa cv. surama).

PubMed

Joshi, R K; Mohanty, S; Subudhi, E; Nayak, S

2010-09-08

Turmeric (Curcuma longa), an important asexually reproducing spice crop of the family Zingiberaceae is highly susceptible to bacterial and fungal pathogens. The identification of resistance gene analogs holds great promise for development of resistant turmeric cultivars. Degenerate primers designed based on known resistance genes (R-genes) were used in combinations to elucidate resistance gene analogs from Curcuma longa cultivar surama. The three primers resulted in amplicons with expected sizes of 450-600 bp. The nucleotide sequence of these amplicons was obtained through sequencing; their predicted amino acid sequences compared to each other and to the amino acid sequences of known R-genes revealed significant sequence similarity. The finding of conserved domains, viz., kinase-1a, kinase-2 and hydrophobic motif, provided evidence that the sequences belong to the NBS-LRR class gene family. The presence of tryptophan as the last residue of kinase-2 motif further qualified them to be in the non-TIR-NBS-LRR subfamily of resistance genes. A cluster analysis based on the neighbor-joining method was carried out using Curcuma NBS analogs together with several resistance gene analogs and known R-genes, which classified them into two distinct subclasses, corresponding to clades N3 and N4 of non-TIR-NBS sequences described in plants. The NBS analogs that we isolated can be used as guidelines to eventually isolate numerous R-genes in turmeric.
Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes

NASA Astrophysics Data System (ADS)

Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.
A novel endo-beta-1,3-glucanase, BGN13.1, involved in the mycoparasitism of Trichoderma harzianum.

PubMed Central

de la Cruz, J; Pintor-Toro, J A; Benítez, T; Llobell, A; Romero, L C

1995-01-01

The mycoparasitic fungus Trichoderma harzianum CECT 2413 produces at least three extracellular beta-1,3-glucanases. The most basic of these extracellular enzymes, named BGN13.1, was expressed when either fungal cell wall polymers or autoclaved mycelia from different fungi were used as the carbon source. BGN13.1 was purified to electrophoretic homogeneity and was biochemically characterized. The enzyme was specific for beta-1,3 linkages and has an endolytic mode of action. A synthetic oligonucleotide primer based on the sequence of an internal peptide was designed to clone the cDNA corresponding to BGN13.1. The deduced amino acid sequence predicted a molecular mass of 78 kDa for the mature protein. Analysis of the amino acid sequence indicates that the enzyme contains three regions, one N-terminal leader sequence; another, nondefined sequence; and one cysteine-rich C-terminal sequence. Sequence comparison shows that this beta-1,3-glucanase, first described for filamentous fungi, belongs to a family different from that of its previously described bacterial, yeast, and plant counterparts. Enzymatic-activity, protein, and mRNA data indicated that bgn13.1 is repressed by glucose and induced by either fungal cell wall polymers or autoclaved yeast cells and mycelia. Finally, experimental evidence showed that the enzyme hydrolyzes yeast and fungal cell walls. PMID:7592488
Purification, characterization and sequence analysis of Omp50,a new porin isolated from Campylobacter jejuni.

PubMed Central

Bolla, J M; Dé, E; Dorez, A; Pagès, J M

2000-01-01

A novel pore-forming protein identified in Campylobacter was purified by ion-exchange chromatography and named Omp50 according to both its molecular mass and its outer membrane localization. We observed a pore-forming ability of Omp50 after re-incorporation into artificial membranes. The protein induced cation-selective channels with major conductance values of 50-60 pS in 1 M NaCl. N-terminal sequencing allowed us to identify the predicted coding sequence Cj1170c from the Campylobacter jejuni genome database as the corresponding gene in the NCTC 11168 genome sequence. The gene, designated omp50, consists of a 1425 bp open reading frame encoding a deduced 453-amino acid protein with a calculated pI of 5.81 and a molecular mass of 51169.2 Da. The protein possessed a 20-amino acid leader sequence. No significant similarity was found between Omp50 and porin protein sequences already determined. Moreover, the protein showed only weak sequence identity with the major outer-membrane protein (MOMP) of Campylobacter, correlating with the absence of antigenic cross-reactivity between these two proteins. Omp50 is expressed in C. jejuni and Campylobacter lari but not in Campylobacter coli. The gene, however, was detected in all three species by PCR. According to its conformation and functional properties, the protein would belong to the family of outer-membrane monomeric porins. PMID:11104668
RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules.

PubMed

Garcia-Martin, Juan Antonio; Dotu, Ivan; Clote, Peter

2015-07-01

Several algorithms for RNA inverse folding have been used to design synthetic riboswitches, ribozymes and thermoswitches, whose activity has been experimentally validated. The RNAiFold software is unique among approaches for inverse folding in that (exhaustive) constraint programming is used instead of heuristic methods. For that reason, RNAiFold can generate all sequences that fold into the target structure or determine that there is no solution. RNAiFold 2.0 is a complete overhaul of RNAiFold 1.0, rewritten from the now defunct COMET language to C++. The new code properly extends the capabilities of its predecessor by providing a user-friendly pipeline to design synthetic constructs having the functionality of given Rfam families. In addition, the new software supports amino acid constraints, even for proteins translated in different reading frames from overlapping coding sequences; moreover, structure compatibility/incompatibility constraints have been expanded. With these features, RNAiFold 2.0 allows the user to design single RNA molecules as well as hybridization complexes of two RNA molecules. the web server, source code and linux binaries are publicly accessible at http://bioinformatics.bc.edu/clotelab/RNAiFold2.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Bacterial collagen-like proteins that form triple-helical structures

PubMed Central

Yu, Zhuoxin; An, Bo; Ramshaw, John A.M.; Brodsky, Barbara

2014-01-01

A large number of collagen-like proteins have been identified in bacteria during the past ten years, principally from analysis of genome databases. These bacterial collagens share the distinctive Gly-Xaa-Yaa repeating amino acid sequence of animal collagens which underlies their unique triple-helical structure. A number of the bacterial collagens have been expressed in E. coli, and they all adopt a triple-helix conformation. Unlike animal collagens, these bacterial proteins do not contain the post-translationally modified amino acid, hydroxyproline, which is known to stabilize the triple-helix structure and may promote self-assembly. Despite the absence of collagen hydroxylation, the triple-helix structures of the bacterial collagens studied exhibit a high thermal stability of 35–39 °C, close to that seen for mammalian collagens. These bacterial collagens are readily produced in large quantities by recombinant methods, either in the original amino acid sequence or in genetically manipulated sequences. This new family of recombinant, easy to modify collagens could provide a novel system for investigating structural and functional motifs in animal collagens and could also form the basis of new biomedical materials with designed structural properties and functions. PMID:24434612
Cloning of the Escherichia coli endo-1,4-D-glucanase gene and identification of its product.

PubMed

Park, Y W; Yun, H D

1999-03-01

A plasmid (pYP17) containing a genomic DNA insert from Escherichia coli K-12 that confers the ability to hydrolyze carboxymethylcellulose (CMC) was isolated from a genomic library constructed in the cosmid vector pLAFR3 in E. coli DH5alpha. A small 1.65-kb fragment, designated bcsC (pYP300), was sequenced and found to contain an ORF of 1,104 bp encoding a protein of 368 amino acid residues, with a calculated molecular weight of 41,700 Da. BcsC carries a typical prokaryotic signal peptide of 21 amino acid residues. The predicted amino acid sequence of the BcsC protein is similar to that of CelY of Erwinia chrysanthemi, CMCase of Cellulomonas uda, EngX of Acetobacter xylinum, and CelC of Agrobacterium tumefaciens. Based on these sequence similarities, we propose that the bcsC gene is a member of glycosyl hydrolase family 8. The apparent molecular mass of the protein, when expressed in E. coli, is approximately 40 kDa, and the CMCase activity is found mainly in the extracellular space. The enzyme is optimally active at pH 7 and a temperature of 40 degrees C.
Comparative characterization of random-sequence proteins consisting of 5, 12, and 20 kinds of amino acids

PubMed Central

Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi

2010-01-01

Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279–284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614
Optimized smith waterman processor design for breast cancer early diagnosis

NASA Astrophysics Data System (ADS)

Nurdin, D. S.; Isa, M. N.; Ismail, R. C.; Ahmad, M. I.

2017-09-01

This paper presents an optimized design of Processing Element (PE) of Systolic Array (SA) which implements affine gap penalty Smith Waterman (SW) algorithm on the Xilinx Virtex-6 XC6VLX75T Field Programmable Gate Array (FPGA) for Deoxyribonucleic Acid (DNA) sequence alignment. The PE optimization aims to reduce PE logic resources to increase number of PEs in FPGA for higher degree of parallelism during alignment matrix computations. This is useful for aligning long DNA-based disease sequence such as Breast Cancer (BC) for early diagnosis. The optimized PE architecture has the smallest PE area with 15 slices in a PE and 776 PEs implemented in the Virtex - 6 FPGA.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Reiser, Steven E.; Somerville, Chris R.

The present invention relates to bacterial enzymes, in particular to an acyl-CoA reductase and a gene encoding an acyl-CoA reductase, the amino acid and nucleic acid sequences corresponding to the reductase polypeptide and gene, respectively, and to methods of obtaining such enzymes, amino acid sequences and nucleic acid sequences. The invention also relates to the use of such sequences to provide transgenic host cells capable of producing fatty alcohols and fatty aldehydes.
Development of designed site-directed pseudopeptide-peptido-mimetic immunogens as novel minimal subunit-vaccine candidates for malaria.

PubMed

Lozano, José Manuel; Lesmes, Liliana P; Carreño, Luisa F; Gallego, Gina M; Patarroyo, Manuel Elkin

2010-12-06

Synthetic vaccines constitute the most promising tools for controlling and preventing infectious diseases. When synthetic immunogens are designed from the pathogen native sequences, these are normally poorly immunogenic and do not induce protection, as demonstrated in our research. After attempting many synthetic strategies for improving the immunogenicity properties of these sequences, the approach consisting of identifying high binding motifs present in those, and then performing specific changes on amino-acids belonging to such motifs, has proven to be a workable strategy. In addition, other strategies consisting of chemically introducing non-natural constraints to the backbone topology of the molecule and modifying the α-carbon asymmetry are becoming valuable tools to be considered in this pursuit. Non-natural structural constraints to the peptide backbone can be achieved by introducing peptide bond isosters such as reduced amides, partially retro or retro-inverso modifications or even including urea motifs. The second can be obtained by strategically replacing L-amino-acids with their enantiomeric forms for obtaining both structurally site-directed designed immunogens as potential vaccine candidates and their Ig structural molecular images, both having immuno-therapeutic effects for preventing and controlling malaria.
PNA-COMBO-FISH: From combinatorial probe design in silico to vitality compatible, specific labelling of gene targets in cell nuclei.

PubMed

Müller, Patrick; Rößler, Jens; Schwarz-Finsterle, Jutta; Schmitt, Eberhard; Hausmann, Michael

2016-07-01

Recently, advantages concerning targeting specificity of PCR constructed oligonucleotide FISH probes in contrast to established FISH probes, e.g. BAC clones, have been demonstrated. These techniques, however, are still using labelling protocols with DNA denaturing steps applying harsh heat treatment with or without further denaturing chemical agents. COMBO-FISH (COMBinatorial Oligonucleotide FISH) allows the design of specific oligonucleotide probe combinations in silico. Thus, being independent from primer libraries or PCR laboratory conditions, the probe sequences extracted by computer sequence data base search can also be synthesized as single stranded PNA-probes (Peptide Nucleic Acid probes) or TINA-DNA (Twisted Intercalating Nucleic Acids). Gene targets can be specifically labelled with at least about 20 probes obtaining visibly background free specimens. By using appropriately designed triplex forming oligonucleotides, the denaturing procedures can completely be omitted. These results reveal a significant step towards oligonucleotide-FISH maintaining the 3d-nanostructure and even the viability of the cell target. The method is demonstrated with the detection of Her2/neu and GRB7 genes, which are indicators in breast cancer diagnosis and therapy. Copyright © 2016. Published by Elsevier Inc.
Novel rod-shaped viruses isolated from garlic, Allium sativum, possessing a unique genome organization.

PubMed

Sumi, S; Tsuneyoshi, T; Furutani, H

1993-09-01

Rod-shaped flexuous viruses were partially purified from garlic plants (Allium sativum) showing typical mosaic symptoms. The genome was shown to be composed of RNA with a poly(A) tail of an estimated size of 10 kb as shown by denaturing agarose gel electrophoresis. We constructed cDNA libraries and screened four independent clones, which were designated GV-A, GV-B, GV-C and GV-D, using Northern and Southern blot hybridization. Nucleotide sequence determination of the cDNAs, two of which correspond to nearly one-third of the virus genomic RNA, shows that all of these viruses possess an identical genomic structure and that also at least four proteins are encoded in the viral cDNA, their M(r)s being estimated to be 15K, 27K, 40K and 11K. The 15K open reading frame (ORF) encodes the core-like sequence of a zinc finger protein preceded by a cluster of basic amino acid residues. The 27K ORF probably encodes the viral coat protein (CP), based on both the existence of some conserved sequences observed in many other rod-shaped or flexuous virus CPs and an overall amino acid sequence similarity to potexvirus and carlavirus CPs. The 11K ORF shows significant amino acid sequence similarities to the corresponding 12K proteins of the potexviruses and carlaviruses. On the other hand, the 40K ORF product does not resemble any other plant virus gene products reported so far. The genomic organization in the 3' region of the garlic viruses resembles, but clearly differs from, that of carlaviruses. Phylogenetic analysis based upon the amino acid sequence of the viral capsid protein also indicates that the garlic viruses have a unique and distinct domain different from those of the potexvirus and carlavirus groups. The results suggest that the garlic viruses described here belong to an unclassified and new virus group closely related to the carlaviruses.
Cloning and sequence analysis of a full-length cDNA of SmPP1cb encoding turbot protein phosphatase 1 beta catalytic subunit

NASA Astrophysics Data System (ADS)

Qi, Fei; Guo, Huarong; Wang, Jian

2008-02-01

Reversible protein phosphorylation, catalyzed by protein kinases and phosphatases, is an important and versatile mechanism by which eukaryotic cells regulate almost all the signaling processes. Protein phosphatase 1 (PP1) is the first and well-characterized member of the protein serine/threonine phosphatase family. In the present study, a full-length cDNA encoding the beta isoform of the catalytic subunit of protein phosphatase 1(PP1cb), was for the first time isolated and sequenced from the skin tissue of flatfish turbot Scophthalmus maximus, designated SmPP1cb, by the rapid amplification of cDNA ends (RACE) technique. The cDNA sequence of SmPP1cb we obtained contains a 984 bp open reading frame (ORF), flanked by a complete 39 bp 5' untranslated region and 462 bp 3' untranslated region. The ORF encodes a putative 327 amino acid protein, and the N-terminal section of this protein is highly acidic, Met-Ala-Glu-Gly-Glu-Leu-Asp-Val-Asp, a common feature for PP1 catalytic subunit but absent in protein phosphatase 2B (PP2B). And its calculated molecular mass is 37 193 Da and pI 5.8. Sequence analysis indicated that, SmPP1cb is extremely conserved in both amino acid and nucleotide acid levels compared with the PP1cb of other vertebrates and invertebrates, and its Kozak motif contained in the 5'UTR around ATG start codon is GXXAXXGXX ATGG, which is different from mammalian in two positions A-6 and G-3, indicating the possibility of different initiation of translation in turbot, and also the 3'UTR of SmPP1cb is highly diverse in the sequence similarity and length compared with other animals, especially zebrafish. The cloning and sequencing of SmPP1cb gene lays a good foundation for the future work on the biological functions of PP1 in the flatfish turbot.
Biosynthesis of riboflavin: an unusual riboflavin synthase of Methanobacterium thermoautotrophicum.

PubMed Central

Eberhardt, S; Korn, S; Lottspeich, F; Bacher, A

1997-01-01

Riboflavin synthase was purified by a factor of about 1,500 from cell extract of Methanobacterium thermoautotrophicum. The enzyme had a specific activity of about 2,700 nmol mg(-1) h(-1) at 65 degrees C, which is relatively low compared to those of riboflavin synthases of eubacteria and yeast. Amino acid sequences obtained after proteolytic cleavage had no similarity with known riboflavin synthases. The gene coding for riboflavin synthase (designated ribC) was subsequently cloned by marker rescue with a ribC mutant of Escherichia coli. The ribC gene of M. thermoautotrophicum specifies a protein of 153 amino acid residues. The predicted amino acid sequence agrees with the information gleaned from Edman degradation of the isolated protein and shows 67% identity with the sequence predicted for the unannotated reading frame MJ1184 of Methanococcus jannaschii. The ribC gene is adjacent to a cluster of four genes with similarity to the genes cbiMNQO of Salmonella typhimurium, which form part of the cob operon (this operon contains most of the genes involved in the biosynthesis of vitamin B12). The amino acid sequence predicted by the ribC gene of M. thermoautotrophicum shows no similarity whatsoever to the sequences of riboflavin synthases of eubacteria and yeast. Most notably, the M. thermoautotrophicum protein does not show the internal sequence homology characteristic of eubacterial and yeast riboflavin synthases. The protein of M. thermoautotrophicum can be expressed efficiently in a recombinant E. coli strain. The specific activity of the purified, recombinant protein is 1,900 nmol mg(-1) h(-1) at 65 degrees C. In contrast to riboflavin synthases from eubacteria and fungi, the methanobacterial enzyme has an absolute requirement for magnesium ions. The 5' phosphate of 6,7-dimethyl-8-ribityllumazine does not act as a substrate. The findings suggest that riboflavin synthase has evolved independently in eubacteria and methanobacteria. PMID:9139911
Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

PubMed

Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

1994-07-08

The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.
Defining Electron Bifurcation in the Electron-Transferring Flavoprotein Family.

PubMed

Garcia Costas, Amaya M; Poudel, Saroj; Miller, Anne-Frances; Schut, Gerrit J; Ledbetter, Rhesa N; Fixen, Kathryn R; Seefeldt, Lance C; Adams, Michael W W; Harwood, Caroline S; Boyd, Eric S; Peters, John W

2017-11-01

Electron bifurcation is the coupling of exergonic and endergonic redox reactions to simultaneously generate (or utilize) low- and high-potential electrons. It is the third recognized form of energy conservation in biology and was recently described for select electron-transferring flavoproteins (Etfs). Etfs are flavin-containing heterodimers best known for donating electrons derived from fatty acid and amino acid oxidation to an electron transfer respiratory chain via Etf-quinone oxidoreductase. Canonical examples contain a flavin adenine dinucleotide (FAD) that is involved in electron transfer, as well as a non-redox-active AMP. However, Etfs demonstrated to bifurcate electrons contain a second FAD in place of the AMP. To expand our understanding of the functional variety and metabolic significance of Etfs and to identify amino acid sequence motifs that potentially enable electron bifurcation, we compiled 1,314 Etf protein sequences from genome sequence databases and subjected them to informatic and structural analyses. Etfs were identified in diverse archaea and bacteria, and they clustered into five distinct well-supported groups, based on their amino acid sequences. Gene neighborhood analyses indicated that these Etf group designations largely correspond to putative differences in functionality. Etfs with the demonstrated ability to bifurcate were found to form one group, suggesting that distinct conserved amino acid sequence motifs enable this capability. Indeed, structural modeling and sequence alignments revealed that identifying residues occur in the NADH- and FAD-binding regions of bifurcating Etfs. Collectively, a new classification scheme for Etf proteins that delineates putative bifurcating versus nonbifurcating members is presented and suggests that Etf-mediated bifurcation is associated with surprisingly diverse enzymes. IMPORTANCE Electron bifurcation has recently been recognized as an electron transfer mechanism used by microorganisms to maximize energy conservation. Bifurcating enzymes couple thermodynamically unfavorable reactions with thermodynamically favorable reactions in an overall spontaneous process. Here we show that the electron-transferring flavoprotein (Etf) enzyme family exhibits far greater diversity than previously recognized, and we provide a phylogenetic analysis that clearly delineates bifurcating versus nonbifurcating members of this family. Structural modeling of proteins within these groups reveals key differences between the bifurcating and nonbifurcating Etfs. Copyright © 2017 American Society for Microbiology.
Defining Electron Bifurcation in the Electron-Transferring Flavoprotein Family

PubMed Central

Garcia Costas, Amaya M.; Poudel, Saroj; Miller, Anne-Frances; Schut, Gerrit J.; Ledbetter, Rhesa N.; Seefeldt, Lance C.; Adams, Michael W. W.

2017-01-01

ABSTRACT Electron bifurcation is the coupling of exergonic and endergonic redox reactions to simultaneously generate (or utilize) low- and high-potential electrons. It is the third recognized form of energy conservation in biology and was recently described for select electron-transferring flavoproteins (Etfs). Etfs are flavin-containing heterodimers best known for donating electrons derived from fatty acid and amino acid oxidation to an electron transfer respiratory chain via Etf-quinone oxidoreductase. Canonical examples contain a flavin adenine dinucleotide (FAD) that is involved in electron transfer, as well as a non-redox-active AMP. However, Etfs demonstrated to bifurcate electrons contain a second FAD in place of the AMP. To expand our understanding of the functional variety and metabolic significance of Etfs and to identify amino acid sequence motifs that potentially enable electron bifurcation, we compiled 1,314 Etf protein sequences from genome sequence databases and subjected them to informatic and structural analyses. Etfs were identified in diverse archaea and bacteria, and they clustered into five distinct well-supported groups, based on their amino acid sequences. Gene neighborhood analyses indicated that these Etf group designations largely correspond to putative differences in functionality. Etfs with the demonstrated ability to bifurcate were found to form one group, suggesting that distinct conserved amino acid sequence motifs enable this capability. Indeed, structural modeling and sequence alignments revealed that identifying residues occur in the NADH- and FAD-binding regions of bifurcating Etfs. Collectively, a new classification scheme for Etf proteins that delineates putative bifurcating versus nonbifurcating members is presented and suggests that Etf-mediated bifurcation is associated with surprisingly diverse enzymes. IMPORTANCE Electron bifurcation has recently been recognized as an electron transfer mechanism used by microorganisms to maximize energy conservation. Bifurcating enzymes couple thermodynamically unfavorable reactions with thermodynamically favorable reactions in an overall spontaneous process. Here we show that the electron-transferring flavoprotein (Etf) enzyme family exhibits far greater diversity than previously recognized, and we provide a phylogenetic analysis that clearly delineates bifurcating versus nonbifurcating members of this family. Structural modeling of proteins within these groups reveals key differences between the bifurcating and nonbifurcating Etfs. PMID:28808132

Isolation of acetic, propionic and butyric acid-forming bacteria from biogas plants.

PubMed

Cibis, Katharina Gabriela; Gneipel, Armin; König, Helmut

2016-02-20

In this study, acetic, propionic and butyric acid-forming bacteria were isolated from thermophilic and mesophilic biogas plants (BGP) located in Germany. The fermenters were fed with maize silage and cattle or swine manure. Furthermore, pressurized laboratory fermenters digesting maize silage were sampled. Enrichment cultures for the isolation of acid-forming bacteria were grown in minimal medium supplemented with one of the following carbon sources: Na(+)-dl-lactate, succinate, ethanol, glycerol, glucose or a mixture of amino acids. These substrates could be converted by the isolates to acetic, propionic or butyric acid. In total, 49 isolates were obtained, which belonged to the phyla Firmicutes, Tenericutes or Thermotogae. According to 16S rRNA gene sequences, most isolates were related to Clostridium sporosphaeroides, Defluviitoga tunisiensis and Dendrosporobacter quercicolus. Acetic, propionic or butyric acid were produced in cultures of isolates affiliated to Bacillus thermoamylovorans, Clostridium aminovalericum, Clostridium cochlearium/Clostridium tetani, C. sporosphaeroides, D. quercicolus, Proteiniborus ethanoligenes, Selenomonas bovis and Tepidanaerobacter sp. Isolates related to Thermoanaerobacterium thermosaccharolyticum produced acetic, butyric and lactic acid, and isolates related to D. tunisiensis formed acetic acid. Specific primer sets targeting 16S rRNA gene sequences were designed and used for real-time quantitative PCR (qPCR). The isolates were physiologically characterized and their role in BGP discussed. Copyright © 2016 Elsevier B.V. All rights reserved.
HomoSAR: bridging comparative protein modeling with quantitative structural activity relationship to design new peptides.

PubMed

Borkar, Mahesh R; Pissurlenkar, Raghuvir R S; Coutinho, Evans C

2013-11-15

Peptides play significant roles in the biological world. To optimize activity for a specific therapeutic target, peptide library synthesis is inevitable; which is a time consuming and expensive. Computational approaches provide a promising way to simply elucidate the structural basis in the design of new peptides. Earlier, we proposed a novel methodology termed HomoSAR to gain insight into the structure activity relationships underlying peptides. Based on an integrated approach, HomoSAR uses the principles of homology modeling in conjunction with the quantitative structural activity relationship formalism to predict and design new peptide sequences with the optimum activity. In the present study, we establish that the HomoSAR methodology can be universally applied to all classes of peptides irrespective of sequence length by studying HomoSAR on three peptide datasets viz., angiotensin-converting enzyme inhibitory peptides, CAMEL-s antibiotic peptides, and hAmphiphysin-1 SH3 domain binding peptides, using a set of descriptors related to the hydrophobic, steric, and electronic properties of the 20 natural amino acids. Models generated for all three datasets have statistically significant correlation coefficients (r(2)) and predictive r2 (r(pred)2) and cross validated coefficient ( q(LOO)2). The daintiness of this technique lies in its simplicity and ability to extract all the information contained in the peptides to elucidate the underlying structure activity relationships. The difficulties of correlating both sequence diversity and variation in length of the peptides with their biological activity can be addressed. The study has been able to identify the preferred or detrimental nature of amino acids at specific positions in the peptide sequences. Copyright © 2013 Wiley Periodicals, Inc.
Application of Locked Nucleic Acid (LNA) Primer and PCR Clamping by LNA Oligonucleotide to Enhance the Amplification of Internal Transcribed Spacer (ITS) Regions in Investigating the Community Structures of Plant-Associated Fungi.

PubMed

Ikenaga, Makoto; Tabuchi, Masakazu; Kawauchi, Tomohiro; Sakai, Masao

2016-09-29

The simultaneous extraction of host plant DNA severely limits investigations of the community structures of plant-associated fungi due to the similar homologies of sequences in primer-annealing positions between fungi and host plants. Although fungal-specific primers have been designed, plant DNA continues to be excessively amplified by PCR, resulting in the underestimation of community structures. In order to overcome this limitation, locked nucleic acid (LNA) primers and PCR clamping by LNA oligonucleotides have been applied to enhance the amplification of fungal internal transcribed spacer (ITS) regions. LNA primers were designed by converting DNA into LNA, which is specific to fungi, at the forward primer side. LNA oligonucleotides, the sequences of which are complementary to the host plants, were designed by overlapping a few bases with the annealing position of the reverse primer. Plant-specific DNA was then converted into LNA at the shifted position from the 3' end of the primer-binding position. PCR using the LNA technique enhanced the amplification of fungal ITS regions, whereas those of the host plants were more likely to be amplified without the LNA technique. A denaturing gradient gel electrophoresis (DGGE) analysis displayed patterns that reached an acceptable level for investigating the community structures of plant-associated fungi using the LNA technique. The sequences of the bands detected using the LNA technique were mostly affiliated with known isolates. However, some sequences showed low similarities, indicating the potential to identify novel fungi. Thus, the application of the LNA technique is considered effective for widening the scope of community analyses of plant-associated fungi.
Application of Locked Nucleic Acid (LNA) Primer and PCR Clamping by LNA Oligonucleotide to Enhance the Amplification of Internal Transcribed Spacer (ITS) Regions in Investigating the Community Structures of Plant–Associated Fungi

PubMed Central

Ikenaga, Makoto; Tabuchi, Masakazu; Kawauchi, Tomohiro; Sakai, Masao

2016-01-01

The simultaneous extraction of host plant DNA severely limits investigations of the community structures of plant–associated fungi due to the similar homologies of sequences in primer–annealing positions between fungi and host plants. Although fungal-specific primers have been designed, plant DNA continues to be excessively amplified by PCR, resulting in the underestimation of community structures. In order to overcome this limitation, locked nucleic acid (LNA) primers and PCR clamping by LNA oligonucleotides have been applied to enhance the amplification of fungal internal transcribed spacer (ITS) regions. LNA primers were designed by converting DNA into LNA, which is specific to fungi, at the forward primer side. LNA oligonucleotides, the sequences of which are complementary to the host plants, were designed by overlapping a few bases with the annealing position of the reverse primer. Plant-specific DNA was then converted into LNA at the shifted position from the 3′ end of the primer–binding position. PCR using the LNA technique enhanced the amplification of fungal ITS regions, whereas those of the host plants were more likely to be amplified without the LNA technique. A denaturing gradient gel electrophoresis (DGGE) analysis displayed patterns that reached an acceptable level for investigating the community structures of plant–associated fungi using the LNA technique. The sequences of the bands detected using the LNA technique were mostly affiliated with known isolates. However, some sequences showed low similarities, indicating the potential to identify novel fungi. Thus, the application of the LNA technique is considered effective for widening the scope of community analyses of plant–associated fungi. PMID:27600711
Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme.

PubMed

Gallage, Nethaji J; Hansen, Esben H; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg

2014-06-19

Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco.
Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme

PubMed Central

Gallage, Nethaji J.; Hansen, Esben H.; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg

2014-01-01

Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco. PMID:24941968
Genome sequence of a distinct watermelon mosaic virus identified from ginseng (Panax ginseng) transcriptome.

PubMed

Park, D; Kim, H; Hahn, Y

Watermelon mosaic virus (WMV) is a member of the genus Potyvirus, which is the largest genus of plant viruses. WMV is a significant pathogen of crop plants, including Cucurbitaceae species. A WMV strain, designated as WMV-Pg, was identified in transcriptome data collected from ginseng (Panax ginseng) root. WMV-Pg showed 84% nucleotide sequence identity and 91% amino acid sequence identity with its closest related virus, WMV-Fr. A phylogenetic analysis of WMV-Pg with other WMVs and soybean mosaic viruses (SMVs) indicated that WMV-Pg is a distinct subtype of the WMV/SMV group of the genus Potyvirus in the family Potyviridae.
Candidate new rotavirus species in Schreiber's bats, Serbia.

PubMed

Bányai, Krisztián; Kemenesi, Gábor; Budinski, Ivana; Földes, Fanni; Zana, Brigitta; Marton, Szilvia; Varga-Kugler, Renáta; Oldal, Miklós; Kurucz, Kornélia; Jakab, Ferenc

2017-03-01

The genus Rotavirus comprises eight species designated A to H and one tentative species, Rotavirus I. In a virus metagenomic analysis of Schreiber's bats sampled in Serbia in 2014 we obtained sequences likely representing novel rotavirus species. Whole genome sequencing and phylogenetic analysis classified the representative strain into a tentative tenth rotavirus species, we provisionally called Rotavirus J. The novel virus shared a maximum of 50% amino acid sequence identity within the VP6 gene to currently known members of the genus. This study extends our understanding of the genetic diversity of rotaviruses in bats. Copyright © 2016 Elsevier B.V. All rights reserved.
Folding and Stabilization of Native-Sequence-Reversed Proteins

PubMed Central

Zhang, Yuanzhao; Weber, Jeffrey K; Zhou, Ruhong

2016-01-01

Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols. PMID:27113844
Folding and Stabilization of Native-Sequence-Reversed Proteins

NASA Astrophysics Data System (ADS)

Zhang, Yuanzhao; Weber, Jeffrey K.; Zhou, Ruhong

2016-04-01

Though the problem of sequence-reversed protein folding is largely unexplored, one might speculate that reversed native protein sequences should be significantly more foldable than purely random heteropolymer sequences. In this article, we investigate how the reverse-sequences of native proteins might fold by examining a series of small proteins of increasing structural complexity (α-helix, β-hairpin, α-helix bundle, and α/β-protein). Employing a tandem protein structure prediction algorithmic and molecular dynamics simulation approach, we find that the ability of reverse sequences to adopt native-like folds is strongly influenced by protein size and the flexibility of the native hydrophobic core. For β-hairpins with reverse-sequences that fail to fold, we employ a simple mutational strategy for guiding stable hairpin formation that involves the insertion of amino acids into the β-turn region. This systematic look at reverse sequence duality sheds new light on the problem of protein sequence-structure mapping and may serve to inspire new protein design and protein structure prediction protocols.
Testing the limits of rational design by engineering pH sensitivity into membrane-active peptides.

PubMed

Wiedman, Gregory; Wimley, William C; Hristova, Kalina

2015-04-01

In this work, we sought to rationally design membrane-active peptides that are triggered by low pH to form macromolecular-sized pores in lipid bilayers. Such peptides could have broad utility in biotechnology and in nanomedicine as cancer therapeutics or drug delivery vehicles that promote release of macromolecules from endosomes. Our approach to rational design was to combine the properties of a pH-independent peptide, MelP5, which forms large pores allowing passage of macromolecules, with the properties of two pH-dependent membrane-active peptides, pHlip and GALA. We created two hybrid sequences, MelP5_Δ4 and MelP5_Δ6, by using the distribution of acidic residues on pHlip and GALA as a guide to insert acidic amino acids into the amphipathic helix of MelP5. We show that the new peptides bind to lipid bilayers and acquire secondary structure in a pH-dependent manner. The peptides also destabilize bilayers in a pH-dependent manner, such that lipid vesicles release the small molecules ANTS/DPX at low pH only. Thus, we were successful in designing pH-triggered pore-forming peptides. However, no macromolecular release was observed under any conditions. Therefore, we abolished the unique macromolecular poration properties of MelP5 by introducing pH sensitivity into its sequence. We conclude that the properties of pHlip, GALA, and MelP5 are additive, but only partially so. We propose that this lack of additivity is a limitation in the rational design of novel membrane-active peptides, and that high-throughput approaches to discovery will be critical for continued progress in the field. Copyright © 2015 Elsevier B.V. All rights reserved.
Designing probe from E6 genome region of human Papillomavirus 16 for sensing applications.

PubMed

Parmin, Nor Azizah; Hashim, Uda; Gopinath, Subash C B

2018-02-01

Human Papillomavirus (HPV) is a standout amongst the most commonly reported over 100 types, among them genotypes 16, 18, 31 and 45 are the high-risk HPV. Herein, we designed the oligonucleotide probe for the detection of predominant HPV type 16 for the sensing applications. Conserved amino acid sequences within E6 region of the open reading frame in the HPV genome was used as the basis to design oligonucleotide probe to detect cervical cancer. Analyses of E6 amino acid sequences from the high-risk HPVs were done to check the percentage of similarity and consensus regions that cause different cancers, including cervical cancer. Basic local alignment search tools (BLAST) have given extra statistical parameters, for example, desire values (E-values) and score bits. The probe, 'GGG GTC GGT GGA CCG GTC GAT GTA' was designed with 66.7% GC content. This oligonucleotide probe is designed with the length of 24 mer, GC percent is between 40 and 70, and the melting point (Tm) is above 50°C. The probe needed an acceptable length between 22 and 31 mer. The choice of region is identified here can be used as a probe, has implications for HPV detection techniques in biosensor especially for clinical determination of cervical cancer. Copyright © 2017 Elsevier B.V. All rights reserved.
Biosynthesis of Essential Polyunsaturated Fatty Acids in Wheat Triggered by Expression of Artificial Gene

PubMed Central

Mihálik, Daniel; Klčová, Lenka; Ondreičková, Katarína; Hudcovicová, Martina; Gubišová, Marcela; Klempová, Tatiana; Čertík, Milan; Pauk, János; Kraic, Ján

2015-01-01

The artificial gene D6D encoding the enzyme ∆6desaturase was designed and synthesized using the sequence of the same gene from the fungus Thamnidium elegans. The original start codon was replaced by the signal sequence derived from the wheat gene for high-molecular-weight glutenin subunit and the codon usage was completely changed for optimal expression in wheat. Synthesized artificial D6D gene was delivered into plants of the spring wheat line CY-45 and the gene itself, as well as transcribed D6D mRNA were confirmed in plants of T0 and T1 generations. The desired product of the wheat genetic modification by artificial D6D gene was the γ-linolenic acid. Its presence was confirmed in mature grains of transgenic wheat plants in the amount 0.04%–0.32% (v/v) of the total amount of fatty acids. Both newly synthesized γ-linolenic acid and stearidonic acid have been detected also in leaves, stems, roots, awns, paleas, rachillas, and immature grains of the T1 generation as well as in immature and mature grains of the T2 generation. Contents of γ-linolenic acid and stearidonic acid varied in range 0%–1.40% (v/v) and 0%–1.53% (v/v) from the total amount of fatty acids, respectively. This approach has opened the pathway of desaturation of fatty acids and production of essential polyunsaturated fatty acids in wheat. PMID:26694368
The domestication of the probiotic bacterium Lactobacillus acidophilus

PubMed Central

Bull, Matthew J.; Jolley, Keith A.; Bray, James E.; Aerts, Maarten; Vandamme, Peter; Maiden, Martin C. J.; Marchesi, Julian R.; Mahenthiralingam, Eshwar

2014-01-01

Lactobacillus acidophilus is a Gram-positive lactic acid bacterium that has had widespread historical use in the dairy industry and more recently as a probiotic. Although L. acidophilus has been designated as safe for human consumption, increasing commercial regulation and clinical demands for probiotic validation has resulted in a need to understand its genetic diversity. By drawing on large, well-characterised collections of lactic acid bacteria, we examined L. acidophilus isolates spanning 92 years and including multiple strains in current commercial use. Analysis of the whole genome sequence data set (34 isolate genomes) demonstrated L. acidophilus was a low diversity, monophyletic species with commercial isolates essentially identical at the sequence level. Our results indicate that commercial use has domesticated L. acidophilus with genetically stable, invariant strains being consumed globally by the human population. PMID:25425319
The domestication of the probiotic bacterium Lactobacillus acidophilus.

PubMed

Bull, Matthew J; Jolley, Keith A; Bray, James E; Aerts, Maarten; Vandamme, Peter; Maiden, Martin C J; Marchesi, Julian R; Mahenthiralingam, Eshwar

2014-11-26

Lactobacillus acidophilus is a Gram-positive lactic acid bacterium that has had widespread historical use in the dairy industry and more recently as a probiotic. Although L. acidophilus has been designated as safe for human consumption, increasing commercial regulation and clinical demands for probiotic validation has resulted in a need to understand its genetic diversity. By drawing on large, well-characterised collections of lactic acid bacteria, we examined L. acidophilus isolates spanning 92 years and including multiple strains in current commercial use. Analysis of the whole genome sequence data set (34 isolate genomes) demonstrated L. acidophilus was a low diversity, monophyletic species with commercial isolates essentially identical at the sequence level. Our results indicate that commercial use has domesticated L. acidophilus with genetically stable, invariant strains being consumed globally by the human population.
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2006-07-04

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2002-01-01

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Single-Stranded γPNAs for In Vivo Site-Specific Genome Editing via Watson-Crick Recognition

PubMed Central

Bahal, Raman; Quijano, Elias; McNeer, Nicole Ali; Liu, Yanfeng; Bhunia, Dinesh C.; López-Giráldez, Francesco; Fields, Rachel J.; Saltzman, W. Mark; Ly, Danith H.; Glazer, Peter M.

2014-01-01

Triplex-forming peptide nucleic acids (PNAs) facilitate gene editing by stimulating recombination of donor DNAs within genomic DNA via site-specific formation of altered helical structures that further stimulate DNA repair. However, PNAs designed for triplex formation are sequence restricted to homopurine sites. Herein we describe a novel strategy where next generation single-stranded gamma PNAs (γPNAs) containing miniPEG substitutions at the gamma position can target genomic DNA in mouse bone marrow at mixed-sequence sites to induce targeted gene editing. In addition to enhanced binding, γPNAs confer increased solubility and improved formulation into poly(lactic-co-glycolic acid) (PLGA) nanoparticles for efficient intracellular delivery. Single-stranded γPNAs induce targeted gene editing at frequencies of 0.8% in mouse bone marrow cells treated ex vivo and 0.1% in vivo via IV injection, without detectable toxicity. These results suggest that γPNAs may provide a new tool for induced gene editing based on Watson-Crick recognition without sequence restriction. PMID:25174576
Single-stranded γPNAs for in vivo site-specific genome editing via Watson-Crick recognition.

PubMed

Bahal, Raman; Quijano, Elias; McNeer, Nicole A; Liu, Yanfeng; Bhunia, Dinesh C; Lopez-Giraldez, Francesco; Fields, Rachel J; Saltzman, William M; Ly, Danith H; Glazer, Peter M

2014-01-01

Triplex-forming peptide nucleic acids (PNAs) facilitate gene editing by stimulating recombination of donor DNAs within genomic DNA via site-specific formation of altered helical structures that further stimulate DNA repair. However, PNAs designed for triplex formation are sequence restricted to homopurine sites. Herein we describe a novel strategy where next generation single-stranded gamma PNAs (γPNAs) containing miniPEG substitutions at the gamma position can target genomic DNA in mouse bone marrow at mixed-sequence sites to induce targeted gene editing. In addition to enhanced binding, γPNAs confer increased solubility and improved formulation into poly(lactic-co-glycolic acid) (PLGA) nanoparticles for efficient intracellular delivery. Single-stranded γPNAs induce targeted gene editing at frequencies of 0.8% in mouse bone marrow cells treated ex vivo and 0.1% in vivo via IV injection, without detectable toxicity. These results suggest that γPNAs may provide a new tool for induced gene editing based on Watson-Crick recognition without sequence restriction.
Characterization of a chitinolytic enzyme from Serratia sp. KCK isolated from kimchi juice.

PubMed

Kim, Hyun-Soo; Timmis, Kenneth N; Golyshin, Peter N

2007-07-01

The novel chitinolytic bacterium Serratia sp. KCK, which was isolated from kimchi juice, produced chitinase A. The gene coding for the chitinolytic enzyme was cloned on the basis of sequencing of internal peptides, homology search, and design of degenerated primers. The cloned open reading frame of chiA encodes for deduced polypeptide of 563 amino acid residues with a calculated molecular mass of 61 kDa and appears to correspond to a molecular mass of about 57 kDa, which excluded the signal sequence. The deduced amino acid sequence showed high similarity to those of bacterial chitinases classified as family 18 of glycosyl hydrolases. The chitinase A is an exochitinase and exhibits a greater pH range (5.0-10.0), thermostability with a temperature optimum of 40 degrees C, and substrate range other than Serratia chitinases thus far described. These results suggested that Serratia sp. KCK chitinase A can be used for biotechnological applications with good potential.

Hybridization and sequencing of nucleic acids using base pair mismatches

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Human jagged polypeptide, encoding nucleic acids and methods of use

DOEpatents

Li, Linheng; Hood, Leroy

2000-01-01

The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
A diverse family of serine proteinase genes expressed in cotton boll weevil (Anthonomus grandis): implications for the design of pest-resistant transgenic cotton plants.

PubMed

Oliveira-Neto, Osmundo B; Batista, João A N; Rigden, Daniel J; Fragoso, Rodrigo R; Silva, Rodrigo O; Gomes, Eliane A; Franco, Octávio L; Dias, Simoni C; Cordeiro, Célia M T; Monnerat, Rose G; Grossi-De-Sá, Maria F

2004-09-01

Fourteen different cDNA fragments encoding serine proteinases were isolated by reverse transcription-PCR from cotton boll weevil (Anthonomus grandis) larvae. A large diversity between the sequences was observed, with a mean pairwise identity of 22% in the amino acid sequence. The cDNAs encompassed 11 trypsin-like sequences classifiable into three families and three chymotrypsin-like sequences belonging to a single family. Using a combination of 5' and 3' RACE, the full-length sequence was obtained for five of the cDNAs, named Agser2, Agser5, Agser6, Agser10 and Agser21. The encoded proteins included amino acid sequence motifs of serine proteinase active sites, conserved cysteine residues, and both zymogen activation and signal peptides. Southern blotting analysis suggested that one or two copies of these serine proteinase genes exist in the A. grandis genome. Northern blotting analysis of Agser2 and Agser5 showed that for both genes, expression is induced upon feeding and is concentrated in the gut of larvae and adult insects. Reverse northern analysis of the 14 cDNA fragments showed that only two trypsin-like and two chymotrypsin-like were expressed at detectable levels. Under the effect of the serine proteinase inhibitors soybean Kunitz trypsin inhibitor and black-eyed pea trypsin/chymotrypsin inhibitor, expression of one of the trypsin-like sequences was upregulated while expression of the two chymotrypsin-like sequences was downregulated. Copyright 2004 Elsevier Ltd.
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai

PubMed Central

Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung

2016-01-01

An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His69, Asp117, and Ser216. The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5′ donor splice (GT) and 3′ acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai. PMID:27399771
Cloning a Chymotrypsin-Like 1 (CTRL-1) Protease cDNA from the Jellyfish Nemopilema nomurai.

PubMed

Heo, Yunwi; Kwon, Young Chul; Bae, Seong Kyeong; Hwang, Duhyeon; Yang, Hye Ryeon; Choudhary, Indu; Lee, Hyunkyoung; Yum, Seungshic; Shin, Kyoungsoon; Yoon, Won Duk; Kang, Changkeun; Kim, Euikyung

2016-07-05

An enzyme in a nematocyst extract of the Nemopilema nomurai jellyfish, caught off the coast of the Republic of Korea, catalyzed the cleavage of chymotrypsin substrate in an amidolytic kinetic assay, and this activity was inhibited by the serine protease inhibitor, phenylmethanesulfonyl fluoride. We isolated the full-length cDNA sequence of this enzyme, which contains 850 nucleotides, with an open reading frame of 801 encoding 266 amino acids. A blast analysis of the deduced amino acid sequence showed 41% identity with human chymotrypsin-like (CTRL) and the CTRL-1 precursor. Therefore, we designated this enzyme N. nomurai CTRL-1. The primary structure of N. nomurai CTRL-1 includes a leader peptide and a highly conserved catalytic triad of His(69), Asp(117), and Ser(216). The disulfide bonds of chymotrypsin and the substrate-binding sites are highly conserved compared with the CTRLs of other species, including mammalian species. Nemopilema nomurai CTRL-1 is evolutionarily more closely related to Actinopterygii than to Scyphozoan (Aurelia aurita) or Hydrozoan (Hydra vulgaris). The N. nomurai CTRL1 was amplified from the genomic DNA with PCR using specific primers designed based on the full-length cDNA, and then sequenced. The N. nomurai CTRL1 gene contains 2434 nucleotides and four distinct exons. The 5' donor splice (GT) and 3' acceptor splice sequences (AG) are wholly conserved. This is the first report of the CTRL1 gene and cDNA structures in the jellyfish N. nomurai.
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
37 CFR 5.31-5.33 - [Reserved

Code of Federal Regulations, 2011 CFR

2011-07-01

... from abandonment 1.135 Amino Acid Sequences. (See Nucleotide and/or Amino Acid Sequences) Appeal to... Appeals and Interference 41.47 Of rejection of an application 1.104(a) Nucleotide and/or Amino Acid...) Symbols for nucleotide and/or amino acid sequence data 1.822 T Tables in patent applications 1.58 Terminal...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2011 CFR

2011-07-01

... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
Redesigning Channel-Forming Peptides: Amino Acid Substitutions that Enhance Rates of Supramolecular Self-Assembly and Raise Ion Transport Activity

PubMed Central

Shank, Lalida P.; Broughman, James R.; Takeguchi, Wade; Cook, Gabriel; Robbins, Ashley S.; Hahn, Lindsey; Radke, Gary; Iwamoto, Takeo; Schultz, Bruce D.; Tomich, John M.

2006-01-01

Three series of 22-residue peptides derived from the transmembrane M2 segment of the glycine receptor α1-subunit (M2GlyR) have been designed, synthesized, and tested to determine the plasticity of a channel-forming sequence and to define whether channel pores with enhanced conductive properties could be created. Sixteen sequences were examined for aqueous solubility, solution-association tendency, secondary structure, and half-maximal concentration for supramolecular assembly, channel activity, and ion transport properties across epithelial monolayers. All peptides interact strongly with membranes: associating with, inserting across, and assembling to form homooligomeric bundles when in micromolar concentrations. Single and double amino acid replacements involving arginine and/or aromatic amino acids within the final five C-terminal residues of the peptide cause dramatic effects on the concentration dependence, yielding a range of K1/2 values from 36 ± 5 to 390 ± 220 μM for transport activity. New water/lipid interfacial boundaries were established for the transmembrane segment using charged or aromatic amino acids, thus limiting the peptides' ability to move perpendicularly to the plane of the bilayer. Formation of discrete water/lipid interfacial boundaries appears to be necessary for efficient supramolecular assembly and high anion transport activity. A peptide sequence is identified that may show efficacy in channel replacement therapy for channelopathies such as cystic fibrosis. PMID:16387776
Sequence heuristics to encode phase behaviour in intrinsically disordered protein polymers

PubMed Central

Quiroz, Felipe García; Chilkoti, Ashutosh

2015-01-01

Proteins and synthetic polymers that undergo aqueous phase transitions mediate self-assembly in nature and in man-made material systems. Yet little is known about how the phase behaviour of a protein is encoded in its amino acid sequence. Here, by synthesizing intrinsically disordered, repeat proteins to test motifs that we hypothesized would encode phase behaviour, we show that the proteins can be designed to exhibit tunable lower or upper critical solution temperature (LCST and UCST, respectively) transitions in physiological solutions. We also show that mutation of key residues at the repeat level abolishes phase behaviour or encodes an orthogonal transition. Furthermore, we provide heuristics to identify, at the proteome level, proteins that might exhibit phase behaviour and to design novel protein polymers consisting of biologically active peptide repeats that exhibit LCST or UCST transitions. These findings set the foundation for the prediction and encoding of phase behaviour at the sequence level. PMID:26390327
Single Amino Acid Substitutions at Specific Positions of the Heptad Repeat Sequence of Piscidin-1 Yielded Novel Analogs That Show Low Cytotoxicity and In Vitro and In Vivo Antiendotoxin Activity

PubMed Central

Kumar, Amit; Tripathi, Amit Kumar; Kathuria, Manoj; Shree, Sonal; Tripathi, Jitendra Kumar; Purshottam, R. K.; Ramachandran, Ravishankar; Mitra, Kalyan

2016-01-01

Piscidin-1 possesses significant antimicrobial and cytotoxic activities. To recognize the primary amino acid sequence(s) in piscidin-1 that could be important for its biological activity, a long heptad repeat sequence located in the region from amino acids 2 to 19 was identified. To comprehend the possible role of this motif, six analogs of piscidin-1 were designed by selectively replacing a single isoleucine residue at a d (5th) position or at an a (9th or 16th) position with either an alanine or a valine residue. Two more analogs, namely, I5F,F6A-piscidin-1 and V12I-piscidin-1, were designed for investigating the effect of interchanging an alanine residue at a d position with an adjacent phenylalanine residue and replacing a valine residue with an isoleucine residue at another d position of the heptad repeat of piscidin-1, respectively. Single alanine-substituted analogs exhibited significantly reduced cytotoxicity against mammalian cells compared with that of piscidin-1 but appreciably retained the antibacterial and antiendotoxin activities of piscidin-1. All the single valine-substituted piscidin-1 analogs and I5F,F6A-piscidin-1 showed cytotoxicity greater than that of the corresponding alanine-substituted analogs, antibacterial activity marginally greater than or similar to that of the corresponding alanine-substituted analogs, and also antiendotoxin activity superior to that of the corresponding alanine-substituted analogs. Interestingly, among these peptides, V12I-piscidin-1 showed the highest cytotoxicity and antibacterial and antiendotoxin activities. Lipopolysaccharide (12 mg/kg of body weight)-treated mice, further treated with I16A-piscidin-1, the piscidin-1 analog with the highest therapeutic index, at a single dose of 1 or 2 mg/kg of body weight, showed 80 and 100% survival, respectively. Structural and functional characterization of these peptides revealed the basis of their biological activity and demonstrated that nontoxic piscidin-1 analogs with significant antimicrobial and antiendotoxin activities can be designed by incorporating single alanine substitutions in the piscidin-1 heptad repeat. PMID:27067326
Gene encoding a novel extracellular metalloprotease in Bacillus subtilis.

PubMed Central

Sloma, A; Rudolph, C F; Rufo, G A; Sullivan, B J; Theriault, K A; Ally, D; Pero, J

1990-01-01

The gene for a novel extracellular metalloprotease was cloned, and its nucleotide sequence was determined. The gene (mpr) encodes a primary product of 313 amino acids that has little similarity to other known Bacillus proteases. The amino acid sequence of the mature protease was preceded by a signal sequence of approximately 34 amino acids and a pro sequence of 58 amino acids. Four cysteine residues were found in the deduced amino acid sequence of the mature protein, indicating the possible presence of disulfide bonds. The mpr gene mapped in the cysA-aroI region of the chromosome and was not required for growth or sporulation. Images FIG. 2 FIG. 7 PMID:2105291
Cloning and characterization of the SERK1 gene in triploid Pingyi Tiancha [Malus hupehensis (Pamp.) Rehd. var. pingyiensis Jiang] and a tetraploid hybrid strain.

PubMed

Zhang, L J; Dong, W X; Guo, S M; Wang, Y X; Wang, A D; Lu, X J

2015-11-19

This study aims to explore the roles of somatic embryogenesis receptor-like kinase (SERK) in Malus hupehensis (Pingyi Tiancha). The full-length sequences of SERK1 in triploid Pingyi Tiancha (3n) and a tetraploid hybrid strain 33# (4n) were cloned, sequenced, and designated as MhSERK1 and MhdSERK1, respectively. Multiple alignments of amino acid sequences were conducted to identify similarity between MhSERK1 and MhdSERK1 and SERK sequences in other species, and a neighbor-joining phylogenetic tree was constructed to elucidate their phylogenetic relations. Expression levels of MhSERK1 and MhdSERK1 in different tissues and developmental stages were investigated using quantitative real-time PCR. The coding sequence lengths of MhSERK1 and MhdSERK1 were 1899 bp (encoding 632 amino acids) and 1881 bp (encoding 626 amino acids), respectively. Sequence analysis demonstrated that MhSERK1 and MhdSERK1 display high similarity to SERKs in other species, with a conserved intron/exon structure that is unique to members of the SERK family. Additionally, the phylogenetic tree showed that MhSERK1 and MhdSERK1 clustered with orange CitSERK (93%). Furthermore, MhSERK1 and MhdSERK1 were mainly expressed in the reproductive organs, in particular the ovary. Their expression levels were highest in young flowers and they differed among different tissues and organs. Our results suggest that MhSERK1 and MhdSERK1 are related to plant reproduction, and that MhSERK1 is related to apomixis in triploid Pingyi Tiancha.

Statistical analysis of native contact formation in the folding of designed model proteins

NASA Astrophysics Data System (ADS)

Tiana, Guido; Broglia, Ricardo A.

2001-02-01

The time evolution of the formation probability of native bonds has been studied for designed sequences which fold fast into the native conformation. From this analysis a clear hierarchy of bonds emerge: (a) local, fast forming highly stable native bonds built by some of the most strongly interacting amino acids of the protein; (b) nonlocal bonds formed late in the folding process, in coincidence with the folding nucleus, and involving essentially the same strongly interacting amino acids already participating in the fast bonds; (c) the rest of the native bonds whose behavior is subordinated, to a large extent, to that of the strong local and nonlocal native contacts.
Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence.

PubMed

Shin, Dong-Ho; Webb, Barbara M; Nakao, Miki; Smith, Sylvia L

2009-07-01

Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and -d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (
Characterization of shark complement factor I gene(s): genomic analysis of a novel shark-specific sequence

PubMed Central

Shin, Dong-Ho; Webb, Barbara M.; Nakao, Miki; Smith, Sylvia L.

2009-01-01

Complement factor I is a crucial regulator of mammalian complement activity. Very little is known of complement regulators in non-mammalian species. We isolated and sequenced four highly similar complement factor I cDNAs from the liver of the nurse shark (Ginglymostoma cirratum), designated as GcIf-1, GcIf-2, GcIf-3 and GcIf-4 (previously referred to as nsFI-a, -b, -c and –d) which encode 689, 673, 673 and 657 amino acid residues, respectively. They share 95% (≤) amino acid identities with each other, 35.4 ~ 39.6% and 62.8 ~ 65.9% with factor I of mammals and banded houndshark (Triakis scyllium), respectively. The modular structure of the GcIf is similar to that of mammals with one notable exception, the presence of a novel shark-specific sequence between the leader peptide (LP) and the factor I membrane attack complex (FIMAC) domain. The cDNA sequences differ only in the size and composition of the shark-specific region (SSR). Sequence analysis of each SSR has identified within the region two novel short sequences (SS1 and SS2) and three repeat sequences (RS1, 2 and 3). Genomic analysis has revealed the existence of three introns between the leader peptide and the FIMAC domain, tentatively designated intron 1, intron 2, and intron 3 which span 4067, 2293 and 2082 bp, respectively. Southern blot analysis suggests the presence of a single gene copy for each cDNA type. Phylogenetic analysis suggests that complement factor I of cartilaginous fish diverged prior to the emergence of mammals. All four GcIf cDNA species are expressed in four different tissues and the liver is the main tissue in which expression level of all four is high. This suggests that the expression of GcIf isotypes is tissue-dependent. PMID:19423168
Thermophilic cellobiohydrolase

DOEpatents

Sapra, Rajat; Park, Joshua I.; Datta, Supratim; Simmons, Blake A.

2017-04-18

The present invention provides for a composition comprising a polypeptide comprising a first amino acid sequence having at least 70% identity with the amino acid sequence of Csac GH5 wherein said first amino acid sequence has a thermostable or thermophilic cellobiohydrolase (CBH) or exoglucanase activity.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, M.S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device. 27 figs.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

2004-05-11

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2003-08-19

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Logic gates and antisense DNA devices operating on a translator nucleic Acid scaffold.

PubMed

Shlyahovsky, Bella; Li, Yang; Lioubashevski, Oleg; Elbaz, Johann; Willner, Itamar

2009-07-28

A series of logic gates, "AND", "OR", and "XOR", are designed using a DNA scaffold that includes four "footholds" on which the logic operations are activated. Two of the footholds represent input-recognition strands, and these are blocked by complementary nucleic acids, whereas the other two footholds are blocked by nucleic acids that include the horseradish peroxidase (HRP)-mimicking DNAzyme sequence. The logic gates are activated by either nucleic acid inputs that hybridize to the respective "footholds", or by low-molecular-weight inputs (adenosine monophosphate or cocaine) that yield the respective aptamer-substrate complexes. This results in the respective translocation of the blocking nucleic acids to the footholds carrying the HRP-mimicking DNAzyme sequence, and the concomitant release of the respective DNAzyme. The released product-strands then self-assemble into the hemin/G-quadruplex-HRP-mimicking DNAzyme that biocatalyzes the formation of a colored product and provides an output signal for the different logic gates. The principle of the logic operation is, then, implemented as a possible paradigm for future nanomedicine. The nucleic acid inputs that bind to the blocked footholds result in the translocation of the blocking nucleic acids to the respective footholds carrying the antithrombin aptamer. The released aptamer inhibits, then, the hydrolytic activity of thrombin. The system demonstrates the regulation of a biocatalytic reaction by a translator system activated on a DNA scaffold.
Introduction on Using the FastPCR Software and the Related Java Web Tools for PCR and Oligonucleotide Assembly and Analysis.

PubMed

Kalendar, Ruslan; Tselykh, Timofey V; Khassenov, Bekbolat; Ramanculov, Erlan M

2017-01-01

This chapter introduces the FastPCR software as an integrated tool environment for PCR primer and probe design, which predicts properties of oligonucleotides based on experimental studies of the PCR efficiency. The software provides comprehensive facilities for designing primers for most PCR applications and their combinations. These include the standard PCR as well as the multiplex, long-distance, inverse, real-time, group-specific, unique, overlap extension PCR for multi-fragments assembling cloning and loop-mediated isothermal amplification (LAMP). It also contains a built-in program to design oligonucleotide sets both for long sequence assembly by ligase chain reaction and for design of amplicons that tile across a region(s) of interest. The software calculates the melting temperature for the standard and degenerate oligonucleotides including locked nucleic acid (LNA) and other modifications. It also provides analyses for a set of primers with the prediction of oligonucleotide properties, dimer and G/C-quadruplex detection, linguistic complexity as well as a primer dilution and resuspension calculator. The program consists of various bioinformatical tools for analysis of sequences with the GC or AT skew, CG% and GA% content, and the purine-pyrimidine skew. It also analyzes the linguistic sequence complexity and performs generation of random DNA sequence as well as restriction endonucleases analysis. The program allows to find or create restriction enzyme recognition sites for coding sequences and supports the clustering of sequences. It performs efficient and complete detection of various repeat types with visual display. The FastPCR software allows the sequence file batch processing that is essential for automation. The program is available for download at http://primerdigital.com/fastpcr.html , and its online version is located at http://primerdigital.com/tools/pcr.html .
Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

PubMed

Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

2002-07-01

Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Labeled nucleotide phosphate (NP) probes

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2009-02-03

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Enriching peptide libraries for binding affinity and specificity through computationally directed library design

PubMed Central

Foight, Glenna Wink; Chen, T. Scott; Richman, Daniel; Keating, Amy E.

2017-01-01

Peptide reagents with high affinity or specificity for their target protein interaction partner are of utility for many important applications. Optimization of peptide binding by screening large libraries is a proven and powerful approach. Libraries designed to be enriched in peptide sequences that are predicted to have desired affinity or specificity characteristics are more likely to yield success than random mutagenesis. We present a library optimization method in which the choice of amino acids to encode at each peptide position can be guided by available experimental data or structure-based predictions. We discuss how to use analysis of predicted library performance to inform rounds of library design. Finally, we include protocols for more complex library design procedures that consider the chemical diversity of the amino acids at each peptide position and optimize a library score based on a user-specified input model. PMID:28236241
Enriching Peptide Libraries for Binding Affinity and Specificity Through Computationally Directed Library Design.

PubMed

Foight, Glenna Wink; Chen, T Scott; Richman, Daniel; Keating, Amy E

2017-01-01

Peptide reagents with high affinity or specificity for their target protein interaction partner are of utility for many important applications. Optimization of peptide binding by screening large libraries is a proven and powerful approach. Libraries designed to be enriched in peptide sequences that are predicted to have desired affinity or specificity characteristics are more likely to yield success than random mutagenesis. We present a library optimization method in which the choice of amino acids to encode at each peptide position can be guided by available experimental data or structure-based predictions. We discuss how to use analysis of predicted library performance to inform rounds of library design. Finally, we include protocols for more complex library design procedures that consider the chemical diversity of the amino acids at each peptide position and optimize a library score based on a user-specified input model.
Use of a Designed Peptide Array To Infer Dissociation Trends for Nontryptic Peptides in Quadrupole Ion Trap and Quadrupole Time-of-Flight Mass Spectrometry

DOE PAGES

Gaucher, Sara P.; Morrow, Jeffrey A.; Faulon, Jean-Loup M.

2007-09-14

Observed peptide gas-phase fragmentation patterns are a complex function of many variables. In order to systematically probe this phenomenon, an array of 40 peptides was synthesized for study. The array of sequences was designed to hold certain variables (peptide length) constant and randomize or balance others (peptide amino acid distribution and position). A high-quality tandem mass spectrometry (MS/MS) data set was acquired for each peptide for all observed charge states on multiple MS instruments, quadrupole-time-of-flight and quadrupole ion trap. The data were analyzed as a function of total charge state and number of mobile protons. Previously known dissociation trends weremore » observed, validating our approach. In addition, the general influence of basic amino acids on dissociation could be determined because, in contrast to the more widely studied tryptic peptides, the amino acids H, K, and R were positionally distributed. Interestingly, our results suggest that cleavage at all basic amino acids is suppressed when a mobile proton is available. Cleavage at H becomes favored only under conditions where a partially mobile proton is present, a caveat to the previously reported trend of enhanced cleavage at H. In conclusion, all acquired data were used as a benchmark to determine how well these sequences would have been identified in a database search using a common algorithm, Mascot.« less
Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

PubMed Central

Yasuno, Rie; Wada, Hajime

1998-01-01

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
Initial cloning and sequencing of hydHG, an operon homologous to ntrBC and regulating the labile hydrogenase activity in Escherichia coli K-12.

PubMed Central

Stoker, K; Reijnders, W N; Oltmann, L F; Stouthamer, A H

1989-01-01

To isolate genes from Escherichia coli which regulate the labile hydrogenase activity, a plasmid library was used to transform hydL mutants lacking the labile hydrogenase. A single type of gene, designated hydG, was isolated. This gene also partially restored the hydrogenase activity in hydF mutants (which are defective in all hydrogenase isoenzymes), although the low hydrogenase 1 and 2 levels were not induced. Therefore, hydG apparently regulates, specifically, the labile hydrogenase activity. Restoration of this latter activity in hydF mutants was accompanied by a proportional increase of the H2 uptake activity, suggesting a functional relationship. H2:fumarate oxidoreductase activity was not restored in complemented hydL mutants. These latter strains may therefore lack, in addition to the labile hydrogenase, a second component (provisionally designated component R), possibly an electron carrier coupling H2 oxidation to the anerobic respiratory chain. Sequence analysis showed an open reading frame of 1,314 base pairs for hydG. It was preceded by a ribosome-binding site but apparently lacked a promoter. Minicell experiments revealed a single polypeptide of approximately 50 kilodaltons. Comparison of the predicted amino acid sequence with a protein sequence data base revealed strong homology to NtrC from Klebsiella pneumoniae, a DNA-binding transcriptional activator. The 411 base pairs upstream from pHG40 contained a second open reading frame overlapping hydG by four bases. The deduced amino acid sequence showed considerable homology with the C-terminal part of NtrB. This sequence was therefore assumed to be part of a second gene, encoding the NtrB-like component, and was designated hydH. The labile hydrogenase activity in E. coli is apparently regulated by a multicomponent system analogous to the NtrB-NtrC system. This conclusion is in agreement with the results of Birkmann et al. (A. Birkmann, R. G. Sawers, and A. Böck, Mol. Gen. Genet. 210:535-542, 1987), who demonstrated ntrA dependence for the labile hydrogenase activity. Images PMID:2666400
RECOMBINANT ANDROGEN RECEPTOR (AR) BINDING ACROSS VERTEBRATE SPECIES: COMPARISON OF BINDING OF ENVIRONMENTAL COMPOUNDS TO HUMAN, RAINBOW TROUT AND FATHEAD MINNOW AR.

EPA Science Inventory

In vitro screening assays designed to identify androgen mimics or antagonists typically use mammalian (rat, human) androgen receptors (AR). Although the amino acid sequences of receptors from nonmammalian vertebrates are not identical to the mammalian receptors, it is uncertain ...
Fermentation of Corn Fiber Hydrolysate to Lactic Acid by the Moderate Thermophile Bacillus coagulans

USDA-ARS?s Scientific Manuscript database

Composted manure from a dairy farm in Texas was examined for thermophilic microorganisms by enrichment in xylose broth medium. Forty randomly picked isolates were identified as strains of Bacillus coagulans by sequence analysis of rRNA genes. One strain, designated as MXL-9, could convert mixed su...
Identification of a novel gene cluster participating in menaquinone (vitamin K2) biosynthesis. Cloning and sequence determination of the 2-heptaprenyl-1,4-naphthoquinone methyltransferase gene of Bacillus stearothermophilus.

PubMed

Koike-Takeshita, A; Koyama, T; Ogura, K

1997-05-09

We recently described the isolation and sequence analysis of a DNA region containing the genes of Bacillus stearothermophilus heptaprenyl diphosphate synthase, which catalyzes the synthesis of the prenyl side chain of menaquinone-7 of this bacterium. Sequence analyses revealed the presence of three open reading frames (ORFs), designated as ORF-1, ORF-2, and ORF-3, and the structural genes of the heptaprenyl diphosphate synthase were proved to consist of ORF-1 (heps-1) and ORF-3 (heps-2) (Koike-Takeshita, A., Koyama, T., Obata, S., and Ogura, K. (1995) J. Biol. Chem. 270, 18396-18400). The predicted amino acid sequence of ORF-2 (234 amino acids) contains a methyltransferase consensus sequence and shows a 22% identity with UbiG of Escherichia coli, which catalyzes S-adenosyl-L-methionine-dependent methylation of 2-octaprenyl-3-methyl-5-hydroxy-6-methoxy-1,4-benzoquinone. These pieces of information led us to identify the ORF-2 gene product. The cell-free homogenate of the transformant of E. coli with an expression vector of ORF-2 catalyzed the incorporation of S-adenosyl-L-methionine into menaquinone-8, indicating that ORF-2 encodes 2-heptaprenyl-1,4-naphthoquinone methyltransferase, which participates in the terminal step of the menaquinone biosynthesis. Thus it is concluded that the ORF-1, ORF-2, and ORF-3 genes, designated heps-1, menG, and heps-2, respectively, form another cluster involved in menaquinone biosynthesis in addition to the cluster of menB, menC, menD, and menE already identified in the Bacillus subtilis and E. coli chromosomes.

Gordonia caeni sp. nov., isolated from sludge of a sewage disposal plant.

PubMed

Srinivasan, Sathiyaraj; Park, Giho; Yang, Hyejin; Hwang, Supyong; Bae, Yoonjung; Jung, Yong-An; Kim, Myung Kyum; Lee, Myungjin

2012-11-01

A Gram-stain-positive, strictly aerobic, short-rod-shaped, non-motile strain (designated MJ32(T)) was isolated from a sludge sample of the Daejeon sewage disposal plant in South Korea. A polyphasic approach was applied to study the taxonomic position of strain MJ32(T). Strain MJ32(T) showed highest 16S rRNA gene sequence similarity to Gordonia hirsuta DSM 44140(T) (98.1%) and Gordonia hydrophobica DSM 44015(T) (97.0%); levels of sequence similarity to the type strains of other recognized Gordonia species were less than 97.0%. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain MJ32(T) belonged to the clade formed by members of the genus Gordonia in the family Gordoniaceae. The G+C content of the genomic DNA of strain MJ32(T) was 69.2 mol%. Chemotaxonomically, strain MJ32(T) showed features typical of the genus Gordonia. The predominant respiratory quinone was MK-9(H(2)), the mycolic acids present had C(56)-C(60) carbon atoms, and the major fatty acids were C(16:0) (34.6%), tuberculostearic acid (21.8%), C(16:1)ω7c (19.5%) and C(18:1)ω9c (12.7%). The peptidoglycan type was based on meso-2,6-diaminopimelic acid as the diagnostic diamino acid with glycolated sugars. On the basis of phylogenetic inference, fatty acid profile and other phenotypic properties, strain MJ32(T) is considered to represent a novel species of the genus Gordonia, for which the name Gordonia caeni sp. nov. is proposed. The type strain is MJ32(T) (=KCTC 19771(T)=JCM 16923(T)).
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1999-10-26

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2001-06-05

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Epitopes of human testis-specific lactate dehydrogenase deduced from a cDNA sequence

DOE Office of Scientific and Technical Information (OSTI.GOV)

Millan, J.L.; Driscoll, C.E.; LeVan, K.M.

The sequence and structure of human testis-specific L-lactate dehydrogenase (LDHC/sub 4/, LDHX; (L)-lactate:NAD/sup +/ oxidoreductase, EC 1.1.1.27) has been derived from analysis of a complementary DNA (cDNA) clone comprising the complete protein coding region of the enzyme. From the deduced amino acid sequence, human LDHC/sub 4/ is as different from rodent LDHC/sub 4/ (73% homology) as it is from human LDHA/sub 4/ (76% homology) and porcine LDHB/sub 4/ (68% homology). Subunit homologies are consistent with the conclusion that the LDHC gene arose by at least two independent duplication events. Furthermore, the lower degree of homology between mouse and human LDHC/submore » 4/ and the appearance of this isozyme late in evolution suggests a higher rate of mutation in the mammalian LDHC genes than in the LDHA and -B genes. Comparison of exposed amino acid residues of discrete anti-genic determinants of mouse and human LDHC/sub 4/ reveals significant differences. Knowledge of the human LDHC/sub 4/ sequence will help design human-specific peptides useful in the development of a contraceptive vaccine.« less
Neisseria arctica sp. nov. isolated from nonviable eggs of greater white-fronted geese (Anser albifrons) in Arctic Alaska

USGS Publications Warehouse

Hansen, Cristina M.; Himschoot, Elizabeth; Hare, Rebekah F.; Meixell, Brandt W.; Van Hemert, Caroline R.; Hueffer, Karsten

2017-01-01

During the summers of 2013 and 2014, isolates of a novel Gram-negative coccus in the Neisseria genus were obtained from the contents of nonviable greater white-fronted goose (Anser albifrons) eggs on the Arctic Coastal Plain of Alaska. We used a polyphasic approach to determine whether these isolates represent a novel species. 16S rRNA gene sequences, 23S rRNA gene sequences, and chaperonin 60 gene sequences suggested that these Alaskan isolates are members of a distinct species that is most closely related to Neisseria canis, N. animaloris, and N. shayeganii. Analysis of the rplF gene additionally showed that our isolates are unique and most closely related to N. weaveri. Average nucleotide identity of the whole genome sequence of our type strain was between 71.5% and 74.6% compared to close relatives, further supporting designation as a novel species. Fatty acid methyl ester analysis showed a predominance of C14:0, C16:0, and C16:1ω7c fatty acids. Finally, biochemical characteristics distinguished our isolates from other Neisseria species. The name Neisseria arctica (type strain KH1503T = ATCC TSD-57T = DSM 103136T) is proposed.
[Sequence analysis of LEAFY homologous gene from Dendrobium moniliforme and application for identification of medicinal Dendrobium].

PubMed

Xing, Wen-Rui; Hou, Bei-Wei; Guan, Jing-Jiao; Luo, Jing; Ding, Xiao-Yu

2013-04-01

The LEAFY (LFY) homologous gene of Dendrobium moniliforme (L.) Sw. was cloned by new primers which were designed based on the conservative region of known sequences of orchid LEAFY gene. Partial LFY homologous gene was cloned by common PCR, then we got the complete LFY homologous gene Den LFY by Tail-PCR. The complete sequence of DenLFY gene was 3 575 bp which contained three exons and two introns. Using BLAST method, comparison analysis among the exon of LFY homologous gene indicted that the DenLFY gene had high identity with orchids LFY homologous, including the related fragment of PhalLFY (84%) in Phalaenopsis hybrid cultivar, LFY homologous gene in Oncidium (90%) and in other orchid (over 80%). Using MP analysis, Dendrobium is found to be the sister to Oncidium and Phalaenopsis. Homologous analysis demonstrated that the C-terminal amino acids were highly conserved. When the exons and introns were separately considered, exons and the sequence of amino acid were good markers for the function research of DenLFY gene. The second intron can be used in authentication research of Dendrobium based on the length polymorphism between Dendrobium moniliforme and Dendrobium officinale.
Automated Sanger Analysis Pipeline (ASAP): A Tool for Rapidly Analyzing Sanger Sequencing Data with Minimum User Interference.

PubMed

Singh, Aditya; Bhatia, Prateek

2016-12-01

Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Nucleic acid analysis using terminal-phosphate-labeled nucleotides

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-04-22

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Can natural proteins designed with 'inverted' peptide sequences adopt native-like protein folds?

PubMed

Sridhar, Settu; Guruprasad, Kunchur

2014-01-01

We have carried out a systematic computational analysis on a representative dataset of proteins of known three-dimensional structure, in order to evaluate whether it would possible to 'swap' certain short peptide sequences in naturally occurring proteins with their corresponding 'inverted' peptides and generate 'artificial' proteins that are predicted to retain native-like protein fold. The analysis of 3,967 representative proteins from the Protein Data Bank revealed 102,677 unique identical inverted peptide sequence pairs that vary in sequence length between 5-12 and 18 amino acid residues. Our analysis illustrates with examples that such 'artificial' proteins may be generated by identifying peptides with 'similar structural environment' and by using comparative protein modeling and validation studies. Our analysis suggests that natural proteins may be tolerant to accommodating such peptides.
Formation of conjugated delta8,delta10-double bonds by delta12-oleic-acid desaturase-related enzymes: biosynthetic origin of calendic acid.

PubMed

Cahoon, E B; Ripp, K G; Hall, S E; Kinney, A J

2001-01-26

Divergent forms of the plant Delta(12)-oleic-acid desaturase (FAD2) have previously been shown to catalyze the formation of acetylenic bonds, epoxy groups, and conjugated Delta(11),Delta(13)-double bonds by modification of an existing Delta(12)-double bond in C(18) fatty acids. Here, we report a class of FAD2-related enzymes that modifies a Delta(9)-double bond to produce the conjugated trans-Delta(8),trans-Delta(10)-double bonds found in calendic acid (18:3Delta(8trans,10trans,12cis)), the major component of the seed oil of Calendula officinalis. Using an expressed sequence tag approach, cDNAs for two closely related FAD2-like enzymes, designated CoFADX-1 and CoFADX-2, were identified from a C. officinalis developing seed cDNA library. The deduced amino acid sequences of these polypeptides share 40-50% identity with those of other FAD2 and FAD2-related enzymes. Expression of either CoFADX-1 or CoFADX-2 in somatic soybean embryos resulted in the production of calendic acid. In embryos expressing CoFADX-2, calendic acid accumulated to as high as 22% (w/w) of the total fatty acids. In addition, expression of CoFADX-1 and CoFADX-2 in Saccharomyces cerevisiae was accompanied by calendic acid accumulation when induced cells were supplied exogenous linoleic acid (18:2Delta(9cis,12cis)). These results are thus consistent with a route of calendic acid synthesis involving modification of the Delta(9)-double bond of linoleic acid. Regiospecificity for Delta(9)-double bonds is unprecedented among FAD2-related enzymes and further expands the functional diversity found in this family of enzymes.
Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

DOEpatents

Studier, F. William

1995-04-18

Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.
Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

DOEpatents

Studier, F.W.

1995-04-18

Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Variability of the protein sequences of lcrV between epidemic and atypical rhamnose-positive strains of Yersinia pestis.

PubMed

Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V

2007-01-01

Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses

USDA-ARS?s Scientific Manuscript database

Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

PubMed Central

Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

2017-01-01

Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Methods of diagnosing alagille syndrome

DOEpatents

Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.

2004-03-09

The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.

PubMed

Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S

1985-07-01

The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
Submesoscale characteristics and transcription of a fatty acid elongase gene from a freshwater green microalgae, Myrmecia incisa Reisigl

NASA Astrophysics Data System (ADS)

Yu, Shuiyan; Liu, Shicheng; Li, Chunyang; Zhou, Zhigang

2011-01-01

Myrmecia incisa is a green coccoid freshwater microalgae, which is rich in arachidonic acid (ArA, C20: 4ω-6, δ5, 8, 11, 14), a long chain polyunsaturated fatty acid (PUFA), especially under nitrogen starvation stress. A cDNA library of M. incisa was constructed with λ phage vectors and a 545 nt expressed sequence tag (EST) was screened from this library as a putative elongase gene due to its 56% and 49% identity to Marchantia polymorpha L. and Ostreococcus tauri Courties et Chrétiennot-Dinet, respectively. Based upon this EST sequence, an elongase gene designated MiFAE was isolated from M. incisa via 5'/3' rapid amplification of cDNA ends (RACE). The cDNA sequence was 1 331 bp long and included a 33 bp 5'-untranslated region (UTR) and a 431 bp 3'-UTR with a typical poly-A tail. The 867 bp ORF encoded a predicted protein of 288 amino acids. This protein was characterized by a conserved histidine-rich box and a MYxYY motif that was present in other members of the elongase family. The genomic DNA sequence of MiFAE was found to be interrupted by three introns with splicing sites of Introns I (81 bp), II (81 bp), and III (67 bp) that conformed to the GT-AG rule. Quantitative real-time PCR showed that the transcription level of MiFAE in this microalga under nitrogen starvation was higher than that under normal condition. Prior to the ArA content accumulation, the transcription of MiFAE was enhanced, suggesting that it was possibly responsible for the ArA accumulation in this microalga cultured under nitrogen starvation conditions.
Molecular cloning of the heat shock protein 20 gene from Paphia textile and its expression in response to heat shock

NASA Astrophysics Data System (ADS)

Li, Jiakai; Wu, Xiangwei; Tan, Jing; Zhao, Ruixiang; Deng, Lingwei; Liu, Xiande

2015-07-01

P. textile is an important aquaculture species in China and is mainly distributed in Fujian, Guangdong, and Guangxi Provinces. In this study, an HSP20 cDNA designated PtHSP20 was cloned from P. textile. The full-length cDNA of PtHSP20 is 1 090 bp long and contains a 5' untranslated region (UTR) of 93 bp, a 3' UTR of 475 bp, and an open reading frame (ORF) of 522 bp. The PtHSP20 cDNA encodes 173 amino acid residues and has a molecular mass of 20.22 kDa and an isoelectric point of 6.2. Its predicted amino acid sequence shows that PtHSP20 contains a typical α-crystallin domain (residues 77-171) and three polyadenylation signal-sequences at the C-terminus. According to an amino acid sequence alignment, PtHSP20 shows moderate homology to other mollusk sHSPs. PtHSP20 mRNA was present in all of the test tissues including the heart, digestive gland, adductor muscle, gonad, gill, and mantle, with the highest concentration found in the gonad. Under the stress of high temperature, the expression of PtHSP20 mRNA was down-regulated in all of the tissues except the adductor muscle and gonad.

Alignment-Annotator web server: rendering and annotating sequence alignments.

PubMed

Gille, Christoph; Fähling, Michael; Weyand, Birgit; Wieland, Thomas; Gille, Andreas

2014-07-01

Alignment-Annotator is a novel web service designed to generate interactive views of annotated nucleotide and amino acid sequence alignments (i) de novo and (ii) embedded in other software. All computations are performed at server side. Interactivity is implemented in HTML5, a language native to web browsers. The alignment is initially displayed using default settings and can be modified with the graphical user interfaces. For example, individual sequences can be reordered or deleted using drag and drop, amino acid color code schemes can be applied and annotations can be added. Annotations can be made manually or imported (BioDAS servers, the UniProt, the Catalytic Site Atlas and the PDB). Some edits take immediate effect while others require server interaction and may take a few seconds to execute. The final alignment document can be downloaded as a zip-archive containing the HTML files. Because of the use of HTML the resulting interactive alignment can be viewed on any platform including Windows, Mac OS X, Linux, Android and iOS in any standard web browser. Importantly, no plugins nor Java are required and therefore Alignment-Anotator represents the first interactive browser-based alignment visualization. http://www.bioinformatics.org/strap/aa/ and http://strap.charite.de/aa/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Molecular cloning of an inducible serine esterase gene from human cytotoxic lymphocytes.

PubMed Central

Trapani, J A; Klein, J L; White, P C; Dupont, B

1988-01-01

A cDNA clone encoding a human serine esterase gene was isolated from a library constructed from poly(A)+ RNA of allogeneically stimulated, interleukin 2-expanded peripheral blood mononuclear cells. The clone, designated HSE26.1, represents a full-length copy of a 0.9-kilobase mRNA present in human cytotoxic cells but absent from a wide variety of noncytotoxic cell lines. Clone HSE26.1 contains an 892-base-pair sequence, including a single 741-base-pair open reading frame encoding a putative 247-residue polypeptide. The first 20 amino acids of the polypeptide form a leader sequence. The mature protein is predicted to have an unglycosylated Mr of approximately equal to 26,000 and contains a single potential site for N-linked glycosylation. The nucleotide and predicted amino acid sequences of clone HSE26.1 are homologous with all murine and human serine esterases cloned thus far but are most similar to mouse granzyme B (70% nucleotide and 68% amino acid identity). HSE26.1 protein is expressed weakly in unstimulated peripheral blood mononuclear cells but is strongly induced within 6-hr incubation in medium containing phytohemagglutinin. The data suggest that the protein encoded by HSE26.1 plays a role in cell-mediated cytotoxicity. Images PMID:3261871
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1997-01-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1997-04-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.
Role of two alpha-L-arabinofuranosidases in arabinoxylan degradation and characteristics of the encoding genes from shochu koji molds, Aspergillus kawachii and Aspergillus awamori.

PubMed

Koseki, Takuya; Okuda, Masaki; Sudoh, Shigetoshi; Kizaki, Yasuzo; Iwano, Kimio; Aramaki, Isao; Matsuzawa, Hiroshi

2003-01-01

Two different alpha-L-arabinofuranosidases from Aspergillus kawachii were purified and characterized. The two enzymes acted synergically with xylanase in the degradation of arabinoxylan and resulted in an increase in the amount of ferulic acid release by feruloyl esterase. Both enzymes were acidophilic and acid stable enzymes which had an optimum pH of 4.0 and were stable at pH 3.0-7.0. The general properties of the enzymes including pH optima and pH stability were similar to those of Aspergillus awamori. These results suggest that the alpha-L-arabinofuranosidases contribute to an increase in cereal utilization and formation of aroma in shochu brewing. Two different genes encoding alpha-L-arabinofuranosidases from A. kawachii, designated as AkabfA and AkabjB, and those from A. awamori, designated as AwabfA and AwabjB, were also cloned and characterized. The difference between the sequences of AkabfA and AwabfA was only one nucleotide, resulting in an amino acid difference in the sequence, and the enzymes were assigned to family 51 of glycoside hydrolases. On the other hand, the differences between the sequences of AkabjB and AwabjB and between their encoding proteins were two nucleotides and one amino acid residue, respectively, and the enzymes were assigned to family 54 of glycoside hydrolases. On comparison of the abfA and abjB genes among A. kawachii, A. awamori, and A. niger, the relationship between the two genes for A. kawachii and A. awamori was much closer than those between A. niger and the others. Northern analyses showed that transcription of AkabfB was greater than that of AkabfA in the presence of L-arabitol and L-arabinose, and that transcriptions of both genes were not induced in the presence of sucrose and glucose.
Pyviko: an automated Python tool to design gene knockouts in complex viruses with overlapping genes.

PubMed

Taylor, Louis J; Strebel, Klaus

2017-01-07

Gene knockouts are a common tool used to study gene function in various organisms. However, designing gene knockouts is complicated in viruses, which frequently contain sequences that code for multiple overlapping genes. Designing mutants that can be traced by the creation of new or elimination of existing restriction sites further compounds the difficulty in experimental design of knockouts of overlapping genes. While software is available to rapidly identify restriction sites in a given nucleotide sequence, no existing software addresses experimental design of mutations involving multiple overlapping amino acid sequences in generating gene knockouts. Pyviko performed well on a test set of over 240,000 gene pairs collected from viral genomes deposited in the National Center for Biotechnology Information Nucleotide database, identifying a point mutation which added a premature stop codon within the first 20 codons of the target gene in 93.2% of all tested gene-overprinted gene pairs. This shows that Pyviko can be used successfully in a wide variety of contexts to facilitate the molecular cloning and study of viral overprinted genes. Pyviko is an extensible and intuitive Python tool for designing knockouts of overlapping genes. Freely available as both a Python package and a web-based interface ( http://louiejtaylor.github.io/pyViKO/ ), Pyviko simplifies the experimental design of gene knockouts in complex viruses with overlapping genes.
Distinct profiling of antimicrobial peptide families

PubMed Central

Khamis, Abdullah M.; Essack, Magbubah; Gao, Xin; Bajic, Vladimir B.

2015-01-01

Motivation: The increased prevalence of multi-drug resistant (MDR) pathogens heightens the need to design new antimicrobial agents. Antimicrobial peptides (AMPs) exhibit broad-spectrum potent activity against MDR pathogens and kills rapidly, thus giving rise to AMPs being recognized as a potential substitute for conventional antibiotics. Designing new AMPs using current in-silico approaches is, however, challenging due to the absence of suitable models, large number of design parameters, testing cycles, production time and cost. To date, AMPs have merely been categorized into families according to their primary sequences, structures and functions. The ability to computationally determine the properties that discriminate AMP families from each other could help in exploring the key characteristics of these families and facilitate the in-silico design of synthetic AMPs. Results: Here we studied 14 AMP families and sub-families. We selected a specific description of AMP amino acid sequence and identified compositional and physicochemical properties of amino acids that accurately distinguish each AMP family from all other AMPs with an average sensitivity, specificity and precision of 92.88%, 99.86% and 95.96%, respectively. Many of our identified discriminative properties have been shown to be compositional or functional characteristics of the corresponding AMP family in literature. We suggest that these properties could serve as guides for in-silico methods in design of novel synthetic AMPs. The methodology we developed is generic and has a potential to be applied for characterization of any protein family. Contact: vladimir.bajic@kaust.edu.sa Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25388148
The reactivities of human erythrocyte autoantibodies anti-Pr2, anti-Gd, Fl and Sa with gangliosides in a chromatogram binding assay.

PubMed Central

Uemura, K; Roelcke, D; Nagai, Y; Feizi, T

1984-01-01

The thin layer chromatogram binding assay was used to study the reaction of several natural-monoclonal autoantibodies which recognize sialic acid-dependent antigens of human erythrocytes. Immunostaining of gangliosides derived from human and bovine erythrocytes was achieved with four autoantibodies designated anti-Pr2, anti-Gd, Sa and Fl, each of which has a different haemagglutination pattern with untreated and proteinase-treated erythrocytes and with cells of I and i antigen types. From the chromatogram binding patterns of anti-Pr2 with gangliosides of the neolacto and the ganglio series, it is deduced that this antibody reacts best with N-acetylneuraminic acid when it is alpha 2-3- or alpha 2-6-linked to a terminal Gal(beta 1-4)Glc/GlcNAc GlcNAc sequence and to a lesser extent when it is alpha 2-3-linked to a terminal Gal(beta 1-3)GalNAc sequence or to an internal galactose and when it is alpha 2-8-linked to another, internal N-acetylneuraminic acid residue. The other three antibodies differ from anti-Pr2 in their lack of reaction with glycolipids of the ganglio series. They react with the NeuAc(alpha 2-3)Gal(beta 1-4)Glc/GlcNAc sequence as found in GM3 and in glycolipids of the neolacto series, but show a preference for the latter, longer sequences. Thus all four antibodies react with sialylated oligosaccharides containing i type (linear) and I type (branched) neolacto backbones. Fl antibody differs from the other three in its stronger reaction with branched neolacto sequences in accordance with its stronger agglutination of erythrocytes of I rather than i type. The four antibodies show a specificity for N-acetyl- rather than N-glycolyl-neuraminic acid. Images Fig. 1. Fig. 2. Fig. 3. Fig. 4. PMID:6204642
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Nucleic acid detection kits

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.

2005-03-29

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D

2012-10-16

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
GC-rich coding sequences reduce transposon-like, small RNA-mediated transgene silencing.

PubMed

Sidorenko, Lyudmila V; Lee, Tzuu-Fen; Woosley, Aaron; Moskal, William A; Bevan, Scott A; Merlo, P Ann Owens; Walsh, Terence A; Wang, Xiujuan; Weaver, Staci; Glancy, Todd P; Wang, PoHao; Yang, Xiaozeng; Sriram, Shreedharan; Meyers, Blake C

2017-11-01

The molecular basis of transgene susceptibility to silencing is poorly characterized in plants; thus, we evaluated several transgene design parameters as means to reduce heritable transgene silencing. Analyses of Arabidopsis plants with transgenes encoding a microalgal polyunsaturated fatty acid (PUFA) synthase revealed that small RNA (sRNA)-mediated silencing, combined with the use of repetitive regulatory elements, led to aggressive transposon-like silencing of canola-biased PUFA synthase transgenes. Diversifying regulatory sequences and using native microalgal coding sequences (CDSs) with higher GC content improved transgene expression and resulted in a remarkable trans-generational stability via reduced accumulation of sRNAs and DNA methylation. Further experiments in maize with transgenes individually expressing three crystal (Cry) proteins from Bacillus thuringiensis (Bt) tested the impact of CDS recoding using different codon bias tables. Transgenes with higher GC content exhibited increased transcript and protein accumulation. These results demonstrate that the sequence composition of transgene CDSs can directly impact silencing, providing design strategies for increasing transgene expression levels and reducing risks of heritable loss of transgene expression.
G-Quadruplex Induction by the Hairpin Pyrrole-Imidazole Polyamide Dimer.

PubMed

Obata, Shunsuke; Asamitsu, Sefan; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

2018-02-06

The G-quadruplex (G4) is one type of higher-order structure of nucleic acids and is thought to play important roles in various biological events such as regulation of transcription and inhibition of DNA replication. Pyrrole-imidazole polyamides (PIPs) are programmable small molecules that can sequence-specifically bind with high affinity to the minor groove of double-stranded DNA (dsDNA). Herein, we designed head-to-head hairpin PIP dimers and their target dsDNA in a model G4-forming sequence. Using an electrophoresis mobility shift assay and transcription arrest assay, we found that PIP dimers could induce the structural change to G4 DNA from dsDNA through the recognition by one PIP dimer molecule of two duplex-binding sites flanking both ends of the G4-forming sequence. This induction ability was dependent on linker length. This is the first study to induce G4 formation using PIPs, which are known to be dsDNA binders. The results reported here suggest that selective G4 induction in native sequences may be achieved with PIP dimers by applying the same design strategy.
The design of strain-specific polymerase chain reactions for discrimination of the racoon rabies virus strain from indigenous rabies viruses of Ontario.

PubMed

Nadin-Davis, S A; Huang, W; Wandeler, A I

1996-03-01

Since its recognition as a discrete epizootic in Florida in the early 1950s, the raccoon strain of rabies virus (RV) has spread over almost the entire eastern seaboard of the US and now threatens to enter the southernmost regions of Canada. To characterise this RV strain in more detail, nucleotide sequencing of the N and G genes, encoding the nucleoprotein and glycoprotein, respectively, of representative isolates has been undertaken. This sequence information generated a conserved restriction map of the N gene, thereby permitting unequivocal identification of this strain by molecular techniques. Comparisons of the predicted nucleoprotein and glycoprotein products with those of other RV strains identified a number of amino acid sequence variations conserved only in the raccoon strain. This information was used to design strain-specific primers targeted to the N gene sequences encoding these residues. The incorporation of these primers into a multiplex polymerase chain reaction (PCR) protocol permitted easy and rapid discrimination between the raccoon RV strain and indigenous Ontario RVs.
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.

PubMed

Sakai, Ryo; Aerts, Jan

2014-01-01

The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Extraction Behaviors of Heavy Rare Earths with Organophosphoric Extractants: The Contribution of Extractant Dimer Dissociation, Acid Ionization, and Complexation. A Quantum Chemistry Study.

PubMed

Jing, Yu; Chen, Ji; Chen, Li; Su, Wenrou; Liu, Yu; Li, Deqian

2017-03-30

Heavy rare earths (HREs), namely Ho 3+ , Er 3+ , Tm 3+ , Yb 3+ and Lu 3+ , are rarer and more exceptional than light rare earths, due to the stronger extraction capacity for 100 000 extractions. Therefore, their incomplete stripping and high acidity of stripping become problems for HRE separation by organophosphoric extractants. However, the theories of extractant structure-performance relationship and molecular design method of novel HRE extractants are still not perfect. Beyond the coordination chemistry of the HRE-extracted complex, the extractant dimer dissociation, acid ionization, and complexation behaviors can be crucial to HRE extraction and reactivity of ionic species for understanding and further improving the extraction performance. To address the above issues, three primary fundamental processes, including extractant dimer dissociation, acid ionization, and HRE complexation, were identified and investigated systematically. The intrinsic extraction performances of HRE cations with four acidic organophosphoric extractants (P507, P204, P227 and Cyanex 272) were studied by using relativistic energy-consistent 4f core pseudopotentials, combined with density functional theory and a solvation model. Four acidic organophosphoric extractants have been qualified quantitatively from microscopic structures to chemical properties. It has been found that the Gibbs free energy changes of the overall extraction process (sequence: P204 > P227 > P507 > Cyanex 272) and their differences as a function of HREs (sequence: Ho/Er > Er/Tm > Tm/Yb > Yb/Lu) are in good agreement with the experimental maximum extraction capacities and separation factors. These results could provide an important approach to evaluate HRE extractants by the comprehensive consideration of dimer dissociation, acid ionization, and complexation processes. This paper also demonstrates the importance of the P-O bond, the P-C bond, isomer substituent, and solvation effects on the structure-performance relationship that can be used to guide molecular designs of HRE extraction in future.
Mining of Microbial Genomes for the Novel Sources of Nitrilases.

PubMed

Sharma, Nikhil; Thakur, Neerja; Raj, Tilak; Savitri; Bhalla, Tek Chand

2017-01-01

Next-generation DNA sequencing (NGS) has made it feasible to sequence large number of microbial genomes and advancements in computational biology have opened enormous opportunities to mine genome sequence data for novel genes and enzymes or their sources. In the present communication in silico mining of microbial genomes has been carried out to find novel sources of nitrilases. The sequences selected were analyzed for homology and considered for designing motifs. The manually designed motifs based on amino acid sequences of nitrilases were used to screen 2000 microbial genomes (translated to proteomes). This resulted in identification of one hundred thirty-eight putative/hypothetical sequences which could potentially code for nitrilase activity. In vitro validation of nine predicted sources of nitrilases was done for nitrile/cyanide hydrolyzing activity. Out of nine predicted nitrilases, Gluconacetobacter diazotrophicus , Sphingopyxis alaskensis , Saccharomonospora viridis , and Shimwellia blattae were specific for aliphatic nitriles, whereas nitrilases from Geodermatophilus obscurus , Nocardiopsis dassonvillei , Runella slithyformis , and Streptomyces albus possessed activity for aromatic nitriles. Flavobacterium indicum was specific towards potassium cyanide (KCN) which revealed the presence of nitrilase homolog, that is, cyanide dihydratase with no activity for either aliphatic, aromatic, or aryl nitriles. The present study reports the novel sources of nitrilases and cyanide dihydratase which were not reported hitherto by in silico or in vitro studies.
Limited number of immunoglobulin VH regions expressed in the mutant rabbit "Alicia".

PubMed

DiPietro, L A; Short, J A; Zhai, S K; Kelus, A S; Meier, D; Knight, K L

1990-06-01

A unique feature of rabbit Ig is the presence of VH region allotypic specificities. In normal rabbits, more than 80% of circulating immunoglobulin molecules bear the VHa allotypic specificities, al, a2 or a3; the remaining 10% to 20% of immunoglobulin molecules lack VHa allotypic specificities and are designated VHa-. A mutant rabbit designated Alicia, in contrast, has predominantly serum immunoglobulin molecules that lack the VHa allotypic specificities (Kelus and Weiss, Proc. Natl. Acad. Sci. USA 1986. 83: 4883). To study the nature and molecular complexity of VHa- molecules, we cloned and determined the nucleotide sequence of seven cDNA prepared from splenic RNA of an Alicia rabbit. Six of the clones appeared to encode VHa- molecules; the framework regions encoded by these clones were remarkably similar to each other, each having an unusual insertion of four amino acids at position 10. This insertion of four amino acids has been seen in only 2 of 54 sequenced rabbit VH genes. The similarity of the sequences of the six VHa- clones to each other and their dissimilarity to most other VH genes leads us to suggest that the VHa- molecules in Alicia rabbits are derived predominantly from one or a small number of very similar VH genes. Such preferential utilization of a small number of VH genes may explain the allelic inheritance of VH allotypes.
KAS IV: a 3-ketoacyl-ACP synthase from Cuphea sp. is a medium chain specific condensing enzyme.

PubMed

Dehesh, K; Edwards, P; Fillatti, J; Slabaugh, M; Byrne, J

1998-08-01

cDNA clones encoding a novel 3-ketoacyl-ACP synthase (KAS) have been isolated from Cuphea. The amino acid sequence of this enzyme is different from the previously characterized classes of KASs, designated KAS I and III, and similar to those designated as KAS II. To define the acyl chain specificity of this enzyme, we generated transgenic Brassica plants over-expressing the cDNA encoded protein in a seed specific manner. Expression of this enzyme in transgenic Brassica seeds which normally do not produce medium chain fatty acids does not result in any detectable modification of the fatty acid profile. However, co-expression of the Cuphea KAS with medium chain specific thioesterases, capable of production of either 12:0 or 8:0/10:0 fatty acids in seed oil, strongly enhances the levels of these medium chain fatty acids as compared with seed oil of plants expressing the thioesterases alone. By contrast, co-expression of the Cuphea KAS along with an 18:0/18.1-ACP thioesterase does not result in any detectable modification of the fatty acids. These data indicate that the Cuphea KAS reported here has a different acyl-chain specificity to the previously characterized KAS I, II and III. Therefore, we designate this enzyme KAS IV, a medium chain specific condensing enzyme.

Agaricicola taiwanensis gen. nov., sp. nov., an alphaproteobacterium isolated from the edible mushroom Agaricus blazei.

PubMed

Chu, Jiunn-Nan; Arun, A B; Chen, Wen-Ming; Chou, Jui-Hsing; Shen, Fo-Ting; Rekha, P D; Kämpfer, P; Young, Li-Sen; Lin, Shih-Yao; Young, Chiu-Chung

2010-09-01

A Gram-negative, beige-pigmented, aerobic, motile, club-shaped bacterium, designated strain CC-SBABM117(T), was isolated from the stipe of the edible mushroom Agaricus blazei Murrill. 16S rRNA gene sequence analysis demonstrated that the strain shared <93 % similarity with the type strains of species in the genera Pannonibacter, Methylopila, Nesiotobacter and Stappia. The organism was unable to produce acid from carbohydrates, but utilized a number of organic acids and amino acids. Ubiquinone 10 (Q-10) was the major respiratory quinone and C(18 : 1) ω 7c, C(19 : 0) cyclo ω 8c, C(16 : 0) and C(18 : 0) were the predominant fatty acids. The predominant polar lipids were diphosphatidylglycerol, phosphatidylcholine, phosphatidylglycerol and phosphatidylethanolamine. The DNA G+C content of strain CC-SBABM117(T) was 62.7 mol%. On the basis of 16S rRNA gene sequence analysis and chemotaxonomic and physiological data, strain CC-SBABM117(T) is considered to represent a novel species of a new genus, for which the name Agaricicola taiwanensis gen. nov., sp. nov. is proposed. The type strain of Agaricicola taiwanensis is CC-SBABM117(T) (=BCRC 17964(T) =CCM 7684(T)).
Design and Evaluation of a Lactobacillus manihotivorans Species-Specific rRNA-Targeted Hybridization Probe and Its Application to the Study of Sour Cassava Fermentation

PubMed Central

Ampe, Frédéric

2000-01-01

Based on 16S rRNA sequence comparison, we have designed a 20-mer oligonucleotide that targets a region specific to the species Lactobacillus manihotivorans recently isolated from sour cassava fermentation. The probe recognized the rRNA obtained from all the L. manihotivorans strains tested but did not recognize 56 strains of microorganisms from culture collections or directly isolated from sour cassava, including 29 species of lactic acid bacteria. This probe was then successfully used in quantitative RNA blots and demonstrated the importance of L. manihotivorans in the fermentation of sour cassava starch, which could represent up to 20% of total lactic acid bacteria. PMID:10788405
Bioequivalence evaluation of two brands of amoxicillin/clavulanic acid 250/125 mg combination tablets in healthy human volunteers: use of replicate design approach.

PubMed

Idkaidek, Nasir M; Al-Ghazawi, Ahmad; Najib, Naji M

2004-12-01

The purpose of this study was to apply a replicate design approach to a bioequivalence study of amoxicillin/clavulanic acid combination following a 250/125 mg oral dose to 23 subjects, and to compare the analysis of individual bioequivalence with average bioequivalence. This was conducted as a 2-treatment 2-sequence 4-period crossover study. Average bioequivalence was shown, while the results from the individual bioequivalence approach had no success in showing bioequivalence. In conclusion, the individual bioequivalence approach is a strong statistical tool to test for intra-subject variances and also subject-by-formulation interaction variance compared with the average bioequivalence approach. copyright (c) 2004 John Wiley & Sons, Ltd.
How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.

PubMed

Tian, Pengfei; Best, Robert B

2017-10-17

Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Identification of Biomolecular Building Blocks by Recognition Tunneling: Stride towards Nanopore Sequencing of Biomolecules

NASA Astrophysics Data System (ADS)

Sen, Suman

DNA, RNA and Protein are three pivotal biomolecules in human and other organisms, playing decisive roles in functionality, appearance, diseases development and other physiological phenomena. Hence, sequencing of these biomolecules acquires the prime interest in the scientific community. Single molecular identification of their building blocks can be done by a technique called Recognition Tunneling (RT) based on Scanning Tunneling Microscope (STM). A single layer of specially designed recognition molecule is attached to the STM electrodes, which trap the targeted molecules (DNA nucleoside monophosphates, RNA nucleoside monophosphates or amino acids) inside the STM nanogap. Depending on their different binding interactions with the recognition molecules, the analyte molecules generate stochastic signal trains accommodating their "electronic fingerprints". Signal features are used to detect the molecules using a machine learning algorithm and different molecules can be identified with significantly high accuracy. This, in turn, paves the way for rapid, economical nanopore sequencing platform, overcoming the drawbacks of Next Generation Sequencing (NGS) techniques. To read DNA nucleotides with high accuracy in an STM tunnel junction a series of nitrogen-based heterocycles were designed and examined to check their capabilities to interact with naturally occurring DNA nucleotides by hydrogen bonding in the tunnel junction. These recognition molecules are Benzimidazole, Imidazole, Triazole and Pyrrole. Benzimidazole proved to be best among them showing DNA nucleotide classification accuracy close to 99%. Also, Imidazole reader can read an abasic monophosphate (AP), a product from depurination or depyrimidination that occurs 10,000 times per human cell per day. In another study, I have investigated a new universal reader, 1-(2-mercaptoethyl)pyrene (Pyrene reader) based on stacking interactions, which should be more specific to the canonical DNA nucleosides. In addition, Pyrene reader showed higher DNA base-calling accuracy compare to Imidazole reader, the workhorse in our previous projects. In my other projects, various amino acids and RNA nucleoside monophosphates were also classified with significantly high accuracy using RT. Twenty naturally occurring amino acids and various RNA nucleosides (four canonical and two modified) were successfully identified. Thus, we envision nanopore sequencing biomolecules using Recognition Tunneling (RT) that should provide comprehensive betterment over current technologies in terms of time, chemical and instrumental cost and capability of de novo sequencing.
Characterisation and cloning of a Na(+)-dependent broad-specificity neutral amino acid transporter from NBL-1 cells: a novel member of the ASC/B(0) transporter family.

PubMed

Pollard, Matthew; Meredith, David; McGivan, John D

2002-04-12

Na(+)-dependent neutral amino acid transport into the bovine renal epithelial cell line NBL-1 is catalysed by a broad-specificity transporter originally termed System B(0). This transporter is shown to differ in specificity from the B(0) transporter cloned from JAR cells [J. Biol. Chem. 271 (1996) 18657] in that it interacts much more strongly with phenylalanine. Using probes designed to conserved transmembrane regions of the ASC/B(0) transporter family we have isolated a cDNA encoding the NBL-1 cell System B(0) transporter. When expressed in Xenopus oocytes the clone catalysed Na(+)-dependent alanine uptake which was inhibited by glutamine, leucine and phenylalanine. However, the clone did not catalyse Na(+)-dependent phenylalanine transport, again as in NBL-1 cells. The clone encoded a protein of 539 amino acids; the predicted transmembrane domains were almost identical in sequence to those of the other members of the B(0)/ASC transporter family. Comparison of the sequences of NBL-1 and JAR cell transporters showed some differences near the N-terminus, C-terminus and in the loop between helices 3 and 4. The NBL-1 B(0) transporter is not the same as the renal brush border membrane transporter since it does not transport phenylalanine. Differences in specificity in this protein family arise from relatively small differences in amino acid sequence.
Putative Porin of Bradyrhizobium sp. (Lupinus) Bacteroids Induced by Glyphosate▿

PubMed Central

de María, Nuria; Guevara, Ángeles; Serra, M. Teresa; García-Luque, Isabel; González-Sama, Alfonso; de Lacoba, Mario García; de Felipe, M. Rosario; Fernández-Pascual, Mercedes

2007-01-01

Application of glyphosate (N-[phosphonomethyl] glycine) to Bradyrhizobium sp. (Lupinus)-nodulated lupin plants caused modifications in the protein pattern of bacteroids. The most significant change was the presence of a 44-kDa polypeptide in bacteroids from plants treated with the higher doses of glyphosate employed (5 and 10 mM). The polypeptide has been characterized by the amino acid sequencing of its N terminus and the isolation and nucleic acid sequencing of its encoding gene. It is putatively encoded by a single gene, and the protein has been identified as a putative porin. Protein modeling revealed the existence of several domains sharing similarity to different porins, such as a transmembrane beta-barrel. The protein has been designated BLpp, for Bradyrhizobium sp. (Lupinus) putative porin, and would be the first porin described in Bradyrhizobium sp. (Lupinus). In addition, a putative conserved domain of porins has been identified which consists of 87 amino acids, located in the BLpp sequence 30 amino acids downstream of the N-terminal region. In bacteroids, mRNA of the BLpp gene shows a basal constitutive expression that increases under glyphosate treatment, and the expression of the gene is seemingly regulated at the transcriptional level. By contrast, in free-living bacteria glyphosate treatment leads to an inhibition of BLpp mRNA accumulation, indicating a different effect of glyphosate on BLpp gene expression in bacteroids and free-living bacteria. The possible role of BLpp in a metabolite interchange between Bradyrhizobium and lupin is discussed. PMID:17557843
Pseudoclavibacter caeni sp. nov., isolated from sludge of a sewage disposal plant.

PubMed

Srinivasan, Sathiyaraj; Kim, Hyun Sook; Kim, Myung Kyum; Lee, Myungjin

2012-04-01

A Gram-positive, strictly aerobic, rod-shaped, non-motile bacterial strain, designated MJ28T, was isolated from a sludge sample from the Daejeon sewage disposal plant in South Korea. A polyphasic approach was applied to study the taxonomic position of strain MJ28T. Strain MJ28T showed highest 16S rRNA gene sequence similarity to Pseudoclavibacter soli KP02T (95.2 %). Levels of 16S rRNA gene sequence similarity to the type strains of other Pseudoclavibacter species were less than 94.0 %. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain MJ28T belonged to the clade formed by members of the genus Pseudoclavibacter in the family Microbacteriaceae. The G+C content of the genomic DNA of strain MJ28T was 65.8 mol%. The chemotaxonomic characteristics of strain MJ28T showed features typical of the genus Pseudoclavibacter, with MK-9 as the predominant respiratory quinone, 2,4-diaminobutryic acid as the diamino acid in the peptidoglycan, and anteiso-C17:0 (44.6 %), anteiso-C15:0 (35.7 %) and C16:0 (9.5 %) as the major fatty acids. On the basis of phylogenetic inference, fatty acid profile and other phenotypic properties, strain MJ28T is considered to represent a novel species of the genus Pseudoclavibacter, for which the name Pseudoclavibacter caeni sp. nov. is proposed. The type strain is MJ28T (=KCTC 19773T=JCM 16921T).
[Cloning, expression and transcriptional analysis of biotin carboxyl carrier protein gene (accA) from Amycolatopsis mediterranei U32 ].

PubMed

Lu, Jie; Yao, Yufeng; Jiang, Weihong; Jiao, Ruishen

2003-02-01

Acetyl CoA carboxylase (EC 6.4.1.2, ACC) catalyzes the ATP-dependent carboxylation of acetyl CoA to yield malonyl CoA, which is the first committed step in fatty acid synthesis. A pair of degenerate PCR primers were designed according to the conserved amino acid sequence of AccA from M. tuberculosis and S. coelicolor. The product of the PCR amplification, a DNA fragment of 250bp was used as a probe for screening the U32 genomic cosmid library and its gene, accA, coding the biotinylated protein subunit of acetyl CoA carboxylase, was successfully cloned from U32. The accA ORF encodes a 598-amino-acid protein with the calculated molecular mass of 63.7kD, with 70.1% of G + C content. A typical Streptomyces RBS sequence, AGGAGG, was found at the - 6 position upstream of the start codon GTG. Analysis of the deduced amino acid sequence showed the presence of biotin-binding site and putative ATP-bicarbonate interaction region, which suggested the U32 AccA may act as a biotin carboxylase as well as a biotin carrier protein. Gene accA was then cloned into the pET28 (b) vector and expressed solubly in E. coli BL21 (DE3) by 0.1 mmol/L IPTG induction. Western blot confirmed the covalent binding of biotin with AccA. Northern blot analyzed transcriptional regulation of accA by 5 different nitrogen sources.
Pharmacokinetic properties of tandem d-peptides designed for treatment of Alzheimer's disease.

PubMed

Leithold, Leonie H E; Jiang, Nan; Post, Julia; Niemietz, Nicole; Schartmann, Elena; Ziehm, Tamar; Kutzsche, Janine; Shah, N Jon; Breitkreutz, Jörg; Langen, Karl-Josef; Willuweit, Antje; Willbold, Dieter

2016-06-30

Peptides are more and more considered for the development of drug candidates. However, they frequently exhibit severe disadvantages such as instability and unfavourable pharmacokinetic properties. Many peptides are rapidly cleared from the organism and oral bioavailabilities as well as in vivo half-lives often remain low. In contrast, some peptides consisting solely of d-enantiomeric amino acid residues were shown to combine promising therapeutic properties with high proteolytic stability and enhanced pharmacokinetic parameters. Recently, we have shown that D3 and RD2 have highly advantageous pharmacokinetic properties. Especially D3 has already proven promising properties suitable for treatment of Alzheimer's disease. Here, we analyse the pharmacokinetic profiles of D3D3 and RD2D3, which are head-to-tail tandem d-peptides built of D3 and its derivative RD2. Both D3D3 and RD2D3 show proteolytic stability in mouse plasma and organ homogenates for at least 24h and in murine and human liver microsomes for 4h. Notwithstanding their high affinity to plasma proteins, both peptides are taken up into the brain following i.v. as well as i.p. administration. Although both peptides contain identical d-amino acid residues, they are arranged in a different sequence order and the peptides show differences in pharmacokinetic properties. After i.p. administration RD2D3 exhibits lower plasma clearance and higher bioavailability than D3D3. We therefore concluded that the amino acid sequence of RD2 leads to more favourable pharmacokinetic properties within the tandem peptide, which underlines the importance of particular sequence motifs, even in short peptides, for the design of further therapeutic d-peptides. Copyright © 2016 Elsevier B.V. All rights reserved.
Molecular cloning of rat sperm galactosyl receptor, a C-type lectin with in vitro egg binding activity.

PubMed

Rivkin, E; Tres, L L; Kaplan-Kraicer, R; Shalgi, R; Kierszenbaum, A L

2000-07-01

Rat sperm galactosyl receptor is a member of the C-type animal lectin family showing preferential binding to N-acetylgalactosamine compared to galactose. Binding is mediated by a Ca(2+)-dependent carbohydrate-recognition domain (CRD) identical to that of the minor variant of rat hepatic lectin receptor 2/3 (RHL-2/3). The molecular organization of the genomic DNA, cDNA, and derived amino acid sequence of rat testis galactosyl receptor have been determined and in vitro fertilization studies were conducted to ascertain its role. We have determined that the rat testis galactosyl receptor gene generates two mRNA species: one species, designated liver-type, is identical to RHL-2/3; the other, designated testis-type, contains one unspliced intron (86 nt) which alters the reading frame and changes the amino acid sequence of the carboxyl terminus. As a result, the CRD (glutamine-proline-aspartic acid/QPD) and flanked Ca(2+)-binding amino acid sequences were not present in the testis-type protein. Northern and Southern blots demonstrated presence of transcripts with unspliced intron in rat sperm but not liver. Similarly, antibody, raised against a synthetic 12-amino acid peptide (p12) encoded by the unspliced intron, recognized in immunoblots a 54 kDa receptor protein in protein extracts from testis but not from liver. Immunofluorescence and immunogold electron microscopy studies demonstrated that both protein species localized on the plasma membrane surface of the head and tail of rat sperm. Furthermore, capacitated rat sperm preincubated with polyclonal antisera to RHL-2/3 or to the CRD of the liver-type galactosyl receptor showed a statistically significant decrease in the in vitro fertilization rate. We conclude that rat sperm galactosyl receptor may play a role in egg binding and that an undetermined molecular mechanism operates to generate two proteins with identical intracellular amino terminal domain but only one of them displays a CRD and associated Ca(2+)-binding sites at the carboxyl terminal extracellular domain. Copyright 2000 Wiley-Liss, Inc.
Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

PubMed Central

Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

1986-01-01

A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
CLONING AND IN VITRO EXPRESSION AND CHARACTERIZATION OF THE ANDROGEN RECEPTOR AND ISOLATION OF ESTROGEN RECEPTOR α FROM THE FATHEAD MINNOW (PIMEPHALES PROMELAS)

EPA Science Inventory

In vitro screening assays designed to identify hormone mimics or antagonists typically use mammalian (rat, human) estrogen (ER) and androgen receptors (AR). Although we know that the amino acid sequences of steroid receptors in nonmammalian vertebrates are not identical to the ma...
Method of increasing conversion of a fatty acid to its corresponding dicarboxylic acid

DOEpatents

Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan

2004-09-14

A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Automated sequence analysis and editing software for HIV drug resistance testing.

PubMed

Struck, Daniel; Wallis, Carole L; Denisov, Gennady; Lambert, Christine; Servais, Jean-Yves; Viana, Raquel V; Letsoalo, Esrom; Bronze, Michelle; Aitken, Sue C; Schuurman, Rob; Stevens, Wendy; Schmit, Jean Claude; Rinke de Wit, Tobias; Perez Bercoff, Danielle

2012-05-01

Access to antiretroviral treatment in resource-limited-settings is inevitably paralleled by the emergence of HIV drug resistance. Monitoring treatment efficacy and HIV drugs resistance testing are therefore of increasing importance in resource-limited settings. Yet low-cost technologies and procedures suited to the particular context and constraints of such settings are still lacking. The ART-A (Affordable Resistance Testing for Africa) consortium brought together public and private partners to address this issue. To develop an automated sequence analysis and editing software to support high throughput automated sequencing. The ART-A Software was designed to automatically process and edit ABI chromatograms or FASTA files from HIV-1 isolates. The ART-A Software performs the basecalling, assigns quality values, aligns query sequences against a set reference, infers a consensus sequence, identifies the HIV type and subtype, translates the nucleotide sequence to amino acids and reports insertions/deletions, premature stop codons, ambiguities and mixed calls. The results can be automatically exported to Excel to identify mutations. Automated analysis was compared to manual analysis using a panel of 1624 PR-RT sequences generated in 3 different laboratories. Discrepancies between manual and automated sequence analysis were 0.69% at the nucleotide level and 0.57% at the amino acid level (668,047 AA analyzed), and discordances at major resistance mutations were recorded in 62 cases (4.83% of differences, 0.04% of all AA) for PR and 171 (6.18% of differences, 0.03% of all AA) cases for RT. The ART-A Software is a time-sparing tool for pre-analyzing HIV and viral quasispecies sequences in high throughput laboratories and highlighting positions requiring attention. Copyright © 2012 Elsevier B.V. All rights reserved.
Nucleotide sequence of the gene for the Mr 32,000 thylakoid membrane protein from Spinacia oleracea and Nicotiana debneyi predicts a totally conserved primary translation product of Mr 38,950

PubMed Central

Zurawski, Gerard; Bohnert, Hans J.; Whitfeld, Paul R.; Bottomley, Warwick

1982-01-01

The gene for the so-called Mr 32,000 rapidly labeled photosystem II thylakoid membrane protein (here designated psbA) of spinach (Spinacia oleracea) chloroplasts is located on the chloroplast DNA in the large single-copy region immediately adjacent to one of the inverted repeat sequences. In this paper we show that the size of the mRNA for this protein is ≈ 1.25 kilobases and that the direction of transcription is towards the inverted repeat unit. The nucleotide sequence of the gene and its flanking regions is presented. The only large open reading frame in the sequence codes for a protein of Mr 38,950. The nucleotide sequence of psbA from Nicotiana debneyi also has been determined, and comparison of the sequences from the two species shows them to be highly conserved (>95% homology) throughout the entire reading frame. Conservation of the amino acid sequence is absolute, there being no changes in a total of 353 residues. This leads us to conclude that the primary translation product of psbA must be a protein of Mr 38,950. The protein is characterized by the complete absence of lysine residues and is relatively rich in hydrophobic amino acids, which tend to be clustered. Transcription of spinach psbA starts about 86 base pairs before the first ATG codon. Immediately upstream from this point there is a sequence typical of that found in E. coli promoters. An almost identical sequence occurs in the equivalent region of N. debneyi DNA. Images PMID:16593262
On the Role of Aggregation Prone Regions in Protein Evolution, Stability, and Enzymatic Catalysis: Insights from Diverse Analyses

PubMed Central

Buck, Patrick M.; Kumar, Sandeep; Singh, Satish K.

2013-01-01

The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. PMID:24146608
A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

PubMed

Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

1995-04-01

The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
A natural mutation-led truncation in one of the two aluminum-activated malate transporter-like genes at the Ma locus is associated with low fruit acidity in apple.

PubMed

Bai, Yang; Dougherty, Laura; Li, Mingjun; Fazio, Gennaro; Cheng, Lailiang; Xu, Kenong

2012-08-01

Acidity levels greatly affect the taste and flavor of fruit, and consequently its market value. In mature apple fruit, malic acid is the predominant organic acid. Several studies have confirmed that the major quantitative trait locus Ma largely controls the variation of fruit acidity levels. The Ma locus has recently been defined in a region of 150 kb that contains 44 predicted genes on chromosome 16 in the Golden Delicious genome. In this study, we identified two aluminum-activated malate transporter-like genes, designated Ma1 and Ma2, as strong candidates of Ma by narrowing down the Ma locus to 65-82 kb containing 12-19 predicted genes depending on the haplotypes. The Ma haplotypes were determined by sequencing two bacterial artificial chromosome clones from G.41 (an apple rootstock of genotype Mama) that cover the two distinct haplotypes at the Ma locus. Gene expression profiling in 18 apple germplasm accessions suggested that Ma1 is the major determinant at the Ma locus controlling fruit acidity as Ma1 is expressed at a much higher level than Ma2 and the Ma1 expression is significantly correlated with fruit titratable acidity (R (2) = 0.4543, P = 0.0021). In the coding sequences of low acidity alleles of Ma1 and Ma2, sequence variations at the amino acid level between Golden Delicious and G.41 were not detected. But the alleles for high acidity vary considerably between the two genotypes. The low acidity allele of Ma1, Ma1-1455A, is mainly characterized by a mutation at base 1455 in the open reading frame. The mutation leads to a premature stop codon that truncates the carboxyl terminus of Ma1-1455A by 84 amino acids compared with Ma1-1455G. A survey of 29 apple germplasm accessions using marker CAPS(1455) that targets the SNP(1455) in Ma1 showed that the CAPS(1455A) allele was associated completely with high pH and highly with low titratable acidity, suggesting that the natural mutation-led truncation is most likely responsible for the abolished function of Ma for low pH or high acidity in apple.
Blocking the RecA activity and SOS-response in bacteria with a short α-helical peptide.

PubMed

Yakimov, Alexander; Pobegalov, Georgii; Bakhlanova, Irina; Khodorkovskii, Mikhail; Petukhov, Michael; Baitin, Dmitry

2017-09-19

The RecX protein, a very active natural RecA protein inhibitor, can completely disassemble RecA filaments at nanomolar concentrations that are two to three orders of magnitude lower than that of RecA protein. Based on the structure of RecX protein complex with the presynaptic RecA filament, we designed a short first in class α-helical peptide that both inhibits RecA protein activities in vitro and blocks the bacterial SOS-response in vivo. The peptide was designed using SEQOPT, a novel method for global sequence optimization of protein α-helices. SEQOPT produces artificial peptide sequences containing only 20 natural amino acids with the maximum possible conformational stability at a given pH, ionic strength, temperature, peptide solubility. It also accounts for restrictions due to known amino acid residues involved in stabilization of protein complexes under consideration. The results indicate that a few key intermolecular interactions inside the RecA protein presynaptic complex are enough to reproduce the main features of the RecX protein mechanism of action. Since the SOS-response provides a major mechanism of bacterial adaptation to antibiotics, these results open new ways for the development of antibiotic co-therapy that would not cause bacterial resistance. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Primary structure of prostaglandin G/H synthase from sheep vesicular gland determined from the complementary DNA sequence.

PubMed Central

DeWitt, D L; Smith, W L

1988-01-01

Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids.

PubMed

García-Remesal, Miguel; Cuevas, Alejandro; Pérez-Rey, David; Martín, Luis; Anguita, Alberto; de la Iglesia, Diana; de la Calle, Guillermo; Crespo, José; Maojo, Víctor

2010-11-01

PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided. PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder
Designing Anticancer Peptides by Constructive Machine Learning.

PubMed

Grisoni, Francesca; Neuhaus, Claudia S; Gabernet, Gisela; Müller, Alex T; Hiss, Jan A; Schneider, Gisbert

2018-04-21

Constructive (generative) machine learning enables the automated generation of novel chemical structures without the need for explicit molecular design rules. This study presents the experimental application of such a deep machine learning model to design membranolytic anticancer peptides (ACPs) de novo. A recurrent neural network with long short-term memory cells was trained on α-helical cationic amphipathic peptide sequences and then fine-tuned with 26 known ACPs by transfer learning. This optimized model was used to generate unique and novel amino acid sequences. Twelve of the peptides were synthesized and tested for their activity on MCF7 human breast adenocarcinoma cells and selectivity against human erythrocytes. Ten of these peptides were active against cancer cells. Six of the active peptides killed MCF7 cancer cells without affecting human erythrocytes with at least threefold selectivity. These results advocate constructive machine learning for the automated design of peptides with desired biological activities. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Bioaugmentation with Clostridium tyrobutyricum to improve butyric acid production through direct rice straw bioconversion.

PubMed

Chi, Xue; Li, Jianzheng; Wang, Xin; Zhang, Yafei; Leu, Shao-Yuan; Wang, Ying

2018-05-02

One-pot bioconversion is an economically attractive biorefinery strategy to reduce enzyme consumption. Direct conversion of lignocellulosic biomass for butyric acid production is still challenging because of competition among microorganisms. In a consolidated hydrolysis/fermentation bioprocessing (CBP) the microbial structure may eventually prefer the production of caproic acid rather than butyric acid production. This paper presents a new bioaugmentation approach for high butyric acid production from rice straw. By dosing 0.03 g/L of Clostridium tyrobutyricum ATCC 25755 in the CBP, an increase of 226% higher butyric acid was yielded. The selectivity and concentration also increased to 60.7% and 18.05 g/L, respectively. DNA-sequencing confirmed the shift of bacterial community in the augmented CBP. Butyric acid producer was enriched in the bioaugmented bacterial community and the bacteria related to long chain acids production was degenerated. The findings may be useful in future research and process design to enhance productivity of desired bio-products. Copyright © 2018 Elsevier Ltd. All rights reserved.
Improving a natural enzyme activity through incorporation of unnatural amino acids.

PubMed

Ugwumba, Isaac N; Ozawa, Kiyoshi; Xu, Zhi-Qiang; Ely, Fernanda; Foo, Jee-Loon; Herlt, Anthony J; Coppin, Chris; Brown, Sue; Taylor, Matthew C; Ollis, David L; Mander, Lewis N; Schenk, Gerhard; Dixon, Nicholas E; Otting, Gottfried; Oakeshott, John G; Jackson, Colin J

2011-01-19

The bacterial phosphotriesterases catalyze hydrolysis of the pesticide paraoxon with very fast turnover rates and are thought to be near to their evolutionary limit for this activity. To test whether the naturally evolved turnover rate could be improved through the incorporation of unnatural amino acids and to probe the role of peripheral active site residues in nonchemical steps of the catalytic cycle (substrate binding and product release), we replaced the naturally occurring tyrosine amino acid at position 309 with unnatural L-(7-hydroxycoumarin-4-yl)ethylglycine (Hco) and L-(7-methylcoumarin-4-yl)ethylglycine amino acids, as well as leucine, phenylalanine, and tryptophan. Kinetic analysis suggests that the 7-hydroxyl group of Hco, particularly in its deprotonated state, contributes to an increase in the rate-limiting product release step of substrate turnover as a result of its electrostatic repulsion of the negatively charged 4-nitrophenolate product of paraoxon hydrolysis. The 8-11-fold improvement of this already highly efficient catalyst through a single rationally designed mutation using an unnatural amino acid stands in contrast to the difficulty in improving this native activity through screening hundreds of thousands of mutants with natural amino acids. These results demonstrate that designer amino acids provide easy access to new and valuable sequence and functional space for the engineering and evolution of existing enzyme functions.
BONSAI Garden: Parallel knowledge discovery system for amino acid sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shoudai, T.; Miyano, S.; Shinohara, A.

1995-12-31

We have developed a machine discovery system BON-SAI which receives positive and negative examples as inputs and produces as a hypothesis a pair of a decision tree over regular patterns and an alphabet indexing. This system has succeeded in discovering reasonable knowledge on transmembrane domain sequences and signal peptide sequences by computer experiments. However, when several kinds of sequences axe mixed in the data, it does not seem reasonable for a single BONSAI system to find a hypothesis of a reasonably small size with high accuracy. For this purpose, we have designed a system BONSAI Garden, in which several BONSAI`smore » and a program called Gardener run over a network in parallel, to partition the data into some number of classes together with hypotheses explaining these classes accurately.« less
Identification of a new genotype H wild-type mumps virus strain and its molecular relatedness to other virulent and attenuated strains.

PubMed

Amexis, Georgios; Rubin, Steven; Chatterjee, Nando; Carbone, Kathryn; Chumakov, Kostantin

2003-06-01

A single clinical isolate of mumps virus designated 88-1961 was obtained from a patient hospitalized with a clinical history of upper respiratory tract infection, parotitis, severe headache, fever and lymphadenopathy. We have sequenced the full-length genome of 88-1961 and compared it against all available full-length sequences of mumps virus. Based upon its nucleotide sequence of the SH gene 88-1961 was identified as a genotype H mumps strain. The overall extent of nucleotide and amino acid differences between each individual gene and protein of 88-1961 and the full-length mumps samples showed that the missense to silent ratios were unevenly distributed. Upon evaluation of the consensus sequence of 88-1961, four positions were found to be clearly heterogeneous at the nucleotide level (NP 315C/T, NP 318C/T, F 271A/C, and HN 855C/T). Sequence analysis revealed that the amino acid sequences for the NP, M, and the L protein were the most conserved, whereas the SH protein exhibited the highest variability among the compared mumps genotypes A, B, and G. No identifying molecular patterns in the non-coding (intergenic) or coding regions of 88-1961 were found when we compared it against relatively virulent (Urabe AM9 B, Glouc1/UK96, 87-1004 and 87-1005) and non-virulent mumps strains (Jeryl Lynn and all Urabe Am9 A substrains). Copyright 2003 Wiley-Liss, Inc.
DNATagger, colors for codons.

PubMed

Scherer, N M; Basso, D M

2008-09-16

DNATagger is a web-based tool for coloring and editing DNA, RNA and protein sequences and alignments. It is dedicated to the visualization of protein coding sequences and also protein sequence alignments to facilitate the comprehension of evolutionary processes in sequence analysis. The distinctive feature of DNATagger is the use of codons as informative units for coloring DNA and RNA sequences. The codons are colored according to their corresponding amino acids. It is the first program that colors codons in DNA sequences without being affected by "out-of-frame" gaps of alignments. It can handle single gaps and gaps inside the triplets. The program also provides the possibility to edit the alignments and change color patterns and translation tables. DNATagger is a JavaScript application, following the W3C guidelines, designed to work on standards-compliant web browsers. It therefore requires no installation and is platform independent. The web-based DNATagger is available as free and open source software at http://www.inf.ufrgs.br/~dmbasso/dnatagger/.
Statistical theory of combinatorial libraries of folding proteins: energetic discrimination of a target structure.

PubMed

Zou, J; Saven, J G

2000-02-11

A self-consistent theory is presented that can be used to estimate the number and composition of sequences satisfying a predetermined set of constraints. The theory is formulated so as to examine the features of sequences having a particular value of Delta=E(f)-(u), where E(f) is the energy of sequences when in a target structure and (u) is an average energy of non-target structures. The theory yields the probabilities w(i)(alpha) that each position i in the sequence is occupied by a particular monomer type alpha. The theory is applied to a simple lattice model of proteins. Excellent agreement is observed between the theory and the results of exact enumerations. The theory provides a quantitative framework for the design and interpretation of combinatorial experiments involving proteins, where a library of amino acid sequences is searched for sequences that fold to a desired structure. Copyright 2000 Academic Press.
Registry in a tube: multiplexed pools of retrievable parts for genetic design space exploration.

PubMed

Woodruff, Lauren B A; Gorochowski, Thomas E; Roehner, Nicholas; Mikkelsen, Tarjei S; Densmore, Douglas; Gordon, D Benjamin; Nicol, Robert; Voigt, Christopher A

2017-02-17

Genetic designs can consist of dozens of genes and hundreds of genetic parts. After evaluating a design, it is desirable to implement changes without the cost and burden of starting the construction process from scratch. Here, we report a two-step process where a large design space is divided into deep pools of composite parts, from which individuals are retrieved and assembled to build a final construct. The pools are built via multiplexed assembly and sequenced using next-generation sequencing. Each pool consists of ∼20 Mb of up to 5000 unique and sequence-verified composite parts that are barcoded for retrieval by PCR. This approach is applied to a 16-gene nitrogen fixation pathway, which is broken into pools containing a total of 55 848 composite parts (71.0 Mb). The pools encompass an enormous design space (1043 possible 23 kb constructs), from which an algorithm-guided 192-member 4.5 Mb library is built. Next, all 1030 possible genetic circuits based on 10 repressors (NOR/NOT gates) are encoded in pools where each repressor is fused to all permutations of input promoters. These demonstrate that multiplexing can be applied to encompass entire design spaces from which individuals can be accessed and evaluated. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Influence of Length and Amino Acid Composition on Dimer Formation of Immunoglobulin based Chimera.

PubMed

Manoj, Patidar; Naveen, Yadav; Dalai, Sarat Kumar

2017-10-18

The dimeric immunoglobulin (Ig) chimeras used for drug targeting and delivery are preferred biologics over their monomeric forms. Designing these Ig chimeras involves critical selection of a suitable Ig base that ensures dimer formation. In the present study, we systematically analyzed several factors that influence the formation of dimeric chimera. We designed and predicted 608 cytokine-Ig chimeras where we tested the contributions of (1) different domains of Ig constant heavy chain, (2) length of partner proteins, (3) amino acid (AA) composition and (4) position of cysteine in the formation of homodimer. The sequences of various Ig and cytokines were procured from Uniprot database, fused and submitted to COTH (CO-THreader) server for the prediction of dimer formation. Contributions of different domains of Ig constant heavy chain, length of chimeric proteins, AA composition and position of cysteine were tested to the homodimer formation of 608 cytokine-Ig chimeras. Various in silico approaches were adopted for validating the in silico findings. Experimentally we also validated our approach by expressing in CHO cells the chimeric design of shorter cytokine with Ig domain and analyzing the protein by SDS-PAGE. Our results advocate that while the CH1 region and the Hinge region of Ig heavy chain are critical, the length of partner proteins also crucially influences homodimer formation of the Ig-based chimera. We also report that the CH1 domain of Ig is not required for dimer formation of Ig based chimera in the presence of larger partner proteins. For shorter partner proteins fused to CH2-CH3, however, careful selection of partner sequence is critical, particularly the hydrophobic AA composition, cysteine content & their positions, disulphide bond formation property, and the linker sequences. We validated our in silico observation by various bioinformatics tools and checked the ability of chimeras to bind with the receptors of native protein by docking studies. As a proof of concept, we have expressed the chimeric proteins in CHO cells and found that our design favors the synthesis of dimeric proteins. Our structural prediction study suggests that extra amino acids in the range of 15-20 added to the CH2 domain of Ig is a critical requirement to make homodimer. This information from our study will have implication in designing efficacious homodimeric chimera. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Nucleotide sequence analysis of the gene encoding the Deinococcus radiodurans surface protein, derived amino acid sequence, and complementary protein chemical studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peters, J.; Peters, M.; Lottspeich, F.

1987-11-01

The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Characterization and prediction of residues determining protein functional specificity.

PubMed

Capra, John A; Singh, Mona

2008-07-01

Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular functional specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolutionary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs. Datasets and GroupSim code are available online at http://compbio.cs.princeton.edu/specificity/. Supplementary data are available at Bioinformatics online.
Artificial mismatch hybridization

DOEpatents

Guo, Zhen; Smith, Lloyd M.

1998-01-01

An improved nucleic acid hybridization process is provided which employs a modified oligonucleotide and improves the ability to discriminate a control nucleic acid target from a variant nucleic acid target containing a sequence variation. The modified probe contains at least one artificial mismatch relative to the control nucleic acid target in addition to any mismatch(es) arising from the sequence variation. The invention has direct and advantageous application to numerous existing hybridization methods, including, applications that employ, for example, the Polymerase Chain Reaction, allele-specific nucleic acid sequencing methods, and diagnostic hybridization methods.
Continuously tunable nucleic acid hybridization probes.

PubMed

Wu, Lucia R; Wang, Juexiao Sherry; Fang, John Z; Evans, Emily R; Pinto, Alessandro; Pekker, Irena; Boykin, Richard; Ngouenet, Celine; Webster, Philippa J; Beechem, Joseph; Zhang, David Yu

2015-12-01

In silico-designed nucleic acid probes and primers often do not achieve favorable specificity and sensitivity tradeoffs on the first try, and iterative empirical sequence-based optimization is needed, particularly in multiplexed assays. We present a novel, on-the-fly method of tuning probe affinity and selectivity by adjusting the stoichiometry of auxiliary species, which allows for independent and decoupled adjustment of the hybridization yield for different probes in multiplexed assays. Using this method, we achieved near-continuous tuning of probe effective free energy. To demonstrate our approach, we enforced uniform capture efficiency of 31 DNA molecules (GC content, 0-100%), maximized the signal difference for 11 pairs of single-nucleotide variants and performed tunable hybrid capture of mRNA from total RNA. Using the Nanostring nCounter platform, we applied stoichiometric tuning to simultaneously adjust yields for a 24-plex assay, and we show multiplexed quantitation of RNA sequences and variants from formalin-fixed, paraffin-embedded samples.
Discovery of novel antimicrobial peptides with unusual cysteine motifs in dandelion Taraxacum officinale Wigg. flowers.

PubMed

Astafieva, A A; Rogozhin, E A; Odintsova, T I; Khadeeva, N V; Grishin, E V; Egorov, Ts A

2012-08-01

Three novel antimicrobial peptides designated ToAMP1, ToAMP2 and ToAMP3 were purified from Taraxacum officinale flowers. Their amino acid sequences were determined. The peptides are cationic and cysteine-rich and consist of 38, 44 and 42 amino acid residues for ToAMP1, ToAMP2 and ToAMP3, respectively. Importantly, according to cysteine motifs, the peptides are representatives of two novel previously unknown families of plant antimicrobial peptides. ToAMP1 and ToAMP2 share high sequence identity and belong to 6-Cys-containing antimicrobial peptides, while ToAMP3 is a member of a distinct 8-Cys family. The peptides were shown to display high antimicrobial activity both against fungal and bacterial pathogens, and therefore represent new promising molecules for biotechnological and medicinal applications. Crown Copyright © 2012. Published by Elsevier Inc. All rights reserved.
Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2000-01-01

A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.
Sequence-specific label-free nucleic acid biosensor for the detection of the hepatitis C virus genotype 1a using a disposable pencil graphite electrode.

PubMed

Donmez, Soner; Arslan, Fatma; Arslan, Halit

2016-05-01

In this paper, we demonstrate a simple, sensitive, inexpensive, disposable and label-free electrochemical nucleic acid biosensor for the detection of the hepatitis C virus genotype 1a (HCV1a). The nucleic acid biosensor was designed with the amino-linked inosine-substituted 20-mer probes, which were immobilized onto a disposable pencil graphite electrode (PGE) by covalent linking. The proposed nucleic acid biosensor was linear in the range of 0.05 and 0.75 μM, exhibiting a limit of detection of 54.9 nM. The single-stranded synthetic PCR product analogs of HCV1a were also detected with satisfactory results under optimal conditions, showing the potential application of this biosensor.
Programming mRNA decay to modulate synthetic circuit resource allocation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Venturelli, Ophelia S.; Tei, Mika; Bauer, Stefan

Synthetic circuits embedded in host cells compete with cellular processes for limited intracellular resources. Here we show how funnelling of cellular resources, after global transcriptome degradation by the sequence-dependent endoribonuclease MazF, to a synthetic circuit can increase production. Target genes are protected from MazF activity by recoding the gene sequence to eliminate recognition sites, while preserving the amino acid sequence. The expression of a protected fluorescent reporter and flux of a high-value metabolite are significantly enhanced using this genome-scale control strategy. Proteomics measurements discover a host factor in need of protection to improve resource redistribution activity. A computational model demonstratesmore » that the MazF mRNA-decay feedback loop enables proportional control of MazF in an optimal operating regime. Transcriptional profiling of MazF-induced cells elucidates the dynamic shifts in transcript abundance and discovers regulatory design elements. Altogether, our results suggest that manipulation of cellular resource allocation is a key control parameter for synthetic circuit design.« less
Programming mRNA decay to modulate synthetic circuit resource allocation

DOE PAGES

Venturelli, Ophelia S.; Tei, Mika; Bauer, Stefan; ...

2017-04-26

Synthetic circuits embedded in host cells compete with cellular processes for limited intracellular resources. Here we show how funnelling of cellular resources, after global transcriptome degradation by the sequence-dependent endoribonuclease MazF, to a synthetic circuit can increase production. Target genes are protected from MazF activity by recoding the gene sequence to eliminate recognition sites, while preserving the amino acid sequence. The expression of a protected fluorescent reporter and flux of a high-value metabolite are significantly enhanced using this genome-scale control strategy. Proteomics measurements discover a host factor in need of protection to improve resource redistribution activity. A computational model demonstratesmore » that the MazF mRNA-decay feedback loop enables proportional control of MazF in an optimal operating regime. Transcriptional profiling of MazF-induced cells elucidates the dynamic shifts in transcript abundance and discovers regulatory design elements. Altogether, our results suggest that manipulation of cellular resource allocation is a key control parameter for synthetic circuit design.« less

Correlating low-similarity peptide sequences and allergenic epitopes.

PubMed

Kanduc, D

2008-01-01

Although a high number of allergenic peptide epitopes has been experimentally identified and defined, the molecular basis and the precise mechanisms underlying peptide allergenicity are unknown. This issue was analyzed exploring the relationship between peptide allergenicity and sequence similarity to the human proteome. The structured analysis of the data reported in literature put into evidence that the most part of IgE-binding epitopes are (or harbor) pentapeptide unit(s) with no/low similarity to the human proteome, this way suggesting that no or low sequence similarity to the host proteome might represent a minimum common denominator identifying allergenic peptides. The present literature analysis might be of relevance in devising and designing short amino acid modules to be used for blocking pathogenic IgE.
Droplet Microfluidic Device Fabrication and Use for Isothermal Amplification and Detection of MicroRNA.

PubMed

Giuffrida, Maria Chiara; D'Agata, Roberta; Spoto, Giuseppe

2017-01-01

Droplet microfluidics combined with the isothermal circular strand displacement polymerization (ICSDP) represents a powerful new technique to detect both single-stranded DNA and microRNA sequences. The method here described helps in overcoming some drawbacks of the lately introduced droplet polymerase chain reaction (PCR) amplification when implemented in microfluidic devices. The method also allows the detection of nanoliter droplets of nucleic acids sequences solutions, with a particular attention to microRNA sequences that are detected at the picomolar level. The integration of the ICSDP amplification protocol in droplet microfluidic devices reduces the time of analysis and the amount of sample required. In addition, there is also the possibility to design parallel analyses to be integrated in portable devices.
Molecular cloning and nucleotide sequence of CYP6BF1 from the diamondback moth, Plutella xylostella

PubMed Central

Li, Hongshan; Dai, Huaguo; Wei, Hui

2005-01-01

A novel cDNA clong encoding a cytochrome P450 was screened from the insecticide-susceptible strain of Plutella xylostella (L.) (Lepidoptera:Yponomeutidae). The nucleotide sequence of the clone, designated CYP6BF1, was determined. This is the first full-length sequence of the CYP6 family from Plutella xylostella (L.). The cDNA is 1661bp in length and contains an open reading frame from base pairs 26 to 1570, encoding a protein of 514 amino acid residues. It is similar to the other insect P450s in gene family 6, including CYP6AE1 from Depressaria pastinacella, (46%). The GenBank accession number is AY971374. PMID:17119627
Viral morphogenesis is the dominant source of sequence censorship in M13 combinatorial peptide phage display.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rodi, D. J.; Soares, A. S.; Makowski, L.

Novel statistical methods have been developed and used to quantitate and annotate the sequence diversity within combinatorial peptide libraries on the basis of small numbers (1-200) of sequences selected at random from commercially available M13 p3-based phage display libraries. These libraries behave statistically as though they correspond to populations containing roughly 4.0{+-}1.6% of the random dodecapeptides and 7.9{+-}2.6% of the random constrained heptapeptides that are theoretically possible within the phage populations. Analysis of amino acid residue occurrence patterns shows no demonstrable influence on sequence censorship by Escherichia coli tRNA isoacceptor profiles or either overall codon or Class II codon usagemore » patterns, suggesting no metabolic constraints on recombinant p3 synthesis. There is an overall depression in the occurrence of cysteine, arginine and glycine residues and an overabundance of proline, threonine and histidine residues. The majority of position-dependent amino acid sequence bias is clustered at three positions within the inserted peptides of the dodecapeptide library, +1, +3 and +12 downstream from the signal peptidase cleavage site. Conformational tendency measures of the peptides indicate a significant preference for inserts favoring a {beta}-turn conformation. The observed protein sequence limitations can primarily be attributed to genetic codon degeneracy and signal peptidase cleavage preferences. These data suggest that for applications in which maximal sequence diversity is essential, such as epitope mapping or novel receptor identification, combinatorial peptide libraries should be constructed using codon-corrected trinucleotide cassettes within vector-host systems designed to minimize morphogenesis-related censorship.« less
PreSSAPro: a software for the prediction of secondary structure by amino acid properties.

PubMed

Costantini, Susan; Colonna, Giovanni; Facchiano, Angelo M

2007-10-01

PreSSAPro is a software, available to the scientific community as a free web service designed to provide predictions of secondary structures starting from the amino acid sequence of a given protein. Predictions are based on our recently published work on the amino acid propensities for secondary structures in either large but not homogeneous protein data sets, as well as in smaller but homogeneous data sets corresponding to protein structural classes, i.e. all-alpha, all-beta, or alpha-beta proteins. Predictions result improved by the use of propensities evaluated for the right protein class. PreSSAPro predicts the secondary structure according to the right protein class, if known, or gives a multiple prediction with reference to the different structural classes. The comparison of these predictions represents a novel tool to evaluate what sequence regions can assume different secondary structures depending on the structural class assignment, in the perspective of identifying proteins able to fold in different conformations. The service is available at the URL http://bioinformatica.isa.cnr.it/PRESSAPRO/.
Identification of immunoglobulins using Chou's pseudo amino acid composition with feature selection technique.

PubMed

Tang, Hua; Chen, Wei; Lin, Hao

2016-04-01

Immunoglobulins, also called antibodies, are a group of cell surface proteins which are produced by the immune system in response to the presence of a foreign substance (called antigen). They play key roles in many medical, diagnostic and biotechnological applications. Correct identification of immunoglobulins is crucial to the comprehension of humoral immune function. With the avalanche of protein sequences identified in postgenomic age, it is highly desirable to develop computational methods to timely identify immunoglobulins. In view of this, we designed a predictor called "IGPred" by formulating protein sequences with the pseudo amino acid composition into which nine physiochemical properties of amino acids were incorporated. Jackknife cross-validated results showed that 96.3% of immunoglobulins and 97.5% of non-immunoglobulins can be correctly predicted, indicating that IGPred holds very high potential to become a useful tool for antibody analysis. For the convenience of most experimental scientists, a web-server for IGPred was established at http://lin.uestc.edu.cn/server/IGPred. We believe that the web-server will become a powerful tool to study immunoglobulins and to guide related experimental validations.
A chondroitin sulfate chain attached to the bone dentin matrix protein 1 NH2-terminal fragment.

PubMed

Qin, Chunlin; Huang, Bingzhen; Wygant, James N; McIntyre, Bradley W; McDonald, Charles H; Cook, Richard G; Butler, William T

2006-03-24

Dentin matrix protein 1 (DMP1) is an acidic noncollagenous protein shown by gene ablations to be critical for the proper mineralization of bone and dentin. In the extracellular matrix of these tissues DMP1 is present as fragments representing the NH2-terminal (37 kDa) and COOH-terminal (57 kDa) portions of the cDNA-deduced amino acid sequence. During our separation of bone noncollagenous proteins, we observed a high molecular weight, DMP1-related component (designated DMP1-PG). We purified DMP1-PG with a monoclonal anti-DMP1 antibody affinity column. Amino acid analysis and Edman degradation of tryptic peptides proved that the core protein for DMP1-PG is the 37-kDa fragment of DMP1. Chondroitinase treatments demonstrated that the slower migration rate of DMP1-PG is due to the presence of glycosaminoglycan. Quantitative disaccharide analysis indicated that the glycosaminoglycan is made predominantly of chondroitin 4-sulfate. Further analysis on tryptic peptides led us to conclude that a single glycosaminoglycan chain is linked to the core protein via Ser74, located in the Ser74-Gly75 dipeptide, an amino acid sequence specific for the attachment of glycosaminoglycans. Our findings show that in addition to its existence as a phosphoprotein, the NH2-terminal fragment from DMP1 occurs as a proteoglycan. Amino acid sequence alignment analysis showed that the Ser74-Gly75 dipeptide and its flanking regions are highly conserved among a wide range of species from caiman to the Homo sapiens, indicating that this glycosaminoglycan attachment domain has survived an extremely long period of evolution pressure, suggesting that the glycosaminoglycan may be critical for the basic biological functions of DMP1.
Molecular Characterization of a Novel N-Acetyltransferase from Chryseobacterium sp.

PubMed Central

Yoshida, Kenji; Tanaka, Kosei; Yoshida, Ken-ichi

2014-01-01

N-Acetyltransferase from Chryseobacterium sp. strain 5-3B is an acetyl coenzyme A (acetyl-CoA)-dependent enzyme that catalyzes the enantioselective transfer of an acetyl group from acetyl-CoA to the amino group of l-2-phenylglycine to produce (2S)-2-acetylamino-2-phenylacetic acid. We purified the enzyme from strain 5-3B and deduced the N-terminal amino acid sequence. The gene, designated natA, was cloned with two other hypothetical protein genes; the three genes probably form a 2.5-kb operon. The deduced amino acid sequence of NatA showed high levels of identity to sequences of putative N-acetyltransferases of Chryseobacterium spp. but not to other known arylamine and arylalkylamine N-acetyltransferases. Phylogenetic analysis indicated that NatA forms a distinct lineage from known N-acetyltransferases. We heterologously expressed recombinant NatA (rNatA) in Escherichia coli and purified it. rNatA showed high activity for l-2-phenylglycine and its chloro- and hydroxyl-derivatives. The Km and Vmax values for l-2-phenylglycine were 0.145 ± 0.026 mM and 43.6 ± 2.39 μmol · min−1 · mg protein−1, respectively. The enzyme showed low activity for 5-aminosalicylic acid and 5-hydroxytryptamine, which are reported as good substrates of a known arylamine N-acetyltransferase and an arylalkylamine N-acetyltransferase. rNatA had a comparatively broad acyl donor specificity, transferring acyl groups to l-2-phenylglycine and producing the corresponding 2-acetylamino-2-phenylacetic acids (relative activity with acetyl donors acetyl-CoA, propanoyl-CoA, butanoyl-CoA, pentanoyl-CoA, and hexanoyl-CoA, 100:108:122:10:<1). PMID:24375143
Mesonia hippocampi sp. nov., isolated from the brood pouch of a diseased Barbour's Seahorse (Hippocampus barbouri).

PubMed

Kolberg, Judy; Busse, Hans-Jürgen; Wilke, Thomas; Schubert, Patrick; Kämpfer, Peter; Glaeser, Stefanie P

2015-07-01

An orange-pigmented, Gram-staining-negative, rod-shaped bacterium, designated 96_Hippo_TS_3/13(T) was isolated from the brood pouch of a diseased seahorse male of the species Hippocampus barbouri from the animal facility of the University of Giessen, Germany. Phylogenetic analyses based on the nearly full-length 16S rRNA gene sequence placed strain 96_Hippo_TS_3/13(T) into the monophyletic cluster of the genus Mesonia within the family Flavobacteriaceae. However, the strain shared only 92.2-93.8% sequence similarity to type strains of species of the genus Mesonia, with highest sequence similarity to the type strain of Mesonia aquimarina. Cellular fatty acid analysis showed a Mesonia-typical fatty acid profile including several branched and hydroxyl fatty acids with highest amounts of iso-C15 : 0 (40.9%) followed by iso-C17 : 0 3-OH (14.8%). In the polyamine pattern, sym-homospermidine was predominant. The diagnostic diamino acid of the peptidoglycan was meso-diaminopimelic acid. The quinone system contained exclusively menaquinone MK-6. The only identified compound in the polar lipid profile was phosphatidylethanolamine present in major amounts. Additionally, major amounts of an unidentified aminolipid and two unidentified lipids not containing a phosphate group, an amino group or a sugar residue were detected. The genomic G+C content of strain 96_Hippo_TS_3/13(T) was 30 mol%. Based on genotypic, chemotaxonomic and physiological characterizations we propose a novel species of the genus Mesonia, Mesonia hippocampi sp. nov., with strain 96_Hippo_TS_3/13(T) ( = CIP 110839T = LMG 28572(T) = CCM 8557(T)) as the type strain. An emended description of the genus Mesonia is also provided.
The respective roles of polar/nonpolar binary patterns and amino acid composition in protein regular secondary structures explored exhaustively using hydrophobic cluster analysis.

PubMed

Rebehmed, Joseph; Quintus, Flavien; Mornon, Jean-Paul; Callebaut, Isabelle

2016-05-01

Several studies have highlighted the leading role of the sequence periodicity of polar and nonpolar amino acids (binary patterns) in the formation of regular secondary structures (RSS). However, these were based on the analysis of only a few simple cases, with no direct mean to correlate binary patterns with the limits of RSS. Here, HCA-derived hydrophobic clusters (HC) which are conditioned binary patterns whose positions fit well those of RSS, were considered. All the HC types, defined by unique binary patterns, which were commonly observed in three-dimensional (3D) structures of globular domains, were analyzed. The 180 HC types with preferences for either α-helices or β-strands distinctly contain basic binary units typical of these RSS. Therefore a general trend supporting the "binary pattern preference" assumption was observed. HC for which observed RSS are in disagreement with their expected behavior (discordant HC) were also examined. They were separated in HC types with moderate preferences for RSS, having "weak" binary patterns and versatile RSS and HC types with high preferences for RSS, having "strong" binary patterns and then displaying nonpolar amino acids at the protein surface. It was shown that in both cases, discordant HC could be distinguished from concordant ones by well-differentiated amino acid compositions. The obtained results could, thus, help to complement the currently available methods for the accurate prediction of secondary structures in proteins from the only information of a single amino acid sequence. This can be especially useful for characterizing orphan sequences and for assisting protein engineering and design. © 2016 Wiley Periodicals, Inc.
Optical resolution of phenylthiohydantoin-amino acids by capillary electrophoresis and identification of the phenylthiohydantoin-D-amino acid residue of [D-Ala2]-methionine enkephalin.

PubMed

Kurosu, Y; Murayama, K; Shindo, N; Shisa, Y; Ishioka, N

1996-11-01

This is an initial report to propose a protein sequence analysis system with DL differentiation using capillary electrophoresis (CE). This system consists of a protein sequencer and a CE system. After fractionation of phenyl-thiohydantoin (PTH)-amino acids using a protein sequencer, optical resolution for each PTH-amino acid is performed by CE using some chiral selectors such as digitonin, beta-escin and others. As a model peptide, [D-Ala2]-methionine enkephalin (L-Tyr-D-Ala-Gly-L-Phe-L-Met), was used and the sequence with DL differentiation was determined, with the exception of the fourth amino acid, L-Phe, using our proposed system.
Tomitella biformata gen. nov., sp. nov., a new member of the suborder Corynebacterineae isolated from a permafrost ice wedge.

PubMed

Katayama, Taiki; Kato, Tomoko; Tanaka, Michiko; Douglas, Thomas A; Brouchkov, Anatoli; Abe, Ayumi; Sone, Teruo; Fukuda, Masami; Asano, Kozo

2010-12-01

Gram-reaction-positive, aerobic, non-spore-forming, irregular rod-shaped bacteria, designated AHU1821(T) and AHU1820, were isolated from an ice wedge in the Fox permafrost tunnel, Alaska. The strains were psychrophilic, growing at -5 to 27°C. Phylogenetic analysis of the 16S rRNA and gyrB gene sequences indicated that the ice-wedge isolates formed a clade distinct from other mycolic-acid-containing bacteria within the suborder Corynebacterineae. The cell wall of strains AHU1821(T) and AHU1820 contained meso-diaminopimelic acid, arabinose and galactose, indicating chemotype IV. The muramic acids in the peptidoglycan were glycolated. The predominant menaquinone was MK-9(H(2)). The polar lipids consisted of diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylinositol, phosphatidylinositol mannosides and an unidentified glycolipid. The major fatty acids were hexadecenoic acid (C(16 : 1)), hexadecanoic acid (C(16 : 0)), octadecenoic acid (C(18 : 1)) and tetradecanoic acid (C(14 : 0)). Tuberculostearic acid was present in relatively small amounts (1 %). Strains AHU1821(T) and AHU1820 contained mycolic acids with 42-52 carbons. The DNA G+C content of the two strains was 69.3-71.6 mol% (T(m)). 16S rRNA, rpoB and recA gene sequences were identical between strains AHU1821(T) and AHU1820 and those of the gyrB gene showed 99.9 % similarity. Based on phylogenetic and phenotypic evidence, strains AHU1821(T) and AHU1820 represent a single novel species of a novel genus, for which the name Tomitella biformata gen. nov., sp. nov. is proposed. The type strain of Tomitella biformata is AHU1821(T) (=DSM 45403(T) =NBRC 106253(T)).
A phylogenetic analysis using full-length viral genomes of South American dengue serotype 3 in consecutive Venezuelan outbreaks reveals novel NS5 mutation

PubMed Central

Schmidt, DJ; Pickett, BE; Camacho, D; Comach, G; Xhaja, K; Lennon, NJ; Rizzolo, K; de Bosch, N; Becerra, A; Nogueira, ML; Mondini, A; da Silva, EV; Vasconcelos, PF; Muñoz-Jordán, JL; Santiago, GA; Ocazionez, R; Gehrke, L; Lefkowitz, EJ; Birren, BW; Henn, MR; Bosch, I

2013-01-01

Dengue virus currently causes 50-100 million infections annually. Comprehensive knowledge about the evolution of Dengue in response to selection pressure is currently unavailable, but would greatly enhance vaccine design efforts. In the current study, we sequenced 187 new dengue virus serotype 3(DENV-3) genotype III whole genomes isolated from Asia and the Americas. We analyzed them together with previously-sequenced isolates to gain a more detailed understanding of the evolutionary adaptations existing in this prevalent American serotype. In order to analyze the phylogenetic dynamics of DENV-3 during outbreak periods; we incorporated datasets of 48 and 11 sequences spanning two major outbreaks in Venezuela during 2001 and 2007-2008 respectively. Our phylogenetic analysis of newly sequenced viruses shows that subsets of genomes cluster primarily by geographic location, and secondarily by time of virus isolation. DENV-3 genotype III sequences from Asia are significantly divergent from those from the Americas due to their geographical separation and subsequent speciation. We measured amino acid variation for the E protein by calculating the Shannon entropy at each position between Asian and American genomes. We found a cluster of 7 amino acid substitutions having high variability within E protein domain III, which has previously been implicated in serotype-specific neutralization escape mutants. No novel mutations were found in the E protein of sequences isolated during either Venezuelan outbreak. Shannon entropy analysis of the NS5 polymerase mature protein revealed that a G374E mutation, in a region that contributes to interferon resistance in other flaviviruses by interfering with JAK-STAT signaling was present in both the Asian and American sequences from the 2007-2008 Venezuelan outbreak, but was absent in the sequences from the 2001 Venezuelan outbreak. In addition to E, several NS5 amino acid changes were unique to the 2007-2008 epidemic in Venezuela and may give additional insight into the adaptive response of DENV-3 at the population level. PMID:21964598
Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

PubMed Central

Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.

2007-01-01

We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Comprehensive computational design of ordered peptide macrocycles

PubMed Central

Hosseinzadeh, Parisa; Bhardwaj, Gaurav; Mulligan, Vikram Khipple; Shortridge, Matthew D.; Craven, Timothy W.; Pardo-Avila, Fátima; Rettie, Stephen A.; Kim, David E.; Silva, Daniel-Adriano; Ibrahim, Yehia M.; Webb, Ian K.; Cort, John R.; Adkins, Joshua N.; Varani, Gabriele; Baker, David

2018-01-01

Mixed-chirality peptide macrocycles such as cyclosporine are among the most potent therapeutics identified to date, but there is currently no way to systematically search the structural space spanned by such compounds. Natural proteins do not provide a useful guide: Peptide macrocycles lack regular secondary structures and hydrophobic cores, and can contain local structures not accessible with L-amino acids. Here, we enumerate the stable structures that can be adopted by macrocyclic peptides composed of L- and D-amino acids by near-exhaustive backbone sampling followed by sequence design and energy landscape calculations. We identify more than 200 designs predicted to fold into single stable structures, many times more than the number of currently available unbound peptide macrocycle structures. Nuclear magnetic resonance structures of 9 of 12 designed 7- to 10-residue macrocycles, and three 11- to 14-residue bicyclic designs, are close to the computational models. Our results provide a nearly complete coverage of the rich space of structures possible for short peptide macrocycles and vastly increase the available starting scaffolds for both rational drug design and library selection methods. PMID:29242347
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.

PubMed

Militello, Kevin T; Lazatin, Justine C

2017-05-01

Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Form and format for... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer...
A motif detection and classification method for peptide sequences using genetic programming.

PubMed

Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki

2008-08-01

An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.
Method of identity analyte-binding peptides

DOEpatents

Kauvar, Lawrence M.

1990-01-01

A method for affinity chromatography or adsorption of a designated analyte utilizes a paralog as the affinity partner. The immobilized paralog can be used in purification or analysis of the analyte; the paralog can also be used as a substitute for antibody in an immunoassay. The paralog is identified by screening candidate peptide sequences of 4-20 amino acids for specific affinity to the analyte.
Design of a targeted peptide nucleic acid prodrug to inhibit hepatic human microsomal triglyceride transfer protein expression in hepatocytes.

PubMed

Biessen, Erik A L; Sliedregt-Bol, Karen; 'T Hoen, Peter A Chr; Prince, Perry; Van der Bilt, Erica; Valentijn, A Rob P M; Meeuwenoord, Nico J; Princen, Hans; Bijsterbosch, Martin K; Van der Marel, Gijs A; Van Boom, Jacques H; Van Berkel, Theo J C

2002-01-01

In this study, we present the design and synthesis of an antisense peptide nucleic acid (asPNA) prodrug, which displays an improved biodistribution profile and an equally improved capacity to reduce the levels of target mRNA. The prodrug, K(GalNAc)(2)-asPNA, comprised of a 14-mer sequence complementary to the human microsomal triglyceride transfer protein (huMTP) gene, conjugated to a high-affinity tag for the hepatic asialoglycoprotein receptor (K(GalNAc)(2)). The prodrug was avidly bound and rapidly internalized by HepG2s. After iv injection into mice, K(GalNAc)(2)-asPNA accumulated in the parenchymal liver cells to a much greater extent than nonconjugated PNA (46% +/- 1% vs 3.1% +/- 0.5% of the injected dose, respectively). The prodrug was able to reduce MTP mRNA levels in HepG2 cells by 35-40% (P < 0.02) at 100 nM in an asialoglycoprotein receptor- and sequence-dependent fashion. In conclusion, hepatocyte-targeted PNA prodrugs combine a greatly improved tropism with an enhanced local intracellular availability and activity, making them attractive therapeutics to lower the expression level of hepatic target genes such as MTP.

Application of 2D graphic representation of protein sequence based on Huffman tree method.

PubMed

Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling

2012-05-01

Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

PubMed

Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

2002-11-01

The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

PubMed Central

Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

1997-01-01

The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753
Contribution of Tryptophan Residues to the Combining Site of a Monoclonal Anti Dinitrophenyl Spin-Label Antibody

DTIC Science & Technology

1987-01-01

identified in the difference spectra, implying that: there are five to seven tryptophans within 17 A of the spin-label hapten. Amino acid sequences...of the heavy, and light chains were obtained by a combination of amino acid and DNA sequencing. A molecular model’ was constructed from the sequence...Clore & acids yields detailed information about the amino acid com- Gronenborn, 1982, 1983). This technique should also identify position of the combining
Effect of Backbone Design on Hybridization Thermodynamics of Oligo-nucleic Acids: A Coarse-Grained Molecular Dynamics Simulation Study

NASA Astrophysics Data System (ADS)

Ghobadi, Ahmadreza F.; Jayaraman, Arthi

DNA hybridization is the basis of various bio-nano technologies, such as DNA origami and assembly of DNA-functionalized nanoparticles. A hybridized double stranded (ds) DNA is formed when complementary nucleobases on hybridizing strands exhibit specific and directional hydrogen bonds through canonical Watson-Crick base-pairing interactions. In recent years, the need for cheaper alternatives and significant synthetic advances have driven design of DNA mimics with new backbone chemistries. However, a fundamental understanding of how these backbone modifications in the oligo-nucleic acids impact the hybridization and melting behavior of the duplex is still lacking. In this talk, we present our recent findings on impact of varying backbone chemistry on hybridization of oligo-nucleic acid duplexes. We use coarse-grained molecular dynamics simulations to isolate the effect of strand flexibility, electrostatic interactions and nucleobase spacing on the melting curves for duplexes with various strand sequences and concentrations. Since conjugation of oligo-nucleic acids with polymers serve as building blocks for thermo-responsive polymer networks and gels, we also present the effect of such conjugation on hybridization thermodynamics and polymer conformation.
A GntR-type transcriptional repressor controls sialic acid utilization in Bifidobacterium breve UCC2003.

PubMed

Egan, Muireann; O'Connell Motherway, Mary; van Sinderen, Douwe

2015-02-01

Bifidobacterium breve strains are numerically prevalent among the gut microbiota of healthy, breast-fed infants. The metabolism of sialic acid, a ubiquitous monosaccharide in the infant and adult gut, by B. breve UCC2003 is dependent on a large gene cluster, designated the nan/nag cluster. This study describes the transcriptional regulation of the nan/nag cluster and thus sialic acid metabolism in B. breve UCC2003. Insertion mutagenesis and transcriptome analysis revealed that the nan/nag cluster is regulated by a GntR family transcriptional repressor, designated NanR. Crude cell extract of Escherichia coli EC101 in which the nanR gene had been cloned and overexpressed was shown to bind to two promoter regions within this cluster, each of which containing an imperfect inverted repeat that is believed to act as the NanR operator sequence. Formation of the DNA-NanR complex is prevented in the presence of sialic acid, which we had previously shown to induce transcription of this gene cluster. © FEMS 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Concurrent Automated Sequencing of the Glycan and Peptide Portions of O-Linked Glycopeptide Anions by Ultraviolet Photodissociation Mass Spectrometry

PubMed Central

Madsen, James A.; Ko, Byoung Joon; Xu, Hua; Iwashkiw, Jeremy A.; Robotham, Scott A.; Shaw, Jared B.; Feldman, Mario F.; Brodbelt, Jennifer S.

2013-01-01

O -glycopeptides are often acidic owing to the frequent occurrence of acidic saccharides in the glycan, rendering traditional proteomic workflows that rely on positive mode tandem mass spectrometry (MS/MS) less effective. In this report, we demonstrate the utility of negative mode ultraviolet photodissociation (UVPD) MS for the characterization of acidic O-linked glycopeptide anions. This method was evaluated for a series of singly- and multiply-deprotonated glycopeptides from the model glycoprotein kappa casein, resulting in production of both peptide and glycan product ions that afforded 100% sequence coverage of the peptide and glycan moieties from a single MS/MS event. The most abundant and frequent peptide sequence ions were a/x-type products, which, importantly, were found to retain the labile glycan modifications. The glycan-specific ions mainly arose from glycosidic bond cleavages (B, Y, C, and Z ions) in addition to some less common cross-ring cleavages. Based on the UVPD fragmentation patterns, an automated database searching strategy (based on the MassMatrix algorithm) was designed that is specific for the analysis of glycopeptide anions by UVPD. This algorithm was used to identify glycopeptides from mixtures of glycosylated and non-glycosylated peptides, sequence both glycan and peptide moieties simultaneously, and pinpoint the correct site(s) of glycosylation. This methodology was applied to uncover novel site-specificity of the O-linked glycosylated OmpA/MotB from the “superbug” A. baumannii to help aid in the elucidation of the functional role that protein glycosylation plays in pathogenesis. PMID:24006841
Cloning and characterization of the gene encoding the endopolygalacturonase-inhibiting protein (PGIP) of Phaseolus vulgaris L.

PubMed

Toubart, P; Desiderio, A; Salvi, G; Cervone, F; Daroda, L; De Lorenzo, G

1992-05-01

Polygalacturonase-inhibiting protein (PGIP) is a cell wall protein purified from hypocotyls of true bean (Phaseolus vulgaris L.). PGIP inhibits fungal endopolygalacturonases and is considered to be an important factor for plant resistance to phytopathogenic fungi (Albersheim and Anderson, 1971; Cervone et al., 1987). The amino acid sequences of the N-terminus and one internal tryptic peptide of the PGIP purified from P. vulgaris cv. Pinto were used to design redundant oligonucleotides that were successfully utilized as primers in a polymerase chain reaction (PCR) with total DNA of P. vulgaris as a template. A DNA band of 758 bp (a specific PCR amplification product of part of the gene coding for PGIP) was isolated and cloned. By using the 758-bp DNA as a hybridization probe, a lambda clone containing the PGIP gene was isolated from a genomic library of P. vulgaris cv. Saxa. The coding and immediate flanking regions of the PGIP gene, contained on a subcloned 3.3 kb SalI-SalI DNA fragment, were sequenced. A single, continuous ORF of 1026 nt (342 amino acids) was present in the genomic clone. The nucleotide and deduced amino acid sequences of the PGIP gene showed no significant similarity with any known databank sequence. Northern blotting analysis of poly(A)+ RNAs, isolated from various tissues of bean seedlings or from suspension-cultured bean cells, were also performed using the cloned PCR-generated DNA as a probe. A 1.2 kb transcript was detected in suspension-cultured cells and, to a lesser extent, in leaves, hypocotyls, and flowers.(ABSTRACT TRUNCATED AT 250 WORDS)
An α‐Helix‐Mimicking 12,13‐Helix: Designed α/β/γ‐Foldamers as Selective Inhibitors of Protein–Protein Interactions

PubMed Central

Grison, Claire M.; Miles, Jennifer A.; Robin, Sylvie

2016-01-01

Abstract A major current challenge in bioorganic chemistry is the identification of effective mimics of protein secondary structures that act as inhibitors of protein–protein interactions (PPIs). In this work, trans‐2‐aminocyclobutanecarboxylic acid (tACBC) was used as the key β‐amino acid component in the design of α/β/γ‐peptides to structurally mimic a native α‐helix. Suitably functionalized α/β/γ‐peptides assume an α‐helix‐mimicking 12,13‐helix conformation in solution, exhibit enhanced proteolytic stability in comparison to the wild‐type α‐peptide parent sequence from which they are derived, and act as selective inhibitors of the p53/hDM2 interaction. PMID:27467859
An α-Helix-Mimicking 12,13-Helix: Designed α/β/γ-Foldamers as Selective Inhibitors of Protein-Protein Interactions.

PubMed

Grison, Claire M; Miles, Jennifer A; Robin, Sylvie; Wilson, Andrew J; Aitken, David J

2016-09-05

A major current challenge in bioorganic chemistry is the identification of effective mimics of protein secondary structures that act as inhibitors of protein-protein interactions (PPIs). In this work, trans-2-aminocyclobutanecarboxylic acid (tACBC) was used as the key β-amino acid component in the design of α/β/γ-peptides to structurally mimic a native α-helix. Suitably functionalized α/β/γ-peptides assume an α-helix-mimicking 12,13-helix conformation in solution, exhibit enhanced proteolytic stability in comparison to the wild-type α-peptide parent sequence from which they are derived, and act as selective inhibitors of the p53/hDM2 interaction. © 2016 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

PubMed Central

Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

1988-01-01

Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

PubMed Central

Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

1982-01-01

We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
The Apollo program and amino acids. [precursors significance in molecular evolution

NASA Technical Reports Server (NTRS)

Fox, S. W.

1973-01-01

Apollo lunar sample analyses designed to detect the presence of organic compounds are reviewed, and the results are discussed from the viewpoint of relevance to laboratory experiments on the synthesis of amino acids and to theoretical models of cosmochemical processes resulting in the formation of organic compounds. Glycine, alanine, glutamic acid, aspartic acid, serine, and threonine have been found repeatedly in the hydrolyzates of hot aqueous extracts of lunar dust. These compounds represent an early step in the sequence of events leading to the rise of living material and were probably deposited by the solar wind. The results of the Apollo program so far suggest that the pathway from cosmic organic matter to life as it evolved on earth could have been pursued on the moon to the stage of amino acid precursors and then may have been terminated for lack of sufficient water.
Insights into the diversity of eukaryotes in acid mine drainage biofilm communities.

PubMed

Baker, Brett J; Tyson, Gene W; Goosherst, Lindsey; Banfield, Jillian F

2009-04-01

Microscopic eukaryotes are known to have important ecosystem functions, but their diversity in most environments remains vastly unexplored. Here we analyzed an 18S rRNA gene library from a subsurface iron- and sulfur-oxidizing microbial community growing in highly acidic (pH < 0.9) runoff within the Richmond Mine at Iron Mountain (northern California). Phylogenetic analysis revealed that the majority (68%) of the sequences belonged to fungi. Protists falling into the deeply branching lineage named the acidophilic protist clade (APC) and the class Heterolobosea were also present. The APC group represents kingdom-level novelty, with <76% sequence similarity to 18S rRNA gene sequences of organisms from other environments. Fluorescently labeled oligonucleotide rRNA probes were designed to target each of these groups in biofilm samples, enabling abundance and morphological characterization. Results revealed that the populations vary significantly with the habitat and no group is ubiquitous. Surprisingly, many of the eukaryotic lineages (with the exception of the APC) are closely related to neutrophiles, suggesting that they recently adapted to this extreme environment. Molecular analyses presented here confirm that the number of eukaryotic species associated with the acid mine drainage (AMD) communities is low. This finding is consistent with previous results showing a limited diversity of archaea, bacteria, and viruses in AMD environments and suggests that the environmental pressures and interplay between the members of these communities limit species diversity at all trophic levels.
Persistence of evolutionary memory: primordial six-transmembrane helical domain mu opiate receptors selectively linked to endogenous morphine signaling.

PubMed

Kream, Richard M; Sheehan, Melinda; Cadet, Patrick; Mantione, Kirk J; Zhu, Wei; Casares, Federico; Stefano, George B

2007-12-01

Biochemical, molecular and pharmacological evidence for two unique six-transmembrane helical (TMH) domain opiate receptors expressed from the micro opioid receptor (MOR) gene have been shown. Designated micro3 and micro4 receptors, both protein species are Class A rhodopsin-like members of the superfamily of G-protein coupled receptors but are selectively tailored to mediate the cellular regulatory effects of endogenous morphine and related morphinan alkaloids via stimulation of nitric oxide (NO) production and release. Both micro3 and micro4 receptors lack an amino acid sequence of approximately 90 amino acids that constitute the extracellular N-terminal and TMH1 domains and part of the first intracellular loop of the micro1 receptor, but retain the empirically defined ligand binding pocket distributed across conserved TMH2, TMH3, and TMH7 domains of the micro1 sequence. Additionally, the receptor proteins are terminated by unique intracellular C-terminal amino acid sequences that serve as putative coupling or docking domains required for constitutive NO synthase activation. Because the recognition profile of micro3 and micro4 receptors is restricted to rigid benzylisoquinoline alkaloids typified by morphine and its extended family of chemical congeners, it is hypothesized that conformational stabilization provided by interaction of extended extracellular N-terminal protein domains and the extracellular loops is required for binding of endogenous opioid peptides as well as synthetic flexible opiate alkaloids.
pH responsive micelle self-assembled from a new amphiphilic peptide as anti-tumor drug carrier.

PubMed

Liang, Ju; Wu, Wen-Lan; Xu, Xiao-Ding; Zhuo, Ren-Xi; Zhang, Xian-Zheng

2014-02-01

An acid-responsive amphiphilic peptide that contains KKGRGDS sequence in hydrophilic head and VVVVVV sequence in hydrophobic tail was designed and prepared. In neutral or basic medium, this amphiphilic peptide can self-assemble into micelles through hydrogen bonding and hydrophobic interactions. If changing the solution pH to an acidic environment, the electrostatic repulsion interaction among the ionized lysine (K) residues will prevent the self-assembly of the amphiphilic peptide, leading to the dissociation of micelles. The anti-tumor drug of doxorubicin (DOX) was chosen and loaded into the self-assembled micelles of the amphiphilic peptide to investigate the influence of external pH change on the drug release behavior. As expected, the micelles show a sustained DOX release in neutral medium (pH 7.0) but fast release behavior in acidic medium (pH 5.0). When incubating these DOX-loaded micelles with HeLa and COS7 cells, due to the over-expression of integrins on cancer cells, the micelles can efficiently use the tumor-targeting function of RGD sequence to deliver the drug into HeLa cells. Combined with the low cytotoxicity of the amphiphilic peptide against both HeLa and COS7 cells, the amphiphilic peptide reported in this work may be promising in clinical application for targeted drug delivery. Copyright © 2013 Elsevier B.V. All rights reserved.
Phylogenetic Relationship of Necoclí Virus to Other South American Hantaviruses (Bunyaviridae: Hantavirus).

PubMed

Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F

2015-07-01

The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.
Brain cDNA clone for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McTiernan, C.; Adkins, S.; Chatonnet, A.

1987-10-01

A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Compositions and methods for improved protein production

DOEpatents

Bodie, Elizabeth A [San Carlos, CA; Kim, Steve [San Francisco, CA

2012-07-10

The present invention relates to the identification of novel nucleic acid sequences, designated herein as 7p, 8k, 7E, 9G, 8Q and 203, in a host cell which effect protein production. The present invention also provides host cells having a mutation or deletion of part or all of the gene encoding 7p, 8k, 7E, 9G, 8Q and 203, which are presented in FIG. 1, and are SEQ ID NOS.: 1-6, respectively. The present invention also provides host cells further comprising a nucleic acid encoding a desired heterologous protein such as an enzyme.
Compositions and methods for improved protein production

DOEpatents

Bodie, Elizabeth A.; Kim, Steve Sungjin

2014-06-03

The present invention relates to the identification of novel nucleic acid sequences, designated herein as 7p, 8k, 7E, 9G, 8Q and 203, in a host cell which effect protein production. The present invention also provides host cells having a mutation or deletion of part or all of the gene encoding 7p, 8k, 7E, 9G, 8Q and 203, which are presented in FIG. 1, and are SEQ ID NOS.: 1-6, respectively. The present invention also provides host cells further comprising a nucleic acid encoding a desired heterologous protein such as an enzyme.

Molecular identification of the ompL1 gene within Leptospira interrogans standard serovars.

PubMed

Dezhbord, Mehrangiz; Esmaelizad, Majid; Khaki, Pejvak; Fotohi, Fariba; Zarehparvar Moghaddam, Athena

2014-06-11

Leptospirosis, caused by infection with pathogenic Leptospira species, is one of the most prevalent zoonotic diseases in the world. Current leptospiral vaccines are mainly multivalent dead whole-cell mixtures made of several local dominant serovars. Therefore, design and construction of an efficient recombinant vaccine for leptospirosis control is very important. OmpL1 is an immunogenic porin protein that could be of special significance in vaccination and serodiagnosis for leptospirosis. Three strains belonging to pathogenic L. interrogans were analyzed. The specific primers for proliferation of the ompL1 gene were designed. The amplified gene was cloned. In order to investigate the ompL1 nucleotide sequence and homological analysis of this gene, ompL1 genes cloned from standard vaccinal Leptospira serovars prevalent in Iran were sequenced and cloned. PCR amplification of the ompL1 gene using the designed primers resulted in a 963 bp ompL1 gene product. The PCR based on the ompL1 gene detected all pathogenic reference serovars of Leptospira spp. tested. Based on alignment and phylogenetic analysis, although the ompL1 nucleotide sequence was slightly different within three vaccinal serovars (100%-85% identity), amino acid alignment of the OmpL1 proteins revealed that there would be inconsiderable difference among them. The ompL1 gene of the three isolates was well conserved, differing only by a total of 6 bp and the proteins by 2 amino acids. The cloned gene could be further used for expression and recombinant OmpL1 as an efficient and conserved antigen, and may be a useful vaccine candidate against leptospirosis in our region.
Cloning and expression of cDNA coding for bouganin.

PubMed

den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

2002-03-01

Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
Method for altering antibody light chain interactions

DOEpatents

Stevens, Fred J.; Stevens, Priscilla Wilkins; Raffen, Rosemarie; Schiffer, Marianne

2002-01-01

A method for recombinant antibody subunit dimerization including modifying at least one codon of a nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in the interface segment of the light polypeptide variable region, the charged amino acid having a first polarity; and modifying at least one codon of the nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in an interface segment of the heavy polypeptide variable region corresponding to a position in the light polypeptide variable region, the charged amino acid having a second polarity opposite the first polarity. Nucleic acid sequences which code for novel light chain proteins, the latter of which are used in conjunction with the inventive method, are also provided.
An Aspergillus oryzae acetyl xylan esterase: molecular cloning and characteristics of recombinant enzyme expressed in Pichia pastoris.

PubMed

Koseki, Takuya; Miwa, Yozo; Akao, Takeshi; Akita, Osamu; Hashizume, Katsumi

2006-02-10

We screened 20,000 clones of an expressed sequence tag (EST) library from Aspergillus oryzae (http://www.nrib.go.jp/ken/EST/db/index.html) and obtained one cDNA clone encoding a protein with similarity to fungal acetyl xylan esterase. We also cloned the corresponding gene, designated as Aoaxe, from the genomic DNA. The deduced amino acid sequence consisted of a putative signal peptide of 31-amino acids and a mature protein of 276-amino acids. We engineered Aoaxe for heterologous expression in P. pastoris. Recombinant AoAXE (rAoAXE) was secreted by the aid of fused alpha-factor secretion signal peptide and accumulated as an active enzyme in the culture medium to a final level of 190 mg/l after 5 days. Purified rAoAXEA before and after treatment with endoglycosidase H migrated by SDS-PAGE with a molecular mass of 31 and 30 kDa, respectively. Purified rAoAXE displayed the greatest hydrolytic activity toward alpha-naphthylacetate (C2), lower activity toward alpha-naphthylpropionate (C3) and no detectable activity toward acyl-chain substrates containing four or more carbon atoms. The recombinant enzyme catalyzed the release of acetic acid from birchwood xylan. No activity was detectable using methyl esters of ferulic, caffeic or sinapic acids. rAoAXE was thermolabile in comparison to other AXEs from Aspergillus.
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2013 CFR

2013-07-01

... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2010 CFR

2010-07-01

... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2012 CFR

2012-07-01

... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
S1 of distinct IBV population expressed from recombinant adenovirus confers protection against challenge.

PubMed

Toro, H; Zhang, J F; Gallardo, R A; van Santen, V L; van Ginkel, F W; Joiner, K S; Breedlove, C

2014-06-01

Protective properties of three distinct infectious bronchitis virus (IBV) Ark Delmarva poultry industry (ArkDPI) S1 proteins encoded from replication-defective recombinant adenovirus vectors were investigated. Using a suboptimal dose of each recombinant virus, we demonstrated that IBV S1 amino acid sequences showing > or = 95.8% amino acid identity to the S1 of the challenge strain differed in their ability at conferring protection. Indeed, the S1 sequence of the IBV population previously designated C4 (AdIBVS1.C4), which protected the most poorly, differs from the S1 sequence of population C2 (AdIBVS1.C2), which provided the highest protection, only at amino acid position 56. The fact that a change in one amino acid in this region significantly altered the induction of a protective immune response against this protein provides evidence that the first portion of S1 displays relevant immunoprotective epitopes. Use of an optimal dose of AdIBVS1.C2 effectively protected chickens from clinical signs and significantly reduced viral load after IBV Ark virulent challenge. Moreover, increased numbers of both IgA and IgG IBV-specific antibody secreting lymphocytes were detected in the spleen after challenge. The increased response detected for both IgA and IgG lymphocytes after challenge might be explained by vaccine-induced B memory cells. The fact that a single vaccination with Ad/IBVS1.C2 provides protection against IBV challenge is promising, because Ad-vectored vaccines can be mass delivered by in ovo inoculation using automated in ovo injectors.
Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

PubMed

Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

2015-12-01

Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov. Copyright © 2015. Published by Elsevier GmbH.
Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

PubMed Central

Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

2012-01-01

Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Use of CYP52A2A promoter to increase gene expression in yeast

DOEpatents

Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan

2004-01-06

A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Method of Identifying a Base in a Nucleic Acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

1999-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Identifying a base in a nucleic acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2005-02-08

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Directing folding pathways for multi-component DNA origami nanostructures with complex topology

NASA Astrophysics Data System (ADS)

Marras, A. E.; Zhou, L.; Kolliopoulos, V.; Su, H.-J.; Castro, C. E.

2016-05-01

Molecular self-assembly has become a well-established technique to design complex nanostructures and hierarchical mesoscale assemblies. The typical approach is to design binding complementarity into nucleotide or amino acid sequences to achieve the desired final geometry. However, with an increasing interest in dynamic nanodevices, the need to design structures with motion has necessitated the development of multi-component structures. While this has been achieved through hierarchical assembly of similar structural units, here we focus on the assembly of topologically complex structures, specifically with concentric components, where post-folding assembly is not feasible. We exploit the ability to direct folding pathways to program the sequence of assembly and present a novel approach of designing the strand topology of intermediate folding states to program the topology of the final structure, in this case a DNA origami slider structure that functions much like a piston-cylinder assembly in an engine. The ability to program the sequence and control orientation and topology of multi-component DNA origami nanostructures provides a foundation for a new class of structures with internal and external moving parts and complex scaffold topology. Furthermore, this work provides critical insight to guide the design of intermediate states along a DNA origami folding pathway and to further understand the details of DNA origami self-assembly to more broadly control folding states and landscapes.
Nucleotide sequencing analysis of a LEU gene of Candida maltosa which complements leuB mutation of Escherichia coli and leu2 mutation of Saccharomyces cerevisiae.

PubMed

Takagi, M; Kobayashi, N; Sugimoto, M; Fujii, T; Watari, J; Yano, K

1987-01-01

The expression of a LEU gene from Candida maltosa (designated as C-LEU2) isolated previously (Kawamura et al. 1983) was shown to be regulated, when transferred into Saccharomyces cerevisiae, by leucine and threonine in the medium, as in the case of LEU2 gene of S. cerevisiae. The coding region together with the regulatory region was subcloned and the nucleotide sequence was determined. When the sequence of the coding region was compared with that of LEU2, the homology was 72% for base pairs and 76% for deduced amino acids. Comparison of the regulatory region of C-LEU2 with those of LEU1 and LEU2 suggested a few short consensus sequences which are involved in regulation of gene expression by leucine and threonine in the medium.
Cloning and sequence analysis of sucrose phosphate synthase gene from varieties of Pennisetum species.

PubMed

Li, H C; Lu, H B; Yang, F Y; Liu, S J; Bai, C J; Zhang, Y W

2015-03-31

Sucrose phosphate synthase (SPS) is an enzyme used by higher plants for sucrose synthesis. In this study, three primer sets were designed on the basis of known SPS sequences from maize (GenBank: NM_001112224.1) and sugarcane (GenBank: JN584485.1), and five novel SPS genes were identified by RT-PCR from the genomes of Pennisetum spp (the hybrid P. americanum x P. purpureum, P. purpureum Schum., P. purpureum Schum. cv. Red, P. purpureum Schum. cv. Taiwan, and P. purpureum Schum. cv. Mott). The cloned sequences showed 99.9% identity and 80-88% similarity to the SPS sequences of other plants. The SPS gene of hybrid Pennisetum had one nucleotide and four amino acid polymorphisms compared to the other four germplasms, and cluster analysis was performed to assess genetic diversity in this species. Additional characterization of the SPS gene product can potentially allow Pennisetum to be exploited as a biofuel source.
A New Primer to Amplify pmoA Gene From NC10 Bacteria in the Sediments of Dongchang Lake and Dongping Lake.

PubMed

Wang, Shenghui; Liu, Yanjun; Liu, Guofu; Huang, Yaru; Zhou, Yu

2017-08-01

Nitrite-dependent anaerobic methane oxidation (n-damo) is catalyzed by the NC10 phylum bacterium "Candidatus Methylomirabilis oxyfera" (M. oxyfera). Generally, the pmoA gene is applied as a functional marker to test and identify NC10-like bacteria. However, it is difficult to detect the NC10 bacteria from sediments of freshwater lake (Dongchang Lake and Dongping Lake) with the previous pmoA gene primer sets. In this work, a new primer cmo208 was designed and used to amplify pmoA gene of NC10-like bacteria. A newly nested PCR approach was performed using the new primer cmo208 and the previous primers cmo182, cmo682, and cmo568 to detect the NC10 bacteria. The obtained pmoA gene sequences exhibited 85-92% nucleotide identity and 95-97% amino acid sequence identity to pmoA gene of M. oxyfera. The obtained diversity of pmoA gene sequences coincided well with the diversity of 16S rRNA sequences. These results indicated that the newly designed pmoA primer cmo208 could give one more option to detect NC10 bacteria from different environmental samples.
Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deutscher, J.; Pevec, B.; Beyreuther, K.

1986-10-21

The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
Synthesis and Pharmacology of α/β(3)-Peptides Based on the Melanocortin Agonist Ac-His-dPhe-Arg-Trp-NH2 Sequence.

PubMed

Singh, Anamika; Tala, Srinivasa R; Flores, Viktor; Freeman, Katie; Haskell-Luevano, Carrie

2015-05-14

The melanocortin-3 and -4 receptors are expressed in the brain and play key roles in regulating feeding behavior, metabolism, and energy homeostasis. In the present study, incorporation of β(3)-amino acids into a melanocortin tetrapeptide template was investigated. Four linear α/β(3)-hybrid tetrapeptides were designed with the modifications at the Phe, Arg, and Trp residues in the agonist sequence Ac-His-dPhe-Arg-Trp-NH2. The most potent mouse melanocortin-4 receptor (mMC4R) agonist, Ac-His-dPhe-Arg-β(3)hTrp-NH2 (8) showed 35-fold selectivity versus the mMC3R. The study presented here has identified a new template with heterogeneous backbone for designing potent and selective melanocortin receptor ligands.
Synthesis and Pharmacology of α/β3-Peptides Based on the Melanocortin Agonist Ac-His-dPhe-Arg-Trp-NH2 Sequence

PubMed Central

2015-01-01

The melanocortin-3 and -4 receptors are expressed in the brain and play key roles in regulating feeding behavior, metabolism, and energy homeostasis. In the present study, incorporation of β3-amino acids into a melanocortin tetrapeptide template was investigated. Four linear α/β3-hybrid tetrapeptides were designed with the modifications at the Phe, Arg, and Trp residues in the agonist sequence Ac-His-dPhe-Arg-Trp-NH2. The most potent mouse melanocortin-4 receptor (mMC4R) agonist, Ac-His-dPhe-Arg-β3hTrp-NH2 (8) showed 35-fold selectivity versus the mMC3R. The study presented here has identified a new template with heterogeneous backbone for designing potent and selective melanocortin receptor ligands. PMID:26005535

Design of multi-phase dynamic chemical networks

NASA Astrophysics Data System (ADS)

Chen, Chenrui; Tan, Junjun; Hsieh, Ming-Chien; Pan, Ting; Goodwin, Jay T.; Mehta, Anil K.; Grover, Martha A.; Lynn, David G.

2017-08-01

Template-directed polymerization reactions enable the accurate storage and processing of nature's biopolymer information. This mutualistic relationship of nucleic acids and proteins, a network known as life's central dogma, is now marvellously complex, and the progressive steps necessary for creating the initial sequence and chain-length-specific polymer templates are lost to time. Here we design and construct dynamic polymerization networks that exploit metastable prion cross-β phases. Mixed-phase environments have been used for constructing synthetic polymers, but these dynamic phases emerge naturally from the growing peptide oligomers and create environments suitable both to nucleate assembly and select for ordered templates. The resulting templates direct the amplification of a phase containing only chain-length-specific peptide-like oligomers. Such multi-phase biopolymer dynamics reveal pathways for the emergence, self-selection and amplification of chain-length- and possibly sequence-specific biopolymers.
Nucleic acid aptamers: research tools in disease diagnostics and therapeutics.

PubMed

Santosh, Baby; Yadava, Pramod K

2014-01-01

Aptamers are short sequences of nucleic acid (DNA or RNA) or peptide molecules which adopt a conformation and bind cognate ligands with high affinity and specificity in a manner akin to antibody-antigen interactions. It has been globally acknowledged that aptamers promise a plethora of diagnostic and therapeutic applications. Although use of nucleic acid aptamers as targeted therapeutics or mediators of targeted drug delivery is a relatively new avenue of research, one aptamer-based drug "Macugen" is FDA approved and a series of aptamer-based drugs are in clinical pipelines. The present review discusses the aspects of design, unique properties, applications, and development of different aptamers to aid in cancer diagnosis, prevention, and/or treatment under defined conditions.
The practical and pedagogical advantages of an ambigraphic nucleic acid notation.

PubMed

Rozak, David A

2006-01-01

The universally applied IUPAC notation for nucleic acids was adopted primarily to facilitate the mental association of G, A, T, C, and the related ambiguity characters with the bases they represent. However it is possible to create a notation that offers greater support for the basic manipulations and analyses to which genetic sequences frequently are subjected. By designing a nucleic acid notation around ambigrams, it is possible to simplify the frequently applied process of reverse complementation and aid the visualization of palindromes. The ambigraphic notation presented here also uses common orthographic features such as stems and loops to highlight guanine and cytosine rich regions, support the derivation of ambiguity characters, and aid educators in teaching the fundamentals of molecular genetics.
Methods and compositions for regulating gene expression in plant cells

NASA Technical Reports Server (NTRS)

Dai, Shunhong (Inventor); Beachy, Roger N. (Inventor); Luis, Maria Isabel Ordiz (Inventor)

2010-01-01

Novel chimeric plant promoter sequences are provided, together with plant gene expression cassettes comprising such sequences. In certain preferred embodiments, the chimeric plant promoters comprise the BoxII cis element and/or derivatives thereof. In addition, novel transcription factors are provided, together with nucleic acid sequences encoding such transcription factors and plant gene expression cassettes comprising such nucleic acid sequences. In certain preferred embodiments, the novel transcription factors comprise the acidic domain, or fragments thereof, of the RF2a transcription factor. Methods for using the chimeric plant promoter sequences and novel transcription factors in regulating the expression of at least one gene of interest are provided, together with transgenic plants comprising such chimeric plant promoter sequences and novel transcription factors.
The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase.

PubMed Central

Freemont, P S; Dunbar, B; Fothergill-Gilmore, L A

1988-01-01

The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase, comprising 363 residues, was determined. The sequence was deduced by automated sequencing of CNBr-cleavage, o-iodosobenzoic acid-cleavage, trypsin-digest and staphylococcal-proteinase-digest fragments. Comparison of the sequence with other class I aldolase sequences shows that the mammalian muscle isoenzyme is one of the most highly conserved enzymes known, with only about 2% of the residues changing per 100 million years. Non-mammalian aldolases appear to be evolving at the same rate as other glycolytic enzymes, with about 4% of the residues changing per 100 million years. Secondary-structure predictions are analysed in an accompanying paper [Sawyer, Fothergill-Gilmore & Freemont (1988) Biochem. J. 249, 789-793]. PMID:3355497
Isolation and characterization of a novel tannase from a metagenomic library.

PubMed

Yao, Jian; Fan, Xin Jiong; Lu, Yi; Liu, Yu Huan

2011-04-27

A novel gene (designated as tan410) encoding tannase was isolated from a cotton field metagenomic library by functional screening. Sequence analysis revealed that tan410 encoded a protein of 521 amino acids. SDS-PAGE and gel filtration chromatography analysis of purified tannase suggested that Tan410 was a monomeric enzyme with a molecular mass of 55 kDa. The optimum temperature and pH of Tan410 were 30 °C and 6.4. The activity was enhanced by addition of Ca(2+), Mg(2+) and Cd(2+). In addition, Tan410 was stable in the presence of 4 M NaCl. Chlorogenic acid, rosmarinic acid, ethyl ferulate, tannic acid, epicatechin gallate and epigallocathchin gallate were efficiently hydrolyzed by recombinant tannase. All of these excellent properties make Tan410 an interesting enzyme for biotechnological application.
Cloning and sequencing of the allophycocyanin genes from Spirulina maxima (Cyanophyta)

NASA Astrophysics Data System (ADS)

Qin, Song; Hiroyuki, Kojima; Yoshikazu, Kawata; Shin-Ichi, Yano; Zeng, Cheng-Kui

1998-03-01

The genes coding for the α-and β-subunit of allophycocyanin ( apcA and apcB) from the cyanophyte Spirulina maxima were cloned and sequenced. The results revealed 44.4% of nucleotide sequence similarity and 30.4% of similarity of deduced amino acid sequence between them. The amino acid sequence identities between S. maxima and S. platensis are 99.4% for α subunit and 100% for β subunit.
Immunoglobulin from Antarctic fish species of Rajidae family.

PubMed

Coscia, Maria Rosaria; Cocca, Ennio; Giacomelli, Stefano; Cuccaro, Fausta; Oreste, Umberto

2012-03-01

Immunoglobulins (Ig) of Chondroichthyes have been extensively studied in sharks; in contrast, in skates investigations on Ig remain scarce and fragmentary despite the high occurrence of skates in all of the major oceans of the world. To focus on Rajidae Igμ, the most abundant heavy chain isotype, we have chosen the Antarctic species Bathyraja eatonii, Bathyraja albomaculata, Bathyraja brachyurops, and Amblyraja georgiana which live at high latitudes in the Southern Ocean, and at very low temperatures. We prepared mRNA from the spleen of individuals of each species and performed RT-PCR experiments using two oligonucleotides designed on the alignment of various elasmobranch Igμ heavy chain sequences available in GenBank. The PCR products, about 1400-nt long, were cloned and sequenced. Nucleotide sequence identities calculated for the constant region domains ranged from 88.5% to 97.5% between species, and from 91.1% to 99.7% within species. In a distance tree, including also Raja erinacea sequences, two major branches were obtained, one containing Arhynchobatinae sequences, the other one Rajinae sequences. Four presumptive D gene segments were identified in the region of the VH/D/JH recombination; two different D segments were often found in the same sequence. Moreover, 5-15 genomic fragments of different lengths, carrying the gene locus encoding Igμ chain were revealed by Southern blotting analysis. B. eatonii amino acid sequences were analyzed for the positional diversity by Shannon entropy analysis, showing CH4 as the most conserved domain, and CH3 as the most variable one. B. eatonii CDR3 region length varied between 11 and 15 amino acid residues; the mean length (13.4 aa) was greater than that of Leucoraja eglanteria sequences (7.7 aa). An alignment of representative sequences of Antarctic species and R. erinacea showed that more cysteine residues not involved in the intradomain disulfide bridges were present in Antarctic species. Copyright Â© 2011 Elsevier B.V. All rights reserved.
Identification of genes from pattern formation, tyrosine kinase, and potassium channel families by DNA amplification

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamb, A.; Weir, M.; Rudy, B.

1989-06-01

The study of gene family members has been aided by the isolation of related genes on the basis of DNA homology. The authors have adapted the polymerase chain reaction to screen animal genomes very rapidly and reliably for likely gene family members. Using conserved amino acid sequences to design degenerate oligonucleotide primers, they have shown that the genome of the nematode Caenorhabditis elegans contains sequences homologous to many Drosophila genes involved in pattern formation, including the segment polarity gene wingless (vertebrate int-1), and homeobox sequences characteristic of the Antennapedia, engrailed, and paired families. In addition, they have used this methodmore » to show that C. elegans contains at least five different sequences homologous to genes in the tyrosine kinase family. Lastly, they have isolated six potassium channel sequences from humans, a result that validates the utility of the method with large genomes and suggests that human potassium channel gene diversity may be extensive.« less
Finding similar nucleotide sequences using network BLAST searches.

PubMed

Ladunga, Istvan

2009-06-01

The Basic Local Alignment Search Tool (BLAST) is a keystone of bioinformatics due to its performance and user-friendliness. Beginner and intermediate users will learn how to design and submit blastn and Megablast searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNA, expressed sequence tag, and noncoding RNA sequences, and run Megablast searches, which are much faster than blastn. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez, PUBMED, structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge.
Genetic analysis of duck circovirus in Pekin ducks from South Korea.

PubMed

Cha, S-Y; Kang, M; Cho, J-G; Jang, H-K

2013-11-01

The genetic organization of the 24 duck circovirus (DuCV) strains detected in commercial Pekin ducks from South Korea between 2011 and 2012 is described in this study. Multiple sequence alignment and phylogenetic analyses were performed on the 24 viral genome sequences as well as on 45 genome sequences available from the GenBank database. Phylogenetic analyses based on the genomic and open reading frame 2/cap sequences demonstrated that all DuCV strains belonged to genotype 1 and were designated in a subcluster under genotype 1. Analysis of the capsid protein amino acid sequences of the 24 Korean DuCV strains showed 10 substitutions compared with that of other genotype 1 strains. Our analysis showed that genotype 1 is predominant and circulating in South Korea. These present results serve as incentive to add more data to the DuCV database and provide insight to conduct further intensive study on the geographic relationships among these virus strains.
Use of linalool synthase in genetic engineering of scent production

DOEpatents

Pichersky, E.

1998-12-15

A purified S-linalool synthase polypeptide from Clarkia breweri is disclosed as is the recombinant polypeptide and nucleic acid sequences encoding the polypeptide. Also disclosed are antibodies immunoreactive with the purified peptide and with recombinant versions of the polypeptide. Methods of using the nucleic acid sequences, as well as methods of enhancing the smell and the flavor of plants expressing the nucleic acid sequences are also disclosed. 5 figs.
Use of linalool synthase in genetic engineering of scent production

DOEpatents

Pichersky, Eran

1998-01-01

A purified S-linalool synthase polypeptide from Clarkia breweri is disclosed as is the recombinant polypeptide and nucleic acid sequences encoding the polypeptide. Also disclosed are antibodies immunoreactive with the purified peptide and with recombinant versions of the polypeptide. Methods of using the nucleic acid sequences, as well as methods of enhancing the smell and the flavor of plants expressing the nucleic acid sequences are also disclosed.
Probe kit for identifying a base in a nucleic acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Crotoxin: Structural Studies, Mechanism of Action and Cloning of its Gene

DTIC Science & Technology

1988-03-01

thirteen amino acids being acidic . Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate ...the sequence determination of both the basic and acidic subunits of crotoxin- The acidic * subunit peptides were d!Tfficult, .sfi~n~e two of-ftflý...fluorescence spectroscopy. Results indicate a large conformational change occurs upon) ccmplex formation between the acidic and basic subunits of all four
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2014 CFR

2014-07-01

... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those listed... the Feature section. Otherwise, each occurrence of a base or amino acid not appearing in WIPO Standard...
Characterization and Nucleotide Sequence of CARB-6, a New Carbenicillin-Hydrolyzing β-Lactamase from Vibrio cholerae

PubMed Central

Choury, Danièle; Aubert, Gérald; Szajnert, Marie-France; Azibi, Kemal; Delpech, Marc; Paul, Gérard

1999-01-01

A clinical strain of Vibrio cholerae non-O1 non-O139 isolated in France produced a new β-lactamase with a pI of 5.35. The purified enzyme, with a molecular mass of 33,000 Da, was characterized. Its kinetic constants show it to be a carbenicillin-hydrolyzing enzyme comparable to the five previously reported CARB β-lactamases and to SAR-1, another carbenicillin-hydrolyzing β-lactamase that has a pI of 4.9 and that is produced by a V. cholerae strain from Tanzania. This β-lactamase is designated CARB-6, and the gene for CARB-6 could not be transferred to Escherichia coli K-12 by conjugation. The nucleotide sequence of the structural gene was determined by direct sequencing of PCR-generated fragments from plasmid DNA with four pairs of primers covering the whole sequence of the reference CARB-3 gene. The gene encodes a 288-amino-acid protein that shares 94% homology with the CARB-1, CARB-2, and CARB-3 enzymes, 93% homology with the Proteus mirabilis N29 enzyme, and 86.5% homology with the CARB-4 enzyme. The sequence of CARB-6 differs from those of CARB-3, CARB-2, CARB-1, N29, and CARB-4 at 15, 16, 17, 19, and 37 amino acid positions, respectively. All these mutations are located in the C-terminal region of the sequence and at the surface of the molecule, according to the crystal structure of the Staphylococcus aureus PC-1 β-lactamase. PMID:9925522
Insights into the sequence parameters for halophilic adaptation.

PubMed

Nath, Abhigyan

2016-03-01

The sequence parameters for halophilic adaptation are still not fully understood. To understand the molecular basis of protein hypersaline adaptation, a detailed analysis is carried out, and investigated the likely association of protein sequence attributes to halophilic adaptation. A two-stage strategy is implemented, where in the first stage a supervised machine learning classifier is build, giving an overall accuracy of 86 % on stratified tenfold cross validation and 90 % on blind testing set, which are better than the previously reported results. The second stage consists of statistical analysis of sequence features and possible extraction of halophilic molecular signatures. The results of this study showed that, halophilic proteins are characterized by lower average charge, lower K content, and lower S content. A statistically significant preference/avoidance list of sequence parameters is also reported giving insights into the molecular basis of halophilic adaptation. D, Q, E, H, P, T, V are significantly preferred while N, C, I, K, M, F, S are significantly avoided. Among amino acid physicochemical groups, small, polar, charged, acidic and hydrophilic groups are preferred over other groups. The halophilic proteins also showed a preference for higher average flexibility, higher average polarity and avoidance for higher average positive charge, average bulkiness and average hydrophobicity. Some interesting trends observed in dipeptide counts are also reported. Further a systematic statistical comparison is undertaken for gaining insights into the sequence feature distribution in different residue structural states. The current analysis may facilitate the understanding of the mechanism of halophilic adaptation clearer, which can be further used for rational design of halophilic proteins.
Role of CadC and CadD in the 2,4-dichlorophenoxyacetic acid oxygenase system of Sphingomonas agrestis 58-1.

PubMed

Kijima, Kumiko; Mita, Hajime; Kawakami, Mitsuyasu; Amada, Kei

2018-02-02

In the present study, we confirm that 2,4-dichlorophenoxyacetic acid (2,4-D) oxygenase from Sphingomonas agrestis 58-1 belongs to the family of Rieske non-heme iron aromatic ring-hydroxylating oxygenases, which comprise a core enzyme (oxygenase), ferredoxin, and oxidoreductase. It has previously been shown that cadAB genes are necessary for the conversion of 2,4-D to 2,4-dichlorophenol; however, the respective roles of ferredoxin and oxidoreductase in the 2,4-D oxygenase system from S. agrestis 58-1 remain unknown. Using nucleotide sequence analysis of the plasmid pCADAB1 from Sphingomonas sp. ERG5, which degrades 4-chloro-2-methylphenoxyacetic acid and 2,4-D, Nielsen et al. identified orf95, upstream of cadA, and orf98, downstream of cadB, which were predicted and designated as cadD (oxidoreductase) and cadC (ferredoxin), respectively (Nielsen et al., PLoS One, 8, 1-9, 2013). These designations were the result of sequence analysis; therefore, we constructed an expression system of CadABC and CadABCD in Escherichia coli and assayed their enzyme activities. Our findings indicate that CadC is essential for the activity of 2,4-D oxygenase and CadD promotes CadABC activity in recombinant E. coli cells. Copyright © 2018 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Cooperativity and specificity of association of a designed transmembrane peptide.

PubMed Central

Gratkowski, Holly; Dai, Qing-Hong; Wand, A Joshua; DeGrado, William F; Lear, James D

2002-01-01

Thermodynamics studies aimed at quantitatively characterizing free energy effects of amino acid substitutions are not restricted to two state systems, but do require knowing the number of states involved in the equilibrium under consideration. Using analytical ultracentrifugation and NMR methods, we show here that a membrane-soluble peptide, MS1, designed by modifying the sequence of the water-soluble coiled-coil GCN4-P1, exhibits a reversible monomer-dimer-trimer association in detergent micelles with a greater degree of cooperativity in C14-betaine than in dodecyl phosphocholine detergents. PMID:12202385

RNAblueprint: flexible multiple target nucleic acid sequence design.

PubMed

Hammer, Stefan; Tschiatschek, Birgit; Flamm, Christoph; Hofacker, Ivo L; Findeiß, Sven

2017-09-15

Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. We developed a C ++ library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Mouse Vk gene classification by nucleic acid sequence similarity.

PubMed

Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

1989-01-01

Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.
Method of identity analyte-binding peptides

DOEpatents

Kauvar, L.M.

1990-10-16

A method for affinity chromatography or adsorption of a designated analyte utilizes a paralog as the affinity partner. The immobilized paralog can be used in purification or analysis of the analyte; the paralog can also be used as a substitute for antibody in an immunoassay. The paralog is identified by screening candidate peptide sequences of 4--20 amino acids for specific affinity to the analyte. 5 figs.
Adaptive molecular evolution of the two-pore channel 1 gene TPC1 in the karst-adapted genus Primulina (Gesneriaceae)

PubMed Central

Tao, Junjie; Feng, Chao; Ai, Bin; Kang, Ming

2016-01-01

Background and Aims Limestone karst areas possess high floral diversity and endemism. The genus Primulina, which contributes to the unique calcicole flora, has high species richness and exhibit specific soil-based habitat associations that are mainly distributed on calcareous karst soils. The adaptive molecular evolutionary mechanism of the genus to karst calcium-rich environments is still not well understood. The Ca2+-permeable channel TPC1 was used in this study to test whether its gene is involved in the local adaptation of Primulina to karst high-calcium soil environments. Methods Specific amplification and sequencing primers were designed and used to amplify the full-length coding sequences of TPC1 from cDNA of 76 Primulina species. The sequence alignment without recombination and the corresponding reconstructed phylogeny tree were used in molecular evolutionary analyses at the nucleic acid level and amino acid level, respectively. Finally, the identified sites under positive selection were labelled on the predicted secondary structure of TPC1. Key Results Seventy-six full-length coding sequences of Primulina TPC1 were obtained. The length of the sequences varied between 2220 and 2286 bp and the insertion/deletion was located at the 5′ end of the sequences. No signal of substitution saturation was detected in the sequences, while significant recombination breakpoints were detected. The molecular evolutionary analyses showed that TPC1 was dominated by purifying selection and the selective pressures were not significantly different among species lineages. However, significant signals of positive selection were detected at both TPC1 codon level and amino acid level, and five sites under positive selective pressure were identified by at least three different methods. Conclusions The Ca2+-permeable channel TPC1 may be involved in the local adaptation of Primulina to karst Ca2+-rich environments. Different species lineages suffered similar selective pressure associated with calcium in karst environments, and episodic diversifying selection at a few sites may play a major role in the molecular evolution of Primulina TPC1. PMID:27582362
``Sequence space soup'' of proteins and copolymers

NASA Astrophysics Data System (ADS)

Chan, Hue Sun; Dill, Ken A.

1991-09-01

To study the protein folding problem, we use exhaustive computer enumeration to explore ``sequence space soup,'' an imaginary solution containing the ``native'' conformations (i.e., of lowest free energy) under folding conditions, of every possible copolymer sequence. The model is of short self-avoiding chains of hydrophobic (H) and polar (P) monomers configured on the two-dimensional square lattice. By exhaustive enumeration, we identify all native structures for every possible sequence. We find that random sequences of H/P copolymers will bear striking resemblance to known proteins: Most sequences under folding conditions will be approximately as compact as known proteins, will have considerable amounts of secondary structure, and it is most probable that an arbitrary sequence will fold to a number of lowest free energy conformations that is of order one. In these respects, this simple model shows that proteinlike behavior should arise simply in copolymers in which one monomer type is highly solvent averse. It suggests that the structures and uniquenesses of native proteins are not consequences of having 20 different monomer types, or of unique properties of amino acid monomers with regard to special packing or interactions, and thus that simple copolymers might be designable to collapse to proteinlike structures and properties. A good strategy for designing a sequence to have a minimum possible number of native states is to strategically insert many P monomers. Thus known proteins may be marginally stable due to a balance: More H residues stabilize the desired native state, but more P residues prevent simultaneous stabilization of undesired native states.
Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

PubMed

Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

2004-01-01

The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
C9/12 Ribbon-Like Structures in Hybrid Peptides Alternating α- and Thiazole-Based γ-Amino Acids.

PubMed

Bonnel, Clément; Legrand, Baptiste; Simon, Matthieu; Martinez, Jean; Bantignies, Jean-Louis; Kang, Young Kee; Wenger, Emmanuel; Hoh, Francois; Masurier, Nicolas; Maillard, Ludovic T

2017-12-11

According to their restricted conformational freedom, heterocyclic γ-amino acids are usually considered to be related to Z-vinylogous γ-amino acids. In this context, oligomers alternating α-amino acids and thiazole-based γ-amino acids (ATCs) were expected to fold into a canonical 12-helical shape as described for α/γ-hybrid peptides composed of cis-α/β-unsaturated γ-amino acids. However, through a combination of X-ray crystallography, NMR spectroscopy, FTIR experiments, and DFT calculations, it was determined that the folding behavior of ATC-containing hybrid peptides is much more complex. The homochiral α/(S)-ATC sequences were unable to adopt a stable conformation, whereas the heterochiral α/(R)-ATC peptides displayed novel ribbon structures stabilized by unusual C 9/12 -bifurcated hydrogen bonds. These ribbon structures could be considered as a succession of pre-organized γ/α dipeptides and may provide the basis for designing original α-helix mimics. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Characterisation and In Silico Analysis of Interleukin-4 cDNA of Nilgai (Boselaphus tragocamelus) and Indian Buffalo (Bubalus bubalis)

PubMed Central

Saini, M.; Palai, T. K.; Das, D. K.; Hatle, K. M.; Gupta, P. K.

2013-01-01

Interleukin-4 (IL-4) produced from Th2 cells modulates both innate and adaptive immune responses. It is a common belief that wild animals possess better immunity against diseases than domestic and laboratory animals; however, the immune system of wild animals is not fully explored yet. Therefore, a comparative study was designed to explore the wildlife immunity through characterisation of IL-4 cDNA of nilgai, a wild ruminant, and Indian buffalo, a domestic ruminant. Total RNA was extracted from peripheral blood mononuclear cells of nilgai and Indian buffalo and reverse transcribed into cDNA. Respective cDNA was further cloned and sequenced. Sequences were analysed in silico and compared with their homologues available at GenBank. The deduced 135 amino acid protein of nilgai IL-4 is 95.6% similar to that of Indian buffalo. N-linked glycosylation sequence, leader sequence, Cysteine residues in the signal peptide region, and 3′ UTR of IL-4 were found to be conserved across species. Six nonsynonymous nucleotide substitutions were found in Indian buffalo compared to nilgai amino acid sequence. Tertiary structure of this protein in both species was modeled, and it was found that this protein falls under 4-helical cytokines superfamily and short chain cytokine family. Phylogenetic analysis revealed a single cluster of ruminants including both nilgai and Indian buffalo that was placed distinct from other nonruminant mammals. PMID:24348167
Methods for making nucleotide probes for sequencing and synthesis

DOEpatents

Church, George M; Zhang, Kun; Chou, Joseph

2014-07-08

Compositions and methods for making a plurality of probes for analyzing a plurality of nucleic acid samples are provided. Compositions and methods for analyzing a plurality of nucleic acid samples to obtain sequence information in each nucleic acid sample are also provided.
Sequence Polishing Library (SPL) v10.0

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oberortner, Ernst

The Sequence Polishing Library (SPL) is a suite of software tools in order to automate "Design for Synthesis and Assembly" workflows. Specifically: The SPL "Converter" tool converts files among the following sequence data exchange formats: CSV, FASTA, GenBank, and Synthetic Biology Open Language (SBOL); The SPL "Juggler" tool optimizes the codon usages of DNA coding sequences according to an optimization strategy, a user-specific codon usage table and genetic code. In addition, the SPL "Juggler" can translate amino acid sequences into DNA sequences.:The SPL "Polisher" verifies NA sequences against DNA synthesis constraints, such as GC content, repeating k-mers, and restriction sites.more » In case of violations, the "Polisher" reports the violations in a comprehensive manner. The "Polisher" tool can also modify the violating regions according to an optimization strategy, a user-specific codon usage table and genetic code;The SPL "Partitioner" decomposes large DNA sequences into smaller building blocks with partial overlaps that enable an efficient assembly. The "Partitioner" enables the user to configure the characteristics of the overlaps, which are mostly determined by the utilized assembly protocol, such as length, GC content, or melting temperature.« less
Soil amino acid composition across a boreal forest successional sequence

Treesearch

Nancy R. Werdin-Pfisterer; Knut Kielland; Richard D. Boone

2009-01-01

Soil amino acids are important sources of organic nitrogen for plant nutrition, yet few studies have examined which amino acids are most prevalent in the soil. In this study, we examined the composition, concentration, and seasonal patterns of soil amino acids across a primary successional sequence encompassing a natural gradient of plant productivity and soil...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2014 CFR

2014-07-01

...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2013 CFR

2013-07-01

...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2012 CFR

2012-07-01

...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.

1984-01-01

Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar.more » For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells.« less
Cloning and sequencing of a gene encoding a novel extracellular neutral proteinase from Streptomyces sp. strain C5 and expression of the gene in Streptomyces lividans 1326.

PubMed Central

Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R

1992-01-01

The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
Molecular cloning and characterization of a cDNA encoding the gibberellin biosynthetic enzyme ent-kaurene synthase B from pumpkin (Cucurbita maxima L.).

PubMed

Yamaguchi, S; Saito, T; Abe, H; Yamane, H; Murofushi, N; Kamiya, Y

1996-08-01

The first committed step in the formation of diterpenoids leading to gibberellin (GA) biosynthesis is the conversion of geranylgeranyl diphosphate (GGDP) to ent-kaurene. ent-Kaurene synthase A (KSA) catalyzes the conversion of GGDP to copalyl diphosphate (CDP), which is subsequently converted to ent-kaurene by ent-kaurene synthase B (KSB). A full-length KSB cDNA was isolated from developing cotyledons in immature seeds of pumpkin (Cucurbita maxima L.). Degenerate oligonucleotide primers were designed from the amino acid sequences obtained from the purified protein to amplify a cDNA fragment, which was used for library screening. The isolated full-length cDNA was expressed in Escherichia coli as a fusion protein, which demonstrated the KSB activity to cyclize [3H]CDP to [3H]ent-kaurene. The KSB transcript was most abundant in growing tissues, but was detected in every organ in pumpkin seedlings. The deduced amino acid sequence shares significant homology with other terpene cyclases, including the conserved DDXXD motif, a putative divalent metal ion-diphosphate complex binding site. A putative transit peptide sequence that may target the translated product into the plastids is present in the N-terminal region.
Cytotoxic T lymphocytes and CD4 epitope mutations in the pre-core/core region of hepatitis B virus in chronic hepatitis B carriers in Northeast Iran.

PubMed

Zhand, Sareh; Tabarraei, Alijan; Nazari, Amineh; Moradi, Abdolvahab

2017-07-01

Hepatitis B virus (HBV) is vulnerable to many various mutations. Those within epitopes recognized by sensitized T cells may influence the re-emergence of the virus. This study was designed to investigate the mutation in immune epitope regions of HBV pre-core/core among chronic HBV patients of Golestan province, Northeast Iran. In 120 chronic HBV carriers, HBV DNA was extracted from blood plasma samples and PCR was done using specific primers. Direct sequencing and alignment of the pre-core/core region were applied using reference sequence from Gene Bank database (Accession Number AB033559). The study showed 27 inferred amino acid substitutions, 9 of which (33.3%) were in CD4 and 2 (7.4%) in cytotoxic T lymphocytes' (CTL) epitopes and 16 other mutations (59.2%) were observed in other regions. CTL escape mutations were not commonly observed in pre-core/core sequences of chronic HBV carriers in the locale of study. It can be concluded that most of the inferred amino acid substitutions occur in different immune epitopes other than CTL and CD4.
Characterization, Genome Sequence, and Analysis of Escherichia Phage CICC 80001, a Bacteriophage Infecting an Efficient L-Aspartic Acid Producing Escherichia coli.

PubMed

Xu, Youqiang; Ma, Yuyue; Yao, Su; Jiang, Zengyan; Pei, Jiangsen; Cheng, Chi

2016-03-01

Escherichia phage CICC 80001 was isolated from the bacteriophage contaminated medium of an Escherichia coli strain HY-05C (CICC 11022S) which could produce L-aspartic acid. The phage had a head diameter of 45-50 nm and a tail of about 10 nm. The one-step growth curve showed a latent period of 10 min and a rise period of about 20 min. The average burst size was about 198 phage particles per infected cell. Tests were conducted on the plaques, multiplicity of infection, and host range. The genome of CICC 80001 was sequenced with a length of 38,810 bp, and annotated. The key proteins leading to host-cell lysis were phylogenetically analyzed. One protein belonged to class II holin, and the other two belonged to the endopeptidase family and N-acetylmuramoyl-L-alanine amidase family, respectively. The genome showed the sequence identity of 82.7% with that of Enterobacteria phage T7, and carried ten unique open reading frames. The bacteriophage resistant E. coli strain designated CICC 11021S was breeding and its L-aspartase activity was 84.4% of that of CICC 11022S.
Chirality- and sequence-selective successive self-sorting via specific homo- and complementary-duplex formations

PubMed Central

Makiguchi, Wataru; Tanabe, Junki; Yamada, Hidekazu; Iida, Hiroki; Taura, Daisuke; Ousaka, Naoki; Yashima, Eiji

2015-01-01

Self-recognition and self-discrimination within complex mixtures are of fundamental importance in biological systems, which entirely rely on the preprogrammed monomer sequences and homochirality of biological macromolecules. Here we report artificial chirality- and sequence-selective successive self-sorting of chiral dimeric strands bearing carboxylic acid or amidine groups joined by chiral amide linkers with different sequences through homo- and complementary-duplex formations. A mixture of carboxylic acid dimers linked by racemic-1,2-cyclohexane bis-amides with different amide sequences (NHCO or CONH) self-associate to form homoduplexes in a completely sequence-selective way, the structures of which are different from each other depending on the linker amide sequences. The further addition of an enantiopure amide-linked amidine dimer to a mixture of the racemic carboxylic acid dimers resulted in the formation of a single optically pure complementary duplex with a 100% diastereoselectivity and complete sequence specificity stabilized by the amidinium–carboxylate salt bridges, leading to the perfect chirality- and sequence-selective duplex formation. PMID:26051291

Some links on this page may take you to non-federal websites. Their policies may differ from this site.