abrf edman sequencing: Topics by Science.gov

Sample records for abrf edman sequencing

N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis.

PubMed

Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P; Marians, Kenneth J; Erdjument-Bromage, Hediye

2016-07-01

In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods.
Evaluation of material properties and compression characteristics of Assam Bora rice flours as a directly compressible vehicle in tablet formulation.

PubMed

Ahmad, Mohammad Zaki; Akhter, Sohail; Dhiman, Ishita; Sharma, Poonam; Verma, Reena

2013-02-01

The mechanical properties and compaction characteristics of different varieties of Assam Bora rice flours (ABRFs) were evaluated and compared with those of official Starch 1500®. The material properties and compression characteristics of Assam Bora rice flours were studied by Heckel and Kawakita analysis. The influences of physical and geometrical properties of ABRFs were evaluated with regard to their compression properties. The mechanical properties, such as toughness and Young's modulus of ABRFs were also compared with that of Starch 1500®. The novel ABRFs reflect better physical characteristics such as higher bulk and tap densities, less porosity, better powder packing ability, large surface area, and improved flowability. ABRFs were the least sensitive material to magnesium stearate, and blending time did not affect its compactibility. Their onset of plastic deformation and strain rate sensitivity as compared to that of Starch 1500® demonstrate its potential use as a directly compressible vehicle for tablet. The experimental ABRFs showed superior properties to official Starch 1500® in many cases and could serve as suitable alternatives for particular purposes.
A proteomic analysis of leaf sheaths from rice.

PubMed

Shen, Shihua; Matsubae, Masami; Takao, Toshifumi; Tanaka, Naoki; Komatsu, Setsuko

2002-10-01

The proteins extracted from the leaf sheaths of rice seedlings were separated by 2-D PAGE, and analyzed by Edman sequencing and mass spectrometry, followed by database searching. Image analysis revealed 352 protein spots on 2-D PAGE after staining with Coomassie Brilliant Blue. The amino acid sequences of 44 of 84 proteins were determined; for 31 of these proteins, a clear function could be assigned, whereas for 12 proteins, no function could be assigned. Forty proteins did not yield amino acid sequence information, because they were N-terminally blocked, or the obtained sequences were too short and/or did not give unambiguous results. Fifty-nine proteins were analyzed by mass spectrometry; all of these proteins were identified by matching to the protein database. The amino acid sequences of 19 of 27 proteins analyzed by mass spectrometry were similar to the results of Edman sequencing. These results suggest that 2-D PAGE combined with Edman sequencing and mass spectrometry analysis can be effectively used to identify plant proteins.
THE ABRF MARG MICROARRAY SURVEY 2005: TAKING THE PULSE ON THE MICROARRAY FIELD

EPA Science Inventory

Over the past several years microarray technology has evolved into a critical component of any discovery based program. Since 1999, the Association of Biomolecular Resource Facilities (ABRF) Microarray Research Group (MARG) has conducted biennial surveys designed to generate a pr...
THE ABRF-MARG MICROARRAY SURVEY 2004: TAKING THE PULSE OF THE MICROARRAY FIELD

EPA Science Inventory

Over the past several years, the field of microarrays has grown and evolved drastically. In its continued efforts to track this evolution, the ABRF-MARG has once again conducted a survey of international microarray facilities and individual microarray users. The goal of the surve...
P2P proteomics -- data sharing for enhanced protein identification

PubMed Central

2012-01-01

Background In order to tackle the important and challenging problem in proteomics of identifying known and new protein sequences using high-throughput methods, we propose a data-sharing platform that uses fully distributed P2P technologies to share specifications of peer-interaction protocols and service components. By using such a platform, information to be searched is no longer centralised in a few repositories but gathered from experiments in peer proteomics laboratories, which can subsequently be searched by fellow researchers. Methods The system distributively runs a data-sharing protocol specified in the Lightweight Communication Calculus underlying the system through which researchers interact via message passing. For this, researchers interact with the system through particular components that link to database querying systems based on BLAST and/or OMSSA and GUI-based visualisation environments. We have tested the proposed platform with data drawn from preexisting MS/MS data reservoirs from the 2006 ABRF (Association of Biomolecular Resource Facilities) test sample, which was extensively tested during the ABRF Proteomics Standards Research Group 2006 worldwide survey. In particular we have taken the data available from a subset of proteomics laboratories of Spain's National Institute for Proteomics, ProteoRed, a network for the coordination, integration and development of the Spanish proteomics facilities. Results and Discussion We performed queries against nine databases including seven ProteoRed proteomics laboratories, the NCBI Swiss-Prot database and the local database of the CSIC/UAB Proteomics Laboratory. A detailed analysis of the results indicated the presence of a protein that was supported by other NCBI matches and highly scored matches in several proteomics labs. The analysis clearly indicated that the protein was a relatively high concentrated contaminant that could be present in the ABRF sample. This fact is evident from the information that could be derived from the proposed P2P proteomics system, however it is not straightforward to arrive to the same conclusion by conventional means as it is difficult to discard organic contamination of samples. The actual presence of this contaminant was only stated after the ABRF study of all the identifications reported by the laboratories. PMID:22293032
Unraveling the sequence and structure of the protein osteocalcin from a 42 ka fossil horse

NASA Astrophysics Data System (ADS)

Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Andrews, Philip C.; Leykam, Joseph; Stafford, Thomas W.; Kelly, Robert L.; Walker, Danny N.; Buckley, Mike; Humpula, James

2006-04-01

We report the first complete amino acid sequence and evidence of secondary structure for osteocalcin from a temperate fossil. The osteocalcin derives from a 42 ka equid bone excavated from Juniper Cave, Wyoming. Results were determined by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-MS) and Edman sequencing with independent confirmation of the sequence in two laboratories. The ancient sequence was compared to that of three modern taxa: horse ( Equus caballus), zebra ( Equus grevyi), and donkey ( Equus asinus). Although there was no difference in sequence among modern taxa, MALDI-MS and Edman sequencing show that residues 48 and 49 of our modern horse are Thr, Ala rather than Pro, Val as previously reported (Carstanjen B., Wattiez, R., Armory, H., Lepage, O.M., Remy, B., 2002. Isolation and characterization of equine osteocalcin. Ann. Med. Vet.146(1), 31-38). MALDI-MS and Edman sequencing data indicate that the osteocalcin sequence of the 42 ka fossil is similar to that of modern horse. Previously inaccessible structural attributes for ancient osteocalcin were observed. Glu 39 rather than Gln 39 is consistent with deamidation, a process known to occur during fossilization and aging. Two post-translational modifications were documented: Hyp 9 and a disulfide bridge. The latter suggests at least partial retention of secondary structure. As has been done for ancient DNA research, we recommend standards for preparation and criteria for authenticating results of ancient protein sequencing.
Venom characterization of the Amazonian scorpion Tityus metuendus.

PubMed

Batista, C V F; Martins, J G; Restano-Cassulini, R; Coronas, F I V; Zamudio, F Z; Procópio, R; Possani, L D

2018-03-01

The soluble venom from the scorpion Tityus metuendus was characterized by various methods. In vivo experiments with mice showed that it is lethal. Extended electrophysiological recordings using seven sub-types of human voltage gated sodium channels (hNav1.1 to 1.7) showed that it contains both α- and β-scorpion toxin types. Fingerprint analysis by mass spectrometry identified over 200 distinct molecular mass components. At least 60 sub-fractions were recovered from HPLC separation. Five purified peptides were sequenced by Edman degradation, and their complete primary structures were determined. Additionally, three other peptides have had their N-terminal amino acid sequences determined by Edman degradation and reported. Mass spectrometry analysis of tryptic digestion of the soluble venom permitted the identification of the amino acid sequence of 111 different peptides. Search for similarities of the sequences found indicated that they probably are: sodium and potassium channel toxins, metalloproteinases, hyaluronidases, endothelin and angiotensin-converting enzymes, bradykinin-potentiating peptide, hypothetical proteins, allergens, other enzymes, other proteins and peptides. Copyright © 2018 Elsevier Ltd. All rights reserved.
Interlaboratory Study on Differential Analysis of Protein Glycosylation by Mass Spectrometry: The ABRF Glycoprotein Research Multi-Institutional Study 2012*

PubMed Central

Leymarie, Nancy; Griffin, Paula J.; Jonscher, Karen; Kolarich, Daniel; Orlando, Ron; McComb, Mark; Zaia, Joseph; Aguilan, Jennifer; Alley, William R.; Altmann, Friederich; Ball, Lauren E.; Basumallick, Lipika; Bazemore-Walker, Carthene R.; Behnken, Henning; Blank, Michael A.; Brown, Kristy J.; Bunz, Svenja-Catharina; Cairo, Christopher W.; Cipollo, John F.; Daneshfar, Rambod; Desaire, Heather; Drake, Richard R.; Go, Eden P.; Goldman, Radoslav; Gruber, Clemens; Halim, Adnan; Hathout, Yetrib; Hensbergen, Paul J.; Horn, David M.; Hurum, Deanna; Jabs, Wolfgang; Larson, Göran; Ly, Mellisa; Mann, Benjamin F.; Marx, Kristina; Mechref, Yehia; Meyer, Bernd; Möginger, Uwe; Neusüβ, Christian; Nilsson, Jonas; Novotny, Milos V.; Nyalwidhe, Julius O.; Packer, Nicolle H.; Pompach, Petr; Reiz, Bela; Resemann, Anja; Rohrer, Jeffrey S.; Ruthenbeck, Alexandra; Sanda, Miloslav; Schulz, Jan Mirco; Schweiger-Hufnagel, Ulrike; Sihlbom, Carina; Song, Ehwang; Staples, Gregory O.; Suckau, Detlev; Tang, Haixu; Thaysen-Andersen, Morten; Viner, Rosa I.; An, Yanming; Valmu, Leena; Wada, Yoshinao; Watson, Megan; Windwarder, Markus; Whittal, Randy; Wuhrer, Manfred; Zhu, Yiying; Zou, Chunxia

2013-01-01

One of the principal goals of glycoprotein research is to correlate glycan structure and function. Such correlation is necessary in order for one to understand the mechanisms whereby glycoprotein structure elaborates the functions of myriad proteins. The accurate comparison of glycoforms and quantification of glycosites are essential steps in this direction. Mass spectrometry has emerged as a powerful analytical technique in the field of glycoprotein characterization. Its sensitivity, high dynamic range, and mass accuracy provide both quantitative and sequence/structural information. As part of the 2012 ABRF Glycoprotein Research Group study, we explored the use of mass spectrometry and ancillary methodologies to characterize the glycoforms of two sources of human prostate specific antigen (PSA). PSA is used as a tumor marker for prostate cancer, with increasing blood levels used to distinguish between normal and cancer states. The glycans on PSA are believed to be biantennary N-linked, and it has been observed that prostate cancer tissues and cell lines contain more antennae than their benign counterparts. Thus, the ability to quantify differences in glycosylation associated with cancer has the potential to positively impact the use of PSA as a biomarker. We studied standard peptide-based proteomics/glycomics methodologies, including LC-MS/MS for peptide/glycopeptide sequencing and label-free approaches for differential quantification. We performed an interlaboratory study to determine the ability of different laboratories to correctly characterize the differences between glycoforms from two different sources using mass spectrometry methods. We used clustering analysis and ancillary statistical data treatment on the data sets submitted by participating laboratories to obtain a consensus of the glycoforms and abundances. The results demonstrate the relative strengths and weaknesses of top-down glycoproteomics, bottom-up glycoproteomics, and glycomics methods. PMID:23764502
Comparative mRNA and MicroRNA Profiling during Acute Myocardial Infarction Induced by Coronary Occlusion and Ablation Radio-Frequency Currents

PubMed Central

Santana, Eduardo T.; Feliciano, Regiane dos Santos; Serra, Andrey J.; Brigidio, Eduardo; Antonio, Ednei L.; Tucci, Paulo J. F.; Nathanson, Lubov; Morris, Mariana; Silva, José A.

2016-01-01

The ligation of the left anterior descending coronary artery is the most commonly used experimental model to induce myocardial infarction (MI) in rodents. A high mortality in the acute phase and the heterogeneity of the size of the MI obtained are drawbacks recognized in this model. In an attempt to solve the problem, our group recently developed a new MI experimental model which is based on application of myocardial ablation radio-frequency currents (AB-RF) that yielded MI with homogeneous sizes and significantly reduce acute mortality. In addition, cardiac structural, and functional changes aroused by AB-RF were similar to those seen in animals with MI induced by coronary artery ligation. Herein, we compared mRNA expression of genes that govern post-MI milieu in occlusion and ablation models. We analyzed 48 mRNAs expressions of nine different signal transduction pathways (cell survival and metabolism signs, matrix extracellular, cell cycle, oxidative stress, apoptosis, calcium signaling, hypertrophy markers, angiogenesis, and inflammation) in rat left ventricle 1 week after MI generated by both coronary occlusion and AB-RF. Furthermore, high-throughput miRNA analysis was also assessed in both MI procedures. Interestingly, mRNA expression levels and miRNA expressions showed strong similarities between both models after MI, with few specificities in each model, activating similar signal transduction pathways. To our knowledge, this is the first comparison of genomic alterations of mRNA and miRNA contents after two different MI procedures and identifies key signaling regulators modulating the pathophysiology of these two models that might culminate in heart failure. Furthermore, these analyses may contribute with the current knowledge concerning transcriptional and post-transcriptional changes of AB-RF protocol, arising as an alternative and effective MI method that reproduces most changes seem in coronary occlusion. PMID:27932994
Isoforms of a cuticular protein from larvae of the meal beetle, Tenebrio molitor, studied by mass spectrometry in combination with Edman degradation and two-dimensional polyacrylamide gel electrophoresis.

PubMed Central

Haebel, S.; Jensen, C.; Andersen, S. O.; Roepstorff, P.

1995-01-01

Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants. PMID:7795523
Isoforms of a cuticular protein from larvae of the meal beetle, Tenebrio molitor, studied by mass spectrometry in combination with Edman degradation and two-dimensional polyacrylamide gel electrophoresis.

PubMed

Haebel, S; Jensen, C; Andersen, S O; Roepstorff, P

1995-03-01

Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants.
Complete covalent structure of statherin, a tyrosine-rich acidic peptide which inhibits calcium phosphate precipitation from human parotid saliva.

PubMed

Schlesinger, D H; Hay, D I

1977-03-10

The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.
Identification of Tumor Suppressor Genes by Genetic and Epigenetic Genome-Scanning

DTIC Science & Technology

2008-04-01

SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT 18. NUMBER OF PAGES 19a. NAME OF RESPONSIBLE PERSON USAMRMC a. REPORT U b. ABSTRACT U...oncogene-related sequences in human neuroblastomas. Cell 35: 359-67; 1983. 3. Capon, D. J.; Seeburg, P. H.; McGrath, J. P.; Hayflick , J. S.; Edman
Characterization, production, and purification of leucocin H, a two-peptide bacteriocin from Leuconostoc MF215B.

PubMed

Blom, H; Katla, T; Holck, A; Sletten, K; Axelsson, L; Holo, H

1999-07-01

Leuconostoc MF215B was found to produce a two-peptide bacteriocin referred to as leucocin H. The two peptides were termed leucocin Halpha and leucocin Hbeta. When acting together, they inhibit, among others, Listeria monocytogenes, Bacillus cereus, and Clostridium perfringens. Production of leucocin H in growth medium takes place at temperatures down to 6 degrees C and at pH below 7. The highest activity of leucocin H in growth medium was demonstrated in the late exponential growth phase. The bacteriocin was purified by precipitation with ammonium sulfate, ion-exchange (SP Sepharose) and reverse phase chromatography. Upon purification, specific activity increased 10(5)-fold, and the final specific activity was 2 x 10(7) BU/OD280. Amino acid composition analyses of leucocin Halpha and leucocin Hbeta indicated that both peptides consisted of around 40 amino acid residues. Their N-termini were blocked for Edman degradation, and the methionin residues of leucocin Hbeta did not respond to Cyanogen Bromide (CNBr) cleavage. Absorbance at 280 nm indicated the presence of tryptophan residues and tryptophan-fracturing opened for partial sequencing by Edman degradation. From leucocin Halpha, the sequence of 20 amino acids was obtained; from leucocin Hbeta the sequence of 28 amino acid residues was obtained. No sequence homology to other known bacteriocins could be demonstrated. It also appeared that the two peptides themselves shared little or no sequence homology. The presence of soy oil did not affect the activity of leucocin H in agar.
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

NASA Astrophysics Data System (ADS)

Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

2000-02-01

Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
Isolation, purification and functional characterization of alpha-BnIA from Conus bandanus venom.

PubMed

Nguyen, Bao; Le Caer, Jean-Pierre; Aráoz, Romulo; Thai, Robert; Lamthanh, Hung; Benoit, Evelyne; Molgó, Jordi

2014-12-01

We report the isolation and characterization by proteomic approach of a native conopeptide, named BnIA, from the crude venom of Conus bandanus, a molluscivorous cone snail species, collected in the South central coast of Vietnam. Its primary sequence was determined by matrix-assisted laser desorption/ionization time-of-flight tandem mass spectrometry using collision-induced dissociation and confirmed by Edman's degradation of the pure native fraction. BnIA was present in high amounts in the crude venom and the complete sequence of the 16 amino acid peptide was the following GCCSHPACSVNNPDIC*, with C-terminal amidation deduced from Edman's degradation and theoretical monoisotopic mass calculation. Sequence alignment revealed that its -C1C2X4C3X7C4- pattern belongs to the A-superfamily of conopeptides. The cysteine connectivity of BnIA was 1-3/2-4 as determined by partial-reduction technique, like other α4/7-conotoxins, reported previously on other Conus species. Additionally, we found that native α-BnIA shared the same sequence alignment as Mr1.1, from the closely related molluscivorous Conus marmoreus venom, in specimens collected in the same coastal region of Vietnam. Functional studies revealed that native α-BnIA inhibited acetylcholine-evoked currents reversibly in oocytes expressing the human α7 nicotinic acetylcholine receptors, and blocked nerve-evoked skeletal muscle contractions in isolated mouse neuromuscular preparations, but with ∼200-times less potency. Copyright © 2014 Elsevier Ltd. All rights reserved.
Comparison of Comparative Genomic Hybridization Technologies across Microarray Platforms

EPA Science Inventory

In the 2007 Association of Biomolecular Resource Facilities (ABRF) Microarray Research Group (MARG) project, we analyzed HL-60 DNA with five platforms: Agilent, Affymetrix 500K, Affymetrix U133 Plus 2.0, Illumina, and RPCI 19K BAC arrays. Copy number variation (CNV) was analyzed ...
2008 Microarray Research Group (MARG Survey): Sensing the State of Microarray Technology

EPA Science Inventory

Over the past several years, the field of microarrays has grown and evolved drastically. In its continued efforts to track this evolution and transformation, the ABRF-MARG has once again conducted a survey of international microarray facilities and individual microarray users. Th...
Primary structure of Lep d I, the main Lepidoglyphus destructor allergen.

PubMed

Varela, J; Ventas, P; Carreira, J; Barbas, J A; Gimenez-Gallego, G; Polo, F

1994-10-01

The most relevant allergen of the storage mite Lepidoglyphus destructor (Lep d I) has been characterized. Lep d I is a monomer protein of 13273 Da. The primary structure of Lep d I was determined by N-terminal Edman degradation and partially confirmed by cDNA sequencing. Sequence polymorphism was observed at six positions, with non-conservative substitutions in three of them. No potential N-glycosylation site was revealed by peptide sequencing. The 125-residue sequence of Lep d I shows approximately 40% identity (including the six cysteines) with the overlapping regions of group II allergens from the genus Dermatophagoides, which, however, do not share common allergenic epitopes with Lep d I.

Characterization and analysis of posttranslational modifications of the human large cytoplasmic ribosomal subunit proteins by mass spectrometry and Edman sequencing.

PubMed

Odintsova, Tatyana I; Müller, Eva-Christina; Ivanov, Anton V; Egorov, Tsezi A; Bienert, Ralf; Vladimirov, Serguei N; Kostka, Susanne; Otto, Albrecht; Wittmann-Liebold, Brigitte; Karpova, Galina G

2003-04-01

The 60S ribosomal proteins were isolated from ribosomes of human placenta and separated by reversed phase HPLC. The fractions obtained were subjected to trypsin and Glu-C digestion and analyzed by mass fingerprinting (MALDI-TOF), MS/MS (ESI), and Edman sequencing. Forty-six large subunit proteins were found, 22 of which showed masses in accordance with the SwissProt database (June 2002) masses (proteins L6, L7, L9, L13, L15, L17, L18, L21, L22, L24, L26, L27, L30, L32, L34, L35, L36, L37, L37A, L38, L39, L41). Eleven (proteins L7, L10A, L11, L12, L13A, L23, L23A, L27A, L28, L29, and P0) resulted in mass changes that are consistent with N-terminal loss of methionine, acetylation, internal methylation, or hydroxylation. A loss of methionine without acetylation was found for protein L8 and L17. For nine proteins (L3, L4, L5, L7A, L10, L14, L19, L31, and L40), the molecular masses could not be determined. Proteins P1 and protein L3-like were not identified by the methods applied.
Cloning and characterization of 2S albumin, Car i 1, a major allergen in pecan.

PubMed

Sharma, Girdhari M; Irsigler, Andre; Dhanarajan, Pushparani; Ayuso, Rosalia; Bardina, Luda; Sampson, Hugh A; Roux, Kenneth H; Sathe, Shridhar K

2011-04-27

Although pecans are associated with IgE-mediated food allergies, the allergens responsible remain to be identified and characterized. The 2S albumin gene was amplified from the pecan cDNA library. Dot-blots were used to screen the recombinant protein with pecan allergic patients' serum. The affinity purified native protein was analyzed by Edman sequencing and mass spectrometry/mass spectrometry (MS/MS) analysis. Cross-reactivity with walnut was determined by inhibition enzyme-linked immunosorbent assay (ELISA). Sequential epitopes were determined by probing the overlapping peptides with three different patients' serum pool. The 3-dimensional homology model was generated, and the locations of the pecan epitopes were compared with those of known sequential epitopes on other allergenic tree nut homologues. Of 28 patients tested by dot-blot, 22 (79%) bound to 2S albumin, designated as Car i 1. Edman sequencing and the MS/MS sequencing of native 2S albumin confirmed the identity of recombinant (r) Car i 1. Both pecan and walnut protein extracts inhibited the IgE-binding to rCar i 1. Sequential epitope mapping indicated weak, moderate, and strong reactivity against 12, 7, and 5 peptides, respectively. Of the 11 peptides recognized by all serum pools, 5 peptides were strongly reactive and located in 3 discrete regions of the Car i 1 (amino acids 43-57, 67-78, and 106-120). Three-dimensional modeling revealed IgE-reactive epitopes to be solvent accessible and share significant homology with other tree nuts providing a possible basis for previously observed cross-reactivity.
A cardioactive peptide from the southern armyworm, Spodoptera eridania.

PubMed

Furuya, K; Hackett, M; Cirelli, M A; Schegg, K M; Wang, H; Shabanowitz, J; Hunt, D F; Schooley, D A

1999-01-01

A cardioactive peptide was isolated from extracts of whole heads of the southern armyworm, Spodoptera eridania. This peptide has the sequence ENFAVGCTPGYQRTADGRCKPTF (Mr = 2516.8), determined from both Edman sequencing and tandem mass spectrometry in combination with off-line micropreparative capillary liquid chromatography. This peptide, termed Spoer-CAP23, has excitatory effects on a semi-isolated heart from larval Manduca sexta, causing an inotropic effect at low concentrations of peptide and chronotropic and inotropic effects at high doses. The threshold concentration for stimulatory effects of the synthetic peptide on the semi-isolated heart was about 1 nM, suggesting a physiological role as a neuropeptide.
Protein Sequencing with Tandem Mass Spectrometry

NASA Astrophysics Data System (ADS)

Ziady, Assem G.; Kinter, Michael

The recent introduction of electrospray ionization techniques that are suitable for peptides and whole proteins has allowed for the design of mass spectrometric protocols that provide accurate sequence information for proteins. The advantages gained by these approaches over traditional Edman Degradation sequencing include faster analysis and femtomole, sometimes attomole, sensitivity. The ability to efficiently identify proteins has allowed investigators to conduct studies on their differential expression or modification in response to various treatments or disease states. In this chapter, we discuss the use of electrospray tandem mass spectrometry, a technique whereby protein-derived peptides are subjected to fragmentation in the gas phase, revealing sequence information for the protein. This powerful technique has been instrumental for the study of proteins and markers associated with various disorders, including heart disease, cancer, and cystic fibrosis. We use the study of protein expression in cystic fibrosis as an example.
Enterocin T, a novel class IIa bacteriocin produced by Enterococcus sp. 812.

PubMed

Chen, Yi-Sheng; Yu, Chi-Rong; Ji, Si-Hua; Liou, Min-Shiuan; Leong, Kun-Hon; Pan, Shwu-Fen; Wu, Hui-Chung; Lin, Yu-Hsuan; Yu, Bi; Yanagida, Fujitoshi

2013-09-01

Enterococcus sp. 812, isolated from fresh broccoli, was previously found to produce a bacteriocin active against a number of Gram-positive bacteria, including Listeria monocytogenes. Bacteriocin activity decreased slightly after autoclaving (121 °C for 15 min), but was inactivated by protease K. Mass spectrometry analysis revealed the bacteriocin mass to be approximately 4,521.34 Da. N-terminal amino acid sequencing yielded a partial sequence, NH2-ATYYGNGVYXDKKKXWVEWGQA, by Edman degradation, which contained the consensus class IIa bacteriocin motif YGNGV in the N-terminal region. The obtained partial sequence showed high homology with some enterococcal bacteriocins; however, no identical peptide or protein was found. This peptide was therefore considered to be a novel bacteriocin produced by Enterococcus sp. 812 and was termed enterocin T.
Determination and reoxidation of the disulfide bridges of a squash-type trypsin inhibitor from Sechium edule seeds.

PubMed

Faça, Vitor M; Pereira, Sandra R; Laure, Hélen J; Greene, Lewis J

2004-07-01

The determination of the disulfide pairings of SETI-II, a trypsin inhibitor isolated from Sechium edule, is described herein. The inhibitor contains 31 amino acid residues per mol, 6 of which are cysteine. Forty-five nmol (160 microg) of SETI-II was hydrolyzed with 20 microg thermolysin for 48 hr at 45 degrees C, and peptides were separated by reverse phase high performance liquid chromatography (RP-HPLC). The major products were identified by amino acid composition, Edman degradation, and on the basis of the sequence of the inhibitor. The disulfide bridge pairings and (yields) are: Cys1-Cys4 (79%), Cys2-Cys5 (21%) and Cys3-Cys6 (43%). When the reduced inhibitor was reoxidized with glutathione reduced form (GSH)/glutathione oxidized form (GSSG) at pH 8.5 for 3 hr, full activity was recovered. These data show that disulfide bridge pairing and oxidation can be determined at nanomole levels and that sensitive and quantitative Edman degradation can eliminate the final time- and material-consuming step of disulfide determinations by eliminating the need to purify and cleave each peptide containing a disulfide bridge.
Peptidomic approach identifies cruzioseptins, a new family of potent antimicrobial peptides in the splendid leaf frog, Cruziohyla calcarifer.

PubMed

Proaño-Bolaños, Carolina; Zhou, Mei; Wang, Lei; Coloma, Luis A; Chen, Tianbao; Shaw, Chris

2016-09-02

Phyllomedusine frogs are an extraordinary source of biologically active peptides. At least 8 families of antimicrobial peptides have been reported in this frog clade, the dermaseptins being the most diverse. By a peptidomic approach, integrating molecular cloning, Edman degradation sequencing and tandem mass spectrometry, a new family of antimicrobial peptides has been identified in Cruziohyla calcarifer. These 15 novel antimicrobial peptides of 20-32 residues in length are named cruzioseptins. They are characterized by having a unique shared N-terminal sequence GFLD- and the sequence motifs -VALGAVSK- or -GKAAL(N/G/S) (V/A)V- in the middle of the peptide. Cruzioseptins have a broad spectrum of antimicrobial activity and low haemolytic effect. The most potent cruzioseptin was CZS-1 that had a MIC of 3.77μM against the Gram positive bacterium, Staphylococcus aureus and the yeast Candida albicans. In contrast, CZS-1 was 3-fold less potent against the Gram negative bacterium, Escherichia coli (MIC 15.11μM). CZS-1 reached 100% haemolysis at 120.87μM. Skin secretions from unexplored species such as C. calcarifer continue to demonstrate the enormous molecular diversity hidden in the amphibian skin. Some of these novel peptides may provide lead structures for the development of a new class of antibiotics and antifungals of therapeutic use. Through the combination of molecular cloning, Edman degradation sequencing, tandem mass spectrometry and MALDI-TOF MS we have identified a new family of 15 antimicrobial peptides in the skin secretion of Cruziohyla calcarifer. The novel family is named "Cruzioseptins" and contains cationic amphipathic peptides of 20-32 residues. They have a broad range of antimicrobial activity that also includes effective antifungals with low haemolytic activity. Therefore, C. calcarifer has proven to be a rich source of novel peptides, which could become leading structures for the development of novel antibiotics and antifungals of clinical application. Copyright © 2016 Elsevier B.V. All rights reserved.
Mass Spectrometry Data Collection in Parallel at Multiple Core Facilities Operating TripleTOF 5600 and Orbitrap Elite/Velos Pro/Q Exactive Mass Spectrometers

PubMed Central

Jones, K.; Kim, K.; Patel, B.; Kelsen, S.; Braverman, A.; Swinton, D.; Gafken, P.; Jones, L.; Lane, W.; Neveu, J.; Leung, H.; Shaffer, S.; Leszyk, J.; Stanley, B.; Fox, T.; Stanley, A.; Yeung, Anthony

2013-01-01

Proteomic research can benefit from simultaneous access to multiple cutting-edge mass spectrometers. 18 core facilities responded to our investigators seeking service through the ABRF Discussion Forum. Five of the facilities selected completed four plasma proteomics experiments as routine fee-for-service. Each biological experiment entailed an iTRAQ 4-plex proteome comparison of immunodepleted plasma provided as 30 labeled-peptide fractions. Identical samples were analyzed by two AB SCIEX TripleTOF 5600 and three Thermo Orbitrap (Elite/Velos Pro/Q Exactive) instruments. 480 LC-MS/MS runs delivered >250 GB of data over two months. We compare herein routine service analyses of three peptide fractions of different peptide abundance. Data files from each instrument were studied to develop optimal analysis parameters to compare with default parameters in Mascot Distiller 2.4, ProteinPilot 4.5 beta, AB Sciex MS Data Converter 1.3 beta, and Proteome Discover 1.3. Peak-picking for TripleTOFs was best by ProteinPilot 4.5 beta while Mascot Distiller and Proteome Discoverer were comparable for the Orbitraps. We compared protein identification and quantitation in SwissProt 2012_07 database by Mascot Server 2.4.01 versus ProteinPilot. By all search methods, more proteins, up to two fold, were identified using the Q Exactive than others. Q Exactive excelled also at the number of unique significant peptide ion sequences. However, software-dependent impact on subsequent interpretation, due to peptide modifications, can be critical. These findings may have special implications for iTRAQ plasma proteomics. For the low abundance peptide ions, the slope of the dynamic range drop-off in the plasma proteome is uniquely sharp compared with cell lysates. Our study provides data for testable improvements in the operation of these mass spectrometers. More importantly, we have demonstrated a new affordable expedient workflow for investigators to perform proteomic experiments through the ABRF infrastructure. (We acknowledge John Cottrell for optimizing the peak-picking parameters for Mascot Distiller).
Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

DOE Office of Scientific and Technical Information (OSTI.GOV)

Feild, M.J.; Armstrong, F.B.

1987-05-01

E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and (/sup 3/H)-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealedmore » limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region.« less
[Hemoglobins, XXXII. Analysis of the primary structure of the monomeric hemoglobin CTT VIIA (erythrocruorin) or Chironomus thummi thummi, Diptera (author's transl)].

PubMed

Kleinschmidt, T; Braunitzer, G

1980-01-01

The dimeric hemoglobin CTT VIIA (erythrocruorin) was isolated from the hemolymph of the larva from Chironomus thummi thummi and purified by preparative polyacrylamide gel electrophoresis. Peptides obtained by limited tryptical digestion were sequenced by automatic Edman degradation. For the elucidation of the sequence in the C-terminal region of the chain, additional cleavages with proteinase of Staphylococcus aureus and chymotrypsin were necessary. CTT VIIA is compared with human beta-chains and other hemoglobins of Chironomus. The amino acid residues in the pocket are especially discussed. Most of them are invariant in all Chironomus hemoglobins, independent of the size of the heme pocket, which is normal in some components and enlarged in others.
Brain cDNA clone for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McTiernan, C.; Adkins, S.; Chatonnet, A.

1987-10-01

A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Nucleotide sequence analysis of the gene encoding the Deinococcus radiodurans surface protein, derived amino acid sequence, and complementary protein chemical studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peters, J.; Peters, M.; Lottspeich, F.

1987-11-01

The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Article Watch, April 2010

PubMed Central

Slaughter, Clive

2010-01-01

This column highlights recently published articles that are of interest to the readership of this publication. We encourage ABRF members to forward information about articles they feel are important and useful to Clive Slaughter, MCG-UGA Medical Partnership, 279 William St., Athens, GA 30607-1777, USA. Tel.: (706) 369-5945: Fax: (706) 369-5936; E-mail: cslaughter@mail.mcg.edu; or to any member of the editorial board. Article summaries reflect the reviewer's opinions and not necessarily those of the association.
Article Watch: April 2018

PubMed Central

Slaughter, Clive A.

2018-01-01

This column highlights recently published articles that are of interest to the readership of this publication. We encourage ABRF members to forward information on articles they feel are important and useful to Clive Slaughter, MCG-UGA Medical Partnership, 1425 Prince Ave., Athens, GA 30606, USA. Tel: (706) 713-2216; Fax: (706) 713-2221; E-mail: cslaught@uga.edu, or to any member of the editorial board. Article summaries reflect the reviewer’s opinions and not necessarily those of the association. PMID:29463959
ABRF-PRG07: advanced quantitative proteomics study.

PubMed

Falick, Arnold M; Lane, William S; Lilley, Kathryn S; MacCoss, Michael J; Phinney, Brett S; Sherman, Nicholas E; Weintraub, Susan T; Witkowska, H Ewa; Yates, Nathan A

2011-04-01

A major challenge for core facilities is determining quantitative protein differences across complex biological samples. Although there are numerous techniques in the literature for relative and absolute protein quantification, the majority is nonroutine and can be challenging to carry out effectively. There are few studies comparing these technologies in terms of their reproducibility, accuracy, and precision, and no studies to date deal with performance across multiple laboratories with varied levels of expertise. Here, we describe an Association of Biomolecular Resource Facilities (ABRF) Proteomics Research Group (PRG) study based on samples composed of a complex protein mixture into which 12 known proteins were added at varying but defined ratios. All of the proteins were present at the same concentration in each of three tubes that were provided. The primary goal of this study was to allow each laboratory to evaluate its capabilities and approaches with regard to: detection and identification of proteins spiked into samples that also contain complex mixtures of background proteins and determination of relative quantities of the spiked proteins. The results returned by 43 participants were compiled by the PRG, which also collected information about the strategies used to assess overall performance and as an aid to development of optimized protocols for the methodologies used. The most accurate results were generally reported by the most experienced laboratories. Among laboratories that used the same technique, values that were closer to the expected ratio were obtained by more experienced groups.
Two new bradykinin-related peptides from the venom of the social wasp Protopolybia exigua (Saussure).

PubMed

Mendes, Maria Anita; Palma, Mario Sergio

2006-11-01

Two bradykinin-related peptides (Protopolybiakinin-I and Protopolybiakinin-II) were isolated from the venom of the social wasp Protopolybia exigua by RP-HPLC, and sequenced by Edman degradation method. Peptide sequences of Protopolybiakinin-I and Protopolybiakinin-II were DKNKKPIRVGGRRPPGFTR-OH and DKNKKPIWMAGFPGFTPIR-OH, respectively. Synthetic peptides with identical sequences to the bradykinin-related peptides and their biological functions were characterized. Protopolybiakinin-I caused less potent constriction of the isolated rat ileum muscles than bradykinin (BK). In addition, it caused degranulation of mast cells which was seven times more potent than BK. This peptide causes algesic effects due to the direct activation of B(2)-receptors. Protopolybiakinin-II is not an agonist of rat ileum muscle and had no algesic effects. However, Protopolybiakinin-II was found to be 10 times more potent as a mast cell degranulator than BK. The amino acid sequence of Protopolybiakinin-I is the longest among the known wasp kinins.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Kem, W.; Dunn, B.; Parten, B.

A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residuesmore » (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.« less
Proprotein convertases generate a highly functional heterodimeric form of thymic stromal lymphopoietin in humans.

PubMed

Poposki, Julie A; Klingler, Aiko I; Stevens, Whitney W; Peters, Anju T; Hulse, Kathryn E; Grammer, Leslie C; Schleimer, Robert P; Welch, Kevin C; Smith, Stephanie S; Sidle, Douglas M; Conley, David B; Tan, Bruce K; Kern, Robert C; Kato, Atsushi

2017-05-01

Thymic stromal lymphopoietin (TSLP) is known to be elevated and truncated in nasal polyps (NPs) of patients with chronic rhinosinusitis and might play a significant role in type 2 inflammation in this disease. However, neither the structure nor the role of the truncated products of TSLP has been studied. We sought to investigate the mechanisms of truncation of TSLP in NPs and the function of the truncated products. We incubated recombinant human TSLP with NP extracts, and determined the protein sequence of the truncated forms of TSLP using Edman protein sequencing and matrix-assisted laser desorption/ionization-time of flight mass spectrometry. We investigated the functional activity of truncated TSLP using a PBMC-based bioassay. Edman sequencing and mass spectrometry results indicated that NP extracts generated 2 major truncated products, TSLP (residues 29-124) and TSLP (131-159). Interestingly, these 2 products remained linked with disulfide bonds and presented as a dimerized form, TSLP (29-124 + 131-159). We identified that members of the proprotein convertase were rate-limiting enzymes in the truncation of TSLP between residues 130 and 131 and generated a heterodimeric unstable metabolite TSLP (29-130 + 131-159). Carboxypeptidase N immediately digested 6 amino acids from the C terminus of the longer subunit of TSLP to generate a stable dimerized form, TSLP (29-124 + 131-159), in NPs. These truncations were homeostatic but primate-specific events. A metabolite TSLP (29-130 + 131-159) strongly activated myeloid dendritic cells and group 2 innate lymphoid cells compared with mature TSLP. Posttranslational modifications control the functional activity of TSLP in humans and overproduction of TSLP may be a key trigger for the amplification of type 2 inflammation in diseases. Copyright © 2016 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Shotgun protein sequencing: assembly of peptide tandem mass spectra from mixtures of modified proteins.

PubMed

Bandeira, Nuno; Clauser, Karl R; Pevzner, Pavel A

2007-07-01

Despite significant advances in the identification of known proteins, the analysis of unknown proteins by MS/MS still remains a challenging open problem. Although Klaus Biemann recognized the potential of MS/MS for sequencing of unknown proteins in the 1980s, low throughput Edman degradation followed by cloning still remains the main method to sequence unknown proteins. The automated interpretation of MS/MS spectra has been limited by a focus on individual spectra and has not capitalized on the information contained in spectra of overlapping peptides. Indeed the powerful shotgun DNA sequencing strategies have not been extended to automated protein sequencing. We demonstrate, for the first time, the feasibility of automated shotgun protein sequencing of protein mixtures by utilizing MS/MS spectra of overlapping and possibly modified peptides generated via multiple proteases of different specificities. We validate this approach by generating highly accurate de novo reconstructions of multiple regions of various proteins in western diamondback rattlesnake venom. We further argue that shotgun protein sequencing has the potential to overcome the limitations of current protein sequencing approaches and thus catalyze the otherwise impractical applications of proteomics methodologies in studies of unknown proteins.
Isolation and Characterization of Acetylated Derivative of Recombinant Insulin Lispro Produced in Escherichia coli.

PubMed

Szewczak, Joanna; Bierczyńska-Krzysik, Anna; Piejko, Marcin; Mak, Paweł; Stadnik, Dorota

2015-07-01

Insulin lispro is a rapid-acting insulin analogue produced by recombinant DNA technology. As a biosynthetic drug, the protein undergoes strict monitoring aiming for detection and characterization of impurities. The goal of this study was to isolate and identify a derivative of insulin lispro formed during biosynthesis. For this purpose, ion exchange chromatography in combination with endoproteinase Glu-C digestion, MALDI-TOF/TOF mass spectrometry and Edman sequencing were employed. Ion exchange chromatography analysis of related proteins in development batches of recombinant insulin lispro revealed the existence of unknown derivative in excess of the assumed limit. Its molecular mass was 42 Da higher than the theoretical mass of Lys(B31) insulin lispro--one of the expected process-related intermediates. Endoproteinase Glu-C cleavage enabled indication of the modified peptide. Tandem mass spectrometry (MS/MS) allowed to explore the location and type of the modification. The 42 amu shift was present in the mass of y-type ions, while b-type ions were in agreement with theoretical values. It suggested that the modification is present on B31 lysine. Further inquiry revealed the presence of two diagnostic ions for lysine acetylation at m/z 143.1 and 126.1. In addition, the peptide was isolated and sequenced by Edman degradation. Standards of phenylthiohydantoin derivatives of N-ε-acetyl-L-lysine and N-ε-trimethyl-L-lysine, not available commercially, were synthesized in the laboratory. The retention time of the modified residue confirmed its identity as N-ε-acetyl-L-lysine. The derivative of insulin lispro formed during biosynthesis of the drug was identified to be N-ε-acetyl-L-lysine (B31) insulin lispro.

Enterocin 96, a Novel Class II Bacteriocin Produced by Enterococcus faecalis WHE 96, Isolated from Munster Cheese▿

PubMed Central

Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd

2009-01-01

Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains. PMID:19411428
Enterocin 96, a novel class II bacteriocin produced by Enterococcus faecalis WHE 96, isolated from Munster cheese.

PubMed

Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd

2009-07-01

Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains.
An improved procedure, involving mass spectrometry, for N-terminal amino acid sequence determination of proteins which are N alpha-blocked.

PubMed Central

Rose, K; Kocher, H P; Blumberg, B M; Kolakofsky, D

1984-01-01

A modification to a previously described procedure [Gray & del Valle (1970) Biochemistry 9, 2134-2137; Rose, Simona & Offord (1983) Biochem. J. 215, 261-272] for mass-spectral identification of the N-terminal regions of proteins is shown to be useful in cases where the N-terminus is blocked. Three proteins were studied: vesicular-stomatitis-virus N protein, Sendai-virus NP protein, and a rabbit immunoglobulin lambda-light chain. These proteins, found to be blocked at the N-terminus with either the acetyl group or a pyroglutamic acid residue, had all failed to yield to attempted Edman degradation, in one case even after attempted enzymic removal of the pyroglutamic acid residue. The N-terminal regions of all three proteins were sequenced by using the new procedure. PMID:6421284
Isolation and purification of two antioxidant peptides from alcalase hydrolysate of Arca subcrenata.

PubMed

Li, Ting-Fei; Ye, Bin; Song, Li-Yan; Yu, Rong-Min

2014-07-01

To investigate the constituents with antioxidant activities from alcalase hydrolysate of Arca subcrenata. The consecutive chromatographic methods were employed,including ion-exchange chromatography, gel filtration chromatography, and reverse phase high-performance liquid chromatography (RP-HPLC). The amino acid sequences of the purified antioxidant peptides were determined by automated Edman degradation. Under the guidance of the assay of scavenging free radicals, two peptides with antioxidant activities, termed as A-Bg1 and A-Bh, were isolated and purified from the alcalase hydrolysate of Arca subcrenata. Constituents from the hydrolysate of Arca subcrenata might be a new potential resource of antioxidants.
UNIT 10.7 Electroblotting from Polyacrylamide Gels

PubMed Central

Goldman, Aaron; Speicher, David W.

2015-01-01

Transferring proteins from polyacrylamide gels onto retentive membranes is now primarily used for immunoblotting. A second application that was quite common up to about a decade ago was electroblotting of proteins for N-terminal and internal sequencing using Edman chemistry. This unit contains procedures for electroblotting proteins from polyacrylamide gels onto a variety of membranes, including polyvinylidene difluoride (PVDF) and nitrocellulose. In addition to the commonly used tank or wet transfer system, protocols are provided for electroblotting using semidry and dry systems. This unit also describes procedures for eluting proteins from membranes using detergents or acidic extraction with organic solvents for specialized applications. PMID:26521711
Electroblotting from Polyacrylamide Gels.

PubMed

Goldman, Aaron; Ursitti, Jeanine A; Mozdzanowski, Jacek; Speicher, David W

2015-11-02

Transferring proteins from polyacrylamide gels onto retentive membranes is now primarily used for immunoblotting. A second application that was quite common up to about a decade ago was electroblotting of proteins for N-terminal and internal sequencing using Edman chemistry. This unit contains procedures for electroblotting proteins from polyacrylamide gels onto a variety of membranes, including polyvinylidene difluoride (PVDF) and nitrocellulose. In addition to the commonly used tank or wet transfer system, protocols are provided for electroblotting using semidry and dry systems. This unit also describes procedures for eluting proteins from membranes using detergents or acidic extraction with organic solvents for specialized applications. Copyright © 2015 John Wiley & Sons, Inc.
Low molecular weight squash trypsin inhibitors from Sechium edule seeds.

PubMed

Laure, Hélen J; Faça, Vítor M; Izumi, Clarice; Padovan, Júlio C; Greene, Lewis J

2006-02-01

Nine chromatographic components containing trypsin inhibitor activity were isolated from Sechium edule seeds by acetone fractionation, gel filtration, affinity chromatography and RP-HPLC in an overall yield of 46% of activity and 0.05% of protein. The components obtained with highest yield of total activity and highest specific activity were sequenced by Edman degradation and their molecular masses determined by mass spectrometry. The inhibitors contained 31, 32 and 27 residues per molecule and their sequences were: SETI-IIa, EDRKCPKILMRCKRDSDCLAKCTCQESGYCG; SETI-IIb, EEDRKCPKILMRCKRDSDCLAKCTCQESGYCG and SETI-V, CPRILMKCKLDTDCFPTCTCRPSGFCG. SETI-IIa and SETI-IIb, which differed by an amino-terminal E in the IIb form, were not separable under the conditions employed. The sequences are consistent with consensus sequences obtained from 37 other inhibitors: CPriI1meCk_DSDCla_C_C_G_CG, where capital letters are invariant amino acid residues and lower case letters are the most preserved in this position. SETI-II and SETI-V form complexes with trypsin with a 1:1 stoichiometry and have dissociation constants of 5.4x10(-11)M and 1.1x10(-9)M, respectively.
Interlaboratory studies and initiatives developing standards for proteomics

PubMed Central

Ivanov, Alexander R.; Colangelo, Christopher M.; Dufresne, Craig P.; Friedman, David B.; Lilley, Kathryn S.; Mechtler, Karl; Phinney, Brett S.; Rose, Kristie L.; Rudnick, Paul A.; Searle, Brian C.; Shaffer, Scott A.; Weintraub, Susan T.

2013-01-01

Proteomics is a rapidly transforming interdisciplinary field of research that embraces a diverse set of analytical approaches to tackle problems in fundamental and applied biology. This view-point article highlights the benefits of interlaboratory studies and standardization initiatives to enable investigators to address many of the challenges found in proteomics research. Among these initiatives, we discuss our efforts on a comprehensive performance standard for characterizing PTMs by MS that was recently developed by the Association of Biomolecular Resource Facilities (ABRF) Proteomics Standards Research Group (sPRG). PMID:23319436
Biomonitoring of carcinogenic substances: enzymatic digestion of globin for detecting alkylated amino acids

NASA Astrophysics Data System (ADS)

Bader, Michael; Rauscher, Dankwart; Geibel, Kurt; Angerer, Juergen

1993-03-01

We report the application of proteases for the total hydrolysis of globin with subsequent determination of amino acids. Optimization of the proteolysis was made with respect to enzyme concentration, time of incubation and type of protease. Ethylene oxide modified globin was used to compare the results of the analysis of the N-terminal amino acid valine after enzymatic cleavage to those obtained from the widely used modified Edman procedure. It is shown that the cleavage is of good reproducibility and yields more alkylated amino acid than the Edman procedure.
The removal of pyroglutamic acid from monoclonal antibodies without denaturation of the protein chains.

PubMed

Werner, William E; Wu, Sylvia; Mulkerrin, Michael

2005-07-01

Typically, the removal of pyroglutamate from the protein chains of immunoglobulins with the enzyme pyroglutamate aminopeptidase requires the use of chaotropic and reducing agents, quite often with limited success. This article describes a series of optimization experiments using elevated temperatures and detergents to denature and stabilize the heavy chains of immunoglobulins such that the pyroglutamate at the amino terminal was accessible to enzymatic removal using the thermostable protease isolated from Pyrococcus furiosus. The detergent polysorbate 20 (Tween 20) was used successfully to facilitate the removal of pyroglutamate residues. A one-step digestion was developed using elevated temperatures and polysorbate 20, rather than chaotropic and reducing agents, with sample cleanup and preparation for Edman sequencing performed using a commercial cartridge containing the PVDF membrane. All of the immunoglobulins digested with this method yielded heavy chain sequence, but the extent of deblocking was immunglobulin dependent (typically>50%).
Frontoxins, three-finger toxins from Micrurus frontalis venom, decrease miniature endplate potential amplitude at frog neuromuscular junction.

PubMed

Moreira, K G; Prates, M V; Andrade, F A C; Silva, L P; Beirão, P S L; Kushmerick, C; Naves, L A; Bloch, C

2010-08-01

Neurotoxicity is a major symptom of envenomation caused by Brazilian coral snake Micrurus frontalis. Due to the small amount of material that can be collected, no neurotoxin has been fully sequenced from this venom. In this work we report six new three-finger like toxins isolated from the venom of the coral snake M. frontalis which we named Frontoxin (FTx) I-VI. Toxins were purified using multiple steps of RP-HPLC. Molecular masses were determined by MALDI-TOF and ESI ion-trap mass spectrometry. The complete amino acid sequence of FTx II, III, IV and V were determined by sequencing of overlapping proteolytic fragments by Edman degradation and by de novo sequencing. The amino acid sequences of FTx I, II, III and VI predict 4 conserved disulphide bonds and structural similarity to previously reported short-chain alpha-neurotoxins. FTx IV and V each contained 10 conserved cysteines and share high similarity with long-chain alpha-neurotoxins. At the frog neuromuscular junction FTx II, III and IV reduced miniature endplate potential amplitudes in a time-and concentration-dependent manner suggesting Frontoxins block nicotinic acetylcholine receptors. Copyright 2010 Elsevier Ltd. All rights reserved.
Isolation, characterization, and primary structure of rubredoxin from the photosynthetic bacterium, Heliobacillus mobilis

NASA Technical Reports Server (NTRS)

Lee, W. Y.; Brune, D. C.; LoBrutto, R.; Blankenship, R. E.

1995-01-01

Rubredoxin is a small nonheme iron protein that serves as an electron carrier in bacterial systems. Rubredoxin has now been isolated and characterized from the strictly anaerobic phototroph, Heliobacillus mobilis. THe molecular mass (5671.3 Da from the amino acid sequence) was confirmed and partial formylation of the N-terminal methionyl residue was established by matrix-assisted laser desorption mass spectroscopy. The complete 52-amino-acid sequence was determined by a combination of N-terminal sequencing by Edman degradation and C-terminal sequencing by a novel method using carboxypeptidase treatment in conjunction with amino acid analysis and laser desorption time of flight mass spectrometry. The molar absorption coefficient of Hc. mobilis rubredoxin at 490 nm is 6.9 mM-1 cm-1 and the midpoint redox potential at pH 8.0 is -46 mV. The EPR spectrum of the oxidized form shows resonances at g = 9.66 and 4.30 due to a high-spin ferric iron. The amino acid sequence is homologous to those of rubredoxins from other species, in particular, the gram-positive bacteria, and the phototrophic green sulfur bacteria, and the evolutionary implications of this are discussed.
De novo protein sequencing by combining top-down and bottom-up tandem mass spectra.

PubMed

Liu, Xiaowen; Dekker, Lennard J M; Wu, Si; Vanduijn, Martijn M; Luider, Theo M; Tolić, Nikola; Kou, Qiang; Dvorkin, Mikhail; Alexandrova, Sonya; Vyatkina, Kira; Paša-Tolić, Ljiljana; Pevzner, Pavel A

2014-07-03

There are two approaches for de novo protein sequencing: Edman degradation and mass spectrometry (MS). Existing MS-based methods characterize a novel protein by assembling tandem mass spectra of overlapping peptides generated from multiple proteolytic digestions of the protein. Because each tandem mass spectrum covers only a short peptide of the target protein, the key to high coverage protein sequencing is to find spectral pairs from overlapping peptides in order to assemble tandem mass spectra to long ones. However, overlapping regions of peptides may be too short to be confidently identified. High-resolution mass spectrometers have become accessible to many laboratories. These mass spectrometers are capable of analyzing molecules of large mass values, boosting the development of top-down MS. Top-down tandem mass spectra cover whole proteins. However, top-down tandem mass spectra, even combined, rarely provide full ion fragmentation coverage of a protein. We propose an algorithm, TBNovo, for de novo protein sequencing by combining top-down and bottom-up MS. In TBNovo, a top-down tandem mass spectrum is utilized as a scaffold, and bottom-up tandem mass spectra are aligned to the scaffold to increase sequence coverage. Experiments on data sets of two proteins showed that TBNovo achieved high sequence coverage and high sequence accuracy.
The primary structure of the hemoglobin of spectacled bear (Tremarctos ornatus, Carnivora).

PubMed

Hofmann, O; Braunitzer, G

1987-08-01

The complete primary structure of the alpha- and beta-chains of the hemoglobin of Spectacled Bear (Tremarctos ornatus) is presented. Following cleavage of the heme-protein link and chain separation by RP-HPLC, their amino-acid sequences were determined by Edman degradation in liquid- and gas-phase sequenators. The hemoglobin of Spectacled Bear displays only five amino-acid exchanges to that of Polar Bear (Ursus maritimus, Ursinae) and Asiatic Black Bear (Ursus tibetanus, Ursinae) whereas 8 and 12 replacements, respectively, to Giant Panda (Ailuropoda melanoleuca) and Lesser Panda (Ailurus fulgens) can be found. This clearly demonstrates that the Spectacled Bear, the most aberrant bear of the Ursidae, is somewhat intermediate between Pandas and Ursinae.
Isolation and determination of the primary structure of a lectin protein from the serum of the American alligator (Alligator mississippiensis).

PubMed

Darville, Lancia N F; Merchant, Mark E; Maccha, Venkata; Siddavarapu, Vivekananda Reddy; Hasan, Azeem; Murray, Kermit K

2012-02-01

Mass spectrometry in conjunction with de novo sequencing was used to determine the amino acid sequence of a 35kDa lectin protein isolated from the serum of the American alligator that exhibits binding to mannose. The protein N-terminal sequence was determined using Edman degradation and enzymatic digestion with different proteases was used to generate peptide fragments for analysis by liquid chromatography tandem mass spectrometry (LC MS/MS). Separate analysis of the protein digests with multiple enzymes enhanced the protein sequence coverage. De novo sequencing was accomplished using MASCOT Distiller and PEAKS software and the sequences were searched against the NCBI database using MASCOT and BLAST to identify homologous peptides. MS analysis of the intact protein indicated that it is present primarily as monomer and dimer in vitro. The isolated 35kDa protein was ~98% sequenced and found to have 313 amino acids and nine cysteine residues and was identified as an alligator lectin. The alligator lectin sequence was aligned with other lectin sequences using DIALIGN and ClustalW software and was found to exhibit 58% and 59% similarity to both human and mouse intelectin-1. The alligator lectin exhibited strong binding affinities toward mannan and mannose as compared to other tested carbohydrates. Copyright © 2011 Elsevier Inc. All rights reserved.
Membrane-associated precursor to poliovirus VPg identified by immunoprecipitation with antibodies directed against a synthetic heptapeptide

DOE Office of Scientific and Technical Information (OSTI.GOV)

Semelr, B.L.; Anderson, C.W.; Hanecak, R.

1982-02-01

A synthetic heptapeptide corresponding to the C-terminal sequence of the poliovirus genome protein (VPg) has been linked to bovine serum albumin and used to raise antibodies in rabbits. These antibodies precipitate not only VPg but also at least two more virus-specific polypeptides. The smaller polypeptide, denoted P3-9 (12,000 daltons), has been mapped by Edman degradation and by fragmentation with cyanogen bromide and determined to be the N-terminal cleavage product of polypeptide P3-1b, a precursor to the RNA polymerase. P3-9 contains the sequence of the basic protein VPg (22 amino acids) at its C terminus. As predicted by the known RNAmore » sequence of poliovirus, P3-9 also contains a hydrophobic region of 22 amino acids preceding VPg, an observation suggesting that P3-9 may be membrane-associated. This was confirmed by fractionation of infected cells in the presence or absence of detergent. We speculate that P3-9 may be the donor of VPg to RNA chains in the membrane-bound RNA replication complex.« less
Organization of the hao gene cluster of Nitrosomonas europaea: genes for two tetraheme c cytochromes.

PubMed

Bergmann, D J; Arciero, D M; Hooper, A B

1994-06-01

The organization of genes for three proteins involved in ammonia oxidation in Nitrosomonas europaea has been investigated. The amino acid sequence of the N-terminal region and four heme-containing peptides produced by proteolysis of the tetraheme cytochrome c554 of N. europaea were determined by Edman degradation. The gene (cycA) encoding this cytochrome is present in three copies per genome (H. McTavish, F. LaQuier, D. Arciero, M. Logan, G. Mundfrom, J.A. Fuchs, and A. B. Hooper, J. Bacteriol. 175:2445-2447, 1993). Three clones, representing at least two copies of cycA, were isolated and sequenced by the dideoxy-chain termination procedure. In both copies, the sequences of 211 amino acids derived from the gene sequence are identical and include all amino acids predicted by the proteolytic peptides. In two copies, the cycA open reading frame (ORF) is followed closely (three bases in one copy) by a second ORF predicted to encode a 28-kDa tetraheme c cytochrome not previously characterized but similar to the nirT gene product of Pseudomonas stutzeri. In one copy of the cycA gene cluster, the second ORF is absent.
Enterocin TW21, a novel bacteriocin from dochi-isolated Enterococcus faecium D081821.

PubMed

Chang, S-Y; Chen, Y-S; Pan, S-F; Lee, Y-S; Chang, C-H; Chang, C-H; Yu, B; Wu, H-C

2013-09-01

Purification and characterization of a novel bacteriocin produced by strain Enterococcus faecium D081821. Enterococcus faecium D081821, isolated from the traditional Taiwanese fermented food dochi (fermented black beans), was previously found to produce a bacteriocin against Listeria monocytogenes and some Gram-positive bacteria. This bacteriocin, termed enterocin TW21, was purified from culture supernatant by ammonium sulfate precipitation, Sep-Pak C18 cartridge, ion-exchange and gel filtration chromatography. Mass spectrometry analysis showed the mass of the peptide to be approximately 5300·6 Da. The N-terminal amino acid sequencing yielded a partial sequence NH2 -ATYYGNGVYxNTQK by Edman degradation, and it contains the consensus class IIa bacteriocin motif YGNGV in the N-terminal region. The open reading frame (ORF) encoding the bacteriocin was identified from the draft genome sequence of Enterococcus faecium D081821, and sequence analysis of this peptide indicated that enterocin TW21 is a novel bacteriocin. Enterococcus faecium D081821 produced a bacteriocin named enterocin TW21, the molecular weight and amino acid sequence both revealed it to be a novel bacteriocin. A new member of class IIa bacteriocin was identified. This bacteriocin shows great inhibitory ability against L. monocytogenes and could be applied as a natural food preservative. © 2013 The Society for Applied Microbiology.
The tick plasma lectin, Dorin M, is a fibrinogen-related molecule.

PubMed

Rego, Ryan O M; Kovár, Vojtĕch; Kopácek, Petr; Weise, Christoph; Man, Petr; Sauman, Ivo; Grubhoffer, Libor

2006-04-01

A lectin, named Dorin M, previously isolated and characterized from the hemolymph plasma of the soft tick, Ornithodoros moubata, was cloned and sequenced. The immunofluorescence using confocal microscopy revealed that Dorin M is produced in the tick hemocytes. A tryptic cleavage of Dorin M was performed and the resulting peptide fragments were sequenced by Edman degradation and/or mass spectrometry. Two of three internal peptide sequences displayed a significant similarity to the family of fibrinogen-related molecules. Degenerate primers were designed and used for PCR with hemocyte cDNA as a template. The sequence of the whole Dorin M cDNA was completed by the method of RACE. The tissue-specific expression investigated by RT-PCR revealed that Dorin M, in addition to hemocytes, is significantly expressed in salivary glands. The derived amino-acid sequence clearly shows that Dorin M has a fibrinogen-like domain, and exhibited the most significant similarity with tachylectins 5A and 5B from a horseshoe crab, Tachypleus tridentatus. In addition, other protein and binding characteristics suggest that Dorin M is closely related to tachylectins-5. Since these lectins have been reported to function as non-self recognizing molecules, we believe that Dorin M may play a similar role in an innate immunity of the tick and, possibly, also in pathogen transmission by this vector.
Isolation and characterization of a new bacteriocin, termed enterocin M, produced by environmental isolate Enterococcus faecium AL41.

PubMed

Mareková, Mária; Lauková, Andrea; Skaugen, Morten; Nes, Ingolf

2007-08-01

The new bacteriocin, termed enterocin M, produced by Enterococcus faecium AL 41 showed a wide spectrum of inhibitory activity against the indicator organisms from different sources. It was purified by (NH4)2SO4 precipitation, cation-exchange chromatography and reverse phase chromatography (FPLC). The purified peptide was sequenced by N-terminal amino acid Edman degradation and a mass spectrometry analysis was performed. By combining the data obtained from amino acid sequence (39 N-terminal amino acid residues was determined) and the molecular weight (determined to be 4628 Da) it was concluded that the purified enterocin M is a new bacteriocin, which is very similar to enterocin P. However, its molecular weight is different from enterocin P (4701.25). Of the first 39 N-terminal residues of enterocin M, valine was found in position 20 and a lysine in position 35, while enterocin P has tryptophane residues in these positions.

The amino acid sequences of carboxypeptidases I and II from Aspergillus niger and their stability in the presence of divalent cations.

PubMed

Svendsen, I; Dal Degan, F

1998-09-08

The amino acid sequences of serine carboxypeptidase I (CPD-I) and II (CPD-II), respectively, from Aspergillus niger have been determined by conventional Edman degradation of the reduced and vinylpyridinated enzymes and peptides hereof generated by cleavage with cyanogen bromide, iodobenzoic acid, glutamic acid cleaving enzyme, AspN-endoproteinase and EndoLysC proteinase. CPD-I consists of a single peptide chain of 471 amino acid residues, three disulfide bridges and nine N-glycosylated asparaginyl residues, while CPD-II consists of a single peptide chain of 481 amino acid residues, has three disulfide bridges, one free cysteinyl residue and nine glycosylated asparaginyl residues. The enzymes are closely related to carboxypeptidase S3 from Penicillium janthinellum. Both Ca2+ and Mg2+ stabilize CPD-I as well as CPD-II, at basic pH values, Ca2+ being most effective, while the divalent ions have no effect on the activity of the two enzymes.
Isolation and identification of a cardioactive peptide from Tenebrio molitor and Spodoptera eridania.

PubMed

Furuya, K; Liao, S; Reynolds, S E; Ota, R B; Hackett, M; Schooley, D A

1993-12-01

We isolated several cardioactive peptides from extracts of whole heads of the mealworm, Tenebrio molitor, and the southern armyworm, Spodoptera eridania, using a semi-isolated heart of Manduca sexta for bioassay. We have now isolated from each species the peptide with the strongest effect on rate of contraction of the heart. The peptides were identified using micro Edman sequencing and mass spectrometric methods. This cardioactive peptide has the same primary structure from both species: Pro-Phe-Cys-Asn-Ala-Phe-Thr-Gly-Cys-NH2, a cyclic nonapeptide which is identical to crustacean cardioactive peptide (CCAP) originally isolated from the shore crab, Carcinus maenas, and subsequently isolated from Locusta migratoria and Manduca sexta. This is additional evidence that CCAP has widespread occurrence in arthropoda.
Proteomic analysis of the venom from the fish eating coral snake Micrurus surinamensis: novel toxins, their function and phylogeny.

PubMed

Olamendi-Portugal, Timoteo; Batista, Cesar V F; Restano-Cassulini, Rita; Pando, Victoria; Villa-Hernandez, Oscar; Zavaleta-Martínez-Vargas, Alfonso; Salas-Arruz, Maria C; Rodríguez de la Vega, Ricardo C; Becerril, Baltazar; Possani, Lourival D

2008-05-01

The protein composition of the soluble venom from the South American fish-eating coral snake Micrurus surinamensis surinamensis, here abbreviated M. surinamensis, was separated by RP-HPLC and 2-DE, and their components were analyzed by automatic Edman degradation, MALDI-TOF and ESI-MS/MS. Approximately 100 different molecules were identified. Sixty-two components possess molecular masses between 6 and 8 kDa, are basically charged molecules, among which are cytotoxins and neurotoxins lethal to fish (Brachidanios rerio). Six new toxins (abbreviated Ms1-Ms5 and Ms11) were fully sequenced. Amino acid sequences similar to the enzymes phospholipase A2 and amino acid oxidase were identified. Over 20 additional peptides were identified by sequencing minor components of the HPLC separation and from 2-DE gels. A functional assessment of the physiological activity of the six toxins was also performed by patch clamp using muscular nicotinic acetylcholine receptor assays. Variable degrees of blockade were observed, most of them reversible. The structural and functional data obtained were used for phylogenetic analysis, providing information on some evolutionary aspects of the venom components of this snake. This contribution increases by a factor of two the total number of alpha-neurotoxins sequenced from the Micrurus genus in currently available literature.
Identification of a major continuous epitope of human alpha crystallin

NASA Technical Reports Server (NTRS)

Takemoto, L.; Emmons, T.; Spooner, B. S. (Principal Investigator)

1992-01-01

Human lens proteins were digested with trypsin or V8 protease, and the resulting peptides resolved on a C18 reverse phase column. Fractions from this column were probed with polyclonal antiserum made against the whole alpha crystallin molecule. Peptides in the seropositive fraction were purified to homogeneity, then characterized by mass spectral analysis and partial Edman degradation. The tryptic and V8 digests contained only one seropositive peptide that was derived from the C-terminal region of the alpha-A molecule. To determine the exact boundaries of the epitope, various size analogues of this region were synthesized and probed with anti-alpha serum. Together, these studies demonstrate that the major continuous epitope of the alpha-A chain includes the sequence KPTSAPS, corresponding to residues 166-172 of the human alpha-A crystallin chain.
Systemic AA amyloidosis in the red fox (Vulpes vulpes).

PubMed

Rising, Anna; Cederlund, Ella; Palmberg, Carina; Uhlhorn, Henrik; Gaunitz, Stefan; Nordling, Kerstin; Ågren, Erik; Ihse, Elisabet; Westermark, Gunilla T; Tjernberg, Lars; Jörnvall, Hans; Johansson, Jan; Westermark, Per

2017-11-01

Amyloid A (AA) amyloidosis occurs spontaneously in many mammals and birds, but the prevalence varies considerably among different species, and even among subgroups of the same species. The Blue fox and the Gray fox seem to be resistant to the development of AA amyloidosis, while Island foxes have a high prevalence of the disease. Herein, we report on the identification of AA amyloidosis in the Red fox (Vulpes vulpes). Edman degradation and tandem MS analysis of proteolyzed amyloid protein revealed that the amyloid partly was composed of full-length SAA. Its amino acid sequence was determined and found to consist of 111 amino acid residues. Based on inter-species sequence comparisons we found four residue exchanges (Ser31, Lys63, Leu71, Lys72) between the Red and Blue fox SAAs. Lys63 seems unique to the Red fox SAA. We found no obvious explanation to how these exchanges might correlate with the reported differences in SAA amyloidogenicity. Furthermore, in contrast to fibrils from many other mammalian species, the isolated amyloid fibrils from Red fox did not seed AA amyloidosis in a mouse model. © 2017 The Protein Society.
A Novel Factor Xa-Inhibiting Peptide from Centipedes Venom.

PubMed

Kong, Yi; Shao, Yu; Chen, Hao; Ming, Xin; Wang, Jin-Bin; Li, Zhi-Yu; Wei, Ji-Fu

2013-01-01

Centipedes have been used as traditional medicine for thousands of years in China. Centipede venoms consist of many biochemical peptides and proteins. Factor Xa (FXa) is a serine endopeptidase that plays the key role in blood coagulation, and has been used as a new target for anti-thrombotic drug development. A novel FXa inhibitor, a natural peptide with the sequence of Thr-Asn-Gly-Tyr-Thr (TNGYT), was isolated from the venom of Scolopendra subspinipes mutilans using a combination of size-exclusion and reverse-phase chromatography. The molecular weight of the TNGYT peptide was 554.3 Da measured by electrospray ionization mass spectrometry. The amino acid sequence of TNGYT was determined by Edman degradation. TNGYT inhibited the activity of FXa in a dose-dependent manner with an IC 50 value of 41.14 mg/ml. It prolonged the partial thromboplastin time and prothrombin time in both in vitro and ex vivo assays. It also significantly prolonged whole blood clotting time and bleeding time in mice. This is the first report that an FXa inhibiting peptide was isolated from centipedes venom.
The primary structure of rat liver ribosomal protein L37. Homology with yeast and bacterial ribosomal proteins.

PubMed

Lin, A; McNally, J; Wool, I G

1983-09-10

The covalent structure of the rat liver 60 S ribosomal subunit protein L37 was determined. Twenty-four tryptic peptides were purified and the sequence of each was established; they accounted for all 111 residues of L37. The sequence of the first 30 residues of L37, obtained previously by automated Edman degradation of the intact protein, provided the alignment of the first 9 tryptic peptides. Three peptides (CN1, CN2, and CN3) were produced by cleavage of protein L37 with cyanogen bromide. The sequence of CN1 (65 residues) was established from the sequence of secondary peptides resulting from cleavage with trypsin and chymotrypsin. The sequence of CN1 in turn served to order tryptic peptides 1 through 14. The sequence of CN2 (15 residues) was determined entirely by a micromanual procedure and allowed the alignment of tryptic peptides 14 through 18. The sequence of the NH2-terminal 28 amino acids of CN3 (31 residues) was determined; in addition the complete sequences of the secondary tryptic and chymotryptic peptides were done. The sequence of CN3 provided the order of tryptic peptides 18 through 24. Thus the sequence of the three cyanogen bromide peptides also accounted for the 111 residues of protein L37. The carboxyl-terminal amino acids were identified after carboxypeptidase A treatment. There is a disulfide bridge between half-cystinyl residues at positions 40 and 69. Rat liver ribosomal protein L37 is homologous with yeast YP55 and with Escherichia coli L34. Moreover, there is a segment of 17 residues in rat L37 that occurs, albeit with modifications, in yeast YP55 and in E. coli S4, L20, and L34.
Mass fingerprinting of the venom and transcriptome of venom gland of scorpion Centruroides tecomanus.

PubMed

Valdez-Velázquez, Laura L; Quintero-Hernández, Verónica; Romero-Gutiérrez, Maria Teresa; Coronas, Fredy I V; Possani, Lourival D

2013-01-01

Centruroides tecomanus is a Mexican scorpion endemic of the State of Colima, that causes human fatalities. This communication describes a proteome analysis obtained from milked venom and a transcriptome analysis from a cDNA library constructed from two pairs of venom glands of this scorpion. High perfomance liquid chromatography separation of soluble venom produced 80 fractions, from which at least 104 individual components were identified by mass spectrometry analysis, showing to contain molecular masses from 259 to 44,392 Da. Most of these components are within the expected molecular masses for Na(+)- and K(+)-channel specific toxic peptides, supporting the clinical findings of intoxication, when humans are stung by this scorpion. From the cDNA library 162 clones were randomly chosen, from which 130 sequences of good quality were identified and were clustered in 28 contigs containing, each, two or more expressed sequence tags (EST) and 49 singlets with only one EST. Deduced amino acid sequence analysis from 53% of the total ESTs showed that 81% (24 sequences) are similar to known toxic peptides that affect Na(+)-channel activity, and 19% (7 unique sequences) are similar to K(+)-channel especific toxins. Out of the 31 sequences, at least 8 peptides were confirmed by direct Edman degradation, using components isolated directly from the venom. The remaining 19%, 4%, 4%, 15% and 5% of the ESTs correspond respectively to proteins involved in cellular processes, antimicrobial peptides, venom components, proteins without defined function and sequences without similarity in databases. Among the cloned genes are those similar to metalloproteinases.
Leptoglycin: a new Glycine/Leucine-rich antimicrobial peptide isolated from the skin secretion of the South American frog Leptodactylus pentadactylus (Leptodactylidae).

PubMed

Sousa, Juliana C; Berto, Raquel F; Gois, Elicélia A; Fontenele-Cardi, Nauíla C; Honório, José E R; Konno, Katsuhiro; Richardson, Michael; Rocha, Marcos F G; Camargo, Antônio A C M; Pimenta, Daniel C; Cardi, Bruno A; Carvalho, Krishnamurti M

2009-07-01

Antimicrobial peptides are components of innate immunity that is the first-line defense against invading pathogens for a wide range of organisms. Here, we describe the isolation, biological characterization and amino acid sequencing of a novel neutral Glycine/Leucine-rich antimicrobial peptide from skin secretion of Leptodactylus pentadactylus named leptoglycin. The amino acid sequence of the peptide purified by RP-HPLC (C(18) column) was deduced by mass spectrometric de novo sequencing and confirmed by Edman degradation: GLLGGLLGPLLGGGGGGGGGLL. Leptoglycin was able to inhibit the growth of Gram-negative bacteria Pseudomonas aeruginosa, Escherichia coli and Citrobacter freundii with minimal inhibitory concentrations (MICs) of 8 microM, 50 microM, and 75 microM respectively, but it did not show antimicrobial activity against Gram-positive bacteria (Staphylococcus aureus, Micrococcus luteus and Enterococcus faecalis), yeasts (Candida albicans and Candida tropicalis) and dermatophytes fungi (Microsporum canis and Trichophyton rubrum). No hemolytic activity was observed at the 2-200 microM range concentration. The amino acid sequence of leptoglycin with high level of glycine (59.1%) and leucine (36.4%) containing an unusual central proline suggests the existence of a new class of Gly/Leu-rich antimicrobial peptides. Taken together, these results suggest that this natural antimicrobial peptide could be a tool to develop new antibiotics.
Purification, characterization and molecular cloning of chymotrypsin inhibitor peptides from the venom of Burmese Daboia russelii siamensis.

PubMed

Guo, Chun-Teng; McClean, Stephen; Shaw, Chris; Rao, Ping-Fan; Ye, Ming-Yu; Bjourson, Anthony J

2013-05-01

One novel Kunitz BPTI-like peptide designated as BBPTI-1, with chymotrypsin inhibitory activity was identified from the venom of Burmese Daboia russelii siamensis. It was purified by three steps of chromatography including gel filtration, cation exchange and reversed phase. A partial N-terminal sequence of BBPTI-1, HDRPKFCYLPADPGECLAHMRSF was obtained by automated Edman degradation and a Ki value of 4.77nM determined. Cloning of BBPTI-1 including the open reading frame and 3' untranslated region was achieved from cDNA libraries derived from lyophilized venom using a 3' RACE strategy. In addition a cDNA sequence, designated as BBPTI-5, was also obtained. Alignment of cDNA sequences showed that BBPTI-5 exhibited an identical sequence to BBPTI-1 cDNA except for an eight nucleotide deletion in the open reading frame. Gene variations that represented deletions in the BBPTI-5 cDNA resulted in a novel protease inhibitor analog. Amino acid sequence alignment revealed that deduced peptides derived from cloning of their respective precursor cDNAs from libraries showed high similarity and homology with other Kunitz BPTI proteinase inhibitors. BBPTI-1 and BBPTI-5 consist of 60 and 66 amino acid residues respectively, including six conserved cysteine residues. As these peptides have been reported to have influence on the processes of coagulation, fibrinolysis and inflammation, their potential application in biomedical contexts warrants further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.
Investigation of the protein osteocalcin of Camelops hesternus: Sequence, structure and phylogenetic implications

NASA Astrophysics Data System (ADS)

Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.

2007-12-01

Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.
Detection of Proteins on Blot Membranes

PubMed Central

Goldman, Aaron; Harper, Sandra; Speicher, David W.

2017-01-01

Staining of blot membranes enables the visualization of bound proteins. Proteins are usually transferred to blot membranes by electroblotting, by direct spotting of protein solutions, or by contact blots. Staining allows the efficiency of transfer to the membrane to be monitored. This unit describes protocols for staining proteins after electroblotting from polyacrylamide gels to blot membranes such as polyvinylidene difluoride (PVDF), nitrocellulose, or nylon membranes. The same methods can be used if proteins are directly spotted, either manually or using robotics. Protocols are included for seven general protein stains (amido black, Coomassie blue, Ponceau S, colloidal gold, colloidal silver, India ink, and MemCode) and three fluorescent protein stains (fluorescamine, IAEDANS, and SYPRO Ruby). Also included is an in-depth discussion of the different blot membrane types and the compatibility of different protein stains with downstream applications, such as immunoblotting or N-terminal Edman sequencing. PMID:27801518
Detection of Proteins on Blot Membranes.

PubMed

Goldman, Aaron; Harper, Sandra; Speicher, David W

2016-11-01

Staining of blot membranes enables the visualization of bound proteins. Proteins are usually transferred to blot membranes by electroblotting, by direct spotting of protein solutions, or by contact blots. Staining allows the efficiency of transfer to the membrane to be monitored. This unit describes protocols for staining proteins after electroblotting from polyacrylamide gels to blot membranes such as polyvinylidene difluoride (PVDF), nitrocellulose, or nylon membranes. The same methods can be used if proteins are directly spotted, either manually or using robotics. Protocols are included for seven general protein stains (amido black, Coomassie blue, Ponceau S, colloidal gold, colloidal silver, India ink, and MemCode) and three fluorescent protein stains (fluorescamine, IAEDANS, and SYPRO Ruby). Also included is an in-depth discussion of the different blot membrane types and the compatibility of different protein stains with downstream applications, such as immunoblotting or N-terminal Edman sequencing. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Frog secretions and hunting magic in the upper Amazon: identification of a peptide that interacts with an adenosine receptor.

PubMed Central

Daly, J W; Caceres, J; Moni, R W; Gusovsky, F; Moos, M; Seamon, K B; Milton, K; Myers, C W

1992-01-01

A frog used for "hunting magic" by several groups of Panoan-speaking Indians in the borderline between Brazil and Peru is identified as Phyllomedusa bicolor. This frog's skin secretion, which the Indians introduce into the body through fresh burns, is rich in peptides. These include vasoactive peptides, opioid peptides, and a peptide that we have named adenoregulin, with the sequence GLWSKIKEVGKEAAKAAAKAAGKAALGAVSEAV as determined from mass spectrometry and Edman degradation. The natural peptide may contain a D amino acid residue, since it is not identical in chromatographic properties to the synthetic peptide. Adenoregulin enhances binding of agonists to A1 adenosine receptors; it is accompanied in the skin secretion by peptides that inhibit binding. The vasoactive peptide sauvagine, the opioid peptides, and adenoregulin and related peptides affect behavior in mice and presumably contribute to the behavioral sequelae observed in humans. Images PMID:1438301
Frog secretions and hunting magic in the upper Amazon: identification of a peptide that interacts with an adenosine receptor.

PubMed

Daly, J W; Caceres, J; Moni, R W; Gusovsky, F; Moos, M; Seamon, K B; Milton, K; Myers, C W

1992-11-15

A frog used for "hunting magic" by several groups of Panoan-speaking Indians in the borderline between Brazil and Peru is identified as Phyllomedusa bicolor. This frog's skin secretion, which the Indians introduce into the body through fresh burns, is rich in peptides. These include vasoactive peptides, opioid peptides, and a peptide that we have named adenoregulin, with the sequence GLWSKIKEVGKEAAKAAAKAAGKAALGAVSEAV as determined from mass spectrometry and Edman degradation. The natural peptide may contain a D amino acid residue, since it is not identical in chromatographic properties to the synthetic peptide. Adenoregulin enhances binding of agonists to A1 adenosine receptors; it is accompanied in the skin secretion by peptides that inhibit binding. The vasoactive peptide sauvagine, the opioid peptides, and adenoregulin and related peptides affect behavior in mice and presumably contribute to the behavioral sequelae observed in humans.
Antifungal mechanism of a novel antifungal protein from pumpkin rinds against various fungal pathogens.

PubMed

Park, Seong-Cheol; Kim, Jin-Young; Lee, Jong-Kook; Hwang, Indeok; Cheong, Hyeonsook; Nah, Jae-Woon; Hahm, Kyung-Soo; Park, Yoonkyung

2009-10-14

A novel antifungal protein (Pr-2) was identified from pumpkin rinds using water-soluble extraction, ultrafiltration, cation exchange chromatography, and reverse-phase high-performance liquid chromatography. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry indicated that the protein had a molecular mass of 14865.57 Da. Automated Edman degradation showed that the N-terminal sequence of Pr-2 was QGIGVGDNDGKRGKR-. The Pr-2 protein strongly inhibited in vitro growth of Botrytis cinerea, Colletotrichum coccodes, Fusarium solani, Fusarium oxysporum, and Trichoderma harzianum at 10-20 microM. The results of confocal laser scanning microscopy and SYTOX Green uptake demonstrated that its effective region was the membrane of the fungal cell surface. In addition, this protein was found to be noncytotoxic and heat-stable. Taken together, the results of this study indicate that Pr-2 is a good candidate for use as a natural antifungal agent.
Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deutscher, J.; Pevec, B.; Beyreuther, K.

1986-10-21

The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
An EThcD-Based Method for Discrimination of Leucine and Isoleucine Residues in Tryptic Peptides

NASA Astrophysics Data System (ADS)

Zhokhov, Sergey S.; Kovalyov, Sergey V.; Samgina, Tatiana Yu.; Lebedev, Albert T.

2017-08-01

An EThcD-based approach for the reliable discrimination of isomeric leucine and isoleucine residues in peptide de novo sequencing procedure has been proposed. A multistage fragmentation of peptide ions was performed with Orbitrap Elite mass spectrometer in electrospray ionization mode. At the first stage, z-ions were produced by ETD or ETcaD fragmentation of doubly or triply charged peptide precursor ions. These primary ions were further fragmented by HCD with broad-band ion isolation, and the resulting w-ions showed different mass for leucine and isoleucine residues. The procedure did not require manual isolation of specific z-ions prior to HCD stage. Forty-three tryptic peptides (3 to 27 residues) obtained by trypsinolysis of human serum albumin (HSA) and gp188 protein were analyzed. To demonstrate a proper solution for radical site migration problem, three non-tryptic peptides were also analyzed. A total of 93 leucine and isoleucine residues were considered and 83 of them were correctly identified. The developed approach can be a reasonable substitution for additional Edman degradation procedure, which is still used in peptide sequencing for leucine and isoleucine discrimination.
Definition of the HLA-A29 peptide ligand motif allows prediction of potential T-cell epitopes from the retinal soluble antigen, a candidate autoantigen in birdshot retinopathy.

PubMed Central

Boisgerault, F; Khalil, I; Tieng, V; Connan, F; Tabary, T; Cohen, J H; Choppin, J; Charron, D; Toubert, A

1996-01-01

The peptide-binding motif of HLA-A29, the predisposing allele for birdshot retinopathy, was determined after acid-elution of endogenous peptides from purified HLA-A29 molecules. Individual and pooled HPLC fractions were sequenced by Edman degradation. Major anchor residues could be defined as glutamate at the second position of the peptide and as tyrosine at the carboxyl terminus. In vitro binding of polyglycine synthetic peptides to purified HLA-A29 molecules also revealed the need for an auxiliary anchor residue at the third position, preferably phenylalanine. By using this motif, we synthesized six peptides from the retinal soluble antigen, a candidate autoantigen in autoimmune uveoretinitis. Their in vitro binding was tested on HLA-A29 and also on HLA-B44 and HLA-B61, two alleles sharing close peptide-binding motifs. Two peptides derived from the carboxyl-terminal sequence of the human retinal soluble antigen bound efficiently to HLA-A29. This study could contribute to the prediction of T-cell epitopes from retinal autoantigens implicated in birdshot retinopathy. PMID:8622959
[Purification, characterization and partial primary structure analysis of rutin-degrading enzyme in tartary buckwheat seeds].

PubMed

Zhang, Yuwei; Li, Jie; Yuan, Yong; Gu, Jijuan; Chen, Peng

2017-05-25

Rutin-degrading enzymes (RDE) can degrade rutin into poorly water soluble compound, quercetin, and cause the bitter taste in tartary buckwheat. In the present study RDE from Yu 6-21 tartary buckwheat seeds was purified by ammonium sulphate precipitation, followed by hydrophobic interaction chromatography on Phenyl Sepharose CL-4B, ion exchange chromatography on CM-Cellulose and gel filtration chromatography on Sephadex G-150. Purified RDE showed single band with molecular weight of 66 kDa on SDS-PAGE. The optimum pH and temperature of RDE were 5.0 and 50 ℃ respectively. The Km was 0.27 mmol/L, and the Vmax was 39.68 U/mg. The RDE activity could be inhibited by Cu²⁺, Zn²⁺, Mn²⁺ and EDTA, and showed tolerance to 50% methanol (V/V). The N terminal sequence (TVSRSSFPDGFLFGL) was obtained by Edman degradation method and 15 internal peptide sequences were determined by MALDI-TOF-MS (matrix-assisted laser desorption ionization time of flight mass spectrometry). These results established the foundations for identification of the candidate gene of RDE via transcriptome data and further studying RDE biological function.

Structural and biological characterization of three novel mastoparan peptides from the venom of the neotropical social wasp Protopolybia exigua (Saussure).

PubMed

Mendes, Maria Anita; de Souza, Bibiana Monson; Palma, Mario Sergio

2005-01-01

The venom of the Neotropical social wasp Protopolybia exigua(Saussure) was fractionated by RP-HPLC resulting in the elution of 20 fractions. The homogeneity of the preparations were checked out by using ESI-MS analysis and the fractions 15, 17 and 19 (eluted at the most hydrophobic conditions) were enough pure to be sequenced by Edman degradation chemistry, resulting in the following sequences: Protopolybia MPI I-N-W-L-K-L-G-K-K-V-S-A-I-L-NH2 Protopolybia-MP II I-N-W-K-A-I-I-E-A-A-K-Q-A-L-NH2 Protopolybia-MP III I-N-W-L-K-L-G-K-A-V-I-D-A-L-NH2 All the peptides were manually synthesized on-solid phase and functionally characterized. Protopolybia-MP I is a hemolytic mastoparan, probably acting on mast cells by assembling in plasma membrane, resulting in pore formation; meanwhile, the peptides Protopolybia-MP II and -MP III were characterized as a non-hemolytic mast cell degranulator toxins, which apparently act by virtue of their binding to G-protein receptor, activating the mast cell degranulation.
A novel cysteine-rich antifungal peptide ToAMP4 from Taraxacum officinale Wigg. flowers.

PubMed

Astafieva, A A; Rogozhin, Eugene A; Andreev, Yaroslav A; Odintsova, T I; Kozlov, S A; Grishin, Eugene V; Egorov, Tsezi A

2013-09-01

A novel peptide named ToAMP4 was isolated from Taraxacum officinale Wigg. flowers by a combination of acetic acid extraction and different types of chromatography: affinity, size-exclusion, and RP-HPLC. The amino acid sequence of ToAMP4 was determined by automated Edman degradation. The peptide is basic, consists of 41 amino acids, and incorporates three disulphide bonds. Due to the unusual cysteine spacing pattern, ToAMP4 does not belong to any known plant AMP family, but classifies together with two other antimicrobial peptides ToAMP1 and ToAMP2 previously isolated from the dandelion flowers. To study the biological activity of ToAMP4, it was successfully produced in a prokaryotic expression system as a fusion protein with thioredoxin. The recombinant peptide was shown to be identical to the native ToAMP4 by chromatographic behavior, molecular mass, and N-terminal amino acid sequence. The peptide displays broad-spectrum antifungal activity against important phytopathogens. Two ToAMP4-mediated inhibition strategies depending on the fungus were demonstrated. The results obtained add to our knowledge on the structural and functional diversity of AMPs in plants. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Membrane fractions active in poliovirus RNA replication contain VPg precursor polypeptides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Takegami, T.; Semler, B.L.; Anderson, C.W.

1983-01-01

The poliovirus specific polypeptide P3-9 is of special interest for studies of viral RNA replication because it contains a hydrophobic region and, separated by only seven amino acids from that region, the amino acid sequence of the genome-linked protein VPg. Membraneous complexes of poliovirus-infected HeLa cells that contain poliovirus RNA replicating proteins have been analyzed for the presence of P3-9 by immunoprecipitation. Incubation of a membrane fraction rich in P3-9 with proteinase leaves the C-terminal 69 amino acids of P3-9 intact, an observation suggesting that this portion is protected by its association with the cellular membrane. These studies have alsomore » revealed two hitherto undescribed viral polypeptides consisting of amino acid sequences of the P2 andf P3 regions of the polyprotein. Sequence analysis by stepwise Edman degradation show that these proteins are 3b/9 (M/sub r/77,000) and X/9 (M/sub r/50,000). 3b/9 and X/9 are membrane bound and are turned over rapidly and may be direct precursors to proteins P2-X and P3-9 of the RNA replication complex. P2-X, a polypeptide void of hydrophobic amino acid sequences but also found associated with membranes, is rapidly degraded when the membraneous complex is treated with trypsin. It is speculated that P2-X is associated with membranes by its affinity to the N-terminus of P3-9.« less
A new cofactor in prokaryotic enzyme: Tryptophan tryptophylquinone as the redox prosthetic group in methylamine dehydrogenase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McIntire, W.S.; Wemmer, D.E.; Chistoserdov, A.

Methylamine dehydrogenase (MADH), an {alpha}{sub 2}{beta}{sub 2} enzyme from numerous methylotrophic soil bacteria, contains a novel quinonoid redox prosthetic group that is covalently bound to its small {beta} subunit through two amino acyl residues. A comparison of the amino acid sequence deduced from the gene sequence of the small subunit for the enzyme from Methylobacterium extorquens AM1 with the published amino acid sequence obtained by Edman degradation method, allowed the identification of the amino acyl constituents of the cofactor as two tryptophyl residues. This information was crucial for interpreting {sup 1}H and {sup 13}C nuclear magnetic resonance, and mass spectralmore » data collected for the semicarbazide- and carboxymethyl-derivatized bis(tripeptidyl)-cofactor of MADH from bacterium W3A1. The cofactor is composed of two cross-linked tryptophyl residues. Although there are many possible isomers, only one is consistent with all the data: The first tryptophyl residue in the peptide sequence exists as an indole-6,7-dione, and is attached at its 4 position to the 2 position of the second, otherwise unmodified, indole side group. Contrary to earlier reports, the cofactor of MADH is not 2,7,9-tricarboxypyrroloquinoline quinone (PQQ), a derivative thereof, of pro-PQQ. This appears to be the only example of two cross-linked, modified amino acyl residues having a functional role in the active site of an enzyme, in the absence of other cofactors or metal ions.« less
Echinoderm phosphorylated matrix proteins UTMP16 and UTMP19 have different functions in sea urchin tooth mineralization.

PubMed

Alvares, Keith; Dixit, Saryu N; Lux, Elizabeth; Veis, Arthur

2009-09-18

Studies of mineralization of embryonic spicules and of the sea urchin genome have identified several putative mineralization-related proteins. These predicted proteins have not been isolated or confirmed in mature mineralized tissues. Mature Lytechinus variegatus teeth were demineralized with 0.6 N HCl after prior removal of non-mineralized constituents with 4.0 M guanidinium HCl. The HCl-extracted proteins were fractionated on ceramic hydroxyapatite and separated into bound and unbound pools. Gel electrophoresis compared the protein distributions. The differentially present bands were purified and digested with trypsin, and the tryptic peptides were separated by high pressure liquid chromatography. NH2-terminal sequences were determined by Edman degradation and compared with the genomic sequence bank data. Two of the putative mineralization-related proteins were found. Their complete amino acid sequences were cloned from our L. variegatus cDNA library. Apatite-binding UTMP16 was found to be present in two isoforms; both isoforms had a signal sequence, a Ser-Asp-rich extracellular matrix domain, and a transmembrane and cytosolic insertion sequence. UTMP19, although rich in Glu and Thr did not bind to apatite. It had neither signal peptide nor transmembrane domain but did have typical nuclear localization and nuclear exit signal sequences. Both proteins were phosphorylated and good substrates for phosphatase. Immunolocalization studies with anti-UTMP16 show it to concentrate at the syncytial membranes in contact with the mineral. On the basis of our TOF-SIMS analyses of magnesium ion and Asp mapping of the mineral phase composition, we speculate that UTMP16 may be important in establishing the high magnesium columns that fuse the calcite plates together to enhance the mechanical strength of the mineralized tooth.
Two new 4-Cys conotoxins (framework 14) of the vermivorous snail Conus austini from the Gulf of Mexico with activity in the central nervous system of mice

PubMed Central

Zugasti-Cruz, Alejandro; Falcón, Andrés; Heimer de la Cotera, Edgar P.; Olivera, Baldomero M.; Aguilar, Manuel B.

2008-01-01

As part of continuing studies of the venom components present in Conus austini (syn.: Conus cancellatus), a vermivorous cone snail collected in the western Gulf of Mexico, Mexico, two major peptides, as14a and as14b, were purified and characterized. Their amino acid sequences were determined by automatic Edman sequencing after reduction and alkylation. Their molecular masses, established by matrix-assisted laser desorption ionization time-of-flight mass spectrometry, confirmed the chemical analyses and indicated that as14a and as14b have free C-termini. Each peptide contains four Cys residues arranged in a pattern (C-C-C-C, framework 14). The primary structure of as14a is GGVGRCIYNCMNSGGGLNFIQCKTMCY (experimental monoisotopic mass 2,883.92 Da; calculated monoisotopic mass 2,884.20 Da), whereas that of as14b is RWDVDQCIYYCLNGVVGYSYTECQTMCT (experimental monoisotopic mass 3,308.63 Da; calculated monoisotopic mass 3,308.34 Da). Both purified peptides elicited scratching and grooming activity in mice, and as14b also caused body and rear limb extension and tail curling immediately upon injection. The high sequence similarity of peptide as14a with peptide vil14a from the vermivorous C. villepinii suggests that the former might block K+ channels. PMID:18206266
PhcrTx2, a New Crab-Paralyzing Peptide Toxin from the Sea Anemone Phymanthus crucifer

PubMed Central

Garateix, Anoland; Salceda, Emilio; Zaharenko, André Junqueira; Pons, Tirso; Santos, Yúlica; Arreguín, Roberto; Ständker, Ludger; Forssmann, Wolf-Georg; Tytgat, Jan; Vega, Rosario

2018-01-01

Sea anemones produce proteinaceous toxins for predation and defense, including peptide toxins that act on a large variety of ion channels of pharmacological and biomedical interest. Phymanthus crucifer is commonly found in the Caribbean Sea; however, the chemical structure and biological activity of its toxins remain unknown, with the exception of PhcrTx1, an acid-sensing ion channel (ASIC) inhibitor. Therefore, in the present work, we focused on the isolation and characterization of new P. crucifer toxins by chromatographic fractionation, followed by a toxicity screening on crabs, an evaluation of ion channels, and sequence analysis. Five groups of toxic chromatographic fractions were found, and a new paralyzing toxin was purified and named PhcrTx2. The toxin inhibited glutamate-gated currents in snail neurons (maximum inhibition of 35%, IC50 4.7 µM), and displayed little or no influence on voltage-sensitive sodium/potassium channels in snail and rat dorsal root ganglion (DRG) neurons, nor on a variety of cloned voltage-gated ion channels. The toxin sequence was fully elucidated by Edman degradation. PhcrTx2 is a new β-defensin-fold peptide that shares a sequence similarity to type 3 potassium channels toxins. However, its low activity on the evaluated ion channels suggests that its molecular target remains unknown. PhcrTx2 is the first known paralyzing toxin in the family Phymanthidae. PMID:29414882
Glycosaminoglycan Chain of Dentin Sialoprotein Proteoglycan

PubMed Central

Zhu, Q.; Sun, Y.; Prasad, M.; Wang, X.; Yamoah, A.K.; Li, Y.; Feng, J.; Qin, C.

2010-01-01

Dentin sialophosphoprotein (DSPP) is processed into dentin sialoprotein (DSP) and dentin phosphoprotein. A molecular variant of rat DSP, referred to as “HMW-DSP”, has been speculated to be a proteoglycan form of DSP. To determine if HMW-DSP is the proteoglycan form of DSP and to identify the glycosaminoglycan side-chain attachment site(s), we further characterized HMW-DSP. Chondroitinase ABC treatment reduced the migration rate for portions of rat HMW-DSP to the level of DSP. Disaccharide analysis showed that rat HMW-DSP contains glycosaminoglycan chains made of chondroitin-4-sulfate and has an average of 31-32 disaccharides/mol. These observations confirmed that HMW-DSP is the proteoglycan form of DSP (renamed “DSP-PG”). Edman degradation and mass spectrometric analyses of tryptic peptides from rat DSP-PG, along with substitution analyses of candidate Ser residues in mouse DSPP, confirmed that 2 glycosaminoglycan chains are attached to Ser241 and Ser253 in the rat, or Ser242 and Ser254 in the mouse DSPP sequence. PMID:20400719
Glycosaminoglycan chain of dentin sialoprotein proteoglycan.

PubMed

Zhu, Q; Sun, Y; Prasad, M; Wang, X; Yamoah, A K; Li, Y; Feng, J; Qin, C

2010-08-01

Dentin sialophosphoprotein (DSPP) is processed into dentin sialoprotein (DSP) and dentin phosphoprotein. A molecular variant of rat DSP, referred to as "HMW-DSP", has been speculated to be a proteoglycan form of DSP. To determine if HMW-DSP is the proteoglycan form of DSP and to identify the glycosaminoglycan side-chain attachment site(s), we further characterized HMW-DSP. Chondroitinase ABC treatment reduced the migration rate for portions of rat HMW-DSP to the level of DSP. Disaccharide analysis showed that rat HMW-DSP contains glycosaminoglycan chains made of chondroitin-4-sulfate and has an average of 31-32 disaccharides/mol. These observations confirmed that HMW-DSP is the proteoglycan form of DSP (renamed "DSP-PG"). Edman degradation and mass spectrometric analyses of tryptic peptides from rat DSP-PG, along with substitution analyses of candidate Ser residues in mouse DSPP, confirmed that 2 glycosaminoglycan chains are attached to Ser(241) and Ser(253) in the rat, or Ser(242) and Ser(254) in the mouse DSPP sequence.
Structural characterization of osmoregulator peptides from the brain of the leech Theromyzon tessulatum: IPEPYVWD and IPEPYVWD-amide.

PubMed

Salzet, M; Vandenbulcke, F; Verger-Bocquet, M

1996-12-31

Neurons immunoreactive to an antiserum (a-OT) directed specifically against the C-terminal part (prolyl-leucyl-glycinamide) of vertebrate oxytocin (OT) were detected in the brain of the leech Theromyzon tessulatum. With high pressure gel permeation chromatography followed by reversed-phase HPLC on brain extracts, evidence was given of the presence of three peptides (P1, P2, P3) immunoreactive to a-OT. Results of injection experiments in T. tessulatum and of titrations of each peptide at the different physiological stages of the animals which showed a peak in peptide P1 amount at stage 3B, indicated that P1 is the active OT-like peptide. Using three steps of reversed-phase HPLC, Edman degradation and electrospray mass spectrometry, two sequences for P1 (IPEPYVWD and IPEPYVWD-amide) were found. These peptides differ from peptides to the oxytocin/vasopressin family and are unique in the animal kingdom. Confirmation of their action on the hydric balance and their distribution in the CNS were presented.
Lacticin LC14, a new bacteriocin produced by Lactococcus lactis BMG6.14: isolation, purification and partial characterization.

PubMed

Lasta, Samar; Ouzari, Hadda; Andreotti, Nicolas; Fajloun, Ziad; Mansuelle, Pascal; Boudabous, Abdellatif; Sampieri, Francois; Sabatier, Jean Marc

2012-08-01

A new bacteriocin, lacticin LC14, produced by Lactococcus lactis BMG6.14, was isolated and characterized. It was purified to homogeneity from overnight broth culture by ammonium sulfate precipitation, Sep-Pak chromatography, and two steps of reversed-phase HPLC. Lacticin LC14 showed bactericidal-type antimicrobial activity against several lactic acid bacteria and pathogenic strains including Listeria monocytogenes. It was inactivated by proteinase K and pronase E, but was resistant to papain, lysozyme, lipase and catalase. Lacticin LC14 was heat resistant, stable over a wide range of pH (2-10) and after treatment by solvents and detergents. Its N-terminal end was found unreactive towards Edman sequencing. Based on MALDI-TOF mass spectrometry, its molecular mass was 3333.7 Da. LC14 amino acid composition revealed a high proportion of hydrophobic residues, but no modified ones. LC14 may be able to challenge other well known other bacteriocins in probiotic and therapeutic applications.
[The primary structure of the alpha-amylase inhibitor Hoe 467A from Streptomyces tendae 4158. A new class of inhibitors].

PubMed

Aschauer, H; Vértesy, L; Nesemann, G; Braunitzer, G

1983-10-01

The native or modified alpha-amylase inhibitor Hoe 467A - isolated from the culture medium of Streptomyces tendae 4158 - and overlapping peptides were degraded by the automatic Edman technique. The oxidized or aminoethylated or oxidized and maleoylated inhibitor was digested with trypsin and the native inhibitor with pepsin. Further digestion with Staphylococcus aureus proteinase was also carried out. After peptic digestion two cystin peptides were isolated, which allowed the establishment of the disulfide bonds. The alpha-amylase inhibitor is a polypeptid consisting of 74 amino-acid residues with a molecular mass of 7958 Da. The inhibitor is composed of all naturally occurring amino acids except methionine and phenylalanine and shows no sequence homology to known inhibitors. The clinical and pharmacological importance in respect to the inhibitors ability for inactivation of human salivary and pancreatic alpha-amylase is discussed. Especially the proteinase resistance of the inhibitor enables a clinical application in human (e.g. Diabetes mellitus) per os.
A Low Molecular Weight Protein from the Sea Anemone Anemonia viridis with an Anti-Angiogenic Activity

PubMed Central

Loret, Erwann P.; Luis, José; Nuccio, Christopher; Villard, Claude; Mansuelle, Pascal; Lebrun, Régine; Villard, Pierre Henri

2018-01-01

Sea anemones are a remarkable source of active principles due to a decentralized venom system. New blood vessel growth or angiogenesis is a very promising target against cancer, but the few available antiangiogenic compounds have limited efficacy. In this study, a protein fraction, purified from tentacles of Anemonia viridis, was able to limit endothelial cells proliferation and angiogenesis at low concentration (14 nM). Protein sequences were determined with Edman degradation and mass spectrometry in source decay and revealed homologies with Blood Depressing Substance (BDS) sea anemones. The presence of a two-turn alpha helix observed with circular dichroism and a trypsin activity inhibition suggested that the active principle could be a Kunitz-type inhibitor, which may interact with an integrin due to an Arginine Glycin Aspartate (RGD) motif. Molecular modeling showed that this RGD motif was well exposed to solvent. This active principle could improve antiangiogenic therapy from existing antiangiogenic compounds binding on the Vascular Endothelial Growth Factor (VEGF). PMID:29671760
Purification and characterization of a strong fibrinolytic enzyme (nattokinase) in the vegetable cheese natto, a popular soybean fermented food in Japan.

PubMed

Fujita, M; Nomura, K; Hong, K; Ito, Y; Asada, A; Nishimuro, S

1993-12-30

A strong fibrinolytic enzyme (nattokinase) was purified from the vegetable cheese natto. Nattokinase was extracted from natto with saline and isolated by sequential use of hydrophobic chromatography on Butyl-Toyopearl, ion-exchange chromatography on CM-Toyopearl, and gel-filtration on Sephadex G-50. The isolated protein gave a single sharp band on SDS-PAGE either before or after reduction. The sequence, as determined by automated Edman degradation of the uncleaved molecule and its enzymatically derived peptide, consisted of a total 275 amino acid residues (M.W = 27,728) and exhibited a high homology with the subtilisins. The purified nattokinase digested not only fibrin but also several synthetic substrates. Among the synthetic substrates, the most sensitive substrate was Suc-Ala-Ala-Pro-Phe-pNA for subtilisin. PMSF inhibited both the fibrinolytic activity and the amidolytic activity. The results indicate that nattokinase is a subtilisin-like serine protease.
Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

PubMed

Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

2010-04-01

The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.
Characterization of the Organic Component of Low-Molecular-Weight Chromium-Binding Substance and Its Binding of Chromium123

PubMed Central

Chen, Yuan; Watson, Heather M.; Gao, Junjie; Sinha, Sarmistha Halder; Cassady, Carolyn J.; Vincent, John B.

2011-01-01

Chromium was proposed to be an essential element over 50 y ago and was shown to have therapeutic potential in treating the symptoms of type 2 diabetes; however, its mechanism of action at a molecular level is unknown. One chromium-binding biomolecule, low-molecular weight chromium-binding substance (LMWCr or chromodulin), has been found to be biologically active in in vitro assays and proposed as a potential candidate for the in vivo biologically active form of chromium. Characterization of the organic component of LMWCr has proven difficult. Treating bovine LMWCr with trifluoroacetic acid followed by purification on a graphite powder micro-column generates a heptapeptide fragment of LMWCr. The peptide sequence of the fragment was analyzed by MS and tandem MS (MS/MS and MS/MS/MS) using collision-induced dissociation and post-source decay. Two candidate sequences, pEEEEGDD and pEEEGEDD (where pE is pyroglutamate), were identified from the MS/MS experiments; additional tandem MS suggests the sequence is pEEEEGDD. The N-terminal glutamate residues explain the inability to sequence LMWCr by the Edman method. Langmuir isotherms and Hill plots were used to analyze the binding constants of chromic ions to synthetic peptides similar in composition to apoLMWCr. The sequence pEEEEGDD was found to bind 4 chromic ions per peptide with nearly identical cooperativity and binding constants to those of apoLMWCr. This work should lead to further studies elucidating or eliminating a potential role for LMWCr in treating the symptoms of type 2 diabetes and other conditions resulting from improper carbohydrate and lipid metabolism. PMID:21593351
Biochemical and molecular characterization of the venom from the Cuban scorpion Rhopalurus junceus.

PubMed

García-Gómez, B I; Coronas, F I V; Restano-Cassulini, R; Rodríguez, R R; Possani, L D

2011-07-01

This communication describes the first general biochemical, molecular and functional characterization of the venom from the Cuban blue scorpion Rhopalurus junceus, which is often used as a natural product for anti-cancer therapy in Cuba. The soluble venom of this arachnid is not toxic to mice, injected intraperitoneally at doses up to 200 μg/20 g body weight, but it is deadly to insects at doses of 10 μg per animal. The venom causes typical alpha and beta-effects on Na+ channels, when assayed using patch-clamp techniques in neuroblastoma cells in vitro. It also affects K+ currents conducted by ERG (ether-a-go-go related gene) channels. The soluble venom was shown to display phospholipase, hyaluronidase and anti-microbial activities. High performance liquid chromatography of the soluble venom can separate at least 50 components, among which are peptides lethal to crickets. Four such peptides were isolated to homogeneity and their molecular masses and N-terminal amino acid sequence were determined. The major component (RjAa12f) was fully sequenced by Edman degradation. It contains 64 amino acid residues and four disulfide bridges, similar to other known scorpion toxins. A cDNA library prepared from the venomous glands of one scorpion allowed cloning 18 genes that code for peptides of the venom, including RjA12f and eleven other closely related genes. Sequence analyses and phylogenetic reconstruction of the amino acid sequences deduced from the cloned genes showed that this scorpion contains sodium channel like toxin sequences clearly segregated into two monophyletic clusters. Considering the complex set of effects on Na+ currents verified here, this venom certainly warrant further investigation. Copyright © 2011 Elsevier Ltd. All rights reserved.
Carolina View, Vol II, 1986.

ERIC Educational Resources Information Center

Davis, Char W., Ed.; Small, LaVeta T., Ed.

1986-01-01

Diverse issues in higher education are addressed in 19 articles. Titles and authors are as follows: "2001: Formulation of a Vision" (Kenneth L. Schwab); Trustees' Roles and Student Issues" (Davis Powers); "What Do Students and Faculty Talk about When They Eat Meals Together?" (John H. Schuh, Neal Edman); "Student…
Purification and complete amino acid sequence of a new type of sweet protein taste-modifying activity, curculin.

PubMed

Yamashita, H; Theerasilp, S; Aiuchi, T; Nakaya, K; Nakamura, Y; Kurihara, Y

1990-09-15

A new taste-modifying protein named curculin was extracted with 0.5 M NaCl from the fruits of Curculigo latifolia and purified by ammonium sulfate fractionation, CM-Sepharose ion-exchange chromatography, and gel filtration. Purified curculin thus obtained gave a single band having a Mr of 12,000 on sodium dodecyl sulfate-polyacrylamide gel electrophoresis in the presence of 8 M urea. The molecular weight determined by low-angle laser light scattering was 27,800. These results suggest that native curculin is a dimer of a 12,000-Da polypeptide. The complete amino acid sequence of curculin was determined by automatic Edman degradation. Curculin consists of 114 residues. Curculin itself elicits a sweet taste. After curculin, water elicits a sweet taste, and sour substances induce a stronger sense of sweetness. No protein with both sweet-tasting and taste-modifying activities has ever been found. There are five sets of tripeptides common to miraculin (a taste-modifying protein), six sets of tripeptides common to thaumatin (a sweet protein), and two sets of tripeptides common to monellin (a sweet protein). Anti-miraculin serum was not immunologically reactive with curculin. The mechanism of the taste-modifying action of curculin is discussed.
Comparative proteomic analysis of male and female venoms from the Cuban scorpion Rhopalurus junceus.

PubMed

Rodríguez-Ravelo, Rodolfo; Batista, Cesar V F; Coronas, Fredy I V; Zamudio, Fernando Z; Hernández-Orihuela, Lorena; Espinosa-López, Georgina; Ruiz-Urquiola, Ariel; Possani, Lourival D

2015-12-01

A complete mass spectrometry analysis of venom components from male and female scorpions of the species Rhophalurus junceus of Cuba is reported. In the order of 200 individual molecular masses were identified in both venoms, from which 63 are identical in male and females genders. It means that a significant difference of venom components exists between individuals of different sexes, but the most abundant components are present in both sexes. The relative abundance of identical components is different among the genders. Three well defined groups of different peptides were separated and identified. The first group corresponds to peptides with molecular masses of 1000-2000 Da; the second to peptides with 3500-4500 Da molecular weight, and the third with 6500-8000 Da molecular weights. A total of 86 peptides rich in disulfide bridges were found in the venoms, 27 with three disulfide bridges and 59 with four disulfide bridges. LC-MS/MS analysis allowed the identification and amino acid sequence determination of 31 novel peptides in male venom. Two new putative K(+)-channel peptides were sequences by Edman degradation. They contain 37 amino acid residues, packed by three disulfide bridges and were assigned the systematic numbers: α-KTx 1.18 and α-KTx 2.15. Copyright © 2015 Elsevier Ltd. All rights reserved.

Purification and Molecular Characterization of the Novel Highly Potent Bacteriocin TSU4 Produced by Lactobacillus animalis TSU4.

PubMed

Sahoo, Tapasa Kumar; Jena, Prasant Kumar; Patel, Amiya Kumar; Seshadri, Sriram

2015-09-01

Bacterial infections causing fish diseases and spoilage during fish food processing and storage are major concerns in aquaculture. Use of bacteriocins has recently been considered as an effective strategy for prevention of bacterial infections. A novel bacteriocin produced by Catla catla gut isolates, Lactobacillus animalis TSU4, designated as bacteriocin TSU4 was purified to homogeneity by a three-step protocol. The molecular mass of bacteriocin TSU4 was 4117 Da determined by Q-TOF LC/MS analysis. Its isoelectric point was ~9. Secondary conformation obtained by circular dichroism spectroscopy showed molecular conformation with significant proportions of the structure in α-helix (23.7 %) and β-sheets (17.1 %). N-terminal sequencing was carried out by the Edman degradation method; partial sequence identified was NH2-SMSGFSKPHD. Bacteriocin TSU4 exhibited a wide range of antimicrobial activity, pH and thermal stability. It showed a bacteriocidal mode of action against the indicator strain Aeromonas hydrophila MTCC 646. Bacteriocin TSU4 is the first reported bacteriocin produced by fish isolate Lactobacillus animalis. The characterization of bacteriocin TSU4 suggested that it is a novel bacteriocin with potential value against infections of bacteria such as A. hydrophila MTCC 646 and Pseudomonas aeruginosa MTCC 1688 and application to prevent spoilage during food preservation.
Structural characterization of peptides derived from prosomatostatins I and II isolated from the pancreatic islets of two species of teleostean fish: the daddy sculpin and the flounder.

PubMed

Conlon, J M; Davis, M S; Falkmer, S; Thim, L

1987-11-02

The primary structures of three peptides from extracts from the pancreatic islets of the daddy sculpin (Cottus scorpius) and three analogous peptides from the islets of the flounder (Platichthys flesus), two species of teleostean fish, have been determined by automated Edman degradation. The structures of the flounder peptides were confirmed by fast-atom bombardment mass spectrometry. The peptides show strong homology to residues (49-60), (63-96) and (98-125) of the predicted sequence of preprosomatostatin II from the anglerfish (Lophius americanus). The amino acid sequences of the peptides suggest that, in the sculpin, prosomatostatin II is cleaved at a dibasic amino acid residue processing site (corresponding to Lys61-Arg62 in anglerfish preprosomatostatin II). The resulting fragments are further cleaved at monobasic residue processing sites (corresponding to Arg48 and Arg97 in anglerfish preprosomatostatin II). In the flounder the same dibasic residue processing site is utilised but cleavage at different monobasic sites takes place (corresponding to Arg50 and Arg97 in anglerfish preprosomatostatin II). A peptide identical to mammalian somatostatin-14 was also isolated from the islets of both species and is presumed to represent a cleavage product of prosomatostatin I.
R(-)-4-(3-Isothiocyanatopyrrolidin-1-yl)-7-(N,N-dimethylaminosulfonyl)-2,1,3-benzoxadiazole, a fluorescent chiral tagging reagent: sensitive resolution of chiral amines and amino acids by reversed-phase liquid chromatography.

PubMed

Toyo'oka, T; Jin, D; Tomoi, N; Oe, T; Hiranuma, H

2001-02-01

The usefulness of R(-)-4-(3-isothiocyanatopyrrolidin-1-yl)-7-(N,N-dimethylaminosulfonyl)-2,1,3-benzoxadiazole [R(-)-DBD-PyNCS], a fluorescent chiral tagging reagent, for the determination of racemic amines and amino acids, was studied. The reagent reacted with beta-blockers selected as representative secondary amines to produce corresponding fluorescent diastereomers (excitation at 460 nm and emission at 550 nm). The yields of the derivatization reaction were dependent on the stereostructure arround the NH group in beta-blockers. The resulting diastereomers were completely separated with single chromatographic run using linear gradient elutions by reversed-phase chromatography. R(-)-DBD-PyNCS was also applied to the determination of DL-amino acid, considered to be one of the primary amines, in human urine and foodstuffs. DL-amino acids tested equally reacted with the reagent, and the thiocarbamoyl derivatives were separated with an ODS column. The epimerization during the derivatization reaction was negligible judging from the resolution of opposite diastereomers on the chromatogram. The occurence of D-amino acids (D-Ala, D-Ser, D-Asp and/or D-Glu) was identified in the samples tested. The structures and the purities were elucidated with on-line HPLC-MS. The chiral reagent possessing an isothiocyanate group (-NCS) in the structure seems to be applicable to continuous sequential analysis of peptides containing D-amino acids. The thiocarbamoyl derivatives obtained from the reaction with DL-amino acids were converted to thiohydantoins via thiazolinones in acidic medium. The thiohydantoins produced from acidic, basic, neutral, hydroxyl and aromatic amino acids were completely separated with isocratic elutions using acidic mobile phase containing 0.1% TFA. The separations were sufficient for the identification of DL-amino acid in peptide sequences. Although the epimerization during the conversion reaction to thiohydantoins was not avoidable, the descrimination of D- and L-configuration was demonstrated with some commercially available peptides such as beta-lipotropin and [D-Ala2]-deltorphin II. The Edman degaradation method using R(-)-DBD-PyNCS was also adopted to autoanlaysis by gas-phase sequencer. The separation and the detection (UV 254 nm) conditions of the derivatives were used without any change from those for the Edman degradation method using PITC as the tagging reagent. The three DL-amino acid residues (Tyr, Ala and Gly) in [L-Ala2]-leucine-enkephalin and [D-Ala2]-leucine-enkephalin were perfectly identidied with the autoanalysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Fox, J.W.; Elzinga, M.; Tu, A.T.

The primary structure of myotoxin a, a myotoxin protein from the venom of the North American rattlesnake Crotalus viridis viridis, was determined and the position of the disulfide bonds assigned. The toxin was isolated, carboxymethylated, and cleaved by cyanogen bromide, and the resultant peptides were isolated. The cyanogen bromide peptides were subjected to amino acid sequence analysis. In order to assign the positions of the three disulfide bonds, the native toxin was cleaved sequentially with cyanogen bromide and trypsin. A two peptide unit connected by one disulfide bond was isolated and characterized, and a three-peptide unit connected by two disulfidemore » bonds was isolated. One peptide in the three-peptide unit was identified as Cys-Cys-Lys. In order to establish the linkages between the peptides and Cys-Cys-Lys, one cycle of Edman degradation was carried out such that the Cys-Cys bond was cleaved. Upon isolation and analysis of the cleavage products, the disulfide bonds connecting the three peptides were determined. The positions of the disulfide bridges of myotoxin a were determined to be totally different from those of neurotoxins isolated from snake venoms. The sequence of myotoxin a was compared with the sequences of other snake venom toxins using the computer program RELATE to determine whether myotoxin a is similar to any other types of toxins. From the computer analysis, myotoxin a did not show any close relationship to other toxins except crotamine from the South American rattlesnake Crotalus durissus terrificus.« less
Production, purification, sequencing and activity spectra of mutacins D-123.1 and F-59.1

PubMed Central

2011-01-01

Background The increase in bacterial resistance to antibiotics impels the development of new anti-bacterial substances. Mutacins (bacteriocins) are small antibacterial peptides produced by Streptococcus mutans showing activity against bacterial pathogens. The objective of the study was to produce and characterise additional mutacins in order to find new useful antibacterial substances. Results Mutacin F-59.1 was produced in liquid media by S. mutans 59.1 while production of mutacin D-123.1 by S. mutans 123.1 was obtained in semi-solid media. Mutacins were purified by hydrophobic chromatography. The amino acid sequences of the mutacins were obtained by Edman degradation and their molecular mass was determined by mass spectrometry. Mutacin F-59.1 consists of 25 amino acids, containing the YGNGV consensus sequence of pediocin-like bacteriocins with a molecular mass calculated at 2719 Da. Mutacin D-123.1 has an identical molecular mass (2364 Da) with the same first 9 amino acids as mutacin I. Mutacins D-123.1 and F-59.1 have wide activity spectra inhibiting human and food-borne pathogens. The lantibiotic mutacin D-123.1 possesses a broader activity spectrum than mutacin F-59.1 against the bacterial strains tested. Conclusion Mutacin F-59.1 is the first pediocin-like bacteriocin identified and characterised that is produced by Streptococcus mutans. Mutacin D-123.1 appears to be identical to mutacin I previously identified in different strains of S. mutans. PMID:21477375
Production, purification, sequencing and activity spectra of mutacins D-123.1 and F-59.1.

PubMed

Nicolas, Guillaume G; LaPointe, Gisèle; Lavoie, Marc C

2011-04-10

The increase in bacterial resistance to antibiotics impels the development of new anti-bacterial substances. Mutacins (bacteriocins) are small antibacterial peptides produced by Streptococcus mutans showing activity against bacterial pathogens. The objective of the study was to produce and characterise additional mutacins in order to find new useful antibacterial substances. Mutacin F-59.1 was produced in liquid media by S. mutans 59.1 while production of mutacin D-123.1 by S. mutans 123.1 was obtained in semi-solid media. Mutacins were purified by hydrophobic chromatography. The amino acid sequences of the mutacins were obtained by Edman degradation and their molecular mass was determined by mass spectrometry. Mutacin F-59.1 consists of 25 amino acids, containing the YGNGV consensus sequence of pediocin-like bacteriocins with a molecular mass calculated at 2719 Da. Mutacin D-123.1 has an identical molecular mass (2364 Da) with the same first 9 amino acids as mutacin I. Mutacins D-123.1 and F-59.1 have wide activity spectra inhibiting human and food-borne pathogens. The lantibiotic mutacin D-123.1 possesses a broader activity spectrum than mutacin F-59.1 against the bacterial strains tested. Mutacin F-59.1 is the first pediocin-like bacteriocin identified and characterised that is produced by Streptococcus mutans. Mutacin D-123.1 appears to be identical to mutacin I previously identified in different strains of S. mutans.
Novel proline-hydroxyproline glycopeptides from the dandelion (Taraxacum officinale Wigg.) flowers: de novo sequencing and biological activity.

PubMed

Astafieva, Alexandra A; Enyenihi, Atim A; Rogozhin, Eugene A; Kozlov, Sergey A; Grishin, Eugene V; Odintsova, Tatyana I; Zubarev, Roman A; Egorov, Tsezi A

2015-09-01

Two novel homologous peptides named ToHyp1 and ToHyp2 that show no similarity to any known proteins were isolated from Taraxacum officinale Wigg. flowers by multidimensional liquid chromatography. Amino acid and mass spectrometry analyses demonstrated that the peptides have unusual structure: they are cysteine-free, proline-hydroxyproline-rich and post-translationally glycosylated by pentoses, with 5 carbohydrates in ToHyp2 and 10 in ToHyp1. The ToHyp2 peptide with a monoisotopic molecular mass of 4350.3Da was completely sequenced by a combination of Edman degradation and de novo sequencing via top down multistage collision induced dissociation (CID) and higher energy dissociation (HCD) tandem mass spectrometry (MS(n)). ToHyp2 consists of 35 amino acids, contains eighteen proline residues, of which 8 prolines are hydroxylated. The peptide displays antifungal activity and inhibits growth of Gram-positive and Gram-negative bacteria. We further showed that carbohydrate moieties have no significant impact on the peptide structure, but are important for antifungal activity although not absolutely necessary. The deglycosylated ToHyp2 peptide was less active against the susceptible fungus Bipolaris sorokiniana than the native peptide. Unique structural features of the ToHyp2 peptide place it into a new family of plant defense peptides. The discovery of ToHyp peptides in T. officinale flowers expands the repertoire of molecules of plant origin with practical applications. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Identification of the regulatory autophosphorylation site of autophosphorylation-dependent protein kinase (auto-kinase). Evidence that auto-kinase belongs to a member of the p21-activated kinase family.

PubMed

Yu, J S; Chen, W J; Ni, M H; Chan, W H; Yang, S D

1998-08-15

Autophosphorylation-dependent protein kinase (auto-kinase) was identified from pig brain and liver on the basis of its unique autophosphorylation/activation property [Yang, Fong, Yu and Liu (1987) J. Biol. Chem. 262, 7034-7040; Yang, Chang and Soderling (1987) J. Biol. Chem. 262, 9421-9427]. Its substrate consensus sequence motif was determined as being -R-X-(X)-S*/T*-X3-S/T-. To characterize auto-kinase further, we partly sequenced the kinase purified from pig liver. The N-terminal sequence (VDGGAKTSDKQKKKAXMTDE) and two internal peptide sequences (EKLRTIV and LQNPEK/ILTP/FI) of auto-kinase were obtained. These sequences identify auto-kinase as a C-terminal catalytic fragment of p21-activated protein kinase 2 (PAK2 or gamma-PAK) lacking its N-terminal regulatory region. Auto-kinase can be recognized by an antibody raised against the C-terminal peptide of human PAK2 by immunoblotting. Furthermore the autophosphorylation site sequence of auto-kinase was successfully predicted on the basis of its substrate consensus sequence motif and the known PAK2 sequence, and was further demonstrated to be RST(P)MVGTPYWMAPEVVTR by phosphoamino acid analysis, manual Edman degradation and phosphopeptide mapping via the help of phosphorylation site analysis of a synthetic peptide corresponding to the sequence of PAK2 from residues 396 to 418. During the activation process, auto-kinase autophosphorylates mainly on a single threonine residue Thr402 (according to the sequence numbering of human PAK2). In addition, a phospho-specific antibody against a synthetic phosphopeptide containing this identified sequence was generated and shown to be able to differentially recognize the activated auto-kinase autophosphorylated at Thr402 but not the non-phosphorylated/inactive auto-kinase. Immunoblot analysis with this phospho-specific antibody further revealed that the change in phosphorylation level of Thr402 of auto-kinase was well correlated with the activity change of the kinase during both autophosphorylation/activation and protein phosphatase-mediated dephosphorylation/inactivation processes. Taken together, our results identify Thr402 as the regulatory autophosphorylation site of auto-kinase, which is a C-terminal catalytic fragment of PAK2.
Identification of the regulatory autophosphorylation site of autophosphorylation-dependent protein kinase (auto-kinase). Evidence that auto-kinase belongs to a member of the p21-activated kinase family.

PubMed Central

Yu, J S; Chen, W J; Ni, M H; Chan, W H; Yang, S D

1998-01-01

Autophosphorylation-dependent protein kinase (auto-kinase) was identified from pig brain and liver on the basis of its unique autophosphorylation/activation property [Yang, Fong, Yu and Liu (1987) J. Biol. Chem. 262, 7034-7040; Yang, Chang and Soderling (1987) J. Biol. Chem. 262, 9421-9427]. Its substrate consensus sequence motif was determined as being -R-X-(X)-S*/T*-X3-S/T-. To characterize auto-kinase further, we partly sequenced the kinase purified from pig liver. The N-terminal sequence (VDGGAKTSDKQKKKAXMTDE) and two internal peptide sequences (EKLRTIV and LQNPEK/ILTP/FI) of auto-kinase were obtained. These sequences identify auto-kinase as a C-terminal catalytic fragment of p21-activated protein kinase 2 (PAK2 or gamma-PAK) lacking its N-terminal regulatory region. Auto-kinase can be recognized by an antibody raised against the C-terminal peptide of human PAK2 by immunoblotting. Furthermore the autophosphorylation site sequence of auto-kinase was successfully predicted on the basis of its substrate consensus sequence motif and the known PAK2 sequence, and was further demonstrated to be RST(P)MVGTPYWMAPEVVTR by phosphoamino acid analysis, manual Edman degradation and phosphopeptide mapping via the help of phosphorylation site analysis of a synthetic peptide corresponding to the sequence of PAK2 from residues 396 to 418. During the activation process, auto-kinase autophosphorylates mainly on a single threonine residue Thr402 (according to the sequence numbering of human PAK2). In addition, a phospho-specific antibody against a synthetic phosphopeptide containing this identified sequence was generated and shown to be able to differentially recognize the activated auto-kinase autophosphorylated at Thr402 but not the non-phosphorylated/inactive auto-kinase. Immunoblot analysis with this phospho-specific antibody further revealed that the change in phosphorylation level of Thr402 of auto-kinase was well correlated with the activity change of the kinase during both autophosphorylation/activation and protein phosphatase-mediated dephosphorylation/inactivation processes. Taken together, our results identify Thr402 as the regulatory autophosphorylation site of auto-kinase, which is a C-terminal catalytic fragment of PAK2. PMID:9693111
Biosynthesis of riboflavin: an unusual riboflavin synthase of Methanobacterium thermoautotrophicum.

PubMed Central

Eberhardt, S; Korn, S; Lottspeich, F; Bacher, A

1997-01-01

Riboflavin synthase was purified by a factor of about 1,500 from cell extract of Methanobacterium thermoautotrophicum. The enzyme had a specific activity of about 2,700 nmol mg(-1) h(-1) at 65 degrees C, which is relatively low compared to those of riboflavin synthases of eubacteria and yeast. Amino acid sequences obtained after proteolytic cleavage had no similarity with known riboflavin synthases. The gene coding for riboflavin synthase (designated ribC) was subsequently cloned by marker rescue with a ribC mutant of Escherichia coli. The ribC gene of M. thermoautotrophicum specifies a protein of 153 amino acid residues. The predicted amino acid sequence agrees with the information gleaned from Edman degradation of the isolated protein and shows 67% identity with the sequence predicted for the unannotated reading frame MJ1184 of Methanococcus jannaschii. The ribC gene is adjacent to a cluster of four genes with similarity to the genes cbiMNQO of Salmonella typhimurium, which form part of the cob operon (this operon contains most of the genes involved in the biosynthesis of vitamin B12). The amino acid sequence predicted by the ribC gene of M. thermoautotrophicum shows no similarity whatsoever to the sequences of riboflavin synthases of eubacteria and yeast. Most notably, the M. thermoautotrophicum protein does not show the internal sequence homology characteristic of eubacterial and yeast riboflavin synthases. The protein of M. thermoautotrophicum can be expressed efficiently in a recombinant E. coli strain. The specific activity of the purified, recombinant protein is 1,900 nmol mg(-1) h(-1) at 65 degrees C. In contrast to riboflavin synthases from eubacteria and fungi, the methanobacterial enzyme has an absolute requirement for magnesium ions. The 5' phosphate of 6,7-dimethyl-8-ribityllumazine does not act as a substrate. The findings suggest that riboflavin synthase has evolved independently in eubacteria and methanobacteria. PMID:9139911
Comparative analysis of the XopD T3S effector family in plant pathogenic bacteria

PubMed Central

Kim, Jung-Gun; Taylor, Kyle W.; Mudgett, Mary Beth

2011-01-01

SUMMARY XopD is a type III effector protein that is required for Xanthomonas campestris pathovar vesicatoria (Xcv) growth in tomato. It is a modular protein consisting of an N-terminal DNA-binding domain, two EAR transcriptional repressor motifs, and a C-terminal SUMO protease. In tomato, XopD functions as a transcriptional repressor, resulting in the suppression of defense responses at late stages of infection. A survey of available genome sequences for phytopathogenic bacteria revealed that XopD homologs are limited to species within three Genera of Proteobacteria – Xanthomonas, Acidovorax, and Pseudomonas. While the EAR motif(s) and SUMO protease domain are conserved in all the XopD-like proteins, variation exists in the length and sequence identity of the N-terminal domains. Comparative analysis of the DNA sequences surrounding xopD and xopD-like genes led to revised annotation of the xopD gene. Edman degradation sequence analysis and functional complementation studies confirmed that the xopD gene from Xcv encodes a 760 amino acid protein with a longer N-terminal domain than previously predicted. None of the XopD-like proteins studied complemented Xcv ΔxopD mutant phenotypes in tomato leaves suggesting that the N-terminus of XopD defines functional specificity. Xcv ΔxopD strains expressing chimeric fusion proteins containing the N-terminus of XopD fused to the EAR motif(s) and SUMO protease domain of the XopD-like protein from Xanthomonas campestris pathovar campestris strain B100 were fully virulent in tomato demonstrating that the N-terminus of XopD controls specificity in tomato. PMID:21726373
Isolation and identification of an extracellular subtilisin-like serine protease secreted by the bat pathogen Pseudogymnoascus destructans.

PubMed

Pannkuk, Evan L; Risch, Thomas S; Savary, Brett J

2015-01-01

White nose syndrome (WNS) is a cutaneous fungal disease of bats. WNS is responsible for unprecedented mortalities in North American cave bat populations. There have been few descriptions of enzyme activities that may function in WNS host/pathogen interactions, while no study has isolated and described secreted proteases. To address the hypothesis that Pseudogymnoascus destructans secretes extracellular proteases that function in wing necrosis during WNS infection, the object of this study was to culture P. destructans on various media, then isolate and structurally identify those proteases accumulated stably in the culture medium. We found a single dominant protease activity on minimal nutrient broth enriched with protein substrates, which was strongly inhibited by phenylmethylsulfonyl fluoride. This P. destructans serine protease (PdSP1) was isolated by preparative isoelectric focusing and concanavalin A lectin affinity chromatography. PdSP1 showed a molecular weight 27,900 (estimated by SDS-PAGE), broad pH optimum 6-8, and temperature optimum 60°C. Structural characterization of PdSP1 by MALDI-TOF MS, Orbitrap MS/MS, and Edman amino-terminal peptide sequencing matched it directly to a hypothetical protein accession from the sequenced P. destructans genome that is further identified as a MEROPS family S8A subtilisin-like serine peptidase. Two additional isoforms, PdSP2 and PdSP3, were identified in the P. destructans genome with 90% and 53% homology, respectively. P. destructans S8A serine proteases showed closer sequence conservation to P. pannorum and plant pathogenic fungi than to human pathogenic dermatophytes. Peptide-specific polyclonal antibodies developed from the PdSP1 sequence detected the protein in western blots. These subtilisin-like serine proteases are candidates for further functional studies in WNS host-pathogen interaction.
Isolation and Identification of an Extracellular Subtilisin-Like Serine Protease Secreted by the Bat Pathogen Pseudogymnoascus destructans

PubMed Central

Pannkuk, Evan L.; Risch, Thomas S.; Savary, Brett J.

2015-01-01

White nose syndrome (WNS) is a cutaneous fungal disease of bats. WNS is responsible for unprecedented mortalities in North American cave bat populations. There have been few descriptions of enzyme activities that may function in WNS host/pathogen interactions, while no study has isolated and described secreted proteases. To address the hypothesis that Pseudogymnoascus destructans secretes extracellular proteases that function in wing necrosis during WNS infection, the object of this study was to culture P. destructans on various media, then isolate and structurally identify those proteases accumulated stably in the culture medium. We found a single dominant protease activity on minimal nutrient broth enriched with protein substrates, which was strongly inhibited by phenylmethylsulfonyl fluoride. This P. destructans serine protease (PdSP1) was isolated by preparative isoelectric focusing and concanavalin A lectin affinity chromatography. PdSP1 showed a molecular weight 27,900 (estimated by SDS-PAGE), broad pH optimum 6-8, and temperature optimum 60°C. Structural characterization of PdSP1 by MALDI-TOF MS, Orbitrap MS/MS, and Edman amino-terminal peptide sequencing matched it directly to a hypothetical protein accession from the sequenced P. destructans genome that is further identified as a MEROPS family S8A subtilisin-like serine peptidase. Two additional isoforms, PdSP2 and PdSP3, were identified in the P. destructans genome with 90% and 53% homology, respectively. P. destructans S8A serine proteases showed closer sequence conservation to P. pannorum and plant pathogenic fungi than to human pathogenic dermatophytes. Peptide-specific polyclonal antibodies developed from the PdSP1 sequence detected the protein in western blots. These subtilisin-like serine proteases are candidates for further functional studies in WNS host-pathogen interaction. PMID:25785714
The Acheta domesticus Densovirus, Isolated from the European House Cricket, Has Evolved an Expression Strategy Unique among Parvoviruses▿†

PubMed Central

Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter

2011-01-01

The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445
Identification of a Novel Small Cysteine-Rich Protein in the Fraction from the Biocontrol Fusarium oxysporum Strain CS-20 that Mitigates Fusarium Wilt Symptoms and Triggers Defense Responses in Tomato

PubMed Central

Shcherbakova, Larisa A.; Odintsova, Tatyana I.; Stakheev, Alexander A.; Fravel, Deborah R.; Zavriev, Sergey K.

2016-01-01

The biocontrol effect of the non-pathogenic Fusarium oxysporum strain CS-20 against the tomato wilt pathogen F. oxysporum f. sp. lycopersici (FOL) has been previously reported to be primarily plant-mediated. This study shows that CS-20 produces proteins, which elicit defense responses in tomato plants. Three protein-containing fractions were isolated from CS-20 biomass using size exclusion chromatography. Exposure of seedling roots to one of these fractions prior to inoculation with pathogenic FOL strains significantly reduced wilt severity. This fraction initiated an ion exchange response in cultured tomato cells resulting in a reversible alteration of extracellular pH; increased tomato chitinase activity, and induced systemic resistance by enhancing PR-1 expression in tomato leaves. Two other protein fractions were inactive in seedling protection. The main polypeptide (designated CS20EP), which was specifically present in the defense-inducing fraction and was not detected in inactive protein fractions, was identified. The nucleotide sequence encoding this protein was determined, and its complete amino acid sequence was deduced from direct Edman degradation (25 N-terminal amino acid residues) and DNA sequencing. The CS20EP was found to be a small basic cysteine-rich protein with a pI of 9.87 and 23.43% of hydrophobic amino acid residues. BLAST search in the NCBI database showed that the protein is new; however, it displays 48% sequence similarity with a hypothetical protein FGSG_10784 from F. graminearum strain PH-1. The contribution of CS20EP to elicitation of tomato defense responses resulting in wilt mitigating is discussed. PMID:26779237
Characterization of a periplasmic S1-like nuclease coded by the Mesorhizobium loti symbiosis island

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pimkin, Maxim; Miller, C. Glenn; Blakesley, Lauryn

DNA sequences encoding hypothetical proteins homologous to S1 nuclease from Aspergillus oryzae are found in many organisms including fungi, plants, pathogenic bacteria, and eukaryotic parasites. One of these is the M1 nuclease of Mesorhizobium loti which we demonstrate herein to be an enzymatically active, soluble, and stable S1 homolog that lacks the extensive mannosyl-glycosylation found in eukaryotic S1 nuclease homologs. We have expressed the cloned M1 protein in M. loti and purified recombinant native M1 to near homogeneity and have also isolated a homogeneous M1 carboxy-terminal hexahistidine tag fusion protein. Mass spectrometry and N-terminal Edman degradation sequencing confirmed the proteinmore » identity. The enzymatic properties of the purified M1 nuclease are similar to those of S1. At acidic pH M1 is 25 times more active on single-stranded DNA than on double-stranded DNA and 3 times more active on single-stranded DNA than on single-stranded RNA. At neutral pH the RNase activity of M1 exceeds the DNase activity. M1 nicks supercoiled RF-I plasmid DNA and rapidly cuts the phosphodiester bond across from the nick in the resultant relaxed RF-II plasmid DNA. Therefore, M1 represents an active bacterial S1 homolog in spite of great sequence divergence. The biochemical characterization of M1 nuclease supports our sequence alignment that reveals the minimal 21 amino acid residues that are necessarily conserved for the structure and functions of this enzyme family. The ability of M1 to degrade RNA at neutral pH implies previously unappreciated roles of these nucleases in biological systems.« less
An asparagine residue at the N-terminus affects the maturation process of low molecular weight glutenin subunits of wheat endosperm

PubMed Central

2014-01-01

Background Wheat glutenin polymers are made up of two main subunit types, the high- (HMW-GS) and low- (LMW-GS) molecular weight subunits. These latter are represented by heterogeneous proteins. The most common, based on the first amino acid of the mature sequence, are known as LMW-m and LMW-s types. The mature sequences differ as a consequence of three extra amino acids (MET-) at the N-terminus of LMW-m types. The nucleotide sequences of their encoding genes are, however, nearly identical, so that the relationship between gene and protein sequences is difficult to ascertain. It has been hypothesized that the presence of an asparagine residue in position 23 of the complete coding sequence for the LMW-s type might account for the observed three-residue shortened sequence, as a consequence of cleavage at the asparagine by an asparaginyl endopeptidase. Results We performed site-directed mutagenesis of a LMW-s gene to replace asparagine at position 23 with threonine and thus convert it to a candidate LMW-m type gene. Similarly, a candidate LMW-m type gene was mutated at position 23 to replace threonine with asparagine. Next, we produced transgenic durum wheat (cultivar Svevo) lines by introducing the mutated versions of the LMW-m and LMW-s genes, along with the wild type counterpart of the LMW-m gene. Proteomic comparisons between the transgenic and null segregant plants enabled identification of transgenic proteins by mass spectrometry analyses and Edman N-terminal sequencing. Conclusions Our results show that the formation of LMW-s type relies on the presence of an asparagine residue close to the N-terminus generated by signal peptide cleavage, and that LMW-GS can be quantitatively processed most likely by vacuolar asparaginyl endoproteases, suggesting that those accumulated in the vacuole are not sequestered into stable aggregates that would hinder the action of proteolytic enzymes. Rather, whatever is the mechanism of glutenin polymer transport to the vacuole, the proteins remain available for proteolytic processing, and can be converted to the mature form by the removal of a short N-terminal sequence. PMID:24629124
Molecular Cloning and Characterization of Novel Morus alba Germin-Like Protein Gene Which Encodes for a Silkworm Gut Digestion-Resistant Antimicrobial Protein

PubMed Central

Patnaik, Bharat Bhusan; Kim, Dong Hyun; Oh, Seung Han; Song, Yong-Su; Chanh, Nguyen Dang Minh; Kim, Jong Sun; Jung, Woo-jin; Saha, Atul Kumar; Bindroo, Bharat Bhushan; Han, Yeon Soo

2012-01-01

Background Silkworm fecal matter is considered one of the richest sources of antimicrobial and antiviral protein (substances) and such economically feasible and eco-friendly proteins acting as secondary metabolites from the insect system can be explored for their practical utility in conferring broad spectrum disease resistance against pathogenic microbial specimens. Methodology/Principal Findings Silkworm fecal matter extracts prepared in 0.02 M phosphate buffer saline (pH 7.4), at a temperature of 60°C was subjected to 40% saturated ammonium sulphate precipitation and purified by gel-filtration chromatography (GFC). SDS-PAGE under denaturing conditions showed a single band at about 21.5 kDa. The peak fraction, thus obtained by GFC wastested for homogeneityusing C18reverse-phase high performance liquid chromatography (HPLC). The activity of the purified protein was tested against selected Gram +/− bacteria and phytopathogenic Fusarium species with concentration-dependent inhibitionrelationship. The purified bioactive protein was subjected to matrix-assisted laser desorption and ionization-time of flight mass spectrometry (MALDI-TOF-MS) and N-terminal sequencing by Edman degradation towards its identification. The N-terminal first 18 amino acid sequence following the predicted signal peptide showed homology to plant germin-like proteins (Glp). In order to characterize the full-length gene sequence in detail, the partial cDNA was cloned and sequenced using degenerate primers, followed by 5′- and 3′-rapid amplification of cDNA ends (RACE-PCR). The full-length cDNA sequence composed of 630 bp encoding 209 amino acids and corresponded to germin-like proteins (Glps) involved in plant development and defense. Conclusions/Significance The study reports, characterization of novel Glpbelonging to subfamily 3 from M. alba by the purification of mature active protein from silkworm fecal matter. The N-terminal amino acid sequence of the purified protein was found similar to the deduced amino acid sequence (without the transit peptide sequence) of the full length cDNA from M. alba. PMID:23284650
DOE Office of Scientific and Technical Information (OSTI.GOV)

James Graham, Robert Leslie; Graham, Ciaren; McClean, Stephen

A novel undecapeptide has been isolated and structurally characterized from the venoms of three species of New World pit vipers from the subfamily, Crotalinae. These include the Mexican moccasin (Agkistrodon bilineatus), the prairie rattlesnake (Crotalus viridis viridis), and the South American bushmaster (Lachesis muta). The peptide was purified from all three venoms using a combination of gel permeation chromatography and reverse-phase HPLC. Automated Edman degradation sequencing and MALDI-TOF mass spectrometry established its peptide primary structure as: Thr-Pro-Pro-Ala-Gly-Pro-Asp-Val-Gly-Pro-Arg-OH, with a non-protonated molecular mass of 1063.18 Da. A synthetic replicate of the peptide was found to be an antagonist of bradykinin actionmore » at the rat vascular B2 receptor. This is the first bradykinin inhibitory peptide isolated from snake venom. Database searching revealed the peptide to be highly structurally related (10/11 residues) with a domain residing between the bradykinin-potentiating peptide and C-type natriuretic peptide domains of a recently cloned precursor from tropical rattlesnake (Crotalus durissus terrificus) venom gland. BIP thus represents a novel biological entity from snake venom.« less
Antimicrobial proline-rich peptides from the hemolymph of marine snail Rapana venosa.

PubMed

Dolashka, Pavlina; Moshtanska, Vesela; Borisova, Valika; Dolashki, Aleksander; Stevanovic, Stefan; Dimanov, Tzvetan; Voelter, Wolfgang

2011-07-01

Hemolymph of Rapana venosa snails is a complex mixture of biochemically and pharmacologically active components such as peptides and proteins. Antimicrobial peptides are gaining attention as antimicrobial alternatives to chemical food preservatives and commonly used antibiotics. Therefore, for the first time we have explored the isolation, identification and characterisation of 11 novel antimicrobial peptides produced by the hemolymph of molluscs. The isolated peptides from the hemolymph applying ultrafiltration and reverse-phase high-performance liquid chromatography (RP-HPLC) have molecular weights between 3000 and 9500 Da, determined by mass spectrometric analysis. The N-terminal sequences of the peptides identified by Edman degradation matched no peptides in the MASCOT search database, indicating novel proline-rich peptides. UV spectra revealed that these substances possessed the characteristics of protein peptides with acidic isoelectric points. However, no Cotton effects were observed between 190 and 280 nm by circular dichroism spectroscopy. Four of the pro-rich peptides also showed strong antimicrobial activities against tested microorganisms including Gram-positive and Gram-negative bacteria. Copyright © 2011 Elsevier Inc. All rights reserved.

Rapid removal of acetimidoyl groups from proteins and peptides. Applications to primary structure determination.

PubMed Central

Dubois, G C; Robinson, E A; Inman, J K; Perham, R N; Appella, E

1981-01-01

Methylamine buffers can be used for the rapid quantitative removal of acetimidoyl groups from proteins and peptides modified by treatment with ethyl or methyl acetimidate. The half-life for displacement of acetimidoyl groups from fully amidinated proteins incubated in 3.44 M-methylamine/HCl buffer at pH 11.5 and 25 degrees C was approx. 26 min; this half life is 29 times less than that observed in ammonia/HCl buffer under the same conditions of pH and amine concentration. Incubation of acetimidated proteins with methylamine for 4 h resulted in greater than 95% removal of acetimidoyl groups. No deleterious effects on primary structure were detected by amino acid analysis or by automated Edman degradation. Reversible amidination of lysine residues, in conjunction with tryptic digestion, has been successfully applied to the determination of the amino acid sequence of an acetimidated mouse immunoglobulin heavy chain peptide. The regeneration of amino groups in amidinated proteins and peptides by methylaminolysis makes amidination a valuable alternative to citraconoylation and maleoylation in structural studies. PMID:6803762
An insecticidal toxin from Nephila clavata spider venom.

PubMed

Jin, Lin; Fang, Mingqian; Chen, Mengrou; Zhou, Chunling; Ombati, Rose; Hakim, Md Abdul; Mo, Guoxiang; Lai, Ren; Yan, Xiuwen; Wang, Yumin; Yang, Shilong

2017-07-01

Spiders are the most successful insect predators given that they use their venom containing insecticidal peptides as biochemical weapons for preying. Due to the high specificity and potency of peptidic toxins, discoveries of insecticidal toxins from spider venom have provided an opportunity to obtain natural compounds for agricultural applications without affecting human health. In this study, a novel insecticidal toxin (μ-NPTX-Nc1a) was identified and characterized from the venom of Nephila clavata. Its primary sequence is GCNPDCTGIQCGWPRCPGGQNPVMDKCVSCCPFCPPKSAQG which was determined by automated Edman degradation, cDNA cloning, and MS/MS analysis. BLAST search indicated that Nc1a shows no similarity with known peptides or proteins, indicating that Nc1a belongs to a novel family of insecticidal peptide. Nc1a displayed inhibitory effects on Na V and K V channels in cockroach dorsal unpaired median neurons. The median lethal dose (LD50) of Nc1a on cockroach was 573 ng/g. Herein, a study that identifies a novel insecticidal toxin, which can be a potential candidate and/or template for the development of bioinsecticides, is presented.
Multi-omic Mitoprotease Profiling Defines a Role for Oct1p in Coenzyme Q Production.

PubMed

Veling, Mike T; Reidenbach, Andrew G; Freiberger, Elyse C; Kwiecien, Nicholas W; Hutchins, Paul D; Drahnak, Michael J; Jochem, Adam; Ulbrich, Arne; Rush, Matthew J P; Russell, Jason D; Coon, Joshua J; Pagliarini, David J

2017-12-07

Mitoproteases are becoming recognized as key regulators of diverse mitochondrial functions, although their direct substrates are often difficult to discern. Through multi-omic profiling of diverse Saccharomyces cerevisiae mitoprotease deletion strains, we predicted numerous associations between mitoproteases and distinct mitochondrial processes. These include a strong association between the mitochondrial matrix octapeptidase Oct1p and coenzyme Q (CoQ) biosynthesis-a pathway essential for mitochondrial respiration. Through Edman sequencing and in vitro and in vivo biochemistry, we demonstrated that Oct1p directly processes the N terminus of the CoQ-related methyltransferase, Coq5p, which markedly improves its stability. A single mutation to the Oct1p recognition motif in Coq5p disrupted its processing in vivo, leading to CoQ deficiency and respiratory incompetence. This work defines the Oct1p processing of Coq5p as an essential post-translational event for proper CoQ production. Additionally, our data visualization tool enables efficient exploration of mitoprotease profiles that can serve as the basis for future mechanistic investigations. Copyright © 2017 Elsevier Inc. All rights reserved.
Purification and characteristics of a novel bacteriocin produced by Enterococcus faecalis L11 isolated from Chinese traditional fermented cucumber.

PubMed

Gao, Yurong; Li, Benling; Li, Dapeng; Zhang, Liyuan

2016-05-01

To purify and characterize a novel bacteriocin with broad inhibitory spectrum produced by an isolate of Enterococcus faecalis from Chinese fermented cucumber. E. faecalis L11 produced a bacteriocin with antimicrobial activity against both Escherichia coli and Staphylococcus aureus. The amino acid sequence of the purified bacteriocin, enterocin L11, was assayed by Edman degradation method. It differs from other class II bacteriocins and exhibited a broad antimicrobial activity against not only Gram-positive bacteria, including Bacillus subtilis, S. aureus, Listeria monocytogenes, Sarcina flava, Lactobacillus acidophilus, L. plantarum, L. delbrueckii subsp. delbrueckii, L. delbrueckii subsp. bulgaricus and Streptococcus thermophilus, but also some Gram-negative bacteria including Salmonella typhimurium, E. coli and Shigella flexneri. Enterocin L11 retained 91 % of its activity after holding at 121 °C for 30 min. It was also resistant to acids and alkalis. Enterocin L11 is a novel broad-spectrum Class II bacteriocin produced by E. faecalis L11, and may have potential as a food biopreservative.
PH-sauvagine from the skin secretion of Phyllomedusa hypochondrialis: A novel CRF-like peptide with smooth muscle contraction activity.

PubMed

Zhou, Yu; Shaw, Chris; Chen, Tianbao

2015-09-15

Amphibian skin, and particularly that of south/Central American phyllomedusine frogs, is supposed to be "a huge factory and store house of a variety of active peptides". The 40 amino acid amphibian CRF-like peptide, sauvagine, is a prototype member of a unique family of these Phyllomedusa skin peptides. In this study, we describe for the first time the structure of a mature novel peptide from the skin secretion of the South American orange-legged leaf frog, Phyllomedusa hypochondrialis, which belongs to the amphibian CRF/sauvagine family. Partial amino acid sequence from the N-terminal was obtained by automated Edman degradation with the following structure: pGlu-GPPISIDLNMELLRNMIEI-. The biosynthetic precursor of this novel sauvagine peptide, consisted of 85 amino acid residues and was deduced from cDNA library constructed from the same skin secretion. Compared with the standard sauvagine from the frog, Phyllomedusa sauvagei, this novel peptide was found to exert similar contraction effects on isolated guinea-pig colon and rat urinary bladder smooth muscle preparations. Copyright © 2015 Elsevier Ltd. All rights reserved.
Crab digestive phospholipase: a new invertebrate member.

PubMed

Cherif, Slim; Ben Bacha, Abir; Ben Ali, Yassine; Horchani, Habib; Rekik, Wiem; Gargouri, Youssef

2010-01-01

Crab digestive phospholipase (CDPL) was purified from the hepatopancreas of Carcinus mediterraneus crabs. Homogeneous enzyme was obtained after two chromatography steps: anion exchange and size exclusion HPLC column. Homogeneous CDPL has a molecular mass of 14 kDa as determined by SDS/PAGE analysis. Unlike known digestive phospholipases like porcine PLA(2) (PPPL), CDPL displayed its maximal activity at 50 degrees C and not at 37 degrees C. A specific activity of 40 U/mg for the purified CDPL was measured using PC as substrate under optimal conditions (pH 8 and 50 degrees C) in the presence of 8 mM sodium deoxycholate (NaDC) and 10 mM CaCl(2). In contrast to PPPL, purified CDPL was completely inactivated at 60 degrees C. The N-terminal sequence was determined by automatic Edman degradation. No similarity between 12 N-terminal amino acid residues of CDPL was found with those of known digestive phospholipases. CDPL appears to be a new member of invertebrate phospholipases, and it is potentially useful for treat phospholipid-rich industrial effluents, or to synthesize useful chemical compounds which can be used in the food industry.
Evidence for the proteolytic processing of dentin matrix protein 1. Identification and characterization of processed fragments and cleavage sites.

PubMed

Qin, Chunlin; Brunn, Jan C; Cook, Richard G; Orkiszewski, Ralph S; Malone, James P; Veis, Arthur; Butler, William T

2003-09-05

Full-length cDNA coding for dentin matrix protein 1 (DMP1) has been cloned and sequenced, but the corresponding complete protein has not been isolated. In searching for naturally occurring DMP1, we recently discovered that the extracellular matrix of bone contains fragments originating from DMP1. Shortened forms of DMP1, termed 37K and 57K fragments, were treated with alkaline phosphatase and then digested with trypsin. The resultant peptides were purified by a two-dimensional method: size exclusion followed by reversed-phase high performance liquid chromatography. Purified peptides were sequenced by Edman degradation and mass spectrometry, and the sequences compared with the DMP1 sequence predicted from cDNA. Extensive sequencing of tryptic peptides revealed that the 37K fragments originated from the NH2-terminal region, and the 57K fragments were from the COOH-terminal part of DMP1. Phosphate analysis indicated that the 37K fragments contained 12 phosphates, and the 57K fragments had 41. From 37K fragments, two peptides lacked a COOH-terminal lysine or arginine; instead they ended at Phe173 and Ser180 and were thus COOH termini of 37K fragments. Two peptides were from the NH2 termini of 57K fragments, starting at Asp218 and Asp222. These findings indicated that DMP1 is proteolytically cleaved at four bonds, Phe173-Asp174, Ser180-Asp181, Ser217-Asp218, and Gln221-Asp222, forming eight fragments. The uniformity of cleavages at the NH2-terminal peptide bonds of aspartyl residues suggests that a single proteinase is involved. Based on its reported specificity, we hypothesize that these scissions are catalyzed by PHEX protein. We envision that the proteolytic processing of DMP1 plays a crucial role during osteogenesis and dentinogenesis.
Characterization of on-target generated tryptic peptides from Giberella zeae conidia spore proteins by means of matrix-assisted laser desorption/ionization mass spectrometry.

PubMed

Dong, Hongjuan; Marchetti-Deschmann, Martina; Allmaier, Günter

2014-01-01

Traditionally characterization of microbial proteins is performed by a complex sequence of steps with the final step to be either Edman sequencing or mass spectrometry, which generally takes several weeks or months to be complete. In this work, we proposed a strategy for the characterization of tryptic peptides derived from Giberella zeae (anamorph: Fusarium graminearum) proteins in parallel to intact cell mass spectrometry (ICMS) in which no complicated and time-consuming steps were needed. Experimentally, after a simple washing treatment of the spores, the aliquots of the intact G. zeae macro conidia spores solution, were deposited two times onto one MALDI (matrix-assisted laser desorption ionization) mass spectrometry (MS) target (two spots). One spot was used for ICMS and the second spot was subject to a brief on-target digestion with bead-immobilized or non-immobilized trypsin. Subsequently, one spot was analyzed immediately by MALDI MS in the linear mode (ICMS) whereas the second spot containing the digested material was investigated by MALDI MS in the reflectron mode ("peptide mass fingerprint") followed by protonated peptide selection for MS/MS (post source decay (PSD) fragment ion) analysis. Based on the formed fragment ions of selected tryptic peptides a complete or partial amino acid sequence was generated by manual de novo sequencing. These sequence data were used for homology search for protein identification. Finally four different peptides of varying abundances have been identified successfully allowing the verification that our desorbed/ionized surface compounds were indeed derived from proteins. The presence of three different proteins could be found unambiguously. Interestingly, one of these proteins is belonging to the ribosomal superfamily which indicates that not only surface-associated proteins were digested. This strategy minimized the amount of time and labor required for obtaining deeper information on spore preparations within the nowadays widely used ICMS approach. Copyright © 2013 Elsevier Ltd. All rights reserved.
Characterization of papain-like isoenzymes from latex of Asclepias curassavica by molecular biology validated by proteomic approach.

PubMed

Obregón, Walter D; Liggieri, Constanza S; Trejo, Sebastian A; Avilés, Francesc X; Vairo-Cavalli, Sandra E; Priolo, Nora S

2009-01-01

Latices from Asclepias spp are used in wound healing and the treatment of some digestive disorders. These pharmacological actions have been attributed to the presence of cysteine proteases in these milky latices. Asclepias curassavica (Asclepiadaceae), "scarlet milkweed" is a perennial subshrub native to South America. In the current paper we report a new approach directed at the selective biochemical and molecular characterization of asclepain cI (acI) and asclepain cII (acII), the enzymes responsible for the proteolytic activity of the scarlet milkweed latex. SDS-PAGE spots of both purified peptidases were digested with trypsin and Peptide Mass Fingerprints (PMFs) obtained showed no equivalent peptides. No identification was possible by MASCOT search due to the paucity of information concerning Asclepiadaceae latex cysteine proteinases available in databases. From total RNA extracted from latex samples, cDNA of both peptidases was obtained by RT-PCR using degenerate primers encoding Asclepiadaceae cysteine peptidase conserved domains. Theoretical PMFs of partial polypeptide sequences obtained by cloning (186 and 185 amino acids) were compared with empirical PMFs, confirming that the sequences of 186 and 185 amino acids correspond to acI and acII, respectively. N-terminal sequences of acI and acII, characterized by Edman sequencing, were overlapped with those coming from the cDNA to obtain the full-length sequence of both mature peptidases (212 and 211 residues respectively). Alignment and phylogenetic analysis confirmed that acI and acII belong to the subfamily C1A forming a new group of papain-like cysteine peptidases together with asclepain f from Asclepias fruticosa. We conclude that PMF could be adopted as an excellent tool to differentiate, in a fast and unequivocal way, peptidases with very similar physicochemical and functional properties, with advantages over other conventional methods (for instance enzyme kinetics) that are time consuming and afford less reliable results.
Protein sequence analysis, cloning, and expression of flammutoxin, a pore-forming cytolysin from Flammulina velutipes. Maturation of dimeric precursor to monomeric active form by carboxyl-terminal truncation.

PubMed

Tomita, Toshio; Mizumachi, Yoshihiro; Chong, Kang; Ogawa, Kanako; Konishi, Norihide; Sugawara-Tomita, Noriko; Dohmae, Naoshi; Hashimoto, Yohichi; Takio, Koji

2004-12-24

Flammutoxin (FTX), a 31-kDa pore-forming cytolysin from Flammulina velutipes, is specifically expressed during the fruiting body formation. We cloned and expressed the cDNA encoding a 272-residue protein with an identical N-terminal sequence with that of FTX but failed to obtain hemolytically active protein. This, together with the presence of multiple FTX family proteins in the mushroom, prompted us to determine the complete primary structure of FTX by protein sequence analysis. The N-terminal 72 and C-terminal 107 residues were sequenced by Edman degradation of the fragments generated from the alkylated FTX by enzymatic digestions with Achromobacter protease I or Staphylococcus aureus V8 protease and by chemical cleavages with CNBr, hydroxylamine, or 1% formic acid. The central part of FTX was sequenced with a surface-adhesive 7-kDa fragment, which was generated by a tryptic digestion of FTX and recovered by rinsing the wall of a test tube with 6 M guanidine HCl. The 7-kDa peptide was cleaved with 12 M HCl, thermolysin, or S. aureus V8 protease to produce smaller peptides for sequence analysis. As a result, FTX consisted of 251 residues, and protein and nucleotide sequences were in accord except for the lack of the initial Met and the C-terminal 20 residues in protein. Recombinant FTX (rFTX) with or without the C-terminal 20 residues (rFTX271 or rFTX251, respectively) was prepared to study the maturation process of FTX. Like natural FTX, rFTX251 existed as a monomer in solution and assembled into an SDS-stable, ring-shaped pore complex on human erythrocytes, causing hemolysis. In contrast, rFTX271, existing as a dimer in solution, bound to the cells but failed to form pore complex. The dimeric rFTX271 was converted to hemolytically active monomers upon the cleavage between Lys(251) and Met(252) by trypsin.
A chondroitin sulfate chain attached to the bone dentin matrix protein 1 NH2-terminal fragment.

PubMed

Qin, Chunlin; Huang, Bingzhen; Wygant, James N; McIntyre, Bradley W; McDonald, Charles H; Cook, Richard G; Butler, William T

2006-03-24

Dentin matrix protein 1 (DMP1) is an acidic noncollagenous protein shown by gene ablations to be critical for the proper mineralization of bone and dentin. In the extracellular matrix of these tissues DMP1 is present as fragments representing the NH2-terminal (37 kDa) and COOH-terminal (57 kDa) portions of the cDNA-deduced amino acid sequence. During our separation of bone noncollagenous proteins, we observed a high molecular weight, DMP1-related component (designated DMP1-PG). We purified DMP1-PG with a monoclonal anti-DMP1 antibody affinity column. Amino acid analysis and Edman degradation of tryptic peptides proved that the core protein for DMP1-PG is the 37-kDa fragment of DMP1. Chondroitinase treatments demonstrated that the slower migration rate of DMP1-PG is due to the presence of glycosaminoglycan. Quantitative disaccharide analysis indicated that the glycosaminoglycan is made predominantly of chondroitin 4-sulfate. Further analysis on tryptic peptides led us to conclude that a single glycosaminoglycan chain is linked to the core protein via Ser74, located in the Ser74-Gly75 dipeptide, an amino acid sequence specific for the attachment of glycosaminoglycans. Our findings show that in addition to its existence as a phosphoprotein, the NH2-terminal fragment from DMP1 occurs as a proteoglycan. Amino acid sequence alignment analysis showed that the Ser74-Gly75 dipeptide and its flanking regions are highly conserved among a wide range of species from caiman to the Homo sapiens, indicating that this glycosaminoglycan attachment domain has survived an extremely long period of evolution pressure, suggesting that the glycosaminoglycan may be critical for the basic biological functions of DMP1.
Characterization of Am IT, an anti-insect β-toxin isolated from the venom of scorpion Androctonus mauretanicus.

PubMed

Oukkache, Naoual; ElJaoudi, Rachid; Chgoury, Fatima; Rocha, Marisa Teixeira; Sabatier, Jean-Marc

2015-06-25

In the present study, a 'novel' toxin, called Am IT from the venom of scorpion Androctonus mauretanicus is isolated and characterized. A detailed analysis of the action of Am IT on insect axonal sodium currents is reported. Am IT was purified through gel filtration followed by C18 reversed-phase HPLC. Toxicity of Am IT in vivo was assessed on male German cockroach (Blattella germanica) larvae and C57/BL6 mice. Cross-reactivity of Am IT with two β-toxins was evidenced using (125)I-iodinated toxin-based radioimmunoassays with synaptosomal preparations from rat brain. The complete amino acid sequence of Am IT was finally determined by Edman sequencing. Am IT was observed to compete with AaH IT4 purified from the venom of scorpion Androctonus australis in binding assays. It was recognized by an antibody raised against a β-type toxin, which indicated some structural similarity with β-toxins (or related toxin family). The 'novel' toxin exhibited dual activity since it competed with anti-mammal toxins in binding assays as well as showed contracting activity to insect. The toxin competed with radio-labeled β-toxin Css IV by binding to Na(+) channels of rat brain synaptosomes. Analysis of toxin amino acid sequences showed that Am IT shares high structural identity (92%) with AaH IT4. In conclusion, Am IT not only reveals an anti-insect compound properties secreted by 'Old World' scorpions, paralyzing insect larvae by binding to Na(+) channels on larvae's nerve-cell membranes, but also exerts toxic activity in mice, which is similar to anti-mammal toxins from 'New World' scorpions (North and South Americas). Therefore, Am IT appears to be structurally and functionally similar to AaH IT4.
Skin secretion peptides: the molecular facet of the deimatic behavior of the four-eyed frog, Physalaemus nattereri (Anura, Leptodactylidae).

PubMed

Barbosa, Eder Alves; Iembo, Tatiane; Martins, Graciella Ribeiro; Silva, Luciano Paulino; Prates, Maura Vianna; Andrade, Alan Carvalho; Bloch, Carlos

2015-11-15

Amphibians can produce a large amount of bioactive peptides over the skin. In order to map the precise tissue localization of these compounds and evaluate their functions, mass spectrometry imaging (MSI) and gene expression studies were used to investigate a possible correlation between molecules involved in the antimicrobial defense mechanisms and anti-predatory behavior by Physalaemus nattereri. Total skin secretion of P. nattereri was analyzed by classical Protein Chemistry and proteomic techniques. Intact inguinal macroglands were dissected from the rest of the skin and both tissues were analyzed by MSI and real-time polymerase chain reaction (RT-PCR) experiments. Peptides were primarily identified by de novo sequencing, automatic Edman degradation and cDNA data. Fifteen bradykinin (BK)-related peptides and two antimicrobial peptides were sequenced and mapped by MSI on the inguinal macrogland and the rest of P. nattereri skin. RT-PCR results revealed that BK-related peptide levels of expression were about 30,000 times higher on the inguinal macroglands than on the any other region of the skin, whilst antimicrobial peptide ions appear to be evenly distributed in both investigated regions. The presence of antimicrobial peptides in all investigated tissue regions is in accordance with the defensive role against microorganisms thoroughly demonstrated in the literature, whereas BK-related molecules are largely found on the inguinal macroglands suggesting an intriguing link between their noxious activities against potential predators of P. nattereri and the frog's deimatic behavior. Copyright © 2015 John Wiley & Sons, Ltd.
AcT-2: a novel myotropic and antimicrobial type 2 tryptophyllin from the skin secretion of the Central American red-eyed leaf frog, Agalychnis callidryas.

PubMed

Ge, Lilin; Lyu, Peng; Zhou, Mei; Zhang, Huiling; Wan, Yuantai; Li, Bin; Li, Renjie; Wang, Lei; Chen, Tianbao; Shaw, Chris

2014-01-01

Tryptophyllins are a diverse family of amphibian peptides originally found in extracts of phyllomedusine frog skin by chemical means. Their biological activities remain obscure. Here we describe the isolation and preliminary pharmacological characterization of a novel type 2 tryptophyllin, named AcT-2, from the skin secretion of the red-eyed leaf frog, Agalychnis callidryas. The peptide was initially identified during smooth muscle pharmacological screening of skin secretion HPLC fractions and the unique primary structure--GMRPPWF-NH2--was established by both Edman degradation and electrospray MS/MS fragmentation sequencing. A. cDNA encoding the biosynthetic precursor of AcT-2 was successfully cloned from a skin secretion-derived cDNA library by means of RACE PCR and this contained an open-reading frame consisting of 62 amino acid residues with a single AcT-2 encoding sequence located towards the C-terminus. A synthetic replicate of AcT-2 was found to relax arterial smooth muscle (EC50 = 5.1 nM) and to contract rat urinary bladder smooth muscle (EC50 = 9.3 μ M). The peptide could also inhibit the growth of the microorganisms, Staphylococcus aureus, (MIC = 256 mg/L) Escherichia coli (MIC = 512 mg/L), and Candida albicans (128 mg/L). AcT-2 is thus the first amphibian skin tryptophyllin found to possess both myotropic and antimicrobial activities.
Perlinhibin, a Cysteine-, Histidine-, and Arginine-Rich Miniprotein from Abalone (Haliotis laevigata) Nacre, Inhibits In Vitro Calcium Carbonate Crystallization

PubMed Central

Mann, Karlheinz; Siedler, Frank; Treccani, Laura; Heinemann, Fabian; Fritz, Monika

2007-01-01

We have isolated a 4.785 Da protein from the nacreous layer of the sea snail Haliotis laevigata (greenlip abalone) shell after demineralization with acetic acid. The sequence of 41 amino acids was determined by Edman degradation supported by mass spectrometry. The most abundant amino acids were cysteine (19.5%), histidine (17%), and arginine (14.6%). The positively charged amino acids were almost counterbalanced by negatively charged ones resulting in a calculated isoelectric point of 7.86. Atomic-force microscopy studies of the interaction of the protein with calcite surfaces in supersaturated calcium carbonate solution or calcium chloride solution showed that the protein bound specifically to calcite steps, inhibiting further crystal growth at these sites in carbonate solution and preventing crystal dissolution when carbonate was substituted with chloride. Therefore this protein was named perlinhibin. X-ray diffraction investigation of the crystal after atomic-force microscopy growth experiments showed that the formation of aragonite was induced on the calcite substrate around holes caused by perlinhibin crystal-growth inhibition. The strong interaction of the protein with calcium carbonate was also shown by vapor diffusion crystallization. In the presence of the protein, the crystal surfaces were covered with holes due to protein binding and local inhibition of crystal growth. In addition to perlinhibin, we isolated and sequenced a perlinhibin-related protein, indicating that perlinhibin may be a member of a family of closely related proteins. PMID:17496038
Structural characterization and comparative modeling of PD-Ls 1-3, type 1 ribosome-inactivating proteins from summer leaves of Phytolacca dioica L.

PubMed

Di Maro, Antimo; Chambery, Angela; Carafa, Vincenzo; Costantini, Susan; Colonna, Giovanni; Parente, Augusto

2009-03-01

The amino acid sequence and glycan structure of PD-L1, PD-L2 and PD-L3, type 1 ribosome-inactivating proteins isolated from Phytolacca dioica L. leaves, were determined using a combined approach based on peptide mapping, Edman degradation and ESI-Q-TOF MS in precursor ion discovery mode. The comparative analysis of the 261 amino acid residue sequences showed that PD-L1 and PD-L2 have identical primary structure, as it is the case of PD-L3 and PD-L4. Furthermore, the primary structure of PD-Ls 1-2 and PD-Ls 3-4 have 81.6% identity (85.1% similarity). The ESI-Q-TOF MS analysis confirmed that PD-Ls 1-3 were glycosylated at different sites. In particular, PD-L1 contained three glycidic chains with the well known paucidomannosidic structure (Man)(3) (GlcNAc)(2) (Fuc)(1) (Xyl)(1) linked to Asn10, Asn43 and Asn255. PD-L2 was glycosylated at Asn10 and Asn43, and PD-L3 was glycosylated only at Asn10. PD-L4 was confirmed to be not glycosylated. Despite an overall high structural similarity, the comparative modeling of PD-L1, PD-L2, PD-L3 and PD-L4 has shown potential influences of the glycidic chains on their adenine polynucleotide glycosylase activity on different substrates.
Olfactory Proteins Mediating Chemical Communication in the Navel Orangeworm Moth, Amyelois transitella

PubMed Central

Leal, Walter S.; Ishida, Yuko; Pelletier, Julien; Xu, Wei; Rayo, Josep; Xu, Xianzhong; Ames, James B.

2009-01-01

Background The navel orangeworm, Amyelois transitella Walker (Lepidoptera: Pyralidae), is the most serious insect pest of almonds and pistachios in California for which environmentally friendly alternative methods of control — like pheromone-based approaches — are highly desirable. Some constituents of the sex pheromone are unstable and could be replaced with parapheromones, which may be designed on the basis of molecular interaction of pheromones and pheromone-detecting olfactory proteins. Methodology By analyzing extracts from olfactory and non-olfactory tissues, we identified putative olfactory proteins, obtained their N-terminal amino acid sequences by Edman degradation, and used degenerate primers to clone the corresponding cDNAs by SMART RACE. Additionally, we used degenerate primers based on conserved sequences of known proteins to fish out other candidate olfactory genes. We expressed the gene encoding a newly identified pheromone-binding protein, which was analyzed by circular dichroism, fluorescence, and nuclear magnetic resonance, and used in a binding assay to assess affinity to pheromone components. Conclusion We have cloned nine cDNAs encoding olfactory proteins from the navel orangeworm, including two pheromone-binding proteins, two general odorant-binding proteins, one chemosensory protein, one glutathione S-transferase, one antennal binding protein X, one sensory neuron membrane protein, and one odorant receptor. Of these, AtraPBP1 is highly enriched in male antennae. Fluorescence, CD and NMR studies suggest a dramatic pH-dependent conformational change, with high affinity to pheromone constituents at neutral pH and no binding at low pH. PMID:19789654
Processing of an anglerfish somatostatin precursor to a hydroxylysine-containing somatostatin 28.

PubMed Central

Spiess, J; Noe, B D

1985-01-01

A novel 28-residue somatostatin (SS) has been isolated from anglerfish pancreatic islets and characterized by complete Edman degradation, peptide mapping, and amino acid analysis. The primary structure of this anglerfish SS-28 (aSS-28) containing hydroxylysine (Hyl) was established to be H-Ser-Val-Asp-Ser-Thr-Asn-Asn-Leu-Pro-Pro-Arg-Glu-Arg-Lys-Ala-Gly-Cys- Lys-Asn-Phe-Tyr-Trp-Hyl-Gly-Phe-Thr-Ser-Cys-OH. This sequence (with the exception of hydroxylysine-23, which is replaced by lysine) is identical to the sequence of the COOH-terminal 28 residues of prepro-SS II predicted on the basis of cDNA analysis [Hobart, P., Crawford, R., Shen, L., Pictet, R. & Rutter, W. J. (1980) Nature (London) 288, 137-141]. This is the first instance in which hydroxylysine (to date characteristically observed in collagen or collagen-like structures) has been found in a potential regulatory peptide. Chromatographic characterization of peptides, radiolabeled in islet culture, revealed that aSS-28 contained 10-12% of the radioactivity incorporated into the 8000- to 1000-dalton SS-like polypeptides, whereas 88-90% of this radioactivity was detected in anglerfish SS-14. It appears probable that aSS-28 represents the predominant primary cleavage product derived from prepro-SS II by cleavage at the COOH-terminal side of a single arginine. Based on knowledge of the collagen biosynthesis, it is speculated that hydroxylation may take place as an early post-translational event. Images PMID:2857489
Methods and devices for protein assays

DOEpatents

Chhabra, Swapnil [San Jose, CA; Cintron, Jose M [Indianapolis, IN; Shediac, Renee [Oakland, CA

2009-11-03

Methods and devices for protein assays based on Edman degradation in microfluidic channels are disclosed herein. As disclosed, the cleaved amino acid residues may be immobilized in an array format and identified by detectable labels, such as antibodies, which specifically bind given amino acid residues. Alternatively, the antibodies are immobilized in an array format and the cleaved amino acids are labeled identified by being bound by the antibodies in the array.
Ocellatin peptides from the skin secretion of the South American frog Leptodactylus labyrinthicus (Leptodactylidae): characterization, antimicrobial activities and membrane interactions.

PubMed

Gusmão, Karla A G; Dos Santos, Daniel M; Santos, Virgílio M; Cortés, María Esperanza; Reis, Pablo V M; Santos, Vera L; Piló-Veloso, Dorila; Verly, Rodrigo M; de Lima, Maria Elena; Resende, Jarbas M

2017-01-01

The availability of antimicrobial peptides from several different natural sources has opened an avenue for the discovery of new biologically active molecules. To the best of our knowledge, only two peptides isolated from the frog Leptodactylus labyrinthicus , namely pentadactylin and ocellatin-F1, have shown antimicrobial activities. Therefore, in order to explore the antimicrobial potential of this species, we have investigated the biological activities and membrane interactions of three peptides isolated from the anuran skin secretion. Three peptide primary structures were determined by automated Edman degradation. These sequences were prepared by solid-phase synthesis and submitted to activity assays against gram-positive and gram-negative bacteria and against two fungal strains. The hemolytic properties of the peptides were also investigated in assays with rabbit blood erythrocytes. The conformational preferences of the peptides and their membrane interactions have been investigated by circular dichroism spectroscopy and liposome dye release assays. The amino acid compositions of three ocellatins were determined and the sequences exhibit 100% homology for the first 22 residues (ocellatin-LB1 sequence). Ocellatin-LB2 carries an extra Asn residue and ocellatin-F1 extra Asn-Lys-Leu residues at C-terminus. Ocellatin-F1 presents a stronger antibiotic potential and a broader spectrum of activities compared to the other peptides. The membrane interactions and pore formation capacities of the peptides correlate directly with their antimicrobial activities, i.e., ocellatin-F1 > ocellatin-LB1 > ocellatin-LB2. All peptides acquire high helical contents in membrane environments. However, ocellatin-F1 shows in average stronger helical propensities. The obtained results indicate that the three extra amino acid residues at the ocellatin-F1 C-terminus play an important role in promoting stronger peptide-membrane interactions and antimicrobial properties. The extra Asn-23 residue present in ocellatin-LB2 sequence seems to decrease its antimicrobial potential and the strength of the peptide-membrane interactions.

Antagonistic Activities of Novel Peptides from Bacillus amyloliquefaciens PT14 against Fusarium solani and Fusarium oxysporum.

PubMed

Kim, Young Gwon; Kang, Hee Kyoung; Kwon, Kee-Deok; Seo, Chang Ho; Lee, Hyang Burm; Park, Yoonkyung

2015-12-09

Bacillus species have recently drawn attention due to their potential use in the biological control of fungal diseases. This paper reports on the antifungal activity of novel peptides isolated from Bacillus amyloliquefaciens PT14. Reverse-phase high-performance liquid chromatography revealed that B. amyloliquefaciens PT14 produces five peptides (PT14-1, -2, -3, -4a, and -4b) that exhibit antifungal activity but are inactive against bacterial strains. In particular, PT14-3 and PT14-4a showed broad-spectrum antifungal activity against Fusarium solani and Fusarium oxysporum. The PT14-4a N-terminal amino acid sequence was identified through Edman degradation, and a BLAST homology analysis showed it not to be identical to any other protein or peptide. PT14-4a displayed strong fungicidal activity with minimal inhibitory concentrations of 3.12 mg/L (F. solani) and 6.25 mg/L (F. oxysporum), inducing severe morphological deformation in the conidia and hyphae. On the other hand, PT14-4a had no detectable hemolytic activity. This suggests PT14-4a has the potential to serve as an antifungal agent in clinical therapeutic and crop-protection applications.
The substrate degradome of meprin metalloproteases reveals an unexpected proteolytic link between meprin β and ADAM10.

PubMed

Jefferson, Tamara; Auf dem Keller, Ulrich; Bellac, Caroline; Metz, Verena V; Broder, Claudia; Hedrich, Jana; Ohler, Anke; Maier, Wladislaw; Magdolen, Viktor; Sterchi, Erwin; Bond, Judith S; Jayakumar, Arumugam; Traupe, Heiko; Chalaris, Athena; Rose-John, Stefan; Pietrzik, Claus U; Postina, Rolf; Overall, Christopher M; Becker-Pauly, Christoph

2013-01-01

The in vivo roles of meprin metalloproteases in pathophysiological conditions remain elusive. Substrates define protease roles. Therefore, to identify natural substrates for human meprin α and β we employed TAILS (terminal amine isotopic labeling of substrates), a proteomics approach that enriches for N-terminal peptides of proteins and cleavage fragments. Of the 151 new extracellular substrates we identified, it was notable that ADAM10 (a disintegrin and metalloprotease domain-containing protein 10)-the constitutive α-secretase-is activated by meprin β through cleavage of the propeptide. To validate this cleavage event, we expressed recombinant proADAM10 and after preincubation with meprin β, this resulted in significantly elevated ADAM10 activity. Cellular expression in murine primary fibroblasts confirmed activation. Other novel substrates including extracellular matrix proteins, growth factors and inhibitors were validated by western analyses and enzyme activity assays with Edman sequencing confirming the exact cleavage sites identified by TAILS. Cleavages in vivo were confirmed by comparing wild-type and meprin(-/-) mice. Our finding of cystatin C, elafin and fetuin-A as substrates and natural inhibitors for meprins reveal new mechanisms in the regulation of protease activity important for understanding pathophysiological processes.
Personal Protective Measures Against Insects and Other Arthropods of Military Significance

DTIC Science & Technology

2009-10-01

responsibility, although it is also an important adjunct to unit- level and higher echelon preventive medicine countermeasures. Military personnel must be aware...fabric treatment level of 0.52% weight by weight of active ingredient. (d) Several steps are essential in properly using this treatment method...Control as a Force Multiplier. Defense 90. pp. 26-35. Elridge, B.F. and Edman, J.D. 2004. Medical Entomology: A Textbook on Public Health and
The primary structure of fatty-acid-binding protein from nurse shark liver. Structural and evolutionary relationship to the mammalian fatty-acid-binding protein family.

PubMed

Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M

1992-02-01

The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs. The presence in shark liver of an FABP which differs substantially in primary structure from mammalian liver FABP, while being closely related to the FABP expressed in mammalian heart muscle, peripheral nerve myelin and adipocytes, opens a further dimension regarding the question of the existence of structure-dependent and tissue-specific specialization of FABP function in lipid metabolism.
Venom gland transcriptomic and venom proteomic analyses of the scorpion Megacormus gertschi Díaz-Najera, 1966 (Scorpiones: Euscorpiidae: Megacorminae).

PubMed

Santibáñez-López, Carlos E; Cid-Uribe, Jimena I; Zamudio, Fernando Z; Batista, Cesar V F; Ortiz, Ernesto; Possani, Lourival D

2017-07-01

The soluble venom from the Mexican scorpion Megacormus gertschi of the family Euscorpiidae was obtained and its biological effects were tested in several animal models. This venom is not toxic to mice at doses of 100 μg per 20 g of mouse weight, while being lethal to arthropods (insects and crustaceans), at doses of 20 μg (for crickets) and 100 μg (for shrimps) per animal. Samples of the venom were separated by high performance liquid chromatography and circa 80 distinct chromatographic fractions were obtained from which 67 components have had their molecular weights determined by mass spectrometry analysis. The N-terminal amino acid sequence of seven protein/peptides were obtained by Edman degradation and are reported. Among the high molecular weight components there are enzymes with experimentally-confirmed phospholipase activity. A pair of telsons from this scorpion species was dissected, from which total RNA was extracted and used for cDNA library construction. Massive sequencing by the Illumina protocol, followed by de novo assembly, resulted in a total of 110,528 transcripts. From those, we were able to annotate 182, which putatively code for peptides/proteins with sequence similarity to previously-reported venom components available from different protein databases. Transcripts seemingly coding for enzymes showed the richest diversity, with 52 sequences putatively coding for proteases, 20 for phospholipases, 8 for lipases and 5 for hyaluronidases. The number of different transcripts potentially coding for peptides with sequence similarity to those that affect ion channels was 19, for putative antimicrobial peptides 19, and for protease inhibitor-like peptides, 18. Transcripts seemingly coding for other venom components were identified and described. The LC/MS analysis of a trypsin-digested venom aliquot resulted in 23 matches with the translated transcriptome database, which validates the transcriptome. The proteomic and transcriptomic analyses reported here constitute the first approach to study the venom components from a scorpion species belonging to the family Euscorpiidae. The data certainly show that this venom is different from all the ones described thus far in the literature. Copyright © 2017 Elsevier Ltd. All rights reserved.
Identification of cysteine-644 as the covalent site of attachment of dexamethasone 21-mesylate to murine glucocorticoid receptors in WEHI-7 cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Smith, L.I.; Bodwell, J.E.; Mendel, D.B.

1988-05-17

Dexamethasone 21-mesylate is a highly specific synthetic glucocorticoid derivative that binds covalently to glucocorticoid receptors via sulfhydryl groups. The authors have identified the amino acid that reacts with the dexamethasone 21-mesylate by using enzymatic digestion and microsequencing for radiolabel. Nonactivated glucocorticoid receptors obtained from labeling intact WEHI-7 mouse thymoma cells with (/sup 3/H)dexamethasone 21-mesylate were immunopurified and analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Trypsin digestion followed by reversed-phase high-performance liquid chromatography (reversed-phase HPLC) produced a single (/sup 3/H)dexamethasone 21-mesylate labeled peptide. Automated Edman degradation of this peptide revealed that the (/sup 3/H)dexamethasone 21-mesylate was located at position 5 frommore » the amino terminus. Dual-isotope labeling studies with (/sup 3/H)dexamethasone 21-mesylate and (/sup 35/S)methionine demonstrated that this peptide contained methionine. Staphylococcus aureus V8 protease digestion of (/sup 3/H)dexamethasone 21-mesylate labeled steroid-binding subunits generated a different radiolabeled peptide containing label at position 7 from the amino terminus. On the basis of the published amino acid sequence of the murine glucocorticoid receptor, their data clearly identify cysteine-644 as the single residue in the steroid-binding domain that covalently binds dexamethasone 21-mesylate. They have confirmed this finding by demonstrating that a synthetic peptide representing the amino acid sequence 640-650 of the murine glucocorticoid receptor behaves in an identical manner on reversed-phase HPLC as the trypsin-generated peptide from intact cells.« less
When is Mass Spectrometry Combined with Affinity Approaches Essential? A Case Study of Tyrosine Nitration in Proteins

NASA Astrophysics Data System (ADS)

Petre, Brînduşa-Alina; Ulrich, Martina; Stumbaum, Mihaela; Bernevic, Bogdan; Moise, Adrian; Döring, Gerd; Przybylski, Michael

2012-11-01

Tyrosine nitration in proteins occurs under physiologic conditions and is increased at disease conditions associated with oxidative stress, such as inflammation and Alzheimer's disease. Identification and quantification of tyrosine-nitrations are crucial for understanding nitration mechanism(s) and their functional consequences. Mass spectrometry (MS) is best suited to identify nitration sites, but is hampered by low stabilities and modification levels and possible structural changes induced by nitration. In this insight, we discuss methods for identifying and quantifying nitration sites by proteolytic affinity extraction using nitrotyrosine (NT)-specific antibodies, in combination with electrospray-MS. The efficiency of this approach is illustrated by identification of specific nitration sites in two proteins in eosinophil granules from several biological samples, eosinophil-cationic protein (ECP) and eosinophil-derived neurotoxin (EDN). Affinity extraction combined with Edman sequencing enabled the quantification of nitration levels, which were found to be 8 % and 15 % for ECP and EDN, respectively. Structure modeling utilizing available crystal structures and affinity studies using synthetic NT-peptides suggest a tyrosine nitration sequence motif comprising positively charged residues in the vicinity of the NT- residue, located at specific surface- accessible sites of the protein structure. Affinities of Tyr-nitrated peptides from ECP and EDN to NT-antibodies, determined by online bioaffinity- MS, provided nanomolar KD values. In contrast, false-positive identifications of nitrations were obtained in proteins from cystic fibrosis patients upon using NT-specific antibodies, and were shown to be hydroxy-tyrosine modifications. These results demonstrate affinity- mass spectrometry approaches to be essential for unequivocal identification of biological tyrosine nitrations.
Thionin-like peptides from Capsicum annuum fruits with high activity against human pathogenic bacteria and yeasts.

PubMed

Taveira, Gabriel B; Mathias, Luciana S; da Motta, Olney V; Machado, Olga L T; Rodrigues, Rosana; Carvalho, André O; Teixeira-Ferreira, André; Perales, Jonas; Vasconcelos, Ilka M; Gomes, Valdirene M

2014-01-01

Plants defend themselves against pathogens with production of antimicrobial peptides (AMPs). Herein we describe the discovery of a new antifungal and antibacterial peptide from fruits of Capsicum annuum that showed similarity to an already well characterized family of plant AMPs, thionins. Other fraction composed of two peptides, in which the major peptide also showed similarity to thionins. Among the obtained fractions, fraction 1, which is composed of a single peptide of 7 kDa, was sequenced by Edman method and its comparative sequence analysis in database (nr) showed similarity to thionin-like peptides. Tests against microorganisms, fraction 1 presented inhibitory activity to the cells of yeast Saccharomyces cerevisiae, Candida albicans, and Candida tropicalis and caused growth reduction to the bacteria species Escherichia coli and Pseudomonas aeruginosa. Fraction 3 caused inhibitory activity only for C. albicans and C. tropicalis. This fraction was composed of two peptides of ∼7 and 10 kDa, and the main protein band correspondent to the 7 kDa peptide, also showed similarity to thionins. This plasma membrane permeabilization assay demonstrates that the peptides present in the fractions 1 and 3 induced changes in the membranes of all yeast strains, leading to their permeabilization. Fraction 1 was capable of inhibiting acidification of the medium of glucose-induced S. cerevisiae cells 78% after an incubation time of 30 min, and opposite result was obtained for C. albicans. Experiments demonstrate that the fraction 1 and 3 were toxic and induced changes in the membranes of all yeast strains, leading to their permeabilization. Copyright © 2013 Wiley Periodicals, Inc.
The adipokinetic hormone of the coleopteran suborder Adephaga: Structure, function, and comparison of distribution in other insects.

PubMed

Gäde, Gerd; Marco, Heather G

2017-07-01

The aim of the current study is to identify the adipokinetic hormone(s) (AKHs) of a basal suborder of the species-rich Coleoptera, the Adephaga, and possibly learn more about the ancestral AKH of beetles. Moreover, we wanted to compare the ancestral AKH with AKHs of more advanced beetles, of which a number are pest insects. This would allow us to assess whether AKH mimetics would be suitable as insecticides, that is, be harmful to the pest species but not to the beneficial species. Nine species of the Adephaga were investigated and all synthesize only one octapeptide in the corpus cardiacum, as revealed by Edman degradation sequencing techniques or by mass spectrometry. The amino acid sequence pGlu-Leu-Asn-Phe-Ser-Thr-Gly-Trp corresponds to Schgr-AKH-II that was first identified in the desert locust. It is assumed that Schgr-AKH-II-the peptide of a basal beetle clade-is the ancestral AKH for beetles. Some other beetle families, as well as some Hymenoptera (including honey bees) also contain this peptide, whereas most of the pest beetle species have different AKHs. This argues that those peptides and their receptors should be explored for developing mimetics with insecticidal properties. A scenario where Schgr-AKH-II (the only AKH of Adephaga) is used as basic molecular structure to derive almost all other known beetle AKHs via single step mutations is very likely, and supports the interpretation that Schgr-AKH-II is the ancestral AKH of Coleoptera. © 2017 Wiley Periodicals, Inc.
Identification and immunologic characterization of an allergen, alliin lyase, from garlic (Allium sativum).

PubMed

Kao, Shao-Hsuan; Hsu, Ching-Hsian; Su, Song-Nan; Hor, Wei-Ting; Chang T, Wen-Hong; Chow, Lu-Ping

2004-01-01

Garlic (Allium sativum) is one of the most common relishes used in cooking worldwide. Very few garlic allergens have been reported, and garlic allergy has been rarely studied. The aim of the study was to identify allergenic proteins in garlic and to investigate their importance in allergies to other Allium species (leek, shallot, and onion). A crude extract of garlic proteins was separated by SDS-PAGE and 2-dimensional electrophoresis; immunoblotting was then performed with the use of individual and pooled sera from patients with garlic allergy, and the major IgE-binding proteins were analyzed by amino acid sequencing and mass spectrometry. The putative allergens were further purified by chromatography; the antigenicity, allergenicity, and IgE-binding cross-reactivity of the purified protein were then studied by immunoblotting, periodate oxidation, skin tests, and IgE-binding inhibition assays. A major allergen, alliin lyase, was identified by mass spectrometry and Edman sequencing and purified to homogeneity through the use of a simple 2-step chromatographic method. Skin tests showed that the purified protein elicited IgE-mediated hypersensitive responses in patients with garlic allergy. Periodate oxidation showed that carbohydrate groups were involved in the antigenicity, allergenicity, and cross-reactivity. Garlic alliin lyase showed strong cross-reactivity with alliin lyases from other Allium species, namely leek, shallot, and onion. Alliin lyase was found to be a major garlic allergen in a garlic-allergic group of patients in Taiwan. The wide distribution of alliin lyase in Allium suggests it may be a new cross-reactive allergen.
Peroxisomal copper, zinc superoxide dismutase. Characterization of the isoenzyme from watermelon cotyledons.

PubMed Central

Bueno, P; Varela, J; Gimeénez-Gallego, G; del Río, L A

1995-01-01

The biochemical and immunochemical characterization of a superoxide dismutase (SOD, EC 1.15.1.1) from peroxisomal origin has been carried out. The enzyme is a Cu,Zn-containing SOD (CuZn-SOD) located in the matrix of peroxisomes from watermelon (Citrullus vulgaris Schrad.) cotyledons (L.M. Sandalio and L.A. del Río [1988] Plant Physiol 88: 1215-1218). The amino acid composition of the enzyme was determined. Analysis by reversed-phase high-performance liquid chromatography of the peroxisomal CuZn-SOD incubated with 6 M guanidine-HCl indicated that this enzyme contained a noncovalently bound chromophore group that was responsible for the absorbance peak of the native enzyme at 260 nm. The amino acid sequence of the peroxisomal CuZn-SOD was determined by Edman degradation. Comparison of its sequence with those reported for other plant SODs revealed homologies of about 70% with cytosolic CuZn-SODs and of 90% with chloroplastic CuZn-SODs. The peroxisomal SOD has a high thermal stability and resistance to inactivation by hydrogen peroxide. A polyclonal antibody was raised against peroxisomal CuZn-SOD, and by western blotting the antibody cross-reacted with plant CuZn-SODs but did not recognize either plant Mn-SOD or bacterial Fe-SOD. The antiSOD-immunoglobulin G showed a weak cross-reaction with bovine erythrocytes and liver CuZn-SODs, and also with cell-free extracts from trout liver. The possible function of this CuZn-SOD in the oxidative metabolism of peroxisomes is discussed. PMID:7630940
Antibacterial activity in bovine lactoferrin-derived peptides.

PubMed Central

Hoek, K S; Milne, J M; Grieve, P A; Dionysius, D A; Smith, R

1997-01-01

Several peptides sharing high sequence homology with lactoferricin B (Lf-cin B) were generated from bovine lactoferrin (Lf) with recombinant chymosin. Two peptides were copurified, one identical to Lf-cin B and another differing from Lf-cin B by the inclusion of a C-terminal alanine (lactoferricin). Two other peptides were copurified from chymosin-hydrolyzed Lf, one differing from Lf-cin B by the inclusion of C-terminal alanyl-leucine and the other being a heterodimer linked by a disulfide bond. These peptides were isolated in a single step from chymosin-hydrolyzed Lf by membrane ion-exchange chromatography and were purified by reverse-phase high-pressure liquid chromatography (HPLC). They were characterized by N-terminal Edman sequencing, mass spectrometry, and antibacterial activity determination. Pure lactoferricin, prepared from pepsin-hydrolyzed Lf, was purified by standard chromatography techniques. This peptide was analyzed against a number of gram-positive and gram-negative bacteria before and after reduction of its disulfide bond or cleavage after its single methionine residue and was found to inhibit the growth of all the test bacteria at a concentration of 8 microM or less. Subfragments of lactoferricin were isolated from reduced and cleaved peptide by reverse-phase HPLC. Subfragment 1 (residues 1 to 10) was active against most of the test microorganisms at concentrations of 10 to 50 microM. Subfragment 2 (residues 11 to 26) was active against only a few microorganisms at concentrations up to 100 microM. These antibacterial studies indicate that the activity of lactoferricin is mainly, but not wholly, due to its N-terminal region. PMID:8980754
Biochemical and Genetic Evidence that Enterococcus faecium L50 Produces Enterocins L50A and L50B, the sec-Dependent Enterocin P, and a Novel Bacteriocin Secreted without an N-Terminal Extension Termed Enterocin Q

PubMed Central

Cintas, Luis M.; Casaus, Pilar; Herranz, Carmen; Håvarstein, Leiv Sigve; Holo, Helge; Hernández, Pablo E.; Nes, Ingolf F.

2000-01-01

Enterococcus faecium L50 grown at 16 to 32°C produces enterocin L50 (EntL50), consisting of EntL50A and EntL50B, two unmodified non-pediocin-like peptides synthesized without an N-terminal leader sequence or signal peptide. However, the bacteriocin activity found in the cell-free culture supernatants following growth at higher temperatures (37 to 47°C) is not due to EntL50. A purification procedure including cation-exchange, hydrophobic interaction, and reverse-phase liquid chromatography has shown that the antimicrobial activity is due to two different bacteriocins. Amino acid sequences obtained by Edman degradation and DNA sequencing analyses revealed that one is identical to the sec-dependent pediocin-like enterocin P produced by E. faecium P13 (L. M. Cintas, P. Casaus, L. S. Håvarstein, P. E. Hernández, and I. F. Nes, Appl. Environ. Microbiol. 63:4321–4330, 1997) and the other is a novel unmodified non-pediocin-like bacteriocin termed enterocin Q (EntQ), with a molecular mass of 3,980. DNA sequencing analysis of a 963-bp region of E. faecium L50 containing the enterocin P structural gene (entP) and the putative immunity protein gene (entiP) reveals a genetic organization identical to that previously found in E. faecium P13. DNA sequencing analysis of a 1,448-bp region identified two consecutive but diverging open reading frames (ORFs) of which one, termed entQ, encodes a 34-amino-acid protein whose deduced amino acid sequence was identical to that obtained for EntQ by amino acid sequencing, showing that EntQ, similarly to EntL50A and EntL50B, is synthesized without an N-terminal leader sequence or signal peptide. The second ORF, termed orf2, was located immediately upstream of and in opposite orientation to entQ and encodes a putative immunity protein composed of 221 amino acids. Bacteriocin production by E. faecium L50 showed that EntP and EntQ are produced in the temperature range from 16 to 47°C and maximally detected at 47 and 37 to 47°C, respectively, while EntL50A and EntL50B are maximally synthesized at 16 to 25°C and are not detected at 37°C or above. PMID:11073927
Angiotensin I-Converting Enzyme Inhibitor Derived from Cross-Linked Oyster Protein

PubMed Central

Xie, Cheng-Liang; Kim, Jin-Soo; Ha, Jong-Myung; Choung, Se-Young

2014-01-01

Following cross-linking by microbial transglutaminase, modified oyster proteins were hydrolyzed to improve inhibitory activity against angiotensin-converting enzyme (ACE) inhibitory activity with the use of a single protease, or a combination of six proteases. The oyster hydrolysate with the lowest 50% ACE inhibitory concentration (IC50) of 0.40 mg/mL was obtained by two-step hydrolysis of the cross-linked oyster protein using Protamex and Neutrase. Five ACE inhibitory peptides were purified from the oyster hydrolysate using a multistep chromatographic procedure comprised of ion-exchange, size exclusion, and reversed-phase liquid chromatography. Their sequences were identified as TAY, VK, KY, FYN, and YA, using automated Edman degradation and mass spectrometry. These peptides were synthesized, and their IC50 values were measured to be 16.7, 29.0, 51.5, 68.2, and 93.9 μM, respectively. Toxicity of the peptides on the HepG2 cell line was not detected. The oyster hydrolysate also significantly decreased the systolic blood pressure of spontaneously hypertensive rats (SHR). The antihypertensive effect of the oyster hydrolysate on SHR was rapid and long-lasting, compared to commercially obtained sardine hydrolysate. These results suggest that the oyster hydrolysate could be a source of effective nutraceuticals against hypertension. PMID:25140307
Purification and characterization of an antibacterial and anti-inflammatory polypeptide from Arca subcrenata.

PubMed

Chen, Yuyan; Li, Chunlei; Zhu, Jianhua; Xie, Wangshi; Hu, Xianjing; Song, Liyan; Zi, Jiachen; Yu, Rongmin

2017-03-01

A polypeptide coded as PGC was isolated from Arca subcrenata muscle using ion exchange, Sephadex G-50 gel chromatography and RP-HPLC. PGC was identified to be a homogeneous compound by Native-PAGE and the purity was more than 98.9% measured by HPLC. The isoelectric point of PGC was determined to be 9.76 by IEF-PAGE. The molecular weight was determined to be 15,973.0Da by ESI-MS/MS. The conformational structure of PGC was characterized by UV-vis, FT-IR and CD spectroscopy. N terminal amino acid sequence of PGC was shown as PSVYDAAAQLTADVKKDLRDSWKVIGGDKKGNGVA by Edman degradation. The results demonstrated that there is a high degree of homology between PGC and the subunit from hemoglobin, and proposed that PGC is the depolymerized polypeptide of Hemoglobin I (HbI) from A. subcrenata. The evaluation of biological activities showed that the diameters of the inhibitory ring of PGC on Escherichia coli and Staphylococcus aureus were 14.5±0.44mm and 16.5±1.15mm, respectively. The IC 50 of inhibition rate for PGC on NO production was 9.60±0.71μg/mL. Therefore, PGC might be developed as one of potential antibacterial and anti-inflammatory agents. Copyright © 2016 Elsevier B.V. All rights reserved.
Asymmetric arginine dimethylation of heterogeneous nuclear ribonucleoprotein K by protein-arginine methyltransferase 1 inhibits its interaction with c-Src.

PubMed

Ostareck-Lederer, Antje; Ostareck, Dirk H; Rucknagel, Karl P; Schierhorn, Angelika; Moritz, Bodo; Huttelmaier, Stefan; Flach, Nadine; Handoko, Lusy; Wahle, Elmar

2006-04-21

Arginine methylation is a post-translational modification found in many RNA-binding proteins. Heterogeneous nuclear ribonucleoprotein K (hnRNP K) from HeLa cells was shown, by mass spectrometry and Edman degradation, to contain asymmetric N(G),N(G)-dimethylarginine at five positions in its amino acid sequence (Arg256, Arg258, Arg268, Arg296, and Arg299). Whereas these five residues were quantitatively modified, Arg303 was asymmetrically dimethylated in <33% of hnRNP K and Arg287 was monomethylated in <10% of the protein. All other arginine residues were unmethylated. Protein-arginine methyltransferase 1 was identified as the only enzyme methylating hnRNP K in vitro and in vivo. An hnRNP K variant in which the five quantitatively modified arginine residues had been substituted was not methylated. Methylation of arginine residues by protein-arginine methyltransferase 1 did not influence the RNA-binding activity, the translation inhibitory function, or the cellular localization of hnRNP K but reduced the interaction of hnRNP K with the tyrosine kinase c-Src. This led to an inhibition of c-Src activation and hnRNP K phosphorylation. These findings support the role of arginine methylation in the regulation of protein-protein interactions.
Conformational and Functional Effects Induced by D- and L-Amino Acid Epimerization on a Single Gene Encoded Peptide from the Skin Secretion of Hypsiboas punctatus

PubMed Central

de Magalhães, Mariana T. Q.; Barbosa, Eder A.; Prates, Maura V.; Verly, Rodrigo M.; Munhoz, Victor Hugo O.; de Araújo, Ivan E.; Bloch, Carlos

2013-01-01

Skin secretion of Hypsiboas punctatus is the source of a complex mixture of bioactive compounds where peptides and small proteins prevail, similarly to many other amphibians. Among dozens of molecules isolated from H. punctatus in a proteomic based approach, we report here the structural and functional studies of a novel peptide named Phenylseptin (FFFDTLKNLAGKVIGALT-NH2) that was purified as two naturally occurring D- and L-Phes configurations. The amino acid epimerization and C-terminal amidation for both molecules were confirmed by a combination of techniques including reverse-phase UFLC, ion mobility mass spectrometry, high resolution MS/MS experiments, Edman degradation, cDNA sequencing and solid-phase peptide synthesis. RMSD analysis of the twenty lowest-energy 1H NMR structures of each peptide revealed a major 90° difference between the two backbones at the first four N-terminal residues and substantial orientation changes of their respective side chains. These structural divergences were considered to be the primary cause of the in vitro quantitative differences in antimicrobial activities between the two molecules. Finally, both molecules elicited equally aversive reactions in mice when delivered orally, an effect that depended entirely on peripheral gustatory pathways. PMID:23565145
Structure and biological activities of eumenine mastoparan-AF (EMP-AF), a new mast cell degranulating peptide in the venom of the solitary wasp (Anterhynchium flavomarginatum micado).

PubMed

Konno, K; Hisada, M; Naoki, H; Itagaki, Y; Kawai, N; Miwa, A; Yasuhara, T; Morimoto, Y; Nakata, Y

2000-11-01

A new mast cell degranulating peptide, eumenine mastoparan-AF (EMP-AF), was isolated from the venom of the solitary wasp Anterhynchium flavomarginatum micado, the most common eumenine wasp found in Japan. The structure was analyzed by FAB-MS/MS together with Edman degradation, which was corroborated by solid-phase synthesis. The sequence of EMP-AF, Ile-Asn-Leu-Leu-Lys-Ile-Ala-Lys-Gly-Ile-Ile-Lys-Ser-Leu-NH(2), was similar to that of mastoparan, a mast cell degranulating peptide from a hornet venom; tetradecapeptide with C-terminus amidated and rich in hydrophobic and basic amino acids. In fact, EMP-AF exhibited similar activity to mastoparan in stimulating degranulation from rat peritoneal mast cells and RBL-2H3 cells. It also showed significant hemolytic activity in human erythrocytes. Therefore, this is the first example that a mast cell degranulating peptide is found in the solitary wasp venom. Besides the degranulation and hemolytic activity, EMP-AF also affects on neuromuscular transmission in the lobster walking leg preparation. Three analogs EMP-AF-1 approximately 3 were snythesized and biologically tested together with EMP-AF, resulting in the importance of the C-terminal amide structure for biological activities.
Studies on the plasma membrane H sup + -ATPase of oat roots: Preparation and assay, cytological localization, and sulfhydryl chemistry

DOE Office of Scientific and Technical Information (OSTI.GOV)

Katz, D.B.

1989-01-01

Biochemical and cytological studies were performed on the plasma membrane proton pump (H{sup +}-ATPase) of oat roots (Avena sativa cv. Stout). H{sup +}-ATPase activity in oat root plasma membranes is inhibited by N-ethylmaleimide (NEM), a covalent modifier of protein sulfhydryl groups. The rate of inhibition is reduced in the presence of ADP or MgADP. An M{sub r} = 100,000 plasma membrane polypeptide showed reduced labelling by ({sup 3}H)NEM in the presence of ADP. When tryptic peptides from ({sup 3}H)NEM-labeled M{sub r} = 100,000 polypeptide were separated by reverse-phase high-pressure liquid chromatography (HPLC), only one radioactive peak consistently showed labeling inmore » the presence of ADP. In order to determine the location and identity of the NEM-reactive residue, the radioactive peptide in this peak was further purified by HPLC. The amino acid sequence(s) in the resulting sample were then determined by Edman degradation on an automated gas-phase sequenator. The PTH-amino acids released at each cycle of the degradation were separated by HPLC. Analysis of the chromatograms suggested that the radio-labeled residue was located in a peptide of sequence V-E-N-Q-D-A-I-D-A-C{sup *}-M-V-G-M-L-A-D-P-K. The NEM-reactive residue was cysteine, based on the retention time of the radioactivity released. The ATP-hydrolyzing activity observed in electron micrographs by lead-precipitation of enzymically released inorganic phosphate was compared with that observed in in vitro assays of the soluble and plasma membrane fractions of oat root homogenates. Although an ATP-hydrolyzing activity was observed on the plasma membrane in the electron micrographs, its substrate specificity and inhibitor sensitivity was identical to that observed for phosphatase activity.« less
Enterococcus faecium F58, a bacteriocinogenic strain naturally occurring in Jben, a soft, farmhouse goat's cheese made in Morocco.

PubMed

Achemchem, F; Martínez-Bueno, M; Abrini, J; Valdivia, E; Maqueda, M

2005-01-01

Characterization of Ent F-58 produced by Enterococcus faecium strain F58 isolated from Jben, a soft, farmhouse goat's cheese manufactured without starter cultures. E. faecium strain F58 was isolated because of its broad inhibitory spectrum, including activity against food-borne pathogenic and spoilage bacteria. The antimicrobial substance was produced during the growth phase, with maximum production after 16-20 h of incubation at 30 degrees C, and was stable over a wide pH range (4-8) and at high temperatures (5 min at 100 degrees C). The enterocin was purified to homogeneity using cation exchange and hydrophobic interaction on C-18 and reverse-phase high-performance liquid chromatography. The activity was eluted as two individual active fractions (F-58A and F-58B) and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry analysis showed masses of 5210.5 and 5234.3 Da respectively. Both peptides were partially sequenced by Edman degradation, and amino-acid sequencing revealed high similarity with enterocin L50 (I). PCR-amplified fragments containing the structural genes for F-58 A and B were located in a 22-kb plasmid harboured by this strain. We verified that it also holds the structural gene for P-like enterocin. E. faecium strain F58 from Jben cheese, a producer of enterocin L50, exerts an inhibitory effect against strains of genera such as Listeria, Staphylococcus, Clostridium, Brochothrix and Bacillus. Enterocin was characterized according to its functional and biological properties, purification to homogeneity and an analysis of its amino acid and genetic sequences. E. faecium strain F58 is a newly discovered producer of enterocin L50, the biotechnological characteristics of which indicate its potential for application as a protective agent against pathogens and spoilage bacteria in foods.

Purification and Characterization of Suicin 65, a Novel Class I Type B Lantibiotic Produced by Streptococcus suis.

PubMed

Vaillancourt, Katy; LeBel, Geneviève; Frenette, Michel; Fittipaldi, Nahuel; Gottschalk, Marcelo; Grenier, Daniel

2015-01-01

Bacteriocins are antimicrobial peptides of bacterial origin that are considered as a promising alternative to the use of conventional antibiotics. Recently, our laboratory reported the purification and characterization of two lantibiotics, suicin 90-1330 and suicin 3908, produced by the swine pathogen and zoonotic agent Streptococcus suis (serotype 2). In this study, a novel bacteriocin produced by S. suis has been identified and characterized. The producing strain S. suis 65 (serotype 2) was found to belong to the sequence type 28, that includes strains known to be weakly or avirulent in a mouse model. The bacteriocin, whose production was only possible following growth on solid culture medium, was purified to homogeneity by cationic exchange and reversed-phase high-pressure liquid chromatography. The bacteriocin, named suicin 65, was heat, pH and protease resistant. Suicin 65 was active against all S. suis isolates tested, including antibiotic resistant strains. Amino acid sequencing of the purified bacteriocin by Edman degradation revealed the presence of modified amino acids suggesting a lantibiotic. Using the partial sequence obtained, a blast was performed against published genomes of S. suis and allowed to identify a putative lantibiotic locus in the genome of S. suis 89-1591. From this genome, primers were designed and the gene cluster involved in the production of suicin 65 by S. suis 65 was amplified by PCR. Sequence analysis revealed the presence of ten open reading frames, including a duplicate of the structural gene. The structural genes (sssA and sssA') of suicin 65 encodes a 25-amino acid residue leader peptide and a 26-amino acid residue mature peptide yielding an active bacteriocin with a deducted molecular mass of 3,005 Da. Mature suicin 65 showed a high degree of identity with class I type B lantibiotics (globular structure) produced by Streptococcus pyogenes (streptococcin FF22; 84.6%), Streptococcus macedonicus (macedocin ACA-DC 198; 84.6%), and Lactococcus lactis subsp. lactis (lacticin 481; 74.1%). Further studies will evaluate the ability of suicin 65 or the producing strain to prevent experimental S. suis infections in pigs.
Purification and Characterization of Suicin 65, a Novel Class I Type B Lantibiotic Produced by Streptococcus suis

PubMed Central

Vaillancourt, Katy; LeBel, Geneviève; Frenette, Michel; Fittipaldi, Nahuel; Gottschalk, Marcelo; Grenier, Daniel

2015-01-01

Bacteriocins are antimicrobial peptides of bacterial origin that are considered as a promising alternative to the use of conventional antibiotics. Recently, our laboratory reported the purification and characterization of two lantibiotics, suicin 90–1330 and suicin 3908, produced by the swine pathogen and zoonotic agent Streptococcus suis (serotype 2). In this study, a novel bacteriocin produced by S. suis has been identified and characterized. The producing strain S. suis 65 (serotype 2) was found to belong to the sequence type 28, that includes strains known to be weakly or avirulent in a mouse model. The bacteriocin, whose production was only possible following growth on solid culture medium, was purified to homogeneity by cationic exchange and reversed-phase high-pressure liquid chromatography. The bacteriocin, named suicin 65, was heat, pH and protease resistant. Suicin 65 was active against all S. suis isolates tested, including antibiotic resistant strains. Amino acid sequencing of the purified bacteriocin by Edman degradation revealed the presence of modified amino acids suggesting a lantibiotic. Using the partial sequence obtained, a blast was performed against published genomes of S. suis and allowed to identify a putative lantibiotic locus in the genome of S. suis 89–1591. From this genome, primers were designed and the gene cluster involved in the production of suicin 65 by S. suis 65 was amplified by PCR. Sequence analysis revealed the presence of ten open reading frames, including a duplicate of the structural gene. The structural genes (sssA and sssA’) of suicin 65 encodes a 25-amino acid residue leader peptide and a 26-amino acid residue mature peptide yielding an active bacteriocin with a deducted molecular mass of 3,005 Da. Mature suicin 65 showed a high degree of identity with class I type B lantibiotics (globular structure) produced by Streptococcus pyogenes (streptococcin FF22; 84.6%), Streptococcus macedonicus (macedocin ACA-DC 198; 84.6%), and Lactococcus lactis subsp. lactis (lacticin 481; 74.1%). Further studies will evaluate the ability of suicin 65 or the producing strain to prevent experimental S. suis infections in pigs. PMID:26709705
Peptidomics of Three Bothrops Snake Venoms: Insights Into the Molecular Diversification of Proteomes and Peptidomes*

PubMed Central

Tashima, Alexandre K.; Zelanis, André; Kitano, Eduardo S.; Ianzer, Danielle; Melo, Robson L.; Rioli, Vanessa; Sant'anna, Sávio S.; Schenberg, Ana C. G.; Camargo, Antônio C. M.; Serrano, Solange M. T.

2012-01-01

Snake venom proteomes/peptidomes are highly complex and maintenance of their integrity within the gland lumen is crucial for the expression of toxin activities. There has been considerable progress in the field of venom proteomics, however, peptidomics does not progress as fast, because of the lack of comprehensive venom sequence databases for analysis of MS data. Therefore, in many cases venom peptides have to be sequenced manually by MS/MS analysis or Edman degradation. This is critical for rare snake species, as is the case of Bothrops cotiara (BC) and B. fonsecai (BF), which are regarded as near threatened with extinction. In this study we conducted a comprehensive analysis of the venom peptidomes of BC, BF, and B. jararaca (BJ) using a combination of solid-phase extraction and reversed-phase HPLC to fractionate the peptides, followed by nano-liquid chromatography-tandem MS (LC-MS/MS) or direct infusion electrospray ionization-(ESI)-MS/MS or MALDI-MS/MS analyses. We detected marked differences in the venom peptidomes and identified peptides ranging from 7 to 39 residues in length by de novo sequencing. Forty-four unique sequences were manually identified, out of which 30 are new peptides, including 17 bradykinin-potentiating peptides, three poly-histidine-poly-glycine peptides and interestingly, 10 l-amino acid oxidase fragments. Some of the new bradykinin-potentiating peptides display significant bradykinin potentiating activity. Automated database search revealed fragments from several toxins in the peptidomes, mainly from l-amino acid oxidase, and allowed the determination of the peptide bond specificity of proteinases and amino acid occurrences for the P4-P4′ sites. We also demonstrate that the venom lyophilization/resolubilization process greatly increases the complexity of the peptidome because of the imbalance caused to the venom proteome and the consequent activity of proteinases on venom components. The use of proteinase inhibitors clearly showed different outcomes in the peptidome characterization and suggested that degradomic-peptidomic analysis of snake venoms is highly sensitive to the conditions of sampling procedures. PMID:22869554
Purification and characterisation of an antifungal protein, MCha-Pr, from the intercellular fluid of bitter gourd (Momordica charantia) leaves.

PubMed

Zhang, Beibei; Xie, Chengjian; Wei, Yunming; Li, Jing; Yang, Xingyong

2015-03-01

An antifungal protein, designated MCha-Pr, was isolated from the intercellular fluid of bitter gourd (Momordica charantia) leaves during a screen for potent antimicrobial proteins from plants. The isolation procedure involved a combination of extraction, ammonium sulphate precipitation, gel filtration on Bio-Gel P-6, ion exchange chromatography on CM-Sephadex, an additional gel filtration on HiLoad 16/60 Superdex 30, and finally, HPLC on a SOURCE 5RPC column. Matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry indicated that the protein had a molecular mass of 25733.46Da. Automated Edman degradation was used to determine the N-terminal sequence of MCha-Pr, and the amino acid sequence was identified as V-E-Y-T-I-T-G-N-A-G-N-T-P-G-G. The MCha-Pr protein has some similarity to the pathogenesis-related proteins from Atropa belladonna (deadly nightshade), Solanum tuberosum (potato), Ricinus communis (castor bean), and Nicotiana tabacum (tobacco). Analysis of the circular dichroism spectra indicated that MCha-Pr predominantly contains α-helix and β-sheet structures. MCha-Pr had inhibitory effects towards a variety of fungal species and the 50% inhibition of fungal growth (IC50) for Alternaria brassicae, Cercospora personata, Fusarium oxysporum, Mucor sp., and Rhizoctonia solani are 33 μM, 42 μM, 37 μM, 40 μM, and 48 μM, respectively. In addition, this antifungal protein can inhibit the germination of A. brassicae spores at 12.5 μM. These results suggest that MCha-Pr in bitter gourd leaves plays a protective role against phytopathogens and has a wide antimicrobial spectrum. Copyright © 2014 Elsevier Inc. All rights reserved.
Purification and Characterization of Plantaricin JLA-9: A Novel Bacteriocin against Bacillus spp. Produced by Lactobacillus plantarum JLA-9 from Suan-Tsai, a Traditional Chinese Fermented Cabbage.

PubMed

Zhao, Shengming; Han, Jinzhi; Bie, Xiaomei; Lu, Zhaoxin; Zhang, Chong; Lv, Fengxia

2016-04-06

Bacteriocins are ribosomally synthesized peptides with antimicrobial activity produced by numerous bacteria. A novel bacteriocin-producing strain, Lactobacillus plantarum JLA-9, isolated from Suan-Tsai, a traditional Chinese fermented cabbage, was screened and identified by its physiobiochemical characteristics and 16S rDNA sequence analysis. A new bacteriocin, designated plantaricin JLA-9, was purified using butanol extraction, gel filtration, and reverse-phase high-performance liquid chromatography. The molecular mass of plantaricin JLA-9 was shown to be 1044 Da by MALDI-TOF-MS analyses. The amino acid sequence of plantaricin JLA-9 was predicted to be FWQKMSFA by MALDI-TOF-MS/MS, which was confirmed by Edman degradation. This bacteriocin exhibited broad-spectrum antibacterial activity against Gram-positive and Gram-negative bacteria, especially Bacillus spp., high thermal stability (20 min, 121 °C), and narrow pH stability (pH 2.0-7.0). It was sensitive to α-chymotrypsin, pepsin, alkaline protease, and papain. The mode of action of this bacteriocin responsible for outgrowth inhibition of Bacillus cereus spores was studied. Plantaricin JLA-9 had no detectable effects on germination initiation over 1 h on monitoring the hydration, heat resistance, and 2,6-pyridinedicarboxylic acid (DPA) release of spores. Rather, germination initiation is a prerequisite for the action of plantaricin JLA-9. Plantaricin JLA-9 inhibited growth by preventing the establishment of oxidative metabolism and disrupting membrane integrity in germinating spores within 2 h. The results suggest that plantaricin JLA-9 has potential applications in the control of Bacillus spp. in the food industry.
Bioinsecticidal activity of a novel Kunitz trypsin inhibitor from Catanduva (Piptadenia moniliformis) seeds.

PubMed

Cruz, Ana C B; Massena, Fábio S; Migliolo, Ludovico; Macedo, Leonardo L P; Monteiro, Norberto K V; Oliveira, Adeliana S; Macedo, Francisco P; Uchoa, Adriana F; Grossi de Sá, Maria F; Vasconcelos, Ilka M; Murad, Andre M; Franco, Octavio L; Santos, Elizeu A

2013-09-01

The present study aims to provide new in vitro and in vivo biochemical information about a novel Kunitz trypsin inhibitor purified from Piptadenia moniliformis seeds. The purification process was performed using TCA precipitation, Trypsin-Sepharose and reversed-phase C18 HPLC chromatography. The inhibitor, named PmTKI, showed an apparent molecular mass of around 19 kDa, visualized by SDS-PAGE, which was confirmed by mass spectrometry MALDI-ToF demonstrating a monoisotopic mass of 19.296 Da. The inhibitor was in vitro active against trypsin, chymotrypsin and papain. Moreover, kinetic enzymatic studies were performed aiming to understand the inhibition mode of PmTKI, which competitively inhibits the target enzyme, presenting Ki values of 1.5 × 10(-8) and 3.0 × 10(-1) M against trypsin and chymotrypsin, respectively. Also, the inhibitory activity was assayed at different pH ranges, temperatures and reduction environments (DTT). The inhibitor was stable in all conditions maintaining an 80% residual activity. N-terminal sequence was obtained by Edman degradation and the primary sequence presented identity with members of Kunitz-type inhibitors from the same subfamily. Finally after biochemical characterization the inhibitory effect was evaluated in vitro on insect digestive enzymes from different orders, PmTKI demonstrated remarkable activity against enzymes from Anthonomus grandis (90%), Plodia interpuncptella (60%), and Ceratitis capitata (70%). Furthermore, in vivo bioinsecticidal assays of C. capitata larvae were also performed and the concentration of PmTKI (w/w) in an artificial diet required to LD50 and ED50 larvae were 0.37 and 0.3% respectively. In summary, data reported here shown the biotechnological potential of PmTKI for insect pest control. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Design and testing for a nontagged F1-V fusion protein as vaccine antigen against bubonic and pneumonic plague.

PubMed

Powell, Bradford S; Andrews, Gerard P; Enama, Jeffrey T; Jendrek, Scott; Bolt, Chris; Worsham, Patricia; Pullen, Jeffrey K; Ribot, Wilson; Hines, Harry; Smith, Leonard; Heath, David G; Adamovicz, Jeffrey J

2005-01-01

A two-component recombinant fusion protein antigen was re-engineered and tested as a medical counter measure against the possible biological threat of aerosolized Yersinia pestis. The active component of the proposed subunit vaccine combines the F1 capsular protein and V virulence antigen of Y. pestis and improves upon the design of an earlier histidine-tagged fusion protein. In the current study, different production strains were screened for suitable expression and a purification process was optimized to isolate an F1-V fusion protein absent extraneous coding sequences. Soluble F1-V protein was isolated to 99% purity by sequential liquid chromatography including capture and refolding of urea-denatured protein via anion exchange, followed by hydrophobic interaction, concentration, and then transfer into buffered saline for direct use after frozen storage. Protein identity and primary structure were verified by mass spectrometry and Edman sequencing, confirming a purified product of 477 amino acids and removal of the N-terminal methionine. Purity, quality, and higher-order structure were compared between lots using RP-HPLC, intrinsic fluorescence, CD spectroscopy, and multi-angle light scattering spectroscopy, all of which indicated a consistent and properly folded product. As formulated with aluminum hydroxide adjuvant and administered in a single subcutaneous dose, this new F1-V protein also protected mice from wild-type and non-encapsulated Y. pestis challenge strains, modeling prophylaxis against pneumonic and bubonic plague. These findings confirm that the fusion protein architecture provides superior protection over the former licensed product, establish a foundation from which to create a robust production process, and set forth assays for the development of F1-V as the active pharmaceutical ingredient of the next plague vaccine.
Turkish scorpion Buthacus macrocentrus: general characterization of the venom and description of Bu1, a potent mammalian Na⁺-channel α-toxin.

PubMed

Caliskan, F; Quintero-Hernández, V; Restano-Cassulini, R; Batista, C V F; Zamudio, F Z; Coronas, F I; Possani, L D

2012-03-01

The venom of the scorpion Buthacus macrocentrus of Turkey was fractionated by high performance liquid chromatography (HPLC) and its mass finger print analysis was obtained by spectrometry. More than 70 different fractions were obtained, allowing the determination of the molecular masses of at least 60 peptides ranging between 648 and 44,336 Da. The venom is enriched with peptides containing molecular masses between 3200-4500 Da, and 6000-7500 Da. They very likely correspond to K⁺-channel and Na⁺-channel specific peptides, respectively, as expected from venoms of scorpions of the family Buthidae, already determined for other species. The major component obtained from HPLC was shown to be lethal to mice and was further purified and characterized. It contains 65 amino acid residues maintained closely packed by 4 disulfide bridges, and shows a molecular weight of 7263 Da. Additionally, a cDNA from the venomous glands of this scorpion was used in conjunction with sequence data from Edman degradation and mass spectrometry for cloning the gene that codes for Bu1 as we named this toxin. This gene codes for a 67 amino acid residues peptide, where the two last are eliminated post-translationally for production of an amidated C-terminal arginine. Its sequence is closely related to toxins from the species Leiurus quinquestriatus, as revealed by a phylogenetic tree analysis. Electrophysiological results conducted with Bu1 using patch-clamp techniques indicate that it modifies the Na⁺ currents, in a similar way as other well known α-scorpion toxins. These results support the conclusion that this species of scorpions is dangerous to humans, having an epidemiological interest for the country. Copyright © 2012 Elsevier Ltd. All rights reserved.
Purification and Characterization of a Novel Anti-Campylobacter Bacteriocin Produced by Lactobacillus curvatus DN317.

PubMed

Zommiti, Mohamed; Almohammed, Hamdan; Ferchichi, Mounir

2016-12-01

The lactic acid bacteria (LAB) microbiota of Saudi chicken ceca was determined. From 60 samples, 204 isolates of lactic acid bacteria were obtained. Three isolates produced antimicrobial activities against Campylobacter jejuni, Listeria monocytogenes, and Bacillus subtilis. The isolate DN317, which had the highest activity against Campylobacter jejuni ATCC 33560, was identified as Lactobacillus curvatus (GenBank accession numbers: KX353849 and KX353850). Full inhibitory activity was observed after a 2-h incubation with the supernatant at pH values between 4 and 8. Only 16% of the activity was conserved after a treatment at 121 °C for 15 min. The use of proteinase K, pepsin, chymotrypsin, trypsin, papain, and lysozyme drastically reduced the antimicrobial activity. However, lipase, catalase, and lysozyme had no effect on this activity. The active peptide produced by Lactobacillus curvatus DN317 was purified by precipitation with an 80% saturated ammonium sulfate solution, and two steps of reversed phase HPLC on a C18 column. The molecular weight of this peptide was 4448 Da as determined by MALDI-ToF. N-terminal sequence analysis using Edman degradation revealed 47 amino acid residues (UniProt Knowledgebase accession number C0HK82) revealing homology with the amino acid sequences of sakacin P and curvaticin L442. The antimicrobial activity of the bacteriocin, namely curvaticin DN317, was found to be bacteriostatic against Campylobacter jejuni ATCC 33560. The use of microbial antagonism by LAB is one of the best ways to control microorganisms safely in foods. This result constitutes a reasonable advance in the antimicrobial field because of its potential applications in food technology.
Tri-domain Bifunctional Inhibitor of Metallocarboxypeptidases A and Serine Proteases Isolated from Marine Annelid Sabellastarte magnifica*

PubMed Central

Alonso-del-Rivero, Maday; Trejo, Sebastian A.; Reytor, Mey L.; Rodriguez-de-la-Vega, Monica; Delfin, Julieta; Diaz, Joaquin; González-González, Yamile; Canals, Francesc; Chavez, Maria Angeles; Aviles, Francesc X.

2012-01-01

This study describes a novel bifunctional metallocarboxypeptidase and serine protease inhibitor (SmCI) isolated from the tentacle crown of the annelid Sabellastarte magnifica. SmCI is a 165-residue glycoprotein with a molecular mass of 19.69 kDa (mass spectrometry) and 18 cysteine residues forming nine disulfide bonds. Its cDNA was cloned and sequenced by RT-PCR and nested PCR using degenerated oligonucleotides. Employing this information along with data derived from automatic Edman degradation of peptide fragments, the SmCI sequence was fully characterized, indicating the presence of three bovine pancreatic trypsin inhibitor/Kunitz domains and its high homology with other Kunitz serine protease inhibitors. Enzyme kinetics and structural analyses revealed SmCI to be an inhibitor of human and bovine pancreatic metallocarboxypeptidases of the A-type (but not B-type), with nanomolar Ki values. SmCI is also capable of inhibiting bovine pancreatic trypsin, chymotrypsin, and porcine pancreatic elastase in varying measures. When the inhibitor and its nonglycosylated form (SmCI N23A mutant) were overproduced recombinantly in a Pichia pastoris system, they displayed the dual inhibitory properties of the natural form. Similarly, two bi-domain forms of the inhibitor (recombinant rSmCI D1-D2 and rSmCI D2-D3) as well as its C-terminal domain (rSmCI-D3) were also overproduced. Of these fragments, only the rSmCI D1-D2 bi-domain retained inhibition of metallocarboxypeptidase A but only partially, indicating that the whole tri-domain structure is required for such capability in full. SmCI is the first proteinaceous inhibitor of metallocarboxypeptidases able to act as well on another mechanistic class of proteases (serine-type) and is the first of this kind identified in nature. PMID:22411994
Actiflagelin, a new sperm activator isolated from Walterinnesia aegyptia venom using phenotypic screening.

PubMed

Abd El-Aziz, Tarek Mohamed; Al Khoury, Sawsan; Jaquillard, Lucie; Triquigneaux, Mathilde; Martinez, Guillaume; Bourgoin-Voillard, Sandrine; Sève, Michel; Arnoult, Christophe; Beroud, Rémy; De Waard, Michel

2018-01-01

Sperm contains a wealth of cell surface receptors and ion channels that are required for most of its basic functions such as motility and acrosome reaction. Conversely, animal venoms are enriched in bioactive compounds that primarily target those ion channels and cell surface receptors. We hypothesized, therefore, that animal venoms should be rich enough in sperm-modulating compounds for a drug discovery program. Our objective was to demonstrate this fact by using a sperm-based phenotypic screening to identify positive modulators from the venom of Walterinnesia aegyptia . Herein, as proof of concept that venoms contain interesting compounds for sperm physiology, we fractionated Walterinnesia aegyptia snake venom by RP-HPLC and screened for bioactive fractions capable of accelerating mouse sperm motility (primary screening). Next, we purified each compound from the positive fraction by cation exchange and identified the bioactive peptide by secondary screening. The peptide sequence was established by Edman sequencing of the reduced/alkylated compound combined to LC-ESI-QTOF MS/MS analyses of reduced/alkylated fragment peptides following trypsin or V8 protease digestion. Using this two-step purification protocol combined to cell phenotypic screening, we identified a new toxin of 7329.38 Da (actiflagelin) that activates sperm motility in vitro from OF1 male mice. Actiflagelin is 63 amino acids in length and contains five disulfide bridges along the proposed pattern of disulfide connectivity C 1 -C 5 , C 2 -C 3 , C 4 -C 6 , C 7 -C 8 and C 9 -C 10 . Modeling of its structure suggests that it belongs to the family of three finger toxins with a noticeable homology with bucandin, a peptide from Bungarus candidus venom. This report demonstrates the feasibility of identifying profertility compounds that may be of therapeutic potential for infertility cases where motility is an issue.
Blood-feeding Behaviors of Anopheles stephensi But Not Phlebotomus papatasi are Influenced by Actively Warming Guinea Pigs (Cavia porcellus) Under General Anesthesia

DTIC Science & Technology

2015-01-01

response in mosquitoes. Proc Natl Acad Sci 108:8026–8029. Davis EE, Sokolove PG. 1975. Temperature responses of antennal receptors of the mosquito, Aedes ...books/NBK54050/ Peterson D, Brown A. 1951. Studies of the responses of the female Aedes mosquito. Part III. The response of Aedes aegypti (L.) to a warm...Hyg 33:1232–1238. Walker ED, Edman JD. 1985. Feeding-site selection and blood-feeding behavior of Aedes triseriatus (Diptera: Culicidae) on rodent
Combining Search Engines for Comparative Proteomics

PubMed Central

Tabb, David

2012-01-01

Many proteomics laboratories have found spectral counting to be an ideal way to recognize biomarkers that differentiate cohorts of samples. This approach assumes that proteins that differ in quantity between samples will generate different numbers of identifiable tandem mass spectra. Increasingly, researchers are employing multiple search engines to maximize the identifications generated from data collections. This talk evaluates four strategies to combine information from multiple search engines in comparative proteomics. The “Count Sum” model pools the spectra across search engines. The “Vote Counting” model combines the judgments from each search engine by protein. Two other models employ parametric and non-parametric analyses of protein-specific p-values from different search engines. We evaluated the four strategies in two different data sets. The ABRF iPRG 2009 study generated five LC-MS/MS analyses of “red” E. coli and five analyses of “yellow” E. coli. NCI CPTAC Study 6 generated five concentrations of Sigma UPS1 spiked into a yeast background. All data were identified with X!Tandem, Sequest, MyriMatch, and TagRecon. For both sample types, “Vote Counting” appeared to manage the diverse identification sets most effectively, yielding heightened discrimination as more search engines were added.
A photoreceptor calcium binding protein is recognized by autoantibodies obtained from patients with cancer-associated retinopathy

PubMed Central

1991-01-01

Cancer-associated retinopathy (CAR), a paraneoplastic syndrome, is characterized by the degeneration of retinal photoreceptors under conditions where the tumor and its metastases have not invaded the eye. The retinopathy often is apparent before the diagnosis of cancer and may be associated with autoantibodies that react with specific sites in the retina. We have examined the sera from patients with CAR to further characterize the retinal antigen. Western blot analysis of human retinal proteins reveals a prominent band at 26 kD that is labeled by the CAR antisera. Antibodies to the 26-kD protein were affinity- purified from complex CAR antisera and used for EM-immunocytochemical localization of the protein to the nuclei, inner and outer segments of both rod and cone cells. Other antibodies obtained from the CAR sera did not label photoreceptors. Using the affinity-purified antibodies for detection, the 26-kD protein, designated p26, was purified to homogeneity from the outer segments of bovine rod photoreceptor cells by Phenyl-Sepharose and ion exchange chromatography. Partial amino acid sequence of p26 was determined by gas phase Edman degradation and revealed extensive homology with a cone-specific protein, visinin. Based upon structural relatedness, both the p26 rod protein and visinin are members of the calmodulin family and contain calcium binding domains of the E-F hand structure. PMID:1999465
Isolation and structure elucidation of neuropeptides of the AKH/RPCH family in long-horned grasshoppers (Ensifera).

PubMed

Gäde, G

1992-11-01

An identical neuropeptide was isolated by reversed-phase high-performance liquid chromatography from the corpora cardiaca of the king cricket, Libanasidus vittatus, and the two armoured ground crickets, Heterodes namaqua and Acanthoproctus cervinus. The crude gland extracts had adipokinetic activity in migratory locusts, hypertrehalosaemic activity in American cockroaches and a slight hypertrehalosaemic, but no adipokinetic, effect in armoured ground crickets. The primary structure of this neuropeptide was determined by pulsed-liquid phase sequencing employing Edman chemistry after enzymically deblocking the N-terminal 5-oxopyrrolidine-2-carboxylic acid residue. The C-terminus was also blocked, as indicated by the lack of digestion by carboxypeptidase A. The peptide was assigned the structure [symbol: see text]Glu-Leu-Asn-Phe-Ser-Thr-Gly-TrpNH2, previously designated Scg-AKH-II. The corpora cardiaca of the cricket Gryllodes sigillatus contained a neuropeptide which differed in retention time from the one isolated from the king and armoured ground crickets. The structure was assigned as [symbol: see text]Glu-Val-Asn-Phe-Ser-Thr-Gly-TrpNH2, previously designated Grb-AKH. This octapeptide caused hyperlipaemia in its donor species. The presence of the same peptide, Scg-AKH-II, in the two primitive infraorders of Ensifera, and the different peptide, Grb-AKH, in the most advanced infraorder of Ensifera, supports the evolutionary trends assigned formerly from morphological and physiological evidence.
Isolation and characterization of a leech neuropeptide in rat brains: coupling to nitric oxide release in leech, rat and human tissues.

PubMed

Salzet, M; Salzet, B; Sáutière, P; Lésage, J; Beauvillain, J C; Bilfinger, T V; Rialas, C; Bjenning, C; Stefano, G B

1998-03-30

The osmoregulator peptide (leech osmoregulatory factor, LORF; IPEPYVWD) was first found in the leech central nervous system (CNS). Given the fact that certain peptides can be found in mammals and invertebrates, e.g., opioid, we examined rat brains to determine if LORF was present. This peptide was found and isolated by successive reversed-phase HPLC purification steps and characterized by electrospray mass spectrometry measurement. It was sequenced by Edman degradation and quantified in different tissues by ELISA. Our results demonstrate the presence of LORF in the hypothalamus, thalamus, and striatum (6 pmol/mg of protein extract) and in other brain areas at lower levels. This octapeptide is also present in the rat duodenum and liver (10 to 14 pmol/mg) and at lower levels in heart, lung, pancreas and caudal spinal cord (< 5 pmol/mg). The testes, adrenals and kidneys have the lowest levels of all the tissues examined (ca. 0.5 pmol/mg of protein). Furthermore, we also demonstrate that LORF is coupled to nitric oxide (NO) release in leech CNS, rat hypothalamus and human saphenous vein in a manner which is inhibited by a nitric oxide synthase inhibitor as well as an antibody directed toward LORF. The study demonstrates that LORF, and its function in relation to NO release, has been conserved over more than 400 million years of evolution.
A broad-spectrum antimicrobial activity of Bacillus subtilis RLID 12.1.

PubMed

Ramachandran, Ramya; Chalasani, Ajay Ghosh; Lal, Ram; Roy, Utpal

2014-01-01

In the present study, an attempt was made to biochemically characterize the antimicrobial substance from the soil isolate designated as RLID 12.1 and explore its potential applications in biocontrol of drug-resistant pathogens. The antimicrobial potential of the wild-type isolate belonging to the genus Bacillus was determined by the cut-well agar assay. The production of antimicrobial compound was recorded maximum at late exponential growth phase. The ultrafiltered concentrate was insensitive to organic solvents, metal salts, surfactants, and proteolytic and nonproteolytic enzymes. The concentrate was highly heat stable and active over a wide range of pH values. Partial purification, zymogram analysis, and TLC were performed to determine the preliminary biochemical nature. The molecular weight of the antimicrobial peptide was determined to be less than 2.5 kDa in 15% SDS-PAGE and in zymogram analysis against Streptococcus pyogenes. The N-terminal amino acid sequence by Edman degradation was partially determined to be T-P-P-Q-S-X-L-X-X-G, which shows very insignificant identity to other antimicrobial peptides from bacteria. The minimum inhibitory concentrations of dialysed and partially purified ion exchange fractions were determined against some selected gram-positive and gram-negative bacteria and some pathogenic yeasts. The presence of three important antimicrobial peptide biosynthesis genes ituc, fend, and bmyb was determined by PCR.
New Kunitz-Type HCRG Polypeptides from the Sea Anemone Heteractis crispa

PubMed Central

Gladkikh, Irina; Monastyrnaya, Margarita; Zelepuga, Elena; Sintsova, Oksana; Tabakmakher, Valentin; Gnedenko, Oksana; Ivanov, Alexis; Hua, Kuo-Feng; Kozlovskaya, Emma

2015-01-01

Sea anemones are a rich source of Kunitz-type polypeptides that possess not only protease inhibitor activity, but also Kv channels toxicity, analgesic, antihistamine, and anti-inflammatory activities. Two Kunitz-type inhibitors belonging to a new Heteractis crispa RG (HCRG) polypeptide subfamily have been isolated from the sea anemone Heteractis crispa. The amino acid sequences of HCRG1 and HCRG2 identified using the Edman degradation method share up to 95% of their identity with the representatives of the HCGS polypeptide multigene subfamily derived from H. crispa cDNA. Polypeptides are characterized by positively charged Arg at the N-terminus as well as P1 Lys residue at their canonical binding loop, identical to those of bovine pancreatic trypsin inhibitor (BPTI). These polypeptides are shown by our current evidence to be more potent inhibitors of trypsin than the known representatives of the HCGS subfamily with P1Thr. The kinetic and thermodynamic characteristics of the intermolecular interactions between inhibitors and serine proteases were determined by the surface plasmon resonance (SPR) method. Residues functionally important for polypeptide binding to trypsin were revealed using molecular modeling methods. Furthermore, HCRG1 and HCRG2 possess anti-inflammatory activity, reducing tumor necrosis factor-α (TNF-α) and interleukin 6 (IL-6) secretions, as well as proIL-1β expression in lipopolysaccharide (LPS)-activated macrophages. However, there was no effect on nitric oxide (NO) generation. PMID:26404319
Ligatoxin B, a new cytotoxic protein with a novel helix-turn-helix DNA-binding domain from the mistletoe Phoradendron liga.

PubMed Central

Li, Shi-Sheng; Gullbo, Joachim; Lindholm, Petra; Larsson, Rolf; Thunberg, Eva; Samuelsson, Gunnar; Bohlin, Lars; Claeson, Per

2002-01-01

A new basic protein, designated ligatoxin B, containing 46 amino acid residues has been isolated from the mistletoe Phoradendron liga (Gill.) Eichl. (Viscaceae). The protein's primary structure, determined unambiguously using a combination of automated Edman degradation, trypsin enzymic digestion, and tandem MS analysis, was 1-KSCCPSTTAR-NIYNTCRLTG-ASRSVCASLS-GCKIISGSTC-DSGWNH-46. Ligatoxin B exhibited in vitro cytotoxic activities on the human lymphoma cell line U-937-GTB and the primary multidrug-resistant renal adenocarcinoma cell line ACHN, with IC50 values of 1.8 microM and 3.2 microM respectively. Sequence alignment with other thionins identified a new member of the class 3 thionins, ligatoxin B, which is similar to the earlier described ligatoxin A. As predicted by the method of homology modelling, ligatoxin B shares a three-dimensional structure with the viscotoxins and purothionins and so may have the same mode of cytotoxic action. The novel similarities observed by structural comparison of the helix-turn-helix (HTH) motifs of the thionins, including ligatoxin B, and the HTH DNA-binding proteins, led us to propose the working hypothesis that thionins represent a new group of DNA-binding proteins. This working hypothesis could be useful in further dissecting the molecular mechanisms of thionin cytotoxicity and of thionin opposition to multidrug resistance, and useful in clarifying the physiological function of thionins in plants. PMID:12049612
Characterization of a Cadmium-Binding Complex of Cabbage Leaves 1

PubMed Central

Wagner, George J.

1984-01-01

The chemical nature of a principal, inducible cadmium-binding complex which accumulates in cabbage leaves (Wagner and Trotter 1982 Plant Physiol 69: 804-809) was studied and compared with that of animal metallothionein and copper-binding proteins isolated from various organisms. The apparent molecular weight of native cabbage complex and carboxymethylated ligand of the complex under native conditions as determined by gel filtration was about 10,000 daltons. Under denaturing conditions their apparent molecular weights were about 2000 daltons. Ligand of native complex contained 37, 28, and 9 residue per cent of glutamic acid-glutamine, cysteine, and glycine, respectively, and low aromatic residue, serine and lysine content. The high acidic and low hydrophobic residue content explain the behavior of complex on electrophoresis in the presence and absence of sodium dodecyl sulfate. Its isoelectric point was below 4.0 and it bound 4 to 6 moles cadmium per mole ligand in what appear to be cadmium-mercaptide chromophores. The complex was found to be heat stable, relatively protease insensitive, and lacking in disulfide bonds. Attempts to determine the primary sequence of reduced native complex and carboxymethylated, cleaved ligand using the Edman degradation procedure were unsuccessful. An electrophoretic procedure is described for preparative isolation of purified complex and a method is described for monitoring ligand of complex as its fluorescent dibromobimane adduct. Images Fig. 1 Fig. 3 PMID:16663927

Isolation and Purification of Water Soluble Proteins from Ginger Root (Zingiber officinale) by Two Dimensional Liquid Chromatography

PubMed Central

Sandovall, A.O.; Andrews, K.; Wahab, A.; Choudhary, M.I.; Ahmed, A.

2014-01-01

The RI-INBRE Centralized Core Facility was established in 2003 and participates annually in Undergraduate Summer Research Program. It provides students hands on research experience in key technologies in biomedical sciences. We present here the isolation and purification of water soluble proteins from ginger, a rhizome of the plant, Zingiber officinale. It is an important ingredient of species used in traditional South Asian cuisines. In Indian, Pakistani and Chinese folk medicine, ginger is used for gastro-intestinal disorders, nausea, vomiting, inflammatory diseases, muscle and joint pain. Limited studies have been reported on the bioactive proteins from ginger extract. The water soluble proteins were extracted from ginger root and successfully purified to homogeneity by using two-dimensional liquid chromatography (FPLC/RP-HPLC) approach. The ginger root was washed with distilled water; skin removed and then emulsified using an electric blender. Sample was stirred for four days at 4°C with and without protease inhibitor. Purification of a 42kDa protein was achieved by employing gel filtration, ion-exchange and reversed phase HPLC. The homogeneity of the protein was confirmed by SDS-PAGE gel electrophoresis and MALDI-TOF mass spectrometry. Future work will be conducted on the protein characterization using mass spectrometry and Edman protein sequencing. Supported by grant 5P20GM103430 from the National Institute of General Medical Sciences, NIH, USA.
Substrate specificity of mitochondrial intermediate peptidase analysed by a support-bound peptide library

PubMed Central

Marcondes, M.F.M.; Alves, F.M.; Assis, D.M.; Hirata, I.Y.; Juliano, L.; Oliveira, V.; Juliano, M.A.

2015-01-01

The substrate specificity of recombinant human mitochondrial intermediate peptidase (hMIP) using a synthetic support-bound FRET peptide library is presented. The collected fluorescent beads, which contained the hydrolysed peptides generated by hMIP, were sequenced by Edman degradation. The results showed that this peptidase presents a remarkable preference for polar uncharged residues at P1 and P1′ substrate positions: Ser = Gln > Thr at P1 and Ser > Thr at P1′. Non-polar residues were frequent at the substrate P3, P2, P2′ and P3′ positions. Analysis of the predicted MIP processing sites in imported mitochondrial matrix proteins shows these cleavages indeed occur between polar uncharged residues. Previous analysis of these processing sites indicated the importance of positions far from the MIP cleavage site, namely the presence of a hydrophobic residue (Phe or Leu) at P8 and a polar uncharged residue (Ser or Thr) at P5. To evaluate this, additional kinetic analyses were carried out, using fluorogenic substrates synthesized based on the processing sites attributed to MIP. The results described here underscore the importance of the P1 and P1′ substrate positions for the hydrolytic activity of hMIP. The information presented in this work will help in the design of new substrate-based inhibitors for this peptidase. PMID:26082885
Identification of E-cadherin signature motifs functioning as cleavage sites for Helicobacter pylori HtrA

NASA Astrophysics Data System (ADS)

Schmidt, Thomas P.; Perna, Anna M.; Fugmann, Tim; Böhm, Manja; Jan Hiss; Haller, Sarah; Götz, Camilla; Tegtmeyer, Nicole; Hoy, Benjamin; Rau, Tilman T.; Neri, Dario; Backert, Steffen; Schneider, Gisbert; Wessler, Silja

2016-03-01

The cell adhesion protein and tumour suppressor E-cadherin exhibits important functions in the prevention of gastric cancer. As a class-I carcinogen, Helicobacter pylori (H. pylori) has developed a unique strategy to interfere with E-cadherin functions. In previous studies, we have demonstrated that H. pylori secretes the protease high temperature requirement A (HtrA) which cleaves off the E-cadherin ectodomain (NTF) on epithelial cells. This opens cell-to-cell junctions, allowing bacterial transmigration across the polarised epithelium. Here, we investigated the molecular mechanism of the HtrA-E-cadherin interaction and identified E-cadherin cleavage sites for HtrA. Mass-spectrometry-based proteomics and Edman degradation revealed three signature motifs containing the [VITA]-[VITA]-x-x-D-[DN] sequence pattern, which were preferentially cleaved by HtrA. Based on these sites, we developed a substrate-derived peptide inhibitor that selectively bound and inhibited HtrA, thereby blocking transmigration of H. pylori. The discovery of HtrA-targeted signature sites might further explain why we detected a stable 90 kDa NTF fragment during H. pylori infection, but also additional E-cadherin fragments ranging from 105 kDa to 48 kDa in in vitro cleavage experiments. In conclusion, HtrA targets E-cadherin signature sites that are accessible in in vitro reactions, but might be partially masked on epithelial cells through functional homophilic E-cadherin interactions.
Angiotensin I-converting-enzyme-inhibitory and antibacterial peptides from Lactobacillus helveticus PR4 proteinase-hydrolyzed caseins of milk from six species.

PubMed

Minervini, F; Algaron, F; Rizzello, C G; Fox, P F; Monnet, V; Gobbetti, M

2003-09-01

Sodium caseinates prepared from bovine, sheep, goat, pig, buffalo or human milk were hydrolyzed by a partially purified proteinase of Lactobacillus helveticus PR4. Peptides in each hydrolysate were fractionated by reversed-phase fast-protein liquid chromatography. The fractions which showed the highest angiotensin I-converting-enzyme (ACE)-inhibitory or antibacterial activity were sequenced by mass spectrum and Edman degradation analyses. Various ACE-inhibitory peptides were found in the hydrolysates: the bovine alpha(S1)-casein (alpha(S1)-CN) 24-47 fragment (f24-47), f169-193, and beta-CN f58-76; ovine alpha(S1)-CN f1-6 and alpha(S2)-CN f182-185 and f186-188; caprine beta-CN f58-65 and alpha(S2)-CN f182-187; buffalo beta-CN f58-66; and a mixture of three tripeptides originating from human beta-CN. A mixture of peptides with a C-terminal sequence, Pro-Gly-Pro, was found in the most active fraction of the pig sodium caseinate hydrolysate. The highest ACE-inhibitory activity of some peptides corresponded to the concentration of the ACE inhibitor (S)-N-(1-[ethoxycarbonyl]-3-phenylpropyl)-ala-pro maleate (enalapril) of 49.253 micro g/ml (100 micro mol/liter). Several of the above sequences had features in common with other ACE-inhibitory peptides reported in the literature. The 50% inhibitory concentration (IC(50)) of some of the crude peptide fractions was very low (16 to 100 micro g/ml). Some identified peptides were chemically synthesized, and the ACE-inhibitory activity and IC(50)s were confirmed. An antibacterial peptide corresponding to beta-CN f184-210 was identified in human sodium caseinate hydrolysate. It showed a very large spectrum of inhibition against gram-positive and -negative bacteria, including species of potential clinical interest, such as Enterococcus faecium, Bacillus megaterium, Escherichia coli, Listeria innocua, Salmonella spp., Yersinia enterocolitica, and Staphylococcus aureus. The MIC for E. coli F19 was ca. 50 micro g/ml. Once generated, the bioactive peptides were resistant to further degradation by proteinase of L. helveticus PR4 or by trypsin and chymotrypsin.
Characterization of the first K⁺ channel blockers from the venom of the Moroccan scorpion Buthus occitanus Paris.

PubMed

Martin-Eauclaire, Marie-France; Céard, Brigitte; Belghazi, Maya; Lebrun, Régine; Bougis, Pierre E

2013-12-01

The availability of a large variety of specific blockers, which inhibit different K(+) currents, would help to elucidate their differences in physiological function. Short peptide toxins isolated from scorpion venoms are able to block voltage-dependent or Ca(2+)-activated K(+) channels. Here, we have studied the venom of the Moroccan scorpion Buthus occitanus Paris (BoP) in order to find new peptides, which could enlarge our structure-function relationship knowledge on the Kv1.3 blocker Kaliotoxin (KTX) that belongs to the α-KTx3.1 family. Indeed and since more a decade, KTX is widely used by international investigators because it exhibits a quite sharp specificity and a high-affinity for the Kv1.3 channel, which is not only a neuronal channel but also a therapeutic target for diverse autoimmune diseases such as multiple sclerosis, type 1 diabetes, and rheumatoid arthritis. The BoP venom was first investigated using HPLC and MALDI-TOF/MS. Further, the HPLC fractions were screened by ELISA with antibodies raised against KTX. These antibodies recognized at least three components toxic in mice by intracerebroventricular injection. They were further pharmacologically characterized by competition using (125)I-KTX bound to its specific binding sites on rat brain synaptosomes. A single component (4161 Da) inhibited totally the (125)I-KTX binding and with high-affinity (IC50 = 0.1 nM), while the two other components poorly competed with (IC50 > 100 nM). These toxins were sequenced in full by Edman's degradation. The high-affinity ligand (BoPKTX) shares 86% sequence identity with KTX and was classified as toxin α-KTx3.17. The two others peptides (BoP1 and BoP2, 4093 Da and 4121 Da, respectively) only differ by a Lys/Arg mutation. Their amino acid sequences were related to Martentoxin, which has been characterized from the Chinese scorpion Buthus martenzi Karch and described as both a BKCa and Kv1.3 blocker. Accordingly, they belong to the α-KTx16 family. Copyright © 2013 Elsevier Ltd. All rights reserved.
6-phospho-alpha-D-glucosidase from Fusobacterium mortiferum: cloning, expression, and assignment to family 4 of the glycosylhydrolases.

PubMed Central

Bouma, C L; Reizer, J; Reizer, A; Robrish, S A; Thompson, J

1997-01-01

The Fusobacterium mortiferum malH gene, encoding 6-phospho-alpha-glucosidase (maltose 6-phosphate hydrolase; EC 3.2.1.122), has been isolated, characterized, and expressed in Escherichia coli. The relative molecular weight of the polypeptide encoded by malH (441 residues; Mr of 49,718) was in agreement with the estimated value (approximately 49,000) obtained by sodium dodecyl sulfate-polyacrylamide gel electrophoresis for the enzyme purified from F. mortiferum. The N-terminal sequence of the MalH protein obtained by Edman degradation corresponded to the first 32 amino acids deduced from the malH sequence. The enzyme produced by the strain carrying the cloned malH gene cleaved [U-14C]maltose 6-phosphate to glucose 6-phosphate (Glc6P) and glucose. The substrate analogs p-nitrophenyl-alpha-D-glucopyranoside 6-phosphate (pNP alphaGlc6P) and 4-methylumbelliferyl-alpha-D-glucopyranoside 6-phosphate (4MU alphaGlc6P) were hydrolyzed to yield Glc6P and the yellow p-nitrophenolate and fluorescent 4-methylumbelliferyl aglycons, respectively. The 6-phospho-alpha-glucosidase expressed in E. coli (like the enzyme purified from F. mortiferum) required Fe2+, Mn2+, Co2+, or Ni2+ for activity and was inhibited in air. Synthesis of maltose 6-phosphate hydrolase from the cloned malH gene in E. coli was modulated by addition of various sugars to the growth medium. Computer-based analyses of MalH and its homologs revealed that the phospho-alpha-glucosidase from F. mortiferum belongs to the seven-member family 4 of the glycosylhydrolase superfamily. The cloned 2.2-kb Sau3AI DNA fragment from F. mortiferum contained a second partial open reading frame of 83 residues (designated malB) that was located immediately upstream of malH. The high degree of sequence identity of MalB with IIB(Glc)-like proteins of the phosphoenol pyruvate dependent:sugar phosphotransferase system suggests participation of MalB in translocation of maltose and related alpha-glucosides in F. mortiferum. PMID:9209025
LC-MS/MS screening strategy for unknown adducts to N-terminal valine in hemoglobin applied to smokers and nonsmokers.

PubMed

Carlsson, Henrik; von Stedingk, Hans; Nilsson, Ulrika; Törnqvist, Margareta

2014-12-15

Electrophilically reactive compounds have the ability to form adducts with nucleophilic sites in DNA and proteins, constituting a risk for toxic effects. Mass spectrometric detection of adducts to N-terminal valine in hemoglobin (Hb) after detachment by modified Edman degradation procedures is one approach for in vivo monitoring of exposure to electrophilic compounds/metabolites. So far, applications have been limited to one or a few selected reactive species, such as acrylamide and its metabolite glycidamide. This article presents a novel screening strategy for unknown Hb adducts to be used as a basis for an adductomic approach. The method is based on a modified Edman procedure, FIRE, specifically developed for LC-MS/MS analysis of N-terminal valine adducts in Hb detached as fluorescein thiohydantoin (FTH) derivatives. The aim is to detect and identify a priori unknown Hb adducts in human blood samples. Screening of valine adducts was performed by stepwise scanning of precursor ions in small mass increments, monitoring four fragments common for the FTH derivative of valine with different N-substitutions in the multiple-reaction mode, covering a mass range of 135 Da (m/z 503-638). Samples from six smokers and six nonsmokers were analyzed. Control experiments were performed to compare these results with known adducts and to check for artifactual formation of adducts. In all samples of smokers and nonsmokers, seven adducts were identified, of which six have previously been studied. Nineteen unknown adducts were observed, and 14 of those exhibited fragmentation patterns similar to earlier studied FTH derivatives of adducts to valine. Identification of the unknown adducts will be the focus of future work. The presented methodology is a promising screening tool using Hb adducts to indicate exposure to potentially toxic electrophilic compounds and metabolites.
Atypical Genetic Locus Associated with Constitutive Production of Enterocin B by Enterococcus faecium BFE 900

PubMed Central

Franz, Charles M. A. P.; Worobo, Randy W.; Quadri, Luis E. N.; Schillinger, Ulrich; Holzapfel, Wilhelm H.; Vederas, John C.; Stiles, Michael E.

1999-01-01

A purified bacteriocin produced by Enterococcus faecium BFE 900 isolated from black olives was shown by Edman degradation and mass spectrometric analyses to be identical to enterocin B produced by E. faecium T136 from meat (P. Casaus, T. Nilsen, L. M. Cintas, I. F. Nes, P. E. Hernández, and H. Holo, Microbiology 143:2287–2294, 1997). The structural gene was located on a 2.2-kb HindIII fragment and a 12.0-kb EcoRI chromosomal fragment. The genetic characteristics and production of EntB by E. faecium BFE 900 differed from that described so far by the presence of a conserved sequence like a regulatory box upstream of the EntB gene, and its production was constitutive and not regulated. The 2.2-kb chromosomal fragment contained the hitherto undetected immunity gene for EntB in an atypical orientation that is the reverse of that of the structural gene. Typical transport and other genes associated with bacteriocin production were not detected on the 12.0-kb chromosomal fragment containing the EntB structural gene. This makes the EntB genetic system different from most other bacteriocin systems, where transport and possible regulatory genes are clustered. EntB was subcloned and expressed by the dedicated secretion machinery of Carnobacterium piscicola LV17A. The structural gene was amplified by PCR, fused to the divergicin A signal peptide, and expressed by the general secretory pathway in Enterococcus faecalis ATCC 19433. PMID:10224016
Structural and mechanistic studies on. beta. -hydroxydecanoly thioester dehydrase and its inhibition by the suicide substrate, 3-decynoic acid, n-acetylcysteamine thioester

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, W.B.

..beta..-Hydroxydecanoyl thioester dehydrase catalyzes the interconversion of thioesters of (R)-3-hydroxydecanoic acid, (E)-2-decenoic acid, and (Z)-3-decenoic acid. Dehydrase is irreversibly inactivated by the N-acetylcysteamine thioester of 3-decynoic acid (3-decynoyl-NAC). This is the classic example of suicide enzyme inactivation. The structure of the dehydrase-inactivator adduct is still unclear. The purpose of this thesis is to determine the structure of the inactivator moiety and the stoichiometry of the inactivation of this dimeric enzyme, as well as to conduct structural studies on dehydrase itself. 3-(2-/sup 13/C)Decynoyl-NAC was synthesized and incubated with homogeneous dehydrase. The spectra showed that dehydrase adds to the inactivator so asmore » to quickly produce an (E)-3-(N/sup im/-histidinyl)-3-decenoyl thioester adduct at the active site. This species is slowly converted to the 2-decenoyl thioester congener. Titration of dehydrase with 3-(2/sup 13/C)decynoyl-NAC under these conditions clearly indicated that 2 moles of inactivator are bound to each mole of dehydrase dimer. These experiments provide a self-consistent picture of dehydrase inactivation by 3-decynoyl-NAC and normal dehydrase-catalyzed reactions. Dehydrase was cleaved by chemical fragmentation, and the resulting mixture of peptides were separated by reversed-phase HPLC. Partial N-terminal sequences of purified peptides were obtained by automated Edman technology« less
Molecular Cloning and Pharmacological Properties of an Acidic PLA2 from Bothrops pauloensis Snake Venom

PubMed Central

Ferreira, Francis Barbosa; Gomes, Mário Sérgio Rocha; Naves de Souza, Dayane Lorena; Gimenes, Sarah Natalie Cirilo; Castanheira, Letícia Eulalio; Borges, Márcia Helena; Rodrigues, Renata Santos; Yoneyama, Kelly Aparecida Geraldo; Homsi Brandeburgo, Maria Inês; Rodrigues, Veridiana M.

2013-01-01

In this work, we describe the molecular cloning and pharmacological properties of an acidic phospholipase A2 (PLA2) isolated from Bothrops pauloensis snake venom. This enzyme, denominated BpPLA2-TXI, was purified by four chromatographic steps and represents 2.4% of the total snake venom protein content. BpPLA2-TXI is a monomeric protein with a molecular mass of 13.6 kDa, as demonstrated by Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF) analysis and its theoretical isoelectric point was 4.98. BpPLA2-TXI was catalytically active and showed some pharmacological effects such as inhibition of platelet aggregation induced by collagen or ADP and also induced edema and myotoxicity. BpPLA2-TXI displayed low cytotoxicity on TG-180 (CCRF S 180 II) and Ovarian Carcinoma (OVCAR-3), whereas no cytotoxicity was found in regard to MEF (Mouse Embryonic Fibroblast) and Sarcoma 180 (TIB-66). The N-terminal sequence of forty-eight amino acid residues was determined by Edman degradation. In addition, the complete primary structure of 122 amino acids was deduced by cDNA from the total RNA of the venom gland using specific primers, and it was significantly similar to other acidic D49 PLA2s. The phylogenetic analyses showed that BpPLA2-TXI forms a group with other acidic D49 PLA2s from the gender Bothrops, which are characterized by a catalytic activity associated with anti-platelet effects. PMID:24304676
Purification, properties, and N-terminal amino acid sequence of homogeneous Escherichia coli 2-amino-3-ketobutyrate CoA ligase, a pyridoxal phosphate-dependent enzyme.

PubMed

Mukherjee, J J; Dekker, E E

1987-10-25

Starting with 100 g (wet weight) of a mutant of Escherichia coli K-12 forced to grow on L-threonine as sole carbon source, we developed a 6-step procedure that provides 30-40 mg of homogeneous 2-amino-3-ketobutyrate CoA ligase (also called aminoacetone synthetase or synthase). This ligase, which catalyzes the cleavage/condensation reaction between 2-amino-3-ketobutyrate (the presumed product of the L-threonine dehydrogenase-catalyzed reaction) and glycine + acetyl-CoA, has an apparent molecular weight approximately equal to 85,000 and consists of two identical (or nearly identical) subunits with Mr = 42,000. Computer analysis of amino acid composition data, which gives the best fit nearest integer ratio for each residue, indicates a total of 387 amino acids/subunit with a calculated Mr = 42,093. Stepwise Edman degradation provided the N-terminal sequence of the first 21 amino acids. It is a pyridoxal phosphate-dependent enzyme since (a) several carbonyl reagents caused greater than 90% loss of activity, (b) dialysis against buffer containing hydroxylamine resulted in 89% loss of activity coincident with an 86% decrease in absorptivity at 428 nm, (c) incubation of the apoenzyme with 20 microM pyridoxal phosphate showed a parallel recovery (greater than 90%) of activity and 428-nm absorptivity, and (d) reduction of the holoenzyme with NaBH4 resulted in complete inactivation, disappearance of a new absorption maximum at 333 nm. Strict specificity for glycine is shown but acetyl-CoA (100%), n-propionyl-CoA (127%), or n-butyryl-CoA (16%) is utilized in the condensation reaction. Apparent Km values for acetyl-CoA, n-propionyl-CoA, and glycine are 59 microM, 80 microM, and 12 mM, respectively; the pH optimum = 7.5. Added divalent metal ions or sulfhydryl compounds inhibited catalysis of the condensation reaction.
Biochemical, physicochemical and molecular characterization of a genuine 2-Cys-peroxiredoxin purified from cowpea [Vigna unguiculata (L.) Walpers] leaves.

PubMed

Silva, Fredy D A; Vasconcelos, Ilka M; Lobo, Marina D P; de Castro, Patrícia G; Magalhães, Vladimir G; de Freitas, Cléverson D T; Carlini, Célia R R S; Pinto, Paulo M; Beltramini, Leila M; Filho, José H A; Barros, Eduardo B; Alencar, Luciana M R; Grangeiro, Thalles B; Oliveira, José T A

2012-07-01

Peroxiredoxins have diverse functions in cellular defense-signaling pathways. 2-Cys-peroxiredoxins (2-Cys-Prx) reduce H2O2 and alkyl-hydroperoxide. This study describes the purification and characterization of a genuine 2-Cys-Prx from Vigna unguiculata (Vu-2-Cys-Prx). Vu-2-Cys-Prx was purified from leaves by ammonium sulfate fractionation, chitin affinity and ion exchange chromatography. Vu-2-Cys-Prx reduces H2O2 using NADPH and DTT. Vu-2-Cys-Prx is a 44 kDa (SDS-PAGE)/46 kDa (exclusion chromatography) protein that appears as a 22 kDa molecule under reducing conditions, indicating that it is a homodimer linked intermolecularly by disulfide bonds and has a pI range of 4.56–4.72; its NH2-terminal sequence was similar to 2-Cys-Prx from Phaseolus vulgaris (96%) and Populus tricocarpa (96%). Analysis by ESI-Q-TOF MS/MS showed a molecular mass/pI of 28.622 kDa/5.18. Vu-2-Cys-Prx has 8% α-helix, 39% β-sheet, 22% of turns and 31% of unordered forms. Vu-2-Cys-Prx was heat stable, has optimal activity at pH 7.0, and prevented plasmid DNA degradation. Atomic force microscopy shows that Vu-2-Cys-Prx oligomerized in decamers which might be associated with its molecular chaperone activity that prevented denaturation of insulin and citrate synthase. Its cDNA analysis showed that the redox-active Cys52 residue and the amino acids Pro45, Thr49 and Arg128 are conserved as in other 2-Cys-Prx. The biochemical and molecular features of Vu-2-Cys-Prx are similar to other members of 2-Cys-Prx family. To date, only one publication reported on the purification of native 2-Cys-Prx from leaves and the subsequent analysis by N-terminal Edman sequencing, which is crucial for construction of stromal recombinant 2-Cys-Prx proteins.
Sequence of the radioactive tryptic peptide obtained after inactivating the F1-ATPase of the thermophilic bacterium PS3 with 5'-p-fluorosulfonylbenzoyl(3H)adenosine at 65 degrees C

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bullough, D.A.; Yoshida, M.; Allison, W.S.

1986-02-01

Following a lag of about 30 min, the F1-ATPase from the thermophilic bacterium, PS3 (TF1), was inactivated slowly by 0.8 mM 5'-p-fluorosulfonylbenzoyladenosine (FSBA) at 23 degrees C and pH 7.0. When the enzyme was treated with 0.2 mM FSBA at pH 7.0 and 23 degrees C for 15 min and gel-filtered, no enzyme activity was lost. However, the lag in inactivation was abolished when the enzyme was subsequently incubated with 2.0 mM FSBA at 23 degrees C in the pH range from 6.8 to 10.0. The pH-inactivation profile obtained under these conditions revealed a pK alpha of about 9.3 whichmore » was associated with the inactivation. When pretreated TF1 was inactivated at 23 degrees C with (3H)FSBA by about 90%, greater than 20 mol of (3H)SBA was incorporated per mole of enzyme. TF1 was inactivated rapidly by 0.8 mM FSBA at pH 6.4 and 65 degrees C, and no lag was observed. Following inactivation of TF1 with 0.8 mM (3H)FSBA at 65 degrees C and pH 6.4, about 10 mol of (3H)SBA was incorporated per mole of enzyme. When a tryptic digest of the labeled enzyme was fractionated by reversed-phase high-performance liquid chromatography, a single major radioactive peptide was isolated. When subjected to automatic Edman degradation, this peptide was shown to have the amino acid sequence: A-L-A-P-E-I-V-G-E-E-H-X-Q-V-A-R, where X indicates that a phenylthiohydantoin derivative was not detected in cycle 12. However, from the DNA sequence of the gene encoding the subunit of TF1 (Y. Kagawa, M. Ishizuka, T. Saishu, and S. Nakao (1985)), this position has been shown to be occupied by tyrosine. This tyrosine is homologous with beta-Tyr-368 of the bovine mitochondrial F1-ATPase (MF1) the modification of which is responsible for the inactivation MF1 by FSBA.« less
Angiotensin I-Converting-Enzyme-Inhibitory and Antibacterial Peptides from Lactobacillus helveticus PR4 Proteinase-Hydrolyzed Caseins of Milk from Six Species

PubMed Central

Minervini, F.; Algaron, F.; Rizzello, C. G.; Fox, P. F.; Monnet, V.; Gobbetti, M.

2003-01-01

Sodium caseinates prepared from bovine, sheep, goat, pig, buffalo or human milk were hydrolyzed by a partially purified proteinase of Lactobacillus helveticus PR4. Peptides in each hydrolysate were fractionated by reversed-phase fast-protein liquid chromatography. The fractions which showed the highest angiotensin I-converting-enzyme (ACE)-inhibitory or antibacterial activity were sequenced by mass spectrum and Edman degradation analyses. Various ACE-inhibitory peptides were found in the hydrolysates: the bovine αS1-casein (αS1-CN) 24-47 fragment (f24-47), f169-193, and β-CN f58-76; ovine αS1-CN f1-6 and αS2-CN f182-185 and f186-188; caprine β-CN f58-65 and αS2-CN f182-187; buffalo β-CN f58-66; and a mixture of three tripeptides originating from human β-CN. A mixture of peptides with a C-terminal sequence, Pro-Gly-Pro, was found in the most active fraction of the pig sodium caseinate hydrolysate. The highest ACE-inhibitory activity of some peptides corresponded to the concentration of the ACE inhibitor (S)-N-(1-[ethoxycarbonyl]-3-phenylpropyl)-ala-pro maleate (enalapril) of 49.253 μg/ml (100 μmol/liter). Several of the above sequences had features in common with other ACE-inhibitory peptides reported in the literature. The 50% inhibitory concentration (IC50) of some of the crude peptide fractions was very low (16 to 100 μg/ml). Some identified peptides were chemically synthesized, and the ACE-inhibitory activity and IC50s were confirmed. An antibacterial peptide corresponding to β-CN f184-210 was identified in human sodium caseinate hydrolysate. It showed a very large spectrum of inhibition against gram-positive and -negative bacteria, including species of potential clinical interest, such as Enterococcus faecium, Bacillus megaterium, Escherichia coli, Listeria innocua, Salmonella spp., Yersinia enterocolitica, and Staphylococcus aureus. The MIC for E. coli F19 was ca. 50 μg/ml. Once generated, the bioactive peptides were resistant to further degradation by proteinase of L. helveticus PR4 or by trypsin and chymotrypsin. PMID:12957917
Purification and partial characterization of an early pregnancy factor-induced suppressor factor (EPF-S1).

PubMed

Rolfe, B A; Athanasas-Platsis, S; Hoskin, M J; Morton, H; Cavanagh, A C

1995-06-01

The immunomodulatory properties of early pregnancy factor (EPF) are mediated through induction of at least two lymphokines, designated EPF-S1 and EPF-S2 (previously estimated M(r) 15,000 and 55,000 respectively). The activity of the former is MHC-restricted while the latter is restricted to a locus (or loci) outside the MHC. The present study established further criteria by which EPF-S1 and EPF-S2 might be distinguished from each other and compared with other suppressor factors. In addition, techniques have been developed to purify EPF-S1 to homogeneity. Congenic mouse strains were used to map the genetic restriction of EPF-S2 in the rosette inhibition test and high performance gel permeation chromatography was used to demonstrate that EPF-S1 induces EPF-S2 but not vice versa. Further studies then focused on isolation of this first component of the cascade, EPF-S1, from immune ascites (from growth in athymic mice of the anti-EPF-S1 producing rat-mouse hybridoma R2T gamma, in which EPF-S1 is complexed to antibody). Techniques used were acidification followed by application to Sep-pak C18 cartridges, high performance cation-exchange chromatography and two reversed-phased HPLC steps on a C3 column. Purified material was analyzed by SDS-PAGE and Edman degradation. Approximately 10 micrograms EPF-S1 were isolated fom 60 ml ascitic fluid. Homogeneity of the purified material was demonstrated by SDS-PAGE, where it ran as a single band of approximate M(r) 12,000 coincident with biological activity. Attempts at Edman degradation indicate that the molecule is N-blocked. Definitive primary characterization of EPF-S1 must await the preparation and isolation of proteolytic fragments of the molecule, but the present studies establish conditions which make such structural analysis possible.
Maltaricin CPN, a new class IIa bacteriocin produced by Carnobacterium maltaromaticum CPN isolated from mould-ripened cheese.

PubMed

Hammi, I; Delalande, F; Belkhou, R; Marchioni, E; Cianferani, S; Ennahar, S

2016-11-01

The purpose of this study was to isolate, characterize and determine the structure and the antibacterial activities of a bacteriocin produced by Carnobacterium maltaromaticum CPN, a strain isolated from unpasteurized milk Camembert cheese. This bacteriocin, termed maltaricin CPN, was produced at higher amounts in MRS broth at temperatures between 15°C and 25°C. It was purified to homogeneity from culture supernatant by using a simple method consisting of cation-exchange and reversed-phase chromatographies. Mass spectrometry showed that maltaricin was a 4427·29 Da bacteriocin. Its amino acid sequence was determined by Edman degradation which showed that it had close similarity with bacteriocins of the class IIa. Maltaricin CPN consisted in fact of 44 unmodified amino acids including two cysteine residues at positions 9 and 14 linked by a disulphide bond. The antimicrobial activity of maltaricin CPN covered a range of bacteria, with strong activity against many species of Gram-positive bacteria, especially the food-borne pathogen Listeria monocytogenes, but no activity against Gram-negative ones. In the studied conditions, C. maltaromaticum CPN produced a new class IIa bacteriocin with strong anti-Listeria activity. The study covers the purification and the structural characterization of a new bacteriocin produced by strain C. maltaromaticum CPN isolated from Camembert cheese. Its activity against strains of L. monocytogenes and higher production rates at relatively low temperatures show potential technological applications to improve the safety of refrigerated food. © 2016 The Society for Applied Microbiology.
Some biochemical and histochemical properties of human liver serine dehydratase.

PubMed

Kashii, Tatsuhiko; Gomi, Tomoharu; Oya, Takeshi; Ishii, Yoko; Oda, Hirofumi; Maruyama, Muneharu; Kobayashi, Masashi; Masuda, Tohru; Yamazaki, Mitsuaki; Nagata, Takuya; Tsukada, Kazuhiro; Nakajima, Akinori; Tatsu, Kazuhito; Mori, Hisashi; Takusagawa, Fusao; Ogawa, Hirofumi; Pitot, Henry C

2005-03-01

In rat, serine dehydratase (SDH) is abundant in the liver and known to be a gluconeogenic enzyme, while there is little information about the biochemical property of human liver serine dehydratase because of its low content and difficulty in obtaining fresh materials. To circumvent these problems, we purified recombinant enzyme from Escherichia coli, and compared some properties between human and rat liver serine dehydratases. Edman degradation showed that the N-terminal sequence of about 75% of human serine dehydratase starts from MetSTART-Met2-Ser3- and the rest from Ser3-, whereas the N-terminus of rat enzyme begins from the second codon of MetSTART-Ala2-. The heterogeneity of the purified preparation was totally confirmed by mass spectrometry. Accordingly, this observation in part fails to follow the general rule that the first Met is not removed when the side chain of the penultimate amino acid is bulky such as Met, Arg, Lys, etc. There existed the obvious differences in the local structures between the two enzymes as revealed by limited-proteolysis experiments using trypsin and Staphylococcus aureus V8 protease. The most prominent difference was found histochemically: expression of rat liver serine dehydratase is confined to the periportal region in which many enzymes involved in gluconeogenesis and urea cycle are known to coexist, whereas human liver serine dehydratase resides predominantly in the perivenous region. These findings provide an additional support to the previous notion suggested by physiological experiments that contribution of serine dehydratase to gluconeogenesis is negligible or little in human liver.
The human gastrin precursor. Characterization of phosphorylated forms and fragments.

PubMed Central

Varro, A; Desmond, H; Pauwels, S; Gregory, H; Young, J; Dockray, G J

1988-01-01

There is a potential phosphorylation site in the C-terminal region of the precursor for the acid-stimulating hormone gastrin, which is immediately adjacent to an important cleavage point. In the present study we have sought to identify, separate, quantify and characterize phosphorylated and unphosphorylated forms of human progastrin and its fragments. Identification was made by two radioimmunoassays: (a) a novel assay employing an antibody raised to intact human progastrin; and (b) an assay using antibody reacting with the C-terminal tryptic fragment of human progastrin, as well as progastrin itself. Two forms of human progastrin isolated from a gastrinoma were separated by ion-exchange h.p.l.c., and had similar elution positions on reverse-phase h.p.l.c. and on gel filtration. The more acidic peptide contained close to equimolar amounts of phosphate. On trypsinization, peptides were released that co-eluted on ion-exchange h.p.l.c. with, and had the immunochemical properties of, naturally occurring C-terminal fragments of progastrin. One of the latter was isolated and shown by Edman degradation after derivatization with ethanethiol to have the sequence Ser (P)-Ala-Glu-Asp-Glu-Asn. Similar peptides occur in antral mucosa resected from ulcer patients. The unphosphorylated forms of progastrin predominated, whereas the phosphorylated forms of the C-terminal fragments were predominant. This distribution could be explained by preferential cleavage of phosphorylated progastrin. We conclude that in human progastrin, Ser-96 can occur in the phosphorylated form; this residue immediately follows a pair of basic residues (Arg-Arg) that are cleaved during synthesis of the biologically active product. PMID:3223964
Tripeptidyl peptidase II. An oligomeric protease complex from Arabidopsis.

PubMed

Book, Adam J; Yang, Peizhen; Scalf, Mark; Smith, Lloyd M; Vierstra, Richard D

2005-06-01

The breakdown of most nuclear and cytoplasmic proteins involves their partial cleavage by the 26S proteasome followed by further disassembly to free amino acids by the combined action of endo- and exopeptidases. In animals, one important intermediate exopeptidase is tripeptidyl peptidase (TPP)II, which digests peptide products of the 26S proteasome and other endopeptidases into tripeptides. Here, we describe the purification and characterization of TPPII from Arabidopsis (Arabidopsis thaliana). Like its animal counterparts, Arabidopsis TPPII exists as a soluble, approximately 5- to 9-MD complex. Two related species of 153 and 142 kD are present in the purified preparations that are derived from a single TPP2 gene. Sequencing by Edman degradation of the intact polypeptides and mass spectrometry of proteolytic fragments demonstrated that the 142-kD form mainly differs from the 153-kD form by a truncation at the C-terminal end. This serine protease is a member of the subtilisin superfamily and is sensitive to the inhibitors alanine-alanine-phenylalanine-chloromethylketone and butabindide, which are diagnostic for the TPPII subfamily. The Arabidopsis TPP2 gene is widely expressed in many tissue types with related genes evident in other plant genomes. Whereas the 26S proteasome is essential, TPPII appears not as important for plant physiology. An Arabidopsis T-DNA mutant defective in TPP2 expression displays no phenotypic abnormalities and is not hypersensitive to either amino acid analogs or the 26S proteasome inhibitor MG132. As a consequence, plants likely contain other intermediate exopeptidases that assist in amino acid recycling.
Tripeptidyl Peptidase II. An Oligomeric Protease Complex from Arabidopsis1

PubMed Central

Book, Adam J.; Yang, Peizhen; Scalf, Mark; Smith, Lloyd M.; Vierstra, Richard D.

2005-01-01

The breakdown of most nuclear and cytoplasmic proteins involves their partial cleavage by the 26S proteasome followed by further disassembly to free amino acids by the combined action of endo- and exopeptidases. In animals, one important intermediate exopeptidase is tripeptidyl peptidase (TPP)II, which digests peptide products of the 26S proteasome and other endopeptidases into tripeptides. Here, we describe the purification and characterization of TPPII from Arabidopsis (Arabidopsis thaliana). Like its animal counterparts, Arabidopsis TPPII exists as a soluble, approximately 5- to 9-MD complex. Two related species of 153 and 142 kD are present in the purified preparations that are derived from a single TPP2 gene. Sequencing by Edman degradation of the intact polypeptides and mass spectrometry of proteolytic fragments demonstrated that the 142-kD form mainly differs from the 153-kD form by a truncation at the C-terminal end. This serine protease is a member of the subtilisin superfamily and is sensitive to the inhibitors alanine-alanine-phenylalanine-chloromethylketone and butabindide, which are diagnostic for the TPPII subfamily. The Arabidopsis TPP2 gene is widely expressed in many tissue types with related genes evident in other plant genomes. Whereas the 26S proteasome is essential, TPPII appears not as important for plant physiology. An Arabidopsis T-DNA mutant defective in TPP2 expression displays no phenotypic abnormalities and is not hypersensitive to either amino acid analogs or the 26S proteasome inhibitor MG132. As a consequence, plants likely contain other intermediate exopeptidases that assist in amino acid recycling. PMID:15908606

Multiple forms of statherin in human salivary secretions.

PubMed

Jensen, J L; Lamkin, M S; Troxler, R F; Oppenheim, F G

1991-01-01

Sequential chromatography of hydroxyapatite-adsorbed salivary proteins from submandibular/sublingual secretions on Sephadex G-50 and reversed-phase HPLC resulted in the purification of statherin and several statherin variants. Amino acid analysis, Edman degradation and carboxypeptidase digestion of the obtained protein fractions led to the determination of the complete primary structures of statherin SV1, statherin SV2, and statherin SV3. SV1 is identical to statherin but lacks the carboxyl-terminal phenylalanine residue. SV2, lacking residues 6-15, is otherwise identical to statherin. SV3 is identical to SV2 but lacks the carboxyl-terminal phenylalanine. These results provide the first evidence for multiple forms of statherin which are probably derived both by post-translational modification and alternative splicing of the statherin gene.
Preparation and characterization of 5-(4-hydroxy-3-nitrobenzyl)-3-phenyl-2-thiohydantoin, the phenylthiohydantoin derivative of 3-nitrotyrosine.

PubMed

Lilova, A; Kleinschmidt, T; Nedkov, P; Braunitzer, G

1986-10-01

The phenylthiocarbamoyl derivative of 3-nitrotyrosine was synthesized according to the known Edman method and then converted to its phenylthiohydantoin derivative [5-(4-hydroxy-3-nitrobenzyl)-3-phenyl-2-thiohydantion] by incubation in 0.5M HCl for 24 h at room temperature. After drying over P2O5 the chromatographically pure substance could be obtained by double recrystallization from hot acetic acid. It could be established that a shorter incubation time leads to an incomplete conversion and higher temperatures cause polymerization of the product. The compounds could be characterized by thin-layer and high-performance liquid chromatography, melting point, elemental analysis as well as NMR- and absorption spectroscopy.
Venoms of Centruroides and Tityus species from Panama and their main toxic fractions.

PubMed

Salazar, Marcos H; Arenas, Iván; Corrales-García, Ligia L; Miranda, Roberto; Vélez, Sara; Sánchez, Jairo; Mendoza, Karla; Cleghorn, John; Zamudio, Fernando Z; Castillo, Adolfo; Possani, Lourival D; Corzo, Gerardo; Acosta, Hildaura

2018-01-01

The scorpionism in Panama is notorious for the confluence and coexistence of buthid scorpions from the genera Centruroides and Tityus. This communication describes an overview of the larger representative toxic venom fractions from eight dangerous buthid scorpion species of Panama: Centruroides (C. granosus, C. bicolor, C. limbatus and C. panamensis) and Tityus (T. (A.) asthenes, T. (A.) festae, T. (T.) cerroazul and T. (A.) pachyurus). Their venoms were separated by HPLC and the corresponding sub-fractions were tested for lethality effects on mice and insects. Many fractions toxic to either mice or insects, or both, were found and have had their molecular masses determined by mass spectrometry analysis. The great majority of the lethal components had a molecular mass close to 7000 Da, assumed to be peptides that recognize Na + -channels, responsible for the toxicity symptoms observed in other buthids scorpion venoms. A toxic peptide isolated from the venom of T. pachyurus was sequenced by Edman degradation, allowing the synthesis of nucleotide probe for cloning the correspondent gene. The mature toxin based on the cDNA sequencing has the C-terminal residue amidated, contains 62 amino acid packed by 4 disulfide linkages, with molecular mass of 7099.1 Da. This same toxic peptide seems to be present in scorpions of the species T. pachyurus collected in 5 different regions of Panama, although the overall HPLC profile is quite different. The most diverse neurotoxic venom components from the genus Centruroides were found in the species C. panamensis, whereas T. cerroazul was the one from the genus Tityus. The most common neurotoxins were observed in the venoms of T. festae, T. asthenes and T. pachyurus with closely related molecular masses of 7099.1 and 7332 Da. The information reported here is considered very important for future generation of a neutralizing antivenom against scorpions from Panama. Furthermore, it will contribute to the growing interest in using bioactive toxins from scorpions for drug discovery purposes. Copyright © 2017 Elsevier Ltd. All rights reserved.
A computational module assembled from different protease family motifs identifies PI PLC from Bacillus cereus as a putative prolyl peptidase with a serine protease scaffold.

PubMed

Rendón-Ramírez, Adela; Shukla, Manish; Oda, Masataka; Chakraborty, Sandeep; Minda, Renu; Dandekar, Abhaya M; Ásgeirsson, Bjarni; Goñi, Félix M; Rao, Basuthkar J

2013-01-01

Proteolytic enzymes have evolved several mechanisms to cleave peptide bonds. These distinct types have been systematically categorized in the MEROPS database. While a BLAST search on these proteases identifies homologous proteins, sequence alignment methods often fail to identify relationships arising from convergent evolution, exon shuffling, and modular reuse of catalytic units. We have previously established a computational method to detect functions in proteins based on the spatial and electrostatic properties of the catalytic residues (CLASP). CLASP identified a promiscuous serine protease scaffold in alkaline phosphatases (AP) and a scaffold recognizing a β-lactam (imipenem) in a cold-active Vibrio AP. Subsequently, we defined a methodology to quantify promiscuous activities in a wide range of proteins. Here, we assemble a module which encapsulates the multifarious motifs used by protease families listed in the MEROPS database. Since APs and proteases are an integral component of outer membrane vesicles (OMV), we sought to query other OMV proteins, like phospholipase C (PLC), using this search module. Our analysis indicated that phosphoinositide-specific PLC from Bacillus cereus is a serine protease. This was validated by protease assays, mass spectrometry and by inhibition of the native phospholipase activity of PI-PLC by the well-known serine protease inhibitor AEBSF (IC50 = 0.018 mM). Edman degradation analysis linked the specificity of the protease activity to a proline in the amino terminal, suggesting that the PI-PLC is a prolyl peptidase. Thus, we propose a computational method of extending protein families based on the spatial and electrostatic congruence of active site residues.
Conorfamide-Sr2, a gamma-carboxyglutamate-containing FMRFamide-related peptide from the venom of Conus spurius with activity in mice and mollusks

PubMed Central

Aguilar, Manuel B.; Luna-Ramírez, Karen S.; Echeverría, Daniel; Falcón, Andrés; Olivera, Baldomero M.; Heimer de la Cotera, Edgar P.; Maillo, María

2008-01-01

A novel peptide, conorfamide-Sr2 (CNF-Sr2), was purified from the venom extract of Conus spurius, collected in the Caribbean Sea off the Yucatan Peninsula. Its primary structure was determined by automated Edman degradation and amino acid analysis, and confirmed by electrospray ionization mass spectrometry. Conorfamide-Sr2 contains 12 amino acids and no Cys residues, and it is only the second FMRFamide-related peptide isolated from a venom. Its primary structure GPMγDPLγIIRI-nh2, (γ, gamma-carboxyglutamate;-nh2, amidated C-terminus; calculated monoisotopic mass, 1,468.72 Da; experimental monoisotopic mass, 1,468.70 Da) shows two features that are unusual among FMRFamide-related peptides (FaRPs, also known as RFamide peptides), namely the novel presence of gamma-carboxyglutamate, and a rather uncommon C-terminal residue, Ile. CNF-Sr2 exhibits paralytic activity in the limpet Patella opea and causes hyperactivity in the freshwater snail Pomacea paludosa and in the mouse. The sequence similarities of CNF-Sr2 with FaRPs from marine and freshwater mollusks and mice might explain its biological effects in these organisms. It also resembles FaRPs from polychaetes (the prey of C. spurius), which suggests a natural biological role. Based on these similarities, CNF-Sr2 might interact with receptors of these three distinct types of FaRPs, G-protein-coupled receptors, Na+ channels activated by FMRFamide (FaNaCs), and acid-sensing ion channels (ASICs). The biological activities of CNF-Sr2 in mollusks and mice make it a potential tool to study molecular targets in these and other organisms. PMID:18201803
Identification of amino acids in the N-terminal SH2 domain of phospholipase C gamma 1 important in the interaction with epidermal growth factor receptor.

PubMed

Gergel, J R; McNamara, D J; Dobrusin, E M; Zhu, G; Saltiel, A R; Miller, W T

1994-12-13

Photoaffinity labeling and site-directed mutagenesis have been used to identify amino acid residues of the phospholipase C gamma 1 (PLC gamma 1) N-terminal SH2 domain involved in recognition of the activated epidermal growth factor receptor (EGFR). The photoactive amino acid p-benzoylphenylalanine (Bpa) was incorporated into phosphotyrosine-containing peptides derived from EGFR autophosphorylation sites Tyr992 and Tyr1068. Irradiation of these labels in the presence of SH2 domains showed cross-linking which was time-dependent and specific; labeling was inhibited with non-Bpa-containing peptides from EGFR in molar excess. The phosphotyrosine residue on the peptides was important for SH2 recognition, as dephosphorylated peptides did not cross-link. Radiolabeled peptides were used to identify sites of cross-linking to the N-terminal SH2 of PLC gamma 1. Bpa peptide-SH2 complexes were digested with trypsin, and radioactive fragments were purified by HPLC and analyzed by Edman sequencing. These experiments showed Arg562 and an additional site in the alpha A-beta B region of the SH2 domain, most likely Glu587, to be labeled by the Tyr992-derived peptide. Similar analysis of the reaction with the Tyr1068-derived photoaffinity label identified Leu653 as the cross-linked site. Mutation of the neighboring residues of Glu587 decreased photo-cross-linking, emphasizing the importance of this region of the molecule for recognition. These results are consistent with evidence from the v-Src crystal structure and implicate the loop spanning residues Gln640-Ser654 of PLC gamma 1 in specific recognition of phosphopeptides.
Characterization of toxin III of the scorpion Leiurus quinquestriatus quinquestriatus: a new type of alpha-toxin highly toxic both to mammals and insects.

PubMed

Kopeyan, C; Mansuelle, P; Martin-Eauclaire, M F; Rochat, H; Miranda, F

1993-01-01

The primary structure of toxin III of Leiurus quinquestriatus quinquestriatus (Lqq III) was elucidated by automatic Edman degradation of the reduced and S-carboxymethylated protein and derived tryptic peptides. Like other scorpion toxins that are active on sodium channels, Lqq III, consisting of 64 amino acids, is a 7 kDa single-chain polypeptide crosslinked by four disulfide bridges. It belongs to the alpha-toxin group, as judged by competition experiments with 125I AaH II for binding to rat brain synaptosomes (K0.5 = 7 x 10(-7) M). Lqq III is the first alpha-toxin to be characterized that is highly toxic to mice [LD50 = 50 micrograms (7.1 nmol)/kg body wt], by subcutaneous injection, insects Blatella germanica [LD50 = 60 ng (8.5 pmol)/g body wt.] and Musca domestica [LD50 = 120 ng (17 pmol)/g body wt]. When tested via the intracerebroventricular route, the toxicity for mice [55 micrograms (8 nmol)/kg] was of the same order as that found by subcutaneous injection, indicating that Lqq III has a higher affinity for peripheral sodium channels that for those of the central nervous system. There are three differences between the sequences of Lqq III and Lqh alpha IT, an alpha-toxin isolated from the venom of Leiurus quinquestriatus hebraeus. These substitutions are found at positions 20, 24, and 64 (Ser-->Ala,Asp-->Glu and His-->Arg, respectively). Surprisingly Lqh alpha IT is only weakly active in mice [LD50 = 5 mg (0.7 mumol)/kg], while in insects its toxicity is similar to that of Lqq III [140 ng (20 pmol)/g body wt blowfly larvae]. These observations are relevant to the definition of scorpion toxin structure-activity relationships.
New APETx-like peptides from sea anemone Heteractis crispa modulate ASIC1a channels.

PubMed

Kalina, Rimma; Gladkikh, Irina; Dmitrenok, Pavel; Chernikov, Oleg; Koshelev, Sergey; Kvetkina, Aleksandra; Kozlov, Sergey; Kozlovskaya, Emma; Monastyrnaya, Margarita

2018-06-01

Sea anemones are an abundant source of various biologically active peptides. The hydrophobic 20% ethanol fraction of tropical sea anemone Heteractis crispa was shown to contain at least 159 peptide compounds including neurotoxins, proteinase and α-amylase inhibitors, as well as modulators of acid-sensing ion channels (ASICs). The three new peptides, π-AnmTX Hcr 1b-2, -3, and -4 (41 aa) (short names Hcr 1b-2, -3, -4), identified by a combination of reversed-phase liquid chromatography and mass spectrometry were found to belong to the class 1b sea anemone neurotoxins. The amino acid sequences of these peptides were determined by Edman degradation and tandem mass spectrometry. The percent of identity of Hcr 1b-2, -3, and -4 with well-known ASIC3 inhibitors Hcr 1b-1 from H. crispa and APETx2 from Anthopleura elegantissima is 95-78% and 46-49%, respectively. Electrophysiological experiments on homomeric ASIC channels expressed in Xenopus laevis oocytes establish that these peptides are the first inhibitors of ASIC1a derived from sea anemone venom. The major peptide, Hcr 1b-2, inhibited both rASIC1a (IC 50 4.8 ± 0.3 μM; nH 0.92 ± 0.05) and rASIC3 (IC 50 15.9 ± 1.1 μM; nH 1.0 ± 0.05). The maximum inhibition at saturating peptide concentrations reached 64% and 81%, respectively. In the model of acid-induced muscle pain Hcr 1b-2 was also shown to exhibit an antihyperalgesic effect, significantly reducing of the pain threshold of experimental animals. Copyright © 2018 Elsevier Inc. All rights reserved.
Purification, characterization and specificity of chondroitin lyases and glycuronidase from Flavobacterium heparinum.

PubMed Central

Gu, K; Linhardt, R J; Laliberté, M; Gu, K; Zimmermann, J

1995-01-01

The chondroitin lyases from Flavobacterium heparinum (Cytophaga heparinia) have been widely used in depolymerization of glycosaminoglycan and proteoglycan chondroitin sulphates. Oligosaccharide products derived from chondroitin sulphate can be further degraded by glycuronidases and sulphatases obtained from the same organism. There has been no reported purification of these enzymes to homogeneity nor is there any information on their physical and kinetic characteristics. The absence of pure enzymes has resulted in a lack of understanding of the optimal conditions for their catalytic activity and their substrate specificity. This has limited the use of these enzymes as reagents for preparation of oligosaccharides for structure and activity studies. Reproducible schemes to purify a chondroitin AC lyase, a glycuronidase and chondroitin B lyase from Flavobacterium heparinum to apparent homogeneity are described. Chondroitin AC lyase (chondroitinase AC, EC 4.2.2.5), glycuronidase [chondro-(1-->3)-glycuronidase, no EC number] and chondroitin B lyase (chondroitinase B, no EC number) have M(r) values (assessed by SDS/PAGE) of 74,000, 41,800 and 55,200 respectively, and isoelectric points (determined by isoelectric focusing) of 8.85, 9.28 and 9.05 respectively. Chondroitin lyase AC and B contain pyroglutamic acid at their N-termini precluding their analysis by Edman degradation. Deblocking with pyroglutamate aminopeptidase facilitated the determination of their N-terminal sequences. The kinetic properties of these enzymes have been determined as well as the optimum conditions for their catalytic activity. The specificity of the glycouronidase, determined using 17 different disaccharide substrates, shows that it only acts on unsulphated or 6-O-sulphated 1-->3 linkages. The chondroitin lyases are both endolytic enzymes, and oligosaccharide mapping shows their expected specificity towards the chondroitin and dermatan sulphate polymers. Images Figure 2 Figure 4 PMID:8526872
Formation of pyroglutamic acid from N-terminal glutamic acid in immunoglobulin gamma antibodies.

PubMed

Chelius, Dirk; Jing, Kay; Lueras, Alexis; Rehder, Douglas S; Dillon, Thomas M; Vizel, Alona; Rajan, Rahul S; Li, Tiansheng; Treuheit, Michael J; Bondarenko, Pavel V

2006-04-01

The status of the N-terminus of proteins is important for amino acid sequencing by Edman degradation, protein identification by shotgun and top-down techniques, and to uncover biological functions, which may be associated with modifications. In this study, we investigated the pyroglutamic acid formation from N-terminal glutamic acid residues in recombinant monoclonal antibodies. Almost half the antibodies reported in the literature contain a glutamic acid residue at the N-terminus of the light or the heavy chain. Our reversed-phase high-performance liquid chromatography-mass spectrometry method could separate the pyroglutamic acid-containing light chains from the native light chains of reduced and alkylated recombinant monoclonal antibodies. Tryptic peptide mapping and tandem mass spectrometry of the reduced and alkylated proteins was used for the identification of the pyroglutamic acid. We identified the formation of pyroglutamic acid from N-terminal glutamic acid in the heavy chains and light chains of several antibodies, indicating that this nonenzymatic reaction does occur very commonly and can be detected after a few weeks of incubation at 37 and 45 degrees C. The rate of this reaction was measured in several aqueous buffers with different pH values, showing minimal formation of pyroglutamic acid at pH 6.2 and increased formation of pyroglutamic acid at pH 4 and pH 8. The half-life of the N-terminal glutamic acid was approximately 9 months in a pH 4.1 buffer at 45 degrees C. To our knowledge, we showed for the first time that glutamic acid residues located at the N-terminus of proteins undergo pyroglutamic acid formation in vitro.
Venom from the snake Bothrops asper Garman. Purification and characterization of three phospholipases A2

PubMed Central

Anagón, Alejandro C.; Molinar, Ricardo R.; Possani, Lourival D.; Fletcher, Paul L.; Cronan, John E.; Julia, Jordi Z.

1980-01-01

The water-soluble venom of Bothrops asper Garman (San Juan Evangelista, Veracruz, México) showed 15 polypeptide bands on polyacrylamide-gel electrophoresis. This material exhibited phospholipase, hyaluronidase, N-benzoyl-l-arginine ethyl hydrolase, N-benzoyl-l-tyrosine ethyl hydrolase and phosphodiesterase activity, but no alkaline phosphatase or acid phosphatase activity. Fractionation on Sephadex G-75 afforded seven protein fractions, which were apparently less toxic than the whole venom (LD50=4.3μg/g mouse wt.). Subsequent separation of the phospholipase-positive fraction (II) on DEAE-cellulose with potassium phosphate buffers (pH7.55) gave several fractions, two being phospholipase-positive (II.6 and II.8). These fractions were further purified on DEAE-cellulose columns with potassium phosphate buffers (pH8.6). Fraction II.8.4 was rechromatographed in the same DEAE-cellulose column, giving a pure protein designated phospholipase 1. The fraction II.6.3 was further separated by gel disc electrophoresis yielding two more pure proteins designated phospholipase 2 and phospholipase 3. Analysis of phospholipids hydrolysed by these enzymes have shown that all three phospholipases belong to type A2. Amino acid analysis has shown that phospholipase A2 (type 1) has 97 residues with a calculated mol.wt. of 10978±11. Phospholipase A2 (type 2) has 96 residues with a mol.wt. of 10959±11. Phospholipase A2 (type 3) has 266 residues with 16 half-cystine residues and a calculated mol.wt of 29042±31. Automated Edman degradation showed the N-terminal sequence to be: Asx-Leu-Trp-Glx-Phe-Gly-Glx-Met-Met-Ser-Asx-Val- Met-Arg-Lys-Asx-Val-Val-Phe-Lys-Tyr-Leu- for phospholipase A2 (type 2). ImagesFig. 1. PMID:7387631
Isolation and biochemical characterization of a γ-type phospholipase A2 inhibitor from Crotalus durissus collilineatus snake serum.

PubMed

Gimenes, Sarah Natalie Cirilo; Ferreira, Francis Barbosa; Silveira, Ana Carolina Portella; Rodrigues, Renata Santos; Yoneyama, Kelly Aparecida Geraldo; Izabel Dos Santos, Juliana; Fontes, Marcos Roberto de Mattos; de Campos Brites, Vera Lúcia; Santos, André Luiz Quagliatto; Borges, Márcia Helena; Lopes, Daiana Silva; Rodrigues, Veridiana M

2014-04-01

In the present work, we describe the isolation and partial structural and biochemical characterization of the first phospholipase A2 inhibitor (γPLI) from Crotalus durissus collilineatus (Cdc) snake serum. Initially, the Cdc serum was subjected to a Q-Sepharose ion exchange column, producing six peaks at 280 nm absorbance (Q1-Q6). Subsequently, Q4 fraction was submitted to affinity chromatography with immobilized PLA2 BnSP-7, a step that resulted in two fractions (NHS-1 and NHS-2). The latter contained the inhibitor, denominated γCdcPLI. The molecular mass of γCdcPLI, determined by Matrix-Assisted Laser Desorption Ionization Time-of-Flight (MALDI-TOF), was 22,340 Da. Partial sequences obtained by Edman degradation and by mass spectrometry (MALDI-TOF/TOF), showed similarity, as expected, to other related inhibitors. Circular dichroism (CD) analysis showed the presence of approximately 22% alpha helices and 29% beta sheets in the protein secondary structure. Additionally, CD studies also indicated no significant changes in the secondary structure of γCdcPLI when it is complexed to BpPLA2-TXI. On the other hand, dynamic light scattering (DLS) assays showed a temperature-dependent oligomerization behavior for this inhibitor. Biochemical analyses showed γCdcPLI was able to inhibit the enzymatic, cytotoxic and myotoxic activities of PLA2s. Structural and functional studies performed on this inhibitor may elucidate the action mechanisms of PLA2 inhibitors. In addition, we hope this study may contribute to investigating the potential use of these inhibitors for the treatment of snakebite or inflammatory diseases in which PLA2s may be involved. Copyright © 2014 Elsevier Ltd. All rights reserved.
cDNA cloning and immunological characterization of a newly identified enolase allergen from Penicillium citrinum and Aspergillus fumigatus.

PubMed

Lai, Hsiu-Yu; Tam, Ming F; Tang, Ren-Bin; Chou, Hong; Chang, Ching-Yun; Tsai, Jaw-Ji; Shen, Horng-Der

2002-03-01

Penicillium citrinum and Aspergillus fumigatus are prevalent indoor airborne fungal species that have been implicated in human respiratory allergic disorders. It is important to understand the allergenic profile of these fungal species. The purpose of the present study is to characterize a newly identified enolase allergen from P. citrinum and A. fumigatus. Fungal proteins were separated by two-dimensional (2D) gel electrophoresis and blotted onto polyvinylidene difluoride membranes. Protein spots that reacted with IgE antibodies in serum samples from asthmatic patients were identified and the N-terminal amino acid sequences were determined by Edman degradation. The peptide sequences obtained were utilized in cloning the cDNA of the allergen genes by reverse transcriptase-polymerase chain reaction and the 5'- and 3'-rapid amplification cDNA end reactions. Our results from 2D immunoblotting identified a 47-kD IgE-reactive component in the extracts of P. citrinum and A. fumigatus. The N-terminal amino acid sequences of the 47-kD proteins are homologous to those of fungal enolases. The corresponding enolase cDNA from P. citrinum contains 1,552 bp and encodes a protein of 438 residues. In A. fumigatus, the isolated enolase cDNA has 1,649 bp and contains a 438-amino acid open reading frame. The deduced amino acid sequences of these two enolases have 94% identity. These enolases from P. citrinum and A. fumigatus were expressed in Escherichia coli as a His-tagged protein and designated as rPen c 22 and rAsp f 22, respectively. Sera from 7 (30%) of the 23 Penicillium-sensitized asthmatic patients showed IgE binding to the 47-kD P. citrinum component (Pen c 22) and rPen c 22. In addition, six of seven Pen c 22-positive serum samples have IgE immunoblot reactivity to the 47-kD A. fumigatus component (Asp f 22) and rAsp f 22. A polyclonal rabbit antiserum generated against the N-terminal peptide of Pen c 22 can react with Pen c 22, rPen c 22, Asp f 22 and rAsp f 22. In addition, the presence of IgE cross-reactivity between rPen c 22 and rAsp f 22 and between enolases from A. fumigatus and Alternaria alternata was also detected by immunoblot inhibition. These results demonstrated that a novel enolase allergen from P. citrinum (Pen c 22) and A. fumigatus (Asp f 22) was identified. In addition, IgE cross-reactivity between enolase allergens from A. fumigatus and P. citrinum and between enolases from A. fumigatus and A. alternata was also detected. Results obtained provide more information on fungal enolase allergens. Copyright 2002 S. Karger AG, Basel
Primary structure of a guanyl-specific ribonuclease from the fungus Penicillium brevicompactum

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kulikov, V.A.; Shlyapnikov, S.V.; Yakovlev, G.I.

1986-01-01

By the automatic Edman degradation of the intact S-carboxymethylated protein and a mixture of the products of its proteolytic cleavage at Arg, Lys, and Glu residues, together with results on the kinetics of the proteolysis of the protein under the action of carboxypeptidase Y, the primary structure of the extracellular guanyl-specific RNase of the fungus Penicillium brevicompactum has been determined. The RNase contains 102 amino acid residues: 7 Asp, 7 Asn, 9 Thr, 11 Ser, 4 Glu, 1 Gln, 4 Pro, 10 Gly, 11 Ala, 4 Cys, 7 Val, 4 Ile, 3 Leu, 9 Tyr, 5 Phe, 2 Lys, 3more » His, 1 Arg (M/sub r/ 10,801). It has been established that four hemicystine residues of the P. compactum RNase form, in pairs, two disulfide bonds« less
Biomimetic Synthesis of Macahydantoins A and B from Lepidium meyenii, and Structure Revision of Macahydantoin B as a Class of Thiohydantoin with a 4-Methyl-hexahydropyrrolo[1,2-c]imidazole Skeleton.

PubMed

Zhou, Min; Ma, Hang-Ying; Xing, Huan-Huan; Li, Ping; Li, Gan-Peng; Geng, Hui-Chun; Hu, Qiu-Fen; Yang, Guang-Yu

2017-09-15

Phytochemical investigation on Lepidium meyenii led to the discovery of macahydantoin C (3), a new thiohydantoin with a 1,3-diazabicyclo[3.3.1]nonane core, the spectral properties of which indicate a potential structural misassignment of its previously reported analogue, macahydantoin B (2a). To probe this hypothesis, a concise, scalable, and biomimetic synthesis of the originally proposed 2a and its revised structure (2b) was efficiently accomplished using the modified Edman degradation as the key step from commercially available materials in 65% (three steps) and 52% (three steps) overall yields, respectively. These synthetic endeavors undoubtedly reassigned the structure of macahydantoin B as an unreported type of thiohydantoin featuring a 4-methyl-hexahydropyrrolo[1,2-c]imidazole scaffold.
Desleucyl-Oritavancin with a Damaged d-Ala-d-Ala Binding Site Inhibits the Transpeptidation Step of Cell-Wall Biosynthesis in Whole Cells of Staphylococcus aureus.

PubMed

Kim, Sung Joon; Singh, Manmilan; Sharif, Shasad; Schaefer, Jacob

2017-03-14

We have used solid-state nuclear magnetic resonance to characterize the exact nature of the dual mode of action of oritavancin in preventing cell-wall assembly in Staphylococcus aureus. Measurements performed on whole cells labeled selectively in vivo have established that des-N-methylleucyl-N-4-(4-fluorophenyl)benzyl-chloroeremomycin, an Edman degradation product of [ 19 F]oritavancin, which has a damaged d-Ala-d-Ala binding aglycon, is a potent inhibitor of the transpeptidase activity of cell-wall biosynthesis. The desleucyl drug binds to partially cross-linked peptidoglycan by a cleft formed between the drug aglycon and its biphenyl hydrophobic side chain. This type of binding site is present in other oritavancin-like glycopeptides, which suggests that for these drugs a similar transpeptidase inhibition occurs.
Synthesis and screening of one-bead-one-compound cyclic peptide libraries.

PubMed

Qian, Ziqing; Upadhyaya, Punit; Pei, Dehua

2015-01-01

Cyclic peptides have been a rich source of biologically active molecules. Herein we present a method for the combinatorial synthesis and screening of large one-bead-one-compound (OBOC) libraries of cyclic peptides against biological targets such as proteins. Up to ten million different cyclic peptides are rapidly synthesized on TentaGel microbeads by the split-and-pool synthesis method and subjected to a multistage screening protocol which includes magnetic sorting, on-bead enzyme-linked and fluorescence-based assays, and in-solution binding analysis of cyclic peptides selectively released from single beads by fluorescence anisotropy. Finally, the most active hit(s) is identified by the partial Edman degradation-mass spectrometry (PED-MS) method. This method allows a single researcher to synthesize and screen up to ten million cyclic peptides and identify the most active ligand(s) in ~1 month, without the time-consuming and expensive hit resynthesis or the use of any special equipment.
Poliovirus RNA synthesis in vitro: structural elements and antibody inhibition

DOE Office of Scientific and Technical Information (OSTI.GOV)

Semler, B.L.; Hanecak, R.; Dorner, L.F.

1983-01-01

The poliovirus RNA polymerase complex has been analyzed by immunoautoradiography using antibody probes derived from purified replicase (P3) region viral polypeptides. Antibody preparations made against the polio RNA polymerase, P3-4b, detected a previously unreported cellular protein that copurifies with the RNA polymerase. An IgG fraction purified from rabbit antiserum to polypeptide P3-2, a precursor fo the RNA polymerase, specifically inhibits poliovirus RNA synthesis in vitro. The authors have also immunoprecipitated a 60,000-dalton protein (P3-4a) with antiserum to protein P3-4b and have determined the precise genomic map position of this protein by automated Edman degradation. Protein P3-4a originates by cleavage ofmore » the RNA polymerase precursor at a glutamine-glucine amino acid pair not previously reported to be a viral cleavage site.« less
Characterization of the major cyanogen bromide fragment of alpha-A crystallin

NASA Technical Reports Server (NTRS)

Ifeanyi, F.; Takemoto, L.; Spooner, B. S. (Principal Investigator)

1991-01-01

Alpha crystallin from the bovine lens has been digested with cyanogen bromide, and the major fragment (CB-1) has been purified using reverse phase HPLC. Characterization of this fragment by Edman degradation and antisera to synthetic peptides indicates that it originates from alpha-A crystallin, but lacks the N-terminal methionine and the last 35 amino acids from the C-terminus of the molecule. The purified CB-1 fragment binds as well as native alpha crystallin to lens membrane, but is unable to self-assemble into the correct size of high molecular weight oligomeric complexes characteristic of the intact alpha-A chain. Together, these results demonstrate that the alpha-A chain is comprised of at least two functional domains, one of which is involved in binding of alpha-A crystallin to lens membrane, and another which is necessary for correct self-assembly of the molecule into high molecular weight oligomers.
Structural characterization of a novel neuropeptide from the central nervous system of the leech Erpobdella octoculata. The leech osmoregulator factor.

PubMed

Salzet, M; Bulet, P; Weber, W M; Clauss, W; Verger-Bocquet, M; Malecha, J

1996-03-22

Purification of a material immunoreactive to an antiserum against the C-terminal part of the oxytocin (Pro-Leu-Gly-amide) and present in the central nervous system of the Pharyngobdellid leech Erpobdella octoculata was performed by reversed-phase high performance liquid chromatography combined with both enzyme-linked immunosorbent and dot immunobinding assays for oxytocin. The amino acid sequence of the purified peptide (Ile-Pro-Glu-Pro-Tyr-Val-Trp-Asp) was established by Edman degradation and confirmed by electrospray mass spectrometry measurement. When injected in leeches, purified or synthetic peptides exert an anti-diuretic effect, the most effective ranged between 10 pmol and 1 nmol. They provoked an uptake of water 1-2 h post-injection. Furthermore, electrophysiological experiments conducted in the leech Hirudo medicinalis revealed an inhibition of the potency of Na+ conductances of leech skin by this peptide. Immunocytochemical studies with an antiserum against synthetic oxytocin-like molecule provided the cytological basis for existence of a neuropeptide, since large amounts of immunoreactive neurons were detected in the central nervous systems of E. octoculata. The purified molecule is both different to peptides of the oxytocin/vasopressin family and is a novel neuropeptide in the animal kingdom. It was named the leech osmoregulator factor (LORF). An identification of the proteins immunoreactive to an antiserum against oxytocin performed at the level of both central nervous systems extracts and in vitro central nervous system-translated RNA products indicated that in the two cases, a single protein was detected. These proteins with a molecular masses of, respectively, approximately 34 kDa (homodimer of 17 kDa) for the central nervous systems extracts and approximately 19 kDa for in vitro central nervous system-translated RNA products were not recognized by the antiserum against MSEL- and VLDV-neurophysin (proteins associated to oxytocin and vasopressin), confirming that LORF did not belong to the oxytocin/vasopressin family.

Purification of a NAD(P) reductase-like protein from the thermogenic appendix of the Sauromatum guttatum inflorescence.

PubMed

Skubatz, Hanna; Howald, William N

2013-03-01

A NAD(P) reductase-like protein with a molecular mass of 34.146 ± 34 Da was purified to homogeneity from the appendix of the inflorescence of the Sauromatum guttatum. On-line liquid chromatography/electrospray ionization-mass spectrometry was used to isolate and quantify the protein. For the identification of the protein, liquid chromatography/electrospray ionization-tandem mass spectrometry analysis of tryptic digests of the protein was carried out. The acquired mass spectra were used for database searching, which led to the identification of a single tryptic peptide. The 12 amino acid tryptic peptide (FLPSEFGNDVDR) was found to be identical to amino acid residues at the positions 108-120 of isoflavone reductase in the Arabidopsis genome. A BLAST search identified this sequence region as unique and specific to a class of NAD(P)-dependent reductases involved in phenylpropanoid biosynthesis. Edman degradation revealed that the protein was N-terminally blocked. The amount of the protein (termed RL, NAD(P) reductase-like protein) increased 60-fold from D-4 (4 days before inflorescence-opening, designated as D-day) to D-Day, and declined the following day, when heat-production ceased. When salicylic acid, the endogenous trigger of heat-production in the Sauromatum appendix, was applied to premature appendices, a fivefold decrease in the amount of RL was detected in the treated section relative to the non-treated section. About 40 % of RL was found in the cytoplasm. Another 30 % was detected in Percoll-purified mitochondria and the rest, about 30 % was associated with a low speed centrifugation pellet due to nuclei and amyloplast localization. RL was also found in other thermogenic plants and detected in Arabidopsis leaves. The function of RL in thermogenic and non-thermogenic plants requires further investigation.
Novel haemoglobin-derived antimicrobial peptides from chicken (Gallus gallus) blood: purification, structural aspects and biological activity.

PubMed

Vasilchenko, A S; Rogozhin, E A; Vasilchenko, A V; Kartashova, O L; Sycheva, M V

2016-12-01

To purify and characterize antimicrobial peptides derived from the acid extract of Gallus gallus blood cells. Two polypeptides (i.e. CHb-1 and CHb-2) with antibacterial activity were detected in the acidic extract of blood cells from chicken (G. gallus). The isolated peptides that possessed a potent antibacterial activity were purified using a two-step chromatography procedure that involved solid-phase extraction of a total protein/peptide extract followed by thin fractionation by reversed-phase high performance liquid chromatography (RP-HPLC). The molecular masses of the purified peptides were similar and were 4824·4 and 4825·2 Da, which have been measured by matrix-assisted laser desorption/ionization mass spectrometry (MALDI TOF MS). Their amino acid sequences were determined by Edman degradation and showed that the peptides were fully identical to the two fragments of G. gallus α-haemoglobin localized into different subunits (A and D respectively). The peptides were active in micromolar concentrations against Gram-negative Escherichia coli K12 TG1. Using the 1-N-phenylnaphthylamine, the FITC-dextran labelled probes and the live/dead staining allowed to show the hemocidin mode of action and estimate the pore size. In this study, for the first time, α-haemoglobin from chicken (G. gallus) has been investigated as a donor of the two high homologous native peptide fragments that possess potent antibacterial activity in vitro. These are membrane-active peptides and their mechanism of action against E. coli involves a toroidal pore formation. The obtained results expand the perception of the role of haemoglobin in a living system, describing it as a source of multifunction substances. Additionally, the data presented in this paper may contribute to the development of new, cost-effective, antimicrobial agents. © 2016 The Society for Applied Microbiology.
Structure Predictions of Two Bauhinia variegata Lectins Reveal Patterns of C-Terminal Properties in Single Chain Legume Lectins

PubMed Central

Moreira, Gustavo M. S. G.; Conceição, Fabricio R.; McBride, Alan J. A.; Pinto, Luciano da S.

2013-01-01

Bauhinia variegata lectins (BVL-I and BVL-II) are single chain lectins isolated from the plant Bauhinia variegata. Single chain lectins undergo post-translational processing on its N-terminal and C-terminal regions, which determines their physiological targeting, carbohydrate binding activity and pattern of quaternary association. These two lectins are isoforms, BVL-I being highly glycosylated, and thus far, it has not been possible to determine their structures. The present study used prediction and validation algorithms to elucidate the likely structures of BVL-I and -II. The program Bhageerath-H was chosen from among three different structure prediction programs due to its better overall reliability. In order to predict the C-terminal region cleavage sites, other lectins known to have this modification were analysed and three rules were created: (1) the first amino acid of the excised peptide is small or hydrophobic; (2) the cleavage occurs after an acid, polar, or hydrophobic residue, but not after a basic one; and (3) the cleavage spot is located 5-8 residues after a conserved Leu amino acid. These rules predicted that BVL-I and –II would have fifteen C-terminal residues cleaved, and this was confirmed experimentally by Edman degradation sequencing of BVL-I. Furthermore, the C-terminal analyses predicted that only BVL-II underwent α-helical folding in this region, similar to that seen in SBA and DBL. Conversely, BVL-I and -II contained four conserved regions of a GS-I association, providing evidence of a previously undescribed X4+unusual oligomerisation between the truncated BVL-I and the intact BVL-II. This is the first report on the structural analysis of lectins from Bauhinia spp. and therefore is important for the characterisation C-terminal cleavage and patterns of quaternary association of single chain lectins. PMID:24260572
Structure predictions of two Bauhinia variegata lectins reveal patterns of C-terminal properties in single chain legume lectins.

PubMed

Moreira, Gustavo M S G; Conceição, Fabricio R; McBride, Alan J A; Pinto, Luciano da S

2013-01-01

Bauhinia variegata lectins (BVL-I and BVL-II) are single chain lectins isolated from the plant Bauhinia variegata. Single chain lectins undergo post-translational processing on its N-terminal and C-terminal regions, which determines their physiological targeting, carbohydrate binding activity and pattern of quaternary association. These two lectins are isoforms, BVL-I being highly glycosylated, and thus far, it has not been possible to determine their structures. The present study used prediction and validation algorithms to elucidate the likely structures of BVL-I and -II. The program Bhageerath-H was chosen from among three different structure prediction programs due to its better overall reliability. In order to predict the C-terminal region cleavage sites, other lectins known to have this modification were analysed and three rules were created: (1) the first amino acid of the excised peptide is small or hydrophobic; (2) the cleavage occurs after an acid, polar, or hydrophobic residue, but not after a basic one; and (3) the cleavage spot is located 5-8 residues after a conserved Leu amino acid. These rules predicted that BVL-I and -II would have fifteen C-terminal residues cleaved, and this was confirmed experimentally by Edman degradation sequencing of BVL-I. Furthermore, the C-terminal analyses predicted that only BVL-II underwent α-helical folding in this region, similar to that seen in SBA and DBL. Conversely, BVL-I and -II contained four conserved regions of a GS-I association, providing evidence of a previously undescribed X4+unusual oligomerisation between the truncated BVL-I and the intact BVL-II. This is the first report on the structural analysis of lectins from Bauhinia spp. and therefore is important for the characterisation C-terminal cleavage and patterns of quaternary association of single chain lectins.
Probing the reactivity of nucleophile residues in human 2,3-diphosphoglycerate/deoxy-hemoglobin complex by aspecific chemical modifications.

PubMed

Scaloni, A; Ferranti, P; De Simone, G; Mamone, G; Sannolo, N; Malorni, A

1999-06-11

The use of aspecific methylation reaction in combination with MS procedures has been employed for the characterization of the nucleophilic residues present on the molecular surface of the human 2,3-diphosphoglycerate/deoxy-hemoglobin complex. In particular, direct molecular weight determinations by ESMS allowed to control the reaction conditions, limiting the number of methyl groups introduced in the modified globin chains. A combined LCESMS-Edman degradation approach for the analysis of the tryptic peptide mixtures yielded to the exact identification of methylation sites together with the quantitative estimation of their degree of modification. The reactivities observed were directly correlated with the pKa and the relative surface accessibility of the nucleophilic residues, calculated from the X-ray crystallographic structure of the protein. The results here described indicate that this methodology can be efficiently used in aspecific modification experiments directed to the molecular characterization of the surface topology in proteins and protein complexes.
Structural characterization of thioether-bridged bacteriocins.

PubMed

Lohans, Christopher T; Vederas, John C

2014-01-01

Bacteriocins are a group of ribosomally synthesized antimicrobial peptides produced by bacteria, some of which are extensively post-translationally modified. Some bacteriocins, namely the lantibiotics and sactibiotics, contain one or more thioether bridges. However, these modifications complicate the structural elucidation of these bacteriocins using conventional techniques. This review will discuss the techniques and strategies that have been applied to determine the primary structures of lantibiotics and sactibiotics. A major challenge is to identify the topology of thioether bridges in these peptides (i.e., which amino-acid residues are involved in which bridges). Edman degradation, NMR spectroscopy and tandem MS have all been commonly applied to characterize these bacteriocins, but can be incompatible with the post-translational modifications present. Chemical modifications to the modified residues, such as desulfurization and reduction, make the treated bacteriocins more compatible to analysis by these standard peptide analytical techniques. Despite their differences in structure, similar strategies have proved useful to study the structures of both lantibiotics and sactibiotics.
Monitoring abacavir bioactivation in humans: screening for an aldehyde metabolite.

PubMed

Grilo, Nádia M; Antunes, Alexandra M M; Caixas, Umbelina; Marinho, Aline T; Charneira, Catarina; Conceição Oliveira, M; Monteiro, Emília C; Matilde Marques, M; Pereira, Sofia A

2013-05-10

The anti-HIV drug abacavir is associated with idiosyncratic hypersensitivity reactions and cardiotoxicity. Although the mechanism underlying abacavir-toxicity is not fully understood, drug bioactivation to reactive metabolites may be involved. This work was aimed at identifying abacavir-protein adducts in the hemoglobin of HIV patients as biomarkers of abacavir bioactivation and protein modification. The protocol received prior approval from the Hospital Ethics Committee, patients gave their written informed consent and adherence was controlled through a questionnaire. Abacavir-derived Edman adducts with the N-terminal valine of hemoglobin were analyzed by an established liquid chromatography-electrospray ionization-tandem mass spectrometry method. Abacavir-valine adducts were detected in three out of ten patients. This work represents the first evidence of abacavir-protein adduct formation in humans. The data confirm the ability of abacavir to modify self-proteins and suggest that the molecular mechanism(s) of some abacavir-induced adverse reactions may require bioactivation. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Porifera Lectins: Diversity, Physiological Roles and Biotechnological Potential.

PubMed

Gardères, Johan; Bourguet-Kondracki, Marie-Lise; Hamer, Bojan; Batel, Renato; Schröder, Heinz C; Müller, Werner E G

2015-08-07

An overview on the diversity of 39 lectins from the phylum Porifera is presented, including 38 lectins, which were identified from the class of demosponges, and one lectin from the class of hexactinellida. Their purification from crude extracts was mainly performed by using affinity chromatography and gel filtration techniques. Other protocols were also developed in order to collect and study sponge lectins, including screening of sponge genomes and expression in heterologous bacterial systems. The characterization of the lectins was performed by Edman degradation or mass spectrometry. Regarding their physiological roles, sponge lectins showed to be involved in morphogenesis and cell interaction, biomineralization and spiculogenesis, as well as host defense mechanisms and potentially in the association between the sponge and its microorganisms. In addition, these lectins exhibited a broad range of bioactivities, including modulation of inflammatory response, antimicrobial and cytotoxic activities, as well as anticancer and neuromodulatory activity. In view of their potential pharmacological applications, sponge lectins constitute promising molecules of biotechnological interest.
Porifera Lectins: Diversity, Physiological Roles and Biotechnological Potential

PubMed Central

Gardères, Johan; Bourguet-Kondracki, Marie-Lise; Hamer, Bojan; Batel, Renato; Schröder, Heinz C.; Müller, Werner E. G.

2015-01-01

An overview on the diversity of 39 lectins from the phylum Porifera is presented, including 38 lectins, which were identified from the class of demosponges, and one lectin from the class of hexactinellida. Their purification from crude extracts was mainly performed by using affinity chromatography and gel filtration techniques. Other protocols were also developed in order to collect and study sponge lectins, including screening of sponge genomes and expression in heterologous bacterial systems. The characterization of the lectins was performed by Edman degradation or mass spectrometry. Regarding their physiological roles, sponge lectins showed to be involved in morphogenesis and cell interaction, biomineralization and spiculogenesis, as well as host defense mechanisms and potentially in the association between the sponge and its microorganisms. In addition, these lectins exhibited a broad range of bioactivities, including modulation of inflammatory response, antimicrobial and cytotoxic activities, as well as anticancer and neuromodulatory activity. In view of their potential pharmacological applications, sponge lectins constitute promising molecules of biotechnological interest. PMID:26262628
(+)-Meyeniins A-C, Novel Hexahydroimidazo[1,5-c]thiazole Derivatives from the Tubers of Lepidium meyenii: Complete Structural Elucidation by Biomimetic Synthesis and Racemic Crystallization.

PubMed

Zhou, Min; Ma, Hang-Ying; Liu, Zhi-Hua; Yang, Guang-Yu; Du, Gang; Ye, Yan-Qing; Li, Gan-Peng; Hu, Qiu-Fen

2017-03-08

(+)-Meyeniins A-C (1-3), a novel class of sulfur-containing hexahydroimidazo[1,5-c]thiazole derivatives, were isolated from the tubers of Lepidium meyenii (maca) cultivated in Lijiang, Yunnan province, China. Guided by their biosynthetic hypothesis, a stereocontrolled biomimetic synthesis of meyeniins A-C and their individual enantiomers was efficiently accomplished by a combination of a condensation reaction and Edman degradation. The formation of high-quality crystals for X-ray crystallography occurred much more readily from a racemic mixture of (±)-meyeniin A than with the single enantiomer alone in this case. These extensive strategies, combined with circular dichroism (CD) spectra, allowed the complete structural assignments of (+)-meyeniins A-C. Among them, (+)-meyeniin A showed moderate selective cytotoxicities against the HL-60, A549 and MCF-7 human cell lines with IC 50 values of 14.41, 32.22, and 33.14 μM, respectively. To some extent, these findings support traditional applications of maca as healthy nutritional supplements or functional foods for cancer prevention.
Evaluation of IgE reactivity of active and thermally inactivated actinidin, a biomarker of kiwifruit allergy.

PubMed

Grozdanovic, Milica; Popovic, Milica; Polovic, Natalija; Burazer, Lidija; Vuckovic, Olga; Atanaskovic-Markovic, Marina; Lindner, Buko; Petersen, Arnd; Gavrovic-Jankulovic, Marija

2012-03-01

Actinidin, an abundant cysteine protease from kiwifruit, is a specific biomarker of isolated allergy to kiwifruit. This study evaluates the IgE-binding properties of biologically active and thermally inactivated actinidin. Employing two different activity assays (caseinolytic assay and zymogram with gelatin) we showed that actinidin obtained from kiwifruit extract under native conditions represents a mixture of inactive and active enzyme. The structural integrity of actinidin was confirmed by SDS-PAGE, Edman degradation, mass fingerprint and Western blot with polyclonal antibodies. Although it was capable of inducing positive skin prick test reactions, we failed to detect IgE reactivity of active actinidin in Western blot with patient sera. Thermally inactivated actinidin exhibited IgE reactivity both in vivo and in vitro, indicating that heat processed kiwifruit products may induce clinical reactivity. These findings imply that apart from the allergenic epitopes on its surface, actinidin also contains hidden epitopes inside the protein which become accessible to IgE upon thermal treatment. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Comparison of Different IMAC Techniques Used for Enrichment of Phosphorylated Peptides

PubMed Central

Kånge, Rikard; Selditz, Ulrike; Granberg, Maria; Lindberg, Ulrika; Ekstrand, Gunnar; Ek, Bo; Gustafsson, Magnus

2005-01-01

Four commercially available immobilized metal ion affinity chromatography (IMAC) methods for phosphopeptide enrichment were compared using small volumes and concentrations of phosphopeptide mixtures with or without extra-added bovine serum albumin (BSA) nonphosphorylated peptides. Addition of abundant tryptic BSA peptides to the phosphopeptide mixture increases the demand for selective IMAC capture. While SwellGel gallium Discs, IPAC Metal Chelating Resin, and ZipTipMC Pipette Tips allow for the possibility of enriching phosphopeptides, the Gyrolab MALDI IMAC1 also presents the possibility of verifying existing phosphopeptides after a dephosphorylation step. Phosphate-containing peptides are identified through a mass shift between phosphorylated and dephosphorylated spectra of 80 Da (or multiples of 80 Da). This verification is useful if the degree of phosphorylation is low in the sample or if the ionization is unfavorable, which often is the case for phosphopeptides. A peptide mixture in which phosphorylated serine, threonine, and tyrosine were represented was diluted in steps and thereafter enriched using the four different IMAC methods prior to analyses with matrix assisted laser desorption/ionization mass spectrometry. The enrichment of phosphopeptides using SwellGel Gallium Discs or Gyrolab MALDI IMAC1 was not significantly affected by the addition of abundant BSA peptides added to the sample mixture, and the achieved detection limits using these techniques were also the lowest. All four of the included phosphopeptides were detected by MALDI-MS only after enrichment using the Gyrolab MALDI IMAC1 compact disc (CD) and detection down to low femtomole levels was possible. Furthermore, selectivity, reproducibility, and detection for a number of other phosphopeptides using the IMAC CD are reported herein. For example, two phosphopeptides sent out in a worldwide survey performed by the Proteomics Research Group (PRG03) of the Association of Biomolecular Resource Facilities (ABRF) were detected and verified by means of the 80 Da mass shift achieved by on-column dephosphorylation. PMID:16030316
Acidic α-galactosidase is the most abundant nectarin in floral nectar of common tobacco (Nicotiana tabacum)

PubMed Central

Zha, Hong-Guang; Flowers, V. Lynn; Yang, Min; Chen, Ling-Yang; Sun, Hang

2012-01-01

Background and Aims To date, most floral nectarins (nectar proteins) are reported to function in nectar defence, particularly for insect-pollinated outcrossing species. We compared nectarin composition and abundance in selfing common tobacco (Nicotiana tobaccum) with outcrossing ornamental tobacco plants to elucidate the functional difference of nectarins in different reproductive systems. Methods Common tobacco (CT) nectarins were separated by SDS-PAGE and the N terminus of the most abundant nectarin was sequenced via Edman degradation. The full-length nectarin gene was amplified and cloned from genomic DNA and mRNA with hiTail-PCR and RACE (rapid amplification of cDNA ends), and expression patterns were then investigated in different tissues using semi-quantitative reverse transcriptase PCR. Additionally, high-performance liquid chromatography and enzymatic analyses of nectar sugar composition, and other biochemical traits and functions of the novel nectarin were studied. Key Results The most abundant nectarin in CT nectar is an acidic α-galactosidase, here designated NTα-Gal. This compound has a molecular mass of 40 013 Da and a theoretical pI of 5·33. NTα-Gal has a conserved α-Gal characteristic signature, encodes a mature protein of 364 amino acids and is expressed in different organs. Compared with 27 other melliferous plant species from different families, CT floral nectar demonstrated the highest α-Gal activity, which is inhibited by d-galactose. Raffinose family oligosaccharides were not detected in CT nectar, indicating that NTα-Gal does not function in post-secretory hydrolysis. Moreover, tobacco plant fruits did not develop intact skin with galactose inhibition of NTα-Gal activity in nectar, suggesting that NTα-Gal induces cell-wall surface restructuring during the initial stages of fruit development. Conclusions α-Gal was the most abundant nectarin in selfing CT plants, but was not detected in the nectar of strictly outcrossing sister tobacco species. No function was demonstrated in antimicrobial defence. Therefore, floral nectarins in selfing species maintain their functional significance in reproductive organ development. PMID:22271925
Daboxin P, a Major Phospholipase A2 Enzyme from the Indian Daboia russelii russelii Venom Targets Factor X and Factor Xa for Its Anticoagulant Activity

PubMed Central

Iyer, Janaki Krishnamurthy; Shih, Norrapat; Majumder, Munmi; Mattaparthi, Venkata Satish Kumar; Mukhopadhyay, Rupak; Doley, Robin

2016-01-01

In the present study a major protein has been purified from the venom of Indian Daboia russelii russelii using gel filtration, ion exchange and Rp-HPLC techniques. The purified protein, named daboxin P accounts for ~24% of the total protein of the crude venom and has a molecular mass of 13.597 kDa. It exhibits strong anticoagulant and phospholipase A2 activity but is devoid of any cytotoxic effect on the tested normal or cancerous cell lines. Its primary structure was deduced by N-terminal sequencing and chemical cleavage using Edman degradation and tandem mass spectrometry. It is composed of 121 amino acids with 14 cysteine residues and catalytically active His48 -Asp49 pair. The secondary structure of daboxin P constitutes 42.73% of α-helix and 12.36% of β-sheet. It is found to be stable at acidic (pH 3.0) and neutral pH (pH 7.0) and has a Tm value of 71.59 ± 0.46°C. Daboxin P exhibits anticoagulant effect under in-vitro and in-vivo conditions. It does not inhibit the catalytic activity of the serine proteases but inhibits the activation of factor X to factor Xa by the tenase complexes both in the presence and absence of phospholipids. It also inhibits the tenase complexes when active site residue (His48) was alkylated suggesting its non-enzymatic mode of anticoagulant activity. Moreover, it also inhibits prothrombinase complex when pre-incubated with factor Xa prior to factor Va addition. Fluorescence emission spectroscopy and affinity chromatography suggest the probable interaction of daboxin P with factor X and factor Xa. Molecular docking analysis reveals the interaction of the Ca+2 binding loop; helix C; anticoagulant region and C-terminal region of daboxin P with the heavy chain of factor Xa. This is the first report of a phospholipase A2 enzyme from Indian viper venom which targets both factor X and factor Xa for its anticoagulant activity. PMID:27089306
Okadaic acid-induced, naringin-sensitive phosphorylation of glycine N-methyltransferase in isolated rat hepatocytes.

PubMed Central

Møller, Michael T N; Samari, Hamid R; Fengsrud, Monica; Strømhaug, Per E; øStvold, Anne C; Seglen, Per O

2003-01-01

Glycine N-methyltransferase (GNMT) is an abundant cytosolic enzyme that catalyses the methylation of glycine into sarcosine, coupled with conversion of the methyl donor, S -adenosylmethionine (AdoMet), into S -adenosylhomocysteine (AdoHcy). GNMT is believed to play a role in monitoring the AdoMet/AdoHcy ratio, and hence the cellular methylation capacity, but regulation of the enzyme itself is not well understood. In the present study, treatment of isolated rat hepatocytes with the protein phosphatase inhibitor okadaic acid, was found to induce an overphosphorylation of GNMT, as shown by proteomic analysis. The analysis comprised two-dimensional gel electrophoretic separation of (32)P-labelled phosphoproteins and identification of individual protein spots by matrix-assisted laser-desorption ionization-time-of-flight mass spectrometry. The identity of GNMT was verified by N-terminal Edman sequencing of tryptic peptides. Chromatographic separation of proteolytic peptides and (32)P-labelled amino acids suggested that GNMT was phosphorylated within a limited region, and only at serine residues. GNMT phosphorylation could be suppressed by naringin, an okadaic acid-antagonistic flavonoid. To assess the possible functional role of GNMT phosphorylation, the effect of okadaic acid on hepatocytic AdoMet and AdoHcy levels was examined, using HPLC separation for metabolite analysis. Surprisingly, okadaic acid was found to have no effect on the basal levels of AdoMet or AdoHcy. An accelerated AdoMet-AdoHcy flux, induced by the addition of methionine (1 mM), was likewise unaffected by okadaic acid. 5-Aminoimidazole-4-carboxamide riboside, an activator of the hepatocytic AMP-activated protein kinase, similarly induced GNMT phosphorylation without affecting AdoMet and AdoHcy levels. Activation of cAMP-dependent protein kinase by dibutyryl-cAMP, reported to cause GNMT phosphorylation under cell-free conditions, also had little effect on hepatocytic AdoMet and AdoHcy levels. Phosphorylation of GNMT would thus seem to play no role in regulation of the intracellular AdoMet/AdoHcy ratio, but could be involved in other GNMT functions, such as the binding of folates or aromatic hydrocarbons. PMID:12697024
Tryptic digestion of human GPIIIa. Isolation and biochemical characterization of the 23 kDa N-terminal glycopeptide carrying the antigenic determinant for a monoclonal antibody (P37) which inhibits platelet aggregation.

PubMed Central

Calvete, J J; Rivas, G; Maruri, M; Alvarez, M V; McGregor, J L; Hew, C L; Gonzalez-Rodriguez, J

1988-01-01

Early digestion of pure human platelet glycoprotein IIIa (GPIIIa) leads to a single cleavage of the molecule at 23 kDa far from one of the terminal amino acids. Automated Edman degradation demonstrates that GPIIIa and the smaller (23 kDa) tryptic fragment share the same N-terminal amino acid sequence. A further cleavage occurs in the larger fragment (80 kDa), reducing its apparent molecular mass by 10 kDa. The 23 kDa fragment remains attached to the larger ones in unreduced samples. Stepwise reduction of early digested GPIIIa with dithioerythritol selectively reduces the single disulphide bond joining the smaller (23 kDa) to the larger (80/70 kDa) fragments. Two fractions were obtained by size-exclusion chromatography of early digested GPIIIa after partial or full reduction and alkylation. The larger-size fraction contains the 80/70 kDa fragments, while the 23 kDa fragment is isolated in the smaller. The amino acid compositions of these fractions do not differ very significantly from the composition of GPIIIa; however the 23 kDa fragment contains only 10.2% by weight of sugars and is richer in neuraminic acid. Disulphide bonds are distributed four in the 23 kDa glycopeptide and 20-21 in the 80/70 kDa glycopeptide. The epitope for P37, a monoclonal antibody which inhibits platelet aggregation [Melero & González-Rodríguez (1984) Eur. J. Biochem. 141, 421-427] is situated within the first 17 kDa of the N-terminal region of GPIIIa, which gives a special functional interest to this extracellular region of GPIIIa. On the other hand, the epitopes for GPIIIa-specific monoclonal antibodies, P6, P35, P40 and P97, which do not interfere with platelet aggregation, are located within the larger tryptic fragment (80/70 kDa). Thus, the antigenic areas available in the extracellular surface of GPIIIa for these five monoclonal antibodies are now more precisely delineated. Images Fig. 1. Fig. 2. Fig. 3. Fig. 4. PMID:2455507
Identification of tyrosine phosphorylation sites in human Gab-1 protein by EGF receptor kinase in vitro.

PubMed

Lehr, S; Kotzka, J; Herkner, A; Klein, E; Siethoff, C; Knebel, B; Noelle, V; Brüning, J C; Klein, H W; Meyer, H E; Krone, W; Müller-Wieland, D

1999-01-05

Grb2-associated binder-1 (Gab-1) has been identified recently in a cDNA library of glioblastoma tumors and appears to play a central role in cellular growth response, transformation, and apoptosis. Structural and functional features indicate that Gab-1 is a multisubstrate docking protein downstream in the signaling pathways of different receptor tyrosine kinases, including the epidermal growth factor receptor (EGFR). Therefore, the aim of the study was to characterize the phosphorylation of recombinant human Gab-1 (hGab-1) protein by EGFR in vitro. Using the pGEX system to express the entire protein and different domains of hGab-1 as glutathione S-transferase proteins, kinetic data for phosphorylation of these proteins by wheat germ agglutinine-purified EGFR and the recombinant EGFR (rEGFR) receptor kinase domain were determined. Our data revealed similar affinities of hGab-1-C for both receptor preparations (KM = 2.7 microM for rEGFR vs 3.2 microM for WGA EGFR) as well as for the different recombinant hGab-1 domains. To identify the specific EGFR phosphorylation sites, hGab-1-C was sequenced by Edman degradation and mass spectrometry. The entire protein was phosphorylated by rEGFR at eight tyrosine residues (Y285, Y373, Y406, Y447, Y472, Y619, Y657, and Y689). Fifty percent of the identified radioactivity was incorporated in tyrosine Y657 as the predominant peak in HPLC analysis, a site exhibiting features of a potential Syp (PTP1D) binding site. Accordingly, GST-pull down assays with A431 and HepG2 cell lysates showed that phosphorylated intact hGab-1 was able to bind Syp. This binding appears to be specific, because it was abolished by changing the Y657 of hGab-1 to F657. These results demonstrate that hGab-1 is a high-affinity substrate for the EGFR and the major tyrosine phosphorylation site Y657 in the C terminus is a specific binding site for the tyrosine phosphatase Syp.
Self-consistency tests of large-scale dynamics parameterizations for single-column modeling

DOE PAGES

Edman, Jacob P.; Romps, David M.

2015-03-18

Large-scale dynamics parameterizations are tested numerically in cloud-resolving simulations, including a new version of the weak-pressure-gradient approximation (WPG) introduced by Edman and Romps (2014), the weak-temperature-gradient approximation (WTG), and a prior implementation of WPG. We perform a series of self-consistency tests with each large-scale dynamics parameterization, in which we compare the result of a cloud-resolving simulation coupled to WTG or WPG with an otherwise identical simulation with prescribed large-scale convergence. In self-consistency tests based on radiative-convective equilibrium (RCE; i.e., no large-scale convergence), we find that simulations either weakly coupled or strongly coupled to either WPG or WTG are self-consistent, butmore » WPG-coupled simulations exhibit a nonmonotonic behavior as the strength of the coupling to WPG is varied. We also perform self-consistency tests based on observed forcings from two observational campaigns: the Tropical Warm Pool International Cloud Experiment (TWP-ICE) and the ARM Southern Great Plains (SGP) Summer 1995 IOP. In these tests, we show that the new version of WPG improves upon prior versions of WPG by eliminating a potentially troublesome gravity-wave resonance.« less
Hemoglobin adducts as biomarkers of 1,3-butadiene in occupationally low exposed Italian workers and a few diesel-exposed miners.

PubMed

Begemann, P; Upton, P B; Ranasinghe, A; Swenberg, J A; Soleo, L; Vimercati, L; Gelormini, A; Fustinoni, S; Zwirner-Baier, I; Neumann, H G

2001-06-01

Hemoglobin adducts were determined as biomarkers of 1,3-butadiene (BD) in 30 workers and 10 controls from an Italian BD plant and in 14 diesel-exposed miners. N-(2,3,4-trihydroxybutyl)valine (THBVal), an N-terminal valine globin adduct of reactive butadiene metabolites, was analyzed by gas chromatography/high resolution mass spectrometry after a modified Edman degradation and further acetylation. The BD exposure for the plant workers was 31 microg/m(3) (personal sampling). Whereas there was no detectable difference in hemoglobin adduct levels (range 17.7-61.4 pmol/g globin) between the total group of exposed and controls, slight but significant differences could be found between two subgroups of workers from different production units as well as one subgroup and controls (P<0.05), between smoking (n=13) and non-smoking exposed workers (n=17; P=0.066) as well as between smoking exposed workers and controls (P=0.055). Adduct levels of the miners (all non-smokers) were in the same range as those of the Italian BD-workers and controls. The internal exposure and strain measured by THBVal levels resulting from a very low occupational BD exposure was in the range of the contribution of moderate smoking.
DEPRESSION AND INTERNALLY DIRECTED AGGRESSION: GENETIC AND ENVIRONMENTAL CONTRIBUTIONS

PubMed Central

Haddad, Suzanne K.; Neiderhiser, Jenae M.; Spotts, Erica L.; Ganiban, Jody; Lichtenstein, Paul; Reiss, David

2013-01-01

This study uses behavior genetic (BG) methodology to investigate Freud’s theory of depression as aggression directed toward the self (1930) and the extent to which genetically and environmentally influenced aggressive tendencies contribute to depressive symptoms. Data from the Twin and Offspring Study in Sweden (TOSS) is used to demonstrate how, in estimating shared and unique environmental influences, BG methods can inform psychoanalytic theory and practice, particularly because of their shared emphasis on the importance of individual experience in development. The TOSS sample consists of 909 pairs of adult twins, their partners, and one adolescent child per couple. The Center for Epidemiologic Studies Depression Scale (Radloff 1977) was used to measure depressive symptoms and the Karolinska Scales of Personality (Schalling and Edman 1993) to measure internally directed aggression. Genetic analyses indicated that for both men and women, their unique experiences as well as genetic factors contributed equally to the association between internally directed aggression and depressive symptoms. These findings support Freud’s theory that constitutionally based differences in aggression, along with individual experiences, contribute to a person’s depressive symptoms. Establishing that an individual’s unique, not shared, experiences and perceptions contribute to depressive symptoms and internally directed aggression reinforces the use of patient-specific treatment approaches implemented in psychoanalytic psychotherapy or psychoanalysis. PMID:18515705

The fsr Quorum-Sensing System and Cognate Gelatinase Orchestrate the Expression and Processing of Proprotein EF_1097 into the Mature Antimicrobial Peptide Enterocin O16.

PubMed

Dundar, Halil; Brede, Dag A; La Rosa, Sabina Leanti; El-Gendy, Ahmed Osama; Diep, Dzung B; Nes, Ingolf F

2015-07-01

A novel antimicrobial peptide designated enterocin O16 was purified from Enterococcus faecalis. Mass spectrometry showed a monoisotopic mass of 7,231 Da, and N-terminal Edman degradation identified a 29-amino-acid sequence corresponding to residues 90 to 119 of the EF_1097 protein. Bioinformatic analysis showed that enterocin O16 is composed of the 68 most C-terminal residues of the EF_1097 protein. Introduction of an in-frame isogenic deletion in the ef1097 gene abolished the production of enterocin O16. Enterocin O16 has a narrow inhibitory spectrum, as it inhibits mostly lactobacilli. Apparently, E. faecalis is intrinsically resistant to the antimicrobial peptide, as no immunity connected to the production of enterocin O16 could be identified. ef1097 has previously been identified as one of three loci regulated by the fsr quorum-sensing system. The introduction of a nonsense mutation into fsrB consistently impaired enterocin O16 production, but externally added gelatinase biosynthesis-activating pheromone restored the antimicrobial activity. Functional genetic analysis showed that the EF_1097 proprotein is processed extracellularly into enterocin O16 by the metalloprotease GelE. Thus, it is evident that the fsr quorum-sensing system constitutes the regulatory unit that controls the expression of the EF_1097 precursor protein and the protease GelE and that the latter is required for the formation of enterocin O16. On the basis of these results, this study identified antibacterial antagonism as a novel aspect related to the function of fsr and provides a rationale for why ef1097 is part of the fsr regulon. The fsr quorum-sensing system modulates important physiological functions in E. faecalis via the activity of GelE. The present study presents a new facet of fsr signaling. The system controls the expression of three primary target operons (fsrABCD, gelE-sprE, and ef1097-ef1097b). We demonstrate that the concerted expression of these operons constitutes the elements necessary for the production of a bacteriocin-type peptide and that antimicrobial antagonism is an intrinsic function of fsr. The bacteriocin enterocin O16 consists of the 68 most C-terminal residues of the EF_1097 secreted proprotein. The GelE protease processes the EF_1097 proprotein into enterocin O16. In this manner, fsr signaling enables E. faecalis populations to express antimicrobial activity in a cell density-dependent manner. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
The fsr Quorum-Sensing System and Cognate Gelatinase Orchestrate the Expression and Processing of Proprotein EF_1097 into the Mature Antimicrobial Peptide Enterocin O16

PubMed Central

Dundar, Halil; Brede, Dag A.; La Rosa, Sabina Leanti; El-Gendy, Ahmed Osama; Diep, Dzung B.

2015-01-01

ABSTRACT A novel antimicrobial peptide designated enterocin O16 was purified from Enterococcus faecalis. Mass spectrometry showed a monoisotopic mass of 7,231 Da, and N-terminal Edman degradation identified a 29-amino-acid sequence corresponding to residues 90 to 119 of the EF_1097 protein. Bioinformatic analysis showed that enterocin O16 is composed of the 68 most C-terminal residues of the EF_1097 protein. Introduction of an in-frame isogenic deletion in the ef1097 gene abolished the production of enterocin O16. Enterocin O16 has a narrow inhibitory spectrum, as it inhibits mostly lactobacilli. Apparently, E. faecalis is intrinsically resistant to the antimicrobial peptide, as no immunity connected to the production of enterocin O16 could be identified. ef1097 has previously been identified as one of three loci regulated by the fsr quorum-sensing system. The introduction of a nonsense mutation into fsrB consistently impaired enterocin O16 production, but externally added gelatinase biosynthesis-activating pheromone restored the antimicrobial activity. Functional genetic analysis showed that the EF_1097 proprotein is processed extracellularly into enterocin O16 by the metalloprotease GelE. Thus, it is evident that the fsr quorum-sensing system constitutes the regulatory unit that controls the expression of the EF_1097 precursor protein and the protease GelE and that the latter is required for the formation of enterocin O16. On the basis of these results, this study identified antibacterial antagonism as a novel aspect related to the function of fsr and provides a rationale for why ef1097 is part of the fsr regulon. IMPORTANCE The fsr quorum-sensing system modulates important physiological functions in E. faecalis via the activity of GelE. The present study presents a new facet of fsr signaling. The system controls the expression of three primary target operons (fsrABCD, gelE-sprE, and ef1097-ef1097b). We demonstrate that the concerted expression of these operons constitutes the elements necessary for the production of a bacteriocin-type peptide and that antimicrobial antagonism is an intrinsic function of fsr. The bacteriocin enterocin O16 consists of the 68 most C-terminal residues of the EF_1097 secreted proprotein. The GelE protease processes the EF_1097 proprotein into enterocin O16. In this manner, fsr signaling enables E. faecalis populations to express antimicrobial activity in a cell density-dependent manner. PMID:25733609
Production and structural characterization of amino terminally histidine tagged human oncostatin M in E. coli.

PubMed

Sporeno, E; Barbato, G; Graziani, R; Pucci, P; Nitti, G; Paonessa, G

1994-05-01

Oncostatin M is a cytokine that acts as a growth regulator on a wide variety of cells and has diverse biological activities including acute phase protein induction, LDL receptor up-regulation and cell-specific gene expression. In order to gather information about the Onc M structure, we established a protocol for large scale production and single step purification of this functional cytokine from bacterial cells. The cDNA of human Onc M was cloned by RT-PCR from total RNA of PMA induced U937 cells. After the addition of a six histidine tag at the N-terminus, the coding region of mature Onc M was cloned in the pT7.7 expression vector. Histidine tagged Onc M was overexpressed in bacterial cells and purified to homogeneity in one step on a metal chelating column. We found that recombinant 6xHis-OncM remains fully active in a growth inhibition assay. Structural characterization of the purified protein was performed by electrospray mass spectrometry, automated Edman degradation and peptide mapping by high-pressure liquid chromatography/fast-atom-bombardment mass spectrometry. Thermal and pH stability dependence of Onc M was assessed by circular dichroism spectroscopy; the helical content is about 50%, in agreement with the four helix bundle fold postulated for cytokines that bind haematopoietic receptors of type I.
Purification and characterization of a lectin from the white shrimp Litopenaeus setiferus (Crustacea decapoda) hemolymph.

PubMed

Alpuche, Juan; Pereyra, Ali; Agundis, Concepción; Rosas, Carlos; Pascual, Cristina; Slomianny, Marie-Christine; Vázquez, Lorena; Zenteno, Edgar

2005-06-20

A 291-kDa lectin (LsL) was purified from the hemolymph of the white shrimp Litopenaeus setiferus by affinity chromatography on glutaraldehyde-fixed stroma from rabbit erythrocytes. LsL is a heterotetramer of two 80-kDa and two 52-kDa subunits, with no covalently-liked carbohydrate, and mainly composed by aspartic and glutamic acids, glycine and alanine, with relatively lower methionine and cysteine contents. Edman degradation indicated that the NH2-terminal of the 80-kDa subunit is composed DASNAQKQHDVNFLL, whereas the NH2-terminal of the 52-kDa subunit is blocked. The peptide mass fingerprint of LsL was predicted from tryptic peptides from each subunit by MALDI-TOF, and revealed that each subunit showed 23 and 22%, respectively, homology with the hemocyanin precursor from Litopenaeus vannamei. Circular dichroism analysis revealed beta sheet and alpha helix contents of 52.7 and 6.1%, respectively. LsL agglutinate at higher titers guinea pig, murine, and rabbit erythrocytes its activity is divalent cation-dependent. N-acetylated sugars, such as GlcNAc, GalNAc, and NeuAc, were the most effective inhibitors of the LsL hemagglutinating activity. Sialylated O-glycosylated proteins, such as bovine submaxillary gland mucin, human IgA, and fetuin, showed stronger inhibitory activity than sialylated N-glycosylated proteins, such as human orosomucoid, IgG, transferrin, and lactoferrin. Desialylation of erythrocytes or inhibitory glycoproteins abolished their capacity to bind LsL, confirming the relevance of sialic acid in LsL-ligand interactions.
Ubiquitylation Functions in the Calcium Carbonate Biomineralization in the Extracellular Matrix

PubMed Central

Fang, Dong; Pan, Cong; Lin, Huijuan; Lin, Ya; Xu, Guangrui; Zhang, Guiyou; Wang, Hongzhong; Xie, Liping; Zhang, Rongqing

2012-01-01

Mollusks shell formation is mediated by matrix proteins and many of these proteins have been identified and characterized. However, the mechanisms of protein control remain unknown. Here, we report the ubiquitylation of matrix proteins in the prismatic layer of the pearl oyster, Pinctada fucata. The presence of ubiquitylated proteins in the prismatic layer of the shell was detected with a combination of western blot and immunogold assays. The coupled ubiquitins were separated and identified by Edman degradation and liquid chromatography/mass spectrometry (LC/MS). Antibody injection in vivo resulted in large amounts of calcium carbonate randomly accumulating on the surface of the nacreous layer. These ubiquitylated proteins could bind to specific faces of calcite and aragonite, which are the two main mineral components of the shell. In the in vitro calcium carbonate crystallization assay, they could reduce the rate of calcium carbonate precipitation and induce the calcite formation. Furthermore, when the attached ubiquitins were removed, the functions of the EDTA-soluble matrix of the prismatic layer were changed. Their potency to inhibit precipitation of calcium carbonate was decreased and their influence on the morphology of calcium carbonate crystals was changed. Taken together, ubiquitylation is involved in shell formation. Although the ubiquitylation is supposed to be involved in every aspect of biophysical processes, our work connected the biomineralization-related proteins and the ubiquitylation mechanism in the extracellular matrix for the first time. This would promote our understanding of the shell biomineralization and the ubiquitylation processes. PMID:22558208
Radioiodination of scorpion and snake toxins. [/sup 125/I, /sup 127/I

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rochat, H.; Tessier, M.; Miranda, F.

1977-10-01

Several scorpion and snake toxins were radioiodinated using the lactoperoxydase method of (/sup 125/I)iodide oxidation. Two techniques of labeling were set up: Using carrier-free Na/sup 125/I and 5 ..mu..g of toxin, about one iodine atom was incorporated per mole of protein without loss of toxicity. Specific radioactivities about 2,000 Ci/mmol (280 ..mu..Ci/..mu..g) were obtained. The modified toxin, purified by immunoprecipitation with an antiserum prepared against the native toxin, was obtained in a short time (4 hr), with a good yield (50 to 80%), and in a small volume (1 ml). Using Na/sup 127/I traced with Na /sup 125/I and largermore » amounts (200 ..mu..g) of toxin, more than one iodine atom was incorporated per mole of protein without loss of activity. Lower specific radioactivities (1 to 1.5 Ci/mmol) were obtained. The iodinated toxins were purified by gel filtration of the radioiodination mixtures on a column made of two layers of Sephadex (G-15 and G-50). The modified proteins were extensively analyzed by paper electrophoresis and polyacrylamide gel electrophoresis. Their content of monoiodotyrosine and diiodotyrosine was estimated and, in the case of toxin I of Androctonus australis Hector, it was possible to follow the iodination rate of its three tyrosine residues by automatic Edman degradation. The mode of purification of the iodinated scorpion toxins affects their behavior on molecular sieving on Sephadex G-50 and on electrophoresis on polyacrylamide gel. The results are discussed.« less
Antimicrobial peptides isolated from Phyllomedusa nordestina (Amphibia) alter the permeability of plasma membrane of Leishmania and Trypanosoma cruzi.

PubMed

Pinto, Erika Gracielle; Pimenta, Daniel C; Antoniazzi, Marta Maria; Jared, Carlos; Tempone, Andre Gustavo

2013-12-01

Nature has provided inspiration for Drug Discovery studies and amphibian secretions have been used as a promising source of effective peptides which could be explored as novel drug prototypes for neglected parasitic diseases as Leishmaniasis and Chagas disease. In this study, we isolated four antimicrobial peptides (AMPs) from Phyllomedusa nordestina secretion, and studied their effectiveness against Leishmania (L.) infantum and Trypanosoma cruzi. The antiparasitic fractions were characterized by mass spectrometry and Edman degradation, leading to the identification of dermaseptins 1 and 4 and phylloseptins 7 and 8. T. cruzi trypomastigotes were susceptible to peptides, showing IC50 values in the range concentration of 0.25-0.68 μM. Leishmania (L.) infantum showed susceptibility to phylloseptin 7, presenting an IC50 value of 10 μM. Except for phylloseptin 7 which moderate showed cytotoxicity (IC50=34 μM), the peptides induced no cellular damage to mammalian cells. The lack of mitochondrial oxidative activity of parasites detected by the MTT assay, suggested that peptides were leishmanicidal and trypanocidal. By using the fluorescent probe SYTOX(®) Green, dermaseptins 1 and 4 and phylloseptins 7 and 8 showed time-dependent plasma membrane permeabilization of T. cruzi; phylloseptin 7 also showed a similar effect in Leishmania parasites. The present study demonstrates for the first time that AMPs target the plasma membrane of Leishmania and T. cruzi, leading to cellular death. Considering the potential of amphibian peptides against protozoan parasites and the reduced mammalian toxicity, they may contribute as scaffolds for drug design studies. Copyright © 2013 Elsevier Inc. All rights reserved.
Cross-reactivity to fish and chicken meat - a new clinical syndrome.

PubMed

Kuehn, A; Codreanu-Morel, F; Lehners-Weber, C; Doyen, V; Gomez-André, S-A; Bienvenu, F; Fischer, J; Ballardini, N; van Hage, M; Perotin, J-M; Silcret-Grieu, S; Chabane, H; Hentges, F; Ollert, M; Hilger, C; Morisset, M

2016-12-01

Fish is one of the most allergenic foods. While clinical cross-reactivity among different fishes is a widely accepted feature of fish allergy, associations with other food allergies are not well understood. This study aims at analyzing the relevance of clinical cross-reactivity between fish and chicken meat in patients with allergy to chicken meat without sensitization to hen's eggs. Patients with food allergy to fish and chicken meat (n = 29) or chicken meat only (n = 7) were recruited. IgE-reactive chicken proteins were identified (Edman, MS analysis) and quantified (ELISA). Allergens were used in IgE ELISA and skin testing. Chicken parvalbumin and two new allergens, aldolase and enolase, were identified at 12, 40, and 50 kDa, respectively. They were recognized by sIgE of 61%, 75%, and 83% of all patient sera which were in the majority of the cases positive for the fish homologues as well. Fish and chicken meat allergens were highly cross-reactive while high inhibition rates with fish or chicken allergens correlated with the patients' primary sensitization to fish or chicken. In cooked or roasted foods, enolase and aldolase were detectable in chicken breast while parvalbumin was detectable in chicken legs and wings. Fish and chicken meat are cross-reactive foods; both fish-allergic and chicken meat-allergic patients might be at risk of developing a food allergy to chicken meat or to fish, respectively. This clinical phenomenon is proposed to be termed 'fish-chicken syndrome' with cross-reactive allergens involved being parvalbumins, enolases, and aldolases. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Heat shock proteins on the human sperm surface.

PubMed

Naaby-Hansen, Soren; Herr, John C

2010-01-01

The sperm plasma membrane is known to be critical to fertilization and to be highly regionalized into domains of head, mid- and principal pieces. However, the molecular composition of the sperm plasma membrane and its alterations during genital tract passage, capacitation and the acrosome reaction remains to be fully dissected. A two-dimensional gel-based proteomic study previously identified 98 human sperm proteins which were accessible for surface labelling with both biotin and radioiodine. In this report twelve dually labelled protein spots were excised from stained gels or PDVF membranes and analysed by mass spectrometry (MS) and Edman degradation. Seven members from four different heat shock protein (HSP) families were identified including HYOU1 (ORP150), HSPC1 (HSP86), HSPA5 (Bip), HSPD1 (HSP60), and several isoforms of the two testis-specific HSP70 chaperones HSPA2 and HSPA1L. An antiserum raised against the testis-specific HSPA2 chaperone reacted with three 65kDa HSPA2 isoforms and three high molecular weight surface proteins (78-79kDa, 84kDa and 90-93kDa). These proteins, together with seven 65kDa HSP70 forms, reacted with human anti-sperm IgG antibodies that blocked in vitro fertilization in humans. Three of these surface biotinylated human sperm antigens were immunoprecipitated with a rabbit antiserum raised against a linear peptide epitope in Chlamydia trachomatis HSP70. The results indicate diverse HSP chaperones are accessible for surface labelling on human sperm. Some of these share epitopes with C. trachomatis HSP70, suggesting an association between genital tract infection, immunity to HSP70 and reproductive failure. 2009 Elsevier Ireland Ltd. All rights reserved.
Occupational exposure to acrylamide in closed system production plants: air levels and biomonitoring.

PubMed

Moorman, William J; Reutman, Susan S; Shaw, Peter B; Blade, Leo Michael; Marlow, David; Vesper, Hubert; Clark, John C; Schrader, Steven M

2012-01-01

The aim of this study was to evaluate biomarkers of acrylamide exposure, including hemoglobin adducts and urinary metabolites in acrylamide production workers. Biomarkers are integrated measures of the internal dose, and it is total acrylamide dose from all routes and sources that may present health risks. Workers from three companies were studied. Workers potentially exposed to acrylamide monomer wore personal breathing-zone air samplers. Air samples and surface-wipe samples were collected and analyzed for acrylamide. General-area air samples were collected in chemical processing units and control rooms. Hemoglobin adducts were isolated from ethylenediamine teraacetic acid (EDTA)-whole blood, and adducts of acrylamide and glycidamide, at the N-terminal valines of hemoglobin, were cleaved from the protein chain by use of a modified Edman reaction. Full work-shift, personal breathing zone, and general-area air samples were collected and analyzed for particulate and acrylamide monomer vapor. The highest general-area concentration of acrylamide vapor was 350 μg/cm(3) in monomer production. Personal breathing zone and general-area concentrations of acrylamide vapor were found to be highest in monomer production operations, and lower levels were in the polymer production operations. Adduct levels varied widely among workers, with the highest in workers in the monomer and polymer production areas. The acrylamide adduct range was 15-1884 pmol/g; glycidamide adducts ranged from 17.8 to 1376 p/mol/g. The highest acrylamide and glycidamide adduct levels were found among monomer production process operators. The primary urinary metabolite N-acetyl-S-(2-carbamoylethyl) cysteine (NACEC) ranged from the limit of detection to 15.4 μg/ml. Correlation of workplace exposure and sentinel health effects is needed to determine and control safe levels of exposure for regulatory standards.
Identification of Propofol Binding Sites in a Nicotinic Acetylcholine Receptor with a Photoreactive Propofol Analog*

PubMed Central

Jayakar, Selwyn S.; Dailey, William P.; Eckenhoff, Roderic G.; Cohen, Jonathan B.

2013-01-01

Propofol, a widely used intravenous general anesthetic, acts at anesthetic concentrations as a positive allosteric modulator of γ-aminobutyric acid type A receptors and at higher concentration as an inhibitor of nicotinic acetylcholine receptors (nAChRs). Here, we characterize propofol binding sites in a muscle-type nAChR by use of a photoreactive analog of propofol, 2-isopropyl-5-[3-(trifluoromethyl)-3H-diazirin-3-yl]phenol (AziPm). Based upon radioligand binding assays, AziPm stabilized the Torpedo nAChR in the resting state, whereas propofol stabilized the desensitized state. nAChR-rich membranes were photolabeled with [3H]AziPm, and labeled amino acids were identified by Edman degradation. [3H]AziPm binds at three sites within the nAChR transmembrane domain: (i) an intrasubunit site in the δ subunit helix bundle, photolabeling in the nAChR desensitized state (+agonist) δM2-18′ and two residues in δM1 (δPhe-232 and δCys-236); (ii) in the ion channel, photolabeling in the nAChR resting, closed channel state (−agonist) amino acids in the M2 helices (αM2-6′, βM2-6′ and -13′, and δM2-13′) that line the channel lumen (with photolabeling reduced by >90% in the desensitized state); and (iii) at the γ-α interface, photolabeling αM2-10′. Propofol enhanced [3H]AziPm photolabeling at αM2-10′. Propofol inhibited [3H]AziPm photolabeling within the δ subunit helix bundle at lower concentrations (IC50 = 40 μm) than it inhibited ion channel photolabeling (IC50 = 125 μm). These results identify for the first time a single intrasubunit propofol binding site in the nAChR transmembrane domain and suggest that this is the functionally relevant inhibitory binding site. PMID:23300078
An isotope-dilution UPLC-MS/MS technique for the human biomonitoring of the internal exposure to glycidol via a valine adduct at the N-terminus of hemoglobin.

PubMed

Hielscher, Jan; Monien, Bernhard H; Abraham, Klaus; Jessel, Sönke; Seidel, Albrecht; Lampen, Alfonso

2017-08-01

Fatty acid esters of glycidol (glycidyl esters) are processing contaminants generated as a byproduct of the industrial deodorization of vegetable oils and fats. Oral intake of glycidyl esters leads to the release of glycidol in the gastrointestinal tract. Glycidol is carcinogenic, genotoxic and teratogenic in rodents. It is rated as probably carcinogenic to humans (IARC group 2A). The determination of internal exposure of glycidol may support the assessment of the possible human health risks related to glycidyl ester intake. For this purpose, hemoglobin adducts of glycidol may be suitable biomarkers reflecting the cumulative exposure of up to four months. We applied a modified Edman degradation to assess the glycidol adduct at the N-terminal valine, N-(2,3-dihydroxypropyl)-valine (2,3-diHOPr-Val), of hemoglobin. The modified valine was cleaved with fluorescein-5-isothiocyanate (FITC), resulting in the formation of N-(2,3-dihydroxypropyl)-valine fluorescein thiohydantoin (DHP-Val-FTH). An isotope-dilution technique was developed for the quantification of the thiohydantoin analyte by ultra performance liquid chromatography-tandem mass spectrometry (UPLC-MS/MS) and DHP-Val-d 7 -FTH as reference standard. The limit of detection was 4 fmol DHP-Val-FTH per injection corresponding to 0.7pmol 2,3-diHOPr-Val/g hemoglobin. The adduct levels in blood samples of 12 non-smoking participants were in the range of 2.2-4.9pmol 2,3-diHOPr-Val/g hemoglobin. The current work presents the first isotope-dilution technique using UPLC-MS/MS for the quantification of 2,3-diHOPr-Val at the N-terminus of hemoglobin as a sensitive and convenient alternative to earlier GC-MS methods. Copyright © 2017 Elsevier B.V. All rights reserved.
Purification and characterization of two bacteriocins produced by lactic acid bacteria isolated from Mongolian airag.

PubMed

Batdorj, B; Dalgalarrondo, M; Choiset, Y; Pedroche, J; Métro, F; Prévost, H; Chobert, J-M; Haertlé, T

2006-10-01

The aim of this study was to isolate and identify bacteriocin-producing lactic acid bacteria (LAB) issued from Mongolian airag (traditional fermented mare's milk), and to purify and characterize bacteriocins produced by these LAB. Identification of the bacteria (Enterococcus durans) was carried out on the basis of its morphological, biochemical characteristics and carbohydrate fermentation profile and by API50CH kit and 16S rDNA analyses. The pH-neutral cell-free supernatant of this bacterium inhibited the growth of several Lactobacillus spp. and food-borne pathogens including Escherichia coli, Staphylococcus aureus and Listeria innocua. The antimicrobial agent (enterocin A5-11) was heat stable and was not sensitive to acid and alkaline conditions (pH 2-10), but was sensitive to several proteolytic enzymes. Its inhibitory activity was completely eliminated after treatment with proteinase K and alpha-chymotrypsin. The activity was however not completely inactivated by other proteases including trypsin and pepsin. Three-step purification procedure with high recovery yields was developed to separate two bacteriocins. The applied procedure allowed the recovery of 16% and 64% of enterocins A5-11A and A5-11B, respectively, present in the culture supernatant with purity higher than 99%. SDS-PAGE analyses revealed that enterocin A5-11 has a molecular mass of 5000 Da and mass spectrometry analyses demonstrates molecular masses of 5206 and 5218 Da for fractions A and B, respectively. Amino acid analyses of both enterocins indicated significant quantitative difference in their contents in threonine, alanine, isoleucine and leucine. Their N-termini were blocked hampering straightforward Edman degradation. Bacteriocins A5-11A and B from Ent. durans belong to the class II of bacteriocins. Judging from molecular masses, amino acid composition and spectrum of activities, bacteriocins A5-11A and B from Ent. durans show high degree of similarity with enterocins L50A and L50B isolated from Enterococcus faecium (Cintas et al. 1998, 2000) and with enterocin I produced by Ent. faecium 6T1a, a strain originally isolated from a Spanish-style green olive fermentation (Floriano et al. 1998).
Monitoring exposure to acrylonitrile using adducts with N-terminal valine in hemoglobin.

PubMed

Osterman-Golkar, S M; MacNeela, J P; Turner, M J; Walker, V E; Swenberg, J A; Sumner, S J; Youtsey, N; Fennell, T R

1994-12-01

Human exposure to acrylonitrile (ACN), a carcinogen in rats, may occur in industrial settings, through waste water and tobacco smoke. ACN is an electrophilic compound and binds covalently to nucleophilic sites in macromolecules. Measurements of adducts with hemoglobin could be utilized for improved exposure assessments. In this study, a method for quantification of N-(2-cyanoethyl)valine (CEVal), the product of reaction of ACN with N-terminal valine in hemoglobin has been developed. The method is based on the N-alkyl Edman procedure, which involves derivatization of the globin with pentafluorophenyl isothiocyanate and gas chromatographic-mass spectrometric analysis of the resulting thiohydantoin. An internal standard was prepared by reacting valylglycylglycine with [2H3]ACN, spiked with [14C]ACN to a known sp. act. Levels of CEVal were measured in globin from rats exposed to 3-300 p.p.m. ACN in drinking water for 105 days and from humans (four smokers and four non-smokers). CEVal was detected at all exposure levels in the drinking water study. The relationship between adduct level and water concentration was linear at concentrations of 10 p.p.m. (corresponding to an average daily uptake of c. 0.74 mg ACN/kg body wt during the 65 days prior to sacrifice) and below, with a slope of 37.7 pmol CEVal/g globin/p.p.m. At higher concentrations, adduct levels increased sublinearly, indicating saturation of a metabolic process for elimination of ACN. Comparison of adduct formation with the estimated dose (mg/kg/day) of ACN indicated that at low dose (0-10 p.p.m.) CEVal = 0.508 x ACN dose + 0.048 and at high dose (35-300 p.p.m.) CEVal = 1.142 x ACN dose - 1.098. Globin from the smokers (10-20 cigarettes/day) contained about 90 pmol CEVal/g, whereas the adduct levels in globin from non-smokers were below the detection limit. The analytical sensitivity should be sufficient to allow monitoring of occupationally exposed workers at levels well below the current Occupational Safety and Health Administration standard of 2 p.p.m.
Analyses of (1-chloroethenyl)oxirane headspace and hemoglobin N-valine adducts in erythrocytes indicate selective detoxification of (1-chloroethenyl)oxirane enantiomers.

PubMed

Hurst, Harrell E; Ali, Md Yeakub

2007-03-20

Chloroprene (2-chloro-1,3-butadiene, CAS 126-99-8, CP) is a colorless volatile liquid used in manufacture of polychloroprene, a synthetic rubber polymer. National Toxicology Program inhalation studies of CP in rats and mice gave clear evidence of carcinogenic activity. CP is metabolized by CYP2E1 to electrophilic epoxides, including R- and S-(1-chloroethenyl)oxirane (CEO), which form adducts with nucleic acids and other nucleophiles including glutathione and hemoglobin. As detection of these epoxide metabolites in vivo is technically challenging, measurements of CEO-Hb adducts may provide biomarkers of exposure to bioactivated metabolites of CP. The present studies involved exposure of C57BL/6 mouse erythrocytes (RBC) in vitro to pure enantiomers of CEO. Headspace analysis of CEO using Cyclodex-B capillary GC/MS with selected ion monitoring enabled separation, specific detection, and quantification of CEO enantiomers as reactions proceeded in vitro with RBC. These analyses indicated that R-CEO was much more persistent when incubated in vitro with RBC, while S-CEO disappeared rapidly. After periods of exposure of RBC to various concentrations of R- or S-CEO, erythrocytes were lysed and globin isolated. Covalent adducts, formed by reaction of CEO with N-terminal valine in Hb, were analyzed following Edman cleavage and trimethylsilylation. SIM-GC/MS analyses using a 5%-phenyl-dimethylsiloxane capillary column enabled quantification of CEO-Hb adducts. These analyses produced two chromatographic peaks of CEO-valine adduct derivatives, which were tentatively identified from mass spectra, reaction, and abundance data to be 1-(3-chloro-2-trimethylsilyloxybut-3-en-1-yl)-5-isopropyl-3-phenyl-2-thiohydantoin and 1-[2-chloro-1-(trimethylsilyloxymethyl)prop-2-en-1-yl]-5-isopropyl-3-phenyl-2-thiohydantoin. Analyses quantified significantly greater levels of adducts formed from R-CEO than from S-CEO. Studies involving pretreatment of RBC with glutathione-depleting diethyl maleate diminished the selective detoxification of S-CEO, and suggest enantiomeric selectivity of mouse glutathione-S-transferase as a mechanism of differential detoxification of CEO enantiomers. These results indicate more rapid detoxification of S-CEO by mouse RBC in vitro, while R-CEO may persist to react with cellular nucleophiles.
Is sequence awareness mandatory for perceptual sequence learning: An assessment using a pure perceptual sequence learning design.

PubMed

Deroost, Natacha; Coomans, Daphné

2018-02-01

We examined the role of sequence awareness in a pure perceptual sequence learning design. Participants had to react to the target's colour that changed according to a perceptual sequence. By varying the mapping of the target's colour onto the response keys, motor responses changed randomly. The effect of sequence awareness on perceptual sequence learning was determined by manipulating the learning instructions (explicit versus implicit) and assessing the amount of sequence awareness after the experiment. In the explicit instruction condition (n = 15), participants were instructed to intentionally search for the colour sequence, whereas in the implicit instruction condition (n = 15), they were left uninformed about the sequenced nature of the task. Sequence awareness after the sequence learning task was tested by means of a questionnaire and the process-dissociation-procedure. The results showed that the instruction manipulation had no effect on the amount of perceptual sequence learning. Based on their report to have actively applied their sequence knowledge during the experiment, participants were subsequently regrouped in a sequence strategy group (n = 14, of which 4 participants from the implicit instruction condition and 10 participants from the explicit instruction condition) and a no-sequence strategy group (n = 16, of which 11 participants from the implicit instruction condition and 5 participants from the explicit instruction condition). Only participants of the sequence strategy group showed reliable perceptual sequence learning and sequence awareness. These results indicate that perceptual sequence learning depends upon the continuous employment of strategic cognitive control processes on sequence knowledge. Sequence awareness is suggested to be a necessary but not sufficient condition for perceptual learning to take place. Copyright © 2018 Elsevier B.V. All rights reserved.
RIKEN Integrated Sequence Analysis (RISA) System—384-Format Sequencing Pipeline with 384 Multicapillary Sequencer

PubMed Central

Shibata, Kazuhiro; Itoh, Masayoshi; Aizawa, Katsunori; Nagaoka, Sumiharu; Sasaki, Nobuya; Carninci, Piero; Konno, Hideaki; Akiyama, Junichi; Nishi, Katsuo; Kitsunai, Tokuji; Tashiro, Hideo; Itoh, Mari; Sumi, Noriko; Ishii, Yoshiyuki; Nakamura, Shin; Hazama, Makoto; Nishine, Tsutomu; Harada, Akira; Yamamoto, Rintaro; Matsumoto, Hiroyuki; Sakaguchi, Sumito; Ikegami, Takashi; Kashiwagi, Katsuya; Fujiwake, Syuji; Inoue, Kouji; Togawa, Yoshiyuki; Izawa, Masaki; Ohara, Eiji; Watahiki, Masanori; Yoneda, Yuko; Ishikawa, Tomokazu; Ozawa, Kaori; Tanaka, Takumi; Matsuura, Shuji; Kawai, Jun; Okazaki, Yasushi; Muramatsu, Masami; Inoue, Yorinao; Kira, Akira; Hayashizaki, Yoshihide

2000-01-01

The RIKEN high-throughput 384-format sequencing pipeline (RISA system) including a 384-multicapillary sequencer (the so-called RISA sequencer) was developed for the RIKEN mouse encyclopedia project. The RISA system consists of colony picking, template preparation, sequencing reaction, and the sequencing process. A novel high-throughput 384-format capillary sequencer system (RISA sequencer system) was developed for the sequencing process. This system consists of a 384-multicapillary auto sequencer (RISA sequencer), a 384-multicapillary array assembler (CAS), and a 384-multicapillary casting device. The RISA sequencer can simultaneously analyze 384 independent sequencing products. The optical system is a scanning system chosen after careful comparison with an image detection system for the simultaneous detection of the 384-capillary array. This scanning system can be used with any fluorescent-labeled sequencing reaction (chain termination reaction), including transcriptional sequencing based on RNA polymerase, which was originally developed by us, and cycle sequencing based on thermostable DNA polymerase. For long-read sequencing, 380 out of 384 sequences (99.2%) were successfully analyzed and the average read length, with more than 99% accuracy, was 654.4 bp. A single RISA sequencer can analyze 216 kb with >99% accuracy in 2.7 h (90 kb/h). For short-read sequencing to cluster the 3′ end and 5′ end sequencing by reading 350 bp, 384 samples can be analyzed in 1.5 h. We have also developed a RISA inoculator, RISA filtrator and densitometer, RISA plasmid preparator which can handle throughput of 40,000 samples in 17.5 h, and a high-throughput RISA thermal cycler which has four 384-well sites. The combination of these technologies allowed us to construct the RISA system consisting of 16 RISA sequencers, which can process 50,000 DNA samples per day. One haploid genome shotgun sequence of a higher organism, such as human, mouse, rat, domestic animals, and plants, can be revealed by seven RISA systems within one month. PMID:11076861
Synchronized excitability in a network enables generation of internal neuronal sequences

PubMed Central

Wang, Yingxue; Roth, Zachary; Pastalkova, Eva

2016-01-01

Hippocampal place field sequences are supported by sensory cues and network internal mechanisms. In contrast, sharp-wave (SPW) sequences, theta sequences, and episode field sequences are internally generated. The relationship of these sequences to memory is unclear. SPW sequences have been shown to support learning and have been assumed to also support episodic memory. Conversely, we demonstrate these SPW sequences were present in trained rats even after episodic memory was impaired and after other internal sequences – episode field and theta sequences – were eliminated. SPW sequences did not support memory despite continuing to ‘replay’ all task-related sequences – place- field and episode field sequences. Sequence replay occurred selectively during synchronous increases of population excitability -- SPWs. Similarly, theta sequences depended on the presence of repeated synchronized waves of excitability – theta oscillations. Thus, we suggest that either intermittent or rhythmic synchronized changes of excitability trigger sequential firing of neurons, which in turn supports learning and/or memory. DOI: http://dx.doi.org/10.7554/eLife.20697.001 PMID:27677848
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

Sequencing artifacts in the type A influenza databases and attempts to correct them.

PubMed

Suarez, David L; Chester, Nikki; Hatfield, Jason

2014-07-01

There are over 276 000 influenza gene sequences in public databases, with the quality of the sequences determined by the contributor. As part of a high school class project, influenza sequences with possible errors were identified in the public databases based on the size of the gene being longer than expected, with the hypothesis that these sequences would have an error. Students contacted sequence submitters alerting them of the possible sequence issue(s) and requested they the suspect sequence(s) be correct as appropriate. Type A influenza viruses were screened, and gene segments longer than the accepted size were identified for further analysis. Attention was placed on sequences with additional nucleotides upstream or downstream of the highly conserved non-coding ends of the viral segments. A total of 1081 sequences were identified that met this criterion. Three types of errors were commonly observed: non-influenza primer sequence wasn't removed from the sequence; PCR product was cloned and plasmid sequence was included in the sequence; and Taq polymerase added an adenine at the end of the PCR product. Internal insertions of nucleotide sequence were also commonly observed, but in many cases it was unclear if the sequence was correct or actually contained an error. A total of 215 sequences, or 22.8% of the suspect sequences, were corrected in the public databases in the first year of the student project. Unfortunately 138 additional sequences with possible errors were added to the databases in the second year. Additional awareness of the need for data integrity of sequences submitted to public databases is needed to fully reap the benefits of these large data sets. © 2014 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
First complete genome sequence of infectious laryngotracheitis virus

PubMed Central

2011-01-01

Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38) were also identified. More than 530 single-nucleotide polymorphisms (SNPs) were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains. PMID:21501528
Locating Sequence on FPC Maps and Selecting a Minimal Tiling Path

PubMed Central

Engler, Friedrich W.; Hatfield, James; Nelson, William; Soderlund, Carol A.

2003-01-01

This study discusses three software tools, the first two aid in integrating sequence with an FPC physical map and the third automatically selects a minimal tiling path given genomic draft sequence and BAC end sequences. The first tool, FSD (FPC Simulated Digest), takes a sequenced clone and adds it back to the map based on a fingerprint generated by an in silico digest of the clone. This allows verification of sequenced clone positions and the integration of sequenced clones that were not originally part of the FPC map. The second tool, BSS (Blast Some Sequence), takes a query sequence and positions it on the map based on sequence associated with the clones in the map. BSS has multiple uses as follows: (1) When the query is a file of marker sequences, they can be added as electronic markers. (2) When the query is draft sequence, the results of BSS can be used to close gaps in a sequenced clone or the physical map. (3) When the query is a sequenced clone and the target is BAC end sequences, one may select the next clone for sequencing using both sequence comparison results and map location. (4) When the query is whole-genome draft sequence and the target is BAC end sequences, the results can be used to select many clones for a minimal tiling path at once. The third tool, pickMTP, automates the majority of this last usage of BSS. Results are presented using the rice FPC map, BAC end sequences, and whole-genome shotgun from Syngenta. PMID:12915486
Program for Editing Spacecraft Command Sequences

NASA Technical Reports Server (NTRS)

Gladden, Roy; Waggoner, Bruce; Kordon, Mark; Hashemi, Mahnaz; Hanks, David; Salcedo, Jose

2006-01-01

Sequence Translator, Editor, and Expander Resource (STEER) is a computer program that facilitates construction of sequences and blocks of sequences (hereafter denoted generally as sequence products) for commanding a spacecraft. STEER also provides mechanisms for translating among various sequence product types and quickly expanding activities of a given sequence in chronological order for review and analysis of the sequence. To date, construction of sequence products has generally been done by use of such clumsy mechanisms as text-editor programs, translating among sequence product types has been challenging, and expanding sequences to time-ordered lists has involved arduous processes of converting sequence products to "real" sequences and running them through Class-A software (defined, loosely, as flight and ground software critical to a spacecraft mission). Also, heretofore, generating sequence products in standard formats has been troublesome because precise formatting and syntax are required. STEER alleviates these issues by providing a graphical user interface containing intuitive fields in which the user can enter the necessary information. The STEER expansion function provides a "quick and dirty" means of seeing how a sequence and sequence block would expand into a chronological list, without need to use of Class-A software.
Multimodal sequence learning.

PubMed

Kemény, Ferenc; Meier, Beat

2016-02-01

While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.
A trace display and editing program for data from fluorescence based sequencing machines.

PubMed

Gleeson, T; Hillier, L

1991-12-11

'Ted' (Trace editor) is a graphical editor for sequence and trace data from automated fluorescence sequencing machines. It provides facilities for viewing sequence and trace data (in top or bottom strand orientation), for editing the base sequence, for automated or manual trimming of the head (vector) and tail (uncertain data) from the sequence, for vertical and horizontal trace scaling, for keeping a history of sequence editing, and for output of the edited sequence. Ted has been used extensively in the C.elegans genome sequencing project, both as a stand-alone program and integrated into the Staden sequence assembly package, and has greatly aided in the efficiency and accuracy of sequence editing. It runs in the X windows environment on Sun workstations and is available from the authors. Ted currently supports sequence and trace data from the ABI 373A and Pharmacia A.L.F. sequencers.
Sequences show rapid motor transfer and spatial translation in the oculomotor system.

PubMed

Stainer, Matthew J; Carpenter, R H S; Brotchie, Peter; Anderson, Andrew J

2016-07-01

Every day we perform learnt sequences of actions that seem to happen almost without awareness. It has been argued that for learning such sequences parallel learning networks exist - one using spatial coordinates and one using motor coordinates - with sequence acquisition involving a progressive shift from the former to the latter as a sequence is rehearsed. When sequences are interrupted by an out-of-sequence target, there is a delay in the response to the target, and so here we transiently interrupt oculomotor sequences to probe the influence of oculomotor rehearsal and spatial coordinates in sequence acquisition. For our main experiments, we used a repeating sequences of eight targets in length that was first learnt either using saccadic eye movements (left/right), manual responses (left/right or up/down) or as a sequence of colour (blue/red) requiring no motor response. The sequence was immediately repeated for saccadic eye movements, during which the influence of on out-of-sequence target (an interruption) was assessed. When a sequence is learnt beforehand in an abstract way (for example, as a sequence of colours or of orthogonally mapped manual responses), interruptions are immediately disruptive to latency, suggesting neither motor rehearsal nor specific spatial coordinates are essential for encoding sequences of actions and that sequences - no matter how they are encoded - can be rapidly translated into oculomotor coordinates. The magnitude of a disruption does, however, correspond to how well a sequence is learnt: introducing an interruption to an extended sequence before it was reliably learnt reduces the magnitude of the latency disruption. Copyright © 2016 Elsevier Ltd. All rights reserved.
Subgrouping Automata: automatic sequence subgrouping using phylogenetic tree-based optimum subgrouping algorithm.

PubMed

Seo, Joo-Hyun; Park, Jihyang; Kim, Eun-Mi; Kim, Juhan; Joo, Keehyoung; Lee, Jooyoung; Kim, Byung-Gee

2014-02-01

Sequence subgrouping for a given sequence set can enable various informative tasks such as the functional discrimination of sequence subsets and the functional inference of unknown sequences. Because an identity threshold for sequence subgrouping may vary according to the given sequence set, it is highly desirable to construct a robust subgrouping algorithm which automatically identifies an optimal identity threshold and generates subgroups for a given sequence set. To meet this end, an automatic sequence subgrouping method, named 'Subgrouping Automata' was constructed. Firstly, tree analysis module analyzes the structure of tree and calculates the all possible subgroups in each node. Sequence similarity analysis module calculates average sequence similarity for all subgroups in each node. Representative sequence generation module finds a representative sequence using profile analysis and self-scoring for each subgroup. For all nodes, average sequence similarities are calculated and 'Subgrouping Automata' searches a node showing statistically maximum sequence similarity increase using Student's t-value. A node showing the maximum t-value, which gives the most significant differences in average sequence similarity between two adjacent nodes, is determined as an optimum subgrouping node in the phylogenetic tree. Further analysis showed that the optimum subgrouping node from SA prevents under-subgrouping and over-subgrouping. Copyright © 2013. Published by Elsevier Ltd.
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

PubMed

Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark

2003-07-04

The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

PubMed Central

Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

2006-01-01

Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Use of the Minion nanopore sequencer for rapid sequencing of avian influenza virus isolates

USDA-ARS?s Scientific Manuscript database

A relatively new sequencing technology, the MinION nanopore sequencer, provides a platform that is smaller, faster, and cheaper than existing Next Generation Sequence (NGS) technologies. The MinION sequences of individual strands of DNA and can produce millions of sequencing reads. The cost of the s...
Feedback shift register sequences versus uniformly distributed random sequences for correlation chromatography

NASA Technical Reports Server (NTRS)

Kaljurand, M.; Valentin, J. R.; Shao, M.

1996-01-01

Two alternative input sequences are commonly employed in correlation chromatography (CC). They are sequences derived according to the algorithm of the feedback shift register (i.e., pseudo random binary sequences (PRBS)) and sequences derived by using the uniform random binary sequences (URBS). These two sequences are compared. By applying the "cleaning" data processing technique to the correlograms that result from these sequences, we show that when the PRBS is used the S/N of the correlogram is much higher than the one resulting from using URBS.
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
MRO Sequence Checking Tool

NASA Technical Reports Server (NTRS)

Fisher, Forest; Gladden, Roy; Khanampornpan, Teerapat

2008-01-01

The MRO Sequence Checking Tool program, mro_check, automates significant portions of the MRO (Mars Reconnaissance Orbiter) sequence checking procedure. Though MRO has similar checks to the ODY s (Mars Odyssey) Mega Check tool, the checks needed for MRO are unique to the MRO spacecraft. The MRO sequence checking tool automates the majority of the sequence validation procedure and check lists that are used to validate the sequences generated by MRO MPST (mission planning and sequencing team). The tool performs more than 50 different checks on the sequence. The automation varies from summarizing data about the sequence needed for visual verification of the sequence, to performing automated checks on the sequence and providing a report for each step. To allow for the addition of new checks as needed, this tool is built in a modular fashion.
Comparison of next generation sequencing technologies for transcriptome characterization

PubMed Central

2009-01-01

Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica) and the magnoliid avocado (Persea americana) using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary-based sequencing, but NG sequencing also presents significant challenges in assembly and sequence accuracy due to short read lengths, method-specific sequencing errors, and the absence of physical clones. These problems may be overcome by hybrid sequencing strategies using a mixture of sequencing methodologies, by new assemblers, and by sequencing more deeply. Sequencing and microarray outcomes from multiple experiments suggest that our simulator will be useful for guiding NG transcriptome sequencing projects in a wide range of organisms. PMID:19646272
Piscine reovirus: Genomic and molecular phylogenetic analysis from farmed and wild salmonids collected on the Canada/US Pacific Coast

USGS Publications Warehouse

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Piscine Reovirus: Genomic and Molecular Phylogenetic Analysis from Farmed and Wild Salmonids Collected on the Canada/US Pacific Coast

PubMed Central

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Operating characteristics of the implicit learning system supporting serial interception sequence learning.

PubMed

Sanchez, Daniel J; Reber, Paul J

2012-04-01

The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the implicit sequence learning mechanism, participants performed the serial interception sequence learning (SISL) task with covertly embedded repeating sequences that were much longer than most previous studies: ranging from 30 to 60 (Experiment 1) and 60 to 90 (Experiment 2) items in length. Robust sequence-specific learning was observed for sequences up to 80 items in length, extending the known capacity of implicit sequence learning. In Experiment 3, 12-item repeating sequences were embedded among increasing amounts of irrelevant nonrepeating sequences (from 20 to 80% of training trials). Despite high levels of irrelevant trials, learning occurred across conditions. A comparison of learning rates across all three experiments found a surprising degree of constancy in the rate of learning regardless of sequence length or embedded noise. Sequence learning appears to be constant with the logarithm of the number of sequence repetitions practiced during training. The consistency in learning rate across experiments and conditions implies that the mechanisms supporting implicit sequence learning are not capacity-constrained by very long sequences nor adversely affected by high rates of irrelevant sequences during training.
[Study on ITS sequences of Aconitum vilmorinianum and its medicinal adulterant].

PubMed

Zhang, Xiao-nan; Du, Chun-hua; Fu, De-huan; Gao, Li; Zhou, Pei-jun; Wang, Li

2012-09-01

To analyze and compare the ITS sequences of Aconitum vilmorinianum and its medicinal adulterant Aconitum austroyunnanense. Total genomic DNA were extracted from sample materials by improved CTAB method, ITS sequences of samples were amplified using PCR systems, directly sequenced and analyzed using software DNAStar, ClustalX1.81 and MEGA 4.0. 299 consistent sites, 19 variable sites and 13 informative sites were found in ITS1 sequences, 162 consistent sites, 2 variable sites and 1 informative sites were found in 5.8S sequences, 217 consistent sites, 3 variable sites and 1 informative site were found in ITS2 sequences. Base transition and transversion was not found only in 5.8S sequences, 2 sites transition and 1 site transversion were found in ITS1 sequences, only 1 site transversion was found in ITS2 sequences comparting the ITS sequences data matrix. By analyzing the ITS sequences data matrix from 2 population of Aconitum vilmorinianum and 3 population of Aconitum austroyunnanense, we found a stable informative site at the 596th base in ITS2 sequences, in all the samples of Aconitum vilmorinianum the base was C, and in all the samples of Aconitum austroyunnanense the base was A. Aconitum vilmorinianum and Aconitum austroyunnanense can be identified by their characters of ITS sequences, and the variable sites in ITS1 sequences are more than in ITS2 sequences.

Long-range barcode labeling-sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Feng; Zhang, Tao; Singh, Kanwar K.

Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.
Coordinate cytokine regulatory sequences

DOEpatents

Frazer, Kelly A.; Rubin, Edward M.; Loots, Gabriela G.

2005-05-10

The present invention provides CNS sequences that regulate the cytokine gene expression, expression cassettes and vectors comprising or lacking the CNS sequences, host cells and non-human transgenic animals comprising the CNS sequences or lacking the CNS sequences. The present invention also provides methods for identifying compounds that modulate the functions of CNS sequences as well as methods for diagnosing defects in the CNS sequences of patients.
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

PubMed

Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

2017-07-01

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
Sequence verification of synthetic DNA by assembly of sequencing reads

PubMed Central

Wilson, Mandy L.; Cai, Yizhi; Hanlon, Regina; Taylor, Samantha; Chevreux, Bastien; Setubal, João C.; Tyler, Brett M.; Peccoud, Jean

2013-01-01

Gene synthesis attempts to assemble user-defined DNA sequences with base-level precision. Verifying the sequences of construction intermediates and the final product of a gene synthesis project is a critical part of the workflow, yet one that has received the least attention. Sequence validation is equally important for other kinds of curated clone collections. Ensuring that the physical sequence of a clone matches its published sequence is a common quality control step performed at least once over the course of a research project. GenoREAD is a web-based application that breaks the sequence verification process into two steps: the assembly of sequencing reads and the alignment of the resulting contig with a reference sequence. GenoREAD can determine if a clone matches its reference sequence. Its sophisticated reporting features help identify and troubleshoot problems that arise during the sequence verification process. GenoREAD has been experimentally validated on thousands of gene-sized constructs from an ORFeome project, and on longer sequences including whole plasmids and synthetic chromosomes. Comparing GenoREAD results with those from manual analysis of the sequencing data demonstrates that GenoREAD tends to be conservative in its diagnostic. GenoREAD is available at www.genoread.org. PMID:23042248
The Genome Sequencer FLX System--longer reads, more applications, straight forward bioinformatics and more complete data sets.

PubMed

Droege, Marcus; Hill, Brendon

2008-08-31

The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.
Method to amplify variable sequences without imposing primer sequences

DOEpatents

Bradbury, Andrew M.; Zeytun, Ahmet

2006-11-14

The present invention provides methods of amplifying target sequences without including regions flanking the target sequence in the amplified product or imposing amplification primer sequences on the amplified product. Also provided are methods of preparing a library from such amplified target sequences.
Sequence Complexity of Chromosome 3 in Caenorhabditis elegans

PubMed Central

Pierro, Gaetano

2012-01-01

The nucleotide sequences complexity in chromosome 3 of Caenorhabditis elegans (C. elegans) is studied. The complexity of these sequences is compared with some random sequences. Moreover, by using some parameters related to complexity such as fractal dimension and frequency, indicator matrix is given a first classification of sequences of C. elegans. In particular, the sequences with highest and lowest fractal value are singled out. It is shown that the intrinsic nature of the low fractal dimension sequences has many common features with the random sequences. PMID:22919380
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Children's discrimination of vowel sequences

NASA Astrophysics Data System (ADS)

Coady, Jeffry A.; Kluender, Keith R.; Evans, Julia

2003-10-01

Children's ability to discriminate sequences of steady-state vowels was investigated. Vowels (as in ``beet,'' ``bat,'' ``bought,'' and ``boot'') were synthesized at durations of 40, 80, 160, 320, 640, and 1280 ms. Four different vowel sequences were created by concatenating different orders of vowels for each duration, separated by 10-ms intervening silence. Thus, sequences differed in vowel order and duration (rate). Sequences were 12 s in duration, with amplitude ramped linearly over the first and last 2 s. Sequence pairs included both same (identical sequences) and different trials (sequences with vowels in different orders). Sequences with vowel of equal duration were presented on individual trials. Children aged 7;0 to 10;6 listened to pairs of sequences (with 100 ms between sequences) and responded whether sequences sounded the same or different. Results indicate that children are best able to discriminate sequences of intermediate-duration vowels, typical of conversational speaking rate. Children were less accurate with both shorter and longer vowels. Results are discussed in terms of auditory processing (shortest vowels) and memory (longest vowels). [Research supported by NIDCD DC-05263, DC-04072, and DC-005650.
Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences

PubMed Central

Gibbs, Mark J; Armstrong, John S; Gibbs, Adrian J

2005-01-01

Background Most current DNA diagnostic tests for identifying organisms use specific oligonucleotide probes that are complementary in sequence to, and hence only hybridise with the DNA of one target species. By contrast, in traditional taxonomy, specimens are usually identified by 'dichotomous keys' that use combinations of characters shared by different members of the target set. Using one specific character for each target is the least efficient strategy for identification. Using combinations of shared bisectionally-distributed characters is much more efficient, and this strategy is most efficient when they separate the targets in a progressively binary way. Results We have developed a practical method for finding minimal sets of sub-sequences that identify individual sequences, and could be targeted by combinations of probes, so that the efficient strategy of traditional taxonomic identification could be used in DNA diagnosis. The sizes of minimal sub-sequence sets depended mostly on sequence diversity and sub-sequence length and interactions between these parameters. We found that 201 distinct cytochrome oxidase subunit-1 (CO1) genes from moths (Lepidoptera) were distinguished using only 15 sub-sequences 20 nucleotides long, whereas only 8–10 sub-sequences 6–10 nucleotides long were required to distinguish the CO1 genes of 92 species from the 9 largest orders of insects. Conclusion The presence/absence of sub-sequences in a set of gene sequences can be used like the questions in a traditional dichotomous taxonomic key; hybridisation probes complementary to such sub-sequences should provide a very efficient means for identifying individual species, subtypes or genotypes. Sequence diversity and sub-sequence length are the major factors that determine the numbers of distinguishing sub-sequences in any set of sequences. PMID:15817134
Comparison of an In Vitro Diagnostic Next-Generation Sequencing Assay with Sanger Sequencing for HIV-1 Genotypic Resistance Testing.

PubMed

Tzou, Philip L; Ariyaratne, Pramila; Varghese, Vici; Lee, Charlie; Rakhmanaliev, Elian; Villy, Carolin; Yee, Meiqi; Tan, Kevin; Michel, Gerd; Pinsky, Benjamin A; Shafer, Robert W

2018-06-01

The ability of next-generation sequencing (NGS) technologies to detect low frequency HIV-1 drug resistance mutations (DRMs) not detected by dideoxynucleotide Sanger sequencing has potential advantages for improved patient outcomes. We compared the performance of an in vitro diagnostic (IVD) NGS assay, the Sentosa SQ HIV genotyping assay for HIV-1 genotypic resistance testing, with Sanger sequencing on 138 protease/reverse transcriptase (RT) and 39 integrase sequences. The NGS assay used a 5% threshold for reporting low-frequency variants. The level of complete plus partial nucleotide sequence concordance between Sanger sequencing and NGS was 99.9%. Among the 138 protease/RT sequences, a mean of 6.4 DRMs was identified by both Sanger and NGS, a mean of 0.5 DRM was detected by NGS alone, and a mean of 0.1 DRM was detected by Sanger sequencing alone. Among the 39 integrase sequences, a mean of 1.6 DRMs was detected by both Sanger sequencing and NGS and a mean of 0.15 DRM was detected by NGS alone. Compared with Sanger sequencing, NGS estimated higher levels of resistance to one or more antiretroviral drugs for 18.2% of protease/RT sequences and 5.1% of integrase sequences. There was little evidence for technical artifacts in the NGS sequences, but the G-to-A hypermutation was detected in three samples. In conclusion, the IVD NGS assay evaluated in this study was highly concordant with Sanger sequencing. At the 5% threshold for reporting minority variants, NGS appeared to attain a modestly increased sensitivity for detecting low-frequency DRMs without compromising sequence accuracy. Copyright © 2018 American Society for Microbiology.
Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

NASA Astrophysics Data System (ADS)

Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

2017-07-01

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
Sequence Bundles: a novel method for visualising, discovering and exploring sequence motifs

PubMed Central

2014-01-01

Background We introduce Sequence Bundles--a novel data visualisation method for representing multiple sequence alignments (MSAs). We identify and address key limitations of the existing bioinformatics data visualisation methods (i.e. the Sequence Logo) by enabling Sequence Bundles to give salient visual expression to sequence motifs and other data features, which would otherwise remain hidden. Methods For the development of Sequence Bundles we employed research-led information design methodologies. Sequences are encoded as uninterrupted, semi-opaque lines plotted on a 2-dimensional reconfigurable grid. Each line represents a single sequence. The thickness and opacity of the stack at each residue in each position indicates the level of conservation and the lines' curved paths expose patterns in correlation and functionality. Several MSAs can be visualised in a composite image. The Sequence Bundles method is designed to favour a tangible, continuous and intuitive display of information. Results We have developed a software demonstration application for generating a Sequence Bundles visualisation of MSAs provided for the BioVis 2013 redesign contest. A subsequent exploration of the visualised line patterns allowed for the discovery of a number of interesting features in the dataset. Reported features include the extreme conservation of sequences displaying a specific residue and bifurcations of the consensus sequence. Conclusions Sequence Bundles is a novel method for visualisation of MSAs and the discovery of sequence motifs. It can aid in generating new insight and hypothesis making. Sequence Bundles is well disposed for future implementation as an interactive visual analytics software, which can complement existing visualisation tools. PMID:25237395
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Program Synthesizes UML Sequence Diagrams

NASA Technical Reports Server (NTRS)

Barry, Matthew R.; Osborne, Richard N.

2006-01-01

A computer program called "Rational Sequence" generates Universal Modeling Language (UML) sequence diagrams of a target Java program running on a Java virtual machine (JVM). Rational Sequence thereby performs a reverse engineering function that aids in the design documentation of the target Java program. Whereas previously, the construction of sequence diagrams was a tedious manual process, Rational Sequence generates UML sequence diagrams automatically from the running Java code.
Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence

PubMed Central

Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E.

2016-01-01

It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven by concurrent online and offline learning. In addition, as the acquisition of a probabilistic sequence requires greater procedural memory compared to the acquisition of a fixed sequence, our results suggest that offline learning is more likely to take place in a procedural sequence learning task. PMID:26973502
Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence.

PubMed

Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E

2016-01-01

It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven by concurrent online and offline learning. In addition, as the acquisition of a probabilistic sequence requires greater procedural memory compared to the acquisition of a fixed sequence, our results suggest that offline learning is more likely to take place in a procedural sequence learning task.
Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

PubMed Central

de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

2000-01-01

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084
The recurrence sequences via Sylvester matrices

NASA Astrophysics Data System (ADS)

Karaduman, Erdal; Deveci, Ömür

2017-07-01

In this work, we define the Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by using the Slyvester matrices which are obtained from the characteristic polynomials of the Pell and Jacobsthal sequences and then, we study the sequences defined modulo m. Also, we obtain the cyclic groups and the semigroups from the generating matrices of these sequences when read modulo m and then, we derive the relationships among the orders of the cyclic groups and the periods of the sequences. Furthermore, we redefine Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by means of the elements of the groups and then, we examine them in the finite groups.
BAC sequencing using pooled methods.

PubMed

Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

2015-01-01

Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

Long-range correlations and charge transport properties of DNA sequences

NASA Astrophysics Data System (ADS)

Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

2010-04-01

By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5
Single molecule sequencing of the M13 virus genome without amplification

PubMed Central

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X.; Yan, Qin; Deem, Michael W.; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias. PMID:29253901
Single molecule sequencing of the M13 virus genome without amplification.

PubMed

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X; Yan, Qin; Deem, Michael W; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias.
DNA sequencing using polymerase substrate-binding kinetics

PubMed Central

Previte, Michael John Robert; Zhou, Chunhong; Kellinger, Matthew; Pantoja, Rigo; Chen, Cheng-Yao; Shi, Jin; Wang, BeiBei; Kia, Amirali; Etchin, Sergey; Vieceli, John; Nikoomanzar, Ali; Bomati, Erin; Gloeckner, Christian; Ronaghi, Mostafa; He, Molly Min

2015-01-01

Next-generation sequencing (NGS) has transformed genomic research by decreasing the cost of sequencing. However, whole-genome sequencing is still costly and complex for diagnostics purposes. In the clinical space, targeted sequencing has the advantage of allowing researchers to focus on specific genes of interest. Routine clinical use of targeted NGS mandates inexpensive instruments, fast turnaround time and an integrated and robust workflow. Here we demonstrate a version of the Sequencing by Synthesis (SBS) chemistry that potentially can become a preferred targeted sequencing method in the clinical space. This sequencing chemistry uses natural nucleotides and is based on real-time recording of the differential polymerase/DNA-binding kinetics in the presence of correct or mismatch nucleotides. This ensemble SBS chemistry has been implemented on an existing Illumina sequencing platform with integrated cluster amplification. We discuss the advantages of this sequencing chemistry for targeted sequencing as well as its limitations for other applications. PMID:25612848
Memory and learning with rapid audiovisual sequences

PubMed Central

Keller, Arielle S.; Sekuler, Robert

2015-01-01

We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed. PMID:26575193
Memory and learning with rapid audiovisual sequences.

PubMed

Keller, Arielle S; Sekuler, Robert

2015-01-01

We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed.
SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read

PubMed Central

2010-01-01

Background High-throughput automated sequencing has enabled an exponential growth rate of sequencing data. This requires increasing sequence quality and reliability in order to avoid database contamination with artefactual sequences. The arrival of pyrosequencing enhances this problem and necessitates customisable pre-processing algorithms. Results SeqTrim has been implemented both as a Web and as a standalone command line application. Already-published and newly-designed algorithms have been included to identify sequence inserts, to remove low quality, vector, adaptor, low complexity and contaminant sequences, and to detect chimeric reads. The availability of several input and output formats allows its inclusion in sequence processing workflows. Due to its specific algorithms, SeqTrim outperforms other pre-processors implemented as Web services or standalone applications. It performs equally well with sequences from EST libraries, SSH libraries, genomic DNA libraries and pyrosequencing reads and does not lead to over-trimming. Conclusions SeqTrim is an efficient pipeline designed for pre-processing of any type of sequence read, including next-generation sequencing. It is easily configurable and provides a friendly interface that allows users to know what happened with sequences at every pre-processing stage, and to verify pre-processing of an individual sequence if desired. The recommended pipeline reveals more information about each sequence than previously described pre-processors and can discard more sequencing or experimental artefacts. PMID:20089148
Efficient Identification of Murine M2 Macrophage Peptide Targeting Ligands by Phage Display and Next-Generation Sequencing.

PubMed

Liu, Gary W; Livesay, Brynn R; Kacherovsky, Nataly A; Cieslewicz, Maryelise; Lutz, Emi; Waalkes, Adam; Jensen, Michael C; Salipante, Stephen J; Pun, Suzie H

2015-08-19

Peptide ligands are used to increase the specificity of drug carriers to their target cells and to facilitate intracellular delivery. One method to identify such peptide ligands, phage display, enables high-throughput screening of peptide libraries for ligands binding to therapeutic targets of interest. However, conventional methods for identifying target binders in a library by Sanger sequencing are low-throughput, labor-intensive, and provide a limited perspective (<0.01%) of the complete sequence space. Moreover, the small sample space can be dominated by nonspecific, preferentially amplifying "parasitic sequences" and plastic-binding sequences, which may lead to the identification of false positives or exclude the identification of target-binding sequences. To overcome these challenges, we employed next-generation Illumina sequencing to couple high-throughput screening and high-throughput sequencing, enabling more comprehensive access to the phage display library sequence space. In this work, we define the hallmarks of binding sequences in next-generation sequencing data, and develop a method that identifies several target-binding phage clones for murine, alternatively activated M2 macrophages with a high (100%) success rate: sequences and binding motifs were reproducibly present across biological replicates; binding motifs were identified across multiple unique sequences; and an unselected, amplified library accurately filtered out parasitic sequences. In addition, we validate the Multiple Em for Motif Elicitation tool as an efficient and principled means of discovering binding sequences.
Integration of Temporal and Ordinal Information During Serial Interception Sequence Learning

PubMed Central

Gobel, Eric W.; Sanchez, Daniel J.; Reber, Paul J.

2011-01-01

The expression of expert motor skills typically involves learning to perform a precisely timed sequence of movements (e.g., language production, music performance, athletic skills). Research examining incidental sequence learning has previously relied on a perceptually-cued task that gives participants exposure to repeating motor sequences but does not require timing of responses for accuracy. Using a novel perceptual-motor sequence learning task, learning a precisely timed cued sequence of motor actions is shown to occur without explicit instruction. Participants learned a repeating sequence through practice and showed sequence-specific knowledge via a performance decrement when switched to an unfamiliar sequence. In a second experiment, the integration of representation of action order and timing sequence knowledge was examined. When either action order or timing sequence information was selectively disrupted, performance was reduced to levels similar to completely novel sequences. Unlike prior sequence-learning research that has found timing information to be secondary to learning action sequences, when the task demands require accurate action and timing information, an integrated representation of these types of information is acquired. These results provide the first evidence for incidental learning of fully integrated action and timing sequence information in the absence of an independent representation of action order, and suggest that this integrative mechanism may play a material role in the acquisition of complex motor skills. PMID:21417511
Method and apparatus for biological sequence comparison

DOEpatents

Marr, T.G.; Chang, W.I.

1997-12-23

A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.
Method and apparatus for biological sequence comparison

DOEpatents

Marr, Thomas G.; Chang, William I-Wei

1997-01-01

A method and apparatus for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence.
A Deep-Coverage Tomato BAC Library and Prospects Toward Development of an STC Framework for Genome Sequencing

PubMed Central

Budiman, Muhammad A.; Mao, Long; Wood, Todd C.; Wing, Rod A.

2000-01-01

Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10−6, and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed. [The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111–AQ368361.] PMID:10645957
The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

PubMed

Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

2007-02-14

The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function.

PubMed

Mehrotra, Shweta; Goyal, Vinod

2014-08-01

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences. Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics.

PubMed

Timmermans, M J T N; Dodsworth, S; Culverwell, C L; Bocak, L; Ahrens, D; Littlewood, D T J; Pons, J; Vogler, A P

2010-11-01

Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.
Novel methodologies for spectral classification of exon and intron sequences

NASA Astrophysics Data System (ADS)

Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.

2012-12-01

Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
Sequence and facies architecture of the upper Blackhawk Formation and the Lower Castlegate Sandstone (Upper Cretaceous), Book Cliffs, Utah, USA

NASA Astrophysics Data System (ADS)

Yoshida, S.

2000-11-01

High-frequency stratigraphic sequences that comprise the Desert Member of the Blackhawk Formation, the Lower Castlegate Sandstone, and the Buck Tongue in the Green River area of Utah display changes in sequence architecture from marine deposits to marginal marine deposits to an entirely nonmarine section. Facies and sequence architecture differ above and below the regionally extensive Castlegate sequence boundary, which separates two low-frequency (106-year cyclicity) sequences. Below this surface, high-frequency sequences are identified and interpreted as comprising the highstand systems tract of the low-frequency Blackhawk sequence. Each high-frequency sequence has a local incised valley system on top of the wave-dominated delta, and coastal plain to shallow marine deposits are preserved. Above the Castlegate sequence boundary, in contrast, a regionally extensive sheet sandstone of fluvial to estuarine origin with laterally continuous internal erosional surfaces occurs. These deposits above the Castlegate sequence boundary are interpreted as the late lowstand to early transgressive systems tracts of the low-frequency Castlegate sequence. The base-level changes that generated both the low- and high-frequency sequences are attributed to crustal response to fluctuations in compressive intraplate stress on two different time scales. The low-frequency stratigraphic sequences are attributed to changes in the long-term regional subsidence rate and regional tilting of foreland basin fill. High-frequency sequences probably reflect the response of anisotropic basement to tectonism. Sequence architecture changes rapidly across the faulted margin of the underlying Paleozoic Paradox Basin. The high-frequency sequences are deeply eroded and stack above the Paradox Basin, but display less relief and become conformable updip. These features indicate that the area above the Paradox Basin was more prone to vertical structural movements during formation of the Blackhawk-Lower Castlegate succession.
Equally parsimonious pathways through an RNA sequence space are not equally likely

NASA Technical Reports Server (NTRS)

Lee, Y. H.; DSouza, L. M.; Fox, G. E.

1997-01-01

An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.
Aftershock occurrence rate decay for individual sequences and catalogs

NASA Astrophysics Data System (ADS)

Nyffenegger, Paul A.

One of the earliest observations of the Earth's seismicity is that the rate of aftershock occurrence decays with time according to a power law commonly known as modified Omori-law (MOL) decay. However, the physical reasons for aftershock occurrence and the empirical decay in rate remain unclear despite numerous models that yield similar rate decay behavior. Key problems in relating the observed empirical relationship to the physical conditions of the mainshock and fault are the lack of studies including small magnitude mainshocks and the lack of uniformity between studies. We use simulated aftershock sequences to investigate the factors which influence the maximum likelihood (ML) estimate of the Omori-law p value, the parameter describing aftershock occurrence rate decay, for both individual aftershock sequences and "stacked" or superposed sequences. Generally the ML estimate of p is accurate, but since the ML estimated uncertainty is unaffected by whether the sequence resembles an MOL model, a goodness-of-fit test such as the Anderson-Darling statistic is necessary. While stacking aftershock sequences permits the study of entire catalogs and sequences with small aftershock populations, stacking introduces artifacts. The p value for stacked sequences is approximately equal to the mean of the individual sequence p values. We apply single-link cluster analysis to identify all aftershock sequences from eleven regional seismicity catalogs. We observe two new mathematically predictable empirical relationships for the distribution of aftershock sequence populations. The average properties of aftershock sequences are not correlated with tectonic environment, but aftershock populations and p values do show a depth dependence. The p values show great variability with time, and large values or changes in p sometimes precedes major earthquakes. Studies of teleseismic earthquake catalogs over the last twenty years have led seismologists to question seismicity models and aftershock sequence decay for deep sequences. For seven exceptional deep sequences, we conclude that MOL decay adequately describes these sequences, and little difference exists compared to shallow sequences. However, they do include larger aftershock populations compared to most deep sequences. These results imply that p values for deep sequences are larger than those for intermediate depth sequences.

"First generation" automated DNA sequencing technology.

PubMed

Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

2011-10-01

Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
Site directed recombination

DOEpatents

Jurka, Jerzy W.

1997-01-01

Enhanced homologous recombination is obtained by employing a consensus sequence which has been found to be associated with integration of repeat sequences, such as Alu and ID. The consensus sequence or sequence having a single transition mutation determines one site of a double break which allows for high efficiency of integration at the site. By introducing single or double stranded DNA having the consensus sequence flanking region joined to a sequence of interest, one can reproducibly direct integration of the sequence of interest at one or a limited number of sites. In this way, specific sites can be identified and homologous recombination achieved at the site by employing a second flanking sequence associated with a sequence proximal to the 3'-nick.
The sequence of sequencers: The history of sequencing DNA

PubMed Central

Heather, James M.; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. PMID:26554401
Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification.

PubMed

Borozan, Ivan; Watt, Stuart; Ferretti, Vincent

2015-05-01

Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. ivan.borozan@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification

PubMed Central

Borozan, Ivan; Watt, Stuart; Ferretti, Vincent

2015-01-01

Motivation: Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Results: Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. Availability and implementation: All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. Contact: ivan.borozan@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573913
Studying long 16S rDNA sequences with ultrafast-metagenomic sequence classification using exact alignments (Kraken).

PubMed

Valenzuela-González, Fabiola; Martínez-Porchas, Marcel; Villalpando-Canchola, Enrique; Vargas-Albores, Francisco

2016-03-01

Ultrafast-metagenomic sequence classification using exact alignments (Kraken) is a novel approach to classify 16S rDNA sequences. The classifier is based on mapping short sequences to the lowest ancestor and performing alignments to form subtrees with specific weights in each taxon node. This study aimed to evaluate the classification performance of Kraken with long 16S rDNA random environmental sequences produced by cloning and then Sanger sequenced. A total of 480 clones were isolated and expanded, and 264 of these clones formed contigs (1352 ± 153 bp). The same sequences were analyzed using the Ribosomal Database Project (RDP) classifier. Deeper classification performance was achieved by Kraken than by the RDP: 73% of the contigs were classified up to the species or variety levels, whereas 67% of these contigs were classified no further than the genus level by the RDP. The results also demonstrated that unassembled sequences analyzed by Kraken provide similar or inclusively deeper information. Moreover, sequences that did not form contigs, which are usually discarded by other programs, provided meaningful information when analyzed by Kraken. Finally, it appears that the assembly step for Sanger sequences can be eliminated when using Kraken. Kraken cumulates the information of both sequence senses, providing additional elements for the classification. In conclusion, the results demonstrate that Kraken is an excellent choice for use in the taxonomic assignment of sequences obtained by Sanger sequencing or based on third generation sequencing, of which the main goal is to generate larger sequences. Copyright © 2016 Elsevier B.V. All rights reserved.
Rapid identification of causative species in patients with Old World leishmaniasis.

PubMed Central

Minodier, P; Piarroux, R; Gambarelli, F; Joblet, C; Dumon, H

1997-01-01

Conventional methods for the identification of species of Leishmania parasite causing infections have limitations. By using a DNA-based alternative, the present study tries to develop a new tool for this purpose. Thirty-three patients living in Marseilles (in the south of France) were suffering from visceral or cutaneous leishmaniasis. DNA of the parasite in clinical samples (bone marrow, peripheral blood, or skin) from these patients were amplified by PCR and were directly sequenced. The sequences observed were compared to these of 30 strains of the genus causing Old World leishmaniasis collected in Europe, Africa, or Asia. In the analysis of the sequences of the strains, two different sequence patterns for Leishmania infantum, one sequence for Leishmania donovani, one sequence for Leishmania major, two sequences for Leishmania tropica, and one sequence for Leishmania aethiopica were obtained. Four sequences were observed among the strains from the patients: one was similar to the sequence for the L. major strains, two were identical to the sequences for the L. infantum strains, and the last sequence was not observed within the strains but had a high degree of homology with the sequences of the L. infantum and L. donovani strains. The L. infantum strains from all immunocompetent patients had the same sequence. The L. infantum strains from immunodeficient patients suffering from visceral leishmaniasis had three different sequences. This fact might signify that some variants of L. infantum acquire pathogenicity exclusively in immunocompromised patients. To dispense with the sequencing step, a restriction assay with HaeIII was used. Some restriction patterns might support genetic exchanges in members of the genus Leishmania. PMID:9316906
Science sequence design

NASA Technical Reports Server (NTRS)

Koskela, P. E.; Bollman, W. E.; Freeman, J. E.; Helton, M. R.; Reichert, R. J.; Travers, E. S.; Zawacki, S. J.

1973-01-01

The activities of the following members of the Navigation Team are recorded: the Science Sequence Design Group, responsible for preparing the final science sequence designs; the Advanced Sequence Planning Group, responsible for sequence planning; and the Science Recommendation Team (SRT) representatives, responsible for conducting the necessary sequence design interfaces with the teams during the mission. The interface task included science support in both advance planning and daily operations. Science sequences designed during the mission are also discussed.
The first genome sequences of human bocaviruses from Vietnam

PubMed Central

Thanh, Tran Tan; Van, Hoang Minh Tu; Hong, Nguyen Thi Thu; Nhu, Le Nguyen Truc; Anh, Nguyen To; Tuan, Ha Manh; Hien, Ho Van; Tuong, Nguyen Manh; Kien, Trinh Trung; Khanh, Truong Huu; Nhan, Le Nguyen Thanh; Hung, Nguyen Thanh; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H. Rogier; Tan, Le Van

2017-01-01

As part of an ongoing effort to generate complete genome sequences of hand, foot and mouth disease-causing enteroviruses directly from clinical specimens, two complete coding sequences and two partial genomic sequences of human bocavirus 1 (n=3) and 2 (n=1) were co-amplified and sequenced, representing the first genome sequences of human bocaviruses from Vietnam. The sequences may aid future study aiming at understanding the evolution of the virus. PMID:28090592
Meeting the challenges of non-referenced genome assembly from short-read sequence data

Treesearch

M. Parks; A. Liston; R. Cronn

2010-01-01

Massively parallel sequencing technologies (MPST) offer unprecedented opportunities for novel sequencing projects. MPST, while offering tremendous sequencing capacity, are typically most effective in resequencing projects (as opposed to the sequencing of novel genomes) due to the fact that sequence is returned in relatively short reads. Nonetheless, there is great...
Experimental investigation of an RNA sequence space

NASA Technical Reports Server (NTRS)

Lee, Youn-Hyung; Dsouza, Lisa; Fox, George E.

1993-01-01

Modern rRNAs are the historic consequence of an ongoing evolutionary exploration of a sequence space. These extant sequences belong to a special subset of the sequence space that is comprised only of those primary sequences that can validly perform the biological function(s) required of the particular RNA. If it were possible to readily identify all such valid sequences, stochastic predictions could be made about the relative likelihood of various evolutionary pathways available to an RNA. Herein an experimental system which can assess whether a particular sequence is likely to have validity as a eubacterial 5S rRNA is described. A total of ten naturally occurring, and hence known to be valid, sequences and two point mutants of unknown validity were used to test the usefulness of the approach. Nine of the ten valid sequences tested positive whereas both mutants tested as clearly defective. The tenth valid sequence gave results that would be interpreted as reflecting a borderline status were the answer not known. These results demonstrate that it is possible to experimentally determine which sequences in local regions of the sequence space are potentially valid 5S rRNAs.
On the role of the SMA in the discrete sequence production task: a TMS study. Transcranial Magnetic Stimulation.

PubMed

Verwey, Willem B; Lammens, Robin; van Honk, Jack

2002-01-01

Participants practiced two discrete six-key sequences for a total of 420 trials. The 1 x 6 sequence had a unique order of key presses while the 2 x 3 sequence involved repetition of a three-key segment. Both sequences showed a long interkey interval halfway the sequence indicating hierarchical sequence control in that not only the 2 x 3 but also the 1 x 6 sequence was executed as two successive motor chunks. Besides, the second part of both sequences was executed faster than the first part. This supports the earlier notion of a motor processor executing the elements of familiar motor chunks and a cognitive processor triggering either these motor chunks or individual sequence elements. Low-frequency, off-line transcranial magnetic stimulation (TMS) of the supplementary motor area (SMA) counteracted normal improvement with practice of key presses at all sequence positions. Together, these results are in line with the notion that with moderate practice, the SMA executes short sequence fragments that are concatenated by other brain structures.
First-order and higher order sequence learning in specific language impairment.

PubMed

Clark, Gillian M; Lum, Jarrad A G

2017-02-01

A core claim of the procedural deficit hypothesis of specific language impairment (SLI) is that the disorder is associated with poor implicit sequence learning. This study investigated whether implicit sequence learning problems in SLI are present for first-order conditional (FOC) and higher order conditional (HOC) sequences. Twenty-five children with SLI and 27 age-matched, nonlanguage-impaired children completed 2 serial reaction time tasks. On 1 version, the sequence to be implicitly learnt comprised a FOC sequence and on the other a HOC sequence. Results showed that the SLI group learned the HOC sequence (η p ² = .285, p = .005) but not the FOC sequence (η p ² = .099, p = .118). The control group learned both sequences (FOC η p ² = .497, HOC η p 2= .465, ps < .001). The SLI group's difficulty learning the FOC sequence is consistent with the procedural deficit hypothesis. However, the study provides new evidence that multiple mechanisms may underpin the learning of FOC and HOC sequences. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

PubMed Central

Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

2012-01-01

RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Deep sequencing of evolving pathogen populations: applications, errors, and bioinformatic solutions

PubMed Central

2014-01-01

Deep sequencing harnesses the high throughput nature of next generation sequencing technologies to generate population samples, treating information contained in individual reads as meaningful. Here, we review applications of deep sequencing to pathogen evolution. Pioneering deep sequencing studies from the virology literature are discussed, such as whole genome Roche-454 sequencing analyses of the dynamics of the rapidly mutating pathogens hepatitis C virus and HIV. Extension of the deep sequencing approach to bacterial populations is then discussed, including the impacts of emerging sequencing technologies. While it is clear that deep sequencing has unprecedented potential for assessing the genetic structure and evolutionary history of pathogen populations, bioinformatic challenges remain. We summarise current approaches to overcoming these challenges, in particular methods for detecting low frequency variants in the context of sequencing error and reconstructing individual haplotypes from short reads. PMID:24428920
ORFer--retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files.

PubMed

Büssow, Konrad; Hoffmann, Steve; Sievert, Volker

2002-12-19

Functional genomics involves the parallel experimentation with large sets of proteins. This requires management of large sets of open reading frames as a prerequisite of the cloning and recombinant expression of these proteins. A Java program was developed for retrieval of protein and nucleic acid sequences and annotations from NCBI GenBank, using the XML sequence format. Annotations retrieved by ORFer include sequence name, organism and also the completeness of the sequence. The program has a graphical user interface, although it can be used in a non-interactive mode. For protein sequences, the program also extracts the open reading frame sequence, if available, and checks its correct translation. ORFer accepts user input in the form of single or lists of GenBank GI identifiers or accession numbers. It can be used to extract complete sets of open reading frames and protein sequences from any kind of GenBank sequence entry, including complete genomes or chromosomes. Sequences are either stored with their features in a relational database or can be exported as text files in Fasta or tabulator delimited format. The ORFer program is freely available at http://www.proteinstrukturfabrik.de/orfer. The ORFer program allows for fast retrieval of DNA sequences, protein sequences and their open reading frames and sequence annotations from GenBank. Furthermore, storage of sequences and features in a relational database is supported. Such a database can supplement a laboratory information system (LIMS) with appropriate sequence information.
Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.

PubMed

Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P

2016-05-27

Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.
Shotgun Protein Sequencing with Meta-contig Assembly*

PubMed Central

Guthals, Adrian; Clauser, Karl R.; Bandeira, Nuno

2012-01-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings. PMID:22798278

Shotgun protein sequencing with meta-contig assembly.

PubMed

Guthals, Adrian; Clauser, Karl R; Bandeira, Nuno

2012-10-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Existence of host-related DNA sequences in the schistosome genome.

PubMed

Iwamura, Y; Irie, Y; Kominami, R; Nara, T; Yasuraoka, K

1991-06-01

DNA sequences homologous to the mouse intracisternal A particle and endogenous type C retrovirus were detected in the DNAs of Schistosoma japonicum adults and S. mansoni eggs. Furthermore, other kinds of repetitive sequences in the host genome such as mouse type 1 Alu sequence (B1), mouse type 2 Alu sequence (B2) and mo-2 sequence, a mouse mini-satellite, were also detected in the DNAs from adults and eggs of S. japonicum and eggs of S. mansoni. Almost all of the sequences described above were absent in the DNAs of S. mansoni adults. The DNA fingerprints of schistosomes, using the mo-2 sequence, were indistinguishable from each other and resembled those of their murine hosts. Moreover, the mo-2 sequence was hypermethylated in the DNAs of schistosomes and its amount was variable in them. These facts indicate that host-related sequences are actually present in schistosomes and that the mo-2 repetitive sequence exists probably in extra-chromosome.
The complete CDS of the prion protein (PRNP) gene of African lion (Panthera leo).

PubMed

Maj, Andrzej; Spellman, Garth M; Sarver, Shane K

2008-04-01

We provide the complete PRNP CDS sequence for the African lion, which is different from the previously published sequence and more similar to other carnivore sequences. The newly obtained prion protein sequence differs from the domestic cat sequence at three amino acid positions and contains only four octapeptide repeats. We recommend that this sequence be used as the reference sequence for future studies of the PRNP gene for this species.
Tidying Up International Nucleotide Sequence Databases: Ecological, Geographical and Sequence Quality Annotation of ITS Sequences of Mycorrhizal Fungi

PubMed Central

Tedersoo, Leho; Abarenkov, Kessy; Nilsson, R. Henrik; Schüssler, Arthur; Grelet, Gwen-Aëlle; Kohout, Petr; Oja, Jane; Bonito, Gregory M.; Veldre, Vilmar; Jairus, Teele; Ryberg, Martin; Larsson, Karl-Henrik; Kõljalg, Urmas

2011-01-01

Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi. PMID:21949797
The rapid evolution of molecular genetic diagnostics in neuromuscular diseases.

PubMed

Volk, Alexander E; Kubisch, Christian

2017-10-01

The development of massively parallel sequencing (MPS) has revolutionized molecular genetic diagnostics in monogenic disorders. The present review gives a brief overview of different MPS-based approaches used in clinical diagnostics of neuromuscular disorders (NMDs) and highlights their advantages and limitations. MPS-based approaches like gene panel sequencing, (whole) exome sequencing, (whole) genome sequencing, and RNA sequencing have been used to identify the genetic cause in NMDs. Although gene panel sequencing has evolved as a standard test for heterogeneous diseases, it is still debated, mainly because of financial issues and unsolved problems of variant interpretation, whether genome sequencing (and to a lesser extent also exome sequencing) of single patients can already be regarded as routine diagnostics. However, it has been shown that the inclusion of parents and additional family members often leads to a substantial increase in the diagnostic yield in exome-wide/genome-wide MPS approaches. In addition, MPS-based RNA sequencing just enters the research and diagnostic scene. Next-generation sequencing increasingly enables the detection of the genetic cause in highly heterogeneous diseases like NMDs in an efficient and affordable way. Gene panel sequencing and family-based exome sequencing have been proven as potent and cost-efficient diagnostic tools. Although clinical validation and interpretation of genome sequencing is still challenging, diagnostic RNA sequencing represents a promising tool to bypass some hurdles of diagnostics using genomic DNA.
The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

Treesearch

Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

2014-01-01

Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
Sequence repeats and protein structure

NASA Astrophysics Data System (ADS)

Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos

2012-11-01

Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
The sequence of sequencers: The history of sequencing DNA.

PubMed

Heather, James M; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
An improved model for whole genome phylogenetic analysis by Fourier transform.

PubMed

Yin, Changchuan; Yau, Stephen S-T

2015-10-07

DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kit for detecting nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2001-01-01

A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
PFAAT version 2.0: a tool for editing, annotating, and analyzing multiple sequence alignments.

PubMed

Caffrey, Daniel R; Dana, Paul H; Mathur, Vidhya; Ocano, Marco; Hong, Eun-Jong; Wang, Yaoyu E; Somaroo, Shyamal; Caffrey, Brian E; Potluri, Shobha; Huang, Enoch S

2007-10-11

By virtue of their shared ancestry, homologous sequences are similar in their structure and function. Consequently, multiple sequence alignments are routinely used to identify trends that relate to function. This type of analysis is particularly productive when it is combined with structural and phylogenetic analysis. Here we describe the release of PFAAT version 2.0, a tool for editing, analyzing, and annotating multiple sequence alignments. Support for multiple annotations is a key component of this release as it provides a framework for most of the new functionalities. The sequence annotations are accessible from the alignment and tree, where they are typically used to label sequences or hyperlink them to related databases. Sequence annotations can be created manually or extracted automatically from UniProt entries. Once a multiple sequence alignment is populated with sequence annotations, sequences can be easily selected and sorted through a sophisticated search dialog. The selected sequences can be further analyzed using statistical methods that explicitly model relationships between the sequence annotations and residue properties. Residue annotations are accessible from the alignment viewer and are typically used to designate binding sites or properties for a particular residue. Residue annotations are also searchable, and allow one to quickly select alignment columns for further sequence analysis, e.g. computing percent identities. Other features include: novel algorithms to compute sequence conservation, mapping conservation scores to a 3D structure in Jmol, displaying secondary structure elements, and sorting sequences by residue composition. PFAAT provides a framework whereby end-users can specify knowledge for a protein family in the form of annotation. The annotations can be combined with sophisticated analysis to test hypothesis that relate to sequence, structure and function.
Molecular cloning and nucleotide sequence of the alpha and beta subunits of allophycocyanin from the cyanelle genome of Cyanophora paradoxa.

PubMed Central

Bryant, D A; de Lorimier, R; Lambert, D H; Dubbs, J M; Stirewalt, V L; Stevens, S E; Porter, R D; Tam, J; Jay, E

1985-01-01

The genes for the alpha- and beta-subunit apoproteins of allophycocyanin (AP) were isolated from the cyanelle genome of Cyanophora paradoxa and subjected to nucleotide sequence analysis. The AP beta-subunit apoprotein gene was localized to a 7.8-kilobase-pair Pst I restriction fragment from cyanelle DNA by hybridization with a tetradecameric oligonucleotide probe. Sequence analysis using that oligonucleotide and its complement as primers for the dideoxy chain-termination sequencing method confirmed the presence of both AP alpha- and beta-subunit genes on this restriction fragment. Additional oligonucleotide primers were synthesized as sequencing progressed and were used to determine rapidly the nucleotide sequence of a 1336-base-pair region of this cloned fragment. This strategy allowed the sequencing to be completed without a detailed restriction map and without extensive and time-consuming subcloning. The sequenced region contains two open reading frames whose deduced amino acid sequences are 81-85% homologous to cyanobacterial and red algal AP subunits whose amino acid sequences have been determined. The two open reading frames are in the same orientation and are separated by 39 base pairs. AP alpha is 5' to AP beta and both coding sequences are preceded by a polypurine, Shine-Dalgarno-type sequence. Sequences upstream from AP alpha closely resemble the Escherichia coli consensus promoter sequences and also show considerable homology to promoter sequences for several chloroplast-encoded psbA genes. A 56-base-pair palindromic sequence downstream from the AP beta gene could play a role in the termination of transcription or translation. The allophycocyanin apoprotein subunit genes are located on the large single-copy region of the cyanelle genome. PMID:2987916
Sequence dependent aggregation of peptides and fibril formation

NASA Astrophysics Data System (ADS)

Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

2017-09-01

Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Statistical properties of filtered pseudorandom digital sequences formed from the sum of maximum-length sequences

NASA Technical Reports Server (NTRS)

Wallace, G. R.; Weathers, G. D.; Graf, E. R.

1973-01-01

The statistics of filtered pseudorandom digital sequences called hybrid-sum sequences, formed from the modulo-two sum of several maximum-length sequences, are analyzed. The results indicate that a relation exists between the statistics of the filtered sequence and the characteristic polynomials of the component maximum length sequences. An analysis procedure is developed for identifying a large group of sequences with good statistical properties for applications requiring the generation of analog pseudorandom noise. By use of the analysis approach, the filtering process is approximated by the convolution of the sequence with a sum of unit step functions. A parameter reflecting the overall statistical properties of filtered pseudorandom sequences is derived. This parameter is called the statistical quality factor. A computer algorithm to calculate the statistical quality factor for the filtered sequences is presented, and the results for two examples of sequence combinations are included. The analysis reveals that the statistics of the signals generated with the hybrid-sum generator are potentially superior to the statistics of signals generated with maximum-length generators. Furthermore, fewer calculations are required to evaluate the statistics of a large group of hybrid-sum generators than are required to evaluate the statistics of the same size group of approximately equivalent maximum-length sequences.
Effects of the Ion PGM™ Hi-Q™ sequencing chemistry on sequence data quality.

PubMed

Churchill, Jennifer D; King, Jonathan L; Chakraborty, Ranajit; Budowle, Bruce

2016-09-01

Massively parallel sequencing (MPS) offers substantial improvements over current forensic DNA typing methodologies such as increased resolution, scalability, and throughput. The Ion PGM™ is a promising MPS platform for analysis of forensic biological evidence. The system employs a sequencing-by-synthesis chemistry on a semiconductor chip that measures a pH change due to the release of hydrogen ions as nucleotides are incorporated into the growing DNA strands. However, implementation of MPS into forensic laboratories requires a robust chemistry. Ion Torrent's Hi-Q™ Sequencing Chemistry was evaluated to determine if it could improve on the quality of the generated sequence data in association with selected genetic marker targets. The whole mitochondrial genome and the HID-Ion STR 10-plex panel were sequenced on the Ion PGM™ system with the Ion PGM™ Sequencing 400 Kit and the Ion PGM™ Hi-Q™ Sequencing Kit. Concordance, coverage, strand balance, noise, and deletion ratios were assessed in evaluating the performance of the Ion PGM™ Hi-Q™ Sequencing Kit. The results indicate that reliable, accurate data are generated and that sequencing through homopolymeric regions can be improved with the use of Ion Torrent's Hi-Q™ Sequencing Chemistry. Overall, the quality of the generated sequencing data supports the potential for use of the Ion PGM™ in forensic genetic laboratories.
Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.

PubMed

Magner, Abram; Duda, Jarosław; Szpankowski, Wojciech; Grama, Ananth

2016-06-01

Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to accurately reconstruct the true sequence with high probability? Our results provide a number of important insights: (i) the probability of accurate reconstruction of a sequence from a single sample in the presence of indel errors tends quickly (i.e., exponentially) to zero as the length of the sequence increases; and (ii) replicated extrusion is an effective technique for accurate reconstruction. We show that for typical distributions of indel errors, the required number of replicas is a slow function (polylogarithmic) of sequence length - implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Moreover, we show that in certain cases, the required number of replicas can be related to information-theoretic parameters of the indel error distributions.
AlignMe—a membrane protein sequence alignment web server

PubMed Central

Stamm, Marcus; Staritzbichler, René; Khafizov, Kamil; Forrest, Lucy R.

2014-01-01

We present a web server for pair-wise alignment of membrane protein sequences, using the program AlignMe. The server makes available two operational modes of AlignMe: (i) sequence to sequence alignment, taking two sequences in fasta format as input, combining information about each sequence from multiple sources and producing a pair-wise alignment (PW mode); and (ii) alignment of two multiple sequence alignments to create family-averaged hydropathy profile alignments (HP mode). For the PW sequence alignment mode, four different optimized parameter sets are provided, each suited to pairs of sequences with a specific similarity level. These settings utilize different types of inputs: (position-specific) substitution matrices, secondary structure predictions and transmembrane propensities from transmembrane predictions or hydrophobicity scales. In the second (HP) mode, each input multiple sequence alignment is converted into a hydrophobicity profile averaged over the provided set of sequence homologs; the two profiles are then aligned. The HP mode enables qualitative comparison of transmembrane topologies (and therefore potentially of 3D folds) of two membrane proteins, which can be useful if the proteins have low sequence similarity. In summary, the AlignMe web server provides user-friendly access to a set of tools for analysis and comparison of membrane protein sequences. Access is available at http://www.bioinfo.mpg.de/AlignMe PMID:24753425
Quantitative phenotyping via deep barcode sequencing.

PubMed

Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey

2009-10-01

Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.

Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Whole genome sequence analysis of unidentified genetically modified papaya for development of a specific detection method.

PubMed

Nakamura, Kosuke; Kondo, Kazunari; Akiyama, Hiroshi; Ishigaki, Takumi; Noguchi, Akio; Katsumata, Hiroshi; Takasaki, Kazuto; Futo, Satoshi; Sakata, Kozue; Fukuda, Nozomi; Mano, Junichi; Kitta, Kazumi; Tanaka, Hidenori; Akashi, Ryo; Nishimaki-Mogami, Tomoko

2016-08-15

Identification of transgenic sequences in an unknown genetically modified (GM) papaya (Carica papaya L.) by whole genome sequence analysis was demonstrated. Whole genome sequence data were generated for a GM-positive fresh papaya fruit commodity detected in monitoring using real-time polymerase chain reaction (PCR). The sequences obtained were mapped against an open database for papaya genome sequence. Transgenic construct- and event-specific sequences were identified as a GM papaya developed to resist infection from a Papaya ringspot virus. Based on the transgenic sequences, a specific real-time PCR detection method for GM papaya applicable to various food commodities was developed. Whole genome sequence analysis enabled identifying unknown transgenic construct- and event-specific sequences in GM papaya and development of a reliable method for detecting them in papaya food commodities. Copyright © 2016 Elsevier Ltd. All rights reserved.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
GATA: A graphic alignment tool for comparative sequenceanalysis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nix, David A.; Eisen, Michael B.

2005-01-01

Several problems exist with current methods used to align DNA sequences for comparative sequence analysis. Most dynamic programming algorithms assume that conserved sequence elements are collinear. This assumption appears valid when comparing orthologous protein coding sequences. Functional constraints on proteins provide strong selective pressure against sequence inversions, and minimize sequence duplications and feature shuffling. For non-coding sequences this collinearity assumption is often invalid. For example, enhancers contain clusters of transcription factor binding sites that change in number, orientation, and spacing during evolution yet the enhancer retains its activity. Dotplot analysis is often used to estimate non-coding sequence relatedness. Yet dotmore » plots do not actually align sequences and thus cannot account well for base insertions or deletions. Moreover, they lack an adequate statistical framework for comparing sequence relatedness and are limited to pairwise comparisons. Lastly, dot plots and dynamic programming text outputs fail to provide an intuitive means for visualizing DNA alignments.« less
Deep Sequencing to Identify the Causes of Viral Encephalitis

PubMed Central

Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.

2014-01-01

Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Method for phosphorothioate antisense DNA sequencing by capillary electrophoresis with UV detection.

PubMed

Froim, D; Hopkins, C E; Belenky, A; Cohen, A S

1997-11-01

The progress of antisense DNA therapy demands development of reliable and convenient methods for sequencing short single-stranded oligonucleotides. A method of phosphorothioate antisense DNA sequencing analysis using UV detection coupled to capillary electrophoresis (CE) has been developed based on a modified chain termination sequencing method. The proposed method reduces the sequencing cost since it uses affordable CE-UV instrumentation and requires no labeling with minimal sample processing before analysis. Cycle sequencing with ThermoSequenase generates quantities of sequencing products that are readily detectable by UV. Discrimination of undesired components from sequencing products in the reaction mixture, previously accomplished by fluorescent or radioactive labeling, is now achieved by bringing concentrations of undesired components below the UV detection range which yields a 'clean', well defined sequence. UV detection coupled with CE offers additional conveniences for sequencing since it can be accomplished with commercially available CE-UV equipment and is readily amenable to automation.
Method for phosphorothioate antisense DNA sequencing by capillary electrophoresis with UV detection.

PubMed Central

Froim, D; Hopkins, C E; Belenky, A; Cohen, A S

1997-01-01

The progress of antisense DNA therapy demands development of reliable and convenient methods for sequencing short single-stranded oligonucleotides. A method of phosphorothioate antisense DNA sequencing analysis using UV detection coupled to capillary electrophoresis (CE) has been developed based on a modified chain termination sequencing method. The proposed method reduces the sequencing cost since it uses affordable CE-UV instrumentation and requires no labeling with minimal sample processing before analysis. Cycle sequencing with ThermoSequenase generates quantities of sequencing products that are readily detectable by UV. Discrimination of undesired components from sequencing products in the reaction mixture, previously accomplished by fluorescent or radioactive labeling, is now achieved by bringing concentrations of undesired components below the UV detection range which yields a 'clean', well defined sequence. UV detection coupled with CE offers additional conveniences for sequencing since it can be accomplished with commercially available CE-UV equipment and is readily amenable to automation. PMID:9336449
Orthogonal Polynomials Associated with Complementary Chain Sequences

NASA Astrophysics Data System (ADS)

Behera, Kiran Kumar; Sri Ranga, A.; Swaminathan, A.

2016-07-01

Using the minimal parameter sequence of a given chain sequence, we introduce the concept of complementary chain sequences, which we view as perturbations of chain sequences. Using the relation between these complementary chain sequences and the corresponding Verblunsky coefficients, the para-orthogonal polynomials and the associated Szegő polynomials are analyzed. Two illustrations, one involving Gaussian hypergeometric functions and the other involving Carathéodory functions are also provided. A connection between these two illustrations by means of complementary chain sequences is also observed.
The sequence measurement system of the IR camera

NASA Astrophysics Data System (ADS)

Geng, Ai-hui; Han, Hong-xia; Zhang, Hai-bo

2011-08-01

Currently, the IR cameras are broadly used in the optic-electronic tracking, optic-electronic measuring, fire control and optic-electronic countermeasure field, but the output sequence of the most presently applied IR cameras in the project is complex and the giving sequence documents from the leave factory are not detailed. Aiming at the requirement that the continuous image transmission and image procession system need the detailed sequence of the IR cameras, the sequence measurement system of the IR camera is designed, and the detailed sequence measurement way of the applied IR camera is carried out. The FPGA programming combined with the SignalTap online observation way has been applied in the sequence measurement system, and the precise sequence of the IR camera's output signal has been achieved, the detailed document of the IR camera has been supplied to the continuous image transmission system, image processing system and etc. The sequence measurement system of the IR camera includes CameraLink input interface part, LVDS input interface part, FPGA part, CameraLink output interface part and etc, thereinto the FPGA part is the key composed part in the sequence measurement system. Both the video signal of the CmaeraLink style and the video signal of LVDS style can be accepted by the sequence measurement system, and because the image processing card and image memory card always use the CameraLink interface as its input interface style, the output signal style of the sequence measurement system has been designed into CameraLink interface. The sequence measurement system does the IR camera's sequence measurement work and meanwhile does the interface transmission work to some cameras. Inside the FPGA of the sequence measurement system, the sequence measurement program, the pixel clock modification, the SignalTap file configuration and the SignalTap online observation has been integrated to realize the precise measurement to the IR camera. Te sequence measurement program written by the verilog language combining the SignalTap tool on line observation can count the line numbers in one frame, pixel numbers in one line and meanwhile account the line offset and row offset of the image. Aiming at the complex sequence of the IR camera's output signal, the sequence measurement system of the IR camera accurately measures the sequence of the project applied camera, supplies the detailed sequence document to the continuous system such as image processing system and image transmission system and gives out the concrete parameters of the fval, lval, pixclk, line offset and row offset. The experiment shows that the sequence measurement system of the IR camera can get the precise sequence measurement result and works stably, laying foundation for the continuous system.
Evaluation of 16S Rrna amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
Evaluation of 16S rRNA amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
Noncoding sequence classification based on wavelet transform analysis: part I

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.
Full genome sequence of Rocio virus reveal substantial variations from the prototype Rocio virus SPH 34675 sequence.

PubMed

Setoh, Yin Xiang; Amarilla, Alberto A; Peng, Nias Y; Slonchak, Andrii; Periasamy, Parthiban; Figueiredo, Luiz T M; Aquino, Victor H; Khromykh, Alexander A

2018-01-01

Rocio virus (ROCV) is an arbovirus belonging to the genus Flavivirus, family Flaviviridae. We present an updated sequence of ROCV strain SPH 34675 (GenBank: AY632542.4), the only available full genome sequence prior to this study. Using next-generation sequencing of the entire genome, we reveal substantial sequence variation from the prototype sequence, with 30 nucleotide differences amounting to 14 amino acid changes, as well as significant changes to predicted 3'UTR RNA structures. Our results present an updated and corrected sequence of a potential emerging human-virulent flavivirus uniquely indigenous to Brazil (GenBank: MF461639).
What is a melody? On the relationship between pitch and brightness of timbre.

PubMed

Cousineau, Marion; Carcagno, Samuele; Demany, Laurent; Pressnitzer, Daniel

2013-01-01

Previous studies showed that the perceptual processing of sound sequences is more efficient when the sounds vary in pitch than when they vary in loudness. We show here that sequences of sounds varying in brightness of timbre are processed with the same efficiency as pitch sequences. The sounds used consisted of two simultaneous pure tones one octave apart, and the listeners' task was to make same/different judgments on pairs of sequences varying in length (one, two, or four sounds). In one condition, brightness of timbre was varied within the sequences by changing the relative level of the two pure tones. In other conditions, pitch was varied by changing fundamental frequency, or loudness was varied by changing the overall level. In all conditions, only two possible sounds could be used in a given sequence, and these two sounds were equally discriminable. When sequence length increased from one to four, discrimination performance decreased substantially for loudness sequences, but to a smaller extent for brightness sequences and pitch sequences. In the latter two conditions, sequence length had a similar effect on performance. These results suggest that the processes dedicated to pitch and brightness analysis, when probed with a sequence-discrimination task, share unexpected similarities.
Prefrontal neural correlates of memory for sequences.

PubMed

Averbeck, Bruno B; Lee, Daeyeol

2007-02-28

The sequence of actions appropriate to solve a problem often needs to be discovered by trial and error and recalled in the future when faced with the same problem. Here, we show that when monkeys had to discover and then remember a sequence of decisions across trials, ensembles of prefrontal cortex neurons reflected the sequence of decisions the animal would make throughout the interval between trials. This signal could reflect either an explicit memory process or a sequence-planning process that begins far in advance of the actual sequence execution. This finding extended to error trials such that, when the neural activity during the intertrial interval specified the wrong sequence, the animal also attempted to execute an incorrect sequence. More specifically, we used a decoding analysis to predict the sequence the monkey was planning to execute at the end of the fore-period, just before sequence execution. When this analysis was applied to error trials, we were able to predict where in the sequence the error would occur, up to three movements into the future. This suggests that prefrontal neural activity can retain information about sequences between trials, and that regardless of whether information is remembered correctly or incorrectly, the prefrontal activity veridically reflects the animal's action plan.

Local alignment of two-base encoded DNA sequence

PubMed Central

Homer, Nils; Merriman, Barry; Nelson, Stanley F

2009-01-01

Background DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. Results We present an extension of the standard dynamic programming method for local alignment, which simultaneously decodes the data and performs the alignment, maximizing a similarity score based on a weighted combination of errors and edits, and allowing an affine gap penalty. We also present simulations that demonstrate the performance characteristics of our two base encoded alignment method and contrast those with standard DNA sequence alignment under the same conditions. Conclusion The new local alignment algorithm for two-base encoded data has substantial power to properly detect and correct measurement errors while identifying underlying sequence variants, and facilitating genome re-sequencing efforts based on this form of sequence data. PMID:19508732
Application of representational difference analysis to identify genomic differences between Bradyrhizobium elkanii and B. Japonicum species.

PubMed

Soares, René Arderius; Passaglia, Luciane Maria Pereira

2010-10-01

Bradyrhizobium elkanii is successfully used in the formulation of commercial inoculants and, together with B. japonicum, it fully supplies the plant nitrogen demands. Despite the similarity between B. japonicum and B. elkanii species, several works demonstrated genetic and physiological differences between them. In this work Representational Difference Analysis (RDA) was used for genomic comparison between B. elkanii SEMIA 587, a crop inoculant strain, and B. japonicum USDA 110, a reference strain. Two hundred sequences were obtained. From these, 46 sequences belonged exclusively to the genome of B. elkanii strain, and 154 showed similarity to sequences from B. japonicum genome. From the 46 sequences with no similarity to sequences from B. japonicum, 39 showed no similarity to sequences in public databases and seven showed similarity to sequences of genes coding for known proteins. These seven sequences were divided in three groups: similar to sequences from other Bradyrhizobium strains, similar to sequences from other nitrogen-fixing bacteria, and similar to sequences from non nitrogen-fixing bacteria. These new sequences could be used as DNA markers in order to investigate the rates of genetic material gain and loss in natural Bradyrhizobium strains.
Augmented brain function by coordinated reset stimulation with slowly varying sequences.

PubMed

Zeitler, Magteld; Tass, Peter A

2015-01-01

Several brain disorders are characterized by abnormally strong neuronal synchrony. Coordinated Reset (CR) stimulation was developed to selectively counteract abnormal neuronal synchrony by desynchronization. For this, phase resetting stimuli are delivered to different subpopulations in a timely coordinated way. In neural networks with spike timing-dependent plasticity CR stimulation may eventually lead to an anti-kindling, i.e., an unlearning of abnormal synaptic connectivity and abnormal synchrony. The spatiotemporal sequence by which all stimulation sites are stimulated exactly once is called the stimulation site sequence, or briefly sequence. So far, in simulations, pre-clinical and clinical applications CR was applied either with fixed sequences or rapidly varying sequences (RVS). In this computational study we show that appropriate repetition of the sequence with occasional random switching to the next sequence may significantly improve the anti-kindling effect of CR. To this end, a sequence is applied many times before randomly switching to the next sequence. This new method is called SVS CR stimulation, i.e., CR with slowly varying sequences. In a neuronal network with strong short-range excitatory and weak long-range inhibitory dynamic couplings SVS CR stimulation turns out to be superior to CR stimulation with fixed sequences or RVS.
Augmented brain function by coordinated reset stimulation with slowly varying sequences

PubMed Central

Zeitler, Magteld; Tass, Peter A.

2015-01-01

Several brain disorders are characterized by abnormally strong neuronal synchrony. Coordinated Reset (CR) stimulation was developed to selectively counteract abnormal neuronal synchrony by desynchronization. For this, phase resetting stimuli are delivered to different subpopulations in a timely coordinated way. In neural networks with spike timing-dependent plasticity CR stimulation may eventually lead to an anti-kindling, i.e., an unlearning of abnormal synaptic connectivity and abnormal synchrony. The spatiotemporal sequence by which all stimulation sites are stimulated exactly once is called the stimulation site sequence, or briefly sequence. So far, in simulations, pre-clinical and clinical applications CR was applied either with fixed sequences or rapidly varying sequences (RVS). In this computational study we show that appropriate repetition of the sequence with occasional random switching to the next sequence may significantly improve the anti-kindling effect of CR. To this end, a sequence is applied many times before randomly switching to the next sequence. This new method is called SVS CR stimulation, i.e., CR with slowly varying sequences. In a neuronal network with strong short-range excitatory and weak long-range inhibitory dynamic couplings SVS CR stimulation turns out to be superior to CR stimulation with fixed sequences or RVS. PMID:25873867
A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection

PubMed Central

Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike

2018-01-01

ABSTRACT Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection. PMID:29564396
A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection.

PubMed

Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike; Khan, Arifa S

2018-01-01

Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection.
Memory for sequences of events impaired in typical aging.

PubMed

Allen, Timothy A; Morris, Andrea M; Stark, Shauna M; Fortin, Norbert J; Stark, Craig E L

2015-03-01

Typical aging is associated with diminished episodic memory performance. To improve our understanding of the fundamental mechanisms underlying this age-related memory deficit, we previously developed an integrated, cross-species approach to link converging evidence from human and animal research. This novel approach focuses on the ability to remember sequences of events, an important feature of episodic memory. Unlike existing paradigms, this task is nonspatial, nonverbal, and can be used to isolate different cognitive processes that may be differentially affected in aging. Here, we used this task to make a comprehensive comparison of sequence memory performance between younger (18-22 yr) and older adults (62-86 yr). Specifically, participants viewed repeated sequences of six colored, fractal images and indicated whether each item was presented "in sequence" or "out of sequence." Several out of sequence probe trials were used to provide a detailed assessment of sequence memory, including: (i) repeating an item from earlier in the sequence ("Repeats"; e.g., AB A: DEF), (ii) skipping ahead in the sequence ("Skips"; e.g., AB D: DEF), and (iii) inserting an item from a different sequence into the same ordinal position ("Ordinal Transfers"; e.g., AB 3: DEF). We found that older adults performed as well as younger controls when tested on well-known and predictable sequences, but were severely impaired when tested using novel sequences. Importantly, overall sequence memory performance in older adults steadily declined with age, a decline not detected with other measures (RAVLT or BPS-O). We further characterized this deficit by showing that performance of older adults was severely impaired on specific probe trials that required detailed knowledge of the sequence (Skips and Ordinal Transfers), and was associated with a shift in their underlying mnemonic representation of the sequences. Collectively, these findings provide unambiguous evidence that the capacity to remember sequences of events is fundamentally affected by typical aging. © 2015 Allen et al.; Published by Cold Spring Harbor Laboratory Press.
Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Onda, M.; Kudo, S.; Fukuda, M.

Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less
Integrated sequence stratigraphy of the postimpact sediments from the Eyreville core holes, Chesapeake Bay impact structure inner basin

USGS Publications Warehouse

Browning, J.V.; Miller, K.G.; McLaughlin, P.P.; Edwards, L.E.; Kulpecz, A.A.; Powars, D.S.; Wade, B.S.; Feigenson, M.D.; Wright, J.D.

2009-01-01

The Eyreville core holes provide the first continuously cored record of postimpact sequences from within the deepest part of the central Chesapeake Bay impact crater. We analyzed the upper Eocene to Pliocene postimpact sediments from the Eyreville A and C core holes for lithology (semiquantitative measurements of grain size and composition), sequence stratigraphy, and chronostratigraphy. Age is based primarily on Sr isotope stratigraphy supplemented by biostratigraphy (dinocysts, nannofossils, and planktonic foraminifers); age resolution is approximately ??0.5 Ma for early Miocene sequences and approximately ??1.0 Ma for younger and older sequences. Eocene-lower Miocene sequences are subtle, upper middle to lower upper Miocene sequences are more clearly distinguished, and upper Miocene- Pliocene sequences display a distinct facies pattern within sequences. We recognize two upper Eocene, two Oligocene, nine Miocene, three Pliocene, and one Pleistocene sequence and correlate them with those in New Jersey and Delaware. The upper Eocene through Pleistocene strata at Eyreville record changes from: (1) rapidly deposited, extremely fi ne-grained Eocene strata that probably represent two sequences deposited in a deep (>200 m) basin; to (2) highly dissected Oligocene (two very thin sequences) to lower Miocene (three thin sequences) with a long hiatus; to (3) a thick, rapidly deposited (43-73 m/Ma), very fi ne-grained, biosiliceous middle Miocene (16.5-14 Ma) section divided into three sequences (V5-V3) deposited in middle neritic paleoenvironments; to (4) a 4.5-Ma-long hiatus (12.8-8.3 Ma); to (5) sandy, shelly upper Miocene to Pliocene strata (8.3-2.0 Ma) divided into six sequences deposited in shelf and shoreface environments; and, last, to (6) a sandy middle Pleistocene paralic sequence (~400 ka). The Eyreville cores thus record the fi lling of a deep impact-generated basin where the timing of sequence boundaries is heavily infl uenced by eustasy. ?? 2009 The Geological Society of America.
Are commercial providers a viable option for clinical bacterial sequencing?

PubMed

Raven, Kathy; Blane, Beth; Churcher, Carol; Parkhill, Julian; Peacock, Sharon J

2018-04-05

Bacterial whole-genome sequencing in the clinical setting has the potential to bring major improvements to infection control and clinical practice. Sequencing instruments are not currently available in the majority of routine microbiology laboratories worldwide, but an alternative is to use external sequencing providers. To foster discussion around this we investigated whether send-out services were a viable option. Four providers offering MiSeq sequencing were selected based on cost and evaluated based on the service provided and sequence data quality. DNA was prepared from five methicillin-resistant Staphylococcus aureus (MRSA) isolates, four of which were investigated during a previously published outbreak in the UK together with a reference MRSA isolate (ST22 HO 5096 0412). Cost of sequencing per isolate ranged from £155 to £342 and turnaround times from DNA postage to arrival of sequence data ranged from 12 to 63 days. Comparison of commercially generated genomes against the original sequence data demonstrated very high concordance, with no more than one single nucleotide polymorphism (SNP) difference on core genome mapping between the original sequences and the new sequence for all four providers. Multilocus sequence type could not be assigned based on assembly for the two cheapest sequence providers due to fragmented assemblies probably caused by a lower output of sequence data per isolate. Our results indicate that external providers returned highly accurate genome data, but that improvements are required in turnaround time to make this a viable option for use in clinical practice.
Universal sequence map (USM) of arbitrary discrete sequences

PubMed Central

2002-01-01

Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM), is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR). The latter enables the representation of 4 unit type sequences (like DNA) as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules. PMID:11895567
Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

PubMed

Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

2017-06-01

Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

PubMed

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Effects of pre- and pro-sequence of thaumatin on the secretion by Pichia pastoris.

PubMed

Ide, Nobuyuki; Masuda, Tetsuya; Kitabatake, Naofumi

2007-11-23

Thaumatin is a 22-kDa sweet-tasting protein containing eight disulfide bonds. When thaumatin is expressed in Pichia pastoris using the thaumatin cDNA fused with both the alpha-factor signal sequence and the Kex2 protease cleavage site from Saccharomyces cerevisiae, the N-terminal sequence of the secreted thaumatin molecule is not processed correctly. To examine the role of the thaumatin cDNA-encoded N-terminal pre-sequence and C-terminal pro-sequence on the processing of thaumatin and efficiency of thaumatin production in P. pastoris, four expression plasmids with different pre-sequence and pro-sequence were constructed and transformed into P. pastoris. The transformants containing pre-thaumatin gene that has the native plant signal, secreted thaumatin molecules in the medium. The N-terminal amino acid sequence of the secreted thaumatin molecule was processed correctly. The production yield of thaumatin was not affected by the C-terminal pro-sequence, and the pro-sequence was not processed in P. pastoris, indicating that pro-sequence is not necessary for thaumatin synthesis.
From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes.

PubMed

Kwok, Hin; Chiang, Alan Kwok Shing

2016-02-24

Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.
Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

PubMed

Wang, Penghao; Wilson, Susan R

2013-01-01

Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.
Metagenome assembly through clustering of next-generation sequencing data using protein sequences.

PubMed

Sim, Mikang; Kim, Jaebum

2015-02-01

The study of environmental microbial communities, called metagenomics, has gained a lot of attention because of the recent advances in next-generation sequencing (NGS) technologies. Microbes play a critical role in changing their environments, and the mode of their effect can be solved by investigating metagenomes. However, the difficulty of metagenomes, such as the combination of multiple microbes and different species abundance, makes metagenome assembly tasks more challenging. In this paper, we developed a new metagenome assembly method by utilizing protein sequences, in addition to the NGS read sequences. Our method (i) builds read clusters by using mapping information against available protein sequences, and (ii) creates contig sequences by finding consensus sequences through probabilistic choices from the read clusters. By using simulated NGS read sequences from real microbial genome sequences, we evaluated our method in comparison with four existing assembly programs. We found that our method could generate relatively long and accurate metagenome assemblies, indicating that the idea of using protein sequences, as a guide for the assembly, is promising. Copyright © 2015 Elsevier B.V. All rights reserved.
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
An efficient approach to BAC based assembly of complex genomes.

PubMed

Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David

2016-01-01

There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.

Sequence memory based on coherent spin-interaction neural networks.

PubMed

Xia, Min; Wong, W K; Wang, Zhijie

2014-12-01

Sequence information processing, for instance, the sequence memory, plays an important role on many functions of brain. In the workings of the human brain, the steady-state period is alterable. However, in the existing sequence memory models using heteroassociations, the steady-state period cannot be changed in the sequence recall. In this work, a novel neural network model for sequence memory with controllable steady-state period based on coherent spininteraction is proposed. In the proposed model, neurons fire collectively in a phase-coherent manner, which lets a neuron group respond differently to different patterns and also lets different neuron groups respond differently to one pattern. The simulation results demonstrating the performance of the sequence memory are presented. By introducing a new coherent spin-interaction sequence memory model, the steady-state period can be controlled by dimension parameters and the overlap between the input pattern and the stored patterns. The sequence storage capacity is enlarged by coherent spin interaction compared with the existing sequence memory models. Furthermore, the sequence storage capacity has an exponential relationship to the dimension of the neural network.
A Next-Generation Sequencing Primer—How Does It Work and What Can It Do?

PubMed Central

Alekseyev, Yuriy O.; Fazeli, Roghayeh; Yang, Shi; Basran, Raveen; Miller, Nancy S.

2018-01-01

Next-generation sequencing refers to a high-throughput technology that determines the nucleic acid sequences and identifies variants in a sample. The technology has been introduced into clinical laboratory testing and produces test results for precision medicine. Since next-generation sequencing is relatively new, graduate students, medical students, pathology residents, and other physicians may benefit from a primer to provide a foundation about basic next-generation sequencing methods and applications, as well as specific examples where it has had diagnostic and prognostic utility. Next-generation sequencing technology grew out of advances in multiple fields to produce a sophisticated laboratory test with tremendous potential. Next-generation sequencing may be used in the clinical setting to look for specific genetic alterations in patients with cancer, diagnose inherited conditions such as cystic fibrosis, and detect and profile microbial organisms. This primer will review DNA sequencing technology, the commercialization of next-generation sequencing, and clinical uses of next-generation sequencing. Specific applications where next-generation sequencing has demonstrated utility in oncology are provided. PMID:29761157
Towards predicting the encoding capability of MR fingerprinting sequences.

PubMed

Sommer, K; Amthor, T; Doneva, M; Koken, P; Meineke, J; Börnert, P

2017-09-01

Sequence optimization and appropriate sequence selection is still an unmet need in magnetic resonance fingerprinting (MRF). The main challenge in MRF sequence design is the lack of an appropriate measure of the sequence's encoding capability. To find such a measure, three different candidates for judging the encoding capability have been investigated: local and global dot-product-based measures judging dictionary entry similarity as well as a Monte Carlo method that evaluates the noise propagation properties of an MRF sequence. Consistency of these measures for different sequence lengths as well as the capability to predict actual sequence performance in both phantom and in vivo measurements was analyzed. While the dot-product-based measures yielded inconsistent results for different sequence lengths, the Monte Carlo method was in a good agreement with phantom experiments. In particular, the Monte Carlo method could accurately predict the performance of different flip angle patterns in actual measurements. The proposed Monte Carlo method provides an appropriate measure of MRF sequence encoding capability and may be used for sequence optimization. Copyright © 2017 Elsevier Inc. All rights reserved.
Use of sequence-independent-single-primer-amplification (SISPA) for whole genome sequencing using illumina MiSeq platform for avian influenza virus, Newcastle disease virus, and infectious bronchitis virus

USDA-ARS?s Scientific Manuscript database

Over the past decade, Next Generation Sequencing (NGS) technologies, also called deep sequencing, have continued to evolve, increasing capacity and lower the cost necessary for large genome sequencing projects. The one of the advantage of NGS platforms is the possibility to sequence the samples with...
New Sequences with Low Correlation and Large Family Size

NASA Astrophysics Data System (ADS)

Zeng, Fanxin

In direct-sequence code-division multiple-access (DS-CDMA) communication systems and direct-sequence ultra wideband (DS-UWB) radios, sequences with low correlation and large family size are important for reducing multiple access interference (MAI) and accepting more active users, respectively. In this paper, a new collection of families of sequences of length pn-1, which includes three constructions, is proposed. The maximum number of cyclically distinct families without GMW sequences in each construction is φ(pn-1)/n·φ(pm-1)/m, where p is a prime number, n is an even number, and n=2m, and these sequences can be binary or polyphase depending upon choice of the parameter p. In Construction I, there are pn distinct sequences within each family and the new sequences have at most d+2 nontrivial periodic correlation {-pm-1, -1, pm-1, 2pm-1,…,dpm-1}. In Construction II, the new sequences have large family size p2n and possibly take the nontrivial correlation values in {-pm-1, -1, pm-1, 2pm-1,…,(3d-4)pm-1}. In Construction III, the new sequences possess the largest family size p(d-1)n and have at most 2d correlation levels {-pm-1, -1,pm-1, 2pm-1,…,(2d-2)pm-1}. Three constructions are near-optimal with respect to the Welch bound because the values of their Welch-Ratios are moderate, WR_??_d, WR_??_3d-4 and WR_??_2d-2, respectively. Each family in Constructions I, II and III contains a GMW sequence. In addition, Helleseth sequences and Niho sequences are special cases in Constructions I and III, and their restriction conditions to the integers m and n, pm≠2 (mod 3) and n≅0 (mod 4), respectively, are removed in our sequences. Our sequences in Construction III include the sequences with Niho type decimation 3·2m-2, too. Finally, some open questions are pointed out and an example that illustrates the performance of these sequences is given.
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment

PubMed Central

2013-01-01

Background Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. Results In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Conclusion Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA. PMID:24564200
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

PubMed

Nagar, Anurag; Hahsler, Michael

2013-01-01

Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA.
Introduction of the hybcell-based compact sequencing technology and comparison to state-of-the-art methodologies for KRAS mutation detection.

PubMed

Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus

2015-03-01

The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data.

PubMed

Abe, Takashi; Inokuchi, Hachiro; Yamada, Yuko; Muto, Akira; Iwasaki, Yuki; Ikemura, Toshimichi

2014-01-01

The tRNA gene data base curated by experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses', 121 chloroplasts', and 12 eukaryotes' genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group (ISG) can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that batch-learning self-organizing-map with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.
Automated Sanger Analysis Pipeline (ASAP): A Tool for Rapidly Analyzing Sanger Sequencing Data with Minimum User Interference.

PubMed

Singh, Aditya; Bhatia, Prateek

2016-12-01

Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
TaxI: a software tool for DNA barcoding using distance methods

PubMed Central

Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel

2005-01-01

DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755
Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

PubMed Central

Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

2010-01-01

Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665
Multiplexed fragaria chloroplast genome sequencing

Treesearch

W. Njuguna; A. Liston; R. Cronn; N.V. Bassil

2010-01-01

A method to sequence multiple chloroplast genomes using ultra high throughput sequencing technologies was recently described. Complete chloroplast genome sequences can resolve phylogenetic relationships at low taxonomic levels and identify informative point mutations and indels. The objective of this research was to sequence multiple Fragaria...
Detection of a divergent variant of grapevine virus F by next-generation sequencing.

PubMed

Molenaar, Nicholas; Burger, Johan T; Maree, Hans J

2015-08-01

The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
Nuclear counterparts of the cytoplasmic mitochondrial 12S rRNA gene: a problem of ancient DNA and molecular phylogenies.

PubMed

van der Kuyl, A C; Kuiken, C L; Dekker, J T; Perizonius, W R; Goudsmit, J

1995-06-01

Monkey mummy bones and teeth originating from the North Saqqara Baboon Galleries (Egypt), soft tissue from a mummified baboon in a museum collection, and nineteenth/twentieth-century skin fragments from mangabeys were used for DNA extraction and PCR amplification of part of the mitochondrial 12S rRNA gene. Sequences aligning with the 12S rRNA gene were recovered but were only distantly related to contemporary monkey mitochondrial 12S rRNA sequences. However, many of these sequences were identical or closely related to human nuclear DNA sequences resembling mitochondrial 12S rRNA (isolated from a cell line depleted in mitochondria) and therefore have to be considered contamination. Subsequently in a separate study we were able to recover genuine mitochondrial 12S rRNA sequences from many extant species of nonhuman Old World primates and sequences closely resembling the human nuclear integrations. Analysis of all sequences by the neighbor-joining (NJ) method indicated that mitochondrial DNA sequences and their nuclear counterparts can be divided into two distinct clusters. One cluster contained all temporary cytoplasmic mitochondrial DNA sequences and approximately half of the monkey nuclear mitochondriallike sequences. A second cluster contained most human nuclear sequences and the other half of monkey nuclear sequences with a separate branch leading to human and gorilla mitochondrial and nuclear sequences. Sequences recovered from ancient materials were equally divided between the two clusters. These results constitute a warning for when working with ancient DNA or performing phylogenetic analysis using mitochondrial DNA as a target sequence: Nuclear counterparts of mitochondrial genes may lead to faulty interpretation of results.
Three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions sequence for routine imaging of the spine: preliminary experience

PubMed Central

Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U

2012-01-01

Objectives The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. Methods 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T1 weighted sequence. The reference standards were sagittal and axial T1 weighted and T2 weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. Results There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6–1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92–1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T1 weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. Conclusion The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited. PMID:22374284
Three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions sequence for routine imaging of the spine: preliminary experience.

PubMed

Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U

2012-08-01

The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T(1) weighted sequence. The reference standards were sagittal and axial T(1) weighted and T(2) weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6-1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92-1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T(1) weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited.
ChromatoGate: A Tool for Detecting Base Mis-Calls in Multiple Sequence Alignments by Semi-Automatic Chromatogram Inspection

PubMed Central

Alachiotis, Nikolaos; Vogiatzi, Emmanouella; Pavlidis, Pavlos; Stamatakis, Alexandros

2013-01-01

Automated DNA sequencers generate chromatograms that contain raw sequencing data. They also generate data that translates the chromatograms into molecular sequences of A, C, G, T, or N (undetermined) characters. Since chromatogram translation programs frequently introduce errors, a manual inspection of the generated sequence data is required. As sequence numbers and lengths increase, visual inspection and manual correction of chromatograms and corresponding sequences on a per-peak and per-nucleotide basis becomes an error-prone, time-consuming, and tedious process. Here, we introduce ChromatoGate (CG), an open-source software that accelerates and partially automates the inspection of chromatograms and the detection of sequencing errors for bidirectional sequencing runs. To provide users full control over the error correction process, a fully automated error correction algorithm has not been implemented. Initially, the program scans a given multiple sequence alignment (MSA) for potential sequencing errors, assuming that each polymorphic site in the alignment may be attributed to a sequencing error with a certain probability. The guided MSA assembly procedure in ChromatoGate detects chromatogram peaks of all characters in an alignment that lead to polymorphic sites, given a user-defined threshold. The threshold value represents the sensitivity of the sequencing error detection mechanism. After this pre-filtering, the user only needs to inspect a small number of peaks in every chromatogram to correct sequencing errors. Finally, we show that correcting sequencing errors is important, because population genetic and phylogenetic inferences can be misled by MSAs with uncorrected mis-calls. Our experiments indicate that estimates of population mutation rates can be affected two- to three-fold by uncorrected errors. PMID:24688709
ChromatoGate: A Tool for Detecting Base Mis-Calls in Multiple Sequence Alignments by Semi-Automatic Chromatogram Inspection.

PubMed

Alachiotis, Nikolaos; Vogiatzi, Emmanouella; Pavlidis, Pavlos; Stamatakis, Alexandros

2013-01-01

Automated DNA sequencers generate chromatograms that contain raw sequencing data. They also generate data that translates the chromatograms into molecular sequences of A, C, G, T, or N (undetermined) characters. Since chromatogram translation programs frequently introduce errors, a manual inspection of the generated sequence data is required. As sequence numbers and lengths increase, visual inspection and manual correction of chromatograms and corresponding sequences on a per-peak and per-nucleotide basis becomes an error-prone, time-consuming, and tedious process. Here, we introduce ChromatoGate (CG), an open-source software that accelerates and partially automates the inspection of chromatograms and the detection of sequencing errors for bidirectional sequencing runs. To provide users full control over the error correction process, a fully automated error correction algorithm has not been implemented. Initially, the program scans a given multiple sequence alignment (MSA) for potential sequencing errors, assuming that each polymorphic site in the alignment may be attributed to a sequencing error with a certain probability. The guided MSA assembly procedure in ChromatoGate detects chromatogram peaks of all characters in an alignment that lead to polymorphic sites, given a user-defined threshold. The threshold value represents the sensitivity of the sequencing error detection mechanism. After this pre-filtering, the user only needs to inspect a small number of peaks in every chromatogram to correct sequencing errors. Finally, we show that correcting sequencing errors is important, because population genetic and phylogenetic inferences can be misled by MSAs with uncorrected mis-calls. Our experiments indicate that estimates of population mutation rates can be affected two- to three-fold by uncorrected errors.
PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data.

PubMed

Chiu, Kuo Ping; Wong, Chee-Hong; Chen, Qiongyu; Ariyaratne, Pramila; Ooi, Hong Sain; Wei, Chia-Lin; Sung, Wing-Kin Ken; Ruan, Yijun

2006-08-25

We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the Project Manager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.

Making sense of deep sequencing

PubMed Central

Goldman, D.; Domschke, K.

2016-01-01

This review, the first of an occasional series, tries to make sense of the concepts and uses of deep sequencing of polynucleic acids (DNA and RNA). Deep sequencing, synonymous with next-generation sequencing, high-throughput sequencing and massively parallel sequencing, includes whole genome sequencing but is more often and diversely applied to specific parts of the genome captured in different ways, for example the highly expressed portion of the genome known as the exome and portions of the genome that are epigenetically marked either by DNA methylation, the binding of proteins including histones, or that are in different configurations and thus more or less accessible to enzymes that cleave DNA. Deep sequencing of RNA (RNASeq) reverse-transcribed to complementary DNA is invaluable for measuring RNA expression and detecting changes in RNA structure. Important concepts in deep sequencing include the length and depth of sequence reads, mapping and assembly of reads, sequencing error, haplotypes, and the propensity of deep sequencing, as with other types of ‘big data’, to generate large numbers of errors, requiring monitoring for methodologic biases and strategies for replication and validation. Deep sequencing yields a unique genetic fingerprint that can be used to identify a person, and a trove of predictors of genetic medical diseases. Deep sequencing to identify epigenetic events including changes in DNA methylation and RNA expression can reveal the history and impact of environmental exposures. Because of the power of sequencing to identify and deliver biomedically significant information about a person and their blood relatives, it creates ethical dilemmas and practical challenges in research and clinical care, for example the decision and procedures to report incidental findings that will increasingly and frequently be discovered. PMID:24925306
A statistical method for the detection of variants from next-generation resequencing of DNA pools.

PubMed

Bansal, Vikas

2010-06-15

Next-generation sequencing technologies have enabled the sequencing of several human genomes in their entirety. However, the routine resequencing of complete genomes remains infeasible. The massive capacity of next-generation sequencers can be harnessed for sequencing specific genomic regions in hundreds to thousands of individuals. Sequencing-based association studies are currently limited by the low level of multiplexing offered by sequencing platforms. Pooled sequencing represents a cost-effective approach for studying rare variants in large populations. To utilize the power of DNA pooling, it is important to accurately identify sequence variants from pooled sequencing data. Detection of rare variants from pooled sequencing represents a different challenge than detection of variants from individual sequencing. We describe a novel statistical approach, CRISP [Comprehensive Read analysis for Identification of Single Nucleotide Polymorphisms (SNPs) from Pooled sequencing] that is able to identify both rare and common variants by using two approaches: (i) comparing the distribution of allele counts across multiple pools using contingency tables and (ii) evaluating the probability of observing multiple non-reference base calls due to sequencing errors alone. Information about the distribution of reads between the forward and reverse strands and the size of the pools is also incorporated within this framework to filter out false variants. Validation of CRISP on two separate pooled sequencing datasets generated using the Illumina Genome Analyzer demonstrates that it can detect 80-85% of SNPs identified using individual sequencing while achieving a low false discovery rate (3-5%). Comparison with previous methods for pooled SNP detection demonstrates the significantly lower false positive and false negative rates for CRISP. Implementation of this method is available at http://polymorphism.scripps.edu/~vbansal/software/CRISP/.
Sequence modelling and an extensible data model for genomic database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Peter Wei-Der

1992-01-01

The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS's do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the Extensible Object Model'', to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
Sequence modelling and an extensible data model for genomic database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Peter Wei-Der

1992-01-01

The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS`s do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the ``Extensible Object Model``, to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation.

PubMed

Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C

2012-01-01

The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

PubMed Central

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Partial gene sequences for the A subunit of methyl-coenzyme M reductase (mcrI) as a phylogenetic tool for the family Methanosarcinaceae

NASA Technical Reports Server (NTRS)

Springer, E.; Sachs, M. S.; Woese, C. R.; Boone, D. R.

1995-01-01

Representatives of the family Methanosarcinaceae were analyzed phylogenetically by comparing partial sequences of their methyl-coenzyme M reductase (mcrI) genes. A 490-bp fragment from the A subunit of the gene was selected, amplified by the PCR, cloned, and sequenced for each of 25 strains belonging to the Methanosarcinaceae. The sequences obtained were aligned with the corresponding portions of five previously published sequences, and all of the sequences were compared to determine phylogenetic distances by Fitch distance matrix methods. We prepared analogous trees based on 16S rRNA sequences; these trees corresponded closely to the mcrI trees, although the mcrI sequences of pairs of organisms had 3.01 +/- 0.541 times more changes than the respective pairs of 16S rRNA sequences, suggesting that the mcrI fragment evolved about three times more rapidly than the 16S rRNA gene. The qualitative similarity of the mcrI and 16S rRNA trees suggests that transfer of genetic information between dissimilar organisms has not significantly affected these sequences, although we found inconsistencies between some mcrI distances that we measured and and previously published DNA reassociation data. It is unlikely that multiple mcrI isogenes were present in the organisms that we examined, because we found no major discrepancies in multiple determinations of mcrI sequences from the same organism. Our primers for the PCR also match analogous sites in the previously published mcrII sequences, but all of the sequences that we obtained from members of the Methanosarcinaceae were more closely related to mcrI sequences than to mcrII sequences, suggesting that members of the Methanosarcinaceae do not have distinct mcrII genes.
DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

PubMed Central

Palzkill, T G; Oliver, S G; Newlon, C S

1986-01-01

Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036
Single-cell genomic sequencing using Multiple Displacement Amplification.

PubMed

Lasken, Roger S

2007-10-01

Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
MALDI Top-Down sequencing: calling N- and C-terminal protein sequences with high confidence and speed.

PubMed

Suckau, Detlev; Resemann, Anja

2009-12-01

The ability to match Top-Down protein sequencing (TDS) results by MALDI-TOF to protein sequences by classical protein database searching was evaluated in this work. Resulting from these analyses were the protein identity, the simultaneous assignment of the N- and C-termini and protein sequences of up to 70 residues from either terminus. In combination with de novo sequencing using the MALDI-TDS data, even fusion proteins were assigned and the detailed sequence around the fusion site was elucidated. MALDI-TDS allowed to efficiently match protein sequences quickly and to validate recombinant protein structures-in particular, protein termini-on the level of undigested proteins.
Computational analysis of sequence selection mechanisms.

PubMed

Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

2004-04-01

Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.
Nonparametric Combinatorial Sequence Models

NASA Astrophysics Data System (ADS)

Wauthier, Fabian L.; Jordan, Michael I.; Jojic, Nebojsa

This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences. If multiple such groups exist, complex interactions can emerge between them. Sequences of this kind arise frequently in biology but methodologies for analyzing them are still being developed. This paper presents a nonparametric prior on sequences which allows combinatorial structures to emerge and which induces a posterior distribution over factorized sequence representations. We carry out experiments on three sequence datasets which indicate that combinatorial structures are indeed present and that combinatorial sequence models can more succinctly describe them than simpler mixture models. We conclude with an application to MHC binding prediction which highlights the utility of the posterior distribution induced by the prior. By integrating out the posterior our method compares favorably to leading binding predictors.
Real-Time DNA Sequencing in the Antarctic Dry Valleys Using the Oxford Nanopore Sequencer

PubMed Central

Johnson, Sarah S.; Zaikova, Elena; Goerlitz, David S.; Bai, Yu; Tighe, Scott W.

2017-01-01

The ability to sequence DNA outside of the laboratory setting has enabled novel research questions to be addressed in the field in diverse areas, ranging from environmental microbiology to viral epidemics. Here, we demonstrate the application of offline DNA sequencing of environmental samples using a hand-held nanopore sequencer in a remote field location: the McMurdo Dry Valleys, Antarctica. Sequencing was performed using a MK1B MinION sequencer from Oxford Nanopore Technologies (ONT; Oxford, United Kingdom) that was equipped with software to operate without internet connectivity. One-direction (1D) genomic libraries were prepared using portable field techniques on DNA isolated from desiccated microbial mats. By adequately insulating the sequencer and laptop, it was possible to run the sequencing protocol for up to 2½ h under arduous conditions. PMID:28337073
WebLogo: A Sequence Logo Generator

PubMed Central

Crooks, Gavin E.; Hon, Gary; Chandonia, John-Marc; Brenner, Steven E.

2004-01-01

WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization. PMID:15173120
Single-cell genome sequencing at ultra-high-throughput with microfluidic droplet barcoding.

PubMed

Lan, Freeman; Demaree, Benjamin; Ahmed, Noorsher; Abate, Adam R

2017-07-01

The application of single-cell genome sequencing to large cell populations has been hindered by technical challenges in isolating single cells during genome preparation. Here we present single-cell genomic sequencing (SiC-seq), which uses droplet microfluidics to isolate, fragment, and barcode the genomes of single cells, followed by Illumina sequencing of pooled DNA. We demonstrate ultra-high-throughput sequencing of >50,000 cells per run in a synthetic community of Gram-negative and Gram-positive bacteria and fungi. The sequenced genomes can be sorted in silico based on characteristic sequences. We use this approach to analyze the distributions of antibiotic-resistance genes, virulence factors, and phage sequences in microbial communities from an environmental sample. The ability to routinely sequence large populations of single cells will enable the de-convolution of genetic heterogeneity in diverse cell populations.
A measurement of disorder in binary sequences

NASA Astrophysics Data System (ADS)

Gong, Longyan; Wang, Haihong; Cheng, Weiwen; Zhao, Shengmei

2015-03-01

We propose a complex quantity, AL, to characterize the degree of disorder of L-length binary symbolic sequences. As examples, we respectively apply it to typical random and deterministic sequences. One kind of random sequences is generated from a periodic binary sequence and the other is generated from the logistic map. The deterministic sequences are the Fibonacci and Thue-Morse sequences. In these analyzed sequences, we find that the modulus of AL, denoted by |AL | , is a (statistically) equivalent quantity to the Boltzmann entropy, the metric entropy, the conditional block entropy and/or other quantities, so it is a useful quantitative measure of disorder. It can be as a fruitful index to discern which sequence is more disordered. Moreover, there is one and only one value of |AL | for the overall disorder characteristics. It needs extremely low computational costs. It can be easily experimentally realized. From all these mentioned, we believe that the proposed measure of disorder is a valuable complement to existing ones in symbolic sequences.
Quantitative phenotyping via deep barcode sequencing

PubMed Central

Smith, Andrew M.; Heisler, Lawrence E.; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J.; Chee, Mark; Roth, Frederick P.; Giaever, Guri; Nislow, Corey

2009-01-01

Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that ∼20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene–environment interactions on a genome-wide scale. PMID:19622793
Resurgence of Integrated Behavioral Units

PubMed Central

Bachá-Méndez, Gustavo; Reid, Alliston K; Mendoza-Soylovna, Adela

2007-01-01

Two experiments with rats examined the dynamics of well-learned response sequences when reinforcement contingencies were changed. Both experiments contained four phases, each of which reinforced a 2-response sequence of lever presses until responding was stable. The contingencies then were shifted to a new reinforced sequence until responding was again stable. Extinction-induced resurgence of previously reinforced, and then extinguished, heterogeneous response sequences was observed in all subjects in both experiments. These sequences were demonstrated to be integrated behavioral units, controlled by processes acting at the level of the entire sequence. Response-level processes were also simultaneously operative. Errors in sequence production were strongly influenced by the terminal, not the initial, response in the currently reinforced sequence, but not by the previously reinforced sequence. These studies demonstrate that sequence-level and response-level processes can operate simultaneously in integrated behavioral units. Resurgence and the development of integrated behavioral units may be dissociated; thus the observation of one does not necessarily imply the other. PMID:17345948
Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants.

PubMed

Kim, Kyung; Seong, Moon-Woo; Chung, Won-Hyong; Park, Sung Sup; Leem, Sangseob; Park, Won; Kim, Jihyun; Lee, KiYoung; Park, Rae Woong; Kim, Namshin

2015-06-01

Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ~200×. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of 120×. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about 120×. Moreover, the phenomena were consistent across the breast cancer samples.
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

High-Resolution Sequence-Function Mapping of Full-Length Proteins

PubMed Central

Kowalsky, Caitlin A.; Klesmith, Justin R.; Stapleton, James A.; Kelly, Vince; Reichkitzer, Nolan; Whitehead, Timothy A.

2015-01-01

Comprehensive sequence-function mapping involves detailing the fitness contribution of every possible single mutation to a gene by comparing the abundance of each library variant before and after selection for the phenotype of interest. Deep sequencing of library DNA allows frequency reconstruction for tens of thousands of variants in a single experiment, yet short read lengths of current sequencers makes it challenging to probe genes encoding full-length proteins. Here we extend the scope of sequence-function maps to entire protein sequences with a modular, universal sequence tiling method. We demonstrate the approach with both growth-based selections and FACS screening, offer parameters and best practices that simplify design of experiments, and present analytical solutions to normalize data across independent selections. Using this protocol, sequence-function maps covering full sequences can be obtained in four to six weeks. Best practices introduced in this manuscript are fully compatible with, and complementary to, other recently published sequence-function mapping protocols. PMID:25790064
On the joint spectral density of bivariate random sequences. Thesis Technical Report No. 21

NASA Technical Reports Server (NTRS)

Aalfs, David D.

1995-01-01

For univariate random sequences, the power spectral density acts like a probability density function of the frequencies present in the sequence. This dissertation extends that concept to bivariate random sequences. For this purpose, a function called the joint spectral density is defined that represents a joint probability weighing of the frequency content of pairs of random sequences. Given a pair of random sequences, the joint spectral density is not uniquely determined in the absence of any constraints. Two approaches to constraining the sequences are suggested: (1) assume the sequences are the margins of some stationary random field, (2) assume the sequences conform to a particular model that is linked to the joint spectral density. For both approaches, the properties of the resulting sequences are investigated in some detail, and simulation is used to corroborate theoretical results. It is concluded that under either of these two constraints, the joint spectral density can be computed from the non-stationary cross-correlation.
Applications of Single-Cell Sequencing for Multiomics.

PubMed

Xu, Yungang; Zhou, Xiaobo

2018-01-01

Single-cell sequencing interrogates the sequence or chromatin information from individual cells with advanced next-generation sequencing technologies. It provides a higher resolution of cellular differences and a better understanding of the underlying genetic and epigenetic mechanisms of an individual cell in the context of its survival and adaptation to microenvironment. However, it is more challenging to perform single-cell sequencing and downstream data analysis, owing to the minimal amount of starting materials, sample loss, and contamination. In addition, due to the picogram level of the amount of nucleic acids used, heavy amplification is often needed during sample preparation of single-cell sequencing, resulting in the uneven coverage, noise, and inaccurate quantification of sequencing data. All these unique properties raise challenges in and thus high demands for computational methods that specifically fit single-cell sequencing data. We here comprehensively survey the current strategies and challenges for multiple single-cell sequencing, including single-cell transcriptome, genome, and epigenome, beginning with a brief introduction to multiple sequencing techniques for single cells.
Innovative /ye/ and /we/ sequences in recent loans in Japanese

NASA Astrophysics Data System (ADS)

Vance, Timothy; Matsugu, Yuka

2005-04-01

The GV sequences /ye/ and /we/ do not occur in Japanese except perhaps in recent loans. Katakana spellings of the relevant loans in authoritative dictionaries are inconsistent, and it is not clear whether native speakers treat them as containing the GV sequences /ye/ and /we/ or as containing the VV sequences /ie/ and /ue/. Native speakers of Japanese with minimal exposure to spoken English were recorded producing some relevant loans in response to picture prompts. The same speakers were also recorded producing some native words containing uncontroversial /ie/ and /ue/ sequences. All the productions are being analyzed acoustically to determine whether they show the expected contrast between GV and VV sequences. A VV sequence is disyllabic (and bimoraic) and should therefore have greater duration and more gradual formant movements than a monosyllabic (and monomoraic) GV sequence. Utterance-initially, a VV sequence should have a LH pitch pattern and should be preceded by a nondistinctive glottal stop, whereas a GV sequence should have a H pitch pattern and should have smooth onset.
[The principle and application of the single-molecule real-time sequencing technology].

PubMed

Yanhu, Liu; Lu, Wang; Li, Yu

2015-03-01

Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modiﬁcations. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.
Life Cycle Evolution and Systematics of Campanulariid Hydrozoans

DTIC Science & Technology

2004-09-01

kit according to manufacturer’s protocol. Purified PCR product was cycle-sequenced using either Big Dye 2 or 3 sequencing chemistry (ABI), following...ethidium bromide and purified with PCR purification kits (Qiagen). Purified products were cycle- sequenced with either Big Dye 2 or 3 sequencing chemistry...PCR purification kit (Qiagen). The purified product was cycle-sequenced using Big Dye 2 sequencing chemistry (ABI) following the manufacturer’s
What is a melody? On the relationship between pitch and brightness of timbre

PubMed Central

Cousineau, Marion; Carcagno, Samuele; Demany, Laurent; Pressnitzer, Daniel

2014-01-01

Previous studies showed that the perceptual processing of sound sequences is more efficient when the sounds vary in pitch than when they vary in loudness. We show here that sequences of sounds varying in brightness of timbre are processed with the same efficiency as pitch sequences. The sounds used consisted of two simultaneous pure tones one octave apart, and the listeners’ task was to make same/different judgments on pairs of sequences varying in length (one, two, or four sounds). In one condition, brightness of timbre was varied within the sequences by changing the relative level of the two pure tones. In other conditions, pitch was varied by changing fundamental frequency, or loudness was varied by changing the overall level. In all conditions, only two possible sounds could be used in a given sequence, and these two sounds were equally discriminable. When sequence length increased from one to four, discrimination performance decreased substantially for loudness sequences, but to a smaller extent for brightness sequences and pitch sequences. In the latter two conditions, sequence length had a similar effect on performance. These results suggest that the processes dedicated to pitch and brightness analysis, when probed with a sequence-discrimination task, share unexpected similarities. PMID:24478638
Microbial community analysis of the hypersaline water of the Dead Sea using high-throughput amplicon sequencing.

PubMed

Jacob, Jacob H; Hussein, Emad I; Shakhatreh, Muhamad Ali K; Cornelison, Christopher T

2017-10-01

Amplicon sequencing using next-generation technology (bTEFAP ® ) has been utilized in describing the diversity of Dead Sea microbiota. The investigated area is a well-known salt lake in the western part of Jordan found in the lowest geographical location in the world (more than 420 m below sea level) and characterized by extreme salinity (approximately, 34%) in addition to other extreme conditions (low pH, unique ionic composition different from sea water). DNA was extracted from Dead Sea water. A total of 314,310 small subunit RNA (SSU rRNA) sequences were parsed, and 288,452 sequences were then clustered. For alpha diversity analysis, sample was rarefied to 3,000 sequences. The Shannon-Wiener index curve plot reached a plateau at approximately 3,000 sequences indicating that sequencing depth was sufficient to capture the full scope of microbial diversity. Archaea was found to be dominating the sequences (52%), whereas Bacteria constitute 45% of the sequences. Altogether, prokaryotic sequences (which constitute 97% of all sequences) were found to predominate. The findings expand on previous studies by using high-throughput amplicon sequencing to describe the microbial community in an environment which in recent years has been shown to hide some interesting diversity. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

PubMed Central

2009-01-01

Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962
Sedimentary sequence evolution in a Foredeep basin: Eastern Venezuela

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bejarano, C.; Funes, D.; Sarzalho, S.

1996-08-01

Well log-seismic sequence stratigraphy analysis in the Eastern Venezuela Foreland Basin leads to study of the evolution of sedimentary sequences onto the Cretaceous-Paleocene passive margin. This basin comprises two different foredeep sub-basins: The Guarico subbasin to the west, older, and the Maturin sub-basin to the east, younger. A foredeep switching between these two sub-basins is observed at 12.5 m.y. Seismic interpretation and well log sections across the study area show sedimentary sequences with transgressive sands and coastal onlaps to the east-southeast for the Guarico sub-basin, as well as truncations below the switching sequence (12.5 m.y.), and the Maturin sub-basin showsmore » apparent coastal onlaps to the west-northwest, as well as a marine onlap (deeper water) in the west, where it starts to establish. Sequence stratigraphy analysis of these sequences with well logs allowed the study of the evolution of stratigraphic section from Paleocene to middle Miocene (68.0-12.0 m.y.). On the basis of well log patterns, the sequences were divided in regressive-transgressive-regressive sedimentary cycles caused by changes in relative sea level. Facies distributions were analyzed and the sequences were divided into simple sequences or sub- sequences of a greater frequencies than third order depositional sequences.« less
FOUNTAIN: A JAVA open-source package to assist large sequencing projects

PubMed Central

Buerstedde, Jean-Marie; Prill, Florian

2001-01-01

Background Better automation, lower cost per reaction and a heightened interest in comparative genomics has led to a dramatic increase in DNA sequencing activities. Although the large sequencing projects of specialized centers are supported by in-house bioinformatics groups, many smaller laboratories face difficulties managing the appropriate processing and storage of their sequencing output. The challenges include documentation of clones, templates and sequencing reactions, and the storage, annotation and analysis of the large number of generated sequences. Results We describe here a new program, named FOUNTAIN, for the management of large sequencing projects . FOUNTAIN uses the JAVA computer language and data storage in a relational database. Starting with a collection of sequencing objects (clones), the program generates and stores information related to the different stages of the sequencing project using a web browser interface for user input. The generated sequences are subsequently imported and annotated based on BLAST searches against the public databases. In addition, simple algorithms to cluster sequences and determine putative polymorphic positions are implemented. Conclusions A simple, but flexible and scalable software package is presented to facilitate data generation and storage for large sequencing projects. Open source and largely platform and database independent, we wish FOUNTAIN to be improved and extended in a community effort. PMID:11591214
Sequence-specific bias correction for RNA-seq data using recurrent neural networks.

PubMed

Zhang, Yao-Zhong; Yamaguchi, Rui; Imoto, Seiya; Miyano, Satoru

2017-01-25

The recent success of deep learning techniques in machine learning and artificial intelligence has stimulated a great deal of interest among bioinformaticians, who now wish to bring the power of deep learning to bare on a host of bioinformatical problems. Deep learning is ideally suited for biological problems that require automatic or hierarchical feature representation for biological data when prior knowledge is limited. In this work, we address the sequence-specific bias correction problem for RNA-seq data redusing Recurrent Neural Networks (RNNs) to model nucleotide sequences without pre-determining sequence structures. The sequence-specific bias of a read is then calculated based on the sequence probabilities estimated by RNNs, and used in the estimation of gene abundance. We explore the application of two popular RNN recurrent units for this task and demonstrate that RNN-based approaches provide a flexible way to model nucleotide sequences without knowledge of predetermined sequence structures. Our experiments show that training a RNN-based nucleotide sequence model is efficient and RNN-based bias correction methods compare well with the-state-of-the-art sequence-specific bias correction method on the commonly used MAQC-III data set. RNNs provides an alternative and flexible way to calculate sequence-specific bias without explicitly pre-determining sequence structures.
Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology.

PubMed

Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y

2015-10-22

Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.
Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology

PubMed Central

Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K. P.; Woo, Patrick C. Y.

2015-01-01

Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10–49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n = 2), Pichia (Candida) norvegensis (n = 2), Candida tropicalis (n = 1) and Saccharomyces cerevisiae (n = 1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study. PMID:26506340
New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

PubMed

Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2006-02-01

We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
Identification of Genomic Insertion and Flanking Sequence of G2-EPSPS and GAT Transgenes in Soybean Using Whole Genome Sequencing Method.

PubMed

Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan

2016-01-01

Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.
Geoseq: a tool for dissecting deep-sequencing datasets.

PubMed

Gurtowski, James; Cancio, Anthony; Shah, Hardik; Levovitz, Chaya; George, Ajish; Homann, Robert; Sachidanandam, Ravi

2010-10-12

Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a) identify differential isoform expression in mRNA-seq datasets, b) identify miRNAs (microRNAs) in libraries, and identify mature and star sequences in miRNAS and c) to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.
Novel primers for complete mitochondrial cytochrome b genesequencing in mammals

USGS Publications Warehouse

Naidu, Ashwin; Fitak, Robert R.; Munguia-Vega, Adrian; Culver, Melanie

2011-01-01

Sequence-based species identification relies on the extent and integrity of sequence data available in online databases such as GenBank. When identifying species from a sample of unknown origin, partial DNA sequences obtained from the sample are aligned against existing sequences in databases. When the sequence from the matching species is not present in the database, high-scoring alignments with closely related sequences might produce unreliable results on species identity. For species identification in mammals, the cytochrome b (cyt b) gene has been identified to be highly informative; thus, large amounts of reference sequence data from the cyt b gene are much needed. To enhance availability of cyt b gene sequence data on a large number of mammalian species in GenBank and other such publicly accessible online databases, we identified a primer pair for complete cyt b gene sequencing in mammals. Using this primer pair, we successfully PCR amplified and sequenced the complete cyt b gene from 40 of 44 mammalian species representing 10 orders of mammals. We submitted 40 complete, correctly annotated, cyt b protein coding sequences to GenBank. To our knowledge, this is the first single primer pair to amplify the complete cyt b gene in a broad range of mammalian species. This primer pair can be used for the addition of new cyt b gene sequences and to enhance data available on species represented in GenBank. The availability of novel and complete gene sequences as high-quality reference data can improve the reliability of sequence-based species identification.
Chronology of Eocene-Miocene sequences on the New Jersey shallow shelf: implications for regional, interregional, and global correlations

USGS Publications Warehouse

Browning, James V.; Miller, Kenneth G.; Sugarman, Peter J.; Barron, John; McCarthy, Francine M.G.; Kulhanek, Denise K.; Katz, Miriam E.; Feigenson, Mark D.

2013-01-01

Integrated Ocean Drilling Program Expedition 313 continuously cored and logged latest Eocene to early-middle Miocene sequences at three sites (M27, M28, and M29) on the inner-middle continental shelf offshore New Jersey, providing an opportunity to evaluate the ages, global correlations, and significance of sequence boundaries. We provide a chronology for these sequences using integrated strontium isotopic stratigraphy and biostratigraphy (primarily calcareous nannoplankton, diatoms, and dinocysts [dinoflagellate cysts]). Despite challenges posed by shallow-water sediments, age resolution is typically ±0.5 m.y. and in many sequences is as good as ±0.25 m.y. Three Oligocene sequences were sampled at Site M27 on sequence bottomsets. Fifteen early to early-middle Miocene sequences were dated at Sites M27, M28, and M29 across clinothems in topsets, foresets (where the sequences are thickest), and bottomsets. A few sequences have coarse (∼1 m.y.) or little age constraint due to barren zones; we constrain the age estimates of these less well dated sequences by applying the principle of superposition, i.e., sediments above sequence boundaries in any site are younger than the sediments below the sequence boundaries at other sites. Our age control provides constraints on the timing of deposition in the clinothem; sequences on the topsets are generally the youngest in the clinothem, whereas the bottomsets generally are the oldest. The greatest amount of time is represented on foresets, although we have no evidence for a correlative conformity. Our chronology provides a baseline for regional and interregional correlations and sea-level reconstructions: (1) we correlate a major increase in sedimentation rate precisely with the timing of the middle Miocene climate changes associated with the development of a permanent East Antarctic Ice Sheet; and (2) the timing of sequence boundaries matches the deep-sea oxygen isotopic record, implicating glacioeustasy as a major driver for forming sequence boundaries.
Memory for sequences of events impaired in typical aging

PubMed Central

Allen, Timothy A.; Morris, Andrea M.; Stark, Shauna M.; Fortin, Norbert J.

2015-01-01

Typical aging is associated with diminished episodic memory performance. To improve our understanding of the fundamental mechanisms underlying this age-related memory deficit, we previously developed an integrated, cross-species approach to link converging evidence from human and animal research. This novel approach focuses on the ability to remember sequences of events, an important feature of episodic memory. Unlike existing paradigms, this task is nonspatial, nonverbal, and can be used to isolate different cognitive processes that may be differentially affected in aging. Here, we used this task to make a comprehensive comparison of sequence memory performance between younger (18–22 yr) and older adults (62–86 yr). Specifically, participants viewed repeated sequences of six colored, fractal images and indicated whether each item was presented “in sequence” or “out of sequence.” Several out of sequence probe trials were used to provide a detailed assessment of sequence memory, including: (i) repeating an item from earlier in the sequence (“Repeats”; e.g., ABADEF), (ii) skipping ahead in the sequence (“Skips”; e.g., ABDDEF), and (iii) inserting an item from a different sequence into the same ordinal position (“Ordinal Transfers”; e.g., AB3DEF). We found that older adults performed as well as younger controls when tested on well-known and predictable sequences, but were severely impaired when tested using novel sequences. Importantly, overall sequence memory performance in older adults steadily declined with age, a decline not detected with other measures (RAVLT or BPS-O). We further characterized this deficit by showing that performance of older adults was severely impaired on specific probe trials that required detailed knowledge of the sequence (Skips and Ordinal Transfers), and was associated with a shift in their underlying mnemonic representation of the sequences. Collectively, these findings provide unambiguous evidence that the capacity to remember sequences of events is fundamentally affected by typical aging. PMID:25691514

Theta oscillations promote temporal sequence learning.

PubMed

Crivelli-Decker, Jordan; Hsieh, Liang-Tien; Clarke, Alex; Ranganath, Charan

2018-05-17

Many theoretical models suggest that neural oscillations play a role in learning or retrieval of temporal sequences, but the extent to which oscillations support sequence representation remains unclear. To address this question, we used scalp electroencephalography (EEG) to examine oscillatory activity over learning of different object sequences. Participants made semantic decisions on each object as they were presented in a continuous stream. For three "Consistent" sequences, the order of the objects was always fixed. Activity during Consistent sequences was compared to "Random" sequences that consisted of the same objects presented in a different order on each repetition. Over the course of learning, participants made faster semantic decisions to objects in Consistent, as compared to objects in Random sequences. Thus, participants were able to use sequence knowledge to predict upcoming items in Consistent sequences. EEG analyses revealed decreased oscillatory power in the theta (4-7 Hz) band at frontal sites following decisions about objects in Consistent sequences, as compared with objects in Random sequences. The theta power difference between Consistent and Random only emerged in the second half of the task, as participants were more effectively able to predict items in Consistent sequences. Moreover, we found increases in parieto-occipital alpha (10-13 Hz) and beta (14-28 Hz) power during the pre-response period for objects in Consistent sequences, relative to objects in Random sequences. Linear mixed effects modeling revealed that single trial theta oscillations were related to reaction time for future objects in a sequence, whereas beta and alpha oscillations were only predictive of reaction time on the current trial. These results indicate that theta and alpha/beta activity preferentially relate to future and current events, respectively. More generally our findings highlight the importance of band-specific neural oscillations in the learning of temporal order information. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Application of Modified Spin-Echo–based Sequences for Hepatic MR Elastography: Evaluation, Comparison with the Conventional Gradient-Echo Sequence, and Preliminary Clinical Experience

PubMed Central

Mariappan, Yogesh K.; Dzyubak, Bogdan; Glaser, Kevin J.; Venkatesh, Sudhakar K.; Sirlin, Claude B.; Hooker, Jonathan; McGee, Kiaran P.

2017-01-01

Purpose To (a) evaluate modified spin-echo (SE) magnetic resonance (MR) elastographic sequences for acquiring MR images with improved signal-to-noise ratio (SNR) in patients in whom the standard gradient-echo (GRE) MR elastographic sequence yields low hepatic signal intensity and (b) compare the stiffness values obtained with these sequences with those obtained with the conventional GRE sequence. Materials and Methods This HIPAA-compliant retrospective study was approved by the institutional review board; the requirement to obtain informed consent was waived. Data obtained with modified SE and SE echo-planar imaging (EPI) MR elastographic pulse sequences with short echo times were compared with those obtained with the conventional GRE MR elastographic sequence in two patient cohorts, one that exhibited adequate liver signal intensity and one that exhibited low liver signal intensity. Shear stiffness values obtained with the three sequences in 130 patients with successful GRE-based examinations were retrospectively tested for statistical equivalence by using a 5% margin. In 47 patients in whom GRE examinations were considered to have failed because of low SNR, the SNR and confidence level with the SE-based sequences were compared with those with the GRE sequence. Results The results of this study helped confirm the equivalence of SE MR elastography and SE-EPI MR elastography to GRE MR elastography (P = .0212 and P = .0001, respectively). The SE and SE-EPI MR elastographic sequences provided substantially improved SNR and stiffness inversion confidence level in 47 patients in whom GRE MR elastography had failed. Conclusion Modified SE-based MR elastographic sequences provide higher SNR MR elastographic data and reliable stiffness measurements; thus, they enable quantification of stiffness in patients in whom the conventional GRE MR elastographic sequence failed owing to low signal intensity. The equivalence of the three sequences indicates that the current diagnostic thresholds are applicable to SE MR elastographic sequences for assessing liver fibrosis. © RSNA, 2016 PMID:27509543
(Pea)nuts and bolts of visual narrative: Structure and meaning in sequential image comprehension

PubMed Central

Cohn, Neil; Paczynski, Martin; Jackendoff, Ray; Holcomb, Phillip J.; Kuperberg, Gina R.

2012-01-01

Just as syntax differentiates coherent sentences from scrambled word strings, the comprehension of sequential images must also use a cognitive system to distinguish coherent narrative sequences from random strings of images. We conducted experiments analogous to two classic studies of language processing to examine the contributions of narrative structure and semantic relatedness to processing sequential images. We compared four types of comic strips: 1) Normal sequences with both structure and meaning, 2) Semantic Only sequences (in which the panels were related to a common semantic theme, but had no narrative structure), 3) Structural Only sequences (narrative structure but no semantic relatedness), and 4) Scrambled sequences of randomly-ordered panels. In Experiment 1, participants monitored for target panels in sequences presented panel-by-panel. Reaction times were slowest to panels in Scrambled sequences, intermediate in both Structural Only and Semantic Only sequences, and fastest in Normal sequences. This suggests that both semantic relatedness and narrative structure offer advantages to processing. Experiment 2 measured ERPs to all panels across the whole sequence. The N300/N400 was largest to panels in both the Scrambled and Structural Only sequences, intermediate in Semantic Only sequences and smallest in the Normal sequences. This implies that a combination of narrative structure and semantic relatedness can facilitate semantic processing of upcoming panels (as reflected by the N300/N400). Also, panels in the Scrambled sequences evoked a larger left-lateralized anterior negativity than panels in the Structural Only sequences. This localized effect was distinct from the N300/N400, and appeared despite the fact that these two sequence types were matched on local semantic relatedness between individual panels. These findings suggest that sequential image comprehension uses a narrative structure that may be independent of semantic relatedness. Altogether, we argue that the comprehension of visual narrative is guided by an interaction between structure and meaning. PMID:22387723
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses

USDA-ARS?s Scientific Manuscript database

Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
Genome Wide Characterization of Simple Sequence Repeats in Cucumber

USDA-ARS?s Scientific Manuscript database

The whole genome sequence of the cucumber cultivar Gy14 was recently sequenced at 15× coverage with the Roche 454 Titanium technology. The microsatellite DNA sequences (simple sequence repeats, SSRs) in the assembled scaffolds were computationally explored and characterized. A total of 112,073 SSRs ...
Ion Torren Semiconductor Sequencing Allows Rapid, Low Cost Sequencing of the Human Exome (7th Annual SFAF Meeting, 2012)

ScienceCinema

Jenkins, David

2018-01-10

David Jenkins on "Ion Torrent semiconductor sequencing allows rapid, low-cost sequencing of the human exome" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Ion Torren Semiconductor Sequencing Allows Rapid, Low Cost Sequencing of the Human Exome (7th Annual SFAF Meeting, 2012)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jenkins, David

David Jenkins on "Ion Torrent semiconductor sequencing allows rapid, low-cost sequencing of the human exome" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Genome sequence of Phytophthora ramorum: implications for management

Treesearch

Brett Tyler; Sucheta Tripathy; Nik Grunwald; Kurt Lamour; Kelly Ivors; Matteo Garbelotto; Daniel Rokhsar; Nik Putnam; Igor Grigoriev; Jeffrey Boore

2006-01-01

A draft genome sequence has been determined for Phytophthora ramorum, together with a draft sequence of the soybean pathogen Phytophthora sojae. The P. ramorum genome was sequenced to a depth of 7-fold coverage, while the P. sojae genome was sequenced to a depth of 9-fold coverage. The genome...
Teaching Task Sequencing via Verbal Mediation.

ERIC Educational Resources Information Center

Rusch, Frank R.; And Others

1987-01-01

Verbal sequence training was used to teach a moderately mentally retarded woman to sequence job-related tasks. Learning to say the tasks in the proper sequence resulted in the employee performing her tasks in that sequence, and the employee was capable of mediating her own work behavior when scheduled changes occurred. (Author/JDD)
Sequencing Adventure Activities: A New Perspective.

ERIC Educational Resources Information Center

Bisson, Christian

Sequencing in adventure education involves putting activities in an order appropriate to the needs of the group. Contrary to the common assumption that each adventure sequence is unique, a review of literature concerning five sequencing models reveals a certain universality. These models present sequences that move through four phases: group…
Application of population sequencing (POPSEQ) for ordering and inputting genotyping-by-sequencing markers in hexaploid wheat

USDA-ARS?s Scientific Manuscript database

The advancement of next-generation sequencing technologies in conjunction with new bioinformatics tools enabled fine-tuning of sequence-based high resolution mapping strategies for complex genomes. Although genotyping-by-sequencing (GBS) provides a large number of markers, its application for assoc...
77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-29

... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
A Glance at Microsatellite Motifs from 454 Sequencing Reads of Watermelon Genomic DNA

USDA-ARS?s Scientific Manuscript database

A single 454 (Life Sciences Sequencing Technology) run of Charleston Gray watermelon (Citrullus lanatus var. lanatus) genomic DNA was performed and sequence data were assembled. A large scale identification of simple sequence repeat (SSR) was performed and SSR sequence data were used for the develo...
Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo) genome assembly and analysis

USDA-ARS?s Scientific Manuscript database

Next-generation sequencing technologies were used to rapidly and efficiently sequence the genome of the domestic turkey (Meleagris gallopavo). The current genome assembly (~1.1 Gb) includes 917 Mb of sequence assigned to chromosomes. Innate heterozygosity of the sequenced bird allowed discovery of...
Bellerophon: A program to detect chimeric sequences in multiple sequence alignments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Huber, Thomas; Faulkner, Geoffrey; Hugenholtz, Philip

2003-12-23

Bellerophon is a program for detecting chimeric sequences in multiple sequence datasets by an adaption of partial treeing analysis. Bellerophon was specifically developed to detect 16S rRNA gene chimeras in PCR-clone libraries of environmental samples but can be applied to other nucleotide sequence alignments.
A Code Division Multiple Access Communication System for the Low Frequency Band.

DTIC Science & Technology

1983-04-01

frequency channels spread-spectrum communication / complex sequences, orthogonal codes impulsive noise 20. ABSTRACT (Continue an reverse side It...their transmissions with signature sequences. Our LF/CDMA scheme is different in that each user’s signature sequence set consists of M orthogonal ...signature sequences. Our LF/CDMA scheme is different in that each user’s signature sequence set consists of M orthogonal sequences and thus log 2 M
A vision for ubiquitous sequencing

PubMed Central

Erlich, Yaniv

2015-01-01

Genomics has recently celebrated reaching the $1000 genome milestone, making affordable DNA sequencing a reality. With this goal successfully completed, the next goal of the sequencing revolution can be sequencing sensors—miniaturized sequencing devices that are manufactured for real-time applications and deployed in large quantities at low costs. The first part of this manuscript envisions applications that will benefit from moving the sequencers to the samples in a range of domains. In the second part, the manuscript outlines the critical barriers that need to be addressed in order to reach the goal of ubiquitous sequencing sensors. PMID:26430149
Data compression of discrete sequence: A tree based approach using dynamic programming

NASA Technical Reports Server (NTRS)

Shivaram, Gurusrasad; Seetharaman, Guna; Rao, T. R. N.

1994-01-01

A dynamic programming based approach for data compression of a ID sequence is presented. The compression of an input sequence of size N to that of a smaller size k is achieved by dividing the input sequence into k subsequences and replacing the subsequences by their respective average values. The partitioning of the input sequence is carried with the intention of reducing the mean squared error in the reconstructed sequence. The complexity involved in finding the partitions which would result in such an optimal compressed sequence is reduced by using the dynamic programming approach, which is presented.
Whole-genome sequencing for comparative genomics and de novo genome assembly.

PubMed

Benjak, Andrej; Sala, Claudia; Hartkoorn, Ruben C

2015-01-01

Next-generation sequencing technologies for whole-genome sequencing of mycobacteria are rapidly becoming an attractive alternative to more traditional sequencing methods. In particular this technology is proving useful for genome-wide identification of mutations in mycobacteria (comparative genomics) as well as for de novo assembly of whole genomes. Next-generation sequencing however generates a vast quantity of data that can only be transformed into a usable and comprehensible form using bioinformatics. Here we describe the methodology one would use to prepare libraries for whole-genome sequencing, and the basic bioinformatics to identify mutations in a genome following Illumina HiSeq or MiSeq sequencing, as well as de novo genome assembly following sequencing using Pacific Biosciences (PacBio).
Library construction for next-generation sequencing: Overviews and challenges

PubMed Central

Head, Steven R.; Komori, H. Kiyomi; LaMere, Sarah A.; Whisenant, Thomas; Van Nieuwerburgh, Filip; Salomon, Daniel R.; Ordoukhanian, Phillip

2014-01-01

High-throughput sequencing, also known as next-generation sequencing (NGS), has revolutionized genomic research. In recent years, NGS technology has steadily improved, with costs dropping and the number and range of sequencing applications increasing exponentially. Here, we examine the critical role of sequencing library quality and consider important challenges when preparing NGS libraries from DNA and RNA sources. Factors such as the quantity and physical characteristics of the RNA or DNA source material as well as the desired application (i.e., genome sequencing, targeted sequencing, RNA-seq, ChIP-seq, RIP-seq, and methylation) are addressed in the context of preparing high quality sequencing libraries. In addition, the current methods for preparing NGS libraries from single cells are also discussed. PMID:24502796

PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.

PubMed

Liu, Chao Chun; Ji, Hezhao

2018-06-26

The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

PubMed

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.
Long sequence correlation coprocessor

NASA Astrophysics Data System (ADS)

Gage, Douglas W.

1994-09-01

A long sequence correlation coprocessor (LSCC) accelerates the bitwise correlation of arbitrarily long digital sequences by calculating in parallel the correlation score for 16, for example, adjacent bit alignments between two binary sequences. The LSCC integrated circuit is incorporated into a computer system with memory storage buffers and a separate general purpose computer processor which serves as its controller. Each of the LSCC's set of sequential counters simultaneously tallies a separate correlation coefficient. During each LSCC clock cycle, computer enable logic associated with each counter compares one bit of a first sequence with one bit of a second sequence to increment the counter if the bits are the same. A shift register assures that the same bit of the first sequence is simultaneously compared to different bits of the second sequence to simultaneously calculate the correlation coefficient by the different counters to represent different alignments of the two sequences.
It’s More Than Stamp Collecting: How Genome Sequencing Can Unify Biological Research

PubMed Central

Richards, Stephen

2015-01-01

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, whilst the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to “Big Science” survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. PMID:26003218
It's more than stamp collecting: how genome sequencing can unify biological research.

PubMed

Richards, Stephen

2015-07-01

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, while the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to 'big science' survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. Copyright © 2015 Elsevier Ltd. All rights reserved.
Sequence information signal processor

DOEpatents

Peterson, John C.; Chow, Edward T.; Waterman, Michael S.; Hunkapillar, Timothy J.

1999-01-01

An electronic circuit is used to compare two sequences, such as genetic sequences, to determine which alignment of the sequences produces the greatest similarity. The circuit includes a linear array of series-connected processors, each of which stores a single element from one of the sequences and compares that element with each successive element in the other sequence. For each comparison, the processor generates a scoring parameter that indicates which segment ending at those two elements produces the greatest degree of similarity between the sequences. The processor uses the scoring parameter to generate a similar scoring parameter for a comparison between the stored element and the next successive element from the other sequence. The processor also delivers the scoring parameter to the next processor in the array for use in generating a similar scoring parameter for another pair of elements. The electronic circuit determines which processor and alignment of the sequences produce the scoring parameter with the highest value.
Identification of Sequence Specificity of 5-Methylcytosine Oxidation by Tet1 Protein with High-Throughput Sequencing.

PubMed

Kizaki, Seiichiro; Chandran, Anandhakumar; Sugiyama, Hiroshi

2016-03-02

Tet (ten-eleven translocation) family proteins have the ability to oxidize 5-methylcytosine (mC) to 5-hydroxymethylcytosine (hmC), 5-formylcytosine (fC), and 5-carboxycytosine (caC). However, the oxidation reaction of Tet is not understood completely. Evaluation of genomic-level epigenetic changes by Tet protein requires unbiased identification of the highly selective oxidation sites. In this study, we used high-throughput sequencing to investigate the sequence specificity of mC oxidation by Tet1. A 6.6×10(4) -member mC-containing random DNA-sequence library was constructed. The library was subjected to Tet-reactive pulldown followed by high-throughput sequencing. Analysis of the obtained sequence data identified the Tet1-reactive sequences. We identified mCpG as a highly reactive sequence of Tet1 protein. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rapid Threat Organism Recognition Pipeline

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.

2013-05-07

The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less
Next-Generation Sequencing Platforms

NASA Astrophysics Data System (ADS)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Contributions from associative and explicit sequence knowledge to the execution of discrete keying sequences.

PubMed

Verwey, Willem B

2015-05-01

Research has provided many indications that highly practiced 6-key sequences are carried out in a chunking mode in which key-specific stimuli past the first are largely ignored. When in such sequences a deviating stimulus occasionally occurs at an unpredictable location, participants fall back to responding to individual stimuli (Verwey & Abrahamse, 2012). The observation that in such a situation execution still benefits from prior practice has been attributed to the possibility to operate in an associative mode. To better understand the contribution to the execution of keying sequences of motor chunks, associative sequence knowledge and also of explicit sequence knowledge, the present study tested three alternative accounts for the earlier finding of an execution rate increase at the end of 6-key sequences performed in the associative mode. The results provide evidence that the earlier observed execution rate increase can be attributed to the use of explicit sequence knowledge. In the present experiment this benefit was limited to sequences that are executed at the moderately fast rates of the associative mode, and occurred at both the earlier and final elements of the sequences. Copyright © 2015 Elsevier B.V. All rights reserved.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
Value of a single-shot turbo spin-echo pulse sequence for assessing the architecture of the subarachnoid space and the constitutive nature of cerebrospinal fluid.

PubMed

Pease, Anthony; Sullivan, Stacey; Olby, Natasha; Galano, Heather; Cerda-Gonzalez, Sophia; Robertson, Ian D; Gavin, Patrick; Thrall, Donald

2006-01-01

Three case history reports are presented to illustrate the value of the single-shot turbo spin-echo pulse sequence for assessment of the subarachnoid space. The use of the single-shot turbo spin-echo pulse sequence, which is a heavily T2-weighted sequence, allows for a rapid, noninvasive evaluation of the subarachnoid space by using the high signal from cerebrospinal fluid. This sequence can be completed in seconds rather than the several minutes required for a T2-fast spin-echo sequence. Unlike the standard T2-fast spin-echo sequence, a single-shot turbo spin-echo pulse sequence also provides qualitative information about the protein and the cellular content of the cerebrospinal fluid, such as in patients with inflammatory debris or hemorrhage in the cerebrospinal fluid. Although the resolution of the single-shot turbo spin-echo pulse sequence images is relatively poor compared with more conventional sequences, the qualitative information about the subarachnoid space and cerebrospinal fluid and the rapid acquisition time, make it a useful sequence to include in standard protocols of spinal magnetic resonance imaging.
Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

PubMed

Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

1997-01-01

SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.
rpoB Gene Sequencing for Identification of Corynebacterium Species

PubMed Central

Khamis, Atieh; Raoult, Didier; La Scola, Bernard

2004-01-01

The genus Corynebacterium is a heterogeneous group of species comprising human and animal pathogens and environmental bacteria. It is defined on the basis of several phenotypic characters and the results of DNA-DNA relatedness and, more recently, 16S rRNA gene sequencing. However, the 16S rRNA gene is not polymorphic enough to ensure reliable phylogenetic studies and needs to be completely sequenced for accurate identification. The almost complete rpoB sequences of 56 Corynebacterium species were determined by both PCR and genome walking methods. In all cases the percent similarities between different species were lower than those observed by 16S rRNA gene sequencing, even for those species with degrees of high similarity. Several clusters supported by high bootstrap values were identified. In order to propose a method for strain identification which does not require sequencing of the complete rpoB sequence (approximately 3,500 bp), we identified an area with a high degree of polymorphism, bordered by conserved sequences that can be used as universal primers for PCR amplification and sequencing. The sequence of this fragment (434 to 452 bp) allows accurate species identification and may be used in the future for routine sequence-based identification of Corynebacterium species. PMID:15364970
A safe an easy method for building consensus HIV sequences from 454 massively parallel sequencing data.

PubMed

Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico

2018-02-01

To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

NASA Astrophysics Data System (ADS)

Chen, Ellson Y.

1997-05-01

So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.
The future scalability of pH-based genome sequencers: A theoretical perspective

NASA Astrophysics Data System (ADS)

Go, Jonghyun; Alam, Muhammad A.

2013-10-01

Sequencing of human genome is an essential prerequisite for personalized medicine and early prognosis of various genetic diseases. The state-of-art, high-throughput genome sequencing technologies provide improved sequencing; however, their reliance on relatively expensive optical detection schemes has prevented wide-spread adoption of the technology in routine care. In contrast, the recently announced pH-based electronic genome sequencers achieve fast sequencing at low cost because of the compatibility with the current microelectronics technology. While the progress in technology development has been rapid, the physics of the sequencing chips and the potential for future scaling (and therefore, cost reduction) remain unexplored. In this article, we develop a theoretical framework and a scaling theory to explain the principle of operation of the pH-based sequencing chips and use the framework to explore various perceived scaling limits of the technology related to signal to noise ratio, well-to-well crosstalk, and sequencing accuracy. We also address several limitations inherent to the key steps of pH-based genome sequencers, which are widely shared by many other sequencing platforms in the market but remained unexplained properly so far.
Pulse sequence programming in a dynamic visual environment: SequenceTree.

PubMed

Magland, Jeremy F; Li, Cheng; Langham, Michael C; Wehrli, Felix W

2016-01-01

To describe SequenceTree, an open source, integrated software environment for implementing MRI pulse sequences and, ideally, exporting them to actual MRI scanners. The software is a user-friendly alternative to vendor-supplied pulse sequence design and editing tools and is suited for programmers and nonprogrammers alike. The integrated user interface was programmed using the Qt4/C++ toolkit. As parameters and code are modified, the pulse sequence diagram is automatically updated within the user interface. Several aspects of pulse programming are handled automatically, allowing users to focus on higher-level aspects of sequence design. Sequences can be simulated using a built-in Bloch equation solver and then exported for use on a Siemens MRI scanner. Ideally, other types of scanners will be supported in the future. SequenceTree has been used for 8 years in our laboratory and elsewhere and has contributed to more than 50 peer-reviewed publications in areas such as cardiovascular imaging, solid state and nonproton NMR, MR elastography, and high-resolution structural imaging. SequenceTree is an innovative, open source, visual pulse sequence environment for MRI combining simplicity with flexibility and is ideal both for advanced users and users with limited programming experience. © 2015 Wiley Periodicals, Inc.
Investigation of Human Cancers for Retrovirus by Low-Stringency Target Enrichment and High-Throughput Sequencing.

PubMed

Vinner, Lasse; Mourier, Tobias; Friis-Nielsen, Jens; Gniadecki, Robert; Dybkaer, Karen; Rosenberg, Jacob; Langhoff, Jill Levin; Cruz, David Flores Santa; Fonager, Jannik; Izarzugaza, Jose M G; Gupta, Ramneek; Sicheritz-Ponten, Thomas; Brunak, Søren; Willerslev, Eske; Nielsen, Lars Peter; Hansen, Anders Johannes

2015-08-19

Although nearly one fifth of all human cancers have an infectious aetiology, the causes for the majority of cancers remain unexplained. Despite the enormous data output from high-throughput shotgun sequencing, viral DNA in a clinical sample typically constitutes a proportion of host DNA that is too small to be detected. Sequence variation among virus genomes complicates application of sequence-specific, and highly sensitive, PCR methods. Therefore, we aimed to develop and characterize a method that permits sensitive detection of sequences despite considerable variation. We demonstrate that our low-stringency in-solution hybridization method enables detection of <100 viral copies. Furthermore, distantly related proviral sequences may be enriched by orders of magnitude, enabling discovery of hitherto unknown viral sequences by high-throughput sequencing. The sensitivity was sufficient to detect retroviral sequences in clinical samples. We used this method to conduct an investigation for novel retrovirus in samples from three cancer types. In accordance with recent studies our investigation revealed no retroviral infections in human B-cell lymphoma cells, cutaneous T-cell lymphoma or colorectal cancer biopsies. Nonetheless, our generally applicable method makes sensitive detection possible and permits sequencing of distantly related sequences from complex material.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.