peptide sequencing algorithms: Topics by Science.gov

Sample records for peptide sequencing algorithms

Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics.

PubMed

Blank-Landeshammer, Bernhard; Kollipara, Laxmikanth; Biß, Karsten; Pfenninger, Markus; Malchow, Sebastian; Shuvaev, Konstantin; Zahedi, René P; Sickmann, Albert

2017-09-01

Complex mass spectrometry based proteomics data sets are mostly analyzed by protein database searches. While this approach performs considerably well for sequenced organisms, direct inference of peptide sequences from tandem mass spectra, i.e., de novo peptide sequencing, oftentimes is the only way to obtain information when protein databases are absent. However, available algorithms suffer from drawbacks such as lack of validation and often high rates of false positive hits (FP). Here we present a simple method of combining results from commonly available de novo peptide sequencing algorithms, which in conjunction with minor tweaks in data acquisition ensues lower empirical FDR compared to the analysis using single algorithms. Results were validated using state-of-the art database search algorithms as well specifically synthesized reference peptides. Thus, we could increase the number of PSMs meeting a stringent FDR of 5% more than 3-fold compared to the single best de novo sequencing algorithm alone, accounting for an average of 11 120 PSMs (combined) instead of 3476 PSMs (alone) in triplicate 2 h LC-MS runs of tryptic HeLa digestion.
Open-pNovo: De Novo Peptide Sequencing with Thousands of Protein Modifications.

PubMed

Yang, Hao; Chi, Hao; Zhou, Wen-Jing; Zeng, Wen-Feng; He, Kun; Liu, Chao; Sun, Rui-Xiang; He, Si-Min

2017-02-03

De novo peptide sequencing has improved remarkably, but sequencing full-length peptides with unexpected modifications is still a challenging problem. Here we present an open de novo sequencing tool, Open-pNovo, for de novo sequencing of peptides with arbitrary types of modifications. Although the search space increases by ∼300 times, Open-pNovo is close to or even ∼10-times faster than the other three proposed algorithms. Furthermore, considering top-1 candidates on three MS/MS data sets, Open-pNovo can recall over 90% of the results obtained by any one traditional algorithm and report 5-87% more peptides, including 14-250% more modified peptides. On a high-quality simulated data set, ∼85% peptides with arbitrary modifications can be recalled by Open-pNovo, while hardly any results can be recalled by others. In summary, Open-pNovo is an excellent tool for open de novo sequencing and has great potential for discovering unexpected modifications in the real biological applications.
Application of a fast sorting algorithm to the assignment of mass spectrometric cross-linking data.

PubMed

Petrotchenko, Evgeniy V; Borchers, Christoph H

2014-09-01

Cross-linking combined with MS involves enzymatic digestion of cross-linked proteins and identifying cross-linked peptides. Assignment of cross-linked peptide masses requires a search of all possible binary combinations of peptides from the cross-linked proteins' sequences, which becomes impractical with increasing complexity of the protein system and/or if digestion enzyme specificity is relaxed. Here, we describe the application of a fast sorting algorithm to search large sequence databases for cross-linked peptide assignments based on mass. This same algorithm has been used previously for assigning disulfide-bridged peptides (Choi et al., ), but has not previously been applied to cross-linking studies. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Engineering peptide ligase specificity by proteomic identification of ligation sites.

PubMed

Weeks, Amy M; Wells, James A

2018-01-01

Enzyme-catalyzed peptide ligation is a powerful tool for site-specific protein bioconjugation, but stringent enzyme-substrate specificity limits its utility. We developed an approach for comprehensively characterizing peptide ligase specificity for N termini using proteome-derived peptide libraries. We used this strategy to characterize the ligation efficiency for >25,000 enzyme-substrate pairs in the context of the engineered peptide ligase subtiligase and identified a family of 72 mutant subtiligases with activity toward N-terminal sequences that were previously recalcitrant to modification. We applied these mutants individually for site-specific bioconjugation of purified proteins, including antibodies, and in algorithmically selected combinations for sequencing of the cellular N terminome with reduced sequence bias. We also developed a web application to enable algorithmic selection of the most efficient subtiligase variant(s) for bioconjugation to user-defined sequences. Our methods provide a new toolbox of enzymes for site-specific protein modification and a general approach for rapidly defining and engineering peptide ligase specificity.
Introducing folding stability into the score function for computational design of RNA-binding peptides boosts the probability of success.

PubMed

Xiao, Xingqing; Agris, Paul F; Hall, Carol K

2016-05-01

A computational strategy that integrates our peptide search algorithm with atomistic molecular dynamics simulation was used to design rational peptide drugs that recognize and bind to the anticodon stem and loop domain (ASL(Lys3)) of human tRNAUUULys3 for the purpose of interrupting HIV replication. The score function of the search algorithm was improved by adding a peptide stability term weighted by an adjustable factor λ to the peptide binding free energy. The five best peptide sequences associated with five different values of λ were determined using the search algorithm and then input in atomistic simulations to examine the stability of the peptides' folded conformations and their ability to bind to ASL(Lys3). Simulation results demonstrated that setting an intermediate value of λ achieves a good balance between optimizing the peptide's binding ability and stabilizing its folded conformation during the sequence evolution process, and hence leads to optimal binding to the target ASL(Lys3). Thus, addition of a peptide stability term significantly improves the success rate for our peptide design search. © 2016 Wiley Periodicals, Inc.
A serendipitous survey of prediction algorithms for amyloidogenicity

PubMed Central

Roland, Bartholomew P.; Kodali, Ravindra; Mishra, Rakesh; Wetzel, Ronald

2014-01-01

SUMMARY The 17- amino acid N-terminal segment of the Huntingtin protein, httNT, grows into stable α-helix rich oligomeric aggregates when incubated under physiological conditions. We examined 15 scrambled sequence versions of an httNT peptide for their stabilities against aggregation in aqueous solution at low micromolar concentration and physiological conditions. Surprisingly, given their derivation from a sequence that readily assembles into highly stable α-helical aggregates that fail to convert into β-structure, we found that three of these scrambled peptides rapidly grow into amyloid-like fibrils, while two others also develop amyloid somewhat more slowly. The other 10 scrambled peptides do not detectibly form any aggregates after 100 hrs incubation under these conditions. We then analyzed these sequences using four previously described algorithms for predicting the tendencies of peptides to grow into amyloid or other β-aggregates. We found that these algorithms – Zyggregator, Tango, Waltz and Zipper – varied greatly in the number of sequences predicted to be amyloidogenic and in their abilities to correctly identify the amyloid forming members of scrambled peptide collection. The results are discussed in the context of a review of the sequence and structural factors currently thought to be important in determining amyloid formation kinetics and thermodynamics. PMID:23893755
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

PubMed Central

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

PubMed

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.
Overcoming Species Boundaries in Peptide Identification with Bayesian Information Criterion-driven Error-tolerant Peptide Search (BICEPS)*

PubMed Central

Renard, Bernhard Y.; Xu, Buote; Kirchner, Marc; Zickmann, Franziska; Winter, Dominic; Korten, Simone; Brattig, Norbert W.; Tzur, Amit; Hamprecht, Fred A.; Steen, Hanno

2012-01-01

Currently, the reliable identification of peptides and proteins is only feasible when thoroughly annotated sequence databases are available. Although sequencing capacities continue to grow, many organisms remain without reliable, fully annotated reference genomes required for proteomic analyses. Standard database search algorithms fail to identify peptides that are not exactly contained in a protein database. De novo searches are generally hindered by their restricted reliability, and current error-tolerant search strategies are limited by global, heuristic tradeoffs between database and spectral information. We propose a Bayesian information criterion-driven error-tolerant peptide search (BICEPS) and offer an open source implementation based on this statistical criterion to automatically balance the information of each single spectrum and the database, while limiting the run time. We show that BICEPS performs as well as current database search algorithms when such algorithms are applied to sequenced organisms, whereas BICEPS only uses a remotely related organism database. For instance, we use a chicken instead of a human database corresponding to an evolutionary distance of more than 300 million years (International Chicken Genome Sequencing Consortium (2004) Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 432, 695–716). We demonstrate the successful application to cross-species proteomics with a 33% increase in the number of identified proteins for a filarial nematode sample of Litomosoides sigmodontis. PMID:22493179
UVnovo: A De Novo Sequencing Algorithm Using Single Series of Fragment Ions via Chromophore Tagging and 351 nm Ultraviolet Photodissociation Mass Spectrometry

PubMed Central

Robotham, Scott A.; Horton, Andrew P.; Cannon, Joe R.; Cotham, Victoria C.; Marcotte, Edward M.; Brodbelt, Jennifer S.

2016-01-01

De novo peptide sequencing by mass spectrometry represents an important strategy for characterizing novel peptides and proteins, in which a peptide’s amino acid sequence is inferred directly from the precursor peptide mass and tandem mass spectrum (MS/MS or MS3) fragment ions, without comparison to a reference proteome. This method is ideal for organisms or samples lacking a complete or well-annotated reference sequence set. One of the major barriers to de novo spectral interpretation arises from confusion of N- and C-terminal ion series due to the symmetry between b and y ion pairs created by collisional activation methods (or c, z ions for electron-based activation methods). This is known as the ‘antisymmetric path problem’ and leads to inverted amino acid subsequences within a de novo reconstruction. Here, we combine several key strategies for de novo peptide sequencing into a single high-throughput pipeline: high efficiency carbamylation blocks lysine side chains, and subsequent tryptic digestion and N-terminal peptide derivatization with the ultraviolet chromophore AMCA yields peptides susceptible to 351 nm ultraviolet photodissociation (UVPD). UVPD-MS/MS of the AMCA-modified peptides then predominantly produces y ions in the MS/MS spectra, specifically addressing the antisymmetric path problem. Finally, the program UVnovo applies a random forest algorithm to automatically learn from and then interpret UVPD mass spectra, passing results to a hidden Markov model for de novo sequence prediction and scoring. We show this combined strategy provides high performance de novo peptide sequencing, enabling the de novo sequencing of thousands of peptides from an E. coli lysate at high confidence. PMID:26938041
Incorporating sequence information into the scoring function: a hidden Markov model for improved peptide identification.

PubMed

Khatun, Jainab; Hamlett, Eric; Giddings, Morgan C

2008-03-01

The identification of peptides by tandem mass spectrometry (MS/MS) is a central method of proteomics research, but due to the complexity of MS/MS data and the large databases searched, the accuracy of peptide identification algorithms remains limited. To improve the accuracy of identification we applied a machine-learning approach using a hidden Markov model (HMM) to capture the complex and often subtle links between a peptide sequence and its MS/MS spectrum. Our model, HMM_Score, represents ion types as HMM states and calculates the maximum joint probability for a peptide/spectrum pair using emission probabilities from three factors: the amino acids adjacent to each fragmentation site, the mass dependence of ion types and the intensity dependence of ion types. The Viterbi algorithm is used to calculate the most probable assignment between ion types in a spectrum and a peptide sequence, then a correction factor is added to account for the propensity of the model to favor longer peptides. An expectation value is calculated based on the model score to assess the significance of each peptide/spectrum match. We trained and tested HMM_Score on three data sets generated by two different mass spectrometer types. For a reference data set recently reported in the literature and validated using seven identification algorithms, HMM_Score produced 43% more positive identification results at a 1% false positive rate than the best of two other commonly used algorithms, Mascot and X!Tandem. HMM_Score is a highly accurate platform for peptide identification that works well for a variety of mass spectrometer and biological sample types. The program is freely available on ProteomeCommons via an OpenSource license. See http://bioinfo.unc.edu/downloads/ for the download link.
A novel algorithm for validating peptide identification from a shotgun proteomics search engine.

PubMed

Jian, Ling; Niu, Xinnan; Xia, Zhonghang; Samir, Parimal; Sumanasekera, Chiranthani; Mu, Zheng; Jennings, Jennifer L; Hoek, Kristen L; Allos, Tara; Howard, Leigh M; Edwards, Kathryn M; Weil, P Anthony; Link, Andrew J

2013-03-01

Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has revolutionized the proteomics analysis of complexes, cells, and tissues. In a typical proteomic analysis, the tandem mass spectra from a LC-MS/MS experiment are assigned to a peptide by a search engine that compares the experimental MS/MS peptide data to theoretical peptide sequences in a protein database. The peptide spectra matches are then used to infer a list of identified proteins in the original sample. However, the search engines often fail to distinguish between correct and incorrect peptides assignments. In this study, we designed and implemented a novel algorithm called De-Noise to reduce the number of incorrect peptide matches and maximize the number of correct peptides at a fixed false discovery rate using a minimal number of scoring outputs from the SEQUEST search engine. The novel algorithm uses a three-step process: data cleaning, data refining through a SVM-based decision function, and a final data refining step based on proteolytic peptide patterns. Using proteomics data generated on different types of mass spectrometers, we optimized the De-Noise algorithm on the basis of the resolution and mass accuracy of the mass spectrometer employed in the LC-MS/MS experiment. Our results demonstrate De-Noise improves peptide identification compared to other methods used to process the peptide sequence matches assigned by SEQUEST. Because De-Noise uses a limited number of scoring attributes, it can be easily implemented with other search engines.
Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks*

PubMed Central

Bandeira, Nuno

2016-01-01

Peptide and protein identification remains challenging in organisms with poorly annotated or rapidly evolving genomes, as are commonly encountered in environmental or biofuels research. Such limitations render tandem mass spectrometry (MS/MS) database search algorithms ineffective as they lack corresponding sequences required for peptide-spectrum matching. We address this challenge with the spectral networks approach to (1) match spectra of orthologous peptides across multiple related species and then (2) propagate peptide annotations from identified to unidentified spectra. We here present algorithms to assess the statistical significance of spectral alignments (Align-GF), reduce the impurity in spectral networks, and accurately estimate the error rate in propagated identifications. Analyzing three related Cyanothece species, a model organism for biohydrogen production, spectral networks identified peptides from highly divergent sequences from networks with dozens of variant peptides, including thousands of peptides in species lacking a sequenced genome. Our analysis further detected the presence of many novel putative peptides even in genomically characterized species, thus suggesting the possibility of gaps in our understanding of their proteomic and genomic expression. A web-based pipeline for spectral networks analysis is available at http://proteomics.ucsd.edu/software. PMID:27609420
An FPGA Implementation to Detect Selective Cationic Antibacterial Peptides

PubMed Central

Polanco González, Carlos; Nuño Maganda, Marco Aurelio; Arias-Estrada, Miguel; del Rio, Gabriel

2011-01-01

Exhaustive prediction of physicochemical properties of peptide sequences is used in different areas of biological research. One example is the identification of selective cationic antibacterial peptides (SCAPs), which may be used in the treatment of different diseases. Due to the discrete nature of peptide sequences, the physicochemical properties calculation is considered a high-performance computing problem. A competitive solution for this class of problems is to embed algorithms into dedicated hardware. In the present work we present the adaptation, design and implementation of an algorithm for SCAPs prediction into a Field Programmable Gate Array (FPGA) platform. Four physicochemical properties codes useful in the identification of peptide sequences with potential selective antibacterial activity were implemented into an FPGA board. The speed-up gained in a single-copy implementation was up to 108 times compared with a single Intel processor cycle for cycle. The inherent scalability of our design allows for replication of this code into multiple FPGA cards and consequently improvements in speed are possible. Our results show the first embedded SCAPs prediction solution described and constitutes the grounds to efficiently perform the exhaustive analysis of the sequence-physicochemical properties relationship of peptides. PMID:21738652
TANDEM: matching proteins with tandem mass spectra.

PubMed

Craig, Robertson; Beavis, Ronald C

2004-06-12

Tandem mass spectra obtained from fragmenting peptide ions contain some peptide sequence specific information, but often there is not enough information to sequence the original peptide completely. Several proprietary software applications have been developed to attempt to match the spectra with a list of protein sequences that may contain the sequence of the peptide. The application TANDEM was written to provide the proteomics research community with a set of components that can be used to test new methods and algorithms for performing this type of sequence-to-data matching. The source code and binaries for this software are available at http://www.proteome.ca/opensource.html, for Windows, Linux and Macintosh OSX. The source code is made available under the Artistic License, from the authors.
High-throughput Database Search and Large-scale Negative Polarity Liquid Chromatography–Tandem Mass Spectrometry with Ultraviolet Photodissociation for Complex Proteomic Samples*

PubMed Central

Madsen, James A.; Xu, Hua; Robinson, Michelle R.; Horton, Andrew P.; Shaw, Jared B.; Giles, David K.; Kaoud, Tamer S.; Dalby, Kevin N.; Trent, M. Stephen; Brodbelt, Jennifer S.

2013-01-01

The use of ultraviolet photodissociation (UVPD) for the activation and dissociation of peptide anions is evaluated for broader coverage of the proteome. To facilitate interpretation and assignment of the resulting UVPD mass spectra of peptide anions, the MassMatrix database search algorithm was modified to allow automated analysis of negative polarity MS/MS spectra. The new UVPD algorithms were developed based on the MassMatrix database search engine by adding specific fragmentation pathways for UVPD. The new UVPD fragmentation pathways in MassMatrix were rigorously and statistically optimized using two large data sets with high mass accuracy and high mass resolution for both MS1 and MS2 data acquired on an Orbitrap mass spectrometer for complex Halobacterium and HeLa proteome samples. Negative mode UVPD led to the identification of 3663 and 2350 peptides for the Halo and HeLa tryptic digests, respectively, corresponding to 655 and 645 peptides that were unique when compared with electron transfer dissociation (ETD), higher energy collision-induced dissociation, and collision-induced dissociation results for the same digests analyzed in the positive mode. In sum, 805 and 619 proteins were identified via UVPD for the Halobacterium and HeLa samples, respectively, with 49 and 50 unique proteins identified in contrast to the more conventional MS/MS methods. The algorithm also features automated charge determination for low mass accuracy data, precursor filtering (including intact charge-reduced peaks), and the ability to combine both positive and negative MS/MS spectra into a single search, and it is freely open to the public. The accuracy and specificity of the MassMatrix UVPD search algorithm was also assessed for low resolution, low mass accuracy data on a linear ion trap. Analysis of a known mixture of three mitogen-activated kinases yielded similar sequence coverage percentages for UVPD of peptide anions versus conventional collision-induced dissociation of peptide cations, and when these methods were combined into a single search, an increase of up to 13% sequence coverage was observed for the kinases. The ability to sequence peptide anions and cations in alternating scans in the same chromatographic run was also demonstrated. Because ETD has a significant bias toward identifying highly basic peptides, negative UVPD was used to improve the identification of the more acidic peptides in conjunction with positive ETD for the more basic species. In this case, tryptic peptides from the cytosolic section of HeLa cells were analyzed by polarity switching nanoLC-MS/MS utilizing ETD for cation sequencing and UVPD for anion sequencing. Relative to searching using ETD alone, positive/negative polarity switching significantly improved sequence coverages across identified proteins, resulting in a 33% increase in unique peptide identifications and more than twice the number of peptide spectral matches. PMID:23695934
Two-level QSAR network (2L-QSAR) for peptide inhibitor design based on amino acid properties and sequence positions.

PubMed

Du, Q S; Ma, Y; Xie, N Z; Huang, R B

2014-01-01

In the design of peptide inhibitors the huge possible variety of the peptide sequences is of high concern. In collaboration with the fast accumulation of the peptide experimental data and database, a statistical method is suggested for peptide inhibitor design. In the two-level peptide prediction network (2L-QSAR) one level is the physicochemical properties of amino acids and the other level is the peptide sequence position. The activity contributions of amino acids are the functions of physicochemical properties and the sequence positions. In the prediction equation two weight coefficient sets {ak} and {bl} are assigned to the physicochemical properties and to the sequence positions, respectively. After the two coefficient sets are optimized based on the experimental data of known peptide inhibitors using the iterative double least square (IDLS) procedure, the coefficients are used to evaluate the bioactivities of new designed peptide inhibitors. The two-level prediction network can be applied to the peptide inhibitor design that may aim for different target proteins, or different positions of a protein. A notable advantage of the two-level statistical algorithm is that there is no need for host protein structural information. It may also provide useful insight into the amino acid properties and the roles of sequence positions.
Discovery of phosphorylation motif mixtures in phosphoproteomics data

PubMed Central

Ritz, Anna; Shakhnarovich, Gregory; Salomon, Arthur R.; Raphael, Benjamin J.

2009-01-01

Motivation: Modification of proteins via phosphorylation is a primary mechanism for signal transduction in cells. Phosphorylation sites on proteins are determined in part through particular patterns, or motifs, present in the amino acid sequence. Results: We describe an algorithm that simultaneously discovers multiple motifs in a set of peptides that were phosphorylated by several different kinases. Such sets of peptides are routinely produced in proteomics experiments.Our motif-finding algorithm uses the principle of minimum description length to determine a mixture of sequence motifs that distinguish a foreground set of phosphopeptides from a background set of unphosphorylated peptides. We show that our algorithm outperforms existing motif-finding algorithms on synthetic datasets consisting of mixtures of known phosphorylation sites. We also derive a motif specificity score that quantifies whether or not the phosphoproteins containing an instance of a motif have a significant number of known interactions. Application of our motif-finding algorithm to recently published human and mouse proteomic studies recovers several known phosphorylation motifs and reveals a number of novel motifs that are enriched for interactions with a particular kinase or phosphatase. Our tools provide a new approach for uncovering the sequence specificities of uncharacterized kinases or phosphatases. Availability: Software is available at http:/cs.brown.edu/people/braphael/software.html. Contact: aritz@cs.brown.edu; braphael@cs.brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:18996944
MRUniNovo: an efficient tool for de novo peptide sequencing utilizing the hadoop distributed computing framework.

PubMed

Li, Chuang; Chen, Tao; He, Qiang; Zhu, Yunping; Li, Kenli

2017-03-15

Tandem mass spectrometry-based de novo peptide sequencing is a complex and time-consuming process. The current algorithms for de novo peptide sequencing cannot rapidly and thoroughly process large mass spectrometry datasets. In this paper, we propose MRUniNovo, a novel tool for parallel de novo peptide sequencing. MRUniNovo parallelizes UniNovo based on the Hadoop compute platform. Our experimental results demonstrate that MRUniNovo significantly reduces the computation time of de novo peptide sequencing without sacrificing the correctness and accuracy of the results, and thus can process very large datasets that UniNovo cannot. MRUniNovo is an open source software tool implemented in java. The source code and the parameter settings are available at http://bioinfo.hupo.org.cn/MRUniNovo/index.php. s131020002@hnu.edu.cn ; taochen1019@163.com. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Predicting intensity ranks of peptide fragment ions.

PubMed

Frank, Ari M

2009-05-01

Accurate modeling of peptide fragmentation is necessary for the development of robust scoring functions for peptide-spectrum matches, which are the cornerstone of MS/MS-based identification algorithms. Unfortunately, peptide fragmentation is a complex process that can involve several competing chemical pathways, which makes it difficult to develop generative probabilistic models that describe it accurately. However, the vast amounts of MS/MS data being generated now make it possible to use data-driven machine learning methods to develop discriminative ranking-based models that predict the intensity ranks of a peptide's fragment ions. We use simple sequence-based features that get combined by a boosting algorithm into models that make peak rank predictions with high accuracy. In an accompanying manuscript, we demonstrate how these prediction models are used to significantly improve the performance of peptide identification algorithms. The models can also be useful in the design of optimal multiple reaction monitoring (MRM) transitions, in cases where there is insufficient experimental data to guide the peak selection process. The prediction algorithm can also be run independently through PepNovo+, which is available for download from http://bix.ucsd.edu/Software/PepNovo.html.

Predicting Intensity Ranks of Peptide Fragment Ions

PubMed Central

Frank, Ari M.

2009-01-01

Accurate modeling of peptide fragmentation is necessary for the development of robust scoring functions for peptide-spectrum matches, which are the cornerstone of MS/MS-based identification algorithms. Unfortunately, peptide fragmentation is a complex process that can involve several competing chemical pathways, which makes it difficult to develop generative probabilistic models that describe it accurately. However, the vast amounts of MS/MS data being generated now make it possible to use data-driven machine learning methods to develop discriminative ranking-based models that predict the intensity ranks of a peptide's fragment ions. We use simple sequence-based features that get combined by a boosting algorithm in to models that make peak rank predictions with high accuracy. In an accompanying manuscript, we demonstrate how these prediction models are used to significantly improve the performance of peptide identification algorithms. The models can also be useful in the design of optimal MRM transitions, in cases where there is insufficient experimental data to guide the peak selection process. The prediction algorithm can also be run independently through PepNovo+, which is available for download from http://bix.ucsd.edu/Software/PepNovo.html. PMID:19256476
MassSieve: Panning MS/MS peptide data for proteins

PubMed Central

Slotta, Douglas J.; McFarland, Melinda A.; Markey, Sanford P.

2010-01-01

We present MassSieve, a Java-based platform for visualization and parsimony analysis of single and comparative LC-MS/MS database search engine results. The success of mass spectrometric peptide sequence assignment algorithms has led to the need for a tool to merge and evaluate the increasing data set sizes that result from LC-MS/MS-based shotgun proteomic experiments. MassSieve supports reports from multiple search engines with differing search characteristics, which can increase peptide sequence coverage and/or identify conflicting or ambiguous spectral assignments. PMID:20564260
Sequence homology between HLA-bound cytomegalovirus and human peptides: A potential trigger for alloreactivity

PubMed Central

Koparde, Vishal N.; Jameson-Lee, Maximilian; Elnasseh, Abdelrhman G.; Scalora, Allison F.; Kobulnicky, David J.; Serrano, Myrna G.; Roberts, Catherine H.; Buck, Gregory A.; Neale, Michael C.; Nixon, Daniel E.; Toor, Amir A.

2017-01-01

Human cytomegalovirus (hCMV) reactivation may often coincide with the development of graft-versus-host-disease (GVHD) in stem cell transplantation (SCT). Seventy seven SCT donor-recipient pairs (DRP) (HLA matched unrelated donor (MUD), n = 50; matched related donor (MRD), n = 27) underwent whole exome sequencing to identify single nucleotide polymorphisms (SNPs) generating alloreactive peptide libraries for each DRP (9-mer peptide-HLA complexes); Human CMV CROSS (Cross-Reactive Open Source Sequence) database was compiled from NCBI; HLA class I binding affinity for each DRPs HLA was calculated by NetMHCpan 2.8 and hCMV- derived 9-mers algorithmically compared to the alloreactive peptide-HLA complex libraries. Short consecutive (≥6) amino acid (AA) sequence homology matching hCMV to recipient peptides was considered for HLA-bound-peptide (IC50<500nM) cross reactivity. Of the 70,686 hCMV 9-mers contained within the hCMV CROSS database, an average of 29,658 matched the MRD DRP alloreactive peptides and 52,910 matched MUD DRP peptides (p<0.001). In silico analysis revealed multiple high affinity, immunogenic CMV-Human peptide matches (IC50<500 nM) expressed in GVHD-affected tissue-specific manner. hCMV+GVHD was found in 18 patients, 13 developing hCMV viremia before GVHD onset. Analysis of patients with GVHD identified potential cross reactive peptide expression within affected organs. We propose that hCMV peptide sequence homology with human alloreactive peptides may contribute to the pathophysiology of GVHD. PMID:28800601
Sequence homology between HLA-bound cytomegalovirus and human peptides: A potential trigger for alloreactivity.

PubMed

Hall, Charles E; Koparde, Vishal N; Jameson-Lee, Maximilian; Elnasseh, Abdelrhman G; Scalora, Allison F; Kobulnicky, David J; Serrano, Myrna G; Roberts, Catherine H; Buck, Gregory A; Neale, Michael C; Nixon, Daniel E; Toor, Amir A

2017-01-01

Human cytomegalovirus (hCMV) reactivation may often coincide with the development of graft-versus-host-disease (GVHD) in stem cell transplantation (SCT). Seventy seven SCT donor-recipient pairs (DRP) (HLA matched unrelated donor (MUD), n = 50; matched related donor (MRD), n = 27) underwent whole exome sequencing to identify single nucleotide polymorphisms (SNPs) generating alloreactive peptide libraries for each DRP (9-mer peptide-HLA complexes); Human CMV CROSS (Cross-Reactive Open Source Sequence) database was compiled from NCBI; HLA class I binding affinity for each DRPs HLA was calculated by NetMHCpan 2.8 and hCMV- derived 9-mers algorithmically compared to the alloreactive peptide-HLA complex libraries. Short consecutive (≥6) amino acid (AA) sequence homology matching hCMV to recipient peptides was considered for HLA-bound-peptide (IC50<500nM) cross reactivity. Of the 70,686 hCMV 9-mers contained within the hCMV CROSS database, an average of 29,658 matched the MRD DRP alloreactive peptides and 52,910 matched MUD DRP peptides (p<0.001). In silico analysis revealed multiple high affinity, immunogenic CMV-Human peptide matches (IC50<500 nM) expressed in GVHD-affected tissue-specific manner. hCMV+GVHD was found in 18 patients, 13 developing hCMV viremia before GVHD onset. Analysis of patients with GVHD identified potential cross reactive peptide expression within affected organs. We propose that hCMV peptide sequence homology with human alloreactive peptides may contribute to the pathophysiology of GVHD.
Concurrent Automated Sequencing of the Glycan and Peptide Portions of O-Linked Glycopeptide Anions by Ultraviolet Photodissociation Mass Spectrometry

PubMed Central

Madsen, James A.; Ko, Byoung Joon; Xu, Hua; Iwashkiw, Jeremy A.; Robotham, Scott A.; Shaw, Jared B.; Feldman, Mario F.; Brodbelt, Jennifer S.

2013-01-01

O -glycopeptides are often acidic owing to the frequent occurrence of acidic saccharides in the glycan, rendering traditional proteomic workflows that rely on positive mode tandem mass spectrometry (MS/MS) less effective. In this report, we demonstrate the utility of negative mode ultraviolet photodissociation (UVPD) MS for the characterization of acidic O-linked glycopeptide anions. This method was evaluated for a series of singly- and multiply-deprotonated glycopeptides from the model glycoprotein kappa casein, resulting in production of both peptide and glycan product ions that afforded 100% sequence coverage of the peptide and glycan moieties from a single MS/MS event. The most abundant and frequent peptide sequence ions were a/x-type products, which, importantly, were found to retain the labile glycan modifications. The glycan-specific ions mainly arose from glycosidic bond cleavages (B, Y, C, and Z ions) in addition to some less common cross-ring cleavages. Based on the UVPD fragmentation patterns, an automated database searching strategy (based on the MassMatrix algorithm) was designed that is specific for the analysis of glycopeptide anions by UVPD. This algorithm was used to identify glycopeptides from mixtures of glycosylated and non-glycosylated peptides, sequence both glycan and peptide moieties simultaneously, and pinpoint the correct site(s) of glycosylation. This methodology was applied to uncover novel site-specificity of the O-linked glycosylated OmpA/MotB from the “superbug” A. baumannii to help aid in the elucidation of the functional role that protein glycosylation plays in pathogenesis. PMID:24006841
PinaColada: peptide-inhibitor ant colony ad-hoc design algorithm.

PubMed

Zaidman, Daniel; Wolfson, Haim J

2016-08-01

Design of protein-protein interaction (PPI) inhibitors is a major challenge in Structural Bioinformatics. Peptides, especially short ones (5-15 amino acid long), are natural candidates for inhibition of protein-protein complexes due to several attractive features such as high structural compatibility with the protein binding site (mimicking the surface of one of the proteins), small size and the ability to form strong hotspot binding connections with the protein surface. Efficient rational peptide design is still a major challenge in computer aided drug design, due to the huge space of possible sequences, which is exponential in the length of the peptide, and the high flexibility of peptide conformations. In this article we present PinaColada, a novel computational method for the design of peptide inhibitors for protein-protein interactions. We employ a version of the ant colony optimization heuristic, which is used to explore the exponential space ([Formula: see text]) of length n peptide sequences, in combination with our fast robotics motivated PepCrawler algorithm, which explores the conformational space for each candidate sequence. PinaColada is being run in parallel, on a DELL PowerEdge 2.8 GHZ computer with 20 cores and 256 GB memory, and takes up to 24 h to design a peptide of 5-15 amino acids length. An online server available at: http://bioinfo3d.cs.tau.ac.il/PinaColada/. danielza@post.tau.ac.il; wolfson@tau.ac.il. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Computationally assisted screening and design of cell-interactive peptides by a cell-based assay using peptide arrays and a fuzzy neural network algorithm.

PubMed

Kaga, Chiaki; Okochi, Mina; Tomita, Yasuyuki; Kato, Ryuji; Honda, Hiroyuki

2008-03-01

We developed a method of effective peptide screening that combines experiments and computational analysis. The method is based on the concept that screening efficiency can be enhanced from even limited data by use of a model derived from computational analysis that serves as a guide to screening and combining the model with subsequent repeated experiments. Here we focus on cell-adhesion peptides as a model application of this peptide-screening strategy. Cell-adhesion peptides were screened by use of a cell-based assay of a peptide array. Starting with the screening data obtained from a limited, random 5-mer library (643 sequences), a rule regarding structural characteristics of cell-adhesion peptides was extracted by fuzzy neural network (FNN) analysis. According to this rule, peptides with unfavored residues in certain positions that led to inefficient binding were eliminated from the random sequences. In the restricted, second random library (273 sequences), the yield of cell-adhesion peptides having an adhesion rate more than 1.5-fold to that of the basal array support was significantly high (31%) compared with the unrestricted random library (20%). In the restricted third library (50 sequences), the yield of cell-adhesion peptides increased to 84%. We conclude that a repeated cycle of experiments screening limited numbers of peptides can be assisted by the rule-extracting feature of FNN.
Gapped Spectral Dictionaries and Their Applications for Database Searches of Tandem Mass Spectra*

PubMed Central

Jeong, Kyowon; Kim, Sangtae; Bandeira, Nuno; Pevzner, Pavel A.

2011-01-01

Generating all plausible de novo interpretations of a peptide tandem mass (MS/MS) spectrum (Spectral Dictionary) and quickly matching them against the database represent a recently emerged alternative approach to peptide identification. However, the sizes of the Spectral Dictionaries quickly grow with the peptide length making their generation impractical for long peptides. We introduce Gapped Spectral Dictionaries (all plausible de novo interpretations with gaps) that can be easily generated for any peptide length thus addressing the limitation of the Spectral Dictionary approach. We show that Gapped Spectral Dictionaries are small thus opening a possibility of using them to speed-up MS/MS searches. Our MS-GappedDictionary algorithm (based on Gapped Spectral Dictionaries) enables proteogenomics applications (such as searches in the six-frame translation of the human genome) that are prohibitively time consuming with existing approaches. MS-GappedDictionary generates gapped peptides that occupy a niche between accurate but short peptide sequence tags and long but inaccurate full length peptide reconstructions. We show that, contrary to conventional wisdom, some high-quality spectra do not have good peptide sequence tags and introduce gapped tags that have advantages over the conventional peptide sequence tags in MS/MS database searches. PMID:21444829
Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.

PubMed

Sadygov, Rovshan G; Cociorva, Daniel; Yates, John R

2004-12-01

Database searching is an essential element of large-scale proteomics. Because these methods are widely used, it is important to understand the rationale of the algorithms. Most algorithms are based on concepts first developed in SEQUEST and PeptideSearch. Four basic approaches are used to determine a match between a spectrum and sequence: descriptive, interpretative, stochastic and probability-based matching. We review the basic concepts used by most search algorithms, the computational modeling of peptide identification and current challenges and limitations of this approach for protein identification.
De novo peptide sequencing using CID and HCD spectra pairs.

PubMed

Yan, Yan; Kusalik, Anthony J; Wu, Fang-Xiang

2016-10-01

In tandem mass spectrometry (MS/MS), there are several different fragmentation techniques possible, including, collision-induced dissociation (CID) higher energy collisional dissociation (HCD), electron-capture dissociation (ECD), and electron transfer dissociation (ETD). When using pairs of spectra for de novo peptide sequencing, the most popular methods are designed for CID (or HCD) and ECD (or ETD) spectra because of the complementarity between them. Less attention has been paid to the use of CID and HCD spectra pairs. In this study, a new de novo peptide sequencing method is proposed for these spectra pairs. This method includes a CID and HCD spectra merging criterion and a parent mass correction step, along with improvements to our previously proposed algorithm for sequencing merged spectra. Three pairs of spectral datasets were used to investigate and compare the performance of the proposed method with other existing methods designed for single spectrum (HCD or CID) sequencing. Experimental results showed that full-length peptide sequencing accuracy was increased significantly by using spectra pairs in the proposed method, with the highest accuracy reaching 81.31%. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
UniNovo: a universal tool for de novo peptide sequencing.

PubMed

Jeong, Kyowon; Kim, Sangtae; Pevzner, Pavel A

2013-08-15

Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spectra or spectra of non-tryptic digests. Thus, rather than developing a new algorithm for each type of spectra, we develop a universal de novo sequencing algorithm called UniNovo that works well for all types of spectra or even for spectral pairs (e.g. CID/ETD spectral pairs). UniNovo uses an improved scoring function that captures the dependences between different ion types, where such dependencies are learned automatically using a modified offset frequency function. The performance of UniNovo is compared with PepNovo+, PEAKS and pNovo using various types of spectra. The results show that the performance of UniNovo is superior to other tools for ETD spectra and superior or comparable with others for CID and HCD spectra. UniNovo also estimates the probability that each reported reconstruction is correct, using simple statistics that are readily obtained from a small training dataset. We demonstrate that the estimation is accurate for all tested types of spectra (including CID, HCD, ETD, CID/ETD and HCD/ETD spectra of trypsin, LysC or AspN digested peptides). UniNovo is implemented in JAVA and tested on Windows, Ubuntu and OS X machines. UniNovo is available at http://proteomics.ucsd.edu/Software/UniNovo.html along with the manual.
Binomial probability distribution model-based protein identification algorithm for tandem mass spectrometry utilizing peak intensity information.

PubMed

Xiao, Chuan-Le; Chen, Xiao-Zhou; Du, Yang-Li; Sun, Xuesong; Zhang, Gong; He, Qing-Yu

2013-01-04

Mass spectrometry has become one of the most important technologies in proteomic analysis. Tandem mass spectrometry (LC-MS/MS) is a major tool for the analysis of peptide mixtures from protein samples. The key step of MS data processing is the identification of peptides from experimental spectra by searching public sequence databases. Although a number of algorithms to identify peptides from MS/MS data have been already proposed, e.g. Sequest, OMSSA, X!Tandem, Mascot, etc., they are mainly based on statistical models considering only peak-matches between experimental and theoretical spectra, but not peak intensity information. Moreover, different algorithms gave different results from the same MS data, implying their probable incompleteness and questionable reproducibility. We developed a novel peptide identification algorithm, ProVerB, based on a binomial probability distribution model of protein tandem mass spectrometry combined with a new scoring function, making full use of peak intensity information and, thus, enhancing the ability of identification. Compared with Mascot, Sequest, and SQID, ProVerB identified significantly more peptides from LC-MS/MS data sets than the current algorithms at 1% False Discovery Rate (FDR) and provided more confident peptide identifications. ProVerB is also compatible with various platforms and experimental data sets, showing its robustness and versatility. The open-source program ProVerB is available at http://bioinformatics.jnu.edu.cn/software/proverb/ .
Discovery of novel antimicrobial peptides: A transcriptomic study of the sea anemone Cnidopus japonicus.

PubMed

Grafskaia, Ekaterina N; Polina, Nadezhda F; Babenko, Vladislav V; Kharlampieva, Daria D; Bobrovsky, Pavel A; Manuvera, Valentin A; Farafonova, Tatyana E; Anikanov, Nikolay A; Lazarev, Vassili N

2018-04-01

As essential conservative component of the innate immune systems of living organisms, antimicrobial peptides (AMPs) could complement pharmaceuticals that increasingly fail to combat various pathogens exhibiting increased resistance to microbial antibiotics. Among the properties of AMPs that suggest their potential as therapeutic agents, diverse peptides in the venoms of various predators demonstrate antimicrobial activity and kill a wide range of microorganisms. To identify potent AMPs, the study reported here involved a transcriptomic profiling of the tentacle secretion of the sea anemone Cnidopus japonicus. An in silico search algorithm designed to discover toxin-like proteins containing AMPs was developed based on the evaluation of the properties and structural peculiarities of amino acid sequences. The algorithm revealed new proteins of the anemone containing antimicrobial candidate sequences, and 10 AMPs verified using high-throughput proteomics were synthesized. The antimicrobial activity of the candidate molecules was experimentally estimated against Gram-positive and -negative bacteria. Ultimately, three peptides exhibited antimicrobial activity against bacterial strains, which suggests that the method can be applied to reveal new AMPs in the venoms of other predators as well.
Development of a strategy and computational application to select candidate protein analogues with reduced HLA binding and immunogenicity.

PubMed

Dhanda, Sandeep Kumar; Grifoni, Alba; Pham, John; Vaughan, Kerrie; Sidney, John; Peters, Bjoern; Sette, Alessandro

2018-01-01

Unwanted immune responses against protein therapeutics can reduce efficacy or lead to adverse reactions. T-cell responses are key in the development of such responses, and are directed against immunodominant regions within the protein sequence, often associated with binding to several allelic variants of HLA class II molecules (promiscuous binders). Herein, we report a novel computational strategy to predict 'de-immunized' peptides, based on previous studies of erythropoietin protein immunogenicity. This algorithm (or method) first predicts promiscuous binding regions within the target protein sequence and then identifies residue substitutions predicted to reduce HLA binding. Further, this method anticipates the effect of any given substitution on flanking peptides, thereby circumventing the creation of nascent HLA-binding regions. As a proof-of-principle, the algorithm was applied to Vatreptacog α, an engineered Factor VII molecule associated with unintended immunogenicity. The algorithm correctly predicted the two immunogenic peptides containing the engineered residues. As a further validation, we selected and evaluated the immunogenicity of seven substitutions predicted to simultaneously reduce HLA binding for both peptides, five control substitutions with no predicted reduction in HLA-binding capacity, and additional flanking region controls. In vitro immunogenicity was detected in 21·4% of the cultures of peptides predicted to have reduced HLA binding and 11·4% of the flanking regions, compared with 46% for the cultures of the peptides predicted to be immunogenic. This method has been implemented as an interactive application, freely available online at http://tools.iedb.org/deimmunization/. © 2017 John Wiley & Sons Ltd.
A motif detection and classification method for peptide sequences using genetic programming.

PubMed

Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki

2008-08-01

An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.
Comparative study of classification algorithms for immunosignaturing data

PubMed Central

2012-01-01

Background High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data. Results We characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy. Conclusions ‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties. PMID:22720696
Enhanced Prediction of Src Homology 2 (SH2) Domain Binding Potentials Using a Fluorescence Polarization-derived c-Met, c-Kit, ErbB, and Androgen Receptor Interactome*

PubMed Central

Leung, Kin K.; Hause, Ronald J.; Barkinge, John L.; Ciaccio, Mark F.; Chuu, Chih-Pin; Jones, Richard B.

2014-01-01

Many human diseases are associated with aberrant regulation of phosphoprotein signaling networks. Src homology 2 (SH2) domains represent the major class of protein domains in metazoans that interact with proteins phosphorylated on the amino acid residue tyrosine. Although current SH2 domain prediction algorithms perform well at predicting the sequences of phosphorylated peptides that are likely to result in the highest possible interaction affinity in the context of random peptide library screens, these algorithms do poorly at predicting the interaction potential of SH2 domains with physiologically derived protein sequences. We employed a high throughput interaction assay system to empirically determine the affinity between 93 human SH2 domains and phosphopeptides abstracted from several receptor tyrosine kinases and signaling proteins. The resulting interaction experiments revealed over 1000 novel peptide-protein interactions and provided a glimpse into the common and specific interaction potentials of c-Met, c-Kit, GAB1, and the human androgen receptor. We used these data to build a permutation-based logistic regression classifier that performed considerably better than existing algorithms for predicting the interaction potential of several SH2 domains. PMID:24728074
Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

PubMed

Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

2017-06-01

Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
De novo protein sequencing by combining top-down and bottom-up tandem mass spectra.

PubMed

Liu, Xiaowen; Dekker, Lennard J M; Wu, Si; Vanduijn, Martijn M; Luider, Theo M; Tolić, Nikola; Kou, Qiang; Dvorkin, Mikhail; Alexandrova, Sonya; Vyatkina, Kira; Paša-Tolić, Ljiljana; Pevzner, Pavel A

2014-07-03

There are two approaches for de novo protein sequencing: Edman degradation and mass spectrometry (MS). Existing MS-based methods characterize a novel protein by assembling tandem mass spectra of overlapping peptides generated from multiple proteolytic digestions of the protein. Because each tandem mass spectrum covers only a short peptide of the target protein, the key to high coverage protein sequencing is to find spectral pairs from overlapping peptides in order to assemble tandem mass spectra to long ones. However, overlapping regions of peptides may be too short to be confidently identified. High-resolution mass spectrometers have become accessible to many laboratories. These mass spectrometers are capable of analyzing molecules of large mass values, boosting the development of top-down MS. Top-down tandem mass spectra cover whole proteins. However, top-down tandem mass spectra, even combined, rarely provide full ion fragmentation coverage of a protein. We propose an algorithm, TBNovo, for de novo protein sequencing by combining top-down and bottom-up MS. In TBNovo, a top-down tandem mass spectrum is utilized as a scaffold, and bottom-up tandem mass spectra are aligned to the scaffold to increase sequence coverage. Experiments on data sets of two proteins showed that TBNovo achieved high sequence coverage and high sequence accuracy.
The utility and limitations of current web-available algorithms to predict peptides recognized by CD4 T cells in response to pathogen infection #

PubMed Central

Chaves, Francisco A.; Lee, Alvin H.; Nayak, Jennifer; Richards, Katherine A.; Sant, Andrea J.

2012-01-01

The ability to track CD4 T cells elicited in response to pathogen infection or vaccination is critical because of the role these cells play in protective immunity. Coupled with advances in genome sequencing of pathogenic organisms, there is considerable appeal for implementation of computer-based algorithms to predict peptides that bind to the class II molecules, forming the complex recognized by CD4 T cells. Despite recent progress in this area, there is a paucity of data regarding their success in identifying actual pathogen-derived epitopes. In this study, we sought to rigorously evaluate the performance of multiple web-available algorithms by comparing their predictions and our results using purely empirical methods for epitope discovery in influenza that utilized overlapping peptides and cytokine Elispots, for three independent class II molecules. We analyzed the data in different ways, trying to anticipate how an investigator might use these computational tools for epitope discovery. We come to the conclusion that currently available algorithms can indeed facilitate epitope discovery, but all shared a high degree of false positive and false negative predictions. Therefore, efficiencies were low. We also found dramatic disparities among algorithms and between predicted IC50 values and true dissociation rates of peptide:MHC class II complexes. We suggest that improved success of predictive algorithms will depend less on changes in computational methods or increased data sets and more on changes in parameters used to “train” the algorithms that factor in elements of T cell repertoire and peptide acquisition by class II molecules. PMID:22467652

SANDPUMA: ensemble predictions of nonribosomal peptide chemistry reveal biosynthetic diversity across Actinobacteria.

PubMed

Chevrette, Marc G; Aicheler, Fabian; Kohlbacher, Oliver; Currie, Cameron R; Medema, Marnix H

2017-10-15

Nonribosomally synthesized peptides (NRPs) are natural products with widespread applications in medicine and biotechnology. Many algorithms have been developed to predict the substrate specificities of nonribosomal peptide synthetase adenylation (A) domains from DNA sequences, which enables prioritization and dereplication, and integration with other data types in discovery efforts. However, insufficient training data and a lack of clarity regarding prediction quality have impeded optimal use. Here, we introduce prediCAT, a new phylogenetics-inspired algorithm, which quantitatively estimates the degree of predictability of each A-domain. We then systematically benchmarked all algorithms on a newly gathered, independent test set of 434 A-domain sequences, showing that active-site-motif-based algorithms outperform whole-domain-based methods. Subsequently, we developed SANDPUMA, a powerful ensemble algorithm, based on newly trained versions of all high-performing algorithms, which significantly outperforms individual methods. Finally, we deployed SANDPUMA in a systematic investigation of 7635 Actinobacteria genomes, suggesting that NRP chemical diversity is much higher than previously estimated. SANDPUMA has been integrated into the widely used antiSMASH biosynthetic gene cluster analysis pipeline and is also available as an open-source, standalone tool. SANDPUMA is freely available at https://bitbucket.org/chevrm/sandpuma and as a docker image at https://hub.docker.com/r/chevrm/sandpuma/ under the GNU Public License 3 (GPL3). chevrette@wisc.edu or marnix.medema@wur.nl. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks

DOE Office of Scientific and Technical Information (OSTI.GOV)

Na, Seungjin; Payne, Samuel H.; Bandeira, Nuno

The spectral networks approach enables the detection of pairs of spectra from related peptides and thus allows for the propagation of annotations from identified peptides to unidentified spectra. Beyond allowing for unbiased discovery of unexpected post-translational modifications, spectral networks are also applicable to multi-species comparative proteomics or metaproteomics to identify numerous orthologous versions of a protein. We present algorithmic and statistical advances in spectral networks that have made it possible to rigorously assess the statistical significance of spectral pairs and accurately estimate the error rate of identifications via propagation. In the analysis of three related Cyanothece species, a model organismmore » for biohydrogen production, spectral networks identified peptides with highly divergent sequences with up to dozens of variants per peptide, including many novel peptides in species that lack a sequenced genome. Furthermore, spectral networks strongly suggested the presence of novel peptides even in genomically characterized species (i.e. missing from databases) in that a significant portion of unidentified multi-species networks included at least two polymorphic peptide variants.« less
Attractors in Sequence Space: Agent-Based Exploration of MHC I Binding Peptides.

PubMed

Jäger, Natalie; Wisniewska, Joanna M; Hiss, Jan A; Freier, Anja; Losch, Florian O; Walden, Peter; Wrede, Paul; Schneider, Gisbert

2010-01-12

Ant Colony Optimization (ACO) is a meta-heuristic that utilizes a computational analogue of ant trail pheromones to solve combinatorial optimization problems. The size of the ant colony and the representation of the ants' pheromone trails is unique referring to the given optimization problem. In the present study, we employed ACO to generate novel peptides that stabilize MHC I protein on the plasma membrane of a murine lymphoma cell line. A jury of feedforward neural network classifiers served as fitness function for peptide design by ACO. Bioactive murine MHC I H-2K(b) stabilizing as well as nonstabilizing octapeptides were designed, synthesized and tested. These peptides reveal residue motifs that are relevant for MHC I receptor binding. We demonstrate how the performance of the implemented ACO algorithm depends on the colony size and the size of the search space. The actual peptide design process by ACO constitutes a search path in sequence space that can be visualized as trajectories on a self-organizing map (SOM). By projecting the sequence space on a SOM we visualize the convergence of the different solutions that emerge during the optimization process in sequence space. The SOM representation reveals attractors in sequence space for MHC I binding peptides. The combination of ACO and SOM enables systematic peptide optimization. This technique allows for the rational design of various types of bioactive peptides with minimal experimental effort. Here, we demonstrate its successful application to the design of MHC-I binding and nonbinding peptides which exhibit substantial bioactivity in a cell-based assay. Copyright © 2010 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: I. Statistically annotated datasets for peptide sequences and proteins identified via the application of ICAT and tandem mass spectrometry to proteins copurifying with T cell lipid rafts.

PubMed

von Haller, Priska D; Yi, Eugene; Donohoe, Samuel; Vaughn, Kelly; Keller, Andrew; Nesvizhskii, Alexey I; Eng, Jimmy; Li, Xiao-jun; Goodlett, David R; Aebersold, Ruedi; Watts, Julian D

2003-07-01

Lipid rafts were prepared according to standard protocols from Jurkat T cells stimulated via T cell receptor/CD28 cross-linking and from control (unstimulated) cells. Co-isolating proteins from the control and stimulated cell preparations were labeled with isotopically normal (d0) and heavy (d8) versions of the same isotope-coded affinity tag (ICAT) reagent, respectively. Samples were combined, proteolyzed, and resultant peptides fractionated via cation exchange chromatography. Cysteine-containing (ICAT-labeled) peptides were recovered via the biotin tag component of the ICAT reagents by avidin-affinity chromatography. On-line micro-capillary liquid chromatography tandem mass spectrometry was performed on both avidin-affinity (ICAT-labeled) and flow-through (unlabeled) fractions. Initial peptide sequence identification was by searching recorded tandem mass spectrometry spectra against a human sequence data base using SEQUEST software. New statistical data modeling algorithms were then applied to the SEQUEST search results. These allowed for discrimination between likely "correct" and "incorrect" peptide assignments, and from these the inferred proteins that they collectively represented, by calculating estimated probabilities that each peptide assignment and subsequent protein identification was a member of the "correct" population. For convenience, the resultant lists of peptide sequences assigned and the proteins to which they corresponded were filtered at an arbitrarily set cut-off of 0.5 (i.e. 50% likely to be "correct") and above and compiled into two separate datasets. In total, these data sets contained 7667 individual peptide identifications, which represented 2669 unique peptide sequences, corresponding to 685 proteins and related protein groups.
Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures.

PubMed

Li, Guo-Zhong; Vissers, Johannes P C; Silva, Jeffrey C; Golick, Dan; Gorenstein, Marc V; Geromanos, Scott J

2009-03-01

A novel database search algorithm is presented for the qualitative identification of proteins over a wide dynamic range, both in simple and complex biological samples. The algorithm has been designed for the analysis of data originating from data independent acquisitions, whereby multiple precursor ions are fragmented simultaneously. Measurements used by the algorithm include retention time, ion intensities, charge state, and accurate masses on both precursor and product ions from LC-MS data. The search algorithm uses an iterative process whereby each iteration incrementally increases the selectivity, specificity, and sensitivity of the overall strategy. Increased specificity is obtained by utilizing a subset database search approach, whereby for each subsequent stage of the search, only those peptides from securely identified proteins are queried. Tentative peptide and protein identifications are ranked and scored by their relative correlation to a number of models of known and empirically derived physicochemical attributes of proteins and peptides. In addition, the algorithm utilizes decoy database techniques for automatically determining the false positive identification rates. The search algorithm has been tested by comparing the search results from a four-protein mixture, the same four-protein mixture spiked into a complex biological background, and a variety of other "system" type protein digest mixtures. The method was validated independently by data dependent methods, while concurrently relying on replication and selectivity. Comparisons were also performed with other commercially and publicly available peptide fragmentation search algorithms. The presented results demonstrate the ability to correctly identify peptides and proteins from data independent acquisition strategies with high sensitivity and specificity. They also illustrate a more comprehensive analysis of the samples studied; providing approximately 20% more protein identifications, compared to a more conventional data directed approach using the same identification criteria, with a concurrent increase in both sequence coverage and the number of modified peptides.
An improved stochastic fractal search algorithm for 3D protein structure prediction.

PubMed

Zhou, Changjun; Sun, Chuan; Wang, Bin; Wang, Xiaojun

2018-05-03

Protein structure prediction (PSP) is a significant area for biological information research, disease treatment, and drug development and so on. In this paper, three-dimensional structures of proteins are predicted based on the known amino acid sequences, and the structure prediction problem is transformed into a typical NP problem by an AB off-lattice model. This work applies a novel improved Stochastic Fractal Search algorithm (ISFS) to solve the problem. The Stochastic Fractal Search algorithm (SFS) is an effective evolutionary algorithm that performs well in exploring the search space but falls into local minimums sometimes. In order to avoid the weakness, Lvy flight and internal feedback information are introduced in ISFS. In the experimental process, simulations are conducted by ISFS algorithm on Fibonacci sequences and real peptide sequences. Experimental results prove that the ISFS performs more efficiently and robust in terms of finding the global minimum and avoiding getting stuck in local minimums.
Shotgun Protein Sequencing with Meta-contig Assembly*

PubMed Central

Guthals, Adrian; Clauser, Karl R.; Bandeira, Nuno

2012-01-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings. PMID:22798278
Shotgun protein sequencing with meta-contig assembly.

PubMed

Guthals, Adrian; Clauser, Karl R; Bandeira, Nuno

2012-10-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.
The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra.

PubMed

Shilov, Ignat V; Seymour, Sean L; Patel, Alpesh A; Loboda, Alex; Tang, Wilfred H; Keating, Sean P; Hunter, Christie L; Nuwaysir, Lydia M; Schaeffer, Daniel A

2007-09-01

The Paragon Algorithm, a novel database search engine for the identification of peptides from tandem mass spectrometry data, is presented. Sequence Temperature Values are computed using a sequence tag algorithm, allowing the degree of implication by an MS/MS spectrum of each region of a database to be determined on a continuum. Counter to conventional approaches, features such as modifications, substitutions, and cleavage events are modeled with probabilities rather than by discrete user-controlled settings to consider or not consider a feature. The use of feature probabilities in conjunction with Sequence Temperature Values allows for a very large increase in the effective search space with only a very small increase in the actual number of hypotheses that must be scored. The algorithm has a new kind of user interface that removes the user expertise requirement, presenting control settings in the language of the laboratory that are translated to optimal algorithmic settings. To validate this new algorithm, a comparison with Mascot is presented for a series of analogous searches to explore the relative impact of increasing search space probed with Mascot by relaxing the tryptic digestion conformance requirements from trypsin to semitrypsin to no enzyme and with the Paragon Algorithm using its Rapid mode and Thorough mode with and without tryptic specificity. Although they performed similarly for small search space, dramatic differences were observed in large search space. With the Paragon Algorithm, hundreds of biological and artifact modifications, all possible substitutions, and all levels of conformance to the expected digestion pattern can be searched in a single search step, yet the typical cost in search time is only 2-5 times that of conventional small search space. Despite this large increase in effective search space, there is no drastic loss of discrimination that typically accompanies the exploration of large search space.
PEPlife: A Repository of the Half-life of Peptides

NASA Astrophysics Data System (ADS)

Mathur, Deepika; Prakash, Satya; Anand, Priya; Kaur, Harpreet; Agrawal, Piyush; Mehta, Ayesha; Kumar, Rajesh; Singh, Sandeep; Raghava, Gajendra P. S.

2016-11-01

Short half-life is one of the key challenges in the field of therapeutic peptides. Various studies have reported enhancement in the stability of peptides using methods like chemical modifications, D-amino acid substitution, cyclization, replacement of labile aminos acids, etc. In order to study this scattered data, there is a pressing need for a repository dedicated to the half-life of peptides. To fill this lacuna, we have developed PEPlife (http://crdd.osdd.net/raghava/peplife), a manually curated resource of experimentally determined half-life of peptides. PEPlife contains 2229 entries covering 1193 unique peptides. Each entry provides detailed information of the peptide, like its name, sequence, half-life, modifications, the experimental assay for determining half-life, biological nature and activity of the peptide. We also maintain SMILES and structures of peptides. We have incorporated web-based modules to offer user-friendly data searching and browsing in the database. PEPlife integrates numerous tools to perform various types of analysis such as BLAST, Smith-Waterman algorithm, GGSEARCH, Jalview and MUSTANG. PEPlife would augment the understanding of different factors that affect the half-life of peptides like modifications, sequence, length, route of delivery of the peptide, etc. We anticipate that PEPlife will be useful for the researchers working in the area of peptide-based therapeutics.
Establishment of HLA-DR4 Transgenic Mice for the Identification of CD4+ T Cell Epitopes of Tumor-Associated Antigens

PubMed Central

Harada, Kumiko; Michibata, Yayoi; Tsukamoto, Hirotake; Senju, Satoru; Tomita, Yusuke; Yuno, Akira; Hirayama, Masatoshi; Abu Sayem, Mohammad; Takeda, Naoki; Shibuya, Isao; Sogo, Shinji; Fujiki, Fumihiro; Sugiyama, Haruo; Eto, Masatoshi; Nishimura, Yasuharu

2013-01-01

Reports have shown that activation of tumor-specific CD4+ helper T (Th) cells is crucial for effective anti-tumor immunity and identification of Th-cell epitopes is critical for peptide vaccine-based cancer immunotherapy. Although computer algorithms are available to predict peptides with high binding affinity to a specific HLA class II molecule, the ability of those peptides to induce Th-cell responses must be evaluated. We have established HLA-DR4 (HLA-DRA*01:01/HLA-DRB1*04:05) transgenic mice (Tgm), since this HLA-DR allele is most frequent (13.6%) in Japanese population, to evaluate HLA-DR4-restricted Th-cell responses to tumor-associated antigen (TAA)-derived peptides predicted to bind to HLA-DR4. To avoid weak binding between mouse CD4 and HLA-DR4, Tgm were designed to express chimeric HLA-DR4/I-Ed, where I-Ed α1 and β1 domains were replaced with those from HLA-DR4. Th cells isolated from Tgm immunized with adjuvant and HLA-DR4-binding cytomegalovirus-derived peptide proliferated when stimulated with peptide-pulsed HLA-DR4-transduced mouse L cells, indicating chimeric HLA-DR4/I-Ed has equivalent antigen presenting capacity to HLA-DR4. Immunization with CDCA155-78 peptide, a computer algorithm-predicted HLA-DR4-binding peptide derived from TAA CDCA1, successfully induced Th-cell responses in Tgm, while immunization of HLA-DR4-binding Wilms' tumor 1 antigen-derived peptide with identical amino acid sequence to mouse ortholog failed. This was overcome by using peptide-pulsed syngeneic bone marrow-derived dendritic cells (BM-DC) followed by immunization with peptide/CFA booster. BM-DC-based immunization of KIF20A494-517 peptide from another TAA KIF20A, with an almost identical HLA-binding core amino acid sequence to mouse ortholog, successfully induced Th-cell responses in Tgm. Notably, both CDCA155-78 and KIF20A494-517 peptides induced human Th-cell responses in PBMCs from HLA-DR4-positive donors. Finally, an HLA-DR4 binding DEPDC1191-213 peptide from a new TAA DEPDC1 overexpressed in bladder cancer induced strong Th-cell responses both in Tgm and in PBMCs from an HLA-DR4-positive donor. Thus, the HLA-DR4 Tgm combined with computer algorithm was useful for preliminary screening of candidate peptides for vaccination. PMID:24386437
iFeature: a python package and web server for features extraction and selection from protein and peptide sequences.

PubMed

Chen, Zhen; Zhao, Pei; Li, Fuyi; Leier, André; Marquez-Lago, Tatiana T; Wang, Yanan; Webb, Geoffrey I; Smith, A Ian; Daly, Roger J; Chou, Kuo-Chen; Song, Jiangning

2018-03-08

Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection, and dimensionality reduction algorithms, greatly facilitating training, analysis, and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit. http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/. jiangning.song@monash.edu; kcchou@gordonlifescience.org; roger.daly@monash.edu. Supplementary data are available at Bioinformatics online.
Brute-Force Approach for Mass Spectrometry-Based Variant Peptide Identification in Proteogenomics without Personalized Genomic Data

NASA Astrophysics Data System (ADS)

Ivanov, Mark V.; Lobas, Anna A.; Levitsky, Lev I.; Moshkovskii, Sergei A.; Gorshkov, Mikhail V.

2018-02-01

In a proteogenomic approach based on tandem mass spectrometry analysis of proteolytic peptide mixtures, customized exome or RNA-seq databases are employed for identifying protein sequence variants. However, the problem of variant peptide identification without personalized genomic data is important for a variety of applications. Following the recent proposal by Chick et al. (Nat. Biotechnol. 33, 743-749, 2015) on the feasibility of such variant peptide search, we evaluated two available approaches based on the previously suggested "open" search and the "brute-force" strategy. To improve the efficiency of these approaches, we propose an algorithm for exclusion of false variant identifications from the search results involving analysis of modifications mimicking single amino acid substitutions. Also, we propose a de novo based scoring scheme for assessment of identified point mutations. In the scheme, the search engine analyzes y-type fragment ions in MS/MS spectra to confirm the location of the mutation in the variant peptide sequence.
Scrutinizing MHC-I binding peptides and their limits of variation.

PubMed

Koch, Christian P; Perna, Anna M; Pillong, Max; Todoroff, Nickolay K; Wrede, Paul; Folkers, Gerd; Hiss, Jan A; Schneider, Gisbert

2013-01-01

Designed peptides that bind to major histocompatibility protein I (MHC-I) allomorphs bear the promise of representing epitopes that stimulate a desired immune response. A rigorous bioinformatical exploration of sequence patterns hidden in peptides that bind to the mouse MHC-I allomorph H-2K(b) is presented. We exemplify and validate these motif findings by systematically dissecting the epitope SIINFEKL and analyzing the resulting fragments for their binding potential to H-2K(b) in a thermal denaturation assay. The results demonstrate that only fragments exclusively retaining the carboxy- or amino-terminus of the reference peptide exhibit significant binding potential, with the N-terminal pentapeptide SIINF as shortest ligand. This study demonstrates that sophisticated machine-learning algorithms excel at extracting fine-grained patterns from peptide sequence data and predicting MHC-I binding peptides, thereby considerably extending existing linear prediction models and providing a fresh view on the computer-based molecular design of future synthetic vaccines. The server for prediction is available at http://modlab-cadd.ethz.ch (SLiDER tool, MHC-I version 2012).
Science of Decision Making: A Data-Modeling Approach

DTIC Science & Technology

2013-10-01

were separated on a capillary column using the Dionex UltiMate 3000 (Sunnyvale, CA). The resolved peptides were then sprayed into a linear ion trap...database (3–5). These algorithms assign a peptide sequence, along with a matching score of the experimental ion product mass spectrum, to a theoretical ion ...Bacterial Sample Processing Samples were prepared for liquid chromatography (LC) tandem MS (LC– MS/MS) in a similar manner to that previously reported
Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

PubMed

Wang, Penghao; Wilson, Susan R

2013-01-01

Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.
[ProteoСat: a tool for planning of proteomic experiments].

PubMed

Skvortsov, V S; Alekseychuk, N N; Khudyakov, D V; Mikurova, A V; Rybina, A V; Novikova, S E; Tikhonova, O V

2015-01-01

ProteoCat is a computer program has been designed to help researchers in the planning of large-scale proteomic experiments. The central part of this program is the subprogram of hydrolysis simulation that supports 4 proteases (trypsin, lysine C, endoproteinases AspN and GluC). For the peptides obtained after virtual hydrolysis or loaded from data file a number of properties important in mass-spectrometric experiments can be calculated or predicted. The data can be analyzed or filtered to reduce a set of peptides. The program is using new and improved modification of our methods developed to predict pI and probability of peptide detection; pI can also be predicted for a number of popular pKa's scales, proposed by other investigators. The algorithm for prediction of peptide retention time was realized similar to the algorithm used in the program SSRCalc. ProteoCat can estimate the coverage of amino acid sequences of proteins under defined limitation on peptides detection, as well as the possibility of assembly of peptide fragments with user-defined size of "sticky" ends. The program has a graphical user interface, written on JAVA and available at http://www.ibmc.msk.ru/LPCIT/ProteoCat.
Understanding and predicting binding between human leukocyte antigens (HLAs) and peptides by network analysis.

PubMed

Luo, Heng; Ye, Hao; Ng, Hui; Shi, Leming; Tong, Weida; Mattes, William; Mendrick, Donna; Hong, Huixiao

2015-01-01

As the major histocompatibility complex (MHC), human leukocyte antigens (HLAs) are one of the most polymorphic genes in humans. Patients carrying certain HLA alleles may develop adverse drug reactions (ADRs) after taking specific drugs. Peptides play an important role in HLA related ADRs as they are the necessary co-binders of HLAs with drugs. Many experimental data have been generated for understanding HLA-peptide binding. However, efficiently utilizing the data for understanding and accurately predicting HLA-peptide binding is challenging. Therefore, we developed a network analysis based method to understand and predict HLA-peptide binding. Qualitative Class I HLA-peptide binding data were harvested and prepared from four major databases. An HLA-peptide binding network was constructed from this dataset and modules were identified by the fast greedy modularity optimization algorithm. To examine the significance of signals in the yielded models, the modularity was compared with the modularity values generated from 1,000 random networks. The peptides and HLAs in the modules were characterized by similarity analysis. The neighbor-edges based and unbiased leverage algorithm (Nebula) was developed for predicting HLA-peptide binding. Leave-one-out (LOO) validations and two-fold cross-validations were conducted to evaluate the performance of Nebula using the constructed HLA-peptide binding network. Nine modules were identified from analyzing the HLA-peptide binding network with a highest modularity compared to all the random networks. Peptide length and functional side chains of amino acids at certain positions of the peptides were different among the modules. HLA sequences were module dependent to some extent. Nebula archived an overall prediction accuracy of 0.816 in the LOO validations and average accuracy of 0.795 in the two-fold cross-validations and outperformed the method reported in the literature. Network analysis is a useful approach for analyzing large and sparse datasets such as the HLA-peptide binding dataset. The modules identified from the network analysis clustered peptides and HLAs with similar sequences and properties of amino acids. Nebula performed well in the predictions of HLA-peptide binding. We demonstrated that network analysis coupled with Nebula is an efficient approach to understand and predict HLA-peptide binding interactions and thus, could further our understanding of ADRs.
Understanding and predicting binding between human leukocyte antigens (HLAs) and peptides by network analysis

PubMed Central

2015-01-01

Background As the major histocompatibility complex (MHC), human leukocyte antigens (HLAs) are one of the most polymorphic genes in humans. Patients carrying certain HLA alleles may develop adverse drug reactions (ADRs) after taking specific drugs. Peptides play an important role in HLA related ADRs as they are the necessary co-binders of HLAs with drugs. Many experimental data have been generated for understanding HLA-peptide binding. However, efficiently utilizing the data for understanding and accurately predicting HLA-peptide binding is challenging. Therefore, we developed a network analysis based method to understand and predict HLA-peptide binding. Methods Qualitative Class I HLA-peptide binding data were harvested and prepared from four major databases. An HLA-peptide binding network was constructed from this dataset and modules were identified by the fast greedy modularity optimization algorithm. To examine the significance of signals in the yielded models, the modularity was compared with the modularity values generated from 1,000 random networks. The peptides and HLAs in the modules were characterized by similarity analysis. The neighbor-edges based and unbiased leverage algorithm (Nebula) was developed for predicting HLA-peptide binding. Leave-one-out (LOO) validations and two-fold cross-validations were conducted to evaluate the performance of Nebula using the constructed HLA-peptide binding network. Results Nine modules were identified from analyzing the HLA-peptide binding network with a highest modularity compared to all the random networks. Peptide length and functional side chains of amino acids at certain positions of the peptides were different among the modules. HLA sequences were module dependent to some extent. Nebula archived an overall prediction accuracy of 0.816 in the LOO validations and average accuracy of 0.795 in the two-fold cross-validations and outperformed the method reported in the literature. Conclusions Network analysis is a useful approach for analyzing large and sparse datasets such as the HLA-peptide binding dataset. The modules identified from the network analysis clustered peptides and HLAs with similar sequences and properties of amino acids. Nebula performed well in the predictions of HLA-peptide binding. We demonstrated that network analysis coupled with Nebula is an efficient approach to understand and predict HLA-peptide binding interactions and thus, could further our understanding of ADRs. PMID:26424483
STEPS: a grid search methodology for optimized peptide identification filtering of MS/MS database search results.

PubMed

Piehowski, Paul D; Petyuk, Vladislav A; Sandoval, John D; Burnum, Kristin E; Kiebel, Gary R; Monroe, Matthew E; Anderson, Gordon A; Camp, David G; Smith, Richard D

2013-03-01

For bottom-up proteomics, there are wide variety of database-searching algorithms in use for matching peptide sequences to tandem MS spectra. Likewise, there are numerous strategies being employed to produce a confident list of peptide identifications from the different search algorithm outputs. Here we introduce a grid-search approach for determining optimal database filtering criteria in shotgun proteomics data analyses that is easily adaptable to any search. Systematic Trial and Error Parameter Selection--referred to as STEPS--utilizes user-defined parameter ranges to test a wide array of parameter combinations to arrive at an optimal "parameter set" for data filtering, thus maximizing confident identifications. The benefits of this approach in terms of numbers of true-positive identifications are demonstrated using datasets derived from immunoaffinity-depleted blood serum and a bacterial cell lysate, two common proteomics sample types. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Fast tandem mass spectra-based protein identification regardless of the number of spectra or potential modifications examined.

PubMed

Falkner, Jayson; Andrews, Philip

2005-05-15

Comparing tandem mass spectra (MSMS) against a known dataset of protein sequences is a common method for identifying unknown proteins; however, the processing of MSMS by current software often limits certain applications, including comprehensive coverage of post-translational modifications, non-specific searches and real-time searches to allow result-dependent instrument control. This problem deserves attention as new mass spectrometers provide the ability for higher throughput and as known protein datasets rapidly grow in size. New software algorithms need to be devised in order to address the performance issues of conventional MSMS protein dataset-based protein identification. This paper describes a novel algorithm based on converting a collection of monoisotopic, centroided spectra to a new data structure, named 'peptide finite state machine' (PFSM), which may be used to rapidly search a known dataset of protein sequences, regardless of the number of spectra searched or the number of potential modifications examined. The algorithm is verified using a set of commercially available tryptic digest protein standards analyzed using an ABI 4700 MALDI TOFTOF mass spectrometer, and a free, open source PFSM implementation. It is illustrated that a PFSM can accurately search large collections of spectra against large datasets of protein sequences (e.g. NCBI nr) using a regular desktop PC; however, this paper only details the method for identifying peptide and subsequently protein candidates from a dataset of known protein sequences. The concept of using a PFSM as a peptide pre-screening technique for MSMS-based search engines is validated by using PFSM with Mascot and XTandem. Complete source code, documentation and examples for the reference PFSM implementation are freely available at the Proteome Commons, http://www.proteomecommons.org and source code may be used both commercially and non-commercially as long as the original authors are credited for their work.
Order within disorder: Aggrecan chondroitin sulphate-attachment region provides new structural insights into protein sequences classified as disordered

PubMed Central

Jowitt, Thomas A; Murdoch, Alan D; Baldock, Clair; Berry, Richard; Day, Joanna M; Hardingham, Timothy E

2010-01-01

Structural investigation of proteins containing large stretches of sequences without predicted secondary structure is the focus of much increased attention. Here, we have produced an unglycosylated 30 kDa peptide from the chondroitin sulphate (CS)-attachment region of human aggrecan (CS-peptide), which was predicted to be intrinsically disordered and compared its structure with the adjacent aggrecan G3 domain. Biophysical analyses, including analytical ultracentrifugation, light scattering, and circular dichroism showed that the CS-peptide had an elongated and stiffened conformation in contrast to the globular G3 domain. The results suggested that it contained significant secondary structure, which was sensitive to urea, and we propose that the CS-peptide forms an elongated wormlike molecule based on a dynamic range of energetically equivalent secondary structures stabilized by hydrogen bonds. The dimensions of the structure predicted from small-angle X-ray scattering analysis were compatible with EM images of fully glycosylated aggrecan and a partly glycosylated aggrecan CS2-G3 construct. The semiordered structure identified in CS-peptide was not predicted by common structural algorithms and identified a potentially distinct class of semiordered structure within sequences currently identified as disordered. Sequence comparisons suggested some evidence for comparable structures in proteins encoded by other genes (PRG4, MUC5B, and CBP). The function of these semiordered sequences may serve to spatially position attached folded modules and/or to present polypeptides for modification, such as glycosylation, and to provide templates for the multiple pleiotropic interactions proposed for disordered proteins. Proteins 2010. © 2010 Wiley-Liss, Inc. PMID:20806220
A Consensus Method for the Prediction of ‘Aggregation-Prone’ Peptides in Globular Proteins

PubMed Central

Tsolis, Antonios C.; Papandreou, Nikos C.; Iconomidou, Vassiliki A.; Hamodrakas, Stavros J.

2013-01-01

The purpose of this work was to construct a consensus prediction algorithm of ‘aggregation-prone’ peptides in globular proteins, combining existing tools. This allows comparison of the different algorithms and the production of more objective and accurate results. Eleven (11) individual methods are combined and produce AMYLPRED2, a publicly, freely available web tool to academic users (http://biophysics.biol.uoa.gr/AMYLPRED2), for the consensus prediction of amyloidogenic determinants/‘aggregation-prone’ peptides in proteins, from sequence alone. The performance of AMYLPRED2 indicates that it functions better than individual aggregation-prediction algorithms, as perhaps expected. AMYLPRED2 is a useful tool for identifying amyloid-forming regions in proteins that are associated with several conformational diseases, called amyloidoses, such as Altzheimer's, Parkinson's, prion diseases and type II diabetes. It may also be useful for understanding the properties of protein folding and misfolding and for helping to the control of protein aggregation/solubility in biotechnology (recombinant proteins forming bacterial inclusion bodies) and biotherapeutics (monoclonal antibodies and biopharmaceutical proteins). PMID:23326595
Designing Antibacterial Peptides with Enhanced Killing Kinetics

PubMed Central

Waghu, Faiza H.; Joseph, Shaini; Ghawali, Sanket; Martis, Elvis A.; Madan, Taruna; Venkatesh, Kareenhalli V.; Idicula-Thomas, Susan

2018-01-01

Antimicrobial peptides (AMPs) are gaining attention as substitutes for antibiotics in order to combat the risk posed by multi-drug resistant pathogens. Several research groups are engaged in design of potent anti-infective agents using natural AMPs as templates. In this study, a library of peptides with high sequence similarity to Myeloid Antimicrobial Peptide (MAP) family were screened using popular online prediction algorithms. These peptide variants were designed in a manner to retain the conserved residues within the MAP family. The prediction algorithms were found to effectively classify peptides based on their antimicrobial nature. In order to improve the activity of the identified peptides, molecular dynamics (MD) simulations, using bilayer and micellar systems could be used to design and predict effect of residue substitution on membranes of microbial and mammalian cells. The inference from MD simulation studies well corroborated with the wet-lab observations indicating that MD-guided rational design could lead to discovery of potent AMPs. The effect of the residue substitution on membrane activity was studied in greater detail using killing kinetic analysis. Killing kinetics studies on Gram-positive, negative and human erythrocytes indicated that a single residue change has a drastic effect on the potency of AMPs. An interesting outcome was a switch from monophasic to biphasic death rate constant of Staphylococcus aureus due to a single residue mutation in the peptide. PMID:29527201
NNAlign: a platform to construct and evaluate artificial neural network models of receptor-ligand interactions.

PubMed

Nielsen, Morten; Andreatta, Massimo

2017-07-03

Peptides are extensively used to characterize functional or (linear) structural aspects of receptor-ligand interactions in biological systems, e.g. SH2, SH3, PDZ peptide-recognition domains, the MHC membrane receptors and enzymes such as kinases and phosphatases. NNAlign is a method for the identification of such linear motifs in biological sequences. The algorithm aligns the amino acid or nucleotide sequences provided as training set, and generates a model of the sequence motif detected in the data. The webserver allows setting up cross-validation experiments to estimate the performance of the model, as well as evaluations on independent data. Many features of the training sequences can be encoded as input, and the network architecture is highly customizable. The results returned by the server include a graphical representation of the motif identified by the method, performance values and a downloadable model that can be applied to scan protein sequences for occurrence of the motif. While its performance for the characterization of peptide-MHC interactions is widely documented, we extended NNAlign to be applicable to other receptor-ligand systems as well. Version 2.0 supports alignments with insertions and deletions, encoding of receptor pseudo-sequences, and custom alphabets for the training sequences. The server is available at http://www.cbs.dtu.dk/services/NNAlign-2.0. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Crescendo: A Protein Sequence Database Search Engine for Tandem Mass Spectra.

PubMed

Wang, Jianqi; Zhang, Yajie; Yu, Yonghao

2015-07-01

A search engine that discovers more peptides reliably is essential to the progress of the computational proteomics. We propose two new scoring functions (L- and P-scores), which aim to capture similar characteristics of a peptide-spectrum match (PSM) as Sequest and Comet do. Crescendo, introduced here, is a software program that implements these two scores for peptide identification. We applied Crescendo to test datasets and compared its performance with widely used search engines, including Mascot, Sequest, and Comet. The results indicate that Crescendo identifies a similar or larger number of peptides at various predefined false discovery rates (FDR). Importantly, it also provides a better separation between the true and decoy PSMs, warranting the future development of a companion post-processing filtering algorithm.
Analysis and Prediction of Myristoylation Sites Using the mRMR Method, the IFS Method and an Extreme Learning Machine Algorithm.

PubMed

Wang, ShaoPeng; Zhang, Yu-Hang; Huang, GuoHua; Chen, Lei; Cai, Yu-Dong

2017-01-01

Myristoylation is an important hydrophobic post-translational modification that is covalently bound to the amino group of Gly residues on the N-terminus of proteins. The many diverse functions of myristoylation on proteins, such as membrane targeting, signal pathway regulation and apoptosis, are largely due to the lipid modification, whereas abnormal or irregular myristoylation on proteins can lead to several pathological changes in the cell. To better understand the function of myristoylated sites and to correctly identify them in protein sequences, this study conducted a novel computational investigation on identifying myristoylation sites in protein sequences. A training dataset with 196 positive and 84 negative peptide segments were obtained. Four types of features derived from the peptide segments following the myristoylation sites were used to specify myristoylatedand non-myristoylated sites. Then, feature selection methods including maximum relevance and minimum redundancy (mRMR), incremental feature selection (IFS), and a machine learning algorithm (extreme learning machine method) were adopted to extract optimal features for the algorithm to identify myristoylation sites in protein sequences, thereby building an optimal prediction model. As a result, 41 key features were extracted and used to build an optimal prediction model. The effectiveness of the optimal prediction model was further validated by its performance on a test dataset. Furthermore, detailed analyses were also performed on the extracted 41 features to gain insight into the mechanism of myristoylation modification. This study provided a new computational method for identifying myristoylation sites in protein sequences. We believe that it can be a useful tool to predict myristoylation sites from protein sequences. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines.

PubMed

Yu, Wen; Taylor, J Alex; Davis, Michael T; Bonilla, Leo E; Lee, Kimberly A; Auger, Paul L; Farnsworth, Chris C; Welcher, Andrew A; Patterson, Scott D

2010-03-01

Despite recent advances in qualitative proteomics, the automatic identification of peptides with optimal sensitivity and accuracy remains a difficult goal. To address this deficiency, a novel algorithm, Multiple Search Engines, Normalization and Consensus is described. The method employs six search engines and a re-scoring engine to search MS/MS spectra against protein and decoy sequences. After the peptide hits from each engine are normalized to error rates estimated from the decoy hits, peptide assignments are then deduced using a minimum consensus model. These assignments are produced in a series of progressively relaxed false-discovery rates, thus enabling a comprehensive interpretation of the data set. Additionally, the estimated false-discovery rate was found to have good concordance with the observed false-positive rate calculated from known identities. Benchmarking against standard proteins data sets (ISBv1, sPRG2006) and their published analysis, demonstrated that the Multiple Search Engines, Normalization and Consensus algorithm consistently achieved significantly higher sensitivity in peptide identifications, which led to increased or more robust protein identifications in all data sets compared with prior methods. The sensitivity and the false-positive rate of peptide identification exhibit an inverse-proportional and linear relationship with the number of participating search engines.
Basophile: Accurate Fragment Charge State Prediction Improves Peptide Identification Rates

DOE PAGES

Wang, Dong; Dasari, Surendra; Chambers, Matthew C.; ...

2013-03-07

In shotgun proteomics, database search algorithms rely on fragmentation models to predict fragment ions that should be observed for a given peptide sequence. The most widely used strategy (Naive model) is oversimplified, cleaving all peptide bonds with equal probability to produce fragments of all charges below that of the precursor ion. More accurate models, based on fragmentation simulation, are too computationally intensive for on-the-fly use in database search algorithms. We have created an ordinal-regression-based model called Basophile that takes fragment size and basic residue distribution into account when determining the charge retention during CID/higher-energy collision induced dissociation (HCD) of chargedmore » peptides. This model improves the accuracy of predictions by reducing the number of unnecessary fragments that are routinely predicted for highly-charged precursors. Basophile increased the identification rates by 26% (on average) over the Naive model, when analyzing triply-charged precursors from ion trap data. Basophile achieves simplicity and speed by solving the prediction problem with an ordinal regression equation, which can be incorporated into any database search software for shotgun proteomic identification.« less
Expression mapping using a retroviral vector for CD8+ T cell epitopes: definition of a Mycobacterium tuberculosis peptide presented by H2-Dd.

PubMed

Aoshi, Taiki; Suzuki, Mina; Uchijima, Masato; Nagata, Toshi; Koide, Yukio

2005-03-01

Identification of CD8+ T cell epitopes is important because detection of specific CD8+ T cells after infection or immunization requires prior knowledge of epitope specificity. Furthermore, identification of CD8+ T cell epitopes permits the development of specific preventive and therapeutic approaches to both infections and tumors. Thus far, CD8+ T cell epitopes have been identified either using an overlapping peptide library covering an entire protein, or using algorithms designed to identify likely peptides that bind to major histocompatibility complex (MHC) class I molecules. The synthesis of overlapping peptides can be prohibitively expensive, and the algorithm programs used to predict CD8+ T cell epitopes are not always accurate. Here we describe a retroviral expression system that specifically allows longer polypeptides and shorter peptides to be expressed in the cytoplasm, and thereby to be processed onto class I MHC molecules. T cells from mice that were immunized with a DNA vaccine encoding MPT-51 were probed against MHC-compatible cell lines retrovirally transduced with overlapping gene fragments encoding 120-140 amino acids of the MPT-51 molecule. After further testing of shorter peptide sequences, we identified a CD8+ T cell epitope using cell lines expressing a relatively small number of algorithm-predicted candidate epitopes. We found that one of the requirements for cell surface display of the 20-mer peptide was the need for cotranslational ubiquitination. The restriction molecule was identified as Dd following transduction with MHC class I genes followed by transduction with the oligonucleotide encoding the epitope. The retroviral expression system described here is cost-effective, particularly if the target molecule is large, and could be adapted to identifying T cell epitopes recognized in infectious disease and against tumor cell antigens.
Computational and Experimental Validation of B and T-Cell Epitopes of the In Vivo Immune Response to a Novel Malarial Antigen

DTIC Science & Technology

2013-08-16

approach in the context of a novel, immunologically relevant antigen. The limited accuracy of the tested algorithms to predict the in vivo immune responses...overlapping peptides spanning the entire sequence are individually tested for antibody interacting residues. Conformational B cell epitopes, in contrast...a blind assessment of this approach in the context of a novel, immunologically relevant antigen. The limited accuracy of the tested algorithms to
Amino acid signature enables proteins to recognize modified tRNA.

PubMed

Spears, Jessica L; Xiao, Xingqing; Hall, Carol K; Agris, Paul F

2014-02-25

Human tRNA(Lys3)UUU is the primer for HIV replication. The HIV-1 nucleocapsid protein, NCp7, facilitates htRNA(Lys3)UUU recruitment from the host cell by binding to and remodeling the tRNA structure. Human tRNA(Lys3)UUU is post-transcriptionally modified, but until recently, the importance of those modifications in tRNA recognition by NCp7 was unknown. Modifications such as the 5-methoxycarbonylmethyl-2-thiouridine at anticodon wobble position-34 and 2-methylthio-N(6)-threonylcarbamoyladenosine, adjacent to the anticodon at position-37, are important to the recognition of htRNA(Lys3)UUU by NCp7. Several short peptides selected from phage display libraries were found to also preferentially recognize these modifications. Evolutionary algorithms (Monte Carlo and self-consistent mean field) and assisted model building with energy refinement were used to optimize the peptide sequence in silico, while fluorescence assays were developed and conducted to verify the in silico results and elucidate a 15-amino acid signature sequence (R-W-Q/N-H-X2-F-Pho-X-G/A-W-R-X2-G, where X can be most amino acids, and Pho is hydrophobic) that recognized the tRNA's fully modified anticodon stem and loop domain, hASL(Lys3)UUU. Peptides of this sequence specifically recognized and bound modified htRNA(Lys3)UUU with an affinity 10-fold higher than that of the starting sequence. Thus, this approach provides an effective means of predicting sequences of RNA binding peptides that have better binding properties. Such peptides can be used in cell and molecular biology as well as biochemistry to explore RNA binding proteins and to inhibit those protein functions.
PepLine: a software pipeline for high-throughput direct mapping of tandem mass spectrometry data on genomic sequences.

PubMed

Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain

2008-05-01

PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
Expanding the cerebrospinal fluid endopeptidome.

PubMed

Hansson, Karl T; Skillbäck, Tobias; Pernevik, Elin; Kern, Silke; Portelius, Erik; Höglund, Kina; Brinkmalm, Gunnar; Holmén-Larsson, Jessica; Blennow, Kaj; Zetterberg, Henrik; Gobom, Johan

2017-03-01

Biomarkers of neurodegenerative disorders are needed to assist in diagnosis, to monitor disease progression and therapeutic interventions, and to provide insight into disease mechanisms. One route to identify such biomarkers is by proteomic and peptidomic analysis of cerebrospinal fluid (CSF). In the current study, we performed an in-depth analysis of the human CSF endopeptidome to establish an inventory that may serve as a basis for future targeted biomarker studies. High-pH RP HPLC was employed for off-line sample prefractionation followed by low-pH nano-LC-MS analysis. Different software programs and scoring algorithms for peptide identification were employed and compared. A total of 18 031 endogenous peptides were identified at a FDR of 1%, increasing the number of known endogenous CSF peptides 10-fold compared to previous studies. The peptides were derived from 2 053 proteins of which more than 60 have been linked to neurodegeneration. Notably, among the findings were six peptides derived from microtubule-associated protein tau, three of which span the diagnostically interesting threonine-181 (Tau-F isoform). Also, 213 peptides from amyloid precursor protein were identified, 58 of which were partially or completely within the sequence of amyloid β 1-40/42, as well as 109 peptides from apolipoprotein E, spanning sequences that discriminate between the E2/E3/E4 isoforms of the protein. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DeNovoGUI: An Open Source Graphical User Interface for de Novo Sequencing of Tandem Mass Spectra

PubMed Central

2013-01-01

De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com. PMID:24295440
DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra.

PubMed

Muth, Thilo; Weilnböck, Lisa; Rapp, Erdmann; Huber, Christian G; Martens, Lennart; Vaudel, Marc; Barsnes, Harald

2014-02-07

De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com .
Use of a Designed Peptide Array To Infer Dissociation Trends for Nontryptic Peptides in Quadrupole Ion Trap and Quadrupole Time-of-Flight Mass Spectrometry

DOE PAGES

Gaucher, Sara P.; Morrow, Jeffrey A.; Faulon, Jean-Loup M.

2007-09-14

Observed peptide gas-phase fragmentation patterns are a complex function of many variables. In order to systematically probe this phenomenon, an array of 40 peptides was synthesized for study. The array of sequences was designed to hold certain variables (peptide length) constant and randomize or balance others (peptide amino acid distribution and position). A high-quality tandem mass spectrometry (MS/MS) data set was acquired for each peptide for all observed charge states on multiple MS instruments, quadrupole-time-of-flight and quadrupole ion trap. The data were analyzed as a function of total charge state and number of mobile protons. Previously known dissociation trends weremore » observed, validating our approach. In addition, the general influence of basic amino acids on dissociation could be determined because, in contrast to the more widely studied tryptic peptides, the amino acids H, K, and R were positionally distributed. Interestingly, our results suggest that cleavage at all basic amino acids is suppressed when a mobile proton is available. Cleavage at H becomes favored only under conditions where a partially mobile proton is present, a caveat to the previously reported trend of enhanced cleavage at H. In conclusion, all acquired data were used as a benchmark to determine how well these sequences would have been identified in a database search using a common algorithm, Mascot.« less
A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics

PubMed Central

Tang, Haixu; Li, Sujun; Ye, Yuzhen

2016-01-01

Metaproteomic studies adopt the common bottom-up proteomics approach to investigate the protein composition and the dynamics of protein expression in microbial communities. When matched metagenomic and/or metatranscriptomic data of the microbial communities are available, metaproteomic data analyses often employ a metagenome-guided approach, in which complete or fragmental protein-coding genes are first directly predicted from metagenomic (and/or metatranscriptomic) sequences or from their assemblies, and the resulting protein sequences are then used as the reference database for peptide/protein identification from MS/MS spectra. This approach is often limited because protein coding genes predicted from metagenomes are incomplete and fragmental. In this paper, we present a graph-centric approach to improving metagenome-guided peptide and protein identification in metaproteomics. Our method exploits the de Bruijn graph structure reported by metagenome assembly algorithms to generate a comprehensive database of protein sequences encoded in the community. We tested our method using several public metaproteomic datasets with matched metagenomic and metatranscriptomic sequencing data acquired from complex microbial communities in a biological wastewater treatment plant. The results showed that many more peptides and proteins can be identified when assembly graphs were utilized, improving the characterization of the proteins expressed in the microbial communities. The additional proteins we identified contribute to the characterization of important pathways such as those involved in degradation of chemical hazards. Our tools are released as open-source software on github at https://github.com/COL-IU/Graph2Pro. PMID:27918579
Tempest: Accelerated MS/MS database search software for heterogeneous computing platforms

PubMed Central

Adamo, Mark E.; Gerber, Scott A.

2017-01-01

MS/MS database search algorithms derive a set of candidate peptide sequences from in-silico digest of a protein sequence database, and compute theoretical fragmentation patterns to match these candidates against observed MS/MS spectra. The original Tempest publication described these operations mapped to a CPU-GPU model, in which the CPU generates peptide candidates that are asynchronously sent to a discrete GPU to be scored against experimental spectra in parallel (Milloy et al., 2012). The current version of Tempest expands this model, incorporating OpenCL to offer seamless parallelization across multicore CPUs, GPUs, integrated graphics chips, and general-purpose coprocessors. Three protocols describe how to configure and run a Tempest search, including discussion of how to leverage Tempest's unique feature set to produce optimal results. PMID:27603022
Targeted Feature Detection for Data-Dependent Shotgun Proteomics

PubMed Central

2017-01-01

Label-free quantification of shotgun LC–MS/MS data is the prevailing approach in quantitative proteomics but remains computationally nontrivial. The central data analysis step is the detection of peptide-specific signal patterns, called features. Peptide quantification is facilitated by associating signal intensities in features with peptide sequences derived from MS2 spectra; however, missing values due to imperfect feature detection are a common problem. A feature detection approach that directly targets identified peptides (minimizing missing values) but also offers robustness against false-positive features (by assigning meaningful confidence scores) would thus be highly desirable. We developed a new feature detection algorithm within the OpenMS software framework, leveraging ideas and algorithms from the OpenSWATH toolset for DIA/SRM data analysis. Our software, FeatureFinderIdentification (“FFId”), implements a targeted approach to feature detection based on information from identified peptides. This information is encoded in an MS1 assay library, based on which ion chromatogram extraction and detection of feature candidates are carried out. Significantly, when analyzing data from experiments comprising multiple samples, our approach distinguishes between “internal” and “external” (inferred) peptide identifications (IDs) for each sample. On the basis of internal IDs, two sets of positive (true) and negative (decoy) feature candidates are defined. A support vector machine (SVM) classifier is then trained to discriminate between the sets and is subsequently applied to the “uncertain” feature candidates from external IDs, facilitating selection and confidence scoring of the best feature candidate for each peptide. This approach also enables our algorithm to estimate the false discovery rate (FDR) of the feature selection step. We validated FFId based on a public benchmark data set, comprising a yeast cell lysate spiked with protein standards that provide a known ground-truth. The algorithm reached almost complete (>99%) quantification coverage for the full set of peptides identified at 1% FDR (PSM level). Compared with other software solutions for label-free quantification, this is an outstanding result, which was achieved at competitive quantification accuracy and reproducibility across replicates. The FDR for the feature selection was estimated at a low 1.5% on average per sample (3% for features inferred from external peptide IDs). The FFId software is open-source and freely available as part of OpenMS (www.openms.org). PMID:28673088

Targeted Feature Detection for Data-Dependent Shotgun Proteomics.

PubMed

Weisser, Hendrik; Choudhary, Jyoti S

2017-08-04

Label-free quantification of shotgun LC-MS/MS data is the prevailing approach in quantitative proteomics but remains computationally nontrivial. The central data analysis step is the detection of peptide-specific signal patterns, called features. Peptide quantification is facilitated by associating signal intensities in features with peptide sequences derived from MS2 spectra; however, missing values due to imperfect feature detection are a common problem. A feature detection approach that directly targets identified peptides (minimizing missing values) but also offers robustness against false-positive features (by assigning meaningful confidence scores) would thus be highly desirable. We developed a new feature detection algorithm within the OpenMS software framework, leveraging ideas and algorithms from the OpenSWATH toolset for DIA/SRM data analysis. Our software, FeatureFinderIdentification ("FFId"), implements a targeted approach to feature detection based on information from identified peptides. This information is encoded in an MS1 assay library, based on which ion chromatogram extraction and detection of feature candidates are carried out. Significantly, when analyzing data from experiments comprising multiple samples, our approach distinguishes between "internal" and "external" (inferred) peptide identifications (IDs) for each sample. On the basis of internal IDs, two sets of positive (true) and negative (decoy) feature candidates are defined. A support vector machine (SVM) classifier is then trained to discriminate between the sets and is subsequently applied to the "uncertain" feature candidates from external IDs, facilitating selection and confidence scoring of the best feature candidate for each peptide. This approach also enables our algorithm to estimate the false discovery rate (FDR) of the feature selection step. We validated FFId based on a public benchmark data set, comprising a yeast cell lysate spiked with protein standards that provide a known ground-truth. The algorithm reached almost complete (>99%) quantification coverage for the full set of peptides identified at 1% FDR (PSM level). Compared with other software solutions for label-free quantification, this is an outstanding result, which was achieved at competitive quantification accuracy and reproducibility across replicates. The FDR for the feature selection was estimated at a low 1.5% on average per sample (3% for features inferred from external peptide IDs). The FFId software is open-source and freely available as part of OpenMS ( www.openms.org ).
BlockLogo: visualization of peptide and sequence motif conservation

PubMed Central

Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian; Sun, Jing; Schönbach, Christian; Reinherz, Ellis L.; Zhang, Guang Lan; Brusic, Vladimir

2013-01-01

BlockLogo is a web-server application for visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, selection of motif positions, type of sequence, and output format definition. The output has BlockLogo along with the sequence logo, and a table of motif frequencies. We deployed BlockLogo as an online application and have demonstrated its utility through examples that show visualization of T-cell epitopes and B-cell epitopes (both continuous and discontinuous). Our additional example shows a visualization and analysis of structural motifs that determine specificity of peptide binding to HLA-DR molecules. The BlockLogo server also employs selected experimentally validated prediction algorithms to enable on-the-fly prediction of MHC binding affinity to 15 common HLA class I and class II alleles as well as visual analysis of discontinuous epitopes from multiple sequence alignments. It enables the visualization and analysis of structural and functional motifs that are usually described as regular expressions. It provides a compact view of discontinuous motifs composed of distant positions within biological sequences. BlockLogo is available at: http://research4.dfci.harvard.edu/cvc/blocklogo/ and http://methilab.bu.edu/blocklogo/ PMID:24001880
An Approach for Peptide Identification by De Novo Sequencing of Mixture Spectra.

PubMed

Liu, Yi; Ma, Bin; Zhang, Kaizhong; Lajoie, Gilles

2017-01-01

Mixture spectra occur quite frequently in a typical wet-lab mass spectrometry experiment, which result from the concurrent fragmentation of multiple precursors. The ability to efficiently and confidently identify mixture spectra is essential to alleviate the existent bottleneck of low mass spectra identification rate. However, most of the traditional computational methods are not suitable for interpreting mixture spectra, because they still take the assumption that the acquired spectra come from the fragmentation of a single precursor. In this manuscript, we formulate the mixture spectra de novo sequencing problem mathematically, and propose a dynamic programming algorithm for the problem. Additionally, we use both simulated and real mixture spectra data sets to verify the merits of the proposed algorithm.
Peptidomic strategy for purification and identification of potential ACE-inhibitory and antioxidant peptides in Tetradesmus obliquus microalgae.

PubMed

Montone, Carmela Maria; Capriotti, Anna Laura; Cavaliere, Chiara; La Barbera, Giorgia; Piovesana, Susy; Zenezini Chiozzi, Riccardo; Laganà, Aldo

2018-06-01

Microalgae are unicellular marine organisms that have promoted complex biochemical pathways to survive in greatly competitive marine environments. They could contain significant amounts of high-quality proteins which, because of their structural diversity, contain a range of yet undiscovered novel bioactive peptides. In this work, a peptidomic platform was developed for the separation and identification of bioactive peptides in protein hydrolysates. In this work, a peptidomic platform was developed for the extraction, separation, and identification of bioactive peptides in protein hydrolysates. Indeed, extraction of proteins from recalcitrant tissues is still a challenge due to their strong cell walls and high levels of non-protein interfering compounds. Therefore, seven different protein extraction protocols, based on mechanical and chemical methods, were tested in order to produce high-quality protein extracts. Proteins obtained by means of the best protocol, consisting of milling the recalcitrant tissue with glass beads, were subjected to enzymatic digestion with Alcalase® and subsequently the hydrolysate was purified by two-dimensional semi-preparative reversed phase liquid chromatography. Fractions were assayed for antioxidant and antihypertensive activities and only the most active ones were finally analyzed by RP nanoHPLC-MS/MS. Around 500 peptide sequences were identified in these fractions. The identified peptides were subjected to an in silico analysis by PeptideRanker algorithm in order to assign a score of bioactivity probability. Twenty-five sequenced peptides were found with potential antioxidant and angiotensin-converting-enzyme-inhibitory activities. Four of these peptides, WPRGYFL, GPDRPKFLGPF, WYGPDRPKFL, SDWDRF, were selected for synthesis and in vitro tested for specific bioactivity, exhibiting good values of antioxidant and ACE-inhibitory activity. Graphical abstract Workflow showing the entire peptidomic approach developed for identification of bioactive peptides in microalgae.
A Mass Spectrometry-Based Predictive Strategy Reveals ADAP1 is Phosphorylated at Tyrosine 364

DOE Office of Scientific and Technical Information (OSTI.GOV)

Littrell, BobbiJo R

The goal of this work was to identify phosphorylation sites within the amino acid sequence of human ADAP1. Using traditional mass spectrometry-based techniques we were unable to produce interpretable spectra demonstrating modification by phosphorylation. This prompted us to employ a strategy in which phosphorylated peptides were first predicted using peptide mapping followed by targeted MS/MS acquisition. ADAP1 was immunoprecipitated from extracts of HEK293 cells stably-transfected with ADAP1 cDNA. Immunoprecipitated ADAP1 was digested with proteolytic enzymes and analyzed by LC-MS in MS1 mode by high-resolution quadrupole time-of-flight mass spectrometry (QTOF-MS). Peptide molecular features were extracted using an untargeted data mining algorithm.more » Extracted peptide neutral masses were matched against the ADAP1 amino acid sequence with phosphorylation included as a predicted modification. Peptides with predicted phosphorylation sites were analyzed by targeted LC-MS2. Acquired MS2 spectra were then analyzed using database search engines to confirm phosphorylation. Spectra of phosphorylated peptides were validated by manual interpretation. Further confirmation was performed by manipulating phospho-peptide abundance using calf intestinal phosphatase (CIP) and the phorbol ester, phorbol 12-myristate 13-acetate (PMA). Of five predicted phosphopeptides, one, comprised of the sequence AVDRPMLPQEYAVEAHFK, was confirmed to be phosphorylated on a Tyrosine at position 364. Pre-treatment of cells with PMA prior to immunoprecipitation increased the ratio of phosphorylated to unphosphorylated peptide as determined by area counts of extracted ion chromatograms (EIC). Addition of CIP to immunoprecipitation reactions eliminated the phosphorylated form. A novel phosphorylation site was identified at Tyrosine 364. Phosphorylation at this site is increased by treatment with PMA. PMA promotes membrane translocation and activation of protein kinase C (PKC), indicating that Tyrosine 364 is phosphorylated by a PKC-dependent mechanism.« less
Elucidation of cross-species proteomic effects in human and hominin bone proteome identification through a bioinformatics experiment.

PubMed

Welker, F

2018-02-20

The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from "cross-species proteomic effects": the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts.
Cooperativity among Short Amyloid Stretches in Long Amyloidogenic Sequences

PubMed Central

He, Zhisong; Shi, Xiaohe; Feng, Kaiyan; Ma, Buyong; Cai, Yu-Dong

2012-01-01

Amyloid fibrillar aggregates of polypeptides are associated with many neurodegenerative diseases. Short peptide segments in protein sequences may trigger aggregation. Identifying these stretches and examining their behavior in longer protein segments is critical for understanding these diseases and obtaining potential therapies. In this study, we combined machine learning and structure-based energy evaluation to examine and predict amyloidogenic segments. Our feature selection method discovered that windows consisting of long amino acid segments of ∼30 residues, instead of the commonly used short hexapeptides, provided the highest accuracy. Weighted contributions of an amino acid at each position in a 27 residue window revealed three cooperative regions of short stretch, resemble the β-strand-turn-β-strand motif in A-βpeptide amyloid and β-solenoid structure of HET-s(218–289) prion (C). Using an in-house energy evaluation algorithm, the interaction energy between two short stretches in long segment is computed and incorporated as an additional feature. The algorithm successfully predicted and classified amyloid segments with an overall accuracy of 75%. Our study revealed that genome-wide amyloid segments are not only dependent on short high propensity stretches, but also on nearby residues. PMID:22761773
Tandem mass spectrometry of human tryptic blood peptides calculated by a statistical algorithm and captured by a relational database with exploration by a general statistical analysis system.

PubMed

Bowden, Peter; Beavis, Ron; Marshall, John

2009-11-02

A goodness of fit test may be used to assign tandem mass spectra of peptides to amino acid sequences and to directly calculate the expected probability of mis-identification. The product of the peptide expectation values directly yields the probability that the parent protein has been mis-identified. A relational database could capture the mass spectral data, the best fit results, and permit subsequent calculations by a general statistical analysis system. The many files of the Hupo blood protein data correlated by X!TANDEM against the proteins of ENSEMBL were collected into a relational database. A redundant set of 247,077 proteins and peptides were correlated by X!TANDEM, and that was collapsed to a set of 34,956 peptides from 13,379 distinct proteins. About 6875 distinct proteins were only represented by a single distinct peptide, 2866 proteins showed 2 distinct peptides, and 3454 proteins showed at least three distinct peptides by X!TANDEM. More than 99% of the peptides were associated with proteins that had cumulative expectation values, i.e. probability of false positive identification, of one in one hundred or less. The distribution of peptides per protein from X!TANDEM was significantly different than those expected from random assignment of peptides.
Rapid motif compliance scoring with match weight sets.

PubMed

Venezia, D; O'Hara, P J

1993-02-01

Most current implementations of motif matching in biological sequences have sacrificed the generality of weight matrix scoring for shorter runtimes. The program MOTIF incorporates a weight matrix and a rapid, backtracking tree-search algorithm to score motif compliance with greatly enhanced performance while placing no constraints on the motif. In addition, any positions within a motif can be marked as 'inviolate', thereby requiring an exact match. MOTIF allows a choice of regular expression formats and can use both motif and sequence libraries as either targets or queries. Nucleic acid sequences can optionally be translated by MOTIF in any frame(s) and used against peptide motifs.
Optimal de novo design of MRM experiments for rapid assay development in targeted proteomics.

PubMed

Bertsch, Andreas; Jung, Stephan; Zerck, Alexandra; Pfeifer, Nico; Nahnsen, Sven; Henneges, Carsten; Nordheim, Alfred; Kohlbacher, Oliver

2010-05-07

Targeted proteomic approaches such as multiple reaction monitoring (MRM) overcome problems associated with classical shotgun mass spectrometry experiments. Developing MRM quantitation assays can be time consuming, because relevant peptide representatives of the proteins must be found and their retention time and the product ions must be determined. Given the transitions, hundreds to thousands of them can be scheduled into one experiment run. However, it is difficult to select which of the transitions should be included into a measurement. We present a novel algorithm that allows the construction of MRM assays from the sequence of the targeted proteins alone. This enables the rapid development of targeted MRM experiments without large libraries of transitions or peptide spectra. The approach relies on combinatorial optimization in combination with machine learning techniques to predict proteotypicity, retention time, and fragmentation of peptides. The resulting potential transitions are scheduled optimally by solving an integer linear program. We demonstrate that fully automated construction of MRM experiments from protein sequences alone is possible and over 80% coverage of the targeted proteins can be achieved without further optimization of the assay.
Pepitome: evaluating improved spectral library search for identification complementarity and quality assessment

PubMed Central

Dasari, Surendra; Chambers, Matthew C.; Martinez, Misti A.; Carpenter, Kristin L.; Ham, Amy-Joan L.; Vega-Montoto, Lorenzo J.; Tabb, David L.

2012-01-01

Spectral libraries have emerged as a viable alternative to protein sequence databases for peptide identification. These libraries contain previously detected peptide sequences and their corresponding tandem mass spectra (MS/MS). Search engines can then identify peptides by comparing experimental MS/MS scans to those in the library. Many of these algorithms employ the dot product score for measuring the quality of a spectrum-spectrum match (SSM). This scoring system does not offer a clear statistical interpretation and ignores fragment ion m/z discrepancies in the scoring. We developed a new spectral library search engine, Pepitome, which employs statistical systems for scoring SSMs. Pepitome outperformed the leading library search tool, SpectraST, when analyzing data sets acquired on three different mass spectrometry platforms. We characterized the reliability of spectral library searches by confirming shotgun proteomics identifications through RNA-Seq data. Applying spectral library and database searches on the same sample revealed their complementary nature. Pepitome identifications enabled the automation of quality analysis and quality control (QA/QC) for shotgun proteomics data acquisition pipelines. PMID:22217208
Andromeda: a peptide search engine integrated into the MaxQuant environment.

PubMed

Cox, Jürgen; Neuhauser, Nadin; Michalski, Annette; Scheltema, Richard A; Olsen, Jesper V; Mann, Matthias

2011-04-01

A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.
Tempest: Accelerated MS/MS Database Search Software for Heterogeneous Computing Platforms.

PubMed

Adamo, Mark E; Gerber, Scott A

2016-09-07

MS/MS database search algorithms derive a set of candidate peptide sequences from in silico digest of a protein sequence database, and compute theoretical fragmentation patterns to match these candidates against observed MS/MS spectra. The original Tempest publication described these operations mapped to a CPU-GPU model, in which the CPU (central processing unit) generates peptide candidates that are asynchronously sent to a discrete GPU (graphics processing unit) to be scored against experimental spectra in parallel. The current version of Tempest expands this model, incorporating OpenCL to offer seamless parallelization across multicore CPUs, GPUs, integrated graphics chips, and general-purpose coprocessors. Three protocols describe how to configure and run a Tempest search, including discussion of how to leverage Tempest's unique feature set to produce optimal results. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space.

PubMed

Akbar, Shahid; Hayat, Maqsood; Iqbal, Muhammad; Jan, Mian Ahmad

2017-06-01

Cancer is a fatal disease, responsible for one-quarter of all deaths in developed countries. Traditional anticancer therapies such as, chemotherapy and radiation, are highly expensive, susceptible to errors and ineffective techniques. These conventional techniques induce severe side-effects on human cells. Due to perilous impact of cancer, the development of an accurate and highly efficient intelligent computational model is desirable for identification of anticancer peptides. In this paper, evolutionary intelligent genetic algorithm-based ensemble model, 'iACP-GAEnsC', is proposed for the identification of anticancer peptides. In this model, the protein sequences are formulated, using three different discrete feature representation methods, i.e., amphiphilic Pseudo amino acid composition, g-Gap dipeptide composition, and Reduce amino acid alphabet composition. The performance of the extracted feature spaces are investigated separately and then merged to exhibit the significance of hybridization. In addition, the predicted results of individual classifiers are combined together, using optimized genetic algorithm and simple majority technique in order to enhance the true classification rate. It is observed that genetic algorithm-based ensemble classification outperforms than individual classifiers as well as simple majority voting base ensemble. The performance of genetic algorithm-based ensemble classification is highly reported on hybrid feature space, with an accuracy of 96.45%. In comparison to the existing techniques, 'iACP-GAEnsC' model has achieved remarkable improvement in terms of various performance metrics. Based on the simulation results, it is observed that 'iACP-GAEnsC' model might be a leading tool in the field of drug design and proteomics for researchers. Copyright © 2017 Elsevier B.V. All rights reserved.
Mapping the tumour human leukocyte antigen (HLA) ligandome by mass spectrometry.

PubMed

Freudenmann, Lena Katharina; Marcu, Ana; Stevanović, Stefan

2018-07-01

The entirety of human leukocyte antigen (HLA)-presented peptides is referred to as the HLA ligandome of a cell or tissue, in tumours often termed immunopeptidome. Mapping the tumour immunopeptidome by mass spectrometry (MS) comprehensively views the pathophysiologically relevant antigenic signature of human malignancies. MS is an unbiased approach stringently filtering the candidates to be tested as opposed to epitope prediction algorithms. In the setting of peptide-specific immunotherapies, MS-based strategies significantly diminish the risk of lacking clinical benefit, as they yield highly enriched amounts of truly presented peptides. Early immunopeptidomic efforts were severely limited by technical sensitivity and manual spectra interpretation. The technological progress with development of orbitrap mass analysers and enhanced chromatographic performance led to vast improvements in mass accuracy, sensitivity, resolution, and speed. Concomitantly, bioinformatic tools were developed to process MS data, integrate sequencing results, and deconvolute multi-allelic datasets. This enabled the immense advancement of tumour immunopeptidomics. Studying the HLA-presented peptide repertoire bears high potential for both answering basic scientific questions and translational application. Mapping the tumour HLA ligandome has started to significantly contribute to target identification for the design of peptide-specific cancer immunotherapies in clinical trials and compassionate need treatments. In contrast to prediction algorithms, rare HLA allotypes and HLA class II can be adequately addressed when choosing MS-guided target identification platforms. Herein, we review the identification of tumour HLA ligands focusing on sources, methods, bioinformatic data analysis, translational application, and provide an outlook on future developments. © 2018 John Wiley & Sons Ltd.
Sequencing Cyclic Peptides by Multistage Mass Spectrometry

PubMed Central

Mohimani, Hosein; Yang, Yu-Liang; Liu, Wei-Ting; Hsieh, Pei-Wen; Dorrestein, Pieter C.; Pevzner, Pavel A.

2012-01-01

Some of the most effective antibiotics (e.g., Vancomycin and Daptomycin) are cyclic peptides produced by non-ribosomal biosynthetic pathways. While hundreds of biomedically important cyclic peptides have been sequenced, the computational techniques for sequencing cyclic peptides are still in their infancy. Previous methods for sequencing peptide antibiotics and other cyclic peptides are based on Nuclear Magnetic Resonance spectroscopy, and require large amount (miligrams) of purified materials that, for most compounds, are not possible to obtain. Recently, development of mass spectrometry based methods has provided some hope for accurate sequencing of cyclic peptides using picograms of materials. In this paper we develop a method for sequencing of cyclic peptides by multistage mass spectrometry, and show its advantages over single stage mass spectrometry. The method is tested on known and new cyclic peptides from Bacillus brevis, Dianthus superbus and Streptomyces griseus, as well as a new family of cyclic peptides produced by marine bacteria. PMID:21751357
Broad and Cross-Clade CD4+ T-Cell Responses Elicited by a DNA Vaccine Encoding Highly Conserved and Promiscuous HIV-1 M-Group Consensus Peptides

PubMed Central

Almeida, Rafael Ribeiro; Rosa, Daniela Santoro; Ribeiro, Susan Pereira; Santana, Vinicius Canato; Kallás, Esper Georges; Sidney, John; Sette, Alessandro; Kalil, Jorge; Cunha-Neto, Edecio

2012-01-01

T-cell based vaccine approaches have emerged to counteract HIV-1/AIDS. Broad, polyfunctional and cytotoxic CD4+ T-cell responses have been associated with control of HIV-1 replication, which supports the inclusion of CD4+ T-cell epitopes in vaccines. A successful HIV-1 vaccine should also be designed to overcome viral genetic diversity and be able to confer immunity in a high proportion of immunized individuals from a diverse HLA-bearing population. In this study, we rationally designed a multiepitopic DNA vaccine in order to elicit broad and cross-clade CD4+ T-cell responses against highly conserved and promiscuous peptides from the HIV-1 M-group consensus sequence. We identified 27 conserved, multiple HLA-DR-binding peptides in the HIV-1 M-group consensus sequences of Gag, Pol, Nef, Vif, Vpr, Rev and Vpu using the TEPITOPE algorithm. The peptides bound in vitro to an average of 12 out of the 17 tested HLA-DR molecules and also to several molecules such as HLA-DP, -DQ and murine IAb and IAd. Sixteen out of the 27 peptides were recognized by PBMC from patients infected with different HIV-1 variants and 72% of such patients recognized at least 1 peptide. Immunization with a DNA vaccine (HIVBr27) encoding the identified peptides elicited IFN-γ secretion against 11 out of the 27 peptides in BALB/c mice; CD4+ and CD8+ T-cell proliferation was observed against 8 and 6 peptides, respectively. HIVBr27 immunization elicited cross-clade T-cell responses against several HIV-1 peptide variants. Polyfunctional CD4+ and CD8+ T cells, able to simultaneously proliferate and produce IFN-γ and TNF-α, were also observed. This vaccine concept may cope with HIV-1 genetic diversity as well as provide increased population coverage, which are desirable features for an efficacious strategy against HIV-1/AIDS. PMID:23028895
Insight into the Structure of Amyloid Fibrils from the Analysis of Globular Proteins

PubMed Central

Trovato, Antonio; Chiti, Fabrizio; Maritan, Amos; Seno, Flavio

2006-01-01

The conversion from soluble states into cross-β fibrillar aggregates is a property shared by many different proteins and peptides and was hence conjectured to be a generic feature of polypeptide chains. Increasing evidence is now accumulating that such fibrillar assemblies are generally characterized by a parallel in-register alignment of β-strands contributed by distinct protein molecules. Here we assume a universal mechanism is responsible for β-structure formation and deduce sequence-specific interaction energies between pairs of protein fragments from a statistical analysis of the native folds of globular proteins. The derived fragment–fragment interaction was implemented within a novel algorithm, prediction of amyloid structure aggregation (PASTA), to investigate the role of sequence heterogeneity in driving specific aggregation into ordered self-propagating cross-β structures. The algorithm predicts that the parallel in-register arrangement of sequence portions that participate in the fibril cross-β core is favoured in most cases. However, the antiparallel arrangement is correctly discriminated when present in fibrils formed by short peptides. The predictions of the most aggregation-prone portions of initially unfolded polypeptide chains are also in excellent agreement with available experimental observations. These results corroborate the recent hypothesis that the amyloid structure is stabilised by the same physicochemical determinants as those operating in folded proteins. They also suggest that side chain–side chain interaction across neighbouring β-strands is a key determinant of amyloid fibril formation and of their self-propagating ability. PMID:17173479
DNASynth: a software application to optimization of artificial gene synthesis

NASA Astrophysics Data System (ADS)

Muczyński, Jan; Nowak, Robert M.

2017-08-01

DNASynth is a client-server software application in which the client runs in a web browser. The aim of this program is to support and optimize process of artificial gene synthesizing using Ligase Chain Reaction. Thanks to LCR it is possible to obtain DNA strand coding defined by user peptide. The DNA sequence is calculated by optimization algorithm that consider optimal codon usage, minimal energy of secondary structures and minimal number of required LCR. Additionally absence of sequences characteristic for defined by user set of restriction enzymes is guaranteed. The presented software was tested on synthetic and real data.
Chameleon sequences in neurodegenerative diseases.

PubMed

Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

2016-03-25

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases. Copyright © 2016 Elsevier Inc. All rights reserved.

Chameleon sequences in neurodegenerative diseases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bahramali, Golnaz; Goliaei, Bahram, E-mail: goliaei@ut.ac.ir; Minuchehr, Zarrin, E-mail: minuchehr@nigeb.ac.ir

2016-03-25

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix tomore » coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.« less
CABS-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site

PubMed Central

Kurcinski, Mateusz; Jamroz, Michal; Blaszczyk, Maciej; Kolinski, Andrzej; Kmiecik, Sebastian

2015-01-01

Protein–peptide interactions play a key role in cell functions. Their structural characterization, though challenging, is important for the discovery of new drugs. The CABS-dock web server provides an interface for modeling protein–peptide interactions using a highly efficient protocol for the flexible docking of peptides to proteins. While other docking algorithms require pre-defined localization of the binding site, CABS-dock does not require such knowledge. Given a protein receptor structure and a peptide sequence (and starting from random conformations and positions of the peptide), CABS-dock performs simulation search for the binding site allowing for full flexibility of the peptide and small fluctuations of the receptor backbone. This protocol was extensively tested over the largest dataset of non-redundant protein–peptide interactions available to date (including bound and unbound docking cases). For over 80% of bound and unbound dataset cases, we obtained models with high or medium accuracy (sufficient for practical applications). Additionally, as optional features, CABS-dock can exclude user-selected binding modes from docking search or to increase the level of flexibility for chosen receptor fragments. CABS-dock is freely available as a web server at http://biocomp.chem.uw.edu.pl/CABSdock. PMID:25943545
Integrating sampling techniques and inverse virtual screening: toward the discovery of artificial peptide-based receptors for ligands.

PubMed

Pérez, Germán M; Salomón, Luis A; Montero-Cabrera, Luis A; de la Vega, José M García; Mascini, Marcello

2016-05-01

A novel heuristic using an iterative select-and-purge strategy is proposed. It combines statistical techniques for sampling and classification by rigid molecular docking through an inverse virtual screening scheme. This approach aims to the de novo discovery of short peptides that may act as docking receptors for small target molecules when there are no data available about known association complexes between them. The algorithm performs an unbiased stochastic exploration of the sample space, acting as a binary classifier when analyzing the entire peptides population. It uses a novel and effective criterion for weighting the likelihood of a given peptide to form an association complex with a particular ligand molecule based on amino acid sequences. The exploratory analysis relies on chemical information of peptides composition, sequence patterns, and association free energies (docking scores) in order to converge to those peptides forming the association complexes with higher affinities. Statistical estimations support these results providing an association probability by improving predictions accuracy even in cases where only a fraction of all possible combinations are sampled. False positives/false negatives ratio was also improved with this method. A simple rigid-body docking approach together with the proper information about amino acid sequences was used. The methodology was applied in a retrospective docking study to all 8000 possible tripeptide combinations using the 20 natural amino acids, screened against a training set of 77 different ligands with diverse functional groups. Afterward, all tripeptides were screened against a test set of 82 ligands, also containing different functional groups. Results show that our integrated methodology is capable of finding a representative group of the top-scoring tripeptides. The associated probability of identifying the best receptor or a group of the top-ranked receptors is more than double and about 10 times higher, respectively, when compared to classical random sampling methods.
mzStudio: A Dynamic Digital Canvas for User-Driven Interrogation of Mass Spectrometry Data.

PubMed

Ficarro, Scott B; Alexander, William M; Marto, Jarrod A

2017-08-01

Although not yet truly 'comprehensive', modern mass spectrometry-based experiments can generate quantitative data for a meaningful fraction of the human proteome. Importantly for large-scale protein expression analysis, robust data pipelines are in place for identification of un-modified peptide sequences and aggregation of these data to protein-level quantification. However, interoperable software tools that enable scientists to computationally explore and document novel hypotheses for peptide sequence, modification status, or fragmentation behavior are not well-developed. Here, we introduce mzStudio, an open-source Python module built on our multiplierz project. This desktop application provides a highly-interactive graphical user interface (GUI) through which scientists can examine and annotate spectral features, re-search existing PSMs to test different modifications or new spectral matching algorithms, share results with colleagues, integrate other domain-specific software tools, and finally create publication-quality graphics. mzStudio leverages our common application programming interface (mzAPI) for access to native data files from multiple instrument platforms, including ion trap, quadrupole time-of-flight, Orbitrap, matrix-assisted laser desorption ionization, and triple quadrupole mass spectrometers and is compatible with several popular search engines including Mascot, Proteome Discoverer, X!Tandem, and Comet. The mzStudio toolkit enables researchers to create a digital provenance of data analytics and other evidence that support specific peptide sequence assignments.
MHC2NNZ: A novel peptide binding prediction approach for HLA DQ molecules

NASA Astrophysics Data System (ADS)

Xie, Jiang; Zeng, Xu; Lu, Dongfang; Liu, Zhixiang; Wang, Jiao

2017-07-01

The major histocompatibility complex class II (MHC-II) molecule plays a crucial role in immunology. Computational prediction of MHC-II binding peptides can help researchers understand the mechanism of immune systems and design vaccines. Most of the prediction algorithms for MHC-II to date have made large efforts in human leukocyte antigen (HLA, the name of MHC in Human) molecules encoded in the DR locus. However, HLA DQ molecules are equally important and have only been made less progress because it is more difficult to handle them experimentally. In this study, we propose an artificial neural network-based approach called MHC2NNZ to predict peptides binding to HLA DQ molecules. Unlike previous artificial neural network-based methods, MHC2NNZ not only considers sequence similarity features but also captures the chemical and physical properties, and a novel method incorporating these properties is proposed to represent peptide flanking regions (PFR). Furthermore, MHC2NNZ improves the prediction accuracy by combining with amino acid preference at more specific positions of the peptides binding core. By evaluating on 3549 peptides binding to six most frequent HLA DQ molecules, MHC2NNZ is demonstrated to outperform other state-of-the-art MHC-II prediction methods.
Signal-3L: A 3-layer approach for predicting signal peptides.

PubMed

Shen, Hong-Bin; Chou, Kuo-Chen

2007-11-16

Functioning as an "address tag" that directs nascent proteins to their proper cellular and extracellular locations, signal peptides have become a crucial tool in finding new drugs or reprogramming cells for gene therapy. To effectively and timely use such a tool, however, the first important thing is to develop an automated method for rapidly and accurately identifying the signal peptide for a given nascent protein. With the avalanche of new protein sequences generated in the post-genomic era, the challenge has become even more urgent and critical. In this paper, we have developed a novel method for predicting signal peptide sequences and their cleavage sites in human, plant, animal, eukaryotic, Gram-positive, and Gram-negative protein sequences, respectively. The new predictor is called Signal-3L that consists of three prediction engines working, respectively, for the following three progressively deepening layers: (1) identifying a query protein as secretory or non-secretory by an ensemble classifier formed by fusing many individual OET-KNN (optimized evidence-theoretic K nearest neighbor) classifiers operated in various dimensions of PseAA (pseudo amino acid) composition spaces; (2) selecting a set of candidates for the possible signal peptide cleavage sites of a query secretory protein by a subsite-coupled discrimination algorithm; (3) determining the final cleavage site by fusing the global sequence alignment outcome for each of the aforementioned candidates through a voting system. Signal-3L is featured by high success prediction rates with short computational time, and hence is particularly useful for the analysis of large-scale datasets. Signal-3L is freely available as a web-server at http://chou.med.harvard.edu/bioinf/Signal-3L/ or http://202.120.37.186/bioinf/Signal-3L, where, to further support the demand of the related areas, the signal peptides identified by Signal-3L for all the protein entries in Swiss-Prot databank that do not have signal peptide annotations or are annotated with uncertain terms but are classified by Signal-3L as secretory proteins are provided in a downloadable file. The large-scale file is prepared with Microsoft Excel and named "Tab-Signal-3L.xls", and will be updated once a year to include new protein entries and reflect the continuous development of Signal-3L.
Current algorithmic solutions for peptide-based proteomics data generation and identification.

PubMed

Hoopmann, Michael R; Moritz, Robert L

2013-02-01

Peptide-based proteomic data sets are ever increasing in size and complexity. These data sets provide computational challenges when attempting to quickly analyze spectra and obtain correct protein identifications. Database search and de novo algorithms must consider high-resolution MS/MS spectra and alternative fragmentation methods. Protein inference is a tricky problem when analyzing large data sets of degenerate peptide identifications. Combining multiple algorithms for improved peptide identification puts significant strain on computational systems when investigating large data sets. This review highlights some of the recent developments in peptide and protein identification algorithms for analyzing shotgun mass spectrometry data when encountering the aforementioned hurdles. Also explored are the roles that analytical pipelines, public spectral libraries, and cloud computing play in the evolution of peptide-based proteomics. Copyright © 2012 Elsevier Ltd. All rights reserved.
Phosphorylation-dependent mineral-type specificity for apatite-binding peptide sequences.

PubMed

Addison, William N; Miller, Sharon J; Ramaswamy, Janani; Mansouri, Ahmad; Kohn, David H; McKee, Marc D

2010-12-01

Apatite-binding peptides discovered by phage display provide an alternative design method for creating functional biomaterials for bone and tooth tissue repair. A limitation of this approach is the absence of display peptide phosphorylation--a post-translational modification important to mineral-binding proteins. To refine the material specificity of a recently identified apatite-binding peptide, and to determine critical design parameters (net charge, charge distribution, amino acid sequence and composition) controlling peptide affinity for mineral, we investigated the effects of phosphorylation and sequence scrambling on peptide adsorption to four different apatites (bone-like mineral, and three types of apatite containing initially 0, 5.6 and 10.5% carbonate). Phosphorylation of the VTKHLNQISQSY peptide (VTK peptide) led to a 10-fold increase in peptide adsorption (compared to nonphosphorylated peptide) to bone-like mineral, and a 2-fold increase in adsorption to the carbonated apatite, but there was no effect of phosphorylation on peptide affinity to pure hydroxyapatite (without carbonate). Sequence scrambling of the nonphosphorylated VTK peptide enhanced its specificity for the bone-like mineral, but scrambled phosphorylated VTK peptide (pVTK) did not significantly alter mineral-binding suggesting that despite the importance of sequence order and/or charge distribution to mineral-binding, the enhanced binding after phosphorylation exceeds any further enhancement by altered sequence order. Osteoblast culture mineralization was dose-dependently inhibited by pVTK and to a significantly lesser extent by scrambled pVTK, while the nonphosphorylated and scrambled forms had no effect, indicating that inhibition of osteoblast mineralization is dependent on both peptide sequence and charge. Computational modeling of peptide-mineral interactions indicated a favorable change in binding energy upon phosphorylation that was unaffected by scrambling. In conclusion, phosphorylation of serine residues increases peptide specificity for bone-like mineral, whose adsorption is determined primarily by sequence composition and net charge as opposed to sequence order. However, sequence order in addition to net charge modulates the mineralization of osteoblast cultures. The ability of such peptides to inhibit mineralization has potential utility in the management of pathologic calcification. Copyright © 2010 Elsevier Ltd. All rights reserved.
A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics

PubMed Central

Stanley, Jeffrey R.; Adkins, Joshua N.; Slysz, Gordon W.; Monroe, Matthew E.; Purvine, Samuel O.; Karpievitch, Yuliya V.; Anderson, Gordon A.; Smith, Richard D.; Dabney, Alan R.

2011-01-01

Current algorithms for quantifying peptide identification confidence in the accurate mass and time (AMT) tag approach assume that the AMT tags themselves have been correctly identified. However, there is uncertainty in the identification of AMT tags, as this is based on matching LC-MS/MS fragmentation spectra to peptide sequences. In this paper, we incorporate confidence measures for the AMT tag identifications into the calculation of probabilities for correct matches to an AMT tag database, resulting in a more accurate overall measure of identification confidence for the AMT tag approach. The method is referred to as Statistical Tools for AMT tag Confidence (STAC). STAC additionally provides a Uniqueness Probability (UP) to help distinguish between multiple matches to an AMT tag and a method to calculate an overall false discovery rate (FDR). STAC is freely available for download as both a command line and a Windows graphical application. PMID:21692516
Characterizing Peptide Neutral Losses Induced by Negative Electron-Transfer Dissociation (NETD)

PubMed Central

Rumachik, Neil G.; McAlister, Graeme C.; Russell, Jason D.; Bailey, Derek J.; Wenger, Craig D.; Coon, Joshua J.

2012-01-01

We implemented negative electron-transfer dissociation (NETD) on a hybrid ion trap/Orbitrap mass spectrometer to conduct ion/ion reactions using peptide anions and radical reagent cations. In addition to sequence-informative ladders of a•- and x-type fragment ions, NETD generated intense neutral loss peaks corresponding to the entire or partial side-chain cleavage from amino acids constituting a given peptide. Thus, a critical step towards the characterization of this recently introduced fragmentation technique is a systematic study of synthetic peptides to identify common neutral losses and preferential fragmentation pathways. Examining 46 synthetic peptides with high mass accuracy and high resolution analysis permitted facile determination of the chemical composition of each neutral loss. We identified 19 unique neutral losses from 14 amino acids and three modified amino acids, and assessed the specificity and sensitivity of each neutral loss using a database of 1542 confidently identified peptides generated from NETD shotgun experiments employing high-pH separations and negative electrospray ionization. As residue-specific neutral losses indicate the presence of certain amino acids, we determined that many neutral losses have potential diagnostic utility. We envision this catalogue of neutral losses being incorporated into database search algorithms to improve peptide identification specificity and to further advance characterization of the acidic proteome. PMID:22290482
Multiplex De Novo Sequencing of Peptide Antibiotics

NASA Astrophysics Data System (ADS)

Mohimani, Hosein; Liu, Wei-Ting; Yang, Yu-Liang; Gaudêncio, Susana P.; Fenical, William; Dorrestein, Pieter C.; Pevzner, Pavel A.

Proliferation of drug-resistant diseases raises the challenge of searching for new, more efficient antibiotics. Currently, some of the most effective antibiotics (i.e., Vancomycin and Daptomycin) are cyclic peptides produced by non-ribosomal biosynthetic pathways. The isolation and sequencing of cyclic peptide antibiotics, unlike the same activity with linear peptides, is time-consuming and error-prone. The dominant technique for sequencing cyclic peptides is NMR-based and requires large amounts (milligrams) of purified materials that, for most compounds, are not possible to obtain. Given these facts, there is a need for new tools to sequence cyclic NRPs using picograms of material. Since nearly all cyclic NRPs are produced along with related analogs, we develop a mass spectrometry approach for sequencing all related peptides at once (in contrast to the existing approach that analyzes individual peptides). Our results suggest that instead of attempting to isolate and NMR-sequence the most abundant compound, one should acquire spectra of many related compounds and sequence all of them simultaneously using tandem mass spectrometry. We illustrate applications of this approach by sequencing new variants of cyclic peptide antibiotics from Bacillus brevis, as well as sequencing a previously unknown familiy of cyclic NRPs produced by marine bacteria.
Rational Design of a Transferrin-Binding Peptide Sequence Tailored to Targeted Nanoparticle Internalization.

PubMed

Santi, Melissa; Maccari, Giuseppe; Mereghetti, Paolo; Voliani, Valerio; Rocchiccioli, Silvia; Ucciferri, Nadia; Luin, Stefano; Signore, Giovanni

2017-02-15

The transferrin receptor (TfR) is a promising target in cancer therapy owing to its overexpression in most solid tumors and on the blood-brain barrier. Nanostructures chemically derivatized with transferrin are employed in TfR targeting but often lose their functionality upon injection in the bloodstream. As an alternative strategy, we rationally designed a peptide coating able to bind transferrin on suitable pockets not involved in binding to TfR or iron by using an iterative multiscale-modeling approach coupled with quantitative structure-activity and relationship (QSAR) analysis and evolutionary algorithms. We tested that selected sequences have low aspecific protein adsorption and high binding energy toward transferrin, and one of them is efficiently internalized in cells with a transferrin-dependent pathway. Furthermore, it promotes transferrin-mediated endocytosis of gold nanoparticles by modifying their protein corona and promoting oriented adsorption of transferrin. This strategy leads to highly effective nanostructures, potentially useful in diagnostic and therapeutic applications, which exploit (and do not suffer) the protein solvation for achieving a better targeting.
SVM-Based Prediction of Propeptide Cleavage Sites in Spider Toxins Identifies Toxin Innovation in an Australian Tarantula

PubMed Central

Wong, Emily S. W.; Hardy, Margaret C.; Wood, David; Bailey, Timothy; King, Glenn F.

2013-01-01

Spider neurotoxins are commonly used as pharmacological tools and are a popular source of novel compounds with therapeutic and agrochemical potential. Since venom peptides are inherently toxic, the host spider must employ strategies to avoid adverse effects prior to venom use. It is partly for this reason that most spider toxins encode a protective proregion that upon enzymatic cleavage is excised from the mature peptide. In order to identify the mature toxin sequence directly from toxin transcripts, without resorting to protein sequencing, the propeptide cleavage site in the toxin precursor must be predicted bioinformatically. We evaluated different machine learning strategies (support vector machines, hidden Markov model and decision tree) and developed an algorithm (SpiderP) for prediction of propeptide cleavage sites in spider toxins. Our strategy uses a support vector machine (SVM) framework that combines both local and global sequence information. Our method is superior or comparable to current tools for prediction of propeptide sequences in spider toxins. Evaluation of the SVM method on an independent test set of known toxin sequences yielded 96% sensitivity and 100% specificity. Furthermore, we sequenced five novel peptides (not used to train the final predictor) from the venom of the Australian tarantula Selenotypus plumipes to test the accuracy of the predictor and found 80% sensitivity and 99.6% 8-mer specificity. Finally, we used the predictor together with homology information to predict and characterize seven groups of novel toxins from the deeply sequenced venom gland transcriptome of S. plumipes, which revealed structural complexity and innovations in the evolution of the toxins. The precursor prediction tool (SpiderP) is freely available on ArachnoServer (http://www.arachnoserver.org/spiderP.html), a web portal to a comprehensive relational database of spider toxins. All training data, test data, and scripts used are available from the SpiderP website. PMID:23894279
GuiTope: an application for mapping random-sequence peptides to protein sequences.

PubMed

Halperin, Rebecca F; Stafford, Phillip; Emery, Jack S; Navalkar, Krupa Arun; Johnston, Stephen Albert

2012-01-03

Random-sequence peptide libraries are a commonly used tool to identify novel ligands for binding antibodies, other proteins, and small molecules. It is often of interest to compare the selected peptide sequences to the natural protein binding partners to infer the exact binding site or the importance of particular residues. The ability to search a set of sequences for similarity to a set of peptides may sometimes enable the prediction of an antibody epitope or a novel binding partner. We have developed a software application designed specifically for this task. GuiTope provides a graphical user interface for aligning peptide sequences to protein sequences. All alignment parameters are accessible to the user including the ability to specify the amino acid frequency in the peptide library; these frequencies often differ significantly from those assumed by popular alignment programs. It also includes a novel feature to align di-peptide inversions, which we have found improves the accuracy of antibody epitope prediction from peptide microarray data and shows utility in analyzing phage display datasets. Finally, GuiTope can randomly select peptides from a given library to estimate a null distribution of scores and calculate statistical significance. GuiTope provides a convenient method for comparing selected peptide sequences to protein sequences, including flexible alignment parameters, novel alignment features, ability to search a database, and statistical significance of results. The software is available as an executable (for PC) at http://www.immunosignature.com/software and ongoing updates and source code will be available at sourceforge.net.
Electron-Transfer Ion/Ion Reactions of Doubly Protonated Peptides: Effect of Elevated Bath Gas Temperature

PubMed Central

Pitteri, Sharon J.; Chrisman, Paul A.; McLuckey, Scott A.

2005-01-01

In this study, the electron-transfer dissociation (ETD) behavior of cations derived from 27 different peptides (22 of which are tryptic peptides) has been studied in a 3D quadrupole ion trap mass spectrometer. Ion/ion reactions between peptide cations and nitrobenzene anions have been examined at both room temperature and in an elevated temperature bath gas environment to form ETD product ions. From the peptides studied, the ETD sequence coverage tends to be inversely related to peptide size. At room temperature, very high sequence coverage (~100%) was observed for small peptides (≤7 amino acids). For medium-sized peptides composed of 8–11 amino acids, the average sequence coverage was 46%. Larger peptides with 14 or more amino acids yielded an average sequence coverage of 23%. Elevated-temperature ETD provided increased sequence coverage over room-temperature experiments for the peptides of greater than 7 residues, giving an average of 67% for medium-sized peptides and 63% for larger peptides. Percent ETD, a measure of the extent of electron transfer, has also been calculated for the peptides and also shows an inverse relation with peptide size. Bath gas temperature does not have a consistent effect on percent ETD, however. For the tryptic peptides, fragmentation is localized at the ends of the peptides suggesting that the distribution of charge within the peptide may play an important role in determining fragmentation sites. A triply protonated peptide has also been studied and shows behavior similar to the doubly charged peptides. These preliminary results suggest that for a given charge state there is a maximum size for which high sequence coverage is obtained and that increasing the bath gas temperature can increase this maximum. PMID:16131079
Combinatorial Approach for Large-scale Identification of Linked Peptides from Tandem Mass Spectrometry Spectra*

PubMed Central

Wang, Jian; Anania, Veronica G.; Knott, Jeff; Rush, John; Lill, Jennie R.; Bourne, Philip E.; Bandeira, Nuno

2014-01-01

The combination of chemical cross-linking and mass spectrometry has recently been shown to constitute a powerful tool for studying protein–protein interactions and elucidating the structure of large protein complexes. However, computational methods for interpreting the complex MS/MS spectra from linked peptides are still in their infancy, making the high-throughput application of this approach largely impractical. Because of the lack of large annotated datasets, most current approaches do not capture the specific fragmentation patterns of linked peptides and therefore are not optimal for the identification of cross-linked peptides. Here we propose a generic approach to address this problem and demonstrate it using disulfide-bridged peptide libraries to (i) efficiently generate large mass spectral reference data for linked peptides at a low cost and (ii) automatically train an algorithm that can efficiently and accurately identify linked peptides from MS/MS spectra. We show that using this approach we were able to identify thousands of MS/MS spectra from disulfide-bridged peptides through comparison with proteome-scale sequence databases and significantly improve the sensitivity of cross-linked peptide identification. This allowed us to identify 60% more direct pairwise interactions between the protein subunits in the 20S proteasome complex than existing tools on cross-linking studies of the proteasome complexes. The basic framework of this approach and the MS/MS reference dataset generated should be valuable resources for the future development of new tools for the identification of linked peptides. PMID:24493012
CABS-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site.

PubMed

Kurcinski, Mateusz; Jamroz, Michal; Blaszczyk, Maciej; Kolinski, Andrzej; Kmiecik, Sebastian

2015-07-01

Protein-peptide interactions play a key role in cell functions. Their structural characterization, though challenging, is important for the discovery of new drugs. The CABS-dock web server provides an interface for modeling protein-peptide interactions using a highly efficient protocol for the flexible docking of peptides to proteins. While other docking algorithms require pre-defined localization of the binding site, CABS-dock does not require such knowledge. Given a protein receptor structure and a peptide sequence (and starting from random conformations and positions of the peptide), CABS-dock performs simulation search for the binding site allowing for full flexibility of the peptide and small fluctuations of the receptor backbone. This protocol was extensively tested over the largest dataset of non-redundant protein-peptide interactions available to date (including bound and unbound docking cases). For over 80% of bound and unbound dataset cases, we obtained models with high or medium accuracy (sufficient for practical applications). Additionally, as optional features, CABS-dock can exclude user-selected binding modes from docking search or to increase the level of flexibility for chosen receptor fragments. CABS-dock is freely available as a web server at http://biocomp.chem.uw.edu.pl/CABSdock. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Computational studies of sequence-specific driving forces in peptide self-assembly

NASA Astrophysics Data System (ADS)

Jeon, Joohyun

Peptides are biopolymers made from various sequences of twenty different types of amino acids, connected by peptide bonds. There are practically an infinite number of possible sequences and tremendous possible combinations of peptide-peptide interactions. Recently, an increasing number of studies have shown a stark variety of peptide self-assembled nanomaterials whose detailed structures depend on their sequences and environmental factors; these have end uses in medical and bio-electronic applications, for example. To understand the underlying physics of complex peptide self-assembly processes and to delineate sequence specific effects, in this study, I use various simulation tools spanning all-atom molecular dynamics to simple lattice models and quantify the balance of interactions in the peptide self-assembly processes. In contrast to the existing view that peptides' aggregation propensities are proportional to the net sequence hydrophobicity and inversely proportional to the net charge, I show the more nuanced effects of electrostatic interactions, including the cooperative effects between hydrophobic and electrostatic interactions. Notably, I suggest rather unexpected, yet important roles of entropies in the small scale oligomerization processes. Overall, this study broadens our understanding of the role of thermodynamic driving forces in peptide self-assembly.
Phage display selection of peptides that target calcium-binding proteins.

PubMed

Vetter, Stefan W

2013-01-01

Phage display allows to rapidly identify peptide sequences with binding affinity towards target proteins, for example, calcium-binding proteins (CBPs). Phage technology allows screening of 10(9) or more independent peptide sequences and can identify CBP binding peptides within 2 weeks. Adjusting of screening conditions allows selecting CBPs binding peptides that are either calcium-dependent or independent. Obtained peptide sequences can be used to identify CBP target proteins based on sequence homology or to quickly obtain peptide-based CBP inhibitors to modulate CBP-target interactions. The protocol described here uses a commercially available phage display library, in which random 12-mer peptides are displayed on filamentous M13 phages. The library was screened against the calcium-binding protein S100B.
Hexicon 2: Automated Processing of Hydrogen-Deuterium Exchange Mass Spectrometry Data with Improved Deuteration Distribution Estimation

NASA Astrophysics Data System (ADS)

Lindner, Robert; Lou, Xinghua; Reinstein, Jochen; Shoeman, Robert L.; Hamprecht, Fred A.; Winkler, Andreas

2014-06-01

Hydrogen-deuterium exchange (HDX) experiments analyzed by mass spectrometry (MS) provide information about the dynamics and the solvent accessibility of protein backbone amide hydrogen atoms. Continuous improvement of MS instrumentation has contributed to the increasing popularity of this method; however, comprehensive automated data analysis is only beginning to mature. We present Hexicon 2, an automated pipeline for data analysis and visualization based on the previously published program Hexicon (Lou et al. 2010). Hexicon 2 employs the sensitive NITPICK peak detection algorithm of its predecessor in a divide-and-conquer strategy and adds new features, such as chromatogram alignment and improved peptide sequence assignment. The unique feature of deuteration distribution estimation was retained in Hexicon 2 and improved using an iterative deconvolution algorithm that is robust even to noisy data. In addition, Hexicon 2 provides a data browser that facilitates quality control and provides convenient access to common data visualization tasks. Analysis of a benchmark dataset demonstrates superior performance of Hexicon 2 compared with its predecessor in terms of deuteration centroid recovery and deuteration distribution estimation. Hexicon 2 greatly reduces data analysis time compared with manual analysis, whereas the increased number of peptides provides redundant coverage of the entire protein sequence. Hexicon 2 is a standalone application available free of charge under http://hx2.mpimf-heidelberg.mpg.de.

Hexicon 2: automated processing of hydrogen-deuterium exchange mass spectrometry data with improved deuteration distribution estimation.

PubMed

Lindner, Robert; Lou, Xinghua; Reinstein, Jochen; Shoeman, Robert L; Hamprecht, Fred A; Winkler, Andreas

2014-06-01

Hydrogen-deuterium exchange (HDX) experiments analyzed by mass spectrometry (MS) provide information about the dynamics and the solvent accessibility of protein backbone amide hydrogen atoms. Continuous improvement of MS instrumentation has contributed to the increasing popularity of this method; however, comprehensive automated data analysis is only beginning to mature. We present Hexicon 2, an automated pipeline for data analysis and visualization based on the previously published program Hexicon (Lou et al. 2010). Hexicon 2 employs the sensitive NITPICK peak detection algorithm of its predecessor in a divide-and-conquer strategy and adds new features, such as chromatogram alignment and improved peptide sequence assignment. The unique feature of deuteration distribution estimation was retained in Hexicon 2 and improved using an iterative deconvolution algorithm that is robust even to noisy data. In addition, Hexicon 2 provides a data browser that facilitates quality control and provides convenient access to common data visualization tasks. Analysis of a benchmark dataset demonstrates superior performance of Hexicon 2 compared with its predecessor in terms of deuteration centroid recovery and deuteration distribution estimation. Hexicon 2 greatly reduces data analysis time compared with manual analysis, whereas the increased number of peptides provides redundant coverage of the entire protein sequence. Hexicon 2 is a standalone application available free of charge under http://hx2.mpimf-heidelberg.mpg.de.
Function-based classification of carbohydrate-active enzymes by recognition of short, conserved peptide motifs.

PubMed

Busk, Peter Kamp; Lange, Lene

2013-06-01

Functional prediction of carbohydrate-active enzymes is difficult due to low sequence identity. However, similar enzymes often share a few short motifs, e.g., around the active site, even when the overall sequences are very different. To exploit this notion for functional prediction of carbohydrate-active enzymes, we developed a simple algorithm, peptide pattern recognition (PPR), that can divide proteins into groups of sequences that share a set of short conserved sequences. When this method was used on 118 glycoside hydrolase 5 proteins with 9% average pairwise identity and representing four characterized enzymatic functions, 97% of the proteins were sorted into groups correlating with their enzymatic activity. Furthermore, we analyzed 8,138 glycoside hydrolase 13 proteins including 204 experimentally characterized enzymes with 28 different functions. There was a 91% correlation between group and enzyme activity. These results indicate that the function of carbohydrate-active enzymes can be predicted with high precision by finding short, conserved motifs in their sequences. The glycoside hydrolase 61 family is important for fungal biomass conversion, but only a few proteins of this family have been functionally characterized. Interestingly, PPR divided 743 glycoside hydrolase 61 proteins into 16 subfamilies useful for targeted investigation of the function of these proteins and pinpointed three conserved motifs with putative importance for enzyme activity. Furthermore, the conserved sequences were useful for cloning of new, subfamily-specific glycoside hydrolase 61 proteins from 14 fungi. In conclusion, identification of conserved sequence motifs is a new approach to sequence analysis that can predict carbohydrate-active enzyme functions with high precision.
Chemometric analysis of Hymenoptera toxins and defensins: A model for predicting the biological activity of novel peptides from venoms and hemolymph.

PubMed

Saidemberg, Daniel M; Baptista-Saidemberg, Nicoli B; Palma, Mario S

2011-09-01

When searching for prospective novel peptides, it is difficult to determine the biological activity of a peptide based only on its sequence. The "trial and error" approach is generally laborious, expensive and time consuming due to the large number of different experimental setups required to cover a reasonable number of biological assays. To simulate a virtual model for Hymenoptera insects, 166 peptides were selected from the venoms and hemolymphs of wasps, bees and ants and applied to a mathematical model of multivariate analysis, with nine different chemometric components: GRAVY, aliphaticity index, number of disulfide bonds, total residues, net charge, pI value, Boman index, percentage of alpha helix, and flexibility prediction. Principal component analysis (PCA) with non-linear iterative projections by alternating least-squares (NIPALS) algorithm was performed, without including any information about the biological activity of the peptides. This analysis permitted the grouping of peptides in a way that strongly correlated to the biological function of the peptides. Six different groupings were observed, which seemed to correspond to the following groups: chemotactic peptides, mastoparans, tachykinins, kinins, antibiotic peptides, and a group of long peptides with one or two disulfide bonds and with biological activities that are not yet clearly defined. The partial overlap between the mastoparans group and the chemotactic peptides, tachykinins, kinins and antibiotic peptides in the PCA score plot may be used to explain the frequent reports in the literature about the multifunctionality of some of these peptides. The mathematical model used in the present investigation can be used to predict the biological activities of novel peptides in this system, and it may also be easily applied to other biological systems. Copyright © 2011 Elsevier Inc. All rights reserved.
Affinity selection of Nipah and Hendra virus-related vaccine candidates from a complex random peptide library displayed on bacteriophage virus-like particles

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peabody, David S.; Chackerian, Bryce; Ashley, Carlee

The invention relates to virus-like particles of bacteriophage MS2 (MS2 VLPs) displaying peptide epitopes or peptide mimics of epitopes of Nipah Virus envelope glycoprotein that elicit an immune response against Nipah Virus upon vaccination of humans or animals. Affinity selection on Nipah Virus-neutralizing monoclonal antibodies using random sequence peptide libraries on MS2 VLPs selected peptides with sequence similarity to peptide sequences found within the envelope glycoprotein of Nipah itself, thus identifying the epitopes the antibodies recognize. The selected peptide sequences themselves are not necessarily identical in all respects to a sequence within Nipah Virus glycoprotein, and therefore may be referredmore » to as epitope mimics VLPs displaying these epitope mimics can serve as vaccine. On the other hand, display of the corresponding wild-type sequence derived from Nipah Virus and corresponding to the epitope mapped by affinity selection, may also be used as a vaccine.« less
Meta sequence analysis of human blood peptides and their parent proteins.

PubMed

Bowden, Peter; Pendrak, Voitek; Zhu, Peihong; Marshall, John G

2010-04-18

Sequence analysis of the blood peptides and their qualities will be key to understanding the mechanisms that contribute to error in LC-ESI-MS/MS. Analysis of peptides and their proteins at the level of sequences is much more direct and informative than the comparison of disparate accession numbers. A portable database of all blood peptide and protein sequences with descriptor fields and gene ontology terms might be useful for designing immunological or MRM assays from human blood. The results of twelve studies of human blood peptides and/or proteins identified by LC-MS/MS and correlated against a disparate array of genetic libraries were parsed and matched to proteins from the human ENSEMBL, SwissProt and RefSeq databases by SQL. The reported peptide and protein sequences were organized into an SQL database with full protein sequences and up to five unique peptides in order of prevalence along with the peptide count for each protein. Structured query language or BLAST was used to acquire descriptive information in current databases. Sampling error at the level of peptides is the largest source of disparity between groups. Chi Square analysis of peptide to protein distributions confirmed the significant agreement between groups on identified proteins. Copyright 2010. Published by Elsevier B.V.
Anatomy and evolution of database search engines-a central component of mass spectrometry based proteomic workflows.

PubMed

Verheggen, Kenneth; Raeder, Helge; Berven, Frode S; Martens, Lennart; Barsnes, Harald; Vaudel, Marc

2017-09-13

Sequence database search engines are bioinformatics algorithms that identify peptides from tandem mass spectra using a reference protein sequence database. Two decades of development, notably driven by advances in mass spectrometry, have provided scientists with more than 30 published search engines, each with its own properties. In this review, we present the common paradigm behind the different implementations, and its limitations for modern mass spectrometry datasets. We also detail how the search engines attempt to alleviate these limitations, and provide an overview of the different software frameworks available to the researcher. Finally, we highlight alternative approaches for the identification of proteomic mass spectrometry datasets, either as a replacement for, or as a complement to, sequence database search engines. © 2017 Wiley Periodicals, Inc.
Exploring Site-Specific N-Glycosylation Microheterogeneity of Haptoglobin using Glycopeptide CID Tandem Mass Spectra and Glycan Database Search

PubMed Central

Chandler, Kevin Brown; Pompach, Petr; Goldman, Radoslav

2013-01-01

Glycosylation is a common protein modification with a significant role in many vital cellular processes and human diseases, making the characterization of protein-attached glycan structures important for understanding cell biology and disease processes. Direct analysis of protein N-glycosylation by tandem mass spectrometry of glycopeptides promises site-specific elucidation of N-glycan microheterogeneity, something which detached N-glycan and de-glycosylated peptide analyses cannot provide. However, successful implementation of direct N-glycopeptide analysis by tandem mass spectrometry remains a challenge. In this work, we consider algorithmic techniques for the analysis of LC-MS/MS data acquired from glycopeptide-enriched fractions of enzymatic digests of purified proteins. We implement a computational strategy which takes advantage of the properties of CID fragmentation spectra of N-glycopeptides, matching the MS/MS spectra to peptide-glycan pairs from protein sequences and glycan structure databases. Significantly, we also propose a novel false-discovery-rate estimation technique to estimate and manage the number of false identifications. We use a human glycoprotein standard, haptoglobin, digested with trypsin and GluC, enriched for glycopeptides using HILIC chromatography, and analyzed by LC-MS/MS to demonstrate our algorithmic strategy and evaluate its performance. Our software, GlycoPeptideSearch (GPS), assigned glycopeptide identifications to 246 of the spectra at false-discovery-rate 5.58%, identifying 42 distinct haptoglobin peptide-glycan pairs at each of the four haptoglobin N-linked glycosylation sites. We further demonstrate the effectiveness of this approach by analyzing plasma-derived haptoglobin, identifying 136 N-linked glycopeptide spectra at false-discovery-rate 0.4%, representing 15 distinct glycopeptides on at least three of the four N-linked glycosylation sites. The software, GlycoPeptideSearch, is available for download from http://edwardslab.bmcb.georgetown.edu/GPS. PMID:23829323
Molecular basis of branched peptides resistance to enzyme proteolysis.

PubMed

Falciani, Chiara; Lozzi, Luisa; Pini, Alessandro; Corti, Federico; Fabbrini, Monica; Bernini, Andrea; Lelli, Barbara; Niccolai, Neri; Bracci, Luisa

2007-03-01

We found that synthetic peptides in the form of dendrimers become resistant to proteolysis. To determine the molecular basis of this resistance, different bioactive peptides were synthesized in monomeric, two-branched and tetra-branched form and incubated with human plasma and serum. Proteolytic resistance of branched multimeric sequences was compared to that of the same peptides synthesized as multimeric linear molecules. Unmodified peptides and cleaved sequences were detected by high pressure liquid chromatography and mass spectrometry. An increase in peptide copies did not increase peptide resistance in linear multimeric sequences, whereas multimericity progressively enhanced proteolytic stability of branched multimeric peptides. A structure-based hypothesis of branched peptide resistance to proteolysis by metallopeptidases is presented.
Identification and application of self-binding zipper-like sequences in SARS-CoV spike protein.

PubMed

Zhang, Si Min; Liao, Ying; Neo, Tuan Ling; Lu, Yanning; Liu, Ding Xiang; Vahlne, Anders; Tam, James P

2018-05-22

Self-binding peptides containing zipper-like sequences, such as the Leu/Ile zipper sequence within the coiled coil regions of proteins and the cross-β spine steric zippers within the amyloid-like fibrils, could bind to the protein-of-origin through homophilic sequence-specific zipper motifs. These self-binding sequences represent opportunities for the development of biochemical tools and/or therapeutics. Here, we report on the identification of a putative self-binding β-zipper-forming peptide within the severe acute respiratory syndrome-associated coronavirus spike (S) protein and its application in viral detection. Peptide array scanning of overlapping peptides covering the entire length of S protein identified 34 putative self-binding peptides of six clusters, five of which contained octapeptide core consensus sequences. The Cluster I consensus octapeptide sequence GINITNFR was predicted by the Eisenberg's 3D profile method to have high amyloid-like fibrillation potential through steric β-zipper formation. Peptide C6 containing the Cluster I consensus sequence was shown to oligomerize and form amyloid-like fibrils. Taking advantage of this, C6 was further applied to detect the S protein expression in vitro by fluorescence staining. Meanwhile, the coiled-coil-forming Leu/Ile heptad repeat sequences within the S protein were under-represented during peptide array scanning, in agreement with that long peptide lengths were required to attain high helix-mediated interaction avidity. The data suggest that short β-zipper-like self-binding peptides within the S protein could be identified through combining the peptide scanning and predictive methods, and could be exploited as biochemical detection reagents for viral infection. Copyright © 2018. Published by Elsevier Ltd.
Computational design of d-peptide inhibitors of hepatitis delta antigen dimerization

NASA Astrophysics Data System (ADS)

Elkin, Carl D.; Zuccola, Harmon J.; Hogle, James M.; Joseph-McCarthy, Diane

2000-11-01

Hepatitis delta virus (HDV) encodes a single polypeptide called hepatitis delta antigen (DAg). Dimerization of DAg is required for viral replication. The structure of the dimerization region, residues 12 to 60, consists of an anti-parallel coiled coil [Zuccola et al., Structure, 6 (1998) 821]. Multiple Copy Simultaneous Searches (MCSS) of the hydrophobic core region formed by the bend in the helix of one monomer of this structure were carried out for many diverse functional groups. Six critical interaction sites were identified. The Protein Data Bank was searched for backbone templates to use in the subsequent design process by matching to these sites. A 14 residue helix expected to bind to the d-isomer of the target structure was selected as the template. Over 200 000 mutant sequences of this peptide were generated based on the MCSS results. A secondary structure prediction algorithm was used to screen all sequences, and in general only those that were predicted to be highly helical were retained. Approximately 100 of these 14-mers were model built as d-peptides and docked with the l-isomer of the target monomer. Based on calculated interaction energies, predicted helicity, and intrahelical salt bridge patterns, a small number of peptides were selected as the most promising candidates. The ligand design approach presented here is the computational analogue of mirror image phage display. The results have been used to characterize the interactions responsible for formation of this model anti-parallel coiled coil and to suggest potential ligands to disrupt it.
Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

PubMed

Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

2014-10-01

Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.
sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides

PubMed Central

Luo, Heng; Ye, Hao; Ng, Hui Wen; Sakkiah, Sugunadevi; Mendrick, Donna L.; Hong, Huixiao

2016-01-01

Understanding the binding between human leukocyte antigens (HLAs) and peptides is important to understand the functioning of the immune system. Since it is time-consuming and costly to measure the binding between large numbers of HLAs and peptides, computational methods including machine learning models and network approaches have been developed to predict HLA-peptide binding. However, there are several limitations for the existing methods. We developed a network-based algorithm called sNebula to address these limitations. We curated qualitative Class I HLA-peptide binding data and demonstrated the prediction performance of sNebula on this dataset using leave-one-out cross-validation and five-fold cross-validations. This algorithm can predict not only peptides of different lengths and different types of HLAs, but also the peptides or HLAs that have no existing binding data. We believe sNebula is an effective method to predict HLA-peptide binding and thus improve our understanding of the immune system. PMID:27558848
The primary structure of rat liver ribosomal protein L37. Homology with yeast and bacterial ribosomal proteins.

PubMed

Lin, A; McNally, J; Wool, I G

1983-09-10

The covalent structure of the rat liver 60 S ribosomal subunit protein L37 was determined. Twenty-four tryptic peptides were purified and the sequence of each was established; they accounted for all 111 residues of L37. The sequence of the first 30 residues of L37, obtained previously by automated Edman degradation of the intact protein, provided the alignment of the first 9 tryptic peptides. Three peptides (CN1, CN2, and CN3) were produced by cleavage of protein L37 with cyanogen bromide. The sequence of CN1 (65 residues) was established from the sequence of secondary peptides resulting from cleavage with trypsin and chymotrypsin. The sequence of CN1 in turn served to order tryptic peptides 1 through 14. The sequence of CN2 (15 residues) was determined entirely by a micromanual procedure and allowed the alignment of tryptic peptides 14 through 18. The sequence of the NH2-terminal 28 amino acids of CN3 (31 residues) was determined; in addition the complete sequences of the secondary tryptic and chymotryptic peptides were done. The sequence of CN3 provided the order of tryptic peptides 18 through 24. Thus the sequence of the three cyanogen bromide peptides also accounted for the 111 residues of protein L37. The carboxyl-terminal amino acids were identified after carboxypeptidase A treatment. There is a disulfide bridge between half-cystinyl residues at positions 40 and 69. Rat liver ribosomal protein L37 is homologous with yeast YP55 and with Escherichia coli L34. Moreover, there is a segment of 17 residues in rat L37 that occurs, albeit with modifications, in yeast YP55 and in E. coli S4, L20, and L34.
Structures of the transmembrane helices of the G-protein coupled receptor, rhodopsin.

PubMed

Katragadda, M; Chopra, A; Bennett, M; Alderfer, J L; Yeagle, P L; Albert, A D

2001-07-01

An hypothesis is tested that individual peptides corresponding to the transmembrane helices of the membrane protein, rhodopsin, would form helices in solution similar to those in the native protein. Peptides containing the sequences of helices 1, 4 and 5 of rhodopsin were synthesized. Two peptides, with overlapping sequences at their termini, were synthesized to cover each of the helices. The peptides from helix 1 and helix 4 were helical throughout most of their length. The N- and C-termini of all the peptides were disordered and proline caused opening of the helical structure in both helix 1 and helix 4. The peptides from helix 5 were helical in the middle segment of each peptide, with larger disordered regions in the N- and C-termini than for helices 1 and 4. These observations show that there is a strong helical propensity in the amino acid sequences corresponding to the transmembrane domain of this G-protein coupled receptor. In the case of the peptides from helix 4, it was possible to superimpose the structures of the overlapping sequences to produce a construct covering the whole of the sequence of helix 4 of rhodopsin. As similar superposition for the peptides from helix 1 also produced a construct, but somewhat less successfully because of the disordering in the region of sequence overlap. This latter problem was more severe for helix 5 and therefore a single peptide was synthesized for the entire sequence of this helix, and its structure determined. It proved to be helical throughout. Comparison of all these structures with the recent crystal structure of rhodopsin revealed that the peptide structures mimicked the structures seen in the whole protein. Thus similar studies of peptides may provide useful information on the secondary structure of other transmembrane proteins built around helical bundles.
Ammonium sulfate and MALDI in-source decay: a winning combination for sequencing peptides

PubMed Central

Delvolve, Alice; Woods, Amina S.

2009-01-01

In previous papers we highlighted the role of ammonium sulfate in increasing peptide fragmentation by in source decay (ISD). The current work systematically investigated effects of MALDI extraction delay, peptide amino acid composition, matrix and ammonium sulfate concentration on peptides ISD fragmentation. The data confirmed that ammonium sulfate increased peptides signal to noise ratio as well as their in source fragmentation resulting in complete sequence coverage regardless of the amino acid composition. This method is easy, inexpensive and generates the peptides sequence instantly. PMID:19877641
Bromine isotopic signature facilitates de novo sequencing of peptides in free-radical-initiated peptide sequencing (FRIPS) mass spectrometry.

PubMed

Nam, Jungjoo; Kwon, Hyuksu; Jang, Inae; Jeon, Aeran; Moon, Jingyu; Lee, Sun Young; Kang, Dukjin; Han, Sang Yun; Moon, Bongjin; Oh, Han Bin

2015-02-01

We recently showed that free-radical-initiated peptide sequencing mass spectrometry (FRIPS MS) assisted by the remarkable thermochemical stability of (2,2,6,6-tetramethyl-piperidin-1-yl)oxyl (TEMPO) is another attractive radical-driven peptide fragmentation MS tool. Facile homolytic cleavage of the bond between the benzylic carbon and the oxygen of the TEMPO moiety in o-TEMPO-Bz-C(O)-peptide and the high reactivity of the benzylic radical species generated in •Bz-C(O)-peptide are key elements leading to extensive radical-driven peptide backbone fragmentation. In the present study, we demonstrate that the incorporation of bromine into the benzene ring, i.e. o-TEMPO-Bz(Br)-C(O)-peptide, allows unambiguous distinction of the N-terminal peptide fragments from the C-terminal fragments through the unique bromine doublet isotopic signature. Furthermore, bromine substitution does not alter the overall radical-driven peptide backbone dissociation pathways of o-TEMPO-Bz-C(O)-peptide. From a practical perspective, the presence of the bromine isotopic signature in the N-terminal peptide fragments in TEMPO-assisted FRIPS MS represents a useful and cost-effective opportunity for de novo peptide sequencing. Copyright © 2015 John Wiley & Sons, Ltd.
Peptide de novo sequencing of mixture tandem mass spectra

PubMed Central

Hotta, Stéphanie Yuki Kolbeck; Verano‐Braga, Thiago; Kjeldsen, Frank

2016-01-01

The impact of mixture spectra deconvolution on the performance of four popular de novo sequencing programs was tested using artificially constructed mixture spectra as well as experimental proteomics data. Mixture fragmentation spectra are recognized as a limitation in proteomics because they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co‐isolation and thus prone to false identifications. The deconvolution approach matched complementary b‐, y‐ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co‐isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced peptides. The improvement was in the range of 20–35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight candidate peptide score distribution and high sensitivity to small changes in the mass spectrum introduced by the employed deconvolution method could explain some of the missing peptide identifications. PMID:27329701
A Support Vector Machine based method to distinguish proteobacterial proteins from eukaryotic plant proteins

PubMed Central

2012-01-01

Background Members of the phylum Proteobacteria are most prominent among bacteria causing plant diseases that result in a diminution of the quantity and quality of food produced by agriculture. To ameliorate these losses, there is a need to identify infections in early stages. Recent developments in next generation nucleic acid sequencing and mass spectrometry open the door to screening plants by the sequences of their macromolecules. Such an approach requires the ability to recognize the organismal origin of unknown DNA or peptide fragments. There are many ways to approach this problem but none have emerged as the best protocol. Here we attempt a systematic way to determine organismal origins of peptides by using a machine learning algorithm. The algorithm that we implement is a Support Vector Machine (SVM). Result The amino acid compositions of proteobacterial proteins were found to be different from those of plant proteins. We developed an SVM model based on amino acid and dipeptide compositions to distinguish between a proteobacterial protein and a plant protein. The amino acid composition (AAC) based SVM model had an accuracy of 92.44% with 0.85 Matthews correlation coefficient (MCC) while the dipeptide composition (DC) based SVM model had a maximum accuracy of 94.67% and 0.89 MCC. We also developed SVM models based on a hybrid approach (AAC and DC), which gave a maximum accuracy 94.86% and a 0.90 MCC. The models were tested on unseen or untrained datasets to assess their validity. Conclusion The results indicate that the SVM based on the AAC and DC hybrid approach can be used to distinguish proteobacterial from plant protein sequences. PMID:23046503
BIOPEP database and other programs for processing bioactive peptide sequences.

PubMed

Minkiewicz, Piotr; Dziuba, Jerzy; Iwaniak, Anna; Dziuba, Marta; Darewicz, Małgorzata

2008-01-01

This review presents the potential for application of computational tools in peptide science based on a sample BIOPEP database and program as well as other programs and databases available via the World Wide Web. The BIOPEP application contains a database of biologically active peptide sequences and a program enabling construction of profiles of the potential biological activity of protein fragments, calculation of quantitative descriptors as measures of the value of proteins as potential precursors of bioactive peptides, and prediction of bonds susceptible to hydrolysis by endopeptidases in a protein chain. Other bioactive and allergenic peptide sequence databases are also presented. Programs enabling the construction of binary and multiple alignments between peptide sequences, the construction of sequence motifs attributed to a given type of bioactivity, searching for potential precursors of bioactive peptides, and the prediction of sites susceptible to proteolytic cleavage in protein chains are available via the Internet as are other approaches concerning secondary structure prediction and calculation of physicochemical features based on amino acid sequence. Programs for prediction of allergenic and toxic properties have also been developed. This review explores the possibilities of cooperation between various programs.
Design and construction of 2A peptide-linked multicistronic vectors.

PubMed

Szymczak-Workman, Andrea L; Vignali, Kate M; Vignali, Dario A A

2012-02-01

The need for reliable, multicistronic vectors for multigene delivery is at the forefront of biomedical technology. This article describes the design and construction of 2A peptide-linked multicistronic vectors, which can be used to express multiple proteins from a single open reading frame (ORF). The small 2A peptide sequences, when cloned between genes, allow for efficient, stoichiometric production of discrete protein products within a single vector through a novel "cleavage" event within the 2A peptide sequence. Expression of more than two genes using conventional approaches has several limitations, most notably imbalanced protein expression and large size. The use of 2A peptide sequences alleviates these concerns. They are small (18-22 amino acids) and have divergent amino-terminal sequences, which minimizes the chance for homologous recombination and allows for multiple, different 2A peptide sequences to be used within a single vector. Importantly, separation of genes placed between 2A peptide sequences is nearly 100%, which allows for stoichiometric and concordant expression of the genes, regardless of the order of placement within the vector.

Library Design-Facilitated High-Throughput Sequencing of Synthetic Peptide Libraries.

PubMed

Vinogradov, Alexander A; Gates, Zachary P; Zhang, Chi; Quartararo, Anthony J; Halloran, Kathryn H; Pentelute, Bradley L

2017-11-13

A methodology to achieve high-throughput de novo sequencing of synthetic peptide mixtures is reported. The approach leverages shotgun nanoliquid chromatography coupled with tandem mass spectrometry-based de novo sequencing of library mixtures (up to 2000 peptides) as well as automated data analysis protocols to filter away incorrect assignments, noise, and synthetic side-products. For increasing the confidence in the sequencing results, mass spectrometry-friendly library designs were developed that enabled unambiguous decoding of up to 600 peptide sequences per hour while maintaining greater than 85% sequence identification rates in most cases. The reliability of the reported decoding strategy was additionally confirmed by matching fragmentation spectra for select authentic peptides identified from library sequencing samples. The methods reported here are directly applicable to screening techniques that yield mixtures of active compounds, including particle sorting of one-bead one-compound libraries and affinity enrichment of synthetic library mixtures performed in solution.
Intravenous phage display identifies peptide sequences that target the burn-injured intestine.

PubMed

Costantini, Todd W; Eliceiri, Brian P; Putnam, James G; Bansal, Vishal; Baird, Andrew; Coimbra, Raul

2012-11-01

The injured intestine is responsible for significant morbidity and mortality after severe trauma and burn; however, targeting the intestine with therapeutics aimed at decreasing injury has proven difficult. We hypothesized that we could use intravenous phage display technology to identify peptide sequences that target the injured intestinal mucosa in a murine model, and then confirm the cross-reactivity of this peptide sequence with ex vivo human gut. Four hours following 30% TBSA burn we performed an in vivo, intravenous systemic administration of phage library containing 10(12) phage in balb/c mice to biopan for gut-targeting peptides. In vivo assessment of the candidate peptide sequences identified after 4 rounds of internalization was performed by injecting 1×10(12) copies of each selected phage clone into sham or burned animals. Internalization into the gut was assessed using quantitative polymerase chain reaction. We then incubated this gut-targeting peptide sequence with human intestine and visualized fluorescence using confocal microscopy. We identified 3 gut-targeting peptide sequences which caused collapse of the phage library (4-1: SGHQLLLNKMP, 4-5: ILANDLTAPGPR, 4-11: SFKPSGLPAQSL). Sequence 4-5 was internalized into the intestinal mucosa of burned animals 9.3-fold higher than sham animals injected with the same sequence (2.9×10(5)vs. 3.1×10(4) particles per mg tissue). Sequences 4-1 and 4-11 were both internalized into the gut, but did not demonstrate specificity for the injured mucosa. Phage sequence 4-11 demonstrated cross-reactivity with human intestine. In the future, this gut-targeting peptide sequence could serve as a platform for the delivery of biotherapeutics. Copyright © 2012 Elsevier Inc. All rights reserved.
Identification of cancer-specific motifs in mimotope profiles of serum antibody repertoire.

PubMed

Gerasimov, Ekaterina; Zelikovsky, Alex; Măndoiu, Ion; Ionov, Yurij

2017-06-07

For fighting cancer, earlier detection is crucial. Circulating auto-antibodies produced by the patient's own immune system after exposure to cancer proteins are promising bio-markers for the early detection of cancer. Since an antibody recognizes not the whole antigen but 4-7 critical amino acids within the antigenic determinant (epitope), the whole proteome can be represented by a random peptide phage display library. This opens the possibility to develop an early cancer detection test based on a set of peptide sequences identified by comparing cancer patients' and healthy donors' global peptide profiles of antibody specificities. Due to the enormously large number of peptide sequences contained in global peptide profiles generated by next generation sequencing, the large number of cancer and control sera is required to identify cancer-specific peptides with high degree of statistical significance. To decrease the number of peptides in profiles generated by nextgen sequencing without losing cancer-specific sequences we used for generation of profiles the phage library enriched by panning on the pool of cancer sera. To further decrease the complexity of profiles we used computational methods for transforming a list of peptides constituting the mimotope profiles to the list motifs formed by similar peptide sequences. We have shown that the amino-acid order is meaningful in mimotope motifs since they contain significantly more peptides than motifs among peptides where amino-acids are randomly permuted. Also the single sample motifs significantly differ from motifs in peptides drawn from multiple samples. Finally, multiple cancer-specific motifs have been identified.
Switch-peptides: design and characterization of controllable super-amyloid-forming host-guest peptides as tools for identifying anti-amyloid agents.

PubMed

Camus, Marie-Stéphanie; Dos Santos, Sonia; Chandravarkar, Arunan; Mandal, Bhubaneswar; Schmid, Adrian W; Tuchscherer, Gabriele; Mutter, Manfred; Lashuel, Hilal A

2008-09-01

Several amyloid-forming proteins are characterized by the presence of hydrophobic and highly amyloidogenic core sequences that play critical roles in the initiation and progression of amyloid fibril formation. Therefore targeting these sequences represents a viable strategy for identifying candidate molecules that could interfere with amyloid formation and toxicity of the parent proteins. However, the highly amyloidogenic and insoluble nature of these sequences has hampered efforts to develop high-throughput fibrillization assays. Here we describe the design and characterization of host-guest switch peptides that can be used for in vitro mechanistic and screening studies that are aimed at discovering aggregation inhibitors that target highly amyloidogenic sequences. These model systems are based on a host-guest system where the amyloidogenic sequence (guest peptide) is flanked by two beta-sheet-promoting (Leu-Ser)(n) oligomers as host sequences. Two host-guest peptides were prepared by using the hydrophobic core of Abeta comprising residues 14-24 (HQKLVFFAEDV) as the guest peptide with switch elements inserted within (peptide 1) or at the N and C termini of the guest peptide (peptide 2). Both model peptides can be triggered to undergo rapid self-assembly and amyloid formation in a highly controllable manner and their fibrillization kinetics is tuneable by manipulating solution conditions (for example, peptide concentration and pH). The fibrillization of both peptides reproduces many features of the full-length Abeta peptides and can be inhibited by known inhibitors of Abeta fibril formation. Our results suggest that this approach can be extended to other amyloid proteins and should facilitate the discovery of small-molecule aggregation inhibitors and the development of more efficacious anti-amyloid agents to treat and/or reverse the pathogenesis of neurodegenerative and systemic amyloid diseases.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

NASA Astrophysics Data System (ADS)

McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides.

PubMed

McMillen, Chelsea L; Wright, Patience M; Cassady, Carolyn J

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Determination of the sequences of protein-derived peptides and peptide mixtures by mass spectrometry

PubMed Central

Morris, Howard R.; Williams, Dudley H.; Ambler, Richard P.

1971-01-01

Micro-quantities of protein-derived peptides have been converted into N-acetylated permethyl derivatives, and their sequences determined by low-resolution mass spectrometry without prior knowledge of their amino acid compositions or lengths. A new strategy is suggested for the mass spectrometric sequencing of oligopeptides or proteins, involving gel filtration of protein hydrolysates and subsequent sequence analysis of peptide mixtures. Finally, results are given that demonstrate for the first time the use of mass spectrometry for the analysis of a protein-derived peptide mixture, again without prior knowledge of the protein or components within the mixture. PMID:5158904
Identification of tissue-specific targeting peptide

NASA Astrophysics Data System (ADS)

Jung, Eunkyoung; Lee, Nam Kyung; Kang, Sang-Kee; Choi, Seung-Hoon; Kim, Daejin; Park, Kisoo; Choi, Kihang; Choi, Yun-Jaie; Jung, Dong Hyun

2012-11-01

Using phage display technique, we identified tissue-targeting peptide sets that recognize specific tissues (bone-marrow dendritic cell, kidney, liver, lung, spleen and visceral adipose tissue). In order to rapidly evaluate tissue-specific targeting peptides, we performed machine learning studies for predicting the tissue-specific targeting activity of peptides on the basis of peptide sequence information using four machine learning models and isolated the groups of peptides capable of mediating selective targeting to specific tissues. As a representative liver-specific targeting sequence, the peptide "DKNLQLH" was selected by the sequence similarity analysis. This peptide has a high degree of homology with protein ligands which can interact with corresponding membrane counterparts. We anticipate that our models will be applicable to the prediction of tissue-specific targeting peptides which can recognize the endothelial markers of target tissues.
qPMS9: An Efficient Algorithm for Quorum Planted Motif Search

NASA Astrophysics Data System (ADS)

Nicolae, Marius; Rajasekaran, Sanguthevar

2015-01-01

Discovering patterns in biological sequences is a crucial problem. For example, the identification of patterns in DNA sequences has resulted in the determination of open reading frames, identification of gene promoter elements, intron/exon splicing sites, and SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have led to domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, discovery of short functional motifs, etc. In this paper we focus on the identification of an important class of patterns, namely, motifs. We study the (l, d) motif search problem or Planted Motif Search (PMS). PMS receives as input n strings and two integers l and d. It returns all sequences M of length l that occur in each input string, where each occurrence differs from M in at most d positions. Another formulation is quorum PMS (qPMS), where the motif appears in at least q% of the strings. We introduce qPMS9, a parallel exact qPMS algorithm that offers significant runtime improvements on DNA and protein datasets. qPMS9 solves the challenging DNA (l, d)-instances (28, 12) and (30, 13). The source code is available at https://code.google.com/p/qpms9/.
Evaluation of Phage Display Discovered Peptides as Ligands for Prostate-Specific Membrane Antigen (PSMA)

PubMed Central

Edwards, W. Barry

2013-01-01

The aim of this study was to identify potential ligands of PSMA suitable for further development as novel PSMA-targeted peptides using phage display technology. The human PSMA protein was immobilized as a target followed by incubation with a 15-mer phage display random peptide library. After one round of prescreening and two rounds of screening, high-stringency screening at the third round of panning was performed to identify the highest affinity binders. Phages which had a specific binding activity to PSMA in human prostate cancer cells were isolated and the DNA corresponding to the 15-mers were sequenced to provide three consensus sequences: GDHSPFT, SHFSVGS and EVPRLSLLAVFL as well as other sequences that did not display consensus. Two of the peptide sequences deduced from DNA sequencing of binding phages, SHSFSVGSGDHSPFT and GRFLTGGTGRLLRIS were labeled with 5-carboxyfluorescein and shown to bind and co-internalize with PSMA on human prostate cancer cells by fluorescence microscopy. The high stringency requirements yielded peptides with affinities KD∼1 µM or greater which are suitable starting points for affinity maturation. While these values were less than anticipated, the high stringency did yield peptide sequences that apparently bound to different surfaces on PSMA. These peptide sequences could be the basis for further development of peptides for prostate cancer tumor imaging and therapy. PMID:23935860
Accelerating String Set Matching in FPGA Hardware for Bioinformatics Research

PubMed Central

Dandass, Yoginder S; Burgess, Shane C; Lawrence, Mark; Bridges, Susan M

2008-01-01

Background This paper describes techniques for accelerating the performance of the string set matching problem with particular emphasis on applications in computational proteomics. The process of matching peptide sequences against a genome translated in six reading frames is part of a proteogenomic mapping pipeline that is used as a case-study. The Aho-Corasick algorithm is adapted for execution in field programmable gate array (FPGA) devices in a manner that optimizes space and performance. In this approach, the traditional Aho-Corasick finite state machine (FSM) is split into smaller FSMs, operating in parallel, each of which matches up to 20 peptides in the input translated genome. Each of the smaller FSMs is further divided into five simpler FSMs such that each simple FSM operates on a single bit position in the input (five bits are sufficient for representing all amino acids and special symbols in protein sequences). Results This bit-split organization of the Aho-Corasick implementation enables efficient utilization of the limited random access memory (RAM) resources available in typical FPGAs. The use of on-chip RAM as opposed to FPGA logic resources for FSM implementation also enables rapid reconfiguration of the FPGA without the place and routing delays associated with complex digital designs. Conclusion Experimental results show storage efficiencies of over 80% for several data sets. Furthermore, the FPGA implementation executing at 100 MHz is nearly 20 times faster than an implementation of the traditional Aho-Corasick algorithm executing on a 2.67 GHz workstation. PMID:18412963
A Proteomic Workflow Using High-Throughput De Novo Sequencing Towards Complementation of Genome Information for Improved Comparative Crop Science.

PubMed

Turetschek, Reinhard; Lyon, David; Desalegn, Getinet; Kaul, Hans-Peter; Wienkoop, Stefanie

2016-01-01

The proteomic study of non-model organisms, such as many crop plants, is challenging due to the lack of comprehensive genome information. Changing environmental conditions require the study and selection of adapted cultivars. Mutations, inherent to cultivars, hamper protein identification and thus considerably complicate the qualitative and quantitative comparison in large-scale systems biology approaches. With this workflow, cultivar-specific mutations are detected from high-throughput comparative MS analyses, by extracting sequence polymorphisms with de novo sequencing. Stringent criteria are suggested to filter for confidential mutations. Subsequently, these polymorphisms complement the initially used database, which is ready to use with any preferred database search algorithm. In our example, we thereby identified 26 specific mutations in two cultivars of Pisum sativum and achieved an increased number (17 %) of peptide spectrum matches.
Peptide de novo sequencing of mixture tandem mass spectra.

PubMed

Gorshkov, Vladimir; Hotta, Stéphanie Yuki Kolbeck; Verano-Braga, Thiago; Kjeldsen, Frank

2016-09-01

The impact of mixture spectra deconvolution on the performance of four popular de novo sequencing programs was tested using artificially constructed mixture spectra as well as experimental proteomics data. Mixture fragmentation spectra are recognized as a limitation in proteomics because they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co-isolation and thus prone to false identifications. The deconvolution approach matched complementary b-, y-ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co-isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced peptides. The improvement was in the range of 20-35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight candidate peptide score distribution and high sensitivity to small changes in the mass spectrum introduced by the employed deconvolution method could explain some of the missing peptide identifications. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Phage display peptide libraries: deviations from randomness and correctives

PubMed Central

Ryvkin, Arie; Ashkenazy, Haim; Weiss-Ottolenghi, Yael; Piller, Chen; Pupko, Tal; Gershoni, Jonathan M

2018-01-01

Abstract Peptide-expressing phage display libraries are widely used for the interrogation of antibodies. Affinity selected peptides are then analyzed to discover epitope mimetics, or are subjected to computational algorithms for epitope prediction. A critical assumption for these applications is the random representation of amino acids in the initial naïve peptide library. In a previous study, we implemented next generation sequencing to evaluate a naïve library and discovered severe deviations from randomness in UAG codon over-representation as well as in high G phosphoramidite abundance causing amino acid distribution biases. In this study, we demonstrate that the UAG over-representation can be attributed to the burden imposed on the phage upon the assembly of the recombinant Protein 8 subunits. This was corrected by constructing the libraries using supE44-containing bacteria which suppress the UAG driven abortive termination. We also demonstrate that the overabundance of G stems from variant synthesis-efficiency and can be corrected using compensating oligonucleotide-mixtures calibrated by mass spectroscopy. Construction of libraries implementing these correctives results in markedly improved libraries that display random distribution of amino acids, thus ensuring that enriched peptides obtained in biopanning represent a genuine selection event, a fundamental assumption for phage display applications. PMID:29420788
Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

DOE Office of Scientific and Technical Information (OSTI.GOV)

Feild, M.J.; Armstrong, F.B.

1987-05-01

E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and (/sup 3/H)-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealedmore » limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region.« less
Can natural proteins designed with 'inverted' peptide sequences adopt native-like protein folds?

PubMed

Sridhar, Settu; Guruprasad, Kunchur

2014-01-01

We have carried out a systematic computational analysis on a representative dataset of proteins of known three-dimensional structure, in order to evaluate whether it would possible to 'swap' certain short peptide sequences in naturally occurring proteins with their corresponding 'inverted' peptides and generate 'artificial' proteins that are predicted to retain native-like protein fold. The analysis of 3,967 representative proteins from the Protein Data Bank revealed 102,677 unique identical inverted peptide sequence pairs that vary in sequence length between 5-12 and 18 amino acid residues. Our analysis illustrates with examples that such 'artificial' proteins may be generated by identifying peptides with 'similar structural environment' and by using comparative protein modeling and validation studies. Our analysis suggests that natural proteins may be tolerant to accommodating such peptides.
Discovery of Neuropeptides in the Nematode Ascaris suum by Database Mining and Tandem Mass Spectrometry

PubMed Central

Jarecki, Jessica L.; Frey, Brian L.; Smith, Lloyd M.; Stretton, Antony O.

2011-01-01

Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) was used to discover peptides in extracts of the large parasitic nematode Ascaris suum. This required the assembly of a new database of known and predicted peptides. In addition to those already sequenced, peptides were either previously predicted to be processed from precursor proteins identified in an A. suum library of expressed sequence tags (ESTs), or newly predicted from a library of A. suum genome survey sequences (GSSs). The predicted MS/MS fragmentation patterns of this collection of real and putative peptides were compared with the actual fragmentation patterns found in the MS/MS spectra of peptides fractionated by MS; this enabled individual peptides to be sequenced. Many previously identified peptides were found, and 21 novel peptides were discovered. Thus, this approach is very useful, despite the fact that the available GSS database is still preliminary, having only 1X coverage. PMID:21524146
Sequence dependent aggregation of peptides and fibril formation

NASA Astrophysics Data System (ADS)

Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

2017-09-01

Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.
Effect of sequence and stereochemistry reversal on p53 peptide mimicry.

PubMed

Atzori, Alessio; Baker, Audrey E; Chiu, Mark; Bryce, Richard A; Bonnet, Pascal

2013-01-01

Peptidomimetics effective in modulating protein-protein interactions and resistant to proteolysis have potential in therapeutic applications. An appealing yet underperforming peptidomimetic strategy is to employ D-amino acids and reversed sequences to mimic a lead peptide conformation, either separately or as the combined retro-inverso peptide. In this work, we examine the conformations of inverse, reverse and retro-inverso peptides of p53(15-29) using implicit solvent molecular dynamics simulation and circular dichroism spectroscopy. In order to obtain converged ensembles for the peptides, we find enhanced sampling is required via the replica exchange molecular dynamics method. From these replica exchange simulations, the D-peptide analogues of p53(15-29) result in a predominantly left-handed helical conformation. When the parent sequence is reversed sequence as either the L-peptide and D-peptide, these peptides display a greater helical propensity, feature reflected by NMR and CD studies in TFE/water solvent. The simulations also indicate that, while approximately similar orientations of the side-chains are possible by the peptide analogues, their ability to mimic the parent peptide is severely compromised by backbone orientation (for D-amino acids) and side-chain orientation (for reversed sequences). A retro-inverso peptide is disadvantaged as a mimic in both aspects, and further chemical modification is required to enable this concept to be used fruitfully in peptidomimetic design. The replica exchange molecular simulation approach adopted here, with its ability to provide detailed conformational insights into modified peptides, has potential as a tool to guide structure-based design of new improved peptidomimetics.
Optimization and high-throughput screening of antimicrobial peptides.

PubMed

Blondelle, Sylvie E; Lohner, Karl

2010-01-01

While a well-established process for lead compound discovery in for-profit companies, high-throughput screening is becoming more popular in basic and applied research settings in academia. The development of combinatorial libraries combined with easy and less expensive access to new technologies have greatly contributed to the implementation of high-throughput screening in academic laboratories. While such techniques were earlier applied to simple assays involving single targets or based on binding affinity, they have now been extended to more complex systems such as whole cell-based assays. In particular, the urgent need for new antimicrobial compounds that would overcome the rapid rise of drug-resistant microorganisms, where multiple target assays or cell-based assays are often required, has forced scientists to focus onto high-throughput technologies. Based on their existence in natural host defense systems and their different mode of action relative to commercial antibiotics, antimicrobial peptides represent a new hope in discovering novel antibiotics against multi-resistant bacteria. The ease of generating peptide libraries in different formats has allowed a rapid adaptation of high-throughput assays to the search for novel antimicrobial peptides. Similarly, the availability nowadays of high-quantity and high-quality antimicrobial peptide data has permitted the development of predictive algorithms to facilitate the optimization process. This review summarizes the various library formats that lead to de novo antimicrobial peptide sequences as well as the latest structural knowledge and optimization processes aimed at improving the peptides selectivity.

Constancy and diversity in the flavivirus fusion peptide.

PubMed

Seligman, Stephen J

2008-02-14

Flaviviruses include the mosquito-borne dengue, Japanese encephalitis, yellow fever and West Nile and the tick-borne encephalitis viruses. They are responsible for considerable world-wide morbidity and mortality. Viral entry is mediated by a conserved fusion peptide containing 16 amino acids located in domain II of the envelope protein E. Highly orchestrated conformational changes initiated by exposure to acidic pH accompany the fusion process and are important factors limiting amino acid changes in the fusion peptide that still permit fusion with host cell membranes in both arthropod and vertebrate hosts. The cell-fusing related agents, growing only in mosquitoes or insect cell lines, possess a different homologous peptide. Analysis of 46 named flaviviruses deposited in the Entrez Nucleotides database extended the constancy in the canonical fusion peptide sequences of mosquito-borne, tick-borne and viruses with no known vector to include more recently-sequenced viruses. The mosquito-borne signature amino acid, G104, was also found in flaviviruses with no known vector and with the cell-fusion related viruses. Despite the constancy in the canonical sequences in pathogenic flaviviruses, mutations were surprisingly frequent with a 27% prevalence of nonsynonymous mutations in yellow fever virus fusion peptide sequences, and 0 to 7.4% prevalence in the others. Six of seven yellow fever patients whose virus had fusion peptide mutations died. In the cell-fusing related agents, not enough sequences have been deposited to estimate reliably the prevalence of fusion peptide mutations. However, the canonical sequences homologous to the fusion peptide and the pattern of disulfide linkages in protein E differed significantly from the other flaviviruses. The constancy of the canonical fusion peptide sequences in the arthropod-borne flaviviruses contrasts with the high prevalence of mutations in most individual viruses. The discrepancy may be the result of a survival advantage accompanying sequence diversity (quasispecies) involving the fusion peptide. Limited clinical data with yellow fever virus suggest that the presence of fusion peptide mutants is not associated with a decreased case fatality rate. The cell-fusing related agents may have substantial differences from other flaviviruses in their mechanism of viral entry into the host cell.
Ligand Docking to Intermediate and Close-To-Bound Conformers Generated by an Elastic Network Model Based Algorithm for Highly Flexible Proteins

PubMed Central

Kurkcuoglu, Zeynep; Doruker, Pemra

2016-01-01

Incorporating receptor flexibility in small ligand-protein docking still poses a challenge for proteins undergoing large conformational changes. In the absence of bound structures, sampling conformers that are accessible by apo state may facilitate docking and drug design studies. For this aim, we developed an unbiased conformational search algorithm, by integrating global modes from elastic network model, clustering and energy minimization with implicit solvation. Our dataset consists of five diverse proteins with apo to complex RMSDs 4.7–15 Å. Applying this iterative algorithm on apo structures, conformers close to the bound-state (RMSD 1.4–3.8 Å), as well as the intermediate states were generated. Dockings to a sequence of conformers consisting of a closed structure and its “parents” up to the apo were performed to compare binding poses on different states of the receptor. For two periplasmic binding proteins and biotin carboxylase that exhibit hinge-type closure of two dynamics domains, the best pose was obtained for the conformer closest to the bound structure (ligand RMSDs 1.5–2 Å). In contrast, the best pose for adenylate kinase corresponded to an intermediate state with partially closed LID domain and open NMP domain, in line with recent studies (ligand RMSD 2.9 Å). The docking of a helical peptide to calmodulin was the most challenging case due to the complexity of its 15 Å transition, for which a two-stage procedure was necessary. The technique was first applied on the extended calmodulin to generate intermediate conformers; then peptide docking and a second generation stage on the complex were performed, which in turn yielded a final peptide RMSD of 2.9 Å. Our algorithm is effective in producing conformational states based on the apo state. This study underlines the importance of such intermediate states for ligand docking to proteins undergoing large transitions. PMID:27348230
PGCA: An algorithm to link protein groups created from MS/MS data

PubMed Central

Sasaki, Mayu; Hollander, Zsuzsanna; Smith, Derek; McManus, Bruce; McMaster, W. Robert; Ng, Raymond T.; Cohen Freue, Gabriela V.

2017-01-01

The quantitation of proteins using shotgun proteomics has gained popularity in the last decades, simplifying sample handling procedures, removing extensive protein separation steps and achieving a relatively high throughput readout. The process starts with the digestion of the protein mixture into peptides, which are then separated by liquid chromatography and sequenced by tandem mass spectrometry (MS/MS). At the end of the workflow, recovering the identity of the proteins originally present in the sample is often a difficult and ambiguous process, because more than one protein identifier may match a set of peptides identified from the MS/MS spectra. To address this identification problem, many MS/MS data processing software tools combine all plausible protein identifiers matching a common set of peptides into a protein group. However, this solution introduces new challenges in studies with multiple experimental runs, which can be characterized by three main factors: i) protein groups’ identifiers are local, i.e., they vary run to run, ii) the composition of each group may change across runs, and iii) the supporting evidence of proteins within each group may also change across runs. Since in general there is no conclusive evidence about the absence of proteins in the groups, protein groups need to be linked across different runs in subsequent statistical analyses. We propose an algorithm, called Protein Group Code Algorithm (PGCA), to link groups from multiple experimental runs by forming global protein groups from connected local groups. The algorithm is computationally inexpensive and enables the connection and analysis of lists of protein groups across runs needed in biomarkers studies. We illustrate the identification problem and the stability of the PGCA mapping using 65 iTRAQ experimental runs. Further, we use two biomarker studies to show how PGCA enables the discovery of relevant candidate protein group markers with similar but non-identical compositions in different runs. PMID:28562641
Activation of Adhesion G Protein-coupled Receptors: AGONIST SPECIFICITY OF STACHEL SEQUENCE-DERIVED PEPTIDES.

PubMed

Demberg, Lilian M; Winkler, Jana; Wilde, Caroline; Simon, Kay-Uwe; Schön, Julia; Rothemund, Sven; Schöneberg, Torsten; Prömel, Simone; Liebscher, Ines

2017-03-17

Members of the adhesion G protein-coupled receptor (aGPCR) family carry an agonistic sequence within their large ectodomains. Peptides derived from this region, called the Stachel sequence, can activate the respective receptor. As the conserved core region of the Stachel sequence is highly similar between aGPCRs, the agonist specificity of Stachel sequence-derived peptides was tested between family members using cell culture-based second messenger assays. Stachel peptides derived from aGPCRs of subfamily VI (GPR110/ADGRF1, GPR116/ADGRF5) and subfamily VIII (GPR64/ADGRG2, GPR126/ADGRG6) are able to activate more than one member of the respective subfamily supporting their evolutionary relationship and defining them as pharmacological receptor subtypes. Extended functional analyses of the Stachel sequences and derived peptides revealed agonist promiscuity, not only within, but also between aGPCR subfamilies. For example, the Stachel -derived peptide of GPR110 (subfamily VI) can activate GPR64 and GPR126 (both subfamily VIII). Our results indicate that key residues in the Stachel sequence are very similar between aGPCRs allowing for agonist promiscuity of several Stachel -derived peptides. Therefore, aGPCRs appear to be pharmacologically more closely related than previously thought. Our findings have direct implications for many aGPCR studies, as potential functional overlap has to be considered for in vitro and in vivo studies. However, it also offers the possibility of a broader use of more potent peptides when the original Stachel sequence is less effective. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Targeting the Atypical Chemokine Receptor ACKR3/CXCR7: Phase 1 - Phage Display Peptide Identification and Characterization.

PubMed

Vestal, R D; LaJeunesse, D R; Taylor, E W

2016-01-01

One of the greatest challenges in fighting cancer is cell targeting and biomarker selection. The Atypical Chemokine Receptor ACKR3/CXCR7 is expressed on many cancer cell types, including breast cancer and glioblastoma, and binds the endogenous ligands SDF1/CXCL12 and ITAC/CXCL11. A 20 amino acid region of the ACKR3/CXCR7 N-terminus was synthesized and targeted with the NEB PhD-7 Phage Display Peptide Library. Twenty-nine phages were isolated and heptapeptide inserts sequenced; of these, 23 sequences were unique. A 3D molecular model was created for the ACKR3/CXCR7 N-terminus by mutating the corresponding region of the crystal structure of CXCR4 with bound SDF1/CXCL12. A ClustalW alignment was performed on each peptide sequence using the entire SDF1/CXCL12 sequence as the template. The 23-peptide sequences showed similarity to three distinct regions of the SDF1/CXCL12 molecule. A 3D molecular model was made for each of the phage peptide inserts to visually identify potential areas of steric interference of peptides that simulated CXCL12 regions not in contact with the receptor's Nterminus. An ELISA analysis of the relative binding affinity between the peptides identified 9 peptides with statistically significant results. The candidate pool of 9 peptides was further reduced to 3 peptides based on their affinity for the targeted N-terminus region peptide versus no target peptide present or a scrambled negative control peptide. The results clearly show the Phage Display protocol can be used to target a synthesized region of the ACKR3/CXCR7 N-terminus. The 3 peptides chosen, P20, P3, and P9, will be the basis for further targeting studies.
Gastrointestinal Endogenous Proteins as a Source of Bioactive Peptides - An In Silico Study

PubMed Central

Dave, Lakshmi A.; Montoya, Carlos A.; Rutherfurd, Shane M.; Moughan, Paul J.

2014-01-01

Dietary proteins are known to contain bioactive peptides that are released during digestion. Endogenous proteins secreted into the gastrointestinal tract represent a quantitatively greater supply of protein to the gut lumen than those of dietary origin. Many of these endogenous proteins are digested in the gastrointestinal tract but the possibility that these are also a source of bioactive peptides has not been considered. An in silico prediction method was used to test if bioactive peptides could be derived from the gastrointestinal digestion of gut endogenous proteins. Twenty six gut endogenous proteins and seven dietary proteins were evaluated. The peptides present after gastric and intestinal digestion were predicted based on the amino acid sequence of the proteins and the known specificities of the major gastrointestinal proteases. The predicted resultant peptides possessing amino acid sequences identical to those of known bioactive peptides were identified. After gastrointestinal digestion (based on the in silico simulation), the total number of bioactive peptides predicted to be released ranged from 1 (gliadin) to 55 (myosin) for the selected dietary proteins and from 1 (secretin) to 39 (mucin-5AC) for the selected gut endogenous proteins. Within the intact proteins and after simulated gastrointestinal digestion, angiotensin converting enzyme (ACE)-inhibitory peptide sequences were the most frequently observed in both the dietary and endogenous proteins. Among the dietary proteins, after in silico simulated gastrointestinal digestion, myosin was found to have the highest number of ACE-inhibitory peptide sequences (49 peptides), while for the gut endogenous proteins, mucin-5AC had the greatest number of ACE-inhibitory peptide sequences (38 peptides). Gut endogenous proteins may be an important source of bioactive peptides in the gut particularly since gut endogenous proteins represent a quantitatively large and consistent source of protein. PMID:24901416
Combining results of multiple search engines in proteomics.

PubMed

Shteynberg, David; Nesvizhskii, Alexey I; Moritz, Robert L; Deutsch, Eric W

2013-09-01

A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques.
Combining Results of Multiple Search Engines in Proteomics*

PubMed Central

Shteynberg, David; Nesvizhskii, Alexey I.; Moritz, Robert L.; Deutsch, Eric W.

2013-01-01

A crucial component of the analysis of shotgun proteomics datasets is the search engine, an algorithm that attempts to identify the peptide sequence from the parent molecular ion that produced each fragment ion spectrum in the dataset. There are many different search engines, both commercial and open source, each employing a somewhat different technique for spectrum identification. The set of high-scoring peptide-spectrum matches for a defined set of input spectra differs markedly among the various search engine results; individual engines each provide unique correct identifications among a core set of correlative identifications. This has led to the approach of combining the results from multiple search engines to achieve improved analysis of each dataset. Here we review the techniques and available software for combining the results of multiple search engines and briefly compare the relative performance of these techniques. PMID:23720762
PH dependent adhesive peptides

DOEpatents

Tomich, John; Iwamoto, Takeo; Shen, Xinchun; Sun, Xiuzhi Susan

2010-06-29

A novel peptide adhesive motif is described that requires no receptor or cross-links to achieve maximal adhesive strength. Several peptides with different degrees of adhesive strength have been designed and synthesized using solid phase chemistries. All peptides contain a common hydrophobic core sequence flanked by positively or negatively charged amino acids sequences.
Examination of segmental average mass spectra from liquid chromatography-tandem mass spectrometric (LC-MS/MS) data enables screening of multiple types of protein modifications.

PubMed

Liu, Nai-Yu; Lee, Hsiao-Hui; Chang, Zee-Fen; Tsay, Yeou-Guang

2015-09-10

It has been observed that a modified peptide and its non-modified counterpart, when analyzed with reverse phase liquid chromatography, usually share a very similar elution property [1-3]. Inasmuch as this property is common to many different types of protein modifications, we propose an informatics-based approach, featuring the generation of segmental average mass spectra ((sa)MS), that is capable of locating different types of modified peptides in two-dimensional liquid chromatography-mass spectrometric (LC-MS) data collected for regular protease digests from proteins in gels or solutions. To enable the localization of these peptides in the LC-MS map, we have implemented a set of computer programs, or the (sa)MS package, that perform the needed functions, including generating a complete set of segmental average mass spectra, compiling the peptide inventory from the Sequest/TurboSequest results, searching modified peptide candidates and annotating a tandem mass spectrum for final verification. Using ROCK2 as an example, our programs were applied to identify multiple types of modified peptides, such as phosphorylated and hexosylated ones, which particularly include those peptides that could have been ignored due to their peculiar fragmentation patterns and consequent low search scores. Hence, we demonstrate that, when complemented with peptide search algorithms, our approach and the entailed computer programs can add the sequence information needed for bolstering the confidence of data interpretation by the present analytical platforms and facilitate the mining of protein modification information out of complicated LC-MS/MS data. Copyright © 2015 Elsevier B.V. All rights reserved.
ACTG: novel peptide mapping onto gene models.

PubMed

Choi, Seunghyuk; Kim, Hyunwoo; Paek, Eunok

2017-04-15

In many proteogenomic applications, mapping peptide sequences onto genome sequences can be very useful, because it allows us to understand origins of the gene products. Existing software tools either take the genomic position of a peptide start site as an input or assume that the peptide sequence exactly matches the coding sequence of a given gene model. In case of novel peptides resulting from genomic variations, especially structural variations such as alternative splicing, these existing tools cannot be directly applied unless users supply information about the variant, either its genomic position or its transcription model. Mapping potentially novel peptides to genome sequences, while allowing certain genomic variations, requires introducing novel gene models when aligning peptide sequences to gene structures. We have developed a new tool called ACTG (Amino aCids To Genome), which maps peptides to genome, assuming all possible single exon skipping, junction variation allowing three edit distances from the original splice sites, exon extension and frame shift. In addition, it can also consider SNVs (single nucleotide variations) during mapping phase if a user provides the VCF (variant call format) file as an input. Available at http://prix.hanyang.ac.kr/ACTG/search.jsp . eunokpaek@hanyang.ac.kr. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
How Messenger RNA and Nascent Chain Sequences Regulate Translation Elongation.

PubMed

Choi, Junhong; Grosely, Rosslyn; Prabhakar, Arjun; Lapointe, Christopher P; Wang, Jinfan; Puglisi, Joseph D

2018-06-20

Translation elongation is a highly coordinated, multistep, multifactor process that ensures accurate and efficient addition of amino acids to a growing nascent-peptide chain encoded in the sequence of translated messenger RNA (mRNA). Although translation elongation is heavily regulated by external factors, there is clear evidence that mRNA and nascent-peptide sequences control elongation dynamics, determining both the sequence and structure of synthesized proteins. Advances in methods have driven experiments that revealed the basic mechanisms of elongation as well as the mechanisms of regulation by mRNA and nascent-peptide sequences. In this review, we highlight how mRNA and nascent-peptide elements manipulate the translation machinery to alter the dynamics and pathway of elongation.
A peptide sequence on carcinoembryonic antigen binds to a 80kD protein on Kupffer cells.

PubMed

Thomas, P; Petrick, A T; Toth, C A; Fox, E S; Elting, J J; Steele, G

1992-10-30

Clearance of carcinoembryonic antigen (CEA) from the circulation is by binding to Kupffer cells in the liver. We have shown that CEA binding to Kupffer cells occurs via a peptide sequence YPELPK representing amino acids 107-112 of the CEA sequence. This peptide sequence is located in the region between the N-terminal and the first immunoglobulin like loop domain. Using native CEA and peptides containing this sequence complexed with a heterobifunctional crosslinking agent and ligand blotting with biotinylated CEA and NCA we have shown binding to an 80kD protein on the Kupffer cell surface. This binding protein may be important in the development of hepatic metastases.
Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach.

PubMed

Nielsen, Morten; Lundegaard, Claus; Worning, Peder; Hvid, Christina Sylvester; Lamberth, Kasper; Buus, Søren; Brunak, Søren; Lund, Ole

2004-06-12

Prediction of which peptides will bind a specific major histocompatibility complex (MHC) constitutes an important step in identifying potential T-cell epitopes suitable as vaccine candidates. MHC class II binding peptides have a broad length distribution complicating such predictions. Thus, identifying the correct alignment is a crucial part of identifying the core of an MHC class II binding motif. In this context, we wish to describe a novel Gibbs motif sampler method ideally suited for recognizing such weak sequence motifs. The method is based on the Gibbs sampling method, and it incorporates novel features optimized for the task of recognizing the binding motif of MHC classes I and II. The method locates the binding motif in a set of sequences and characterizes the motif in terms of a weight-matrix. Subsequently, the weight-matrix can be applied to identifying effectively potential MHC binding peptides and to guiding the process of rational vaccine design. We apply the motif sampler method to the complex problem of MHC class II binding. The input to the method is amino acid peptide sequences extracted from the public databases of SYFPEITHI and MHCPEP and known to bind to the MHC class II complex HLA-DR4(B1*0401). Prior identification of information-rich (anchor) positions in the binding motif is shown to improve the predictive performance of the Gibbs sampler. Similarly, a consensus solution obtained from an ensemble average over suboptimal solutions is shown to outperform the use of a single optimal solution. In a large-scale benchmark calculation, the performance is quantified using relative operating characteristics curve (ROC) plots and we make a detailed comparison of the performance with that of both the TEPITOPE method and a weight-matrix derived using the conventional alignment algorithm of ClustalW. The calculation demonstrates that the predictive performance of the Gibbs sampler is higher than that of ClustalW and in most cases also higher than that of the TEPITOPE method.
Targeting the atypical chemokine receptor ACKR3/CXCR7 for the treatment of cancer and other diseases

NASA Astrophysics Data System (ADS)

Vestal, Richard D., Jr.

One of the greatest challenges in fighting cancer is cell targeting and biomarker selection. The Atypical Chemokine Receptor ACKR3/CXCR7 is expressed on many cancer cell types, including breast cancer and glioblastoma, and binds the endogenous ligands SDF1/CXCL12 and ITAC/CXCL11. A 20 amino acid region of the ACKR3/CXCR7 N-terminus was synthesized and targeted with the NEB PhD-7 Phage Display Peptide Library. Twenty-nine phages were isolated and heptapeptide inserts sequenced; of these, 23 sequences were unique. A 3D molecular model was created for the ACKR3/CXCR7 N-terminus by mutating the corresponding region of the crystal structure of CXCR4 with bound SDF1/CXCL12. A ClustalW alignment was performed on each peptide sequence using the entire SDF1/CXCL12 sequence as the template. The 23-peptide sequences showed similarity to three distinct regions of the SDF1/CXCL12 molecule. A 3D molecular model was made for each of the phage peptide inserts to visually identify potential areas of steric interference of peptides that simulated CXCL12 regions not in contact with the receptor's N-terminus. An ELISA analysis of the relative binding affinity between the peptides identified 9 peptides with statistically significant results. The candidate pool of 9 peptides was further reduced to 3 peptides based on their affinity for the targeted N-terminus region peptide versus no target peptide present or a scrambled negative control peptide. The results clearly show the Phage Display protocol can be used to target a synthesized region of the ACKR3/CXCR7 N-terminus. The 3 peptides chosen, P20, P3, and P9, showed no effect on the viability or proliferation upon exposure to MCF-7 and U87-MG cells. Membrane binding, colocalization, and cellular uptake were confirmed by whole-cell ELISA and confocal microscopy. The recovered peptides did not activate the receptor as confirmed by a Beta-Arrestin recruitment assay. The data shows that the peptide sequences recovered from the phage display protocol are viable candidates for targeting cancer cells and delivering material to them.
Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deutscher, J.; Pevec, B.; Beyreuther, K.

1986-10-21

The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
A linear programming model for protein inference problem in shotgun proteomics.

PubMed

Huang, Ting; He, Zengyou

2012-11-15

Assembling peptides identified from tandem mass spectra into a list of proteins, referred to as protein inference, is an important issue in shotgun proteomics. The objective of protein inference is to find a subset of proteins that are truly present in the sample. Although many methods have been proposed for protein inference, several issues such as peptide degeneracy still remain unsolved. In this article, we present a linear programming model for protein inference. In this model, we use a transformation of the joint probability that each peptide/protein pair is present in the sample as the variable. Then, both the peptide probability and protein probability can be expressed as a formula in terms of the linear combination of these variables. Based on this simple fact, the protein inference problem is formulated as an optimization problem: minimize the number of proteins with non-zero probabilities under the constraint that the difference between the calculated peptide probability and the peptide probability generated from peptide identification algorithms should be less than some threshold. This model addresses the peptide degeneracy issue by forcing some joint probability variables involving degenerate peptides to be zero in a rigorous manner. The corresponding inference algorithm is named as ProteinLP. We test the performance of ProteinLP on six datasets. Experimental results show that our method is competitive with the state-of-the-art protein inference algorithms. The source code of our algorithm is available at: https://sourceforge.net/projects/prolp/. zyhe@dlut.edu.cn. Supplementary data are available at Bioinformatics Online.
Enzyme-Assisted Discovery of Antioxidant Peptides from Edible Marine Invertebrates: A Review

PubMed Central

Chai, Tsun-Thai; Law, Yew-Chye; Wong, Fai-Chu; Kim, Se-Kwon

2017-01-01

Marine invertebrates, such as oysters, mussels, clams, scallop, jellyfishes, squids, prawns, sea cucumbers and sea squirts, are consumed as foods. These edible marine invertebrates are sources of potent bioactive peptides. The last two decades have seen a surge of interest in the discovery of antioxidant peptides from edible marine invertebrates. Enzymatic hydrolysis is an efficient strategy commonly used for releasing antioxidant peptides from food proteins. A growing number of antioxidant peptide sequences have been identified from the enzymatic hydrolysates of edible marine invertebrates. Antioxidant peptides have potential applications in food, pharmaceuticals and cosmetics. In this review, we first give a brief overview of the current state of progress of antioxidant peptide research, with special attention to marine antioxidant peptides. We then focus on 22 investigations which identified 32 antioxidant peptides from enzymatic hydrolysates of edible marine invertebrates. Strategies adopted by various research groups in the purification and identification of the antioxidant peptides will be summarized. Structural characteristic of the peptide sequences in relation to their antioxidant activities will be reviewed. Potential applications of the peptide sequences and future research prospects will also be discussed. PMID:28212329
Enzyme-Assisted Discovery of Antioxidant Peptides from Edible Marine Invertebrates: A Review.

PubMed

Chai, Tsun-Thai; Law, Yew-Chye; Wong, Fai-Chu; Kim, Se-Kwon

2017-02-16

Marine invertebrates, such as oysters, mussels, clams, scallop, jellyfishes, squids, prawns, sea cucumbers and sea squirts, are consumed as foods. These edible marine invertebrates are sources of potent bioactive peptides. The last two decades have seen a surge of interest in the discovery of antioxidant peptides from edible marine invertebrates. Enzymatic hydrolysis is an efficient strategy commonly used for releasing antioxidant peptides from food proteins. A growing number of antioxidant peptide sequences have been identified from the enzymatic hydrolysates of edible marine invertebrates. Antioxidant peptides have potential applications in food, pharmaceuticals and cosmetics. In this review, we first give a brief overview of the current state of progress of antioxidant peptide research, with special attention to marine antioxidant peptides. We then focus on 22 investigations which identified 32 antioxidant peptides from enzymatic hydrolysates of edible marine invertebrates. Strategies adopted by various research groups in the purification and identification of the antioxidant peptides will be summarized. Structural characteristic of the peptide sequences in relation to their antioxidant activities will be reviewed. Potential applications of the peptide sequences and future research prospects will also be discussed.
Covalent attachment of TAT peptides and thiolated alkyl molecules on GaAs surfaces.

PubMed

Cho, Youngnam; Ivanisevic, Albena

2005-07-07

Four TAT peptide fragments were used to functionalize GaAs surfaces by adsorption from solution. In addition, two well-studied alkylthiols, mercaptohexadecanoic acid (MHA) and 1-octadecanethiol (ODT) were utilized as references to understand the structure of the TAT peptide monolayer on GaAs. The different sequences of TAT peptides were employed in recognition experiments where a synthetic RNA sequence was tested to verify the specific interaction with the TAT peptide. The modified GaAs surfaces were characterized by atomic force microscopy (AFM), X-ray photoelectron spectroscopy (XPS), and Fourier transform infrared reflection absorption spectroscopy (FT-IRRAS). AFM studies were used to compare the surface roughness before and after functionalization. XPS allowed us to characterize the chemical composition of the GaAs surface and conclude that the monolayers composed of different sequences of peptides have similar surface chemistries. Finally, FT-IRRAS experiments enabled us to deduce that the TAT peptide monolayers have a fairly ordered and densely packed alkyl chain structure. The recognition experiments showed preferred interaction of the RNA sequence toward peptides with high arginine content.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Huang, Yingying; Triscari, Joseph M.; Tseng, George C.

Data mining was performed on 28 330 unique peptide tandem mass spectra for which sequences were assigned with high confidence. By dividing the spectra into different sets based on structural features and charge states of the corresponding peptides, chemical interactions involved in promoting specific cleavage patterns in gas-phase peptides were characterized. Pairwise fragmentation maps describing cleavages at all Xxx-Zzz residue combinations for b and y ions reveal that the difference in basicity between Arg and Lys results in different dissociation patterns for singly charged Arg- and Lys-ending tryptic peptides. While one dominant protonation form (proton localized) exists for Arg-ending peptides,more » a heterogeneous population of different protonated forms or more facile interconversion of protonated forms (proton partially mobile) exists for Lys-ending peptides. Cleavage C-terminal to acidic residues dominates spectra from peptides that have a localized proton and cleavage N-terminal to Pro dominates those that have a mobile or partially mobile proton. When Pro is absent from peptides that have a mobile or partially mobile proton, cleavage at each peptide bond becomes much more prominent. Whether the above patterns can be found in b ions, y ions, or both depends on the location of the proton holder(s). Enhanced cleavages C-terminal to branched aliphatic residues (Ile, Val, Leu) are observed in both b and y ions from peptides that have a mobile proton, as well as in y ions from peptides that have a partially mobile proton; enhanced cleavages N-terminal to these residues are observed in b ions from peptides that have a partially mobile proton. Statistical tools have been designed to visualize the fragmentation maps and measure the similarity between them. The pairwise cleavage patterns observed expand our knowledge of peptide gas-phase fragmentation behaviors and should be useful in algorithm development that employs improved models to predict fragment ion intensities.« less
Cyanine-based probe\\tag-peptide pair fluorescence protein imaging and fluorescence protein imaging methods

DOEpatents

Mayer-Cumblidge, M. Uljana; Cao, Haishi

2013-01-15

A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.
Cyanine-based probe\\tag-peptide pair for fluorescence protein imaging and fluorescence protein imaging methods

DOEpatents

Mayer-Cumblidge, M Uljana [Richland, WA; Cao, Haishi [Richland, WA

2010-08-17

A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.
Artificial neural network study on organ-targeting peptides

NASA Astrophysics Data System (ADS)

Jung, Eunkyoung; Kim, Junhyoung; Choi, Seung-Hoon; Kim, Minkyoung; Rhee, Hokyoung; Shin, Jae-Min; Choi, Kihang; Kang, Sang-Kee; Lee, Nam Kyung; Choi, Yun-Jaie; Jung, Dong Hyun

2010-01-01

We report a new approach to studying organ targeting of peptides on the basis of peptide sequence information. The positive control data sets consist of organ-targeting peptide sequences identified by the peroral phage-display technique for four organs, and the negative control data are prepared from random sequences. The capacity of our models to make appropriate predictions is validated by statistical indicators including sensitivity, specificity, enrichment curve, and the area under the receiver operating characteristic (ROC) curve (the ROC score). VHSE descriptor produces statistically significant training models and the models with simple neural network architectures show slightly greater predictive power than those with complex ones. The training and test set statistics indicate that our models could discriminate between organ-targeting and random sequences. We anticipate that our models will be applicable to the selection of organ-targeting peptides for generating peptide drugs or peptidomimetics.
A Peptidic Unconjugated GRP78/BiP Ligand Modulates the Unfolded Protein Response and Induces Prostate Cancer Cell Death

PubMed Central

Maddalo, Danilo; Neeb, Antje; Jehle, Katja; Schmitz, Katja; Muhle-Goll, Claudia; Shatkina, Liubov; Walther, Tamara Vanessa; Bruchmann, Anja; Gopal, Srinivasa M.; Wenzel, Wolfgang; Ulrich, Anne S.; Cato, Andrew C. B.

2012-01-01

The molecular chaperone GRP78/BiP is a key regulator of protein folding in the endoplasmic reticulum, and it plays a pivotal role in cancer cell survival and chemoresistance. Inhibition of its function has therefore been an important strategy for inhibiting tumor cell growth in cancer therapy. Previous efforts to achieve this goal have used peptides that bind to GRP78/BiP conjugated to pro-drugs or cell-death-inducing sequences. Here, we describe a peptide that induces prostate tumor cell death without the need of any conjugating sequences. This peptide is a sequence derived from the cochaperone Bag-1. We have shown that this sequence interacts with and inhibits the refolding activity of GRP78/BiP. Furthermore, we have demonstrated that it modulates the unfolded protein response in ER stress resulting in PARP and caspase-4 cleavage. Prostate cancer cells stably expressing this peptide showed reduced growth and increased apoptosis in in vivo xenograft tumor models. Amino acid substitutions that destroyed binding of the Bag-1 peptide to GRP78/BiP or downregulation of the expression of GRP78 compromised the inhibitory effect of this peptide. This sequence therefore represents a candidate lead peptide for anti-tumor therapy. PMID:23049684
ScanRanker: Quality Assessment of Tandem Mass Spectra via Sequence Tagging

PubMed Central

Ma, Ze-Qiang; Chambers, Matthew C.; Ham, Amy-Joan L.; Cheek, Kristin L.; Whitwell, Corbin W.; Aerni, Hans-Rudolf; Schilling, Birgit; Miller, Aaron W.; Caprioli, Richard M.; Tabb, David L.

2011-01-01

In shotgun proteomics, protein identification by tandem mass spectrometry relies on bioinformatics tools. Despite recent improvements in identification algorithms, a significant number of high quality spectra remain unidentified for various reasons. Here we present ScanRanker, an open-source tool that evaluates the quality of tandem mass spectra via sequence tagging with reliable performance in data from different instruments. The superior performance of ScanRanker enables it not only to find unassigned high quality spectra that evade identification through database search, but also to select spectra for de novo sequencing and cross-linking analysis. In addition, we demonstrate that the distribution of ScanRanker scores predicts the richness of identifiable spectra among multiple LC-MS/MS runs in an experiment, and ScanRanker scores assist the process of peptide assignment validation to increase confident spectrum identifications. The source code and executable versions of ScanRanker are available from http://fenchurch.mc.vanderbilt.edu. PMID:21520941
Computer aided identification of a Hevein-like antimicrobial peptide of bell pepper leaves for biotechnological use.

PubMed

Games, Patrícia Dias; daSilva, Elói Quintas Gonçalves; Barbosa, Meire de Oliveira; Almeida-Souza, Hebréia Oliveira; Fontes, Patrícia Pereira; deMagalhães, Marcos Jorge; Pereira, Paulo Roberto Gomes; Prates, Maura Vianna; Franco, Gloria Regina; Faria-Campos, Alessandra; Campos, Sérgio Vale Aguiar; Baracat-Pereira, Maria Cristina

2016-12-15

Antimicrobial peptides from plants present mechanisms of action that are different from those of conventional defense agents. They are under-explored but have a potential as commercial antimicrobials. Bell pepper leaves ('Magali R') are discarded after harvesting the fruit and are sources of bioactive peptides. This work reports the isolation by peptidomics tools, and the identification and partially characterization by computational tools of an antimicrobial peptide from bell pepper leaves, and evidences the usefulness of records and the in silico analysis for the study of plant peptides aiming biotechnological uses. Aqueous extracts from leaves were enriched in peptide by salt fractionation and ultrafiltration. An antimicrobial peptide was isolated by tandem chromatographic procedures. Mass spectrometry, automated peptide sequencing and bioinformatics tools were used alternately for identification and partial characterization of the Hevein-like peptide, named HEV-CANN. The computational tools that assisted to the identification of the peptide included BlastP, PSI-Blast, ClustalOmega, PeptideCutter, and ProtParam; conventional protein databases (DB) as Mascot, Protein-DB, GenBank-DB, RefSeq, Swiss-Prot, and UniProtKB; specific for peptides DB as Amper, APD2, CAMP, LAMPs, and PhytAMP; other tools included in ExPASy for Proteomics; The Bioactive Peptide Databases, and The Pepper Genome Database. The HEV-CANN sequence presented 40 amino acid residues, 4258.8 Da, theoretical pI-value of 8.78, and four disulfide bonds. It was stable, and it has inhibited the growth of phytopathogenic bacteria and a fungus. HEV-CANN presented a chitin-binding domain in their sequence. There was a high identity and a positive alignment of HEV-CANN sequence in various databases, but there was not a complete identity, suggesting that HEV-CANN may be produced by ribosomal synthesis, which is in accordance with its constitutive nature. Computational tools for proteomics and databases are not adjusted for short sequences, which hampered HEV-CANN identification. The adjustment of statistical tests in large databases for proteins is an alternative to promote the significant identification of peptides. The development of specific DB for plant antimicrobial peptides, with information about peptide sequences, functional genomic data, structural motifs and domains of molecules, functional domains, and peptide-biomolecule interactions are valuable and necessary.
Elucidation of Peptide-Directed Palladium Surface Structure for Biologically Tunable Nanocatalysts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bedford, Nicholas M.; Ramezani-Dakhel, Hadi; Slocik, Joseph M.

Peptide-enabled synthesis of inorganic nanostructures represents an avenue to access catalytic materials with tunable and optimized properties. This is achieved via peptide complexity and programmability that is missing in traditional ligands for catalytic nanomaterials. Unfortunately, there is limited information available to correlate peptide sequence to particle structure and catalytic activity to date. As such, the application of peptide-enabled nanocatalysts remains limited to trial and error approaches. In this paper, a hybrid experimental and computational approach is introduced to systematically elucidate biomolecule-dependent structure/function relationships for peptide-capped Pd nanocatalysts. Synchrotron X-ray techniques were used to uncover substantial particle surface structural disorder, whichmore » was dependent upon the amino acid sequence of the peptide capping ligand. Nanocatalyst configurations were then determined directly from experimental data using reverse Monte Carlo methods and further refined using molecular dynamics simulation, obtaining thermodynamically stable peptide-Pd nanoparticle configurations. Sequence-dependent catalytic property differences for C-C coupling and olefin hydrogenation were then eluddated by identification of the catalytic active sites at the atomic level and quantitative prediction of relative reaction rates. This hybrid methodology provides a clear route to determine peptide-dependent structure/function relationships, enabling the generation of guidelines for catalyst design through rational tailoring of peptide sequences« less
Elucidation of peptide-directed palladium surface structure for biologically tunable nanocatalysts.

PubMed

Bedford, Nicholas M; Ramezani-Dakhel, Hadi; Slocik, Joseph M; Briggs, Beverly D; Ren, Yang; Frenkel, Anatoly I; Petkov, Valeri; Heinz, Hendrik; Naik, Rajesh R; Knecht, Marc R

2015-05-26

Peptide-enabled synthesis of inorganic nanostructures represents an avenue to access catalytic materials with tunable and optimized properties. This is achieved via peptide complexity and programmability that is missing in traditional ligands for catalytic nanomaterials. Unfortunately, there is limited information available to correlate peptide sequence to particle structure and catalytic activity to date. As such, the application of peptide-enabled nanocatalysts remains limited to trial and error approaches. In this paper, a hybrid experimental and computational approach is introduced to systematically elucidate biomolecule-dependent structure/function relationships for peptide-capped Pd nanocatalysts. Synchrotron X-ray techniques were used to uncover substantial particle surface structural disorder, which was dependent upon the amino acid sequence of the peptide capping ligand. Nanocatalyst configurations were then determined directly from experimental data using reverse Monte Carlo methods and further refined using molecular dynamics simulation, obtaining thermodynamically stable peptide-Pd nanoparticle configurations. Sequence-dependent catalytic property differences for C-C coupling and olefin hydrogenation were then elucidated by identification of the catalytic active sites at the atomic level and quantitative prediction of relative reaction rates. This hybrid methodology provides a clear route to determine peptide-dependent structure/function relationships, enabling the generation of guidelines for catalyst design through rational tailoring of peptide sequences.
Short peptides allowing preferential detection of Candida albicans hyphae.

PubMed

Kaba, Hani E J; Pölderl, Antonia; Bilitewski, Ursula

2015-09-01

Whereas the detection of pathogens via recognition of surface structures by specific antibodies and various types of antibody mimics is frequently described, the applicability of short linear peptides as sensor molecules or diagnostic tools is less well-known. We selected peptides which were previously reported to bind to recombinant S. cerevisiae cells, expressing members of the C. albicans Agglutinin-Like-Sequence (ALS) cell wall protein family. We slightly modified amino acid sequences to evaluate peptide sequence properties influencing binding to C. albicans cells. Among the selected peptides, decamer peptides with an "AP"-N-terminus were superior to shorter peptides. The new decamer peptide FBP4 stained viable C. albicans cells more efficiently in their mature hyphal form than in their yeast form. Moreover, it allowed distinction of C. albicans from other related Candida spp. and could thus be the basis for the development of a useful tool for the diagnosis of invasive candidiasis.
Dynamic peptide libraries for the discovery of supramolecular nanomaterials

NASA Astrophysics Data System (ADS)

Pappas, Charalampos G.; Shafi, Ramim; Sasselli, Ivan R.; Siccardi, Henry; Wang, Tong; Narang, Vishal; Abzalimov, Rinat; Wijerathne, Nadeesha; Ulijn, Rein V.

2016-11-01

Sequence-specific polymers, such as oligonucleotides and peptides, can be used as building blocks for functional supramolecular nanomaterials. The design and selection of suitable self-assembling sequences is, however, challenging because of the vast combinatorial space available. Here we report a methodology that allows the peptide sequence space to be searched for self-assembling structures. In this approach, unprotected homo- and heterodipeptides (including aromatic, aliphatic, polar and charged amino acids) are subjected to continuous enzymatic condensation, hydrolysis and sequence exchange to create a dynamic combinatorial peptide library. The free-energy change associated with the assembly process itself gives rise to selective amplification of self-assembling candidates. By changing the environmental conditions during the selection process, different sequences and consequent nanoscale morphologies are selected.
Dynamic peptide libraries for the discovery of supramolecular nanomaterials.

PubMed

Pappas, Charalampos G; Shafi, Ramim; Sasselli, Ivan R; Siccardi, Henry; Wang, Tong; Narang, Vishal; Abzalimov, Rinat; Wijerathne, Nadeesha; Ulijn, Rein V

2016-11-01

Sequence-specific polymers, such as oligonucleotides and peptides, can be used as building blocks for functional supramolecular nanomaterials. The design and selection of suitable self-assembling sequences is, however, challenging because of the vast combinatorial space available. Here we report a methodology that allows the peptide sequence space to be searched for self-assembling structures. In this approach, unprotected homo- and heterodipeptides (including aromatic, aliphatic, polar and charged amino acids) are subjected to continuous enzymatic condensation, hydrolysis and sequence exchange to create a dynamic combinatorial peptide library. The free-energy change associated with the assembly process itself gives rise to selective amplification of self-assembling candidates. By changing the environmental conditions during the selection process, different sequences and consequent nanoscale morphologies are selected.
Exploring the role of hydration and confinement in the aggregation of amyloidogenic peptides Aβ16-22 and Sup357-13 in AOT reverse micelles

NASA Astrophysics Data System (ADS)

Martinez, Anna Victoria; Małolepsza, Edyta; Rivera, Eva; Lu, Qing; Straub, John E.

2014-12-01

Knowledge of how intermolecular interactions of amyloid-forming proteins cause protein aggregation and how those interactions are affected by sequence and solution conditions is essential to our understanding of the onset of many degenerative diseases. Of particular interest is the aggregation of the amyloid-β (Aβ) peptide, linked to Alzheimer's disease, and the aggregation of the Sup35 yeast prion peptide, which resembles the mammalian prion protein linked to spongiform encephalopathies. To facilitate the study of these important peptides, experimentalists have identified small peptide congeners of the full-length proteins that exhibit amyloidogenic behavior, including the KLVFFAE sub-sequence, Aβ16-22, and the GNNQQNY subsequence, Sup357-13. In this study, molecular dynamics simulations were used to examine these peptide fragments encapsulated in reverse micelles (RMs) in order to identify the fundamental principles that govern how sequence and solution environment influence peptide aggregation. Aβ16-22 and Sup357-13 are observed to organize into anti-parallel and parallel β-sheet arrangements. Confinement in the sodium bis(2-ethylhexyl) sulfosuccinate (AOT) reverse micelles is shown to stabilize extended peptide conformations and enhance peptide aggregation. Substantial fluctuations in the reverse micelle shape are observed, in agreement with earlier studies. Shape fluctuations are found to facilitate peptide solvation through interactions between the peptide and AOT surfactant, including direct interaction between non-polar peptide residues and the aliphatic surfactant tails. Computed amide I IR spectra are compared with experimental spectra and found to reflect changes in the peptide structures induced by confinement in the RM environment. Furthermore, examination of the rotational anisotropy decay of water in the RM demonstrates that the water dynamics are sensitive to the presence of peptide as well as the peptide sequence. Overall, our results demonstrate that the RM is a complex confining environment where substantial direct interaction between the surfactant and peptides plays an important role in determining the resulting ensemble of peptide conformations. By extension the results suggest that similarly complex sequence-dependent interactions may determine conformational ensembles of amyloid-forming peptides in a cellular environment.
Using Power Spectrum Analysis to Evaluate 18O-Water Labeling Data Acquired from Low Resolution Mass Spectrometers

PubMed Central

Sadygov, Rovshan G.; Zhao, Yingxin; Haidacher, Sigmund J.; Starkey, Jonathan M.; Tilton, Ronald G.; Denner, Larry

2010-01-01

We describe a method for ratio estimations in 18O-water labeling experiments acquired from low resolution isotopically resolved data. The method is implemented in a software package specifically designed for use in experiments making use of zoom-scan mode data acquisition. Zoom-scan mode data allows commonly used ion trap mass spectrometers to attain isotopic resolution, which make them amenable to use in labeling schemes such as 18O-water labeling, but algorithms and software developed for high resolution instruments may not be appropriate for the lower resolution data acquired in zoom-scan mode. The use of power spectrum analysis is proposed as a general approach which may be uniquely suited to these data types. The software implementation uses power spectrum to remove high-frequency noise, and band-filter contributions from co-eluting species of differing charge states. From the elemental composition of a peptide sequence we generate theoretical isotope envelopes of heavy-light peptide pairs in five different ratios; these theoretical envelopes are correlated with the filtered experimental zoom scans. To automate peptide quantification in high-throughput experiments, we have implemented our approach in a computer program, MassXplorer. We demonstrate the application of MassXplorer to two model mixtures of known proteins, and to a complex mixture of mouse kidney cortical extract. Comparison with another algorithm for ratio estimations demonstrates the increased precision and automation of MassXplorer. PMID:20568695
A fossil protein chimera; difficulties in discriminating dinosaur peptide sequences from modern cross-contamination.

PubMed

Buckley, Michael; Warwood, Stacey; van Dongen, Bart; Kitchener, Andrew C; Manning, Phillip L

2017-05-31

A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus The resulting LC-MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated. © 2017 The Authors.
A fossil protein chimera; difficulties in discriminating dinosaur peptide sequences from modern cross-contamination

PubMed Central

Warwood, Stacey; van Dongen, Bart; Kitchener, Andrew C.; Manning, Phillip L.

2017-01-01

A decade ago, reports that organic-rich soft tissue survived from dinosaur fossils were apparently supported by proteomics-derived sequence information of exceptionally well-preserved bone. This initial claim to the sequencing of endogenous collagen peptides from an approximately 68 Myr Tyrannosaurus rex fossil was highly controversial, largely on the grounds of potential contamination from either bacterial biofilms or from laboratory practice. In a subsequent study, collagen peptide sequences from an approximately 78 Myr Brachylophosaurus canadensis fossil were reported that have remained largely unchallenged. However, the endogeneity of these sequences relies heavily on a single peptide sequence, apparently unique to both dinosaurs. Given the potential for cross-contamination from modern bone analysed by the same team, here we extract collagen from bone samples of three individuals of ostrich, Struthio camelus. The resulting LC–MS/MS data were found to match all of the proposed sequences for both the original Tyrannosaurus and Brachylophosaurus studies. Regardless of the true nature of the dinosaur peptides, our finding highlights the difficulty of differentiating such sequences with confidence. Our results not only imply that cross-contamination cannot be ruled out, but that appropriate measures to test for endogeneity should be further evaluated. PMID:28566488
Bimodal imprint chips for peptide screening: integration of high-throughput sequencing by MS and affinity analyses by surface plasmon resonance imaging.

PubMed

Wang, Weizhi; Li, Menglin; Wei, Zewen; Wang, Zihua; Bu, Xiangli; Lai, Wenjia; Yang, Shu; Gong, He; Zheng, Hui; Wang, Yuqiao; Liu, Ying; Li, Qin; Fang, Qiaojun; Hu, Zhiyuan

2014-04-15

Peptide probes and drugs have widespread applications in disease diagnostics and therapy. The demand for peptides ligands with high affinity and high specificity toward various targets has surged in the biomedical field in recent years. The traditional peptide screening procedure involves selection, sequencing, and characterization steps, and each step is manual and tedious. Herein, we developed a bimodal imprint microarray system to embrace the whole peptide screening process. Silver-sputtered silicon chip fabricated with microwell array can trap and pattern the candidate peptide beads in a one-well-one-bead manner. Peptides on beads were photocleaved in situ. A portion of the peptide in each well was transferred to a gold-coated chip to print the peptide array for high-throughput affinity analyses by surface plasmon resonance imaging (SPRi), and the peptide left in the silver-sputtered chip was ready for in situ single bead sequencing by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF-MS). Using the bimodal imprint chip system, affinity peptides toward AHA were efficiently screened out from the 7 × 10(4) peptide library. The method provides a solution for high efficiency peptide screening.
Learning a peptide-protein binding affinity predictor with kernel ridge regression

PubMed Central

2013-01-01

Background The cellular function of a vast majority of proteins is performed through physical interactions with other biomolecules, which, most of the time, are other proteins. Peptides represent templates of choice for mimicking a secondary structure in order to modulate protein-protein interaction. They are thus an interesting class of therapeutics since they also display strong activity, high selectivity, low toxicity and few drug-drug interactions. Furthermore, predicting peptides that would bind to a specific MHC alleles would be of tremendous benefit to improve vaccine based therapy and possibly generate antibodies with greater affinity. Modern computational methods have the potential to accelerate and lower the cost of drug and vaccine discovery by selecting potential compounds for testing in silico prior to biological validation. Results We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalizes eight kernels, comprised of the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation of the kernel and a linear time algorithm for it’s approximation. Combined with kernel ridge regression and SupCK, a novel binding pocket kernel, the proposed kernel yields biologically relevant and good prediction accuracy on the PepX database. For the first time, a machine learning predictor is capable of predicting the binding affinity of any peptide to any protein with reasonable accuracy. The method was also applied to both single-target and pan-specific Major Histocompatibility Complex class II benchmark datasets and three Quantitative Structure Affinity Model benchmark datasets. Conclusion On all benchmarks, our method significantly (p-value ≤ 0.057) outperforms the current state-of-the-art methods at predicting peptide-protein binding affinities. The proposed approach is flexible and can be applied to predict any quantitative biological activity. Moreover, generating reliable peptide-protein binding affinities will also improve system biology modelling of interaction pathways. Lastly, the method should be of value to a large segment of the research community with the potential to accelerate the discovery of peptide-based drugs and facilitate vaccine development. The proposed kernel is freely available at http://graal.ift.ulaval.ca/downloads/gs-kernel/. PMID:23497081
A novel peptide from the ACEI/BPP-CNP precursor in the venom of Crotalus durissus collilineatus.

PubMed

Higuchi, Shigesada; Murayama, Nobuhiro; Saguchi, Ken-ichi; Ohi, Hiroaki; Fujita, Yoshiaki; da Silva, Nelson Jorge; de Siqueira, Rodrigo José Bezerra; Lahlou, Saad; Aird, Steven D

2006-10-01

In crotaline venoms, angiotensin-converting enzyme inhibitors [ACEIs, also known as bradykinin potentiating peptides (BPPs)], are products of a gene coding for an ACEI/BPP-C-type natriuretic peptide (CNP) precursor. In the genes from Bothrops jararaca and Gloydius blomhoffii, ACEI/BPP sequences are repeated. Sequencing of a cDNA clone from venom glands of Crotalus durissus collilineatus showed that two ACEIs/BPPs are located together at the N-terminus, but without repeats. An additional sequence for CNP was unexpectedly found at the C-terminus. Homologous genes for the ACEI/BPP-CNP precursor suggest that most crotaline venoms contain both ACEIs/BPPs and CNP. The sequence of ACEIs/BPPs is separated from the CNP sequence by a long spacer sequence. Previously, there was no evidence that this spacer actually coded any expressed peptides. Aird and Kaiser (1986, unpublished) previously isolated and sequenced a peptide of 11 residues (TPPAGPDVGPR) from Crotalus viridis viridis venom. In the present study, analysis of the cDNA clone from C. d. collilineatus revealed a nearly identical sequence in the ACEI/BPP-CNP spacer. Fractionation of the crude venom by reverse phase HPLC (C(18)), and analysis of the fractions by mass spectrometry (MS) indicated a component of 1020.5 Da. Amino acid sequencing by MS/MS confirmed that C. d. collilineatus venom contains the peptide TPPAGPDGGPR. Its high proline content and paired proline residues are typical of venom hypotensive peptides, although it lacks the usual N-terminal pyroglutamate. It has no demonstrable hypotensive activity when injected intravenously in rats; however, its occurrence in the venoms of dissimilar species suggests that its presence is not accidental. Evidence suggests that these novel toxins probably activate anaphylatoxin C3a receptors.
An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

PubMed

Omasits, Ulrich; Varadarajan, Adithi R; Schmid, Michael; Goetze, Sandra; Melidis, Damianos; Bourqui, Marc; Nikolayeva, Olga; Québatte, Maxime; Patrignani, Andrea; Dehio, Christoph; Frey, Juerg E; Robinson, Mark D; Wollscheid, Bernd; Ahrens, Christian H

2017-12-01

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae , Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote. © 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.

Cloning of precursors for two MIH/VIH-related peptides in the prawn, Macrobrachium rosenbergii.

PubMed

Yang, W J; Rao, K R

2001-11-30

Two cDNA clones (634 and 1366 bp) encoding MIH/VIH (molt-inhibiting hormone/vitellogenesis-inhibiting hormone)-related peptides were isolated and sequenced from a Macrobrachium rosenbergii eyestalk ganglia cDNA library. The clones contain a 360 and 339 bp open-reading frame, and their conceptually translated peptides consist of a 41 and 34 amino acid signal peptide, respectively, and a 78 amino acid residue mature peptide hormone. The amino acid sequences of the peptides exhibit higher identities with other known MIHs and VIH (44-69%) than with CHHs (28-33%). This is the first report describing the cloning and sequencing of two MIH/VIH-related peptides in a single crustacean species. Transcription of these mRNAs was detected in the eyestalk ganglia, but not in the thoracic ganglia, hepatopancreas, gut, gill, heart, or muscle.
Expansion for the Brachylophosaurus canadensis Collagen I Sequence and Additional Evidence of the Preservation of Cretaceous Protein.

PubMed

Schroeter, Elena R; DeHart, Caroline J; Cleland, Timothy P; Zheng, Wenxia; Thomas, Paul M; Kelleher, Neil L; Bern, Marshall; Schweitzer, Mary H

2017-02-03

Sequence data from biomolecules such as DNA and proteins, which provide critical information for evolutionary studies, have been assumed to be forever outside the reach of dinosaur paleontology. Proteins, which are predicted to have greater longevity than DNA, have been recovered from two nonavian dinosaurs, but these results remain controversial. For proteomic data derived from extinct Mesozoic organisms to reach their greatest potential for investigating questions of phylogeny and paleobiology, it must be shown that peptide sequences can be reliably and reproducibly obtained from fossils and that fragmentary sequences for ancient proteins can be increasingly expanded. To test the hypothesis that peptides can be repeatedly detected and validated from fossil tissues many millions of years old, we applied updated extraction methodology, high-resolution mass spectrometry, and bioinformatics analyses on a Brachylophosaurus canadensis specimen (MOR 2598) from which collagen I peptides were recovered in 2009. We recovered eight peptide sequences of collagen I: two identical to peptides recovered in 2009 and six new peptides. Phylogenetic analyses place the recovered sequences within basal archosauria. When only the new sequences are considered, B. canadensis is grouped more closely to crocodylians, but when all sequences (current and those reported in 2009) are analyzed, B. canadensis is placed more closely to basal birds. The data robustly support the hypothesis of an endogenous origin for these peptides, confirm the idea that peptides can survive in specimens tens of millions of years old, and bolster the validity of the 2009 study. Furthermore, the new data expand the coverage of B. canadensis collagen I (a 33.6% increase in collagen I alpha 1 and 116.7% in alpha 2). Finally, this study demonstrates the importance of reexamining previously studied specimens with updated methods and instrumentation, as we obtained roughly the same amount of sequence data as the previous study with substantially less sample material. Data are available via ProteomeXchange with identifier PXD005087.
Pathogen-Specific Epitopes as Epidemiological Tools for Defining the Magnitude of Mycobacterium leprae Transmission in Areas Endemic for Leprosy

PubMed Central

Spencer, John S.; Hacker, Mariana A. V. B.; Costa, Luciana S.; Carvalho, Fernanda M.; Geluk, Annemieke; van der Ploeg-van Schip, Jolien J.; Pontes, Maria A. A.; Gonçalves, Heitor S.; de Morais, Janvier P.; Bandeira, Tereza J. P. G.; Pessolani, Maria C. V.; Brennan, Patrick J.; Pereira, Geraldo M. B.

2012-01-01

During recent years, comparative genomic analysis has allowed the identification of Mycobacterium leprae-specific genes with potential application for the diagnosis of leprosy. In a previous study, 58 synthetic peptides derived from these sequences were tested for their ability to induce production of IFN-γ in PBMC from endemic controls (EC) with unknown exposure to M. leprae, household contacts of leprosy patients and patients, indicating the potential of these synthetic peptides for the diagnosis of sub- or preclinical forms of leprosy. In the present study, the patterns of IFN-γ release of the individuals exposed or non-exposed to M. leprae were compared using an Artificial Neural Network algorithm, and the most promising M. leprae peptides for the identification of exposed people were selected. This subset of M. leprae-specific peptides allowed the differentiation of groups of individuals from sites hyperendemic for leprosy versus those from areas with lower level detection rates. A progressive reduction in the IFN-γ levels in response to the peptides was seen when contacts of multibacillary (MB) patients were compared to other less exposed groups, suggesting a down modulation of IFN-γ production with an increase in bacillary load or exposure to M. leprae. The data generated indicate that an IFN-γ assay based on these peptides applied individually or as a pool can be used as a new tool for predicting the magnitude of M. leprae transmission in a given population. PMID:22545169
DOE Office of Scientific and Technical Information (OSTI.GOV)

Ruggles, Kelly V.; Tang, Zuojian; Wang, Xuya

Improvements in mass spectrometry (MS)-based peptide sequencing provide a new opportunity to determine whether polymorphisms, mutations and splice variants identified in cancer cells are translated. Herein we therefore describe a proteogenomic data integration tool (QUILTS) and illustrate its application to whole genome, transcriptome and global MS peptide sequence datasets generated from a pair of luminal and basal-like breast cancer patient derived xenografts (PDX). The sensitivity of proteogenomic analysis for singe nucleotide variant (SNV) expression and novel splice junction (NSJ) detection was probed using multiple MS/MS process replicates. Despite over thirty sample replicates, only about 10% of all SNV (somatic andmore » germline) were detected by both DNA and RNA sequencing were observed as peptides. An even smaller proportion of peptides corresponding to NSJ observed by RNA sequencing were detected (<0.1%). Peptides mapping to DNA-detected SNV without a detectable mRNA transcript were also observed demonstrating the transcriptome coverage was also incomplete (~80%). In contrast to germ-line variants, somatic variants were less likely to be detected at the peptide level in the basal-like tumor than the luminal tumor raising the possibility of differential translation or protein degradation effects. In conclusion, the QUILTS program integrates DNA, RNA and peptide sequencing to assess the degree to which somatic mutations are translated and therefore biologically active. By identifying gaps in sequence coverage QUILTS benchmarks current technology and assesses progress towards whole cancer proteome and transcriptome analysis.« less
A Probabilistic Framework for Peptide and Protein Quantification from Data-Dependent and Data-Independent LC-MS Proteomics Experiments

PubMed Central

Richardson, Keith; Denny, Richard; Hughes, Chris; Skilling, John; Sikora, Jacek; Dadlez, Michał; Manteca, Angel; Jung, Hye Ryung; Jensen, Ole Nørregaard; Redeker, Virginie; Melki, Ronald; Langridge, James I.; Vissers, Johannes P.C.

2013-01-01

A probability-based quantification framework is presented for the calculation of relative peptide and protein abundance in label-free and label-dependent LC-MS proteomics data. The results are accompanied by credible intervals and regulation probabilities. The algorithm takes into account data uncertainties via Poisson statistics modified by a noise contribution that is determined automatically during an initial normalization stage. Protein quantification relies on assignments of component peptides to the acquired data. These assignments are generally of variable reliability and may not be present across all of the experiments comprising an analysis. It is also possible for a peptide to be identified to more than one protein in a given mixture. For these reasons the algorithm accepts a prior probability of peptide assignment for each intensity measurement. The model is constructed in such a way that outliers of any type can be automatically reweighted. Two discrete normalization methods can be employed. The first method is based on a user-defined subset of peptides, while the second method relies on the presence of a dominant background of endogenous peptides for which the concentration is assumed to be unaffected. Normalization is performed using the same computational and statistical procedures employed by the main quantification algorithm. The performance of the algorithm will be illustrated on example data sets, and its utility demonstrated for typical proteomics applications. The quantification algorithm supports relative protein quantification based on precursor and product ion intensities acquired by means of data-dependent methods, originating from all common isotopically-labeled approaches, as well as label-free ion intensity-based data-independent methods. PMID:22871168
Single Molecule Spectroscopy of Amino Acids and Peptides by Recognition Tunneling

PubMed Central

Zhao, Yanan; Ashcroft, Brian; Zhang, Peiming; Liu, Hao; Sen, Suman; Song, Weisi; Im, JongOne; Gyarfas, Brett; Manna, Saikat; Biswas, Sovan; Borges, Chad; Lindsay, Stuart

2014-01-01

The human proteome has millions of protein variants due to alternative RNA splicing and post-translational modifications, and variants that are related to diseases are frequently present in minute concentrations. For DNA and RNA, low concentrations can be amplified using the polymerase chain reaction, but there is no such reaction for proteins. Therefore, the development of single molecule protein sequencing is a critical step in the search for protein biomarkers. Here we show that single amino acids can be identified by trapping the molecules between two electrodes that are coated with a layer of recognition molecules and measuring the electron tunneling current across the junction. A given molecule can bind in more than one way in the junction, and we therefore use a machine-learning algorithm to distinguish between the sets of electronic ‘fingerprints’ associated with each binding motif. With this recognition tunneling technique, we are able to identify D, L enantiomers, a methylated amino acid, isobaric isomers, and short peptides. The results suggest that direct electronic sequencing of single proteins could be possible by sequentially measuring the products of processive exopeptidase digestion, or by using a molecular motor to pull proteins through a tunnel junction integrated with a nanopore. PMID:24705512
Bacterial expression of self-assembling peptide hydrogelators

NASA Astrophysics Data System (ADS)

Sonmez, Cem

For tissue regeneration and drug delivery applications, various architectures are explored to serve as biomaterial tools. Via de novo design, functional peptide hydrogel materials have been developed as scaffolds for biomedical applications. The objective of this study is to investigate bacterial expression as an alternative method to chemical synthesis for the recombinant production of self-assembling peptides that can form rigid hydrogels under physiological conditions. The Schneider and Pochan Labs have designed and characterized a 20 amino acid beta-hairpin forming amphiphilic peptide containing a D-residue in its turn region (MAX1). As a result, this peptide must be prepared chemically. Peptide engineering, using the sequence of MAX1 as a template, afforded a small family of peptides for expression (EX peptides) that have different turn sequences consisting of natural amino acids and amenable to bacterial expression. Each sequence was initially chemically synthesized to quickly assess the material properties of its corresponding gel. One model peptide EX1, was chosen to start the bacterial expression studies. DNA constructs facilitating the expression of EX1 were designed in such that the peptide could be expressed with different fusion partners and subsequently cleaved by enzymatic or chemical means to afford the free peptide. Optimization studies were performed to increase the yield of pure peptide that ultimately allowed 50 mg of pure peptide to be harvested from one liter of culture, providing an alternate means to produce this hydrogel-forming peptide. Recombinant production of other self-assembling hairpins with different turn sequences was also successful using this optimized protocol. The studies demonstrate that new beta-hairpin self-assembling peptides that are amenable to bacterial production and form rigid hydrogels at physiological conditions can be designed and produced by fermentation in good yield at significantly reduced cost when compared to chemical synthesis.
Novel Inhibitor Cystine Knot Peptides from Momordica charantia

PubMed Central

Clark, Richard J.; Tang, Jun; Zeng, Guang-Zhi; Franco, Octavio L.; Cantacessi, Cinzia; Craik, David J.; Daly, Norelle L.; Tan, Ning-Hua

2013-01-01

Two new peptides, MCh-1 and MCh-2, along with three known trypsin inhibitors (MCTI-I, MCTI-II and MCTI-III), were isolated from the seeds of the tropical vine Momordica charantia. The sequences of the peptides were determined using mass spectrometry and NMR spectroscopy. Using a strategy involving partial reduction and stepwise alkylation of the peptides, followed by enzymatic digestion and tandem mass spectrometry sequencing, the disulfide connectivity of MCh-1 was elucidated to be CysI-CysIV, CysII-CysV and CysIII-CysVI. The three-dimensional structures of MCh-1 and MCh-2 were determined using NMR spectroscopy and found to contain the inhibitor cystine knot (ICK) motif. The sequences of the novel peptides differ significantly from peptides previously isolated from this plant. Therefore, this study expands the known peptide diversity in M. charantia and the range of sequences that can be accommodated by the ICK motif. Furthermore, we show that a stable two-disulfide intermediate is involved in the oxidative folding of MCh-1. This disulfide intermediate is structurally homologous to the proposed ancestral fold of ICK peptides, and provides a possible pathway for the evolution of this structural motif, which is highly prevalent in nature. PMID:24116036
Hydrophobic and electrostatic interactions between cell penetrating peptides and plasmid DNA are important for stable non-covalent complexation and intracellular delivery.

PubMed

Upadhya, Archana; Sangave, Preeti C

2016-10-01

Cell penetrating peptides are useful tools for intracellular delivery of nucleic acids. Delivery of plasmid DNA, a large nucleic acid, poses a challenge for peptide mediated transport. The paper investigates and compares efficacy of five novel peptide designs for complexation of plasmid DNA and subsequent delivery into cells. The peptides were designed to contain reported DNA condensing agents and basic cell penetrating sequences, octa-arginine (R 8 ) and CHK 6 HC coupled to cell penetration accelerating peptides such as Bax inhibitory mutant peptide (KLPVM) and a peptide derived from the Kaposi fibroblast growth factor (kFGF) membrane translocating sequence. A tryptophan rich peptide, an analogue of Pep-3, flanked with CH 3 on either ends was also a part of the study. The peptides were analysed for plasmid DNA complexation, protection of peptide-plasmid DNA complexes against DNase I, serum components and competitive ligands by simple agarose gel electrophoresis techniques. Hemolysis of rat red blood corpuscles (RBCs) in the presence of the peptides was used as a measure of peptide cytotoxicity. Plasmid DNA delivery through the designed peptides was evaluated in two cell lines, human cervical cancer cell line (HeLa) and (NIH/3 T3) mouse embryonic fibroblasts via expression of the secreted alkaline phosphatase (SEAP) reporter gene. The importance of hydrophobic sequences in addition to cationic sequences in peptides for non-covalent plasmid DNA complexation and delivery has been illustrated. An alternative to the employment of fatty acid moieties for enhanced gene transfer has been proposed. Comparison of peptides for plasmid DNA complexation and delivery of peptide-plasmid DNA complexes to cells estimated by expression of a reporter gene, SEAP. Copyright © 2016 European Peptide Society and John Wiley & Sons, Ltd. Copyright © 2016 European Peptide Society and John Wiley & Sons, Ltd.
High Specific Selectivity and Membrane-Active Mechanism of Synthetic Cationic Hybrid Antimicrobial Peptides Based on the Peptide FV7

PubMed Central

Tan, Tingting; Wu, Di; Li, Weizhong; Zheng, Xin; Li, Weifen; Shan, Anshan

2017-01-01

Hybrid peptides integrating different functional domains of peptides have many advantages, such as remarkable antimicrobial activity, lower hemolysis and ideal cell selectivity, compared with natural antimicrobial peptides. FV7 (FRIRVRV-NH2), a consensus amphiphilic sequence was identified as being analogous to host defense peptides. In this study, we designed a series of hybrid peptides FV7-LL-37 (17–29) (FV-LL), FV7-magainin 2 (9–21) (FV-MA) and FV7-cecropin A (1–8) (FV-CE) by combining the FV7 sequence with the small functional sequences LL-37 (17–29) (LL), magainin 2 (9–21) (MA) and cecropin A (1–8) (CE) which all come from well-described natural peptides. The results demonstrated that the synthetic hybrid peptides, in particular FV-LL, had potent antibacterial activities over a wide range of Gram-negative and Gram-positive bacteria with lower hemolytic activity than other peptides. Furthermore, fluorescent spectroscopy indicated that the hybrid peptide FV-LL exhibited marked membrane destruction by inducing outer and inner bacterial membrane permeabilization, while scanning electron microscopy (SEM) and transmission electron microscopy (TEM) demonstrated that FV-LL damaged membrane integrity by disrupting the bacterial membrane. Inhibiting biofilm formation assays also showed that FV-LL had similar anti-biofilm activity compared with the functional peptide sequence FV7. Synthetic cationic hybrid peptides based on FV7 could provide new models for combining different functional domains and demonstrate effective avenues to screen for novel antimicrobial agents. PMID:28178190
Peptide library synthesis on spectrally encoded beads for multiplexed protein/peptide bioassays

NASA Astrophysics Data System (ADS)

Nguyen, Huy Q.; Brower, Kara; Harink, Björn; Baxter, Brian; Thorn, Kurt S.; Fordyce, Polly M.

2017-02-01

Protein-peptide interactions are essential for cellular responses. Despite their importance, these interactions remain largely uncharacterized due to experimental challenges associated with their measurement. Current techniques (e.g. surface plasmon resonance, fluorescence polarization, and isothermal calorimetry) either require large amounts of purified material or direct fluorescent labeling, making high-throughput measurements laborious and expensive. In this report, we present a new technology for measuring antibody-peptide interactions in vitro that leverages spectrally encoded beads for biological multiplexing. Specific peptide sequences are synthesized directly on encoded beads with a 1:1 relationship between peptide sequence and embedded code, thereby making it possible to track many peptide sequences throughout the course of an experiment within a single small volume. We demonstrate the potential of these bead-bound peptide libraries by: (1) creating a set of 46 peptides composed of 3 commonly used epitope tags (myc, FLAG, and HA) and single amino-acid scanning mutants; (2) incubating with a mixture of fluorescently-labeled antimyc, anti-FLAG, and anti-HA antibodies; and (3) imaging these bead-bound libraries to simultaneously identify the embedded spectral code (and thus the sequence of the associated peptide) and quantify the amount of each antibody bound. To our knowledge, these data demonstrate the first customized peptide library synthesized directly on spectrally encoded beads. While the implementation of the technology provided here is a high-affinity antibody/protein interaction with a small code space, we believe this platform can be broadly applicable to any range of peptide screening applications, with the capability to multiplex into libraries of hundreds to thousands of peptides in a single assay.
AVP-IC50 Pred: Multiple machine learning techniques-based prediction of peptide antiviral activity in terms of half maximal inhibitory concentration (IC50).

PubMed

Qureshi, Abid; Tandon, Himani; Kumar, Manoj

2015-11-01

Peptide-based antiviral therapeutics has gradually paved their way into mainstream drug discovery research. Experimental determination of peptides' antiviral activity as expressed by their IC50 values involves a lot of effort. Therefore, we have developed "AVP-IC50 Pred," a regression-based algorithm to predict the antiviral activity in terms of IC50 values (μM). A total of 759 non-redundant peptides from AVPdb and HIPdb were divided into a training/test set having 683 peptides (T(683)) and a validation set with 76 independent peptides (V(76)) for evaluation. We utilized important peptide sequence features like amino-acid compositions, binary profile of N8-C8 residues, physicochemical properties and their hybrids. Four different machine learning techniques (MLTs) namely Support vector machine, Random Forest, Instance-based classifier, and K-Star were employed. During 10-fold cross validation, we achieved maximum Pearson correlation coefficients (PCCs) of 0.66, 0.64, 0.56, 0.55, respectively, for the above MLTs using the best combination of feature sets. All the predictive models also performed well on the independent validation dataset and achieved maximum PCCs of 0.74, 0.68, 0.59, 0.57, respectively, on the best combination of feature sets. The AVP-IC50 Pred web server is anticipated to assist the researchers working on antiviral therapeutics by enabling them to computationally screen many compounds and focus experimental validation on the most promising set of peptides, thus reducing cost and time efforts. The server is available at http://crdd.osdd.net/servers/ic50avp. © 2015 Wiley Periodicals, Inc.
Radiolabeled Escherichia coli heat-stable enterotoxin analogs for in vivo imaging of colorectal cancer

NASA Astrophysics Data System (ADS)

Giblin, M. F.; Sieckman, G. L.; Owen, N. K.; Hoffman, T. J.; Forte, L. R.; Volkert, W. A.

2005-12-01

The human Escherichia coli heat-stable enterotoxin (STh, amino acid sequence N1SSNYCCELCCNPACTGCY19) binds specifically to the guanylate cyclase C (GC-C) receptor, which is present in high density on the apical surface of normal intestinal epithelial cells as well as on the surface of human colon cancer cells. In the current study, two STh analogs were synthesized and evaluated in vitro and in vivo. Both analogs shared identical 6-19 core sequences, and had N-terminal pendant DOTA moieties. The analogs differed in the identity of a 6 amino acid peptide sequence intervening between DOTA and the 6-19 core. In one analog, the peptide was an RGD-containing sequence found in human fibronectin (GRGDSP), while in the other this peptide sequence was randomly scrambled (GRDSGP). The results indicated that the presence of the human fibronectin sequence in the hybrid peptide did not affect tumor localization in vivo.
Cationic antimicrobial peptides inactivate Shiga toxin-encoding bacteriophages

NASA Astrophysics Data System (ADS)

Del Cogliano, Manuel E.; Hollmann, Axel; Martinez, Melina; Semorile, Liliana; Ghiringhelli, Pablo D.; Maffía, Paulo C.; Bentancor, Leticia V.

2017-12-01

Shiga toxin (Stx) is the principal virulence factor during Shiga toxin-producing Escherichia coli (STEC) infections. We have previously reported the inactivation of bacteriophage encoding Stx after treatment with chitosan, a linear polysaccharide polymer with cationic properties. Cationic antimicrobial peptides (cAMPs) are short linear aminoacidic sequences, with a positive net charge, which display bactericidal or bacteriostatic activity against a wide range of bacterial species. They are promising novel antibiotics since they have shown bactericidal effects against multiresistant bacteria. To evaluate whether cationic properties are responsible for bacteriophage inactivation, we tested seven cationic peptides with proven antimicrobial activity as anti-bacteriophage agents, and one random sequence cationic peptide with no antimicrobial activity as a control. We observed bacteriophage inactivation after incubation with five cAMPs, but no inactivating activity was observed with the random sequence cationic peptide or with the non alpha helical cAMP Omiganan. Finally, to confirm peptide-bacteriophage interaction, zeta potential was analyzed by following changes on bacteriophage surface charges after peptide incubation. According to our results we could propose that: 1) direct interaction of peptides with phage is a necessary step for bacteriophage inactivation, 2) cationic properties are necessary but not sufficient for bacteriophage inactivation, and 3) inactivation by cationic peptides could be sequence (or structure) specific. Overall our data suggest that these peptides could be considered a new family of molecules potentially useful to decrease bacteriophage replication and Stx expression.
Graphene Nanopores for Protein Sequencing.

PubMed

Wilson, James; Sloman, Leila; He, Zhiren; Aksimentiev, Aleksei

2016-07-19

An inexpensive, reliable method for protein sequencing is essential to unraveling the biological mechanisms governing cellular behavior and disease. Current protein sequencing methods suffer from limitations associated with the size of proteins that can be sequenced, the time, and the cost of the sequencing procedures. Here, we report the results of all-atom molecular dynamics simulations that investigated the feasibility of using graphene nanopores for protein sequencing. We focus our study on the biologically significant phenylalanine-glycine repeat peptides (FG-nups)-parts of the nuclear pore transport machinery. Surprisingly, we found FG-nups to behave similarly to single stranded DNA: the peptides adhere to graphene and exhibit step-wise translocation when subject to a transmembrane bias or a hydrostatic pressure gradient. Reducing the peptide's charge density or increasing the peptide's hydrophobicity was found to decrease the translocation speed. Yet, unidirectional and stepwise translocation driven by a transmembrane bias was observed even when the ratio of charged to hydrophobic amino acids was as low as 1:8. The nanopore transport of the peptides was found to produce stepwise modulations of the nanopore ionic current correlated with the type of amino acids present in the nanopore, suggesting that protein sequencing by measuring ionic current blockades may be possible.
Generation of 2A-linked multicistronic cassettes by recombinant PCR.

PubMed

Szymczak-Workman, Andrea L; Vignali, Kate M; Vignali, Dario A A

2012-02-01

The need for reliable, multicistronic vectors for multigene delivery is at the forefront of biomedical technology. It is now possible to express multiple proteins from a single open reading frame (ORF) using 2A peptide-linked multicistronic vectors. These small sequences, when cloned between genes, allow for efficient, stoichiometric production of discrete protein products within a single vector through a novel "cleavage" event within the 2A peptide sequence. Expression of more than two genes using conventional approaches has several limitations, most notably imbalanced protein expression and large size. The use of 2A peptide sequences alleviates these concerns. They are small (18-22 amino acids) and have divergent amino-terminal sequences, which minimizes the chance for homologous recombination and allows for multiple, different 2A peptide sequences to be used within a single vector. Importantly, separation of genes placed between 2A peptide sequences is nearly 100%, which allows for stoichiometric and concordant expression of the genes, regardless of the order of placement within the vector. This protocol describes the use of recombinant polymerase chain reaction (PCR) to connect multiple 2A-linked protein sequences. The final construct is subcloned into an expression vector.
Two new bradykinin-related peptides from the venom of the social wasp Protopolybia exigua (Saussure).

PubMed

Mendes, Maria Anita; Palma, Mario Sergio

2006-11-01

Two bradykinin-related peptides (Protopolybiakinin-I and Protopolybiakinin-II) were isolated from the venom of the social wasp Protopolybia exigua by RP-HPLC, and sequenced by Edman degradation method. Peptide sequences of Protopolybiakinin-I and Protopolybiakinin-II were DKNKKPIRVGGRRPPGFTR-OH and DKNKKPIWMAGFPGFTPIR-OH, respectively. Synthetic peptides with identical sequences to the bradykinin-related peptides and their biological functions were characterized. Protopolybiakinin-I caused less potent constriction of the isolated rat ileum muscles than bradykinin (BK). In addition, it caused degranulation of mast cells which was seven times more potent than BK. This peptide causes algesic effects due to the direct activation of B(2)-receptors. Protopolybiakinin-II is not an agonist of rat ileum muscle and had no algesic effects. However, Protopolybiakinin-II was found to be 10 times more potent as a mast cell degranulator than BK. The amino acid sequence of Protopolybiakinin-I is the longest among the known wasp kinins.
PHASTpep: Analysis Software for Discovery of Cell-Selective Peptides via Phage Display and Next-Generation Sequencing

PubMed Central

Dasa, Siva Sai Krishna; Kelly, Kimberly A.

2016-01-01

Next-generation sequencing has enhanced the phage display process, allowing for the quantification of millions of sequences resulting from the biopanning process. In response, many valuable analysis programs focused on specificity and finding targeted motifs or consensus sequences were developed. For targeted drug delivery and molecular imaging, it is also necessary to find peptides that are selective—targeting only the cell type or tissue of interest. We present a new analysis strategy and accompanying software, PHage Analysis for Selective Targeted PEPtides (PHASTpep), which identifies highly specific and selective peptides. Using this process, we discovered and validated, both in vitro and in vivo in mice, two sequences (HTTIPKV and APPIMSV) targeted to pancreatic cancer-associated fibroblasts that escaped identification using previously existing software. Our selectivity analysis makes it possible to discover peptides that target a specific cell type and avoid other cell types, enhancing clinical translatability by circumventing complications with systemic use. PMID:27186887
HomoSAR: bridging comparative protein modeling with quantitative structural activity relationship to design new peptides.

PubMed

Borkar, Mahesh R; Pissurlenkar, Raghuvir R S; Coutinho, Evans C

2013-11-15

Peptides play significant roles in the biological world. To optimize activity for a specific therapeutic target, peptide library synthesis is inevitable; which is a time consuming and expensive. Computational approaches provide a promising way to simply elucidate the structural basis in the design of new peptides. Earlier, we proposed a novel methodology termed HomoSAR to gain insight into the structure activity relationships underlying peptides. Based on an integrated approach, HomoSAR uses the principles of homology modeling in conjunction with the quantitative structural activity relationship formalism to predict and design new peptide sequences with the optimum activity. In the present study, we establish that the HomoSAR methodology can be universally applied to all classes of peptides irrespective of sequence length by studying HomoSAR on three peptide datasets viz., angiotensin-converting enzyme inhibitory peptides, CAMEL-s antibiotic peptides, and hAmphiphysin-1 SH3 domain binding peptides, using a set of descriptors related to the hydrophobic, steric, and electronic properties of the 20 natural amino acids. Models generated for all three datasets have statistically significant correlation coefficients (r(2)) and predictive r2 (r(pred)2) and cross validated coefficient ( q(LOO)2). The daintiness of this technique lies in its simplicity and ability to extract all the information contained in the peptides to elucidate the underlying structure activity relationships. The difficulties of correlating both sequence diversity and variation in length of the peptides with their biological activity can be addressed. The study has been able to identify the preferred or detrimental nature of amino acids at specific positions in the peptide sequences. Copyright © 2013 Wiley Periodicals, Inc.
Peptide array-based interaction assay of solid-bound peptides and anchorage-dependant cells and its effectiveness in cell-adhesive peptide design.

PubMed

Kato, Ryuji; Kaga, Chiaki; Kunimatsu, Mitoshi; Kobayashi, Takeshi; Honda, Hiroyuki

2006-06-01

Peptide array, the designable peptide library covalently synthesized on cellulose support, was applied to assay peptide-cell interaction, between solid-bound peptides and anchorage-dependant cells, to study objective peptide design. As a model case, cell-adhesive peptides that could enhance cell growth as tissue engineering scaffold material, was studied. On the peptide array, the relative cell-adhesion ratio of NIH/3T3 cells was 2.5-fold higher on the RGDS (Arg-Gly-Asp-Ser) peptide spot as compared to the spot with no peptide, thus indicating integrin-mediated peptide-cell interaction. Such strong cell adhesion mediated by the RGDS peptide was easily disrupted by single residue substitution on the peptide array, thus indicating that the sequence recognition accuracy of cells was strictly conserved in our optimized scheme. The observed cellular morphological extension with active actin stress-fiber on the RGD motif-containing peptide supported our strategy that peptide array-based interaction assay of solid-bound peptide and anchorage-dependant cells (PIASPAC) could provide quantitative data on biological peptide-cell interaction. The analysis of 180 peptides obtained from fibronectin type III domain (no. 1447-1629) yielded 18 novel cell-adhesive peptides without the RGD motif. Taken together with the novel candidates, representative rules of ineffective amino acid usage were obtained from non-effective candidate sequences for the effective designing of cell-adhesive peptides. On comparing the amino acid usage of the top 20 and last 20 peptides from the 180 peptides, the following four brief design rules were indicated: (i) Arg or Lys of positively charged amino acids (except His) could enhance cell adhesion, (ii) small hydrophilic amino acids are favored in cell-adhesion peptides, (iii) negatively charged amino acids and small amino acids (except Gly) could reduce cell adhesion, and (iv) Cys and Met could be excluded from the sequence combination since they have less influence on the peptide design. Such rules that are indicative of the nature of the functional peptide sequence can be obtained only by the mass comparison analysis of PIASPAC using peptide array. By following such indicative rules, numerous amino acid combinations can be effectively screened for further examination of novel peptide design.

Structural studies of polypeptides: Mechanism of immunoglobin catalysis and helix propagation in hybrid sequence, disulfide containing peptides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Storrs, Richard Wood

1992-08-01

Catalytic immunoglobin fragments were studied Nuclear Magnetic Resonance spectroscopy to identify amino acid residues responsible for the catalytic activity. Small, hybrid sequence peptides were analyzed for helix propagation following covalent initiation and for activity related to the protein from which the helical sequence was derived. Hydrolysis of p-nitrophenyl carbonates and esters by specific immunoglobins is thought to involve charge complementarity. The pK of the transition state analog P-nitrophenyl phosphate bound to the immunoglobin fragment was determined by 31P-NMR to verify the juxtaposition of a positively charged amino acid to the binding/catalytic site. Optical studies of immunoglobin mediated photoreversal of cis,more » syn cyclobutane thymine dimers implicated tryptophan as the photosensitizing chromophore. Research shows the chemical environment of a single tryptophan residue is altered upon binding of the thymine dimer. This tryptophan residue was localized to within 20 Å of the binding site through the use of a nitroxide paramagnetic species covalently attached to the thymine dimer. A hybrid sequence peptide was synthesized based on the bee venom peptide apamin in which the helical residues of apamin were replaced with those from the recognition helix of the bacteriophage 434 repressor protein. Oxidation of the disufide bonds occured uniformly in the proper 1-11, 3-15 orientation, stabilizing the 434 sequence in an α-helix. The glycine residue stopped helix propagation. Helix propagation in 2,2,2-trifluoroethanol mixtures was investigated in a second hybrid sequence peptide using the apamin-derived disulfide scaffold and the S-peptide sequence. The helix-stop signal previously observed was not observed in the NMR NOESY spectrum. Helical connectivities were seen throughout the S-peptide sequence. The apamin/S-peptide hybrid binded to the S-protein (residues 21-166 of ribonuclease A) and reconstituted enzymatic activity.« less
Structural studies of polypeptides: Mechanism of immunoglobin catalysis and helix propagation in hybrid sequence, disulfide containing peptides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Storrs, R.W.

1992-08-01

Catalytic immunoglobin fragments were studied Nuclear Magnetic Resonance spectroscopy to identify amino acid residues responsible for the catalytic activity. Small, hybrid sequence peptides were analyzed for helix propagation following covalent initiation and for activity related to the protein from which the helical sequence was derived. Hydrolysis of p-nitrophenyl carbonates and esters by specific immunoglobins is thought to involve charge complementarity. The pK of the transition state analog P-nitrophenyl phosphate bound to the immunoglobin fragment was determined by [sup 31]P-NMR to verify the juxtaposition of a positively charged amino acid to the binding/catalytic site. Optical studies of immunoglobin mediated photoreversal ofmore » cis, syn cyclobutane thymine dimers implicated tryptophan as the photosensitizing chromophore. Research shows the chemical environment of a single tryptophan residue is altered upon binding of the thymine dimer. This tryptophan residue was localized to within 20 [Angstrom] of the binding site through the use of a nitroxide paramagnetic species covalently attached to the thymine dimer. A hybrid sequence peptide was synthesized based on the bee venom peptide apamin in which the helical residues of apamin were replaced with those from the recognition helix of the bacteriophage 434 repressor protein. Oxidation of the disufide bonds occured uniformly in the proper 1-11, 3-15 orientation, stabilizing the 434 sequence in an [alpha]-helix. The glycine residue stopped helix propagation. Helix propagation in 2,2,2-trifluoroethanol mixtures was investigated in a second hybrid sequence peptide using the apamin-derived disulfide scaffold and the S-peptide sequence. The helix-stop signal previously observed was not observed in the NMR NOESY spectrum. Helical connectivities were seen throughout the S-peptide sequence. The apamin/S-peptide hybrid binded to the S-protein (residues 21-166 of ribonuclease A) and reconstituted enzymatic activity.« less
Predictive Model of Linear Antimicrobial Peptides Active against Gram-Negative Bacteria.

PubMed

Vishnepolsky, Boris; Gabrielian, Andrei; Rosenthal, Alex; Hurt, Darrell E; Tartakovsky, Michael; Managadze, Grigol; Grigolava, Maya; Makhatadze, George I; Pirtskhalava, Malak

2018-05-29

Antimicrobial peptides (AMPs) have been identified as a potential new class of anti-infectives for drug development. There are a lot of computational methods that try to predict AMPs. Most of them can only predict if a peptide will show any antimicrobial potency, but to the best of our knowledge, there are no tools which can predict antimicrobial potency against particular strains. Here we present a predictive model of linear AMPs being active against particular Gram-negative strains relying on a semi-supervised machine-learning approach with a density-based clustering algorithm. The algorithm can well distinguish peptides active against particular strains from others which may also be active but not against the considered strain. The available AMP prediction tools cannot carry out this task. The prediction tool based on the algorithm suggested herein is available on https://dbaasp.org.
Algorithms for database-dependent search of MS/MS data.

PubMed

Matthiesen, Rune

2013-01-01

The frequent used bottom-up strategy for identification of proteins and their associated modifications generate nowadays typically thousands of MS/MS spectra that normally are matched automatically against a protein sequence database. Search engines that take as input MS/MS spectra and a protein sequence database are referred as database-dependent search engines. Many programs both commercial and freely available exist for database-dependent search of MS/MS spectra and most of the programs have excellent user documentation. The aim here is therefore to outline the algorithm strategy behind different search engines rather than providing software user manuals. The process of database-dependent search can be divided into search strategy, peptide scoring, protein scoring, and finally protein inference. Most efforts in the literature have been put in to comparing results from different software rather than discussing the underlining algorithms. Such practical comparisons can be cluttered by suboptimal implementation and the observed differences are frequently caused by software parameters settings which have not been set proper to allow even comparison. In other words an algorithmic idea can still be worth considering even if the software implementation has been demonstrated to be suboptimal. The aim in this chapter is therefore to split the algorithms for database-dependent searching of MS/MS data into the above steps so that the different algorithmic ideas become more transparent and comparable. Most search engines provide good implementations of the first three data analysis steps mentioned above, whereas the final step of protein inference are much less developed for most search engines and is in many cases performed by an external software. The final part of this chapter illustrates how protein inference is built into the VEMS search engine and discusses a stand-alone program SIR for protein inference that can import a Mascot search result.
Complete covalent structure of statherin, a tyrosine-rich acidic peptide which inhibits calcium phosphate precipitation from human parotid saliva.

PubMed

Schlesinger, D H; Hay, D I

1977-03-10

The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.
Hydroxyapatite-binding peptides for bone growth and inhibition

DOEpatents

Bertozzi, Carolyn R [Berkeley, CA; Song, Jie [Shrewsbury, MA; Lee, Seung-Wuk [Walnut Creek, CA

2011-09-20

Hydroxyapatite (HA)-binding peptides are selected using combinatorial phage library display. Pseudo-repetitive consensus amino acid sequences possessing periodic hydroxyl side chains in every two or three amino acid sequences are obtained. These sequences resemble the (Gly-Pro-Hyp).sub.x repeat of human type I collagen, a major component of extracellular matrices of natural bone. A consistent presence of basic amino acid residues is also observed. The peptides are synthesized by the solid-phase synthetic method and then used for template-driven HA-mineralization. Microscopy reveal that the peptides template the growth of polycrystalline HA crystals .about.40 nm in size.
Production of Angiotensin-I-Converting-Enzyme-Inhibitory Peptides in Fermented Milks Started by Lactobacillus delbrueckii subsp. bulgaricus SS1 and Lactococcus lactis subsp. cremoris FT4

PubMed Central

Gobbetti, M.; Ferranti, P.; Smacchi, E.; Goffredi, F.; Addeo, F.

2000-01-01

Two fermented milks containing angiotensin-I-converting-enzyme (ACE)-inhibitory peptides were produced by using selected Lactobacillus delbrueckii subsp. bulgaricus SS1 and L. lactis subsp. cremoris FT4. The pH 4.6-soluble nitrogen fraction of the two fermented milks was fractionated by reversed-phase fast-protein liquid chromatography. The fractions which showed the highest ACE-inhibitory indexes were further purified, and the related peptides were sequenced by tandem fast atom bombardment-mass spectrometry. The most inhibitory fractions of the milk fermented by L. delbrueckii subsp. bulgaricus SS1 contained the sequences of β-casein (β-CN) fragment 6-14 (f6-14), f7-14, f73-82, f74-82, and f75-82. Those from the milk fermented by L. lactis subsp. cremoris FT4 contained the sequences of β-CN f7-14, f47-52, and f169-175 and κ-CN f155-160 and f152-160. Most of these sequences had features in common with other ACE-inhibitory peptides reported in the literature. In particular, the β-CN f47-52 sequence had high homology with that of angiotensin-II. Some of these peptides were chemically synthesized. The 50% inhibitory concentrations (IC50s) of the crude purified fractions containing the peptide mixture were very low (8.0 to 11.2 mg/liter). When the synthesized peptides were used individually, the ACE-inhibitory activity was confirmed but the IC50s increased considerably. A strengthened inhibitory effect of the peptide mixtures with respect to the activity of individual peptides was presumed. Once generated, the inhibitory peptides were resistant to further proteolysis either during dairy processing or by trypsin and chymotrypsin. PMID:10966406
Opposite Electron-Transfer Dissociation and Higher-Energy Collisional Dissociation Fragmentation Characteristics of Proteolytic K/R(X)n and (X)nK/R Peptides Provide Benefits for Peptide Sequencing in Proteomics and Phosphoproteomics.

PubMed

Tsiatsiani, Liana; Giansanti, Piero; Scheltema, Richard A; van den Toorn, Henk; Overall, Christopher M; Altelaar, A F Maarten; Heck, Albert J R

2017-02-03

A key step in shotgun proteomics is the digestion of proteins into peptides amenable for mass spectrometry. Tryptic peptides can be readily sequenced and identified by collision-induced dissociation (CID) or higher-energy collisional dissociation (HCD) because the fragmentation rules are well-understood. Here, we investigate LysargiNase, a perfect trypsin mirror protease, because it cleaves equally specific at arginine and lysine residues, albeit at the N-terminal end. LysargiNase peptides are therefore practically tryptic-like in length and sequence except that following ESI, the two protons are now both positioned at the N-terminus. Here, we compare side-by-side the chromatographic separation properties, gas-phase fragmentation characteristics, and (phospho)proteome sequence coverage of tryptic (i.e., (X) n K/R) and LysargiNase (i.e., K/R(X) n ) peptides using primarily electron-transfer dissociation (ETD) and, for comparison, HCD. We find that tryptic and LysargiNase peptides fragment nearly as mirror images. For LysargiNase predominantly N-terminal peptide ions (c-ions (ETD) and b-ions (HCD)) are formed, whereas for trypsin, C-terminal fragment ions dominate (z-ions (ETD) and y-ions (HCD)) in a homologous mixture of complementary ions. Especially during ETD, LysargiNase peptides fragment into low-complexity but information-rich sequence ladders. Trypsin and LysargiNase chart distinct parts of the proteome, and therefore, the combined use of these enzymes will benefit a more in-depth and reliable analysis of (phospho)proteomes.
Loop propensity of the sequence YKGQP from staphylococcal nuclease: implications for the folding of nuclease.

PubMed

Patel, Sunita; Sasidhar, Yellamraju U

2007-10-01

Recently we performed molecular dynamics (MD) simulations on the folding of the hairpin peptide DTVKLMYKGQPMTFR from staphylococcal nuclease in explicit water. We found that the peptide folds into a hairpin conformation with native and nonnative hydrogen-bonding patterns. In all the folding events observed in the folding of the hairpin peptide, loop formation involving the region YKGQP was an important event. In order to trace the origins of the loop propensity of the sequence YKGQP, we performed MD simulations on the sequence starting from extended, polyproline II and native type I' turn conformations for a total simulation length of 300 ns, using the GROMOS96 force field under constant volume and temperature (NVT) conditions. The free-energy landscape of the peptide YKGQP shows minima corresponding to loop conformation with Tyr and Pro side-chain association, turn and extended conformational forms, with modest free-energy barriers separating the minima. To elucidate the role of Gly in facilitating loop formation, we also performed MD simulations of the mutated peptide YKAQP (Gly --> Ala mutation) under similar conditions starting from polyproline II conformation for 100 ns. Two minima corresponding to bend/turn and extended conformations were observed in the free-energy landscape for the peptide YKAQP. The free-energy barrier between the minima in the free-energy landscape of the peptide YKAQP was also modest. Loop conformation is largely sampled by the YKGQP peptide, while extended conformation is largely sampled by the YKAQP peptide. We also explain why the YKGQP sequence samples type II turn conformation in these simulations, whereas the sequence as part of the hairpin peptide DTVKLMYKGQPMTFR samples type I' turn conformation both in the X-ray crystal structure and in our earlier simulations on the folding of the hairpin peptide. We discuss the implications of our results to the folding of the staphylococcal nuclease. Copyright (c) 2007 European Peptide Society and John Wiley & Sons, Ltd.
Anopheles gambiae genome reannotation through synthesis of ab initio and comparative gene prediction algorithms

PubMed Central

Li, Jun; Riehle, Michelle M; Zhang, Yan; Xu, Jiannong; Oduol, Frederick; Gomez, Shawn M; Eiglmeier, Karin; Ueberheide, Beatrix M; Shabanowitz, Jeffrey; Hunt, Donald F; Ribeiro, José MC; Vernick, Kenneth D

2006-01-01

Background Complete genome annotation is a necessary tool as Anopheles gambiae researchers probe the biology of this potent malaria vector. Results We reannotate the A. gambiae genome by synthesizing comparative and ab initio sets of predicted coding sequences (CDSs) into a single set using an exon-gene-union algorithm followed by an open-reading-frame-selection algorithm. The reannotation predicts 20,970 CDSs supported by at least two lines of evidence, and it lowers the proportion of CDSs lacking start and/or stop codons to only approximately 4%. The reannotated CDS set includes a set of 4,681 novel CDSs not represented in the Ensembl annotation but with EST support, and another set of 4,031 Ensembl-supported genes that undergo major structural and, therefore, probably functional changes in the reannotated set. The quality and accuracy of the reannotation was assessed by comparison with end sequences from 20,249 full-length cDNA clones, and evaluation of mass spectrometry peptide hit rates from an A. gambiae shotgun proteomic dataset confirms that the reannotated CDSs offer a high quality protein database for proteomics. We provide a functional proteomics annotation, ReAnoXcel, obtained by analysis of the new CDSs through the AnoXcel pipeline, which allows functional comparisons of the CDS sets within the same bioinformatic platform. CDS data are available for download. Conclusion Comprehensive A. gambiae genome reannotation is achieved through a combination of comparative and ab initio gene prediction algorithms. PMID:16569258
Correlating low-similarity peptide sequences and allergenic epitopes.

PubMed

Kanduc, D

2008-01-01

Although a high number of allergenic peptide epitopes has been experimentally identified and defined, the molecular basis and the precise mechanisms underlying peptide allergenicity are unknown. This issue was analyzed exploring the relationship between peptide allergenicity and sequence similarity to the human proteome. The structured analysis of the data reported in literature put into evidence that the most part of IgE-binding epitopes are (or harbor) pentapeptide unit(s) with no/low similarity to the human proteome, this way suggesting that no or low sequence similarity to the host proteome might represent a minimum common denominator identifying allergenic peptides. The present literature analysis might be of relevance in devising and designing short amino acid modules to be used for blocking pathogenic IgE.
A target-unrelated peptide in an M13 phage display library traced to an advantageous mutation in the gene II ribosome-binding site.

PubMed

Brammer, Leighanne A; Bolduc, Benjamin; Kass, Jessica L; Felice, Kristin M; Noren, Christopher J; Hall, Marilena Fitzsimons

2008-02-01

Screening of the commercially available Ph.D.-7 phage-displayed heptapeptide library for peptides that bind immobilized Zn2+ resulted in the repeated selection of the peptide HAIYPRH, although binding assays indicated that HAIYPRH is not a zinc-binding peptide. HAIYPRH has also been selected in several other laboratories using completely different targets, and its ubiquity suggests that it is a target-unrelated peptide. We demonstrated that phage displaying HAIYPRH are enriched after serial amplification of the library without exposure to target. The amplification of phage displaying HAIYPRH was found to be dramatically faster than that of the library itself. DNA sequencing uncovered a mutation in the Shine-Dalgarno (SD) sequence for gIIp, a protein involved in phage replication, imparting to the SD sequence better complementarity to the 16S ribosomal RNA (rRNA). Introducing this mutation into phage lacking a displayed peptide resulted in accelerated propagation, whereas phage displaying HAIYPRH with a wild-type SD sequence were found to amplify normally. The SD mutation may alter gIIp expression and, consequently, the rate of propagation of phage. In the Ph.D.-7 library, the mutation is coincident with the displayed peptide HAIYPRH, accounting for the target-unrelated selection of this peptide in multiple reported panning experiments.
NetMHCIIpan-2.0 - Improved pan-specific HLA-DR predictions using a novel concurrent alignment and weight optimization training procedure.

PubMed

Nielsen, Morten; Justesen, Sune; Lund, Ole; Lundegaard, Claus; Buus, Søren

2010-11-13

Binding of peptides to Major Histocompatibility class II (MHC-II) molecules play a central role in governing responses of the adaptive immune system. MHC-II molecules sample peptides from the extracellular space allowing the immune system to detect the presence of foreign microbes from this compartment. Predicting which peptides bind to an MHC-II molecule is therefore of pivotal importance for understanding the immune response and its effect on host-pathogen interactions. The experimental cost associated with characterizing the binding motif of an MHC-II molecule is significant and large efforts have therefore been placed in developing accurate computer methods capable of predicting this binding event. Prediction of peptide binding to MHC-II is complicated by the open binding cleft of the MHC-II molecule, allowing binding of peptides extending out of the binding groove. Moreover, the genes encoding the MHC molecules are immensely diverse leading to a large set of different MHC molecules each potentially binding a unique set of peptides. Characterizing each MHC-II molecule using peptide-screening binding assays is hence not a viable option. Here, we present an MHC-II binding prediction algorithm aiming at dealing with these challenges. The method is a pan-specific version of the earlier published allele-specific NN-align algorithm and does not require any pre-alignment of the input data. This allows the method to benefit also from information from alleles covered by limited binding data. The method is evaluated on a large and diverse set of benchmark data, and is shown to significantly out-perform state-of-the-art MHC-II prediction methods. In particular, the method is found to boost the performance for alleles characterized by limited binding data where conventional allele-specific methods tend to achieve poor prediction accuracy. The method thus shows great potential for efficient boosting the accuracy of MHC-II binding prediction, as accurate predictions can be obtained for novel alleles at highly reduced experimental costs. Pan-specific binding predictions can be obtained for all alleles with know protein sequence and the method can benefit by including data in the training from alleles even where only few binders are known. The method and benchmark data are available at http://www.cbs.dtu.dk/services/NetMHCIIpan-2.0.
Efficient Identification of Murine M2 Macrophage Peptide Targeting Ligands by Phage Display and Next-Generation Sequencing.

PubMed

Liu, Gary W; Livesay, Brynn R; Kacherovsky, Nataly A; Cieslewicz, Maryelise; Lutz, Emi; Waalkes, Adam; Jensen, Michael C; Salipante, Stephen J; Pun, Suzie H

2015-08-19

Peptide ligands are used to increase the specificity of drug carriers to their target cells and to facilitate intracellular delivery. One method to identify such peptide ligands, phage display, enables high-throughput screening of peptide libraries for ligands binding to therapeutic targets of interest. However, conventional methods for identifying target binders in a library by Sanger sequencing are low-throughput, labor-intensive, and provide a limited perspective (<0.01%) of the complete sequence space. Moreover, the small sample space can be dominated by nonspecific, preferentially amplifying "parasitic sequences" and plastic-binding sequences, which may lead to the identification of false positives or exclude the identification of target-binding sequences. To overcome these challenges, we employed next-generation Illumina sequencing to couple high-throughput screening and high-throughput sequencing, enabling more comprehensive access to the phage display library sequence space. In this work, we define the hallmarks of binding sequences in next-generation sequencing data, and develop a method that identifies several target-binding phage clones for murine, alternatively activated M2 macrophages with a high (100%) success rate: sequences and binding motifs were reproducibly present across biological replicates; binding motifs were identified across multiple unique sequences; and an unselected, amplified library accurately filtered out parasitic sequences. In addition, we validate the Multiple Em for Motif Elicitation tool as an efficient and principled means of discovering binding sequences.
Ultrahigh-resolution Fourier transform ion cyclotron resonance mass spectrometry and tandem mass spectrometry for peptide de novo amino acid sequencing for a seven-protein mixture by paired single-residue transposed Lys-N and Lys-C digestion.

PubMed

Guan, Xiaoyan; Brownstein, Naomi C; Young, Nicolas L; Marshall, Alan G

2017-01-30

Bottom-up tandem mass spectrometry (MS/MS) is regularly used in proteomics to identify proteins from a sequence database. De novo sequencing is also available for sequencing peptides with relatively short sequence lengths. We recently showed that paired Lys-C and Lys-N proteases produce peptides of identical mass and similar retention time, but different tandem mass spectra. Such parallel experiments provide complementary information, and allow for up to 100% MS/MS sequence coverage. Here, we report digestion by paired Lys-C and Lys-N proteases of a seven-protein mixture: human hemoglobin alpha, bovine carbonic anhydrase 2, horse skeletal muscle myoglobin, hen egg white lysozyme, bovine pancreatic ribonuclease, bovine rhodanese, and bovine serum albumin, followed by reversed-phase nanoflow liquid chromatography, collision-induced dissociation, and 14.5 T Fourier transform ion cyclotron resonance mass spectrometry. Matched pairs of product peptide ions of equal precursor mass and similar retention times from each digestion are compared, leveraging single-residue transposed information with independent interferences to confidently identify fragment ion types, residues, and peptides. Selected pairs of product ion mass spectra for de novo sequenced protein segments from each member of the mixture are presented. Pairs of the transposed product ions as well as complementary information from the parallel experiments allow for both high MS/MS coverage for long peptide sequences and high confidence in the amino acid identification. Moreover, the parallel experiments in the de novo sequencing reduce false-positive matches of product ions from the single-residue transposed peptides from the same segment, and thereby further improve the confidence in protein identification. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
The identification of disulfides in ricin D using proteolytic cleavage followed by negative-ion nano-electrospray ionization mass spectrometry of the peptide fragments.

PubMed

Tran, T T Nha; Brinkworth, Craig S; Bowie, John H

2015-01-30

To use negative-ion nano-electrospray ionization mass spectrometry of peptides from the tryptic digest of ricin D, to provide sequence information; in particular, to identify disulfide position and connectivity. Negative-ion fragmentations of peptides from the tryptic digest of ricin D was studied using a Waters QTOF2 mass spectrometer operating in MS and MS(2) modes. Twenty-three peptides were obtained following high-performance liquid chromatography and studied by negative-ion mass spectrometry covering 73% of the amino-acid residues of ricin D. Five disulfide-containing peptides were identified, three intermolecular and two intramolecular disulfide-containing peptides. The [M-H](-) anions of the intermolecular disulfides undergo facile cleavage of the disulfide units to produce fragment peptides. In negative-ion collision-induced dissociation (CID) these source-formed anions undergo backbone cleavages, which provide sequencing information. The two intramolecular disulfides were converted proteolytically into intermolecular disulfides, which were identified as outlined above. The positions of the five disulfide groups in ricin D may be determined by characteristic negative-ion cleavage of the disulfide groups, while sequence information may be determined using the standard negative-ion backbone cleavages of the resulting cleaved peptides. Negative-ion mass spectrometry can also be used to provide partial sequencing information for other peptides (i.e. those not containing Cys) using the standard negative-ion backbone cleavages of these peptides. Copyright © 2014 John Wiley & Sons, Ltd.
Effect of amino acid substitution on biological activity of cyanophlyctin-β and brevinin-2R

NASA Astrophysics Data System (ADS)

Ghorani-Azam, Adel; Balali-Mood, Mahdi; Aryan, Ehsan; Karimi, Gholamreza; Riahi-Zanjani, Bamdad

2018-04-01

Antimicrobial peptides (AMPs), as ancient immune components, are found in almost all types of living organisms. They are bioactive components with strong antibacterial, antiviral, and anti-tumor properties. In this study, we designed three sequences of antimicrobial peptides to study the effects of structural changes in biological activity compared with original peptides, cyanophlyctin β, and brevinin-2R. For antibacterial activity, two Gram-positive (Staphylococcus aureus and S. epidermidis) and two Gram-negative bacteria (Escherichia coli and Pseudomonas aeroginosa) were assayed. Unlike cyanophlyctin β and brevinin-2R, the synthesized peptide (brevinin-M1, brevinin-M2 and brevinin-M3) showed no considerable antibacterial properties. Hemolytic activity of these peptides was also ignorable even at very high concentrations of 2 mg/ml. However, after proteolytic digestion by trypsin, the peptides showed antibacterial activity comparable to their original template sequences. Structural prediction suggested that the motif sequence responsible for antibacterial activity may be re-exposed to bacterial cell membrane after proteolytic digestion. Also, findings showed that only a small change in primary sequence and therefore structure of peptides may result in a significant alteration in biological activity.
sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Luo, Heng; Ye, Hao; Ng, Hui Wen

Understanding the binding between human leukocyte antigens (HLAs) and peptides is important to understand the functioning of the immune system. Since it is time-consuming and costly to measure the binding between large numbers of HLAs and peptides, computational methods including machine learning models and network approaches have been developed to predict HLA-peptide binding. However, there are several limitations for the existing methods. We developed a network-based algorithm called sNebula to address these limitations. We curated qualitative Class I HLA-peptide binding data and demonstrated the prediction performance of sNebula on this dataset using leave-one-out cross-validation and five-fold cross-validations. Furthermore, this algorithmmore » can predict not only peptides of different lengths and different types of HLAs, but also the peptides or HLAs that have no existing binding data. We believe sNebula is an effective method to predict HLA-peptide binding and thus improve our understanding of the immune system.« less
sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides

DOE PAGES

Luo, Heng; Ye, Hao; Ng, Hui Wen; ...

2016-08-25

Understanding the binding between human leukocyte antigens (HLAs) and peptides is important to understand the functioning of the immune system. Since it is time-consuming and costly to measure the binding between large numbers of HLAs and peptides, computational methods including machine learning models and network approaches have been developed to predict HLA-peptide binding. However, there are several limitations for the existing methods. We developed a network-based algorithm called sNebula to address these limitations. We curated qualitative Class I HLA-peptide binding data and demonstrated the prediction performance of sNebula on this dataset using leave-one-out cross-validation and five-fold cross-validations. Furthermore, this algorithmmore » can predict not only peptides of different lengths and different types of HLAs, but also the peptides or HLAs that have no existing binding data. We believe sNebula is an effective method to predict HLA-peptide binding and thus improve our understanding of the immune system.« less
A two-step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies.

PubMed

Jagtap, Pratik; Goslinga, Jill; Kooren, Joel A; McGowan, Thomas; Wroblewski, Matthew S; Seymour, Sean L; Griffin, Timothy J

2013-04-01

Large databases (>10(6) sequences) used in metaproteomic and proteogenomic studies present challenges in matching peptide sequences to MS/MS data using database-search programs. Most notably, strict filtering to avoid false-positive matches leads to more false negatives, thus constraining the number of peptide matches. To address this challenge, we developed a two-step method wherein matches derived from a primary search against a large database were used to create a smaller subset database. The second search was performed against a target-decoy version of this subset database merged with a host database. High confidence peptide sequence matches were then used to infer protein identities. Applying our two-step method for both metaproteomic and proteogenomic analysis resulted in twice the number of high confidence peptide sequence matches in each case, as compared to the conventional one-step method. The two-step method captured almost all of the same peptides matched by the one-step method, with a majority of the additional matches being false negatives from the one-step method. Furthermore, the two-step method improved results regardless of the database search program used. Our results show that our two-step method maximizes the peptide matching sensitivity for applications requiring large databases, especially valuable for proteogenomics and metaproteomics studies. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Identification of a Unique Amyloid Sequence in AA Amyloidosis of a Pig Associated With Streptococcus Suis Infection.

PubMed

Kamiie, J; Sugahara, G; Yoshimoto, S; Aihara, N; Mineshige, T; Uetsuka, K; Shirota, K

2017-01-01

Here we report a pig with amyloid A (AA) amyloidosis associated with Streptococcus suis infection and identification of a unique amyloid sequence in the amyloid deposits in the tissue. Tissues from the 180-day-old underdeveloped pig contained foci of necrosis and suppurative inflammation associated with S. suis infection. Congo red stain, immunohistochemistry, and electron microscopy revealed intense AA deposition in the spleen and renal glomeruli. Mass spectrometric analysis of amyloid material extracted from the spleen showed serum AA 2 (SAA2) peptide as well as a unique peptide sequence previously reported in a pig with AA amyloidosis. The common detection of the unique amyloid sequence in the current and past cases of AA amyloidosis in pigs suggests that this amyloid sequence might play a key role in the development of porcine AA amyloidosis. An in vitro fibrillation assay demonstrated that the unique AA peptide formed typically rigid, long amyloid fibrils (10 nm wide) and the N-terminus peptide of SAA2 formed zigzagged, short fibers (7 nm wide). Moreover, the SAA2 peptide formed long, rigid amyloid fibrils in the presence of sonicated amyloid fibrils formed by the unique AA peptide. These findings indicate that the N-terminus of SAA2 as well as the AA peptide mediate the development of AA amyloidosis in pigs via cross-seeding polymerization.
Growth hormone-releasing hormone stimulates and somatostatin inhibits the release of a novel protein by cultured rat pituitary cells.

PubMed

Tachibana, K; Marquardt, H; Yokoya, S; Friesen, H G

1988-10-01

We have reported that the secretion of at least 17 distinct peptides [including rat (rGH)] GH by cultured rat pituitary cells was stimulated by GH-releasing hormone and inhibited by somatostatin, when analyzed by two-dimensional polyacrylamide gel electrophoresis. Three of these peptides (no. 23, 24, and 25) were not rGH immunoreactive. In order to determine whether these three peptides are fragments, degradation products or posttranscriptionally modified forms of rGH, rGH and peptide no. 23 were characterized structurally. From partial peptide maps of rGH and peptide no. 23 by V8 protease or chymotrypsin, it appeared that these peptides were not related to each other. By N-terminal microsequencing of two-dimensional polyacrylamide gel electrophoresis purified peptide, we have obtained the sequence of 24 N-terminal amino acid residues of peptide no. 23. This sequence has no significant homology with rGH or any other reported protein sequence. Antiserum was generated against a synthetic oligopeptide corresponding to amino acid residues 3-24 of peptide no. 23. The antiserum cross-reacted with peptides no. 23, 24, and 25 upon Western blot analysis. These results indicate that peptide no. 23 has a novel structure unrelated to other pituitary hormones. Since its secretion is influenced by GH-releasing hormone and somatostatin, peptide no. 23 may represent a previously unrecognized structurally unique growth factor.
Context-Sensitive Markov Models for Peptide Scoring and Identification from Tandem Mass Spectrometry

PubMed Central

Grover, Himanshu; Wallstrom, Garrick; Wu, Christine C.

2013-01-01

Abstract Peptide and protein identification via tandem mass spectrometry (MS/MS) lies at the heart of proteomic characterization of biological samples. Several algorithms are able to search, score, and assign peptides to large MS/MS datasets. Most popular methods, however, underutilize the intensity information available in the tandem mass spectrum due to the complex nature of the peptide fragmentation process, thus contributing to loss of potential identifications. We present a novel probabilistic scoring algorithm called Context-Sensitive Peptide Identification (CSPI) based on highly flexible Input-Output Hidden Markov Models (IO-HMM) that capture the influence of peptide physicochemical properties on their observed MS/MS spectra. We use several local and global properties of peptides and their fragment ions from literature. Comparison with two popular algorithms, Crux (re-implementation of SEQUEST) and X!Tandem, on multiple datasets of varying complexity, shows that peptide identification scores from our models are able to achieve greater discrimination between true and false peptides, identifying up to ∼25% more peptides at a False Discovery Rate (FDR) of 1%. We evaluated two alternative normalization schemes for fragment ion-intensities, a global rank-based and a local window-based. Our results indicate the importance of appropriate normalization methods for learning superior models. Further, combining our scores with Crux using a state-of-the-art procedure, Percolator, we demonstrate the utility of using scoring features from intensity-based models, identifying ∼4-8 % additional identifications over Percolator at 1% FDR. IO-HMMs offer a scalable and flexible framework with several modeling choices to learn complex patterns embedded in MS/MS data. PMID:23289783
De novo peptide sequencing by deep learning

PubMed Central

Tran, Ngoc Hieu; Zhang, Xianglilan; Xin, Lei; Shan, Baozhen; Li, Ming

2017-01-01

De novo peptide sequencing from tandem MS data is the key technology in proteomics for the characterization of proteins, especially for new sequences, such as mAbs. In this study, we propose a deep neural network model, DeepNovo, for de novo peptide sequencing. DeepNovo architecture combines recent advances in convolutional neural networks and recurrent neural networks to learn features of tandem mass spectra, fragment ions, and sequence patterns of peptides. The networks are further integrated with local dynamic programming to solve the complex optimization task of de novo sequencing. We evaluated the method on a wide variety of species and found that DeepNovo considerably outperformed state of the art methods, achieving 7.7–22.9% higher accuracy at the amino acid level and 38.1–64.0% higher accuracy at the peptide level. We further used DeepNovo to automatically reconstruct the complete sequences of antibody light and heavy chains of mouse, achieving 97.5–100% coverage and 97.2–99.5% accuracy, without assisting databases. Moreover, DeepNovo is retrainable to adapt to any sources of data and provides a complete end-to-end training and prediction solution to the de novo sequencing problem. Not only does our study extend the deep learning revolution to a new field, but it also shows an innovative approach in solving optimization problems by using deep learning and dynamic programming. PMID:28720701
Design of a shear-thinning recoverable peptide hydrogel from native sequences and application for influenza H1N1 vaccine adjuvant

USDA-ARS?s Scientific Manuscript database

Peptide hydrogels are considered injectable materials for drug delivery and tissue engineering applications. Most published hydrogel-forming sequences contain either alternating-charged and noncharged residues or amphiphilic blocks. Here, we report a self-assembling peptide, h9e (FLIVIGSIIGPGGDGPGGD...
Plastid-targeting peptides from the chlorarachniophyte Bigelowiella natans.

PubMed

Rogers, Matthew B; Archibald, John M; Field, Matthew A; Li, Catherine; Striepen, Boris; Keeling, Patrick J

2004-01-01

Chlorarachniophytes are marine amoeboflagellate protists that have acquired their plastid (chloroplast) through secondary endosymbiosis with a green alga. Like other algae, most of the proteins necessary for plastid function are encoded in the nuclear genome of the secondary host. These proteins are targeted to the organelle using a bipartite leader sequence consisting of a signal peptide (allowing entry in to the endomembrane system) and a chloroplast transit peptide (for transport across the chloroplast envelope membranes). We have examined the leader sequences from 45 full-length predicted plastid-targeted proteins from the chlorarachniophyte Bigelowiella natans with the goal of understanding important features of these sequences and possible conserved motifs. The chemical characteristics of these sequences were compared with a set of 10 B. natans endomembrane-targeted proteins and 38 cytosolic or nuclear proteins, which show that the signal peptides are similar to those of most other eukaryotes, while the transit peptides differ from those of other algae in some characteristics. Consistent with this, the leader sequence from one B. natans protein was tested for function in the apicomplexan parasite, Toxoplasma gondii, and shown to direct the secretion of the protein.
Computer-based prediction of mitochondria-targeting peptides.

PubMed

Martelli, Pier Luigi; Savojardo, Castrense; Fariselli, Piero; Tasco, Gianluca; Casadio, Rita

2015-01-01

Computational methods are invaluable when protein sequences, directly derived from genomic data, need functional and structural annotation. Subcellular localization is a feature necessary for understanding the protein role and the compartment where the mature protein is active and very difficult to characterize experimentally. Mitochondrial proteins encoded on the cytosolic ribosomes carry specific patterns in the precursor sequence from where it is possible to recognize a peptide targeting the protein to its final destination. Here we discuss to which extent it is feasible to develop computational methods for detecting mitochondrial targeting peptides in the precursor sequences and benchmark our and other methods on the human mitochondrial proteins endowed with experimentally characterized targeting peptides. Furthermore, we illustrate our newly implemented web server and its usage on the whole human proteome in order to infer mitochondrial targeting peptides, their cleavage sites, and whether the targeting peptide regions contain or not arginine-rich recurrent motifs. By this, we add some other 2,800 human proteins to the 124 ones already experimentally annotated with a mitochondrial targeting peptide.
InverPep: A database of invertebrate antimicrobial peptides.

PubMed

Gómez, Esteban A; Giraldo, Paula; Orduz, Sergio

2017-03-01

The aim of this work was to construct InverPep, a database specialised in experimentally validated antimicrobial peptides (AMPs) from invertebrates. AMP data contained in InverPep were manually curated from other databases and the scientific literature. MySQL was integrated with the development platform Laravel; this framework allows to integrate programming in PHP with HTML and was used to design the InverPep web page's interface. InverPep contains 18 separated fields, including InverPep code, phylum and species source, peptide name, sequence, peptide length, secondary structure, molar mass, charge, isoelectric point, hydrophobicity, Boman index, aliphatic index and percentage of hydrophobic amino acids. CALCAMPI, an algorithm to calculate the physicochemical properties of multiple peptides simultaneously, was programmed in PERL language. To date, InverPep contains 702 experimentally validated AMPs from invertebrate species. All of the peptides contain information associated with their source, physicochemical properties, secondary structure, biological activity and links to external literature. Most AMPs in InverPep have a length between 10 and 50 amino acids, a positive charge, a Boman index between 0 and 2 kcal/mol, and 30-50% hydrophobic amino acids. InverPep includes 33 AMPs not reported in other databases. Besides, CALCAMPI and statistical analysis of InverPep data is presented. The InverPep database is available in English and Spanish. InverPep is a useful database to study invertebrate AMPs and its information could be used for the design of new peptides. The user-friendly interface of InverPep and its information can be freely accessed via a web-based browser at http://ciencias.medellin.unal.edu.co/gruposdeinvestigacion/prospeccionydisenobiomoleculas/InverPep/public/home_en. Copyright © 2016 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Identification of conserved and HLA-A*2402-restricted epitopes in Dengue virus serotype 2.

PubMed

Duan, Zhi-Liang; Liu, Hui-Fang; Huang, Xi; Wang, Si-Na; Yang, Jin-Lin; Chen, Xin-Yu; Li, De-Zhou; Zhong, Xiao-Zhi; Chen, Bo-Kun; Wen, Jin-Sheng

2015-01-22

In this study, we set out to identify dengue virus serotype 2 (DENV-2)-specific HLA-A*2402-restricted epitopes and determine the characteristics of T cells generated to these epitopes. We screened the full-length amino-acid sequence of DENV-2 to find potential epitopes using the SYFPEITHI algorithm. Twelve putative HLA-A*2402-binding peptides conserved in hundreds of DENV-2 strains were synthesized, and the HLA restriction of peptides was tested in HLA-A*2402 transgenic mice. Nine peptides (NS4b(228-237), NS2a(73-81), E(298-306), M(141-149), NS4a(96-105), NS4b(159-168), NS5(475-484), NS1(162-171), and NS5(611-620)) induced high levels of peptide-specific IFN-γ-secreting cells in HLA-A*2402 transgenic mice. Apart from IFN-γ, NS4b(228-237-), NS2a(73-81-) and E(298-306)-specific CD8(+) cells produced TNF-α and IL-6 simultaneously, whereas M(141-149-) and NS5(475-484-) CD8(+) cells produced only IL-6. Moreover, splenic mononuclear cells (SMCs) efficiently recognized and killed peptide-pulsed splenocytes. Furthermore, each of nine peptides could be recognized by splenocytes from DENV-2-infected HLA-A*2402 transgenic mice. The SMCs from HLA-A*2402 transgenic mice immunized with nine immunogenic peptides efficiently killed DENV-2-infected splenic monocytes. The present identified epitopes have the potential to be new diagnostic tools for characterization of T-cell immunity in DENV infection and may serve as part of a universal epitope-based vaccine. Copyright © 2014 Elsevier B.V. All rights reserved.
Cell density signal protein suitable for treatment of connective tissue injuries and defects

DOEpatents

Schwarz, Richard I.

2002-08-13

Identification, isolation and partial sequencing of a cell density protein produced by fibroblastic cells. The cell density signal protein comprising a 14 amino acid peptide or a fragment, variant, mutant or analog thereof, the deduced cDNA sequence from the 14 amino acid peptide, a recombinant protein, protein and peptide-specific antibodies, and the use of the peptide and peptide-specific antibodies as therapeutic agents for regulation of cell differentiation and proliferation. A method for treatment and repair of connective tissue and tendon injuries, collagen deficiency, and connective tissue defects.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Enciso, Marta, E-mail: m.enciso@latrobe.edu.au; Schütte, Christof, E-mail: schuette@zib.de; Zuse Institute Berlin, Berlin

We employ a recently developed coarse-grained model for peptides and proteins where the effect of pH is automatically included. We explore the effect of pH in the aggregation process of the amyloidogenic peptide KTVIIE and two related sequences, using three different pH environments. Simulations using large systems (24 peptides chains per box) allow us to describe the formation of realistic peptide aggregates. We evaluate the thermodynamic and kinetic implications of changes in sequence and pH upon peptide aggregation, and we discuss how a minimalistic coarse-grained model can account for these details.
Enhancement of the Enterocin CRL35 Activity by a Synthetic Peptide Derived from the NH2-Terminal Sequence

PubMed Central

Saavedra, Lucila; Minahk, Carlos; de Ruiz Holgado, Aída P.; Sesma, Fernando

2004-01-01

The enterocin CRL35 biosynthetic gene cluster was cloned and sequenced. The sequence was revealed to be highly identical to that of the mundticin KS gene cluster (S. Kawamoto, J. Shima, R. Sato, T. Eguchi, S. Ohmomo, J. Shibato, N. Horikoshi, K. Takeshita, and T. Sameshima, Appl. Environ. Microbiol. 68:3830-3840, 2002). Short synthetic peptides were designed based on the bacteriocin sequence and were evaluated in antimicrobial competitive assays. The peptide KYYGNGVSCNKKGCS produced an enhancement of enterocin CRL35 antimicrobial activity in a buffer system. PMID:15215149
HPEPDOCK: a web server for blind peptide-protein docking based on a hierarchical algorithm.

PubMed

Zhou, Pei; Jin, Bowen; Li, Hao; Huang, Sheng-You

2018-05-09

Protein-peptide interactions are crucial in many cellular functions. Therefore, determining the structure of protein-peptide complexes is important for understanding the molecular mechanism of related biological processes and developing peptide drugs. HPEPDOCK is a novel web server for blind protein-peptide docking through a hierarchical algorithm. Instead of running lengthy simulations to refine peptide conformations, HPEPDOCK considers the peptide flexibility through an ensemble of peptide conformations generated by our MODPEP program. For blind global peptide docking, HPEPDOCK obtained a success rate of 33.3% in binding mode prediction on a benchmark of 57 unbound cases when the top 10 models were considered, compared to 21.1% for pepATTRACT server. HPEPDOCK also performed well in docking against homology models and obtained a success rate of 29.8% within top 10 predictions. For local peptide docking, HPEPDOCK achieved a high success rate of 72.6% on a benchmark of 62 unbound cases within top 10 predictions, compared to 45.2% for HADDOCK peptide protocol. Our HPEPDOCK server is computationally efficient and consumed an average of 29.8 mins for a global peptide docking job and 14.2 mins for a local peptide docking job. The HPEPDOCK web server is available at http://huanglab.phys.hust.edu.cn/hpepdock/.
Identification, Characterization, and Recombinant Expression of Epidermicin NI01, a Novel Unmodified Bacteriocin Produced by Staphylococcus epidermidis That Displays Potent Activity against Staphylococci

PubMed Central

Sandiford, Stephanie

2012-01-01

We describe the discovery, purification, characterization, and expression of an antimicrobial peptide, epidermicin NI01, which is an unmodified bacteriocin produced by Staphylococcus epidermidis strain 224. It is a highly cationic, hydrophobic, plasmid-encoded peptide that exhibits potent antimicrobial activity toward a wide range of pathogenic Gram-positive bacteria including methicillin-resistant Staphylococcus aureus (MRSA), enterococci, and biofilm-forming S. epidermidis strains. Purification of the peptide was achieved using a combination of hydrophobic interaction, cation exchange, and high-performance liquid chromatography (HPLC). Matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) analysis yielded a molecular mass of 6,074 Da, and partial sequence data of the peptide were elucidated using a combination of tandem mass spectrometry (MS/MS) and de novo sequencing. The draft genome sequence of the producing strain was obtained using 454 pyrosequencing technology, thus enabling the identification of the structural gene using the de novo peptide sequence data previously obtained. Epidermicin NI01 contains 51 residues with four tryptophan and nine lysine residues, and the sequence showed approximately 50% identity to peptides lacticin Z, lacticin Q, and aureocin A53, all of which belong to a new family of unmodified type II-like bacteriocins. The peptide is active in the nanomolar range against S. epidermidis, MRSA isolates, and vancomycin-resistant enterococci. Other unique features displayed by epidermicin include a high degree of protease stability and the ability to retain antimicrobial activity over a pH range of 2 to 10, and exposure to the peptide does not result in development of resistance in susceptible isolates. In this study we also show the structural gene alone can be cloned into Escherichia coli strain BL21(DE3), and expression yields active peptide. PMID:22155816
Thermodynamic properties of solvated peptides from selective integrated tempering sampling with a new weighting factor estimation algorithm

NASA Astrophysics Data System (ADS)

Shen, Lin; Xie, Liangxu; Yang, Mingjun

2017-04-01

Conformational sampling under rugged energy landscape is always a challenge in computer simulations. The recently developed integrated tempering sampling, together with its selective variant (SITS), emerges to be a powerful tool in exploring the free energy landscape or functional motions of various systems. The estimation of weighting factors constitutes a critical step in these methods and requires accurate calculation of partition function ratio between different thermodynamic states. In this work, we propose a new adaptive update algorithm to compute the weighting factors based on the weighted histogram analysis method (WHAM). The adaptive-WHAM algorithm with SITS is then applied to study the thermodynamic properties of several representative peptide systems solvated in an explicit water box. The performance of the new algorithm is validated in simulations of these solvated peptide systems. We anticipate more applications of this coupled optimisation and production algorithm to other complicated systems such as the biochemical reactions in solution.
[Cytotoxicity of chimera peptides incorporating sequences of cyclin kinases inhibitors].

PubMed

Kharchenko, V P; Kulinich, V G; Lunin, V G; Filiasova, E I; Shishkin, A M; Sergeenko, O V; Riazanova, E M; Voronina, O L; Bozhenko, V K

2007-01-01

The study is concerned with proapoptotic properties of chimera peptides which incorporate sequences of inhibitors of cyclin kinases p161NK4a and p21CIP/WAF1 as well as internalized sequences (Antp and tat). Sequences of the p16 type appeared to be more cytotoxic than the p21 one. Cytotoxic effect proved dependent on orientation with respect to the C or N terminal point of a polypeptide chain rather than on chimera sequence extent. Although p16 endogenous synthesis did not influence chimera peptide levels, apoptosis did not take place in certain cellular lines. Due to the rather unsophisticated nature of such synthesis, it might be used in designing individually-tailored chemotherapeutic drugs.
Integration of surface-active, periodically sequenced peptides into lipid-based microbubbles.

PubMed

Badami, Joseph V; Desir, Pierre; Tu, Raymond S

2014-07-29

The development of microbubbles toward functional, "theranostic" particles requires the incorporation of constituents with high binding specificity and therapeutic efficacy. Integrating peptides or proteins into the shell of lipid-based microbubbles can provide a means to access both receptor-ligand interactions and therapeutic properties. Simultaneously, peptides or proteins can define the characteristic monolayer mechanics of lipid bubbles and eliminate the need for post-bubble generation modification. The ability to engineer peptide sequences de novo that effectively partition into the bubble monolayer remains parametrically daunting. This work contributes to this effort using two simple amphipathic helical peptides that examine the role of local electrostatics and secondary structure. The two periodically sequenced peptides both have three positive charges, but peptide "K-2.5" spaces those charges 2.5 amino acids apart, while peptide "K-6.0" spaces the charges six amino acids apart. Size populations were determined for bubbles containing each peptide species using light scattering, and a quantitative method was developed to clearly define the fraction of peptides binding onto the microbubble monolayer. The impact of both the initial peptide concentration and the zwitterionic:anionic lipid ratio on peptide binding was also evaluated. Our results indicate that the lipid ratio affected only K-6.0 binding, which appears to be an outcome of the greater ensemble average α-helical population of the K-6.0. These findings provide further insights into the role of charge separation on peptide secondary structure, establishing a simple design metric for peptide binding onto microbubble systems.
A cardioactive peptide from the southern armyworm, Spodoptera eridania.

PubMed

Furuya, K; Hackett, M; Cirelli, M A; Schegg, K M; Wang, H; Shabanowitz, J; Hunt, D F; Schooley, D A

1999-01-01

A cardioactive peptide was isolated from extracts of whole heads of the southern armyworm, Spodoptera eridania. This peptide has the sequence ENFAVGCTPGYQRTADGRCKPTF (Mr = 2516.8), determined from both Edman sequencing and tandem mass spectrometry in combination with off-line micropreparative capillary liquid chromatography. This peptide, termed Spoer-CAP23, has excitatory effects on a semi-isolated heart from larval Manduca sexta, causing an inotropic effect at low concentrations of peptide and chronotropic and inotropic effects at high doses. The threshold concentration for stimulatory effects of the synthetic peptide on the semi-isolated heart was about 1 nM, suggesting a physiological role as a neuropeptide.
Enhanced pulmonary absorption of a macromolecule through coupling to a sequence-specific phage display-derived peptide.

PubMed

Morris, Christopher J; Smith, Mathew W; Griffiths, Peter C; McKeown, Neil B; Gumbleton, Mark

2011-04-10

With the aim of identifying a peptide sequence that promotes pulmonary epithelial transport of macromolecule cargo we used a stringent peptide-phage display library screening protocol against rat lung alveolar epithelial primary cell cultures. We identified a peptide-phage clone (LTP-1) displaying the disulphide-constrained 7-mer peptide sequence, C-TSGTHPR-C, that showed significant pulmonary epithelial translocation across highly restrictive polarised cell monolayers. Cell biological data supported a differential alveolar epithelial cell interaction of the LTP-1 peptide-phage clone and the corresponding free synthetic LTP-1 peptide. Delivering select phage-clones to the intact pulmonary barrier of an isolated perfused rat lung (IPRL) resulted in 8.7% of lung deposited LTP-1 peptide-phage clone transported from the IPRL airways to the vasculature compared (p<0.05) to the cumulative transport of less than 0.004% for control phage-clone groups. To characterise phage-independent activity of LTP-1 peptide, the LTP-1 peptide was conjugated to a 53kDa anionic PAMAM dendrimer. Compared to respective peptide-dendrimer control conjugates, the LTP-1-PAMAM conjugate displayed a two-fold (bioavailability up to 31%) greater extent of absorption in the IPRL. The LTP-1 peptide-mediated enhancement of transport, when LTP-1 was either attached to the phage clone or conjugated to dendrimer, was sequence-dependent and could be competitively inhibited by co-instillation of excess synthetic free LTP-1 peptide. The specific nature of the target receptor or mechanism involved in LTP-1 lung transport remains unclear although the enhanced transport is enabled through a mechanism that is non-disruptive with respect to the pulmonary transport of hydrophilic permeability probes. This study shows proof-of principle that array technologies can be effectively exploited to identify peptides mediating enhanced transmucosal delivery of macromolecule therapeutics across an intact organ. Copyright © 2010 Elsevier B.V. All rights reserved.
Sequence Elucidation of an Unknown Cyclic Peptide of High Doping Potential by ETD and CID Tandem Mass Spectrometry

NASA Astrophysics Data System (ADS)

Guan, Fuyu; Uboh, Cornelius E.; Soma, Lawrence R.; Rudy, Jeffrey

2011-04-01

Identification of an unknown substance without any information remains a daunting challenge despite advances in chemistry and mass spectrometry. However, an unknown cyclic peptide in a sample with very limited volume seized at a Pennsylvania racetrack has been successfully identified. The unknown sample was determined by accurate mass measurements to contain a small unknown peptide as the major component. Collision-induced dissociation (CID) of the unknown peptide revealed the presence of Lys (not Gln, by accurate mass), Phe, and Arg residues, and absence of any y-type product ion. The latter, together with the tryptic digestion results of the unusual deamidation and absence of any tryptic cleavage, suggests a cyclic structure for the peptide. Electron-transfer dissociation (ETD) of the unknown peptide indicated the presence of Gln (not Lys, by the unusual deamidation), Phe, and Arg residues and their connectivity. After all the results were pieced together, a cyclic tetrapeptide, cyclo[Arg-Lys-N(C6H9)Gln-Phe], is proposed for the unknown peptide. Observations of different amino acid residues from CID and ETD experiments for the peptide were interpreted by a fragmentation pathway proposed, as was preferential CID loss of a Lys residue from the peptide. ETD was used for the first time in sequencing of a cyclic peptide; product ions resulting from ETD of the peptide identified were categorized into two types and named pseudo-b and pseudo-z ions that are important for sequencing of cyclic peptides. The ETD product ions were interpreted by fragmentation pathways proposed. Additionally, multi-stage CID mass spectrometry cannot provide complete sequence information for cyclic peptides containing adjacent Arg and Lys residues. The identified cyclic peptide has not been documented in the literature, its pharmacological effects are unknown, but it might be a "designer" drug with athletic performance-enhancing effects.

Xilmass: A New Approach toward the Identification of Cross-Linked Peptides.

PubMed

Yılmaz, Şule; Drepper, Friedel; Hulstaert, Niels; Černič, Maša; Gevaert, Kris; Economou, Anastassios; Warscheid, Bettina; Martens, Lennart; Vandermarliere, Elien

2016-10-18

Chemical cross-linking coupled with mass spectrometry plays an important role in unravelling protein interactions, especially weak and transient ones. Moreover, cross-linking complements several structural determination approaches such as cryo-EM. Although several computational approaches are available for the annotation of spectra obtained from cross-linked peptides, there remains room for improvement. Here, we present Xilmass, a novel algorithm to identify cross-linked peptides that introduces two new concepts: (i) the cross-linked peptides are represented in the search database such that the cross-linking sites are explicitly encoded, and (ii) the scoring function derived from the Andromeda algorithm was adapted to score against a theoretical tandem mass spectrometry (MS/MS) spectrum that contains the peaks from all possible fragment ions of a cross-linked peptide pair. The performance of Xilmass was evaluated against the recently published Kojak and the popular pLink algorithms on a calmodulin-plectin complex data set, as well as three additional, published data sets. The results show that Xilmass typically had the highest number of identified distinct cross-linked sites and also the highest number of predicted cross-linked sites.
Growth-active peptides are produced from alpha-lactalbumin and lysozyme.

PubMed

Kanda, Yoshikazu; Hisayasu, Sanae; Abe, Yasuko; Katsura, Kenichiro; Mashimo, Keico

2007-07-19

We determined the growth-active domains of milk-growth factor (MGF), human alpha-lactalbumin (HMLA) and human lysozyme (HMLZ), and their sequences. Fetal calf serum (FCS) showed inhibitors against proteases. The growth-stimulation of IMR90 cells in CG medium (free-serum) without FCS was induced in a dose-dependent manner up to 400 ng/ml of HMLA, HMLZ or chicken lysozyme (ChLZ), and also in a time-dependent manner until 48 h but was induced gradually until 1000 ng/ml of bovine alpha-lactalbumin (BVLA). The HMLAL6-peptide (HMLAL6), a cleaved product from HMLA by Endpeptidase Lys C, was growth-stimulative. The sequence of HMLAL6 was matched to 35 amino-acid residues (from No. 59 to No. 93 of HMLA), owing to the sequences of HMLAL6R3, HMLAL6R5 and HMLAL6R7 after the reduction of HMLAL6. The sequences of the reduced peptides from MGFL7-peptide (MGFL7: a cleaved product from MGF by Endpeptidase lysine C matched to those of the peptides from HMLAL6, and were similarly identified as the partial sequence of HMLA (59-93, H(2)N-L.W.C.?.K./S.S.Q.V.P.Q.S.R.N.I.?.D.I.S.?.D.K./F.L. D.D.D.I.T.D.D.I.M.?.A.-COOH). The sequence of HMLZ is similar to that of HMLA. HMLZT7-peptide (HMLZT7), a cleaved product of HMLZ by trypsin, was confirmed to have growth-stimulating activity and it's sequence was partially identified as Y. W.?.N.D.G.K.T.P.G.A.V.N.A.?.H.L. -, owing to the results of HMLZT7R1 (reduction of HMLZT7) and HMLZA7R2 (reduction of HMLZA7-peptide (HMLZA7) cleaved product of HMLZ by Endpeptidase Arg C) and is accordingly the sequence from No. 63 to No. 97 of HMLZ. Therefore, the peptides produced from LA and LZ by proteolysis may play a role of growth-stimulation.
Amino acid sequences of peptides from a tryptic digest of a urea-soluble protein fraction (U.S.3) from oxidized wool

PubMed Central

Corfield, M. C.; Fletcher, J. C.; Robson, A.

1967-01-01

1. A tryptic digest of the protein fraction U.S.3 from oxidized wool has been separated into 32 peptide fractions by cation-exchange resin chromatography. 2. Most of these fractions have been resolved into their component peptides by a combination of the techniques of cation-exchange resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid compositions of 58 of the peptides in the digest present in the largest amounts have been determined. 4. The amino acid sequences of 38 of these have been completely elucidated and those of six others partially derived. 5. These findings indicate that the parent protein in wool from which the protein fraction U.S.3 is derived has a minimum molecular weight of 74000. 6. The structures of wool proteins are discussed in the light of the peptide sequences determined, and, in particular, of those sequences in fraction U.S.3 that could not be elucidated. PMID:16742497
Structure stability of lytic peptides during their interactions with lipid bilayers.

PubMed

Chen, H M; Lee, C H

2001-10-01

In this work, molecular dynamics simulations were used to examine the consequences of a variety of analogs of cecropin A on lipid bilayers. Analog sequences were constructed by replacing either the N- or C-terminal helix with the other helix in native or reverse sequence order, by making palindromic peptides based on both the N- and C-terminal helices, and by deleting the hinge region. The structure of the peptides was monitored throughout the simulation. The hinge region appeared not to assist in maintaining helical structure but help in motion flexibility. In general, the N-terminal helix of peptides was less stable than the C-terminal one during the interaction with anionic lipid bilayers. Sequences with hydrophobic helices tended to regain helical structure after an initial loss while sequences with amphipathic helices were less able to do this. The results suggests that hydrophobic design peptides have a high structural stability in an anionic membrane and are the candidates for experimental investigation.
Draft genome sequence of Streptomyces sp. strain SS, which produces a series of uridyl peptide antibiotic sansanmycins.

PubMed

Wang, Lifei; Xie, Yunying; Li, Qinglian; He, Ning; Yao, Entai; Xu, Hongzhang; Yu, Ying; Chen, Ruxian; Hong, Bin

2012-12-01

Streptomyces sp. SS produces a series of uridyl peptide antibiotic sansanmycins. Here, we present a draft genome sequence of Streptomyces sp. SS containing the biosynthetic gene cluster for the antibiotics. The identification of the biosynthetic gene cluster of sansanmycins may provide further insight into biosynthetic mechanisms for uridyl peptide antibiotics.
Characteristics common to a cytokine family spanning five orders of insects.

PubMed

Matsumoto, Hitoshi; Tsuzuki, Seiji; Date-Ito, Atsuko; Ohnishi, Atsushi; Hayakawa, Yoichi

2012-06-01

Growth-blocking peptide (GBP) is a member of an insect cytokine family with diverse functions including growth and immunity controls. Members of this cytokine family have been reported in 15 species of Lepidoptera, and we have recently identified GBP-like peptides in Diptera such as Lucilia cuprina and Drosophila melanogaster, indicating that this peptide family is not specific to Lepidoptera. In order to extend our knowledge of this peptide family, we purified the same family peptide from one of the tenebrionids, Zophobas atratus,(1) isolated its cDNA, and sequenced it. The Z. atratus GBP sequence together with reported sequence data of peptides from the same family enabled us to perform BLAST searches against EST and genome databases of several insect species including Coleoptera, Diptera, Hymenoptera, and Hemiptera and identify homologous peptide genes. Here we report conserved structural features in these sequence data. They consist of 19-30 amino acid residues encoded at the C terminus of a 73-152 amino acid precursor and contain the motif C-x(2)-G-x(4,6)-G-x(1,2)-C-[KR], which shares a certain similarity with the motif in the mammalian EGF peptide family. These data indicate that these small cytokines belonging to one family are present in at least five insect orders. Copyright © 2012 Elsevier Ltd. All rights reserved.
RNAcode: Robust discrimination of coding and noncoding regions in comparative sequence data

PubMed Central

Washietl, Stefan; Findeiß, Sven; Müller, Stephan A.; Kalkhof, Stefan; von Bergen, Martin; Hofacker, Ivo L.; Stadler, Peter F.; Goldman, Nick

2011-01-01

With the availability of genome-wide transcription data and massive comparative sequencing, the discrimination of coding from noncoding RNAs and the assessment of coding potential in evolutionarily conserved regions arose as a core analysis task. Here we present RNAcode, a program to detect coding regions in multiple sequence alignments that is optimized for emerging applications not covered by current protein gene-finding software. Our algorithm combines information from nucleotide substitution and gap patterns in a unified framework and also deals with real-life issues such as alignment and sequencing errors. It uses an explicit statistical model with no machine learning component and can therefore be applied “out of the box,” without any training, to data from all domains of life. We describe the RNAcode method and apply it in combination with mass spectrometry experiments to predict and confirm seven novel short peptides in Escherichia coli and to analyze the coding potential of RNAs previously annotated as “noncoding.” RNAcode is open source software and available for all major platforms at http://wash.github.com/rnacode. PMID:21357752
RNAcode: robust discrimination of coding and noncoding regions in comparative sequence data.

PubMed

Washietl, Stefan; Findeiss, Sven; Müller, Stephan A; Kalkhof, Stefan; von Bergen, Martin; Hofacker, Ivo L; Stadler, Peter F; Goldman, Nick

2011-04-01

With the availability of genome-wide transcription data and massive comparative sequencing, the discrimination of coding from noncoding RNAs and the assessment of coding potential in evolutionarily conserved regions arose as a core analysis task. Here we present RNAcode, a program to detect coding regions in multiple sequence alignments that is optimized for emerging applications not covered by current protein gene-finding software. Our algorithm combines information from nucleotide substitution and gap patterns in a unified framework and also deals with real-life issues such as alignment and sequencing errors. It uses an explicit statistical model with no machine learning component and can therefore be applied "out of the box," without any training, to data from all domains of life. We describe the RNAcode method and apply it in combination with mass spectrometry experiments to predict and confirm seven novel short peptides in Escherichia coli and to analyze the coding potential of RNAs previously annotated as "noncoding." RNAcode is open source software and available for all major platforms at http://wash.github.com/rnacode.
Comparison and Evaluation of Clustering Algorithms for Tandem Mass Spectra.

PubMed

Rieder, Vera; Schork, Karin U; Kerschke, Laura; Blank-Landeshammer, Bernhard; Sickmann, Albert; Rahnenführer, Jörg

2017-11-03

In proteomics, liquid chromatography-tandem mass spectrometry (LC-MS/MS) is established for identifying peptides and proteins. Duplicated spectra, that is, multiple spectra of the same peptide, occur both in single MS/MS runs and in large spectral libraries. Clustering tandem mass spectra is used to find consensus spectra, with manifold applications. First, it speeds up database searches, as performed for instance by Mascot. Second, it helps to identify novel peptides across species. Third, it is used for quality control to detect wrongly annotated spectra. We compare different clustering algorithms based on the cosine distance between spectra. CAST, MS-Cluster, and PRIDE Cluster are popular algorithms to cluster tandem mass spectra. We add well-known algorithms for large data sets, hierarchical clustering, DBSCAN, and connected components of a graph, as well as the new method N-Cluster. All algorithms are evaluated on real data with varied parameter settings. Cluster results are compared with each other and with peptide annotations based on validation measures such as purity. Quality control, regarding the detection of wrongly (un)annotated spectra, is discussed for exemplary resulting clusters. N-Cluster proves to be highly competitive. All clustering results benefit from the so-called DISMS2 filter that integrates additional information, for example, on precursor mass.
Doubling down on phosphorylation as a variable peptide modification.

PubMed

Cooper, Bret

2016-09-01

Some mass spectrometrists believe that searching for variable PTMs like phosphorylation of serine or threonine when using database-search algorithms to interpret peptide tandem mass spectra will increase false-positive matching. The basis for this is the premise that the algorithm compares a spectrum to both a nonphosphorylated peptide candidate and a phosphorylated candidate, which is double the number of candidates compared to a search with no possible phosphorylation. Hence, if the search space doubles, false-positive matching could increase accordingly as the algorithm considers more candidates to which false matches could be made. In this study, it is shown that the search for variable phosphoserine and phosphothreonine modifications does not always double the search space or unduly impinge upon the FDR. A breakdown of how one popular database-search algorithm deals with variable phosphorylation is presented. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.
PredSTP: a highly accurate SVM based model to predict sequential cystine stabilized peptides.

PubMed

Islam, S M Ashiqul; Sajed, Tanvir; Kearney, Christopher Michel; Baker, Erich J

2015-07-05

Numerous organisms have evolved a wide range of toxic peptides for self-defense and predation. Their effective interstitial and macro-environmental use requires energetic and structural stability. One successful group of these peptides includes a tri-disulfide domain arrangement that offers toxicity and high stability. Sequential tri-disulfide connectivity variants create highly compact disulfide folds capable of withstanding a variety of environmental stresses. Their combination of toxicity and stability make these peptides remarkably valuable for their potential as bio-insecticides, antimicrobial peptides and peptide drug candidates. However, the wide sequence variation, sources and modalities of group members impose serious limitations on our ability to rapidly identify potential members. As a result, there is a need for automated high-throughput member classification approaches that leverage their demonstrated tertiary and functional homology. We developed an SVM-based model to predict sequential tri-disulfide peptide (STP) toxins from peptide sequences. One optimized model, called PredSTP, predicted STPs from training set with sensitivity, specificity, precision, accuracy and a Matthews correlation coefficient of 94.86%, 94.11%, 84.31%, 94.30% and 0.86, respectively, using 200 fold cross validation. The same model outperforms existing prediction approaches in three independent out of sample testsets derived from PDB. PredSTP can accurately identify a wide range of cystine stabilized peptide toxins directly from sequences in a species-agnostic fashion. The ability to rapidly filter sequences for potential bioactive peptides can greatly compress the time between peptide identification and testing structural and functional properties for possible antimicrobial and insecticidal candidates. A web interface is freely available to predict STP toxins from http://crick.ecs.baylor.edu/.
Origin of anti-tumor activity of the cysteine-containing GO peptides and further optimization of their cytotoxic properties

NASA Astrophysics Data System (ADS)

Tyuryaeva, Irina I.; Lyublinskaya, Olga G.; Podkorytov, Ivan S.; Skrynnikov, Nikolai R.

2017-01-01

Antitumor GO peptides have been designed as dimerization inhibitors of prominent oncoprotein mucin 1. In this study we demonstrate that activity of GO peptides is independent of the level of cellular expression of mucin 1. Furthermore, these peptides prove to be broadly cytotoxic, causing cell death also in normal cells such as dermal fibroblasts and endometrial mesenchymal stem cells. To explore molecular mechanism of their cytotoxicity, we have designed and tested a number of new peptide sequences containing the key CxC or CxxC motifs. Of note, these sequences bear no similarity to mucin 1 except that they also contain a pair of proximal cysteines. Several of the new peptides turned out to be significantly more potent than their GO prototypes. The results suggest that cytotoxicity of these peptides stems from their (moderate) activity as disulfide oxidoreductases. It is expected that such peptides, which we have termed DO peptides, are involved in disulfide-dithiol exchange reaction, resulting in formation of adventitious disulfide bridges in cell proteins. In turn, this leads to a partial loss of protein function and rapid onset of apoptosis. We anticipate that coupling DO sequences with tumor-homing transduction domains can create a potentially valuable new class of tumoricidal peptides.
Origin of anti-tumor activity of the cysteine-containing GO peptides and further optimization of their cytotoxic properties

PubMed Central

Tyuryaeva, Irina I.; Lyublinskaya, Olga G.; Podkorytov, Ivan S.; Skrynnikov, Nikolai R.

2017-01-01

Antitumor GO peptides have been designed as dimerization inhibitors of prominent oncoprotein mucin 1. In this study we demonstrate that activity of GO peptides is independent of the level of cellular expression of mucin 1. Furthermore, these peptides prove to be broadly cytotoxic, causing cell death also in normal cells such as dermal fibroblasts and endometrial mesenchymal stem cells. To explore molecular mechanism of their cytotoxicity, we have designed and tested a number of new peptide sequences containing the key CxC or CxxC motifs. Of note, these sequences bear no similarity to mucin 1 except that they also contain a pair of proximal cysteines. Several of the new peptides turned out to be significantly more potent than their GO prototypes. The results suggest that cytotoxicity of these peptides stems from their (moderate) activity as disulfide oxidoreductases. It is expected that such peptides, which we have termed DO peptides, are involved in disulfide-dithiol exchange reaction, resulting in formation of adventitious disulfide bridges in cell proteins. In turn, this leads to a partial loss of protein function and rapid onset of apoptosis. We anticipate that coupling DO sequences with tumor-homing transduction domains can create a potentially valuable new class of tumoricidal peptides. PMID:28091523
Method for predicting peptide detection in mass spectrometry

DOEpatents

Kangas, Lars [West Richland, WA; Smith, Richard D [Richland, WA; Petritis, Konstantinos [Richland, WA

2010-07-13

A method of predicting whether a peptide present in a biological sample will be detected by analysis with a mass spectrometer. The method uses at least one mass spectrometer to perform repeated analysis of a sample containing peptides from proteins with known amino acids. The method then generates a data set of peptides identified as contained within the sample by the repeated analysis. The method then calculates the probability that a specific peptide in the data set was detected in the repeated analysis. The method then creates a plurality of vectors, where each vector has a plurality of dimensions, and each dimension represents a property of one or more of the amino acids present in each peptide and adjacent peptides in the data set. Using these vectors, the method then generates an algorithm from the plurality of vectors and the calculated probabilities that specific peptides in the data set were detected in the repeated analysis. The algorithm is thus capable of calculating the probability that a hypothetical peptide represented as a vector will be detected by a mass spectrometry based proteomic platform, given that the peptide is present in a sample introduced into a mass spectrometer.
Automated selected reaction monitoring software for accurate label-free protein quantification.

PubMed

Teleman, Johan; Karlsson, Christofer; Waldemarson, Sofia; Hansson, Karin; James, Peter; Malmström, Johan; Levander, Fredrik

2012-07-06

Selected reaction monitoring (SRM) is a mass spectrometry method with documented ability to quantify proteins accurately and reproducibly using labeled reference peptides. However, the use of labeled reference peptides becomes impractical if large numbers of peptides are targeted and when high flexibility is desired when selecting peptides. We have developed a label-free quantitative SRM workflow that relies on a new automated algorithm, Anubis, for accurate peak detection. Anubis efficiently removes interfering signals from contaminating peptides to estimate the true signal of the targeted peptides. We evaluated the algorithm on a published multisite data set and achieved results in line with manual data analysis. In complex peptide mixtures from whole proteome digests of Streptococcus pyogenes we achieved a technical variability across the entire proteome abundance range of 6.5-19.2%, which was considerably below the total variation across biological samples. Our results show that the label-free SRM workflow with automated data analysis is feasible for large-scale biological studies, opening up new possibilities for quantitative proteomics and systems biology.
LESSONS IN DE NOVO PEPTIDE SEQUENCING BY TANDEM MASS SPECTROMETRY

PubMed Central

Medzihradszky, Katalin F.; Chalkley, Robert J.

2015-01-01

Mass spectrometry has become the method of choice for the qualitative and quantitative characterization of protein mixtures isolated from all kinds of living organisms. The raw data in these studies are MS/MS spectra, usually of peptides produced by proteolytic digestion of a protein. These spectra are “translated” into peptide sequences, normally with the help of various search engines. Data acquisition and interpretation have both been automated, and most researchers look only at the summary of the identifications without ever viewing the underlying raw data used for assignments. Automated analysis of data is essential due to the volume produced. However, being familiar with the finer intricacies of peptide fragmentation processes, and experiencing the difficulties of manual data interpretation allow a researcher to be able to more critically evaluate key results, particularly because there are many known rules of peptide fragmentation that are not incorporated into search engine scoring. Since the most commonly used MS/MS activation method is collision-induced dissociation (CID), in this article we present a brief review of the history of peptide CID analysis. Next, we provide a detailed tutorial on how to determine peptide sequences from CID data. Although the focus of the tutorial is de novo sequencing, the lessons learned and resources supplied are useful for data interpretation in general. PMID:25667941
De Novo Design of Skin-Penetrating Peptides for Enhanced Transdermal Delivery of Peptide Drugs.

PubMed

Menegatti, Stefano; Zakrewsky, Michael; Kumar, Sunny; De Oliveira, Joshua Sanchez; Muraski, John A; Mitragotri, Samir

2016-03-09

Skin-penetrating peptides (SPPs) are attracting increasing attention as a non-invasive strategy for transdermal delivery of therapeutics. The identification of SPP sequences, however, currently performed by experimental screening of peptide libraries, is very laborious. Recent studies have shown that, to be effective enhancers, SPPs must possess affinity for both skin keratin and the drug of interest. We therefore developed a computational process for generating and screening virtual libraries of disulfide-cyclic peptides against keratin and cyclosporine A (CsA) to identify SPPs capable of enhancing transdermal CsA delivery. The selected sequences were experimentally tested and found to bind both CsA and keratin, as determined by mass spectrometry and affinity chromatography, and enhance transdermal permeation of CsA. Four heptameric sequences that emerged as leading candidates (ACSATLQHSCG, ACSLTVNWNCG, ACTSTGRNACG, and ACSASTNHNCG) were tested and yielded CsA permeation on par with previously identified SPP SPACE (TM) . An octameric peptide (ACNAHQARSTCG) yielded significantly higher delivery of CsA compared to heptameric SPPs. The safety profile of the selected sequences was also validated by incubation with skin keratinocytes. This method thus represents an effective procedure for the de novo design of skin-penetrating peptides for the delivery of desired therapeutic or cosmetic agents. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
GibbsCluster: unsupervised clustering and alignment of peptide sequences.

PubMed

Andreatta, Massimo; Alvarez, Bruno; Nielsen, Morten

2017-07-03

Receptor interactions with short linear peptide fragments (ligands) are at the base of many biological signaling processes. Conserved and information-rich amino acid patterns, commonly called sequence motifs, shape and regulate these interactions. Because of the properties of a receptor-ligand system or of the assay used to interrogate it, experimental data often contain multiple sequence motifs. GibbsCluster is a powerful tool for unsupervised motif discovery because it can simultaneously cluster and align peptide data. The GibbsCluster 2.0 presented here is an improved version incorporating insertion and deletions accounting for variations in motif length in the peptide input. In basic terms, the program takes as input a set of peptide sequences and clusters them into meaningful groups. It returns the optimal number of clusters it identified, together with the sequence alignment and sequence motif characterizing each cluster. Several parameters are available to customize cluster analysis, including adjustable penalties for small clusters and overlapping groups and a trash cluster to remove outliers. As an example application, we used the server to deconvolute multiple specificities in large-scale peptidome data generated by mass spectrometry. The server is available at http://www.cbs.dtu.dk/services/GibbsCluster-2.0. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Systemic Lupus Erythematosus: Molecular Mimicry between Anti-dsDNA CDR3 Idiotype, Microbial and Self Peptides-As Antigens for Th Cells.

PubMed

Aas-Hanssen, Kristin; Thompson, Keith M; Bogen, Bjarne; Munthe, Ludvig A

2015-01-01

Systemic lupus erythematosus (SLE) is marked by a T helper (Th) cell-dependent B cell hyperresponsiveness, with frequent germinal center reactions, and gammaglobulinemia. A feature of SLE is the finding of IgG autoantibodies specific for dsDNA. The specificity of the Th cells that drive the expansion of anti-dsDNA B cells is unresolved. However, anti-microbial, anti-histone, and anti-idiotype Th cell responses have been hypothesized to play a role. It has been entirely unclear if these seemingly disparate Th cell responses and hypotheses could be related or unified. Here, we describe that H chain CDR3 idiotypes from IgG(+) B cells of lupus mice have sequence similarities with both microbial and self peptides. Matched sequences were more frequent within the mutated CDR3 repertoire and when sequences were derived from lupus mice with expanded anti-dsDNA B cells. Analyses of histone sequences showed that particular histone peptides were similar to VDJ junctions. Moreover, lupus mice had Th cell responses toward histone peptides similar to anti-dsDNA CDR3 sequences. The results suggest that Th cells in lupus may have multiple cross-reactive specificities linked to the IgVH CDR3 Id-peptide sequences as well as similar DNA-associated protein motifs.
Genetic algorithm optimized triply compensated pulses in NMR spectroscopy

NASA Astrophysics Data System (ADS)

Manu, V. S.; Veglia, Gianluigi

2015-11-01

Sensitivity and resolution in NMR experiments are affected by magnetic field inhomogeneities (of both external and RF), errors in pulse calibration, and offset effects due to finite length of RF pulses. To remedy these problems, built-in compensation mechanisms for these experimental imperfections are often necessary. Here, we propose a new family of phase-modulated constant-amplitude broadband pulses with high compensation for RF inhomogeneity and heteronuclear coupling evolution. These pulses were optimized using a genetic algorithm (GA), which consists in a global optimization method inspired by Nature's evolutionary processes. The newly designed π and π / 2 pulses belong to the 'type A' (or general rotors) symmetric composite pulses. These GA-optimized pulses are relatively short compared to other general rotors and can be used for excitation and inversion, as well as refocusing pulses in spin-echo experiments. The performance of the GA-optimized pulses was assessed in Magic Angle Spinning (MAS) solid-state NMR experiments using a crystalline U-13C, 15N NAVL peptide as well as U-13C, 15N microcrystalline ubiquitin. GA optimization of NMR pulse sequences opens a window for improving current experiments and designing new robust pulse sequences.

PRISM 3: expanded prediction of natural product chemical structures from microbial genomes

PubMed Central

Skinnider, Michael A.; Merwin, Nishanth J.; Johnston, Chad W.

2017-01-01

Abstract Microbial natural products represent a rich resource of pharmaceutically and industrially important compounds. Genome sequencing has revealed that the majority of natural products remain undiscovered, and computational methods to connect biosynthetic gene clusters to their corresponding natural products therefore have the potential to revitalize natural product discovery. Previously, we described PRediction Informatics for Secondary Metabolomes (PRISM), a combinatorial approach to chemical structure prediction for genetically encoded nonribosomal peptides and type I and II polyketides. Here, we present a ground-up rewrite of the PRISM structure prediction algorithm to derive prediction of natural products arising from non-modular biosynthetic paradigms. Within this new version, PRISM 3, natural product scaffolds are modeled as chemical graphs, permitting structure prediction for aminocoumarins, antimetabolites, bisindoles and phosphonate natural products, and building upon the addition of ribosomally synthesized and post-translationally modified peptides. Further, with the addition of cluster detection for 11 new cluster types, PRISM 3 expands to detect 22 distinct natural product cluster types. Other major modifications to PRISM include improved sequence input and ORF detection, user-friendliness and output. Distribution of PRISM 3 over a 300-core server grid improves the speed and capacity of the web application. PRISM 3 is available at http://magarveylab.ca/prism/. PMID:28460067
Topological disposition of the sequences -QRKIVE- and -KETYY in native (Na sup + + K sup + )-ATPase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bayer, R.

1990-03-06

The dispositions with respect to the plane of the membrane of lysine-905 in the internal sequence -EQRKIVE- and of lysine-1012 in the carboxy-terminal sequence -RRPGGWVEKETYY of the {alpha}-polypeptide of sodium and potassium ion activated adenosinetriphosphatase have been determined. These lysines are found in peptides released from the intact {alpha}-polypeptide by the extracellular protease from Staphylococcus aureus strain V8 and by trypsin, respectively. Synthetic peptides containing terminal sequences of these were used to prepare polyclonal antibodies, which were then used to prepare immunoadsorbents directed against the respective peptides. Sealed, right-side-out membrane vesicles containing native (Na{sup +} + K{sup +})-ATPase were labeledmore » with pyridoxal phosphate and sodium ({sup 3}H)borohydride in the absence or presence of saponin. The labeled {alpha}-polypeptide was isolated from these vesicles and digested with appropriate proteases. The incorporation of radioactivity into the peptides binding to the immunoadsorbent directed against the sequence pyrERXIVE increased 3-fold int the presence of saponin as a result of the increased accessibility of this portion of the protein to the reagent when the vesicles were breached by saponin; hence, this sequence is located on the cytoplasmic face of the membrane. It was inferred that the carboxy-terminal sequence -KETYY is on the extracytoplasmic face since the incorporation of radioactivity into peptides binding to the immunoadsorbent directed against the sequence -ETYY did not change when the vesicles were breached with saponin.« less
Extracting Both Peptide Sequence and Glycan Structural Information by 157 nm Photodissociation of N-Linked Glycopeptides

PubMed Central

Zhang, Liangyi; Reilly, James P.

2009-01-01

157 nm photodissociation of N-linked glycopeptides was investigated in MALDI tandem time-of-flight (TOF) and linear ion trap mass spectrometers. Singly-charged glycopeptides yielded abundant peptide and glycan fragments. The peptide fragments included a series of x-, y-, v- and w- ions with the glycan remaining intact. These provide information about the peptide sequence and the glycosylation site. In addition to glycosidic fragments, abundant cross-ring glycan fragments that are not observed in low-energy CID were detected. These fragments provide insight into the glycan sequence and linkages. Doubly-charged glycopeptides generated by nanospray in the linear ion trap mass spectrometer also yielded peptide and glycan fragments. However, the former were dominated by low-energy fragments such as b- and y- type ions while glycan was primarily cleaved at glycosidic bonds. PMID:19113943
Amino acid sequences of peptides from a chymotryptic digest of a urea-soluble protein fraction (U.S.3) from oxidized wool

PubMed Central

Corfield, M. C.; Fletcher, J. C.

1969-01-01

1. A chymotryptic digest of the protein fraction U.S.3. from oxidized wool was separated into 51 peptide fractions by chromatography on a column of cation-exchange resin. 2. The less acidic fractions were separated into their component peptides by a combination of cation-exchange-resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid sequences of 34 of these peptides were elucidated, and those of 14 others partially determined. 4. Overlaps between the tryptic and chymotryptic peptides from fraction U.S.3 have enabled ten extended amino acid sequences to be deduced, the longest containing 20 amino acid residues. 5. The relevance of the results to the structures of the helical and non-helical regions of wool is discussed. PMID:5395876
Studies of the structure-activity relationships of peptides and proteins involved in growth and development based on their three-dimensional structures.

PubMed

Nagata, Koji

2010-01-01

Peptides and proteins with similar amino acid sequences can have different biological functions. Knowledge of their three-dimensional molecular structures is critically important in identifying their functional determinants. In this review, I describe the results of our and other groups' structure-based functional characterization of insect insulin-like peptides, a crustacean hyperglycemic hormone-family peptide, a mammalian epidermal growth factor-family protein, and an intracellular signaling domain that recognizes proline-rich sequence.
Molecular and Cellular Mechanisms for the Interaction between Gold Nanoparticles and Neuroimmune Cells Based on Size, Shape, and Charge

DTIC Science & Technology

2014-04-25

IgG secretion. 2.3 Designing of Synthetic peptide The immunogenic peptides against the foot and mouth disease virus ( FMDV ) were designed and...synthesized based on viral protein 1 of type O FMDV . The amino acid sequence for pFMDV is NGSSKYGDTSTNNVRGDLQVLAQKAERTLC. An extra cysteine was added...peptides were synthesized based on the amino acid sequence of the VP1 coat protein of the FMDV (table 1). The peptide pFMDVD (19 amino acids in length
Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data.

PubMed

Qeli, Ermir; Omasits, Ulrich; Goetze, Sandra; Stekhoven, Daniel J; Frey, Juerg E; Basler, Konrad; Wollscheid, Bernd; Brunner, Erich; Ahrens, Christian H

2014-08-28

The in silico prediction of the best-observable "proteotypic" peptides in mass spectrometry-based workflows is a challenging problem. Being able to accurately predict such peptides would enable the informed selection of proteotypic peptides for targeted quantification of previously observed and non-observed proteins for any organism, with a significant impact for clinical proteomics and systems biology studies. Current prediction algorithms rely on physicochemical parameters in combination with positive and negative training sets to identify those peptide properties that most profoundly affect their general detectability. Here we present PeptideRank, an approach that uses learning to rank algorithm for peptide detectability prediction from shotgun proteomics data, and that eliminates the need to select a negative dataset for the training step. A large number of different peptide properties are used to train ranking models in order to predict a ranking of the best-observable peptides within a protein. Empirical evaluation with rank accuracy metrics showed that PeptideRank complements existing prediction algorithms. Our results indicate that the best performance is achieved when it is trained on organism-specific shotgun proteomics data, and that PeptideRank is most accurate for short to medium-sized and abundant proteins, without any loss in prediction accuracy for the important class of membrane proteins. Targeted proteomics approaches have been gaining a lot of momentum and hold immense potential for systems biology studies and clinical proteomics. However, since only very few complete proteomes have been reported to date, for a considerable fraction of a proteome there is no experimental proteomics evidence that would allow to guide the selection of the best-suited proteotypic peptides (PTPs), i.e. peptides that are specific to a given proteoform and that are repeatedly observed in a mass spectrometer. We describe a novel, rank-based approach for the prediction of the best-suited PTPs for targeted proteomics applications. By building on methods developed in the field of information retrieval (e.g. web search engines like Google's PageRank), we circumvent the delicate step of selecting positive and negative training sets and at the same time also more closely reflect the experimentalist´s need for selecting e.g. the 5 most promising peptides for targeting a protein of interest. This approach allows to predict PTPs for not yet observed proteins or for organisms without prior experimental proteomics data such as many non-model organisms. Copyright © 2014 Elsevier B.V. All rights reserved.
Probabilistic arithmetic automata and their applications.

PubMed

Marschall, Tobias; Herms, Inke; Kaltenbach, Hans-Michael; Rahmann, Sven

2012-01-01

We present a comprehensive review on probabilistic arithmetic automata (PAAs), a general model to describe chains of operations whose operands depend on chance, along with two algorithms to numerically compute the distribution of the results of such probabilistic calculations. PAAs provide a unifying framework to approach many problems arising in computational biology and elsewhere. We present five different applications, namely 1) pattern matching statistics on random texts, including the computation of the distribution of occurrence counts, waiting times, and clump sizes under hidden Markov background models; 2) exact analysis of window-based pattern matching algorithms; 3) sensitivity of filtration seeds used to detect candidate sequence alignments; 4) length and mass statistics of peptide fragments resulting from enzymatic cleavage reactions; and 5) read length statistics of 454 and IonTorrent sequencing reads. The diversity of these applications indicates the flexibility and unifying character of the presented framework. While the construction of a PAA depends on the particular application, we single out a frequently applicable construction method: We introduce deterministic arithmetic automata (DAAs) to model deterministic calculations on sequences, and demonstrate how to construct a PAA from a given DAA and a finite-memory random text model. This procedure is used for all five discussed applications and greatly simplifies the construction of PAAs. Implementations are available as part of the MoSDi package. Its application programming interface facilitates the rapid development of new applications based on the PAA framework.
Mapping membrane activity in undiscovered peptide sequence space using machine learning

PubMed Central

Fulan, Benjamin M.; Wong, Gerard C. L.

2016-01-01

There are some ∼1,100 known antimicrobial peptides (AMPs), which permeabilize microbial membranes but have diverse sequences. Here, we develop a support vector machine (SVM)-based classifier to investigate ⍺-helical AMPs and the interrelated nature of their functional commonality and sequence homology. SVM is used to search the undiscovered peptide sequence space and identify Pareto-optimal candidates that simultaneously maximize the distance σ from the SVM hyperplane (thus maximize its “antimicrobialness”) and its ⍺-helicity, but minimize mutational distance to known AMPs. By calibrating SVM machine learning results with killing assays and small-angle X-ray scattering (SAXS), we find that the SVM metric σ correlates not with a peptide’s minimum inhibitory concentration (MIC), but rather its ability to generate negative Gaussian membrane curvature. This surprising result provides a topological basis for membrane activity common to AMPs. Moreover, we highlight an important distinction between the maximal recognizability of a sequence to a trained AMP classifier (its ability to generate membrane curvature) and its maximal antimicrobial efficacy. As mutational distances are increased from known AMPs, we find AMP-like sequences that are increasingly difficult for nature to discover via simple mutation. Using the sequence map as a discovery tool, we find a unexpectedly diverse taxonomy of sequences that are just as membrane-active as known AMPs, but with a broad range of primary functions distinct from AMP functions, including endogenous neuropeptides, viral fusion proteins, topogenic peptides, and amyloids. The SVM classifier is useful as a general detector of membrane activity in peptide sequences. PMID:27849600
Seed Storage Proteins as a System for Teaching Protein Identification by Mass Spectrometry in Biochemistry Laboratory

ERIC Educational Resources Information Center

Wilson, Karl A.; Tan-Wilson, Anna

2013-01-01

Mass spectrometry (MS) has become an important tool in studying biological systems. One application is the identification of proteins and peptides by the matching of peptide and peptide fragment masses to the sequences of proteins in protein sequence databases. Often prior protein separation of complex protein mixtures by 2D-PAGE is needed,…
Identification of trimannoside-recognizing peptide sequences from a T7 phage display screen using a QCM device.

PubMed

Nishiyama, Kazusa; Takakusagi, Yoichi; Kusayanagi, Tomoe; Matsumoto, Yuki; Habu, Shiori; Kuramochi, Kouji; Sugawara, Fumio; Sakaguchi, Kengo; Takahashi, Hideyo; Natsugari, Hideaki; Kobayashi, Susumu

2009-01-01

Here, we report on the identification of trimannoside-recognizing peptide sequences from a T7 phage display screen using a quartz-crystal microbalance (QCM) device. A trimannoside derivative that can form a self-assembled monolayer (SAM) was synthesized and used for immobilization on the gold electrode surface of a QCM sensor chip. After six sets of one-cycle affinity selection, T7 phage particles displaying PSVGLFTH (8-mer) and SVGLGLGFSTVNCF (14-mer) were found to be enriched at a rate of 17/44, 9/44, respectively, suggesting that these peptides specifically recognize trimannoside. Binding checks using the respective single T7 phage and synthetic peptide also confirmed the specific binding of these sequences to the trimannoside-SAM. Subsequent analysis revealed that these sequences correspond to part of the primary amino acid sequence found in many mannose- or hexose-related proteins. Taken together, these results demonstrate the effectiveness of our T7 phage display environment for affinity selection of binding peptides. We anticipate this screening result will also be extremely useful in the development of inhibitors or drug delivery systems targeting polysaccharides as well as further investigations into the function of carbohydrates in vivo.
Overview of the HUPO Plasma Proteome Project: Results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Omenn, Gilbert; States, David J.; Adamski, Marcin

2005-08-13

HUPO initiated the Plasma Proteome Project (PPP) in 2002. Its pilot phase has (1) evaluated advantages and limitations of many depletion, fractionation, and MS technology platforms; (2) compared PPP reference specimens of human serum and EDTA, heparin, and citrate-anticoagulated plasma; and (3) created a publicly-available knowledge base (www.bioinformatics. med.umich.edu/hupo/ppp; www.ebi.ac.uk/pride). Thirty-five participating laboratories in 13 countries submitted datasets. Working groups addressed (a) specimen stability and protein concentrations; (b) protein identifications from 18 MS/MS datasets; (c) independent analyses from raw MS-MS spectra; (d) search engine performance, subproteome analyses, and biological insights; (e) antibody arrays; and (f) direct MS/SELDI analyses. MS-MS datasetsmore » had 15 710 different International Protein Index (IPI) protein IDs; our integration algorithm applied to multiple matches of peptide sequences yielded 9504 IPI proteins identified with one or more peptides and 3020 proteins identified with two or more peptides (the Core Dataset). These proteins have been characterized with Gene Ontology, InterPro, Novartis Atlas, OMIM, and immunoassay based concentration determinations. The database permits examination of many other subsets, such as 1274 proteins identified with three or more peptides. Reverse protein to DNA matching identified proteins for 118 previously unidentified ORFs. We recommend use of plasma instead of serum, with EDTA (or citrate) for anticoagulation. To improve resolution, sensitivity and reproducibility of peptide identifications and protein matches, we recommend combinations of depletion, fractionation, and MS/MS technologies, with explicit criteria for evaluation of spectra, use of search algorithms, and integration of homologous protein matches. This Special Issue of PROTEOMICS presents papers integral to the collaborative analysis plus many reports of supplementary work on various aspects of the PPP workplan. These PPP results on complexity, dynamic range, incomplete sampling, false-positive matches, and integration of diverse datasets for plasma and serum proteins lay a foundation for development and validation of circulating protein biomarkers in health and disease.« less
Activation of erythropoietin receptor in the absence of hormone by a peptide that binds to a domain different from the hormone binding site

PubMed Central

Naranda, Tatjana; Wong, Kenneth; Kaufman, R. Ilene; Goldstein, Avram; Olsson, Lennart

1999-01-01

Applying a homology search method previously described, we identified a sequence in the extracellular dimerization site of the erythropoietin receptor, distant from the hormone binding site. A peptide identical to that sequence was synthesized. Remarkably, it activated receptor signaling in the absence of erythropoietin. Neither the peptide nor the hormone altered the affinity of the other for the receptor; thus, the peptide does not bind to the hormone binding site. The combined activation of signal transduction by hormone and peptide was strongly synergistic. In mice, the peptide acted like the hormone, protecting against the decrease in hematocrit caused by carboplatin. PMID:10377456
Post-staining electroblotting for efficient and reliable peptide blotting.

PubMed

Lee, Der-Yen; Chang, Geen-Dong

2015-01-01

Post-staining electroblotting has been previously described to transfer Coomassie blue-stained proteins from polyacrylamide gel onto polyvinylidene difluoride (PVDF) membranes. Actually, stained peptides can also be efficiently and reliably transferred. Because of selective staining procedures for peptides and increased retention of stained peptides on the membrane, even peptides with molecular masses less than 2 kDa such as bacitracin and granuliberin R are transferred with satisfactory results. For comparison, post-staining electroblotting is about 16-fold more sensitive than the conventional electroblotting for visualization of insulin on the membrane. Therefore, the peptide blots become practicable and more accessible to further applications, e.g., blot overlay detection or immunoblotting analysis. In addition, the efficiency of peptide transfer is favorable for N-terminal sequence analysis. With this method, peptide blotting can be normalized for further analysis such as blot overlay assay, immunoblotting, and N-terminal sequencing for identification of peptide in crude or partially purified samples.
A new family of cystine knot peptides from the seeds of Momordica cochinchinensis.

PubMed

Chan, Lai Yue; He, Wenjun; Tan, Ninghua; Zeng, Guangzhi; Craik, David J; Daly, Norelle L

2013-01-01

Momordica cochinchinensis, a Cucurbitaceae plant commonly found in Southeast Asia, has the unusual property of containing both acyclic and backbone-cyclized trypsin inhibitors with inhibitor cystine knot (ICK) motifs. In the current study we have shown that M. cochinchinensis also contains another family of acyclic ICK peptides. We recently reported two novel peptides from M. cochinchinensis but have now discovered four additional peptides (MCo-3-MCo-6) with related sequences. Together these peptides form a novel family of M. cochinchinensis ICK peptides (MCo-ICK) that do not have sequence homology with other known peptides and are not potent trypsin inhibitors. Otherwise these new peptides MCo-3 to MCo-6 were evaluated for antimalarial activity against Plasmodium falciparum, and cytotoxic activity against the cancer cell line MDA-MB-231. But these peptides were not active. Copyright © 2012 Elsevier Inc. All rights reserved.
Modeling and prediction of peptide drift times in ion mobility spectrometry using sequence-based and structure-based approaches.

PubMed

Zhang, Yiming; Jin, Quan; Wang, Shuting; Ren, Ren

2011-05-01

The mobile behavior of 1481 peptides in ion mobility spectrometry (IMS), which are generated by protease digestion of the Drosophila melanogaster proteome, is modeled and predicted based on two different types of characterization methods, i.e. sequence-based approach and structure-based approach. In this procedure, the sequence-based approach considers both the amino acid composition of a peptide and the local environment profile of each amino acid in the peptide; the structure-based approach is performed with the CODESSA protocol, which regards a peptide as a common organic compound and generates more than 200 statistically significant variables to characterize the whole structure profile of a peptide molecule. Subsequently, the nonlinear support vector machine (SVM) and Gaussian process (GP) as well as linear partial least squares (PLS) regression is employed to correlate the structural parameters of the characterizations with the IMS drift times of these peptides. The obtained quantitative structure-spectrum relationship (QSSR) models are evaluated rigorously and investigated systematically via both one-deep and two-deep cross-validations as well as the rigorous Monte Carlo cross-validation (MCCV). We also give a comprehensive comparison on the resulting statistics arising from the different combinations of variable types with modeling methods and find that the sequence-based approach can give the QSSR models with better fitting ability and predictive power but worse interpretability than the structure-based approach. In addition, though the QSSR modeling using sequence-based approach is not needed for the preparation of the minimization structures of peptides before the modeling, it would be considerably efficient as compared to that using structure-based approach. Copyright © 2011 Elsevier Ltd. All rights reserved.
Sequence characterization of cDNA sequence of encoding of an antimicrobial Peptide with no disulfide bridge from the Iranian mesobuthus eupeus venomous glands.

PubMed

Farajzadeh-Sheikh, Ahmad; Jolodar, Abbas; Ghaemmaghami, Shamsedin

2013-01-01

Scorpion venom glands produce some antimicrobial peptides (AMP) that can rapidly kill a broad range of microbes and have additional activities that impact on the quality and effectiveness of innate responses and inflammation. In this study, we reported the identification of a cDNA sequence encoding cysteine-free antimicrobial peptides isolated from venomous glands of this species. Total RNA was extracted from the Iranian mesobuthus eupeus venom glands, and cDNA was synthesized by using the modified oligo (dT). The cDNA was used as the template for applying Semi-nested RT- PCR technique. PCR Products were used for direct nucleotide sequencing and the results were compared with Gen Bank database. A 213 BP cDNA fragment encoding the entire coding region of an antimicrobial toxin from the Iranian scorpion M. Eupeus venom glands were isolated. The full-length sequence of the coding region was 210 BP contained an open reading frame of 70 amino with a predicted molecular mass of 7970.48 Da and theoretical Pi of 9.10. The open reading frame consists of 210 BP encoding a precursor of 70 amino acid residues, including a signal peptide of 23 residues a propertied of 7 residues, and a mature peptide of 34 residues with no disulfide bridge. The peptide has detectable sequence identity to the Lesser Asian mesobuthus eupeus MeVAMP-2 (98%), MeVAMP-9 (60%) and several previously described AMPs from other scorpion venoms including mesobuthus martensii (94%) and buthus occitanus Israelis (82%). The secondary structure of the peptide mainly consisted of α-helical structure which was generally conserved by previously reported scorpion counterparts. The phylogenetic analysis showed that the Iranian MeAMP-like toxin was similar but not identical with that of venom antimicrobial peptides from lesser Asian scorpion mesobuthus eupeus.
Web server to identify similarity of amino acid motifs to compounds (SAAMCO).

PubMed

Casey, Fergal P; Davey, Norman E; Baran, Ivan; Varekova, Radka Svobodova; Shields, Denis C

2008-07-01

Protein-protein interactions are fundamental in mediating biological processes including metabolism, cell growth, and signaling. To be able to selectively inhibit or induce protein activity or complex formation is a key feature in controlling disease. For those situations in which protein-protein interactions derive substantial affinity from short linear peptide sequences, or motifs, we can develop search algorithms for peptidomimetic compounds that resemble the short peptide's structure but are not compromised by poor pharmacological properties. SAAMCO is a Web service ( http://bioware.ucd.ie/ approximately saamco) that facilitates the screening of motifs with known structures against bioactive compound databases. It is built on an algorithm that defines compound similarity based on the presence of appropriate amino acid side chain fragments and a favorable Root Mean Squared Deviation (RMSD) between compound and motif structure. The methodology is efficient as the available compound databases are preprocessed and fast regular expression searches filter potential matches before time-intensive 3D superposition is performed. The required input information is minimal, and the compound databases have been selected to maximize the availability of information on biological activity. "Hits" are accompanied with a visualization window and links to source database entries. Motif matching can be defined on partial or full similarity which will increase or reduce respectively the number of potential mimetic compounds. The Web server provides the functionality for rapid screening of known or putative interaction motifs against prepared compound libraries using a novel search algorithm. The tabulated results can be analyzed by linking to appropriate databases and by visualization.
Extensive characterization of peptides from Panax ginseng C. A. Meyer using mass spectrometric approach.

PubMed

Ye, Xueting; Zhao, Nan; Yu, Xi; Han, Xiaoli; Gao, Huiyuan; Zhang, Xiaozhe

2016-11-01

Panax ginseng is an important herb that has clear effects on the treatment of diverse diseases. Until now, the natural peptide constitution of this herb remains unclear. Here, we conduct an extensive characterization of Ginseng peptidome using MS-based data mining and sequencing. The screen on the charge states of precursor ions indicated that Ginseng is a peptide-rich herb in comparison of a number of commonly used herbs. The Ginseng peptides were then extracted and submitted to nano-LC-MS/MS analysis using different fragmentation modes, including CID, high-energy collisional dissociation, and electron transfer dissociation. Further database search and de novo sequencing allowed the identification of total 308 peptides, some of which might have important biological activities. This study illustrates the abundance and sequences of endogenous Ginseng peptides, thus providing the information of more candidates for the screening of active compounds for future biological research and drug discovery studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Practical multipeptide synthesis: dedicated software for the definition of multiple, overlapping peptides covering polypeptide sequences.

PubMed

Heegaard, P M; Holm, A; Hagerup, M

1993-01-01

A personal computer program for the conversion of linear amino acid sequences to multiple, small, overlapping peptide sequences has been developed. Peptide lengths and "jumps" (the distance between two consecutive overlapping peptides) are defined by the user. To facilitate the use of the program for parallel solid-phase chemical peptide syntheses for the synchronous production of multiple peptides, amino acids at each acylation step are laid out by the program in a convenient standard multi-well setup. Also, the total number of equivalents, as well as the derived amount in milligrams (depend-ending on user-defined equivalent weights and molar surplus), of each amino acid are given. The program facilitates the implementation of multipeptide synthesis, e.g., for the elucidation of polypeptide structure-function relationships, and greatly reduces the risk of introducing mistakes at the planning step. It is written in Pascal and runs on any DOS-based personal computer. No special graphic display is needed.

Functionalization of peptide nucleolipid bioconjugates and their structure anti-cancer activity relationship studies.

PubMed

Rana, Niki; Cultrara, Christopher; Phillips, Mariana; Sabatino, David

2017-09-01

In the search for more potent peptide-based anti-cancer conjugates the generation of new, functionally diverse nucleolipid derived D-(KLAKLAK) 2 -AK sequences has enabled a structure and anti-cancer activity relationship study. A reductive amination approach was key for the synthesis of alkylamine, diamine and polyamine derived nucleolipids as well as those incorporating heterocyclic functionality. The carboxy-derived nucleolipids were then coupled to the C-terminus of the D-(KLAKLAK) 2 -AK killer peptide sequence and produced with and without the FITC fluorophore for investigating biological activity in cancer cells. The amphiphilic, α-helical peptide-nucleolipid bioconjugates were found to exhibit variable effects on the viability of MM.1S cells, with the histamine derived nucleolipid peptide bioconjugate displaying the most significant anti-cancer effects. Thus, functionally diverse nucleolipids have been developed to fine-tune the structure and anti-cancer properties of killer peptide sequences, such as D-(KLAKLAK) 2 -AK. Copyright © 2017 Elsevier Ltd. All rights reserved.
BiPPred: Combined sequence- and structure-based prediction of peptide binding to the Hsp70 chaperone BiP.

PubMed

Schneider, Markus; Rosam, Mathias; Glaser, Manuel; Patronov, Atanas; Shah, Harpreet; Back, Katrin Christiane; Daake, Marina Angelika; Buchner, Johannes; Antes, Iris

2016-10-01

Substrate binding to Hsp70 chaperones is involved in many biological processes, and the identification of potential substrates is important for a comprehensive understanding of these events. We present a multi-scale pipeline for an accurate, yet efficient prediction of peptides binding to the Hsp70 chaperone BiP by combining sequence-based prediction with molecular docking and MMPBSA calculations. First, we measured the binding of 15mer peptides from known substrate proteins of BiP by peptide array (PA) experiments and performed an accuracy assessment of the PA data by fluorescence anisotropy studies. Several sequence-based prediction models were fitted using this and other peptide binding data. A structure-based position-specific scoring matrix (SB-PSSM) derived solely from structural modeling data forms the core of all models. The matrix elements are based on a combination of binding energy estimations, molecular dynamics simulations, and analysis of the BiP binding site, which led to new insights into the peptide binding specificities of the chaperone. Using this SB-PSSM, peptide binders could be predicted with high selectivity even without training of the model on experimental data. Additional training further increased the prediction accuracies. Subsequent molecular docking (DynaDock) and MMGBSA/MMPBSA-based binding affinity estimations for predicted binders allowed the identification of the correct binding mode of the peptides as well as the calculation of nearly quantitative binding affinities. The general concept behind the developed multi-scale pipeline can readily be applied to other protein-peptide complexes with linearly bound peptides, for which sufficient experimental binding data for the training of classical sequence-based prediction models is not available. Proteins 2016; 84:1390-1407. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.

PubMed

Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki

2007-10-05

Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.
Confetti: A Multiprotease Map of the HeLa Proteome for Comprehensive Proteomics*

PubMed Central

Guo, Xiaofeng; Trudgian, David C.; Lemoff, Andrew; Yadavalli, Sivaramakrishna; Mirzaei, Hamid

2014-01-01

Bottom-up proteomics largely relies on tryptic peptides for protein identification and quantification. Tryptic digestion often provides limited coverage of protein sequence because of issues such as peptide length, ionization efficiency, and post-translational modification colocalization. Unfortunately, a region of interest in a protein, for example, because of proximity to an active site or the presence of important post-translational modifications, may not be covered by tryptic peptides. Detection limits, quantification accuracy, and isoform differentiation can also be improved with greater sequence coverage. Selected reaction monitoring (SRM) would also greatly benefit from being able to identify additional targetable sequences. In an attempt to improve protein sequence coverage and to target regions of proteins that do not generate useful tryptic peptides, we deployed a multiprotease strategy on the HeLa proteome. First, we used seven commercially available enzymes in single, double, and triple enzyme combinations. A total of 48 digests were performed. 5223 proteins were detected by analyzing the unfractionated cell lysate digest directly; with 42% mean sequence coverage. Additional strong-anion exchange fractionation of the most complementary digests permitted identification of over 3000 more proteins, with improved mean sequence coverage. We then constructed a web application (https://proteomics.swmed.edu/confetti) that allows the community to examine a target protein or protein isoform in order to discover the enzyme or combination of enzymes that would yield peptides spanning a certain region of interest in the sequence. Finally, we examined the use of nontryptic digests for SRM. From our strong-anion exchange fractionation data, we were able to identify three or more proteotypic SRM candidates within a single digest for 6056 genes. Surprisingly, in 25% of these cases the digest producing the most observable proteotypic peptides was neither trypsin nor Lys-C. SRM analysis of Asp-N versus tryptic peptides for eight proteins determined that Asp-N yielded higher signal in five of eight cases. PMID:24696503
Anti-infective activity of apolipoprotein domain derived peptides in vitro: identification of novel antimicrobial peptides related to apolipoprotein B with anti-HIV activity

PubMed Central

2010-01-01

Background Previous reports have shown that peptides derived from the apolipoprotein E receptor binding region and the amphipathic α-helical domains of apolipoprotein AI have broad anti-infective activity and antiviral activity respectively. Lipoproteins and viruses share a similar cell biological niche, being of overlapping size and displaying similar interactions with mammalian cells and receptors, which may have led to other antiviral sequences arising within apolipoproteins, in addition to those previously reported. We therefore designed a series of peptides based around either apolipoprotein receptor binding regions, or amphipathic α-helical domains, and tested these for antiviral and antibacterial activity. Results Of the nineteen new peptides tested, seven showed some anti-infective activity, with two of these being derived from two apolipoproteins not previously used to derive anti-infective sequences. Apolipoprotein J (151-170) - based on a predicted amphipathic alpha-helical domain from apolipoprotein J - had measurable anti-HSV1 activity, as did apolipoprotein B (3359-3367) dp (apoBdp), the latter being derived from the LDL receptor binding domain B of apolipoprotein B. The more active peptide - apoBdp - showed similarity to the previously reported apoE derived anti-infective peptide, and further modification of the apoBdp sequence to align the charge distribution more closely to that of apoEdp or to introduce aromatic residues resulted in increased breadth and potency of activity. The most active peptide of this type showed similar potent anti-HIV activity, comparable to that we previously reported for the apoE derived peptide apoEdpL-W. Conclusions These data suggest that further antimicrobial peptides may be obtained using human apolipoprotein sequences, selecting regions with either amphipathic α-helical structure, or those linked to receptor-binding regions. The finding that an amphipathic α-helical region of apolipoprotein J has antiviral activity comparable with that for the previously reported apolipoprotein AI derived peptide 18A, suggests that full-length apolipoprotein J may also have such activity, as has been reported for full-length apolipoprotein AI. Although the strength of the anti-infective activity of the sequences identified was limited, this could be increased substantially by developing related mutant peptides. Indeed the apolipoprotein B-derived peptide mutants uncovered by the present study may have utility as HIV therapeutics or microbicides. PMID:20298574
Sense-antisense (complementary) peptide interactions and the proteomic code; potential opportunities in biology and pharmaceutical science.

PubMed

Miller, Andrew D

2015-02-01

A sense peptide can be defined as a peptide whose sequence is coded by the nucleotide sequence (read 5' → 3') of the sense (positive) strand of DNA. Conversely, an antisense (complementary) peptide is coded by the corresponding nucleotide sequence (read 5' → 3') of the antisense (negative) strand of DNA. Research has been accumulating steadily to suggest that sense peptides are capable of specific interactions with their corresponding antisense peptides. Unfortunately, although more and more examples of specific sense-antisense peptide interactions are emerging, the very idea of such interactions does not conform to standard biology dogma and so there remains a sizeable challenge to lift this concept from being perceived as a peripheral phenomenon if not worse, into becoming part of the scientific mainstream. Specific interactions have now been exploited for the inhibition of number of widely different protein-protein and protein-receptor interactions in vitro and in vivo. Further, antisense peptides have also been used to induce the production of antibodies targeted to specific receptors or else the production of anti-idiotypic antibodies targeted against auto-antibodies. Such illustrations of utility would seem to suggest that observed sense-antisense peptide interactions are not just the consequence of a sequence of coincidental 'lucky-hits'. Indeed, at the very least, one might conclude that sense-antisense peptide interactions represent a potentially new and different source of leads for drug discovery. But could there be more to come from studies in this area? Studies on the potential mechanism of sense-antisense peptide interactions suggest that interactions may be driven by amino acid residue interactions specified from the genetic code. If so, such specified amino acid residue interactions could form the basis for an even wider amino acid residue interaction code (proteomic code) that links gene sequences to actual protein structure and function, even entire genomes to entire proteomes. The possibility that such a proteomic code should exist is discussed. So too the potential implications for biology and pharmaceutical science are also discussed were such a code to exist.
Anti-tumor activities of peptides corresponding to conserved complementary determining regions from different immunoglobulins.

PubMed

Figueiredo, Carlos R; Matsuo, Alisson L; Massaoka, Mariana H; Polonelli, Luciano; Travassos, Luiz R

2014-09-01

Short synthetic peptides corresponding to sequences of complementarity-determining regions (CDRs) from different immunoglobulin families have been shown to induce antimicrobial, antiviral and antitumor activities regardless of the specificity of the original monoclonal antibody (mAb). Presently, we studied the in vitro and in vivo antitumor activity of synthetic peptides derived from conserved CDR sequences of different immunoglobulins against human tumor cell lines and murine B16F10-Nex2 melanoma aiming at the discovery of candidate molecules for cancer therapy. Four light- and heavy-chain CDR peptide sequences from different antibodies (C36-L1, HA9-H2, 1-H2 and Mg16-H2) showed cytotoxic activity against murine melanoma and a panel of human tumor cell lineages in vitro. Importantly, they also exerted anti-metastatic activity using a syngeneic melanoma model in mice. Other peptides (D07-H3, MN20v1, MS2-H3) were also protective against metastatic melanoma, without showing significant cytotoxicity against tumor cells in vitro. In this case, we suggest that these peptides may act as immune adjuvants in vivo. As observed, peptides induced nitric oxide production in bone-marrow macrophages showing that innate immune cells can also be modulated by these CDR peptides. The present screening supports the search in immunoglobulins of rather frequent CDR sequences that are endowed with specific antitumor properties and may be candidates to be developed as anti-cancer drugs. Copyright © 2014 Elsevier Inc. All rights reserved.
Guiding principles for peptide nanotechnology through directed discovery.

PubMed

Lampel, A; Ulijn, R V; Tuttle, T

2018-05-21

Life's diverse molecular functions are largely based on only a small number of highly conserved building blocks - the twenty canonical amino acids. These building blocks are chemically simple, but when they are organized in three-dimensional structures of tremendous complexity, new properties emerge. This review explores recent efforts in the directed discovery of functional nanoscale systems and materials based on these same amino acids, but that are not guided by copying or editing biological systems. The review summarises insights obtained using three complementary approaches of searching the sequence space to explore sequence-structure relationships for assembly, reactivity and complexation, namely: (i) strategic editing of short peptide sequences; (ii) computational approaches to predicting and comparing assembly behaviours; (iii) dynamic peptide libraries that explore the free energy landscape. These approaches give rise to guiding principles on controlling order/disorder, complexation and reactivity by peptide sequence design.
The CGTCA sequence motif is essential for biological activity of the vasoactive intestinal peptide gene cAMP-regulated enhancer.

PubMed Central

Fink, J S; Verhave, M; Kasper, S; Tsukada, T; Mandel, G; Goodman, R H

1988-01-01

cAMP-regulated transcription of the human vasoactive intestinal peptide gene is dependent upon a 17-base-pair DNA element located 70 base pairs upstream from the transcriptional initiation site. This element is similar to sequences in other genes known to be regulated by cAMP and to sequences in several viral enhancers. We have demonstrated that the vasoactive intestinal peptide regulatory element is an enhancer that depends upon the integrity of two CGTCA sequence motifs for biological activity. Mutations in either of the CGTCA motifs diminish the ability of the element to respond to cAMP. Enhancers containing the CGTCA motif from the somatostatin and adenovirus genes compete for binding of nuclear proteins from C6 glioma and PC12 cells to the vasoactive intestinal peptide enhancer, suggesting that CGTCA-containing enhancers interact with similar transacting factors. Images PMID:2842787
Ocellatin peptides from the skin secretion of the South American frog Leptodactylus labyrinthicus (Leptodactylidae): characterization, antimicrobial activities and membrane interactions.

PubMed

Gusmão, Karla A G; Dos Santos, Daniel M; Santos, Virgílio M; Cortés, María Esperanza; Reis, Pablo V M; Santos, Vera L; Piló-Veloso, Dorila; Verly, Rodrigo M; de Lima, Maria Elena; Resende, Jarbas M

2017-01-01

The availability of antimicrobial peptides from several different natural sources has opened an avenue for the discovery of new biologically active molecules. To the best of our knowledge, only two peptides isolated from the frog Leptodactylus labyrinthicus , namely pentadactylin and ocellatin-F1, have shown antimicrobial activities. Therefore, in order to explore the antimicrobial potential of this species, we have investigated the biological activities and membrane interactions of three peptides isolated from the anuran skin secretion. Three peptide primary structures were determined by automated Edman degradation. These sequences were prepared by solid-phase synthesis and submitted to activity assays against gram-positive and gram-negative bacteria and against two fungal strains. The hemolytic properties of the peptides were also investigated in assays with rabbit blood erythrocytes. The conformational preferences of the peptides and their membrane interactions have been investigated by circular dichroism spectroscopy and liposome dye release assays. The amino acid compositions of three ocellatins were determined and the sequences exhibit 100% homology for the first 22 residues (ocellatin-LB1 sequence). Ocellatin-LB2 carries an extra Asn residue and ocellatin-F1 extra Asn-Lys-Leu residues at C-terminus. Ocellatin-F1 presents a stronger antibiotic potential and a broader spectrum of activities compared to the other peptides. The membrane interactions and pore formation capacities of the peptides correlate directly with their antimicrobial activities, i.e., ocellatin-F1 > ocellatin-LB1 > ocellatin-LB2. All peptides acquire high helical contents in membrane environments. However, ocellatin-F1 shows in average stronger helical propensities. The obtained results indicate that the three extra amino acid residues at the ocellatin-F1 C-terminus play an important role in promoting stronger peptide-membrane interactions and antimicrobial properties. The extra Asn-23 residue present in ocellatin-LB2 sequence seems to decrease its antimicrobial potential and the strength of the peptide-membrane interactions.
Design of Embedded-Hybrid Antimicrobial Peptides with Enhanced Cell Selectivity and Anti-Biofilm Activity

PubMed Central

Xu, Wei; Zhu, Xin; Tan, Tingting; Li, Weizhong; Shan, Anshan

2014-01-01

Antimicrobial peptides have attracted considerable attention because of their broad-spectrum antimicrobial activity and their low prognostic to induce antibiotic resistance which is the most common source of failure in bacterial infection treatment along with biofilms. The method to design hybrid peptide integrating different functional domains of peptides has many advantages. In this study, we designed an embedded-hybrid peptide R-FV-I16 by replacing a functional defective sequence RR7 with the anti-biofilm sequence FV7 embedded in the middle position of peptide RI16. The results demonstrated that the synthetic hybrid the peptide R-FV-I16 had potent antimicrobial activity over a wide range of Gram-negative and Gram-positive bacteria, as well as anti-biofilm activity. More importantly, R-FV-I16 showed lower hemolytic activity and cytotoxicity. Fluorescent assays demonstrated that R-FV-I16 depolarized the outer and the inner bacterial membranes, while scanning electron microscopy and transmission electron microscopy further indicated that this peptide killed bacterial cells by disrupting the cell membrane, thereby damaging membrane integrity. Results from SEM also provided evidence that R-FV-I16 inherited anti-biofilm activity from the functional peptide sequence FV7. Embedded-hybrid peptides could provide a new pattern for combining different functional domains and showing an effective avenue to screen for novel antimicrobial agents. PMID:24945359
Weighing the mass spectrometric evidence for authentic Tyrannosaurus rex collagen

PubMed Central

Buckley, Mike; Walker, Angela; Ho, Simon Y. W.; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phil; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M. Thomas P.; Prigodich, Richard V.; Ryan, Michael; Rijsdijk, Kenneth F.; Janoo, Anwar; Collins, Matthew J.

2009-01-01

We use authentication tests developed for ancient DNA to evaluate claims by Asara et al. of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon passes, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of the α1(I) peptide sequences with amphibians not birds, suggests that T. rex does not. PMID:18174420
Sequence-Dependent Structure/Function Relationships of Catalytic Peptide-Enabled Gold Nanoparticles Generated under Ambient Synthetic Conditions.

PubMed

Bedford, Nicholas M; Hughes, Zak E; Tang, Zhenghua; Li, Yue; Briggs, Beverly D; Ren, Yang; Swihart, Mark T; Petkov, Valeri G; Naik, Rajesh R; Knecht, Marc R; Walsh, Tiffany R

2016-01-20

Peptide-enabled nanoparticle (NP) synthesis routes can create and/or assemble functional nanomaterials under environmentally friendly conditions, with properties dictated by complex interactions at the biotic/abiotic interface. Manipulation of this interface through sequence modification can provide the capability for material properties to be tailored to create enhanced materials for energy, catalysis, and sensing applications. Fully realizing the potential of these materials requires a comprehensive understanding of sequence-dependent structure/function relationships that is presently lacking. In this work, the atomic-scale structures of a series of peptide-capped Au NPs are determined using a combination of atomic pair distribution function analysis of high-energy X-ray diffraction data and advanced molecular dynamics (MD) simulations. The Au NPs produced with different peptide sequences exhibit varying degrees of catalytic activity for the exemplar reaction 4-nitrophenol reduction. The experimentally derived atomic-scale NP configurations reveal sequence-dependent differences in structural order at the NP surface. Replica exchange with solute-tempering MD simulations are then used to predict the morphology of the peptide overlayer on these Au NPs and identify factors determining the structure/catalytic properties relationship. We show that the amount of exposed Au surface, the underlying surface structural disorder, and the interaction strength of the peptide with the Au surface all influence catalytic performance. A simplified computational prediction of catalytic performance is developed that can potentially serve as a screening tool for future studies. Our approach provides a platform for broadening the analysis of catalytic peptide-enabled metallic NP systems, potentially allowing for the development of rational design rules for property enhancement.
P53 Immune Responses in Breast Cancer Patients: Assessment of CTL Recognizing the HLA-A2.1 Restricted, Wild-type Sequence p53 (264-272) Epitope; Frequencies of Tetramer+ T Cells Specific for the Wild-Type Sequence P53 (264-272) Peptide in the Circulation of Patients with Head and Neck Cancer; The Ability of Variant Peptides to Reverse the Nonresponsiveness of T Lymphocytes to the Wild-Type Sequence P53 (264-272) Epitope

DTIC Science & Technology

2002-10-01

This document contains three papers focusing on the analysis of anti-p53 cellular immune responses of breast, head, neck, and oral cancer patients...variants were generated by amino acid exchanges at positions 6 (6T) and 7 (7W) of the peptide. The 7W variant peptide has potential for immunotherapy of nonresponsive oral cancer patients.
The primary structure of aspartate aminotransferase from pig heart muscle. Partial sequences determined by digestion with thermolysin and elastase

PubMed Central

Bossa, Francesco; Barra, Donatella; Carloni, Massimo; Fasella, Paolo; Riva, Francesca; Doonan, Shawn; Doonan, Hilary J.; Hanford, Robin; Vernon, Charles A.; Walker, John M.

1973-01-01

Peptides produced by thermolytic digestion of aminoethylated aspartate aminotransferase and of the oxidized enzyme were isolated and their amino acid sequences determined. Digestion by elastase of the carboxymethylated enzyme gave peptides representing approximately 40% of the primary structure. Fragments from these digests overlapped with previously reported sequences of peptides obtained by peptic and tryptic digestion (Doonan et al., 1972), giving ten composite peptides containing 395 amino acid residues. The amino acid composition of these composite peptides agrees well with that of the intact enzyme. Confirmatory results for some of the present data have been deposited as Supplementary Publication 50018 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973) 131, 5. PMID:4748834
Cross-reactive and strain-specific antipeptide antibodies to Pseudomonas aeruginosa PAK and PAO pili.

PubMed Central

Lee, K K; Paranchych, W; Hodges, R S

1990-01-01

Antipeptide antibodies were raised against synthetic peptides corresponding to the amino acid sequences of eight surface predicted regions of the pilin proteins from Pseudomonas aeruginosa PAK and PAO. Four of the anti-PAK peptide antisera cross-reacted with strain PAO pili, while five anti-PAO peptide antisera cross-reacted with strain PAK pili. Only one region of the two pilin proteins (region 88-97) provided strain-specific antibodies when either strain PAK or strain PAO region 88-97 peptides were used to generate antipeptide antibodies. Our results clearly showed that cross-reactive and strain-specific antibodies cannot be based solely on the degree of homology in the aligned protein sequences. The majority of synthetic peptides bound to their homologous antipilus antiserum, suggesting that linear sequences play a significant role in the immunogenic response of native pili. PMID:1974884
Immunoreactive prohormone atrial natriuretic peptides 1-30 and 31-67 - Existence of a single circulating amino-terminal peptide

NASA Technical Reports Server (NTRS)

Chen, Yu-Ming; Whitson, Peggy A.; Cintron, Nitza M.

1990-01-01

Sep-Pak C18 extraction of human plasma and radioimmunoassay using antibodies which recognize atrial natriuretic peptide (99-128) and the prohormone sequences 1-30 and 31-67 resulted in mean values from 20 normal subjects of 26.2 (+/- 9.2), 362 (+/- 173) and 368 (+/- 160) pg/ml, respectively. A high correlation coefficient between values obtained using antibodies recognizing prohormone sequences 1-30 and 31-67 was observed (R = 0.84). Extracted plasma immunoreactivity of 1-30 and 31-67 both eluted at 46 percent acetonitrile. In contrast, chromatographic elution of synthetic peptides 1-30 and 31-67 was observed at 48 and 39 percent acetonitrile, respectively. Data suggest that the radioimmunoassay of plasma using antibodies recognizing prohormone sequences 1-30 and 31-67 may represent the measurement of a unique larger amino-terminal peptide fragment containing antigenic sites recognized by both antisera.
Multifactorial Understanding of Ion Abundance in Tandem Mass Spectrometry Experiments.

PubMed

Fazal, Zeeshan; Southey, Bruce R; Sweedler, Jonathan V; Rodriguez-Zas, Sandra L

2013-01-29

In a bottom-up shotgun approach, the proteins of a mixture are enzymatically digested, separated, and analyzed via tandem mass spectrometry. The mass spectra relating fragment ion intensities (abundance) to the mass-to-charge are used to deduce the amino acid sequence and identify the peptides and proteins. The variables that influence intensity were characterized using a multi-factorial mixed-effects model, a ten-fold cross-validation, and stepwise feature selection on 6,352,528 fragment ions from 61,543 peptide ions. Intensity was higher in fragment ions that did not have neutral mass loss relative to any mass loss or that had a +1 charge state. Peptide ions classified for proton mobility as non-mobile had lowest intensity of all mobility levels. Higher basic residue (arginine, lysine or histidine) counts in the peptide ion and low counts in the fragment ion were associated with lower fragment ion intensities. Higher counts of proline in peptide and fragment ions were associated with lower intensities. These results are consistent with the mobile proton theory. Opposite trends between peptide and fragment ion counts and intensity may be due to the different impact of factor under consideration at different stages of the MS/MS experiment or to the different distribution of observations across peptide and fragment ion levels. Presence of basic residues at all three positions next to the fragmentation site was associated with lower fragment ion intensity. The presence of proline proximal to the fragmentation site enhanced fragmentation and had the opposite trend when located distant from the site. A positive association between fragment ion intensity and presence of sulfur residues (cysteine and methionine) on the vicinity of the fragmentation site was identified. These results highlight the multi-factorial nature of fragment ion intensity and could improve the algorithms for peptide identification and the simulation in tandem mass spectrometry experiments.
Multifactorial Understanding of Ion Abundance in Tandem Mass Spectrometry Experiments

PubMed Central

Fazal, Zeeshan; Southey, Bruce R; Sweedler, Jonathan V.; Rodriguez-Zas, Sandra L.

2013-01-01

In a bottom-up shotgun approach, the proteins of a mixture are enzymatically digested, separated, and analyzed via tandem mass spectrometry. The mass spectra relating fragment ion intensities (abundance) to the mass-to-charge are used to deduce the amino acid sequence and identify the peptides and proteins. The variables that influence intensity were characterized using a multi-factorial mixed-effects model, a ten-fold cross-validation, and stepwise feature selection on 6,352,528 fragment ions from 61,543 peptide ions. Intensity was higher in fragment ions that did not have neutral mass loss relative to any mass loss or that had a +1 charge state. Peptide ions classified for proton mobility as non-mobile had lowest intensity of all mobility levels. Higher basic residue (arginine, lysine or histidine) counts in the peptide ion and low counts in the fragment ion were associated with lower fragment ion intensities. Higher counts of proline in peptide and fragment ions were associated with lower intensities. These results are consistent with the mobile proton theory. Opposite trends between peptide and fragment ion counts and intensity may be due to the different impact of factor under consideration at different stages of the MS/MS experiment or to the different distribution of observations across peptide and fragment ion levels. Presence of basic residues at all three positions next to the fragmentation site was associated with lower fragment ion intensity. The presence of proline proximal to the fragmentation site enhanced fragmentation and had the opposite trend when located distant from the site. A positive association between fragment ion intensity and presence of sulfur residues (cysteine and methionine) on the vicinity of the fragmentation site was identified. These results highlight the multi-factorial nature of fragment ion intensity and could improve the algorithms for peptide identification and the simulation in tandem mass spectrometry experiments. PMID:24031159
Novel Group of Leaderless Multipeptide Bacteriocins from Gram-Positive Bacteria.

PubMed

Ovchinnikov, Kirill V; Chi, Hai; Mehmeti, Ibrahim; Holo, Helge; Nes, Ingolf F; Diep, Dzung B

2016-09-01

From raw milk we found 10 Lactococcus garvieae isolates that produce a new broad-spectrum bacteriocin. Though the isolates were obtained from different farms, they turned out to possess identical inhibitory spectra, fermentation profiles of sugars, and repetitive sequence-based PCR (rep-PCR) DNA patterns, indicating that they produce the same bacteriocin. One of the isolates (L. garvieae KS1546) was chosen for further assessment. Purification and peptide sequencing combined with genome sequencing revealed that the antimicrobial activity was due to a bacteriocin unit composed of three similar peptides of 32 to 34 amino acids. The three peptides are produced without leader sequences, and their genes are located next to each other in an operon-like structure, adjacent to the genes normally involved in bacteriocin transport (ABC transporter) and self-immunity. The bacteriocin, termed garvicin KS (GarKS), showed sequence homology to four multipeptide bacteriocins in databases: the known staphylococcal aureocin A70, consisting of four peptides, and three unannotated putative multipeptide bacteriocins produced by Bacillus cereus All these multipeptide bacteriocin loci show conserved genetic organization, including being located adjacent to conserved genetic determinants (Cro/cI and integrase) which are normally associated with mobile genetic elements or genome rearrangements. The antimicrobial activity of all multipeptide bacteriocins was confirmed with synthetic peptides, and all were shown to have broad antimicrobial spectra, with GarKS being the most active of them. The inhibitory spectrum of GarKS includes important pathogens belonging to the genera Staphylococcus, Bacillus, Listeria, and Enterococcus Bacterial resistance to antibiotics is a very serious global problem. There are no new antibiotics with novel antimicrobial mechanisms in clinical trials. Bacteriocins use antimicrobial mechanisms different from those of antibiotics and can kill antibiotic-resistant bacteria, but the number of bacteriocins with very broad antimicrobial spectra is very small. In this study, we have found and purified a novel three-peptide bacteriocin, garvicin KS. By homology search, we were able to find one known and three novel sequence-related bacteriocins consisting of 3 or 4 peptides. None of the peptides has modified amino acids in its sequence. Thus, the activity of all bacteriocins was confirmed with chemically synthesized peptides. All of them, especially garvicin KS, have very broad antibacterial spectra, thus representing a great potential in antimicrobial applications in the food industry and medicine. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

Novel Group of Leaderless Multipeptide Bacteriocins from Gram-Positive Bacteria

PubMed Central

Chi, Hai; Mehmeti, Ibrahim; Holo, Helge; Nes, Ingolf F.

2016-01-01

ABSTRACT From raw milk we found 10 Lactococcus garvieae isolates that produce a new broad-spectrum bacteriocin. Though the isolates were obtained from different farms, they turned out to possess identical inhibitory spectra, fermentation profiles of sugars, and repetitive sequence-based PCR (rep-PCR) DNA patterns, indicating that they produce the same bacteriocin. One of the isolates (L. garvieae KS1546) was chosen for further assessment. Purification and peptide sequencing combined with genome sequencing revealed that the antimicrobial activity was due to a bacteriocin unit composed of three similar peptides of 32 to 34 amino acids. The three peptides are produced without leader sequences, and their genes are located next to each other in an operon-like structure, adjacent to the genes normally involved in bacteriocin transport (ABC transporter) and self-immunity. The bacteriocin, termed garvicin KS (GarKS), showed sequence homology to four multipeptide bacteriocins in databases: the known staphylococcal aureocin A70, consisting of four peptides, and three unannotated putative multipeptide bacteriocins produced by Bacillus cereus. All these multipeptide bacteriocin loci show conserved genetic organization, including being located adjacent to conserved genetic determinants (Cro/cI and integrase) which are normally associated with mobile genetic elements or genome rearrangements. The antimicrobial activity of all multipeptide bacteriocins was confirmed with synthetic peptides, and all were shown to have broad antimicrobial spectra, with GarKS being the most active of them. The inhibitory spectrum of GarKS includes important pathogens belonging to the genera Staphylococcus, Bacillus, Listeria, and Enterococcus. IMPORTANCE Bacterial resistance to antibiotics is a very serious global problem. There are no new antibiotics with novel antimicrobial mechanisms in clinical trials. Bacteriocins use antimicrobial mechanisms different from those of antibiotics and can kill antibiotic-resistant bacteria, but the number of bacteriocins with very broad antimicrobial spectra is very small. In this study, we have found and purified a novel three-peptide bacteriocin, garvicin KS. By homology search, we were able to find one known and three novel sequence-related bacteriocins consisting of 3 or 4 peptides. None of the peptides has modified amino acids in its sequence. Thus, the activity of all bacteriocins was confirmed with chemically synthesized peptides. All of them, especially garvicin KS, have very broad antibacterial spectra, thus representing a great potential in antimicrobial applications in the food industry and medicine. PMID:27316965
Venom characterization of the Amazonian scorpion Tityus metuendus.

PubMed

Batista, C V F; Martins, J G; Restano-Cassulini, R; Coronas, F I V; Zamudio, F Z; Procópio, R; Possani, L D

2018-03-01

The soluble venom from the scorpion Tityus metuendus was characterized by various methods. In vivo experiments with mice showed that it is lethal. Extended electrophysiological recordings using seven sub-types of human voltage gated sodium channels (hNav1.1 to 1.7) showed that it contains both α- and β-scorpion toxin types. Fingerprint analysis by mass spectrometry identified over 200 distinct molecular mass components. At least 60 sub-fractions were recovered from HPLC separation. Five purified peptides were sequenced by Edman degradation, and their complete primary structures were determined. Additionally, three other peptides have had their N-terminal amino acid sequences determined by Edman degradation and reported. Mass spectrometry analysis of tryptic digestion of the soluble venom permitted the identification of the amino acid sequence of 111 different peptides. Search for similarities of the sequences found indicated that they probably are: sodium and potassium channel toxins, metalloproteinases, hyaluronidases, endothelin and angiotensin-converting enzymes, bradykinin-potentiating peptide, hypothetical proteins, allergens, other enzymes, other proteins and peptides. Copyright © 2018 Elsevier Ltd. All rights reserved.
Purification and sequence of rat oxyntomodulin.

PubMed Central

Collie, N L; Walsh, J H; Wong, H C; Shively, J E; Davis, M T; Lee, T D; Reeve, J R

1994-01-01

Structural information about rat enteroglucagon, intestinal peptides containing the pancreatic glucagon sequence, has been based previously on cDNA, immunologic, and chromatographic data. Our interests in testing the physiological actions of synthetic enteroglucagon peptides in rats required that we identify precisely the forms present in vivo. From knowledge of the proglucagon gene sequence, we synthesized an enteroglucagon C-terminal octapeptide common to both proposed enteroglucagon forms, glicentin and oxyntomodulin, but sharing no sequence overlap with glucagon. We then developed a radioimmunoassay using antibodies raised against the octapeptide that was specific for enteroglucagon peptides without cross-reacting with glucagon. Rat intestine was extracted, and one presumptive enteroglucagon form was purified by following the enteroglucagon C-terminal octapeptide-like immunoreactivity through several HPLC purification steps. Structural characterization of the material by amino acid composition, microsequence, and mass spectral analyses identified the peptide as rat oxyntomodulin. The 37-residue peptide consists of pancreatic glucagon plus the C-terminal extension, Lys-Arg-Asn-Arg-Asn-Asn-Ile-Ala. This now permits synthesis of an unambiguous duplicate of endogenous rat oxyntomodulin for physiological studies. Images PMID:7937770
Characterization of a molt-inhibiting hormone (MIH) of the crayfish, Orconectes limosus, by cDNA cloning and mass spectrometric analysis.

PubMed

Bulau, Patrick; Okuno, Atsuro; Thome, Elke; Schmitz, Tina; Peter-Katalinic, Jasna; Keller, Rainer

2005-11-01

The structure of the precursor of a molt-inhibiting hormone (MIH) of the American crayfish, Orconectes limosus was determined by cloning of a cDNA based on RNA from the neurosecretory perikarya of the X-organ in the eyestalk ganglia. The open reading frame includes the complete precursor sequence, consisting of a signal peptide of 29, and the MIH sequence of 77 amino acids. In addition, the mature peptide was isolated by HPLC from the neurohemal sinus gland and analyzed by ESI-MS and MALDI-TOF-MS peptide mapping. This showed that the mature peptide (Mass 8664.29 Da) consists of only 75 amino acids, having Ala75-NH2 as C-terminus. Thus, C-terminal Arg77 of the precursor is removed during processing, and Gly76 serves as an amide donor. Sequence comparison confirms this peptide as a novel member of the large family, which includes crustacean hyperglycaemic hormone (CHH), MIH and gonad (vitellogenesis)-inhibiting hormone (GIH/VIH). The lack of a CPRP (CHH-precursor related peptide) in the hormone precursor, the size and specific sequence characteristics show that Orl MIH belongs to the MIH/GIH(VIH) subgroup of this larger family. Comparison with the MIH of Procambarus clarkii, the only other MIH that has thus far been identified in freshwater crayfish, shows extremely high sequence conservation. Both MIHs differ in only one amino acid residue ( approximately 99% identity), whereas the sequence identity to several other known MIHs is between 40 and 46%.
Exploitation of peptide motif sequences and their use in nanobiotechnology.

PubMed

Shiba, Kiyotaka

2010-08-01

Short amino acid sequences extracted from natural proteins or created using in vitro evolution systems are sometimes associated with particular biological functions. These peptides, called peptide motifs, can serve as functional units for the creation of various tools for nanobiotechnology. In particular, peptide motifs that have the ability to specifically recognize the surfaces of solid materials and to mineralize certain inorganic materials have been linking biological science to material science. Here, I review how these peptide motifs have been isolated from natural proteins or created using in vitro evolution systems, and how they have been used in the nanobiotechnology field. Copyright © 2010 Elsevier Ltd. All rights reserved.
SATPdb: a database of structurally annotated therapeutic peptides

PubMed Central

Singh, Sandeep; Chaudhary, Kumardeep; Dhanda, Sandeep Kumar; Bhalla, Sherry; Usmani, Salman Sadullah; Gautam, Ankur; Tuknait, Abhishek; Agrawal, Piyush; Mathur, Deepika; Raghava, Gajendra P.S.

2016-01-01

SATPdb (http://crdd.osdd.net/raghava/satpdb/) is a database of structurally annotated therapeutic peptides, curated from 22 public domain peptide databases/datasets including 9 of our own. The current version holds 19192 unique experimentally validated therapeutic peptide sequences having length between 2 and 50 amino acids. It covers peptides having natural, non-natural and modified residues. These peptides were systematically grouped into 10 categories based on their major function or therapeutic property like 1099 anticancer, 10585 antimicrobial, 1642 drug delivery and 1698 antihypertensive peptides. We assigned or annotated structure of these therapeutic peptides using structural databases (Protein Data Bank) and state-of-the-art structure prediction methods like I-TASSER, HHsearch and PEPstrMOD. In addition, SATPdb facilitates users in performing various tasks that include: (i) structure and sequence similarity search, (ii) peptide browsing based on their function and properties, (iii) identification of moonlighting peptides and (iv) searching of peptides having desired structure and therapeutic activities. We hope this database will be useful for researchers working in the field of peptide-based therapeutics. PMID:26527728
Biodegradable copolymers carrying cell-adhesion peptide sequences.

PubMed

Proks, Vladimír; Machová, Lud'ka; Popelka, Stepán; Rypácek, Frantisek

2003-01-01

Amphiphilic block copolymers are used to create bioactive surfaces on biodegradable polymer scaffolds for tissue engineering. Cell-selective biomaterials can be prepared using copolymers containing peptide sequences derived from extracellular-matrix proteins (ECM). Here we discuss alternative ways for preparation of amphiphilic block copolymers composed of hydrophobic polylactide (PLA) and hydrophilic poly(ethylene oxide) (PEO) blocks with cell-adhesion peptide sequences. Copolymers PLA-b-PEO were prepared by a living polymerisation of lactide in dioxane with tin(II)2-ethylhexanoate as a catalyst. The following approaches for incorporation of peptides into copolymers were elaborated. (a) First, a side-chain protected Gly-Arg-Gly-Asp-Ser-Gly (GRGDSG) peptide was prepared by solid-phase peptide synthesis (SPPS) and then coupled with delta-hydroxy-Z-amino-PEO in solution. In the second step, the PLA block was grafted to it via a controlled polymerisation of lactide initiated by the hydroxy end-groups of PEO in the side-chain-protected GRGDSG-PEO. Deprotection of the peptide yielded a GRGDSG-b-PEO-b-PLA copolymer, with the peptide attached through its C-end. (b) A protected GRGDSG peptide was built up on a polymer resin and coupled with Z-carboxy-PEO using a solid-phase approach. After cleavage of the delta-hydroxy-PEO-GRGDSG copolymer from the resin, polymerisation of lactide followed by deprotection of the peptide yielded a PLA-b-PEO-b-GRGDSG block copolymer, in which the peptide is linked through its N-terminus.
Mass spectrometry analysis and in silico prediction of allergenicity of peptides in tryptic hydrolysates of the proteins from Ruditapes philippinarum.

PubMed

Yu, Yue; Liu, Hongwei; Tu, Maolin; Qiao, Meiling; Wang, Zhenyu; Du, Ming

2017-12-01

Ruditapes philippinarum is nutrient-rich and widely-distributed, but little attention has been paid to the identification and characterization of the bioactive peptides in the bivalve. In the present study, we evaluated the peptides of the R. philippinarum that were enzymolysised by trypsin using a combination of ultra-performance liquid chromatography separation and electrospray ionization quadrupole time-of-flight tandem mass spectrometry, followed by data processing and sequence-similarity database searching. The potential allergenicity of the peptides was assessed in silico. The enzymolysis was performed under the conditions: E:S 3:100 (w/w), pH 9.0, 45 °C for 4 h. After separation and detection, the Swiss-Prot database and a Ruditapes philippinarum sequence database were used: 966 unique peptides were identified by non-error tolerant database searching; 173 peptides matching 55 precursor proteins comprised highly conserved cytoskeleton proteins. The remaining 793 peptides were identified from the R. philippinarum sequence database. The results showed that 510 peptides were labeled as allergens and 31 peptides were potential allergens; 425 peptides were predicted to be nonallergenic. The abundant peptide information contributes to further investigations of the structure and potential function of R. philippinarum. Additional in vitro studies are required to demonstrate and ensure the correct production of the hydrolysates for use in the food industry with respect to R. philippinarum. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Molecular classification of liver cirrhosis in a rat model by proteomics and bioinformatics.

PubMed

Xu, Xiu-Qin; Leow, Chon K; Lu, Xin; Zhang, Xuegong; Liu, Jun S; Wong, Wing-Hung; Asperger, Arndt; Deininger, Sören; Eastwood Leung, Hon-Chiu

2004-10-01

Liver cirrhosis is a worldwide health problem. Reliable, noninvasive methods for early detection of liver cirrhosis are not available. Using a three-step approach, we classified sera from rats with liver cirrhosis following different treatment insults. The approach consisted of: (i) protein profiling using surface-enhanced laser desorption/ionization (SELDI) technology; (ii) selection of a statistically significant serum biomarker set using machine learning algorithms; and (iii) identification of selected serum biomarkers by peptide sequencing. We generated serum protein profiles from three groups of rats: (i) normal (n=8), (ii) thioacetamide-induced liver cirrhosis (n=22), and (iii) bile duct ligation-induced liver fibrosis (n=5) using a weak cation exchanger surface. Profiling data were further analyzed by a recursive support vector machine algorithm to select a panel of statistically significant biomarkers for class prediction. Sensitivity and specificity of classification using the selected protein marker set were higher than 92%. A consistently down-regulated 3495 Da protein in cirrhosis samples was one of the selected significant biomarkers. This 3495 Da protein was purified on-chip and trypsin digested. Further structural characterization of this biomarkers candidate was done by using cross-platform matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) peptide mass fingerprinting (PMF) and matrix-assisted laser desorption/ionization time of flight/time of flight (MALDI-TOF/TOF) tandem mass spectrometry (MS/MS). Combined data from PMF and MS/MS spectra of two tryptic peptides suggested that this 3495 Da protein shared homology to a histidine-rich glycoprotein. These results demonstrated a novel approach to discovery of new biomarkers for early detection of liver cirrhosis and classification of liver diseases.
Recombinant protein secretion in Pseudozyma flocculosa and Pseudozyma antarctica with a novel signal peptide.

PubMed

Cheng, Yali; Avis, Tyler J; Bolduc, Sébastien; Zhao, Yingyi; Anguenot, Raphaël; Neveu, Bertrand; Labbé, Caroline; Belzile, François; Bélanger, Richard R

2008-12-01

Secretion of recombinant proteins aims to reproduce the correct posttranslational modifications of the expressed protein while simplifying its recovery. In this study, secretion signal sequences from an abundantly secreted 34-kDa protein (P34) from Pseudozyma flocculosa were cloned. The efficiency of these sequences in the secretion of recombinant green fluorescent protein (GFP) was investigated in two Pseudozyma species and compared with other secretion signal sequences, from S. cerevisiae and Pseudozyma spp. The results indicate that various secretion signal sequences were functional and that the P34 signal peptide was the most effective secretion signal sequence in both P. flocculosa and P. antarctica. The cells correctly processed the secretion signal sequences, including P34 signal peptide, and mature GFP was recovered from the culture medium. This is the first report of functional secretion signal sequences in P. flocculosa. These sequences can be used to test the secretion of other recombinant proteins and for studying the secretion pathway in P. flocculosa and P. antarctica.
LuciPHOr: Algorithm for Phosphorylation Site Localization with False Localization Rate Estimation Using Modified Target-Decoy Approach*

PubMed Central

Fermin, Damian; Walmsley, Scott J.; Gingras, Anne-Claude; Choi, Hyungwon; Nesvizhskii, Alexey I.

2013-01-01

The localization of phosphorylation sites in peptide sequences is a challenging problem in large-scale phosphoproteomics analysis. The intense neutral loss peaks and the coexistence of multiple serine/threonine and/or tyrosine residues are limiting factors for objectively scoring site patterns across thousands of peptides. Various computational approaches for phosphorylation site localization have been proposed, including Ascore, Mascot Delta score, and ProteinProspector, yet few address direct estimation of the false localization rate (FLR) in each experiment. Here we propose LuciPHOr, a modified target-decoy-based approach that uses mass accuracy and peak intensities for site localization scoring and FLR estimation. Accurate estimation of the FLR is a difficult task at the individual-site level because the degree of uncertainty in localization varies significantly across different peptides. LuciPHOr carries out simultaneous localization on all candidate sites in each peptide and estimates the FLR based on the target-decoy framework, where decoy phosphopeptides generated by placing artificial phosphorylation(s) on non-candidate residues compete with the non-decoy phosphopeptides. LuciPHOr also reports approximate site-level confidence scores for all candidate sites as a means to localize additional sites from multiphosphorylated peptides in which localization can be partially achieved. Unlike the existing tools, LuciPHOr is compatible with any search engine output processed through the Trans-Proteomic Pipeline. We evaluated the performance of LuciPHOr in terms of the sensitivity and accuracy of FLR estimates using two synthetic phosphopeptide libraries and a phosphoproteomic dataset generated from complex mouse brain samples. PMID:23918812
Molecular Diversity and Gene Evolution of the Venom Arsenal of Terebridae Predatory Marine Snails

PubMed Central

Gorson, Juliette; Ramrattan, Girish; Verdes, Aida; Wright, Elizabeth M.; Kantor, Yuri; Rajaram Srinivasan, Ramakrishnan; Musunuri, Raj; Packer, Daniel; Albano, Gabriel; Qiu, Wei-Gang; Holford, Mandë

2015-01-01

Venom peptides from predatory organisms are a resource for investigating evolutionary processes such as adaptive radiation or diversification, and exemplify promising targets for biomedical drug development. Terebridae are an understudied lineage of conoidean snails, which also includes cone snails and turrids. Characterization of cone snail venom peptides, conotoxins, has revealed a cocktail of bioactive compounds used to investigate physiological cellular function, predator-prey interactions, and to develop novel therapeutics. However, venom diversity of other conoidean snails remains poorly understood. The present research applies a venomics approach to characterize novel terebrid venom peptides, teretoxins, from the venom gland transcriptomes of Triplostephanus anilis and Terebra subulata. Next-generation sequencing and de novo assembly identified 139 putative teretoxins that were analyzed for the presence of canonical peptide features as identified in conotoxins. To meet the challenges of de novo assembly, multiple approaches for cross validation of findings were performed to achieve reliable assemblies of venom duct transcriptomes and to obtain a robust portrait of Terebridae venom. Phylogenetic methodology was used to identify 14 teretoxin gene superfamilies for the first time, 13 of which are unique to the Terebridae. Additionally, basic local algorithm search tool homology-based searches to venom-related genes and posttranslational modification enzymes identified a convergence of certain venom proteins, such as actinoporin, commonly found in venoms. This research provides novel insights into venom evolution and recruitment in Conoidean predatory marine snails and identifies a plethora of terebrid venom peptides that can be used to investigate fundamental questions pertaining to gene evolution. PMID:26025559
Uncovering the design rules for peptide synthesis of metal nanoparticles.

PubMed

Tan, Yen Nee; Lee, Jim Yang; Wang, Daniel I C

2010-04-28

Peptides are multifunctional reagents (reducing and capping agents) that can be used for the synthesis of biocompatible metal nanoparticles under relatively mild conditions. However, the progress in peptide synthesis of metal nanoparticles has been slow due to the lack of peptide design rules. It is difficult to establish sequence-reactivity relationships from peptides isolated from biological sources (e.g., biomineralizing organisms) or selected by combinatorial display libraries because of their widely varying compositions and structures. The abundance of random and inactive amino acid sequences in the peptides also increases the difficulty in knowledge extraction. In this study, a "bottom-up" approach was used to formulate a set of rudimentary rules for the size- and shape-controlled peptide synthesis of gold nanoparticles from the properties of the 20 natural alpha-amino acids for AuCl(4)(-) reduction and binding to Au(0). It was discovered that the reduction capability of a peptide depends on the presence of certain reducing amino acid residues, whose activity may be regulated by neighboring residues with different Au(0) binding strengths. Another finding is the effect of peptide net charge on the nucleation and growth of the Au nanoparticles. On the basis of these understandings, several multifunctional peptides were designed to synthesize gold nanoparticles in different morphologies (nanospheres and nanoplates) and with sizes tunable by the strategic placement of selected amino acid residues in the peptide sequence. The methodology presented here and the findings are useful for establishing the scientific basis for the rational design of peptides for the synthesis of metal nanostructures.
Epitaxial Nucleation on Rationally Designed Peptide Functionalized Interface

DTIC Science & Technology

2011-07-19

of 17 amino acid peptides. In this report, we focus on the findings from several variants of these sequences, including the role of charge...separation and histidine-gold coordination. We find that these 17 amino acid peptide sequences behave robustly, where periodicity appears to dominate the...26,27 Secondary structure propensity refers to the intrinsic inclination of individual amino acids to a given secondary structure, where side-group
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its gene

DTIC Science & Technology

1989-12-01

B-chain. Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate , represents a significant...We have completed the sequence determination of both the basic and acidic subunits of crotoxin. The acidic subunit peptides were difficult, since two...of the three peptides were blocked at the amino-terminus by pyroglutamate . Earlier structural studies on crotoxin and related crotalid dimeric
Antimicrobial Peptides from Plants

PubMed Central

Tam, James P.; Wang, Shujing; Wong, Ka H.; Tan, Wei Liang

2015-01-01

Plant antimicrobial peptides (AMPs) have evolved differently from AMPs from other life forms. They are generally rich in cysteine residues which form multiple disulfides. In turn, the disulfides cross-braced plant AMPs as cystine-rich peptides to confer them with extraordinary high chemical, thermal and proteolytic stability. The cystine-rich or commonly known as cysteine-rich peptides (CRPs) of plant AMPs are classified into families based on their sequence similarity, cysteine motifs that determine their distinctive disulfide bond patterns and tertiary structure fold. Cystine-rich plant AMP families include thionins, defensins, hevein-like peptides, knottin-type peptides (linear and cyclic), lipid transfer proteins, α-hairpinin and snakins family. In addition, there are AMPs which are rich in other amino acids. The ability of plant AMPs to organize into specific families with conserved structural folds that enable sequence variation of non-Cys residues encased in the same scaffold within a particular family to play multiple functions. Furthermore, the ability of plant AMPs to tolerate hypervariable sequences using a conserved scaffold provides diversity to recognize different targets by varying the sequence of the non-cysteine residues. These properties bode well for developing plant AMPs as potential therapeutics and for protection of crops through transgenic methods. This review provides an overview of the major families of plant AMPs, including their structures, functions, and putative mechanisms. PMID:26580629
The Specificity of Trimming of MHC Class I-Presented Peptides in the Endoplasmic Reticulum1

PubMed Central

Hearn, Arron; York, Ian A.; Rock, Kenneth L.

2010-01-01

Aminopeptidases in the endoplasmic reticulum (ER) can cleave antigenic peptides and in so doing either create or destroy MHC class I-presented epitopes. However the specificity of this trimming process overall and of the major ER aminopeptidase ERAP1 in particular is not well understood. This issue is important because peptide trimming influences the magnitude and specificity of CD8 T cell responses. By systematically varying the N-terminal flanking sequences of peptides in a cell free biochemical system and in intact cells, we elucidated the specificity of ERAP1 and of ER trimming overall. ERAP1 can cleave after many amino acids on the N-terminus of epitope precursors but does so at markedly different rates. The specificity seen with purified ERAP1 is similar to that observed for trimming and presentation of epitopes in the ER of intact cells. We define N-terminal sequences that are favorable or unfavorable for antigen presentation in ways that are independent from the epitopes core sequence. When databases of known presented peptides were analyzed, the residues that were preferred for the trimming of model peptide precursors were found to be overrepresented in N-terminal flanking sequences of epitopes generally. These data define key determinants in the specificity of antigen processing. PMID:19828632
Synthetic Molecular Evolution of Membrane-Active Peptides

NASA Astrophysics Data System (ADS)

Wimley, William

The physical chemistry of membrane partitioning largely determines the function of membrane active peptides. Membrane-active peptides have potential utility in many areas, including in the cellular delivery of polar compounds, cancer therapy, biosensor design, and in antibacterial, antiviral and antifungal therapies. Yet, despite decades of research on thousands of known examples, useful sequence-structure-function relationships are essentially unknown. Because peptide-membrane interactions within the highly fluid bilayer are dynamic and heterogeneous, accounts of mechanism are necessarily vague and descriptive, and have little predictive power. This creates a significant roadblock to advances in the field. We are bypassing that roadblock with synthetic molecular evolution: iterative peptide library design and orthogonal high-throughput screening. We start with template sequences that have at least some useful activity, and create small, focused libraries using structural and biophysical principles to design the sequence space around the template. Orthogonal high-throughput screening is used to identify gain-of-function peptides by simultaneously selecting for several different properties (e.g. solubility, activity and toxicity). Multiple generations of iterative library design and screening have enabled the identification of membrane-active sequences with heretofore unknown properties, including clinically relevant, broad-spectrum activity against drug-resistant bacteria and enveloped viruses as well as pH-triggered macromolecular poration.
Smart biomaterials: Surfaces functionalized with proteolytically stable osteoblast-adhesive peptides.

PubMed

Zamuner, Annj; Brun, Paola; Scorzeto, Michele; Sica, Giuseppe; Castagliuolo, Ignazio; Dettin, Monica

2017-09-01

Engineered scaffolds for bone tissue regeneration are designed to promote cell adhesion, growth, proliferation and differentiation. Recently, covalent and selective functionalization of glass and titanium surfaces with an adhesive peptide (HVP) mapped on [351-359] sequence of human Vitronectin allowed to selectively increase osteoblast attachment and adhesion strength in in vitro assays, and to promote osseointegration in in vivo studies. For the first time to our knowledge, in this study we investigated the resistance of adhesion sequences to proteolytic digestion: HVP was completely cleaved after 5 h. In order to overcome the enzymatic degradation of the native peptide under physiological conditions we synthetized three analogues of HVP sequence. A retro-inverted peptide D-2HVP, composed of D amino acids, was completely stable in serum-containing medium. In addition, glass surfaces functionalized with D-2HVP increased human osteoblast adhesion as compared to the native peptide and maintained deposition of calcium. Interestingly, D-2HVP increased expression of IBSP, VTN and SPP1 genes as compared to HVP functionalized surfaces. Total internal reflection fluorescence microscope analysis showed cells with numerous filopodia spread on D-2HVP-functionalized surfaces. Therefore, the D-2HVP sequence is proposed as new osteoblast adhesive peptide with increased bioactivity and high proteolytic resistance.
Isolation and Structural Characterization of Antioxidant Peptides from Degreased Apricot Seed Kernels.

PubMed

Zhang, Haisheng; Xue, Jing; Zhao, Huanxia; Zhao, Xinshuai; Xue, Huanhuan; Sun, Yuhan; Xue, Wanrui

2018-05-03

Background : The composition and sequence of amino acids have a prominent influence on theantioxidant activities of peptides. Objective : A series of isolation and purification experiments was conducted to explore the amino acid sequence of antioxidant peptides, which led to its antioxidation causes. Methods : The degreased apricot seed kernels were hydrolyzed by compound proteases of alkaline protease and flavor protease (3:2, u/u) to prepare apricot seed kernel hydrolysates (ASKH). ASKH were separated into ASKH-A and ASKH-B by dialysis bag. ASKH-B (MW < 3.5 kDa) was further separated into fractions by Sephadex G-25 and G-15 gel-filtration chromatography. Reversed-phase HPLC (RP-HPLC) was performed to separate fraction B4b into two antioxidant peptides (peptide B4b-4 and B4b-6). Results : The amino acid sequences were Val-Leu-Tyr-Ile-Trp and Ser-Val-Pro-Tyr-Glu, respectively. Conclusions : The results suggested that ASKH antioxidant peptides may have potential utility as healthy ingredients and as food preservatives due to their antioxidant activity. Highlights : Materials with regional characteristics were selected to explore, and hydrolysates were identified by RP-HPLC and matrix-assisted laser desorption ionization-time-of-flight-MS to obtain amino acid sequences.

β-Casein(94-123)-derived peptides differently modulate production of mucins in intestinal goblet cells.

PubMed

Plaisancié, Pascale; Boutrou, Rachel; Estienne, Monique; Henry, Gwénaële; Jardin, Julien; Paquet, Armelle; Léonil, Joëlle

2015-02-01

We recently reported the identification of a peptide from yoghurts with promising potential for intestinal health: the sequence (94-123) of bovine β-casein. This peptide, composed of 30 amino acid residues, maintains intestinal homoeostasis through production of the secreted mucin MUC2 and of the transmembrane-associated mucin MUC4. Our study aimed to search for the minimal sequence responsible for the biological activity of β-CN(94-123) by using several strategies based on (i) known bioactive peptides encrypted in β-CN(94-123), (ii) in silico prediction of peptides reactivity and (iii) digestion of β-CN(94-123) by enzymes of intestinal brush border membranes. The revealed sequences were tested in vitro on human intestinal mucus-producing HT29-MTX cells. We demonstrated that β-CN(108-113) (an ACE-inhibitory peptide) and β-CN(114-119) (an opioid peptide named neocasomorphin-6) up-regulated MUC4 expression whereas levels of the secreted mucins MUC2 and MUC5AC remained unchanged. The digestion of β-CN(94-123) by intestinal enzymes showed that the peptides β-CN(94-108) and β-CN(117-123) were present throughout 1·5 to 3 h of digestion, respectively. These two peptides raised MUC5AC expression while β-CN(117-123) also induced a decrease in the level of MUC2 mRNA and protein. In addition, this inhibitory effect was reproduced in airway epithelial cells. In conclusion, β-CN(94-123) is a multifunctional molecule but only the sequence of 30 amino acids has a stimulating effect on the production of MUC2, a crucial factor of intestinal protection.
Evolutionary combinatorial chemistry, a novel tool for SAR studies on peptide transport across the blood-brain barrier. Part 2. Design, synthesis and evaluation of a first generation of peptides.

PubMed

Teixidó, Meritxell; Belda, Ignasi; Zurita, Esther; Llorà, Xavier; Fabre, Myriam; Vilaró, Senén; Albericio, Fernando; Giralt, Ernest

2005-12-01

The use of high-throughput methods in drug discovery allows the generation and testing of a large number of compounds, but at the price of providing redundant information. Evolutionary combinatorial chemistry combines the selection and synthesis of biologically active compounds with artificial intelligence optimization methods, such as genetic algorithms (GA). Drug candidates for the treatment of central nervous system (CNS) disorders must overcome the blood-brain barrier (BBB). This paper reports a new genetic algorithm that searches for the optimal physicochemical properties for peptide transport across the blood-brain barrier. A first generation of peptides has been generated and synthesized. Due to the high content of N-methyl amino acids present in most of these peptides, their syntheses were especially challenging due to over-incorporations, deletions and DKP formations. Distinct fragmentation patterns during peptide cleavage have been identified. The first generation of peptides has been studied by evaluation techniques such as immobilized artificial membrane chromatography (IAMC), a cell-based assay, log Poctanol/water calculations, etc. Finally, a second generation has been proposed. (c) 2005 European Peptide Society and John Wiley & Sons, Ltd.
Inhibition of trypanosomal cysteine proteinases by their propeptides.

PubMed

Lalmanach, G; Lecaille, F; Chagas, J R; Authié, E; Scharfstein, J; Juliano, M A; Gauthier, F

1998-09-25

The ability of the prodomains of trypanosomal cysteine proteinases to inhibit their active form was studied using a set of 23 overlapping 15-mer peptides covering the whole prosequence of congopain, the major cysteine proteinase of Trypanosoma congolense. Three consecutive peptides with a common 5-mer sequence YHNGA were competitive inhibitors of congopain. A shorter synthetic peptide consisting of this 5-mer sequence flanked by two Ala residues (AYHNGAA) also inhibited purified congopain. No residue critical for inhibition was identified in this sequence, but a significant improvement in Ki value was obtained upon N-terminal elongation. Procongopain-derived peptides did not inhibit lysosomal cathepsins B and L but did inhibit native cruzipain (from Dm28c clone epimastigotes), the major cysteine proteinase of Trypanosoma cruzi, the proregion of which also contains the sequence YHNGA. The positioning of the YHNGA inhibitory sequence within the prosegment of trypanosomal proteinases is similar to that covering the active site in the prosegment of cysteine proteinases, the three-dimensional structure of which has been resolved. This strongly suggests that trypanosomal proteinases, despite their long C-terminal extension, have a prosegment that folds similarly to that in related mammal and plant cysteine proteinases, resulting in reverse binding within the active site. Such reverse binding could also occur for short procongopain-derived inhibitory peptides, based on their resistance to proteolysis and their ability to retain inhibitory activity after prolonged incubation. In contrast, homologous peptides in related cysteine proteinases did not inhibit trypanosomal proteinases and were rapidly cleaved by these enzymes.
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger.

PubMed

Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J

2009-02-04

Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method.
Insecticidal components from field pea extracts: sequences of some variants of pea albumin 1b.

PubMed

Taylor, Wesley G; Sutherland, Daniel H; Olson, Douglas J H; Ross, Andrew R S; Fields, Paul G

2004-12-15

Methanol soluble insecticidal peptides with masses of 3752, 3757, and 3805 Da, isolated from crude extracts (C8 extracts) derived from the protein-enriched flour of commercial field peas [Pisum sativum (L.)], were purified by reversed phase chromatography and, after reduction and alkylation, were sequenced by matrix-assisted laser desorption/ionization (MALDI) time-of-flight mass spectrometry with the aid of various peptidases. These major peptides were variants of pea albumin 1b (PA1b) with methionine sulfoxide rather than methionine at position 12. Peptide 3752 showed additional variations at positions 29 (valine for isoleucine) and 34 (histidine for asparagine). A minor, 37 amino acid peptide with a molecular mass of 3788 Da was also sequenced and differed from a known PA1b variant at positions 1, 25, and 31. Sequence variants of PA1b with their molecular masses were compiled, and variants that matched the accurate masses of the experimental peptides were used to narrow the search. MALDI postsource decay experiments on pronase fragments helped to confirm the sequences. Whole and dehulled field peas gave insecticidal C8 extracts in the laboratory that were enriched in peptides with masses of 3736, 3741, and 3789 Da, as determined by high-performance liquid chromatography (HPLC) and electrospray ionization mass spectrometry. It was therefore concluded that oxidation of the methionine residues to methionine sulfoxide occurred primarily during the processing of dehulled peas in a mill.
Discovery of Undefined Protein Crosslinking Chemistry: A Comprehensive Methodology Utilizing 18O-labeling and Mass Spectrometry

PubMed Central

Liu, Min; Zhang, Zhongqi; Zang, Tianzhu; Spahr, Chris; Cheetham, Janet; Ren, Da; Sunny Zhou, Zhaohui

2013-01-01

Characterization of protein crosslinking, particularly without prior knowledge of the chemical nature and site of crosslinking, poses a significant challenge due to their intrinsic structural complexity and the lack of a comprehensive analytical approach. Towards this end, we have developed a generally applicable workflow—XChem-Finder that involves four stages. (1) Detection of crosslinked peptides via 18O-labeling at C-termini. (2) Determination of the putative partial sequences of each crosslinked peptide pair using a fragment ion mass database search against known protein sequences coupled with a de novo sequence tag search. (3) Extension to full sequences based on protease specificity, the unique combination of mass, and other constraints. (4) Deduction of crosslinking chemistry and site. The mass difference between the sum of two putative full-length peptides and the crosslinked peptide provides the formulas (elemental composition analysis) for the functional groups involved in each cross- linking. Combined with sequence restraint from MS/MS data, plausible crosslinking chemistry and site were inferred, and ultimately, confirmed by matching with all data. Applying our approach to a stressed IgG2 antibody, ten cross-linked peptides were discovered and found to be connected via thioether originating from disulfides at locations that had not been previously recognized. Furthermore, once the crosslink chemistry was revealed, a targeted crosslink search yielded four additional crosslinked peptides that all contain the C-terminus of the light chain. PMID:23634697
Delivery of siRNA using ternary complexes containing branched cationic peptides: the role of peptide sequence, branching and targeting.

PubMed

Kudsiova, Laila; Welser, Katharina; Campbell, Frederick; Mohammadi, Atefeh; Dawson, Natalie; Cui, Lili; Hailes, Helen C; Lawrence, M Jayne; Tabor, Alethea B

2016-03-01

Ternary nanocomplexes, composed of bifunctional cationic peptides, lipids and siRNA, as delivery vehicles for siRNA have been investigated. The study is the first to determine the optimal sequence and architecture of the bifunctional cationic peptide used for siRNA packaging and delivery using lipopolyplexes. Specifically three series of cationic peptides of differing sequence, degrees of branching and cell-targeting sequences were co-formulated with siRNA and vesicles prepared from a 1 : 1 molar ratio of the cationic lipid DOTMA and the helper lipid, DOPE. The level of siRNA knockdown achieved in the human alveolar cell line, A549-luc cells, in both reduced serum and in serum supplemented media was evaluated, and the results correlated to the nanocomplex structure (established using a range of physico-chemical tools, namely small angle neutron scattering, transmission electron microscopy, dynamic light scattering and zeta potential measurement); the conformational properties of each component (circular dichroism); the degree of protection of the siRNA in the lipopolyplex (using gel shift assays) and to the cellular uptake, localisation and toxicity of the nanocomplexes (confocal microscopy). Although the size, charge, structure and stability of the various lipopolyplexes were broadly similar, it was clear that lipopolyplexes formulated from branched peptides containing His-Lys sequences perform best as siRNA delivery agents in serum, with protection of the siRNA in serum balanced against efficient release of the siRNA into the cytoplasm of the cell.
From amino acid sequence to bioactivity: The biomedical potential of antitumor peptides.

PubMed

Blanco-Míguez, Aitor; Gutiérrez-Jácome, Alberto; Pérez-Pérez, Martín; Pérez-Rodríguez, Gael; Catalán-García, Sandra; Fdez-Riverola, Florentino; Lourenço, Anália; Sánchez, Borja

2016-06-01

Chemoprevention is the use of natural and/or synthetic substances to block, reverse, or retard the process of carcinogenesis. In this field, the use of antitumor peptides is of interest as, (i) these molecules are small in size, (ii) they show good cell diffusion and permeability, (iii) they affect one or more specific molecular pathways involved in carcinogenesis, and (iv) they are not usually genotoxic. We have checked the Web of Science Database (23/11/2015) in order to collect papers reporting on bioactive peptide (1691 registers), which was further filtered searching terms such as "antiproliferative," "antitumoral," or "apoptosis" among others. Works reporting the amino acid sequence of an antiproliferative peptide were kept (60 registers), and this was complemented with the peptides included in CancerPPD, an extensive resource for antiproliferative peptides and proteins. Peptides were grouped according to one of the following mechanism of action: inhibition of cell migration, inhibition of tumor angiogenesis, antioxidative mechanisms, inhibition of gene transcription/cell proliferation, induction of apoptosis, disorganization of tubulin structure, cytotoxicity, or unknown mechanisms. The main mechanisms of action of those antiproliferative peptides with known amino acid sequences are presented and finally, their potential clinical usefulness and future challenges on their application is discussed. © 2016 The Protein Society.
Antibacterial Activity of Synthetic Peptides Derived from Lactoferricin against Escherichia coli ATCC 25922 and Enterococcus faecalis ATCC 29212

PubMed Central

León-Calvijo, María A.; Leal-Castro, Aura L.; Almanzar-Reina, Giovanni A.; Rosas-Pérez, Jaiver E.; García-Castañeda, Javier E.; Rivera-Monroy, Zuly J.

2015-01-01

Peptides derived from human and bovine lactoferricin were designed, synthesized, purified, and characterized using RP-HPLC and MALDI-TOF-MS. Specific changes in the sequences were designed as (i) the incorporation of unnatural amino acids in the sequence, the (ii) reduction or (iii) elongation of the peptide chain length, and (iv) synthesis of molecules with different number of branches containing the same sequence. For each peptide, the antibacterial activity against Escherichia coli ATCC 25922 and Enterococcus faecalis ATCC 29212 was evaluated. Our results showed that Peptides I.2 (RWQWRWQWR) and I.4 ((RRWQWR)4K2 Ahx 2C2) exhibit bigger or similar activity against E. coli (MIC 4–33 μM) and E. faecalis (MIC 10–33 μM) when they were compared with lactoferricin protein (LF) and some of its derivate peptides as II.1 (FKCRRWQWRMKKLGA) and IV.1 (FKCRRWQWRMKKLGAPSITCVRRAE). It should be pointed out that Peptides I.2 and I.4, containing the RWQWR motif, are short and easy to synthesize; our results demonstrate that it is possible to design and obtain synthetic peptides that exhibit enhanced antibacterial activity using a methodology that is fast and low-cost and that allows obtaining products with a high degree of purity and high yield. PMID:25815317
Antibacterial activity of synthetic peptides derived from lactoferricin against Escherichia coli ATCC 25922 and Enterococcus faecalis ATCC 29212.

PubMed

León-Calvijo, María A; Leal-Castro, Aura L; Almanzar-Reina, Giovanni A; Rosas-Pérez, Jaiver E; García-Castañeda, Javier E; Rivera-Monroy, Zuly J

2015-01-01

Peptides derived from human and bovine lactoferricin were designed, synthesized, purified, and characterized using RP-HPLC and MALDI-TOF-MS. Specific changes in the sequences were designed as (i) the incorporation of unnatural amino acids in the sequence, the (ii) reduction or (iii) elongation of the peptide chain length, and (iv) synthesis of molecules with different number of branches containing the same sequence. For each peptide, the antibacterial activity against Escherichia coli ATCC 25922 and Enterococcus faecalis ATCC 29212 was evaluated. Our results showed that Peptides I.2 (RWQWRWQWR) and I.4 ((RRWQWR)4K2Ahx2C2) exhibit bigger or similar activity against E. coli (MIC 4-33 μM) and E. faecalis (MIC 10-33 μM) when they were compared with lactoferricin protein (LF) and some of its derivate peptides as II.1 (FKCRRWQWRMKKLGA) and IV.1 (FKCRRWQWRMKKLGAPSITCVRRAE). It should be pointed out that Peptides I.2 and I.4, containing the RWQWR motif, are short and easy to synthesize; our results demonstrate that it is possible to design and obtain synthetic peptides that exhibit enhanced antibacterial activity using a methodology that is fast and low-cost and that allows obtaining products with a high degree of purity and high yield.
Design, synthesis and DNA interactions of a chimera between a platinum complex and an IHF mimicking peptide.

PubMed

Rao, Harita; Damian, Mariana S; Alshiekh, Alak; Elmroth, Sofi K C; Diederichsen, Ulf

2015-12-28

Conjugation of metal complexes with peptide scaffolds possessing high DNA binding affinity has shown to modulate their biological activities and to enhance their interaction with DNA. In this work, a platinum complex/peptide chimera was synthesized based on a model of the Integration Host Factor (IHF), an architectural protein possessing sequence specific DNA binding and bending abilities through its interaction with a minor groove. The model peptide consists of a cyclic unit resembling the minor grove binding subdomain of IHF, a positively charged lysine dendrimer for electrostatic interactions with the DNA phosphate backbone and a flexible glycine linker tethering the two units. A norvaline derived artificial amino acid was designed to contain a dimethylethylenediamine as a bidentate platinum chelating unit, and introduced into the IHF mimicking peptides. The interaction of the chimeric peptides with various DNA sequences was studied by utilizing the following experiments: thermal melting studies, agarose gel electrophoresis for plasmid DNA unwinding experiments, and native and denaturing gel electrophoresis to visualize non-covalent and covalent peptide-DNA adducts, respectively. By incorporation of the platinum metal center within the model peptide mimicking IHF we have attempted to improve its specificity and DNA targeting ability, particularly towards those sequences containing adjacent guanine residues.
From amino acid sequence to bioactivity: The biomedical potential of antitumor peptides

PubMed Central

Blanco‐Míguez, Aitor; Gutiérrez‐Jácome, Alberto; Pérez‐Pérez, Martín; Pérez‐Rodríguez, Gael; Catalán‐García, Sandra; Fdez‐Riverola, Florentino; Lourenço, Anália

2016-01-01

Abstract Chemoprevention is the use of natural and/or synthetic substances to block, reverse, or retard the process of carcinogenesis. In this field, the use of antitumor peptides is of interest as, (i) these molecules are small in size, (ii) they show good cell diffusion and permeability, (iii) they affect one or more specific molecular pathways involved in carcinogenesis, and (iv) they are not usually genotoxic. We have checked the Web of Science Database (23/11/2015) in order to collect papers reporting on bioactive peptide (1691 registers), which was further filtered searching terms such as “antiproliferative,” “antitumoral,” or “apoptosis” among others. Works reporting the amino acid sequence of an antiproliferative peptide were kept (60 registers), and this was complemented with the peptides included in CancerPPD, an extensive resource for antiproliferative peptides and proteins. Peptides were grouped according to one of the following mechanism of action: inhibition of cell migration, inhibition of tumor angiogenesis, antioxidative mechanisms, inhibition of gene transcription/cell proliferation, induction of apoptosis, disorganization of tubulin structure, cytotoxicity, or unknown mechanisms. The main mechanisms of action of those antiproliferative peptides with known amino acid sequences are presented and finally, their potential clinical usefulness and future challenges on their application is discussed. PMID:27010507
Mechanical characteristics of beta sheet-forming peptide hydrogels are dependent on peptide sequence, concentration and buffer composition

PubMed Central

Müller, Michael; König, Finja; Meyer, Nina; Gattlen, Jasmin; Pieles, Uwe; Peters, Kirsten; Kreikemeyer, Bernd; Mathes, Stephanie; Saxer, Sina

2018-01-01

Self-assembling peptide hydrogels can be modified regarding their biodegradability, their chemical and mechanical properties and their nanofibrillar structure. Thus, self-assembling peptide hydrogels might be suitable scaffolds for regenerative therapies and tissue engineering. Owing to the use of various peptide concentrations and buffer compositions, the self-assembling peptide hydrogels might be influenced regarding their mechanical characteristics. Therefore, the mechanical properties and stability of a set of self-assembling peptide hydrogels, consisting of 11 amino acids, made from four beta sheet self-assembling peptides in various peptide concentrations and buffer compositions were studied. The formed self-assembling peptide hydrogels exhibited stiffnesses ranging from 0.6 to 205 kPa. The hydrogel stiffness was mostly affected by peptide sequence followed by peptide concentration and buffer composition. All self-assembling peptide hydrogels examined provided a nanofibrillar network formation. A maximum self-assembling peptide hydrogel dissolution of 20% was observed for different buffer solutions after 7 days. The stability regarding enzymatic and bacterial digestion showed less degradation in comparison to the self-assembling peptide hydrogel dissolution rate in buffer. The tested set of self-assembling peptide hydrogels were able to form stable scaffolds and provided a broad spectrum of tissue-specific stiffnesses that are suitable for a regenerative therapy. PMID:29657766
Proglucagons in vertebrates: Expression and processing of multiple genes in a bony fish.

PubMed

Busby, Ellen R; Mommsen, Thomas P

2016-09-01

In contrast to mammals, where a single proglucagon (PG) gene encodes three peptides: glucagon, glucagon-like peptide 1 and glucagon-like peptide 2 (GLP-1; GLP-2), many non-mammalian vertebrates carry multiple PG genes. Here, we investigate proglucagon mRNA sequences, their tissue expression and processing in a diploid bony fish. Copper rockfish (Sebastes caurinus) express two independent genes coding for distinct proglucagon sequences (PG I, PG II), with PG II lacking the GLP-2 sequence. These genes are differentially transcribed in the endocrine pancreas, the brain, and the gastrointestinal tract. Alternative splicing identified in rockfish is only one part of this complex regulation of the PG transcripts: the system has the potential to produce two glucagons, four GLP-1s and a single GLP-2, or any combination of these peptides. Mass spectrometric analysis of partially purified PG-derived peptides in endocrine pancreas confirms translation of both PG transcripts and differential processing of the resulting peptides. The complex differential regulation of the two PG genes and their continued presence in this extant teleostean fish strongly suggests unique and, as yet largely unidentified, roles for the peptide products encoded in each gene. Copyright © 2016 Elsevier Inc. All rights reserved.
Selection of staphylococcal enterotoxin B (SEB)-binding peptide using phage display technology

DOE Office of Scientific and Technical Information (OSTI.GOV)

Soykut, Esra Acar; Dudak, Fahriye Ceyda; Boyaci, Ismail Hakki

In this study, peptides were selected to recognize staphylococcal enterotoxin B (SEB) which cause food intoxication and can be used as a biological war agent. By using commercial M13 phage library, single plaque isolation of 38 phages was done and binding affinities were investigated with phage-ELISA. The specificities of the selected phage clones showing high affinity to SEB were checked by using different protein molecules which can be found in food samples. Furthermore, the affinities of three selected phage clones were determined by using surface plasmon resonance (SPR) sensors. Sequence analysis was realized for three peptides showing high binding affinitymore » to SEB and WWRPLTPESPPA, MNLHDYHRLFWY, and QHPQINQTLYRM amino acid sequences were obtained. The peptide sequence with highest affinity to SEB was synthesized with solid phase peptide synthesis technique and thermodynamic constants of the peptide-SEB interaction were determined by using isothermal titration calorimetry (ITC) and compared with those of antibody-SEB interaction. The binding constant of the peptide was determined as 4.2 {+-} 0.7 x 10{sup 5} M{sup -1} which indicates a strong binding close to that of antibody.« less
Conformational dynamics of a short antigenic peptide in its free and antibody bound forms gives insight into the role of β-turns in peptide immunogenicity.

PubMed

Shukla, Rashmi Tambe; Sasidhar, Yellamraju U

2015-07-01

Earlier immunological experiments with a synthetic 36-residue peptide (75-110) from Influenza hemagglutinin have been shown to elicit anti-peptide antibodies (Ab) which could cross-react with the parent protein. In this article, we have studied the conformational features of a short antigenic (Ag) peptide ((98)YPYDVPDYASLRS(110)) from Influenza hemagglutinin in its free and antibody (Ab) bound forms with molecular dynamics simulations using GROMACS package and OPLS-AA/L all-atom force field at two different temperatures (293 K and 310 K). Multiple simulations for the free Ag peptide show sampling of ordered conformations and suggest different conformational preferences of the peptide at the two temperatures. The free Ag samples a conformation crucial for Ab binding (β-turn formed by "DYAS" sequence) with greater preference at 310 K while, it samples a native-like conformation with relatively greater propensity at 293 K. The sequence "DYAS" samples β-turn conformation with greater propensity at 310 K as part of the hemagglutinin protein also. The bound Ag too samples the β-turn involving "DYAS" sequence and in addition it also samples a β-turn formed by the sequence "YPYD" at its N-terminus, which seems to be induced upon binding to the Ab. Further, the bound Ag displays conformational flexibility at both 293 K and 310 K, particularly at terminal residues. The implications of these results for peptide immunogenicity and Ag-Ab recognition are discussed. © 2015 Wiley Periodicals, Inc.
Identification of peptide sequences that target to the brain using in vivo phage display.

PubMed

Li, Jingwei; Zhang, Qizhi; Pang, Zhiqing; Wang, Yuchen; Liu, Qingfeng; Guo, Liangran; Jiang, Xinguo

2012-06-01

Phage display technology could provide a rapid means for the discovery of novel peptides. To find peptide ligands specific for the brain vascular receptors, we performed a modified phage display method. Phages were recovered from mice brain parenchyma after administrated with a random 7-mer peptide library intravenously. A longer circulation time was arranged according to the biodistributive brain/blood ratios of phage particles. Following sequential rounds of isolation, a number of phages were sequenced and a peptide sequence (CTSTSAPYC, denoted as PepC7) was identified. Clone 7-1, which encodes PepC7, exhibited translocation efficiency about 41-fold higher than the random library phage. Immunofluorescence analysis revealed that Clone 7-1 had a significant superiority on transport efficiency into the brain compared with native M13 phage. Clone 7-1 was inhibited from homing to the brain in a dose-dependent fashion when cyclic peptides of the same sequence were present in a competition assay. Interestingly, the linear peptide (ATSTSAPYA, Pep7) and a scrambled control peptide PepSC7 (CSPATSYTC) did not compete with the phage at the same tested concentration (0.2-200 pg). Labeled by Cy5.5, PepC7 exhibited significant brain-targeting capability in in vivo optical imaging analysis. The cyclic conformation of PepC7 formed by disulfide bond, and the correct structure itself play a critical role in maintaining the selectivity and affinity for the brain. In conclusion, PepC7 is a promising brain-target motif never been reported before and it could be applied to targeted drug delivery into the brain.
Antimicrobial activities of amphiphilic peptides covalently bonded to a water-insoluble resin.

PubMed Central

Haynie, S L; Crum, G A; Doele, B A

1995-01-01

A series of polymer-bound antimicrobial peptides was prepared, and the peptides were tested for their antimicrobial activities. The immobilized peptides were prepared by a strategy that used solid-phase peptide synthesis that linked the carboxy-terminal amino acid with an ethylenediamine-modified polyamide resin (PepsynK). The acid-stable, permanent amide bond between the support and the nascent peptide renders the peptide resistant to cleavage from the support during the final acid-catalyzed deprotection step in the synthesis. Select immobilized peptides containing amino acid sequences that ranged from the naturally occurring magainin to simpler synthetic sequences with idealized secondary structures were excellent antimicrobial agents against several organisms. The immobilized peptides typically reduced the number of viable cells by > or = 5 log units. We show that the reduction in cell numbers cannot be explained by the action of a soluble component. We observed no leached or hydrolyzed peptide from the resin, nor did we observe any antimicrobial activity in soluble extracts from the immobilized peptide. The immobilized peptides were washed and reused for repeated microbial contact and killing. These results suggest that the surface actions by magainins and structurally related antimicrobial peptides are sufficient for their lethal activities. PMID:7726486
Polymeric peptide pigments with sequence-encoded properties

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lampel, Ayala; McPhee, Scott A.; Park, Hang-Ah

Melanins are a family of heterogeneous polymeric pigments that provide ultraviolet (UV) light protection, structural support, coloration, and free radical scavenging. Formed by oxidative oligomerization of catecholic small molecules, the physical properties of melanins are influenced by covalent and noncovalent disorder. We report the use of tyrosine-containing tripeptides as tunable precursors for polymeric pigments. In these structures, phenols are presented in a (supra-)molecular context dictated by the positions of the amino acids in the peptide sequence. Oxidative polymerization can be tuned in a sequence-dependent manner, resulting in peptide sequence–encoded properties such as UV absorbance, morphology, coloration, and electrochemical properties overmore » a considerable range. Short peptides have low barriers to application and can be easily scaled, suggesting near-term applications in cosmetics and biomedicine.« less
Transmembrane insertion of twin-arginine signal peptides is driven by TatC and regulated by TatB

PubMed Central

Fröbel, Julia; Rose, Patrick; Lausberg, Frank; Blümmel, Anne-Sophie; Freudl, Roland; Müller, Matthias

2012-01-01

The twin-arginine translocation (Tat) pathway of bacteria and plant chloroplasts mediates the transmembrane transport of folded proteins, which harbour signal sequences with a conserved twin-arginine motif. Many Tat translocases comprise the three membrane proteins TatA, TatB and TatC. TatC was previously shown to be involved in recognizing twin-arginine signal peptides. Here we show that beyond recognition, TatC mediates the transmembrane insertion of a twin-arginine signal sequence, thereby translocating the signal sequence cleavage site across the bilayer. In the absence of TatB, this can lead to the removal of the signal sequence even from a translocation-incompetent substrate. Hence interaction of twin-arginine signal peptides with TatB counteracts their premature cleavage uncoupled from translocation. This capacity of TatB is not shared by the homologous TatA protein. Collectively our results suggest that TatC is an insertase for twin-arginine signal peptides and that translocation-proficient signal sequence recognition requires the concerted action of TatC and TatB. PMID:23250441

Transmembrane insertion of twin-arginine signal peptides is driven by TatC and regulated by TatB.

PubMed

Fröbel, Julia; Rose, Patrick; Lausberg, Frank; Blümmel, Anne-Sophie; Freudl, Roland; Müller, Matthias

2012-01-01

The twin-arginine translocation (Tat) pathway of bacteria and plant chloroplasts mediates the transmembrane transport of folded proteins, which harbour signal sequences with a conserved twin-arginine motif. Many Tat translocases comprise the three membrane proteins TatA, TatB and TatC. TatC was previously shown to be involved in recognizing twin-arginine signal peptides. Here we show that beyond recognition, TatC mediates the transmembrane insertion of a twin-arginine signal sequence, thereby translocating the signal sequence cleavage site across the bilayer. In the absence of TatB, this can lead to the removal of the signal sequence even from a translocation-incompetent substrate. Hence interaction of twin-arginine signal peptides with TatB counteracts their premature cleavage uncoupled from translocation. This capacity of TatB is not shared by the homologous TatA protein. Collectively our results suggest that TatC is an insertase for twin-arginine signal peptides and that translocation-proficient signal sequence recognition requires the concerted action of TatC and TatB.
Signature Peptide-Enabled Metagenomics (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

ScienceCinema

McMahon, Ben

2018-01-11

Ben McMahon of Los Alamos National Laboratory (LANL) presents "Signature Peptide-Enabled Metagenomics" at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.
Signature Peptide-Enabled Metagenomics (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

DOE Office of Scientific and Technical Information (OSTI.GOV)

McMahon, Ben

2012-06-01

Ben McMahon of Los Alamos National Laboratory (LANL) presents "Signature Peptide-Enabled Metagenomics" at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.
Sequence-Dependent Structure/Function Relationships of Catalytic Peptide-Enabled Gold Nanoparticles Generated under Ambient Synthetic Conditions

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bedford, Nicholas M.; Hughes, Zak E.; Tang, Zhenghua

Peptide-enabled nanoparticle (NP) synthesis routes can create and/or assemble functional nanomaterials under environmentally friendly conditions, with properties dictated by complex interactions at the biotic/abiotic interface. Manipulation of this interface through sequence modification can provide the capability for material properties to be tailored to create enhanced materials for energy, catalysis, and sensing applications. Fully realizing the potential of these materials requires a comprehensive understanding of sequence-dependent structure/function relationships that is presently lacking. In this work, the atomic-scale structures of a series of peptide-capped Au NPs are determined using a combination of atomic pair distribution function analysis of high-energy X-ray diffraction datamore » and advanced molecular dynamics (MD) simulations. The Au NPs produced with different peptide sequences exhibit varying degrees of catalytic activity for the exemplar reaction 4-nitrophenol reduction. The experimentally derived atomic-scale NP configurations reveal sequence-dependent differences in structural order at the NP surface. Replica exchange with solute-tempering MD simulations are then used to predict the morphology of the peptide overlayer on these Au NPs and identify factors determining the structure/catalytic properties relationship. We show that the amount of exposed Au surface, the underlying surface structural disorder, and the interaction strength of the peptide with the Au surface all influence catalytic performance. A simplified computational prediction of catalytic performance is developed that can potentially serve as a screening tool for future studies. Our approach provides a platform for broadening the analysis of catalytic peptide-enabled metallic NP systems, potentially allowing for the development of rational design rules for property enhancement.« less
Novel proline-hydroxyproline glycopeptides from the dandelion (Taraxacum officinale Wigg.) flowers: de novo sequencing and biological activity.

PubMed

Astafieva, Alexandra A; Enyenihi, Atim A; Rogozhin, Eugene A; Kozlov, Sergey A; Grishin, Eugene V; Odintsova, Tatyana I; Zubarev, Roman A; Egorov, Tsezi A

2015-09-01

Two novel homologous peptides named ToHyp1 and ToHyp2 that show no similarity to any known proteins were isolated from Taraxacum officinale Wigg. flowers by multidimensional liquid chromatography. Amino acid and mass spectrometry analyses demonstrated that the peptides have unusual structure: they are cysteine-free, proline-hydroxyproline-rich and post-translationally glycosylated by pentoses, with 5 carbohydrates in ToHyp2 and 10 in ToHyp1. The ToHyp2 peptide with a monoisotopic molecular mass of 4350.3Da was completely sequenced by a combination of Edman degradation and de novo sequencing via top down multistage collision induced dissociation (CID) and higher energy dissociation (HCD) tandem mass spectrometry (MS(n)). ToHyp2 consists of 35 amino acids, contains eighteen proline residues, of which 8 prolines are hydroxylated. The peptide displays antifungal activity and inhibits growth of Gram-positive and Gram-negative bacteria. We further showed that carbohydrate moieties have no significant impact on the peptide structure, but are important for antifungal activity although not absolutely necessary. The deglycosylated ToHyp2 peptide was less active against the susceptible fungus Bipolaris sorokiniana than the native peptide. Unique structural features of the ToHyp2 peptide place it into a new family of plant defense peptides. The discovery of ToHyp peptides in T. officinale flowers expands the repertoire of molecules of plant origin with practical applications. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Engineering Amyloid-Like Assemblies from Unstructured Peptides via Site-Specific Lipid Conjugation

PubMed Central

López Deber, María Pilar; Hickman, David T.; Nand, Deepak; Baldus, Marc; Pfeifer, Andrea; Muhs, Andreas

2014-01-01

Aggregation of amyloid beta (Aβ) into oligomers and fibrils is believed to play an important role in the development of Alzheimer’s disease (AD). To gain further insight into the principles of aggregation, we have investigated the induction of β-sheet secondary conformation from disordered native peptide sequences through lipidation, in 1–2% hexafluoroisopropanol (HFIP) in phosphate buffered saline (PBS). Several parameters, such as type and number of lipid chains, peptide sequence, peptide length and net charge, were explored keeping the ratio peptide/HFIP constant. The resulting lipoconjugates were characterized by several physico-chemical techniques: Circular Dichroism (CD), Attenuated Total Reflection InfraRed (ATR-IR), Thioflavin T (ThT) fluorescence, Dynamic Light Scattering (DLS), solid-state Nuclear Magnetic Resonance (ssNMR) spectroscopy and Electron Microscopy (EM). Our data demonstrate the generation of β-sheet aggregates from numerous unstructured peptides under physiological pH, independent of the amino acid sequence. The amphiphilicity pattern and hydrophobicity of the scaffold were found to be key factors for their assembly into amyloid-like structures. PMID:25207975
Purification and characterisation of a new hypothalamic satiety peptide, cocaine and amphetamine regulated transcript (CART), produced in yeast.

PubMed

Thim, L; Nielsen, P F; Judge, M E; Andersen, A S; Diers, I; Egel-Mitani, M; Hastrup, S

1998-05-29

Cocaine and amphetamine regulated transcript (CART) is a newly discovered hypothalamic peptide with a potent appetite suppressing activity following intracerebroventricular administration. When the mature rat CART sequence encoding CART(1-102) was inserted in the yeast expression plasmid three CART peptides could be purified from the fermentation broth reflecting processing at dibasic sequences. None of these corresponded to the naturally occurring CART(55-102). In order to obtain CART(55-102) the precursor Glu-Glu-Ile-Asp-CART(55-102) has been produced and CART(55-102) was generated by digestion of the precursor with dipeptidylaminopeptidase-1. All four generated CART peptides have been characterised by N-terminal amino acid sequencing and mass spectrometry. The CART peptides contain six cysteine residues and using the yeast expressed CART(62-102) the disulphide bond configuration was found to be I-III, II-V and IV-VI. When the four CART peptides were intracerebroventricularly injected in fasted mice (0.1 to 2.0 microg) they all produced a dose dependent inhibition of food intake.
2-Aminobenzamide and 2-Aminobenzoic Acid as New MALDI Matrices Inducing Radical Mediated In-Source Decay of Peptides and Proteins

NASA Astrophysics Data System (ADS)

Smargiasso, Nicolas; Quinton, Loic; de Pauw, Edwin

2012-03-01

One of the mechanisms leading to MALDI in-source decay (MALDI ISD) is the transfer of hydrogen radicals to analytes upon laser irradiation. Analytes such as peptides or proteins may undergo ISD and this method can therefore be exploited for top-down sequencing. When performed on peptides, radical-induced ISD results in production of c- and z-ions, as also found in ETD and ECD activation. Here, we describe two new compounds which, when used as MALDI matrices, are able to efficiently induce ISD of peptides and proteins: 2-aminobenzamide and 2-aminobenzoic acid. In-source reduction of the disulfide bridge containing peptide Calcitonin further confirmed the radicalar mechanism of the ISD process. ISD of peptides led, in addition to c- and z-ions, to the generation of a-, x-, and y-ions both in positive and in negative ion modes. Finally, good sequence coverage was obtained for the sequencing of myoglobin (17 kDa protein), confirming the effectiveness of both 2-aminobenzamide and 2-aminobenzoic acid as MALDI ISD matrices.
2-Aminobenzamide and 2-aminobenzoic acid as new MALDI matrices inducing radical mediated in-source decay of peptides and proteins.

PubMed

Smargiasso, Nicolas; Quinton, Loic; De Pauw, Edwin

2012-03-01

One of the mechanisms leading to MALDI in-source decay (MALDI ISD) is the transfer of hydrogen radicals to analytes upon laser irradiation. Analytes such as peptides or proteins may undergo ISD and this method can therefore be exploited for top-down sequencing. When performed on peptides, radical-induced ISD results in production of c- and z-ions, as also found in ETD and ECD activation. Here, we describe two new compounds which, when used as MALDI matrices, are able to efficiently induce ISD of peptides and proteins: 2-aminobenzamide and 2-aminobenzoic acid. In-source reduction of the disulfide bridge containing peptide Calcitonin further confirmed the radicalar mechanism of the ISD process. ISD of peptides led, in addition to c- and z-ions, to the generation of a-, x-, and y-ions both in positive and in negative ion modes. Finally, good sequence coverage was obtained for the sequencing of myoglobin (17 kDa protein), confirming the effectiveness of both 2-aminobenzamide and 2-aminobenzoic acid as MALDI ISD matrices.
Evaluation of peptides release using a natural rubber latex biomembrane as a carrier.

PubMed

Miranda, M C R; Borges, F A; Barros, N R; Santos Filho, N A; Mendonça, R J; Herculano, R D; Cilli, E M

2018-05-01

The biomembrane natural (NRL-Natural Rubber Latex), manipulated from the latex obtained from the rubber tree Hevea brasiliensis, has shown great potential for application in biomedicine and biomaterials. Reflecting the biocompatibility and low bounce rate of this material, NRL has been used as a physical barrier to infectious agents and for the controlled release of drugs and extracts. The aim of the present study was to evaluate the incorporation and release of peptides using a latex biomembrane carrier. After incorporation, the release of material from the membrane was observed using spectrophotometry. Analyses using HPLC and mass spectroscopy did not confirm the release of the antimicrobial peptide [W 6 ]Hylin a1 after 24 h. In addition, analysis of the release solution showed new compounds, indicating the degradation of the peptide by enzymes contained in the latex. Additionally, the release of a peptide with a shorter sequence (Ac-WAAAA) was evaluated, and degradation was not observed. These results showed that the use of NRL as solid matrices as delivery systems of peptide are sequence dependent and could to be evaluated for each sequence.
Engineering RNA phage MS2 virus-like particles for peptide display

NASA Astrophysics Data System (ADS)

Jordan, Sheldon Keith

Phage display is a powerful and versatile technology that enables the selection of novel binding functions from large populations of randomly generated peptide sequences. Random sequences are genetically fused to a viral structural protein to produce complex peptide libraries. From a sufficiently complex library, phage bearing peptides with practically any desired binding activity can be physically isolated by affinity selection, and, since each particle carries in its genome the genetic information for its own replication, the selectants can be amplified by infection of bacteria. For certain applications however, existing phage display platforms have limitations. One such area is in the field of vaccine development, where the goal is to identify relevant epitopes by affinity-selection against an antibody target, and then to utilize them as immunogens to elicit a desired antibody response. Today, affinity selection is usually conducted using display on filamentous phages like M13. This technology provides an efficient means for epitope identification, but, because filamentous phages do not display peptides in the high-density, multivalent arrays the immune system prefers to recognize, they generally make poor immunogens and are typically useless as vaccines. This makes it necessary to confer immunogenicity by conjugating synthetic versions of the peptides to more immunogenic carriers. Unfortunately, when introduced into these new structural environments, the epitopes often fail to elicit relevant antibody responses. Thus, it would be advantageous to combine the epitope selection and immunogen functions into a single platform where the structural constraints present during affinity selection can be preserved during immunization. This dissertation describes efforts to develop a peptide display system based on the virus-like particles (VLPs) of bacteriophage MS2. Phage display technologies rely on (1) the identification of a site in a viral structural protein that is present on the surface of the virus particle and can accept foreign sequence insertions without disruption of protein folding and viral particle assembly, and (2) on the encapsidation of nucleic acid sequences encoding both the VLP and the peptide it displays. The experiments described here are aimed at satisfying the first of these two requirements by engineering efficient peptide display at two different sites in MS2 coat protein. First, we evaluated the suitability of the N-terminus of MS2 coat for peptide insertions. It was observed that random N-terminal 10-mer fusions generally disrupted protein folding and VLP assembly, but by bracketing the foreign sequences with certain specific dipeptides, these defects could be suppressed. Next, the suitability of a coat protein surface loop for foreign sequence insertion was tested. Specifically, random sequence peptides were inserted into the N-terminal-most AB-loop of a coat protein single-chain dimer. Again we found that efficient display required the presence of appropriate dipeptides bracketing the peptide insertion. Finally, it was shown that an N-terminal fusion that tended to interfere specifically with capsid assembly could be efficiently incorporated into mosaic particles when co-expressed with wild-type coat protein.
Concepts and applications of "natural computing" techniques in de novo drug and peptide design.

PubMed

Hiss, Jan A; Hartenfeller, Markus; Schneider, Gisbert

2010-05-01

Evolutionary algorithms, particle swarm optimization, and ant colony optimization have emerged as robust optimization methods for molecular modeling and peptide design. Such algorithms mimic combinatorial molecule assembly by using molecular fragments as building-blocks for compound construction, and relying on adaptation and emergence of desired pharmacological properties in a population of virtual molecules. Nature-inspired algorithms might be particularly suited for bioisosteric replacement or scaffold-hopping from complex natural products to synthetically more easily accessible compounds that are amenable to optimization by medicinal chemistry. The theory and applications of selected nature-inspired algorithms for drug design are reviewed, together with practical applications and a discussion of their advantages and limitations.
Purification and sequencing of the active site tryptic peptide from penicillin-binding protein 1b of Escherichia coli

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nicholas, R.A.; Suzuki, H.; Hirota, Y.

This paper reports the sequence of the active site peptide of penicillin-binding protein 1b from Escherichia coli. Purified penicillin-binding protein 1b was labeled with (/sup 14/C)penicillin G, digested with trypsin, and partially purified by gel filtration. Upon further purification by high-pressure liquid chromatography, two radioactive peaks were observed, and the major peak, representing over 75% of the applied radioactivity, was submitted to amino acid analysis and sequencing. The sequence Ser-Ile-Gly-Ser-Leu-Ala-Lys was obtained. The active site nucleophile was identified by digesting the purified peptide with aminopeptidase M and separating the radioactive products on high-pressure liquid chromatography. Amino acid analysis confirmed thatmore » the serine residue in the middle of the sequence was covalently bonded to the (/sup 14/C)penicilloyl moiety. A comparison of this sequence to active site sequences of other penicillin-binding proteins and beta-lactamases is presented.« less
Lewis Y Antigen as a Target for Breast Cancer Therapy

DTIC Science & Technology

1996-09-01

have shown that a synthetic peptide can mimic the capsular polysaccharide of N. meningitis serogroup C (MCP) in that it induces an anti-MCP immune...intervening residue. All these sequences resemble the peptide we have identified as a mimic of the group C meningococcal polysaccharide . The immunological...Group C Polysaccharide ct(2-9)sialic acid The sequence similarities among the putative motifs suggest that antibodies raised to this peptide set might
Viral morphogenesis is the dominant source of sequence censorship in M13 combinatorial peptide phage display.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rodi, D. J.; Soares, A. S.; Makowski, L.

Novel statistical methods have been developed and used to quantitate and annotate the sequence diversity within combinatorial peptide libraries on the basis of small numbers (1-200) of sequences selected at random from commercially available M13 p3-based phage display libraries. These libraries behave statistically as though they correspond to populations containing roughly 4.0{+-}1.6% of the random dodecapeptides and 7.9{+-}2.6% of the random constrained heptapeptides that are theoretically possible within the phage populations. Analysis of amino acid residue occurrence patterns shows no demonstrable influence on sequence censorship by Escherichia coli tRNA isoacceptor profiles or either overall codon or Class II codon usagemore » patterns, suggesting no metabolic constraints on recombinant p3 synthesis. There is an overall depression in the occurrence of cysteine, arginine and glycine residues and an overabundance of proline, threonine and histidine residues. The majority of position-dependent amino acid sequence bias is clustered at three positions within the inserted peptides of the dodecapeptide library, +1, +3 and +12 downstream from the signal peptidase cleavage site. Conformational tendency measures of the peptides indicate a significant preference for inserts favoring a {beta}-turn conformation. The observed protein sequence limitations can primarily be attributed to genetic codon degeneracy and signal peptidase cleavage preferences. These data suggest that for applications in which maximal sequence diversity is essential, such as epitope mapping or novel receptor identification, combinatorial peptide libraries should be constructed using codon-corrected trinucleotide cassettes within vector-host systems designed to minimize morphogenesis-related censorship.« less
Deriving Heterospecific Self-Assembling Protein-Protein Interactions Using a Computational Interactome Screen.

PubMed

Crooks, Richard O; Baxter, Daniel; Panek, Anna S; Lubben, Anneke T; Mason, Jody M

2016-01-29

Interactions between naturally occurring proteins are highly specific, with protein-network imbalances associated with numerous diseases. For designed protein-protein interactions (PPIs), required specificity can be notoriously difficult to engineer. To accelerate this process, we have derived peptides that form heterospecific PPIs when combined. This is achieved using software that generates large virtual libraries of peptide sequences and searches within the resulting interactome for preferentially interacting peptides. To demonstrate feasibility, we have (i) generated 1536 peptide sequences based on the parallel dimeric coiled-coil motif and varied residues known to be important for stability and specificity, (ii) screened the 1,180,416 member interactome for predicted Tm values and (iii) used predicted Tm cutoff points to isolate eight peptides that form four heterospecific PPIs when combined. This required that all 32 hypothetical off-target interactions within the eight-peptide interactome be disfavoured and that the four desired interactions pair correctly. Lastly, we have verified the approach by characterising all 36 pairs within the interactome. In analysing the output, we hypothesised that several sequences are capable of adopting antiparallel orientations. We subsequently improved the software by removing sequences where doing so led to fully complementary electrostatic pairings. Our approach can be used to derive increasingly large and therefore complex sets of heterospecific PPIs with a wide range of potential downstream applications from disease modulation to the design of biomaterials and peptides in synthetic biology. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Ab-initio conformational epitope structure prediction using genetic algorithm and SVM for vaccine design.

PubMed

Moghram, Basem Ameen; Nabil, Emad; Badr, Amr

2018-01-01

T-cell epitope structure identification is a significant challenging immunoinformatic problem within epitope-based vaccine design. Epitopes or antigenic peptides are a set of amino acids that bind with the Major Histocompatibility Complex (MHC) molecules. The aim of this process is presented by Antigen Presenting Cells to be inspected by T-cells. MHC-molecule-binding epitopes are responsible for triggering the immune response to antigens. The epitope's three-dimensional (3D) molecular structure (i.e., tertiary structure) reflects its proper function. Therefore, the identification of MHC class-II epitopes structure is a significant step towards epitope-based vaccine design and understanding of the immune system. In this paper, we propose a new technique using a Genetic Algorithm for Predicting the Epitope Structure (GAPES), to predict the structure of MHC class-II epitopes based on their sequence. The proposed Elitist-based genetic algorithm for predicting the epitope's tertiary structure is based on Ab-Initio Empirical Conformational Energy Program for Peptides (ECEPP) Force Field Model. The developed secondary structure prediction technique relies on Ramachandran Plot. We used two alignment algorithms: the ROSS alignment and TM-Score alignment. We applied four different alignment approaches to calculate the similarity scores of the dataset under test. We utilized the support vector machine (SVM) classifier as an evaluation of the prediction performance. The prediction accuracy and the Area Under Receiver Operating Characteristic (ROC) Curve (AUC) were calculated as measures of performance. The calculations are performed on twelve similarity-reduced datasets of the Immune Epitope Data Base (IEDB) and a large dataset of peptide-binding affinities to HLA-DRB1*0101. The results showed that GAPES was reliable and very accurate. We achieved an average prediction accuracy of 93.50% and an average AUC of 0.974 in the IEDB dataset. Also, we achieved an accuracy of 95.125% and an AUC of 0.987 on the HLA-DRB1*0101 allele of the Wang benchmark dataset. The results indicate that the proposed prediction technique "GAPES" is a promising technique that will help researchers and scientists to predict the protein structure and it will assist them in the intelligent design of new epitope-based vaccines. Copyright © 2017 Elsevier B.V. All rights reserved.
Toxin structures as evolutionary tools: Using conserved 3D folds to study the evolution of rapidly evolving peptides.

PubMed

Undheim, Eivind A B; Mobli, Mehdi; King, Glenn F

2016-06-01

Three-dimensional (3D) structures have been used to explore the evolution of proteins for decades, yet they have rarely been utilized to study the molecular evolution of peptides. Here, we highlight areas in which 3D structures can be particularly useful for studying the molecular evolution of peptide toxins. Although we focus our discussion on animal toxins, including one of the most widespread disulfide-rich peptide folds known, the inhibitor cystine knot, our conclusions should be widely applicable to studies of the evolution of disulfide-constrained peptides. We show that conserved 3D folds can be used to identify evolutionary links and test hypotheses regarding the evolutionary origin of peptides with extremely low sequence identity; construct accurate multiple sequence alignments; and better understand the evolutionary forces that drive the molecular evolution of peptides. Also watch the video abstract. © 2016 WILEY Periodicals, Inc.
Discovery of novel antimicrobial peptides with unusual cysteine motifs in dandelion Taraxacum officinale Wigg. flowers.

PubMed

Astafieva, A A; Rogozhin, E A; Odintsova, T I; Khadeeva, N V; Grishin, E V; Egorov, Ts A

2012-08-01

Three novel antimicrobial peptides designated ToAMP1, ToAMP2 and ToAMP3 were purified from Taraxacum officinale flowers. Their amino acid sequences were determined. The peptides are cationic and cysteine-rich and consist of 38, 44 and 42 amino acid residues for ToAMP1, ToAMP2 and ToAMP3, respectively. Importantly, according to cysteine motifs, the peptides are representatives of two novel previously unknown families of plant antimicrobial peptides. ToAMP1 and ToAMP2 share high sequence identity and belong to 6-Cys-containing antimicrobial peptides, while ToAMP3 is a member of a distinct 8-Cys family. The peptides were shown to display high antimicrobial activity both against fungal and bacterial pathogens, and therefore represent new promising molecules for biotechnological and medicinal applications. Crown Copyright © 2012. Published by Elsevier Inc. All rights reserved.
Immunologically active peptides capable of inducing immunization against malaria and genes encoding therefor

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dame, J.B.; Williams, J.L.; McCutchan, T.F.

An antimalarial immunogenic stimulant is described comprising an immunogenic carrier and a peptide sequence of between 2 and 1000 consecutive repeats of a sequence Asn-X-Y-Pro, wherein X is Ala or Val and Y is Asn or Asp.

Fluorescence self-quenching assay for the detection of target collagen sequences using a short probe peptide.

PubMed

Nian, Linge; Hu, Yue; Fu, Caihong; Song, Chen; Wang, Jie; Xiao, Jianxi

2018-01-01

The development of novel assays to detect collagen fragments is of utmost importance for diagnostic, prognostic and therapeutic decisions in various collagen-related diseases, and one essential question is to discover probe peptides that can specifically recognize target collagen sequences. Herein we have developed the fluorescence self-quenching assay as a convenient tool to screen the capability of a series of fluorescent probe peptides of variable lengths to bind with target collagen peptides. We have revealed that the targeting ability of probe peptides is length-dependent, and have discovered a relatively short probe peptide FAM-G(POG) 8 capable to identify the target peptide. We have further demonstrated that fluorescence self-quenching assay together with this short probe peptide can be applied to specifically detect the desired collagen fragment in complex biological media. Fluorescence self-quenching assay provides a powerful new tool to discover effective peptides for the recognition of collagen biomarkers, and it may have great potential to identify probe peptides for various protein biomarkers involved in pathological conditions. Copyright © 2017 Elsevier B.V. All rights reserved.
Aggregation of peptides in the tube model with correlated sidechain orientations

NASA Astrophysics Data System (ADS)

Hung, Nguyen Ba; Hoang, Trinh Xuan

2015-06-01

The ability of proteins and peptides to aggregate and form toxic amyloid fibrils is associated with a range of diseases including BSE (or mad cow), Alzheimer's and Parkinson's Diseases. In this study, we investigate the the role of amino acid sequence in the aggregation propensity by using a modified tube model with a new procedure for hydrophobic interaction. In this model, the amino acid sidechains are not considered explicitly, but their orientations are taken into account in the formation of hydrophobic contact. Extensive Monte Carlo simulations for systems of short peptides are carried out with the use of parallel tempering technique. Our results show that the propensity to form and the structures of the aggregates strongly depend on the amino acid sequence and the number of peptides. Some sequences may not aggregate at all at a presumable physiological temperature while other can easily form fibril-like, β-sheet struture. Our study provides an insight into the principles of how the formation of amyloid can be governed by amino acid sequence.
Antipeptide antibodies that can distinguish specific subunit polypeptides of glutamine synthetase from bean (Phaseolus vulgaris L.)

NASA Technical Reports Server (NTRS)

Cai, X.; Henry, R. L.; Takemoto, L. J.; Guikema, J. A.; Wong, P. P.; Spooner, B. S. (Principal Investigator)

1992-01-01

The amino acid sequences of the beta and gamma subunit polypeptides of glutamine synthetase from bean (Phaseolus vulgaris L.) root nodules are very similar. However, there are small regions within the sequences that are significantly different between the two polypeptides. The sequences between amino acids 2 and 9 and between 264 and 274 are examples. Three peptides (gamma 2-9, gamma 264-274, and beta 264-274) corresponding to these sequences were synthesized. Antibodies against these peptides were raised in rabbits and purified with corresponding peptide-Sepharose affinity chromatography. Western blot analysis of polyacrylamide gel electrophoresis of bean nodule proteins demonstrated that the anti-beta 264-274 antibodies reacted specifically with the beta polypeptide and the anti-gamma 264-274 and anti-gamma 2-9 antibodies reacted specifically with the gamma polypeptide of the native and denatured glutamine synthetase. These results showed the feasibility of using synthetic peptides in developing antibodies that are capable of distinguishing proteins with similar primary structures.
Hepatitis C Virus Antigenic Convergence

PubMed Central

Campo, David S.; Dimitrova, Zoya; Yokosawa, Jonny; Hoang, Duc; Perez, Nestor O.; Ramachandran, Sumathi; Khudyakov, Yury

2012-01-01

Vaccine development against hepatitis C virus (HCV) is hindered by poor understanding of factors defining cross-immunoreactivity among heterogeneous epitopes. Using synthetic peptides and mouse immunization as a model, we conducted a quantitative analysis of cross-immunoreactivity among variants of the HCV hypervariable region 1 (HVR1). Analysis of 26,883 immunological reactions among pairs of peptides showed that the distribution of cross-immunoreactivity among HVR1 variants was skewed, with antibodies against a few variants reacting with all tested peptides. The HVR1 cross-immunoreactivity was accurately modeled based on amino acid sequence alone. The tested peptides were mapped in the HVR1 sequence space, which was visualized as a network of 11,319 sequences. The HVR1 variants with a greater network centrality showed a broader cross-immunoreactivity. The entire sequence space is explored by each HCV genotype and subtype. These findings indicate that HVR1 antigenic diversity is extensively convergent and effectively limited, suggesting significant implications for vaccine development. PMID:22355779
Identification of Potent ACE Inhibitory Peptides from Wild Almond Proteins.

PubMed

Mirzapour, Mozhgan; Rezaei, Karamatollah; Sentandreu, Miguel Angel

2017-10-01

In this study, the production, fractionation, purification and identification of ACE (angiotensin-I-converting enzyme) inhibitory peptides from wild almond (Amygdalus scoparia) proteins were investigated. Wild almond proteins were hydrolyzed using 5 different enzymes (pepsin, trypsin, chymotrypsin, alcalase and flavourzyme) and assayed for their ACE inhibitory activities. The degree of ACE inhibiting activity obtained after hydrolysis was found to be in the following order: alcalase > chymotrypsin > trypsin/pepsin > flavourzyme. The hydrolysates obtained from alcalase (IC 50 = 0.8 mg/mL) were fractionated by sequential ultrafiltration at 10 and 3 kDa cutoff values and the most active fraction (<3 kDa) was further separated using reversed phase high-performance liquid chromatography (RP-HPLC). Peptide sequence identifications were carried out on highly potential fractions obtained from RP-HPLC by means of liquid chromatography coupled to electrospray ionization and tandem mass spectrometry (LC-ESI-MS/MS). Sequencing of ACE inhibitory peptides present in the fraction 26 of RP-HPLC resulted in the identification of 3 peptide sequences (VVNE, VVTR, and VVGVD) not reported previously in the literature. Sequence identification of fractions 40 and 42 from RP-HPLC, which showed the highest ACE inhibitory activities (84.1% and 86.9%, respectively), resulted in the identification of more than 40 potential ACE inhibitory sequences. The results indicate that wild almond protein is a rich source of potential antihypertensive peptides and can be suggested for applications in functional foods and drinks with respect to hindrance and mitigation of hypertension after in vivo assessment. This study has shown the potential of wild almond proteins as good sources for producing ACE-inhibitory active peptides. According to this finding, peptides with higher ACE inhibitory activities could be released during the gastrointestinal digestion and contribute to the health- promoting activities of this natural protein source. © 2017 Institute of Food Technologists®.
Neuropeptidomics of the Mosquito Aedes Aegypti

DTIC Science & Technology

2010-01-01

translational processing ( pyroglutamate formation) was detected for AST-C and CAPA-PVK-2. For the first time in insects, we succeeded in the direct...hormones, trace DNA sequences generated by TIGR and the Broad Institute were first searched by TBLASTN24 using amino acid sequences of candidate peptides...previously described.1 TBLASTN searches, using the amino acid sequences of putative Ae. aegypti neuropeptide and peptide hormone orthologs identified in
Applying the Concept of Peptide Uniqueness to Anti-Polio Vaccination.

PubMed

Kanduc, Darja; Fasano, Candida; Capone, Giovanni; Pesce Delfino, Antonella; Calabrò, Michele; Polimeno, Lorenzo

2015-01-01

Although rare, adverse events may associate with anti-poliovirus vaccination thus possibly hampering global polio eradication worldwide. To design peptide-based anti-polio vaccines exempt from potential cross-reactivity risks and possibly able to reduce rare potential adverse events such as the postvaccine paralytic poliomyelitis due to the tendency of the poliovirus genome to mutate. Proteins from poliovirus type 1, strain Mahoney, were analyzed for amino acid sequence identity to the human proteome at the pentapeptide level, searching for sequences that (1) have zero percent of identity to human proteins, (2) are potentially endowed with an immunologic potential, and (3) are highly conserved among poliovirus strains. Sequence analyses produced a set of consensus epitopic peptides potentially able to generate specific anti-polio immune responses exempt from cross-reactivity with the human host. Peptide sequences unique to poliovirus proteins and conserved among polio strains might help formulate a specific and universal anti-polio vaccine able to react with multiple viral strains and exempt from the burden of possible cross-reactions with human proteins. As an additional advantage, using a peptide-based vaccine instead of current anti-polio DNA vaccines would eliminate the rare post-polio poliomyelitis cases and other disabling symptoms that may appear following vaccination.
Modification of the N-Terminus of a Calcium Carbonate Precipitating Peptide Affects Calcium Carbonate Mineralization.

PubMed

Usui, Kenji; Yokota, Shin-Ichiro; Ozaki, Makoto; Sakashita, Shungo; Imai, Takahito; Tomizaki, Kin-Ya

2018-01-01

A core sequence (the 9 C-terminal residues) of calcification-associated peptide (CAP- 1) isolated from the exoskeleton of the red swamp crayfish was previously shown to control calcium carbonate precipitation with chitin. In addition, a modified core sequence in which the phosphorylated serine at the N terminus is replaced with serine exhibits was also previously shown to alter precipitation characteristics with chitin. We focused on calcium carbonate precipitation and attempted to elucidate aspects of the mechanism underlying mineralization. We attempted to evaluate in detail the effects of modifying the N-terminus in the core sequence on calcium carbonate mineralization without chitin. The peptide modifications included phosphorylation, dephosphorylation, and a free or acetylated Nterminus. The peptides were synthesized manually on Wang resin using the DIPCI-DMAP method for the first residue, and Fmoc solid phase peptide synthesis with HBTU-HOBt for the subsequent residues. Prior to calcium carbonate precipitation, calcium carbonate was suspended in MilliQ water. Carbon dioxide gas was bubbled into the stirred suspension, then the remaining solid CaCO3 was removed by filtration. The concentration of calcium ions in the solution was determined by standard titration with ethylenediaminetetraacetate. Calcium carbonate precipitation was conducted in a micro tube for 3 h at 37°C. We used the micro-scale techniques AFM (atomic force microscopy) and TEM (transmission electron microscopy), and the macro-scale techniques chelate titration, HPLC, gel filtration, CD (circular dichroism) and DLS (dynamic light scattering). We determined the morphologies of the calcium carbonate deposits using AFM and TEM. The pS peptide provided the best control of the shape and size of the calcium carbonate round particles. The acetylated peptides (Ac-S and Ac-pS) provided bigger particles with various shapes. S peptide provided a mixture of bigger particles and amorphous particles. We verified these findings using DLS. All the peptide samples produced nanostructures of the expected size in agreement with the AFM and TEM results. We estimated the abilities of these peptides to precipitate calcium carbonate by determining the residual calcium hydrogen carbonate concentration by standard titration with ethylenediaminetetraacetate after calcium carbonate precipitation. The Ac-pS peptide showed the lowest residual calcium hydrogen carbonate concentration whereas the S peptide showed the highest, suggesting that the precipitating activities of these peptides towards calcium carbonate correlated with peptide net charge. Then the gel filtration results showed a large oligomer peak and a small oligomer/monomer peak for all peptide samples in agreement with the AFM, TEM and DLS results. CD measurements showed that all the peptides formed random-coil-like structures. Thus, we used both macro- and micro-observation techniques such as chelate titration, DLS, AFM and TEM to show that the calcium carbonate precipitating activities of four derivatives of the core sequence of CAP-1 may correlate with the peptide net charge. These peptides mainly act as a catalyst rather than as a binder or component of the calcium carbonate deposits (as a template). On the other hand, the morphologies of the calcium carbonate deposits appeared to be dependent on the ability of the peptide to assemble and act as a template. Consequently, elucidating the relationship between peptide sequence and the ability of the peptide to assemble would be indispensable for controlling precipitate morphologies in the near future. This knowledge would provide important clues for elucidating the relationship between peptide sequence and mineralization ability, including deposit morphology and precipitating activity, for use in nanobiochemistry and materials chemistry research. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Sequence-Specific Model for Peptide Retention Time Prediction in Strong Cation Exchange Chromatography.

PubMed

Gussakovsky, Daniel; Neustaeter, Haley; Spicer, Victor; Krokhin, Oleg V

2017-11-07

The development of a peptide retention prediction model for strong cation exchange (SCX) separation on a Polysulfoethyl A column is reported. Off-line 2D LC-MS/MS analysis (SCX-RPLC) of S. cerevisiae whole cell lysate was used to generate a retention dataset of ∼30 000 peptides, sufficient for identifying the major sequence-specific features of peptide retention mechanisms in SCX. In contrast to RPLC/hydrophilic interaction liquid chromatography (HILIC) separation modes, where retention is driven by hydrophobic/hydrophilic contributions of all individual residues, SCX interactions depend mainly on peptide charge (number of basic residues at acidic pH) and size. An additive model (incorporating the contributions of all 20 residues into the peptide retention) combined with a peptide length correction produces a 0.976 R 2 value prediction accuracy, significantly higher than the additive models for either HILIC or RPLC. Position-dependent effects on peptide retention for different residues were driven by the spatial orientation of tryptic peptides upon interaction with the negatively charged surface functional groups. The positively charged N-termini serve as a primary point of interaction. For example, basic residues (Arg, His, Lys) increase peptide retention when located closer to the N-terminus. We also found that hydrophobic interactions, which could lead to a mixed-mode separation mechanism, are largely suppressed at 20-30% of acetonitrile in the eluent. The accuracy of the final Sequence-Specific Retention Calculator (SSRCalc) SCX model (∼0.99 R 2 value) exceeds all previously reported predictors for peptide LC separations. This also provides a solid platform for method development in 2D LC-MS protocols in proteomics and peptide retention prediction filtering of false positive identifications.
Computational Framework for Prediction of Peptide Sequences That May Mediate Multiple Protein Interactions in Cancer-Associated Hub Proteins.

PubMed

Sarkar, Debasree; Patra, Piya; Ghosh, Abhirupa; Saha, Sudipto

2016-01-01

A considerable proportion of protein-protein interactions (PPIs) in the cell are estimated to be mediated by very short peptide segments that approximately conform to specific sequence patterns known as linear motifs (LMs), often present in the disordered regions in the eukaryotic proteins. These peptides have been found to interact with low affinity and are able bind to multiple interactors, thus playing an important role in the PPI networks involving date hubs. In this work, PPI data and de novo motif identification based method (MEME) were used to identify such peptides in three cancer-associated hub proteins-MYC, APC and MDM2. The peptides corresponding to the significant LMs identified for each hub protein were aligned, the overlapping regions across these peptides being termed as overlapping linear peptides (OLPs). These OLPs were thus predicted to be responsible for multiple PPIs of the corresponding hub proteins and a scoring system was developed to rank them. We predicted six OLPs in MYC and five OLPs in MDM2 that scored higher than OLP predictions from randomly generated protein sets. Two OLP sequences from the C-terminal of MYC were predicted to bind with FBXW7, component of an E3 ubiquitin-protein ligase complex involved in proteasomal degradation of MYC. Similarly, we identified peptides in the C-terminal of MDM2 interacting with FKBP3, which has a specific role in auto-ubiquitinylation of MDM2. The peptide sequences predicted in MYC and MDM2 look promising for designing orthosteric inhibitors against possible disease-associated PPIs. Since these OLPs can interact with other proteins as well, these inhibitors should be specific to the targeted interactor to prevent undesired side-effects. This computational framework has been designed to predict and rank the peptide regions that may mediate multiple PPIs and can be applied to other disease-associated date hub proteins for prediction of novel therapeutic targets of small molecule PPI modulators.
Impact of commercial precooking of common bean (Phaseolus vulgaris) on the generation of peptides, after pepsin-pancreatin hydrolysis, capable to inhibit dipeptidyl peptidase-IV.

PubMed

Mojica, Luis; Chen, Karen; de Mejía, Elvira González

2015-01-01

The objective of this research was to determine the bioactive properties of the released peptides from commercially available precook common beans (Phaseolus vulgaris). Bioactive properties and peptide profiles were evaluated in protein hydrolysates of raw and commercially precooked common beans. Five varieties (Black, Pinto, Red, Navy, and Great Northern) were selected for protein extraction, protein and peptide molecular mass profiles, and peptide sequences. Potential bioactivities of hydrolysates, including antioxidant capacity and inhibition of α-amylase, α-glucosidase, dipeptidyl peptidase-IV (DPP-IV), and angiotensin converting enzyme I (ACE) were analyzed after digestion with pepsin/pancreatin. Hydrolysates from Navy beans were the most potent inhibitors of DPP-IV with no statistical differences between precooked and raw (IC50 = 0.093 and 0.095 mg protein/mL, respectively). α-Amylase inhibition was higher for raw Red, Navy and Great Northern beans (36%, 31%, 27% relative to acarbose (rel ac)/mg protein, respectively). α-Glucosidase inhibition among all bean hydrolysates did not show significant differences; however, inhibition values were above 40% rel ac/mg protein. IC50 values for ACE were not significantly different among all bean hydrolysates (range 0.20 to 0.34 mg protein/mL), except for Red bean that presented higher IC50 values. Peptide molecular mass profile ranged from 500 to 3000 Da. A total of 11 and 17 biologically active peptide sequences were identified in raw and precooked beans, respectively. Peptide sequences YAGGS and YAAGS from raw Great Northern and precooked Pinto showed similar amino acid sequences and same potential ACE inhibition activity. Processing did not affect the bioactive properties of released peptides from precooked beans. Commercially precooked beans could contribute to the intake of bioactive peptides and promote health. © 2014 Institute of Food Technologists®
Peptidomic approach identifies cruzioseptins, a new family of potent antimicrobial peptides in the splendid leaf frog, Cruziohyla calcarifer.

PubMed

Proaño-Bolaños, Carolina; Zhou, Mei; Wang, Lei; Coloma, Luis A; Chen, Tianbao; Shaw, Chris

2016-09-02

Phyllomedusine frogs are an extraordinary source of biologically active peptides. At least 8 families of antimicrobial peptides have been reported in this frog clade, the dermaseptins being the most diverse. By a peptidomic approach, integrating molecular cloning, Edman degradation sequencing and tandem mass spectrometry, a new family of antimicrobial peptides has been identified in Cruziohyla calcarifer. These 15 novel antimicrobial peptides of 20-32 residues in length are named cruzioseptins. They are characterized by having a unique shared N-terminal sequence GFLD- and the sequence motifs -VALGAVSK- or -GKAAL(N/G/S) (V/A)V- in the middle of the peptide. Cruzioseptins have a broad spectrum of antimicrobial activity and low haemolytic effect. The most potent cruzioseptin was CZS-1 that had a MIC of 3.77μM against the Gram positive bacterium, Staphylococcus aureus and the yeast Candida albicans. In contrast, CZS-1 was 3-fold less potent against the Gram negative bacterium, Escherichia coli (MIC 15.11μM). CZS-1 reached 100% haemolysis at 120.87μM. Skin secretions from unexplored species such as C. calcarifer continue to demonstrate the enormous molecular diversity hidden in the amphibian skin. Some of these novel peptides may provide lead structures for the development of a new class of antibiotics and antifungals of therapeutic use. Through the combination of molecular cloning, Edman degradation sequencing, tandem mass spectrometry and MALDI-TOF MS we have identified a new family of 15 antimicrobial peptides in the skin secretion of Cruziohyla calcarifer. The novel family is named "Cruzioseptins" and contains cationic amphipathic peptides of 20-32 residues. They have a broad range of antimicrobial activity that also includes effective antifungals with low haemolytic activity. Therefore, C. calcarifer has proven to be a rich source of novel peptides, which could become leading structures for the development of novel antibiotics and antifungals of clinical application. Copyright © 2016 Elsevier B.V. All rights reserved.
Novel Peptide Sequence (“IQ-tag”) with High Affinity for NIR Fluorochromes Allows Protein and Cell Specific Labeling for In Vivo Imaging

PubMed Central

McCarthy, Jason R.; Weissleder, Ralph

2007-01-01

Background Probes that allow site-specific protein labeling have become critical tools for visualizing biological processes. Methods Here we used phage display to identify a novel peptide sequence with nanomolar affinity for near infrared (NIR) (benz)indolium fluorochromes. The developed peptide sequence (“IQ-tag”) allows detection of NIR dyes in a wide range of assays including ELISA, flow cytometry, high throughput screens, microscopy, and optical in vivo imaging. Significance The described method is expected to have broad utility in numerous applications, namely site-specific protein imaging, target identification, cell tracking, and drug development. PMID:17653285
Availability of MudPIT data for classification of biological samples.

PubMed

Silvestre, Dario Di; Zoppis, Italo; Brambilla, Francesca; Bellettato, Valeria; Mauri, Giancarlo; Mauri, Pierluigi

2013-01-14

Mass spectrometry is an important analytical tool for clinical proteomics. Primarily employed for biomarker discovery, it is increasingly used for developing methods which may help to provide unambiguous diagnosis of biological samples. In this context, we investigated the classification of phenotypes by applying support vector machine (SVM) on experimental data obtained by MudPIT approach. In particular, we compared the performance capabilities of SVM by using two independent collection of complex samples and different data-types, such as mass spectra (m/z), peptides and proteins. Globally, protein and peptide data allowed a better discriminant informative content than experimental mass spectra (overall accuracy higher than 87% in both collection 1 and 2). These results indicate that sequencing of peptides and proteins reduces the experimental noise affecting the raw mass spectra, and allows the extraction of more informative features available for the effective classification of samples. In addition, proteins and peptides features selected by SVM matched for 80% with the differentially expressed proteins identified by the MAProMa software. These findings confirm the availability of the most label-free quantitative methods based on processing of spectral count and SEQUEST-based SCORE values. On the other hand, it stresses the usefulness of MudPIT data for a correct grouping of sample phenotypes, by applying both supervised and unsupervised learning algorithms. This capacity permit the evaluation of actual samples and it is a good starting point to translate proteomic methodology to clinical application.
PRISM 3: expanded prediction of natural product chemical structures from microbial genomes.

PubMed

Skinnider, Michael A; Merwin, Nishanth J; Johnston, Chad W; Magarvey, Nathan A

2017-07-03

Microbial natural products represent a rich resource of pharmaceutically and industrially important compounds. Genome sequencing has revealed that the majority of natural products remain undiscovered, and computational methods to connect biosynthetic gene clusters to their corresponding natural products therefore have the potential to revitalize natural product discovery. Previously, we described PRediction Informatics for Secondary Metabolomes (PRISM), a combinatorial approach to chemical structure prediction for genetically encoded nonribosomal peptides and type I and II polyketides. Here, we present a ground-up rewrite of the PRISM structure prediction algorithm to derive prediction of natural products arising from non-modular biosynthetic paradigms. Within this new version, PRISM 3, natural product scaffolds are modeled as chemical graphs, permitting structure prediction for aminocoumarins, antimetabolites, bisindoles and phosphonate natural products, and building upon the addition of ribosomally synthesized and post-translationally modified peptides. Further, with the addition of cluster detection for 11 new cluster types, PRISM 3 expands to detect 22 distinct natural product cluster types. Other major modifications to PRISM include improved sequence input and ORF detection, user-friendliness and output. Distribution of PRISM 3 over a 300-core server grid improves the speed and capacity of the web application. PRISM 3 is available at http://magarveylab.ca/prism/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Noncanonical expression of a murine cytomegalovirus early protein CD8 T-cell epitope as an immediate early epitope based on transcription from an upstream gene.

PubMed

Fink, Annette; Büttner, Julia K; Thomas, Doris; Holtappels, Rafaela; Reddehase, Matthias J; Lemmermann, Niels A W

2014-02-14

Viral CD8 T-cell epitopes, represented by viral peptides bound to major histocompatibility complex class-I (MHC-I) glycoproteins, are often identified by "reverse immunology", a strategy not requiring biochemical and structural knowledge of the actual viral protein from which they are derived by antigen processing. Instead, bioinformatic algorithms predicting the probability of C-terminal cleavage in the proteasome, as well as binding affinity to the presenting MHC-I molecules, are applied to amino acid sequences deduced from predicted open reading frames (ORFs) based on the genomic sequence. If the protein corresponding to an antigenic ORF is known, it is usually inferred that the kinetic class of the protein also defines the phase in the viral replicative cycle during which the respective antigenic peptide is presented for recognition by CD8 T cells. We have previously identified a nonapeptide from the predicted ORFm164 of murine cytomegalovirus that is presented by the MHC-I allomorph H-2 Dd and that is immunodominant in BALB/c (H-2d haplotype) mice. Surprisingly, although the ORFm164 protein gp36.5 is expressed as an Early (E) phase protein, the m164 epitope is presented already during the Immediate Early (IE) phase, based on the expression of an upstream mRNA starting within ORFm167 and encompassing ORFm164.
Mapping a nucleolar targeting sequence of an RNA binding nucleolar protein, Nop25

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fujiwara, Takashi; Suzuki, Shunji; Kanno, Motoko

2006-06-10

Nop25 is a putative RNA binding nucleolar protein associated with rRNA transcription. The present study was undertaken to determine the mechanism of Nop25 localization in the nucleolus. Deletion experiments of Nop25 amino acid sequence showed Nop25 to contain a nuclear targeting sequence in the N-terminal and a nucleolar targeting sequence in the C-terminal. By expressing derivative peptides from the C-terminal as GFP-fusion proteins in the cells, a lysine and arginine residue-enriched peptide (KRKHPRRAQDSTKKPPSATRTSKTQRRRR) allowed a GFP-fusion protein to be transported and fully retained in the nucleolus. When the peptide was fused with cMyc epitope and expressed in the cells, amore » cMyc epitope was then detected in the nucleolus. Nop25 did not localize in the nucleolus by deletion of the peptide from Nop25. Furthermore, deletion of a subdomain (KRKHPRRAQ) in the peptide or amino acid substitution of lysine and arginine residues in the subdomain resulted in the loss of Nop25 nucleolar localization. These results suggest that the lysine and arginine residue-enriched peptide is the most prominent nucleolar targeting sequence of Nop25 and that the long stretch of basic residues might play an important role in the nucleolar localization of Nop25. Although Nop25 contained putative SUMOylation, phosphorylation and glycosylation sites, the amino acid substitution in these sites had no effect on the nucleolar localization, thus suggesting that these post-translational modifications did not contribute to the localization of Nop25 in the nucleolus. The treatment of the cells, which expressed a GFP-fusion protein with a nucleolar targeting sequence of Nop25, with RNase A resulted in a complete dislocation of the protein from the nucleolus. These data suggested that the nucleolar targeting sequence might therefore play an important role in the binding of Nop25 to RNA molecules and that the RNA binding of Nop25 might be essential for the nucleolar localization of Nop25.« less
Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs.

PubMed

Yan, Yan; Zhang, Kaizhong

2016-12-23

De novo peptide sequencing via tandem mass spectrometry (MS/MS) has been developed rapidly in recent years. With the use of spectra pairs from the same peptide under different fragmentation modes, performance of de novo sequencing is greatly improved. Currently, with large amount of spectra sequenced everyday, spectra libraries containing tens of thousands of annotated experimental MS/MS spectra become available. These libraries provide information of the spectra properties, thus have the potential to be used with de novo sequencing to improve its performance. In this study, an improved de novo sequencing method assisted with spectra library is proposed. It uses spectra libraries as training datasets and introduces significant scores of the features used in our previous de novo sequencing method for HCD and ETD spectra pairs. Two pairs of HCD and ETD spectral datasets were used to test the performance of the proposed method and our previous method. The results show that this proposed method achieves better sequencing accuracy with higher ranked correct sequences and less computational time. This paper proposed an advanced de novo sequencing method for HCD and ETD spectra pair and used information from spectra libraries and significant improved previous similar methods.
Dynamic covalent chemistry enables formation of antimicrobial peptide quaternary assemblies in a completely abiotic manner

NASA Astrophysics Data System (ADS)

Reuther, James F.; Dees, Justine L.; Kolesnichenko, Igor V.; Hernandez, Erik T.; Ukraintsev, Dmitri V.; Guduru, Rusheel; Whiteley, Marvin; Anslyn, Eric V.

2018-01-01

Naturally occurring peptides and proteins often use dynamic disulfide bonds to impart defined tertiary/quaternary structures for the formation of binding pockets with uniform size and function. Although peptide synthesis and modification are well established, controlling quaternary structure formation remains a significant challenge. Here, we report the facile incorporation of aryl aldehyde and acyl hydrazide functionalities into peptide oligomers via solid-phase copper-catalysed azide-alkyne cycloaddition (SP-CuAAC) click reactions. When mixed, these complementary functional groups rapidly react in aqueous media at neutral pH to form peptide-peptide intermolecular macrocycles with highly tunable ring sizes. Moreover, sequence-specific figure-of-eight, dumbbell-shaped, zipper-like and multi-loop quaternary structures were formed selectively. Controlling the proportions of reacting peptides with mismatched numbers of complementary reactive groups results in the formation of higher-molecular-weight sequence-defined ladder polymers. This also amplified antimicrobial effectiveness in select cases. This strategy represents a general approach to the creation of complex abiotic peptide quaternary structures.
Self-assembling peptide nanofiber hydrogels in tissue engineering and regenerative medicine: Progress, design guidelines, and applications.

PubMed

Koutsopoulos, Sotirios

2016-04-01

Until the mid-1980s, mainly biologists were conducting peptide research. This changed with discoveries that opened new paths of research involving the use of peptides in bioengineering, biotechnology, biomedicine, nanotechnology, and bioelectronics. Peptide engineering and rational design of novel peptide sequences with unique and tailor-made properties further expanded the field. The discovery of short self-assembling peptides, which upon association form well-defined supramolecular architectures, created new and exciting areas of research. Depending on the amino acid sequence, the pH, and the type of the electrolyte in the medium, peptide self-assembly leads to the formation of nanofibers, which are further organized to form a hydrogel. In this review, the application of ionic complementary peptides which self-assemble to form nanofiber hydrogels for tissue engineering and regenerative medicine will be discussed through a selective presentation of the most important work performed during the last 25 years. © 2016 Wiley Periodicals, Inc.

Unusual reactivity of a silver mineralizing peptide.

PubMed

Carter, Carly Jo; Ackerson, Christopher J; Feldheim, Daniel L

2010-07-27

The ability of peptides selected via phage display to mediate the formation of inorganic nanoparticles is now well established. The atomic-level interactions between the selected peptides and the metal ion precursors are in most instances, however, largely obscure. We identified a new peptide sequence that is capable of mediating the formation of Ag nanoparticles. Surprisingly, nanoparticle formation requires the presence of peptide, HEPES buffer, and light; the absence of any one of these compromises nanoparticle formation. Electrochemical experiments revealed that the peptide binds Ag+ in a 3 Ag+:1 peptide ratio and significantly alters the Ag+ reduction potential. Alanine replacement studies yielded insight into the sequence-function relationships of Ag nanoparticle formation, including the Ag+ coordination sites and the residues necessary for Ag synthesis. In addition, the peptide was found to function when immobilized onto surfaces, and the specific immobilizing concentration could be adjusted to yield either spherical Ag nanoparticles or high aspect ratio nanowires. These studies further illustrate the range of interesting new solid-state chemistries possible using biomolecules.
Unusual Reactivity of a Silver Mineralizing Peptide

PubMed Central

Carter, Carly Jo; Ackerson, Christopher J.; Feldheim, Daniel L.

2010-01-01

The ability of peptides selected via phage display to mediate the formation of inorganic nanoparticles is now well established. The atomic-level interactions between the selected peptides and the metal ion precursors are in most instances, however, largely obscure. We identified a new peptide sequence that is capable of mediating the formation of Ag nanoparticles. Surprisingly, nanoparticle formation requires the presence of peptide, HEPES buffer, and light; the absence of any one of these compromises nanoparticle formation. Electrochemical experiments revealed that the peptide binds Ag+ in a 3 Ag+:1 peptide ratio and significantly alters the Ag+ reduction potential. Alanine replacement studies yielded insight into the sequence-function relationships of Ag nanoparticle formation, including the Ag+ coordination sites and the residues necessary for Ag synthesis. In addition, the peptide was found to function when immobilized onto surfaces, and the specific immobilizing concentration could be adjusted to yield either spherical Ag nanoparticles or high aspect ratio nanowires. These studies further illustrate the range of interesting new solid-state chemistries possible using biomolecules. PMID:20552994
TEMPO-Assisted Free Radical-Initiated Peptide Sequencing Mass Spectrometry (FRIPS MS) in Q-TOF and Orbitrap Mass Spectrometers: Single-Step Peptide Backbone Dissociations in Positive Ion Mode

NASA Astrophysics Data System (ADS)

Jang, Inae; Lee, Sun Young; Hwangbo, Song; Kang, Dukjin; Lee, Hookeun; Kim, Hugh I.; Moon, Bongjin; Oh, Han Bin

2017-01-01

The present study demonstrates that one-step peptide backbone fragmentations can be achieved using the TEMPO [2-(2,2,6,6-tetramethyl piperidine-1-oxyl)]-assisted free radical-initiated peptide sequencing (FRIPS) mass spectrometry in a hybrid quadrupole time-of-flight (Q-TOF) mass spectrometer and a Q-Exactive Orbitrap instrument in positive ion mode, in contrast to two-step peptide fragmentation in an ion-trap mass spectrometer (reference Anal. Chem. 85, 7044-7051 (30)). In the hybrid Q-TOF and Q-Exactive instruments, higher collisional energies can be applied to the target peptides, compared with the low collisional energies applied by the ion-trap instrument. The higher energy deposition and the additional multiple collisions in the collision cell in both instruments appear to result in one-step peptide backbone dissociations in positive ion mode. This new finding clearly demonstrates that the TEMPO-assisted FRIPS approach is a very useful tool in peptide mass spectrometry research.
Methods and materials for deconstruction of biomass for biofuels production

DOEpatents

Schoeniger, Joseph S; Hadi, Masood Zia

2015-05-05

The present invention relates to nucleic acids, peptides, vectors, cells, and plants useful in the production of biofuels. In certain embodiments, the invention relates to nucleic acid sequences and peptides from extremophile organisms, such as SSO1949 and Ce1A, that are useful for hydrolyzing plant cell wall materials. In further embodiments, the invention relates to modified versions of such sequences that have been optimized for production in one or both of monocot and dicot plants. In other embodiments, the invention provides for targeting peptide production or activity to a certain location within the cell or organism, such as the apoplast. In further embodiments, the invention relates to transformed cells or plants. In additional embodiments, the invention relates to methods of producing biofuel utilizing such nucleic acids, peptides, targeting sequences, vectors, cells, and/or plants.
Leptoglycin: a new Glycine/Leucine-rich antimicrobial peptide isolated from the skin secretion of the South American frog Leptodactylus pentadactylus (Leptodactylidae).

PubMed

Sousa, Juliana C; Berto, Raquel F; Gois, Elicélia A; Fontenele-Cardi, Nauíla C; Honório, José E R; Konno, Katsuhiro; Richardson, Michael; Rocha, Marcos F G; Camargo, Antônio A C M; Pimenta, Daniel C; Cardi, Bruno A; Carvalho, Krishnamurti M

2009-07-01

Antimicrobial peptides are components of innate immunity that is the first-line defense against invading pathogens for a wide range of organisms. Here, we describe the isolation, biological characterization and amino acid sequencing of a novel neutral Glycine/Leucine-rich antimicrobial peptide from skin secretion of Leptodactylus pentadactylus named leptoglycin. The amino acid sequence of the peptide purified by RP-HPLC (C(18) column) was deduced by mass spectrometric de novo sequencing and confirmed by Edman degradation: GLLGGLLGPLLGGGGGGGGGLL. Leptoglycin was able to inhibit the growth of Gram-negative bacteria Pseudomonas aeruginosa, Escherichia coli and Citrobacter freundii with minimal inhibitory concentrations (MICs) of 8 microM, 50 microM, and 75 microM respectively, but it did not show antimicrobial activity against Gram-positive bacteria (Staphylococcus aureus, Micrococcus luteus and Enterococcus faecalis), yeasts (Candida albicans and Candida tropicalis) and dermatophytes fungi (Microsporum canis and Trichophyton rubrum). No hemolytic activity was observed at the 2-200 microM range concentration. The amino acid sequence of leptoglycin with high level of glycine (59.1%) and leucine (36.4%) containing an unusual central proline suggests the existence of a new class of Gly/Leu-rich antimicrobial peptides. Taken together, these results suggest that this natural antimicrobial peptide could be a tool to develop new antibiotics.
Venom proteomic and venomous glands transcriptomic analysis of the Egyptian scorpion Scorpio maurus palmatus (Arachnida: Scorpionidae).

PubMed

Abdel-Rahman, Mohamed A; Quintero-Hernandez, Veronica; Possani, Lourival D

2013-11-01

Proteomic analysis of the scorpion venom Scorpio maurus palmatus was performed using reverse-phase HPLC separation followed by mass spectrometry determination. Sixty five components were identified with molecular masses varying from 413 to 14,009 Da. The high percentage of peptides (41.5%) was from 3 to 5 KDa which may represent linear antimicrobial peptides and KScTxs. Also, 155 expressed sequence tags (ESTs) were analyzed through construction the cDNA library prepared from a pair of venomous gland. About 77% of the ESTs correspond to toxin-like peptides and proteins with definite open reading frames. The cDNA sequencing results also show the presence of sequences whose putative products have sequence similarity with antimicrobial peptides (24%), insecticidal toxins, β-NaScTxs, κ-KScTxs, α-KScTxs, calcines and La1-like peptides. Also, we have obtained 23 atypical types of venom molecules not recorded in other scorpion species. Moreover, 9% of the total ESTs revealed significant similarities with proteins involved in the cellular processes of these scorpion venomous glands. This is the first set of molecular masses and transcripts described from this species, in which various venom molecules have been identified. They belong to either known or unassigned types of scorpion venom peptides and proteins, and provide valuable information for evolutionary analysis and venomics. Copyright © 2013 Elsevier Ltd. All rights reserved.
Molecular dynamics simulations of certain RGD-based peptides from Kistrin provide insight into the higher activity of REI-RGD34 protein at higher temperature.

PubMed

Upadhyay, Sanjay K

2014-05-01

To determine the bioactive conformation required to bind with receptor aIIbb3, the peptide sequence RIPRGDMP from Kistrin was inserted into CDR 1 loop region of REI protein, resulting in REI-RGD34. The activity of REI-RGD34 was observed to increase at higher temperature towards the receptor aIIbb3. It could be justified in either way: the modified complex forces the restricted peptide to adapt bioactive conformation or it unfolds the peptide in a way that opens its binding surface with high affinity for receptor. Here, we model the conformational preference of RGD sequence in RIPRGDMP at 25 and 42 °C using multiple MD simulations. Further, we model the peptide sequence RGD, PRGD and PRGDMP from kistrin to observe the effect of flanking residues on conformational sampling of RGD. The presence of flanking residues around RGD peptide greatly influenced the conformational sampling. A transition from bend to turn conformation was observed for RGD sequence at 42 °C. The turn conformation shows pharmacophoric parameters required to recognize the receptor aIIbb3. Thus, the temperaturedependent activity of RIPRGDMP when inserted into the loop region of REI can be explained by the presence of the turn conformation. This study will help in designing potential antagonist for the receptor aIIbb3.
Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger

PubMed Central

Wright, James C; Sugden, Deana; Francis-McIntyre, Sue; Riba-Garcia, Isabel; Gaskell, Simon J; Grigoriev, Igor V; Baker, Scott E; Beynon, Robert J; Hubbard, Simon J

2009-01-01

Background Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). Results 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. Conclusion This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method. PMID:19193216
The property distance index PD predicts peptides that cross-react with IgE antibodies

PubMed Central

Ivanciuc, Ovidiu; Midoro-Horiuti, Terumi; Schein, Catherine H.; Xie, Liping; Hillman, Gilbert R.; Goldblum, Randall M.; Braun, Werner

2009-01-01

Similarities in the sequence and structure of allergens can explain clinically observed cross-reactivities. Distinguishing sequences that bind IgE in patient sera can be used to identify potentially allergenic protein sequences and aid in the design of hypo-allergenic proteins. The property distance index PD, incorporated in our Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP/), may identify potentially cross-reactive segments of proteins, based on their similarity to known IgE epitopes. We sought to obtain experimental validation of the PD index as a quantitative predictor of IgE cross-reactivity, by designing peptide variants with predetermined PD scores relative to three linear IgE epitopes of Jun a 1, the dominant allergen from mountain cedar pollen. For each of the three epitopes, 60 peptides were designed with increasing PD values (decreasing physicochemical similarity) to the starting sequence. The peptides synthesized on a derivatized cellulose membrane were probed with sera from patients who were allergic to Jun a 1, and the experimental data were interpreted with a PD classification method. Peptides with low PD values relative to a given epitope were more likely to bind IgE from the sera than were those with PD values larger than 6. Control sequences, with PD values between 18 and 20 to all the three epitopes, did not bind patient IgE, thus validating our procedure for identifying negative control peptides. The PD index is a statistically validated method to detect discrete regions of proteins that have a high probability of cross-reacting with IgE from allergic patients. PMID:18950868
Graph-based optimization of epitope coverage for vaccine antigen design

DOE PAGES

Theiler, James Patrick; Korber, Bette Tina Marie

2017-01-29

Epigraph is a recently developed algorithm that enables the computationally efficient design of single or multi-antigen vaccines to maximize the potential epitope coverage for a diverse pathogen population. Potential epitopes are defined as short contiguous stretches of proteins, comparable in length to T-cell epitopes. This optimal coverage problem can be formulated in terms of a directed graph, with candidate antigens represented as paths that traverse this graph. Epigraph protein sequences can also be used as the basis for designing peptides for experimental evaluation of immune responses in natural infections to highly variable proteins. The epigraph tool suite also enables rapidmore » characterization of populations of diverse sequences from an immunological perspective. Fundamental distance measures are based on immunologically relevant shared potential epitope frequencies, rather than simple Hamming or phylogenetic distances. Here, we provide a mathematical description of the epigraph algorithm, include a comparison of different heuristics that can be used when graphs are not acyclic, and we describe an additional tool we have added to the web-based epigraph tool suite that provides frequency summaries of all distinct potential epitopes in a population. Lastly, we also show examples of the graphical output and summary tables that can be generated using the epigraph tool suite and explain their content and applications.« less
Graph-based optimization of epitope coverage for vaccine antigen design

DOE Office of Scientific and Technical Information (OSTI.GOV)

Theiler, James Patrick; Korber, Bette Tina Marie

Epigraph is a recently developed algorithm that enables the computationally efficient design of single or multi-antigen vaccines to maximize the potential epitope coverage for a diverse pathogen population. Potential epitopes are defined as short contiguous stretches of proteins, comparable in length to T-cell epitopes. This optimal coverage problem can be formulated in terms of a directed graph, with candidate antigens represented as paths that traverse this graph. Epigraph protein sequences can also be used as the basis for designing peptides for experimental evaluation of immune responses in natural infections to highly variable proteins. The epigraph tool suite also enables rapidmore » characterization of populations of diverse sequences from an immunological perspective. Fundamental distance measures are based on immunologically relevant shared potential epitope frequencies, rather than simple Hamming or phylogenetic distances. Here, we provide a mathematical description of the epigraph algorithm, include a comparison of different heuristics that can be used when graphs are not acyclic, and we describe an additional tool we have added to the web-based epigraph tool suite that provides frequency summaries of all distinct potential epitopes in a population. Lastly, we also show examples of the graphical output and summary tables that can be generated using the epigraph tool suite and explain their content and applications.« less
Ranalexin. A novel antimicrobial peptide from bullfrog (Rana catesbeiana) skin, structurally related to the bacterial antibiotic, polymyxin.

PubMed

Clark, D P; Durell, S; Maloy, W L; Zasloff, M

1994-04-08

Antimicrobial peptides comprise a diverse class of molecules used in host defense by plants, insects, and animals. In this study we have isolated a novel antimicrobial peptide from the skin of the bullfrog, Rana catesbeiana. This 20 amino acid peptide, which we have termed Ranalexin, has the amino acid sequence: NH2-Phe-Leu-Gly-Gly-Leu-Ile-Lys-Ile-Val-Pro-Ala-Met-Ile-Cys-Ala-Val-Thr- Lys-Lys - Cys-COOH, and it contains a single intramolecular disulfide bond which forms a heptapeptide ring within the molecule. Structurally, Ranalexin resembles the bacterial antibiotic, polymyxin, which contains a similar heptapeptide ring. We have also cloned the cDNA for Ranalexin from a metamorphic R. catesbeiana tadpole cDNA library. Based on the cDNA sequence, it appears that Ranalexin is initially synthesized as a propeptide with a putative signal sequence and an acidic amino acid-rich region at its amino-terminal end. Interestingly, the putative signal sequence of the Ranalexin cDNA is strikingly similar to the signal sequence of opioid peptide precursors isolated from the skin of the South American frogs Phyllomedusa sauvagei and Phyllomedusa bicolor. Northern blot analysis and in situ hybridization experiments demonstrated that Ranalexin mRNA is first expressed in R. catesbeiana skin at metamorphosis and continues to be expressed into adulthood.
Puromycin-sensitive aminopeptidase is the major peptidase responsible for digesting polyglutamine sequences released by proteasomes during protein degradation

PubMed Central

Bhutani, N; Venkatraman, P; Goldberg, A L

2007-01-01

Long stretches of glutamine (Q) residues are found in many cellular proteins. Expansion of these polyglutamine (polyQ) sequences is the underlying cause of several neurodegenerative diseases (e.g. Huntington's disease). Eukaryotic proteasomes have been found to digest polyQ sequences in proteins very slowly, or not at all, and to release such potentially toxic sequences for degradation by other peptidases. To identify these key peptidases, we investigated the degradation in cell extracts of model Q-rich fluorescent substrates and peptides containing 10–30 Q's. Their degradation at neutral pH was due to a single aminopeptidase, the puromycin-sensitive aminopeptidase (PSA, cytosol alanyl aminopeptidase). No other known cytosolic aminopeptidase or endopeptidase was found to digest these polyQ peptides. Although tripeptidyl peptidase II (TPPII) exhibited limited activity, studies with specific inhibitors, pure enzymes and extracts of cells treated with siRNA for TPPII or PSA showed PSA to be the rate-limiting activity against polyQ peptides up to 30 residues long. (PSA digests such Q sequences, shorter ones and typical (non-repeating) peptides at similar rates.) Thus, PSA, which is induced in neurons expressing mutant huntingtin, appears critical in preventing the accumulation of polyQ peptides in normal cells, and its activity may influence susceptibility to polyQ diseases. PMID:17318184
Dual-functioning peptides discovered by phage display increase the magnitude and specificity of BMSC attachment to mineralized biomaterials.

PubMed

Ramaraju, Harsha; Miller, Sharon J; Kohn, David H

2017-07-01

Design of biomaterials for cell-based therapies requires presentation of specific physical and chemical cues to cells, analogous to cues provided by native extracellular matrices (ECM). We previously identified a peptide sequence with high affinity towards apatite (VTKHLNQISQSY, VTK) using phage display. The aims of this study were to identify a human MSC-specific peptide sequence through phage display, combine it with the apatite-specific sequence, and verify the specificity of the combined dual-functioning peptide to both apatite and human bone marrow stromal cells. In this study, a combinatorial phage display identified the cell binding sequence (DPIYALSWSGMA, DPI) which was combined with the mineral binding sequence to generate the dual peptide DPI-VTK. DPI-VTK demonstrated significantly greater binding affinity (1/K D ) to apatite surfaces compared to VTK, phosphorylated VTK (VTK phos ), DPI-VTK phos , RGD-VTK, and peptide-free apatite surfaces (p < 0.01), while significantly increasing hBMSC adhesion strength (τ 50 , p < 0.01). MSCs demonstrated significantly greater adhesion strength to DPI-VTK compared to other cell types, while attachment of MC3T3 pre-osteoblasts and murine fibroblasts was limited (p < 0.01). MSCs on DPI-VTK coated surfaces also demonstrated increased spreading compared to pre-osteoblasts and fibroblasts. MSCs cultured on DPI-VTK coated apatite films exhibited significantly greater proliferation compared to controls (p < 0.001). Moreover, early and late stage osteogenic differentiation markers were elevated on DPI-VTK coated apatite films compared to controls. Taken together, phage display can identify non-obvious cell and material specific peptides to increase human MSC adhesion strength to specific biomaterial surfaces and subsequently increase cell proliferation and differentiation. These new peptides expand biomaterial design methodology for cell-based regeneration of bone defects. This strategy of combining cell and material binding phage display derived peptides is broadly applicable to a variety of systems requiring targeted adhesion of specific cell populations, and may be generalized to the engineering of any adhesion surface. Copyright © 2017 Elsevier Ltd. All rights reserved.
ArrayPitope: Automated Analysis of Amino Acid Substitutions for Peptide Microarray-Based Antibody Epitope Mapping.

PubMed

Hansen, Christian Skjødt; Østerbye, Thomas; Marcatili, Paolo; Lund, Ole; Buus, Søren; Nielsen, Morten

2017-01-01

Identification of epitopes targeted by antibodies (B cell epitopes) is of critical importance for the development of many diagnostic and therapeutic tools. For clinical usage, such epitopes must be extensively characterized in order to validate specificity and to document potential cross-reactivity. B cell epitopes are typically classified as either linear epitopes, i.e. short consecutive segments from the protein sequence or conformational epitopes adapted through native protein folding. Recent advances in high-density peptide microarrays enable high-throughput, high-resolution identification and characterization of linear B cell epitopes. Using exhaustive amino acid substitution analysis of peptides originating from target antigens, these microarrays can be used to address the specificity of polyclonal antibodies raised against such antigens containing hundreds of epitopes. However, the interpretation of the data provided in such large-scale screenings is far from trivial and in most cases it requires advanced computational and statistical skills. Here, we present an online application for automated identification of linear B cell epitopes, allowing the non-expert user to analyse peptide microarray data. The application takes as input quantitative peptide data of fully or partially substituted overlapping peptides from a given antigen sequence and identifies epitope residues (residues that are significantly affected by substitutions) and visualize the selectivity towards each residue by sequence logo plots. Demonstrating utility, the application was used to identify and address the antibody specificity of 18 linear epitope regions in Human Serum Albumin (HSA), using peptide microarray data consisting of fully substituted peptides spanning the entire sequence of HSA and incubated with polyclonal rabbit anti-HSA (and mouse anti-rabbit-Cy3). The application is made available at: www.cbs.dtu.dk/services/ArrayPitope.
Structure of genes for dermaseptins B, antimicrobial peptides from frog skin. Exon 1-encoded prepropeptide is conserved in genes for peptides of highly different structures and activities.

PubMed

Vouille, V; Amiche, M; Nicolas, P

1997-09-01

We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.
A structural basis for antigen presentation by the MHC class Ib molecule, Qa-1b.

PubMed

Zeng, Li; Sullivan, Lucy C; Vivian, Julian P; Walpole, Nicholas G; Harpur, Christopher M; Rossjohn, Jamie; Clements, Craig S; Brooks, Andrew G

2012-01-01

The primary function of the monomorphic MHC class Ib molecule Qa-1(b) is to present peptides derived from the leader sequences of other MHC class I molecules for recognition by the CD94-NKG2 receptors expressed by NK and T cells. Whereas the mode of peptide presentation by its ortholog HLA-E, and subsequent recognition by CD94-NKG2A, is known, the molecular basis of Qa-1(b) function is unclear. We have assessed the interaction between Qa-1(b) and CD94-NKG2A and shown that they interact with an affinity of 17 μM. Furthermore, we have determined the structure of Qa-1(b) bound to the leader sequence peptide, Qdm (AMAPRTLLL), to a resolution of 1.9 Å and compared it with that of HLA-E. The crystal structure provided a basis for understanding the restricted peptide repertoire of Qa-1(b). Whereas the Qa-1(b-AMAPRTLLL) complex was similar to that of HLA-E, significant sequence and structural differences were observed between the respective Ag-binding clefts. However, the conformation of the Qdm peptide bound by Qa-1(b) was very similar to that of peptide bound to HLA-E. Although a number of conserved innate receptors can recognize heterologous ligands from other species, the structural differences between Qa-1(b) and HLA-E manifested in CD94-NKG2A ligand recognition being species specific despite similarities in peptide sequence and conformation. Collectively, our data illustrate the structural homology between Qa-1(b) and HLA-E and provide a structural basis for understanding peptide repertoire selection and the specificity of the interaction of Qa-1(b) with CD94-NKG2 receptors.
Contribution of Peptide Backbone to Anti-Citrullinated Peptide Antibody Reactivity

PubMed Central

Trier, Nicole Hartwig; Dam, Catharina Essendrup; Olsen, Dorthe Tange; Hansen, Paul Robert; Houen, Gunnar

2015-01-01

Rheumatoid arthritis (RA) is one of the most common autoimmune diseases, affecting approximately 1–2% of the world population. One of the characteristic features of RA is the presence of autoantibodies. Especially the highly specific anti-citrullinated peptide antibodies (ACPAs), which have been found in up to 70% of RA patients’ sera, have received much attention. Several citrullinated proteins are associated with RA, suggesting that ACPAs may react with different sequence patterns, separating them from traditional antibodies, whose reactivity usually is specific towards a single target. As ACPAs have been suggested to be involved in the development of RA, knowledge about these antibodies may be crucial. In this study, we examined the influence of peptide backbone for ACPA reactivity in immunoassays. The antibodies were found to be reactive with a central Cit-Gly motif being essential for ACPA reactivity and to be cross-reactive between the selected citrullinated peptides. The remaining amino acids within the citrullinated peptides were found to be of less importance for antibody reactivity. Moreover, these findings indicated that the Cit-Gly motif in combination with peptide backbone is essential for antibody reactivity. Based on these findings it was speculated that any amino acid sequence, which brings the peptide into a properly folded structure for antibody recognition is sufficient for antibody reactivity. These findings are in accordance with the current hypothesis that structural homology rather than sequence homology are favored between citrullinated epitopes. These findings are important in relation to clarifying the etiology of RA and to determine the nature of ACPAs, e.g. why some Cit-Gly-containing sequences are not targeted by ACPAs. PMID:26657009
Activity of human kallikrein-related peptidase 6 (KLK6) on substrates containing sequences of basic amino acids. Is it a processing protease?

PubMed

Silva, Roberta N; Oliveira, Lilian C G; Parise, Carolina B; Oliveira, Juliana R; Severino, Beatrice; Corvino, Angela; di Vaio, Paola; Temussi, Piero A; Caliendo, Giuseppe; Santagada, Vincenzo; Juliano, Luiz; Juliano, Maria A

2017-05-01

Human kallikrein 6 (KLK6) is highly expressed in the central nervous system and with elevated level in demyelinating disease. KLK6 has a very restricted specificity for arginine (R) and hydrolyses myelin basic protein, protein activator receptors and human ionotropic glutamate receptor subunits. Here we report a previously unreported activity of KLK6 on peptides containing clusters of basic amino acids, as in synthetic fluorogenic peptidyl-Arg-7-amino-4-carbamoylmethylcoumarin (peptidyl-ACC) peptides and FRET peptides in the format of Abz-peptidyl-Q-EDDnp (where Abz=ortho-aminobenzoic acid and Q-EDDnp=glutaminyl-N-(2,4-dinitrophenyl) ethylenediamine), in which pairs or sequences of basic amino acids (R or K) were introduced. Surprisingly, KLK6 hydrolyzed the fluorogenic peptides Bz-A-R ↓ R-ACC and Z-R ↓ R-MCA between the two R groups, resulting in non-fluorescent products. FRET peptides containing furin processing sequences of human MMP-14, nerve growth factor (NGF), Neurotrophin-3 (NT-3) and Neurotrophin-4 (NT-4) were cleaved by KLK6 at the same position expected by furin. Finally, KLK6 cleaved FRET peptides derived from human proenkephalin after the KR, the more frequent basic residues flanking enkephalins in human proenkephalin sequence. This result suggests the ability of KLK6 to release enkephalin from proenkephalin precursors and resembles furin a canonical processing proteolytic enzyme. Molecular models of peptides were built into the KLK6 structure and the marked preference of the cut between the two R of the examined peptides was related to the extended conformation of the substrates. Copyright © 2017 Elsevier B.V. All rights reserved.
ArrayPitope: Automated Analysis of Amino Acid Substitutions for Peptide Microarray-Based Antibody Epitope Mapping

PubMed Central

Hansen, Christian Skjødt; Østerbye, Thomas; Marcatili, Paolo; Lund, Ole; Buus, Søren

2017-01-01

Identification of epitopes targeted by antibodies (B cell epitopes) is of critical importance for the development of many diagnostic and therapeutic tools. For clinical usage, such epitopes must be extensively characterized in order to validate specificity and to document potential cross-reactivity. B cell epitopes are typically classified as either linear epitopes, i.e. short consecutive segments from the protein sequence or conformational epitopes adapted through native protein folding. Recent advances in high-density peptide microarrays enable high-throughput, high-resolution identification and characterization of linear B cell epitopes. Using exhaustive amino acid substitution analysis of peptides originating from target antigens, these microarrays can be used to address the specificity of polyclonal antibodies raised against such antigens containing hundreds of epitopes. However, the interpretation of the data provided in such large-scale screenings is far from trivial and in most cases it requires advanced computational and statistical skills. Here, we present an online application for automated identification of linear B cell epitopes, allowing the non-expert user to analyse peptide microarray data. The application takes as input quantitative peptide data of fully or partially substituted overlapping peptides from a given antigen sequence and identifies epitope residues (residues that are significantly affected by substitutions) and visualize the selectivity towards each residue by sequence logo plots. Demonstrating utility, the application was used to identify and address the antibody specificity of 18 linear epitope regions in Human Serum Albumin (HSA), using peptide microarray data consisting of fully substituted peptides spanning the entire sequence of HSA and incubated with polyclonal rabbit anti-HSA (and mouse anti-rabbit-Cy3). The application is made available at: www.cbs.dtu.dk/services/ArrayPitope. PMID:28095436

Inadequate Reference Datasets Biased toward Short Non-epitopes Confound B-cell Epitope Prediction*

PubMed Central

Rahman, Kh. Shamsur; Chowdhury, Erfan Ullah; Sachse, Konrad; Kaltenboeck, Bernhard

2016-01-01

X-ray crystallography has shown that an antibody paratope typically binds 15–22 amino acids (aa) of an epitope, of which 2–5 randomly distributed amino acids contribute most of the binding energy. In contrast, researchers typically choose for B-cell epitope mapping short peptide antigens in antibody binding assays. Furthermore, short 6–11-aa epitopes, and in particular non-epitopes, are over-represented in published B-cell epitope datasets that are commonly used for development of B-cell epitope prediction approaches from protein antigen sequences. We hypothesized that such suboptimal length peptides result in weak antibody binding and cause false-negative results. We tested the influence of peptide antigen length on antibody binding by analyzing data on more than 900 peptides used for B-cell epitope mapping of immunodominant proteins of Chlamydia spp. We demonstrate that short 7–12-aa peptides of B-cell epitopes bind antibodies poorly; thus, epitope mapping with short peptide antigens falsely classifies many B-cell epitopes as non-epitopes. We also show in published datasets of confirmed epitopes and non-epitopes a direct correlation between length of peptide antigens and antibody binding. Elimination of short, ≤11-aa epitope/non-epitope sequences improved datasets for evaluation of in silico B-cell epitope prediction. Achieving up to 86% accuracy, protein disorder tendency is the best indicator of B-cell epitope regions for chlamydial and published datasets. For B-cell epitope prediction, the most effective approach is plotting disorder of protein sequences with the IUPred-L scale, followed by antibody reactivity testing of 16–30-aa peptides from peak regions. This strategy overcomes the well known inaccuracy of in silico B-cell epitope prediction from primary protein sequences. PMID:27189949
Peptide vaccine against canine parvovirus: identification of two neutralization subsites in the N terminus of VP2 and optimization of the amino acid sequence.

PubMed Central

Casal, J I; Langeveld, J P; Cortés, E; Schaaper, W W; van Dijk, E; Vela, C; Kamstrup, S; Meloen, R H

1995-01-01

The N-terminal domain of the major capsid protein VP2 of canine parvovirus was shown to be an excellent target for development of a synthetic peptide vaccine, but detailed information about number of epitopes, optimal length, sequence choice, and site of coupling to the carrier protein was lacking. Therefore, several overlapping peptides based on this N terminus were synthesized to establish conditions for optimal and reproducible induction of neutralizing antibodies in rabbits. The specificity and neutralizing ability of the antibody response for these peptides were determined. Within the N-terminal 23 residues of VP2, two subsites able to induce neutralizing antibodies and which overlapped by only two glycine residues at positions 10 and 11 could be discriminated. The shortest sequence sufficient for neutralization induction was nine residues. Peptides longer than 13 residues consistently induced neutralization, provided that their N termini were located between positions 1 and 11 of VP2. The orientation of the peptides at the carrier protein was also of importance, being more effective when coupled through the N terminus than through the C terminus to keyhole limpet hemocyanin. The results suggest that the presence of amino acid residues 2 to 21 (and probably 3 to 17) of VP2 in a single peptide is preferable for a synthetic peptide vaccine. PMID:7474152
Peptide vaccine against canine parvovirus: identification of two neutralization subsites in the N terminus of VP2 and optimization of the amino acid sequence.

PubMed

Casal, J I; Langeveld, J P; Cortés, E; Schaaper, W W; van Dijk, E; Vela, C; Kamstrup, S; Meloen, R H

1995-11-01

The N-terminal domain of the major capsid protein VP2 of canine parvovirus was shown to be an excellent target for development of a synthetic peptide vaccine, but detailed information about number of epitopes, optimal length, sequence choice, and site of coupling to the carrier protein was lacking. Therefore, several overlapping peptides based on this N terminus were synthesized to establish conditions for optimal and reproducible induction of neutralizing antibodies in rabbits. The specificity and neutralizing ability of the antibody response for these peptides were determined. Within the N-terminal 23 residues of VP2, two subsites able to induce neutralizing antibodies and which overlapped by only two glycine residues at positions 10 and 11 could be discriminated. The shortest sequence sufficient for neutralization induction was nine residues. Peptides longer than 13 residues consistently induced neutralization, provided that their N termini were located between positions 1 and 11 of VP2. The orientation of the peptides at the carrier protein was also of importance, being more effective when coupled through the N terminus than through the C terminus to keyhole limpet hemocyanin. The results suggest that the presence of amino acid residues 2 to 21 (and probably 3 to 17) of VP2 in a single peptide is preferable for a synthetic peptide vaccine.
Ferrate oxidation of murine leukemia virus reverse transcriptase: identification of the template-primer binding domain.

PubMed

Reddy, G; Nanduri, V B; Basu, A; Modak, M J

1991-08-20

Treatment of murine leukemia virus reverse transcriptase (MuLV RT) with potassium ferrate, an oxidizing agent known to oxidize amino acids involved in phosphate binding domains of proteins, results in the irreversible inactivation of both the DNA polymerase and the RNase H activities. Significant protection from ferrate-mediated inactivation is observed in the presence of template-primer but not in the presence of substrate deoxynucleoside triphosphates. Furthermore, ferrate-treated enzyme loses template-primer binding activity as judged by UV-mediated cross-linking of radiolabeled DNA. Comparative tryptic peptide mapping by reverse-phase HPLC of native and ferrate-oxidized enzyme indicated the presence of two new peptides eluting at 38 and 57 min and a significant loss of a peptide eluting at 74 min. Purification, amino acid composition, and sequencing of these affected peptides revealed that they correspond to amino acid residues 285-295, 630-640, and 586-599, respectively, in the primary amino acid sequence of MuLV RT. These results indicate that the domains constituted by the above peptides are important for the template-primer binding function in MuLV RT. Peptide I is located in the polymerase domain whereas peptides II and III are located in the RNase H domain. Amino acid sequence analysis of peptides I and II suggested Lys-285 and Cys-635 as the probable sites of ferrate action.
Novel ZnO-binding peptides obtained by the screening of a phage display peptide library

NASA Astrophysics Data System (ADS)

Golec, Piotr; Karczewska-Golec, Joanna; Łoś, Marcin; Węgrzyn, Grzegorz

2012-11-01

Zinc oxide (ZnO) is a semiconductor compound with a potential for wide use in various applications, including biomaterials and biosensors, particularly as nanoparticles (the size range of ZnO nanoparticles is from 2 to 100 nm, with an average of about 35 nm). Here, we report isolation of novel ZnO-binding peptides, by screening of a phage display library. Interestingly, amino acid sequences of the ZnO-binding peptides reported in this paper and those described previously are significantly different. This suggests that there is a high variability in sequences of peptides which can bind particular inorganic molecules, indicating that different approaches may lead to discovery of different peptides of generally the same activity (e.g., binding of ZnO) but having various detailed properties, perhaps crucial under specific conditions of different applications.
Use of synthetic peptide libraries for the H-2Kd binding motif identification.

PubMed

Quesnel, A; Casrouge, A; Kourilsky, P; Abastado, J P; Trudelle, Y

1995-01-01

To identify Kd-binding peptides, an approach based on small peptide libraries has been developed. These peptide libraries correspond to all possible single-amino acid variants of a particular Kd-binding peptide, SYIPSAEYI, an analog of the Plasmodium berghei 252-260 antigenic peptide SYIPSAEKI. In the parent sequence, each position is replaced by all the genetically encoded amino acids (except cysteine). The multiple analog syntheses are performed either by the Divide Couple and Recombine method or by the Single Resin method and generate mixtures containing 19 peptides. The present report deals with the synthesis, the purification, the chemical characterization by amino acid analysis and electrospray mass spectrometry (ES-MS), and the application of such mixtures in binding tests with a soluble, functionally empty, single-chain H-2Kd molecule denoted SC-Kd. For each mixture, bound peptides were eluted and analyzed by sequencing. Since the binding tests were realized in noncompetitive conditions, our results show that a much broader set of peptides bind to Kd than expected from previous studies. This may be of practical importance when looking for low affinity peptides such as tumor peptides capable of eliciting protective immune response.
Food-derived immunomodulatory peptides.

PubMed

Santiago-López, Lourdes; Hernández-Mendoza, Adrián; Vallejo-Cordoba, Belinda; Mata-Haro, Verónica; González-Córdova, Aarón F

2016-08-01

Food proteins contain specific amino acid sequences within their structures that may positively impact bodily functions and have multiple immunomodulatory effects. The functional properties of these specific sequences, also referred to as bioactive peptides, are revealed only after the degradation of native proteins during digestion processes. Currently, milk proteins have been the most explored source of bioactive peptides, which presents an interesting opportunity for the dairy industry. However, plant- and animal-derived proteins have also been shown to be important sources of bioactive peptides. This review summarizes the in vitro and in vivo evidence of the role of various food proteins as sources of immunomodulatory peptides and discusses the possible pathways involving these properties. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.
Penetration of short fluorescence-labeled peptides into the nucleus in HeLa cells and in vitro specific interaction of the peptides with deoxyribooligonucleotides and DNA.

PubMed

Fedoreyeva, L I; Kireev, I I; Khavinson, V Kh; Vanyushin, B F

2011-11-01

Marked fluorescence in cytoplasm, nucleus, and nucleolus was observed in HeLa cells after incubation with each of several fluorescein isothiocyanate-labeled peptides (epithalon, Ala-Glu-Asp-Gly; pinealon, Glu-Asp-Arg; testagen, Lys-Glu-Asp-Gly). This means that short biologically active peptides are able to penetrate into an animal cell and its nucleus and, in principle they may interact with various components of cytoplasm and nucleus including DNA and RNA. It was established that various initial (intact) peptides differently affect the fluorescence of the 5,6-carboxyfluorescein-labeled deoxyribooligonucleotides and DNA-ethidium bromide complexes. The Stern-Volmer constants characterizing the degree of fluorescence quenching of various single- and double-stranded fluorescence-labeled deoxyribooligonucleotides with short peptides used were different depending on the peptide primary structures. This indicates the specific interaction between short biologically active peptides and nucleic acid structures. On binding to them, the peptides discriminate between different nucleotide sequences and recognize even their cytosine methylation status. Judging from corresponding constants of the fluorescence quenching, the epithalon, pinealon, and bronchogen (Ala-Glu-Asp-Leu) bind preferentially with deoxyribooligonucleotides containing CNG sequence (CNG sites are targets for cytosine DNA methylation in eukaryotes). Epithalon, testagen, and pinealon seem to preferentially bind with CAG- but bronchogen with CTG-containing sequences. The site-specific interactions of peptides with DNA can control epigenetically the cell genetic functions, and they seem to play an important role in regulation of gene activity even at the earliest stages of life origin and in evolution.
Photodissociative Cross-Linking of Non-covalent Peptide-Peptide Ion Complexes in the Gas Phase

NASA Astrophysics Data System (ADS)

Nguyen, Huong T. H.; Andrikopoulos, Prokopis C.; Rulíšek, Lubomír; Shaffer, Christopher J.; Tureček, František

2018-05-01

We report a gas-phase UV photodissociation study investigating non-covalent interactions between neutral hydrophobic pentapeptides and peptide ions incorporating a diazirine-tagged photoleucine residue. Phenylalanine (Phe) and proline (Pro) were chosen as the conformation-affecting residues that were incorporated into a small library of neutral pentapeptides. Gas-phase ion-molecule complexes of these peptides with photo-labeled pentapeptides were subjected to photodissociation. Selective photocleavage of the diazirine ring at 355 nm formed short-lived carbene intermediates that underwent cross-linking by insertion into H-X bonds of the target peptide. The cross-link positions were established from collision-induced dissociation tandem mass spectra (CID-MS3) providing sequence information on the covalent adducts. Effects of the amino acid residue (Pro or Phe) and its position in the target peptide sequence were evaluated. For proline-containing peptides, interactions resulting in covalent cross-links in these complexes became more prominent as proline was moved towards the C-terminus of the target peptide sequence. The photocross-linking yields of phenylalanine-containing peptides depended on the position of both phenylalanine and photoleucine. Density functional theory calculations were used to assign structures of low-energy conformers of the (GLPMG + GLL*LK + H)+ complex. Born-Oppenheimer molecular dynamics trajectory calculations were used to capture the thermal motion in the complexes within 100 ps and determine close contacts between the incipient carbene and the H-X bonds in the target peptide. This provided atomic-level resolution of potential cross-links that aided spectra interpretation and was in agreement with experimental data. [Figure not available: see fulltext.
Discovery of 12-mer peptides that bind to wood lignin

PubMed Central

Yamaguchi, Asako; Isozaki, Katsuhiro; Nakamura, Masaharu; Takaya, Hikaru; Watanabe, Takashi

2016-01-01

Lignin, an abundant terrestrial polymer, is the only large-volume renewable feedstock composed of an aromatic skeleton. Lignin has been used mostly as an energy source during paper production; however, recent interest in replacing fossil fuels with renewable resources has highlighted its potential value in providing aromatic chemicals. Highly selective degradation of lignin is pivotal for industrial production of paper, biofuels, chemicals, and materials. However, few studies have examined natural and synthetic molecular components recognizing the heterogeneous aromatic polymer. Here, we report the first identification of lignin-binding peptides possessing characteristic sequences using a phage display technique. The consensus sequence HFPSP was found in several lignin-binding peptides, and the outer amino acid sequence affected the binding affinity of the peptides. Substitution of phenylalanine7 with Ile in the lignin-binding peptide C416 (HFPSPIFQRHSH) decreased the affinity of the peptide for softwood lignin without changing its affinity for hardwood lignin, indicating that C416 recognised structural differences between the lignins. Circular dichroism spectroscopy demonstrated that this peptide adopted a highly flexible random coil structure, allowing key residues to be appropriately arranged in relation to the binding site in lignin. These results provide a useful platform for designing synthetic and biological catalysts selectively bind to lignin. PMID:26903196
On the Split Personality of Penultimate Proline

PubMed Central

Glover, Matthew S.; Shi, Liuqing; Fuller, Daniel R.; Arnold, Randy J.; Radivojac, Predrag; Clemmer, David E.

2014-01-01

The influence of the position of the amino acid proline in polypeptide sequences is examined by a combination of ion mobility spectrometry-mass spectrometry (IMS-MS), amino acid substitutions, and molecular modeling. The results suggest that when proline exists as the second residue from the N-terminus (i.e., penultimate proline), two families of conformers are formed. We demonstrate the existence of these families by a study of a series of truncated and mutated peptides derived from the 11-residue peptide Ser1-Pro2-Glu3-Leu4-Pro5-Ser6-Pro7-Gln8-Ala9-Glu10-Lys11. We find that every peptide from this sequence with a penultimate proline residue has multiple conformations. Substitution of Ala for Pro residues indicates that multiple conformers arise from the cis- trans isomerization of Xaa1–Pro2 peptide bonds as Xaa–Ala peptide bonds are unlikely to adopt the cis isomer, and examination of spectra from a library of 58 peptides indicates that ~80% of sequences show this effect. A simple mechanism suggesting that the barrier between the cis-and trans-proline forms is lowered because of low steric impedance is proposed. This observation may have interesting biological implications as well, and we note that a number of biologically active peptides have penultimate proline residues. PMID:25503299
Calibration of mass spectrometric peptide mass fingerprint data without specific external or internal calibrants

PubMed Central

Wolski, Witold E; Lalowski, Maciej; Jungblut, Peter; Reinert, Knut

2005-01-01

Background Peptide Mass Fingerprinting (PMF) is a widely used mass spectrometry (MS) method of analysis of proteins and peptides. It relies on the comparison between experimentally determined and theoretical mass spectra. The PMF process requires calibration, usually performed with external or internal calibrants of known molecular masses. Results We have introduced two novel MS calibration methods. The first method utilises the local similarity of peptide maps generated after separation of complex protein samples by two-dimensional gel electrophoresis. It computes a multiple peak-list alignment of the data set using a modified Minimum Spanning Tree (MST) algorithm. The second method exploits the idea that hundreds of MS samples are measured in parallel on one sample support. It improves the calibration coefficients by applying a two-dimensional Thin Plate Splines (TPS) smoothing algorithm. We studied the novel calibration methods utilising data generated by three different MALDI-TOF-MS instruments. We demonstrate that a PMF data set can be calibrated without resorting to external or relying on widely occurring internal calibrants. The methods developed here were implemented in R and are part of the BioConductor package mscalib available from . Conclusion The MST calibration algorithm is well suited to calibrate MS spectra of protein samples resulting from two-dimensional gel electrophoretic separation. The TPS based calibration algorithm might be used to correct systematic mass measurement errors observed for large MS sample supports. As compared to other methods, our combined MS spectra calibration strategy increases the peptide/protein identification rate by an additional 5 – 15%. PMID:16102175
Screening and Identification of Peptides Specifically Targeted to Gastric Cancer Cells from a Phage Display Peptide Library

PubMed

Sahin, Deniz; Taflan, Sevket Onur; Yartas, Gizem; Ashktorab, Hassan; Smoot, Duane T

2018-04-25

Background: Gastric cancer is the second most common cancer among the malign cancer types. Inefficiency of traditional techniques both in diagnosis and therapy of the disease makes the development of alternative and novel techniques indispensable. As an alternative to traditional methods, tumor specific targeting small peptides can be used to increase the efficiency of the treatment and reduce the side effects related to traditional techniques. The aim of this study is screening and identification of individual peptides specifically targeted to human gastric cancer cells using a phage-displayed peptide library and designing specific peptide sequences by using experimentally-eluted peptide sequences. Methods: Here, MKN-45 human gastric cancer cells and HFE-145 human normal gastric epithelial cells were used as the target and control cells, respectively. 5 rounds of biopannning with a phage display 12-peptide library were applied following subtraction biopanning with HFE-145 control cells. The selected phage clones were established by enzyme-linked immunosorbent assay and immunofluorescence detection. We first obtain random phage clones after five biopanning rounds, determine the binding levels of each individual clone. Then, we analyze the frequencies of each amino acid in best binding clones to determine positively overexpressed amino acids for designing novel peptide sequences. Results: DE532 (VETSQYFRGTLS) phage clone was screened positive, showing specific binding on MKN-45 gastric cancer cells. DE-Obs (HNDLFPSWYHNY) peptide, which was designed by using amino acid frequencies of experimentally selected peptides in the 5th round of biopanning, showed specific binding in MKN-45 cells. Conclusion: Selection and characterization of individual clones may give us specifically binding peptides, but more importantly, data extracted from eluted phage clones may be used to design theoretical peptides with better binding properties than even experimentally selected ones. Both peptides, experimental and designed, may be potential candidates to be developed as useful diagnostic or therapeutic ligand molecules in gastric cancer research. Creative Commons Attribution License
Proteome-wide inference of human endophilin 1-binding peptides.

PubMed

Wu, Gang; Zhang, Zeng-Li; Fu, Chun-Jiang; Lv, Feng-Lin; Tian, Fei-Fei

2012-10-01

Human endophilin 1 (hEndo1) is a multifunctional protein that was found to bind a wide spectrum of prolinerich endocytic proteins through its Src homology 3 (SH3) domain. In order to elucidate the unknown biological functions of hEndo1, it is essential to find out the cytoplasmic components that hEndo1 recognizes and binds. However, it is too time-consuming and expensive to synthesize all peptide candidates found in the human proteome and to perform hEndo1 SH3-peptide affinity assay to identify the hEndo1-binding partners. In the present work, we describe a structure/ sequence-hybrid approach to perform proteome-wide inference of human hEndo1-binding peptides using the information gained from both the primary sequence of affinity-known peptides and the interaction profile involved in hEndo1 SH3-peptide complex three-dimensional structures. Modeling results show that (i) different residue positions contribute distinctly to peptide affinity and specificity; P-1, P2 and P4 are most important, P1 and P3 are also effective, and P-3, P-2, P0, P5 and P6 are relatively insignificant, (ii) the consensus core PXXP motif is necessary but not sufficient for determining high affinity of peptides, and some other positions must be also essential in the hEndo1 SH3-peptide binding, and (iii) the alternating arrangement of polar and nonpolar amino acids along peptide sequence is critical for the high specificity of peptide recognition by hEndo1 SH3 domain. In addition, we also find that the residue type at a specific position of hEndo1-binding peptides is not stringently invariable; amino acids that possess similar polarity could replace each other without substantial influence on peptide affinity. In this way, hEndo1 presents a broad specificity in the peptide ligands that it binds.
Identification of novel bacteriophage peptides using a combination of gene sequence LC-MS-MS analysis and BLASTP

USDA-ARS?s Scientific Manuscript database

Introduction: In an effort to characterize novel bacteriophage with lytic activity against pathogenic E.coli associated with foodborne illness, gene sequencing and mass spectrometry have been used to identify expressed peptides which differentiate isolated bacteriophage from other known phage. Here,...
PeptideNavigator: An interactive tool for exploring large and complex data sets generated during peptide-based drug design projects.

PubMed

Diller, Kyle I; Bayden, Alexander S; Audie, Joseph; Diller, David J

2018-01-01

There is growing interest in peptide-based drug design and discovery. Due to their relatively large size, polymeric nature, and chemical complexity, the design of peptide-based drugs presents an interesting "big data" challenge. Here, we describe an interactive computational environment, PeptideNavigator, for naturally exploring the tremendous amount of information generated during a peptide drug design project. The purpose of PeptideNavigator is the presentation of large and complex experimental and computational data sets, particularly 3D data, so as to enable multidisciplinary scientists to make optimal decisions during a peptide drug discovery project. PeptideNavigator provides users with numerous viewing options, such as scatter plots, sequence views, and sequence frequency diagrams. These views allow for the collective visualization and exploration of many peptides and their properties, ultimately enabling the user to focus on a small number of peptides of interest. To drill down into the details of individual peptides, PeptideNavigator provides users with a Ramachandran plot viewer and a fully featured 3D visualization tool. Each view is linked, allowing the user to seamlessly navigate from collective views of large peptide data sets to the details of individual peptides with promising property profiles. Two case studies, based on MHC-1A activating peptides and MDM2 scaffold design, are presented to demonstrate the utility of PeptideNavigator in the context of disparate peptide-design projects. Copyright © 2017 Elsevier Ltd. All rights reserved.
Identification of chondrocyte-binding peptides by phage display.

PubMed

Cheung, Crystal S F; Lui, Julian C; Baron, Jeffrey

2013-07-01

As an initial step toward targeting cartilage tissue for potential therapeutic applications, we sought cartilage-binding peptides using phage display, a powerful technology for selection of peptides that bind to molecules of interest. A library of phage displaying random 12-amino acid peptides was iteratively incubated with cultured chondrocytes to select phage that bind cartilage. The resulting phage clones demonstrated increased affinity to chondrocytes by ELISA, when compared to a wild-type, insertless phage. Furthermore, the selected phage showed little preferential binding to other cell types, including primary skin fibroblast, myocyte and hepatocyte cultures, suggesting a tissue-specific interaction. Immunohistochemical staining revealed that the selected phage bound chondrocytes themselves and the surrounding extracellular matrix. FITC-tagged peptides were synthesized based on the sequence of cartilage-binding phage clones. These peptides, but not a random peptide, bound cultured chondrocytes, and extracelluar matrix. In conclusion, using phage display, we identified peptide sequences that specifically target chondrocytes. We anticipate that such peptides may be coupled to therapeutic molecules to provide targeted treatment for cartilage disorders. Copyright © 2013 Orthopaedic Research Society.
Pharmacokinetic properties of tandem d-peptides designed for treatment of Alzheimer's disease.

PubMed

Leithold, Leonie H E; Jiang, Nan; Post, Julia; Niemietz, Nicole; Schartmann, Elena; Ziehm, Tamar; Kutzsche, Janine; Shah, N Jon; Breitkreutz, Jörg; Langen, Karl-Josef; Willuweit, Antje; Willbold, Dieter

2016-06-30

Peptides are more and more considered for the development of drug candidates. However, they frequently exhibit severe disadvantages such as instability and unfavourable pharmacokinetic properties. Many peptides are rapidly cleared from the organism and oral bioavailabilities as well as in vivo half-lives often remain low. In contrast, some peptides consisting solely of d-enantiomeric amino acid residues were shown to combine promising therapeutic properties with high proteolytic stability and enhanced pharmacokinetic parameters. Recently, we have shown that D3 and RD2 have highly advantageous pharmacokinetic properties. Especially D3 has already proven promising properties suitable for treatment of Alzheimer's disease. Here, we analyse the pharmacokinetic profiles of D3D3 and RD2D3, which are head-to-tail tandem d-peptides built of D3 and its derivative RD2. Both D3D3 and RD2D3 show proteolytic stability in mouse plasma and organ homogenates for at least 24h and in murine and human liver microsomes for 4h. Notwithstanding their high affinity to plasma proteins, both peptides are taken up into the brain following i.v. as well as i.p. administration. Although both peptides contain identical d-amino acid residues, they are arranged in a different sequence order and the peptides show differences in pharmacokinetic properties. After i.p. administration RD2D3 exhibits lower plasma clearance and higher bioavailability than D3D3. We therefore concluded that the amino acid sequence of RD2 leads to more favourable pharmacokinetic properties within the tandem peptide, which underlines the importance of particular sequence motifs, even in short peptides, for the design of further therapeutic d-peptides. Copyright © 2016 Elsevier B.V. All rights reserved.
Human lactoferricin derived di-peptides deploying loop structures induce apoptosis specifically in cancer cells through targeting membranous phosphatidylserine.

PubMed

Riedl, Sabrina; Leber, Regina; Rinner, Beate; Schaider, Helmut; Lohner, Karl; Zweytick, Dagmar

2015-11-01

Host defense-derived peptides have emerged as a novel strategy for the development of alternative anticancer therapies. In this study we report on characteristic features of human lactoferricin (hLFcin) derivatives which facilitate specific killing of cancer cells of melanoma, glioblastoma and rhabdomyosarcoma compared with non-specific derivatives and the synthetic peptide RW-AH. Changes in amino acid sequence of hLFcin providing 9-11 amino acids stretched derivatives LF11-316, -318 and -322 only yielded low antitumor activity. However, the addition of the repeat (di-peptide) and the retro-repeat (di-retro-peptide) sequences highly improved cancer cell toxicity up to 100% at 20 μM peptide concentration. Compared to the complete parent sequence hLFcin the derivatives showed toxicity on the melanoma cell line A375 increased by 10-fold and on the glioblastoma cell line U-87mg by 2-3-fold. Reduced killing velocity, apoptotic blebbing, activation of caspase 3/7 and formation of apoptotic DNA fragments proved that the active and cancer selective peptides, e.g. R-DIM-P-LF11-322, trigger apoptosis, whereas highly active, though non-selective peptides, such as DIM-LF11-318 and RW-AH seem to kill rapidly via necrosis inducing membrane lyses. Structural studies revealed specific toxicity on cancer cells by peptide derivatives with loop structures, whereas non-specific peptides comprised α-helical structures without loop. Model studies with the cancer membrane mimic phosphatidylserine (PS) gave strong evidence that PS only exposed by cancer cells is an important target for specific hLFcin derivatives. Other negatively charged membrane exposed molecules as sialic acid, heparan and chondroitin sulfate were shown to have minor impact on peptide activity. Copyright © 2015. Published by Elsevier B.V.
Functional analysis of Pacific oyster (Crassostrea gigas) β-thymosin: Focus on antimicrobial activity.

PubMed

Nam, Bo-Hye; Seo, Jung-Kil; Lee, Min Jeong; Kim, Young-Ok; Kim, Dong-Gyun; An, Cheul Min; Park, Nam Gyu

2015-07-01

An antimicrobial peptide, ∼5 kDa in size, was isolated and purified in its active form from the mantle of the Pacific oyster Crassostrea gigas by C18 reversed-phase high-performance liquid chromatography. Matrix-assisted laser desorption ionisation time-of-flight analysis revealed 4656.4 Da of the purified and unreduced peptide. A comparison of the N-terminal amino acid sequence of oyster antimicrobial peptide with deduced amino acid sequences in our local expressed sequence tag (EST) database of C. gigas (unpublished data) revealed that the oyster antimicrobial peptide sequence entirely matched the deduced amino acid sequence of an EST clone (HM-8_A04), which was highly homologous with the β-thymosin of other species. The cDNA possessed a 126-bp open reading frame that encoded a protein of 41 amino acids. To confirm the antimicrobial activity of C. gigas β-thymosin, we overexpressed a recombinant β-thymosin (rcgTβ) using a pET22 expression plasmid in an Escherichia coli system. The antimicrobial activity of rcgTβ was evaluated and demonstrated using a bacterial growth inhibition test in both liquid and solid cultures. Copyright © 2015 Elsevier Ltd. All rights reserved.

Subtle Changes in Peptide Conformation Profoundly Affect Recognition of the Non-Classical MHC Class I Molecule HLA-E by the CD94-NKG2 Natural Killer Cell Receptors

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hoare, Hilary L; Sullivan, Lucy C; Clements, Craig S

2008-03-31

Human leukocyte antigen (HLA)-E is a non-classical major histocompatibility complex class I molecule that binds peptides derived from the leader sequences of other HLA class I molecules. Natural killer cell recognition of these HLA-E molecules, via the CD94-NKG2 natural killer family, represents a central innate mechanism for monitoring major histocompatibility complex expression levels within a cell. The leader sequence-derived peptides bound to HLA-E exhibit very limited polymorphism, yet subtle differences affect the recognition of HLA-E by the CD94-NKG2 receptors. To better understand the basis for this peptide-specific recognition, we determined the structure of HLA-E in complex with two leader peptides,more » namely, HLA-Cw*07 (VMAPRALLL), which is poorly recognised by CD94-NKG2 receptors, and HLA-G*01 (VMAPRTLFL), a high-affinity ligand of CD94-NKG2 receptors. A comparison of these structures, both of which were determined to 2.5-Å resolution, revealed that allotypic variations in the bound leader sequences do not result in conformational changes in the HLA-E heavy chain, although subtle changes in the conformation of the peptide within the binding groove of HLA-E were evident. Accordingly, our data indicate that the CD94-NKG2 receptors interact with HLA-E in a manner that maximises the ability of the receptors to discriminate between subtle changes in both the sequence and conformation of peptides bound to HLA-E.« less
Development of new antiatherosclerotic and antithrombotic drugs utilizing F11 receptor (F11R/JAM-A) peptides.

PubMed

Babinska, A; Clement, C C; Swiatkowska, M; Szymanski, J; Shon, A; Ehrlich, Y H; Kornecki, E; Salifu, M O

2014-07-01

Peptides with enhanced resistance to proteolysis, based on the amino acid sequence of the F11 receptor molecule (F11R, aka JAM-A/Junctional adhesion molecule-A), were designed, prepared, and examined as potential candidates for the development of anti-atherosclerotic and anti-thrombotic therapeutic drugs. A sequence at the N-terminal of F11R together with another sequence located in the first Ig-loop of this protein, were identified to form a steric active-site operating in the F11R-dependent adhesion between cells that express F11R molecules on their external surface. In silico modeling of the complex between two polypeptide chains with the sequences positioned in the active-site was used to generate peptide-candidates designed to inhibit homophilic interactions between surface-located F11R molecules. The two lead F11R peptides were modified with D-Arg and D-Lys at selective sites, for attaining higher stability to proteolysis in vivo. Using molecular docking experiments we tested different conformational states and the putative binding affinity between two selected D-Arg and D-Lys-modified F11R peptides and the proposed binding pocket. The inhibitory effects of the F11R peptide 2HN-(dK)-SVT-(dR)-EDTGTYTC-CONH2 on antibody-induced platelet aggregation and on the adhesion of platelets to cytokine-inflammed endothelial cells are reported in detail, and the results point out the significant potential utilization of F11R peptides for the prevention and treatment of atherosclerotic plaques and associated thrombotic events. © 2014 Wiley Periodicals, Inc.
Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry.

PubMed

Asara, John M; Schweitzer, Mary H; Freimark, Lisa M; Phillips, Matthew; Cantley, Lewis C

2007-04-13

Fossilized bones from extinct taxa harbor the potential for obtaining protein or DNA sequences that could reveal evolutionary links to extant species. We used mass spectrometry to obtain protein sequences from bones of a 160,000- to 600,000-year-old extinct mastodon (Mammut americanum) and a 68-million-year-old dinosaur (Tyrannosaurus rex). The presence of T. rex sequences indicates that their peptide bonds were remarkably stable. Mass spectrometry can thus be used to determine unique sequences from ancient organisms from peptide fragmentation patterns, a valuable tool to study the evolution and adaptation of ancient taxa from which genomic sequences are unlikely to be obtained.
Template Based Design of Anti-Metastatic Drugs from the Active Conformation of Laminin Peptide II

DTIC Science & Technology

2001-01-01

p40 (LBP/p40) gene Maeda, M., Kawasaki, K., Mu, Y., Kamada, H., during sea urchin development. Exp. Cell Res. 221, Tsutsumi, Y., Smith, T. J. & Mayumi...represents the average of six replicates + SEM . minance of putative heparin-binding phage recov- ered from elution with peptide 11. Putative heparin...scrambled sequence peptide, WAQADSTPE, was used as a sequence specificity control. The data shown is the average of six replicate wells ± SEM . Statistics were
Uncoupling GP1 and GP2 Expression in the Lassa Virus Glycoprotein Complex: Implications for GP1 Ectodomain Shedding

DTIC Science & Technology

2008-12-23

glycoprotein precursor (GPC) signal peptide (SP) or human IgG signal sequences (s.s.). GP2 was secreted from cells only when (1) the transmembrane (TM) domain... peptide (SP) or human IgG signal sequences (s.s.). GP2 was secreted from cells only when (1) the transmembrane (TM) domain was deleted, the...terminal signal peptide (SP), which directs the precursor to the endoplasmic retic- ulum (ER) for further processing [11]. The SP, which has been
Stable isotope, site-specific mass tagging for protein identification

DOEpatents

Chen, Xian

2006-10-24

Proteolytic peptide mass mapping as measured by mass spectrometry provides an important method for the identification of proteins, which are usually identified by matching the measured and calculated m/z values of the proteolytic peptides. A unique identification is, however, heavily dependent upon the mass accuracy and sequence coverage of the fragment ions generated by peptide ionization. The present invention describes a method for increasing the specificity, accuracy and efficiency of the assignments of particular proteolytic peptides and consequent protein identification, by the incorporation of selected amino acid residue(s) enriched with stable isotope(s) into the protein sequence without the need for ultrahigh instrumental accuracy. Selected amino acid(s) are labeled with .sup.13C/.sup.15N/.sup.2H and incorporated into proteins in a sequence-specific manner during cell culturing. Each of these labeled amino acids carries a defined mass change encoded in its monoisotopic distribution pattern. Through their characteristic patterns, the peptides with mass tag(s) can then be readily distinguished from other peptides in mass spectra. The present method of identifying unique proteins can also be extended to protein complexes and will significantly increase data search specificity, efficiency and accuracy for protein identifications.
Predicting PDZ domain mediated protein interactions from structure

PubMed Central

2013-01-01

Background PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors. Results We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling. Conclusions We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at http://webservice.baderlab.org/domains/POW. PMID:23336252
Genomewide Analysis of the Antimicrobial Peptides in Python bivittatus and Characterization of Cathelicidins with Potent Antimicrobial Activity and Low Cytotoxicity.

PubMed

Kim, Dayeong; Soundrarajan, Nagasundarapandian; Lee, Juyeon; Cho, Hye-Sun; Choi, Minkyeung; Cha, Se-Yeoun; Ahn, Byeongyong; Jeon, Hyoim; Le, Minh Thong; Song, Hyuk; Kim, Jin-Hoi; Park, Chankyu

2017-09-01

In this study, we sought to identify novel antimicrobial peptides (AMPs) in Python bivittatus through bioinformatic analyses of publicly available genome information and experimental validation. In our analysis of the python genome, we identified 29 AMP-related candidate sequences. Of these, we selected five cathelicidin-like sequences and subjected them to further in silico analyses. The results showed that these sequences likely have antimicrobial activity. The sequences were named Pb-CATH1 to Pb-CATH5 according to their sequence similarity to previously reported snake cathelicidins. We predicted their molecular structure and then chemically synthesized the mature peptide for three putative cathelicidins and subjected them to biological activity tests. Interestingly, all three peptides showed potent antimicrobial effects against Gram-negative bacteria but very weak activity against Gram-positive bacteria. Remarkably, ΔPb-CATH4 showed potent activity against antibiotic-resistant clinical isolates and also was observed to possess very low hemolytic activity and cytotoxicity. ΔPb-CATH4 also showed considerable serum stability. Electron microscopic analysis indicated that ΔPb-CATH4 exerts its effects via toroidal pore preformation. Structural comparison of the cathelicidins identified in this study to previously reported ones revealed that these Pb-CATHs are representatives of a new group of reptilian cathelicidins lacking the acidic connecting domain. Furthermore, Pb-CATH4 possesses a completely different mature peptide sequence from those of previously described reptilian cathelicidins. These new AMPs may be candidates for the development of alternatives to or complements of antibiotics to control multidrug-resistant pathogens. Copyright © 2017 American Society for Microbiology.
Genomewide Analysis of the Antimicrobial Peptides in Python bivittatus and Characterization of Cathelicidins with Potent Antimicrobial Activity and Low Cytotoxicity

PubMed Central

Kim, Dayeong; Soundrarajan, Nagasundarapandian; Lee, Juyeon; Cho, Hye-sun; Choi, Minkyeung; Cha, Se-Yeoun; Ahn, Byeongyong; Jeon, Hyoim; Le, Minh Thong; Song, Hyuk; Kim, Jin-Hoi

2017-01-01

ABSTRACT In this study, we sought to identify novel antimicrobial peptides (AMPs) in Python bivittatus through bioinformatic analyses of publicly available genome information and experimental validation. In our analysis of the python genome, we identified 29 AMP-related candidate sequences. Of these, we selected five cathelicidin-like sequences and subjected them to further in silico analyses. The results showed that these sequences likely have antimicrobial activity. The sequences were named Pb-CATH1 to Pb-CATH5 according to their sequence similarity to previously reported snake cathelicidins. We predicted their molecular structure and then chemically synthesized the mature peptide for three putative cathelicidins and subjected them to biological activity tests. Interestingly, all three peptides showed potent antimicrobial effects against Gram-negative bacteria but very weak activity against Gram-positive bacteria. Remarkably, ΔPb-CATH4 showed potent activity against antibiotic-resistant clinical isolates and also was observed to possess very low hemolytic activity and cytotoxicity. ΔPb-CATH4 also showed considerable serum stability. Electron microscopic analysis indicated that ΔPb-CATH4 exerts its effects via toroidal pore preformation. Structural comparison of the cathelicidins identified in this study to previously reported ones revealed that these Pb-CATHs are representatives of a new group of reptilian cathelicidins lacking the acidic connecting domain. Furthermore, Pb-CATH4 possesses a completely different mature peptide sequence from those of previously described reptilian cathelicidins. These new AMPs may be candidates for the development of alternatives to or complements of antibiotics to control multidrug-resistant pathogens. PMID:28630199
Proteomic Identification of Monoclonal Antibodies from Serum

PubMed Central

2015-01-01

Characterizing the in vivo dynamics of the polyclonal antibody repertoire in serum, such as that which might arise in response to stimulation with an antigen, is difficult due to the presence of many highly similar immunoglobulin proteins, each specified by distinct B lymphocytes. These challenges have precluded the use of conventional mass spectrometry for antibody identification based on peptide mass spectral matches to a genomic reference database. Recently, progress has been made using bottom-up analysis of serum antibodies by nanoflow liquid chromatography/high-resolution tandem mass spectrometry combined with a sample-specific antibody sequence database generated by high-throughput sequencing of individual B cell immunoglobulin variable domains (V genes). Here, we describe how intrinsic features of antibody primary structure, most notably the interspersed segments of variable and conserved amino acid sequences, generate recurring patterns in the corresponding peptide mass spectra of V gene peptides, greatly complicating the assignment of correct sequences to mass spectral data. We show that the standard method of decoy-based error modeling fails to account for the error introduced by these highly similar sequences, leading to a significant underestimation of the false discovery rate. Because of these effects, antibody-derived peptide mass spectra require increased stringency in their interpretation. The use of filters based on the mean precursor ion mass accuracy of peptide-spectrum matches is shown to be particularly effective in distinguishing between “true” and “false” identifications. These findings highlight important caveats associated with the use of standard database search and error-modeling methods with nonstandard data sets and custom sequence databases. PMID:24684310
Practical and Efficient Searching in Proteomics: A Cross Engine Comparison

PubMed Central

Paulo, Joao A.

2014-01-01

Background Analysis of large datasets produced by mass spectrometry-based proteomics relies on database search algorithms to sequence peptides and identify proteins. Several such scoring methods are available, each based on different statistical foundations and thereby not producing identical results. Here, the aim is to compare peptide and protein identifications using multiple search engines and examine the additional proteins gained by increasing the number of technical replicate analyses. Methods A HeLa whole cell lysate was analyzed on an Orbitrap mass spectrometer for 10 technical replicates. The data were combined and searched using Mascot, SEQUEST, and Andromeda. Comparisons were made of peptide and protein identifications among the search engines. In addition, searches using each engine were performed with incrementing number of technical replicates. Results The number and identity of peptides and proteins differed across search engines. For all three search engines, the differences in proteins identifications were greater than the differences in peptide identifications indicating that the major source of the disparity may be at the protein inference grouping level. The data also revealed that analysis of 2 technical replicates can increase protein identifications by up to 10-15%, while a third replicate results in an additional 4-5%. Conclusions The data emphasize two practical methods of increasing the robustness of mass spectrometry data analysis. The data show that 1) using multiple search engines can expand the number of identified proteins (union) and validate protein identifications (intersection), and 2) analysis of 2 or 3 technical replicates can substantially expand protein identifications. Moreover, information can be extracted from a dataset by performing database searching with different engines and performing technical repeats, which requires no additional sample preparation and effectively utilizes research time and effort. PMID:25346847
Analysis of Glycoproteins in Human Serum by Means of Glycospecific Magnetic Bead Separation and LC-MALDI-TOF/TOF Analysis with Automated Glycopeptide Detection

PubMed Central

Sparbier, Katrin; Asperger, Arndt; Resemann, Anja; Kessler, Irina; Koch, Sonja; Wenzel, Thomas; Stein, Günter; Vorwerg, Lars; Suckau, Detlev; Kostrzewa, Markus

2007-01-01

Comprehensive proteomic analyses require efficient and selective pre-fractionation to facilitate analysis of post-translationally modified peptides and proteins, and automated analysis workflows enabling the detection, identification, and structural characterization of the corresponding peptide modifications. Human serum contains a high number of glycoproteins, comprising several orders of magnitude in concentration. Thereby, isolation and subsequent identification of low-abundant glycoproteins from serum is a challenging task. selective capturing of glycopeptides and -proteins was attained by means of magnetic particles specifically functionalized with lectins or boronic acids that bind to various structural motifs. Human serum was incubated with differentially functionalized magnetic micro-particles (lectins or boronic acids), and isolated proteins were digested with trypsin. Subsequently, the resulting complex mixture of peptides and glycopeptides was subjected to LC-MALDI analysis and database searching. In parallel, a second magnetic bead capturing was performed on the peptide level to separate and analyze by LC-MALDI intact glycopeptides, both peptide sequence and glycan structure. Detection of glycopeptides was achieved by means of a software algorithm that allows extraction and characterization of potential glycopeptide candidates from large LC-MALDI-MS/MS data sets, based on N-glycopeptide-specific fragmentation patterns and characteristic fragment mass peaks, respectively. By means of fast and simple glycospecific capturing applied in conjunction with extensive LC-MALDI-MS/MS analysis and novel data analysis tools, a high number of low-abundant proteins were identified, comprising known or predicted glycosylation sites. According to the specific binding preferences of the different types of beads, complementary results were obtained from the experiments using either magnetic ConA-, LCA-, WGA-, and boronic acid beads, respectively. PMID:17916798
Practical and Efficient Searching in Proteomics: A Cross Engine Comparison.

PubMed

Paulo, Joao A

2013-10-01

Analysis of large datasets produced by mass spectrometry-based proteomics relies on database search algorithms to sequence peptides and identify proteins. Several such scoring methods are available, each based on different statistical foundations and thereby not producing identical results. Here, the aim is to compare peptide and protein identifications using multiple search engines and examine the additional proteins gained by increasing the number of technical replicate analyses. A HeLa whole cell lysate was analyzed on an Orbitrap mass spectrometer for 10 technical replicates. The data were combined and searched using Mascot, SEQUEST, and Andromeda. Comparisons were made of peptide and protein identifications among the search engines. In addition, searches using each engine were performed with incrementing number of technical replicates. The number and identity of peptides and proteins differed across search engines. For all three search engines, the differences in proteins identifications were greater than the differences in peptide identifications indicating that the major source of the disparity may be at the protein inference grouping level. The data also revealed that analysis of 2 technical replicates can increase protein identifications by up to 10-15%, while a third replicate results in an additional 4-5%. The data emphasize two practical methods of increasing the robustness of mass spectrometry data analysis. The data show that 1) using multiple search engines can expand the number of identified proteins (union) and validate protein identifications (intersection), and 2) analysis of 2 or 3 technical replicates can substantially expand protein identifications. Moreover, information can be extracted from a dataset by performing database searching with different engines and performing technical repeats, which requires no additional sample preparation and effectively utilizes research time and effort.
New Potent Membrane-Targeting Antibacterial Peptides from Viral Capsid Proteins

PubMed Central

Dias, Susana A.; Freire, João M.; Pérez-Peinado, Clara; Domingues, Marco M.; Gaspar, Diana; Vale, Nuno; Gomes, Paula; Andreu, David; Henriques, Sónia T.; Castanho, Miguel A. R. B.; Veiga, Ana S.

2017-01-01

The increasing prevalence of multidrug-resistant bacteria urges the development of new antibacterial agents. With a broad spectrum activity, antimicrobial peptides have been considered potential antibacterial drug leads. Using bioinformatic tools we have previously shown that viral structural proteins are a rich source for new bioactive peptide sequences, namely antimicrobial and cell-penetrating peptides. Here, we test the efficacy and mechanism of action of the most promising peptides among those previously identified against both Gram-positive and Gram-negative bacteria. Two cell-penetrating peptides, vCPP 0769 and vCPP 2319, have high antibacterial activity against Staphylococcus aureus, MRSA, Escherichia coli, and Pseudomonas aeruginosa, being thus multifunctional. The antibacterial mechanism of action of the two most active viral protein-derived peptides, vAMP 059 and vCPP 2319, was studied in detail. Both peptides act on both Gram-positive S. aureus and Gram-negative P. aeruginosa, with bacterial cell death occurring within minutes. Also, these peptides cause bacterial membrane permeabilization and damage of the bacterial envelope of P. aeruginosa cells. Overall, the results show that structural viral proteins are an abundant source for membrane-active peptides sequences with strong antibacterial properties. PMID:28522994
T7 lytic phage-displayed peptide libraries exhibit less sequence bias than M13 filamentous phage-displayed peptide libraries.

PubMed

Krumpe, Lauren R H; Atkinson, Andrew J; Smythers, Gary W; Kandel, Andrea; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki

2006-08-01

We investigated whether the T7 system of phage display could produce peptide libraries of greater diversity than the M13 system of phage display due to the differing processes of lytic and filamentous phage morphogenesis. Using a bioinformatics-assisted computational approach, collections of random peptide sequences obtained from a T7 12-mer library (X(12)) and a T7 7-mer disulfide-constrained library (CX(7)C) were analyzed and compared with peptide populations obtained from New England BioLabs' M13 Ph.D.-12 and Ph.D.-C7C libraries. Based on this analysis, peptide libraries constructed with the T7 system have fewer amino acid biases, increased peptide diversity, and more normal distributions of peptide net charge and hydropathy than the M13 libraries. The greater diversity of T7-displayed libraries provides a potential resource of novel binding peptides for new as well as previously studied molecular targets. To demonstrate their utility, several of the T7-displayed peptide libraries were screened for streptavidin- and neutravidin-binding phage. Novel binding motifs were identified for each protein.
Comment on "Protein sequences from mastodon and Tyrannosaurus rex revealed by mass spectrometry".

PubMed

Buckley, Mike; Walker, Angela; Ho, Simon Y W; Yang, Yue; Smith, Colin; Ashton, Peter; Oates, Jane Thomas; Cappellini, Enrico; Koon, Hannah; Penkman, Kirsty; Elsworth, Ben; Ashford, Dave; Solazzo, Caroline; Andrews, Phillip; Strahler, John; Shapiro, Beth; Ostrom, Peggy; Gandhi, Hasand; Miller, Webb; Raney, Brian; Zylber, Maria Ines; Gilbert, M Thomas P; Prigodich, Richard V; Ryan, Michael; Rijsdijk, Kenneth F; Janoo, Anwar; Collins, Matthew J

2008-01-04

We used authentication tests developed for ancient DNA to evaluate claims by Asara et al. (Reports, 13 April 2007, p. 280) of collagen peptide sequences recovered from mastodon and Tyrannosaurus rex fossils. Although the mastodon samples pass these tests, absence of amino acid composition data, lack of evidence for peptide deamidation, and association of alpha1(I) collagen sequences with amphibians rather than birds suggest that T. rex does not.
Identification of Cell Adhesive Sequences in the N-terminal Region of the Laminin α2 Chain*

PubMed Central

Hozumi, Kentaro; Ishikawa, Masaya; Hayashi, Takemitsu; Yamada, Yuji; Katagiri, Fumihiko; Kikkawa, Yamato; Nomizu, Motoyoshi

2012-01-01

The laminin α2 chain is specifically expressed in the basement membrane surrounding muscle and nerve. We screened biologically active sequences in the mouse laminin N-terminal region of α2 chain using 216 soluble peptides and three recombinant proteins (rec-a2LN, rec-a2LN+, and rec-a2N) by both the peptide- or protein-coated plate and the peptide-conjugated Sepharose bead assays. Ten peptides showed cell attachment activity in the plate assay, and 8 peptides were active in the bead assay. Seven peptides were active in the both assays. Five peptides promoted neurite outgrowth with PC12 cells. To clarify the cellular receptors, we examined the effects of heparin and EDTA on cell attachment to 11 active peptides. Heparin inhibited cell attachment to 10 peptides, and EDTA significantly affected only A2-8 peptide (YHYVTITLDLQQ, mouse laminin α2 chain, 117–128)-mediated cell attachment. Cell attachment to A2-8 was also specifically inhibited by anti-integrin β1 and anti-integrin α2β1 antibodies. These results suggest that A2-8 promotes an integrin α2β1-mediated cell attachment. The rec-a2LN protein, containing the A2-8 sequence, bound to integrin α2β1 and cell attachment to rec-a2LN was inhibited by A2-8 peptide. Further, alanine substitution analysis of both the A2-8 peptide and the rec-a2LN+ protein revealed that the amino acids Ile-122, Leu-124, and Asp-125 were involved in integrin α2β1-mediated cell attachment, suggesting that the A2-8 site plays a functional role as an integrin α2β1 binding site in the LN module. These active peptides may provide new insights on the molecular mechanism of laminin-receptor interactions. PMID:22654118
Bioinspired second harmonic generation

NASA Astrophysics Data System (ADS)

Sonay, Ali Y.; Pantazis, Periklis

2017-07-01

Second harmonic generation (SHG) is a microscopic technique applicable to a broad spectrum of biological and medical imaging due to its excellent photostability, high signal-to-noise ratio (SNR) and narrow emission profile. Current SHG microscopy techniques rely on two main contrast modalities. These are endogenous SHG generated by tissue structures, which is clinically relevant but cannot be targeted to another location, or SHG nanoprobes, inorganic nanocrystals that can be directed to proteins and cells of interest, but cannot be applied for clinical imaging due to their chemical composition. Here we analyzed SHG signal generated by large-scale peptide assemblies. Our results show the sequence of peptides play an important role on both the morphology and SHG signal of the peptide assemblies. Changing peptide sequence allows confinement of large number of peptides to smaller voxels, generating intense SHG signal. With miniaturization of these peptides and their proper functionalization strategies, such bioinspired nanoparticles would emerge as valuable tools for clinical imaging.
Identification and characterization of mutant clones with enhanced propagation rates from phage-displayed peptide libraries.

PubMed

Nguyen, Kieu T H; Adamkiewicz, Marta A; Hebert, Lauren E; Zygiel, Emily M; Boyle, Holly R; Martone, Christina M; Meléndez-Ríos, Carola B; Noren, Karen A; Noren, Christopher J; Hall, Marilena Fitzsimons

2014-10-01

A target-unrelated peptide (TUP) can arise in phage display selection experiments as a result of a propagation advantage exhibited by the phage clone displaying the peptide. We previously characterized HAIYPRH, from the M13-based Ph.D.-7 phage display library, as a propagation-related TUP resulting from a G→A mutation in the Shine-Dalgarno sequence of gene II. This mutant was shown to propagate in Escherichia coli at a dramatically faster rate than phage bearing the wild-type Shine-Dalgarno sequence. We now report 27 additional fast-propagating clones displaying 24 different peptides and carrying 14 unique mutations. Most of these mutations are found either in or upstream of the gene II Shine-Dalgarno sequence, but still within the mRNA transcript of gene II. All 27 clones propagate at significantly higher rates than normal library phage, most within experimental error of wild-type M13 propagation, suggesting that mutations arise to compensate for the reduced virulence caused by the insertion of a lacZα cassette proximal to the replication origin of the phage used to construct the library. We also describe an efficient and convenient assay to diagnose propagation-related TUPS among peptide sequences selected by phage display. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Applying the Concept of Peptide Uniqueness to Anti-Polio Vaccination

PubMed Central

Kanduc, Darja; Fasano, Candida; Capone, Giovanni; Pesce Delfino, Antonella; Calabrò, Michele; Polimeno, Lorenzo

2015-01-01

Background. Although rare, adverse events may associate with anti-poliovirus vaccination thus possibly hampering global polio eradication worldwide. Objective. To design peptide-based anti-polio vaccines exempt from potential cross-reactivity risks and possibly able to reduce rare potential adverse events such as the postvaccine paralytic poliomyelitis due to the tendency of the poliovirus genome to mutate. Methods. Proteins from poliovirus type 1, strain Mahoney, were analyzed for amino acid sequence identity to the human proteome at the pentapeptide level, searching for sequences that (1) have zero percent of identity to human proteins, (2) are potentially endowed with an immunologic potential, and (3) are highly conserved among poliovirus strains. Results. Sequence analyses produced a set of consensus epitopic peptides potentially able to generate specific anti-polio immune responses exempt from cross-reactivity with the human host. Conclusion. Peptide sequences unique to poliovirus proteins and conserved among polio strains might help formulate a specific and universal anti-polio vaccine able to react with multiple viral strains and exempt from the burden of possible cross-reactions with human proteins. As an additional advantage, using a peptide-based vaccine instead of current anti-polio DNA vaccines would eliminate the rare post-polio poliomyelitis cases and other disabling symptoms that may appear following vaccination. PMID:26568962

Automated detection of inaccurate and imprecise transitions in peptide quantification by multiple reaction monitoring mass spectrometry.

PubMed

Abbatiello, Susan E; Mani, D R; Keshishian, Hasmik; Carr, Steven A

2010-02-01

Multiple reaction monitoring mass spectrometry (MRM-MS) of peptides with stable isotope-labeled internal standards (SISs) is increasingly being used to develop quantitative assays for proteins in complex biological matrices. These assays can be highly precise and quantitative, but the frequent occurrence of interferences requires that MRM-MS data be manually reviewed, a time-intensive process subject to human error. We developed an algorithm that identifies inaccurate transition data based on the presence of interfering signal or inconsistent recovery among replicate samples. The algorithm objectively evaluates MRM-MS data with 2 orthogonal approaches. First, it compares the relative product ion intensities of the analyte peptide to those of the SIS peptide and uses a t-test to determine if they are significantly different. A CV is then calculated from the ratio of the analyte peak area to the SIS peak area from the sample replicates. The algorithm identified problematic transitions and achieved accuracies of 94%-100%, with a sensitivity and specificity of 83%-100% for correct identification of errant transitions. The algorithm was robust when challenged with multiple types of interferences and problematic transitions. This algorithm for automated detection of inaccurate and imprecise transitions (AuDIT) in MRM-MS data reduces the time required for manual and subjective inspection of data, improves the overall accuracy of data analysis, and is easily implemented into the standard data-analysis work flow. AuDIT currently works with results exported from MRM-MS data-processing software packages and may be implemented as an analysis tool within such software.
Automated Detection of Inaccurate and Imprecise Transitions in Peptide Quantification by Multiple Reaction Monitoring Mass Spectrometry

PubMed Central

Abbatiello, Susan E.; Mani, D. R.; Keshishian, Hasmik; Carr, Steven A.

2010-01-01

BACKGROUND Multiple reaction monitoring mass spectrometry (MRM-MS) of peptides with stable isotope–labeled internal standards (SISs) is increasingly being used to develop quantitative assays for proteins in complex biological matrices. These assays can be highly precise and quantitative, but the frequent occurrence of interferences requires that MRM-MS data be manually reviewed, a time-intensive process subject to human error. We developed an algorithm that identifies inaccurate transition data based on the presence of interfering signal or inconsistent recovery among replicate samples. METHODS The algorithm objectively evaluates MRM-MS data with 2 orthogonal approaches. First, it compares the relative product ion intensities of the analyte peptide to those of the SIS peptide and uses a t-test to determine if they are significantly different. A CV is then calculated from the ratio of the analyte peak area to the SIS peak area from the sample replicates. RESULTS The algorithm identified problematic transitions and achieved accuracies of 94%–100%, with a sensitivity and specificity of 83%–100% for correct identification of errant transitions. The algorithm was robust when challenged with multiple types of interferences and problematic transitions. CONCLUSIONS This algorithm for automated detection of inaccurate and imprecise transitions (AuDIT) in MRM-MS data reduces the time required for manual and subjective inspection of data, improves the overall accuracy of data analysis, and is easily implemented into the standard data-analysis work flow. AuDIT currently works with results exported from MRM-MS data-processing software packages and may be implemented as an analysis tool within such software. PMID:20022980
freeQuant: A Mass Spectrometry Label-Free Quantification Software Tool for Complex Proteome Analysis.

PubMed

Deng, Ning; Li, Zhenye; Pan, Chao; Duan, Huilong

2015-01-01

Study of complex proteome brings forward higher request for the quantification method using mass spectrometry technology. In this paper, we present a mass spectrometry label-free quantification tool for complex proteomes, called freeQuant, which integrated quantification with functional analysis effectively. freeQuant consists of two well-integrated modules: label-free quantification and functional analysis with biomedical knowledge. freeQuant supports label-free quantitative analysis which makes full use of tandem mass spectrometry (MS/MS) spectral count, protein sequence length, shared peptides, and ion intensity. It adopts spectral count for quantitative analysis and builds a new method for shared peptides to accurately evaluate abundance of isoforms. For proteins with low abundance, MS/MS total ion count coupled with spectral count is included to ensure accurate protein quantification. Furthermore, freeQuant supports the large-scale functional annotations for complex proteomes. Mitochondrial proteomes from the mouse heart, the mouse liver, and the human heart were used to evaluate the usability and performance of freeQuant. The evaluation showed that the quantitative algorithms implemented in freeQuant can improve accuracy of quantification with better dynamic range.
Effective modification of cell death-inducing intracellular peptides by means of a photo-cleavable peptide array-based screening system.

PubMed

Kozaki, Ikko; Shimizu, Kazunori; Honda, Hiroyuki

2017-08-01

Intracellular functional peptides that play a significant role inside cells have been receiving a lot of attention as regulators of cellular activity. Previously, we proposed a novel screening system for intracellular functional peptides; it combined a photo-cleavable peptide array system with cell-penetrating peptides (CPPs). Various peptides can be delivered into cells and intracellular functions of the peptides can be assayed by means of our system. The aim of the present study was to demonstrate that the proposed screening system can be used for assessing the intracellular activity of peptides. The cell death-inducing peptide (LNLISKLF) identified in a mitochondria-targeting domain (MTD) of the Noxa protein served as an original peptide sequence for screening of peptides with higher activity via modification of the peptide sequence. We obtained 4 peptides with higher activity, in which we substituted serine (S) at the fifth position with phenylalanine (F), valine (V), tryptophan (W), or tyrosine (Y). During analysis of the mechanism of action, the modified peptides induced an increase in intracellular calcium concentration, which was caused by the treatment with the original peptide. Higher capacity for cell death induction by the modified peptides may be caused by increased hydrophobicity or an increased number of aromatic residues. Thus, the present work suggests that the intracellular activity of peptides can be assessed using the proposed screening system. It could be used for identifying intracellular functional peptides with higher activity through comprehensive screening. Copyright © 2017 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
ClusterMine360: a database of microbial PKS/NRPS biosynthesis

PubMed Central

Conway, Kyle R.; Boddy, Christopher N.

2013-01-01

ClusterMine360 (http://www.clustermine360.ca/) is a database of microbial polyketide and non-ribosomal peptide gene clusters. It takes advantage of crowd-sourcing by allowing members of the community to make contributions while automation is used to help achieve high data consistency and quality. The database currently has >200 gene clusters from >185 compound families. It also features a unique sequence repository containing >10 000 polyketide synthase/non-ribosomal peptide synthetase domains. The sequences are filterable and downloadable as individual or multiple sequence FASTA files. We are confident that this database will be a useful resource for members of the polyketide synthases/non-ribosomal peptide synthetases research community, enabling them to keep up with the growing number of sequenced gene clusters and rapidly mine these clusters for functional information. PMID:23104377
The neXtProt peptide uniqueness checker: a tool for the proteomics community.

PubMed

Schaeffer, Mathieu; Gateau, Alain; Teixeira, Daniel; Michel, Pierre-André; Zahn-Zabal, Monique; Lane, Lydie

2017-11-01

The neXtProt peptide uniqueness checker allows scientists to define which peptides can be used to validate the existence of human proteins, i.e. map uniquely versus multiply to human protein sequences taking into account isobaric substitutions, alternative splicing and single amino acid variants. The pepx program is available at https://github.com/calipho-sib/pepx and can be launched from the command line or through a cgi web interface. Indexing requires a sequence file in FASTA format. The peptide uniqueness checker tool is freely available on the web at https://www.nextprot.org/tools/peptide-uniqueness-checker and from the neXtProt API at https://api.nextprot.org/. lydie.lane@sib.swiss. © The Author(s) 2017. Published by Oxford University Press.
Dynamics at a Peptide-TiO2 Anatase (101) Interface.

PubMed

Polimeni, Marco; Petridis, Loukas; Smith, Jeremy C; Arcangeli, Caterina

2017-09-28

The interface between biological matter and inorganic materials is a widely investigated research topic due to possible applications in biomedicine and nanotechnology. In this context, the molecular level adsorption mechanism that drives specific recognition between small peptide sequences and inorganic surfaces represents an important topic likely to provide much information useful for designing bioderived materials. Here, we investigate the dynamics at the interface between a Ti-binding peptide sequence (AMRKLPDAPGMHC) and a TiO 2 anatase surface by using molecular dynamics (MD) simulations. In the simulations the adsorption mechanism is characterized by diffusion of the peptide from the bulk water phase toward the TiO 2 surface, followed by the anchoring of the peptide to the surface. The anchoring is mediated by the interfacial water layers by means of the charged groups of the side chains of the peptide. The peptide samples anchored and dissociated states from the surface and its conformation is not affected by the surface when anchored.
Slowing down single-molecule trafficking through a protein nanopore reveals intermediates for peptide translocation

NASA Astrophysics Data System (ADS)

Mereuta, Loredana; Roy, Mahua; Asandei, Alina; Lee, Jong Kook; Park, Yoonkyung; Andricioaei, Ioan; Luchian, Tudor

2014-01-01

The microscopic details of how peptides translocate one at a time through nanopores are crucial determinants for transport through membrane pores and important in developing nano-technologies. To date, the translocation process has been too fast relative to the resolution of the single molecule techniques that sought to detect its milestones. Using pH-tuned single-molecule electrophysiology and molecular dynamics simulations, we demonstrate how peptide passage through the α-hemolysin protein can be sufficiently slowed down to observe intermediate single-peptide sub-states associated to distinct structural milestones along the pore, and how to control residence time, direction and the sequence of spatio-temporal state-to-state dynamics of a single peptide. Molecular dynamics simulations of peptide translocation reveal the time- dependent ordering of intermediate structures of the translocating peptide inside the pore at atomic resolution. Calculations of the expected current ratios of the different pore-blocking microstates and their time sequencing are in accord with the recorded current traces.
Sample limited characterization of a novel disulfide-rich venom peptide toxin from terebrid marine snail Terebra variegata.

PubMed

Anand, Prachi; Grigoryan, Alexandre; Bhuiyan, Mohammed H; Ueberheide, Beatrix; Russell, Victoria; Quinoñez, Jose; Moy, Patrick; Chait, Brian T; Poget, Sébastien F; Holford, Mandë

2014-01-01

Disulfide-rich peptide toxins found in the secretions of venomous organisms such as snakes, spiders, scorpions, leeches, and marine snails are highly efficient and effective tools for novel therapeutic drug development. Venom peptide toxins have been used extensively to characterize ion channels in the nervous system and platelet aggregation in haemostatic systems. A significant hurdle in characterizing disulfide-rich peptide toxins from venomous animals is obtaining significant quantities needed for sequence and structural analyses. Presented here is a strategy for the structural characterization of venom peptide toxins from sample limited (4 ng) specimens via direct mass spectrometry sequencing, chemical synthesis and NMR structure elucidation. Using this integrated approach, venom peptide Tv1 from Terebra variegata was discovered. Tv1 displays a unique fold not witnessed in prior snail neuropeptides. The novel structural features found for Tv1 suggest that the terebrid pool of peptide toxins may target different neuronal agents with varying specificities compared to previously characterized snail neuropeptides.
Mass spectrometric survey of peptides in cephalopods with an emphasis on the FMRFamide-related peptides.

PubMed

Sweedler, J V; Li, L; Floyd, P; Gilly, W

2000-12-01

A matrix-assisted laser desorption/ionization (MALDI) mass spectrometric (MS) survey of the major peptides in the stellar, fin and pallial nerves and the posterior chromatophore lobe of the cephalopods Sepia officinalis, Loligo opalescens and Dosidicus gigas has been performed. Although a large number of putative peptides are distinct among the three species, several molecular masses are conserved. In addition to peptides, characterization of the lipid content of the nerves is reported, and these lipid peaks account for many of the lower molecular masses observed. One conserved set of peaks corresponds to the FMRFamide-related peptides (FRPs). The Loligo opalescens FMRFa gene has been sequenced. It encodes a 331 amino acid residue prohormone that is processed into 14 FRPs, which are both predicted by the nucleotide sequence and confirmed by MALDI MS. The FRPs predicted by this gene (FMRFa, FLRFa/FIRFa and ALSGDAFLRFa) are observed in all three species, indicating that members of this peptide family are highly conserved across cephalopods.
Designing Anticancer Peptides by Constructive Machine Learning.

PubMed

Grisoni, Francesca; Neuhaus, Claudia S; Gabernet, Gisela; Müller, Alex T; Hiss, Jan A; Schneider, Gisbert

2018-04-21

Constructive (generative) machine learning enables the automated generation of novel chemical structures without the need for explicit molecular design rules. This study presents the experimental application of such a deep machine learning model to design membranolytic anticancer peptides (ACPs) de novo. A recurrent neural network with long short-term memory cells was trained on α-helical cationic amphipathic peptide sequences and then fine-tuned with 26 known ACPs by transfer learning. This optimized model was used to generate unique and novel amino acid sequences. Twelve of the peptides were synthesized and tested for their activity on MCF7 human breast adenocarcinoma cells and selectivity against human erythrocytes. Ten of these peptides were active against cancer cells. Six of the active peptides killed MCF7 cancer cells without affecting human erythrocytes with at least threefold selectivity. These results advocate constructive machine learning for the automated design of peptides with desired biological activities. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Cloning and heterologous expression of the antibiotic peptide (ABP) genes from Rhizopus oligosporus NBRC 8631.

PubMed

Yamada, Osamu; Sakamoto, Kazutoshi; Tominaga, Mihoko; Nakayama, Tasuku; Koseki, Takuya; Fujita, Akiko; Akita, Osamu

2005-03-01

We carried out protein sequencing of purified Antibiotic Peptide (ABP), and cloned two genes encoding this peptide as abp1 and abp2, from Rhizopus oligosporus NBRC 8631. Both genes contain an almost identical 231-bp segment, with only 3 nucleotide substitutions, encoding a 77 amino acid peptide. The abp gene product comprises a 28 amino acid signal sequence and a 49 amino acid mature peptide. Northern blot analysis showed that at least one of the abp genes is transcribed in R. oligosporus NBRC 8631. A truncated form of abp1 encoding only the mature peptide was fused with the alpha-factor signal peptide and engineered for expression in Pichia pastoris SMD1168H. Culture broth of the recombinant Pichia displayed ABP activity against Bacillus subtilis NBRC 3335 after induction of heterologous gene expression. This result indicates that mature ABP formed the active structure without the aid of other factors from R. oligosporus, and was secreted.
Functional characterization of a synthetic hydrophilic antifungal peptide derived from the marine snail Cenchritis muricatus.

PubMed

López-Abarrategui, Carlos; Alba, Annia; Silva, Osmar N; Reyes-Acosta, Osvaldo; Vasconcelos, Ilka M; Oliveira, Jose T A; Migliolo, Ludovico; Costa, Maysa P; Costa, Carolina R; Silva, Maria R R; Garay, Hilda E; Dias, Simoni C; Franco, Octávio L; Otero-González, Anselmo J

2012-04-01

Antimicrobial peptides have been found in mollusks and other sea animals. In this report, a crude extract of the marine snail Cenchritis muricatus was evaluated against human pathogens responsible for multiple deleterious effects and diseases. A peptide of 1485.26 Da was purified by reversed-phase HPLC and functionally characterized. This trypsinized peptide was sequenced by MS/MS technology, and a sequence (SRSELIVHQR), named Cm-p1 was recovered, chemically synthesized and functionally characterized. This peptide demonstrated the capacity to prevent the development of yeasts and filamentous fungi. Otherwise, Cm-p1 displayed no toxic effects against mammalian cells. Molecular modeling analyses showed that this peptide possible forms a single hydrophilic α-helix and the probable cationic residue involved in antifungal activity action is proposed. The data reported here demonstrate the importance of sea animals peptide discovery for biotechnological tools development that could be useful in solving human health and agribusiness problems. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
Investigating the microstructure of keratin extracted from wool: peptide sequence (MALDI-TOF/TOF) and protein conformation (FTIR)

USDA-ARS?s Scientific Manuscript database

Keratin was extracted from wool by reduction with 2-mercaptoethanol. It was isolated as intact keratin and characterized by its similar molecular weight, protein composition, and secondary structure to native keratin. Gel electrophoresis patterns and MALDI-TOF/TOF peptide sequences provided the ide...
A complementarity-determining region synthetic peptide acts as a miniantibody and neutralizes human immunodeficiency virus type 1 in vitro.

PubMed Central

Levi, M; Sällberg, M; Rudén, U; Herlyn, D; Maruyama, H; Wigzell, H; Marks, J; Wahren, B

1993-01-01

A complementarity-determining region (CDR) of the mouse monoclonal antibody (mAb) F58 was constructed with specificity to a neutralization-inducing region of human immunodeficiency virus type 1 (HIV-1). The mAb has its major reactivity to the amino acid sequence I--GPGRA in the V3 viral envelope region. All CDRs including several framework amino acids were synthesized from the sequence deduced by cloning and sequencing mAb F58 heavy- and light-chain variable domains. Peptides derived from the third heavy-chain domain (CDR-H3) alone or in combination with the other CDR sequences competed with F58 mAb for the V3 region. The CDR-H3 peptide was chemically modified by cyclization and then inhibited HIV-1 replication as well as syncytium formation by infected cells. Both the homologous IIIB viral strain to which the F58 mAb was induced and the heterologous SF2 strain were inhibited. This synthetic peptide had unexpectedly potent antiviral activity and may be a potential tool for treatment of HIV-infected persons. PMID:7685100
Characterization of Histone H2A Derived Antimicrobial Peptides, Harriottins, from Sicklefin Chimaera Neoharriotta pinnata (Schnakenbeck, 1931) and Its Evolutionary Divergence with respect to CO1 and Histone H2A.

PubMed

Sathyan, Naveen; Philip, Rosamma; Chaithanya, E R; Anil Kumar, P R; Sanjeevan, V N; Singh, I S Bright

2013-01-01

Antimicrobial peptides (AMPs) are humoral innate immune components of fishes that provide protection against pathogenic infections. Histone derived antimicrobial peptides are reported to actively participate in the immune defenses of fishes. Present study deals with identification of putative antimicrobial sequences from the histone H2A of sicklefin chimaera, Neoharriotta pinnata. A 52 amino acid residue termed Harriottin-1, a 40 amino acid Harriottin-2, and a 21 mer Harriottin-3 were identified to possess antimicrobial sequence motif. Physicochemical properties and molecular structure of Harriottins are in agreement with the characteristic features of antimicrobial peptides, indicating its potential role in innate immunity of sicklefin chimaera. The histone H2A sequence of sicklefin chimera was found to differ from previously reported histone H2A sequences. Phylogenetic analysis based on histone H2A and cytochrome oxidase subunit-1 (CO1) gene revealed N. pinnata to occupy an intermediate position with respect to invertebrates and vertebrates.
Characterization of Histone H2A Derived Antimicrobial Peptides, Harriottins, from Sicklefin Chimaera Neoharriotta pinnata (Schnakenbeck, 1931) and Its Evolutionary Divergence with respect to CO1 and Histone H2A

PubMed Central

Sathyan, Naveen; Philip, Rosamma; Chaithanya, E. R.; Anil Kumar, P. R.; Sanjeevan, V. N.; Singh, I. S. Bright

2013-01-01

Antimicrobial peptides (AMPs) are humoral innate immune components of fishes that provide protection against pathogenic infections. Histone derived antimicrobial peptides are reported to actively participate in the immune defenses of fishes. Present study deals with identification of putative antimicrobial sequences from the histone H2A of sicklefin chimaera, Neoharriotta pinnata. A 52 amino acid residue termed Harriottin-1, a 40 amino acid Harriottin-2, and a 21 mer Harriottin-3 were identified to possess antimicrobial sequence motif. Physicochemical properties and molecular structure of Harriottins are in agreement with the characteristic features of antimicrobial peptides, indicating its potential role in innate immunity of sicklefin chimaera. The histone H2A sequence of sicklefin chimera was found to differ from previously reported histone H2A sequences. Phylogenetic analysis based on histone H2A and cytochrome oxidase subunit-1 (CO1) gene revealed N. pinnata to occupy an intermediate position with respect to invertebrates and vertebrates. PMID:27398241
Sequence of the structural gene for granule-bound starch synthase of potato (Solanum tuberosum L.) and evidence for a single point deletion in the amf allele.

PubMed

van der Leij, F R; Visser, R G; Ponstein, A S; Jacobsen, E; Feenstra, W J

1991-08-01

The genomic sequence of the potato gene for starch granule-bound starch synthase (GBSS; "waxy protein") has been determined for the wild-type allele of a monoploid genotype from which an amylose-free (amf) mutant was derived, and for the mutant part of the amf allele. Comparison of the wild-type sequence with a cDNA sequence from the literature and a newly isolated cDNA revealed the presence of 13 introns, the first of which is located in the untranslated leader. The promoter contains a G-box-like sequence. The deduced amino acid sequence of the precursor of GBSS shows a high degree of identity with monocot waxy protein sequences in the region corresponding to the mature form of the enzyme. The transit peptide of 77 amino acids, required for routing of the precursor to the plastids, shows much less identity with the transit peptides of the other waxy preproteins, but resembles the hydropathic distributions of these peptides. Alignment of the amino acid sequences of the four mature starch synthases with the Escherichia coli glgA gene product revealed the presence of at least three conserved boxes; there is no homology with previously proposed starch-binding domains of other enzymes involved in starch metabolism. We report the use of chimeric constructs with wild-type and amf sequences to localize, via complementation experiments, the region of the amf allele in which the mutation resides. Direct sequencing of polymerase chain reaction products confirmed that the amf mutation is a deletion of a single AT basepair in the region coding for the transit peptide.(ABSTRACT TRUNCATED AT 250 WORDS)
Diversity of Secondary Structure in Catalytic Peptides with β-Turn-Biased Sequences

PubMed Central

2016-01-01

X-ray crystallography has been applied to the structural analysis of a series of tetrapeptides that were previously assessed for catalytic activity in an atroposelective bromination reaction. Common to the series is a central Pro-Xaa sequence, where Pro is either l- or d-proline, which was chosen to favor nucleation of canonical β-turn secondary structures. Crystallographic analysis of 35 different peptide sequences revealed a range of conformational states. The observed differences appear not only in cases where the Pro-Xaa loop-region is altered, but also when seemingly subtle alterations to the flanking residues are introduced. In many instances, distinct conformers of the same sequence were observed, either as symmetry-independent molecules within the same unit cell or as polymorphs. Computational studies using DFT provided additional insight into the analysis of solid-state structural features. Select X-ray crystal structures were compared to the corresponding solution structures derived from measured proton chemical shifts, 3J-values, and 1H–1H-NOESY contacts. These findings imply that the conformational space available to simple peptide-based catalysts is more diverse than precedent might suggest. The direct observation of multiple ground state conformations for peptides of this family, as well as the dynamic processes associated with conformational equilibria, underscore not only the challenge of designing peptide-based catalysts, but also the difficulty in predicting their accessible transition states. These findings implicate the advantages of low-barrier interconversions between conformations of peptide-based catalysts for multistep, enantioselective reactions. PMID:28029251
Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

PubMed

Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

1994-07-08

The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.

De novo design of peptide immunogens that mimic the coiled coil region of human T-cell leukemia virus type-1 glycoprotein 21 transmembrane subunit for induction of native protein reactive neutralizing antibodies.

PubMed

Sundaram, Roshni; Lynch, Marcus P; Rawale, Sharad V; Sun, Yiping; Kazanji, Mirdad; Kaumaya, Pravin T P

2004-06-04

Peptide vaccines able to induce high affinity and protective neutralizing antibodies must rely in part on the design of antigenic epitopes that mimic the three-dimensional structure of the corresponding region in the native protein. We describe the design, structural characterization, immunogenicity, and neutralizing potential of antibodies elicited by conformational peptides derived from the human T-cell leukemia virus type 1 (HTLV-1) gp21 envelope glycoprotein spanning residues 347-374. We used a novel template design and a unique synthetic approach to construct two peptides (WCCR2T and CCR2T) that would each assemble into a triple helical coiled coil conformation mimicking the gp21 crystal structure. The peptide B-cell epitopes were grafted onto the epsilon side chains of three lysyl residues on a template backbone construct consisting of the sequence acetyl-XGKGKGKGCONH2 (where X represents the tetanus toxoid promiscuous T cell epitope (TT) sequence 580-599). Leucine substitutions were introduced at the a and d positions of the CCR2T sequence to maximize helical character and stability as shown by circular dichroism and guanidinium hydrochloride studies. Serum from an HTLV-1-infected patient was able to recognize the selected epitopes by enzyme-linked immunosorbent assay (ELISA). Mice immunized with the wild-type sequence (WCCR2T) and the mutant sequence (CCR2T) elicited high antibody titers that were capable of recognizing the native protein as shown by flow cytometry and whole virus ELISA. Sera and purified antibodies from immunized mice were able to reduce the formation of syncytia induced by the envelope glycoprotein of HTLV-1, suggesting that antibodies directed against the coiled coil region of gp21 are capable of disrupting cell-cell fusion. Our results indicate that these peptides represent potential candidates for use in a peptide vaccine against HTLV-1.
Peptides design based on transmembrane Escherichia coli's OmpA protein through molecular dynamics simulations in water-dodecane interfaces.

PubMed

Aguilera-Segura, Sonia M; Núñez Vélez, Vanessa; Achenie, Luke; Álvarez Solano, Oscar; Torres, Rodrigo; González Barrios, Andrés Fernando

2016-07-01

Recent research efforts have focused on the production of environmentally nonthreatening products, including identifying biosurfactants that can replace conventional surfactants. In order to utilize biosurfactants in different industries such as cosmetic, food or petroleum, it is necessary to understand the underpinnings behind the interactions that could take place for biosurfactants which display potential for interface activity. This work aimed to use molecular dynamics simulations to understand the interactions of rationally obtained peptide sequences from the original sequence of the OmpA gene in Escherichia coli, based on the free energy change (ΔG) during peptide insertion at the water-dodecane interface. Seventeen OmpA-based peptide sequences were selected and analyzed based on their hydropathy index profiles. We found that free energy change due to Columbic interactions and SASA (ΔGCoul/SASA), total free energy change and MW (ΔG/MW), and free energy change due to Coulombic and van der Waals interactions (ΔGCoul/ΔGvdW) ratios could provide a better understating in the contribution of the free energy decrease at the interface. The results indicated that the peptide sequences GKNHDTGVSPVFA and THENQLGAGAFG display biosurfactant potential based on low ΔG per square nanometer, high ΔGCoul/ΔGvdW ratio, clearly defined moieties along its hydrophobic surface and sequence, and the presence of charged residues in the polar head. Clearly defined moieties and SASA were determinant for electrostatic interactions between oil-water interfaces. Experimental validations exhibited that the emulsions prepared remained stable between 3 and 27h, respectively. Even though the peptide GKNHDTGVSPVFA displays strong interactions at the interface, stabilization times showed that the peptide THENQLGAGAFG exhibited the best performance suggesting that the stability can be better described by kinetic rather than thermodynamic criteria once the emulsion is formed. Copyright © 2016 Elsevier Inc. All rights reserved.
SeqCompress: an algorithm for biological sequence compression.

PubMed

Sardaraz, Muhammad; Tahir, Muhammad; Ikram, Ataul Aziz; Bajwa, Hassan

2014-10-01

The growth of Next Generation Sequencing technologies presents significant research challenges, specifically to design bioinformatics tools that handle massive amount of data efficiently. Biological sequence data storage cost has become a noticeable proportion of total cost in the generation and analysis. Particularly increase in DNA sequencing rate is significantly outstripping the rate of increase in disk storage capacity, which may go beyond the limit of storage capacity. It is essential to develop algorithms that handle large data sets via better memory management. This article presents a DNA sequence compression algorithm SeqCompress that copes with the space complexity of biological sequences. The algorithm is based on lossless data compression and uses statistical model as well as arithmetic coding to compress DNA sequences. The proposed algorithm is compared with recent specialized compression tools for biological sequences. Experimental results show that proposed algorithm has better compression gain as compared to other existing algorithms. Copyright © 2014 Elsevier Inc. All rights reserved.
The adsorption of preferential binding peptides to apatite-based materials

PubMed Central

Segvich, Sharon J.; Smith, Hayes C.; Kohn, David H.

2009-01-01

The objective of this work was to identify peptide sequences with high affinity to bone-like mineral (BLM) to provide alternative design methods for functional bone regeneration peptides. Adsorption of preferential binding peptide sequences on four apatite-based substrates [BLM and three sintered apatite disks pressed from powders containing 0% CO32− (HA), 5.6% CO32− (CA5), 10.5% CO32− (CA10)] with varied compositions and morphologies was investigated. A combination of phage display, ELISA, and computational modeling was used to elucidate three 12-mer peptide sequences APWHLSSQYSRT (A), STLPI-PHEFSRE (S), and VTKHLNQISQSY (V), from 243 candidates with preferential adsorption on BLM and HA. Overall, peptides S and V have a significantly higher adsorption to the apatite-based materials in comparison to peptide A (for S vs. A, BLM p = 0.001, CA5 p < 0.001, CA10 p < 0.001, HA p = 0.038; for V vs. A, BLM p = 0.006, CA5 p = 0.033, CA10 p = 0.029). FT-IR analysis displayed carbonate levels in CA5 and CA10 dropped to approximately 1.1–2.2% after sintering, whereas SEM imaging displayed CA5 and CA10 possess distinct morphologies. Adsorption results normalized to surface area indicate that small changes in carbonate percentage at a similar morphological scale did not provide enough carbonate incorporation to show statistical differences in peptide adsorption. Because the identified peptides (S and V) have preferential binding to apatite, their use can now be investigated in bone and dentin tissue engineering, tendon and ligament repair, and enamel formation. PMID:19095299
Flanking signal and mature peptide residues influence signal peptide cleavage

PubMed Central

Choo, Khar Heng; Ranganathan, Shoba

2008-01-01

Background Signal peptides (SPs) mediate the targeting of secretory precursor proteins to the correct subcellular compartments in prokaryotes and eukaryotes. Identifying these transient peptides is crucial to the medical, food and beverage and biotechnology industries yet our understanding of these peptides remains limited. This paper examines the most common type of signal peptides cleavable by the endoprotease signal peptidase I (SPase I), and the residues flanking the cleavage sites of three groups of signal peptide sequences, namely (i) eukaryotes (Euk) (ii) Gram-positive (Gram+) bacteria, and (iii) Gram-negative (Gram-) bacteria. Results In this study, 2352 secretory peptide sequences from a variety of organisms with amino-terminal SPs are extracted from the manually curated SPdb database for analysis based on physicochemical properties such as pI, aliphatic index, GRAVY score, hydrophobicity, net charge and position-specific residue preferences. Our findings show that the three groups share several similarities in general, but they display distinctive features upon examination in terms of their amino acid compositions and frequencies, and various physico-chemical properties. Thus, analysis or prediction of their sequences should be separated and treated as distinct groups. Conclusion We conclude that the peptide segment recognized by SPase I extends to the start of the mature protein to a limited extent, upon our survey of the amino acid residues surrounding the cleavage processing site. These flanking residues possibly influence the cleavage processing and contribute to non-canonical cleavage sites. Our findings are applicable in defining more accurate prediction tools for recognition and identification of cleavage site of SPs. PMID:19091014
Reduction of Blood Pressure by AT1 Receptor Decoy Peptides.

PubMed

Re, Richard N; Chen, Ben; Alam, Jawed; Cook, Julia L

2013-01-01

We previously identified the binding of the chaperone protein gamma-aminobutyric acid receptor-associated protein (GABARAP) to a sequence on the carboxy-terminus of the angiotensin II AT1 receptor (AT1R) and showed that this binding enhances AT1R trafficking to the cell surface as well as angiotensin signaling. In this study, we treated sodium-depleted mice with decoy peptides consisting either of a fusion of the cell-penetrating peptide penetratin and the GABARAP/AT1R binding sequence or penetratin fused to a mutated AT1R sequence. We used telemetry to measure blood pressure. Systolic and diastolic pressure fell during the 24 hours following decoy peptide injection but not after control peptide injection. Active cell-penetrating decoy peptide decreased 24-hour average systolic blood pressure from 129.8 ± 4.7 mmHg to 125.0 ± 6.0 mmHg (mean ± standard deviation). Diastolic blood pressure fell from 99.0 ± 7.1 mmHg to 95.0 ± 9.2 mmHg (n=5). Administration of the control peptide raised systolic blood pressure from 128.7 ± 1.3 mmHg to 131.7 ± 2.9 mmHg and diastolic pressure from 93.9 ± 4.5 mmHg to 95.9 ± 4.2 mmHg (n=5). The decreases in both systolic and diastolic blood pressure after active peptide administration were statistically significant compared to control peptide administration (P<0.05, two-tailed Wilcoxon rank-sum test). These results indicate the physiological and potentially therapeutic relevance of inhibitors of GABARAP/AT1R binding.
Shotgun protein sequencing: assembly of peptide tandem mass spectra from mixtures of modified proteins.

PubMed

Bandeira, Nuno; Clauser, Karl R; Pevzner, Pavel A

2007-07-01

Despite significant advances in the identification of known proteins, the analysis of unknown proteins by MS/MS still remains a challenging open problem. Although Klaus Biemann recognized the potential of MS/MS for sequencing of unknown proteins in the 1980s, low throughput Edman degradation followed by cloning still remains the main method to sequence unknown proteins. The automated interpretation of MS/MS spectra has been limited by a focus on individual spectra and has not capitalized on the information contained in spectra of overlapping peptides. Indeed the powerful shotgun DNA sequencing strategies have not been extended to automated protein sequencing. We demonstrate, for the first time, the feasibility of automated shotgun protein sequencing of protein mixtures by utilizing MS/MS spectra of overlapping and possibly modified peptides generated via multiple proteases of different specificities. We validate this approach by generating highly accurate de novo reconstructions of multiple regions of various proteins in western diamondback rattlesnake venom. We further argue that shotgun protein sequencing has the potential to overcome the limitations of current protein sequencing approaches and thus catalyze the otherwise impractical applications of proteomics methodologies in studies of unknown proteins.
Identification of a preferred substrate peptide for transglutaminase 3 and detection of in situ activity in skin and hair follicles.

PubMed

Yamane, Asaka; Fukui, Mina; Sugimura, Yoshiaki; Itoh, Miho; Alea, Mileidys Perez; Thomas, Vincent; El Alaoui, Said; Akiyama, Masashi; Hitomi, Kiyotaka

2010-09-01

Transglutaminases (TGases) are a family of enzymes that catalyze cross-linking reactions between proteins. During epidermal differentiation, these enzymatic reactions are essential for formation of the cornified envelope, which consists of cross-linked structural proteins. Two main transglutaminases isoforms, epidermal-type (TGase 3) and keratinocyte-type (TGase 1), are cooperatively involved in this process of differentiating keratinocytes. Information regarding their substrate preference is of great importance to determine the functional role of these isozymes and clarify their possible co-operative action. Thus far, we have identified highly reactive peptide sequences specifically recognized by TGases isozymes such as TGase 1, TGase 2 (tissue-type isozyme) and the blood coagulation isozyme, Factor XIII. In this study, several substrate peptide sequences for human TGase 3 were screened from a phage-displayed peptide library. The preferred substrate sequences for TGase 3 were selected and evaluated as fusion proteins with mutated glutathione S-transferase. From these studies, a highly reactive and isozyme-specific sequence (E51) was identified. Furthermore, this sequence was found to be a prominent substrate in the peptide form and was suitable for detection of in situ TGase 3 activity in the mouse epidermis. TGase 3 enzymatic activity was detected in the layers of differentiating keratinocytes and hair follicles with patterns distinct from those of TGase 1. Our findings provide new information on the specific distribution of TGase 3 and constitute a useful tool to clarify its functional role in the epidermis.
Characterization, production, and purification of leucocin H, a two-peptide bacteriocin from Leuconostoc MF215B.

PubMed

Blom, H; Katla, T; Holck, A; Sletten, K; Axelsson, L; Holo, H

1999-07-01

Leuconostoc MF215B was found to produce a two-peptide bacteriocin referred to as leucocin H. The two peptides were termed leucocin Halpha and leucocin Hbeta. When acting together, they inhibit, among others, Listeria monocytogenes, Bacillus cereus, and Clostridium perfringens. Production of leucocin H in growth medium takes place at temperatures down to 6 degrees C and at pH below 7. The highest activity of leucocin H in growth medium was demonstrated in the late exponential growth phase. The bacteriocin was purified by precipitation with ammonium sulfate, ion-exchange (SP Sepharose) and reverse phase chromatography. Upon purification, specific activity increased 10(5)-fold, and the final specific activity was 2 x 10(7) BU/OD280. Amino acid composition analyses of leucocin Halpha and leucocin Hbeta indicated that both peptides consisted of around 40 amino acid residues. Their N-termini were blocked for Edman degradation, and the methionin residues of leucocin Hbeta did not respond to Cyanogen Bromide (CNBr) cleavage. Absorbance at 280 nm indicated the presence of tryptophan residues and tryptophan-fracturing opened for partial sequencing by Edman degradation. From leucocin Halpha, the sequence of 20 amino acids was obtained; from leucocin Hbeta the sequence of 28 amino acid residues was obtained. No sequence homology to other known bacteriocins could be demonstrated. It also appeared that the two peptides themselves shared little or no sequence homology. The presence of soy oil did not affect the activity of leucocin H in agar.
Array-Based Rational Design of Short Peptide Probe-Derived from an Anti-TNT Monoclonal Antibody.

PubMed

Okochi, Mina; Muto, Masaki; Yanai, Kentaro; Tanaka, Masayoshi; Onodera, Takeshi; Wang, Jin; Ueda, Hiroshi; Toko, Kiyoshi

2017-10-09

Complementarity-determining regions (CDRs) are sites on the variable chains of antibodies responsible for binding to specific antigens. In this study, a short peptide probe for recognition of 2,4,6-trinitrotoluene (TNT), was identified by testing sequences derived from the CDRs of an anti-TNT monoclonal antibody. The major TNT-binding site in this antibody was identified in the heavy chain CDR3 by antigen docking simulation and confirmed by an immunoassay using a spot-synthesis based peptide array comprising amino acid sequences of six CDRs in the variable region. A peptide derived from heavy chain CDR3 (RGYSSFIYWF) bound to TNT with a dissociation constant of 1.3 μM measured by surface plasmon resonance. Substitution of selected amino acids with basic residues increased TNT binding while substitution with acidic amino acids decreased affinity, an isoleucine to arginine change showed the greatest improvement of 1.8-fold. The ability to create simple peptide binders of volatile organic compounds from sequence information provided by the immune system in the creation of an immune response will be beneficial for sensor developments in the future.
Molecular cloning of a cDNA encoding the precursor of adenoregulin from frog skin. Relationships with the vertebrate defensive peptides, dermaseptins.

PubMed

Amiche, M; Ducancel, F; Lajeunesse, E; Boulain, J C; Ménez, A; Nicolas, P

1993-03-31

Adenoregulin has recently been isolated from Phyllomedusa skin as a 33 amino acid residues peptide which enhanced binding of agonists to the A1 adenosine receptor. In order to study the structure of the precursor of adenoregulin we constructed a cDNA library from mRNAs extracted from the skin of Phyllomedusa bicolor. We detected the complete nucleotide sequence of a cDNA encoding the adenoregulin biosynthetic precursor. The deduced sequence of the precursor is 81 amino acids long, exhibits a putative signal sequence at the NH2 terminus and contains a single copy of the biologically active peptide at the COOH terminus. Structural and conformational homologies that are observed between adenoregulin and the dermaseptins, antimicrobial peptides exhibiting strong membranolytic activities against various pathogenic agents, suggest that adenoregulin is an additional member of the growing family of cytotropic antimicrobial peptides that allow vertebrate animals to defend themselves against microorganisms. As such, the adenosine receptor regulating activity of adenoregulin could be due to its ability to interact with and disrupt membranes lipid bilayers.
Characterizing the Specificity and Co-operation of Aminopeptidases in the Cytosol and ER During MHC Class I antigen Presentation1

PubMed Central

Hearn, Arron; York, Ian A.; Bishop, Courtney; Rock, Kenneth L.

2010-01-01

Many MHC class I binding peptides are generated as N-extended precursors during protein degradation by the proteasome. These peptides can be subsequently trimmed by aminopeptidases in the cytosol and/or the ER to produce mature epitope. However, the contribution and specificity of each of these subcellular compartments in removing N-terminal amino acids for antigen presentation is not well defined. Here we investigate this issue for antigenic precursors that are expressed in the cytosol. By systematically varying the N-terminal flanking sequences of peptides we show that the amino acids upstream of an epitope precursor are a major determinant of the amount of antigen presentation. In many cases MHC class I binding peptides are produced through sequential trimming in both the cytosol and ER. Trimming of flanking residues in the cytosol contributes most to sequences that are poorly trimmed in the ER. Since N-terminal trimming has different specificity in the cytosol and ER, the cleavage of peptides in both of these compartments serves to broaden the repertoire of sequences that are presented. PMID:20351195
Photoaffinity Labeling of Ras Converting Enzyme using Peptide Substrates that Incorporate Benzoylphenylalanine (Bpa) Residues: Improved Labeling and Structural Implications

PubMed Central

Kyro, Kelly; Manandhar, Surya P.; Mullen, Daniel; Schmidt, Walter K.; Distefano, Mark D.

2012-01-01

Rce1p catalyzes the proteolytic trimming of C-terminal tripeptides from isoprenylated proteins containing CAAX-box sequences. Because Rce1p processing is a necessary component in the Ras pathway of oncogenic signal transduction, Rce1p holds promise as a potential target for therapeutic intervention. However, its mechanism of proteolysis and active site have yet to be defined. Here, we describe synthetic peptide analogues that mimic the natural lipidated Rce1p substrate and incorporate photolabile groups for photoaffinity-labeling applications. These photoactive peptides are designed to crosslink to residues in or near the Rce1p active site. By incorporating the photoactive group via p-benzoyl-L-phenylalanine (Bpa) residues directly into the peptide substrate sequence, the labeling efficiency was substantially increased relative to a previously-synthesized compound. Incorporation of biotin on the N-terminus of the peptides permitted photolabeled Rce1p to be isolated via streptavidin affinity capture. Our findings further suggest that residues outside the CAAX-box sequence are in contact with Rce1p, which has implications for future inhibitor design. PMID:22079863
Biosynthesis and processing of the somatostatin family of peptide hormones.

PubMed

Andrews, P C; Dixon, J E

1986-01-01

Understanding of the biosynthesis of the somatostatin family of peptide hormones has greatly increased in recent years. Isolation and sequencing of the rat somatostatin gene indicates that it contains a single intron located between the codons for Gn(-57) and Glu(-56) of pre-prosomatostatin. The gene contains three repetitive sequences, one at the 5' end of the gene and two of them 3' to the coding portion. Two of the sequences consist of alternating purine-pyrimidine bases and have been shown to adopt Z-DNA structures in vitro. The cDNA for rat somatostatin codes for a 116-residue peptide structurally similar to the anglerfish and catfish precursors to the 14-residue somatostatin (SST-14). In addition to SST-14, the catfish and the anglerfish both contain an additional pancreatic somatostatin, each derived from a different gene. The catfish contains a 22-residue somatostatin, which is O-glycosylated at Thr-5. The second somatostatin gene from anglerfish encodes a prosomatostatin that is processed to a 28-residue peptide. The mature peptide contains a hydroxylated lysine at position 23.
Controlling the Surface Chemistry of Graphite by Engineered Self-Assembled Peptides

PubMed Central

Khatayevich, Dmitriy; So, Christopher R.; Hayamizu, Yuhei; Gresswell, Carolyn; Sarikaya, Mehmet

2012-01-01

The systematic control over surface chemistry is a long-standing challenge in biomedical and nanotechnological applications for graphitic materials. As a novel approach, we utilize graphite-binding dodecapeptides that self-assemble into dense domains to form monolayer thick long-range ordered films on graphite. Specifically, the peptides are rationally designed through their amino acid sequences to predictably display hydrophilic and hydrophobic characteristics while maintaining their self-assembly capabilities on the solid substrate. The peptides are observed to maintain a high tolerance for sequence modification, allowing the control over surface chemistry via their amino acid sequence. Furthermore, through a single step co-assembly of two different designed peptides, we predictably and precisely tune the wettability of the resulting functionalized graphite surfaces from 44 to 83 degrees. The modular molecular structures and predictable behavior of short peptides demonstrated here give rise to a novel platform for functionalizing graphitic materials that offers numerous advantages, including non-invasive modification of the substrate, bio-compatible processing in an aqueous environment, and simple fusion with other functional biological molecules. PMID:22428620
RAPID AND AUTOMATED PROCESSING OF MALDI-FTICR/MS DATA FOR N-METABOLIC LABELING IN A SHOTGUN PROTEOMICS ANALYSIS.

PubMed

Jing, Li; Amster, I Jonathan

2009-10-15

Offline high performance liquid chromatography combined with matrix assisted laser desorption and Fourier transform ion cyclotron resonance mass spectrometry (HPLC-MALDI-FTICR/MS) provides the means to rapidly analyze complex mixtures of peptides, such as those produced by proteolytic digestion of a proteome. This method is particularly useful for making quantitative measurements of changes in protein expression by using (15)N-metabolic labeling. Proteolytic digestion of combined labeled and unlabeled proteomes produces complex mixtures that with many mass overlaps when analyzed by HPLC-MALDI-FTICR/MS. A significant challenge to data analysis is the matching of pairs of peaks which represent an unlabeled peptide and its labeled counterpart. We have developed an algorithm and incorporated it into a compute program which significantly accelerates the interpretation of (15)N metabolic labeling data by automating the process of identifying unlabeled/labeled peak pairs. The algorithm takes advantage of the high resolution and mass accuracy of FTICR mass spectrometry. The algorithm is shown to be able to successfully identify the (15)N/(14)N peptide pairs and calculate peptide relative abundance ratios in highly complex mixtures from the proteolytic digest of a whole organism protein extract.
Isolation, Purification and Molecular Mechanism of a Peanut Protein-Derived ACE-Inhibitory Peptide

PubMed Central

Shi, Aimin; Liu, Hongzhi; Liu, Li; Hu, Hui; Wang, Qiang; Adhikari, Benu

2014-01-01

Although a number of bioactive peptides are capable of angiotensin I-converting enzyme (ACE) inhibitory effects, little is known regarding the mechanism of peanut peptides using molecular simulation. The aim of this study was to obtain ACE inhibiting peptide from peanut protein and provide insight on the molecular mechanism of its ACE inhibiting action. Peanut peptides having ACE inhibitory activity were isolated through enzymatic hydrolysis and ultrafiltration. Further chromatographic fractionation was conducted to isolate a more potent peanut peptide and its antihypertensive activity was analyzed through in vitro ACE inhibitory tests and in vivo animal experiments. MALDI-TOF/TOF-MS was used to identify its amino acid sequence. Mechanism of ACE inhibition of P8 was analyzed using molecular docking and molecular dynamics simulation. A peanut peptide (P8) having Lys-Leu-Tyr-Met-Arg-Pro amino acid sequence was obtained which had the highest ACE inhibiting activity of 85.77% (half maximal inhibitory concentration (IC50): 0.0052 mg/ml). This peanut peptide is a competitive inhibitor and show significant short term (12 h) and long term (28 days) antihypertensive activity. Dynamic tests illustrated that P8 can be successfully docked into the active pocket of ACE and can be combined with several amino acid residues. Hydrogen bond, electrostatic bond and Pi-bond were found to be the three main interaction contributing to the structural stability of ACE-peptide complex. In addition, zinc atom could form metal-carboxylic coordination bond with Tyr, Met residues of P8, resulting into its high ACE inhibiting activity. Our finding indicated that the peanut peptide (P8) having a Lys-Leu-Tyr-Met-Arg-Pro amino acid sequence can be a promising candidate for functional foods and prescription drug aimed at control of hypertension. PMID:25347076
Analysis of Endogenous D-Amino Acid-Containing Peptides in Metazoa

PubMed Central

Bai, Lu; Sheeley, Sarah; Sweedler, Jonathan V.

2010-01-01

Peptides are chiral molecules with their structure determined by the composition and configuration of their amino acid building blocks. The naturally occurring amino acids, except glycine, possess two chiral forms. This allows the formation of multiple peptide diastereomers that have the same sequence. Although living organisms use L-amino acids to make proteins, a group of D-amino acid-containing peptides (DAACPs) has been discovered in animals that have at least one of their residues isomerized to the D-form via an enzyme-catalyzed process. In many cases, the biological functions of these peptides are enhanced due to this structural conversion. These DAACPs are different from those known to occur in bacterial cell wall and antibiotic peptides, the latter of which are synthesized in a ribosome-independent manner. DAACPs have now also been identified in a number of distinct groups throughout the Metazoa. Their serendipitous discovery has often resulted from discrepancies observed in bioassays or in chromatographic behavior between natural peptide fractions and peptides synthesized according to a presumed all-L sequence. Because this L-to-D post-translational modification is subtle and not detectable by most sequence determination approaches, it is reasonable to suspect that many studies have overlooked this change; accordingly, DAACPs may be more prevalent than currently thought. Although diastereomer separation techniques developed with synthetic peptides in recent years have greatly aided in the discovery of natural DAACPs, there is a need for new, more robust methods for naturally complex samples. In this review, a brief history of DAACPs in animals is presented, followed by discussion of a variety of analytical methods that have been used for diastereomeric separation and detection of peptides. PMID:20490347
Genome-Wide Prediction and Validation of Peptides That Bind Human Prosurvival Bcl-2 Proteins

PubMed Central

DeBartolo, Joe; Taipale, Mikko; Keating, Amy E.

2014-01-01

Programmed cell death is regulated by interactions between pro-apoptotic and prosurvival members of the Bcl-2 family. Pro-apoptotic family members contain a weakly conserved BH3 motif that can adopt an alpha-helical structure and bind to a groove on prosurvival partners Bcl-xL, Bcl-w, Bcl-2, Mcl-1 and Bfl-1. Peptides corresponding to roughly 13 reported BH3 motifs have been verified to bind in this manner. Due to their short lengths and low sequence conservation, BH3 motifs are not detected using standard sequence-based bioinformatics approaches. Thus, it is possible that many additional proteins harbor BH3-like sequences that can mediate interactions with the Bcl-2 family. In this work, we used structure-based and data-based Bcl-2 interaction models to find new BH3-like peptides in the human proteome. We used peptide SPOT arrays to test candidate peptides for interaction with one or more of the prosurvival proteins Bcl-xL, Bcl-w, Bcl-2, Mcl-1 and Bfl-1. For the 36 most promising array candidates, we quantified binding to all five human receptors using direct and competition binding assays in solution. All 36 peptides showed evidence of interaction with at least one prosurvival protein, and 22 peptides bound at least one prosurvival protein with a dissociation constant between 1 and 500 nM; many peptides had specificity profiles not previously observed. We also screened the full-length parent proteins of a subset of array-tested peptides for binding to Bcl-xL and Mcl-1. Finally, we used the peptide binding data, in conjunction with previously reported interactions, to assess the affinity and specificity prediction performance of different models. PMID:24967846
DOE Office of Scientific and Technical Information (OSTI.GOV)

Fox, J.W.; Elzinga, M.; Tu, A.T.

The primary structure of myotoxin a, a myotoxin protein from the venom of the North American rattlesnake Crotalus viridis viridis, was determined and the position of the disulfide bonds assigned. The toxin was isolated, carboxymethylated, and cleaved by cyanogen bromide, and the resultant peptides were isolated. The cyanogen bromide peptides were subjected to amino acid sequence analysis. In order to assign the positions of the three disulfide bonds, the native toxin was cleaved sequentially with cyanogen bromide and trypsin. A two peptide unit connected by one disulfide bond was isolated and characterized, and a three-peptide unit connected by two disulfidemore » bonds was isolated. One peptide in the three-peptide unit was identified as Cys-Cys-Lys. In order to establish the linkages between the peptides and Cys-Cys-Lys, one cycle of Edman degradation was carried out such that the Cys-Cys bond was cleaved. Upon isolation and analysis of the cleavage products, the disulfide bonds connecting the three peptides were determined. The positions of the disulfide bridges of myotoxin a were determined to be totally different from those of neurotoxins isolated from snake venoms. The sequence of myotoxin a was compared with the sequences of other snake venom toxins using the computer program RELATE to determine whether myotoxin a is similar to any other types of toxins. From the computer analysis, myotoxin a did not show any close relationship to other toxins except crotamine from the South American rattlesnake Crotalus durissus terrificus.« less

Some links on this page may take you to non-federal websites. Their policies may differ from this site.