Proteome of Caulobacter crescentus cell cycle publicly accessible on SWICZ server.
Vohradsky, Jiri; Janda, Ivan; Grünenfelder, Björn; Berndt, Peter; Röder, Daniel; Langen, Hanno; Weiser, Jaroslav; Jenal, Urs
2003-10-01
Here we present the Swiss-Czech Proteomics Server (SWICZ), which hosts the proteomic database summarizing information about the cell cycle of the aquatic bacterium Caulobacter crescentus. The database provides a searchable tool for easy access of global protein synthesis and protein stability data as examined during the C. crescentus cell cycle. Protein synthesis data collected from five different cell cycle stages were determined for each protein spot as a relative value of the total amount of [(35)S]methionine incorporation. Protein stability of pulse-labeled extracts were measured during a chase period equivalent to one cell cycle unit. Quantitative information for individual proteins together with descriptive data such as protein identities, apparent molecular masses and isoelectric points, were combined with information on protein function, genomic context, and the cell cycle stage, and were then assembled in a relational database with a world wide web interface (http://proteom.biomed.cas.cz), which allows the database records to be searched and displays the recovered information. A total of 1250 protein spots were reproducibly detected on two-dimensional gel electropherograms, 295 of which were identified by mass spectroscopy. The database is accessible either through clickable two-dimensional gel electrophoretic maps or by means of a set of dedicated search engines. Basic characterization of the experimental procedures, data processing, and a comprehensive description of the web site are presented. In its current state, the SWICZ proteome database provides a platform for the incorporation of new data emerging from extended functional studies on the C. crescentus proteome.
PACSY, a relational database management system for protein structure and chemical shift analysis.
Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo; Lee, Weontae; Markley, John L
2012-10-01
PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.edu.
Real-time ligand binding pocket database search using local surface descriptors.
Chikhi, Rayan; Sael, Lee; Kihara, Daisuke
2010-07-01
Because of the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two-dimensional pseudo-Zernike moments or the three-dimensional Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark studies employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed.
PACSY, a relational database management system for protein structure and chemical shift analysis
Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo
2012-01-01
PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.edu. PMID:22903636
Wong, Diane K.; Lee, Bai-Yu; Horwitz, Marcus A.; Gibson, Bradford W.
1999-01-01
Iron plays a critical role in the pathophysiology of Mycobacterium tuberculosis. To gain a better understanding of iron regulation by this organism, we have used two-dimensional (2-D) gel electrophoresis, mass spectrometry, and database searching to study protein expression in M. tuberculosis under conditions of high and low iron concentration. Proteins in cellular extracts from M. tuberculosis Erdman strain grown under low-iron (1 μM) and high-iron (70 μM) conditions were separated by 2-D polyacrylamide gel electrophoresis, which allowed high-resolution separation of several hundred proteins, as visualized by Coomassie staining. The expression of at least 15 proteins was induced, and the expression of at least 12 proteins was decreased under low-iron conditions. In-gel trypsin digestion was performed on these differentially expressed proteins, and the digestion mixtures were analyzed by matrix-assisted laser desorption ionization time-of-flight mass spectrometry to determine the molecular masses of the resulting tryptic peptides. Partial sequence data on some of the peptides were obtained by using after source decay and/or collision-induced dissociation. The fragmentation data were used to search computerized peptide mass and protein sequence databases for known proteins. Ten iron-regulated proteins were identified, including Fur and aconitase proteins, both of which are known to be regulated by iron in other bacterial systems. Our study shows that, where large protein sequence databases are available from genomic studies, the combined use of 2-D gel electrophoresis, mass spectrometry, and database searching to analyze proteins expressed under defined environmental conditions is a powerful tool for identifying expressed proteins and their physiologic relevance. PMID:9864233
Gutiérrez-Sánchez, Gerardo; Atwood, James; Kolli, V S Kumar; Roussos, Sévastianos; Augur, Christopher
2012-04-01
Caffeine is toxic to most microorganisms. However, some filamentous fungi, such as Aspergillus tamarii, are able to metabolize this alkaloid when fed caffeine as the sole nitrogen source. The aim of the present work was to identify intracellular A. tamarii proteins, regulated by caffeine, using fluorescence difference two-dimensional gel electrophoresis. Specific proteins from two culture media of A. tamarii grown either on ammonium sulfate or caffeine as the sole nitrogen source were analysed by mass spectrometry. Thirteen out of a total of 85 differentially expressed spots were identified after database search. Identified up-regulated proteins include phosphoglycerate kinase, malate dehydrogenase, dyp-type peroxidase family protein, heat shock protein, Cu, Zn superoxidase dismutase and xanthine dehydrogenase. Some of the proteins identified in this study are involved in the caffeine degradation pathway as well as in stress response, suggesting that stress proteins could be involved in caffeine metabolism in filamentous fungi.
Rice proteome database: a step toward functional analysis of the rice genome.
Komatsu, Setsuko
2005-09-01
The technique of proteome analysis using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) has the power to monitor global changes that occur in the protein complement of tissues and subcellular compartments. In this study, the proteins of rice were cataloged, a rice proteome database was constructed, and a functional characterization of some of the identified proteins was undertaken. Proteins extracted from various tissues and subcellular compartments in rice were separated by 2D-PAGE and an image analyzer was used to construct a display of the proteins. The Rice Proteome Database contains 23 reference maps based on 2D-PAGE of proteins from various rice tissues and subcellular compartments. These reference maps comprise 13129 identified proteins, and the amino acid sequences of 5092 proteins are entered in the database. Major proteins involved in growth or stress responses were identified using the proteome approach. Some of these proteins, including a beta-tubulin, calreticulin, and ribulose-1,5-bisphosphate carboxylase/oxygenase activase in rice, have unexpected functions. The information obtained from the Rice Proteome Database will aid in cloning the genes for and predicting the function of unknown proteins.
The Protein Disease Database of human body fluids: II. Computer methods and data issues.
Lemkin, P F; Orr, G A; Goldstein, M P; Creed, G J; Myrick, J E; Merril, C R
1995-01-01
The Protein Disease Database (PDD) is a relational database of proteins and diseases. With this database it is possible to screen for quantitative protein abnormalities associated with disease states. These quantitative relationships use data drawn from the peer-reviewed biomedical literature. Assays may also include those observed in high-resolution electrophoretic gels that offer the potential to quantitate many proteins in a single test as well as data gathered by enzymatic or immunologic assays. We are using the Internet World Wide Web (WWW) and the Web browser paradigm as an access method for wide distribution and querying of the Protein Disease Database. The WWW hypertext transfer protocol and its Common Gateway Interface make it possible to build powerful graphical user interfaces that can support easy-to-use data retrieval using query specification forms or images. The details of these interactions are totally transparent to the users of these forms. Using a client-server SQL relational database, user query access, initial data entry and database maintenance are all performed over the Internet with a Web browser. We discuss the underlying design issues, mapping mechanisms and assumptions that we used in constructing the system, data entry, access to the database server, security, and synthesis of derived two-dimensional gel image maps and hypertext documents resulting from SQL database searches.
Lu, Y; Qi, Y X; Zhang, H; Zhang, H Q; Pu, J J; Xie, Y X
2013-12-19
To establish a proteomic reference map of Musa acuminate Colla (banana) leaf, we separated and identified leaf proteins using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) and mass spectrometry (MS). Tryptic digests of 44 spots were subjected to peptide mass fingerprinting (PMF) by matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) MS. Three spots that were not identified by MALDI-TOF MS analysis were identified by searching against the NCBInr, SwissProt, and expressed sequence tag (EST) databases. We identified 41 unique proteins. The majority of the identified leaf proteins were found to be involved in energy metabolism. The results indicate that 2D-PAGE is a sensitive and powerful technique for the separation and identification of Musa leaf proteins. A summary of the identified proteins and their putative functions is discussed.
Tyan, Yu-Chang; Wu, Hsin-Yi; Lai, Wu-Wei; Su, Wu-Chou; Liao, Pao-Chi
2005-01-01
Pleural effusion, an accumulation of pleural fluid, contains proteins originated from plasma filtrate and, especially when tissues are damaged, parenchyma interstitial spaces of lungs and/or other organs. This study details protein profiles in human pleural effusion from 43 lung adenocarcinoma patients by a two-dimensional nano-high performance liquid chromatography electrospray ionization tandem mass spectrometry (2D nano-HPLC-ESI-MS/MS) system. The experimental results revealed the identification of 1415 unique proteins from human pleural effusion. Among these 124 proteins identified with higher confidence levels, some proteins have not been reported in plasma and may represent proteins specifically present in pleural effusion. These proteins are valuable for mass identification of differentially expressed proteins involved in proteomics database and screening biomarker to further study in human lung adenocarcinoma. The significance of the use of proteomics analysis of human pleural fluid for the search of new lung cancer marker proteins, and for their simultaneous display and analysis in patients suffering from lung disorders has been examined.
Guais, Olivier; Borderies, Gisèle; Pichereaux, Carole; Maestracci, Marc; Neugnot, Virginie; Rossignol, Michel; François, Jean Marie
2008-12-01
MS/MS techniques are well customized now for proteomic analysis, even for non-sequenced organisms, since peptide sequences obtained by these methods can be matched with those found in databases from closely related sequenced organisms. We used this approach to characterize the protein content of the "Rovabio Excel", an enzymatic cocktail produced by Penicillium funiculosum that is used as feed additive in animal nutrition. Protein separation by bi-dimensional electrophoresis yielded more than 100 spots, from which 37 proteins were unambiguously assigned from peptide sequences. By one-dimensional SDS-gel electrophoresis, 34 proteins were identified among which 8 were not found in the 2-DE analysis. A third method, termed 'peptidic shotgun', which consists in a direct treatment of the cocktail by trypsin followed by separation of the peptides on two-dimensional liquid chromatography, resulted in the identification of two additional proteins not found by the two other methods. Altogether, more than 50 proteins, among which several glycosylhydrolytic, hemicellulolytic and proteolytic enzymes, were identified by combining three separation methods in this enzymatic cocktail. This work confirmed the power of proteome analysis to explore the genome expression of a non-sequenced fungus by taking advantage of sequences from phylogenetically related filamentous fungi and pave the way for further functional analysis of P. funiculosum.
Proteome reference map and regulation network of neonatal rat cardiomyocyte
Li, Zi-jian; Liu, Ning; Han, Qi-de; Zhang, You-yi
2011-01-01
Aim: To study and establish a proteome reference map and regulation network of neonatal rat cardiomyocyte. Methods: Cultured cardiomyocytes of neonatal rats were used. All proteins expressed in the cardiomyocytes were separated and identified by two-dimensional polyacrylamide gel electrophoresis (2-DE) and matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS). Biological networks and pathways of the neonatal rat cardiomyocytes were analyzed using the Ingenuity Pathway Analysis (IPA) program (www.ingenuity.com). A 2-DE database was made accessible on-line by Make2ddb package on a web server. Results: More than 1000 proteins were separated on 2D gels, and 148 proteins were identified. The identified proteins were used for the construction of an extensible markup language-based database. Biological networks and pathways were constructed to analyze the functions associate with cardiomyocyte proteins in the database. The 2-DE database of rat cardiomyocyte proteins can be accessed at http://2d.bjmu.edu.cn. Conclusion: A proteome reference map and regulation network of the neonatal rat cardiomyocytes have been established, which may serve as an international platform for storage, analysis and visualization of cardiomyocyte proteomic data. PMID:21841810
Real-Time Ligand Binding Pocket Database Search Using Local Surface Descriptors
Chikhi, Rayan; Sael, Lee; Kihara, Daisuke
2010-01-01
Due to the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of a particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two dimensional pseudo-Zernike moments or the 3D Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark study employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed. PMID:20455259
MultitaskProtDB-II: an update of a database of multitasking/moonlighting proteins
Franco-Serrano, Luís; Hernández, Sergio; Calvo, Alejandra; Severi, María A; Ferragut, Gabriela; Pérez-Pons, JosepAntoni; Piñol, Jaume; Pich, Òscar; Mozo-Villarias, Ángel; Amela, Isaac
2018-01-01
Abstract Multitasking, or moonlighting, is the capability of some proteins to execute two or more biological functions. MultitaskProtDB-II is a database of multifunctional proteins that has been updated. In the previous version, the information contained was: NCBI and UniProt accession numbers, canonical and additional biological functions, organism, monomeric/oligomeric states, PDB codes and bibliographic references. In the present update, the number of entries has been increased from 288 to 694 moonlighting proteins. MultitaskProtDB-II is continually being curated and updated. The new database also contains the following information: GO descriptors for the canonical and moonlighting functions, three-dimensional structure (for those proteins lacking PDB structure, a model was made using Itasser and Phyre), the involvement of the proteins in human diseases (78% of human moonlighting proteins) and whether the protein is a target of a current drug (48% of human moonlighting proteins). These numbers highlight the importance of these proteins for the analysis and explanation of human diseases and target-directed drug design. Moreover, 25% of the proteins of the database are involved in virulence of pathogenic microorganisms, largely in the mechanism of adhesion to the host. This highlights their importance for the mechanism of microorganism infection and vaccine design. MultitaskProtDB-II is available at http://wallace.uab.es/multitaskII. PMID:29136215
Li, Bing; Sheng, Minjie; Xie, Liqi; Liu, Feng; Yan, Guoquan; Wang, Weifang; Lin, Anjuan; Zhao, Fei; Chen, Yihui
2014-01-09
Diabetes mellitus has been shown to be associated with and complicated by dry eye syndrome. We sought to examine and compare the tear film proteome of type 2 diabetic patients with or without dry eye syndrome and normal subjects using two-dimensional nano-liquid chromatography coupled with tandem mass spectrometry (MS)-based proteomics. Tears were collected from eight type 2 diabetes patients with dry eye syndrome, eight type 2 diabetes patients without dry eye syndrome, and eight normal subjects. Tear breakup time (BUT) was determined, and tear proteins were prepared and analyzed using two-dimensional strong cation-exchange/reversed-phase nano-scale liquid chromatography MS. All MS/MS spectra were identified by using SEQUEST against the human International Protein Index (IPI) database and the relative abundance of individual proteins was assessed by spectral counting. Tear BUT was significantly lower in patients with diabetes and dry eye syndrome than in patients with diabetes only and normal subjects. Analysis of spectral counts of tear proteins showed that, compared to healthy controls, patients with diabetes and dry eye syndrome had increased expression of apoptosis-related proteins, like annexin A1, and immunity- and inflammation-related proteins, including neutrophil elastase 2 and clusterin, and glycometabolism-related proteins, like apolipoprotein A-II. Dry eye syndrome in diabetic patients is associated with aberrant expression of tear proteins, and the findings could lead to identification of novel pathways for therapeutic targeting and new diagnostic markers.
Southan, Christopher; Cutler, Paul; Birrell, Helen; Connell, John; Fantom, Kenneth G M; Sims, Matthew; Shaikh, Narjis; Schneider, Klaus
2002-02-01
A proteomic study of rat urine was undertaken using two-dimensional gel electrophoresis, microbore high performance liquid chromatography, mass spectrometry and N-terminal sequencing. Five known urinary proteins were identified but two novel peptide fragments matched a large number of rat expressed sequence tags (ESTs) from a liver library. By combining protein chemical and nucleotide data, two 101-residue open reading frames with 90% amino acid identity were determined, rat urinary protein 1 (RUP-1) and RUP-2. The data established signal peptide removal and provided evidence for N-glycosylation. A third related sequence, rat spleen protein (RSP-1) was confirmed from EST searches. These three proteins have been submitted to SWISS-PROT as P81827, P81828 and Q9QXN2, respectively. A fourth novel homologue was found in porcine and bovine ESTs from embryo libraries. Alignment with known homologues showed conserved cysteine positions characteristic of a secreted subfamily of Ly-6 proteins. In two cases, antineoplastic urinary protein and caltrin, these homologues have unverified functional annotations. The RUP sequences showed high scoring matches to three unrelated rat mRNAs subsequently established to be chimeric. Two of these share extended sectional identity to RUP-1 but the third may represent another novel Ly-6 homologue. These chimeras have caused serious annotation errors in secondary databases.
NASA Astrophysics Data System (ADS)
Nur Afifah, Diana; Rustanti, Ninik; Anjani, Gemala; Syah, Dahrul; Yanti; Suhartono, Maggy T.
2017-02-01
This paper presents the proteomics study which includes separation, identification and characterization of proteins. The experiment on Indonesian fermented food such as extracellular fibrinolytic protease from Bacillus licheniformis RO3 and Bacillus pumilus 2.g isolated from red oncom and tempeh gembus was conducted. The experimental works comprise the following steps: (1) a combination of one- and two-dimensional electrophoresis analysis, (2) mass spectrometry analysis using MALDI-TOF-MS and (3) investigation using protein database. The result suggested that there were new two protein fractions of B. licheniformis RO3 and three protein fractions of B. pumilus 2.g. These result has not been previously reported.
Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi
2017-06-23
The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from genome sequences, though there are over lapped proteins. Based on the demonstrated application of data stored in the database for functional analyses, it is suggested that these data will be useful for analyses of biological mechanisms in soybean. Furthermore, coupled with recent advances in information and communication technology, the usefulness of this database would increase in the analyses of biological mechanisms. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification of species- and tissue-specific proteins using proteomic strategy
NASA Astrophysics Data System (ADS)
Chernukha, I. M.; Vostrikova, N. L.; Kovalev, L. I.; Shishkin, S. S.; Kovaleva, M. A.; Manukhin, Y. S.
2017-09-01
Proteomic technologies have proven to be very effective for detecting biochemical changes in meat products, such as changes in tissue- and species-specific proteins. In the tissues of cattle, pig, horse and camel M. longissimus dorsi both tissue- and species specific proteins were detected using two dimensional electrophoresis. Species-specific isoforms of several muscle proteins were also identified. The identified and described proteins of cattle, pig, horse and camel skeletal muscles (including mass spectra of the tryptic peptides) were added to the national free access database “Muscle organ proteomics”. This research has enabled the development of new highly sensitive technologies for meat product quality control against food fraud.
Allmer, Jens; Kuhlgert, Sebastian; Hippler, Michael
2008-07-07
The amount of information stemming from proteomics experiments involving (multi dimensional) separation techniques, mass spectrometric analysis, and computational analysis is ever-increasing. Data from such an experimental workflow needs to be captured, related and analyzed. Biological experiments within this scope produce heterogenic data ranging from pictures of one or two-dimensional protein maps and spectra recorded by tandem mass spectrometry to text-based identifications made by algorithms which analyze these spectra. Additionally, peptide and corresponding protein information needs to be displayed. In order to handle the large amount of data from computational processing of mass spectrometric experiments, automatic import scripts are available and the necessity for manual input to the database has been minimized. Information is in a generic format which abstracts from specific software tools typically used in such an experimental workflow. The software is therefore capable of storing and cross analysing results from many algorithms. A novel feature and a focus of this database is to facilitate protein identification by using peptides identified from mass spectrometry and link this information directly to respective protein maps. Additionally, our application employs spectral counting for quantitative presentation of the data. All information can be linked to hot spots on images to place the results into an experimental context. A summary of identified proteins, containing all relevant information per hot spot, is automatically generated, usually upon either a change in the underlying protein models or due to newly imported identifications. The supporting information for this report can be accessed in multiple ways using the user interface provided by the application. We present a proteomics database which aims to greatly reduce evaluation time of results from mass spectrometric experiments and enhance result quality by allowing consistent data handling. Import functionality, automatic protein detection, and summary creation act together to facilitate data analysis. In addition, supporting information for these findings is readily accessible via the graphical user interface provided. The database schema and the implementation, which can easily be installed on virtually any server, can be downloaded in the form of a compressed file from our project webpage.
Protein Simulation Data in the Relational Model.
Simms, Andrew M; Daggett, Valerie
2012-10-01
High performance computing is leading to unprecedented volumes of data. Relational databases offer a robust and scalable model for storing and analyzing scientific data. However, these features do not come without a cost-significant design effort is required to build a functional and efficient repository. Modeling protein simulation data in a relational database presents several challenges: the data captured from individual simulations are large, multi-dimensional, and must integrate with both simulation software and external data sites. Here we present the dimensional design and relational implementation of a comprehensive data warehouse for storing and analyzing molecular dynamics simulations using SQL Server.
Protein Simulation Data in the Relational Model
Simms, Andrew M.; Daggett, Valerie
2011-01-01
High performance computing is leading to unprecedented volumes of data. Relational databases offer a robust and scalable model for storing and analyzing scientific data. However, these features do not come without a cost—significant design effort is required to build a functional and efficient repository. Modeling protein simulation data in a relational database presents several challenges: the data captured from individual simulations are large, multi-dimensional, and must integrate with both simulation software and external data sites. Here we present the dimensional design and relational implementation of a comprehensive data warehouse for storing and analyzing molecular dynamics simulations using SQL Server. PMID:23204646
Separation and characterization of needle and xylem maritime pine proteins.
Costa, P; Pionneau, C; Bauw, G; Dubos, C; Bahrmann, N; Kremer, A; Frigerio, J M; Plomion, C
1999-01-01
Two-dimensional gel electrophoresis (2-DE) and image analysis are currently used for proteome analysis in maritime pine (Pinus pinaster Ait.). This study presents a database of expressed proteins extracted from needles and xylem, two important tissues for growth and wood formation. Electrophoresis was carried out by isoelectric focusing (IEF) in the first dimension and sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) in the second. Silver staining made it possible to detect an average of 900 and 600 spots on 2-DE gels from needles and xylem, respectively. A total of 28 xylem and 35 needle proteins were characterized by internal peptide microsequencing. Out of these 63 proteins, 57 (90%) could be identified based on amino acid similarity with known proteins, of which 24 (42%) have already been described in conifers. Overall comparison of both tissues indicated that 29% and 36% of the spots were specific to xylem and needles, respectively, while the other spots were of identical molecular weight and isoelectric point. The homology of spot location in 2-DE patterns was further validated by sequence analysis of proteins present in both tissues. A proteomic database of maritime pine is accessible on the internet (http://www.pierroton.inra.fr/genetics/2D/).
Motivated Proteins: A web application for studying small three-dimensional protein motifs
Leader, David P; Milner-White, E James
2009-01-01
Background Small loop-shaped motifs are common constituents of the three-dimensional structure of proteins. Typically they comprise between three and seven amino acid residues, and are defined by a combination of dihedral angles and hydrogen bonding partners. The most abundant of these are αβ-motifs, asx-motifs, asx-turns, β-bulges, β-bulge loops, β-turns, nests, niches, Schellmann loops, ST-motifs, ST-staples and ST-turns. We have constructed a database of such motifs from a range of high-quality protein structures and built a web application as a visual interface to this. Description The web application, Motivated Proteins, provides access to these 12 motifs (with 48 sub-categories) in a database of over 400 representative proteins. Queries can be made for specific categories or sub-categories of motif, motifs in the vicinity of ligands, motifs which include part of an enzyme active site, overlapping motifs, or motifs which include a particular amino acid sequence. Individual proteins can be specified, or, where appropriate, motifs for all proteins listed. The results of queries are presented in textual form as an (X)HTML table, and may be saved as parsable plain text or XML. Motifs can be viewed and manipulated either individually or in the context of the protein in the Jmol applet structural viewer. Cartoons of the motifs imposed on a linear representation of protein secondary structure are also provided. Summary information for the motifs is available, as are histograms of amino acid distribution, and graphs of dihedral angles at individual positions in the motifs. Conclusion Motivated Proteins is a publicly and freely accessible web application that enables protein scientists to study small three-dimensional motifs without requiring knowledge of either Structured Query Language or the underlying database schema. PMID:19210785
The Proteome of Seed Development in the Model Legume Lotus japonicus1[C][W
Dam, Svend; Laursen, Brian S.; Ørnfelt, Jane H.; Jochimsen, Bjarne; Stærfeldt, Hans Henrik; Friis, Carsten; Nielsen, Kasper; Goffard, Nicolas; Besenbacher, Søren; Krusell, Lene; Sato, Shusei; Tabata, Satoshi; Thøgersen, Ida B.; Enghild, Jan J.; Stougaard, Jens
2009-01-01
We have characterized the development of seeds in the model legume Lotus japonicus. Like soybean (Glycine max) and pea (Pisum sativum), Lotus develops straight seed pods and each pod contains approximately 20 seeds that reach maturity within 40 days. Histological sections show the characteristic three developmental phases of legume seeds and the presence of embryo, endosperm, and seed coat in desiccated seeds. Furthermore, protein, oil, starch, phytic acid, and ash contents were determined, and this indicates that the composition of mature Lotus seed is more similar to soybean than to pea. In a first attempt to determine the seed proteome, both a two-dimensional polyacrylamide gel electrophoresis approach and a gel-based liquid chromatography-mass spectrometry approach were used. Globulins were analyzed by two-dimensional polyacrylamide gel electrophoresis, and five legumins, LLP1 to LLP5, and two convicilins, LCP1 and LCP2, were identified by matrix-assisted laser desorption ionization quadrupole/time-of-flight mass spectrometry. For two distinct developmental phases, seed filling and desiccation, a gel-based liquid chromatography-mass spectrometry approach was used, and 665 and 181 unique proteins corresponding to gene accession numbers were identified for the two phases, respectively. All of the proteome data, including the experimental data and mass spectrometry spectra peaks, were collected in a database that is available to the scientific community via a Web interface (http://www.cbs.dtu.dk/cgi-bin/lotus/db.cgi). This database establishes the basis for relating physiology, biochemistry, and regulation of seed development in Lotus. Together with a new Web interface (http://bioinfoserver.rsbs.anu.edu.au/utils/PathExpress4legumes/) collecting all protein identifications for Lotus, Medicago, and soybean seed proteomes, this database is a valuable resource for comparative seed proteomics and pathway analysis within and beyond the legume family. PMID:19129418
Accounting for host cell protein behavior in anion-exchange chromatography.
Swanson, Ryan K; Xu, Ruo; Nettleton, Daniel S; Glatz, Charles E
2016-11-01
Host cell proteins (HCP) are a problematic set of impurities in downstream processing (DSP) as they behave most similarly to the target protein during separation. Approaching DSP with the knowledge of HCP separation behavior would be beneficial for the production of high purity recombinant biologics. Therefore, this work was aimed at characterizing the separation behavior of complex mixtures of HCP during a commonly used method: anion-exchange chromatography (AEX). An additional goal was to evaluate the performance of a statistical methodology, based on the characterization data, as a tool for predicting protein separation behavior. Aqueous two-phase partitioning followed by two-dimensional electrophoresis provided data on the three physicochemical properties most commonly exploited during DSP for each HCP: pI (isoelectric point), molecular weight, and surface hydrophobicity. The protein separation behaviors of two alternative expression host extracts (corn germ and E. coli) were characterized. A multivariate random forest (MVRF) statistical methodology was then applied to the database of characterized proteins creating a tool for predicting the AEX behavior of a mixture of proteins. The accuracy of the MVRF method was determined by calculating a root mean squared error value for each database. This measure never exceeded a value of 0.045 (fraction of protein populating each of the multiple separation fractions) for AEX. © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:1453-1463, 2016. © 2016 American Institute of Chemical Engineers.
Rachinsky, Anna; Guerrero, Felix D; Scoles, Glen A
2007-12-01
We used gel electrophoresis and mass spectrometry to investigate differences in protein expression in ovarian tissues from Babesia bovis-infected and uninfected southern cattle tick, Rhipicephalus (Boophilus) microplus. Soluble and membrane proteins were extracted from ovaries of adult female ticks, and analyzed by isoelectric focusing (IEF) and one-dimensional or two-dimensional (2-D) gel electrophoresis. Protein patterns were analyzed for differences in expression between infected and uninfected ticks. 2-D separation of proteins revealed a number of proteins that appeared to be up- or down-regulated in response to infection with Babesia, in particular membrane/membrane-associated proteins and proteins in a low molecular mass range between 6 and 36kDa. A selection of differentially expressed proteins was subjected to analysis by capillary-HPLC-electrospray tandem mass spectrometry (HPLC-ESI-MS/MS). Among the ovarian proteins that were up-regulated in infected ticks were calreticulin, two myosin subunits, an endoplasmic reticulum protein, a peptidyl-prolyl cis-trans isomerase (PPIase), a cytochrome c oxidase subunit, a glutamine synthetase, and a family of Kunitz-type serine protease inhibitors. Among the down-regulated ovarian proteins were another PPIase, a hemoglobin subunit, and a lysozyme. This study is part of an ongoing effort to establish a proteome database that can be utilized to investigate specific proteins involved in successful pathogen transmission.
Yoshida, Yutaka; Miyazaki, Kenji; Kamiie, Junichi; Sato, Masao; Okuizumi, Seiji; Kenmochi, Akihisa; Kamijo, Ken'ichi; Nabetani, Takuji; Tsugita, Akira; Xu, Bo; Zhang, Ying; Yaoita, Eishin; Osawa, Tetsuo; Yamamoto, Tadashi
2005-03-01
To contribute to physiology and pathophysiology of the glomerulus of human kidney, we have launched a proteomic study of human glomerulus, and compiled a profile of proteins expressed in the glomerulus of normal human kidney by two-dimensional gel electrophoresis (2-DE) and identification with matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS) and/or liquid chromatography-tandem mass spectrometry (LC-MS/MS). Kidney cortices with normal appearance were obtained from patients under surgical nephrectomy due to renal tumor, and glomeruli were highly purified by a standard sieving method followed by picking-up under a phase-contrast microscope. The glomerular proteins were separated by 2-DE with 24 cm immobilized pH gradient strips in the 3-10 range in the first dimension and 26 x 20 cm sodium dodecyl sulfate polyacrylamide electrophoresis gels of 12.5% in the second dimension. Gels were silver-stained, and valid spots were processed for identification through an integrated robotic system that consisted of a spot picker, an in-gel digester, and a MALDI-TOF MS and / or a LC-MS/MS. From 2-DE gel images of glomeruli of four subjects with no apparent pathologic manifestations, a synthetic gel image of normal glomerular proteins was created. The synthetic gel image contained 1713 valid spots, of which 1559 spots were commonly observed in the respective 2-DE gels. Among the 1559 spots, 347 protein spots, representing 212 proteins, have so far been identified, and used for the construction of an extensible markup language (XML)-based database. The database is deposited on a web site (http://www.sw.nec.co.jp/bio/rd/hgldb/index.html) in a form accessible to researchers to contribute to proteomic studies of human glomerulus in health and disease.
Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka
2018-05-08
Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on docking calculations with biochemical pathways and enables users to easily and quickly assess PPI feasibilities by archiving PPI predictions. MEGADOCK-Web also promotes the discovery of new PPIs and protein functions and is freely available for use at http://www.bi.cs.titech.ac.jp/megadock-web/ .
Pineda, M; Sajnani, C; Barón, M
2010-01-01
We have analyzed the chloroplast proteome of Nicotiana benthamiana using two-dimensional gel electrophoresis and mass spectrometry followed by a database search. In order to improve the resolution of the two-dimensional electrophoresis gels, we have made separate maps for the low and the high pH range. At least 200 spots were detected. We identified 72 polypeptides, some being isoforms of different multiprotein families. In addition, changes in this chloroplast proteome induced by the infection with the Spanish strain of the Pepper mild mottle virus were investigated. Viral infection induced the down-regulation of several chloroplastidic proteins involved in both the photosynthetic electron-transport chain and the Benson-Calvin cycle.
A Reference Proteomic Database of Lactobacillus plantarum CMCC-P0002
Tian, Wanhong; Yu, Gang; Liu, Xiankai; Wang, Jie; Feng, Erling; Zhang, Xuemin; Chen, Bei; Zeng, Ming; Wang, Hengliang
2011-01-01
Lactobacillus plantarum is a widespread probiotic bacteria found in many fermented food products. In this study, the whole-cell proteins and secretory proteins of L. plantarum were separated by two-dimensional electrophoresis method. A total of 434 proteins were identified by tandem mass spectrometry, including a plasmid-encoded hypothetical protein pLP9000_05. The information of first 20 highest abundance proteins was listed for the further genetic manipulation of L. plantarum, such as construction of high-level expressions system. Furthermore, the first interaction map of L. plantarum was established by Blue-Native/SDS-PAGE technique. A heterodimeric complex composed of maltose phosphorylase Map3 and Map2, and two homodimeric complexes composed of Map3 and Map2 respectively, were identified at the same time, indicating the important roles of these proteins. These findings provided valuable information for the further proteomic researches of L. plantarum. PMID:21998671
A reference proteomic database of Lactobacillus plantarum CMCC-P0002.
Zhu, Li; Hu, Wei; Liu, Datao; Tian, Wanhong; Yu, Gang; Liu, Xiankai; Wang, Jie; Feng, Erling; Zhang, Xuemin; Chen, Bei; Zeng, Ming; Wang, Hengliang
2011-01-01
Lactobacillus plantarum is a widespread probiotic bacteria found in many fermented food products. In this study, the whole-cell proteins and secretory proteins of L. plantarum were separated by two-dimensional electrophoresis method. A total of 434 proteins were identified by tandem mass spectrometry, including a plasmid-encoded hypothetical protein pLP9000_05. The information of first 20 highest abundance proteins was listed for the further genetic manipulation of L. plantarum, such as construction of high-level expressions system. Furthermore, the first interaction map of L. plantarum was established by Blue-Native/SDS-PAGE technique. A heterodimeric complex composed of maltose phosphorylase Map3 and Map2, and two homodimeric complexes composed of Map3 and Map2 respectively, were identified at the same time, indicating the important roles of these proteins. These findings provided valuable information for the further proteomic researches of L. plantarum.
Shin, Jae-Min; Cho, Doo-Ho
2005-01-01
PDB-Ligand (http://www.idrtech.com/PDB-Ligand/) is a three-dimensional structure database of small molecular ligands that are bound to larger biomolecules deposited in the Protein Data Bank (PDB). It is also a database tool that allows one to browse, classify, superimpose and visualize these structures. As of May 2004, there are about 4870 types of small molecular ligands, experimentally determined as a complex with protein or DNA in the PDB. The proteins that a given ligand binds are often homologous and present the same binding structure to the ligand. However, there are also many instances wherein a given ligand binds to two or more unrelated proteins, or to the same or homologous protein in different binding environments. PDB-Ligand serves as an interactive structural analysis and clustering tool for all the ligand-binding structures in the PDB. PDB-Ligand also provides an easier way to obtain a number of different structure alignments of many related ligand-binding structures based on a simple and flexible ligand clustering method. PDB-Ligand will be a good resource for both a better interpretation of ligand-binding structures and the development of better scoring functions to be used in many drug discovery applications.
Compound activity prediction using models of binding pockets or ligand properties in 3D
Kufareva, Irina; Chen, Yu-Chen; Ilatovskiy, Andrey V.; Abagyan, Ruben
2014-01-01
Transient interactions of endogenous and exogenous small molecules with flexible binding sites in proteins or macromolecular assemblies play a critical role in all biological processes. Current advances in high-resolution protein structure determination, database development, and docking methodology make it possible to design three-dimensional models for prediction of such interactions with increasing accuracy and specificity. Using the data collected in the Pocketome encyclopedia, we here provide an overview of two types of the three-dimensional ligand activity models, pocket-based and ligand property-based, for two important classes of proteins, nuclear and G-protein coupled receptors. For half the targets, the pocket models discriminate actives from property matched decoys with acceptable accuracy (the area under ROC curve, AUC, exceeding 84%) and for about one fifth of the targets with high accuracy (AUC > 95%). The 3D ligand property field models performed better than 95% in half of the cases. The high performance models can already become a basis of activity predictions for new chemicals. Family-wide benchmarking of the models highlights strengths of both approaches and helps identify their inherent bottlenecks and challenges. PMID:23116466
Sprenger, Richard R; Speijer, Dave; Back, Jaap Willem; De Koster, Chris G; Pannekoek, Hans; Horrevoets, Anton J G
2004-01-01
The human endothelial cell plasma membrane harbors two subdomains of similar lipid composition, caveolae and rafts, both crucially involved in various essential cellular processes like transcytosis, signal transduction and cholesterol homeostasis. Caveolin-enriched membranes, isolated by either cationic silica or buoyant density methods, were explored by comparing large series of two-dimensional (2-D) maps and subsequent identification of over 100 protein spots by matrix-assisted laser desorption/ionization (MALDI) peptide mass fingerprinting. Improved representation and identification of membrane proteins and valuable information on various post-translational modifications was achieved by the presented optimized procedures for solubilization, destaining and database searching/computing. Whereas the cationic silica purification yielded predominantly known endoplasmic reticulum residents, the cold-detergent method yielded a large number of known caveolae residents, including caveolin-1. Thus, a large part of this subproteome was established, including known (trans-)membrane, signal transduction and glycosyl phosphatidylinositol (GPI)-anchored proteins. Several predicted proteins from the human genome were isolated for the first time from biological samples, including SGRP58, SLP-2, C8ORF2, and XRP-2. These findings and various optimized procedures can serve as a reference to study the differential composition of endothelial cell caveolae and rafts, known to be involved in pathologies like cancer and cardiovascular disease.
Histoplasma capsulatum proteome response to decreased iron availability
Winters, Michael S; Spellman, Daniel S; Chan, Qilin; Gomez, Francisco J; Hernandez, Margarita; Catron, Brittany; Smulian, Alan G; Neubert, Thomas A; Deepe, George S
2008-01-01
Background A fundamental pathogenic feature of the fungus Histoplasma capsulatum is its ability to evade innate and adaptive immune defenses. Once ingested by macrophages the organism is faced with several hostile environmental conditions including iron limitation. H. capsulatum can establish a persistent state within the macrophage. A gap in knowledge exists because the identities and number of proteins regulated by the organism under host conditions has yet to be defined. Lack of such knowledge is an important problem because until these proteins are identified it is unlikely that they can be targeted as new and innovative treatment for histoplasmosis. Results To investigate the proteomic response by H. capsulatum to decreasing iron availability we have created H. capsulatum protein/genomic databases compatible with current mass spectrometric (MS) search engines. Databases were assembled from the H. capsulatum G217B strain genome using gene prediction programs and expressed sequence tag (EST) libraries. Searching these databases with MS data generated from two dimensional (2D) in-gel digestions of proteins resulted in over 50% more proteins identified compared to searching the publicly available fungal databases alone. Using 2D gel electrophoresis combined with statistical analysis we discovered 42 H. capsulatum proteins whose abundance was significantly modulated when iron concentrations were lowered. Altered proteins were identified by mass spectrometry and database searching to be involved in glycolysis, the tricarboxylic acid cycle, lysine metabolism, protein synthesis, and one protein sequence whose function was unknown. Conclusion We have created a bioinformatics platform for H. capsulatum and demonstrated the utility of a proteomic approach by identifying a shift in metabolism the organism utilizes to cope with the hostile conditions provided by the host. We have shown that enzyme transcripts regulated by other fungal pathogens in response to lowering iron availability are also regulated in H. capsulatum at the protein level. We also identified H. capsulatum proteins sensitive to iron level reductions which have yet to be connected to iron availability in other pathogens. These data also indicate the complexity of the response by H. capsulatum to nutritional deprivation. Finally, we demonstrate the importance of a strain specific gene/protein database for H. capsulatum proteomic analysis. PMID:19108728
Protein Expression Profile using Two-Dimensional Gel Analysis in Squamous Cervical Cancer Patients
Bae, Su-Mi; Min, Hyun-Jin; Ding, Guo Hua; Kwak, Sun-Young; Cho, Young-Lae; Nam, Kye-Hyun; Park, Choong Hak; Kim, Yong-Wan; Kim, Chong-Kook; Han, Byoung-Don; Lee, Young-Joo; Kim, Do Kang
2006-01-01
Purpose Screening in cervical cancer is now progressing to discover candidate genes and proteins that may serve as biological markers and that play a role in tumor progression. We examined the protein expression patterns of the squamous cell carcinoma (SCC) tissues from Korean women with using two- dimensional polyacrylamide gel electrophoresis (2-DE) and matrix assisted laser desorption/ionization-time of flight (MALDI- TOF) mass spectrometer. Materials and Methods Normal cervix and SCC tissues were solubilized and 2-DE was performed using pH 3~10 linear IPG strips of 17 cm length. The protein expression was evaluated using PDQuest 2-D software™. The differentially expressed protein spots were identified with a MALDI-TOF mass spectrometer, and the peptide mass spectra identifications were performed using the Mascot program and by searching the Swiss-prot or NCBInr databases. Results A total of 35 proteins were detected in SCC. 17 proteins were up-regulated and 18 proteins weredown-regulated. Among the proteins that were identified, 12 proteins (pigment epithelium derived factor, annexin A2 and A5, keratin 19 and 20, heat shock protein 27, smooth muscle protein 22 alpha, α-enolase, squamous cell carcinoma antigen 1 and 2, glutathione S-transferase and apolipoprotein a1) were protein previously known to be involved in tumor, and 21 proteins were newly identified in this study. Conclusion 2-DE offers the total protein expression profiles of SCC tissues; further characterization of these differentially expressed proteins will give a chance to identify the badly needed tumor-specific diagnostic markers for SCC. PMID:19771267
Hsing, Michael; Cherkasov, Artem
2008-06-25
Insertions and deletions (indels) represent a common type of sequence variations, which are less studied and pose many important biological questions. Recent research has shown that the presence of sizable indels in protein sequences may be indicative of protein essentiality and their role in protein interaction networks. Examples of utilization of indels for structure-based drug design have also been recently demonstrated. Nonetheless many structural and functional characteristics of indels remain less researched or unknown. We have created a web-based resource, Indel PDB, representing a structural database of insertions/deletions identified from the sequence alignments of highly similar proteins found in the Protein Data Bank (PDB). Indel PDB utilized large amounts of available structural information to characterize 1-, 2- and 3-dimensional features of indel sites. Indel PDB contains 117,266 non-redundant indel sites extracted from 11,294 indel-containing proteins. Unlike loop databases, Indel PDB features more indel sequences with secondary structures including alpha-helices and beta-sheets in addition to loops. The insertion fragments have been characterized by their sequences, lengths, locations, secondary structure composition, solvent accessibility, protein domain association and three dimensional structures. By utilizing the data available in Indel PDB, we have studied and presented here several sequence and structural features of indels. We anticipate that Indel PDB will not only enable future functional studies of indels, but will also assist protein modeling efforts and identification of indel-directed drug binding sites.
Hu, Ben; Kuang, Zheng-Kun; Feng, Shi-Yu; Wang, Dong; He, Song-Bing; Kong, De-Xin
2016-11-17
The crystallized ligands in the Protein Data Bank (PDB) can be treated as the inverse shapes of the active sites of corresponding proteins. Therefore, the shape similarity between a molecule and PDB ligands indicated the possibility of the molecule to bind with the targets. In this paper, we proposed a shape similarity profile that can be used as a molecular descriptor for ligand-based virtual screening. First, through three-dimensional (3D) structural clustering, 300 diverse ligands were extracted from the druggable protein-ligand database, sc-PDB. Then, each of the molecules under scrutiny was flexibly superimposed onto the 300 ligands. Superimpositions were scored by shape overlap and property similarity, producing a 300 dimensional similarity array termed the "Three-Dimensional Biologically Relevant Spectrum (BRS-3D)". Finally, quantitative or discriminant models were developed with the 300 dimensional descriptor using machine learning methods (support vector machine). The effectiveness of this approach was evaluated using 42 benchmark data sets from the G protein-coupled receptor (GPCR) ligand library and the GPCR decoy database (GLL/GDD). We compared the performance of BRS-3D with other 2D and 3D state-of-the-art molecular descriptors. The results showed that models built with BRS-3D performed best for most GLL/GDD data sets. We also applied BRS-3D in histone deacetylase 1 inhibitors screening and GPCR subtype selectivity prediction. The advantages and disadvantages of this approach are discussed.
Kang, Beom Sik; Pugalendhi, GaneshKumar; Kim, Ku-Jin
2017-10-13
Interactions between protein molecules are essential for the assembly, function, and regulation of proteins. The contact region between two protein molecules in a protein complex is usually complementary in shape for both molecules and the area of the contact region can be used to estimate the binding strength between two molecules. Although the area is a value calculated from the three-dimensional surface, it cannot represent the three-dimensional shape of the surface. Therefore, we propose an original concept of two-dimensional contact area which provides further information such as the ruggedness of the contact region. We present a novel algorithm for calculating the binding direction between two molecules in a protein complex, and then suggest a method to compute the two-dimensional flattened area of the contact region between two molecules based on the binding direction.
TIPdb-3D: the three-dimensional structure database of phytochemicals from Taiwan indigenous plants.
Tung, Chun-Wei; Lin, Ying-Chi; Chang, Hsun-Shuo; Wang, Chia-Chi; Chen, Ih-Sheng; Jheng, Jhao-Liang; Li, Jih-Heng
2014-01-01
The rich indigenous and endemic plants in Taiwan serve as a resourceful bank for biologically active phytochemicals. Based on our TIPdb database curating bioactive phytochemicals from Taiwan indigenous plants, this study presents a three-dimensional (3D) chemical structure database named TIPdb-3D to support the discovery of novel pharmacologically active compounds. The Merck Molecular Force Field (MMFF94) was used to generate 3D structures of phytochemicals in TIPdb. The 3D structures could facilitate the analysis of 3D quantitative structure-activity relationship, the exploration of chemical space and the identification of potential pharmacologically active compounds using protein-ligand docking. Database URL: http://cwtung.kmu.edu.tw/tipdb. © The Author(s) 2014. Published by Oxford University Press.
Schokraie, Elham; Hotz-Wagenblatt, Agnes; Warnken, Uwe; Mali, Brahim; Frohme, Marcus; Förster, Frank; Dandekar, Thomas; Hengherr, Steffen; Schill, Ralph O; Schnölzer, Martina
2010-03-03
Tardigrades are small, multicellular invertebrates which are able to survive times of unfavourable environmental conditions using their well-known capability to undergo cryptobiosis at any stage of their life cycle. Milnesium tardigradum has become a powerful model system for the analysis of cryptobiosis. While some genetic information is already available for Milnesium tardigradum the proteome is still to be discovered. Here we present to the best of our knowledge the first comprehensive study of Milnesium tardigradum on the protein level. To establish a proteome reference map we developed optimized protocols for protein extraction from tardigrades in the active state and for separation of proteins by high resolution two-dimensional gel electrophoresis. Since only limited sequence information of M. tardigradum on the genome and gene expression level is available to date in public databases we initiated in parallel a tardigrade EST sequencing project to allow for protein identification by electrospray ionization tandem mass spectrometry. 271 out of 606 analyzed protein spots could be identified by searching against the publicly available NCBInr database as well as our newly established tardigrade protein database corresponding to 144 unique proteins. Another 150 spots could be identified in the tardigrade clustered EST database corresponding to 36 unique contigs and ESTs. Proteins with annotated function were further categorized in more detail by their molecular function, biological process and cellular component. For the proteins of unknown function more information could be obtained by performing a protein domain annotation analysis. Our results include proteins like protein member of different heat shock protein families and LEA group 3, which might play important roles in surviving extreme conditions. The proteome reference map of Milnesium tardigradum provides the basis for further studies in order to identify and characterize the biochemical mechanisms of tolerance to extreme desiccation. The optimized proteomics workflow will enable application of sensitive quantification techniques to detect differences in protein expression, which are characteristic of the active and anhydrobiotic states of tardigrades.
Schokraie, Elham; Hotz-Wagenblatt, Agnes; Warnken, Uwe; Mali, Brahim; Frohme, Marcus; Förster, Frank; Dandekar, Thomas; Hengherr, Steffen; Schill, Ralph O.; Schnölzer, Martina
2010-01-01
Background Tardigrades are small, multicellular invertebrates which are able to survive times of unfavourable environmental conditions using their well-known capability to undergo cryptobiosis at any stage of their life cycle. Milnesium tardigradum has become a powerful model system for the analysis of cryptobiosis. While some genetic information is already available for Milnesium tardigradum the proteome is still to be discovered. Principal Findings Here we present to the best of our knowledge the first comprehensive study of Milnesium tardigradum on the protein level. To establish a proteome reference map we developed optimized protocols for protein extraction from tardigrades in the active state and for separation of proteins by high resolution two-dimensional gel electrophoresis. Since only limited sequence information of M. tardigradum on the genome and gene expression level is available to date in public databases we initiated in parallel a tardigrade EST sequencing project to allow for protein identification by electrospray ionization tandem mass spectrometry. 271 out of 606 analyzed protein spots could be identified by searching against the publicly available NCBInr database as well as our newly established tardigrade protein database corresponding to 144 unique proteins. Another 150 spots could be identified in the tardigrade clustered EST database corresponding to 36 unique contigs and ESTs. Proteins with annotated function were further categorized in more detail by their molecular function, biological process and cellular component. For the proteins of unknown function more information could be obtained by performing a protein domain annotation analysis. Our results include proteins like protein member of different heat shock protein families and LEA group 3, which might play important roles in surviving extreme conditions. Conclusions The proteome reference map of Milnesium tardigradum provides the basis for further studies in order to identify and characterize the biochemical mechanisms of tolerance to extreme desiccation. The optimized proteomics workflow will enable application of sensitive quantification techniques to detect differences in protein expression, which are characteristic of the active and anhydrobiotic states of tardigrades. PMID:20224743
Zhou, Y-J; Xue, J-G; Wang, X-G; Zhang, X-Q
2012-11-12
Buffalograss, Buchloe dactyloides, is a dioecious species native to the Great Plains of North America. The florets at the early stages of development possess both gynoecium and androecium organ primordia but later become unisexual. Very little is known about the proteomic changes that occur when the florets change from hermaphroditism to unisexuality. We compared the protein composition of florets at the hermaphroditic stage with that at the unisexual stage. The development stage of the floret was determined by stereomicroscopic observation. Two-dimensional gel electrophoresis was used to separate the proteins extracted from female and male inflorescences. Stage- specific protein maps, with an average of about 400 spots per map, were analyzed with the protein analysis software. Eighteen spots were found to be differentially expressed between the hermaphrodite and unisexual stages. Of these, 12 were present at both stages but with a different expression value. Four specific spots appeared at the hermaphrodite stage and disappeared at the unisexual stage. Two specific protein spots were associated with female and male floret differentiation. One appears to be associated with contabescence in the female floret and the final protein appears to lead to the abortion of gynoecium in the male floret. The MALDI TOF/TOF technique was used for peptide mass fingerprinting of the differentially expressed proteins and the MASCOT software was used to search the protein database. However, only two protein spots were identified from the database. These were aldolase1 and Os05g0574400 (similar to malate dehydrogenase). This type of proteomic study can help to identify novel protein products and determine the mechanisms involved in the floral sex differentiation process in buffalo grass.
[Proteome analysis on interaction between Anoectochilus roxburghii and Mycorrhizal fungus].
Gao, Chuan; Guo, Shun-Xing; Zhang, Jing; Chen, Juan; Zhang, Li-Chun
2012-12-01
To study the mechanism of plant growing promoted by Mycorrhizal fungus through the difference of proteomes. The differential proteomes between uninoculated and inoculated endophytic fungi, Epulorhiza sp. on Anoectochilus roxburghii were analyzed by two-dimensional gel electrophoresis and MALDI-TOF/TOF mass spectrum. Twenty-seven protein spots were analyzed by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF-MS). Twenty-two candidate proteins were identified by database comparisons. The function of these proteins mostly involved in signal transduction, metabolic regulation, as well as photosynthesis and substance metabolism. The results indicate that the regulator control system of plant is influenced by fungi action, and the positive regulation improves substance metabolism and photosynthesis, which results in strong plant and higher resistance. It is also deduced that silent genes may exist in endosymbiosis plants.
von Grotthuss, Marcin; Plewczynski, Dariusz; Ginalski, Krzysztof; Rychlewski, Leszek; Shakhnovich, Eugene I
2006-02-06
The number of protein structures from structural genomics centers dramatically increases in the Protein Data Bank (PDB). Many of these structures are functionally unannotated because they have no sequence similarity to proteins of known function. However, it is possible to successfully infer function using only structural similarity. Here we present the PDB-UF database, a web-accessible collection of predictions of enzymatic properties using structure-function relationship. The assignments were conducted for three-dimensional protein structures of unknown function that come from structural genomics initiatives. We show that 4 hypothetical proteins (with PDB accession codes: 1VH0, 1NS5, 1O6D, and 1TO0), for which standard BLAST tools such as PSI-BLAST or RPS-BLAST failed to assign any function, are probably methyltransferase enzymes. We suggest that the structure-based prediction of an EC number should be conducted having the different similarity score cutoff for different protein folds. Moreover, performing the annotation using two different algorithms can reduce the rate of false positive assignments. We believe, that the presented web-based repository will help to decrease the number of protein structures that have functions marked as "unknown" in the PDB file. http://paradox.harvard.edu/PDB-UF and http://bioinfo.pl/PDB-UF.
The role of stabilization centers in protein thermal stability
DOE Office of Scientific and Technical Information (OSTI.GOV)
Magyar, Csaba; Gromiha, M. Michael; Sávoly, Zoltán
2016-02-26
The definition of stabilization centers was introduced almost two decades ago. They are centers of noncovalent long range interaction clusters, believed to have a role in maintaining the three-dimensional structure of proteins by preventing their decay due to their cooperative long range interactions. Here, this hypothesis is investigated from the viewpoint of thermal stability for the first time, using a large protein thermodynamics database. The positions of amino acids belonging to stabilization centers are correlated with available experimental thermodynamic data on protein thermal stability. Our analysis suggests that stabilization centers, especially solvent exposed ones, do contribute to the thermal stabilizationmore » of proteins. - Highlights: • Stabilization centers contribute to thermal stabilization of protein structures. • Stabilization center content correlates with melting temperature of proteins. • Exposed stabilization center content correlates with stability even in hyperthermophiles. • Stability changing mutations are frequently found at stabilization centers.« less
Structure and function of seed storage proteins in faba bean (Vicia faba L.).
Liu, Yujiao; Wu, Xuexia; Hou, Wanwei; Li, Ping; Sha, Weichao; Tian, Yingying
2017-05-01
The protein subunit is the most important basic unit of protein, and its study can unravel the structure and function of seed storage proteins in faba bean. In this study, we identified six specific protein subunits in Faba bean (cv. Qinghai 13) combining liquid chromatography (LC), liquid chromatography-electronic spray ionization mass (LC-ESI-MS/MS) and bio-information technology. The results suggested a diversity of seed storage proteins in faba bean, and a total of 16 proteins (four GroEL molecular chaperones and 12 plant-specific proteins) were identified from 97-, 96-, 64-, 47-, 42-, and 38-kD-specific protein subunits in faba bean based on the peptide sequence. We also analyzed the composition and abundance of the amino acids, the physicochemical characteristics, secondary structure, three-dimensional structure, transmembrane domain, and possible subcellular localization of these identified proteins in faba bean seed, and finally predicted function and structure. The three-dimensional structures were generated based on homologous modeling, and the protein function was analyzed based on the annotation from the non-redundant protein database (NR database, NCBI) and function analysis of optimal modeling. The objective of this study was to identify the seed storage proteins in faba bean and confirm the structure and function of these proteins. Our results can be useful for the study of protein nutrition and achieve breeding goals for optimal protein quality in faba bean.
VaProS: a database-integration approach for protein/genome information retrieval.
Gojobori, Takashi; Ikeo, Kazuho; Katayama, Yukie; Kawabata, Takeshi; Kinjo, Akira R; Kinoshita, Kengo; Kwon, Yeondae; Migita, Ohsuke; Mizutani, Hisashi; Muraoka, Masafumi; Nagata, Koji; Omori, Satoshi; Sugawara, Hideaki; Yamada, Daichi; Yura, Kei
2016-12-01
Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein-protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts' knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/ .
Veluraja, Kasinadar; Selvin, Jeyasigamani F A; Venkateshwari, Selvakumar; Priyadarzini, Thanu R K
2010-09-23
The inherent flexibility and lack of strong intramolecular interactions of oligosaccharides demand the use of theoretical methods for their structural elucidation. In spite of the developments of theoretical methods, not much research on glycoinformatics is done so far when compared to bioinformatics research on proteins and nucleic acids. We have developed three dimensional structural database for a sialic acid-containing carbohydrates (3DSDSCAR). This is an open-access database that provides 3D structural models of a given sialic acid-containing carbohydrate. At present, 3DSDSCAR contains 60 conformational models, belonging to 14 different sialic acid-containing carbohydrates, deduced through 10 ns molecular dynamics (MD) simulations. The database is available at the URL: http://www.3dsdscar.org. Copyright 2010 Elsevier Ltd. All rights reserved.
Friso, Giulia; Giacomelli, Lisa; Ytterberg, A Jimmy; Peltier, Jean-Benoit; Rudella, Andrea; Sun, Qi; Wijk, Klaas J van
2004-02-01
An extensive analysis of the Arabidopsis thaliana peripheral and integral thylakoid membrane proteome was performed by sequential extractions with salt, detergent, and organic solvents, followed by multidimensional protein separation steps (reverse-phase HPLC and one- and two-dimensional electrophoresis gels), different enzymatic and nonenzymatic protein cleavage techniques, mass spectrometry, and bioinformatics. Altogether, 154 proteins were identified, of which 76 (49%) were alpha-helical integral membrane proteins. Twenty-seven new proteins without known function but with predicted chloroplast transit peptides were identified, of which 17 (63%) are integral membrane proteins. These new proteins, likely important in thylakoid biogenesis, include two rubredoxins, a potential metallochaperone, and a new DnaJ-like protein. The data were integrated with our analysis of the lumenal-enriched proteome. We identified 83 out of 100 known proteins of the thylakoid localized photosynthetic apparatus, including several new paralogues and some 20 proteins involved in protein insertion, assembly, folding, or proteolysis. An additional 16 proteins are involved in translation, demonstrating that the thylakoid membrane surface is an important site for protein synthesis. The high coverage of the photosynthetic apparatus and the identification of known hydrophobic proteins with low expression levels, such as cpSecE, Ohp1, and Ohp2, indicate an excellent dynamic resolution of the analysis. The sequential extraction process proved very helpful to validate transmembrane prediction. Our data also were cross-correlated to chloroplast subproteome analyses by other laboratories. All data are deposited in a new curated plastid proteome database (PPDB) with multiple search functions (http://cbsusrv01.tc.cornell.edu/users/ppdb/). This PPDB will serve as an expandable resource for the plant community.
An integrated approach to reservoir modeling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Donaldson, K.
1993-08-01
The purpose of this research is to evaluate the usefulness of the following procedural and analytical methods in investigating the heterogeneity of the oil reserve for the Mississipian Big Injun Sandstone of the Granny Creek field, Clay and Roane counties, West Virginia: (1) relational database, (2) two-dimensional cross sections, (3) true three-dimensional modeling, (4) geohistory analysis, (5) a rule-based expert system, and (6) geographical information systems. The large data set could not be effectively integrated and interpreted without this approach. A relational database was designed to fully integrate three- and four-dimensional data. The database provides an effective means for maintainingmore » and manipulating the data. A two-dimensional cross section program was designed to correlate stratigraphy, depositional environments, porosity, permeability, and petrographic data. This flexible design allows for additional four-dimensional data. Dynamic Graphics[sup [trademark
Maeda, Kenji; Finnie, Christine; Svensson, Birte
2004-01-01
Barley thioredoxin h isoforms HvTrxh1 and HvTrxh2 differ in temporal and spatial distribution and in kinetic properties. Target proteins of HvTrxh1 and HvTrxh2 were identified in mature seeds and in seeds after 72 h of germination. Improvement of the established method for identification of thioredoxin-targeted proteins based on two-dimensional electrophoresis and fluorescence labelling of thiol groups was achieved by application of a highly sensitive Cy5 maleimide dye and large-format two-dimensional gels, resulting in a 10-fold increase in the observed number of labelled protein spots. The technique also provided information about accessible thiol groups in the proteins identified in the barley seed proteome. In total, 16 different putative target proteins were identified from 26 spots using tryptic in-gel digestion, matrix-assisted laser-desorption ionization-time-of-flight MS and database search. HvTrxh1 and HvTrxh2 were shown to have similar target specificity. Barley alpha-amylase/subtilisin inhibitor, previously demonstrated to be reduced by both HvTrxh1 and HvTrxh2, was among the identified target proteins, confirming the suitability of the method. Several alpha-amylase/trypsin inhibitors, some of which are already known as target proteins of thioredoxin h, and cyclophilin known as a target protein of m-type thioredoxin were also identified. Lipid transfer protein, embryospecific protein, three chitinase isoenzymes, a single-domain glyoxalase-like protein and superoxide dismutase were novel identifications of putative target proteins, suggesting new physiological roles of thioredoxin h in barley seeds. PMID:14636158
Sequence and structural analyses of nuclear export signals in the NESdb database
Xu, Darui; Farmer, Alicia; Collett, Garen; Grishin, Nick V.; Chook, Yuh Min
2012-01-01
We compiled >200 nuclear export signal (NES)–containing CRM1 cargoes in a database named NESdb. We analyzed the sequences and three-dimensional structures of natural, experimentally identified NESs and of false-positive NESs that were generated from the database in order to identify properties that might distinguish the two groups of sequences. Analyses of amino acid frequencies, sequence logos, and agreement with existing NES consensus sequences revealed strong preferences for the Φ1-X3-Φ2-X2-Φ3-X-Φ4 pattern and for negatively charged amino acids in the nonhydrophobic positions of experimentally identified NESs but not of false positives. Strong preferences against certain hydrophobic amino acids in the hydrophobic positions were also revealed. These findings led to a new and more precise NES consensus. More important, three-dimensional structures are now available for 68 NESs within 56 different cargo proteins. Analyses of these structures showed that experimentally identified NESs are more likely than the false positives to adopt α-helical conformations that transition to loops at their C-termini and more likely to be surface accessible within their protein domains or be present in disordered or unobserved parts of the structures. Such distinguishing features for real NESs might be useful in future NES prediction efforts. Finally, we also tested CRM1-binding of 40 NESs that were found in the 56 structures. We found that 16 of the NES peptides did not bind CRM1, hence illustrating how NESs are easily misidentified. PMID:22833565
Merchant, Mark; Kinney, Clint; Sanders, Paige
2009-12-01
Blood was collected from three juvenile alligators (Alligator mississippiensis) before, and again 24h after, injection with bacterial lipopolysaccharide (LPS). The leukocytes were collected from both samples, and the proteins were extracted. Each group of proteins was labeled with a different fluorescent dye and the differences in protein expression were analyzed by two dimensional differential in-gel expressions (2D-DIGE). The proteins which appeared to be increased or decreased by treatment with LPS were selected and analyzed by MALDI-TOF to determine mass and LC-MS/MS to acquire the partial protein sequences. The peptide sequences were compared to the NCBI protein sequence database to determine homology with other sequences from other species. Several proteins of interest appeared to be increased upon LPS stimulation. Proteins with homology to human transgelin-2, fish glucose-6-phosphate dehydrogenase, amphibian α-enolase, alligator lactate dehydrogenase, fish ubiquitin-activating enzyme, and fungal β-tubulin were also increased after LPS injection. Proteins with homology to fish vimentin 4, murine heterogeneous nuclear ribonucleoprotein A3, and avian calreticulin were found to be decreased in response to LPS. In addition, five proteins, four of which were up-regulated (827, 560, 512, and 650%) and one that exhibited repressed expression (307%), did not show homology to any protein in the database, and thus may represent newly discovered proteins. We are using this biochemical approach to isolate and characterize alligator proteins with potential relevant immune function.
Smolka, Marcus Bustamante; Martins-de-Souza, Daniel; Martins, Daniel; Winck, Flavia Vischi; Santoro, Carlos Eduardo; Castellari, Rafael Ramos; Ferrari, Fernanda; Brum, Itaraju Junior; Galembeck, Eduardo; Della Coletta Filho, Helvécio; Machado, Marcos Antonio; Marangoni, Sergio; Novello, Jose Camillo
2003-02-01
The bacteria Xylella fastidiosa is the causative agent of a number of economically important crop diseases, including citrus variegated chlorosis. Although its complete genome is already sequenced, X. fastidiosa is very poorly characterized by biochemical approaches at the protein level. In an initial effort to characterize protein expression in X. fastidiosa we used one- and two-dimensional gel electrophoresis and mass spectrometry to identify the products of 142 genes present in a whole cell extract and in an extracellular fraction of the citrus isolated strain 9a5c. Of particular interest for the study of pathogenesis are adhesion and secreted proteins. Homologs to proteins from three different adhesion systems (type IV fimbriae, mrk pili and hsf surface fibrils) were found to be coexpressed, the last two being detected only as multimeric complexes in the high molecular weight region of one-dimensional electrophoresis gels. Using a procedure to extract secreted proteins as well as proteins weakly attached to the cell surface we identified 30 different proteins including toxins, adhesion related proteins, antioxidant enzymes, different types of proteases and 16 hypothetical proteins. These data suggest that the intercellular space of X. fastidiosa colonies is a multifunctional microenvironment containing proteins related to in vivo bacterial survival and pathogenesis. A codon usage analysis of the most expressed proteins from the whole cell extract revealed a low biased distribution, which we propose is related to the slow growing nature of X. fastidiosa. A database of the X. fastidiosa proteome was developed and can be accessed via the internet (URL: www.proteome.ibi.unicamp.br).
The electric dipole moment of DNA-binding HU protein calculated by the use of an NMR database.
Takashima, S; Yamaoka, K
1999-08-30
Electric birefringence measurements indicated the presence of a large permanent dipole moment in HU protein-DNA complex. In order to substantiate this observation, numerical computation of the dipole moment of HU protein homodimer was carried out by using NMR protein databases. The dipole moments of globular proteins have hitherto been calculated with X-ray databases and NMR data have never been used before. The advantages of NMR databases are: (a) NMR data are obtained, unlike X-ray databases, using protein solutions. Accordingly, this method eliminates the bothersome question as to the possible alteration of the protein structure due to the transition from the crystalline state to the solution state. This question is particularly important for proteins such as HU protein which has some degree of internal flexibility; (b) the three-dimensional coordinates of hydrogen atoms in protein molecules can be determined with a sufficient resolution and this enables the N-H as well as C = O bond moments to be calculated. Since the NMR database of HU protein from Bacillus stearothermophilus consists of 25 models, the surface charge as well as the core dipole moments were computed for each of these structures. The results of these calculations show that the net permanent dipole moments of HU protein homodimer is approximately 500-530 D (1 D = 3.33 x 10(-30) Cm) at pH 7.5 and 600-630 D at the isoelectric point (pH 10.5). These permanent dipole moments are unusually large for a small protein of the size of 19.5 kDa. Nevertheless, the result of numerical calculations is compatible with the electro-optical observation, confirming a very large dipole moment in this protein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Williams, K.; Chubb, C.; Huberman, E.
High resolution two dimensional get electrophoresis (2DE) and database analysis was used to establish protein expression patterns for cultured normal human mammary epithelial cells and thirteen breast cancer cell lines. The Human Breast Epithelial Cell database contains the 2DE protein patterns, including relative protein abundances, for each cell line, plus a composite pattern that contains all the common and specifically expressed proteins from all the cell lines. Significant differences in protein expression, both qualitative and quantitative, were observed not only between normal cells and tumor cells, but also among the tumor cell lines. Eight percent of the consistently detected proteinsmore » were found in significantly (P < 0.001) variable levels among the cell lines. Using a combination of immunostaining, comigration with purified protein, subcellular fractionation, and amino-terminal protein sequencing, we identified a subset of the differentially expressed proteins. These identified proteins include the cytoskeletal proteins actin, tubulin, vimentin, and cytokeratins. The cell lines can be classified into four distinct groups based on their intermediate filament protein profile. We also identified heat shock proteins; hsp27, hsp60, and hsp70 varied in abundance and in some cases in the relative phosphorylation levels among the cell lines. Finally, we identified IMP dehydrogenase in each of the cell lines, and found the levels of this enzyme in the tumor cell lines elevated 2- to 20-fold relative to the levels in normal cells.« less
Gromiha, M Michael; Anoosha, P; Huang, Liang-Tsung
2016-01-01
Protein stability is the free energy difference between unfolded and folded states of a protein, which lies in the range of 5-25 kcal/mol. Experimentally, protein stability is measured with circular dichroism, differential scanning calorimetry, and fluorescence spectroscopy using thermal and denaturant denaturation methods. These experimental data have been accumulated in the form of a database, ProTherm, thermodynamic database for proteins and mutants. It also contains sequence and structure information of a protein, experimental methods and conditions, and literature information. Different features such as search, display, and sorting options and visualization tools have been incorporated in the database. ProTherm is a valuable resource for understanding/predicting the stability of proteins and it can be accessed at http://www.abren.net/protherm/ . ProTherm has been effectively used to examine the relationship among thermodynamics, structure, and function of proteins. We describe the recent progress on the development of methods for understanding/predicting protein stability, such as (1) general trends on mutational effects on stability, (2) relationship between the stability of protein mutants and amino acid properties, (3) applications of protein three-dimensional structures for predicting their stability upon point mutations, (4) prediction of protein stability upon single mutations from amino acid sequence, and (5) prediction methods for addressing double mutants. A list of online resources for predicting has also been provided.
SSEP: secondary structural elements of proteins
Shanthi, V.; Selvarani, P.; Kiran Kumar, Ch.; Mohire, C. S.; Sekar, K.
2003-01-01
SSEP is a comprehensive resource for accessing information related to the secondary structural elements present in the 25 and 90% non-redundant protein chains. The database contains 1771 protein chains from 1670 protein structures and 6182 protein chains from 5425 protein structures in 25 and 90% non-redundant protein chains, respectively. The current version provides information about the α-helical segments and β-strand fragments of varying lengths. In addition, it also contains the information about 310-helix, β- and ν-turns and hairpin loops. The free graphics program RASMOL has been interfaced with the search engine to visualize the three-dimensional structures of the user queried secondary structural fragment. The database is updated regularly and is available through Bioinformatics web server at http://cluster.physics.iisc.ernet.in/ssep/ or http://144.16.71.148/ssep/. PMID:12824336
Cloning and characterization of two novel DNases from Streptococcus pyogenes.
Hasegawa, Tadao; Torii, Keizo; Hashikawa, Shinnosuke; Iinuma, Yoshitsugu; Ohta, Michio
2002-06-01
The proteins in the culture supernatant (exoproteins) from Streptococcus pyogenes serotype M1 were separated by two-dimensional gel electrophoresis, and their N-terminal amino acid sequences were determined. The amino acid sequences were compared to sequences in the S. pyogenes genome database. The coding sequence showed similarity to sequences of two genes, mf2-v ( mf2 variant) and mf3, which had sequence similarity to genes encoding mitogenic factor (MF); MF has DNase activity. The recombinant genes were expressed in Escherichia coli and the proteins were synthesized. Mf2-v and Mf3 had DNase activity. The activity of Mf2-v was localized to the C-terminal half of the protein. The mf3 gene was shown to be present in most clinically isolated strains of S. pyogenes tested, and the mf2gene was detected in 20% of the isolates. The products of the mf2 and mf3 genes in clinically isolated S. pyogenes strains were thus shown to be DNases.
Purification and proteomic analysis of plant plasma membranes.
Alexandersson, Erik; Gustavsson, Niklas; Bernfur, Katja; Karlsson, Adine; Kjellbom, Per; Larsson, Christer
2008-01-01
All techniques needed for proteomic analyses of plant plasma membranes are described in detail, from isolation of plasma membranes to protein identification by mass spectrometry (MS). Plasma membranes are isolated by aqueous two-phase partitioning yielding vesicles with a cytoplasmic side-in orientation and a purity of about 95%. These vesicles are turned inside-out by treatment with Brij 58, which removes soluble contaminating proteins enclosed in the vesicles as well as loosely attached proteins. The final plasma membrane preparation thus retains all integral proteins and many peripheral proteins. Proteins are separated by one-dimensional sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE), and protein bands are excised and digested with trypsin. Peptides in tryptic digests are separated by nanoflow liquid chromatography and either fed directly into an ESI-MS or spotted onto matrix-assisted laser desorption ionization (MALDI) plates for analysis with MALDI-MS. Finally, data processing and database searching are used for protein identification to define a plasma membrane proteome.
NASA Technical Reports Server (NTRS)
Ellis, S.; Giometti, C. S.; Riley, D. A.
1985-01-01
Using 320 g rats, a two-dimensional electrophoretic analysis of muscle proteins in the soleus and EDL muscles from hindlimbs maintained load-free for 10 days is performed. Statistical analysis of the two-dimensional patterns of control and suspended groups reveals more protein alteration in the soleus muscle, with 25 protein differences, than the EDL muscle, with 9 protein differences, as a result of atrophy. Most of the soleus differences reside in minor components. It is suggested that the EDL may also show alteration in its two-dimensional protein map, even though no significant atrophy occurred in muscle wet weight. It is cautioned that strict interpretation of data must take into account possible endocrine perturbations.
Ge, Jia-Jia; Huang, Yu-Sen
2017-01-01
AIM To analyze and identify the proteomic differences between liquefied after-cataracts and normal lenses by means of liquefied chromatography-tandem mass spectrometry (LC-MS/MS). METHODS Three normal lenses and three liquefied after-cataracts were exposed to depolymerizing reagents to extract the total proteins. Protein concentrations were separated using two-dimensional gel electrophoresis (2-DE). The digitized images obtained with a GS-800 scanner were then analyzed with PDQuest7.0 software to detect the differentially-expressed protein spots. These protein spots were cut from the gel using a proteome work spot cutter and subjected to in-gel digestion with trypsin. The digested peptide separation was conducted by LC-MS/MS. RESULTS The 2-DE maps showed that lens proteins were in a pH range of 3-10 with a relative molecular weight of 21-70 kD. The relative molecular weight of the more abundant proteins was localized at 25-50 kD, and the isoelectric points were found to lie between PI 4-9. The maps also showed that the protein level within the liquefied after-cataracts was at 29 points and significantly lower than in normal lenses. The 29 points were identified by LC-MS/MS, and ten of these proteins were identified by mass spectrometry and database queries: beta-crystallin B1, glyceraldehyde-3-phosphate dehydrogenase, carbonyl reductase (NADPH) 1, cDNA FLJ55253, gamma-crystallin D, GAS2-like protein 3, sorbitol dehydrogenase, DNA FLJ60282, phosphoglycerate kinase, and filensin. CONCLUSION The level of the ten proteins may play an important role in the development of liquefied after-cataracts. PMID:28944190
Wilkins, Joanna C.; Homer, Karen A.; Beighton, David
2001-01-01
Streptococcus oralis is the predominant aciduric nonmutans streptococcus isolated from the human dentition, but the role of this organism in the initiation and progression of dental caries has yet to be established. To identify proteins that are differentially expressed by S. oralis growing under conditions of low pH, soluble cellular proteins extracted from bacteria grown in batch culture at pH 5.2 or 7.0 were analyzed by two-dimensional (2-D) gel electrophoresis. Thirty-nine proteins had altered expression at low pH; these were excised, digested with trypsin using an in-gel protocol, and further analyzed by peptide mass fingerprinting using matrix-assisted laser desorption ionization mass spectrometry. The resulting fingerprints were compared with the genomic database for Streptococcus pneumoniae, an organism that is phylogenetically closely related to S. oralis, and putative functions for the majority of these proteins were determined on the basis of functional homology. Twenty-eight proteins were up-regulated following growth at pH 5.2; these included enzymes of the glycolytic pathway (glyceraldehyde-3-phosphate dehydrogenase and lactate dehydrogenase), the polypeptide chains comprising ATP synthase, and proteins that are considered to play a role in the general stress response of bacteria, including the 60-kDa chaperone, Hsp33, and superoxide dismutase, and three distinct ABC transporters. These data identify, for the first time, gene products that may be important in the survival and proliferation of nonmutans aciduric S. oralis under conditions of low pH that are likely to be encountered by this organism in vivo. PMID:11472910
Mostaguir, Khaled; Hoogland, Christine; Binz, Pierre-Alain; Appel, Ron D
2003-08-01
The Make 2D-DB tool has been previously developed to help build federated two-dimensional gel electrophoresis (2-DE) databases on one's own web site. The purpose of our work is to extend the strength of the first package and to build a more efficient environment. Such an environment should be able to fulfill the different needs and requirements arising from both the growing use of 2-DE techniques and the increasing amount of distributed experimental data.
Back to the future: the human protein index (HPI) and the agenda for post-proteomic biology.
Anderson, N G; Matheson, A; Anderson, N L
2001-01-01
The effort to produce an index of all human proteins (the human protein index, or HPI) began twenty years ago, before the initiation of the human genome program. Because DNA sequencing technology is inherently simpler and more scalable than protein analytical technology, and because the finiteness of genomes invited a spirit of rapid conquest, the notion of genome sequencing has displaced that of protein databases in the minds of most molecular biologists for the last decade. However, now that the human genome sequence is nearing completion, a major realignment is under way that brings proteins back to the center of biological thinking. Using an influx of new and improved protein technologies--from mass spectrometry to re-engineered two-dimensional (2-D) gel systems, the original objectives of the HPI have been expanded and the time frame for its execution radically shortened. Several additional large scale technology efforts flowing from the HPI are also described.
Rodrigues, Silas Pessini; Ventura, José Aires; Zingali, R B; Fernandes, P M B
2009-01-01
A variety of sample preparation protocols for plant proteomic analysis using two-dimensional gel electrophoresis (2-DE) have been reported. However, they usually have to be adapted and further optimised for the analysis of plant species not previously studied. This work aimed to evaluate different sample preparation protocols for analysing Carica papaya L. leaf proteins through 2-DE. Four sample preparation methods were tested: (1) phenol extraction and methanol-ammonium acetate precipitation; (2) no precipitation fractionation; and the traditional trichloroacetic acid-acetone precipitation either (3) with or (4) without protein fractionation. The samples were analysed for their compatibility with SDS-PAGE (1-DE) and 2-DE. Fifteen selected protein spots were trypsinised and analysed by matrix-assisted laser desorption/ionisation time-of-flight tandem mass spectrometry (MALDI-TOF-MS/MS), followed by a protein search using the NCBInr database to accurately identify all proteins. Methods number 3 and 4 resulted in large quantities of protein with good 1-DE separation and were chosen for 2-DE analysis. However, only the TCA method without fractionation (no. 4) proved to be useful. Spot number and resolution advances were achieved, which included having an additional solubilisation step in the conventional TCA method. Moreover, most of the theoretical and experimental protein molecular weight and pI data had similar values, suggesting good focusing and, most importantly, limited protein degradation. The described sample preparation method allows the proteomic analysis of papaya leaves by 2-DE and mass spectrometry (MALDI-TOF-MS/MS). The methods presented can be a starting point for the optimisation of sample preparation protocols for other plant species.
DDRprot: a database of DNA damage response-related proteins.
Andrés-León, Eduardo; Cases, Ildefonso; Arcas, Aida; Rojas, Ana M
2016-01-01
The DNA Damage Response (DDR) signalling network is an essential system that protects the genome's integrity. The DDRprot database presented here is a resource that integrates manually curated information on the human DDR network and its sub-pathways. For each particular DDR protein, we present detailed information about its function. If involved in post-translational modifications (PTMs) with each other, we depict the position of the modified residue/s in the three-dimensional structures, when resolved structures are available for the proteins. All this information is linked to the original publication from where it was obtained. Phylogenetic information is also shown, including time of emergence and conservation across 47 selected species, family trees and sequence alignments of homologues. The DDRprot database can be queried by different criteria: pathways, species, evolutionary age or involvement in (PTM). Sequence searches using hidden Markov models can be also used.Database URL: http://ddr.cbbio.es. © The Author(s) 2016. Published by Oxford University Press.
Topology-Scaling Identification of Layered Solids and Stable Exfoliated 2D Materials.
Ashton, Michael; Paul, Joshua; Sinnott, Susan B; Hennig, Richard G
2017-03-10
The Materials Project crystal structure database has been searched for materials possessing layered motifs in their crystal structures using a topology-scaling algorithm. The algorithm identifies and measures the sizes of bonded atomic clusters in a structure's unit cell, and determines their scaling with cell size. The search yielded 826 stable layered materials that are considered as candidates for the formation of two-dimensional monolayers via exfoliation. Density-functional theory was used to calculate the exfoliation energy of each material and 680 monolayers emerge with exfoliation energies below those of already-existent two-dimensional materials. The crystal structures of these two-dimensional materials provide templates for future theoretical searches of stable two-dimensional materials. The optimized structures and other calculated data for all 826 monolayers are provided at our database (https://materialsweb.org).
Raftery, Mark J; Saldanha, Rohit G; Geczy, Carolyn L; Kumar, Rakesh K
2003-01-01
Background Pollens are important triggers for allergic asthma and seasonal rhinitis, and proteases released by major allergenic pollens can injure airway epithelial cells in vitro. Disruption of mucosal epithelial integrity by proteases released by inhaled pollens could promote allergic sensitisation. Methods Pollen diffusates from Kentucky blue grass (Poa pratensis), rye grass (Lolium perenne) and Bermuda grass (Cynodon dactylon) were assessed for peptidase activity using a fluorogenic substrate, as well as by gelatin zymography. Following one- or two-dimensional gel electrophoresis, Coomassie-stained individual bands/spots were excised, subjected to tryptic digestion and analysed by mass spectrometry, either MALDI reflectron TOF or microcapillary liquid chromatography MS-MS. Database searches were used to identify allergens and other plant proteins in pollen diffusates. Results All pollen diffusates tested exhibited peptidase activity. Gelatin zymography revealed high Mr proteolytic activity at ~ 95,000 in all diffusates and additional proteolytic bands in rye and Bermuda grass diffusates, which appeared to be serine proteases on the basis of inhibition studies. A proteolytic band at Mr ~ 35,000 in Bermuda grass diffusate, which corresponded to an intense band detected by Western blotting using a monoclonal antibody to the timothy grass (Phleum pratense) group 1 allergen Phl p 1, was identified by mass spectrometric analysis as the group 1 allergen Cyn d 1. Two-dimensional analysis similarly demonstrated proteolytic activity corresponding to protein spots identified as Cyn d 1. Conclusion One- and two-dimensional electrophoretic separation, combined with analysis by mass spectrometry, is useful for rapid determination of the identities of pollen proteins. A component of the proteolytic activity in Bermuda grass diffusate is likely to be related to the allergen Cyn d 1. PMID:14577842
Proteomic analysis of pollination-induced corolla senescence in petunia.
Bai, Shuangyi; Willard, Belinda; Chapin, Laura J; Kinter, Michael T; Francis, David M; Stead, Anthony D; Jones, Michelle L
2010-02-01
Senescence represents the last phase of petal development during which macromolecules and organelles are degraded and nutrients are recycled to developing tissues. To understand better the post-transcriptional changes regulating petal senescence, a proteomic approach was used to profile protein changes during the senescence of Petuniaxhybrida 'Mitchell Diploid' corollas. Total soluble proteins were extracted from unpollinated petunia corollas at 0, 24, 48, and 72 h after flower opening and at 24, 48, and 72 h after pollination. Two-dimensional gel electrophoresis (2-DE) was used to identify proteins that were differentially expressed in non-senescing (unpollinated) and senescing (pollinated) corollas, and image analysis was used to determine which proteins were up- or down-regulated by the experimentally determined cut-off of 2.1-fold for P <0.05. One hundred and thirty-three differentially expressed protein spots were selected for sequencing. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) was used to determine the identity of these proteins. Searching translated EST databases and the NCBI non-redundant protein database, it was possible to assign a putative identification to greater than 90% of these proteins. Many of the senescence up-regulated proteins were putatively involved in defence and stress responses or macromolecule catabolism. Some proteins, not previously characterized during flower senescence, were identified, including an orthologue of the tomato abscisic acid stress ripening protein 4 (ASR4). Gene expression patterns did not always correlate with protein expression, confirming that both proteomic and genomic approaches will be required to obtain a detailed understanding of the regulation of petal senescence.
WebCSD: the online portal to the Cambridge Structural Database
Thomas, Ian R.; Bruno, Ian J.; Cole, Jason C.; Macrae, Clare F.; Pidcock, Elna; Wood, Peter A.
2010-01-01
WebCSD, a new web-based application developed by the Cambridge Crystallographic Data Centre, offers fast searching of the Cambridge Structural Database using only a standard internet browser. Search facilities include two-dimensional substructure, molecular similarity, text/numeric and reduced cell searching. Text, chemical diagrams and three-dimensional structural information can all be studied in the results browser using the efficient entry summaries and embedded three-dimensional viewer. PMID:22477776
Li, JianYuan; Liu, FuJun; Wang, HaiYan; Liu, Xin; Liu, Juan; Li, Ning; Wan, FengChun; Wang, WenTing; Zhang, ChengLin; Jin, ShaoHua; Liu, Jie; Zhu, Peng; Liu, YunXiang
2010-01-01
The mammalian spermatozoon has many cellular compartments, such as head and tail, permitting it to interact with the female reproductive tract and fertilize the egg. It acquires this fertilizing potential during transit through the epididymis, which secretes proteins that coat different sperm domains. Optimal levels of these proteins provide the spermatozoon with its ability to move to, bind to, fuse with, and penetrate the egg; otherwise male infertility results. As few human epididymal proteins have been characterized, this work was performed to generate a database of human epididymal sperm-located proteins involved in maturation. Two-dimensional gel electrophoresis of epididymal tissue and luminal fluid proteins, followed by identification using MALDI-TOF/MS or MALDI-TOF/TOF, revealed over a thousand spots in gels comprising 745 abundant nonstructural proteins, 408 in luminal fluids, of which 207 were present on spermatozoa. Antibodies raised to 619 recombinant or synthetic peptides, used in Western blots, histological sections, and washed sperm preparations to confirm antibody quality and protein expression, indicated their regional location in the epididymal epithelium and highly specific locations on washed functional spermatozoa. Sperm function tests suggested the role of some proteins in motility and protection against oxidative attack. A large database of these proteins, characterized by size, pI, chromosomal location, and function, was given a unified terminology reflecting their sperm domain location. These novel, secreted human epididymal proteins are potential targets for a posttesticular contraceptive acting to provide rapid, reversible, functional sterility in men and they are also biomarkers that could be used in noninvasive assessments of male fertility. PMID:20736409
Sano, D; Myojo, K; Omura, T
2006-01-01
Water pollution with toxic heavy metals is of growing concern because heavy metals could bring about serious problems for not only ecosystems in the water environment but also human health. Some metal removal technologies have been in practical use, but much energy and troublesome treatments for chemical wastes are required to operate these conventional technologies. In this study, heavy metal-binding proteins (HMBPs) were obtained from metal-stimulated activated sludge culture with affinity chromatography using copper ion as a ligand. Two-dimensional electrophoresis revealed that a number of proteins in activated sludge culture were recovered as HMBPs for copper ion. N-termini of five HMBPs were determined, and two of them were found to be newly discovered proteins for which no amino acid sequences in protein databases were retrieved at more than 80% identities. Metal-coordinating amino acids occupied 38% of residues in one of the N-terminal sequences of the newly discovered HMBPs. Since these HMBPs were expected to be stable under conditions of water and wastewater treatments, it would be possible to utilize HMBPs as novel adsorbents for heavy metal removal if mass volume of HMBPs can be obtained with protein cloning techniques.
Della Corte, Anna; Maugeri, Norma; Pampuch, Agnieszka; Cerletti, Chiara; de Gaetano, Giovanni; Rotilio, Domenico
2008-02-01
Thrombin is an agonist inducing platelet activation. We combined two-dimensional difference gel electrophoresis (2D-DIGE) and mass spectrometry (MALDI-TOF MS) to analyse differentially expressed proteins secreted from thrombin-stimulated platelets. Human washed platelets, from healthy volunteers, were stimulated with thrombin 0.5 U/ml at 37 degrees C without stirring and the secreted proteins were resolved by 2D-DIGE. By image analysis, 1094 spots were detected in the 2D gel. The spots whose mean intensity showed at least a five-fold change intensity increase or decrease in the thrombin-activated platelet gel in comparison with unstimulated control were digested by trypsin and subjected to MALDI-TOF MS analysis. Peptides from mass spectra of in-gel digest samples were matched against available databases, using the Mascot search engine (Matrix Science) for peptide mass fingerprint. In the activated platelet secretome, transferrin, glutathione-transferase, WD repeat protein, ER-60, thrombospondin-1 precursor and thrombospondin were the most abundant. Also lamin A, a nuclear protein, not previously identified in platelets, appeared to be released. The novel strategy to combine 2D-DIGE with MALDI-TOF MS is a useful approach for a quantitative analysis of the effect of thrombin on the secretome profile of human platelets.
2012-01-01
Background Proteins that are associated with hypertension may be identified by comparing the 2-dimensional gel electrophoresis (2-DE) profiles of the sera of spontaneously hypertensive rats (SHR) with those generated from normotensive Spraque-Dawley rats (SDR). Results Five proteins of high abundance were found to be significantly altered when the 2-DE serum profiles of the SHR were compared to those that were similarly generated from the SDR. Analysis by mass spectrometry and database search identified the proteins as retinol binding protein 4, complement C3, albumin (19.9 kDa fragment), alpha1 macroglobulin and alpha1 antiproteinase, which are all known to be associated with hypertension. The altered expression of the two latter proteins was found to be abrogated when similar analysis was performed on sera of the SHR that were treated with captopril. Conclusion Our data suggests that serum alpha1 macroglobulin and alpha1 antiproteinase are potentially useful complementary biomolecular indicators for monitoring of hypertension. PMID:22416803
Proteomic analysis of the Theileria annulata schizont
Witschi, M.; Xia, D.; Sanderson, S.; Baumgartner, M.; Wastling, J.M.; Dobbelaere, D.A.E.
2013-01-01
The apicomplexan parasite, Theileria annulata, is the causative agent of tropical theileriosis, a devastating lymphoproliferative disease of cattle. The schizont stage transforms bovine leukocytes and provides an intriguing model to study host/pathogen interactions. The genome of T. annulata has been sequenced and transcriptomic data are rapidly accumulating. In contrast, little is known about the proteome of the schizont, the pathogenic, transforming life cycle stage of the parasite. Using one-dimensional (1-D) gel LC-MS/MS, a proteomic analysis of purified T. annulata schizonts was carried out. In whole parasite lysates, 645 proteins were identified. Proteins with transmembrane domains (TMDs) were under-represented and no proteins with more than four TMDs could be detected. To tackle this problem, Triton X-114 treatment was applied, which facilitates the extraction of membrane proteins, followed by 1-D gel LC-MS/MS. This resulted in the identification of an additional 153 proteins. Half of those had one or more TMD and 30 proteins with more than four TMDs were identified. This demonstrates that Triton X-114 treatment can provide a valuable additional tool for the identification of new membrane proteins in proteomic studies. With two exceptions, all proteins involved in glycolysis and the citric acid cycle were identified. For at least 29% of identified proteins, the corresponding transcripts were not present in the existing expressed sequence tag databases. The proteomics data were integrated into the publicly accessible database resource at EuPathDB (www.eupathdb.org) so that mass spectrometry-based protein expression evidence for T. annulata can be queried alongside transcriptional and other genomics data available for these parasites. PMID:23178997
Rigidity of transmembrane proteins determines their cluster shape
NASA Astrophysics Data System (ADS)
Jafarinia, Hamidreza; Khoshnood, Atefeh; Jalali, Mir Abbas
2016-01-01
Protein aggregation in cell membrane is vital for the majority of biological functions. Recent experimental results suggest that transmembrane domains of proteins such as α -helices and β -sheets have different structural rigidities. We use molecular dynamics simulation of a coarse-grained model of protein-embedded lipid membranes to investigate the mechanisms of protein clustering. For a variety of protein concentrations, our simulations under thermal equilibrium conditions reveal that the structural rigidity of transmembrane domains dramatically affects interactions and changes the shape of the cluster. We have observed stable large aggregates even in the absence of hydrophobic mismatch, which has been previously proposed as the mechanism of protein aggregation. According to our results, semiflexible proteins aggregate to form two-dimensional clusters, while rigid proteins, by contrast, form one-dimensional string-like structures. By assuming two probable scenarios for the formation of a two-dimensional triangular structure, we calculate the lipid density around protein clusters and find that the difference in lipid distribution around rigid and semiflexible proteins determines the one- or two-dimensional nature of aggregates. It is found that lipids move faster around semiflexible proteins than rigid ones. The aggregation mechanism suggested in this paper can be tested by current state-of-the-art experimental facilities.
Martínez-Castilla, León P.; Rodríguez-Sotres, Rogelio
2010-01-01
Background Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. Principal Findings The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. Conclusion Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone. PMID:20830209
Resource for structure related information on transmembrane proteins
NASA Astrophysics Data System (ADS)
Tusnády, Gábor E.; Simon, István
Transmembrane proteins are involved in a wide variety of vital biological processes including transport of water-soluble molecules, flow of information and energy production. Despite significant efforts to determine the structures of these proteins, only a few thousand solved structures are known so far. Here, we review the various resources for structure-related information on these types of proteins ranging from the 3D structure to the topology and from the up-to-date databases to the various Internet sites and servers dealing with structure prediction and structure analysis. Abbreviations: 3D, three dimensional; PDB, Protein Data Bank; TMP, transmembrane protein.
Simple Logic for Big Problems: An Inside Look at Relational Databases.
ERIC Educational Resources Information Center
Seba, Douglas B.; Smith, Pat
1982-01-01
Discusses database design concept termed "normalization" (process replacing associations between data with associations in two-dimensional tabular form) which results in formation of relational databases (they are to computers what dictionaries are to spoken languages). Applications of the database in serials control and complex systems…
Proteomic analysis of early phase of conidia germination in Aspergillus nidulans.
Oh, Young Taek; Ahn, Chun-Seob; Kim, Jeong Geun; Ro, Hyeon-Su; Lee, Chang-Won; Kim, Jae Won
2010-03-01
In order to investigate proteins involved in early phase of conidia germination, proteomic analysis was performed using two-dimensional gel electrophoresis (2D-GE) in conjunction with MALDI-TOF mass spectrometry (MS). The expression levels of 241 proteins varied quantitatively with statistical significance (P<0.05) at the early phase of the germination stage. Out of these 57 were identified by MALDI-TOF MS. Through classification of physiological functions from Conserved Domain Database analysis, among the identified proteins, 21, 13, and 6 proteins were associated with energy metabolism, protein synthesis, and protein folding process, respectively. Interestingly, eight proteins, which are involved in detoxification of reactive oxygen species (ROS) including catalase A, thioredoxin reductase, and mitochondrial peroxiredoxin, were also identified. The expression levels of the genes were further confirmed using Northern blot and reverse transcriptase (RT)-PCR analyses. This study represents the first proteomic analysis of early phase of conidia germination and will contribute to a better understanding of the molecular events involved in conidia germination process. Copyright (c) 2009 Elsevier Inc. All rights reserved.
Kihara, Daisuke; Sael, Lee; Chikhi, Rayan; Esquivel-Rodriguez, Juan
2011-09-01
The tertiary structures of proteins have been solved in an increasing pace in recent years. To capitalize the enormous efforts paid for accumulating the structure data, efficient and effective computational methods need to be developed for comparing, searching, and investigating interactions of protein structures. We introduce the 3D Zernike descriptor (3DZD), an emerging technique to describe molecular surfaces. The 3DZD is a series expansion of mathematical three-dimensional function, and thus a tertiary structure is represented compactly by a vector of coefficients of terms in the series. A strong advantage of the 3DZD is that it is invariant to rotation of target object to be represented. These two characteristics of the 3DZD allow rapid comparison of surface shapes, which is sufficient for real-time structure database screening. In this article, we review various applications of the 3DZD, which have been recently proposed.
Towards building a team of intelligent robots
NASA Technical Reports Server (NTRS)
Varanasi, Murali R.; Mehrotra, R.
1987-01-01
Topics addressed include: collision-free motion planning of multiple robot arms; two-dimensional object recognition; and pictorial databases (storage and sharing of the representations of three-dimensional objects).
Dynamic changes in proteins during apple (Malus x domestica) fruit ripening and storage
Shi, Yun; Jiang, Li; Zhang, Li; Kang, Ruoyi; Yu, Zhifang
2014-01-01
A proteomic study, using two-dimensional polyacrylamide gel electrophoresis and matrix-assisted laser desorption/ionization time-of-flight/time-of-flight, was conducted in apple fruit (cv. ‘Golden Delicious’) starting at 10 days prior to harvest through 50 days in storage. Total protein was extracted using a phenol/sodium dodecyl sulfate protocol. More than 400 protein spots were detected in each gel and 55 differentially expressed proteins (p<0.05) were subjected to matrix-assisted laser desorption/ionization time-of-flight/time-of-flight analysis. Fifty-three of these proteins were finally identified using an apple expressed sequence tag database downloaded from Genome Database for Rosaceae and placed into six categories. The categories and the percentage of proteins placed in each category were stress response and defense (49.0%), energy and metabolism (34.0%), fruit ripening and senescence (5.6%), signal transduction (3.8%), cell structure (3.8%) and protein synthesis (3.8%). Proteins involved in several multiple metabolic pathways, including glycolysis, pentose–phosphate pathway, anti-oxidative systems, photosynthesis and cell wall synthesis, were downregulated, especially during the climacteric burst in respiration and during the senescent stages of fruit development. Proteins classified as allergens or involved in cell wall degradation were upregulated during the ripening process. Some protein spots exhibited a mixed pattern (increasing to maximal abundance followed by a decrease), such as 1-aminocyclopropane-1-carboxylate oxidase, L-ascorbate peroxidase and abscisic acid response proteins. The identification of differentially expressed proteins associated with physiological processes identified in the current study provides a baseline of information for understanding the metabolic processes and regulatory mechanisms that occur in climacteric apple fruit during ripening and senescence. PMID:26504530
Swetha, Rayapadi G; Kala Sekar, Dinesh Kumar; Ramaiah, Sudha; Anbarasu, Anand; Sekar, Kanagaraj
2014-12-01
Haemophilus influenzae (H. Influenzae) is the causative agent of pneumonia, bacteraemia and meningitis. The organism is responsible for large number of deaths in both developed and developing countries. Even-though the first bacterial genome to be sequenced was that of H. Influenzae, there is no exclusive database dedicated for H. Influenzae. This prompted us to develop the Haemophilus influenzae Genome Database (HIGDB). All data of HIGDB are stored and managed in MySQL database. The HIGDB is hosted on Solaris server and developed using PERL modules. Ajax and JavaScript are used for the interface development. The HIGDB contains detailed information on 42,741 proteins, 18,077 genes including 10 whole genome sequences and also 284 three dimensional structures of proteins of H. influenzae. In addition, the database provides "Motif search" and "GBrowse". The HIGDB is freely accessible through the URL: http://bioserver1.physics.iisc.ernet.in/HIGDB/. The HIGDB will be a single point access for bacteriological, clinical, genomic and proteomic information of H. influenzae. The database can also be used to identify DNA motifs within H. influenzae genomes and to compare gene or protein sequences of a particular strain with other strains of H. influenzae. Copyright © 2014 Elsevier Ltd. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Flour proteins from the US bread wheat Butte 86 were extracted in 0.5% SDS using a two-step procedure with and without sonication and further separated by size exclusion chromatography into monomeric and polymeric fractions. Proteins in each fraction were analyzed by quantitative two-dimensional gel...
Takashima, S
2001-04-05
The large dipole moment of globular proteins has been well known because of the detailed studies using dielectric relaxation and electro-optical methods. The search for the origin of these dipolemoments, however, must be based on the detailed knowledge on protein structure with atomic resolutions. At present, we have two sources of information on the structure of protein molecules: (1) x-ray databases obtained in crystalline state; (2) NMR databases obtained in solution state. While x-ray databases consist of only one model, NMR databases, because of the fluctuation of the protein folding in solution, consist of a number of models, thus enabling the computation of dipole moment repeated for all these models. The aim of this work, using these databases, is the detailed investigation on the interdependence between the structure and dipole moment of protein molecules. The dipole moment of protein molecules has roughly two components: one dipole moment is due to surface charges and the other, core dipole moment, is due to polar groups such as N--H and C==O bonds. The computation of surface charge dipole moment consists of two steps: (A) calculation of the pK shifts of charged groups for electrostatic interactions and (B) calculation of the dipole moment using the pK corrected for electrostatic shifts. The dipole moments of several proteins were computed using both NMR and x-ray databases. The dipole moments of these two sets of calculations are, with a few exceptions, in good agreement with one another and also with measured dipole moments.
Proteome analysis of the fungus Aspergillus carbonarius under ochratoxin A producing conditions.
Crespo-Sempere, A; Gil, J V; Martínez-Culebras, P V
2011-06-30
Aspergillus carbonarius is an important ochratoxin A producing fungus that is responsible for mycotoxin contamination of grapes and wine. In this study, the proteomes of highly (W04-40) and weakly (W04-46) OTA-producing A. carbonarius strains were compared to identify proteins that may be involved in OTA biosynthesis. Protein samples were extracted from two biological replicates and subjected to two dimensional gel electrophoresis analysis and mass spectrometry. Expression profile comparison (PDQuest software), revealed 21 differential spots that were statistically significant and showed a two-fold change in expression, or greater. Among these, nine protein spots were identified by MALDI-MS/MS and MASCOT database and twelve remain unidentified. Of the identified proteins, seven showed a higher expression in strain W04-40 (high OTA producer) and two in strain W04-46 (low OTA producer). Some of the identified amino acid sequences shared homology with proteins involved in regulation, amino acid metabolism, oxidative stress and sporulation. It is worth noting the presence of a protein with 126.5 fold higher abundance in strain W04-40 showing homology with protein CipC, a protein with unknown function related with pathogenesis and mycotoxin production by some authors. Variations in protein expression were also further investigated at the mRNA level by real-time PCR analysis. The mRNA expression levels from three identified proteins including CipC showed correlation with protein expression levels. This study represents the first proteomic analysis for a comparison of two A. carbonarius strains with different OTA production and will contribute to a better understanding of the molecular events involved in OTA biosynthesis. Copyright © 2011 Elsevier B.V. All rights reserved.
Knowledge Discovery in Variant Databases Using Inductive Logic Programming
Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D.
2013-01-01
Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/. PMID:23589683
Knowledge discovery in variant databases using inductive logic programming.
Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D
2013-01-01
Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/.
MMDB: Entrez’s 3D-structure database
Wang, Yanli; Anderson, John B.; Chen, Jie; Geer, Lewis Y.; He, Siqian; Hurwitz, David I.; Liebert, Cynthia A.; Madej, Thomas; Marchler, Gabriele H.; Marchler-Bauer, Aron; Panchenko, Anna R.; Shoemaker, Benjamin A.; Song, James S.; Thiessen, Paul A.; Yamashita, Roxanne A.; Bryant, Stephen H.
2002-01-01
Three-dimensional structures are now known within many protein families and it is quite likely, in searching a sequence database, that one will encounter a homolog with known structure. The goal of Entrez’s 3D-structure database is to make this information, and the functional annotation it can provide, easily accessible to molecular biologists. To this end Entrez’s search engine provides three powerful features. (i) Sequence and structure neighbors; one may select all sequences similar to one of interest, for example, and link to any known 3D structures. (ii) Links between databases; one may search by term matching in MEDLINE, for example, and link to 3D structures reported in these articles. (iii) Sequence and structure visualization; identifying a homolog with known structure, one may view molecular-graphic and alignment displays, to infer approximate 3D structure. In this article we focus on two features of Entrez’s Molecular Modeling Database (MMDB) not described previously: links from individual biopolymer chains within 3D structures to a systematic taxonomy of organisms represented in molecular databases, and links from individual chains (and compact 3D domains within them) to structure neighbors, other chains (and 3D domains) with similar 3D structure. MMDB may be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Structure. PMID:11752307
Mining and Indexing Graph Databases
ERIC Educational Resources Information Center
Yuan, Dayu
2013-01-01
Graphs are widely used to model structures and relationships of objects in various scientific and commercial fields. Chemical molecules, proteins, malware system-call dependencies and three-dimensional mechanical parts are all modeled as graphs. In this dissertation, we propose to mine and index those graph data to enable fast and scalable search.…
Li, Chun-Hong; Zuo, Hua-Li; Zhang, Qian; Wang, Feng-Qin; Hu, Yuan-Jia; Qian, Zheng-Ming; Li, Wen-Jia; Xia, Zhi-Ning; Yang, Feng-Qing
2017-01-01
Background: As one of the bioactive components in Cordyceps sinensis (CS), proteins were rarely used as index components to study the correlation between the protein components and producing areas of natural CS. Objective: Protein components of 26 natural CS samples produced in Qinghai, Tibet, and Sichuan provinces were analyzed and compared to investigate the relationship among 26 different producing areas. Materials and Methods: Proteins from 26 different producing areas were extracted by Tris-HCl buffer with Triton X-100, and separated using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and two-dimensional electrophoresis (2-DE). Results: The SDS-PAGE results indicated that the number of protein bands and optical density curves of proteins in 26 CS samples was a bit different. However, the 2-DE results showed that the numbers and abundance of protein spots in protein profiles of 26 samples were obviously different and showed certain association with producing areas. Conclusions: Based on the expression values of matched protein spots, 26 batches of CS samples can be divided into two main categories (Tibet and Qinghai) by hierarchical cluster analysis. SUMMARY The number of protein bands and optical density curves of proteins in 26 Cordyceps sinensis samples were a bit different on the sodium dodecyl sulfate-polyacrylamide gel electrophoresis protein profilesNumbers and abundance of protein spots in protein profiles of 26 samples were obvious different on two-dimensional electrophoresis mapsTwenty-six different producing areas of natural Cordyceps sinensis samples were divided into two main categories (Tibet and Qinghai) by Hierarchical cluster analysis based on the values of matched protein spots. Abbreviations Used: SDS-PAGE: Sodium dodecyl sulfate polyacrylamide gel electrophoresis, 2-DE: Two-dimensional electrophoresis, Cordyceps sinensis: CS, TCMs: Traditional Chinese medicines PMID:28250651
Li, Chun-Hong; Zuo, Hua-Li; Zhang, Qian; Wang, Feng-Qin; Hu, Yuan-Jia; Qian, Zheng-Ming; Li, Wen-Jia; Xia, Zhi-Ning; Yang, Feng-Qing
2017-01-01
As one of the bioactive components in Cordyceps sinensis (CS), proteins were rarely used as index components to study the correlation between the protein components and producing areas of natural CS. Protein components of 26 natural CS samples produced in Qinghai, Tibet, and Sichuan provinces were analyzed and compared to investigate the relationship among 26 different producing areas. Proteins from 26 different producing areas were extracted by Tris-HCl buffer with Triton X-100, and separated using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and two-dimensional electrophoresis (2-DE). The SDS-PAGE results indicated that the number of protein bands and optical density curves of proteins in 26 CS samples was a bit different. However, the 2-DE results showed that the numbers and abundance of protein spots in protein profiles of 26 samples were obviously different and showed certain association with producing areas. Based on the expression values of matched protein spots, 26 batches of CS samples can be divided into two main categories (Tibet and Qinghai) by hierarchical cluster analysis. The number of protein bands and optical density curves of proteins in 26 Cordyceps sinensis samples were a bit different on the sodium dodecyl sulfate-polyacrylamide gel electrophoresis protein profilesNumbers and abundance of protein spots in protein profiles of 26 samples were obvious different on two-dimensional electrophoresis mapsTwenty-six different producing areas of natural Cordyceps sinensis samples were divided into two main categories (Tibet and Qinghai) by Hierarchical cluster analysis based on the values of matched protein spots. Abbreviations Used : SDS-PAGE: Sodium dodecyl sulfate polyacrylamide gel electrophoresis, 2-DE: Two-dimensional electrophoresis, Cordyceps sinensis : CS, TCMs: Traditional Chinese medicines.
Jian, Jhih-Wei; Elumalai, Pavadai; Pitti, Thejkiran; Wu, Chih Yuan; Tsai, Keng-Chang; Chang, Jeng-Yih; Peng, Hung-Pin; Yang, An-Suei
2016-01-01
Predicting ligand binding sites (LBSs) on protein structures, which are obtained either from experimental or computational methods, is a useful first step in functional annotation or structure-based drug design for the protein structures. In this work, the structure-based machine learning algorithm ISMBLab-LIG was developed to predict LBSs on protein surfaces with input attributes derived from the three-dimensional probability density maps of interacting atoms, which were reconstructed on the query protein surfaces and were relatively insensitive to local conformational variations of the tentative ligand binding sites. The prediction accuracy of the ISMBLab-LIG predictors is comparable to that of the best LBS predictors benchmarked on several well-established testing datasets. More importantly, the ISMBLab-LIG algorithm has substantial tolerance to the prediction uncertainties of computationally derived protein structure models. As such, the method is particularly useful for predicting LBSs not only on experimental protein structures without known LBS templates in the database but also on computationally predicted model protein structures with structural uncertainties in the tentative ligand binding sites. PMID:27513851
PATRIC, the bacterial bioinformatics database and analysis resource.
Wattam, Alice R; Abraham, David; Dalay, Oral; Disz, Terry L; Driscoll, Timothy; Gabbard, Joseph L; Gillespie, Joseph J; Gough, Roger; Hix, Deborah; Kenyon, Ronald; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K; Olson, Robert; Overbeek, Ross; Pusch, Gordon D; Shukla, Maulik; Schulman, Julie; Stevens, Rick L; Sullivan, Daniel E; Vonstein, Veronika; Warren, Andrew; Will, Rebecca; Wilson, Meredith J C; Yoo, Hyun Seung; Zhang, Chengdong; Zhang, Yan; Sobral, Bruno W
2014-01-01
The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein-protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10,000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue.
Comparative proteomics of leaves found at different stem positions of maize seedlings.
Chen, Yi-Bo; Wang, Dan; Ge, Xuan-Liang; Zhao, Biligen-Gaowa; Wang, Xu-Chu; Wang, Bai-Chen
2016-07-01
To better understand the roles of leaves at different stem positions during plant development, we measured the physiological properties of leaves 1-4 on maize seedling stems, and performed a proteomics study to investigate the differences in protein expression in the four leaves using two-dimensional difference gel electrophoresis and tandem mass spectrometry in conjunction with database searching. A total of 167 significantly differentially expressed protein spots were found and identified. Of these, 35% are involved in photosynthesis. By further analysis of the data, we speculated that in leaf 1 the seedling has started to transition from a heterotroph to an autotroph, development of leaf 2 is the time at which the seedling fully transitions from a heterotroph to an autotroph, and leaf maturity was reached only with fully expanded leaves 3 and 4, although there were still some protein expression differences in the two leaves. These results suggest that the different leaves make different contributions to maize seedling growth via modulation of the expression of the photosynthetic proteins. Together, these results provide insight into the roles of the different maize leaves as the plant develops from a heterotroph to an autotroph. Copyright © 2016 Elsevier GmbH. All rights reserved.
PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan
2004-04-02
The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
Tan, Wei Miao; Lau, Seng Fong; Ajat, Mokrish; Mansor, Rozaihan; Abd Rani, Puteri Azaziah Megat; Rahmad, Norasfaliza Binti
2017-03-01
This case study is to report the proteins detected by proteomic analysis of synovial fluid from a dog diagnosed with idiopathic immune-mediated polyarthritis, and to compare it with healthy dogs. Synovial fluid was collected via arthrocentesis from a dog diagnosed with immune-mediated polyarthritis. Protein precipitation was performed on the synovial fluid, followed by isoelectric focusing and 2-dimensional gel electrophoresis. The spots on the 2-dimensional gels were analyzed using MALDI-TOF/MS. The results were then analyzed against the MASCOT database. The results from the proteomic analysis revealed an abundance of several types of immunoglobulins together with the presence of complement C4b-binding protein alpha chain. Actin and keratin were also among the proteins detected. Proteomic studies, facilitate a better understanding of the different levels of proteins expressed during disease activity. Potential disease biomarkers can aid in the diagnosis of disease, as well as help in monitoring treatment efficacy and providing prognosis for the patient. Copyright © 2017 Elsevier Inc. All rights reserved.
Representing and comparing protein structures as paths in three-dimensional space
Zhi, Degui; Krishna, S Sri; Cao, Haibo; Pevzner, Pavel; Godzik, Adam
2006-01-01
Background Most existing formulations of protein structure comparison are based on detailed atomic level descriptions of protein structures and bypass potential insights that arise from a higher-level abstraction. Results We propose a structure comparison approach based on a simplified representation of proteins that describes its three-dimensional path by local curvature along the generalized backbone of the polypeptide. We have implemented a dynamic programming procedure that aligns curvatures of proteins by optimizing a defined sum turning angle deviation measure. Conclusion Although our procedure does not directly optimize global structural similarity as measured by RMSD, our benchmarking results indicate that it can surprisingly well recover the structural similarity defined by structure classification databases and traditional structure alignment programs. In addition, our program can recognize similarities between structures with extensive conformation changes that are beyond the ability of traditional structure alignment programs. We demonstrate the applications of procedure to several contexts of structure comparison. An implementation of our procedure, CURVE, is available as a public webserver. PMID:17052359
Imin, Nijat; De Jong, Femke; Mathesius, Ulrike; van Noorden, Giel; Saeed, Nasir A; Wang, Xin-Ding; Rose, Ray J; Rolfe, Barry G
2004-07-01
Using a combination of two-dimensional gel electrophoresis (2-DE) protein mapping and mass spectrometry (MS) analysis, we have established proteome reference maps of Medicago truncatula embryogenic tissue culture cells. The cultures were generated from single protoplasts, which provided a relatively homogeneous cell population. We used these to analyze protein expression at the globular stages of somatic embryogenesis, which is the earliest morphogenetic embryonic stage. Over 3000 proteins could reproducibly be resolved over a pI range of 4-11. Three hundred and twelve protein spots were extracted from colloidal Coomassie Blue-stained 2-DE gels and analyzed by matrix-assisted laser desorption/ionization-time of flight MS analysis and tandem MS sequencing. This enabled the identification of 169 protein spots representing 128 unique gene products using a publicly available expressed sequence tag database and the MASCOT search engine. These reference maps will be valuable for the investigation of the molecular events which occur during somatic embryogenesis in M. truncatula. The proteome reference maps and supplementary materials will be available and updated for public access at http://semele.anu.edu.au/.
Casadei, Cecilia M.; Tsai, Ching-Ju; Barty, Anton; ...
2018-01-01
Previous proof-of-concept measurements on single-layer two-dimensional membrane-protein crystals performed at X-ray free-electron lasers (FELs) have demonstrated that the collection of meaningful diffraction patterns, which is not possible at synchrotrons because of radiation-damage issues, is feasible. Here, the results obtained from the analysis of a thousand single-shot, room-temperature X-ray FEL diffraction images from two-dimensional crystals of a bacteriorhodopsin mutant are reported in detail. The high redundancy in the measurements boosts the intensity signal-to-noise ratio, so that the values of the diffracted intensities can be reliably determined down to the detector-edge resolution of 4 Å. The results show that two-dimensional serial crystallography atmore » X-ray FELs is a suitable method to study membrane proteins to near-atomic length scales at ambient temperature. The method presented here can be extended to pump–probe studies of optically triggered structural changes on submillisecond timescales in two-dimensional crystals, which allow functionally relevant large-scale motions that may be quenched in three-dimensional crystals.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Casadei, Cecilia M.; Tsai, Ching-Ju; Barty, Anton
Previous proof-of-concept measurements on single-layer two-dimensional membrane-protein crystals performed at X-ray free-electron lasers (FELs) have demonstrated that the collection of meaningful diffraction patterns, which is not possible at synchrotrons because of radiation-damage issues, is feasible. Here, the results obtained from the analysis of a thousand single-shot, room-temperature X-ray FEL diffraction images from two-dimensional crystals of a bacteriorhodopsin mutant are reported in detail. The high redundancy in the measurements boosts the intensity signal-to-noise ratio, so that the values of the diffracted intensities can be reliably determined down to the detector-edge resolution of 4 Å. The results show that two-dimensional serial crystallography atmore » X-ray FELs is a suitable method to study membrane proteins to near-atomic length scales at ambient temperature. The method presented here can be extended to pump–probe studies of optically triggered structural changes on submillisecond timescales in two-dimensional crystals, which allow functionally relevant large-scale motions that may be quenched in three-dimensional crystals.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willard, K.E.
1982-04-01
Two-dimensional gel electrophoretic patterns of human peripheral blood leukocytes from 12 patients with infectious mononucleosis were prepared by use of the ISO-DALT system. Before the two-dimensional separation, the leukocytes were purified by Ficoll-Paque gradient centrifugation and labeled overnight with (/sup 35/S) methionine. Quantitative increases in two proteins were detected in the patterns of infected leukocytes from the patients as compared with controls. Fluorescence-activated cell sorting of leukocytes from normal human peripheral blood before subsequent two-dimensional gel analysis revealed that the dramatic increase in one of these proteins (Inmono:2) could be due to shifts in the population ratios of lymphocytes, monocytes,more » and granulocytes. In contrast, the appearance in the infected leukocytes of a second protein, Inmono:1, could not be accounted for by cell-population shifts. Increased amounts of these two proteins have been found in every patient studied who had clinically detectable infectious mononucleosis. In addition, a patient who displayed symptoms of infectious mononucleosis but who did not have a positive result in the MONOSPOT test (Ortho) until three weeks after our analysis also demonstrated increased relative amounts of these proteins in his leukocyte pattern.« less
EDCs DataBank: 3D-Structure database of endocrine disrupting chemicals.
Montes-Grajales, Diana; Olivero-Verbel, Jesus
2015-01-02
Endocrine disrupting chemicals (EDCs) are a group of compounds that affect the endocrine system, frequently found in everyday products and epidemiologically associated with several diseases. The purpose of this work was to develop EDCs DataBank, the only database of EDCs with three-dimensional structures. This database was built on MySQL using the EU list of potential endocrine disruptors and TEDX list. It contains the three-dimensional structures available on PubChem, as well as a wide variety of information from different databases and text mining tools, useful for almost any kind of research regarding EDCs. The web platform was developed employing HTML, CSS and PHP languages, with dynamic contents in a graphic environment, facilitating information analysis. Currently EDCs DataBank has 615 molecules, including pesticides, natural and industrial products, cosmetics, drugs and food additives, among other low molecular weight xenobiotics. Therefore, this database can be used to study the toxicological effects of these molecules, or to develop pharmaceuticals targeting hormone receptors, through docking studies, high-throughput virtual screening and ligand-protein interaction analysis. EDCs DataBank is totally user-friendly and the 3D-structures of the molecules can be downloaded in several formats. This database is freely available at http://edcs.unicartagena.edu.co. Copyright © 2014. Published by Elsevier Ireland Ltd.
Li, Dong-dong; He, Shao-heng
2004-07-01
To analyse the total proteins in the seeds of almond (Prunus dulcis), one of the popular ingestent allergens in China, by two-dimensional electrophoresis. The total proteins of the seeds were extracted by trichloracetic acid (TCA) method, and then separated by isoelectric focusing as first dimension and SDS-PAGE as the second dimension. The spots of proteins were visualized by staining with Coomassie Brilliant Blue R-250. After analysis with software (ImageMaster 2D), 188 different proteins were detected. The isoelectric points (pI) for approximately 28% of total proteins were between 4.5-5.5, and the relative molecular mass (M(r)) of approximately 62% total proteins were between (20-25)x10(3). This was the first high-resolution, two-dimensional protein map of the seed of almond (Prunus dulcis) in China. Our finding has laid a solid foundation for further identification, characterization, gene cloning and standardization of allergenic proteins in the seed of almond (Prunus dulcis).
DIGE Analysis of Human Tissues.
Gelfi, Cecilia; Capitanio, Daniele
2018-01-01
Two-dimensional difference gel electrophoresis (2-D DIGE) is an advanced and elegant gel electrophoretic analytical tool for comparative protein assessment. It is based on two-dimensional gel electrophoresis (2-DE) separation of fluorescently labeled protein extracts. The tagging procedures are designed to not interfere with the chemical properties of proteins with respect to their pI and electrophoretic mobility, once a proper labeling protocol is followed. The two-dye or three-dye systems can be adopted and their choice depends on specific applications. Furthermore, the use of an internal pooled standard makes 2-D DIGE a highly accurate quantitative method enabling multiple protein samples to be separated on the same two-dimensional gel. The image matching and cross-gel statistical analysis generates robust quantitative results making data validation by independent technologies successful.
Protein structural similarity search by Ramachandran codes
Lo, Wei-Cheng; Huang, Po-Jung; Chang, Chih-Hung; Lyu, Ping-Chiang
2007-01-01
Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation). SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE) and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era. PMID:17716377
Proteomic analysis of differentially expressed proteins in kidneys of brain dead rabbits
Li, Ling; Li, Ning; He, Chongxiang; Huang, Wei; Fan, Xiaoli; Zhong, Zibiao; Wang, Yanfeng; Ye, Qifa
2017-01-01
A large number of previous clinical studies have reported a delayed graft function for brain dead donors, when compared with living relatives or cadaveric organ transplantations. However, there is no accurate method for the quality evaluation of kidneys from brain-dead donors. In the present study, two-dimensional gel electrophoresis and MALDI-TOF MS-based comparative proteomic analysis were conducted to profile the differentially-expressed proteins between brain death and the control group renal tissues. A total of 40 age- and sex-matched rabbits were randomly divided into donation following brain death (DBD) and control groups. Following the induction of brain death via intracranial progressive pressure, the renal function and the morphological alterations were measured 2, 6 and 8 h afterwards. The differentially expressed proteins were detected from renal histological evidence at 6 h following brain death. Although 904±19 protein spots in control groups and 916±25 in DBD groups were identified in the two-dimensional gel electrophoresis, >2-fold alterations were identified by MALDI-TOF MS and searched by NCBI database. The authors successfully acquired five downregulated proteins, these were: Prohibitin (isoform CRA_b), beta-1,3-N-acetylgalactosaminyltransferase 1, Annexin A5, superoxide dismutase (mitochondrial) and cytochrome b-c1 complex subunit 1 (mitochondrial precursor). Conversely, the other five upregulated proteins were: PRP38 pre-mRNA processing factor 38 (yeast) domain containing A, calcineurin subunit B type 1, V-type proton ATPase subunit G 1, NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 10 and peroxiredoxin-3 (mitochondrial). Immunohistochemical results revealed that the expressions of prohibitin (PHB) were gradually increased in a time-dependent manner. The results indicated that there were alterations in levels of several proteins in the kidneys of those with brain death, even if the primary function and the morphological changes were not obvious. PHB may therefore be a novel biomarker for primary quality evaluation of kidneys from brain-dead donors. PMID:28534953
Proteome analysis of Aspergillus ochraceus.
Rizwan, Muhammad; Miller, Ingrid; Tasneem, Fareeha; Böhm, Josef; Gemeiner, Manfred; Razzazi-Fazeli, Ebrahim
2010-08-01
Genome sequencing for many important fungi has begun during recent years; however, there is still some deficiency in proteome profiling of aspergilli. To obtain a comprehensive overview of proteins and their expression, a proteomic approach based on 2D gel electrophoresis and MALDI-TOF/TOF mass spectrometry was used to investigate A. ochraceus. The cell walls of fungi are exceptionally resistant to destruction, therefore two lysis protocols were tested: (1) lysis via manual grinding using liquid nitrogen, and (2) mechanical lysis via rapid agitation with glass beads using MagNalyser. Mechanical grinding with mortar and pestle using liquid nitrogen was found to be a more efficient extraction method for our purpose, resulting in extracts with higher protein content and a clear band pattern in SDS-PAGE. Two-dimensional electrophoresis gave a complex spot pattern comprising proteins of a broad range of isoelectric points and molecular masses. The most abundant spots were subjected to mass spectrometric analysis. We could identify 31 spots representing 26 proteins, most of them involved in metabolic processes and response to stress. Seventeen spots were identified by de novo sequencing due to a lack of DNA and protein database sequences of A. ochraceus. The proteins identified in our study have been reported for the first time in A. ochraceus and this represents the first proteomic approach with identification of major proteins, when the fungus was grown under submerged culture.
Genes Involved in Anaerobic Metabolism of Phenol in the Bacterium Thauera aromatica
Breinig, Sabine; Schiltz, Emile; Fuchs, Georg
2000-01-01
Genes involved in the anaerobic metabolism of phenol in the denitrifying bacterium Thauera aromatica have been studied. The first two committed steps in this metabolism appear to be phosphorylation of phenol to phenylphosphate by an unknown phosphoryl donor (“phenylphosphate synthase”) and subsequent carboxylation of phenylphosphate to 4-hydroxybenzoate under release of phosphate (“phenylphosphate carboxylase”). Both enzyme activities are strictly phenol induced. Two-dimensional gel electrophoresis allowed identification of several phenol-induced proteins. Based on N-terminal and internal amino acid sequences of such proteins, degenerate oligonucleotides were designed to identify the corresponding genes. A chromosomal DNA segment of about 14 kbp was sequenced which contained 10 genes transcribed in the same direction. These are organized in two adjacent gene clusters and include the genes coding for five identified phenol-induced proteins. Comparison with sequences in the databases revealed the following similarities: the gene products of two open reading frames (ORFs) are each similar to either the central part and N-terminal part of phosphoenolpyruvate synthases. We propose that these ORFs are components of the phenylphosphate synthase system. Three ORFs showed similarity to the ubiD gene product, 3-octaprenyl-4-hydroxybenzoate carboxy lyase; UbiD catalyzes the decarboxylation of a 4-hydroxybenzoate analogue in ubiquinone biosynthesis. Another ORF was similar to the ubiX gene product, an isoenzyme of UbiD. We propose that (some of) these four proteins are involved in the carboxylation of phenylphosphate. A 700-bp PCR product derived from one of these ORFs cross-hybridized with DNA from different Thauera and Azoarcus strains, even from those which have not been reported to grow with phenol. One ORF showed similarity to the mutT gene product, and three ORFs showed no strong similarities to sequences in the databases. Upstream of the first gene cluster, an ORF which is transcribed in the opposite direction codes for a protein highly similar to the DmpR regulatory protein of Pseudomonas putida. DmpR controls transcription of the genes of aerobic phenol metabolism, suggesting a similar regulation of anaerobic phenol metabolism by the putative regulator. PMID:11004186
PROTICdb: a web-based application to store, track, query, and compare plant proteome data.
Ferry-Dumazet, Hélène; Houel, Gwenn; Montalent, Pierre; Moreau, Luc; Langella, Olivier; Negroni, Luc; Vincent, Delphine; Lalanne, Céline; de Daruvar, Antoine; Plomion, Christophe; Zivy, Michel; Joets, Johann
2005-05-01
PROTICdb is a web-based application, mainly designed to store and analyze plant proteome data obtained by two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) and mass spectrometry (MS). The purposes of PROTICdb are (i) to store, track, and query information related to proteomic experiments, i.e., from tissue sampling to protein identification and quantitative measurements, and (ii) to integrate information from the user's own expertise and other sources into a knowledge base, used to support data interpretation (e.g., for the determination of allelic variants or products of post-translational modifications). Data insertion into the relational database of PROTICdb is achieved either by uploading outputs of image analysis and MS identification software, or by filling web forms. 2-D PAGE annotated maps can be displayed, queried, and compared through a graphical interface. Links to external databases are also available. Quantitative data can be easily exported in a tabulated format for statistical analyses. PROTICdb is based on the Oracle or the PostgreSQL Database Management System and is freely available upon request at the following URL: http://moulon.inra.fr/ bioinfo/PROTICdb.
Proteomic sensitivity to dietary manipulations in rainbow trout.
Martin, S A M; Vilhelmsson, O; Médale, F; Watt, P; Kaushik, S; Houlihan, D F
2003-09-23
Changes in dietary protein sources due to substitution of fish meal by other protein sources can have metabolic consequences in farmed fish. A proteomics approach was used to study the protein profiles of livers of rainbow trout that have been fed two diets containing different proportions of plant ingredients. Both diets control (C) and soy (S) contained fish meal and plant ingredients and synthetic amino acids, but diet S had a greater proportion of soybean meal. A feeding trial was performed for 12 weeks at the end of which, growth and protein metabolism parameters were measured. Protein growth rates were not different in fish fed different diets; however, protein consumption and protein synthesis rates were higher in the fish fed the diet S. Fish fed diet S had lower efficiency of retention of synthesised protein. Ammonia excretion was increased as well as the activities of hepatic glutamate dehydrogenase and aspartate amino transferase (ASAT). No differences were found in free amino acid pools in either liver or muscle between diets. Protein extraction followed by high-resolution two-dimensional electrophoresis, coupled with gel image analysis, allowed identification and expression of hundreds of protein. Individual proteins of interest were then subjected to further analysis leading to protein identification by trypsin digest fingerprinting. During this study, approximately 800 liver proteins were analysed for expression pattern, of which 33 were found to be differentially expressed between diets C and S. Seventeen proteins were positively identified after database searching. Proteins were identified from diverse metabolic pathways, demonstrating the complex nature of gene expression responses to dietary manipulation revealed by proteomic characterisation.
Wang, Li; Cui, Jing; Hu, Dan Dan; Liu, Ruo Dan; Wang, Zhong Quan
2014-01-22
The excretory-secretory (ES) proteins of Trichinella spiralis muscle larvae (ML) come mainly from the excretory granules of the stichosome and the cuticles (membrane proteins), are directly exposed to the host's immune system, and are the main target antigens, which induce the immune responses. Although the ES proteins are the most commonly used diagnostic antigens for trichinellosis, their main disadvantage are the false negative results during the early stage of infection. The aim of this study was to identify early specific diagnostic antigens from the main components of T. spiralis muscle larval ES proteins. Two-dimensional electrophoresis (2-DE) combined with Western blot were used to screen the early diagnostic antigens from the main components of T. spiralis muscle larval ES proteins. The protein spots recognized by the sera from BALB/c mice infected with T. spiralis at 18 days post-infection (dpi) were identified by MALDI-TOF/TOF-MS and putatively annotated using GO terms obtained from the InterPro databases. The ES proteins were analyzed by 2-DE, and more than 33 protein spots were detected with molecular weight varying from 40 to 60 kDa and isoelectric point (pI) from 4 to 7. When probed with the sera from infected mice at 18 dpi, 21 protein spots were recognized and then identified, and they were characterized to correlate with five different proteins of T. spiralis, including two serine proteases, one deoxyribonuclease (DNase) II, and two kinds of trypsin. The five proteins were functionally categorized into molecular function and biological process according to GO hierarchy. 2-DE and Western blot combined with MALDI-TOF/TOF-MS were used to screen the diagnostic antigens from the main components of T. spiralis muscle larval ES proteins. The five proteins of T. spiralis identified (two serine proteases, DNase II and two kinds of trypsin) might be the early specific diagnostic antigens of trichinellosis.
Isolation and identification of peanut leaf proteins regulated by water stress.
Akkasaeng, Chutipong; Tantisuwichwong, Napaporn; Chairam, Issariya; Prakrongrak, Narumon; Jogloy, Sanun; Pathanothai, Aran
2007-05-15
Water deficits trigger signaling cascades leading to modulation of protein expression in plant tissues. Identification of peanut leaf proteins regulated by water stress provides some insights of cellular and molecular response of peanut plants to drought stress. Peanut variety Khon Kaen 4, a water-stress sensitive variety, was grown in a growth chamber under controlled environment. Water stress was imposed on day 30 after seedling emergence by withholding watering peanut plants for 6 days as compared to plants adequately supplied with water. Total protein were prepared from a leaflet of fully expanded leaf on the main stem. Proteins were separated in duplicated gels using two-dimensional gel electrophoresis and visualized by silver nitrate staining. Image analysis was performed using ImageMaster 2D Platinum 5.0 to determine proteins regulated by water stress. Molecular mass and isoelectric point of each regulated protein were used in database queries for protein identification. One protein was induced under water stress and the homologous protein was identified as Serine/threonine-protein phosphatase PP 1. Five proteins were down-regulated by water deficit. The homologous proteins were chaperone protein DNAJ, auxin-responsive protein IAA29, peroxidase 43, caffeoyl-CoA O-methyltransferase and SNF1-related protein kinase regulatory subunit beta-2. Down-regulated proteins may be associated with sensitivity of the peanut variety to water stress.
Chen, Xingguo; Fazal, Md. Abul; Dovichi, Norman J.
2007-01-01
Two-dimensional capillary electrophoresis was used for the separation of proteins and biogenic amines from the mouse AtT-20 cell line. The first-dimension capillary contained a TRIS-CHES-SDS-dextran buffer to perform capillary sieving electrophoresis, which is based on molecular weight of proteins. The second-dimension capillary contained a TRIS-CHES-SDS buffer for micel1ar electrokinetic capillary chromatography. After a 61 seconds preliminary separation, fractions from the first-dimension capillary were successively transferred to the second-dimension capillary, where they further separated by MECC. The two-dimensional separation required 60 minutes. PMID:17637850
Femtosecond X-ray Diffraction From Two-Dimensional Protein Crystals
DOE Office of Scientific and Technical Information (OSTI.GOV)
Frank, Matthias; Carlson, David B.; Hunter, Mark
2014-02-28
Here we present femtosecond x-ray diffraction patterns from two-dimensional (2-D) protein crystals using an x-ray free electron laser (XFEL). To date it has not been possible to acquire x-ray diffraction from individual 2-D protein crystals due to radiation damage. However, the intense and ultrafast pulses generated by an XFEL permits a new method of collecting diffraction data before the sample is destroyed. Utilizing a diffract-before-destroy methodology at the Linac Coherent Light Source, we observed Bragg diffraction to better than 8.5 Å resolution for two different 2-D protein crystal samples that were maintained at room temperature. These proof-of-principle results show promisemore » for structural analysis of both soluble and membrane proteins arranged as 2-D crystals without requiring cryogenic conditions or the formation of three-dimensional crystals.« less
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein-DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone.
Two-dimensional proteome reference maps for the soybean cyst nematode Heterodera glycines
USDA-ARS?s Scientific Manuscript database
Two-dimensional electrophoresis (2-DE) reference maps of Heterodera glycines were constructed. After in-gel digestion with trypsin, 803 spots representing 426 proteins were subsequently identified by LC-MS/MS. Proteins with annotated function were further categorized by Gene Ontology. Results showed...
Quantitative Analysis of Cell Nucleus Organisation
Shiels, Carol; Adams, Niall M; Islam, Suhail A; Stephens, David A; Freemont, Paul S
2007-01-01
There are almost 1,300 entries for higher eukaryotes in the Nuclear Protein Database. The proteins' subcellular distribution patterns within interphase nuclei can be complex, ranging from diffuse to punctate or microspeckled, yet they all work together in a coordinated and controlled manner within the three-dimensional confines of the nuclear volume. In this review we describe recent advances in the use of quantitative methods to understand nuclear spatial organisation and discuss some of the practical applications resulting from this work. PMID:17676980
Comparative Protein Structure Modeling Using MODELLER
Webb, Benjamin; Sali, Andrej
2016-01-01
Comparative protein structure modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and how to use the ModBase database of such models, and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. PMID:27322406
Mandal, Md Siddikun Nabi; Fu, Ying; Zhang, Sheng; Ji, Wanquan
2014-12-01
Powdery mildew of wheat is caused by Blumeria graminis f. sp. tritici (Bgt). Although many wheat cultivars resistant to this disease have been developed, little is known about their resistance mechanisms. The aim of this study was to identify proteins showing changes in abundance during the resistance response of the wheat line N0308 infected by Bgt. In two-dimensional electrophoresis analyses, 45 spots on the gels showed significant changes in abundance at 24, 48, and 72 h after inoculation, as compared to non-inoculated plants. Of these 45 proteins, 44 were identified by mass spectrometry analysis using the NCBInr database of Triticum aestivum (26 spots) and closely related species in the Triticum genus (18 spots). These proteins were associated with the defense response, photosynthesis, metabolism, and other cellular processes in wheat. Most of the up-regulated proteins were identified as stress- and defense-related proteins. In particular, the product of a specific powdery mildew resistance gene (Pm3b and its homolog) and some other defense- and pathogenesis-related proteins were overexpressed. The resistance gene product mediates the immune response and coordinates other cellular processes during the resistance response to Bgt.
Glusman, Gustavo; Rose, Peter W; Prlić, Andreas; Dougherty, Jennifer; Duarte, José M; Hoffman, Andrew S; Barton, Geoffrey J; Bendixen, Emøke; Bergquist, Timothy; Bock, Christian; Brunk, Elizabeth; Buljan, Marija; Burley, Stephen K; Cai, Binghuang; Carter, Hannah; Gao, JianJiong; Godzik, Adam; Heuer, Michael; Hicks, Michael; Hrabe, Thomas; Karchin, Rachel; Leman, Julia Koehler; Lane, Lydie; Masica, David L; Mooney, Sean D; Moult, John; Omenn, Gilbert S; Pearl, Frances; Pejaver, Vikas; Reynolds, Sheila M; Rokem, Ariel; Schwede, Torsten; Song, Sicheng; Tilgner, Hagen; Valasatava, Yana; Zhang, Yang; Deutsch, Eric W
2017-12-18
The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.
Ripoche, Hugues; Laine, Elodie; Ceres, Nicoletta; Carbone, Alessandra
2017-01-04
The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/, reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET 2 at large-scale on more than 20 000 chains. JET 2 strategy yields very precise predictions of interacting surfaces and unravels their evolutionary process and complexity. JET2 Viewer provides an online intelligent display, including interactive 3D visualization of the binding sites mapped onto PDB structures and suitable files recording JET 2 analyses. Predictions were evaluated on more than 15 000 experimentally characterized protein interfaces. This is, to our knowledge, the largest evaluation of a protein binding site prediction method. The overall performance of JET 2 on all interfaces are: Sen = 52.52, PPV = 51.24, Spe = 80.05, Acc = 75.89. The data can be used to foster new strategies for protein-protein interactions modulation and interaction surface redesign. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
NASA Astrophysics Data System (ADS)
Millan, Jaime; McMillan, Janet; Brodin, Jeff; Lee, Byeongdu; Mirkin, Chad; Olvera de La Cruz, Monica
Programmable DNA interactions represent a robust scheme to self-assemble a rich variety of tunable superlattices, where intrinsic and in some cases non-desirable nano-scale building blocks interactions are substituted for DNA hybridization events. Recent advances in synthesis has allowed the extension of this successful scheme to proteins, where DNA distribution can be tuned independently of protein shape by selectively addressing surface residues, giving rise to assembly properties in three dimensional protein-nanoparticle superlattices dependent on DNA distribution. In parallel to this advances, we introduced a scalable coarse-grained model that faithfully reproduces the previously observed co-assemblies from nanoparticles and proteins conjugates. Herein, we implement this numerical model to explain the stability of complex protein-nanoparticle binary superlattices and to elucidate experimentally inaccessible features such as protein orientation. Also, we will discuss systematic studies that highlight the role of DNA distribution and sequence on two-dimensional protein-protein and protein-nanoparticle superlattices.
sc-PDB: an annotated database of druggable binding sites from the Protein Data Bank.
Kellenberger, Esther; Muller, Pascal; Schalon, Claire; Bret, Guillaume; Foata, Nicolas; Rognan, Didier
2006-01-01
The sc-PDB is a collection of 6 415 three-dimensional structures of binding sites found in the Protein Data Bank (PDB). Binding sites were extracted from all high-resolution crystal structures in which a complex between a protein cavity and a small-molecular-weight ligand could be identified. Importantly, ligands are considered from a pharmacological and not a structural point of view. Therefore, solvents, detergents, and most metal ions are not stored in the sc-PDB. Ligands are classified into four main categories: nucleotides (< 4-mer), peptides (< 9-mer), cofactors, and organic compounds. The corresponding binding site is formed by all protein residues (including amino acids, cofactors, and important metal ions) with at least one atom within 6.5 angstroms of any ligand atom. The database was carefully annotated by browsing several protein databases (PDB, UniProt, and GO) and storing, for every sc-PDB entry, the following features: protein name, function, source, domain and mutations, ligand name, and structure. The repository of ligands has also been archived by diversity analysis of molecular scaffolds, and several chemoinformatics descriptors were computed to better understand the chemical space covered by stored ligands. The sc-PDB may be used for several purposes: (i) screening a collection of binding sites for predicting the most likely target(s) of any ligand, (ii) analyzing the molecular similarity between different cavities, and (iii) deriving rules that describe the relationship between ligand pharmacophoric points and active-site properties. The database is periodically updated and accessible on the web at http://bioinfo-pharma.u-strasbg.fr/scPDB/.
A double stain for total and oxidized proteins from two-dimensional fingerprints.
Talent, J M; Kong, Y; Gracy, R W
1998-10-01
Oxidative modification of proteins plays a major role in the etiology of aging and age-related diseases. For example, in Alzheimer's disease, although evidence points to oxidation of proteins as a causative factor in loss of cognitive abilities, it is not known which specific proteins of the brain are most susceptible to these modifications. Thus, it is of interest to identify the specific proteins which are susceptible to oxidation in vivo. Two-dimensional protein fingerprint methods offer the analytical potential for resolution of thousands of individual proteins from tissues, and the oxidized proteins can be visualized with immunological probes. Sensitive methods permit recovery and sufficient amino acid sequencing to identify these proteins. However, for such analyses it is essential to simultaneously analyze both protein content and level of oxidation. We have evaluated several approaches, identified the sources of artifacts and interferences, and developed a double-staining procedure that allows visualization and quantitation of total protein patterns as well as the specific oxidized proteins from two-dimensional protein fingerprints. The method has been applied to cells grown in culture and to tissue extracts from young and old animals. Copyright 1998 Academic Press.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gaponov, Yu.A.; Igarashi, N.; Hiraki, M.
2004-05-12
An integrated controlling system and a unified database for high throughput protein crystallography experiments have been developed. Main features of protein crystallography experiments (purification, crystallization, crystal harvesting, data collection, data processing) were integrated into the software under development. All information necessary to perform protein crystallography experiments is stored (except raw X-ray data that are stored in a central data server) in a MySQL relational database. The database contains four mutually linked hierarchical trees describing protein crystals, data collection of protein crystal and experimental data processing. A database editor was designed and developed. The editor supports basic database functions to view,more » create, modify and delete user records in the database. Two search engines were realized: direct search of necessary information in the database and object oriented search. The system is based on TCP/IP secure UNIX sockets with four predefined sending and receiving behaviors, which support communications between all connected servers and clients with remote control functions (creating and modifying data for experimental conditions, data acquisition, viewing experimental data, and performing data processing). Two secure login schemes were designed and developed: a direct method (using the developed Linux clients with secure connection) and an indirect method (using the secure SSL connection using secure X11 support from any operating system with X-terminal and SSH support). A part of the system has been implemented on a new MAD beam line, NW12, at the Photon Factory Advanced Ring for general user experiments.« less
Proteome analysis during pod, zygotic and somatic embryo maturation of Theobroma cacao.
Niemenak, Nicolas; Kaiser, Edward; Maximova, Siela N; Laremore, Tatiana; Guiltinan, Mark J
2015-05-15
Two dimensional electrophoresis and nano-LC-MS were performed in order to identify alterations in protein abundance that correlate with maturation of cacao zygotic and somatic embryos. The cacao pod proteome was also characterized during development. The recently published cacao genome sequence was used to create a predicted proteolytic fragment database. Several hundred protein spots were resolved on each tissue analysis, of which 72 variable spots were subjected to MS analysis, resulting in 49 identifications. The identified proteins represent an array of functional categories, including seed storage, stress response, photosynthesis and translation factors. The seed storage protein was strongly accumulated in cacao zygotic embryos compared to their somatic counterpart. However, sucrose treatment (60 g L(-1)) allows up-regulation of storage protein in SE. A high similarity in the profiles of acidic proteins was observed in mature zygotic and somatic embryos. Differential expression in both tissues was observed in proteins having high pI. Several proteins were detected exclusively in fruit tissues, including a chitinase and a 14-3-3 protein. We also identified a novel cacao protein related to known mabinlin type sweet storage proteins. Moreover, the specific presence of thaumatin-like protein, another sweet protein, was also detected in fruit tissue. We discuss our observed correlations between protein expression profiles, developmental stage and stress responses. Copyright © 2015 Elsevier GmbH. All rights reserved.
Merlino, Marielle; Leroy, Philippe; Chambon, Christophe; Branlard, Gérard
2009-05-01
Albumins and globulins of wheat endosperm represent 20% of total kernel protein. They are soluble proteins, mainly enzymes and proteins involved in cell functions. Two-dimensional gel immobiline electrophoresis (2DE) (pH 4-7) x SDS-Page revealed around 2,250 spots. Ninety percent of the spots were common between the very distantly related cultivars 'Opata 85' and 'Synthetic W7984', the two parents of the International Triticeae Mapping Initiative (ITMI) progeny. 'Opata' had 130 specific spots while 'Synthetic' had 96. 2DE and image analysis of the soluble proteins present in 112 recombinant inbred lines of the F9-mapped ITMI progeny enabled 120 unbiased segregating spots to be mapped on 21 wheat (Triticum aestivum L. em. Thell) chromosomes. After trypsic digestion, mapped spots were subjected to MALDI-Tof or tandem mass spectrometry for protein identification by database mining. Among the 'Opata' and 'Synthetic' spots identified, many enzymes have already been mapped in the barley and rice genomes. Multigene families of Heat Shock Proteins, beta-amylases, UDP-glucose pyrophosphorylases, peroxydases and thioredoxins were successfully identified. Although other proteins remain to be identified, some differences were found in the number of segregating proteins involved in response to stress: 11 proteins found in the modern selected cultivar 'Opata 85' as compared to 4 in the new hexaploid ;Synthetic W7984'. In addition, 'Opata' and 'Synthetic' differed in the number of proteins involved in protein folding (2 and 10, respectively). The usefulness of the mapped enzymes for future research on seed composition and characteristics is discussed.
Iimure, Takashi; Kihara, Makoto; Sato, Kazuhiro; Ogushi, Kensuke
2015-04-01
Foam stability is a key factor of beer quality for consumers and brewers. Recent beer proteome analyses have suggested that barley dimeric α-amylase inhibitor-1 (BDAI-1) and avenin-like protein-a (ALP) derived from barley are important for beer foam stability. In this study, BDAI-1 and ALP were purified from a Japanese commercial beer sample using salt precipitation and column chromatography. The purification level was verified using two-dimensional gel electrophoresis, mass spectrometry, and database searches. Purified BDAI-1 and ALP were added to a beer sample to compare the foam stability to that of a control beer sample. As a result, beer foam stability was significantly improved by BDAI-1 but not by ALP, thereby suggesting that BDAI-1 affects beer foam stability whereas ALP does not. Copyright © 2014 Elsevier Ltd. All rights reserved.
Investigations of photosynthetic light harvesting by two-dimensional electronic spectroscopy
NASA Astrophysics Data System (ADS)
Read, Elizabeth Louise
Photosynthesis begins with the harvesting of sunlight by antenna pigments, organized in a network of pigment-protein complexes that rapidly funnel energy to photochemical reaction centers. The intricate design of these systems---the widely varying structural motifs of pigment organization within proteins and protein organization within a larger, cooperative network---underlies the remarkable speed and efficiency of light harvesting. Advances in femtosecond laser spectroscopy have enabled researchers to follow light energy on its course through the energetic levels of photosynthetic systems. Now, newly-developed femtosecond two-dimensional electronic spectroscopy reveals deeper insight into the fundamental molecular interactions and dynamics that emerge in these structures. The following chapters present investigations of a number of natural light-harvesting complexes using two-dimensional electronic spectroscopy. These studies demonstrate the various types of information contained in experimental two-dimensional spectra, and they show that the technique makes it possible to probe pigment-protein complexes on the length- and time-scales relevant to their functioning. New methods are described that further extend the capabilities of two-dimensional electronic spectroscopy, for example, by independently controlling the excitation laser pulse polarizations. The experiments, coupled with theoretical simulation, elucidate spatial pathways of energy flow, unravel molecular and electronic structures, and point to potential new quantum mechanical mechanisms of light harvesting.
Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER.
Han, Xusi; Wei, Qing; Kihara, Daisuke
2017-12-08
With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITE CSC in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
Projections for fast protein structure retrieval
Bhattacharya, Sourangshu; Bhattacharyya, Chiranjib; Chandra, Nagasuma R
2006-01-01
Background In recent times, there has been an exponential rise in the number of protein structures in databases e.g. PDB. So, design of fast algorithms capable of querying such databases is becoming an increasingly important research issue. This paper reports an algorithm, motivated from spectral graph matching techniques, for retrieving protein structures similar to a query structure from a large protein structure database. Each protein structure is specified by the 3D coordinates of residues of the protein. The algorithm is based on a novel characterization of the residues, called projections, leading to a similarity measure between the residues of the two proteins. This measure is exploited to efficiently compute the optimal equivalences. Results Experimental results show that, the current algorithm outperforms the state of the art on benchmark datasets in terms of speed without losing accuracy. Search results on SCOP 95% nonredundant database, for fold similarity with 5 proteins from different SCOP classes show that the current method performs competitively with the standard algorithm CE. The algorithm is also capable of detecting non-topological similarities between two proteins which is not possible with most of the state of the art tools like Dali. PMID:17254310
Pennington, Kyla; McGregor, Emma; Beasley, Clare L; Everall, Ian; Cotter, David; Dunn, Michael J
2004-01-01
A major cause of poor resolution in the alkaline pH range of two-dimensional electrophoresis (2-DE) gels is unsatisfactory separation of basic proteins in the first dimension. We have compared methods for the separation of basic proteins in the isoelectric focusing dimension of human brain proteins. The combined use of anodic cup-loading and the hydroxyethyldisulphide containing solution (DeStreak) produced better resolution in both analytical and micropreparative protein loaded 2-DE gels than the other methods investigated.
Yeates, Todd O.; Padilla, Jennifer; Colovos, Chris
2004-06-29
Novel fusion proteins capable of self-assembling into regular structures, as well as nucleic acids encoding the same, are provided. The subject fusion proteins comprise at least two oligomerization domains rigidly linked together, e.g. through an alpha helical linking group. Also provided are regular structures comprising a plurality of self-assembled fusion proteins of the subject invention, and methods for producing the same. The subject fusion proteins find use in the preparation of a variety of nanostructures, where such structures include: cages, shells, double-layer rings, two-dimensional layers, three-dimensional crystals, filaments, and tubes.
Xia, Chang-Ge; Zhang, Dijun; Ma, Chengnv; Zhou, Jun; He, Shan; Su, Xiu-Rong
2016-04-01
Sea cucumber is a commercially important marine organism in China. Of the different colored varieties sold in China, albino sea cucumber has the greatest appeal among consumers. Identification of factors contributing to albinism in sea cucumber is therefore likely to provide a scientific basis for improving the cultivability of these strains. In this study, two-dimensional liquid chromatography-tandem mass spectrometry coupled with isobaric tags for relative and absolute quantification labeling was used for the first time to quantitatively define the proteome of sea cucumbers and reveal proteomic characteristics unique to albino sea cucumbers. A total of 549 proteins were identified and quantified in albino sea cucumber and the functional annotations of 485 proteins have been exhibited based on COG database. Compared with green sea cucumber, 12 proteins were identified as differentially expressed in the intestine and 16 proteins in the body wall of albino sea cucumber. Among them, 5 proteins were up-regulated in the intestine and 8 proteins were down-regulated in body wall. Gene ontology annotations of these differentially expressed proteins consisted mostly of 'biological process'. The large number of differentially expressed proteins identified here should be highly useful in further elucidating the mechanisms underlying albinism in sea cucumber. Copyright © 2015 Elsevier Ltd. All rights reserved.
Griebel, Anja; Obermaier, Christian; Westermeier, Reiner; Moche, Martin; Büttner, Knut
2013-07-01
A new fluorescent amino-reactive dye has been tested for both labelling proteins prior to electrophoretic separations and between the two steps of two-dimensional electrophoresis. A series of experiments showed, that the labelling of lysines with this dye is compatible with all standard additives used for sample preparation, including reducing substances and carrier ampholytes. Using this dye for pre-labelling considerably simplifies the electrophoresis and detection workflow and provides highly sensitive and quantitative visualisation of proteins.
Brasier, Allan R.; Spratt, Heidi; Wu, Zheng; Boldogh, Istvan; Zhang, Yuhong; Garofalo, Roberto P.; Casola, Antonella; Pashmi, Jawad; Haag, Anthony; Luxon, Bruce; Kurosky, Alexander
2004-01-01
The pneumovirus respiratory syncytial virus (RSV) is a leading cause of epidemic respiratory tract infection. Upon entry, RSV replicates in the epithelial cytoplasm, initiating compensatory changes in cellular gene expression. In this study, we have investigated RSV-induced changes in the nuclear proteome of A549 alveolar type II-like epithelial cells by high-resolution two-dimensional gel electrophoresis (2DE). Replicate 2D gels from uninfected and RSV-infected nuclei were compared for changes in protein expression. We identified 24 different proteins by peptide mass fingerprinting after matrix-assisted laser desorption ionization-time of flight mass spectrometry (MS), whose average normalized spot intensity was statistically significant and differed by ±2-fold. Notable among the proteins identified were the cytoskeletal cytokeratins, RNA helicases, oxidant-antioxidant enzymes, the TAR DNA binding protein (a protein that associates with nuclear domain 10 [ND10] structures), and heat shock protein 70- and 60-kDa isoforms (Hsp70 and Hsp60, respectively). The identification of Hsp70 was also validated by liquid chromatography quadropole-TOF tandem MS (LC-MS/MS). Separate experiments using immunofluorescence microscopy revealed that RSV induced cytoplasmic Hsp70 aggregation and nuclear accumulation. Data mining of a genomic database showed that RSV replication induced coordinate changes in Hsp family proteins, including the 70, 70-2, 90, 40, and 40-3 isoforms. Because the TAR DNA binding protein associates with ND10s, we examined the effect of RSV infection on ND10 organization. RSV induced a striking dissolution of ND10 structures with redistribution of the component promyelocytic leukemia (PML) and speckled 100-kDa (Sp100) proteins into the cytoplasm, as well as inducing their synthesis. Our findings suggest that cytoplasmic RSV replication induces a nuclear heat shock response, causes ND10 disruption, and redistributes PML and Sp100 to the cytoplasm. Thus, a high-resolution proteomics approach, combined with immunofluorescence localization and coupled with genomic response data, yielded unexpected novel insights into compensatory nuclear responses to RSV infection. PMID:15479789
Nannenga, Brent L; Iadanza, Matthew G; Vollmar, Breanna S; Gonen, Tamir
2013-01-01
Electron cryomicroscopy, or cryoEM, is an emerging technique for studying the three-dimensional structures of proteins and large macromolecular machines. Electron crystallography is a branch of cryoEM in which structures of proteins can be studied at resolutions that rival those achieved by X-ray crystallography. Electron crystallography employs two-dimensional crystals of a membrane protein embedded within a lipid bilayer. The key to a successful electron crystallographic experiment is the crystallization, or reconstitution, of the protein of interest. This unit describes ways in which protein can be expressed, purified, and reconstituted into well-ordered two-dimensional crystals. A protocol is also provided for negative stain electron microscopy as a tool for screening crystallization trials. When large and well-ordered crystals are obtained, the structures of both protein and its surrounding membrane can be determined to atomic resolution.
CADB: Conformation Angles DataBase of proteins
Sheik, S. S.; Ananthalakshmi, P.; Bhargavi, G. Ramya; Sekar, K.
2003-01-01
Conformation Angles DataBase (CADB) provides an online resource to access data on conformation angles (both main-chain and side-chain) of protein structures in two data sets corresponding to 25% and 90% sequence identity between any two proteins, available in the Protein Data Bank. In addition, the database contains the necessary crystallographic parameters. The package has several flexible options and display facilities to visualize the main-chain and side-chain conformation angles for a particular amino acid residue. The package can also be used to study the interrelationship between the main-chain and side-chain conformation angles. A web based JAVA graphics interface has been deployed to display the user interested information on the client machine. The database is being updated at regular intervals and can be accessed over the World Wide Web interface at the following URL: http://144.16.71.148/cadb/. PMID:12520049
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Boo-Ja; Kwon, Sun Jae; Kim, Sung-Kyu
Two-dimensional gel electrophoresis (2-DE) was applied for the screening of Tobacco mosaic virus (TMV)-induced hot pepper (Capsicum annuum cv. Bugang) nuclear proteins. From differentially expressed protein spots, we acquired the matched peptide mass fingerprint (PMF) data, analyzed by MALDI-TOF MS, from the non-redundant hot pepper EST protein FASTA database using the VEMS 2.0 software. Among six identified nuclear proteins, the hot pepper 26S proteasome subunit RPN7 (CaRPN7) was subjected to further study. The level of CaRPN7 mRNA was specifically increased during incompatible TMV-P{sub 0} interaction, but not during compatible TMV-P{sub 1.2} interaction. When CaRPN7::GFP fusion protein was targeted in onionmore » cells, the nuclei had been broken into pieces. In the hot pepper leaves, cell death was exacerbated and genomic DNA laddering was induced by Agrobacterium-mediated transient overexpression of CaPRN7. Thus, this report presents that the TMV-induced CaRPN7 may be involved in programmed cell death (PCD) in the hot pepper plant.« less
Graphical classification of DNA sequences of HLA alleles by deep learning.
Miyake, Jun; Kaneshita, Yuhei; Asatani, Satoshi; Tagawa, Seiichi; Niioka, Hirohiko; Hirano, Takashi
2018-04-01
Alleles of human leukocyte antigen (HLA)-A DNAs are classified and expressed graphically by using artificial intelligence "Deep Learning (Stacked autoencoder)". Nucleotide sequence data corresponding to the length of 822 bp, collected from the Immuno Polymorphism Database, were compressed to 2-dimensional representation and were plotted. Profiles of the two-dimensional plots indicate that the alleles can be classified as clusters are formed. The two-dimensional plot of HLA-A DNAs gives a clear outlook for characterizing the various alleles.
DBSecSys 2.0: a database of Burkholderia mallei and Burkholderia pseudomallei secretion systems.
Memišević, Vesna; Kumar, Kamal; Zavaljevski, Nela; DeShazer, David; Wallqvist, Anders; Reifman, Jaques
2016-09-20
Burkholderia mallei and B. pseudomallei are the causative agents of glanders and melioidosis, respectively, diseases with high morbidity and mortality rates. B. mallei and B. pseudomallei are closely related genetically; B. mallei evolved from an ancestral strain of B. pseudomallei by genome reduction and adaptation to an obligate intracellular lifestyle. Although these two bacteria cause different diseases, they share multiple virulence factors, including bacterial secretion systems, which represent key components of bacterial pathogenicity. Despite recent progress, the secretion system proteins for B. mallei and B. pseudomallei, their pathogenic mechanisms of action, and host factors are not well characterized. We previously developed a manually curated database, DBSecSys, of bacterial secretion system proteins for B. mallei. Here, we report an expansion of the database with corresponding information about B. pseudomallei. DBSecSys 2.0 contains comprehensive literature-based and computationally derived information about B. mallei ATCC 23344 and literature-based and computationally derived information about B. pseudomallei K96243. The database contains updated information for 163 B. mallei proteins from the previous database and 61 additional B. mallei proteins, and new information for 281 B. pseudomallei proteins associated with 5 secretion systems, their 1,633 human- and murine-interacting targets, and 2,400 host-B. mallei interactions and 2,286 host-B. pseudomallei interactions. The database also includes information about 13 pathogenic mechanisms of action for B. mallei and B. pseudomallei secretion system proteins inferred from the available literature or computationally. Additionally, DBSecSys 2.0 provides details about 82 virulence attenuation experiments for 52 B. mallei secretion system proteins and 98 virulence attenuation experiments for 61 B. pseudomallei secretion system proteins. We updated the Web interface and data access layer to speed-up users' search of detailed information for orthologous proteins related to secretion systems of the two pathogens. The updates of DBSecSys 2.0 provide unique capabilities to access comprehensive information about secretion systems of B. mallei and B. pseudomallei. They enable studies and comparisons of corresponding proteins of these two closely related pathogens and their host-interacting partners. The database is available at http://dbsecsys.bhsai.org .
AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide
2015-11-19
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. This database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.
Ha, Moon Kyung; Chung, Kee Yang; Lee, Ju Hee; Bang, Dongsik; Park, Yoon Kee; Lee, Kwang Hoon
2004-09-01
Aging is associated with the progressive pathophysiologic modification of endothelial cells. In vitro endothelial cell senescence is accompanied by proliferative activity failure and by perturbations in gene and protein expressions. Moreover, this cellular senescence in culture has been proposed to reflect processes that occur in aging organisms. In order to observe the changing patterns of protein expression in senescent human dermal microvascular endothelial cells (HDMECs), proteins obtained from both early- and late-passaged HDMECs were separated by two-dimensional electrophoresis, visualized by silver staining, and quantified by image processing. Proteins of interest were extracted by in-gel digestion with trypsin and quantified by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS), by searching the National Center for Biotechnology Information protein-sequence database. More than 2000 spots were detected by 2D electrophoresis within a linear pH range of 3-10. Twenty-two major differentially expressed spots were observed in serially passaged HDMECs and identified with high confidence by MALDI-TOF-MS. One of these spots was found to be a 14-15 kDa psoriasis-associated fatty acid-binding protein (PA-FABP) with high affinity for long-chain fatty acids. The expression of PA-FABP was confirmed to be elevated in senescent HDMECs (passage 20) by fluorescence-activated cell sorting (FACS), confocal laser microscopy, and by immunohistochemistry in aged human skin tissue. Our results suggest that the overexpression of FABP in cultured senescent HDMECs is closely related to skin aging.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis.
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V Lila; Karagouni, Amalia D; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality.
SAFE Software and FED Database to Uncover Protein-Protein Interactions using Gene Fusion Analysis
Tsagrasoulis, Dimosthenis; Danos, Vasilis; Kissa, Maria; Trimpalis, Philip; Koumandou, V. Lila; Karagouni, Amalia D.; Tsakalidis, Athanasios; Kossida, Sophia
2012-01-01
Domain Fusion Analysis takes advantage of the fact that certain proteins in a given proteome A, are found to have statistically significant similarity with two separate proteins in another proteome B. In other words, the result of a fusion event between two separate proteins in proteome B is a specific full-length protein in proteome A. In such a case, it can be safely concluded that the protein pair has a common biological function or even interacts physically. In this paper, we present the Fusion Events Database (FED), a database for the maintenance and retrieval of fusion data both in prokaryotic and eukaryotic organisms and the Software for the Analysis of Fusion Events (SAFE), a computational platform implemented for the automated detection, filtering and visualization of fusion events (both available at: http://www.bioacademy.gr/bioinformatics/projects/ProteinFusion/index.htm). Finally, we analyze the proteomes of three microorganisms using these tools in order to demonstrate their functionality. PMID:22267904
Chen, De-Ju; Xu, Yan-Ming; Zheng, Wei; Huang, Dong-Yang; Wong, Wing-Yan; Tai, William Chi-Shing; Cho, Yong-Yeon; Lau, Andy T Y
2015-09-01
For years, many studies have been conducted to investigate the intracellular response of cells challenged with toxic metal(s), yet, the corresponding secretome responses, especially in human lung cells, are largely unexplored. Here, we provide a secretome analysis of human bronchial epithelial cells (BEAS-2B) treated with cadmium chloride (CdCl2 ), with the aim of identifying secreted proteins in response to Cd toxicity. Proteins from control and spent media were separated by two-dimensional electrophoresis and visualized by silver staining. Differentially-secreted proteins were identified by MALDI-TOF-MS analysis and database searching. We characterized, for the first time, the extracellular proteome changes of BEAS-2B dosed with Cd. Our results unveiled that Cd treatment led to the marked upregulation of molecular chaperones, antioxidant enzymes, enzymes associated with glutathione metabolic process, proteins involved in cellular energy metabolism, as well as tumor-suppressors. Pretreatment of cells with the thiol antioxidant glutathione before Cd treatment effectively abrogated the secretion of these proteins and prevented cell death. Taken together, our results demonstrate that Cd causes oxidative stress-induced cytotoxicity; and the differentially-secreted protein signatures could be considered as targets for potential use as extracellular biomarkers upon Cd exposure. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Metagenomic Taxonomy-Guided Database-Searching Strategy for Improving Metaproteomic Analysis.
Xiao, Jinqiu; Tanca, Alessandro; Jia, Ben; Yang, Runqing; Wang, Bo; Zhang, Yu; Li, Jing
2018-04-06
Metaproteomics provides a direct measure of the functional information by investigating all proteins expressed by a microbiota. However, due to the complexity and heterogeneity of microbial communities, it is very hard to construct a sequence database suitable for a metaproteomic study. Using a public database, researchers might not be able to identify proteins from poorly characterized microbial species, while a sequencing-based metagenomic database may not provide adequate coverage for all potentially expressed protein sequences. To address this challenge, we propose a metagenomic taxonomy-guided database-search strategy (MT), in which a merged database is employed, consisting of both taxonomy-guided reference protein sequences from public databases and proteins from metagenome assembly. By applying our MT strategy to a mock microbial mixture, about two times as many peptides were detected as with the metagenomic database only. According to the evaluation of the reliability of taxonomic attribution, the rate of misassignments was comparable to that obtained using an a priori matched database. We also evaluated the MT strategy with a human gut microbial sample, and we found 1.7 times as many peptides as using a standard metagenomic database. In conclusion, our MT strategy allows the construction of databases able to provide high sensitivity and precision in peptide identification in metaproteomic studies, enabling the detection of proteins from poorly characterized species within the microbiota.
Advancements of two dimensional correlation spectroscopy in protein researches
NASA Astrophysics Data System (ADS)
Tao, Yanchun; Wu, Yuqing; Zhang, Liping
2018-05-01
The developments of two-dimensional correlation spectroscopy (2DCOS) applications in protein studies are discussed, especially for the past two decades. The powerful utilities of 2DCOS combined with various analytical techniques in protein studies are summarized. The emphasis is on the vibration spectroscopic techniques including IR, NIR, Raman and optical activity (ROA), as well as vibration circular dichroism (VCD) and fluorescence spectroscopy. In addition, some new developments, such as hetero-spectral 2DCOS, moving-window correlation, and model based correlation, are also reviewed for their utility in the investigation of the secondary structure, denaturation, folding and unfolding changes of protein. Finally, the new possibility and challenges of 2DCOS in protein research are highlighted as well.
7 Å resolution in protein two-dimensional-crystal X-ray diffraction at Linac Coherent Light Source
Pedrini, Bill; Tsai, Ching-Ju; Capitani, Guido; Padeste, Celestino; Hunter, Mark S.; Zatsepin, Nadia A.; Barty, Anton; Benner, W. Henry; Boutet, Sébastien; Feld, Geoffrey K.; Hau-Riege, Stefan P.; Kirian, Richard A.; Kupitz, Christopher; Messerschmitt, Marc; Ogren, John I.; Pardini, Tommaso; Segelke, Brent; Williams, Garth J.; Spence, John C. H.; Abela, Rafael; Coleman, Matthew; Evans, James E.; Schertler, Gebhard F. X.; Frank, Matthias; Li, Xiao-Dan
2014-01-01
Membrane proteins arranged as two-dimensional crystals in the lipid environment provide close-to-physiological structural information, which is essential for understanding the molecular mechanisms of protein function. Previously, X-ray diffraction from individual two-dimensional crystals did not represent a suitable investigational tool because of radiation damage. The recent availability of ultrashort pulses from X-ray free-electron lasers (XFELs) has now provided a means to outrun the damage. Here, we report on measurements performed at the Linac Coherent Light Source XFEL on bacteriorhodopsin two-dimensional crystals mounted on a solid support and kept at room temperature. By merging data from about a dozen single crystal diffraction images, we unambiguously identified the diffraction peaks to a resolution of 7 Å, thus improving the observable resolution with respect to that achievable from a single pattern alone. This indicates that a larger dataset will allow for reliable quantification of peak intensities, and in turn a corresponding increase in the resolution. The presented results pave the way for further XFEL studies on two-dimensional crystals, which may include pump–probe experiments at subpicosecond time resolution. PMID:24914166
You, Xiangrong; Wang, Lingxia; Liang, Wenyu; Gai, Yonghong; Wang, Xiaoyan; Chen, Wei
2012-02-02
Two-dimensional gel electrophoresis (2-DE) was used to analyze the proteins related to floral reversion in Dimocarpus longan Lour. Proteins were extracted from buds undergoing the normal process of flowering and from those undergoing floral reversion in three developing stages in D. longan. Differentially expressed proteins were identified from the gels after 2-DE analysis, which were confirmed using matrix-assisted laser desorption/ionization-time of flying-mass spectroscopy and protein database search. A total of 39 proteins, including 18 up-regulated and 21 down-regulated proteins, were classified into different categories, such as energy and substance metabolism, protein translation, secondary metabolism, phytohormone, cytoskeleton structure, regulation, and stress tolerance. Among these, the largest functional class was associated with primary metabolism. Down-regulated proteins were involved in photosynthesis, transcription, and translation, whereas up-regulated proteins were involved in respiration. Decreased flavonoid synthesis and up-regulated GA20ox might be involved in the floral reversion process. Up-regulated 14-3-3 proteins played a role in the regulation of floral reversion in D. longan by responding to abiotic stress. Observations via transmission electron microscopy revealed the ultrastructure changes in shedding buds undergoing floral reversion. Overall, the results provided insights into the molecular basis for the floral reversion mechanism in D. longan. Copyright © 2011 Elsevier B.V. All rights reserved.
Shindo, N; Fujimura, T; Nojima-Kazuno, S; Mineki, R; Furusawa, S; Sasaki, K; Murayama, K
1998-11-15
Multidrug resistant protein 1 (MDR1) in a doxorubicin-resistant mouse leukemia cell line (P388/DOX) was identified using its amino acid composition combined with protein database searching (ExPASy and EMBL PROPSEARCH) via the World Wide Web. The proteins were separated by one-dimensional SDS-polyacrylamide gel electrophoresis, blotted onto a polyvinylidene fluoride membrane, and stained with Coomassie brilliant blue. A 160-kDa protein band was acid-hydrolyzed in the vapor phase (6 N HC1) and converted to 6-aminoquinolyl-carbamyl (AQC)-amino acids without extraction of the amino acids from the membrane. The amino acid composition of the protein was determined using the sensitive AQC-amino acid analysis method, improving our previously described method. The improved method involved using a Cosmosil 5C8-MS column instead of a Pegasil C8; replacement of the mobile phase A, constituent, 75 mM ammonium phosphate (pH 7.5), with 30 mM sodium phosphate buffer (pH 7.2); and slight modification of the separation program (9). All manipulations for protein hydrolysis and AQC derivatization were carried out in a hood using clean tools. This minimized contamination of amino acids at the low femtomolar level. A database search was carried out with bovine serum albumin as a calibration protein. MDR1 in P388/DOX was ranked first by both databases with high reliability (score 14 for ExPASy, distance 1.34 for EMBL).
Saitoh, Eiichi; Sega, Takuya; Imai, Akane; Isemura, Satoko; Kato, Tetsuo; Ochiai, Akihito; Taniguchi, Masayuki
2018-04-01
The NCBI gene database and human-transcriptome database for alternative splicing were used to determine the expression of mRNAs for P-B (SMR3B) and variant form of P-B. The translational product from the former mRNA was identified as the protein named P-B, whereas that from the latter has not yet been elucidated. In the present study, we investigated the expression of P-B and its variant form at the protein level. To identify the variant protein of P-B, (1) cationic proteins with a higher isoelectric point in human pooled whole saliva were purified by a two dimensional liquid chromatography; (2) the peptide fragments generated from the in-solution of all proteins digested with trypsin separated and analyzed by MALDI-TOF-MS; and (3) the presence or absence of P-B in individual saliva was examined by 15% SDS-PAGE. The peptide sequences (I 37 PPPYSCTPNMNNCSR 52 , C 53 HHHHKRHHYPCNYCFCYPK 72 , R 59 HHYPCNYCFCYPK 72 and H 60 HYPCNYCFCYPK 72 ) present in the variant protein of P-B were identified. The peptide sequence (G 6 PYPPGPLAPPQPFGPGFVPPPPPPPYGPGR 36 ) in P-B (or the variant) and sequence (I 37 PPPPPAPYGPGIFPPPPPQP 57 ) in P-B were identified. The sum of the sequences identified indicated a 91.23% sequence identity for P-B and 79.76% for the variant. There were cases in which P-B existed in individual saliva, but there were cases in which it did not exist in individual saliva. The variant protein is produced by excising a non-canonical intron (CC-AC pair) from the 3'-noncoding sequence of the PBII gene. Both P-B and the variant are subject to proteolysis in the oral cavity. Copyright © 2018 Elsevier Ltd. All rights reserved.
Hadizadeh Tasbiti, Alireza; Yari, Shamsi; Siadat, Seyed Davar; Tabarsi, Payam; Saeedfar, Kayvan; Yari, Fatemeh
2018-02-01
Tuberculosis (TB) is a crucial public health problem with prevalence of multidrug resistant (MDR) rising. An accurate TB biomarker is urgently needed to monitor the response to treatment in patients with MDR tuberculosis. To analyze interaction between selected MDR-TB purified protein and immune cells, dendritic cells from MDR-TB patients and healthy subjects were stimulated by 55KDa protein fractions (Rv0147). The purified proteins identified by proteomic techniques (two-dimensional gel electrophoresis, mass spectrometry) and peptide sequences are known to bind a MHC class I alleles which are extracted from the Immune Epitope Database and Analysis Resource database ( www.iedb.org ). T cells were isolated from PBMC by negative selection and cells were cultured in RPMI-1640 at 37 °C and 5% CO 2 . Cell culture was assayed for cytokine IL-10 and INF-γ by ELISA. We found that INF-γ production was significantly (335 ± 35.5 pg/ml, P ˂ 0.05) upregulated after protein candidate (Rv0147) stimulation by dendritic cells from MDR-TB patients, whereas IL-10 production was greatly reduced compared with production in healthy subjects (212 ± 9.94 pg/ml, P ˂ 0.05). In fact, the purified protein, Rv0147, stimulated dendritic cells from MDR-TB patients, failed to produce IL-10 and directly stimulates INF-γ production by T cells. These results suggest that the purified protein, Rv0147, may stimulate Th1 type protective cytokine response in MDR-TB patients but not in normal subjects. The production of INF-γ but not IL-10 in the presence of purified protein, Rv0147, may be shifted to Th1 responses in MDR-TB patients and supports its potential as protein vaccine candidates against TB.
Cole, Jason C.
2017-01-01
The Cambridge Structural Database (CSD) is the worldwide resource for the dissemination of all published three-dimensional structures of small-molecule organic and metal–organic compounds. This paper briefly describes how this collection of crystal structures can be used en masse in the context of macromolecular crystallography. Examples highlight how the CSD and associated software aid protein–ligand complex validation, and show how the CSD could be further used in the generation of geometrical restraints for protein structure refinement. PMID:28291758
Beaver, John E; Bourne, Philip E; Ponomarenko, Julia V
2007-02-21
Structural information about epitopes, particularly the three-dimensional (3D) structures of antigens in complex with immune receptors, presents a valuable source of data for immunology. This information is available in the Protein Data Bank (PDB) and provided in curated form by the Immune Epitope Database and Analysis Resource (IEDB). With continued growth in these data and the importance in understanding molecular level interactions of immunological interest there is a need for new specialized molecular visualization and analysis tools. The EpitopeViewer is a platform-independent Java application for the visualization of the three-dimensional structure and sequence of epitopes and analyses of their interactions with antigen-specific receptors of the immune system (antibodies, T cell receptors and MHC molecules). The viewer renders both 3D views and two-dimensional plots of intermolecular interactions between the antigen and receptor(s) by reading curated data from the IEDB and/or calculated on-the-fly from atom coordinates from the PDB. The 3D views and associated interactions can be saved for future use and publication. The EpitopeViewer can be accessed from the IEDB Web site http://www.immuneepitope.org through the quick link 'Browse Records by 3D Structure.' The EpitopeViewer is designed and been tested for use by immunologists with little or no training in molecular graphics. The EpitopeViewer can be launched from most popular Web browsers without user intervention. A Java Runtime Environment (RJE) 1.4.2 or higher is required.
Comparative proteome analysis of monolayer and spheroid culture of canine osteosarcoma cells.
Gebhard, Christiane; Miller, Ingrid; Hummel, Karin; Neschi Née Ondrovics, Martina; Schlosser, Sarah; Walter, Ingrid
2018-04-15
Osteosarcoma is an aggressive bone tumor with high metastasis rate in the lungs and affects both humans and dogs in a similar way. Three-dimensional tumor cell cultures mimic the in vivo situation of micro-tumors and metastases and are therefore better experimental in vitro models than the often applied two-dimensional monolayer cultures. The aim of the present study was to perform comparative proteomics of standard monolayer cultures of canine osteosarcoma cells (D17) and three-dimensional spheroid cultures, to better characterize the 3D model before starting with experiments like migration assays. Using DIGE in combination with MALDI-TOF/TOF we found 27 unique canine proteins differently represented between these two culture systems, most of them being part of a functional network including mainly chaperones, structural proteins, stress-related proteins, proteins of the glycolysis/gluconeogenesis pathway and oxidoreductases. In monolayer cells, a noticeable shift to more acidic pI values was noticed for several proteins of medium to high abundance; two proteins (protein disulfide isomerase A3, stress-induced-phosphoprotein 1) showed an increase of phosphorylated protein species. Protein distribution within the cells, as detected by immunohistochemistry, displayed a switch of stress-induced-phosphoprotein 1 from the cytoplasm (in monolayer cultures) to the nucleus (in spheroid cultures). Additionally, Western blot testing revealed upregulated concentrations of metastasin (S100A4), triosephosphate isomerase 1 and septin 2 in spheroid cultures, in contrast to decreased concentrations of CCT2, a subunit of the T-complex. Results indicate regulation of stress proteins in the process of three-dimensional organization characterized by a hypoxic and nutrient-deficient environment comparable to tumor micro-metastases. Osteosarcoma is an aggressive bone tumor that early spreads to the lungs. Three-dimensional tumor cell cultures represent the avascular stage of micro-tumors and metastases, and should therefore represent a better experimental in vitro model compared to two-dimensional monolayer cultures. Significant differences have been reported in response to drug and radiation treatment between these two culture systems. A gel-based proteomic investigation was performed to compare protein patterns of a canine osteosarcoma cell line cultivated under those two conditions, to learn more about altered cell composition and its impact on cell behaviour. Due to the fact that the canine osteosarcoma is an accepted model for the human disease, results will be relevant for the human species as well. Copyright © 2018 Elsevier B.V. All rights reserved.
Nonlinear dimensionality reduction methods for synthetic biology biobricks' visualization.
Yang, Jiaoyun; Wang, Haipeng; Ding, Huitong; An, Ning; Alterovitz, Gil
2017-01-19
Visualizing data by dimensionality reduction is an important strategy in Bioinformatics, which could help to discover hidden data properties and detect data quality issues, e.g. data noise, inappropriately labeled data, etc. As crowdsourcing-based synthetic biology databases face similar data quality issues, we propose to visualize biobricks to tackle them. However, existing dimensionality reduction methods could not be directly applied on biobricks datasets. Hereby, we use normalized edit distance to enhance dimensionality reduction methods, including Isomap and Laplacian Eigenmaps. By extracting biobricks from synthetic biology database Registry of Standard Biological Parts, six combinations of various types of biobricks are tested. The visualization graphs illustrate discriminated biobricks and inappropriately labeled biobricks. Clustering algorithm K-means is adopted to quantify the reduction results. The average clustering accuracy for Isomap and Laplacian Eigenmaps are 0.857 and 0.844, respectively. Besides, Laplacian Eigenmaps is 5 times faster than Isomap, and its visualization graph is more concentrated to discriminate biobricks. By combining normalized edit distance with Isomap and Laplacian Eigenmaps, synthetic biology biobircks are successfully visualized in two dimensional space. Various types of biobricks could be discriminated and inappropriately labeled biobricks could be determined, which could help to assess crowdsourcing-based synthetic biology databases' quality, and make biobricks selection.
Fast protein tertiary structure retrieval based on global surface shape similarity.
Sael, Lee; Li, Bin; La, David; Fang, Yi; Ramani, Karthik; Rustamov, Raif; Kihara, Daisuke
2008-09-01
Characterization and identification of similar tertiary structure of proteins provides rich information for investigating function and evolution. The importance of structure similarity searches is increasing as structure databases continue to expand, partly due to the structural genomics projects. A crucial drawback of conventional protein structure comparison methods, which compare structures by their main-chain orientation or the spatial arrangement of secondary structure, is that a database search is too slow to be done in real-time. Here we introduce a global surface shape representation by three-dimensional (3D) Zernike descriptors, which represent a protein structure compactly as a series expansion of 3D functions. With this simplified representation, the search speed against a few thousand structures takes less than a minute. To investigate the agreement between surface representation defined by 3D Zernike descriptor and conventional main-chain based representation, a benchmark was performed against a protein classification generated by the combinatorial extension algorithm. Despite the different representation, 3D Zernike descriptor retrieved proteins of the same conformation defined by combinatorial extension in 89.6% of the cases within the top five closest structures. The real-time protein structure search by 3D Zernike descriptor will open up new possibility of large-scale global and local protein surface shape comparison. 2008 Wiley-Liss, Inc.
The Efficacy of Multidimensional Constraint Keys in Database Query Performance
ERIC Educational Resources Information Center
Cardwell, Leslie K.
2012-01-01
This work is intended to introduce a database design method to resolve the two-dimensional complexities inherent in the relational data model and its resulting performance challenges through abstract multidimensional constructs. A multidimensional constraint is derived and utilized to implement an indexed Multidimensional Key (MK) to abstract a…
Kim, Young-Ha; slam, Mohammad Saiful; You, Myung-Jo
2015-01-01
Proteomic tools allow large-scale, high-throughput analyses for the detection, identification, and functional investigation of proteome. For detection of antigens from Haemaphysalis longicornis, 1-dimensional electrophoresis (1-DE) quantitative immunoblotting technique combined with 2-dimensional electrophoresis (2-DE) immunoblotting was used for whole body proteins from unfed and partially fed female ticks. Reactivity bands and 2-DE immunoblotting were performed following 2-DE electrophoresis to identify protein spots. The proteome of the partially fed female had a larger number of lower molecular weight proteins than that of the unfed female tick. The total number of detected spots was 818 for unfed and 670 for partially fed female ticks. The 2-DE immunoblotting identified 10 antigenic spots from unfed females and 8 antigenic spots from partially fed females. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry (MALDI-TOF) of relevant spots identified calreticulin, putative secreted WC salivary protein, and a conserved hypothetical protein from the National Center for Biotechnology Information and Swiss Prot protein sequence databases. These findings indicate that most of the whole body components of these ticks are non-immunogenic. The data reported here will provide guidance in the identification of antigenic proteins to prevent infestation and diseases transmitted by H. longicornis. PMID:25748713
PATRIC, the bacterial bioinformatics database and analysis resource
Wattam, Alice R.; Abraham, David; Dalay, Oral; Disz, Terry L.; Driscoll, Timothy; Gabbard, Joseph L.; Gillespie, Joseph J.; Gough, Roger; Hix, Deborah; Kenyon, Ronald; Machi, Dustin; Mao, Chunhong; Nordberg, Eric K.; Olson, Robert; Overbeek, Ross; Pusch, Gordon D.; Shukla, Maulik; Schulman, Julie; Stevens, Rick L.; Sullivan, Daniel E.; Vonstein, Veronika; Warren, Andrew; Will, Rebecca; Wilson, Meredith J.C.; Yoo, Hyun Seung; Zhang, Chengdong; Zhang, Yan; Sobral, Bruno W.
2014-01-01
The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) (http://www.patricbrc.org). A joint effort by two of the original National Institute of Allergy and Infectious Diseases-funded BRCs, PATRIC provides researchers with an online resource that stores and integrates a variety of data types [e.g. genomics, transcriptomics, protein–protein interactions (PPIs), three-dimensional protein structures and sequence typing data] and associated metadata. Datatypes are summarized for individual genomes and across taxonomic levels. All genomes in PATRIC, currently more than 10 000, are consistently annotated using RAST, the Rapid Annotations using Subsystems Technology. Summaries of different data types are also provided for individual genes, where comparisons of different annotations are available, and also include available transcriptomic data. PATRIC provides a variety of ways for researchers to find data of interest and a private workspace where they can store both genomic and gene associations, and their own private data. Both private and public data can be analyzed together using a suite of tools to perform comparative genomic or transcriptomic analysis. PATRIC also includes integrated information related to disease and PPIs. All the data and integrated analysis and visualization tools are freely available. This manuscript describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue. PMID:24225323
Application of Tandem Two-Dimensional Mass Spectrometry for Top-Down Deep Sequencing of Calmodulin.
Floris, Federico; Chiron, Lionel; Lynch, Alice M; Barrow, Mark P; Delsuc, Marc-André; O'Connor, Peter B
2018-06-04
Two-dimensional mass spectrometry (2DMS) involves simultaneous acquisition of the fragmentation patterns of all the analytes in a mixture by correlating their precursor and fragment ions by modulating precursor ions systematically through a fragmentation zone. Tandem two-dimensional mass spectrometry (MS/2DMS) unites the ultra-high accuracy of Fourier transform ion cyclotron resonance (FT-ICR) MS/MS and the simultaneous data-independent fragmentation of 2DMS to achieve extensive inter-residue fragmentation of entire proteins. 2DMS was recently developed for top-down proteomics (TDP), and applied to the analysis of calmodulin (CaM), reporting a cleavage coverage of about ~23% using infrared multiphoton dissociation (IRMPD) as fragmentation technique. The goal of this work is to expand the utility of top-down protein analysis using MS/2DMS in order to extend the cleavage coverage in top-down proteomics further into the interior regions of the protein. In this case, using MS/2DMS, the cleavage coverage of CaM increased from ~23% to ~42%. Graphical Abstract Two-dimensional mass spectrometry, when applied to primary fragment ions from the source, allows deep-sequencing of the protein calmodulin.
Proteomic analysis of 'Zaosu' pear (Pyrus bretschneideri Rehd.) and its early-maturing bud sport.
Liu, Xueting; Zhai, Rui; Feng, Wenting; Zhang, Shiwei; Wang, Zhigang; Qiu, Zonghao; Zhang, Junke; Ma, Fengwang; Xu, Lingfei
2014-07-01
Maturation of fruits involves a series of physiological, biochemical, and organoleptic changes that eventually make fleshy fruits attractive, palatable, and nutritional. In order to understand the mature mechanism of the early-maturing bud sport of 'Zaosu' pear, we analyzed the differences of proteome expression between the both pears in different mature stages by the methods of a combination of two-dimensional electrophoresis (2-DE) and matrix assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) analysis. Seventy-five differential expressed protein spots (p<0.05) were obtained between 'Zaosu' pear and its early-maturing bud sport, but only sixty-eight were demonstratively identified in the database of NCBI and uniprot. The majority of proteins were linked to metabolism, energy, stress response/defense and cell structure. Additionally, our data confirmed an increase of proteins related to cell-wall modification, oxidative stress and pentose phosphate metabolism and a decrease of proteins related to photosynthesis and glycolysis during the development process of both pears, but all these proteins increased or decreased faster in the early-maturing bud sport. This comparative analysis between both pears showed that these proteins were closely associated with maturation and could provide more detailed characteristics of the maturation process of both pears. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel
2016-01-01
Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility of Geomfinder, which was able to discriminate between similar and different 3D patterns related to binding sites of common substrates in a range of diverse proteins. Geomfinder allows detecting similar 3D patterns between any two pair of protein structures, regardless of the divergency among their amino acids sequences. Although the software is not intended for simultaneous multiple comparisons in a large number of proteins, it can be particularly useful in cases such as the structure-based design of multitarget drugs, where a detailed analysis of 3D patterns similarities between a few selected protein targets is essential.
Exploiting genomic data to identify proteins involved in abalone reproduction.
Mendoza-Porras, Omar; Botwright, Natasha A; McWilliam, Sean M; Cook, Mathew T; Harris, James O; Wijffels, Gene; Colgrave, Michelle L
2014-08-28
Aside from their critical role in reproduction, abalone gonads serve as an indicator of sexual maturity and energy balance, two key considerations for effective abalone culture. Temperate abalone farmers face issues with tank restocking with highly marketable abalone owing to inefficient spawning induction methods. The identification of key proteins in sexually mature abalone will serve as the foundation for a greater understanding of reproductive biology. Addressing this knowledge gap is the first step towards improving abalone aquaculture methods. Proteomic profiling of female and male gonads of greenlip abalone, Haliotis laevigata, was undertaken using liquid chromatography-mass spectrometry. Owing to the incomplete nature of abalone protein databases, in addition to searching against two publicly available databases, a custom database comprising genomic data was used. Overall, 162 and 110 proteins were identified in females and males respectively with 40 proteins common to both sexes. For proteins involved in sexual maturation, sperm and egg structure, motility, acrosomal reaction and fertilization, 23 were identified only in females, 18 only in males and 6 were common. Gene ontology analysis revealed clear differences between the female and male protein profiles reflecting a higher rate of protein synthesis in the ovary and higher metabolic activity in the testis. A comprehensive mass spectrometry-based analysis was performed to profile the abalone gonad proteome providing the foundation for future studies of reproduction in abalone. Key proteins involved in both reproduction and energy balance were identified. Genomic resources were utilised to build a database of molluscan proteins yielding >60% more protein identifications than in a standard workflow employing public protein databases. Copyright © 2014 Elsevier B.V. All rights reserved.
Comparative Skeletal Muscle Proteomics Using Two-Dimensional Gel Electrophoresis
Murphy, Sandra; Dowling, Paul; Ohlendieck, Kay
2016-01-01
The pioneering work by Patrick H. O’Farrell established two-dimensional gel electrophoresis as one of the most important high-resolution protein separation techniques of modern biochemistry (Journal of Biological Chemistry 1975, 250, 4007–4021). The application of two-dimensional gel electrophoresis has played a key role in the systematic identification and detailed characterization of the protein constituents of skeletal muscles. Protein changes during myogenesis, muscle maturation, fibre type specification, physiological muscle adaptations and natural muscle aging were studied in depth by the original O’Farrell method or slightly modified gel electrophoretic techniques. Over the last 40 years, the combined usage of isoelectric focusing in the first dimension and sodium dodecyl sulfate polyacrylamide slab gel electrophoresis in the second dimension has been successfully employed in several hundred published studies on gel-based skeletal muscle biochemistry. This review focuses on normal and physiologically challenged skeletal muscle tissues and outlines key findings from mass spectrometry-based muscle proteomics, which was instrumental in the identification of several thousand individual protein isoforms following gel electrophoretic separation. These muscle-associated protein species belong to the diverse group of regulatory and contractile proteins of the acto-myosin apparatus that forms the sarcomere, cytoskeletal proteins, metabolic enzymes and transporters, signaling proteins, ion-handling proteins, molecular chaperones and extracellular matrix proteins. PMID:28248237
ARCPHdb: A comprehensive protein database for SF1 and SF2 helicase from archaea.
Moukhtar, Mirna; Chaar, Wafi; Abdel-Razzak, Ziad; Khalil, Mohamad; Taha, Samir; Chamieh, Hala
2017-01-01
Superfamily 1 and Superfamily 2 helicases, two of the largest helicase protein families, play vital roles in many biological processes including replication, transcription and translation. Study of helicase proteins in the model microorganisms of archaea have largely contributed to the understanding of their function, architecture and assembly. Based on a large phylogenomics approach, we have identified and classified all SF1 and SF2 protein families in ninety five sequenced archaea genomes. Here we developed an online webserver linked to a specialized protein database named ARCPHdb to provide access for SF1 and SF2 helicase families from archaea. ARCPHdb was implemented using MySQL relational database. Web interfaces were developed using Netbeans. Data were stored according to UniProt accession numbers, NCBI Ref Seq ID, PDB IDs and Entrez Databases. A user-friendly interactive web interface has been developed to browse, search and download archaeal helicase protein sequences, their available 3D structure models, and related documentation available in the literature provided by ARCPHdb. The database provides direct links to matching external databases. The ARCPHdb is the first online database to compile all protein information on SF1 and SF2 helicase from archaea in one platform. This database provides essential resource information for all researchers interested in the field. Copyright © 2016 Elsevier Ltd. All rights reserved.
High-Order Methods for Computational Physics
1999-03-01
computation is running in 278 Ronald D. Henderson parallel. Instead we use the concept of a voxel database (VDB) of geometric positions in the mesh [85...processor 0 Fig. 4.19. Connectivity and communications axe established by building a voxel database (VDB) of positions. A VDB maps each position to a...studies such as the highly accurate stability computations considered help expand the database for this benchmark problem. The two-dimensional linear
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system
DOE Office of Scientific and Technical Information (OSTI.GOV)
AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database inmore » which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.« less
An affinity-structure database of helix-turn-helix: DNA complexes with a universal coordinate system
AlQuraishi, Mohammed; Tang, Shengdong; Xia, Xide
2015-11-19
Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. We have developed an integrated affinity-structure database inmore » which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. Lastly, this database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.« less
Chekmarev, Sergei F
2013-10-14
Using the Helmholtz decomposition of the vector field of folding fluxes in a two-dimensional space of collective variables, a potential of the driving force for protein folding is introduced. The potential has two components. One component is responsible for the source and sink of the folding flows, which represent respectively, the unfolded states and the native state of the protein, and the other, which accounts for the flow vorticity inherently generated at the periphery of the flow field, is responsible for the canalization of the flow between the source and sink. The theoretical consideration is illustrated by calculations for a model β-hairpin protein.
Gao, Mingxia; Zhang, Peng; Hong, Guangfeng; Guan, Xia; Yan, Guoquan; Deng, Chunhui; Zhang, Xiangmin
2009-10-30
In this work, a novel and facile monolithic enzymatic microreactor was prepared in the fused-silica capillary via a two-step procedure including surface acryloylation and in situ aqueous polymerization/immobilization to encapsulate a single enzyme, and its application to fast protein digestion through a direct matrix-assisted laser desorption/ionization time-of-flight mass spectrometer (MALDI-TOF-MS) analysis was demonstrated. At first, vinyl groups on the protein surface were generated by a mild acryloylation with N-acryloxysuccinimide in alkali buffer. Then, acryloylated enzyme was encapsulated into polyacrylates by free-radical copolymerization with acrylamide as the monomer, N,N'-methylenebisacrylamide as the cross-linker, and N,N,N',N'-tetramethylethylenediamine/ammonium persulfate as the initiator. Finally, polymers were immobilized onto the activated inner wall of capillaries via the reaction of vinyl groups. Capability of the enzyme-immobilized monolithic microreactor was demonstrated by myoglobin and bovine serum albumin as model proteins. The digestion products were characterized using MALDI-TOF-MS with sequence coverage of 94% and 29% observed. This microreactor was also applied to the analysis of fractions through two-dimensional separation of weak anion exchange/reversed-phase liquid chromatography of human liver extract. After a database search, 16 unique peptides corresponding to 3 proteins were identified when two RPLC fractions of human liver extract were digested by the microreactor. This opens a route for its future application in top-down proteomic analysis.
Frey, W; Brink, J; Schief, W R; Chiu, W; Vogel, V
1998-01-01
Coordination of individual histidine residues located on a protein surface to metal-chelated lipid monolayers is a potentially general method for crystallizing proteins in two dimensions. It was shown recently by Brewster angle microscopy (BAM) that the model protein streptavidin binds via its surface histidines to Cu-DOIDA lipid monolayers, and aggregates into regularly shaped domains that have the appearance of crystals. We have used electron microscopy to confirm that the domains are indeed crystalline with lattice parameters similar to those of the same protein crystallized beneath biotinylated lipid monolayers. Although BAM demonstrates that the two-dimensional protein crystals grown via metal chelation are distinct from the biotin-bound crystals in both microscopic shape and thermodynamic behavior, the two crystal types show similar density projections and the same plane group symmetry. PMID:9591691
Petti, Megan K; Lomont, Justin P; Maj, Michał; Zanni, Martin T
2018-02-15
Two-dimensional spectroscopy is a powerful tool for extracting structural and dynamic information from a wide range of chemical systems. We provide a brief overview of the ways in which two-dimensional visible and infrared spectroscopies are being applied to elucidate fundamental details of important processes in biological and materials science. The topics covered include amyloid proteins, photosynthetic complexes, ion channels, photovoltaics, batteries, as well as a variety of promising new methods in two-dimensional spectroscopy.
Identifying relevant data for a biological database: handcrafted rules versus machine learning.
Sehgal, Aditya Kumar; Das, Sanmay; Noto, Keith; Saier, Milton H; Elkan, Charles
2011-01-01
With well over 1,000 specialized biological databases in use today, the task of automatically identifying novel, relevant data for such databases is increasingly important. In this paper, we describe practical machine learning approaches for identifying MEDLINE documents and Swiss-Prot/TrEMBL protein records, for incorporation into a specialized biological database of transport proteins named TCDB. We show that both learning approaches outperform rules created by hand by a human expert. As one of the first case studies involving two different approaches to updating a deployed database, both the methods compared and the results will be of interest to curators of many specialized databases.
The 2015 Nucleic Acids Research Database Issue and molecular biology database collection.
Galperin, Michael Y; Rigden, Daniel J; Fernández-Suárez, Xosé M
2015-01-01
The 2015 Nucleic Acids Research Database Issue contains 172 papers that include descriptions of 56 new molecular biology databases, and updates on 115 databases whose descriptions have been previously published in NAR or other journals. Following the classification that has been introduced last year in order to simplify navigation of the entire issue, these articles are divided into eight subject categories. This year's highlights include RNAcentral, an international community portal to various databases on noncoding RNA; ValidatorDB, a validation database for protein structures and their ligands; SASBDB, a primary repository for small-angle scattering data of various macromolecular complexes; MoonProt, a database of 'moonlighting' proteins, and two new databases of protein-protein and other macromolecular complexes, ComPPI and the Complex Portal. This issue also includes an unusually high number of cancer-related databases and other databases dedicated to genomic basics of disease and potential drugs and drug targets. The size of NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/a/, remained approximately the same, following the addition of 74 new resources and removal of 77 obsolete web sites. The entire Database Issue is freely available online on the Nucleic Acids Research web site (http://nar.oxfordjournals.org/). Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Li, Lei; Nelson, Clark J.; Solheim, Cory; Whelan, James; Millar, A. Harvey
2012-01-01
The growth and development of plant tissues is associated with an ordered succession of cellular processes that are reflected in the appearance and disappearance of proteins. The control of the kinetics of protein turnover is central to how plants can rapidly and specifically alter protein abundance and thus molecular function in response to environmental or developmental cues. However, the processes of turnover are largely hidden during periods of apparent steady-state protein abundance, and even when proteins accumulate it is unclear whether enhanced synthesis or decreased degradation is responsible. We have used a 15N labeling strategy with inorganic nitrogen sources coupled to a two-dimensional fluorescence difference gel electrophoresis and mass spectrometry analysis of two-dimensional IEF/SDS-PAGE gel spots to define the rate of protein synthesis (KS) and degradation (KD) of Arabidopsis cell culture proteins. Through analysis of MALDI-TOF/TOF mass spectra from 120 protein spots, we were able to quantify KS and KD for 84 proteins across six functional groups and observe over 65-fold variation in protein degradation rates. KS and KD correlate with functional roles of the proteins in the cell and the time in the cell culture cycle. This approach is based on progressive 15N labeling that is innocuous for the plant cells and, because it can be used to target analysis of proteins through the use of specific gel spots, it has broad applicability. PMID:22215636
Tamilvanan, Thangaraju; Hopper, Waheeta
2014-01-01
Yersinia pestis, a Gram negative bacillus, spreads via lymphatic to lymph nodes and to all organs through the bloodstream, causing plague. Yersinia outer protein H (YopH) is one of the important effector proteins, which paralyzes lymphocytes and macrophages by dephosphorylating critical tyrosine kinases and signal transduction molecules. The purpose of the study is to generate a three-dimensional (3D) pharmacophore model by using diverse sets of YopH inhibitors, which would be useful for designing of potential antitoxin. In this study, we have selected 60 biologically active inhibitors of YopH to perform Ligand based pharmacophore study to elucidate the important structural features responsible for biological activity. Pharmacophore model demonstrated the importance of two acceptors, one hydrophobic and two aromatic features toward the biological activity. Based on these features, different databases were screened to identify novel compounds and these ligands were subjected for docking, ADME properties and Binding energy prediction. Post docking validation was performed using molecular dynamics simulation for selected ligands to calculate the Root Mean Square Deviation (RMSD) and Root Mean Square Fluctuation (RMSF). The ligands, ASN03270114, Mol_252138, Mol_31073 and ZINC04237078 may act as inhibitors against YopH of Y. pestis.
The fate of b-ions in the two worlds of collision-induced dissociation.
Waldera-Lupa, Daniel M; Stefanski, Anja; Meyer, Helmut E; Stühler, Kai
2013-12-01
Fragment analysis of proteins and peptides by mass spectrometry using collision-induced dissociation (CID) revealed that the pairwise generated N-terminal b- and C-terminal y-ions have different stabilities resulting in underrepresentation of b-ions. Detailed analyses of large-scale spectra databases and synthetic peptides underlined these observations and additionally showed that the fragmentation pattern depends on utilized CID regime. To investigate this underrepresentation further we systematically compared resonant excitation energy and beam-type CID facilitated on different mass spectrometer platforms: (i) quadrupole time-of-flight, (ii) linear ion trap and (iii) three-dimensional ion trap. Detailed analysis of MS/MS data from a standard tryptic protein digest revealed that b-ions are significantly underrepresented on all investigated mass spectrometers. By N-terminal acetylation of tryptic peptides we show for the first time that b-ion cyclization reaction significantly contributes to b-ion underrepresentation even on ion trap instruments and accounts for at most 16% of b-ion loss. © 2013.
Huang, Hung-Jen; Chen, Wei-Yu; Wu, Jer-Horng
2014-01-01
Protein recovery is crucial for shotgun metaproteomics to study the in situ functionality of microbial populations from complex biofilms but still poorly addressed by far. To fill this knowledge gap, we systematically evaluated the sample preparation with extraction buffers comprising four detergents for the metaproteomics analysis of a terephthalate-degrading methanogenic biofilm using an on-line two-dimensional liquid chromatography tandem mass spectrometry (2D-LC-MS/MS) system. Totally, 1018 non-repeated proteins were identified with the four treatments. On the whole, each treatment could recover the biofilm proteins with specific distributions of molecular weight, hydrophobicity, and isoelectric point. The extraction buffers containing zwitterionic and anionic detergents were found to harvest the proteins with better efficiency and quality, allowing identification up to 76.2% of total identified proteins with the LC-MS/MS analysis. According to the annotation with a relevant metagenomic database, we further observed different taxonomic profiles of bacterial and archaeal members and discriminable patterns of the functional expression among the extraction buffers used. Overall, the finding of the present study provides first insight to the effect of the detergents on the characteristics of extractable proteins from biofilm and the developed protocol combined with nano 2D-LC/MS/MS analysis can improve the metaproteomics studies on microbial functionality of biofilms in the wastewater treatment systems. PMID:24914765
Cassava root membrane proteome reveals activities during storage root maturation.
Naconsie, Maliwan; Lertpanyasampatha, Manassawe; Viboonjun, Unchera; Netrphan, Supatcharee; Kuwano, Masayoshi; Ogasawara, Naotake; Narangajavana, Jarunya
2016-01-01
Cassava (Manihot esculenta Crantz) is one of the most important crops of Thailand. Its storage roots are used as food, feed, starch production, and be the important source for biofuel and biodegradable plastic production. Despite the importance of cassava storage roots, little is known about the mechanisms involved in their formation. This present study has focused on comparison of the expression profiles of cassava root proteome at various developmental stages using two-dimensional gel electrophoresis and LC-MS/MS. Based on an anatomical study using Toluidine Blue, the secondary growth was confirmed to be essential during the development of cassava storage root. To investigate biochemical processes occurring during storage root maturation, soluble and membrane proteins were isolated from storage roots harvested from 3-, 6-, 9-, and 12-month-old cassava plants. The proteins with differential expression pattern were analysed and identified to be associated with 8 functional groups: protein folding and degradation, energy, metabolism, secondary metabolism, stress response, transport facilitation, cytoskeleton, and unclassified function. The expression profiling of membrane proteins revealed the proteins involved in protein folding and degradation, energy, and cell structure were highly expressed during early stages of development. Integration of these data along with the information available in genome and transcriptome databases is critical to expand knowledge obtained solely from the field of proteomics. Possible role of identified proteins were discussed in relation with the activities during storage root maturation in cassava.
Ekman, Martin; Tollbäck, Petter; Bergman, Birgitta
2008-01-01
Cyanobacteria are able to form stable nitrogen-fixing symbioses with diverse eukaryotes. To extend our understanding of adaptations imposed by plant hosts, two-dimensional gel electrophoresis and mass spectrometry (MS) were used for comparative protein expression profiling of a cyanobacterium (cyanobiont) dwelling in leaf cavities of the water-fern Azolla filiculoides. Homology-based protein identification using peptide mass fingerprinting [matrix-assisted laser desorption ionization-time of flight (MALDI-TOF-MS)], tandem MS analyses, and sequence homology searches resulted in an identification success rate of 79% of proteins analysed in the unsequenced cyanobiont. Compared with a free-living strain, processes related to energy production, nitrogen and carbon metabolism, and stress-related functions were up-regulated in the cyanobiont while photosynthesis and metabolic turnover rates were down-regulated, stressing a slow heterotrophic mode of growth, as well as high heterocyst frequencies and nitrogen-fixing capacities. The first molecular data set on the nature of the NifH post-translational modification in cyanobacteria was also obtained: peptide mass spectra of the protein demonstrated the presence of a 300-400 Da protein modification localized to a specific 13 amino acid sequence, within the part of the protein that is ADP-ribosylated in other bacteria and close to the active site of nitrogenase. Furthermore, the distribution of the highest scoring database hits for the identified proteins points to the possibility of using proteomic data in taxonomy.
Trisiriroj, Arunee; Jeyachok, Narumon; Chen, Shui-Tein
2004-07-01
Proteomic approach is applied for the analysis of seed brans of 14 rice varieties (Oryza sativa L. ssp. indica) which can classify to five aromatic rice and nine nonaromatic rice. The two-dimensional electrophoresis (2-DE) protein patterns for 14 rice varieties were similar within pH ranges of 3-10 and 4-7. To characterize aromatic group-specific proteins, we compared 2-D gels of aromatic rice to nonaromatic rice using PDQUEST image analysis. Four out of six differential spots were identified as hypothetical proteins, but one (SSP 7003) was identified by matrix assisted laser desoption/ionization-quardrupole-time of fight (MALDI-Q-TOF) as prolamin with three matching peptides based on NCBI database. Prolamin is a class of storage proteins with three different polypeptides of 10, 13, and 16 kDa. Spot SSP7003 was identified as a 13 kDa polypeptide of prolamin by combination of mass spectroscopy and N-terminal sequence analyses. In contrast, one sulfur-rich 16 kDa polypeptide of prolamin was found in extremely high intensity in brans of deep-water rice compared to nondeep-water rice. Our results suggest that proteomics is a powerful step to open the way for the identification of rice varieties.
A database of whole-body action videos for the study of action, emotion, and untrustworthiness.
Keefe, Bruce D; Villing, Matthias; Racey, Chris; Strong, Samantha L; Wincenciak, Joanna; Barraclough, Nick E
2014-12-01
We present a database of high-definition (HD) videos for the study of traits inferred from whole-body actions. Twenty-nine actors (19 female) were filmed performing different actions-walking, picking up a box, putting down a box, jumping, sitting down, and standing and acting-while conveying different traits, including four emotions (anger, fear, happiness, sadness), untrustworthiness, and neutral, where no specific trait was conveyed. For the actions conveying the four emotions and untrustworthiness, the actions were filmed multiple times, with the actor conveying the traits with different levels of intensity. In total, we made 2,783 action videos (in both two-dimensional and three-dimensional format), each lasting 7 s with a frame rate of 50 fps. All videos were filmed in a green-screen studio in order to isolate the action information from all contextual detail and to provide a flexible stimulus set for future use. In order to validate the traits conveyed by each action, we asked participants to rate each of the actions corresponding to the trait that the actor portrayed in the two-dimensional videos. To provide a useful database of stimuli of multiple actions conveying multiple traits, each video name contains information on the gender of the actor, the action executed, the trait conveyed, and the rating of its perceived intensity. All videos can be downloaded free at the following address: http://www-users.york.ac.uk/~neb506/databases.html. We discuss potential uses for the database in the analysis of the perception of whole-body actions.
Rabilloud, Thierry; Adessi, C.; Giraudel, A.; Lunardi, J.
2007-01-01
Summary We have carried out the separation of sparingly-soluble (membrane and nuclear) proteins by high resolution two-dimensional electrophoresis. IEF with immobilized pH gradients leads to severe quantitative losses of proteins in the resulting 2-D map, although the resolution is usually kept high. We therefore tried to improve the solubility of proteins in this technique, by using denaturing cocktails containing various detergents and chaotropes. Best results were obtained by using a denaturing solution containing urea, thiourea, and detergents (both nonionic and zwitterionic). The usefulness of thiourea-containing denaturing mixtures are shown in this article on several models including microsomal and nuclear proteins and on tubulin, a protein highly prone to aggregation. PMID:9150907
Simultaneous nano-tracking of multiple motor proteins via spectral discrimination of quantum dots.
Kakizuka, Taishi; Ikezaki, Keigo; Kaneshiro, Junichi; Fujita, Hideaki; Watanabe, Tomonobu M; Ichimura, Taro
2016-07-01
Simultaneous nanometric tracking of multiple motor proteins was achieved by combining multicolor fluorescent labeling of target proteins and imaging spectroscopy, revealing dynamic behaviors of multiple motor proteins at the sub-diffraction-limit scale. Using quantum dot probes of distinct colors, we experimentally verified the localization precision to be a few nanometers at temporal resolution of 30 ms or faster. One-dimensional processive movement of two heads of a single myosin molecule and multiple myosin molecules was successfully traced. Furthermore, the system was modified for two-dimensional measurement and applied to tracking of multiple myosin molecules. Our approach is useful for investigating cooperative movement of proteins in supramolecular nanomachinery.
Simultaneous nano-tracking of multiple motor proteins via spectral discrimination of quantum dots
Kakizuka, Taishi; Ikezaki, Keigo; Kaneshiro, Junichi; Fujita, Hideaki; Watanabe, Tomonobu M.; Ichimura, Taro
2016-01-01
Simultaneous nanometric tracking of multiple motor proteins was achieved by combining multicolor fluorescent labeling of target proteins and imaging spectroscopy, revealing dynamic behaviors of multiple motor proteins at the sub-diffraction-limit scale. Using quantum dot probes of distinct colors, we experimentally verified the localization precision to be a few nanometers at temporal resolution of 30 ms or faster. One-dimensional processive movement of two heads of a single myosin molecule and multiple myosin molecules was successfully traced. Furthermore, the system was modified for two-dimensional measurement and applied to tracking of multiple myosin molecules. Our approach is useful for investigating cooperative movement of proteins in supramolecular nanomachinery. PMID:27446684
NASA Astrophysics Data System (ADS)
Thanh Tran, The; Phan, Van Chi
2010-03-01
In this work, we present results of membrane proteome profiling from mouse liver tissues using a gel-based approach in combination with 2DnanoLC-Q-TOF-MS/MS. Following purification of the membrane fraction, SDS-PAGE was carried out as a useful separation step. After staining, gels with protein bands were cut, reduced, alkylated and trypsin-digested. The peptide mixtures extracted from each gel slice were fractionated by two-dimensional nano liquid chromatography (2DnanoLC) coupled online with tandem mass spectrometry analysis (NanoESI-Q-TOF-MS/MS). The proteins were identified by MASCOT search against a mouse protein database using a peptide and fragment mass tolerance of ±0.5 Da. Protein identification was carried out using a Mowse scoring algorithm with a confidence level of 95% and processed by MSQuant v1.5 software for further validation. In total, 318 verified membrane proteins from mouse liver tissues were identified; 66.67% of them (212 proteins) contained at least one or more transmembrane domains predicted by the SOSUI program and 43 were found to be unique microsome membranes. Furthermore, GRAVY values of membrane proteins varied in the range -1.1276 to 0.9016 and only 31 (9.76%) membrane proteins had positive values. The functions and subcellular locations of the identified proteins were categorized as well, according to universal GO annotations.
Functional Interaction Network Construction and Analysis for Disease Discovery.
Wu, Guanming; Haw, Robin
2017-01-01
Network-based approaches project seemingly unrelated genes or proteins onto a large-scale network context, therefore providing a holistic visualization and analysis platform for genomic data generated from high-throughput experiments, reducing the dimensionality of data via using network modules and increasing the statistic analysis power. Based on the Reactome database, the most popular and comprehensive open-source biological pathway knowledgebase, we have developed a highly reliable protein functional interaction network covering around 60 % of total human genes and an app called ReactomeFIViz for Cytoscape, the most popular biological network visualization and analysis platform. In this chapter, we describe the detailed procedures on how this functional interaction network is constructed by integrating multiple external data sources, extracting functional interactions from human curated pathway databases, building a machine learning classifier called a Naïve Bayesian Classifier, predicting interactions based on the trained Naïve Bayesian Classifier, and finally constructing the functional interaction database. We also provide an example on how to use ReactomeFIViz for performing network-based data analysis for a list of genes.
Neumann, Sindy; Hartmann, Holger; Martin-Galiano, Antonio J; Fuchs, Angelika; Frishman, Dmitrij
2012-03-01
Structural bioinformatics of membrane proteins is still in its infancy, and the picture of their fold space is only beginning to emerge. Because only a handful of three-dimensional structures are available, sequence comparison and structure prediction remain the main tools for investigating sequence-structure relationships in membrane protein families. Here we present a comprehensive analysis of the structural families corresponding to α-helical membrane proteins with at least three transmembrane helices. The new version of our CAMPS database (CAMPS 2.0) covers nearly 1300 eukaryotic, prokaryotic, and viral genomes. Using an advanced classification procedure, which is based on high-order hidden Markov models and considers both sequence similarity as well as the number of transmembrane helices and loop lengths, we identified 1353 structurally homogeneous clusters roughly corresponding to membrane protein folds. Only 53 clusters are associated with experimentally determined three-dimensional structures, and for these clusters CAMPS is in reasonable agreement with structure-based classification approaches such as SCOP and CATH. We therefore estimate that ∼1300 structures would need to be determined to provide a sufficient structural coverage of polytopic membrane proteins. CAMPS 2.0 is available at http://webclu.bio.wzw.tum.de/CAMPS2.0/. Copyright © 2011 Wiley Periodicals, Inc.
Schuemie, Martijn J; Mons, Barend; Weeber, Marc; Kors, Jan A
2007-06-01
Gene and protein name identification in text requires a dictionary approach to relate synonyms to the same gene or protein, and to link names to external databases. However, existing dictionaries are incomplete. We investigate two complementary methods for automatic generation of a comprehensive dictionary: combination of information from existing gene and protein databases and rule-based generation of spelling variations. Both methods have been reported in literature before, but have hitherto not been combined and evaluated systematically. We combined gene and protein names from several existing databases of four different organisms. The combined dictionaries showed a substantial increase in recall on three different test sets, as compared to any single database. Application of 23 spelling variation rules to the combined dictionaries further increased recall. However, many rules appeared to have no effect and some appear to have a detrimental effect on precision.
Hufnagel, P.; Glandorf, J.; Körting, G.; Jabs, W.; Schweiger-Hufnagel, U.; Hahner, S.; Lubeck, M.; Suckau, D.
2007-01-01
Analysis of complex proteomes often results in long protein lists, but falls short in measuring the validity of identification and quantification results on a greater number of proteins. Biological and technical replicates are mandatory, as is the combination of the MS data from various workflows (gels, 1D-LC, 2D-LC), instruments (TOF/TOF, trap, qTOF or FTMS), and search engines. We describe a database-driven study that combines two workflows, two mass spectrometers, and four search engines with protein identification following a decoy database strategy. The sample was a tryptically digested lysate (10,000 cells) of a human colorectal cancer cell line. Data from two LC-MALDI-TOF/TOF runs and a 2D-LC-ESI-trap run using capillary and nano-LC columns were submitted to the proteomics software platform ProteinScape. The combined MALDI data and the ESI data were searched using Mascot (Matrix Science), Phenyx (GeneBio), ProteinSolver (Bruker and Protagen), and Sequest (Thermo) against a decoy database generated from IPI-human in order to obtain one protein list across all workflows and search engines at a defined maximum false-positive rate of 5%. ProteinScape combined the data to one LC-MALDI and one LC-ESI dataset. The initial separate searches from the two combined datasets generated eight independent peptide lists. These were compiled into an integrated protein list using the ProteinExtractor algorithm. An initial evaluation of the generated data led to the identification of approximately 1200 proteins. Result integration on a peptide level allowed discrimination of protein isoforms that would not have been possible with a mere combination of protein lists.
A brief review of other notable protein detection methods on acrylamide gels.
Kurien, Biji T; Scofield, R Hal
2012-01-01
Several methods have been described to stain proteins analyzed on acrylamide gels. These include ultrasensitive protein detection in one-dimensional and two-dimensional gel electrophoresis using a fluorescent product from the fungus Epicoccum nigrum; a fluorescence-based Coomassie Blue protein staining; visualization of proteins in acrylamide gels using ultraviolet illumination; fluorescence visualization of proteins in sodium dodecyl sulfate-polyacrylamide gels using environmentally benign, nonfixative, saline solution; and increasing the sensitivity four- to sixfold for detecting trace proteins in dye or silver stained polyacrylamide gels using polyethylene glycol 6000. All these methods are reviewed briefly in this chapter.
MultitaskProtDB: a database of multitasking proteins.
Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac; Perez-Pons, JosepAntoni; Piñol, Jaume; Mozo-Villarias, Angel; Cedano, Juan; Querol, Enrique
2014-01-01
We have compiled MultitaskProtDB, available online at http://wallace.uab.es/multitask, to provide a repository where the many multitasking proteins found in the literature can be stored. Multitasking or moonlighting is the capability of some proteins to execute two or more biological functions. Usually, multitasking proteins are experimentally revealed by serendipity. This ability of proteins to perform multitasking functions helps us to understand one of the ways used by cells to perform many complex functions with a limited number of genes. Even so, the study of this phenomenon is complex because, among other things, there is no database of moonlighting proteins. The existence of such a tool facilitates the collection and dissemination of these important data. This work reports the database, MultitaskProtDB, which is designed as a friendly user web page containing >288 multitasking proteins with their NCBI and UniProt accession numbers, canonical and additional biological functions, monomeric/oligomeric states, PDB codes when available and bibliographic references. This database also serves to gain insight into some characteristics of multitasking proteins such as frequencies of the different pairs of functions, phylogenetic conservation and so forth.
Towards an understanding of wheat chloroplasts: a methodical investigation of thylakoid proteome.
Kamal, Abu Hena Mostafa; Cho, Kun; Komatsu, Setsuko; Uozumi, Nobuyuki; Choi, Jong-Soon; Woo, Sun Hee
2012-05-01
We utilized Percoll density gradient centrifugation to isolate and fractionate chloroplasts of Korean winter wheat cultivar cv. Kumgang (Triticum aestivum L.). The resulting protein fractions were separated by one dimensional polyacrylamide gel electrophoresis (1D-PAGE) coupled with LTQ-FTICR mass spectrometry. This enabled us to detect and identify 767 unique proteins. Our findings represent the most comprehensive exploration of a proteome to date. Based on annotation information from the UniProtKB/Swiss-Prot database and our analyses via WoLF PSORT and PSORT, these proteins are localized in the chloroplast (607 proteins), chloroplast stroma (145), thylakoid membrane (342), lumens (163), and integral membranes (166). In all, 67% were confirmed as chloroplast thylakoid proteins. Although nearly complete protein coverage (89% proteins) has been accomplished for the key chloroplast pathways in wheat, such as for photosynthesis, many other proteins are involved in regulating carbon metabolism. The identified proteins were assigned to 103 functional categories according to a classification system developed by the iProClass database and provided through Protein Information Resources. Those functions include electron transport, energy, cellular organization and biogenesis, transport, stress responses, and other metabolic processes. Whereas most of these proteins are associated with known complexes and metabolic pathways, about 13% of the proteins have unknown functions. The chloroplast proteome contains many proteins that are localized to the thylakoids but as yet have no known function. We propose that some of these familiar proteins participate in the photosynthetic pathway. Thus, our new and comprehensive protein profile may provide clues for better understanding that photosynthetic process in wheat.
Proteomic analysis of Toxocara canis excretory and secretory (TES) proteins.
Sperotto, Rita Leal; Kremer, Frederico Schmitt; Aires Berne, Maria Elisabeth; Costa de Avila, Luciana F; da Silva Pinto, Luciano; Monteiro, Karina Mariante; Caumo, Karin Silva; Ferreira, Henrique Bunselmeyer; Berne, Natália; Borsuk, Sibele
2017-01-01
Toxocariasis is a neglected disease, and its main etiological agent is the nematode Toxocara canis. Serological diagnosis is performed by an enzyme-linked immunosorbent assay using T. canis excretory and secretory (TES) antigens produced by in vitro cultivation of larvae. Identification of TES proteins can be useful for the development of new diagnostic strategies since few TES components have been described so far. Herein, we report the results obtained by proteomic analysis of TES proteins using a liquid chromatography-tandem mass spectrometry (LC-MS/MS) approach. TES fractions were separated by one-dimensional SDS-PAGE and analyzed by LC-MS/MS. The MS/MS spectra were compared with a database of protein sequences deduced from the genome sequence of T. canis, and a total of 19 proteins were identified. Classification according to the signal peptide prediction using the SignalP server showed that seven of the identified proteins were extracellular, 10 had cytoplasmic or nuclear localization, while the subcellular localization of two proteins was unknown. Analysis of molecular functions by BLAST2GO showed that the majority of the gene ontology (GO) terms associated with the proteins present in the TES sample were associated with binding functions, including but not limited to protein binding (GO:0005515), inorganic ion binding (GO:0043167), and organic cyclic compound binding (GO:0097159). This study provides additional information about the exoproteome of T. canis, which can lead to the development of new strategies for diagnostics or vaccination. Copyright © 2016 Elsevier B.V. All rights reserved.
Jobim, M I M; Trein, C; Zirkler, H; Gregory, R M; Sieme, H; Mattos, R C
2011-09-01
The objective was to evaluate protein profiles of equine seminal plasma using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) and to determine whether any of these proteins were related to semen freezability. Seminal plasma was collected from 10 stallions, of high and low semen freezability, housed at the State Stud of Lower Saxony, and routinely used in AI programs. Twenty-five protein spots were identified from the two-dimensional gel (12%), seven of which were present in all samples (all proteins were identified by MALDI-MS). Matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) has been used to generate ion images of samples in one or more mass-to-charge (m/z) values, providing the capability of mapping specific molecules to two-dimensional coordinates of the original sample. Of the 25 proteins identified, two spots had greater relative content (P < 0.05) in seminal plasma samples collected from stallions with high semen freezability: spot 5 (80-85 kDa, isoelectric point [pI] 7.54), identified as CRISP-3; and spot 45 (18.2 kDa, pI 5.0-5.2), identified as HSP-2. Conversely, protein content was greater (P < 0.05) in seminal plasma samples from stallions with low semen freezability: spot 7 (75.4 kDa, pI 6.9-7.4), identified as lactoferrin; spot 15 (26.7 kDa, pI 5.51), identified as kallikrein; spot 25 (25 kDa, pI 7.54), identified as CRISP-3; and spot 35 (13.9 kDa, pI 3.8-4.2), identified as HSP-1. In conclusion, there were differences in the seminal plasma protein profile from stallions with high and low semen freezability. Furthermore, CRISP-3 and HSP-2 were potential seminal plasma markers of high semen freezability. Copyright © 2011 Elsevier Inc. All rights reserved.
Dufoo-Hurtado, Miguel D.; Huerta-Ocampo, José Á.; Barrera-Pacheco, Alberto; Barba de la Rosa, Ana P.; Mercado-Silva, Edmundo M.
2015-01-01
Low-temperature conditioning of garlic “seed” cloves substitutes the initial climatic requirements of the crop and accelerates the cycle. We have reported that “seed” bulbs from “Coreano” variety conditioned at 5°C for 5 weeks reduces growth and plant weight as well as the crop yields and increases the synthesis of phenolic compounds and anthocyanins. Therefore, this treatment suggests a cold stress. Plant acclimation to stress is associated with deep changes in proteome composition. Since proteins are directly involved in plant stress response, proteomics studies can significantly contribute to unravel the possible relationships between protein abundance and plant stress acclimation. The aim of this work was to study the changes in the protein profiles of garlic “seed” cloves subjected to conditioning at low-temperature using proteomics approach. Two sets of garlic bulbs were used, one set was stored at room temperature (23°C), and the other was conditioned at low temperature (5°C) for 5 weeks. Total soluble proteins were extracted from sprouts of cloves and separated by two-dimensional gel electrophoresis. Protein spots showing statistically significant changes in abundance were analyzed by LC-ESI-MS/MS and identified by database search analysis using the Mascot search engine. The results revealed that low-temperature conditioning of garlic “seed” cloves causes alterations in the accumulation of proteins involved in different physiological processes such as cellular growth, antioxidative/oxidative state, macromolecules transport, protein folding and transcription regulation process. The metabolic pathways affected include protein biosynthesis and quality control system, photosynthesis, photorespiration, energy production, and carbohydrate and nucleotide metabolism. These processes can work cooperatively to establish a new cellular homeostasis that might be related with the physiological and biochemical changes observed in previous studies. PMID:26029231
Dufoo-Hurtado, Miguel D; Huerta-Ocampo, José Á; Barrera-Pacheco, Alberto; Barba de la Rosa, Ana P; Mercado-Silva, Edmundo M
2015-01-01
Low-temperature conditioning of garlic "seed" cloves substitutes the initial climatic requirements of the crop and accelerates the cycle. We have reported that "seed" bulbs from "Coreano" variety conditioned at 5°C for 5 weeks reduces growth and plant weight as well as the crop yields and increases the synthesis of phenolic compounds and anthocyanins. Therefore, this treatment suggests a cold stress. Plant acclimation to stress is associated with deep changes in proteome composition. Since proteins are directly involved in plant stress response, proteomics studies can significantly contribute to unravel the possible relationships between protein abundance and plant stress acclimation. The aim of this work was to study the changes in the protein profiles of garlic "seed" cloves subjected to conditioning at low-temperature using proteomics approach. Two sets of garlic bulbs were used, one set was stored at room temperature (23°C), and the other was conditioned at low temperature (5°C) for 5 weeks. Total soluble proteins were extracted from sprouts of cloves and separated by two-dimensional gel electrophoresis. Protein spots showing statistically significant changes in abundance were analyzed by LC-ESI-MS/MS and identified by database search analysis using the Mascot search engine. The results revealed that low-temperature conditioning of garlic "seed" cloves causes alterations in the accumulation of proteins involved in different physiological processes such as cellular growth, antioxidative/oxidative state, macromolecules transport, protein folding and transcription regulation process. The metabolic pathways affected include protein biosynthesis and quality control system, photosynthesis, photorespiration, energy production, and carbohydrate and nucleotide metabolism. These processes can work cooperatively to establish a new cellular homeostasis that might be related with the physiological and biochemical changes observed in previous studies.
Zhang, Y X; Yu, D; Tian, X L; Liu, C Y; Gai, S P; Zheng, G S
2015-01-01
Endo-dormant flower buds of tree peony must have sufficient chilling duration to reinitiate growth, which is a major obstacle to the forcing culture of tree peony in winter. We used a combination of two-dimensional gel electrophoresis (2-DE) and matrix-assisted laser desorption/ionisation time of flight/time of flight mass spectrometry (MALDI-TOF/TOF MS) to identify the differentially expressed proteins of tree peony after three different chilling treatments: endo-dormancy, endo-dormancy release and eco-dormancy stages. More than 200 highly reproducible protein spots were detected, and 31 differentially expressed spots (P < 0.05) were selected for further analysis. Finally, 20 protein spots were confidently identified from databases, which were annotated and classified into seven functional categories: response to abiotic or biotic stimulus (four), metabolic processes (four), other binding (three), transcription or transcription regulation (two), biological processes (one), cell biogenesis (one) and unclassified (five). The results of qPCR of five genes were mainly consistent with that of the protein accumulation analysis as determined by 2-DE. This indicated that most of these genes were mainly regulated at transcriptional level. The activity of nitrate reductase and pyruvate dehydrogenase E1 was consistent with the 2-DE results. The proteomic profiles indicated activation of citrate cycle, amino acid metabolism, lipid metabolism, energy production, calcium signalling and cell growth processes by chilling fulfilment to facilitate dormancy release in tree peony. Analysis of functions of identified proteins will increase our knowledge of endo-dormancy release in tree peony. © 2014 German Botanical Society and The Royal Botanical Society of the Netherlands.
Identification of proteins of altered abundance in oil palm infected with Ganoderma boninense.
Al-Obaidi, Jameel R; Mohd-Yusuf, Yusmin; Razali, Nurhanani; Jayapalan, Jaime Jacqueline; Tey, Chin-Chong; Md-Noh, Normahnani; Junit, Sarni Mat; Othman, Rofina Yasmin; Hashim, Onn Haji
2014-03-24
Basal stem rot is a common disease that affects oil palm, causing loss of yield and finally killing the trees. The disease, caused by fungus Ganoderma boninense, devastates thousands of hectares of oil palm plantings in Southeast Asia every year. In the present study, root proteins of healthy oil palm seedlings, and those infected with G. boninense, were analyzed by 2-dimensional gel electrophoresis (2-DE). When the 2-DE profiles were analyzed for proteins, which exhibit consistent significant change of abundance upon infection with G. boninense, 21 passed our screening criteria. Subsequent analyses by mass spectrometry and database search identified caffeoyl-CoA O-methyltransferase, caffeic acid O-methyltransferase, enolase, fructokinase, cysteine synthase, malate dehydrogenase, and ATP synthase as among proteins of which abundances were markedly altered.
Identification of Proteins of Altered Abundance in Oil Palm Infected with Ganoderma boninense
Al-Obaidi, Jameel R.; Mohd-Yusuf, Yusmin; Razali, Nurhanani; Jayapalan, Jaime Jacqueline; Tey, Chin-Chong; Md-Noh, Normahnani; Junit, Sarni Mat; Othman, Rofina Yasmin; Hashim, Onn Haji
2014-01-01
Basal stem rot is a common disease that affects oil palm, causing loss of yield and finally killing the trees. The disease, caused by fungus Ganoderma boninense, devastates thousands of hectares of oil palm plantings in Southeast Asia every year. In the present study, root proteins of healthy oil palm seedlings, and those infected with G. boninense, were analyzed by 2-dimensional gel electrophoresis (2-DE). When the 2-DE profiles were analyzed for proteins, which exhibit consistent significant change of abundance upon infection with G. boninense, 21 passed our screening criteria. Subsequent analyses by mass spectrometry and database search identified caffeoyl-CoA O-methyltransferase, caffeic acid O-methyltransferase, enolase, fructokinase, cysteine synthase, malate dehydrogenase, and ATP synthase as among proteins of which abundances were markedly altered. PMID:24663087
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?
Yang, Yuedong; Gao, Jianzhao; Wang, Jihua; Heffernan, Rhys; Hanson, Jack; Paliwal, Kuldip; Zhou, Yaoqi
2018-01-01
Abstract Protein secondary structure prediction began in 1951 when Pauling and Corey predicted helical and sheet conformations for protein polypeptide backbone even before the first protein structure was determined. Sixty-five years later, powerful new methods breathe new life into this field. The highest three-state accuracy without relying on structure templates is now at 82–84%, a number unthinkable just a few years ago. These improvements came from increasingly larger databases of protein sequences and structures for training, the use of template secondary structure information and more powerful deep learning techniques. As we are approaching to the theoretical limit of three-state prediction (88–90%), alternative to secondary structure prediction (prediction of backbone torsion angles and Cα-atom-based angles and torsion angles) not only has more room for further improvement but also allows direct prediction of three-dimensional fragment structures with constantly improved accuracy. About 20% of all 40-residue fragments in a database of 1199 non-redundant proteins have <6 Å root-mean-squared distance from the native conformations by SPIDER2. More powerful deep learning methods with improved capability of capturing long-range interactions begin to emerge as the next generation of techniques for secondary structure prediction. The time has come to finish off the final stretch of the long march towards protein secondary structure prediction. PMID:28040746
Exploration of Uncharted Regions of the Protein Universe
Jaroszewski, Lukasz; Li, Zhanwen; Krishna, S. Sri; Bakolitsa, Constantina; Wooley, John; Deacon, Ashley M.; Wilson, Ian A.; Godzik, Adam
2009-01-01
The genome projects have unearthed an enormous diversity of genes of unknown function that are still awaiting biological and biochemical characterization. These genes, as most others, can be grouped into families based on sequence similarity. The PFAM database currently contains over 2,200 such families, referred to as domains of unknown function (DUF). In a coordinated effort, the four large-scale centers of the NIH Protein Structure Initiative have determined the first three-dimensional structures for more than 250 of these DUF families. Analysis of the first 248 reveals that about two thirds of the DUF families likely represent very divergent branches of already known and well-characterized families, which allows hypotheses to be formulated about their biological function. The remainder can be formally categorized as new folds, although about one third of these show significant substructure similarity to previously characterized folds. These results infer that, despite the enormous increase in the number and the diversity of new genes being uncovered, the fold space of the proteins they encode is gradually becoming saturated. The previously unexplored sectors of the protein universe appear to be primarily shaped by extreme diversification of known protein families, which then enables organisms to evolve new functions and adapt to particular niches and habitats. Notwithstanding, these DUF families still constitute the richest source for discovery of the remaining protein folds and topologies. PMID:19787035
He, Qianru; Man, Lili; Ji, Yuhua; Zhang, Shuqiang; Jiang, Maorong; Ding, Fei; Gu, Xiaosong
2012-06-01
Peripheral sensory and motor nerves have different functions and different approaches to regeneration, especially their distinct ability to accurately reinervate terminal nerve pathways. To understand the molecular aspects underlying these differences, the proteomics technique by coupling isobaric tags for relative and absolute quantitation (iTRAQ) with online two-dimensional liquid chromatography tandem mass spectrometry (2D LC-MS/MS) was used to investigate the protein profile of sensory and motor nerve samples from rats. A total of 1472 proteins were identified in either sensory or motor nerve. Of them, 100 proteins showed differential expressions between both nerves, and some of them were validated by quantitative real time RT-PCR, Western blot analysis, and immunohistochemistry. In the light of functional categorization, the differentially expressed proteins in sensory and motor nerves, belonging to a broad range of classes, were related to a diverse array of biological functions, which included cell adhesion, cytoskeleton, neuronal plasticity, neurotrophic activity, calcium-binding, signal transduction, transport, enzyme catalysis, lipid metabolism, DNA-binding, synaptosome function, actin-binding, ATP-binding, extracellular matrix, and commitment to other lineages. The relatively higher expressed proteins in either sensory or motor nerve were tentatively discussed in combination with their specific molecular characteristics. It is anticipated that the database generated in this study will provide a solid foundation for further comprehensive investigation of functional differences between sensory and motor nerves, including the specificity of their regeneration.
Wimmer, Helge; Gundacker, Nina C; Griss, Johannes; Haudek, Verena J; Stättner, Stefan; Mohr, Thomas; Zwickl, Hannes; Paulitschke, Verena; Baron, David M; Trittner, Wolfgang; Kubicek, Markus; Bayer, Editha; Slany, Astrid; Gerner, Christopher
2009-06-01
Interpretation of proteome data with a focus on biomarker discovery largely relies on comparative proteome analyses. Here, we introduce a database-assisted interpretation strategy based on proteome profiles of primary cells. Both 2-D-PAGE and shotgun proteomics are applied. We obtain high data concordance with these two different techniques. When applying mass analysis of tryptic spot digests from 2-D gels of cytoplasmic fractions, we typically identify several hundred proteins. Using the same protein fractions, we usually identify more than thousand proteins by shotgun proteomics. The data consistency obtained when comparing these independent data sets exceeds 99% of the proteins identified in the 2-D gels. Many characteristic differences in protein expression of different cells can thus be independently confirmed. Our self-designed SQL database (CPL/MUW - database of the Clinical Proteomics Laboratories at the Medical University of Vienna accessible via www.meduniwien.ac.at/proteomics/database) facilitates (i) quality management of protein identification data, which are based on MS, (ii) the detection of cell type-specific proteins and (iii) of molecular signatures of specific functional cell states. Here, we demonstrate, how the interpretation of proteome profiles obtained from human liver tissue and hepatocellular carcinoma tissue is assisted by the Clinical Proteomics Laboratories at the Medical University of Vienna-database. Therefore, we suggest that the use of reference experiments supported by a tailored database may substantially facilitate data interpretation of proteome profiling experiments.
Chen, Josephine; Zhao, Po; Massaro, Donald; Clerch, Linda B; Almon, Richard R; DuBois, Debra C; Jusko, William J; Hoffman, Eric P
2004-01-01
Publicly accessible DNA databases (genome browsers) are rapidly accelerating post-genomic research (see http://www.genome.ucsc.edu/), with integrated genomic DNA, gene structure, EST/ splicing and cross-species ortholog data. DNA databases have relatively low dimensionality; the genome is a linear code that anchors all associated data. In contrast, RNA expression and protein databases need to be able to handle very high dimensional data, with time, tissue, cell type and genes, as interrelated variables. The high dimensionality of microarray expression profile data, and the lack of a standard experimental platform have complicated the development of web-accessible databases and analytical tools. We have designed and implemented a public resource of expression profile data containing 1024 human, mouse and rat Affymetrix GeneChip expression profiles, generated in the same laboratory, and subject to the same quality and procedural controls (Public Expression Profiling Resource; PEPR). Our Oracle-based PEPR data warehouse includes a novel time series query analysis tool (SGQT), enabling dynamic generation of graphs and spreadsheets showing the action of any transcript of interest over time. In this report, we demonstrate the utility of this tool using a 27 time point, in vivo muscle regeneration series. This data warehouse and associated analysis tools provides access to multidimensional microarray data through web-based interfaces, both for download of all types of raw data for independent analysis, and also for straightforward gene-based queries. Planned implementations of PEPR will include web-based remote entry of projects adhering to quality control and standard operating procedure (QC/SOP) criteria, and automated output of alternative probe set algorithms for each project (see http://microarray.cnmcresearch.org/pgadatatable.asp).
Chen, Josephine; Zhao, Po; Massaro, Donald; Clerch, Linda B.; Almon, Richard R.; DuBois, Debra C.; Jusko, William J.; Hoffman, Eric P.
2004-01-01
Publicly accessible DNA databases (genome browsers) are rapidly accelerating post-genomic research (see http://www.genome.ucsc.edu/), with integrated genomic DNA, gene structure, EST/ splicing and cross-species ortholog data. DNA databases have relatively low dimensionality; the genome is a linear code that anchors all associated data. In contrast, RNA expression and protein databases need to be able to handle very high dimensional data, with time, tissue, cell type and genes, as interrelated variables. The high dimensionality of microarray expression profile data, and the lack of a standard experimental platform have complicated the development of web-accessible databases and analytical tools. We have designed and implemented a public resource of expression profile data containing 1024 human, mouse and rat Affymetrix GeneChip expression profiles, generated in the same laboratory, and subject to the same quality and procedural controls (Public Expression Profiling Resource; PEPR). Our Oracle-based PEPR data warehouse includes a novel time series query analysis tool (SGQT), enabling dynamic generation of graphs and spreadsheets showing the action of any transcript of interest over time. In this report, we demonstrate the utility of this tool using a 27 time point, in vivo muscle regeneration series. This data warehouse and associated analysis tools provides access to multidimensional microarray data through web-based interfaces, both for download of all types of raw data for independent analysis, and also for straightforward gene-based queries. Planned implementations of PEPR will include web-based remote entry of projects adhering to quality control and standard operating procedure (QC/SOP) criteria, and automated output of alternative probe set algorithms for each project (see http://microarray.cnmcresearch.org/pgadatatable.asp). PMID:14681485
Dong, Runze; Pan, Shuo; Peng, Zhenling; Zhang, Yang; Yang, Jianyi
2018-05-21
With the rapid increase of the number of protein structures in the Protein Data Bank, it becomes urgent to develop algorithms for efficient protein structure comparisons. In this article, we present the mTM-align server, which consists of two closely related modules: one for structure database search and the other for multiple structure alignment. The database search is speeded up based on a heuristic algorithm and a hierarchical organization of the structures in the database. The multiple structure alignment is performed using the recently developed algorithm mTM-align. Benchmark tests demonstrate that our algorithms outperform other peering methods for both modules, in terms of speed and accuracy. One of the unique features for the server is the interplay between database search and multiple structure alignment. The server provides service not only for performing fast database search, but also for making accurate multiple structure alignment with the structures found by the search. For the database search, it takes about 2-5 min for a structure of a medium size (∼300 residues). For the multiple structure alignment, it takes a few seconds for ∼10 structures of medium sizes. The server is freely available at: http://yanglab.nankai.edu.cn/mTM-align/.
2010-01-01
Background Papaver somniferum (opium poppy) is the source for several pharmaceutical benzylisoquinoline alkaloids including morphine, the codeine and sanguinarine. In response to treatment with a fungal elicitor, the biosynthesis and accumulation of sanguinarine is induced along with other plant defense responses in opium poppy cell cultures. The transcriptional induction of alkaloid metabolism in cultured cells provides an opportunity to identify components of this process via the integration of deep transcriptome and proteome databases generated using next-generation technologies. Results A cDNA library was prepared for opium poppy cell cultures treated with a fungal elicitor for 10 h. Using 454 GS-FLX Titanium pyrosequencing, 427,369 expressed sequence tags (ESTs) with an average length of 462 bp were generated. Assembly of these sequences yielded 93,723 unigenes, of which 23,753 were assigned Gene Ontology annotations. Transcripts encoding all known sanguinarine biosynthetic enzymes were identified in the EST database, 5 of which were represented among the 50 most abundant transcripts. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) of total protein extracts from cell cultures treated with a fungal elicitor for 50 h facilitated the identification of 1,004 proteins. Proteins were fractionated by one-dimensional SDS-PAGE and digested with trypsin prior to LC-MS/MS analysis. Query of an opium poppy-specific EST database substantially enhanced peptide identification. Eight out of 10 known sanguinarine biosynthetic enzymes and many relevant primary metabolic enzymes were represented in the peptide database. Conclusions The integration of deep transcriptome and proteome analyses provides an effective platform to catalogue the components of secondary metabolism, and to identify genes encoding uncharacterized enzymes. The establishment of corresponding transcript and protein databases generated by next-generation technologies in a system with a well-defined metabolite profile facilitates an improved linkage between genes, enzymes, and pathway components. The proteome database represents the most relevant alkaloid-producing enzymes, compared with the much deeper and more complete transcriptome library. The transcript database contained full-length mRNAs encoding most alkaloid biosynthetic enzymes, which is a key requirement for the functional characterization of novel gene candidates. PMID:21083930
NCAD, a database integrating the intrinsic conformational preferences of non-coded amino acids
Revilla-López, Guillem; Torras, Juan; Curcó, David; Casanovas, Jordi; Calaza, M. Isabel; Zanuy, David; Jiménez, Ana I.; Cativiela, Carlos; Nussinov, Ruth; Grodzinski, Piotr; Alemán, Carlos
2010-01-01
Peptides and proteins find an ever-increasing number of applications in the biomedical and materials engineering fields. The use of non-proteinogenic amino acids endowed with diverse physicochemical and structural features opens the possibility to design proteins and peptides with novel properties and functions. Moreover, non-proteinogenic residues are particularly useful to control the three-dimensional arrangement of peptidic chains, which is a crucial issue for most applications. However, information regarding such amino acids –also called non-coded, non-canonical or non-standard– is usually scattered among publications specialized in quite diverse fields as well as in patents. Making all these data useful to the scientific community requires new tools and a framework for their assembly and coherent organization. We have successfully compiled, organized and built a database (NCAD, Non-Coded Amino acids Database) containing information about the intrinsic conformational preferences of non-proteinogenic residues determined by quantum mechanical calculations, as well as bibliographic information about their synthesis, physical and spectroscopic characterization, conformational propensities established experimentally, and applications. The architecture of the database is presented in this work together with the first family of non-coded residues included, namely, α-tetrasubstituted α-amino acids. Furthermore, the NCAD usefulness is demonstrated through a test-case application example. PMID:20455555
FARE-CAFE: a database of functional and regulatory elements of cancer-associated fusion events.
Korla, Praveen Kumar; Cheng, Jack; Huang, Chien-Hung; Tsai, Jeffrey J P; Liu, Yu-Hsuan; Kurubanjerdjit, Nilubon; Hsieh, Wen-Tsong; Chen, Huey-Yi; Ng, Ka-Lok
2015-01-01
Chromosomal translocation (CT) is of enormous clinical interest because this disorder is associated with various major solid tumors and leukemia. A tumor-specific fusion gene event may occur when a translocation joins two separate genes. Currently, various CT databases provide information about fusion genes and their genomic elements. However, no database of the roles of fusion genes, in terms of essential functional and regulatory elements in oncogenesis, is available. FARE-CAFE is a unique combination of CTs, fusion proteins, protein domains, domain-domain interactions, protein-protein interactions, transcription factors and microRNAs, with subsequent experimental information, which cannot be found in any other CT database. Genomic DNA information including, for example, manually collected exact locations of the first and second break points, sequences and karyotypes of fusion genes are included. FARE-CAFE will substantially facilitate the cancer biologist's mission of elucidating the pathogenesis of various types of cancer. This database will ultimately help to develop 'novel' therapeutic approaches. Database URL: http://ppi.bioinfo.asia.edu.tw/FARE-CAFE. © The Author(s) 2015. Published by Oxford University Press.
Evaluation of several two-dimensional gel electrophoresis techniques in cardiac proteomics.
Li, Zhao Bo; Flint, Paul W; Boluyt, Marvin O
2005-09-01
Two-dimensional gel electrophoresis (2-DE) is currently the best method for separating complex mixtures of proteins, and its use is gradually becoming more common in cardiac proteome analysis. A number of variations in basic 2-DE have emerged, but their usefulness in analyzing cardiac tissue has not been evaluated. The purpose of the present study was to systematically evaluate the capabilities and limitations of several 2-DE techniques for separating proteins from rat heart tissue. Immobilized pH gradient strips of various pH ranges, parameters of protein loading and staining, subcellular fractionation, and detection of phosphorylated proteins were studied. The results provide guidance for proteome analysis of cardiac and other tissues in terms of selection of the isoelectric point separating window for cardiac proteins, accurate quantitation of cardiac protein abundance, stabilization of technical variation, reduction of sample complexity, enrichment of low-abundant proteins, and detection of phosphorylated proteins.
1993-01-27
Considerable effect was expended in investigating shifts in intercellular calcium of one particular cell line, Jurket, using flow cytometry methods. No...culture. The following analysis were used to characterize the immortalized cell lines: flow cytometry , electron microscopy, two-dimensional protein gel...further characterized by flow cytometry , electron microscopy, two dimensional protein electrophoresis and nuclear run-off assay. Flow cytometric analysis of
EF-2DE Analysis and Protein Identification
USDA-ARS?s Scientific Manuscript database
Isoelectric focusing followed by SDS-PAGE (IEF-2DE) separates proteins in a two-dimensional matrix of protein pI (Protein Isoelectric Point) and molecular weight (MW). The technique is particularly useful to distinguish protein isoforms (Radwan et al., 2012) and proteins that contain post-translatio...
Serrano, Pedro; Dutta, Samit K; Proudfoot, Andrew; Mohanty, Biswaranjan; Susac, Lukas; Martin, Bryan; Geralt, Michael; Jaroszewski, Lukasz; Godzik, Adam; Elsliger, Marc; Wilson, Ian A; Wüthrich, Kurt
2016-11-01
For more than a decade, the Joint Center for Structural Genomics (JCSG; www.jcsg.org) worked toward increased three-dimensional structure coverage of the protein universe. This coordinated quest was one of the main goals of the four high-throughput (HT) structure determination centers of the Protein Structure Initiative (PSI; www.nigms.nih.gov/Research/specificareas/PSI). To achieve the goals of the PSI, the JCSG made use of the complementarity of structure determination by X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy to increase and diversify the range of targets entering the HT structure determination pipeline. The overall strategy, for both techniques, was to determine atomic resolution structures for representatives of large protein families, as defined by the Pfam database, which had no structural coverage and could make significant contributions to biological and biomedical research. Furthermore, the experimental structures could be leveraged by homology modeling to further expand the structural coverage of the protein universe and increase biological insights. Here, we describe what could be achieved by this structural genomics approach, using as an illustration the contributions from 20 NMR structure determinations out of a total of 98 JCSG NMR structures, which were selected because they are the first three-dimensional structure representations of the respective Pfam protein families. The information from this small sample is representative for the overall results from crystal and NMR structure determination in the JCSG. There are five new folds, which were classified as domains of unknown functions (DUF), three of the proteins could be functionally annotated based on three-dimensional structure similarity with previously characterized proteins, and 12 proteins showed only limited similarity with previous deposits in the Protein Data Bank (PDB) and were classified as DUFs. © 2016 Federation of European Biochemical Societies.
NASA Astrophysics Data System (ADS)
Becker, J. Susanne; Zoriy, Miroslav; Pickhardt, Carola; Przybylski, Michael; Becker, J. Sabine
2005-04-01
Identification of metal-containing proteins and determination of Cu, Fe, Zn concentration in very small protein volumes is of increasing importance in protein research. Proteins containing metal ions were analyzed directly and simultaneously in separated protein spots in two-dimensional gels (2D gels) by laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) as an element mass spectrometric technique. In order to study the formation of proteins containing Cu, Zn and Fe in a human brain sample, isotopic-enriched tracers (54Fe, 65Cu and 67Zn) were doped to two-dimensional gels of separated Alzheimer-diseased brain proteins after two-dimensional (2D) gel electrophoresis. The protein spots were screened systematically by LA-ICP-MS with respect to these metal ion intensities. 54Fe/56Fe, 65Cu/63Cu and 67Zn/64Zn isotope ratios in metal-containing proteins were measured directly by LA-ICP-MS. The isotope ratio measurements obtained by LA-ICP-MS indicate certain protein spots with a natural isotope composition of Cu, Zn and/or Fe. These proteins already contained the metal investigated in the original proteins and are stable enough to survive the reducing conditions during gel electrophoresis. On the other hand, proteins with a changed isotope ratio of metals in comparison to the isotope ratio in nature demonstrate the accumulation of tracers within the protein complexes during the tracer experiments in 2D gels. The identification of singular protein spots from Alzheimer-diseased brain separated by 2D gel electrophoresis was attempted by biopolymer mass spectrometry using MALDI-FTICR-MS after excision from the 2D gel and tryptic digestion.
Zhao, Y; Miriyala, S; Miao, L; Mitov, M; Schnell, D; Dhar, S K; Cai, J; Klein, J B; Sultana, R; Butterfield, D A; Vore, M; Batinic-Haberle, I; Bondada, S; St Clair, D K
2014-07-01
Doxorubicin (DOX), one of the most effective anticancer drugs, is known to generate progressive cardiac damage, which is due, in part, to DOX-induced reactive oxygen species (ROS). The elevated ROS often induce oxidative protein modifications that result in alteration of protein functions. This study demonstrates that the level of proteins adducted by 4-hydroxy-2-nonenal (HNE), a lipid peroxidation product, is significantly increased in mouse heart mitochondria after DOX treatment. A redox proteomics method involving two-dimensional electrophoresis followed by mass spectrometry and investigation of protein databases identified several HNE-modified mitochondrial proteins, which were verified by HNE-specific immunoprecipitation in cardiac mitochondria from the DOX-treated mice. The majority of the identified proteins are related to mitochondrial energy metabolism. These include proteins in the citric acid cycle and electron transport chain. The enzymatic activities of the HNE-adducted proteins were significantly reduced in DOX-treated mice. Consistent with the decline in the function of the HNE-adducted proteins, the respiratory function of cardiac mitochondria as determined by oxygen consumption rate was also significantly reduced after DOX treatment. Treatment with Mn(III) meso-tetrakis(N-n-butoxyethylpyridinium-2-yl)porphyrin, an SOD mimic, averted the doxorubicin-induced mitochondrial dysfunctions as well as the HNE-protein adductions. Together, the results demonstrate that free radical-mediated alteration of energy metabolism is an important mechanism mediating DOX-induced cardiac injury, suggesting that metabolic intervention may represent a novel approach to preventing cardiac injury after chemotherapy. Copyright © 2014 Elsevier Inc. All rights reserved.
Synthesis of a select group of proteins by Neisseria gonorrhoeae in response to thermal stress.
Woods, M L; Bonfiglioli, R; McGee, Z A; Georgopoulos, C
1990-03-01
We report the thermal conditions that induce the heat shock response in Neisseria gonorrhoeae. Under conditions of thermal stress, Neisseria gonorrhoeae synthesizes heat shock proteins (hsps), which differ quantitatively from conventionally studied gonococcal proteins. Gonococci accelerate the rate of synthesis of the hsps as early as 5 min after the appropriate stimulus is applied, with synthesis continuing for 30 min, as demonstrated by in vivo labeling experiments with L-[35S]methionine. Two of the gonococcal hsps are immunologically cross-reactive with the hsps of Escherichia coli, DnaK and GroEL, as demonstrated by Western blot (immunoblot) analysis. Ten hsps can be identified on two-dimensional autoradiograms of whole gonococci (total protein). Four hsps can be identified on two-dimensional autoradiograms of 1% N-lauroylsarcosine (sodium salt) (Sarkosyl)-insoluble membrane fractions. Two of the hsps from the 1% Sarkosyl-insoluble fraction are found exclusively in this fraction, suggesting that they are membrane proteins. The identification of this group of proteins will facilitate further study of the function of these proteins and provide insight into the possible role of hsps in disease pathogenesis.
2011-01-01
Background Wheat flour is one of the world's major food ingredients, in part because of the unique end-use qualities conferred by the abundant glutamine- and proline-rich gluten proteins. Many wheat flour proteins also present dietary problems for consumers with celiac disease or wheat allergies. Despite the importance of these proteins it has been particularly challenging to use MS/MS to distinguish the many proteins in a flour sample and relate them to gene sequences. Results Grain from the extensively characterized spring wheat cultivar Triticum aestivum 'Butte 86' was milled to white flour from which proteins were extracted, then separated and quantified by 2-DE. Protein spots were identified by separate digestions with three proteases, followed by tandem mass spectrometry analysis of the peptides. The spectra were used to interrogate an improved protein sequence database and results were integrated using the Scaffold program. Inclusion of cultivar specific sequences in the database greatly improved the results, and 233 spots were identified, accounting for 93.1% of normalized spot volume. Identified proteins were assigned to 157 wheat sequences, many for proteins unique to wheat and nearly 40% from Butte 86. Alpha-gliadins accounted for 20.4% of flour protein, low molecular weight glutenin subunits 18.0%, high molecular weight glutenin subunits 17.1%, gamma-gliadins 12.2%, omega-gliadins 10.5%, amylase/protease inhibitors 4.1%, triticins 1.6%, serpins 1.6%, purinins 0.9%, farinins 0.8%, beta-amylase 0.5%, globulins 0.4%, other enzymes and factors 1.9%, and all other 3%. Conclusions This is the first successful effort to identify the majority of abundant flour proteins for a single wheat cultivar, relate them to individual gene sequences and estimate their relative levels. Many genes for wheat flour proteins are not expressed, so this study represents further progress in describing the expressed wheat genome. Use of cultivar-specific contigs helped to overcome the difficulties of matching peptides to gene sequences for members of highly similar, rapidly evolving storage protein families. Prospects for simplifying this process for routine analyses are discussed. The ability to measure expression levels for individual flour protein genes complements information gained from efforts to sequence the wheat genome and is essential for studies of effects of environment on gene expression. PMID:21314956
Performance Evaluation of Multimodal Multifeature Authentication System Using KNN Classification.
Rajagopal, Gayathri; Palaniswamy, Ramamoorthy
2015-01-01
This research proposes a multimodal multifeature biometric system for human recognition using two traits, that is, palmprint and iris. The purpose of this research is to analyse integration of multimodal and multifeature biometric system using feature level fusion to achieve better performance. The main aim of the proposed system is to increase the recognition accuracy using feature level fusion. The features at the feature level fusion are raw biometric data which contains rich information when compared to decision and matching score level fusion. Hence information fused at the feature level is expected to obtain improved recognition accuracy. However, information fused at feature level has the problem of curse in dimensionality; here PCA (principal component analysis) is used to diminish the dimensionality of the feature sets as they are high dimensional. The proposed multimodal results were compared with other multimodal and monomodal approaches. Out of these comparisons, the multimodal multifeature palmprint iris fusion offers significant improvements in the accuracy of the suggested multimodal biometric system. The proposed algorithm is tested using created virtual multimodal database using UPOL iris database and PolyU palmprint database.
Performance Evaluation of Multimodal Multifeature Authentication System Using KNN Classification
Rajagopal, Gayathri; Palaniswamy, Ramamoorthy
2015-01-01
This research proposes a multimodal multifeature biometric system for human recognition using two traits, that is, palmprint and iris. The purpose of this research is to analyse integration of multimodal and multifeature biometric system using feature level fusion to achieve better performance. The main aim of the proposed system is to increase the recognition accuracy using feature level fusion. The features at the feature level fusion are raw biometric data which contains rich information when compared to decision and matching score level fusion. Hence information fused at the feature level is expected to obtain improved recognition accuracy. However, information fused at feature level has the problem of curse in dimensionality; here PCA (principal component analysis) is used to diminish the dimensionality of the feature sets as they are high dimensional. The proposed multimodal results were compared with other multimodal and monomodal approaches. Out of these comparisons, the multimodal multifeature palmprint iris fusion offers significant improvements in the accuracy of the suggested multimodal biometric system. The proposed algorithm is tested using created virtual multimodal database using UPOL iris database and PolyU palmprint database. PMID:26640813
MultitaskProtDB: a database of multitasking proteins
Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac; Perez-Pons, JosepAntoni; Piñol, Jaume; Mozo-Villarias, Angel; Cedano, Juan; Querol, Enrique
2014-01-01
We have compiled MultitaskProtDB, available online at http://wallace.uab.es/multitask, to provide a repository where the many multitasking proteins found in the literature can be stored. Multitasking or moonlighting is the capability of some proteins to execute two or more biological functions. Usually, multitasking proteins are experimentally revealed by serendipity. This ability of proteins to perform multitasking functions helps us to understand one of the ways used by cells to perform many complex functions with a limited number of genes. Even so, the study of this phenomenon is complex because, among other things, there is no database of moonlighting proteins. The existence of such a tool facilitates the collection and dissemination of these important data. This work reports the database, MultitaskProtDB, which is designed as a friendly user web page containing >288 multitasking proteins with their NCBI and UniProt accession numbers, canonical and additional biological functions, monomeric/oligomeric states, PDB codes when available and bibliographic references. This database also serves to gain insight into some characteristics of multitasking proteins such as frequencies of the different pairs of functions, phylogenetic conservation and so forth. PMID:24253302
3D-SURFER 2.0: web platform for real-time search and characterization of protein surfaces.
Xiong, Yi; Esquivel-Rodriguez, Juan; Sael, Lee; Kihara, Daisuke
2014-01-01
The increasing number of uncharacterized protein structures necessitates the development of computational approaches for function annotation using the protein tertiary structures. Protein structure database search is the basis of any structure-based functional elucidation of proteins. 3D-SURFER is a web platform for real-time protein surface comparison of a given protein structure against the entire PDB using 3D Zernike descriptors. It can smoothly navigate the protein structure space in real-time from one query structure to another. A major new feature of Release 2.0 is the ability to compare the protein surface of a single chain, a single domain, or a single complex against databases of protein chains, domains, complexes, or a combination of all three in the latest PDB. Additionally, two types of protein structures can now be compared: all-atom-surface and backbone-atom-surface. The server can also accept a batch job for a large number of database searches. Pockets in protein surfaces can be identified by VisGrid and LIGSITE (csc) . The server is available at http://kiharalab.org/3d-surfer/.
Proteomic identification of fat-browning markers in cultured white adipocytes treated with curcumin.
Kim, Sang Woo; Choi, Jae Heon; Mukherjee, Rajib; Hwang, Ki-Chul; Yun, Jong Won
2016-04-01
We previously reported that curcumin induces browning of primary white adipocytes via enhanced expression of brown adipocyte-specific genes. In this study, we attempted to identify target proteins responsible for this fat-browning effect by analyzing proteomic changes in cultured white adipocytes in response to curcumin treatment. To elucidate the role of curcumin in fat-browning, we conducted comparative proteomic analysis of primary adipocytes between control and curcumin-treated cells using two-dimensional electrophoresis combined with MALDI-TOF-MS. We also investigated fatty acid metabolic targets, mitochondrial biogenesis, and fat-browning-associated proteins using combined proteomic and network analyses. Proteomic analysis revealed that 58 protein spots from a total of 325 matched spots showed differential expression between control and curcumin-treated adipocytes. Using network analysis, most of the identified proteins were proven to be involved in various metabolic and cellular processes based on the PANTHER classification system. One of the most striking findings is that hormone-sensitive lipase (HSL) was highly correlated with main browning markers based on the STRING database. HSL and two browning markers (UCP1, PGC-1α) were co-immunoprecipitated with these markers, suggesting that HSL possibly plays a role in fat-browning of white adipocytes. Our results suggest that curcumin increased HSL levels and other browning-specific markers, suggesting its possible role in augmentation of lipolysis and suppression of lipogenesis by trans-differentiation from white adipocytes into brown adipocytes (beige).
The COG database: new developments in phylogenetic classification of proteins from complete genomes
Tatusov, Roman L.; Natale, Darren A.; Garkavtsev, Igor V.; Tatusova, Tatiana A.; Shankavaram, Uma T.; Rao, Bachoti S.; Kiryutin, Boris; Galperin, Michael Y.; Fedorova, Natalie D.; Koonin, Eugene V.
2001-01-01
The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis. PMID:11125040
BLAST and FASTA similarity searching for multiple sequence alignment.
Pearson, William R
2014-01-01
BLAST, FASTA, and other similarity searching programs seek to identify homologous proteins and DNA sequences based on excess sequence similarity. If two sequences share much more similarity than expected by chance, the simplest explanation for the excess similarity is common ancestry-homology. The most effective similarity searches compare protein sequences, rather than DNA sequences, for sequences that encode proteins, and use expectation values, rather than percent identity, to infer homology. The BLAST and FASTA packages of sequence comparison programs provide programs for comparing protein and DNA sequences to protein databases (the most sensitive searches). Protein and translated-DNA comparisons to protein databases routinely allow evolutionary look back times from 1 to 2 billion years; DNA:DNA searches are 5-10-fold less sensitive. BLAST and FASTA can be run on popular web sites, but can also be downloaded and installed on local computers. With local installation, target databases can be customized for the sequence data being characterized. With today's very large protein databases, search sensitivity can also be improved by searching smaller comprehensive databases, for example, a complete protein set from an evolutionarily neighboring model organism. By default, BLAST and FASTA use scoring strategies target for distant evolutionary relationships; for comparisons involving short domains or queries, or searches that seek relatively close homologs (e.g. mouse-human), shallower scoring matrices will be more effective. Both BLAST and FASTA provide very accurate statistical estimates, which can be used to reliably identify protein sequences that diverged more than 2 billion years ago.
Kuang, Xingyan; Dhroso, Andi; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry
2016-01-01
Macromolecular interactions are formed between proteins, DNA and RNA molecules. Being a principle building block in macromolecular assemblies and pathways, the interactions underlie most of cellular functions. Malfunctioning of macromolecular interactions is also linked to a number of diseases. Structural knowledge of the macromolecular interaction allows one to understand the interaction’s mechanism, determine its functional implications and characterize the effects of genetic variations, such as single nucleotide polymorphisms, on the interaction. Unfortunately, until now the interactions mediated by different types of macromolecules, e.g. protein–protein interactions or protein–DNA interactions, are collected into individual and unrelated structural databases. This presents a significant obstacle in the analysis of macromolecular interactions. For instance, the homogeneous structural interaction databases prevent scientists from studying structural interactions of different types but occurring in the same macromolecular complex. Here, we introduce DOMMINO 2.0, a structural Database Of Macro-Molecular INteractiOns. Compared to DOMMINO 1.0, a comprehensive database on protein-protein interactions, DOMMINO 2.0 includes the interactions between all three basic types of macromolecules extracted from PDB files. DOMMINO 2.0 is automatically updated on a weekly basis. It currently includes ∼1 040 000 interactions between two polypeptide subunits (e.g. domains, peptides, termini and interdomain linkers), ∼43 000 RNA-mediated interactions, and ∼12 000 DNA-mediated interactions. All protein structures in the database are annotated using SCOP and SUPERFAMILY family annotation. As a result, protein-mediated interactions involving protein domains, interdomain linkers, C- and N- termini, and peptides are identified. Our database provides an intuitive web interface, allowing one to investigate interactions at three different resolution levels: whole subunit network, binary interaction and interaction interface. Database URL: http://dommino.org PMID:26827237
Structural changes of malt proteins during boiling.
Jin, Bei; Li, Lin; Liu, Guo-Qin; Li, Bing; Zhu, Yu-Kui; Liao, Liao-Ning
2009-03-09
Changes in the physicochemical properties and structure of proteins derived from two malt varieties (Baudin and Guangmai) during wort boiling were investigated by differential scanning calorimetry, SDS-PAGE, two-dimensional electrophoresis, gel filtration chromatography and circular dichroism spectroscopy. The results showed that both protein content and amino acid composition changed only slightly during boiling, and that boiling might cause a gradual unfolding of protein structures, as indicated by the decrease in surface hydrophobicity and free sulfhydryl content and enthalpy value, as well as reduced alpha-helix contents and markedly increased random coil contents. It was also found that major component of both worts was a boiling-resistant protein with a molecular mass of 40 kDa, and that according to the two-dimensional electrophoresis and SE-HPLC analyses, a small amount of soluble aggregates might be formed via hydrophobic interactions. It was thus concluded that changes of protein structure caused by boiling that might influence beer quality are largely independent of malt variety.
One-dimensional arrangement of nanoparticles utilizing the V-groove and cage shaped proteins
NASA Astrophysics Data System (ADS)
Ban, Takahiko; Uenuma, Mutsunori; Migita, Shinji; Okamoto, Naofumi; Ishikawa, Yasuaki; Uraoka, Yukiharu; Yamashita, Ichiro; Yamamoto, Shin-ichi
2017-06-01
The one-dimensional arrangement of nanoparticles (NPs) was performed using a V-groove and ferritins as spherical shell proteins. The V-groove was synthesized by lithography and anisotropic etching of a Si substrate. Ferritin has an outer diameter of 12 nm and an inner diameter of 6 nm, and various inorganic substances can be formed into the cavity. In this study, iron oxide, cobalt oxide, and indium oxide cores were used. The surface potential of ferritin can be changed by genetic modification. Particularly, by using Fer8-K98E, NPs could be arranged one-dimensionally onto the bottom of the V-groove. In addition, we succeeded in selectively forming a one-dimensional array of one layer, two layers, and three layers by changing the protein concentration. This experiment is expected to be applicable to various one-dimensional devices.
Application of Tandem Two-Dimensional Mass Spectrometry for Top-Down Deep Sequencing of Calmodulin
NASA Astrophysics Data System (ADS)
Floris, Federico; Chiron, Lionel; Lynch, Alice M.; Barrow, Mark P.; Delsuc, Marc-André; O'Connor, Peter B.
2018-06-01
Two-dimensional mass spectrometry (2DMS) involves simultaneous acquisition of the fragmentation patterns of all the analytes in a mixture by correlating their precursor and fragment ions by modulating precursor ions systematically through a fragmentation zone. Tandem two-dimensional mass spectrometry (MS/2DMS) unites the ultra-high accuracy of Fourier transform ion cyclotron resonance (FT-ICR) MS/MS and the simultaneous data-independent fragmentation of 2DMS to achieve extensive inter-residue fragmentation of entire proteins. 2DMS was recently developed for top-down proteomics (TDP), and applied to the analysis of calmodulin (CaM), reporting a cleavage coverage of about 23% using infrared multiphoton dissociation (IRMPD) as fragmentation technique. The goal of this work is to expand the utility of top-down protein analysis using MS/2DMS in order to extend the cleavage coverage in top-down proteomics further into the interior regions of the protein. In this case, using MS/2DMS, the cleavage coverage of CaM increased from 23% to 42%.
Yadav, Manoj Kumar; Singh, Amisha; Swati, D
2014-08-01
Malaria is one of the most infectious diseases in the world. Plasmodium vivax, the pathogen causing endemic malaria in humans worldwide, is responsible for extensive disease morbidity. Due to the emergence of resistance to common anti-malarial drugs, there is a continuous need to develop a new class of drugs for this pathogen. P. vivax cysteine protease, also known as vivapain-2, plays an important role in haemoglobin hydrolysis and is considered essential for the survival of the parasite. The three-dimensional (3D) structure of vivapain-2 is not predicted experimentally, so its structure is modelled by using comparative modelling approach and further validated by Qualitative Model Energy Analysis (QMEAN) and RAMPAGE tools. The potential binding site of selected vivapain-2 structure has been detected by grid-based function prediction method. Drug targets and their respective drugs similar to vivapain-2 have been identified using three publicly available databases: STITCH 3.1, DrugBank and Therapeutic Target Database (TTD). The second approach of this work focuses on docking study of selected drug E-64 against vivapain-2 protein. Docking reveals crucial information about key residues (Asn281, Cys283, Val396 and Asp398) that are responsible for holding the ligand in the active site. The similarity-search criterion is used for the preparation of our in-house database of drugs, obtained from filtering the drugs from the DrugBank database. A five-point 3D pharmacophore model is generated for the docked complex of vivapain-2 with E-64. This study of 3D pharmacophore-based virtual screening results in identifying three new drugs, amongst which one is approved and the other two are experimentally proved. The ADMET properties of these drugs are found to be in the desired range. These drugs with novel scaffolds may act as potent drugs for treating malaria caused by P. vivax.
Li, Zhengyu; Min, Wenjiao; Huang, Canhua; Bai, Shujun; Tang, Minghai; Zhao, Xia
2010-01-01
We used proteomic approaches to identify altered expressed proteins in endometrial carcinoma, with the aim of discovering potential biomarkers or therapeutic targets for endometrial carcinoma. The global proteins extracted from endometrial carcinoma and normal endometrial tissues were separated by 2-dimensional electrophoresis and analyzed with PDQuest (Bio-Rad, Hercules, Calif) software. The differentially expressed spots were identified by mass spectrometry and searched against NCBInr protein database. Those proteins with potential roles were confirmed by Western blotting and immunohistochemical assays. Ninety-nine proteins were identified by mass spectrometry, and a cluster diagram analysis indicated that these proteins were involved in metabolism, cell transformation, protein folding, translation and modification, proliferation and apoptosis, signal transduction, cytoskeleton, and so on. In confirmatory immunoblotting and immunohistochemical analyses, overexpressions of epidermal fatty acid-binding protein, calcyphosine, and cyclophilin A were also observed in endometrial carcinoma tissues, which were consistent with the proteomic results. Our results suggested that these identified proteins, including epidermal fatty acid-binding protein, calcyphosine, and cyclophilin A, might be of potential values in the studies of endometrial carcinogenesis or investigations of diagnostic biomarkers or treatment targets for endometrial carcinoma.
Yasui, Yutaka; McLerran, Dale; Adam, Bao-Ling; Winget, Marcy; Thornquist, Mark; Feng, Ziding
2003-01-01
Discovery of "signature" protein profiles that distinguish disease states (eg, malignant, benign, and normal) is a key step towards translating recent advancements in proteomic technologies into clinical utilities. Protein data generated from mass spectrometers are, however, large in size and have complex features due to complexities in both biological specimens and interfering biochemical/physical processes of the measurement procedure. Making sense out of such high-dimensional complex data is challenging and necessitates the use of a systematic data analytic strategy. We propose here a data processing strategy for two major issues in the analysis of such mass-spectrometry-generated proteomic data: (1) separation of protein "signals" from background "noise" in protein intensity measurements and (2) calibration of protein mass/charge measurements across samples. We illustrate the two issues and the utility of the proposed strategy using data from a prostate cancer biomarker discovery project as an example.
Protein structure-structure alignment with discrete Fréchet distance.
Jiang, Minghui; Xu, Ying; Zhu, Binhai
2008-02-01
Matching two geometric objects in two-dimensional (2D) and three-dimensional (3D) spaces is a central problem in computer vision, pattern recognition, and protein structure prediction. In particular, the problem of aligning two polygonal chains under translation and rotation to minimize their distance has been studied using various distance measures. It is well known that the Hausdorff distance is useful for matching two point sets, and that the Fréchet distance is a superior measure for matching two polygonal chains. The discrete Fréchet distance closely approximates the (continuous) Fréchet distance, and is a natural measure for the geometric similarity of the folded 3D structures of biomolecules such as proteins. In this paper, we present new algorithms for matching two polygonal chains in two dimensions to minimize their discrete Fréchet distance under translation and rotation, and an effective heuristic for matching two polygonal chains in three dimensions. We also describe our empirical results on the application of the discrete Fréchet distance to protein structure-structure alignment.
Modeling the effect of mood on dimensional attention during categorization.
Zivot, Matthew T; Cohen, Andrew L; Kapucu, Aycan
2013-08-01
Classification is a flexible process that can be affected by mood. The goal of this paper is to evaluate the idea that mood may modulate categorization behavior through an attentional weighting mechanism in which mood changes the attention afforded to different stimulus dimensions. In two experiments, participants learn and are tested on categories while in a calm or sad mood. In Experiment 1, sad participants are faster to learn one- and two-dimensional category structures, but show no advantage on a three-dimensional category structure. In Experiment 2, the generalized context model of categorization is used to measure dimensional weighting. The results suggest that sad participants have a narrower focus of attention, but that the narrowing tends to be on diagnostic dimensions. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Hanson, Jack; Paliwal, Kuldip; Litfin, Thomas; Yang, Yuedong; Zhou, Yaoqi
2018-06-19
Accurate prediction of a protein contact map depends greatly on capturing as much contextual information as possible from surrounding residues for a target residue pair. Recently, ultra-deep residual convolutional networks were found to be state-of-the-art in the latest Critical Assessment of Structure Prediction techniques (CASP12, (Schaarschmidt et al., 2018)) for protein contact map prediction by attempting to provide a protein-wide context at each residue pair. Recurrent neural networks have seen great success in recent protein residue classification problems due to their ability to propagate information through long protein sequences, especially Long Short-Term Memory (LSTM) cells. Here we propose a novel protein contact map prediction method by stacking residual convolutional networks with two-dimensional residual bidirectional recurrent LSTM networks, and using both one-dimensional sequence-based and two-dimensional evolutionary coupling-based information. We show that the proposed method achieves a robust performance over validation and independent test sets with the Area Under the receiver operating characteristic Curve (AUC)>0.95 in all tests. When compared to several state-of-the-art methods for independent testing of 228 proteins, the method yields an AUC value of 0.958, whereas the next-best method obtains an AUC of 0.909. More importantly, the improvement is over contacts at all sequence-position separations. Specifically, a 8.95%, 5.65% and 2.84% increase in precision were observed for the top L∕10 predictions over the next best for short, medium and long-range contacts, respectively. This confirms the usefulness of ResNets to congregate the short-range relations and 2D-BRLSTM to propagate the long-range dependencies throughout the entire protein contact map 'image'. SPOT-Contact server url: http://sparks-lab.org/jack/server/SPOT-Contact/. Supplementary data is available at Bioinformatics online.
High resolution approach to the native state ensemble kinetics and thermodynamics.
Wu, Sangwook; Zhuravlev, Pavel I; Papoian, Garegin A
2008-12-15
Many biologically interesting functions such as allosteric switching or protein-ligand binding are determined by the kinetics and mechanisms of transitions between various conformational substates of the native basin of globular proteins. To advance our understanding of these processes, we constructed a two-dimensional free energy surface (FES) of the native basin of a small globular protein, Trp-cage. The corresponding order parameters were defined using two native substructures of Trp-cage. These calculations were based on extensive explicit water all-atom molecular dynamics simulations. Using the obtained two-dimensional FES, we studied the transition kinetics between two Trp-cage conformations, finding that switching process shows a borderline behavior between diffusive and weakly-activated dynamics. The transition is well-characterized kinetically as a biexponential process. We also introduced a new one-dimensional reaction coordinate for the conformational transition, finding reasonable qualitative agreement with the two-dimensional kinetics results. We investigated the distribution of all the 38 native nuclear magnetic resonance structures on the obtained FES, analyzing interactions that stabilize specific low-energy conformations. Finally, we constructed a FES for the same system but with simple dielectric model of water instead of explicit water, finding that the results were surprisingly similar in a small region centered on the native conformations. The dissimilarities between the explicit and implicit model on the larger-scale point to the important role of water in mediating interactions between amino acid residues.
Al Kindi, Mahmood A; Colella, Alex D; Chataway, Tim K; Jackson, Michael W; Wang, Jing J; Gordon, Tom P
2016-04-01
The structures of epitopes bound by autoantibodies against RNA-protein complexes have been well-defined over several decades, but little is known of the clonality, immunoglobulin (Ig) variable (V) gene usage and mutational status of the autoantibodies themselves at the level of the secreted (serum) proteome. A novel proteomic workflow is presented based on affinity purification of specific Igs from serum, high-resolution two-dimensional gel electrophoresis, and de novo and database-driven sequencing of V-region proteins by mass spectrometry. Analysis of anti-Ro52/Ro60/La proteomes in primary Sjögren's syndrome (SS) and anti-Sm and anti-ribosomal P proteomes in systemic lupus erythematosus (SLE) has revealed that these antibody responses are dominated by restricted sets of public (shared) clonotypes, consistent with common pathways of production across unrelated individuals. The discovery of shared sets of specific V-region peptides can be exploited for diagnostic biomarkers in targeted mass spectrometry platforms and for tracking and removal of pathogenic clones. Copyright © 2016 Elsevier B.V. All rights reserved.
Encounter complexes and dimensionality reduction in protein-protein association.
Kozakov, Dima; Li, Keyong; Hall, David R; Beglov, Dmitri; Zheng, Jiefu; Vakili, Pirooz; Schueler-Furman, Ora; Paschalidis, Ioannis Ch; Clore, G Marius; Vajda, Sandor
2014-04-08
An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein-protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition. DOI: http://dx.doi.org/10.7554/eLife.01370.001.
Ren, Jiangtao; Beckner, Matthew A; Lynch, Kyle B; Chen, Huang; Zhu, Zaifang; Yang, Yu; Chen, Apeng; Qiao, Zhenzhen; Liu, Shaorong; Lu, Joann J
2018-05-15
A comprehensive two-dimensional liquid chromatography (LCxLC) system consisting of twelve columns in the second dimension was developed for comprehensive analysis of intact proteins in complex biological samples. The system consisted of an ion-exchange column in the first dimension and the twelve reverse-phase columns in the second dimension; all thirteen columns were monolithic and prepared inside 250 µm i.d. capillaries. These columns were assembled together through the use of three valves and an innovative configuration. The effluent from the first dimension was continuously fractionated and sequentially transferred into the twelve second-dimension columns, while the second-dimension separations were carried out in a series of batches (six columns per batch). This LCxLC system was tested first using standard proteins followed by real-world samples from E. coli. Baseline separation was observed for eleven standard proteins and hundreds of peaks were observed for the real-world sample analysis. Two-dimensional liquid chromatography, often considered as an effective tool for mapping proteins, is seen as laborious and time-consuming when configured offline. Our online LCxLC system with increased second-dimension columns promises to provide a solution to overcome these hindrances. Copyright © 2018 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coughlan, H. D.; Darmanin, C.; Kirkwood, H. J.
2016-03-14
Three-dimensional imaging of protein crystals during X-ray diffraction experiments opens up a range of possibilities for optimising crystal quality and gaining new insights into the fundamental processes that drive radiation damage. Obtaining this information at the appropriate lengthscales however is extremely challenging. One approach that has been recently demonstrated as a promising avenue for charactering the size and shape of protein crystals at nanometre lengthscales is Bragg Coherent Diffractive Imaging (BCDI). BCDI is a recently developed technique that is able to recover the phase of the continuous diffraction intensity signal around individual Bragg peaks. When data is collected at multiplemore » points on a rocking curve a Reciprocal Space Map (RSM) can be assembled and then inverted using BCDI to obtain a three-dimensional image of the crystal. The first demonstration of two-dimensional BCDI of protein crystals was reported by Boutet at al., recently this work was extended to the study of radiation damage of micron-sized crystals. Here we present the first three-dimensional reconstructions of a Lysozyme protein crystal using BDI. The results are validated against RSM and TEM data and have implications for both radiation damage studies and for developing new approaches to structure retrieval from micron-sized protein crystals.« less
A glimpse into the proteome of phototrophic bacterium Rhodobacter capsulatus.
Onder, Ozlem; Aygun-Sunar, Semra; Selamoglu, Nur; Daldal, Fevzi
2010-01-01
A first glimpse into the proteome of Rhodobacter capsulatus revealed more than 450 (with over 210 cytoplasmic and 185 extracytoplasmic known as well as 55 unknown) proteins that are identified with high degree of confidence using nLC-MS/MS analyses. The accumulated data provide a solid platform for ongoing efforts to establish the proteome of this species and the cellular locations of its constituents. They also indicate that at least 40 of the identified proteins, which were annotated in genome databases as unknown hypothetical proteins, correspond to predicted translation products that are indeed present in cells under the growth conditions used in this work. In addition, matching the identification labels of the proteins reported between the two available R. capsulatus genome databases (ERGO-light with RRCxxxxx and NT05 with NT05RCxxxx numbers) indicated that 11 such proteins are listed only in the latter database.
Face recognition based on two-dimensional discriminant sparse preserving projection
NASA Astrophysics Data System (ADS)
Zhang, Dawei; Zhu, Shanan
2018-04-01
In this paper, a supervised dimensionality reduction algorithm named two-dimensional discriminant sparse preserving projection (2DDSPP) is proposed for face recognition. In order to accurately model manifold structure of data, 2DDSPP constructs within-class affinity graph and between-class affinity graph by the constrained least squares (LS) and l1 norm minimization problem, respectively. Based on directly operating on image matrix, 2DDSPP integrates graph embedding (GE) with Fisher criterion. The obtained projection subspace preserves within-class neighborhood geometry structure of samples, while keeping away samples from different classes. The experimental results on the PIE and AR face databases show that 2DDSPP can achieve better recognition performance.
Synthesis of a select group of proteins by Neisseria gonorrhoeae in response to thermal stress.
Woods, M L; Bonfiglioli, R; McGee, Z A; Georgopoulos, C
1990-01-01
We report the thermal conditions that induce the heat shock response in Neisseria gonorrhoeae. Under conditions of thermal stress, Neisseria gonorrhoeae synthesizes heat shock proteins (hsps), which differ quantitatively from conventionally studied gonococcal proteins. Gonococci accelerate the rate of synthesis of the hsps as early as 5 min after the appropriate stimulus is applied, with synthesis continuing for 30 min, as demonstrated by in vivo labeling experiments with L-[35S]methionine. Two of the gonococcal hsps are immunologically cross-reactive with the hsps of Escherichia coli, DnaK and GroEL, as demonstrated by Western blot (immunoblot) analysis. Ten hsps can be identified on two-dimensional autoradiograms of whole gonococci (total protein). Four hsps can be identified on two-dimensional autoradiograms of 1% N-lauroylsarcosine (sodium salt) (Sarkosyl)-insoluble membrane fractions. Two of the hsps from the 1% Sarkosyl-insoluble fraction are found exclusively in this fraction, suggesting that they are membrane proteins. The identification of this group of proteins will facilitate further study of the function of these proteins and provide insight into the possible role of hsps in disease pathogenesis. Images PMID:2106493
PDB explorer -- a web based algorithm for protein annotation viewer and 3D visualization.
Nayarisseri, Anuraj; Shardiwal, Rakesh Kumar; Yadav, Mukesh; Kanungo, Neha; Singh, Pooja; Shah, Pratik; Ahmed, Sheaza
2014-12-01
The PDB file format, is a text format characterizing the three dimensional structures of macro molecules available in the Protein Data Bank (PDB). Determined protein structure are found in coalition with other molecules or ions such as nucleic acids, water, ions, Drug molecules and so on, which therefore can be described in the PDB format and have been deposited in PDB database. PDB is a machine generated file, it's not human readable format, to read this file we need any computational tool to understand it. The objective of our present study is to develop a free online software for retrieval, visualization and reading of annotation of a protein 3D structure which is available in PDB database. Main aim is to create PDB file in human readable format, i.e., the information in PDB file is converted in readable sentences. It displays all possible information from a PDB file including 3D structure of that file. Programming languages and scripting languages like Perl, CSS, Javascript, Ajax, and HTML have been used for the development of PDB Explorer. The PDB Explorer directly parses the PDB file, calling methods for parsed element secondary structure element, atoms, coordinates etc. PDB Explorer is freely available at http://www.pdbexplorer.eminentbio.com/home with no requirement of log-in.
Pattin, Kristine A.; Moore, Jason H.
2009-01-01
One of the central goals of human genetics is the identification of loci with alleles or genotypes that confer increased susceptibility. The availability of dense maps of single-nucleotide polymorphisms (SNPs) along with high-throughput genotyping technologies has set the stage for routine genome-wide association studies that are expected to significantly improve our ability to identify susceptibility loci. Before this promise can be realized, there are some significant challenges that need to be addressed. We address here the challenge of detecting epistasis or gene-gene interactions in genome-wide association studies. Discovering epistatic interactions in high dimensional datasets remains a challenge due to the computational complexity resulting from the analysis of all possible combinations of SNPs. One potential way to overcome the computational burden of a genome-wide epistasis analysis would be to devise a logical way to prioritize the many SNPs in a dataset so that the data may be analyzed more efficiently and yet still retain important biological information. One of the strongest demonstrations of the functional relationship between genes is protein-protein interaction. Thus, it is plausible that the expert knowledge extracted from protein interaction databases may allow for a more efficient analysis of genome-wide studies as well as facilitate the biological interpretation of the data. In this review we will discuss the challenges of detecting epistasis in genome-wide genetic studies and the means by which we propose to apply expert knowledge extracted from protein interaction databases to facilitate this process. We explore some of the fundamentals of protein interactions and the databases that are publicly available. PMID:18551320
SASD: the Synthetic Alternative Splicing Database for identifying novel isoform from proteomics
2013-01-01
Background Alternative splicing is an important and widespread mechanism for generating protein diversity and regulating protein expression. High-throughput identification and analysis of alternative splicing in the protein level has more advantages than in the mRNA level. The combination of alternative splicing database and tandem mass spectrometry provides a powerful technique for identification, analysis and characterization of potential novel alternative splicing protein isoforms from proteomics. Therefore, based on the peptidomic database of human protein isoforms for proteomics experiments, our objective is to design a new alternative splicing database to 1) provide more coverage of genes, transcripts and alternative splicing, 2) exclusively focus on the alternative splicing, and 3) perform context-specific alternative splicing analysis. Results We used a three-step pipeline to create a synthetic alternative splicing database (SASD) to identify novel alternative splicing isoforms and interpret them at the context of pathway, disease, drug and organ specificity or custom gene set with maximum coverage and exclusive focus on alternative splicing. First, we extracted information on gene structures of all genes in the Ensembl Genes 71 database and incorporated the Integrated Pathway Analysis Database. Then, we compiled artificial splicing transcripts. Lastly, we translated the artificial transcripts into alternative splicing peptides. The SASD is a comprehensive database containing 56,630 genes (Ensembl gene IDs), 95,260 transcripts (Ensembl transcript IDs), and 11,919,779 Alternative Splicing peptides, and also covering about 1,956 pathways, 6,704 diseases, 5,615 drugs, and 52 organs. The database has a web-based user interface that allows users to search, display and download a single gene/transcript/protein, custom gene set, pathway, disease, drug, organ related alternative splicing. Moreover, the quality of the database was validated with comparison to other known databases and two case studies: 1) in liver cancer and 2) in breast cancer. Conclusions The SASD provides the scientific community with an efficient means to identify, analyze, and characterize novel Exon Skipping and Intron Retention protein isoforms from mass spectrometry and interpret them at the context of pathway, disease, drug and organ specificity or custom gene set with maximum coverage and exclusive focus on alternative splicing. PMID:24267658
NASA Astrophysics Data System (ADS)
Chai, Jun; Tian, Bo; Zhen, Hui-Ling; Sun, Wen-Rong
2015-11-01
Energy transfer through a (2+1)-dimensional α-helical protein can be described by a (2+1)-dimensional fourth-order nonlinear Schrödinger equation. For such an equation, a Lax pair and the infinitely-many conservation laws are derived. Using an auxiliary function and a bilinear formulation, we get the one-, two-, three- and N-soliton solutions via the Hirota method. The soliton velocity is linearly related to the lattice parameter γ, while the soliton' direction and amplitude do not depend on γ. Interactions between the two solitons are elastic, while those among the three solitons are pairwise elastic. Oblique, head-on and overtaking interactions between the two solitons are displayed. Oblique interaction among the three solitons and interactions among the two parallel solitons and a single one are presented as well.
Yang, Xiaolong; Thannhauser, T. W.; Burrows, Mary; Cox-Foster, Diana; Gildow, Fred E.; Gray, Stewart M.
2008-01-01
Cereal yellow dwarf virus-RPV (CYDV-RPV) is transmitted specifically by the aphids Rhopalosiphum padi and Schizaphis graminum in a circulative nonpropagative manner. The high level of vector specificity results from the vector aphids having the functional components of the receptor-mediated endocytotic pathways to allow virus to transverse the gut and salivary tissues. Studies of F2 progeny from crosses of vector and nonvector genotypes of S. graminum showed that virus transmission efficiency is a heritable trait regulated by multiple genes acting in an additive fashion and that gut- and salivary gland-associated factors are not genetically linked. Utilizing two-dimensional difference gel electrophoresis to compare the proteomes of vector and nonvector parental and F2 genotypes, four aphid proteins (S4, S8, S29, and S405) were specifically associated with the ability of S. graminum to transmit CYDV-RPV. The four proteins were coimmunoprecipitated with purified RPV, indicating that the aphid proteins are capable of binding to virus. Analysis by mass spectrometry identified S4 as a luciferase and S29 as a cyclophilin, both of which have been implicated in macromolecular transport. Proteins S8 and S405 were not identified from available databases. Study of this unique genetic system coupled with proteomic analysis indicated that these four virus-binding aphid proteins were specifically inherited and conserved in different generations of vector genotypes and suggests that they play a major role in regulating polerovirus transmission. PMID:17959668
Yang, Xiaolong; Thannhauser, T W; Burrows, Mary; Cox-Foster, Diana; Gildow, Fred E; Gray, Stewart M
2008-01-01
Cereal yellow dwarf virus-RPV (CYDV-RPV) is transmitted specifically by the aphids Rhopalosiphum padi and Schizaphis graminum in a circulative nonpropagative manner. The high level of vector specificity results from the vector aphids having the functional components of the receptor-mediated endocytotic pathways to allow virus to transverse the gut and salivary tissues. Studies of F(2) progeny from crosses of vector and nonvector genotypes of S. graminum showed that virus transmission efficiency is a heritable trait regulated by multiple genes acting in an additive fashion and that gut- and salivary gland-associated factors are not genetically linked. Utilizing two-dimensional difference gel electrophoresis to compare the proteomes of vector and nonvector parental and F(2) genotypes, four aphid proteins (S4, S8, S29, and S405) were specifically associated with the ability of S. graminum to transmit CYDV-RPV. The four proteins were coimmunoprecipitated with purified RPV, indicating that the aphid proteins are capable of binding to virus. Analysis by mass spectrometry identified S4 as a luciferase and S29 as a cyclophilin, both of which have been implicated in macromolecular transport. Proteins S8 and S405 were not identified from available databases. Study of this unique genetic system coupled with proteomic analysis indicated that these four virus-binding aphid proteins were specifically inherited and conserved in different generations of vector genotypes and suggests that they play a major role in regulating polerovirus transmission.
Li, J-Y; Chen, X; Hosseini Moghaddam, S H; Chen, M; Wei, H; Zhong, B-X
2009-10-01
The shotgun approach has gained considerable acknowledgement in recent years as a dominant strategy in proteomics. We observed a dramatic increase of specific protein spots in two-dimensional electrophoresis (2-DE) gels of the silkworm (Bombyx mori) embryo at labrum appearance, a characteristic stage during embryonic development of silkworm which is involved with temperature increase by silkworm raiser. We employed shotgun liquid chromatography tandem mass spectrometry (LC-MS/MS) technology to analyse the proteome of B. mori embryos at this stage. A total of 2168 proteins were identified with an in-house database. Approximately 47% of them had isoelectric point (pI) values distributed theoretically in the range pI 5-7 and approximately 60% of them had molecular weights of 15-45 kDa. Furthermore, 111 proteins had an pI greater than 10 and were difficult to separate by 2-DE. Many important functional proteins related to embryonic development, stress response, DNA transcription/translation, cell growth, proliferation and differentiation, organogenesis and reproduction were identified. Among them proteins related to nervous system development were noticeable. All known heat shock proteins (HSPs) were detected in this developmental stage of B. mori embryo. In addition, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis showed energetic metabolism at this stage. These results were expected to provide more information for proteomic monitoring of the insect embryo and better understanding of the spatiotemporal expression of genes during embryonic developmental processes.
Vaganan, M Mayil; Sarumathi, S; Nandakumar, A; Ravi, I; Mustaffa, M M
2015-02-01
Four protocols viz., the trichloroacetic acid-acetone (TCA), phenol-ammonium acetate (PAA), phenol/SDS-ammonium acetate (PSA) and trisbase-acetone (TBA) were evaluated with modifications for protein extraction from banana (Grand Naine) roots, considered as recalcitrant tissues for proteomic analysis. The two-dimensional electrophoresis (2-DE) separated proteins were compared based on protein yield, number of resolved proteins, sum of spot quantity, average spot intensity and proteins resolved in 4-7 pI range. The PAA protocol yielded more proteins (0.89 mg/g of tissues) and protein spots (584) in 2-DE gel than TCA and other protocols. Also, the PAA protocol was superior in terms of sum of total spot quantity and average spot intensity than TCA and other protocols, suggesting phenol as extractant and ammonium acetate as precipitant of proteins were the most suitable for banana rooteomics analysis by 2-DE. In addition, 1:3 ratios of root tissue to extraction buffer and overnight protein precipitation were most efficient to obtain maximum protein yield.
Kobayashi, Tetsuo; Yokoyama, Tomoko; Ito, Satoshi; Kobayashi, Daisuke; Yamagata, Akira; Okada, Moe; Oofusa, Ken; Narita, Ichiei; Murasawa, Akira; Nakazono, Kiyoshi; Yoshie, Hiromasa
2014-11-01
Tumor necrosis factor (TNF)-α inhibitor has been shown to affect the periodontal condition of patients with rheumatoid arthritis (RA). The aim of the present study is to assess the effect of a fully humanized anti-TNF-α monoclonal antibody, adalimumab (ADA), on the periodontal condition of patients with RA and to compare serum protein profiles before and after ADA therapy. The study participants consisted of 20 patients with RA treated with ADA. Clinical periodontal and rheumatologic parameters and serum cytokine levels were evaluated at baseline and 3 months later. Serum protein spot volume was examined with two-dimensional sodium dodecyl sulfate polyacrylamide gel electrophoresis. Proteins with significant difference in abundance before and after ADA therapy were found and identified using mass spectrometry and protein databases. The patients showed a significant decrease in gingival index (P = 0.002), bleeding on probing (P = 0.003), probing depth (P = 0.002), disease activity score including 28 joints using C-reactive protein (P <0.001), and serum levels of TNF-α (P <0.001) and interleukin-6 (P <0.001) after ADA medication, although plaque levels were comparable. Among a total of 495 protein spots obtained, nine spots were significantly decreased in abundance at reassessment, corresponding to complement factor H, phospholipase D, serum amyloid A, complement component 4, and α-1-acid glycoprotein (P <0.01). These results suggest a beneficial effect of ADA therapy on the periodontal condition of patients with RA, which might be related to differences in serum protein profiles before and after ADA therapy.
Hong, Mei
2016-01-01
We have determined refined multidimensional chemical shift ranges for intra-residue correlations (13C–13C, 15N–13C, etc.) in proteins, which can be used to gain type-assignment and/or secondary-structure information from experimental NMR spectra. The chemical-shift ranges are the result of a statistical analysis of the PACSY database of >3000 proteins with 3D structures (1,200,207 13C chemical shifts and >3 million chemical shifts in total); these data were originally derived from the Biological Magnetic Resonance Data Bank. Using relatively simple non-parametric statistics to find peak maxima in the distributions of helix, sheet, coil and turn chemical shifts, and without the use of limited “hand-picked” data sets, we show that ~94 % of the 13C NMR data and almost all 15N data are quite accurately referenced and assigned, with smaller standard deviations (0.2 and 0.8 ppm, respectively) than recognized previously. On the other hand, approximately 6 % of the 13C chemical shift data in the PACSY database are shown to be clearly misreferenced, mostly by ca. −2.4 ppm. The removal of the misreferenced data and other outliers by this purging by intrinsic quality criteria (PIQC) allows for reliable identification of secondary maxima in the two-dimensional chemical-shift distributions already pre-separated by secondary structure. We demonstrate that some of these correspond to specific regions in the Ramachandran plot, including left-handed helix dihedral angles, reflect unusual hydrogen bonding, or are due to the influence of a following proline residue. With appropriate smoothing, significantly more tightly defined chemical shift ranges are obtained for each amino acid type in the different secondary structures. These chemical shift ranges, which may be defined at any statistical threshold, can be used for amino-acid type assignment and secondary-structure analysis of chemical shifts from intra-residue cross peaks by inspection or by using a provided command-line Python script (PLUQin), which should be useful in protein structure determination. The refined chemical shift distributions are utilized in a simple quality test (SQAT) that should be applied to new protein NMR data before deposition in a databank, and they could benefit many other chemical-shift based tools. PMID:26787537
Protein Identification Using Top-Down Spectra*
Liu, Xiaowen; Sirotkin, Yakov; Shen, Yufeng; Anderson, Gordon; Tsai, Yihsuan S.; Ting, Ying S.; Goodlett, David R.; Smith, Richard D.; Bafna, Vineet; Pevzner, Pavel A.
2012-01-01
In the last two years, because of advances in protein separation and mass spectrometry, top-down mass spectrometry moved from analyzing single proteins to analyzing complex samples and identifying hundreds and even thousands of proteins. However, computational tools for database search of top-down spectra against protein databases are still in their infancy. We describe MS-Align+, a fast algorithm for top-down protein identification based on spectral alignment that enables searches for unexpected post-translational modifications. We also propose a method for evaluating statistical significance of top-down protein identifications and further benchmark various software tools on two top-down data sets from Saccharomyces cerevisiae and Salmonella typhimurium. We demonstrate that MS-Align+ significantly increases the number of identified spectra as compared with MASCOT and OMSSA on both data sets. Although MS-Align+ and ProSightPC have similar performance on the Salmonella typhimurium data set, MS-Align+ outperforms ProSightPC on the (more complex) Saccharomyces cerevisiae data set. PMID:22027200
Protein Drug Targets of Lavandula angustifolia on treatment of Rat Alzheimer's Disease
Zali, Hakimeh; Zamanian-Azodi, Mona; Rezaei Tavirani, Mostafa; Akbar-zadeh Baghban, Alireza
2015-01-01
Different treatment strategies of Alzheimer's disease (AD) are being studied for treating or slowing the progression of AD. Many pharmaceutically important regulation systems operate through proteins as drug targets. Here, we investigate the drug target proteins in beta-amyloid (Aβ) injected rat hippocampus treated with Lavandula angustifolia (LA) by proteomics techniques. The reported study showed that lavender extract (LE) improves the spatial performance in AD animal model by diminishing Aβ production in histopathology of hippocampus, so in this study neuroprotective proteins expressed in Aβ injected rats treated with LE were scrutinized. Rats were divided into three groups including normal, Aβ injected, and Aβ injected that was treated with LE. Protein expression profiles of hippocampus tissue were determined by two-dimensional electrophoresis (2DE) method and dysregulated proteins such as Snca, NF-L, Hspa5, Prdx2, Apoa1, and Atp5a1were identified by MALDI-TOF/TOF. KEGG pathway and gene ontology (GO) categories were used by searching DAVID Bioinformatics Resources. All detected protein spots were used to determine predictedinteractions with other proteins in STRING online database. Different isoforms of important protein, Snca that exhibited neuroprotective effects by anti-apoptotic properties were expressed. NF-L involved in the maintenance of neuronal caliber. Hspa5 likewise Prdx2 displays as anti-apoptotic protein that Prdx2 also involved in the neurotrophic effects. Apoa1 has anti-inflammatory activity and Atp5a1, produces ATP from ADP. To sum up, these proteins as potential drug targets were expressed in hippocampus in response to effective components in LA may have therapeutic properties for the treatment of AD and other neurodegenerative diseases. PMID:25561935
Proteomic analysis of plasma membrane proteins in wheat roots exposed to phenanthrene.
Shen, Yu; Du, Jiangxue; Yue, Le; Zhan, Xinhua
2016-06-01
Polycyclic aromatic hydrocarbons (PAHs) are potentially carcinogenic and toxic to humans through ingestion of contaminated food crops. PAHs can enter crop roots through proton/PAH symporters; however, to date, the symporter remains unclear. Here we reveal, for the first time, the plasma membrane proteome of Triticum aestivum seedling roots in response to phenanthrene (a model PAH) exposure. Two-dimensional gel electrophoresis (2-DE) coupled with MALDI-TOF/TOF-MS and protein database search engines were employed to analyze and identify phenanthrene-responsive proteins. Over 192 protein spots are reproducibly detected in each gel, while 8 spots are differentially expressed under phenanthrene treatment. Phenanthrene induces five up-regulated proteins distinguished as 5-methyltetrahydropteroyltriglutamate-homocysteine methyltransferase 2, enolase, heat shock protein 80-2, probable mediator of RNA polymerase II transcription subunit 37e (heat shock 70-kDa protein 1), and lactoylglutathione lyase. Three proteins identified as adenosine kinase 2, 4-hydroxy-7-methoxy-3-oxo-3,4-dihydro-2H-1,4-benzoxazin-2-yl glucoside beta-D-glucosidase 1c, and glyceraldehyde-3-phosphate dehydrogenase 3 are down-regulated under exposure to phenanthrene. The up-regulated proteins are related to plant defense response, antioxidant system, and glycolysis. The down-regulated proteins involve the metabolism of high-energy compounds and plant growth. Magnesium, which is able to bind to enolase, can enhance the transport of phenanthrene into wheat roots. Therefore, it is concluded that phenanthrene can induce differential expression of proteins in relation to carbohydrate metabolism, self-defense, and plant growth on wheat root plasma membrane. This study not only provides novel insights into PAH uptake by plant roots and PAH stress responses, but is also a good starting point for further determination and analyses of their functions using genetic and other approaches.
Cong, Wei-Tao; Wang, Xu; Hwang, Sun-Young; Jin, Li-Tai; Choi, Jung-Kap
2012-01-01
A fast and matrix-assisted laser desorption/ionization mass spectrometry compatible protein staining method in one- and two-dimensional sodium dodecyl sulfate-polyacrylamide gel electrophoresis is described. It is based on the counterion dye staining method that employs oppositely charged two dyes, zincon and ethyl violet, to form an ion-pair complex. The protocol, including fixing, staining, and quick washing steps, can be completed in 1-1.5 h, depending upon gel thickness. It has the sensitivity comparable to the colloidal Coomassie Brilliant Blue G stain using phosphoric acid as a component of staining solution (4-8 ng). The counterion dye stain does not induce protein modifications that complicate interpretation of peptide mapping data from mass spectrometry. Considering the speed, sensitivity, and compatibility with mass spectrometry, the counterion dye stain may be more practical than any other dye-based protein stains for routine proteomic researches.
VIEWCACHE: An incremental pointer-based access method for autonomous interoperable databases
NASA Technical Reports Server (NTRS)
Roussopoulos, N.; Sellis, Timos
1992-01-01
One of biggest problems facing NASA today is to provide scientists efficient access to a large number of distributed databases. Our pointer-based incremental database access method, VIEWCACHE, provides such an interface for accessing distributed data sets and directories. VIEWCACHE allows database browsing and search performing inter-database cross-referencing with no actual data movement between database sites. This organization and processing is especially suitable for managing Astrophysics databases which are physically distributed all over the world. Once the search is complete, the set of collected pointers pointing to the desired data are cached. VIEWCACHE includes spatial access methods for accessing image data sets, which provide much easier query formulation by referring directly to the image and very efficient search for objects contained within a two-dimensional window. We will develop and optimize a VIEWCACHE External Gateway Access to database management systems to facilitate distributed database search.
Toward two-dimensional search engines
NASA Astrophysics Data System (ADS)
Ermann, L.; Chepelianskii, A. D.; Shepelyansky, D. L.
2012-07-01
We study the statistical properties of various directed networks using ranking of their nodes based on the dominant vectors of the Google matrix known as PageRank and CheiRank. On average PageRank orders nodes proportionally to a number of ingoing links, while CheiRank orders nodes proportionally to a number of outgoing links. In this way, the ranking of nodes becomes two dimensional which paves the way for the development of two-dimensional search engines of a new type. Statistical properties of information flow on the PageRank-CheiRank plane are analyzed for networks of British, French and Italian universities, Wikipedia, Linux Kernel, gene regulation and other networks. A special emphasis is done for British universities networks using the large database publicly available in the UK. Methods of spam links control are also analyzed.
ERIC Educational Resources Information Center
Battle, Gary M.; Allen, Frank H.; Ferrence, Gregory M.
2011-01-01
Parts 1 and 2 of this series described the educational value of experimental three-dimensional (3D) chemical structures determined by X-ray crystallography and retrieved from the crystallographic databases. In part 1, we described the information content of the Cambridge Structural Database (CSD) and discussed a representative teaching subset of…
ERIC Educational Resources Information Center
Battle, Gary M.; Allen, Frank H.; Ferrence, Gregory M.
2011-01-01
Parts 1 and 2 of this series described the educational value of experimental three-dimensional (3D) chemical structures determined by X-ray crystallography and retrieved from the crystallographic databases. In part 1, we described the information content of the Cambridge Structural Database (CSD) and discussed a representative teaching subset of…
Genome-Wide SNP Genotyping to Infer the Effects on Gene Functions in Tomato
Hirakawa, Hideki; Shirasawa, Kenta; Ohyama, Akio; Fukuoka, Hiroyuki; Aoki, Koh; Rothan, Christophe; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi
2013-01-01
The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/). PMID:23482505
Proteomic platform for the identification of proteins in olive (Olea europaea) pulp.
Capriotti, Anna Laura; Cavaliere, Chiara; Foglia, Patrizia; Piovesana, Susy; Samperi, Roberto; Stampachiacchiere, Serena; Laganà, Aldo
2013-10-24
The nutritional and cancer-protective properties of the oil extracted mechanically from the ripe fruits of Olea europaea trees are attracting constantly more attention worldwide. The preparation of high-quality protein samples from plant tissues for proteomic analysis poses many challenging problems. In this study we employed a proteomic platform based on two different extraction methods, SDS and CHAPS based protocols, followed by two precipitation protocols, TCA/acetone and MeOH precipitation, in order to increase the final number of identified proteins. The use of advanced MS techniques in combination with the Swissprot and NCBI Viridiplantae databases and TAIR10 Arabidopsis database allowed us to identify 1265 proteins, of which 22 belong to O. europaea. The application of this proteomic platform for protein extraction and identification will be useful also for other proteomic studies on recalcitrant plant/fruit tissues. Copyright © 2013. Published by Elsevier B.V.
Functional Evolution of PLP-dependent Enzymes based on Active-Site Structural Similarities
Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert
2014-01-01
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5’-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the Comparison of Protein Active Site Structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. PMID:24920327
Functional evolution of PLP-dependent enzymes based on active-site structural similarities.
Catazaro, Jonathan; Caprez, Adam; Guru, Ashu; Swanson, David; Powers, Robert
2014-10-01
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5'-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the comparison of protein active site structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional-fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS. © 2014 Wiley Periodicals, Inc.
A resource for benchmarking the usefulness of protein structure models.
Carbajo, Daniel; Tramontano, Anna
2012-08-02
Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by non-academics: No.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karpinets, Tatiana V; Park, Byung; Syed, Mustafa H
2010-01-01
The Carbohydrate-Active Enzyme (CAZy) database provides a rich set of manually annotated enzymes that degrade, modify, or create glycosidic bonds. Despite rich and invaluable information stored in the database, software tools utilizing this information for annotation of newly sequenced genomes by CAZy families are limited. We have employed two annotation approaches to fill the gap between manually curated high-quality protein sequences collected in the CAZy database and the growing number of other protein sequences produced by genome or metagenome sequencing projects. The first approach is based on a similarity search against the entire non-redundant sequences of the CAZy database. Themore » second approach performs annotation using links or correspondences between the CAZy families and protein family domains. The links were discovered using the association rule learning algorithm applied to sequences from the CAZy database. The approaches complement each other and in combination achieved high specificity and sensitivity when cross-evaluated with the manually curated genomes of Clostridium thermocellum ATCC 27405 and Saccharophagus degradans 2-40. The capability of the proposed framework to predict the function of unknown protein domains (DUF) and of hypothetical proteins in the genome of Neurospora crassa is demonstrated. The framework is implemented as a Web service, the CAZymes Analysis Toolkit (CAT), and is available at http://cricket.ornl.gov/cgi-bin/cat.cgi.« less
Park, Byung H; Karpinets, Tatiana V; Syed, Mustafa H; Leuze, Michael R; Uberbacher, Edward C
2010-12-01
The Carbohydrate-Active Enzyme (CAZy) database provides a rich set of manually annotated enzymes that degrade, modify, or create glycosidic bonds. Despite rich and invaluable information stored in the database, software tools utilizing this information for annotation of newly sequenced genomes by CAZy families are limited. We have employed two annotation approaches to fill the gap between manually curated high-quality protein sequences collected in the CAZy database and the growing number of other protein sequences produced by genome or metagenome sequencing projects. The first approach is based on a similarity search against the entire nonredundant sequences of the CAZy database. The second approach performs annotation using links or correspondences between the CAZy families and protein family domains. The links were discovered using the association rule learning algorithm applied to sequences from the CAZy database. The approaches complement each other and in combination achieved high specificity and sensitivity when cross-evaluated with the manually curated genomes of Clostridium thermocellum ATCC 27405 and Saccharophagus degradans 2-40. The capability of the proposed framework to predict the function of unknown protein domains and of hypothetical proteins in the genome of Neurospora crassa is demonstrated. The framework is implemented as a Web service, the CAZymes Analysis Toolkit, and is available at http://cricket.ornl.gov/cgi-bin/cat.cgi.
Facilitating the Hyphenation of CIEF and MALDI-MS for Two-Dimensional Separation of Proteins
Cheng, Chang; Lu, Joann J.; Wang, Xiayan; Roberts, Jonathan; Liu, Shaorong
2011-01-01
Both CIEF and MALDI-MS are frequently used in protein analysis, but hyphenation of the two is not investigated proportionally. One of the major reasons is that the additives (such as carrier ampholytes and detergent) in CIEF severely suppress the MALDI-MS signal, which hampers the hyphenation of the two. In this paper, we develop a simple means to alleviate the above signal-suppressing effect. We first deposit 1 µL of water onto a MALDI-MS target, deliver a fraction of CIEF-separated protein (~0.1 µL) to the water droplet, evaporate the solvent, add 0.5 µL of MALDI matrix to the sample spot, dry the matrix, and move the target plate to a MALDI-TOF-MS for mass spectrum measurement. We optimize the droplet volume and the laser-ablation region. Under the optimized conditions, we improve the signal to noise ratio by 2–10 fold. We also apply this method for two-dimensional separations of standard proteins and Apolipoprotein A-I, a membrane protein expressed in E. Coli cells. PMID:20603827
NASA Technical Reports Server (NTRS)
Stan-Lotter, Helga; Lang, Frank J., Jr.; Hochstein, Lawrence I.
1989-01-01
The subunits from two purified halobacterial membrane enzymes (ATPase and nitrate reductase) behaved differently with respect to isoelectric focusing, silver staining and interaction with ampholytes. Differential behavior was also observed in whole cell proteins from Halobacterium saccharovorum regarding resolution in two-dimensional gels and silver staining. It is proposed that these differences reflect the existence of two classes of halobacterial proteins.
Bhaskara, Ramachandra M; Padhi, Amrita; Srinivasan, Narayanaswamy
2014-07-01
With the preponderance of multidomain proteins in eukaryotic genomes, it is essential to recognize the constituent domains and their functions. Often function involves communications across the domain interfaces, and the knowledge of the interacting sites is essential to our understanding of the structure-function relationship. Using evolutionary information extracted from homologous domains in at least two diverse domain architectures (single and multidomain), we predict the interface residues corresponding to domains from the two-domain proteins. We also use information from the three-dimensional structures of individual domains of two-domain proteins to train naïve Bayes classifier model to predict the interfacial residues. Our predictions are highly accurate (∼85%) and specific (∼95%) to the domain-domain interfaces. This method is specific to multidomain proteins which contain domains in at least more than one protein architectural context. Using predicted residues to constrain domain-domain interaction, rigid-body docking was able to provide us with accurate full-length protein structures with correct orientation of domains. We believe that these results can be of considerable interest toward rational protein and interaction design, apart from providing us with valuable information on the nature of interactions. © 2013 Wiley Periodicals, Inc.
FARE-CAFE: a database of functional and regulatory elements of cancer-associated fusion events
Korla, Praveen Kumar; Cheng, Jack; Huang, Chien-Hung; Tsai, Jeffrey J. P.; Liu, Yu-Hsuan; Kurubanjerdjit, Nilubon; Hsieh, Wen-Tsong; Chen, Huey-Yi; Ng, Ka-Lok
2015-01-01
Chromosomal translocation (CT) is of enormous clinical interest because this disorder is associated with various major solid tumors and leukemia. A tumor-specific fusion gene event may occur when a translocation joins two separate genes. Currently, various CT databases provide information about fusion genes and their genomic elements. However, no database of the roles of fusion genes, in terms of essential functional and regulatory elements in oncogenesis, is available. FARE-CAFE is a unique combination of CTs, fusion proteins, protein domains, domain–domain interactions, protein–protein interactions, transcription factors and microRNAs, with subsequent experimental information, which cannot be found in any other CT database. Genomic DNA information including, for example, manually collected exact locations of the first and second break points, sequences and karyotypes of fusion genes are included. FARE-CAFE will substantially facilitate the cancer biologist’s mission of elucidating the pathogenesis of various types of cancer. This database will ultimately help to develop ‘novel’ therapeutic approaches. Database URL: http://ppi.bioinfo.asia.edu.tw/FARE-CAFE PMID:26384373
TFBSshape: a motif database for DNA shape features of transcription factor binding sites
Yang, Lin; Zhou, Tianyin; Dror, Iris; Mathelier, Anthony; Wasserman, Wyeth W.; Gordân, Raluca; Rohs, Remo
2014-01-01
Transcription factor binding sites (TFBSs) are most commonly characterized by the nucleotide preferences at each position of the DNA target. Whereas these sequence motifs are quite accurate descriptions of DNA binding specificities of transcription factors (TFs), proteins recognize DNA as a three-dimensional object. DNA structural features refine the description of TF binding specificities and provide mechanistic insights into protein–DNA recognition. Existing motif databases contain extensive nucleotide sequences identified in binding experiments based on their selection by a TF. To utilize DNA shape information when analysing the DNA binding specificities of TFs, we developed a new tool, the TFBSshape database (available at http://rohslab.cmb.usc.edu/TFBSshape/), for calculating DNA structural features from nucleotide sequences provided by motif databases. The TFBSshape database can be used to generate heat maps and quantitative data for DNA structural features (i.e., minor groove width, roll, propeller twist and helix twist) for 739 TF datasets from 23 different species derived from the motif databases JASPAR and UniPROBE. As demonstrated for the basic helix-loop-helix and homeodomain TF families, our TFBSshape database can be used to compare, qualitatively and quantitatively, the DNA binding specificities of closely related TFs and, thus, uncover differential DNA binding specificities that are not apparent from nucleotide sequence alone. PMID:24214955
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willard, K.E.; Thorsrud, A.K.; Munthe, E.
Human leukocyte proteins from more than 150 patients with rheumatoid arthritis, together with age- and sex-matched controls, were analyzed by use of the ISO-DALT technique of two-dimensional polyacrylamide gel electrophoresis. Patients with ankylosing spondylitis, polymyalgia rheumatica, psoriatic arthritis, calcium tendinitis, post-infectious arthritis, and asymmetrical seronegative arthritis were also included as positive controls. Synthesis of several proteins, referred to by number as members of the Rheuma set, is shown to increase in the leukocyte preparations from patients with classical rheumatoid arthritis. Several of these proteins are specific to monocytes or granulocytes; others are of unknown cellular origin, but appear to bemore » unique to rheumatoid arthritis. The Rheuma proteins appear to be indicators of disease activity, because their increased synthesis can be correlated with sedimentation rate and other clinical indices of rheumatoid disease activity.« less
Guo, Jun-Hui; Xing, Guo-Lan; Fang, Xin-Hui; Wu, Hui-Fang; Zhang, Bo; Yu, Jin-Zhong; Fan, Zong-Min; Wang, Li-Dong
2017-01-01
AIM To understand the molecular mechanism of esophageal cancer development and provide molecular markers for screening high-risk populations and early diagnosis. METHODS Two-dimensional electrophoresis combined with mass spectrometry were adopted to screen differentially expressed proteins in nine cases of fetal esophageal epithelium, eight cases of esophageal cancer, and eight cases of tumor-adjacent normal esophageal epithelium collected from fetuses of different gestational age, or esophageal cancer patients from a high-risk area of esophageal cancer in China. Immunohistochemistry (avidin-biotin-horseradish peroxidase complex method) was used to detect the expression of peroxiredoxin (PRX)6 in 91 cases of esophageal cancer, tumor-adjacent normal esophageal tissue, basal cell hyperplasia, dysplasia, and carcinoma in situ, as well as 65 cases of esophageal epithelium from fetuses at a gestational age of 3-9 mo. RESULTS After peptide mass fingerprint analysis and search of protein databases, 21 differential proteins were identified; some of which represent a protein isoform. Varying degrees of expression of PRX6 protein, which was localized mainly in the cytoplasm, were detected in adult and fetal normal esophageal tissues, precancerous lesions, and esophageal cancer. With the progression of esophageal lesions, PRX6 protein expression showed a declining trend (P < 0.05). In fetal epithelium from fetuses at gestational age 3-6 mo, PRX6 protein expression showed a declining trend with age (P < 0.05). PRX6 protein expression was significantly higher in well-differentiated esophageal cancer tissues than in poorly differentiated esophageal cancer tissues (P < 0.05). CONCLUSION Development and progression of esophageal cancer result from interactions of genetic changes (accumulation or superposition). PRX6 protein is associated with fetal esophageal development and cancer differentiation. PMID:28293090
Guo, Jun-Hui; Xing, Guo-Lan; Fang, Xin-Hui; Wu, Hui-Fang; Zhang, Bo; Yu, Jin-Zhong; Fan, Zong-Min; Wang, Li-Dong
2017-02-28
To understand the molecular mechanism of esophageal cancer development and provide molecular markers for screening high-risk populations and early diagnosis. Two-dimensional electrophoresis combined with mass spectrometry were adopted to screen differentially expressed proteins in nine cases of fetal esophageal epithelium, eight cases of esophageal cancer, and eight cases of tumor-adjacent normal esophageal epithelium collected from fetuses of different gestational age, or esophageal cancer patients from a high-risk area of esophageal cancer in China. Immunohistochemistry (avidin-biotin-horseradish peroxidase complex method) was used to detect the expression of peroxiredoxin (PRX)6 in 91 cases of esophageal cancer, tumor-adjacent normal esophageal tissue, basal cell hyperplasia, dysplasia, and carcinoma in situ , as well as 65 cases of esophageal epithelium from fetuses at a gestational age of 3-9 mo. After peptide mass fingerprint analysis and search of protein databases, 21 differential proteins were identified; some of which represent a protein isoform. Varying degrees of expression of PRX6 protein, which was localized mainly in the cytoplasm, were detected in adult and fetal normal esophageal tissues, precancerous lesions, and esophageal cancer. With the progression of esophageal lesions, PRX6 protein expression showed a declining trend ( P < 0.05). In fetal epithelium from fetuses at gestational age 3-6 mo, PRX6 protein expression showed a declining trend with age ( P < 0.05). PRX6 protein expression was significantly higher in well-differentiated esophageal cancer tissues than in poorly differentiated esophageal cancer tissues ( P < 0.05). Development and progression of esophageal cancer result from interactions of genetic changes (accumulation or superposition). PRX6 protein is associated with fetal esophageal development and cancer differentiation.
Nabian, S; Taheri, M; Fard, R Mazaheri Nezhad; Aramoon, M
2013-01-01
Background Boophilus annulatus is an obligate blood feeder tick that can cause great losses in animals due to anemia and its ability to injure its host skin directly. The aim of this study was identification of cattle humoral immune response to some tick proteins during experimental infestation. Methods Immune sera against tick were collected from experimentally infested cattle with ticks. One and two-dimensional electrophoresis and Western blotting methods were used for the detection of immunogenic proteins in larval tick extract and eight of these proteins were identified by MALDITOF and MALDI-TOF-TOF mass spectrometry. Results In non-reducing one-dimensional SDS-PAGE, some bounds between 12 to more than 250-kDa appeared. In two-dimensional SDS-PAGE, numerous spot appeared and the identified immunogenic proteins by parallel immunoblotting weighted between 14 and 97 kDa. Amino acid sequences of protein spot with 37-kDa molecular weight had identity to tropomyosin based on Mascot search in NCBI. Conclusion Anti tropomyosin antibodies can be induced in experimentally infested hosts with ticks and it seems that tropomyosin can be useful for the development of anti tick vaccines. PMID:23914237
Database-Guided Discovery of Potent Peptides to Combat HIV-1 or Superbugs
Wang, Guangshun
2013-01-01
Antimicrobial peptides (AMPs), small host defense proteins, are indispensable for the protection of multicellular organisms such as plants and animals from infection. The number of AMPs discovered per year increased steadily since the 1980s. Over 2,000 natural AMPs from bacteria, protozoa, fungi, plants, and animals have been registered into the antimicrobial peptide database (APD). The majority of these AMPs (>86%) possess 11–50 amino acids with a net charge from 0 to +7 and hydrophobic percentages between 31–70%. This article summarizes peptide discovery on the basis of the APD. The major methods are the linguistic model, database screening, de novo design, and template-based design. Using these methods, we identified various potent peptides against human immunodeficiency virus type 1 (HIV-1) or methicillin-resistant Staphylococcus aureus (MRSA). While the stepwise designed anti-HIV peptide is disulfide-linked and rich in arginines, the ab initio designed anti-MRSA peptide is linear and rich in leucines. Thus, there are different requirements for antiviral and antibacterial peptides, which could kill pathogens via different molecular targets. The biased amino acid composition in the database-designed peptides, or natural peptides such as θ-defensins, requires the use of the improved two-dimensional NMR method for structural determination to avoid the publication of misleading structure and dynamics. In the case of human cathelicidin LL-37, structural determination requires 3D NMR techniques. The high-quality structure of LL-37 provides a solid basis for understanding its interactions with membranes of bacteria and other pathogens. In conclusion, the APD database is a comprehensive platform for storing, classifying, searching, predicting, and designing potent peptides against pathogenic bacteria, viruses, fungi, parasites, and cancer cells. PMID:24276259
NASA Astrophysics Data System (ADS)
De Marco, Luigi; Haky, Andrew; Tokmakoff, Andrei
Two-dimensional infrared (2D IR) spectroscopy has proven itself an indispensable tool for studying molecular dynamics and intermolecular interactions on ultrafast timescales. Using a novel source of broadband mid-IR pulses, we have collected 2D IR spectra of protein films at varying levels of hydration. With 2D IR, we can directly observe coupling between water's motions and the protein's. Protein films provide us with the ability to discriminate hydration waters from bulk water and thus give us access to studying water dynamics along the protein backbone, fluctuations in the protein structure, and the interplay between the molecular dynamics of the two. We present two representative protein films: poly-L-proline (PLP) and hen egg-white lysozyme (HEWL). Having no N-H groups, PLP allows us to look at water dynamics without interference from resonant energy transfer between the protein N-H stretch and the water O-H stretch. We conclude that at low hydration levels water-protein interactions dominate, and the water's dynamics are tied to those of the protein. In HEWL films, we take advantage of the robust secondary structure to partially deuterate the film, allowing us to spectrally distinguish the protein core from the exterior. From this, we show that resonant energy transfer to water provides an effective means of dissipating excess energy within the protein, while maintaining the structure. These methods are general and can easily be extended to studying specific protein-water interactions.
The three-dimensional structure of aquaporin-1
NASA Astrophysics Data System (ADS)
Walz, Thomas; Hirai, Teruhisa; Murata, Kazuyoshi; Heymann, J. Bernard; Mitsuoka, Kaoru; Fujiyoshi, Yoshinori; Smith, Barbara L.; Agre, Peter; Engel, Andreas
1997-06-01
The entry and exit of water from cells is a fundamental process of life. Recognition of the high water permeability of red blood cells led to the proposal that specialized water pores exist in the plasma membrane. Expression in Xenopus oocytes and functional studies of an erythrocyte integral membrane protein of relative molecular mass 28,000, identified it as the mercury-sensitive water channel, aquaporin-1 (AQP1). Many related proteins, all belonging to the major intrinsic protein (MIP) family, are found throughout nature. AQP1 is a homotetramer containing four independent aqueous channels. When reconstituted into lipid bilayers, the protein forms two-dimensional lattices with a unit cell containing two tetramers in opposite orientation. Here we present the three-dimensional structure of AQP1 determined at 6Å resolution by cryo-electron microscopy. Each AQP1 monomer has six tilted, bilayer-spanning α-helices which form a right-handed bundle surrounding a central density. These results, together with functional studies, provide a model that identifies the aqueous pore in the AQP1 molecule and indicates the organization of the tetrameric complex in the membrane.
Rapid and efficient proteolysis through laser-assisted immobilized enzyme reactors.
Zhang, Peng; Gao, Mingxia; Zhu, Shaochun; Lei, Jie; Zhang, Xiangmin
2011-11-25
In this report, laser radiation (808nm) for the first time was employed to enhance the efficiency of proteolysis through immobilized enzyme reactor (IMER). IMER based monolithic support was prepared in the fused-silica capillary via a simple two-step procedure including acryloylation on trypsin surface and in situ aqueous polymerization/immobilization. The feasibility and high efficiency of the laser-assisted IMER were demonstrated by the digestion of bovine serum albumin (BSA), cytochrome c (Cyt-c) and β-casein. The digestion process was achieved in 60s. The peptides were identified by MALDI-TOF-MS, yielding the sequence coverage of 33% for BSA, 73% for Cyt-c and 22% for β-casein. The comparisons between the in-solution digestion and on IMER reaction with/without laser assistance were made. To further confirm its efficiency in proteome analysis, the laser-assisted IMER was also applied to the analysis of one fraction of human serum sample through two-dimensional (2-D) separation of strong anion exchange/reversed-phase liquid chromatography (SAX/RPLC). After a database search, 49 unique peptides corresponding to 5 proteins were identified. The results showed that the laser-assisted IMER provides a promising platform for the high-throughput protein identification. Copyright © 2011 Elsevier B.V. All rights reserved.
VIEWCACHE: An incremental pointer-based access method for autonomous interoperable databases
NASA Technical Reports Server (NTRS)
Roussopoulos, N.; Sellis, Timos
1993-01-01
One of the biggest problems facing NASA today is to provide scientists efficient access to a large number of distributed databases. Our pointer-based incremental data base access method, VIEWCACHE, provides such an interface for accessing distributed datasets and directories. VIEWCACHE allows database browsing and search performing inter-database cross-referencing with no actual data movement between database sites. This organization and processing is especially suitable for managing Astrophysics databases which are physically distributed all over the world. Once the search is complete, the set of collected pointers pointing to the desired data are cached. VIEWCACHE includes spatial access methods for accessing image datasets, which provide much easier query formulation by referring directly to the image and very efficient search for objects contained within a two-dimensional window. We will develop and optimize a VIEWCACHE External Gateway Access to database management systems to facilitate database search.
Rasool, Khawaja Ghulam; Khan, Muhammad Altaf; Aldawood, Abdulrahman Saad; Tufail, Muhammad; Mukhtar, Muhammad; Takeda, Makio
2015-01-01
A state of the art proteomic methodology using Matrix Assisted Laser Desorption/Ionization-Time of Flight (MALDI TOF) has been employed to characterize peptides modulated in the date palm stem subsequent to infestation with red palm weevil (RPW). Our analyses revealed 32 differentially expressed peptides associated with RPW infestation in date palm stem. To identify RPW infestation associated peptides (I), artificially wounded plants (W) were used as additional control beside uninfested plants, a conventional control (C). A constant unique pattern of differential expression in infested (I), wounded (W) stem samples compared to control (C) was observed. The upregulated proteins showed relative fold intensity in order of I > W and downregulated spots trend as W > I, a quite interesting pattern. This study also reveals that artificially wounding of date palm stem affects almost the same proteins as infestation; however, relative intensity is quite lower than in infested samples both in up and downregulated spots. All 32 differentially expressed spots were subjected to MALDI-TOF analysis for their identification and we were able to match 21 proteins in the already existing databases. Relatively significant modulated expression pattern of a number of peptides in infested plants predicts the possibility of developing a quick and reliable molecular methodology for detecting plants infested with date palm. PMID:26287180
Rasool, Khawaja Ghulam; Khan, Muhammad Altaf; Aldawood, Abdulrahman Saad; Tufail, Muhammad; Mukhtar, Muhammad; Takeda, Makio
2015-08-17
A state of the art proteomic methodology using Matrix Assisted Laser Desorption/Ionization-Time of Flight (MALDI TOF) has been employed to characterize peptides modulated in the date palm stem subsequent to infestation with red palm weevil (RPW). Our analyses revealed 32 differentially expressed peptides associated with RPW infestation in date palm stem. To identify RPW infestation associated peptides (I), artificially wounded plants (W) were used as additional control beside uninfested plants, a conventional control (C). A constant unique pattern of differential expression in infested (I), wounded (W) stem samples compared to control (C) was observed. The upregulated proteins showed relative fold intensity in order of I > W and downregulated spots trend as W > I, a quite interesting pattern. This study also reveals that artificially wounding of date palm stem affects almost the same proteins as infestation; however, relative intensity is quite lower than in infested samples both in up and downregulated spots. All 32 differentially expressed spots were subjected to MALDI-TOF analysis for their identification and we were able to match 21 proteins in the already existing databases. Relatively significant modulated expression pattern of a number of peptides in infested plants predicts the possibility of developing a quick and reliable molecular methodology for detecting plants infested with date palm.
PrionScan: an online database of predicted prion domains in complete proteomes.
Espinosa Angarica, Vladimir; Angulo, Alfonso; Giner, Arturo; Losilla, Guillermo; Ventura, Salvador; Sancho, Javier
2014-02-05
Prions are a particular type of amyloids related to a large variety of important processes in cells, but also responsible for serious diseases in mammals and humans. The number of experimentally characterized prions is still low and corresponds to a handful of examples in microorganisms and mammals. Prion aggregation is mediated by specific protein domains with a remarkable compositional bias towards glutamine/asparagine and against charged residues and prolines. These compositional features have been used to predict new prion proteins in the genomes of different organisms. Despite these efforts, there are only a few available data sources containing prion predictions at a genomic scale. Here we present PrionScan, a new database of predicted prion-like domains in complete proteomes. We have previously developed a predictive methodology to identify and score prionogenic stretches in protein sequences. In the present work, we exploit this approach to scan all the protein sequences in public databases and compile a repository containing relevant information of proteins bearing prion-like domains. The database is updated regularly alongside UniprotKB and in its present version contains approximately 28000 predictions in proteins from different functional categories in more than 3200 organisms from all the taxonomic subdivisions. PrionScan can be used in two different ways: database query and analysis of protein sequences submitted by the users. In the first mode, simple queries allow to retrieve a detailed description of the properties of a defined protein. Queries can also be combined to generate more complex and specific searching patterns. In the second mode, users can submit and analyze their own sequences. It is expected that this database would provide relevant insights on prion functions and regulation from a genome-wide perspective, allowing researches performing cross-species prion biology studies. Our database might also be useful for guiding experimentalists in the identification of new candidates for further experimental characterization.
Kaszycki, Paweł; Dubicka-Lisowska, Aleksandra; Augustynowicz, Joanna; Piwowarczyk, Barbara; Wesołowski, Wojciech
2018-03-01
Chromate-induced physiological stress in a water-submerged macrophyte Callitriche cophocarpa Sendtn. (water starwort) was tested at the proteomic level. The oxidative stress status of the plant treated with 1 mM Cr(VI) for 3 days revealed stimulation of peroxidases whereas catalase and superoxide dismutase activities were similar to the control levels. Employing two-dimensional electrophoresis, comparative proteomics enabled to detect five differentiating proteins subjected to identification with mass spectrometry followed by an NCBI database search. Cr(VI) incubation led to induction of light harvesting chlorophyll a/b binding protein with a concomitant decrease of accumulation of ribulose bisphosphate carboxylase (RuBisCO). The main finding was, however, the identification of an NAD(P)H-dependent dehydrogenase FQR1, detectable only in Cr(VI)-treated plants. The FQR1 flavoenzyme is known to be responsive to oxidative stress and to act as a detoxification protein by protecting the cells against oxidative damage. It exhibits the in vitro quinone reductase activity and is capable of catalyzing two-electron transfer from NAD(P)H to several substrates, presumably including Cr(VI). The enhanced accumulation of FQR1 was chromate-specific since other stressful conditions, such as salt, temperature, and oxidative stresses, all failed to induce the protein. Zymographic analysis of chromate-treated Callitriche shoots showed a novel enzymatic protein band whose activity was attributed to the newly identified enzyme. We suggest that Cr(VI) phytoremediation with C. cophocarpa can be promoted by chromate reductase activity produced by the induced quinone oxidoreductase which might take part in Cr(VI) → Cr(III) bioreduction process and thus enable the plant to cope with the chromate-generated oxidative stress.
Proteomic Analysis and Functional Studies of Baicalin on Proteins Associated with Skin Cancer.
Li, Dan; Lin, Bingjiang; Yusuf, Nabiha; Burns, Erin M; Yu, Xiuqin; Luo, Dan; Min, Wei
2017-01-01
Abundant evidence supports the key role of ultraviolet radiation (UVR) in skin cancer development. The human skin, especially the epidermal layer, is the main defense against UV radiation. Baicalin is a major bioactive component of Scutellaria baicalensis Georgi, a plant which has been found to exhibit antitumor activity. The anticarcinogenic mechanism of baicalin is not completely understood. We have reported that baicalin inhibited UVB-induced photo-damage and apoptosis in HaCaT cells (human skin keratinocytes). The aim of the present study is to investigate the cellular gene targets responsible for baicalin's antitumor activity by performing two-dimensional electrophoresis liquid chromatography-mass spectrometry/mass spectrometry (2-DE LC-MS/MS) with HaCaT cells following UVB and baicalin exposure. Two-DE for protein separation was performed, followed by matrix-assisted laser desorption/ionization mass spectrometry and database searches. Nucleophosmin (NPM)-specific siRNA was designed and synthesized, and the small interfering RNA was transfected into skin squamous cancer A431 cells to knockdown the NPM expression. Proliferation and cell cycle status were assessed by CCK8 and flow cytometric analyses, respectively. We have identified 38 protein spots that are differentially expressed in HaCaT cells exposed to baicalin and/or UVB irradiation These proteins are involved in detoxification, proliferation, metabolism, cytoskeleton and motility. In particular, we found several proteins that have been linked to tumor progression and resistance, such as NPM. Baicalin treatment reduced the cellular proliferation rate and induced arrest during the S-phase of the cell cycle in A431 cells. NPM1 silencing significantly enhanced the effect of baicalin. Our data indicated that baicalin results in the significant inhibition of tumor growth in the A431 cell line, which may be associated with the regulation of the NPM gene expression.
Adherence is a multi-dimensional construct in the POUNDS LOST trial
Williamson, Donald A.; Anton, Stephen D.; Han, Hongmei; Champagne, Catherine M.; Allen, Ray; LeBlanc, Eric; Ryan, Donna H.; McManus, Katherine; Laranjo, Nancy; Carey, Vincent J.; Loria, Catherine M.; Bray, George A.; Sacks, Frank M.
2011-01-01
Research on the conceptualization of adherence to treatment has not addressed a key question: Is adherence best defined as being a uni-dimensional or multi-dimensional behavioral construct? The primary aim of this study was to test which of these conceptual models best described adherence to a weight management program. This ancillary study was conducted as a part of the POUNDS LOST trial that tested the efficacy of four dietary macro-nutrient compositions for promoting weight loss. A sample of 811 overweight/obese adults was recruited across two clinical sites, and each participant was randomly assigned to one of four macronutrient prescriptions: (1) Low fat (20% of energy), average protein (15% of energy); (2) High fat (40%), average protein (15%); (3) Low fat (20%), high protein (25%); (4) High fat (40%), high protein (25%). Throughout the first 6 months of the study, a computer tracking system collected data on eight indicators of adherence. Computer tracking data from the initial 6 months of the intervention were analyzed using exploratory and confirmatory analyses. Two factors (accounting for 66% of the variance) were identified and confirmed: (1) behavioral adherence and (2) dietary adherence. Behavioral adherence did not differ across the four interventions, but prescription of a high fat diet (vs. a low fat diet) was found to be associated with higher levels of dietary adherence. The findings of this study indicated that adherence to a weight management program was best conceptualized as being multi-dimensional, with two dimensions: behavioral and dietary adherence. PMID:19856202
Protein space: a natural method for realizing the nature of protein universe.
Yu, Chenglong; Deng, Mo; Cheng, Shiu-Yuen; Yau, Shek-Chung; He, Rong L; Yau, Stephen S-T
2013-02-07
Current methods cannot tell us what the nature of the protein universe is concretely. They are based on different models of amino acid substitution and multiple sequence alignment which is an NP-hard problem and requires manual intervention. Protein structural analysis also gives a direction for mapping the protein universe. Unfortunately, now only a minuscule fraction of proteins' 3-dimensional structures are known. Furthermore, the phylogenetic tree representations are not unique for any existing tree construction methods. Here we develop a novel method to realize the nature of protein universe. We show the protein universe can be realized as a protein space in 60-dimensional Euclidean space using a distance based on a normalized distribution of amino acids. Every protein is in one-to-one correspondence with a point in protein space, where proteins with similar properties stay close together. Thus the distance between two points in protein space represents the biological distance of the corresponding two proteins. We also propose a natural graphical representation for inferring phylogenies. The representation is natural and unique based on the biological distances of proteins in protein space. This will solve the fundamental question of how proteins are distributed in the protein universe. Copyright © 2012 Elsevier Ltd. All rights reserved.
Multiple brain atlas database and atlas-based neuroimaging system.
Nowinski, W L; Fang, A; Nguyen, B T; Raphel, J K; Jagannathan, L; Raghavan, R; Bryan, R N; Miller, G A
1997-01-01
For the purpose of developing multiple, complementary, fully labeled electronic brain atlases and an atlas-based neuroimaging system for analysis, quantification, and real-time manipulation of cerebral structures in two and three dimensions, we have digitized, enhanced, segmented, and labeled the following print brain atlases: Co-Planar Stereotaxic Atlas of the Human Brain by Talairach and Tournoux, Atlas for Stereotaxy of the Human Brain by Schaltenbrand and Wahren, Referentially Oriented Cerebral MRI Anatomy by Talairach and Tournoux, and Atlas of the Cerebral Sulci by Ono, Kubik, and Abernathey. Three-dimensional extensions of these atlases have been developed as well. All two- and three-dimensional atlases are mutually preregistered and may be interactively registered with an actual patient's data. An atlas-based neuroimaging system has been developed that provides support for reformatting, registration, visualization, navigation, image processing, and quantification of clinical data. The anatomical index contains about 1,000 structures and over 400 sulcal patterns. Several new applications of the brain atlas database also have been developed, supported by various technologies such as virtual reality, the Internet, and electronic publishing. Fusion of information from multiple atlases assists the user in comprehensively understanding brain structures and identifying and quantifying anatomical regions in clinical data. The multiple brain atlas database and atlas-based neuroimaging system have substantial potential impact in stereotactic neurosurgery and radiotherapy by assisting in visualization and real-time manipulation in three dimensions of anatomical structures, in quantitative neuroradiology by allowing interactive analysis of clinical data, in three-dimensional neuroeducation, and in brain function studies.
Fernández-Suárez, Xosé M; Rigden, Daniel J; Galperin, Michael Y
2014-01-01
The 2014 Nucleic Acids Research Database Issue includes descriptions of 58 new molecular biology databases and recent updates to 123 databases previously featured in NAR or other journals. For convenience, the issue is now divided into eight sections that reflect major subject categories. Among the highlights of this issue are six databases of the transcription factor binding sites in various organisms and updates on such popular databases as CAZy, Database of Genomic Variants (DGV), dbGaP, DrugBank, KEGG, miRBase, Pfam, Reactome, SEED, TCDB and UniProt. There is a strong block of structural databases, which includes, among others, the new RNA Bricks database, updates on PDBe, PDBsum, ArchDB, Gene3D, ModBase, Nucleic Acid Database and the recently revived iPfam database. An update on the NCBI's MMDB describes VAST+, an improved tool for protein structure comparison. Two articles highlight the development of the Structural Classification of Proteins (SCOP) database: one describes SCOPe, which automates assignment of new structures to the existing SCOP hierarchy; the other one describes the first version of SCOP2, with its more flexible approach to classifying protein structures. This issue also includes a collection of articles on bacterial taxonomy and metagenomics, which includes updates on the List of Prokaryotic Names with Standing in Nomenclature (LPSN), Ribosomal Database Project (RDP), the Silva/LTP project and several new metagenomics resources. The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been expanded to 1552 databases. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/).
Pané-Farré, Jan; Kusch, Harald; Wolf, Carmen; Reiß, Swantje; Binh, Le Thi Nguyen; Albrecht, Dirk; Riedel, Katharina; Hecker, Michael; Engelmann, Susanne
2013-01-01
Gel-based proteomics is a powerful approach to study the physiology of Staphylococcus aureus under various growth restricting conditions. We analyzed 679 protein spots from a reference 2-dimensional gel of cytosolic proteins of S. aureus COL by mass spectrometry resulting in 521 different proteins. 4,692 time dependent protein synthesis profiles were generated by exposing S. aureus to nine infection-related stress and starvation stimuli (H2O2, diamide, paraquat, NO, fermentation, nitrate respiration, heat shock, puromycin, mupirocin). These expression profiles are stored in an online resource called Aureolib (http://www.aureolib.de). Moreover, information on target genes of 75 regulators and regulatory elements were included in the database. Cross-comparisons of this extensive data collection of protein synthesis profiles using the tools implemented in Aureolib lead to the identification of stress and starvation specific marker proteins. Altogether, 226 protein synthesis profiles showed induction ratios of 2.5-fold or higher under at least one of the tested conditions with 157 protein synthesis profiles specifically induced in response to a single stimulus. The respective proteins might serve as marker proteins for the corresponding stimulus. By contrast, proteins whose synthesis was increased or repressed in response to more than four stimuli are rather exceptional. The only protein that was induced by six stimuli is the universal stress protein SACOL1759. Most strikingly, cluster analyses of synthesis profiles of proteins differentially synthesized under at least one condition revealed only in rare cases a grouping that correlated with known regulon structures. The most prominent examples are the GapR, Rex, and CtsR regulon. In contrast, protein synthesis profiles of proteins belonging to the CodY and σB regulon are widely distributed. In summary, Aureolib is by far the most comprehensive protein expression database for S. aureus and provides an essential tool to decipher more complex adaptation processes in S. aureus during host pathogen interaction. PMID:23967085
Proteomic analysis of PSD-93 knockout mice following the induction of ischemic cerebral injury.
Rong, Rong; Yang, Hui; Rong, Liangqun; Wei, Xiue; Li, Qingjie; Liu, Xiaomei; Gao, Hong; Xu, Yun; Zhang, Qingxiu
2016-03-01
Postsynaptic density protein-93 (PSD-93) is enriched in the postsynaptic density and is involved in N-methyl-d-aspartate receptor (NMDAR) triggered neurotoxicity through PSD-93/NMDAR/nNOS signaling pathway. In the present study, we found that PSD-93 deficiency reduced infarcted volume and neurological deficits induced by transient middle cerebral artery occlusion (tMCAO) in the mice. To identify novel targets of PSD-93 related neurotoxicity, we applied isobaric tags for relative and absolute quantitative (iTRAQ) labeling and combined this labeling with on-line two-dimensional LC/MS/MS technology to elucidate the changes in protein expression in PSD-93 knockout mice following tMCAO. The proteomic data set consisted of 1892 proteins. Compared to control group, differences in expression levels in ischemic group >1.5-fold and <0.66-fold were considered as differential expression. A total of 104 unique proteins with differential abundance levels were identified, among which 17 proteins were selected for further validation. Gene ontology analysis using UniProt database revealed that these differentially expressed proteins are involved in diverse function such as synaptic transmission, neuronal neurotransmitter and ion transport, modification of organelle membrane components. Moreover, network analysis revealed that the interacting proteins were involved in the transport of synaptic vesicles, the integrity of synaptic membranes and the activation of the ionotropic glutamate receptors NMDAR1 and NMDAR2B. Finally, RT-PCR and Western blot analysis showed that SynGAP, syntaxin-1A, protein kinase C β, and voltage-dependent L-type calcium channels were inhibited by ischemia-reperfusion. Identification of these proteins provides valuable clues to elucidate the mechanisms underlying the actions of PSD-93 in ischemia-reperfusion induced neurotoxicity. Copyright © 2015 Elsevier Inc. All rights reserved.
A rudimentary database for three-dimensional objects using structural representation
NASA Technical Reports Server (NTRS)
Sowers, James P.
1987-01-01
A database which enables users to store and share the description of three-dimensional objects in a research environment is presented. The main objective of the design is to make it a compact structure that holds sufficient information to reconstruct the object. The database design is based on an object representation scheme which is information preserving, reasonably efficient, and yet economical in terms of the storage requirement. The determination of the needed data for the reconstruction process is guided by the belief that it is faster to do simple computations to generate needed data/information for construction than to retrieve everything from memory. Some recent techniques of three-dimensional representation that influenced the design of the database are discussed. The schema for the database and the structural definition used to define an object are given. The user manual for the software developed to create and maintain the contents of the database is included.
Berggren, K; Chernokalskaya, E; Steinberg, T H; Kemper, C; Lopez, M F; Diwu, Z; Haugland, R P; Patton, W F
2000-07-01
SYPRO Ruby dye is a permanent stain comprised of ruthenium as part of an organic complex that interacts noncovalently with proteins. SYPRO Ruby Protein Gel Stain provides a sensitive, gentle, fluorescence-based method for detecting proteins in one-dimensional and two-dimensional sodium dodecyl sulfate-polyacrylamide gels. Proteins are fixed, stained from 3h to overnight and then rinsed in deionized water or dilute methanol/acetic acid solution for 30 min. The stain can be visualized using a wide range of excitation sources commonly used in image analysis systems including a 302 nm UV-B transilluminator, 473 nm second harmonic generation (SHG) laser, 488 nm argon-ion laser, 532 nm yttrium-aluminum-garnet (YAG) laser, xenon arc lamp, blue fluorescent light bulb or blue light-emitting diode (LED). The sensitivity of SYPRO Ruby Protein Gel Stain is superior to colloidal Coomassie Brilliant Blue (CBB) stain or monobromobimane labeling and comparable with the highest sensitivity silver or zinc-imidazole staining procedures available. The linear dynamic range of SYPRO Ruby Protein Gel stain extends over three orders of magnitude, which is vastly superior to silver, zinc-imidazole, monobromobimane and CBB stain. The fluorescent stain does not contain superfluous chemicals (formaldehyde, glutaraldehyde, Tween-20) that frequently interfere with peptide identification in mass spectrometry. While peptide mass profiles are severely altered in protein samples prelabeled with monobromobimane, successful identification of proteins by peptide mass profiling using matrix-assisted laser desorption/ionization mass spectrometry was easily performed after protein detection with SYPRO Ruby Protein Gel stain.
Encounter complexes and dimensionality reduction in protein–protein association
Kozakov, Dima; Li, Keyong; Hall, David R; Beglov, Dmitri; Zheng, Jiefu; Vakili, Pirooz; Schueler-Furman, Ora; Paschalidis, Ioannis Ch; Clore, G Marius; Vajda, Sandor
2014-01-01
An outstanding challenge has been to understand the mechanism whereby proteins associate. We report here the results of exhaustively sampling the conformational space in protein–protein association using a physics-based energy function. The agreement between experimental intermolecular paramagnetic relaxation enhancement (PRE) data and the PRE profiles calculated from the docked structures shows that the method captures both specific and non-specific encounter complexes. To explore the energy landscape in the vicinity of the native structure, the nonlinear manifold describing the relative orientation of two solid bodies is projected onto a Euclidean space in which the shape of low energy regions is studied by principal component analysis. Results show that the energy surface is canyon-like, with a smooth funnel within a two dimensional subspace capturing over 75% of the total motion. Thus, proteins tend to associate along preferred pathways, similar to sliding of a protein along DNA in the process of protein-DNA recognition. DOI: http://dx.doi.org/10.7554/eLife.01370.001 PMID:24714491
Haebel, S.; Jensen, C.; Andersen, S. O.; Roepstorff, P.
1995-01-01
Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants. PMID:7795523
Haebel, S; Jensen, C; Andersen, S O; Roepstorff, P
1995-03-01
Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants.
Liska, Adam J; Shevchenko, Andrej; Pick, Uri; Katz, Adriana
2004-09-01
Salinity is a major limiting factor for the proliferation of plants and inhibits central metabolic activities such as photosynthesis. The halotolerant green alga Dunaliella can adapt to hypersaline environments and is considered a model photosynthetic organism for salinity tolerance. To clarify the molecular basis for salinity tolerance, a proteomic approach has been applied for identification of salt-induced proteins in Dunaliella. Seventy-six salt-induced proteins were selected from two-dimensional gel separations of different subcellular fractions and analyzed by mass spectrometry (MS). Application of nanoelectrospray mass spectrometry, combined with sequence-similarity database-searching algorithms, MS BLAST and MultiTag, enabled identification of 80% of the salt-induced proteins. Salinity stress up-regulated key enzymes in the Calvin cycle, starch mobilization, and redox energy production; regulatory factors in protein biosynthesis and degradation; and a homolog of a bacterial Na(+)-redox transporters. The results indicate that Dunaliella responds to high salinity by enhancement of photosynthetic CO(2) assimilation and by diversion of carbon and energy resources for synthesis of glycerol, the osmotic element in Dunaliella. The ability of Dunaliella to enhance photosynthetic activity at high salinity is remarkable because, in most plants and cyanobacteria, salt stress inhibits photosynthesis. The results demonstrated the power of MS BLAST searches for the identification of proteins in organisms whose genomes are not known and paved the way for dissecting molecular mechanisms of salinity tolerance in algae and higher plants.
Web3DMol: interactive protein structure visualization based on WebGL.
Shi, Maoxiang; Gao, Juntao; Zhang, Michael Q
2017-07-03
A growing number of web-based databases and tools for protein research are being developed. There is now a widespread need for visualization tools to present the three-dimensional (3D) structure of proteins in web browsers. Here, we introduce our 3D modeling program-Web3DMol-a web application focusing on protein structure visualization in modern web browsers. Users submit a PDB identification code or select a PDB archive from their local disk, and Web3DMol will display and allow interactive manipulation of the 3D structure. Featured functions, such as sequence plot, fragment segmentation, measure tool and meta-information display, are offered for users to gain a better understanding of protein structure. Easy-to-use APIs are available for developers to reuse and extend Web3DMol. Web3DMol can be freely accessed at http://web3dmol.duapp.com/, and the source code is distributed under the MIT license. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Specific identification of Bacillus anthracis strains
NASA Astrophysics Data System (ADS)
Krishnamurthy, Thaiya; Deshpande, Samir; Hewel, Johannes; Liu, Hongbin; Wick, Charles H.; Yates, John R., III
2007-01-01
Accurate identification of human pathogens is the initial vital step in treating the civilian terrorism victims and military personnel afflicted in biological threat situations. We have applied a powerful multi-dimensional protein identification technology (MudPIT) along with newly generated software termed Profiler to identify the sequences of specific proteins observed for few strains of Bacillus anthracis, a human pathogen. Software termed Profiler was created to initially screen the MudPIT data of B. anthracis strains and establish the observed proteins specific for its strains. A database was also generated using Profiler containing marker proteins of B. anthracis and its strains, which in turn could be used for detecting the organism and its corresponding strains in samples. Analysis of the unknowns by our methodology, combining MudPIT and Profiler, led to the accurate identification of the anthracis strains present in samples. Thus, a new approach for the identification of B. anthracis strains in unknown samples, based on the molecular mass and sequences of marker proteins, has been ascertained.
Crystallization of SHARPIN using an automated two-dimensional grid screen for optimization.
Stieglitz, Benjamin; Rittinger, Katrin; Haire, Lesley F
2012-07-01
An N-terminal fragment of human SHARPIN was recombinantly expressed in Escherichia coli, purified and crystallized. Crystals suitable for X-ray diffraction were obtained by a one-step optimization of seed dilution and protein concentration using a two-dimensional grid screen. The crystals belonged to the primitive tetragonal space group P4(3)2(1)2, with unit-cell parameters a = b = 61.55, c = 222.81 Å. Complete data sets were collected from native and selenomethionine-substituted protein crystals at 100 K to 2.6 and 2.0 Å resolution, respectively.
Two dimensional Blue Native-/SDS-PAGE analysis of SLP family adaptor protein complexes.
Swamy, Mahima; Kulathu, Yogesh; Ernst, Sandra; Reth, Michael; Schamel, Wolfgang W A
2006-04-15
SH2 domain containing leukocyte protein (SLP) adaptor proteins serve a central role in the antigen-mediated activation of lymphocytes by organizing multiprotein signaling complexes. Here, we use two dimensional native-/SDS-gel electrophoresis to study the number, size and relative abundance of protein complexes containing SLP family proteins. In non-stimulated T cells all SLP-76 proteins are in a approximately 400 kDa complex with the small adaptor protein Grb2-like adaptor protein downstream of Shc (Gads), whereas half of Gads is monomeric. This constitutive SLP-76/Gads complex could be reconstituted in Drosophila S2 cells expressing both components, suggesting that it might not contain additional subunits. In contrast, in B cells SLP-65 exists in a 180 kDa complex as well as in monomeric form. Since the complex was not found in S2 cells expressing only SLP-65, it was not di/trimeric SLP-65. Upon antigen-stimulation only the complexed SLP-65 was phosphorylated. Surprisingly, stimulation-induced alteration of SLP complexes could not be detected, suggesting that active signaling complexes form only transiently, and are of low abundance.
Human Mitochondrial Protein Database
National Institute of Standards and Technology Data Gateway
SRD 131 Human Mitochondrial Protein Database (Web, free access) The Human Mitochondrial Protein Database (HMPDb) provides comprehensive data on mitochondrial and human nuclear encoded proteins involved in mitochondrial biogenesis and function. This database consolidates information from SwissProt, LocusLink, Protein Data Bank (PDB), GenBank, Genome Database (GDB), Online Mendelian Inheritance in Man (OMIM), Human Mitochondrial Genome Database (mtDB), MITOMAP, Neuromuscular Disease Center and Human 2-D PAGE Databases. This database is intended as a tool not only to aid in studying the mitochondrion but in studying the associated diseases.
Kimura, Kosei; Wada, Akira; Ueta, Masami; Ogata, Akihiko; Tanaka, Satoru; Sakai, Akiko; Yoshida, Hideji; Fushitani, Hideo; Miyamoto, Akiko; Fukushima, Masakazu; Uchiumi, Toshio; Tanigawa, Nobuhiko
2010-11-01
Many auxiliary functions of ribosomal proteins (r-proteins) have received considerable attention in recent years. However, human r-proteins have hardly been examined by proteomic analysis. In this study, we isolated ribosomal particles and subsequently compared the proteome of r-proteins between the DLD-1 human colon cancer cell line and its 5-fluorouracil (5-FU)-resistant sub-line, DLD-1/5-FU, using the radical-free and highly reducing method of two-dimensional polyacrylamide gel electrophoresis, which has a superior ability to separate basic proteins, and we discuss the role of r-proteins in 5-FU resistance. Densitometric analysis was performed to quantify modulated proteins, and protein spots showing significant changes were identified by employing matrix-assisted laser desorption/ionization time-of-flight/time-of-flight mass spectrometry. Three basic proteins (L15, L37 and prohibitin) which were significantly modulated between DLD-1 and DLD-1/5-FU were identified. Two proteins, L15 and L37, showed down-regulated expression in DLD-1/5-FU in comparison to DLD-1. Prohibitin, which is not an r-protein and is known to be localized in the mitochondria, showed up-regulated expression in DLD-1/5-FU. These 3 proteins may be related to 5-FU resistance.
NASA Astrophysics Data System (ADS)
Schultz, Christian P.; Bârzu, Octavian; Mantsch, Henry H.
2000-03-01
The functional role of CMP kinases is to regenerate mono-phosphate nucleotides in cells by transferring phosphate residues from tri-phosphorylated nucleotides to monophosphorylated nucleotides. These enzymes possess two binding sites and maintain a highly conserved secondary structure. They are essential for cell survival. Herein we compare the infrared spectra of two similar, but not identical enzymes, the CMP kinases from Escherichia coli and Bacillus subtilis. A two-dimensional cross correlation analysis of the infrared spectra reveals differences in the denaturation behavior of the two proteins. Different secondary structure elements show different time-delayed or advanced unfolding events in the two enzymes. When bound to the active sites, the two nucleotide-substrates CMP and ATP exert a stabilizing effect on the structure of both proteins. The changes observed upon thermal denaturation are different for the two enzymes. Model 2D correlations are used to simulate the different denaturation of the two enzymes. Thermal denaturation and aggregation can be distinguished as two processes separated in time.
Ambroggio, Xavier I; Dommer, Jennifer; Gopalan, Vivek; Dunham, Eleca J; Taubenberger, Jeffery K; Hurt, Darrell E
2013-06-18
Influenza A viruses possess RNA genomes that mutate frequently in response to immune pressures. The mutations in the hemagglutinin genes are particularly significant, as the hemagglutinin proteins mediate attachment and fusion to host cells, thereby influencing viral pathogenicity and species specificity. Large-scale influenza A genome sequencing efforts have been ongoing to understand past epidemics and pandemics and anticipate future outbreaks. Sequencing efforts thus far have generated nearly 9,000 distinct hemagglutinin amino acid sequences. Comparative models for all publicly available influenza A hemagglutinin protein sequences (8,769 to date) were generated using the Rosetta modeling suite. The C-alpha root mean square deviations between a randomly chosen test set of models and their crystallographic templates were less than 2 Å, suggesting that the modeling protocols yielded high-quality results. The models were compiled into an online resource, the Hemagglutinin Structure Prediction (HASP) server. The HASP server was designed as a scientific tool for researchers to visualize hemagglutinin protein sequences of interest in a three-dimensional context. With a built-in molecular viewer, hemagglutinin models can be compared side-by-side and navigated by a corresponding sequence alignment. The models and alignments can be downloaded for offline use and further analysis. The modeling protocols used in the HASP server scale well for large amounts of sequences and will keep pace with expanded sequencing efforts. The conservative approach to modeling and the intuitive search and visualization interfaces allow researchers to quickly analyze hemagglutinin sequences of interest in the context of the most highly related experimental structures, and allow them to directly compare hemagglutinin sequences to each other simultaneously in their two- and three-dimensional contexts. The models and methodology have shown utility in current research efforts and the ongoing aim of the HASP server is to continue to accelerate influenza A research and have a positive impact on global public health.
NASA Astrophysics Data System (ADS)
Methogo, Ruth Menque; Dansette, Patrick M.; Klarskov, Klaus
2007-12-01
A combined approach based on two-dimensional electrophoresis-immuno-blotting and nanoliquid chromatography coupled on-line with electrospray ionization mass spectrometry (nLC-MS/MS) was used to identify proteins modified by a reactive intermediate of tienilic acid (TA). Liver homogenates from rats exposed to TA were fractionated using ultra centrifugation; four fractions were obtained and subjected to 2D electrophoresis. Following transfer to PVDF membranes, modified proteins were visualized after India ink staining, using an anti-serum raised against TA and ECL detection. Immuno-reactive spots were localized on the PVDF membrane by superposition of the ECL image, protein spots of interest were excised, digested on the membrane with trypsin followed by nLC-MS/MS analysis and protein identification. A total of 15 proteins were identified as likely targets modified by a TA reactive metabolite. These include selenium binding protein 2, senescence marker protein SMP-30, adenosine kinase, Acy1 protein, adenosylhomocysteinase, capping protein (actin filament), protein disulfide isomerase, fumarylacetoacetase, arginase chain A, ketohexokinase, proteasome endopeptidase complex, triosephosphate isomerase, superoxide dismutase, dna-type molecular chaperone hsc73 and malate dehydrogenase.
The Protein Information Resource: an integrated public resource of functional annotation of proteins
Wu, Cathy H.; Huang, Hongzhan; Arminski, Leslie; Castro-Alvear, Jorge; Chen, Yongxing; Hu, Zhang-Zhi; Ledley, Robert S.; Lewis, Kali C.; Mewes, Hans-Werner; Orcutt, Bruce C.; Suzek, Baris E.; Tsugita, Akira; Vinayaka, C. R.; Yeh, Lai-Su L.; Zhang, Jian; Barker, Winona C.
2002-01-01
The Protein Information Resource (PIR) serves as an integrated public resource of functional annotation of protein data to support genomic/proteomic research and scientific discovery. The PIR, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the PIR-International Protein Sequence Database (PSD), the major annotated protein sequence database in the public domain, containing about 250 000 proteins. To improve protein annotation and the coverage of experimentally validated data, a bibliography submission system is developed for scientists to submit, categorize and retrieve literature information. Comprehensive protein information is available from iProClass, which includes family classification at the superfamily, domain and motif levels, structural and functional features of proteins, as well as cross-references to over 40 biological databases. To provide timely and comprehensive protein data with source attribution, we have introduced a non-redundant reference protein database, PIR-NREF. The database consists of about 800 000 proteins collected from PIR-PSD, SWISS-PROT, TrEMBL, GenPept, RefSeq and PDB, with composite protein names and literature data. To promote database interoperability, we provide XML data distribution and open database schema, and adopt common ontologies. The PIR web site (http://pir.georgetown.edu/) features data mining and sequence analysis tools for information retrieval and functional identification of proteins based on both sequence and annotation information. The PIR databases and other files are also available by FTP (ftp://nbrfa.georgetown.edu/pir_databases). PMID:11752247
A Brief Review of RNA–Protein Interaction Database Resources
Yi, Ying; Zhao, Yue; Huang, Yan; Wang, Dong
2017-01-01
RNA–Protein interactions play critical roles in various biological processes. By collecting and analyzing the RNA–Protein interactions and binding sites from experiments and predictions, RNA–Protein interaction databases have become an essential resource for the exploration of the transcriptional and post-transcriptional regulatory network. Here, we briefly review several widely used RNA–Protein interaction database resources developed in recent years to provide a guide of these databases. The content and major functions in databases are presented. The brief description of database helps users to quickly choose the database containing information they interested. In short, these RNA–Protein interaction database resources are continually updated, but the current state shows the efforts to identify and analyze the large amount of RNA–Protein interactions. PMID:29657278
Liu, Jiangang; Wang, Dapeng; Li, Yanyan; Yao, Hui; Zhang, Nan; Zhang, Xuewen; Zhong, Fangping; Huang, Yulun
2018-06-01
The human pituitary tumor-transforming gene is an oncogenic protein which serves as a central hub in the cellular signaling network of medulloblastoma. The protein contains two vicinal PxxP motifs at its C terminus that are potential binding sites of peptide-recognition SH3 domains. Here, a synthetic protocol that integrated in silico analysis and in vitro assay was described to identify the SH3-binding partners of pituitary tumor-transforming gene in the gene expression profile of medulloblastoma. In the procedure, a variety of structurally diverse, non-redundant SH3 domains with high gene expression in medulloblastoma were compiled, and their three-dimensional structures were either manually retrieved from the protein data bank database or computationally modeled through bioinformatics technique. The binding capability of these domains towards the two PxxP-containing peptides m1p: 161 LGPPSPVK 168 and m2p: 168 KMPSPPWE 175 of pituitary tumor-transforming gene were ranked by structure-based scoring and fluorescence-based assay. Consequently, a number of SH3 domains, including MAP3K and PI3K, were found to have moderate or high affinity for m1p and/or m2p. Interestingly, the two overlapping peptides exhibits a distinct binding profile to these identified domain partners, suggesting that the binding selectivity of m1p and m2p is optimized across the medulloblastoma expression spectrum by competing for domain candidates. In addition, two redesigned versions of m1p peptide ware obtained via a structure-based rational mutation approach, which exhibited an increased affinity for the domain as compared to native peptide.
Park, Gun Wook; Hwang, Heeyoun; Kim, Kwang Hoe; Lee, Ju Yeon; Lee, Hyun Kyoung; Park, Ji Yeong; Ji, Eun Sun; Park, Sung-Kyu Robin; Yates, John R; Kwon, Kyung-Hoon; Park, Young Mok; Lee, Hyoung-Joo; Paik, Young-Ki; Kim, Jin Young; Yoo, Jong Shin
2016-11-04
In the Chromosome-Centric Human Proteome Project (C-HPP), false-positive identification by peptide spectrum matches (PSMs) after database searches is a major issue for proteogenomic studies using liquid-chromatography and mass-spectrometry-based large proteomic profiling. Here we developed a simple strategy for protein identification, with a controlled false discovery rate (FDR) at the protein level, using an integrated proteomic pipeline (IPP) that consists of four engrailed steps as follows. First, using three different search engines, SEQUEST, MASCOT, and MS-GF+, individual proteomic searches were performed against the neXtProt database. Second, the search results from the PSMs were combined using statistical evaluation tools including DTASelect and Percolator. Third, the peptide search scores were converted into E-scores normalized using an in-house program. Last, ProteinInferencer was used to filter the proteins containing two or more peptides with a controlled FDR of 1.0% at the protein level. Finally, we compared the performance of the IPP to a conventional proteomic pipeline (CPP) for protein identification using a controlled FDR of <1% at the protein level. Using the IPP, a total of 5756 proteins (vs 4453 using the CPP) including 477 alternative splicing variants (vs 182 using the CPP) were identified from human hippocampal tissue. In addition, a total of 10 missing proteins (vs 7 using the CPP) were identified with two or more unique peptides, and their tryptic peptides were validated using MS/MS spectral pattern from a repository database or their corresponding synthetic peptides. This study shows that the IPP effectively improved the identification of proteins, including alternative splicing variants and missing proteins, in human hippocampal tissues for the C-HPP. All RAW files used in this study were deposited in ProteomeXchange (PXD000395).
Electrophoretic extraction of proteins from two-dimensional electrophoresis gel spots
Zhang, Jian-Shi; Giometti, Carol S.; Tollaksen, Sandra L.
1989-01-01
After two-dimensional electrophoresis of proteins or the like, resulting in a polyacrylamide gel slab having a pattern of protein gel spots thereon, an individual protein gel spot is cored out from the slab, to form a gel spot core which is placed in an extraction tube, with a dialysis membrane across the lower end of the tube. Replicate gel spots can be cored out from replicate gel slabs and placed in the extraction tube. Molten agarose gel is poured into the extraction tube where the agarose gel hardens to form an immobilizing gel, covering the gel spot cores. The upper end portion of the extraction tube is filled with a volume of buffer solution, and the upper end is closed by another dialysis membrane. Upper and lower bodies of a buffer solution are brought into contact with the upper and lower membranes and are provided with electrodes connected to the positive and negative terminals of a DC power supply, thereby producing an electrical current which flows through the upper membrane, the volume of buffer solution, the agarose, the gel spot cores and the lower membrane. The current causes the proteins to be extracted electrophoretically from the gel spot cores, so that the extracted proteins accumulate and are contained in the space between the agarose gel and the upper membrane. A high percentage extraction of proteins is achieved. The extracted proteins can be removed and subjected to partial digestion by trypsin or the like, followed by two-dimensional electrophoresis, resulting in a gel slab having a pattern of peptide gel spots which can be cored out and subjected to electrophoretic extraction to extract individual peptides.
Hashemi, Amenehsadat; Gharechahi, Javad; Nematzadeh, Ghorbanali; Shekari, Faezeh; Hosseini, Seyed Abdollah; Salekdeh, Ghasem Hosseini
2016-08-01
To understand the biology of a plant in response to stress, insight into protein-protein interactions, which almost define cell behavior, is thought to be crucial. Here, we provide a comparative complexomics analysis of leaf whole cell lysate of two rice genotypes with contrasting responses to salt using two-dimensional blue native/SDS-PAGE (2D-BN/SDS-PAGE). We aimed to identify changes in subunit composition and stoichiometry of protein complexes elicited by salt. Using mild detergent for protein complex solubilization, we were able to identify 9 protein assemblies as hetero-oligomeric and 30 as homo-oligomeric complexes. A total of 20 proteins were identified as monomers in the 2D-BN/SDS-PAGE gels. In addition to identifying known protein complexes that confirm the technical validity of our analysis, we were also able to discover novel protein-protein interactions. Interestingly, an interaction was detected for glycolytic enzymes enolase (ENO1) and triosephosphate isomerase (TPI) and also for a chlorophyll a-b binding protein and RuBisCo small subunit. To show changes in subunit composition and stoichiometry of protein assemblies during salt stress, the differential abundance of interacting proteins was compared between salt-treated and control plants. A detailed exploration of some of the protein complexes provided novel insight into the function, composition, stoichiometry and dynamics of known and previously uncharacterized protein complexes in response to salt stress. Copyright © 2016 Elsevier GmbH. All rights reserved.
Alaiya, A A; Franzén, B; Moberger, B; Silfverswärd, C; Linder, S; Auer, G
1999-01-01
The process of tumor progression leads to the emergence of multiple clones, and to the development of tumor heterogeneity. One approach to the study of the extent of such heterogeneity is to examine the expression of marker proteins in different tumor areas. Two-dimensional gel electrophoresis (2-DE) is a powerful tool for such studies, since the expression of a large number of polypeptide markers can be evaluated. In the present study, tumor cells were prepared from human ovarian tumors and analyzed by 2-DE and PDQUEST. As judged from the analysis of two different areas in each of nine ovarian tumors, the intratumoral variation in protein expression was low. In contrast, large differences were observed when the protein profiles of different tumors were compared. The differences in gene expression between pairs of malignant carcinomas were slightly larger than the differences observed between pairs of benign tumors. We conclude that 2-DE analysis of intratumoral heterogeneity in ovarian cancer tissue indicates a low degree of heterogeneity.
DeitY-TU face database: its design, multiple camera capturing, characteristics, and evaluation
NASA Astrophysics Data System (ADS)
Bhowmik, Mrinal Kanti; Saha, Kankan; Saha, Priya; Bhattacharjee, Debotosh
2014-10-01
The development of the latest face databases is providing researchers different and realistic problems that play an important role in the development of efficient algorithms for solving the difficulties during automatic recognition of human faces. This paper presents the creation of a new visual face database, named the Department of Electronics and Information Technology-Tripura University (DeitY-TU) face database. It contains face images of 524 persons belonging to different nontribes and Mongolian tribes of north-east India, with their anthropometric measurements for identification. Database images are captured within a room with controlled variations in illumination, expression, and pose along with variability in age, gender, accessories, make-up, and partial occlusion. Each image contains the combined primary challenges of face recognition, i.e., illumination, expression, and pose. This database also represents some new features: soft biometric traits such as mole, freckle, scar, etc., and facial anthropometric variations that may be helpful for researchers for biometric recognition. It also gives an equivalent study of the existing two-dimensional face image databases. The database has been tested using two baseline algorithms: linear discriminant analysis and principal component analysis, which may be used by other researchers as the control algorithm performance score.
FunShift: a database of function shift analysis on protein subfamilies
Abhiman, Saraswathi; Sonnhammer, Erik L. L.
2005-01-01
Members of a protein family normally have a general biochemical function in common, but frequently one or more subgroups have evolved a slightly different function, such as different substrate specificity. It is important to detect such function shifts for a more accurate functional annotation. The FunShift database described here is a compilation of function shift analysis performed between subfamilies in protein families. It consists of two main components: (i) subfamilies derived from protein domain families and (ii) pairwise subfamily comparisons analyzed for function shift. The present release, FunShift 12, was derived from Pfam 12 and consists of 151 934 subfamilies derived from 7300 families. We carried out function shift analysis by two complementary methods on families with up to 500 members. From a total of 179 210 subfamily pairs, 62 384 were predicted to be functionally shifted in 2881 families. Each subfamily pair is provided with a markup of probable functional specificity-determining sites. Tools for searching and exploring the data are provided to make this database a valuable resource for protein function annotation. Knowledge of these functionally important sites will be useful for experimental biologists performing functional mutation studies. FunShift is available at http://FunShift.cgb.ki.se. PMID:15608176
Neural/Bayes network predictor for inheritable cardiac disease pathogenicity and phenotype.
Burghardt, Thomas P; Ajtai, Katalin
2018-04-11
The cardiac muscle sarcomere contains multiple proteins contributing to contraction energy transduction and its regulation during a heartbeat. Inheritable heart disease mutants affect most of them but none more frequently than the ventricular myosin motor and cardiac myosin binding protein c (mybpc3). These co-localizing proteins have mybpc3 playing a regulatory role to the energy transducing motor. Residue substitution and functional domain assignment of each mutation in the protein sequence decides, under the direction of a sensible disease model, phenotype and pathogenicity. The unknown model mechanism is decided here using a method combing neural and Bayes networks. Missense single nucleotide polymorphisms (SNPs) are clues for the disease mechanism summarized in an extensive database collecting mutant sequence location and residue substitution as independent variables that imply the dependent disease phenotype and pathogenicity characteristics in 4 dimensional data points (4ddps). The SNP database contains entries with the majority having one or both dependent data entries unfulfilled. A neural network relating causes (mutant residue location and substitution) and effects (phenotype and pathogenicity) is trained, validated, and optimized using fulfilled 4ddps. It then predicts unfulfilled 4ddps providing the implicit disease model. A discrete Bayes network interprets fulfilled and predicted 4ddps with conditional probabilities for phenotype and pathogenicity given mutation location and residue substitution thus relating the neural network implicit model to explicit features of the motor and mybpc3 sequence and structural domains. Neural/Bayes network forecasting automates disease mechanism modeling by leveraging the world wide human missense SNP database that is in place and expanding. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Guan, Wenzhu; Ferry, Natalie; Edwards, Martin G; Bell, Howard A; Othman, Hamizah; Gatehouse, John A; Gatehouse, Angharad M R
2015-01-01
The grain aphid Sitobion avenae (F.) is a major pest of wheat, acting as a virus vector as well as causing direct plant damage. Commonly grown wheat varieties in the UK have only limited resistance to this pest. The present study was carried out to investigate the potential of a diploid wheat line (ACC20 PGR1755), reported as exhibiting resistance to S. avenae, to serve as a source of resistance genes. The diploid wheat line was confirmed as partially resistant, substantially reducing the fecundity, longevity and growth rate of the aphid. Proteomic analysis showed that approximately 200 protein spots were reproducibly detected in leaf extracts from both the resistant line and a comparable susceptible line (ACC5 PGR1735) using two-dimensional gel electrophoresis and image comparison software. Twenty-four spots were significantly up-regulated (>2-fold) in the resistant line after 24 h of aphid feeding (13 and 11 involved in local and systemic responses, respectively). Approximately 50 % of all differentially expressed protein spots were identified by a combination of database searching with MS and MS/MS data, revealing that the majority of proteins up-regulated by aphid infestation were involved in metabolic processes (including photosynthesis) and transcriptional regulation. However, in the resistant line only, several stress response proteins (including NBS-LRR-like proteins) and oxidative stress response proteins were identified as up-regulated in response to aphid feeding, as well as proteins involved in DNA synthesis/replication/repair. This study indicates that the resistant diploid line ACC20 PGR1755 may provide a valuable resource in breeding wheat for resistance to aphids.
Akashi, Kinya; Yoshida, Kazuo; Kuwano, Masayoshi; Kajikawa, Masataka; Yoshimura, Kazuya; Hoshiyasu, Saki; Inagaki, Naoyuki; Yokota, Akiho
2011-05-01
Wild watermelon (Citrullus lanatus) is a xerophyte native to the Kalahari Desert, Africa. To better understand the molecular mechanisms of drought resistance in this plant, we examined changes in the proteome in response to water deficit. Wild watermelon leaves showed decreased transpiration and a concomitant increase in leaf temperature under water deficit conditions. Comparison of the proteome of stressed plants with that of unstressed plants by two-dimensional gel electrophoresis revealed that the intensity of 40 spots increased in response to the stress, and the intensity of 11 spots decreased. We positively identified 23 stress-induced and 6 stress-repressed proteins by mass spectrometry and database analyses. Interestingly, 15 out of the 23 up-regulated proteins (65% of annotated up-regulated proteins) were heat shock proteins (HSPs). Especially, 10 out of the 15 up-regulated HSPs belonged to the small heat shock protein (sHSP) family. Other stress-induced proteins included those related to antioxidative defense and carbohydrate metabolism. Fifteen distinct cDNA sequences encoding the sHSP were characterized from wild watermelon. Quantitative real-time PCR analysis of the representative sHSP genes revealed strong transcriptional up-regulation in the leaves under water deficit. Moreover, immunoblot analysis confirmed that protein abundance of sHSPs was massively increased under water deficit. Overall, these observations suggest that the defense response of wild watermelon may involve orchestrated regulation of a diverse array of functional proteins related to cellular defense and metabolism, of which HSPs may play a pivotal role on the protection of the plant under water deficit in the presence of strong light.
Liu, Jianhua; Li, Wenyu; Ji, Yihong; Tian, Di; Tian, Lu; Yang, Xinchao; Xu, Lixin; Yan, Ruofeng; Li, Xiangrui; Song, Xiaokai
2017-01-01
Clinical chicken coccidiosis is mostly caused by simultaneous infection of several Eimeria species, and host immunity against Eimeria is species-specific. It is urgent to identify common immunodominant antigen of Eimeria for developing multivalent anticoccidial vaccines. In this study, sporozoite proteins of Eimeria tenella, Eimeria acervulina and Eimeria maxima were analyzed by two-dimensional electrophoresis (2DE). Western bot analysis was performed on the yielded 2DE gel using antisera of E. tenella E. acervulina and E. maxima respectively. Next, the detected immunodominant spots were identified by comparing the data from MALDI-TOF-MS/MS with available databases. Finally, Eimeria common antigens were identified by comparing amino acid sequence between the three Eimeria species. The results showed that analysis by 2DE of sporozoite proteins detected 629, 626 and 632 protein spots from E. tenella, E. acervulina and E. maxima respectively. Western bot analysis revealed 50 (E. tenella), 64 (E. acervulina) and 57 (E. maxima) immunodominant spots from the sporozoite 2DE gels of the three Eimeria species. The immunodominant spots were identified as 33, 27 and 25 immunodominant antigens of E. tenella, E. acervulina and E. maxima respectively. Fifty-four immunodominant proteins were identified as 18 ortholog proteins among the three Eimeria species. Finally, 5 of the 18 ortholog proteins were identified as common immunodominant antigens including elongation factor 2 (EF-2), 14-3-3 protein, ubiquitin-conjugating enzyme domain-containing protein (UCE) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH). In conclusion, our results not only provide Eimeria sporozoite immunodominant antigen map and additional immunodominant antigens, but also common immunodominant antigens for developing multivalent anticoccidial vaccines. PMID:28432276
Liu, Lianrui; Huang, Xinmei; Liu, Jianhua; Li, Wenyu; Ji, Yihong; Tian, Di; Tian, Lu; Yang, Xinchao; Xu, Lixin; Yan, Ruofeng; Li, Xiangrui; Song, Xiaokai
2017-05-23
Clinical chicken coccidiosis is mostly caused by simultaneous infection of several Eimeria species, and host immunity against Eimeria is species-specific. It is urgent to identify common immunodominant antigen of Eimeria for developing multivalent anticoccidial vaccines. In this study, sporozoite proteins of Eimeria tenella, Eimeria acervulina and Eimeria maxima were analyzed by two-dimensional electrophoresis (2DE). Western bot analysis was performed on the yielded 2DE gel using antisera of E. tenella E. acervulina and E. maxima respectively. Next, the detected immunodominant spots were identified by comparing the data from MALDI-TOF-MS/MS with available databases. Finally, Eimeria common antigens were identified by comparing amino acid sequence between the three Eimeria species. The results showed that analysis by 2DE of sporozoite proteins detected 629, 626 and 632 protein spots from E. tenella, E. acervulina and E. maxima respectively. Western bot analysis revealed 50 (E. tenella), 64 (E. acervulina) and 57 (E. maxima) immunodominant spots from the sporozoite 2DE gels of the three Eimeria species. The immunodominant spots were identified as 33, 27 and 25 immunodominant antigens of E. tenella, E. acervulina and E. maxima respectively. Fifty-four immunodominant proteins were identified as 18 ortholog proteins among the three Eimeria species. Finally, 5 of the 18 ortholog proteins were identified as common immunodominant antigens including elongation factor 2 (EF-2), 14-3-3 protein, ubiquitin-conjugating enzyme domain-containing protein (UCE) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH). In conclusion, our results not only provide Eimeria sporozoite immunodominant antigen map and additional immunodominant antigens, but also common immunodominant antigens for developing multivalent anticoccidial vaccines.
Single-particle tracking of quantum dot-conjugated prion proteins inside yeast cells
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsuji, Toshikazu; Kawai-Noma, Shigeko; Pack, Chan-Gi
2011-02-25
Research highlights: {yields} We develop a method to track a quantum dot-conjugated protein in yeast cells. {yields} We incorporate the conjugated quantum dot proteins into yeast spheroplasts. {yields} We track the motions by conventional or 3D tracking microscopy. -- Abstract: Yeast is a model eukaryote with a variety of biological resources. Here we developed a method to track a quantum dot (QD)-conjugated protein in the budding yeast Saccharomyces cerevisiae. We chemically conjugated QDs with the yeast prion Sup35, incorporated them into yeast spheroplasts, and tracked the motions by conventional two-dimensional or three-dimensional tracking microscopy. The method paves the way towardmore » the individual tracking of proteins of interest inside living yeast cells.« less
Structural modeling of G-protein coupled receptors: An overview on automatic web-servers.
Busato, Mirko; Giorgetti, Alejandro
2016-08-01
Despite the significant efforts and discoveries during the last few years in G protein-coupled receptor (GPCR) expression and crystallization, the receptors with known structures to date are limited only to a small fraction of human GPCRs. The lack of experimental three-dimensional structures of the receptors represents a strong limitation that hampers a deep understanding of their function. Computational techniques are thus a valid alternative strategy to model three-dimensional structures. Indeed, recent advances in the field, together with extraordinary developments in crystallography, in particular due to its ability to capture GPCRs in different activation states, have led to encouraging results in the generation of accurate models. This, prompted the community of modelers to render their methods publicly available through dedicated databases and web-servers. Here, we present an extensive overview on these services, focusing on their advantages, drawbacks and their role in successful applications. Future challenges in the field of GPCR modeling, such as the predictions of long loop regions and the modeling of receptor activation states are presented as well. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genetic analysis of the cytoplasmic dynein subunit families.
Pfister, K Kevin; Shah, Paresh R; Hummerich, Holger; Russ, Andreas; Cotton, James; Annuar, Azlina Ahmad; King, Stephen M; Fisher, Elizabeth M C
2006-01-01
Cytoplasmic dyneins, the principal microtubule minus-end-directed motor proteins of the cell, are involved in many essential cellular processes. The major form of this enzyme is a complex of at least six protein subunits, and in mammals all but one of the subunits are encoded by at least two genes. Here we review current knowledge concerning the subunits, their interactions, and their functional roles as derived from biochemical and genetic analyses. We also carried out extensive database searches to look for new genes and to clarify anomalies in the databases. Our analysis documents evolutionary relationships among the dynein subunits of mammals and other model organisms, and sheds new light on the role of this diverse group of proteins, highlighting the existence of two cytoplasmic dynein complexes with distinct cellular roles.
Genetic Analysis of the Cytoplasmic Dynein Subunit Families
Pfister, K. Kevin; Shah, Paresh R; Hummerich, Holger; Russ, Andreas; Cotton, James; Annuar, Azlina Ahmad; King, Stephen M; Fisher, Elizabeth M. C
2006-01-01
Cytoplasmic dyneins, the principal microtubule minus-end-directed motor proteins of the cell, are involved in many essential cellular processes. The major form of this enzyme is a complex of at least six protein subunits, and in mammals all but one of the subunits are encoded by at least two genes. Here we review current knowledge concerning the subunits, their interactions, and their functional roles as derived from biochemical and genetic analyses. We also carried out extensive database searches to look for new genes and to clarify anomalies in the databases. Our analysis documents evolutionary relationships among the dynein subunits of mammals and other model organisms, and sheds new light on the role of this diverse group of proteins, highlighting the existence of two cytoplasmic dynein complexes with distinct cellular roles. PMID:16440056
How Adequate are One- and Two-Dimensional Free Energy Landscapes for Protein Folding Dynamics?
NASA Astrophysics Data System (ADS)
Maisuradze, Gia G.; Liwo, Adam; Scheraga, Harold A.
2009-06-01
The molecular dynamics trajectories of protein folding or unfolding, generated with the coarse-grained united-residue force field for the B domain of staphylococcal protein A, were analyzed by principal component analysis (PCA). The folding or unfolding process was examined by using free-energy landscapes (FELs) in PC space. By introducing a novel multidimensional FEL, it was shown that the low-dimensional FELs are not always sufficient for the description of folding or unfolding processes. Similarities between the topographies of FELs along low- and high-indexed principal components were observed.
Horiuchi, Takayuki; Taoka, Masato; Isobe, Toshiaki; Komano, Teruya; Inouye, Sumiko
2002-07-26
Two genes, fruA and csgA, encoding a putative transcription factor and C-factor, respectively, are essential for fruiting body formation of Myxococcus xanthus. To investigate the role of fruA and csgA genes in developmental gene expression, developing cells as well as vegetative cells of M. xanthus wild-type, fruA::Tc, and csgA731 strains were pulse-labeled with [(35)S]methionine, and the whole cell proteins were analyzed using two-dimensional immobilized pH gradient/SDS-PAGE. Differences in protein synthesis patterns among more than 700 protein spots were detected during development of the three strains. Fourteen proteins showing distinctly different expression patterns in mutant cells were analyzed in more detail. Five of the 14 proteins were identified as elongation factor Tu (EF-Tu), Dru, DofA, FruA, and protein S by immunoblot analysis and mass spectroscopy. A gene encoding DofA was cloned and sequenced. Although both fruA and csgA genes regulate early development of M. xanthus, they were found to differently regulate expression of several developmental genes. The production of six proteins, including DofA and protein S, was dependent on fruA, whereas the production of two proteins was dependent on csgA, and one protein was dependent on both fruA and csgA. To explain the present findings, a new model was presented in which different levels of FruA phosphorylation may distinctively regulate the expression of two groups of developmental genes.
Poppe, Leszek; Jordan, John B; Rogers, Gary; Schnier, Paul D
2015-06-02
An important aspect in the analytical characterization of protein therapeutics is the comprehensive characterization of higher order structure (HOS). Nuclear magnetic resonance (NMR) is arguably the most sensitive method for fingerprinting HOS of a protein in solution. Traditionally, (1)H-(15)N or (1)H-(13)C correlation spectra are used as a "structural fingerprint" of HOS. Here, we demonstrate that protein fingerprint by line shape enhancement (PROFILE), a 1D (1)H NMR spectroscopy fingerprinting approach, is superior to traditional two-dimensional methods using monoclonal antibody samples and a heavily glycosylated protein therapeutic (Epoetin Alfa). PROFILE generates a high resolution structural fingerprint of a therapeutic protein in a fraction of the time required for a 2D NMR experiment. The cross-correlation analysis of PROFILE spectra allows one to distinguish contributions from HOS vs protein heterogeneity, which is difficult to accomplish by 2D NMR. We demonstrate that the major analytical limitation of two-dimensional methods is poor selectivity, which renders these approaches problematic for the purpose of fingerprinting large biological macromolecules.
Protein Information Resource: a community resource for expert annotation of protein data
Barker, Winona C.; Garavelli, John S.; Hou, Zhenglin; Huang, Hongzhan; Ledley, Robert S.; McGarvey, Peter B.; Mewes, Hans-Werner; Orcutt, Bruce C.; Pfeiffer, Friedhelm; Tsugita, Akira; Vinayaka, C. R.; Xiao, Chunlin; Yeh, Lai-Su L.; Wu, Cathy
2001-01-01
The Protein Information Resource, in collaboration with the Munich Information Center for Protein Sequences (MIPS) and the Japan International Protein Information Database (JIPID), produces the most comprehensive and expertly annotated protein sequence database in the public domain, the PIR-International Protein Sequence Database. To provide timely and high quality annotation and promote database interoperability, the PIR-International employs rule-based and classification-driven procedures based on controlled vocabulary and standard nomenclature and includes status tags to distinguish experimentally determined from predicted protein features. The database contains about 200 000 non-redundant protein sequences, which are classified into families and superfamilies and their domains and motifs identified. Entries are extensively cross-referenced to other sequence, classification, genome, structure and activity databases. The PIR web site features search engines that use sequence similarity and database annotation to facilitate the analysis and functional identification of proteins. The PIR-International databases and search tools are accessible on the PIR web site at http://pir.georgetown.edu/ and at the MIPS web site at http://www.mips.biochem.mpg.de. The PIR-International Protein Sequence Database and other files are also available by FTP. PMID:11125041
Lu, Joann J.; Wang, Shili; Li, Guanbin; Wang, Wei; Pu, Qiaosheng; Liu, Shaorong
2012-01-01
In this report, we introduce a chip-capillary hybrid device to integrate capillary isoelectric focusing (CIEF) with parallel capillary sodium dodecyl sulfate – polyacrylamide gel electrophoresis (SDS-PAGE) or capillary gel electrophoresis (CGE) toward automating two-dimensional (2D) protein separations. The hybrid device consists of three chips that are butted together. The middle chip can be moved between two positions to re-route the fluidic paths, which enables the performance of CIEF and injection of proteins partially resolved by CIEF to CGE capillaries for parallel CGE separations in a continuous and automated fashion. Capillaries are attached to the other two chips to facilitate CIEF and CGE separations and to extend the effective lengths of CGE columns. Specifically, we illustrate the working principle of the hybrid device, develop protocols for producing and preparing the hybrid device, and demonstrate the feasibility of using this hybrid device for automated injection of CIEF-separated sample to parallel CGE for 2D protein separations. Potentials and problems associated with the hybrid device are also discussed. PMID:22830584
Dickerson, Jane A.; Ramsay, Lauren M.; Dada, Oluwatosin O.; Cermak, Nathan
2011-01-01
Capillary isoelectric focusing and capillary zone electrophoresis are coupled with laser-induced fluorescence detection to create an ultrasensitive two-dimensional separation method for proteins. In this method, two capillaries are joined through a buffer filled interface. Separate power supplies control the potential at the injection end of the first capillary and at the interface; the detector is held at ground potential. Proteins are labeled with the fluorogenic reagent Chromeo P503, which preserves the isoelectric point of the labeled protein. The labeled proteins were mixed with ampholytes and injected into the first dimension capillary. A focusing step was performed with the injection end of the capillary at high pH and the interface at low pH. To mobilize components, the interface was filled with a high pH buffer, which was compatible with the second dimension separation. A fraction was transferred to the second dimension capillary for separation. The process of fraction transfer and second dimension separation was repeated two dozen times. The separation produced a spot capacity of 125. PMID:20603830
ERIC Educational Resources Information Center
Journal of Chemical Education, 1988
1988-01-01
Reviews two computer programs: "Molecular Graphics," which allows molecule manipulation in three-dimensional space (requiring IBM PC with 512K, EGA monitor, and math coprocessor); and "Periodic Law," a database which contains up to 20 items of information on each of the first 103 elements (Apple II or IBM PC). (MVL)
Johann Deisenhofer, Crystallography, and Proteins
research using X-ray crystallography to elucidate for the first time the three-dimensional structure of a large membrane-bound protein molecule. This structure helped explain the process of photosynthesis, by a protein structure determination that relied on complementary features of two different beam lines
NASA Astrophysics Data System (ADS)
Parviainen, Ville; Joenväärä, Sakari; Peltoniemi, Hannu; Mattila, Pirkko; Renkonen, Risto
2009-04-01
Mass spectrometry-based proteomic research has become one of the main methods in protein-protein interaction research. Several high throughput studies have established an interaction landscape of exponentially growing Baker's yeast culture. However, many of the protein-protein interactions are likely to change in different environmental conditions. In order to examine the dynamic nature of the protein interactions we isolated the protein complexes of mannose-1-phosphate guanyltransferase PSA1 from Saccharomyces cerevisiae at four different time points during batch cultivation. We used the tandem affinity purification (TAP)-method to purify the complexes and subjected the tryptic peptides to LC-MS/MS. The resulting peak lists were analyzed with two different methods: the database related protein identification program X!Tandem and the de novo sequencing program Lutefisk. We observed significant changes in the interactome of PSA1 during the batch cultivation and identified altogether 74 proteins interacting with PSA1 of which only six were found to interact during all time points. All the other proteins showed a more dynamic nature of binding activity. In this study we also demonstrate the benefit of using both database related and de novo methods in the protein interaction research to enhance both the quality and the quantity of observations.
BtoxDB: a comprehensive database of protein structural data on toxin-antitoxin systems.
Barbosa, Luiz Carlos Bertucci; Garrido, Saulo Santesso; Marchetto, Reinaldo
2015-03-01
Toxin-antitoxin (TA) systems are diverse and abundant genetic modules in prokaryotic cells that are typically formed by two genes encoding a stable toxin and a labile antitoxin. Because TA systems are able to repress growth or kill cells and are considered to be important actors in cell persistence (multidrug resistance without genetic change), these modules are considered potential targets for alternative drug design. In this scenario, structural information for the proteins in these systems is highly valuable. In this report, we describe the development of a web-based system, named BtoxDB, that stores all protein structural data on TA systems. The BtoxDB database was implemented as a MySQL relational database using PHP scripting language. Web interfaces were developed using HTML, CSS and JavaScript. The data were collected from the PDB, UniProt and Entrez databases. These data were appropriately filtered using specialized literature and our previous knowledge about toxin-antitoxin systems. The database provides three modules ("Search", "Browse" and "Statistics") that enable searches, acquisition of contents and access to statistical data. Direct links to matching external databases are also available. The compilation of all protein structural data on TA systems in one platform is highly useful for researchers interested in this content. BtoxDB is publicly available at http://www.gurupi.uft.edu.br/btoxdb. Copyright © 2015 Elsevier Ltd. All rights reserved.
A PDB-wide, evolution-based assessment of protein-protein interfaces.
Baskaran, Kumaran; Duarte, Jose M; Biyani, Nikhil; Bliven, Spencer; Capitani, Guido
2014-10-18
Thanks to the growth in sequence and structure databases, more than 50 million sequences are now available in UniProt and 100,000 structures in the PDB. Rich information about protein-protein interfaces can be obtained by a comprehensive study of protein contacts in the PDB, their sequence conservation and geometric features. An automated computational pipeline was developed to run our Evolutionary Protein-Protein Interface Classifier (EPPIC) software on the entire PDB and store the results in a relational database, currently containing > 800,000 interfaces. This allows the analysis of interface data on a PDB-wide scale. Two large benchmark datasets of biological interfaces and crystal contacts, each containing about 3000 entries, were automatically generated based on criteria thought to be strong indicators of interface type. The BioMany set of biological interfaces includes NMR dimers solved as crystal structures and interfaces that are preserved across diverse crystal forms, as catalogued by the Protein Common Interface Database (ProtCID) from Xu and Dunbrack. The second dataset, XtalMany, is derived from interfaces that would lead to infinite assemblies and are therefore crystal contacts. BioMany and XtalMany were used to benchmark the EPPIC approach. The performance of EPPIC was also compared to classifications from the Protein Interfaces, Surfaces, and Assemblies (PISA) program on a PDB-wide scale, finding that the two approaches give the same call in about 88% of PDB interfaces. By comparing our safest predictions to the PDB author annotations, we provide a lower-bound estimate of the error rate of biological unit annotations in the PDB. Additionally, we developed a PyMOL plugin for direct download and easy visualization of EPPIC interfaces for any PDB entry. Both the datasets and the PyMOL plugin are available at http://www.eppic-web.org/ewui/\\#downloads. Our computational pipeline allows us to analyze protein-protein contacts and their sequence conservation across the entire PDB. Two new benchmark datasets are provided, which are over an order of magnitude larger than existing manually curated ones. These tools enable the comprehensive study of several aspects of protein-protein contacts in the PDB and represent a basis for future, even larger scale studies of protein-protein interactions.
NASA Astrophysics Data System (ADS)
Wang, Kai; Wei, Ming; Zhang, Lijun; Du, Yong
2016-04-01
We realized a three-dimensional visualization of the morphology evolution and the growth behavior of the octahedral primary silicon in hypereutectic Al-20wtpctSi alloy during solidification in a real length scale by utilizing the phase-field simulation coupled with CALPHAD databases, and supported by key experiments. Moreover, through two-dimensional cut of the octahedral primary silicon at random angles, different morphologies observed in experiments, including triangle, square, trapezoid, rhombic, pentagon, and hexagon, were well reproduced.
Konc, Janez; Cesnik, Tomo; Konc, Joanna Trykowska; Penca, Matej; Janežič, Dušanka
2012-02-27
ProBiS-Database is a searchable repository of precalculated local structural alignments in proteins detected by the ProBiS algorithm in the Protein Data Bank. Identification of functionally important binding regions of the protein is facilitated by structural similarity scores mapped to the query protein structure. PDB structures that have been aligned with a query protein may be rapidly retrieved from the ProBiS-Database, which is thus able to generate hypotheses concerning the roles of uncharacterized proteins. Presented with uncharacterized protein structure, ProBiS-Database can discern relationships between such a query protein and other better known proteins in the PDB. Fast access and a user-friendly graphical interface promote easy exploration of this database of over 420 million local structural alignments. The ProBiS-Database is updated weekly and is freely available online at http://probis.cmm.ki.si/database.
Li, Yihan; Wojcik, Roza; Dovichi, Norman J.
2010-01-01
We describe a two-dimensional capillary electrophoresis system that incorporates a replaceable enzymatic microreactor for on-line protein digestion. In this system, trypsin is immobilized on magnetic beads. At the start of each experiment, old beads are flushed to waste and replaced with a fresh plug of beads, which is captured by a pair of magnets at the distal tip of the first capillary. For analysis, proteins are separated in the first capillary. A fraction is then parked in the reactor to create peptides. Digested peptides are periodically transferred to the second capillary for separation; a fresh protein fraction is simultaneously moved to the reactor for digestion. An electrospray interface is used to introduce peptides into a mass spectrometer for analysis. This procedure is repeated for several dozen fractions under computer control. The system was demonstrated by the separation and digestion of insulin chain b oxidized and β-casein as model proteins. PMID:21030030
NASA Astrophysics Data System (ADS)
Zhou, Qun; Sun, Su-Qin; Yu, Lu; Xu, Chang-Hua; Noda, Isao; Zhang, Xin-Rong
2006-11-01
Infrared (IR) spectroscopy and two-dimensional (2D) correlation IR spectroscopy are shown to offer some information about stability and shelf life of milk powders without separation and extraction of individual components in this paper. Temperature has been chosen as the perturbation to monitor the infrared behavior of various milk powders, namely, whole milk powder (WMP), sweet whole milk powder (Sweet WMP), low-fat milk powder (LFMP), and skim milk powder (SMP). The sequential order of changes in protein, fat and carbohydrates (mainly lactose) in milk powders is studied for the first time. The protein changes before the sucrose in WMP, whereas the sucrose changes before the protein in Sweet WMP under temperature perturbation. It is also found that in SMP, carbohydrate changes prior to protein whereas in LFMP and WMP protein changes first as the temperature is increased. The conclusion can provide some useful reference to understand the thermal stability of milk powders.
López-Pedrouso, María; Bernal, Javier; Franco, Daniel; Zapata, Carlos
2014-07-23
High-resolution two-dimensional electrophoresis (2-DE) profiles of the protein phaseolin, the major seed storage protein of common bean, display great number of spots with differentially glycosylated and phosphorylated α- and β-type polypeptides. This work aims to test whether these complex profiles can be useful markers of genetic differentiation and seed protein quality in bean populations. The 2-DE phaseolin profile and the amino acid composition were examined in bean seeds from 18 domesticated and wild accessions belonging to the Mesoamerican and Andean gene pools. We found that proteomic distances based on 2-DE profiles were successful in identifying the accessions belonging to each gene pool and outliers distantly related. In addition, accessions identified as outliers from proteomic distances showed the highest levels of methionine content, an essential amino acid deficient in bean seeds. These findings suggest that 2-DE phaseolin profiles provide valuable information with potential of being used in common bean genetic improvement.
NASA Astrophysics Data System (ADS)
Debusschere, Bert J.; Najm, Habib N.; Matta, Alain; Knio, Omar M.; Ghanem, Roger G.; Le Maître, Olivier P.
2003-08-01
This paper presents a model for two-dimensional electrochemical microchannel flow including the propagation of uncertainty from model parameters to the simulation results. For a detailed representation of electroosmotic and pressure-driven microchannel flow, the model considers the coupled momentum, species transport, and electrostatic field equations, including variable zeta potential. The chemistry model accounts for pH-dependent protein labeling reactions as well as detailed buffer electrochemistry in a mixed finite-rate/equilibrium formulation. Uncertainty from the model parameters and boundary conditions is propagated to the model predictions using a pseudo-spectral stochastic formulation with polynomial chaos (PC) representations for parameters and field quantities. Using a Galerkin approach, the governing equations are reformulated into equations for the coefficients in the PC expansion. The implementation of the physical model with the stochastic uncertainty propagation is applied to protein-labeling in a homogeneous buffer, as well as in two-dimensional electrochemical microchannel flow. The results for the two-dimensional channel show strong distortion of sample profiles due to ion movement and consequent buffer disturbances. The uncertainty in these results is dominated by the uncertainty in the applied voltage across the channel.
Two-dimensional electrophoretic analysis of nuclear matrix proteins in human colon adenocarcinoma.
Toumpanaki, A; Baltatzis, G E; Gaitanarou, E; Seretis, E; Toumpanakis, C; Aroni, K; Kittas, Christos; Voloudakis-Baltatzis, I E
2009-01-01
The aim of the present study was to observe possible qualitative and quantitative expression differences between nuclear matrix proteins (NMPs) of human colon adenocarcinoma and their mirror biopsies, using the technique of two-dimensional gel electrophoresis, in order to identify the existence of specific NMP fingerprints for colon cancer. Colon tissues were examined ultrastructurally and NMPs were isolated biochemically, by serial extraction of lipids, soluble proteins, DNA, RNA, and intermediate filaments and were separated according to their isoelectric point (pI) and their molecular weight (MW) by high-resolution two-dimensional electrophoresis (2D). By comparing the 2D electropherograms of colon cancer tissues and mirror biopsy tissues we observed qualitative and quantitative expression differences between their NMPs but also a differentiation of NMP composition between the stages of malignancy. Moreover, despite the similarities between mirror biopsy samples, a highlight percentage of exception was observed. Electrophoretic results provided in this study demonstrated that the examined NMPs could be further investigated as potential markers for detection of colorectal cancer in an early stage, for the assessment of the disease progression, as well as useful tools for individual therapy and for preventing a possible recurrence of cancer and metastasis.
FragFit: a web-application for interactive modeling of protein segments into cryo-EM density maps.
Tiemann, Johanna K S; Rose, Alexander S; Ismer, Jochen; Darvish, Mitra D; Hilal, Tarek; Spahn, Christian M T; Hildebrand, Peter W
2018-05-21
Cryo-electron microscopy (cryo-EM) is a standard method to determine the three-dimensional structures of molecular complexes. However, easy to use tools for modeling of protein segments into cryo-EM maps are sparse. Here, we present the FragFit web-application, a web server for interactive modeling of segments of up to 35 amino acids length into cryo-EM density maps. The fragments are provided by a regularly updated database containing at the moment about 1 billion entries extracted from PDB structures and can be readily integrated into a protein structure. Fragments are selected based on geometric criteria, sequence similarity and fit into a given cryo-EM density map. Web-based molecular visualization with the NGL Viewer allows interactive selection of fragments. The FragFit web-application, accessible at http://proteinformatics.de/FragFit, is free and open to all users, without any login requirements.
New generic indexing technology
NASA Technical Reports Server (NTRS)
Freeston, Michael
1996-01-01
There has been no fundamental change in the dynamic indexing methods supporting database systems since the invention of the B-tree twenty-five years ago. And yet the whole classical approach to dynamic database indexing has long since become inappropriate and increasingly inadequate. We are moving rapidly from the conventional one-dimensional world of fixed-structure text and numbers to a multi-dimensional world of variable structures, objects and images, in space and time. But, even before leaving the confines of conventional database indexing, the situation is highly unsatisfactory. In fact, our research has led us to question the basic assumptions of conventional database indexing. We have spent the past ten years studying the properties of multi-dimensional indexing methods, and in this paper we draw the strands of a number of developments together - some quite old, some very new, to show how we now have the basis for a new generic indexing technology for the next generation of database systems.
An emerging cyberinfrastructure for biodefense pathogen and pathogen-host data.
Zhang, C; Crasta, O; Cammer, S; Will, R; Kenyon, R; Sullivan, D; Yu, Q; Sun, W; Jha, R; Liu, D; Xue, T; Zhang, Y; Moore, M; McGarvey, P; Huang, H; Chen, Y; Zhang, J; Mazumder, R; Wu, C; Sobral, B
2008-01-01
The NIAID-funded Biodefense Proteomics Resource Center (RC) provides storage, dissemination, visualization and analysis capabilities for the experimental data deposited by seven Proteomics Research Centers (PRCs). The data and its publication is to support researchers working to discover candidates for the next generation of vaccines, therapeutics and diagnostics against NIAID's Category A, B and C priority pathogens. The data includes transcriptional profiles, protein profiles, protein structural data and host-pathogen protein interactions, in the context of the pathogen life cycle in vivo and in vitro. The database has stored and supported host or pathogen data derived from Bacillus, Brucella, Cryptosporidium, Salmonella, SARS, Toxoplasma, Vibrio and Yersinia, human tissue libraries, and mouse macrophages. These publicly available data cover diverse data types such as mass spectrometry, yeast two-hybrid (Y2H), gene expression profiles, X-ray and NMR determined protein structures and protein expression clones. The growing database covers over 23 000 unique genes/proteins from different experiments and organisms. All of the genes/proteins are annotated and integrated across experiments using UniProt Knowledgebase (UniProtKB) accession numbers. The web-interface for the database enables searching, querying and downloading at the level of experiment, group and individual gene(s)/protein(s) via UniProtKB accession numbers or protein function keywords. The system is accessible at http://www.proteomicsresource.org/.
Krishnan, Hari B; Natarajan, Savithiry S; Mahmoud, Ahmed A; Bennett, John O; Krishnan, Ammulu Hari; Prasad, Braj Nandan
2006-07-26
Soybeans contain approximately 40% protein and 20% oil and represents an important source of protein in animal rations and human diets. Attempts are being made to increase further the overall protein content of soybeans by utilization of exotic germplasms. In this study, soybean cultivars from Nepal have been characterized and their potential as a germplasm resource for improvement of the protein content and quality of North American cultivars assessed. Soybean cultivars 'Sathia', 'Seti', 'Kavre', and 'Soida Chiny', indigenous to various regions of Nepal, contained 42-45% protein, which is significantly higher in comparison to that of the North American cultivar 'Williams 82' (39%). Fractionation of seed protein by high-resolution two-dimensional gel electrophoresis revealed differences in the protein profiles of these cultivars. Various isoelectric forms of glycinin and beta-conglycinin were identified by comparing the matrix-assisted laser desorption ionization time-of-flight mass fingerprinting data against the National Center for Biotechnology Information nonredundant database. Nepalese cultivar Sathia was distinct, lacking some isoelectric forms of acidic and basic glycinin subunits while expressing other unique forms. The contribution of these unique protein spots present in either Sathia or Williams 82 to the total protein content was quantified using scanning laser densitometry. Distinct restriction fragment length polymorphisms (RFLP) for group 1 glycinin genes were observed among the tested Nepalese genotypes, indicating sequence variation among the cultivars. Conversely, evaluation of RFLP for the genes encoding group 2 glycinins, beta-conglycinin, and Bowman-Birk proteinase inhibitors indicated a high degree of conservation in these genes. Determination of amino acid composition, a reflection of protein quality, indicated that the arginine content of the Nepalese soybeans ranged from 7.7 to 8.1%, which was 5-10% higher than the 7.4% expressed in Williams 82. Additionally, Karve and Seti contained significantly more cysteine than Williams 82. Nepalese high-protein soybeans having a desirable amino acid composition hold potential to increase the protein quality and diversity of North American cultivars.
Rani, Lekha; Minz, Ranjana W; Arora, Amit; Kannan, Monica; Sharma, Aman; Anand, Shashi; Gupta, Dheeraj; Panda, Naresh K; Sakhuja, Vinay K
2014-11-01
The present study is a proteomic approach to find differentially expressed proteins in sera of limited and systemic subsets of active disease versus their remitting state in patients with granulomatosis with polyangiitis (GPA) and their correlation with disease activity. Eighteen patients with GPA in active as well as in remitting state and four healthy controls (HC) were included in the study. For proteomics analysis, two-dimensional gel electrophoresis along with matrix-assisted laser desorption ionization time-of-flight mass spectrometry were performed. A total of 14 gels were run from pooled patients' sera from active GPA and remission as well as pooled HC serum. There was significant differential expression of proteins in limited versus systemic GPA and between active systemic versus remitting patients of systemic disease. We identified nine maximally differentially expressed and five proteins which were not detected in HC. Among nine proteins, one (Prolow density lipoprotein receptor-related protein 1) was downregulated and four proteins (haptoglobin Hp, Hp2, vitamin D binding protein, killer cell lectin-like receptor subfamily F member 2), were up-regulated in both limited and systemic active disease, two proteins like Ig gamma-4 chain C region protein and serum albumin were up-regulated in limited active GPA and two proteins, that is, cysteine rich secretory protein LCCL domain-containing 2 precursor and serine-threonine-protein kinase A-Raf were up-regulated in systemic active disease. Levels of interleukin-17 and vitamin-D binding protein (VDBP) by enzyme-linked immunosorbent assay could distinctly demarcate active disease versus remission. Our study provides potential protein markers of active disease versus remission in GPA. © 2014 Asia Pacific League of Associations for Rheumatology and Wiley Publishing Asia Pty Ltd.
Wilson, Karl A; Tan-Wilson, Anna
2013-01-01
Mass spectrometry (MS) has become an important tool in studying biological systems. One application is the identification of proteins and peptides by the matching of peptide and peptide fragment masses to the sequences of proteins in protein sequence databases. Often prior protein separation of complex protein mixtures by 2D-PAGE is needed, requiring more time and expertise than instructors of large laboratory classes can devote. We have developed an experimental module for our Biochemistry Laboratory course that engages students in MS-based protein identification following protein separation by one-dimensional SDS-PAGE, a technique that is usually taught in this type of course. The module is based on soybean seed storage proteins, a relatively simple mixture of proteins present in high levels in the seed, allowing the identification of the main protein bands by MS/MS and in some cases, even by peptide mass fingerprinting. Students can identify their protein bands using software available on the Internet, and are challenged to deduce post-translational modifications that have occurred upon germination. A collection of mass spectral data and tutorials that can be used as a stand-alone computer-based laboratory module were also assembled. Copyright © 2013 International Union of Biochemistry and Molecular Biology, Inc.
Choosing an Optimal Database for Protein Identification from Tandem Mass Spectrometry Data.
Kumar, Dhirendra; Yadav, Amit Kumar; Dash, Debasis
2017-01-01
Database searching is the preferred method for protein identification from digital spectra of mass to charge ratios (m/z) detected for protein samples through mass spectrometers. The search database is one of the major influencing factors in discovering proteins present in the sample and thus in deriving biological conclusions. In most cases the choice of search database is arbitrary. Here we describe common search databases used in proteomic studies and their impact on final list of identified proteins. We also elaborate upon factors like composition and size of the search database that can influence the protein identification process. In conclusion, we suggest that choice of the database depends on the type of inferences to be derived from proteomics data. However, making additional efforts to build a compact and concise database for a targeted question should generally be rewarding in achieving confident protein identifications.
ZifBASE: a database of zinc finger proteins and associated resources.
Jayakanthan, Mannu; Muthukumaran, Jayaraman; Chandrasekar, Sanniyasi; Chawla, Konika; Punetha, Ankita; Sundar, Durai
2009-09-09
Information on the occurrence of zinc finger protein motifs in genomes is crucial to the developing field of molecular genome engineering. The knowledge of their target DNA-binding sequences is vital to develop chimeric proteins for targeted genome engineering and site-specific gene correction. There is a need to develop a computational resource of zinc finger proteins (ZFP) to identify the potential binding sites and its location, which reduce the time of in vivo task, and overcome the difficulties in selecting the specific type of zinc finger protein and the target site in the DNA sequence. ZifBASE provides an extensive collection of various natural and engineered ZFP. It uses standard names and a genetic and structural classification scheme to present data retrieved from UniProtKB, GenBank, Protein Data Bank, ModBase, Protein Model Portal and the literature. It also incorporates specialized features of ZFP including finger sequences and positions, number of fingers, physiochemical properties, classes, framework, PubMed citations with links to experimental structures (PDB, if available) and modeled structures of natural zinc finger proteins. ZifBASE provides information on zinc finger proteins (both natural and engineered ones), the number of finger units in each of the zinc finger proteins (with multiple fingers), the synergy between the adjacent fingers and their positions. Additionally, it gives the individual finger sequence and their target DNA site to which it binds for better and clear understanding on the interactions of adjacent fingers. The current version of ZifBASE contains 139 entries of which 89 are engineered ZFPs, containing 3-7F totaling to 296 fingers. There are 50 natural zinc finger protein entries ranging from 2-13F, totaling to 307 fingers. It has sequences and structures from literature, Protein Data Bank, ModBase and Protein Model Portal. The interface is cross linked to other public databases like UniprotKB, PDB, ModBase and Protein Model Portal and PubMed for making it more informative. A database is established to maintain the information of the sequence features, including the class, framework, number of fingers, residues, position, recognition site and physio-chemical properties (molecular weight, isoelectric point) of both natural and engineered zinc finger proteins and dissociation constant of few. ZifBASE can provide more effective and efficient way of accessing the zinc finger protein sequences and their target binding sites with the links to their three-dimensional structures. All the data and functions are available at the advanced web-based search interface http://web.iitd.ac.in/~sundar/zifbase.
The Role of High-Dimensional Diffusive Search, Stabilization, and Frustration in Protein Folding
Rimratchada, Supreecha; McLeish, Tom C.B.; Radford, Sheena E.; Paci, Emanuele
2014-01-01
Proteins are polymeric molecules with many degrees of conformational freedom whose internal energetic interactions are typically screened to small distances. Therefore, in the high-dimensional conformation space of a protein, the energy landscape is locally relatively flat, in contrast to low-dimensional representations, where, because of the induced entropic contribution to the full free energy, it appears funnel-like. Proteins explore the conformation space by searching these flat subspaces to find a narrow energetic alley that we call a hypergutter and then explore the next, lower-dimensional, subspace. Such a framework provides an effective representation of the energy landscape and folding kinetics that does justice to the essential characteristic of high-dimensionality of the search-space. It also illuminates the important role of nonnative interactions in defining folding pathways. This principle is here illustrated using a coarse-grained model of a family of three-helix bundle proteins whose conformations, once secondary structure has formed, can be defined by six rotational degrees of freedom. Two folding mechanisms are possible, one of which involves an intermediate. The stabilization of intermediate subspaces (or states in low-dimensional projection) in protein folding can either speed up or slow down the folding rate depending on the amount of native and nonnative contacts made in those subspaces. The folding rate increases due to reduced-dimension pathways arising from the mere presence of intermediate states, but decreases if the contacts in the intermediate are very stable and introduce sizeable topological or energetic frustration that needs to be overcome. Remarkably, the hypergutter framework, although depending on just a few physically meaningful parameters, can reproduce all the types of experimentally observed curvature in chevron plots for realizations of this fold. PMID:24739172
Combining 1D and 2D linear discriminant analysis for palmprint recognition
NASA Astrophysics Data System (ADS)
Zhang, Jian; Ji, Hongbing; Wang, Lei; Lin, Lin
2011-11-01
In this paper, a novel feature extraction method for palmprint recognition termed as Two-dimensional Combined Discriminant Analysis (2DCDA) is proposed. By connecting the adjacent rows of a image sequentially, the obtained new covariance matrices contain the useful information among local geometry structures in the image, which is eliminated by 2DLDA. In this way, 2DCDA combines LDA and 2DLDA for a promising recognition accuracy, but the number of coefficients of its projection matrix is lower than that of other two-dimensional methods. Experimental results on the CASIA palmprint database demonstrate the effectiveness of the proposed method.
Database constraints applied to metabolic pathway reconstruction tools.
Vilaplana, Jordi; Solsona, Francesc; Teixido, Ivan; Usié, Anabel; Karathia, Hiren; Alves, Rui; Mateo, Jordi
2014-01-01
Our group developed two biological applications, Biblio-MetReS and Homol-MetReS, accessing the same database of organisms with annotated genes. Biblio-MetReS is a data-mining application that facilitates the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional (re)annotation of proteomes, to properly identify both the individual proteins involved in the process(es) of interest and their function. It also enables the sets of proteins involved in the process(es) in different organisms to be compared directly. The efficiency of these biological applications is directly related to the design of the shared database. We classified and analyzed the different kinds of access to the database. Based on this study, we tried to adjust and tune the configurable parameters of the database server to reach the best performance of the communication data link to/from the database system. Different database technologies were analyzed. We started the study with a public relational SQL database, MySQL. Then, the same database was implemented by a MapReduce-based database named HBase. The results indicated that the standard configuration of MySQL gives an acceptable performance for low or medium size databases. Nevertheless, tuning database parameters can greatly improve the performance and lead to very competitive runtimes.
Identification and correction of abnormal, incomplete and mispredicted proteins in public databases.
Nagy, Alinda; Hegyi, Hédi; Farkas, Krisztina; Tordai, Hedvig; Kozma, Evelin; Bányai, László; Patthy, László
2008-08-27
Despite significant improvements in computational annotation of genomes, sequences of abnormal, incomplete or incorrectly predicted genes and proteins remain abundant in public databases. Since the majority of incomplete, abnormal or mispredicted entries are not annotated as such, these errors seriously affect the reliability of these databases. Here we describe the MisPred approach that may provide an efficient means for the quality control of databases. The current version of the MisPred approach uses five distinct routines for identifying abnormal, incomplete or mispredicted entries based on the principle that a sequence is likely to be incorrect if some of its features conflict with our current knowledge about protein-coding genes and proteins: (i) conflict between the predicted subcellular localization of proteins and the absence of the corresponding sequence signals; (ii) presence of extracellular and cytoplasmic domains and the absence of transmembrane segments; (iii) co-occurrence of extracellular and nuclear domains; (iv) violation of domain integrity; (v) chimeras encoded by two or more genes located on different chromosomes. Analyses of predicted EnsEMBL protein sequences of nine deuterostome (Homo sapiens, Mus musculus, Rattus norvegicus, Monodelphis domestica, Gallus gallus, Xenopus tropicalis, Fugu rubripes, Danio rerio and Ciona intestinalis) and two protostome species (Caenorhabditis elegans and Drosophila melanogaster) have revealed that the absence of expected signal peptides and violation of domain integrity account for the majority of mispredictions. Analyses of sequences predicted by NCBI's GNOMON annotation pipeline show that the rates of mispredictions are comparable to those of EnsEMBL. Interestingly, even the manually curated UniProtKB/Swiss-Prot dataset is contaminated with mispredicted or abnormal proteins, although to a much lesser extent than UniProtKB/TrEMBL or the EnsEMBL or GNOMON-predicted entries. MisPred works efficiently in identifying errors in predictions generated by the most reliable gene prediction tools such as the EnsEMBL and NCBI's GNOMON pipelines and also guides the correction of errors. We suggest that application of the MisPred approach will significantly improve the quality of gene predictions and the associated databases.
MIPS: analysis and annotation of proteins from whole genomes.
Mewes, H W; Amid, C; Arnold, R; Frishman, D; Güldener, U; Mannhaupt, G; Münsterkötter, M; Pagel, P; Strack, N; Stümpflen, V; Warfsmann, J; Ruepp, A
2004-01-01
The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian protein-protein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de).
ERIC Educational Resources Information Center
Battle, Gary M.; Allen, Frank H.; Ferrence, Gregory M.
2010-01-01
A series of online interactive teaching units have been developed that illustrate the use of experimentally measured three-dimensional (3D) structures to teach fundamental chemistry concepts. The units integrate a 500-structure subset of the Cambridge Structural Database specially chosen for their pedagogical value. The units span a number of key…
Jauchem, James R; Cerna, Cesario Z; Lim, Tiffany Y; Seaman, Ronald L
2014-12-01
In an earlier study, we found significant changes in red-blood-cell, leukocyte, and platelet counts, and in red-blood-cell membrane proteins, following exposures of anesthetized pigs to a conducted electrical weapon. In the current study, we examined potential changes in plasma proteins [analyzed via two-dimensional gel electrophoresis (2-DGE)] following two 30 s exposures of anesthetized pigs (Sus scrofa) to a TASER (®) C2 conducted electrical weapon. Patterns of proteins, separated by 2-DGE, were consistent and reproducible between animals and between times of sampling. We determined that the blood plasma collection, handling, storage, and processing techniques we used are suitable for swine blood. There were no statistically significant changes in plasma proteins following the conducted-electrical-weapon exposures. Overall gel patterns of fibrinogen were similar to results of other studies of both pigs and humans (in control settings, not exposed to conducted electrical weapons). The lack of significant changes in plasma proteins may be added to the body of evidence regarding relative safety of TASER C2 device exposures.
Niu, Hai Tao; Zhang, Yi Bing; Jiang, Hai Ping; Cheng, Bo; Sun, Guang; Wang, Yi; E, Ya Jun; Pang, De Quan; Chang, Ji Wu
2009-01-01
This study was undertaken to identify differences in protein expression profiles between superficial bladder transitional cell carcinoma (BTCC) and normal urothelial cells. We used laser capture microdissection (LCM) to harvest purified cells, and used two-dimensional liquid chromatography (2D-LC) followed by electrospray ionization-tandem mass spectrometry (ESI-MS/MS) to separate and identify the peptide mixture. A total of 440/438 proteins commonly appeared in 4 paired specimens. Multi-step bioinformatic procedures were used for the analysis of identified proteins; 175/179 of the 293/287 proteins that were specific expressed in tumor/normal cells own gene ontology (GO) biological process annotation. Compared with the entire list of the international protein index (IPI), there are 52/46 GO terms exhibited as enriched and 6/10 exhibited as depleted, respectively. Significantly altered pathways between tumor and normal cells mainly include oxidative phosphorylation, focal adhesion, etc. Finally, descriptive statistics show that the shotgun proteomics strategy has practice directive significance for biomarker discovery by two-dimensional electrophoresis (2-DE) technology.
Conformations of the HIV-1 protease: A crystal structure data set analysis.
Palese, Luigi Leonardo
2017-11-01
The HIV protease is an important drug target for HIV/AIDS therapy, and its structure and function have been extensively investigated. This enzyme performs an essential role in viral maturation by processing specific cleavage sites in the Gag and Gag-Pol precursor polyproteins so as to release their mature forms. This 99 amino acid aspartic protease works as a homodimer, with the active site localized in a central cavity capped by two flexible flap regions. The dimer presents closed or open conformations, which are involved in the substrate binding and release. Here the results of the analysis of a HIV-1 protease data set containing 552 dimer structures are reported. Different dimensionality reduction methods have been used in order to get information from this multidimensional database. Most of the structures in the data set belong to two conformational clusters. An interesting observation that comes from the analysis of these data is that some protease sequences are localized preferentially in specific areas of the conformational landscape of this protein. Copyright © 2017 Elsevier B.V. All rights reserved.
Two-Dimensional Grammars And Their Applications To Artificial Intelligence
NASA Astrophysics Data System (ADS)
Lee, Edward T.
1987-05-01
During the past several years, the concepts and techniques of two-dimensional grammars1,2 have attracted growing attention as promising avenues of approach to problems in picture generation as well as in picture description3 representation, recognition, transformation and manipulation. Two-dimensional grammar techniques serve the purpose of exploiting the structure or underlying relationships in a picture. This approach attempts to describe a complex picture in terms of their components and their relative positions. This resembles the way a sentence is described in terms of its words and phrases, and the terms structural picture recognition, linguistic picture recognition, or syntactic picture recognition are often used. By using this approach, the problem of picture recognition becomes similar to that of phrase recognition in a language. However, describing pictures using a string grammar (one-dimensional grammar), the only relation between sub-pictures and/or primitives is the concatenation; that is each picture or primitive can be connected only at the left or right. This one-dimensional relation has not been very effective in describing two-dimensional pictures. A natural generaliza-tion is to use two-dimensional grammars. In this paper, two-dimensional grammars and their applications to artificial intelligence are presented. Picture grammars and two-dimensional grammars are introduced and illustrated by examples. In particular, two-dimensional grammars for generating all possible squares and all possible rhombuses are presented. The applications of two-dimensional grammars to solving region filling problems are discussed. An algorithm for region filling using two-dimensional grammars is presented together with illustrative examples. The advantages of using this algorithm in terms of computation time are also stated. A high-level description of a two-level picture generation system is proposed. The first level is the picture primitive generation using two-dimensional grammars. The second level is picture generation using either string description or entity-relationship (ER) diagram description. Illustrative examples are also given. The advantages of ER diagram description together with its comparison to string description are also presented. The results obtained in this paper may have useful applications in artificial intelligence, robotics, expert systems, picture processing, pattern recognition, knowledge engineering and pictorial database design. Furthermore, examples related to satellite surveillance and identifications are also included.
NASA Technical Reports Server (NTRS)
Trouve, A.; Veynante, D.; Bray, K. N. C.; Mantel, T.
1994-01-01
Current flamelot models based on a description of the flame surface dynamics require the closure of two inter-related equations: a transport equation for the mean reaction progress variable, (tilde)c, and a transport equation for the flame surface density, Sigma. The coupling between these two equations is investigated using direct numerical simulations (DNS) with emphasis on the correlation between the turbulent fluxes of (tilde)c, bar(pu''c''), and Sigma, (u'')(sub S)Sigma. Two different DNS databases are used in the present work: a database developed at CTR by A. Trouve and a database developed by C. J. Rutland using a different code. Both databases correspond to statistically one-dimensional premixed flames in isotropic turbulent flow. The run parameters, however, are significantly different, and the two databases correspond to different combustion regimes. It is found that in all simulated flames, the correlation between bar(pu''c'') and (u'')(sub S)Sigma is always strong. The sign, however, of the turbulent flux of (tilde)c or Sigma with respect to the mean gradients, delta(tilde)c/delta(x) or delta(Sigma)/delta(x), is case-dependent. The CTR database is found to exhibit gradient turbulent transport of (tilde)c and Sigma, whereas the Rutland DNS features counter-gradient diffusion. The two databases are analyzed and compared using various tools (a local analysis of the flow field near the flame, a classical analysis of the conservation equation for (tilde)(u''c''), and a thin flame theoretical analysis). A mechanism is then proposed to explain the discrepancies between the two databases and a preliminary simple criterion is derived to predict the occurrence of gradient/counter-gradient turbulent diffusion.
Cheon, Gowoon; Duerloo, Karel-Alexander N; Sendek, Austin D; Porter, Chase; Chen, Yuan; Reed, Evan J
2017-03-08
Layered materials held together by weak interactions including van der Waals forces, such as graphite, have attracted interest for both technological applications and fundamental physics in their layered form and as an isolated single-layer. Only a few dozen single-layer van der Waals solids have been subject to considerable research focus, although there are likely to be many more that could have superior properties. To identify a broad spectrum of layered materials, we present a novel data mining algorithm that determines the dimensionality of weakly bonded subcomponents based on the atomic positions of bulk, three-dimensional crystal structures. By applying this algorithm to the Materials Project database of over 50,000 inorganic crystals, we identify 1173 two-dimensional layered materials and 487 materials that consist of weakly bonded one-dimensional molecular chains. This is an order of magnitude increase in the number of identified materials with most materials not known as two- or one-dimensional materials. Moreover, we discover 98 weakly bonded heterostructures of two-dimensional and one-dimensional subcomponents that are found within bulk materials, opening new possibilities for much-studied assembly of van der Waals heterostructures. Chemical families of materials, band gaps, and point groups for the materials identified in this work are presented. Point group and piezoelectricity in layered materials are also evaluated in single-layer forms. Three hundred and twenty-five of these materials are expected to have piezoelectric monolayers with a variety of forms of the piezoelectric tensor. This work significantly extends the scope of potential low-dimensional weakly bonded solids to be investigated.
Aslebagh, Roshanak; Channaveerappa, Devika; Arcaro, Kathleen F; Darie, Costel C
2018-05-13
Breast cancer (BC) remains a major cause of mortality, and early detection is considered important for reducing BC-associated deaths. Early detection of BC is challenging in young women, due to the limitations of mammography on the dense breast tissue of young women. We recently reported results of a pilot proteomics study, using one-dimensional polyacrylamide gel electrophoresis (1D-PAGE) and mass spectrometry (MS) to investigate differences in milk proteins from women with and without BC. Here, we applied two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) and MS to compare the protein pattern in milk from the breasts of a single woman who was diagnosed with BC in one breast 24 months after donating her milk. Statistically different gel spots were picked for protein digestion followed by nanoliquid chromatography tandem MS (nanoLC-MS/MS) analysis. The upregulated proteins in BC versus control are alpha-amylase, gelsolin isoform a precursor, alpha-2-glycoprotein 1 zinc isoform CRA_b partial, apoptosis-inducing factor 2 and vitronectin. Several proteins were downregulated in the milk of the breast later diagnosed with cancer as compared to the milk from the healthy breast, including different isoforms of albumin, cholesterol esterase, different isoforms of lactoferrin, different proteins from the casein family and different isoforms of lysozyme. Results warrant further studies to determine the usefulness of these milk proteins for assessing risk and detecting occult disease. MS data is available via ProteomeXchange with identifier PXD009860. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Accelerating simulation for the multiple-point statistics algorithm using vector quantization
NASA Astrophysics Data System (ADS)
Zuo, Chen; Pan, Zhibin; Liang, Hao
2018-03-01
Multiple-point statistics (MPS) is a prominent algorithm to simulate categorical variables based on a sequential simulation procedure. Assuming training images (TIs) as prior conceptual models, MPS extracts patterns from TIs using a template and records their occurrences in a database. However, complex patterns increase the size of the database and require considerable time to retrieve the desired elements. In order to speed up simulation and improve simulation quality over state-of-the-art MPS methods, we propose an accelerating simulation for MPS using vector quantization (VQ), called VQ-MPS. First, a variable representation is presented to make categorical variables applicable for vector quantization. Second, we adopt a tree-structured VQ to compress the database so that stationary simulations are realized. Finally, a transformed template and classified VQ are used to address nonstationarity. A two-dimensional (2D) stationary channelized reservoir image is used to validate the proposed VQ-MPS. In comparison with several existing MPS programs, our method exhibits significantly better performance in terms of computational time, pattern reproductions, and spatial uncertainty. Further demonstrations consist of a 2D four facies simulation, two 2D nonstationary channel simulations, and a three-dimensional (3D) rock simulation. The results reveal that our proposed method is also capable of solving multifacies, nonstationarity, and 3D simulations based on 2D TIs.
NASA Astrophysics Data System (ADS)
Wang, Lusheng; Yang, Yong; Lin, Guohui
Finding the closest object for a query in a database is a classical problem in computer science. For some modern biological applications, computing the similarity between two objects might be very time consuming. For example, it takes a long time to compute the edit distance between two whole chromosomes and the alignment cost of two 3D protein structures. In this paper, we study the nearest neighbor search problem in metric space, where the pair-wise distance between two objects in the database is known and we want to minimize the number of distances computed on-line between the query and objects in the database in order to find the closest object. We have designed two randomized approaches for indexing metric space databases, where objects are purely described by their distances with each other. Analysis and experiments show that our approaches only need to compute O(logn) objects in order to find the closest object, where n is the total number of objects in the database.
Shih, Wenting; Yamada, Soichiro
2011-12-22
Traditionally, cell migration has been studied on two-dimensional, stiff plastic surfaces. However, during important biological processes such as wound healing, tissue regeneration, and cancer metastasis, cells must navigate through complex, three-dimensional extracellular tissue. To better understand the mechanisms behind these biological processes, it is important to examine the roles of the proteins responsible for driving cell migration. Here, we outline a protocol to study the mechanisms of cell migration using the epithelial cell line (MDCK), and a three-dimensional, fibrous, self-polymerizing matrix as a model system. This optically clear extracellular matrix is easily amenable to live-cell imaging studies and better mimics the physiological, soft tissue environment. This report demonstrates a technique for directly visualizing protein localization and dynamics, and deformation of the surrounding three-dimensional matrix. Examination of protein localization and dynamics during cellular processes provides key insight into protein functions. Genetically encoded fluorescent tags provide a unique method for observing protein localization and dynamics. Using this technique, we can analyze the subcellular accumulation of key, force-generating cytoskeletal components in real-time as the cell maneuvers through the matrix. In addition, using multiple fluorescent tags with different wavelengths, we can examine the localization of multiple proteins simultaneously, thus allowing us to test, for example, whether different proteins have similar or divergent roles. Furthermore, the dynamics of fluorescently tagged proteins can be quantified using Fluorescent Recovery After Photobleaching (FRAP) analysis. This measurement assays the protein mobility and how stably bound the proteins are to the cytoskeletal network. By combining live-cell imaging with the treatment of protein function inhibitors, we can examine in real-time the changes in the distribution of proteins and morphology of migrating cells. Furthermore, we also combine live-cell imaging with the use of fluorescent tracer particles embedded within the matrix to visualize the matrix deformation during cell migration. Thus, we can visualize how a migrating cell distributes force-generating proteins, and where the traction forces are exerted to the surrounding matrix. Through these techniques, we can gain valuable insight into the roles of specific proteins and their contributions to the mechanisms of cell migration.
NASA Technical Reports Server (NTRS)
Luo, Ming (Inventor); Sha, Bingdong (Inventor)
2000-01-01
The matrix protein, M1, of influenza virus strain A/PR/8/34 has been purified from virions and crystallized. The crystals consist of a stable fragment (18 Kd) of the M1 protein. X-ray diffraction studies indicated that the crystals have a space group of P3.sub.t 21 or P3.sub.2 21. Vm calculations showed that there are two monomers in an asymmetric unit. A crystallized N-terminal domain of M1, wherein the N-terminal domain of M1 is crystallized such that the three dimensional structure of the crystallized N-terminal domain of M1 can be determined to a resolution of about 2.1 .ANG. or better, and wherein the three dimensional structure of the uncrystallized N-terminal domain of M1 cannot be determined to a resolution of about 2.1 .ANG. or better. A method of purifying M1 and a method of crystallizing M1. A method of using the three-dimensional crystal structure of M1 to screen for antiviral, influenza virus treating or preventing compounds. A method of using the three-dimensional crystal structure of M1 to screen for improved binding to or inhibition of influenza virus M1. The use of the three-dimensional crystal structure of the M1 protein of influenza virus in the manufacture of an inhibitor of influenza virus M1. The use of the three-dimensional crystal structure of the M1 protein of influenza virus in the screening of candidates for inhibition of influenza virus M1.
NASA Astrophysics Data System (ADS)
Sheynkman, Gloria M.; Shortreed, Michael R.; Cesnik, Anthony J.; Smith, Lloyd M.
2016-06-01
Mass spectrometry-based proteomics has emerged as the leading method for detection, quantification, and characterization of proteins. Nearly all proteomic workflows rely on proteomic databases to identify peptides and proteins, but these databases typically contain a generic set of proteins that lack variations unique to a given sample, precluding their detection. Fortunately, proteogenomics enables the detection of such proteomic variations and can be defined, broadly, as the use of nucleotide sequences to generate candidate protein sequences for mass spectrometry database searching. Proteogenomics is experiencing heightened significance due to two developments: (a) advances in DNA sequencing technologies that have made complete sequencing of human genomes and transcriptomes routine, and (b) the unveiling of the tremendous complexity of the human proteome as expressed at the levels of genes, cells, tissues, individuals, and populations. We review here the field of human proteogenomics, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications.
Sheynkman, Gloria M.; Shortreed, Michael R.; Cesnik, Anthony J.; Smith, Lloyd M.
2016-01-01
Mass spectrometry–based proteomics has emerged as the leading method for detection, quantification, and characterization of proteins. Nearly all proteomic workflows rely on proteomic databases to identify peptides and proteins, but these databases typically contain a generic set of proteins that lack variations unique to a given sample, precluding their detection. Fortunately, proteogenomics enables the detection of such proteomic variations and can be defined, broadly, as the use of nucleotide sequences to generate candidate protein sequences for mass spectrometry database searching. Proteogenomics is experiencing heightened significance due to two developments: (a) advances in DNA sequencing technologies that have made complete sequencing of human genomes and transcriptomes routine, and (b) the unveiling of the tremendous complexity of the human proteome as expressed at the levels of genes, cells, tissues, individuals, and populations. We review here the field of human proteogenomics, with an emphasis on its history, current implementations, the types of proteomic variations it reveals, and several important applications. PMID:27049631
Human Variome Project Quality Assessment Criteria for Variation Databases.
Vihinen, Mauno; Hancock, John M; Maglott, Donna R; Landrum, Melissa J; Schaafsma, Gerard C P; Taschner, Peter
2016-06-01
Numerous databases containing information about DNA, RNA, and protein variations are available. Gene-specific variant databases (locus-specific variation databases, LSDBs) are typically curated and maintained for single genes or groups of genes for a certain disease(s). These databases are widely considered as the most reliable information source for a particular gene/protein/disease, but it should also be made clear they may have widely varying contents, infrastructure, and quality. Quality is very important to evaluate because these databases may affect health decision-making, research, and clinical practice. The Human Variome Project (HVP) established a Working Group for Variant Database Quality Assessment. The basic principle was to develop a simple system that nevertheless provides a good overview of the quality of a database. The HVP quality evaluation criteria that resulted are divided into four main components: data quality, technical quality, accessibility, and timeliness. This report elaborates on the developed quality criteria and how implementation of the quality scheme can be achieved. Examples are provided for the current status of the quality items in two different databases, BTKbase, an LSDB, and ClinVar, a central archive of submissions about variants and their clinical significance. © 2016 WILEY PERIODICALS, INC.
Evaluating Assessment Using N-Dimensional Filtering.
ERIC Educational Resources Information Center
Dron, Jon; Boyne, Chris; Mitchell, Richard
This paper describes the use of the CoFIND (Collaborative Filter in N Dimensions) system to evaluate two assessment styles. CoFIND is a resource database that organizes itself around its users' needs. Learners enter resources, categorize, then rate them using "qualities," aspects of resources which learners find worthwhile, the n…
Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan
2014-01-01
Background: The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. Objective: The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. Materials and Methods: The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. Results: The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. Conclusion: The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates. PMID:24748752
Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan
2014-01-01
The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates.
Saitô, Hazime
2004-11-01
We have compared site-directed 13C solid-state NMR spectra of [3-13C]Ala- and/or [1-13C]Val-labeled membrane proteins, including bacteriorhodopsin (bR), pharaonis phoborhodopin (ppR), its cognate transducer (pHtrII) and Escherichia coli diacylglycerol kinase (DGK), in two-dimensional (2D) crystal, lipid bilayers, and detergent. Restricted fluctuation motions of these membrane proteins due to oligomerization of bR by specific protein-protein interactions in the 2D crystalline lattice or protein complex between ppR and pHtrII provide the most favorable environment to yield well-resolved, fully visible 13C NMR signals for [3-13C]Ala-labeled proteins. In contrast, several signals from such membrane proteins were broadened or lost owing to interference of inherent fluctuation frequencies (10(4)-10(5)Hz) with frequency of either proton decoupling or magic angle spinning, if their 13C NMR spectra were recorded as a monomer in lipid bilayers at ambient temperature. The presence of such protein dynamics is essential for the respective proteins to achieve their own biological functions. Finally, spectral broadening found for bR and DGK in detergents were discussed.
Zeng, L. W.; Singh, R. S.
1993-01-01
We have attempted to estimate the number of genes involved in postzygotic reproductive isolation between two closely related species, Drosophila simulans and Drosophila sechellia, by a novel approach that involves the use of high resolution two-dimensional gel electrophoresis (2DE) to examine testis proteins in parents, hybrids and fertile and sterile backcross progenies. The important results that have emerged from this study are as follows: (1) about 8% of about 1000 proteins examined showed divergence (presence/absence) between the two species; (2) by tracing individual proteins in parental, hybrid and backcross males, we were able to associate the divergent proteins with different chromosomes and found that most divergent proteins are associated with autosomes and very few with X chromosome, Y chromosome and cytoplasm; (3) when proteins showing both quantitative and qualitative differences between the two species were examined in F(1) hybrid males, most (97.4%) proteins were expressed at levels between the two parents and no sign of large scale changes in spot density was observed. All the proteins observed in the two parental species were present in F(1) hybrid males except two species-specific proteins that may be encoded (or regulated) by sex chromosomes; (4) when different fertile and sterile backcross male testes were compared, a few D. sechellia-specific proteins were identified to be consistently associated with male sterility. These results along with the observation that a large proportion (23.6%) of first generation backcross males were fertile show that hybrid male sterility between D. simulans and D. sechellia involves a relatively small number of genes. Role of large scale genetic changes due to general genome incompatibility is not supported. The results also suggest that the large effect of X chromosome on hybrid male sterility is not due to higher divergence of X chromosome than autosomes. PMID:8224814
Zeng, L W; Singh, R S
1993-09-01
We have attempted to estimate the number of genes involved in postzygotic reproductive isolation between two closely related species, Drosophila simulans and Drosophila sechellia, by a novel approach that involves the use of high resolution two-dimensional gel electrophoresis (2DE) to examine testis proteins in parents, hybrids and fertile and sterile backcross progenies. The important results that have emerged from this study are as follows: (1) about 8% of about 1000 proteins examined showed divergence (presence/absence) between the two species; (2) by tracing individual proteins in parental, hybrid and backcross males, we were able to associate the divergent proteins with different chromosomes and found that most divergent proteins are associated with autosomes and very few with X chromosome, Y chromosome and cytoplasm; (3) when proteins showing both quantitative and qualitative differences between the two species were examined in F1 hybrid males, most (97.4%) proteins were expressed at levels between the two parents and no sign of large scale changes in spot density was observed. All the proteins observed in the two parental species were present in F1 hybrid males except two species-specific proteins that may be encoded (or regulated) by sex chromosomes; (4) when different fertile and sterile backcross male testes were compared, a few D. sechellia-specific proteins were identified to be consistently associated with male sterility. These results along with the observation that a large proportion (23.6%) of first generation backcross males were fertile show that hybrid male sterility between D. simulans and D. sechellia involves a relatively small number of genes. Role of large scale genetic changes due to general genome incompatibility is not supported. The results also suggest that the large effect of X chromosome on hybrid male sterility is not due to higher divergence of X chromosome than autosomes.
IMGT, the international ImMunoGeneTics information system®
Lefranc, Marie-Paule; Giudicelli, Véronique; Kaas, Quentin; Duprat, Elodie; Jabado-Michaloud, Joumana; Scaviner, Dominique; Ginestoux, Chantal; Clément, Oliver; Chaume, Denys; Lefranc, Gérard
2005-01-01
The international ImMunoGeneTics information system® (IMGT) (http://imgt.cines.fr), created in 1989, by the Laboratoire d'ImmunoGénétique Moléculaire LIGM (Université Montpellier II and CNRS) at Montpellier, France, is a high-quality integrated knowledge resource specializing in the immunoglobulins (IGs), T cell receptors (TRs), major histocompatibility complex (MHC) of human and other vertebrates, and related proteins of the immune systems (RPI) that belong to the immunoglobulin superfamily (IgSF) and to the MHC superfamily (MhcSF). IMGT includes several sequence databases (IMGT/LIGM-DB, IMGT/PRIMER-DB, IMGT/PROTEIN-DB and IMGT/MHC-DB), one genome database (IMGT/GENE-DB) and one three-dimensional (3D) structure database (IMGT/3Dstructure-DB), Web resources comprising 8000 HTML pages (IMGT Marie-Paule page), and interactive tools. IMGT data are expertly annotated according to the rules of the IMGT Scientific chart, based on the IMGT-ONTOLOGY concepts. IMGT tools are particularly useful for the analysis of the IG and TR repertoires in normal physiological and pathological situations. IMGT is used in medical research (autoimmune diseases, infectious diseases, AIDS, leukemias, lymphomas, myelomas), veterinary research, biotechnology related to antibody engineering (phage displays, combinatorial libraries, chimeric, humanized and human antibodies), diagnostics (clonalities, detection and follow up of residual diseases) and therapeutical approaches (graft, immunotherapy and vaccinology). IMGT is freely available at http://imgt.cines.fr. PMID:15608269
Li, Yushuang; Yang, Jiasheng; Zhang, Yi
2016-01-01
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector. PMID:27918587
Pro-angiogenic capacities of microvesicles produced by skin wound myofibroblasts.
Merjaneh, Mays; Langlois, Amélie; Larochelle, Sébastien; Cloutier, Chanel Beaudoin; Ricard-Blum, Sylvie; Moulin, Véronique J
2017-08-01
Wound healing is a very highly organized process where numerous cell types are tightly regulated to restore injured tissue. Myofibroblasts are cells that produce new extracellular matrix and contract wound edges. We previously reported that the human myofibroblasts isolated from normal wound (WMyos) produced microvesicles (MVs) in the presence of the serum. In this study, MVs were further characterized using a proteomic strategy and potential functions of the MVs were determined. MV proteins isolated from six WMyo populations were separated using two-dimensional differential gel electrophoresis. Highly conserved spots were selected and analyzed using mass spectrometry resulting in the identification of 381 different human proteins. Using the DAVID database, clusters of proteins involved in cell motion, apoptosis and adhesion, but also in extracellular matrix production (21 proteins, enrichment score: 3.32) and in blood vessel development/angiogenesis (19 proteins, enrichment score: 2.66) were identified. Another analysis using the functional enrichment analysis tool FunRich was consistent with these results. While the action of the myofibroblasts on extracellular matrix formation is well known, their angiogenic potential is less studied. To further characterize the angiogenic activity of the MVs, they were added to cultured microvascular endothelial cells to evaluate their influence on cell growth and migration using scratch test and capillary-like structure formation in Matrigel ® . The addition of a MV-enriched preparation significantly increased endothelial cell growth, migration and capillary formation compared with controls. The release of microvesicles by the wound myofibroblasts brings new perspectives to the field of communication between cells during the normal healing process.
Jagtap, Pratik; Goslinga, Jill; Kooren, Joel A; McGowan, Thomas; Wroblewski, Matthew S; Seymour, Sean L; Griffin, Timothy J
2013-04-01
Large databases (>10(6) sequences) used in metaproteomic and proteogenomic studies present challenges in matching peptide sequences to MS/MS data using database-search programs. Most notably, strict filtering to avoid false-positive matches leads to more false negatives, thus constraining the number of peptide matches. To address this challenge, we developed a two-step method wherein matches derived from a primary search against a large database were used to create a smaller subset database. The second search was performed against a target-decoy version of this subset database merged with a host database. High confidence peptide sequence matches were then used to infer protein identities. Applying our two-step method for both metaproteomic and proteogenomic analysis resulted in twice the number of high confidence peptide sequence matches in each case, as compared to the conventional one-step method. The two-step method captured almost all of the same peptides matched by the one-step method, with a majority of the additional matches being false negatives from the one-step method. Furthermore, the two-step method improved results regardless of the database search program used. Our results show that our two-step method maximizes the peptide matching sensitivity for applications requiring large databases, especially valuable for proteogenomics and metaproteomics studies. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
MIPS: a database for genomes and protein sequences.
Mewes, H W; Heumann, K; Kaps, A; Mayer, K; Pfeiffer, F; Stocker, S; Frishman, D
1999-01-01
The Munich Information Center for Protein Sequences (MIPS-GSF), Martinsried near Munich, Germany, develops and maintains genome oriented databases. It is commonplace that the amount of sequence data available increases rapidly, but not the capacity of qualified manual annotation at the sequence databases. Therefore, our strategy aims to cope with the data stream by the comprehensive application of analysis tools to sequences of complete genomes, the systematic classification of protein sequences and the active support of sequence analysis and functional genomics projects. This report describes the systematic and up-to-date analysis of genomes (PEDANT), a comprehensive database of the yeast genome (MYGD), a database reflecting the progress in sequencing the Arabidopsis thaliana genome (MATD), the database of assembled, annotated human EST clusters (MEST), and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). MIPS provides access through its WWW server (http://www.mips.biochem.mpg.de) to a spectrum of generic databases, including the above mentioned as well as a database of protein families (PROTFAM), the MITOP database, and the all-against-all FASTA database. PMID:9847138
Gopal, Sandeep; Pocock, Roger
2018-04-19
The Caenorhabditis elegans (C. elegans) germline is used to study several biologically important processes including stem cell development, apoptosis, and chromosome dynamics. While the germline is an excellent model, the analysis is often two dimensional due to the time and labor required for three-dimensional analysis. Major readouts in such studies are the number/position of nuclei and protein distribution within the germline. Here, we present a method to perform automated analysis of the germline using confocal microscopy and computational approaches to determine the number and position of nuclei in each region of the germline. Our method also analyzes germline protein distribution that enables the three-dimensional examination of protein expression in different genetic backgrounds. Further, our study shows variations in cytoskeletal architecture in distinct regions of the germline that may accommodate specific spatial developmental requirements. Finally, our method enables automated counting of the sperm in the spermatheca of each germline. Taken together, our method enables rapid and reproducible phenotypic analysis of the C. elegans germline.
Modeling the formation of cell-matrix adhesions on a single 3D matrix fiber.
Escribano, J; Sánchez, M T; García-Aznar, J M
2015-11-07
Cell-matrix adhesions are crucial in different biological processes like tissue morphogenesis, cell motility, and extracellular matrix remodeling. These interactions that link cell cytoskeleton and matrix fibers are built through protein clutches, generally known as adhesion complexes. The adhesion formation process has been deeply studied in two-dimensional (2D) cases; however, the knowledge is limited for three-dimensional (3D) cases. In this work, we simulate different local extracellular matrix properties in order to unravel the fundamental mechanisms that regulate the formation of cell-matrix adhesions in 3D. We aim to study the mechanical interaction of these biological structures through a three dimensional discrete approach, reproducing the transmission pattern force between the cytoskeleton and a single extracellular matrix fiber. This numerical model provides a discrete analysis of the proteins involved including spatial distribution, interaction between them, and study of the different phenomena, such as protein clutches unbinding or protein unfolding. Copyright © 2015 Elsevier Ltd. All rights reserved.
The Halophile protein database.
Sharma, Naveen; Farooqi, Mohammad Samir; Chaturvedi, Krishna Kumar; Lal, Shashi Bhushan; Grover, Monendra; Rai, Anil; Pandey, Pankaj
2014-01-01
Halophilic archaea/bacteria adapt to different salt concentration, namely extreme, moderate and low. These type of adaptations may occur as a result of modification of protein structure and other changes in different cell organelles. Thus proteins may play an important role in the adaptation of halophilic archaea/bacteria to saline conditions. The Halophile protein database (HProtDB) is a systematic attempt to document the biochemical and biophysical properties of proteins from halophilic archaea/bacteria which may be involved in adaptation of these organisms to saline conditions. In this database, various physicochemical properties such as molecular weight, theoretical pI, amino acid composition, atomic composition, estimated half-life, instability index, aliphatic index and grand average of hydropathicity (Gravy) have been listed. These physicochemical properties play an important role in identifying the protein structure, bonding pattern and function of the specific proteins. This database is comprehensive, manually curated, non-redundant catalogue of proteins. The database currently contains 59 897 proteins properties extracted from 21 different strains of halophilic archaea/bacteria. The database can be accessed through link. Database URL: http://webapp.cabgrid.res.in/protein/ © The Author(s) 2014. Published by Oxford University Press.
Subramani, Suresh; Kalpana, Raja; Monickaraj, Pankaj Moses; Natarajan, Jeyakumar
2015-04-01
The knowledge on protein-protein interactions (PPI) and their related pathways are equally important to understand the biological functions of the living cell. Such information on human proteins is highly desirable to understand the mechanism of several diseases such as cancer, diabetes, and Alzheimer's disease. Because much of that information is buried in biomedical literature, an automated text mining system for visualizing human PPI and pathways is highly desirable. In this paper, we present HPIminer, a text mining system for visualizing human protein interactions and pathways from biomedical literature. HPIminer extracts human PPI information and PPI pairs from biomedical literature, and visualize their associated interactions, networks and pathways using two curated databases HPRD and KEGG. To our knowledge, HPIminer is the first system to build interaction networks from literature as well as curated databases. Further, the new interactions mined only from literature and not reported earlier in databases are highlighted as new. A comparative study with other similar tools shows that the resultant network is more informative and provides additional information on interacting proteins and their associated networks. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Raschka, Sebastian; Scott, Anne M.; Liu, Nan; Gunturu, Santosh; Huertas, Mar; Li, Weiming; Kuhn, Leslie A.
2018-03-01
While the advantage of screening vast databases of molecules to cover greater molecular diversity is often mentioned, in reality, only a few studies have been published demonstrating inhibitor discovery by screening more than a million compounds for features that mimic a known three-dimensional (3D) ligand. Two factors contribute: the general difficulty of discovering potent inhibitors, and the lack of free, user-friendly software to incorporate project-specific knowledge and user hypotheses into 3D ligand-based screening. The Screenlamp modular toolkit presented here was developed with these needs in mind. We show Screenlamp's ability to screen more than 12 million commercially available molecules and identify potent in vivo inhibitors of a G protein-coupled bile acid receptor within the first year of a discovery project. This pheromone receptor governs sea lamprey reproductive behavior, and to our knowledge, this project is the first to establish the efficacy of computational screening in discovering lead compounds for aquatic invasive species control. Significant enhancement in activity came from selecting compounds based on one of the hypotheses: that matching two distal oxygen groups in the 3D structure of the pheromone is crucial for activity. Six of the 15 most active compounds met these criteria. A second hypothesis—that presence of an alkyl sulfate side chain results in high activity—identified another 6 compounds in the top 10, demonstrating the significant benefits of hypothesis-driven screening.
Proteomics Analysis of the Nucleolus in Adenovirus-infected Cells
Lam, Yun W.; Evans, Vanessa C.; Heesom, Kate J.; Lamond, Angus I.; Matthews, David A.
2010-01-01
Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined. PMID:19812395
Proteomics analysis of the nucleolus in adenovirus-infected cells.
Lam, Yun W; Evans, Vanessa C; Heesom, Kate J; Lamond, Angus I; Matthews, David A
2010-01-01
Adenoviruses replicate primarily in the host cell nucleus, and it is well established that adenovirus infection affects the structure and function of host cell nucleoli in addition to coding for a number of nucleolar targeted viral proteins. Here we used unbiased proteomics methods, including high throughput mass spectrometry coupled with stable isotope labeling by amino acids in cell culture (SILAC) and traditional two-dimensional gel electrophoresis, to identify quantitative changes in the protein composition of the nucleolus during adenovirus infection. Two-dimensional gel analysis revealed changes in six proteins. By contrast, SILAC-based approaches identified 351 proteins with 24 proteins showing at least a 2-fold change after infection. Of those, four were previously reported to have aberrant localization and/or functional relevance during adenovirus infection. In total, 15 proteins identified as changing in amount by proteomics methods were examined in infected cells using confocal microscopy. Eleven of these proteins showed altered patterns of localization in adenovirus-infected cells. Comparing our data with the effects of actinomycin D on the nucleolar proteome revealed that adenovirus infection apparently specifically targets a relatively small subset of nucleolar antigens at the time point examined.
Digital Earth system based river basin data integration
NASA Astrophysics Data System (ADS)
Zhang, Xin; Li, Wanqing; Lin, Chao
2014-12-01
Digital Earth is an integrated approach to build scientific infrastructure. The Digital Earth systems provide a three-dimensional visualization and integration platform for river basin data which include the management data, in situ observation data, remote sensing observation data and model output data. This paper studies the Digital Earth system based river basin data integration technology. Firstly, the construction of the Digital Earth based three-dimensional river basin data integration environment is discussed. Then the river basin management data integration technology is presented which is realized by general database access interface, web service and ActiveX control. Thirdly, the in situ data stored in database tables as records integration is realized with three-dimensional model of the corresponding observation apparatus display in the Digital Earth system by a same ID code. In the next two parts, the remote sensing data and the model output data integration technologies are discussed in detail. The application in the Digital Zhang River basin System of China shows that the method can effectively improve the using efficiency and visualization effect of the data.
Tiered Human Integrated Sequence Search Databases for Shotgun Proteomics.
Deutsch, Eric W; Sun, Zhi; Campbell, David S; Binz, Pierre-Alain; Farrah, Terry; Shteynberg, David; Mendoza, Luis; Omenn, Gilbert S; Moritz, Robert L
2016-11-04
The results of analysis of shotgun proteomics mass spectrometry data can be greatly affected by the selection of the reference protein sequence database against which the spectra are matched. For many species there are multiple sources from which somewhat different sequence sets can be obtained. This can lead to confusion about which database is best in which circumstances-a problem especially acute in human sample analysis. All sequence databases are genome-based, with sequences for the predicted gene and their protein translation products compiled. Our goal is to create a set of primary sequence databases that comprise the union of sequences from many of the different available sources and make the result easily available to the community. We have compiled a set of four sequence databases of varying sizes, from a small database consisting of only the ∼20,000 primary isoforms plus contaminants to a very large database that includes almost all nonredundant protein sequences from several sources. This set of tiered, increasingly complete human protein sequence databases suitable for mass spectrometry proteomics sequence database searching is called the Tiered Human Integrated Search Proteome set. In order to evaluate the utility of these databases, we have analyzed two different data sets, one from the HeLa cell line and the other from normal human liver tissue, with each of the four tiers of database complexity. The result is that approximately 0.8%, 1.1%, and 1.5% additional peptides can be identified for Tiers 2, 3, and 4, respectively, as compared with the Tier 1 database, at substantially increasing computational cost. This increase in computational cost may be worth bearing if the identification of sequence variants or the discovery of sequences that are not present in the reviewed knowledge base entries is an important goal of the study. We find that it is useful to search a data set against a simpler database, and then check the uniqueness of the discovered peptides against a more complex database. We have set up an automated system that downloads all the source databases on the first of each month and automatically generates a new set of search databases and makes them available for download at http://www.peptideatlas.org/thisp/ .
Tiered Human Integrated Sequence Search Databases for Shotgun Proteomics
Deutsch, Eric W.; Sun, Zhi; Campbell, David S.; Binz, Pierre-Alain; Farrah, Terry; Shteynberg, David; Mendoza, Luis; Omenn, Gilbert S.; Moritz, Robert L.
2016-01-01
The results of analysis of shotgun proteomics mass spectrometry data can be greatly affected by the selection of the reference protein sequence database against which the spectra are matched. For many species there are multiple sources from which somewhat different sequence sets can be obtained. This can lead to confusion about which database is best in which circumstances – a problem especially acute in human sample analysis. All sequence databases are genome-based, with sequences for the predicted gene and their protein translation products compiled. Our goal is to create a set of primary sequence databases that comprise the union of sequences from many of the different available sources and make the result easily available to the community. We have compiled a set of four sequence databases of varying sizes, from a small database consisting of only the ~20,000 primary isoforms plus contaminants to a very large database that includes almost all non-redundant protein sequences from several sources. This set of tiered, increasingly complete human protein sequence databases suitable for mass spectrometry proteomics sequence database searching is called the Tiered Human Integrated Search Proteome set. In order to evaluate the utility of these databases, we have analyzed two different data sets, one from the HeLa cell line and the other from normal human liver tissue, with each of the four tiers of database complexity. The result is that approximately 0.8%, 1.1%, and 1.5% additional peptides can be identified for Tiers 2, 3, and 4, respectively, as compared with the Tier 1 database, at substantially increasing computational cost. This increase in computational cost may be worth bearing if the identification of sequence variants or the discovery of sequences that are not present in the reviewed knowledge base entries is an important goal of the study. We find that it is useful to search a data set against a simpler database, and then check the uniqueness of the discovered peptides against a more complex database. We have set up an automated system that downloads all the source databases on the first of each month and automatically generates a new set of search databases and makes them available for download at http://www.peptideatlas.org/thisp/. PMID:27577934
Trezza, Alfonso; Bernini, Andrea; Langella, Andrea; Ascher, David B; Pires, Douglas E V; Sodi, Andrea; Passerini, Ilaria; Pelo, Elisabetta; Rizzo, Stanislao; Niccolai, Neri; Spiga, Ottavia
2017-10-01
The aim of this article is to report the investigation of the structural features of ABCA4, a protein associated with a genetic retinal disease. A new database collecting knowledge of ABCA4 structure may facilitate predictions about the possible functional consequences of gene mutations observed in clinical practice. In order to correlate structural and functional effects of the observed mutations, the structure of mouse P-glycoprotein was used as a template for homology modeling. The obtained structural information and genetic data are the basis of our relational database (ABCA4Database). Sequence variability among all ABCA4-deposited entries was calculated and reported as Shannon entropy score at the residue level. The three-dimensional model of ABCA4 structure was used to locate the spatial distribution of the observed variable regions. Our predictions from structural in silico tools were able to accurately link the functional effects of mutations to phenotype. The development of the ABCA4Database gathers all the available genetic and structural information, yielding a global view of the molecular basis of some retinal diseases. ABCA4 modeled structure provides a molecular basis on which to analyze protein sequence mutations related to genetic retinal disease in order to predict the risk of retinal disease across all possible ABCA4 mutations. Additionally, our ABCA4 predicted structure is a good starting point for the creation of a new data analysis model, appropriate for precision medicine, in order to develop a deeper knowledge network of the disease and to improve the management of patients.
MIPS: analysis and annotation of proteins from whole genomes
Mewes, H. W.; Amid, C.; Arnold, R.; Frishman, D.; Güldener, U.; Mannhaupt, G.; Münsterkötter, M.; Pagel, P.; Strack, N.; Stümpflen, V.; Warfsmann, J.; Ruepp, A.
2004-01-01
The Munich Information Center for Protein Sequences (MIPS-GSF), Neuherberg, Germany, provides protein sequence-related information based on whole-genome analysis. The main focus of the work is directed toward the systematic organization of sequence-related attributes as gathered by a variety of algorithms, primary information from experimental data together with information compiled from the scientific literature. MIPS maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the database of complete cDNAs (German Human Genome Project, NGFN), the database of mammalian protein–protein interactions (MPPI), the database of FASTA homologies (SIMAP), and the interface for the fast retrieval of protein-associated information (QUIPOS). The Arabidopsis thaliana database, the rice database, the plant EST databases (MATDB, MOsDB, SPUTNIK), as well as the databases for the comprehensive set of genomes (PEDANT genomes) are described elsewhere in the 2003 and 2004 NAR database issues, respectively. All databases described, and the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de). PMID:14681354
NASA Technical Reports Server (NTRS)
Bertelrud, Arild; Johnson, Sherylene; Anders, J. B. (Technical Monitor)
2002-01-01
A 2-D (two dimensional) high-lift system experiment was conducted in August of 1996 in the Low Turbulence Pressure Tunnel at NASA Langley Research Center, Hampton, VA. The purpose of the experiment was to obtain transition measurements on a three element high-lift system for CFD (computational fluid dynamics) code validation studies. A transition database has been created using the data from this experiment. The present report details how the hot-film data and the related pressure data are organized in the database. Data processing codes to access the data in an efficient and reliable manner are described and limited examples are given on how to access the database and store acquired information.
iDBPs: a web server for the identification of DNA binding proteins.
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-03-01
The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. http://idbps.tau.ac.il/
Identification of immunodominant proteins of the microalgae Prototheca by proteomic analysis
Irrgang, A.; Weise, C.; Murugaiyan, J.; Roesler, U.
2014-01-01
Prototheca zopfii associated with bovine mastitis and human protothecosis exists as two genotypes, of which genotype 1 is considered as non-infectious and genotype 2 as infectious. The mechanism of infection has not yet been described. The present study was aimed to identify genotype 2-specific immunodominant proteins. Prototheca proteins were separated using two-dimensional gel electrophoresis. Subsequent western blotting with rabbit hyperimmune serum revealed 28 protein spots. Matrix-assisted laser desorption ionization time-of-flight mass spectrometry analysis resulted in the identification of 15 proteins including malate dehydrogenase, elongation factor 1-alpha, heat shock protein 70, and 14-3-3 protein, which were previously described as immunogenic proteins of other eukaryotic pathogens. PMID:25755891
Tran, Diem Hong; Shishido, Yuji; Chung, Seong Pil; Trinh, Huong Thi Thanh; Yorita, Kazuko; Sakai, Takashi; Fukui, Kiyoshi
2015-12-10
D-Amino acid oxidase (DAO) is a flavoenzyme that metabolizes D-amino acids and is expected to be a promising therapeutic target of schizophrenia and glioblastoma. The study of DNA-binding proteins has yielded much information in the regulation of transcription and other biological processes. However, proteins interacting with DAO gene have not been elucidated. Our assessment of human DAO promoter activity using luciferase reporter system indicated the 5'-flanking region of this gene (-4289 bp from transcription initiation site) has a regulatory sequence for gene expression, which is regulated by multi-protein complexes interacting with this region. By using pull-down assay coupled with two-dimensional gel electrophoresis and mass spectrometry, we identified six proteins binding to the 5'-flanking region of the human DAO gene (zinc finger C2HC domain-containing protein 1A; histidine-tRNA ligase, cytoplasmic; molybdenum cofactor biosynthesis protein; 60S ribosomal protein L37; calponin-1; calmodulin binding protein and heterogeneous nuclear ribonucleoprotein A2/B1). These preliminary results will contribute to the advance in the understanding of the potential factors associated with the regulatory mechanism of DAO expression. Copyright © 2015 Elsevier B.V. All rights reserved.
Induction of polypeptides in Saccharomyces cerevisiae after ultraviolet irradiation.
Angulo, J F; Schwencke, J; Fernandez, I; Moustacchi, E
1986-07-31
Alterations in the synthesis of proteins following exposure of Saccharomyces cerevisiae to UV light were investigated using radioactive labelling and two dimensional electrophoresis. UV-irradiation induced the synthesis of various proteins. Among them the analogue of the RecA protein of Escherichia coli (Angulo et al. 1985) and two other polypeptides (34 Kd and 35 Kd, pI 5.8) were observed in all four strains analyzed namely two DNA-repair deficient (rad-) strains: (rad6-1 and pso2-1) and their isogenic wild type RAD+ strains.
Levy, Gary; Malik, Minnie; Britten, Joy; Gilden, Melissa; Segars, James; Catherino, William H.
2014-01-01
Objective To investigate the impact of liarozole on transforming growth factor-β3 (TGF-β3) expression, TGF-β3 controlled profibrotic cytokines, and extracellular matrix formation in a three-dimensional (3D) leiomyoma model system. Design Molecular and immunohistochemical analysis in a cell line evaluated in a three-dimensional culture. Setting Laboratory study. Patient(s) None. Intervention(s) Treatment of leiomyoma and myometrial cells with liarozole and TGF-β3 in a three-dimensional culture system. Main Outcome Measure(s) Quantitative real-time reverse-transcriptase polymerase chain reaction and Western blotting to assess fold gene and protein expression of TGF-β3 and TGF-β3 regulated fibrotic cytokines: collagen 1A1 (COL1A1), fibronectin, and versican before and after treatment with liarozole, and confirmatory immunohistochemical stains of treated three-dimensional cultures. Result(s) Both TGF-β3 gene and protein expression were elevated in leiomyoma cells compared with myometrium in two-dimensional and 3D cultures. Treatment with liarozole decreased TGF-β3 gene and protein expression. Extracellular matrix components versican, COL1A1, and fibronectin were also decreased by liarozole treatment in 3D cultures. Treatment of 3D cultures with TGF-β3 increased gene expression and protein production of COL1A1, fibronectin, and versican. Conclusion(s) Liarozole decreased TGF-β3 and TGF-β3–mediated extracellular matrix expression in a 3D uterine leiomyoma culture system. PMID:24825427
NASA Astrophysics Data System (ADS)
Kumar, Dinesh
2013-12-01
Two novel reduced dimensionality (RD) tailored HN(C)N [S.C. Panchal, N.S. Bhavesh, R.V. Hosur, Improved 3D triple resonance experiments, HNN and HN(C)N, for HN and 15N sequential correlations in (13C, 15N) labeled proteins: application to unfolded proteins, J. Biomol. NMR 20 (2001) 135-147] experiments are proposed to facilitate the backbone resonance assignment of proteins both in terms of its accuracy and speed. These experiments - referred here as (4,3)D-hNCOcaNH and (4,3)D-hNcoCANH - exploit the linear combination of backbone 15N and 13C‧/13Cα chemical shifts simultaneously to achieve higher peak dispersion and randomness along their respective F1 dimensions. Simply, this has been achieved by modulating the backbone 15N(i) chemical shifts with that of 13C‧ (i - 1)/13Cα (i - 1) spins following the established reduced dimensionality NMR approach [T. Szyperski, D.C. Yeh, D.K. Sukumaran, H.N. Moseley, G.T. Montelione, Reduced-dimensionality NMR spectroscopy for high-throughput protein resonance assignment, Proc. Natl. Acad. Sci. USA 99 (2002) 8009-8014]. Though the modification is simple it has resulted an ingenious improvement of HN(C)N both in terms of peak dispersion and easiness of establishing the sequential connectivities. The increased dispersion along F1 dimension solves two purposes here: (i) resolves the ambiguities arising because of degenerate 15N chemical shifts and (ii) reduces the signal overlap in F2(15N)-F3(1H) planes (an important requisite in HN(C)N based assignment protocol for facile and unambiguous identification of sequentially connected HSQC peaks). The performance of both these experiments and the assignment protocol has been demonstrated using bovine apo Calbindin-d9k (75 aa) and urea denatured UNC60B (a 152 amino acid ADF/cofilin family protein of Caenorhabditis elegans), as representatives of folded and unfolded protein systems, respectively.
A photoreversible protein-patterning approach for guiding stem cell fate in three-dimensional gels
NASA Astrophysics Data System (ADS)
Deforest, Cole A.; Tirrell, David A.
2015-05-01
Although biochemically patterned hydrogels are capable of recapitulating many critical aspects of the heterogeneous cellular niche, exercising spatial and temporal control of the presentation and removal of biomolecular signalling cues in such systems has proved difficult. Here, we demonstrate a synthetic strategy that exploits two bioorthogonal photochemistries to achieve reversible immobilization of bioactive full-length proteins with good spatial and temporal control within synthetic, cell-laden biomimetic scaffolds. A photodeprotection-oxime-ligation sequence permits user-defined quantities of proteins to be anchored within distinct subvolumes of a three-dimensional matrix, and an ortho-nitrobenzyl ester photoscission reaction facilitates subsequent protein removal. By using this approach to pattern the presentation of the extracellular matrix protein vitronectin, we accomplished reversible differentiation of human mesenchymal stem cells to osteoblasts in a spatially defined manner. Our protein-patterning approach should provide further avenues to probe and direct changes in cell physiology in response to dynamic biochemical signalling.
Dellaire, G.; Farrall, R.; Bickmore, W.A.
2003-01-01
The Nuclear Protein Database (NPD) is a curated database that contains information on more than 1300 vertebrate proteins that are thought, or are known, to localise to the cell nucleus. Each entry is annotated with information on predicted protein size and isoelectric point, as well as any repeats, motifs or domains within the protein sequence. In addition, information on the sub-nuclear localisation of each protein is provided and the biological and molecular functions are described using Gene Ontology (GO) terms. The database is searchable by keyword, protein name, sub-nuclear compartment and protein domain/motif. Links to other databases are provided (e.g. Entrez, SWISS-PROT, OMIM, PubMed, PubMed Central). Thus, NPD provides a gateway through which the nuclear proteome may be explored. The database can be accessed at http://npd.hgu.mrc.ac.uk and is updated monthly. PMID:12520015
An emerging cyberinfrastructure for biodefense pathogen and pathogen–host data
Zhang, C.; Crasta, O.; Cammer, S.; Will, R.; Kenyon, R.; Sullivan, D.; Yu, Q.; Sun, W.; Jha, R.; Liu, D.; Xue, T.; Zhang, Y.; Moore, M.; McGarvey, P.; Huang, H.; Chen, Y.; Zhang, J.; Mazumder, R.; Wu, C.; Sobral, B.
2008-01-01
The NIAID-funded Biodefense Proteomics Resource Center (RC) provides storage, dissemination, visualization and analysis capabilities for the experimental data deposited by seven Proteomics Research Centers (PRCs). The data and its publication is to support researchers working to discover candidates for the next generation of vaccines, therapeutics and diagnostics against NIAID's Category A, B and C priority pathogens. The data includes transcriptional profiles, protein profiles, protein structural data and host–pathogen protein interactions, in the context of the pathogen life cycle in vivo and in vitro. The database has stored and supported host or pathogen data derived from Bacillus, Brucella, Cryptosporidium, Salmonella, SARS, Toxoplasma, Vibrio and Yersinia, human tissue libraries, and mouse macrophages. These publicly available data cover diverse data types such as mass spectrometry, yeast two-hybrid (Y2H), gene expression profiles, X-ray and NMR determined protein structures and protein expression clones. The growing database covers over 23 000 unique genes/proteins from different experiments and organisms. All of the genes/proteins are annotated and integrated across experiments using UniProt Knowledgebase (UniProtKB) accession numbers. The web-interface for the database enables searching, querying and downloading at the level of experiment, group and individual gene(s)/protein(s) via UniProtKB accession numbers or protein function keywords. The system is accessible at http://www.proteomicsresource.org/. PMID:17984082
Phylogenetic reconstruction of South American felids defined by protein electrophoresis.
Slattery, J P; Johnson, W E; Goldman, D; O'Brien, S J
1994-09-01
Phylogenetic associations among six closely related South American felid species were defined by changes in protein-encoding gene loci. We analyzed proteins isolated from skin fibroblasts using two-dimensional electrophoresis and allozymes extracted from blood cells. Genotypes were determined for multiple individuals of ocelot, margay, tigrina, Geoffroy's cat, kodkod, and pampas cat at 548 loci resolved by two-dimensional electrophoresis and 44 allozyme loci. Phenograms were constructed using the methods of Fitch-Margoliash and neighbor-joining on a matrix of Nei's unbiased genetic distances for all pairs of species. Results of a relative-rate test indicate changes in two-dimensional electrophoresis data are constant among all South American felids with respect to a hyena outgroup. Allelic frequencies were transformed to discrete character states for maximum parsimony analysis. Phylogenetic reconstruction indicates a major split occurred approximately 5-6 million years ago, leading to three groups within the ocelot lineage. The earliest divergence led to Leopardus tigrina, followed by a split between an ancestor of an unresolved trichotomy of three species (Oncifelis guigna, O. geoffroyi, and Lynchailuris colocolo) and a recent common ancestor of Leopardus pardalis and L. wiedii. The results suggest that modern South American felids are monophyletic and evolved rapidly after the formation of the Panama land bridge between North and South America.
DOE Office of Scientific and Technical Information (OSTI.GOV)
van der Graaf, M.; van Mierlo, C.P.M.; Hemminga, M.A.
1991-06-11
The first 25 amino acids of the coat protein of cowpea chlorotic mottle virus are essential for binding the encapsidated RNA. Although an {alpha}-helical conformation has been predicted for this highly positively charged N-terminal region. No experimental evidence for this conformation has been presented so far. In this study, two-dimensional proton NMR experiments were performed on a chemically synthesized pentacosapeptide containing the first 25 amino acids of this coat protein. All resonances could be assigned by a combined use of two-dimensional correlated spectroscopy and nuclear Overhauser enhancement spectroscopy carried out at four different temperatures. Various NMR parameters indicate the presencemore » of a conformational ensemble consisting of helical structures rapidly converting into more extended states. Differences in chemical shifts and nuclear Overhauser effects indicate that lowering the temperature induces a shift of the dynamic equilibrium toward more helical structures. At 10{degrees}C, a perceptible fraction of the conformational ensemble consists of structures with an {alpha}-helical conformation between residues 9 and 17, likely starting with a turnlike structure around Thr9 and Arg10. Both the conformation and the position of this helical region agree well with the secondary structure predictions mentioned above.« less
Combinatorial ligand libraries as a two-dimensional method for proteome analysis.
Santucci, Laura; Candiano, Giovanni; Petretto, Andrea; Lavarello, Chiara; Bruschi, Maurizio; Ghiggeri, Gian Marco; Citterio, Attilio; Righetti, Pier Giorgio
2013-07-05
The present report tries to assess the possibility of performing capture of proteomes via combinatorial peptide ligand libraries (CPLL) in a two-dimensional (2D) mode, i.e. via orthogonal complementarity in the capture phase. To that aim, serum proteins are captured at physiological pH either at low ionic strength (25mM NaCl) or at high concentrations of lyotropic salts of the Hofmeister series (1M ammonium sulphate) favouring hydrophobic interaction. Indeed such 2D mechanisms seems to be operative, since 52% of the captured proteins are common to the two capture modes, 20% are specific only of the "ionic" interaction mode and 28% are found only in the "hydrophobically" driven interaction. As an additional bonus, losses of protein species from the initial sample, one of the major drawbacks of CPLLs, are diminished to about 5% and are found only in the ionic capture, whereas the hydrophobically engendered capture is loss-free. Copyright © 2013 Elsevier B.V. All rights reserved.
Wang, Wei; Xia, Minxuan; Chen, Jie; Deng, Fenni; Yuan, Rui; Zhang, Xiaopei; Shen, Fafu
2016-12-01
The data presented in this paper is supporting the research article "Genome-Wide Analysis of Superoxide Dismutase Gene Family in Gossypium raimondii and G. arboreum" [1]. In this data article, we present phylogenetic tree showing dichotomy with two different clusters of SODs inferred by the Bayesian method of MrBayes (version 3.2.4), "Bayesian phylogenetic inference under mixed models" [2], Ramachandran plots of G. raimondii and G. arboreum SODs, the protein sequence used to generate 3D sructure of proteins and the template accession via SWISS-MODEL server, "SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information." [3] and motif sequences of SODs identified by InterProScan (version 4.8) with the Pfam database, "Pfam: the protein families database" [4].
Database Constraints Applied to Metabolic Pathway Reconstruction Tools
Vilaplana, Jordi; Solsona, Francesc; Teixido, Ivan; Usié, Anabel; Karathia, Hiren; Alves, Rui; Mateo, Jordi
2014-01-01
Our group developed two biological applications, Biblio-MetReS and Homol-MetReS, accessing the same database of organisms with annotated genes. Biblio-MetReS is a data-mining application that facilitates the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional (re)annotation of proteomes, to properly identify both the individual proteins involved in the process(es) of interest and their function. It also enables the sets of proteins involved in the process(es) in different organisms to be compared directly. The efficiency of these biological applications is directly related to the design of the shared database. We classified and analyzed the different kinds of access to the database. Based on this study, we tried to adjust and tune the configurable parameters of the database server to reach the best performance of the communication data link to/from the database system. Different database technologies were analyzed. We started the study with a public relational SQL database, MySQL. Then, the same database was implemented by a MapReduce-based database named HBase. The results indicated that the standard configuration of MySQL gives an acceptable performance for low or medium size databases. Nevertheless, tuning database parameters can greatly improve the performance and lead to very competitive runtimes. PMID:25202745
Wang, Shunfang; Liu, Shuhui
2015-12-19
An effective representation of a protein sequence plays a crucial role in protein sub-nuclear localization. The existing representations, such as dipeptide composition (DipC), pseudo-amino acid composition (PseAAC) and position specific scoring matrix (PSSM), are insufficient to represent protein sequence due to their single perspectives. Thus, this paper proposes two fusion feature representations of DipPSSM and PseAAPSSM to integrate PSSM with DipC and PseAAC, respectively. When constructing each fusion representation, we introduce the balance factors to value the importance of its components. The optimal values of the balance factors are sought by genetic algorithm. Due to the high dimensionality of the proposed representations, linear discriminant analysis (LDA) is used to find its important low dimensional structure, which is essential for classification and location prediction. The numerical experiments on two public datasets with KNN classifier and cross-validation tests showed that in terms of the common indexes of sensitivity, specificity, accuracy and MCC, the proposed fusing representations outperform the traditional representations in protein sub-nuclear localization, and the representation treated by LDA outperforms the untreated one.
Wang, Shunfang; Liu, Shuhui
2015-01-01
An effective representation of a protein sequence plays a crucial role in protein sub-nuclear localization. The existing representations, such as dipeptide composition (DipC), pseudo-amino acid composition (PseAAC) and position specific scoring matrix (PSSM), are insufficient to represent protein sequence due to their single perspectives. Thus, this paper proposes two fusion feature representations of DipPSSM and PseAAPSSM to integrate PSSM with DipC and PseAAC, respectively. When constructing each fusion representation, we introduce the balance factors to value the importance of its components. The optimal values of the balance factors are sought by genetic algorithm. Due to the high dimensionality of the proposed representations, linear discriminant analysis (LDA) is used to find its important low dimensional structure, which is essential for classification and location prediction. The numerical experiments on two public datasets with KNN classifier and cross-validation tests showed that in terms of the common indexes of sensitivity, specificity, accuracy and MCC, the proposed fusing representations outperform the traditional representations in protein sub-nuclear localization, and the representation treated by LDA outperforms the untreated one. PMID:26703574
Wolski, Witold E; Lalowski, Maciej; Jungblut, Peter; Reinert, Knut
2005-01-01
Background Peptide Mass Fingerprinting (PMF) is a widely used mass spectrometry (MS) method of analysis of proteins and peptides. It relies on the comparison between experimentally determined and theoretical mass spectra. The PMF process requires calibration, usually performed with external or internal calibrants of known molecular masses. Results We have introduced two novel MS calibration methods. The first method utilises the local similarity of peptide maps generated after separation of complex protein samples by two-dimensional gel electrophoresis. It computes a multiple peak-list alignment of the data set using a modified Minimum Spanning Tree (MST) algorithm. The second method exploits the idea that hundreds of MS samples are measured in parallel on one sample support. It improves the calibration coefficients by applying a two-dimensional Thin Plate Splines (TPS) smoothing algorithm. We studied the novel calibration methods utilising data generated by three different MALDI-TOF-MS instruments. We demonstrate that a PMF data set can be calibrated without resorting to external or relying on widely occurring internal calibrants. The methods developed here were implemented in R and are part of the BioConductor package mscalib available from . Conclusion The MST calibration algorithm is well suited to calibrate MS spectra of protein samples resulting from two-dimensional gel electrophoretic separation. The TPS based calibration algorithm might be used to correct systematic mass measurement errors observed for large MS sample supports. As compared to other methods, our combined MS spectra calibration strategy increases the peptide/protein identification rate by an additional 5 – 15%. PMID:16102175
A Two-Layer Least Squares Support Vector Machine Approach to Credit Risk Assessment
NASA Astrophysics Data System (ADS)
Liu, Jingli; Li, Jianping; Xu, Weixuan; Shi, Yong
Least squares support vector machine (LS-SVM) is a revised version of support vector machine (SVM) and has been proved to be a useful tool for pattern recognition. LS-SVM had excellent generalization performance and low computational cost. In this paper, we propose a new method called two-layer least squares support vector machine which combines kernel principle component analysis (KPCA) and linear programming form of least square support vector machine. With this method sparseness and robustness is obtained while solving large dimensional and large scale database. A U.S. commercial credit card database is used to test the efficiency of our method and the result proved to be a satisfactory one.
An effective placental cotyledons proteins extraction method for 2D gel electrophoresis.
Tan, Niu J; Daim, Leona D J; Jamil, Amilia A M; Mohtarrudin, Norhafizah; Thilakavathy, Karuppiah
2017-03-01
Effective protein extraction is essential especially in producing a well-resolved proteome on 2D gels. A well-resolved placental cotyledon proteome, with good reproducibility, have allowed researchers to study the proteins underlying the physiology and pathophysiology of pregnancy. The aim of this study is to determine the best protein extraction protocol for the extraction of protein from placental cotyledons tissues for a two-dimensional gel electrophoresis (2D-GE). Based on widely used protein extraction strategies, 12 different extraction methodologies were carefully selected, which included one chemical extraction, two mechanical extraction coupled protein precipitations, and nine chemical extraction coupled protein precipitations. Extracted proteins were resolved in a one-dimensional gel electrophoresis and 2D-GE; then, it was compared with set criteria: extraction efficacy, protein resolution, reproducibility, and recovery efficiency. Our results revealed that a better profile was obtained by chemical extraction in comparison to mechanical extraction. We further compared chemical extraction coupled protein precipitation methodologies, where the DNase/lithium chloride-dense sucrose homogenization coupled dichloromethane-methanol precipitation (DNase/LiCl-DSH-D/MPE) method showed good protein extraction efficiency. This, however, was carried out with the best protein resolution and proteome reproducibility on 2D-gels. DNase/LiCl-DSH-D/MPE was efficient in the extraction of proteins from placental cotyledons tissues. In addition, this methodology could hypothetically allow the protein extraction of any tissue that contains highly abundant lipid and glycogen. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Astrophysics Data System (ADS)
Muench, Philip James
Rates in frozen glycerin/water solutions at temperatures between 1.4 K and 20 K are reported for a copper-containing protein, azurin, and a cobalt-containing biomolecular complex, vitamin B_{rm 12r}, the paramagnetic product of the photolysis of coenzyme B_{12}. Results are interpreted in terms of a spectral dimensionality. Rates are also reported for nitrosyl ferrous myoglobin in frozen water solution, which exhibits a dominant one-phonon relaxation process up to 20 K and thus does not reveal spectral dimensionality. The anomalous variation of rate with temperature observed in several iron-containing proteins is not conspicuous here. In a model two-phonon mechanism of relaxation, temperature dependence is fixed by a spectral dimensionality, m, which specifies the variation of vibrational density of states with frequency rho(nu ) ~ nu ^{rm m-1} and is named in analogy with the Debye density of states in 1-, 2-, and 3-dimensional crystals. At sufficiently high temperatures, a non-resonant two-phonon process (Raman) should dominate the relaxation of a paramagnetic ion unless low-lying (under ^{~}70 cm^ {-1}) electronic states are present, as in many rare earths and in high spin ferric complexes, including many ferric proteins. The temperature dependence of the Raman rate for a Kramers ion (odd number of electrons) is T^{rm 3+2m} if temperature is sufficiently lower than Theta = hnu_{rm max} /k, the Debye temperature. The values of m from relaxation data on frozen solutions of a protein have sometimes been dependent upon solvent conditions. The maximum values of m for heme proteins, iron-sulfur proteins, and one copper -and-iron-containing protein, have ranged from about 1.3 to 1.8. Pulse saturation/recovery was used. The recoveries were not exponential, but rates were estimated from semilogarithmic displays of signals or from numerical fitting. The temperature dependence of the rates for azurin between 1.5 K and 22 K can be fit with a spectral dimensionality of 3 and a rather low Debye temperature near 69 K, in contrast to iron proteins. Relaxation of vitamin B_{ rm 12r} differed between samples, indicating varied photolysis or freezing. The Raman relaxation was well fit by a simple power law in temperature, but the values of m varied from 1.14 to 1.48 between samples.
The Evaluation of Forms of Assessment Using N-Dimensional Filtering
ERIC Educational Resources Information Center
Dron, Jon; Boyne, Chris; Mitchell, Richard
2004-01-01
This paper describes the use of the CoFIND (Collaborative Filter in N Dimensions) system to evaluate two assessment styles. CoFIND is a resource database which organizes itself around its users' needs. Learners enter resources, categorize, then rate them using "qualities," aspects of resources which learners find worthwhile, the n dimensions of…
Bu, Gui-jun; Yu, Jing; Di, Hui-hui; Luo, Shi-jia; Zhou, Da-zhai; Xiao, Qiang
2015-02-01
The composition and structure of humic acids formed during composting play an important influence on the quality and mature of compost. In order to explore the composition and evolution mechanism, municipal solid wastes were collected to compost and humic and fulvic acids were obtained from these composted municipal solid wastes. Furthermore, fourier transform infrared spectra and two-dimensional correlation analysis were applied to study the composition and transformation of humic and fulvic acids during composting. The results from fourier transform infrared spectra showed that, the composition of humic acids was complex, and several absorbance peaks were observed at 2917-2924, 2844-2852, 2549, 1662, 1622, 1566, 1454, 1398, 1351, 990-1063, 839 and 711 cm(-1). Compared to humic acids, the composition of fulvci acids was simple, and only three peaks were detected at 1725, 1637 and 990 cm(-1). The appearance of these peaks showed that both humic and fulvic acids comprised the benzene originated from lignin and the polysaccharide. In addition, humic acids comprised a large number of aliphatic and protein which were hardly detected in fulvic acids. Aliphatic, polysaccharide, protein and lignin all were degraded during composting, however, the order of degradation was different between humic and fulvci acids. The result from two-dimensional correlation analysis showed that, organic compounds in humic acids were degraded in the following sequence: aliphatic> protein> polysaccharide and lignin, while that in fulvic acids was as following: protein> polysaccharide and aliphatic. A large number of carboxyl, alcohols and ethers were formed during the degradation process, and the carboxyl was transformed into carbonates. It can be concluded that, fourier transform infrared spectra coupled with two-dimensional correlation analysis not only can analyze the function group composition of humic substances, but also can characterize effectively the degradation sequence of these groups and identified the formation mechanism and dynamics of humic substances during composting.
ProteinShader: illustrative rendering of macromolecules
Weber, Joseph R
2009-01-01
Background Cartoon-style illustrative renderings of proteins can help clarify structural features that are obscured by space filling or balls and sticks style models, and recent advances in programmable graphics cards offer many new opportunities for improving illustrative renderings. Results The ProteinShader program, a new tool for macromolecular visualization, uses information from Protein Data Bank files to produce illustrative renderings of proteins that approximate what an artist might create by hand using pen and ink. A combination of Hermite and spherical linear interpolation is used to draw smooth, gradually rotating three-dimensional tubes and ribbons with a repeating pattern of texture coordinates, which allows the application of texture mapping, real-time halftoning, and smooth edge lines. This free platform-independent open-source program is written primarily in Java, but also makes extensive use of the OpenGL Shading Language to modify the graphics pipeline. Conclusion By programming to the graphics processor unit, ProteinShader is able to produce high quality images and illustrative rendering effects in real-time. The main feature that distinguishes ProteinShader from other free molecular visualization tools is its use of texture mapping techniques that allow two-dimensional images to be mapped onto the curved three-dimensional surfaces of ribbons and tubes with minimum distortion of the images. PMID:19331660
An Evaluation of Database Solutions to Spatial Object Association
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kumar, V S; Kurc, T; Saltz, J
2008-06-24
Object association is a common problem encountered in many applications. Spatial object association, also referred to as crossmatch of spatial datasets, is the problem of identifying and comparing objects in two datasets based on their positions in a common spatial coordinate system--one of the datasets may correspond to a catalog of objects observed over time in a multi-dimensional domain; the other dataset may consist of objects observed in a snapshot of the domain at a time point. The use of database management systems to the solve the object association problem provides portability across different platforms and also greater flexibility. Increasingmore » dataset sizes in today's applications, however, have made object association a data/compute-intensive problem that requires targeted optimizations for efficient execution. In this work, we investigate how database-based crossmatch algorithms can be deployed on different database system architectures and evaluate the deployments to understand the impact of architectural choices on crossmatch performance and associated trade-offs. We investigate the execution of two crossmatch algorithms on (1) a parallel database system with active disk style processing capabilities, (2) a high-throughput network database (MySQL Cluster), and (3) shared-nothing databases with replication. We have conducted our study in the context of a large-scale astronomy application with real use-case scenarios.« less
MIPS: a database for genomes and protein sequences
Mewes, H. W.; Frishman, D.; Güldener, U.; Mannhaupt, G.; Mayer, K.; Mokrejs, M.; Morgenstern, B.; Münsterkötter, M.; Rudd, S.; Weil, B.
2002-01-01
The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz–Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91–93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155–158; Barker et al. (2001) Nucleic Acids Res., 29, 29–32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de). PMID:11752246
MIPS: a database for genomes and protein sequences.
Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B
2002-01-01
The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).
Lecomte, Marie M J; Atkinson, Kelly R; Kay, Daniel P; Simons, Joanne L; Ingram, John R
2013-02-01
The use of biomarkers in skin is a novel diagnostic tool. Interstitial fluid (ISF) from skin provides a snapshot of proteins secreted at the time of sampling giving insights into the patient's health status. A minimally invasive technique for the transdermal collection of human ISF proteins. A low frequency ultrasonic skin permeation device (SonoPrep ultrasonic skin permeation system) was used to produce micropores in the stratum corneum through which ISF was extracted using a portable pulsed vacuum ISF collection device. On average, protein concentrations recovered ranged between 0.064 and 4.792 μg/μL (mean 1.258 μg/μL). Two-dimensional gel electrophoresis revealed that this sample type was amenable to this type of analysis. Gel images indicated that both highly abundant proteins and lower abundance proteins were isolated from the skin. Western blot analysis confirmed the presence of proteins commonly found in plasma and the epidermis. A minimally invasive method for the transdermal recovery of ISF proteins has been developed. We have demonstrated that ISF samples obtained using this approach can be analysed with proteomic techniques, such as two-dimensional gel electrophoresis and western blots, providing another tool for the identification of disease specific protein biomarkers. © 2012 John Wiley & Sons A/S.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kalbitzer, H.R.; Neidig, K.P.; Hengstenberg, W.
1991-11-19
Complete sequence-specific assignments of the {sup 1}H NMR spectrum of HPr protein from Staphylococcus aureus were obtained by two-dimensional NMR methods. Important secondary structure elements that can be derived from the observed nuclear Overhauser effects are a large antiparallel {beta}-pleated sheet consisting of four strands, A, B, C, D, a segment S{sub AB} consisting of an extended region around the active-center histidine (His-15) and an {alpha}-helix, a half-turn between strands B and C, a segment S{sub CD} which shows no typical secondary structure, and the {alpha}-helical, C-terminal segment S{sub term}. These general structural features are similar to those found earliermore » in HPr proteins from different microorganisms such as Escherichia coli, Bacillus subtilis, and Streptococcus faecalis.« less
Analysis of proteins using DIGE and MALDI mass spectrometry
In this work the sensitivity of the quantitative proteomics approach 2D-DIGE/MS (twoDimensional Difference Gel Electrophoresis / Mass Spectrometry) was tested by detecting decreasing amounts of a specific protein at the low picomole and sub-picomole range. Sensitivity of the 2D-D...
Proteome Analysis of the Plasma Membrane of Mycobacterium Tuberculosis
Arora, Shalini; Kosalai, K.; Namane, Abdelkader; Pym, Alex S.; Cole, Stewart T.
2002-01-01
The plasma membrane of Mycobacterium tuberculosis is likely to contain proteins that could serve as novel drug targets, diagnostic probes or even components of a vaccine against tuberculosis. With this in mind, we have undertaken proteome analysis of the membrane of M. tuberculosis H37Rv. Isolated membrane vesicles were extracted with either a detergent (Triton X114) or an alkaline buffer (carbonate) following two of the protocols recommended for membrane protein enrichment. Proteins were resolved by 2D-GE using immobilized pH gradient (IPG) strips, and identified by peptide mass mapping utilizing the M. tuberculosis genome database. The two extraction procedures yielded patterns with minimal overlap. Only two proteins, both HSPs, showed a common presence. MALDI–MS analysis of 61 spots led to the identification of 32 proteins, 17 of which were new to the M. tuberculosis proteome database. We classified 19 of the identified proteins as ‘membrane-associated’; 14 of these were further classified as ‘membrane-bound’, three of which were lipoproteins. The remaining proteins included four heat-shock proteins and several enzymes involved in energy or lipid metabolism. Extraction with Triton X114 was found to be more effective than carbonate for detecting ‘putative’ M. tuberculosis membrane proteins. The protocol was also found to be suitable for comparing BCG and M. tuberculosis membranes, identifying ESAT-6 as being expressed selectively in M. tuberculosis. While this study demonstrates for the first time some of the membrane proteins of M. tuberculosis, it also underscores the problems associated with proteomic analysis of a complex membrane such as that of a mycobacterium. PMID:18629250
Negative Example Selection for Protein Function Prediction: The NoGO Database
Youngs, Noah; Penfold-Brown, Duncan; Bonneau, Richard; Shasha, Dennis
2014-01-01
Negative examples – genes that are known not to carry out a given protein function – are rarely recorded in genome and proteome annotation databases, such as the Gene Ontology database. Negative examples are required, however, for several of the most powerful machine learning methods for integrative protein function prediction. Most protein function prediction efforts have relied on a variety of heuristics for the choice of negative examples. Determining the accuracy of methods for negative example prediction is itself a non-trivial task, given that the Open World Assumption as applied to gene annotations rules out many traditional validation metrics. We present a rigorous comparison of these heuristics, utilizing a temporal holdout, and a novel evaluation strategy for negative examples. We add to this comparison several algorithms adapted from Positive-Unlabeled learning scenarios in text-classification, which are the current state of the art methods for generating negative examples in low-density annotation contexts. Lastly, we present two novel algorithms of our own construction, one based on empirical conditional probability, and the other using topic modeling applied to genes and annotations. We demonstrate that our algorithms achieve significantly fewer incorrect negative example predictions than the current state of the art, using multiple benchmarks covering multiple organisms. Our methods may be applied to generate negative examples for any type of method that deals with protein function, and to this end we provide a database of negative examples in several well-studied organisms, for general use (The NoGO database, available at: bonneaulab.bio.nyu.edu/nogo.html). PMID:24922051
Kleinau, Gunnar; Kreuchwig, Annika; Worth, Catherine L; Krause, Gerd
2010-06-01
The collection, description and molecular analysis of naturally occurring (pathogenic) mutations are important for understanding the functional mechanisms and malfunctions of biological units such as proteins. Numerous databases collate a huge amount of functional data or descriptions of mutations, but tools to analyse the molecular effects of genetic variations are as yet poorly provided. The goal of this work was therefore to develop a translational web-application that facilitates the interactive linkage of functional and structural data and which helps improve our understanding of the molecular basis of naturally occurring gain- or loss- of function mutations. Here we focus on the human glycoprotein hormone receptors (GPHRs), for which a huge number of mutations are known to cause diseases. We describe new options for interactive data analyses within three-dimensional structures, which enable the assignment of molecular relationships between structure and function. Strikingly, as the functional data are converted into relational percentage values, the system allows the comparison and classification of data from different GPHR subtypes and different experimental approaches. Our new application has been incorporated into a freely available database and website for the GPHRs (http://www.ssfa-gphr.de), but the principle development would also be applicable to other macromolecules.
The Movable Type Method Applied to Protein-Ligand Binding.
Zheng, Zheng; Ucisik, Melek N; Merz, Kenneth M
2013-12-10
Accurately computing the free energy for biological processes like protein folding or protein-ligand association remains a challenging problem. Both describing the complex intermolecular forces involved and sampling the requisite configuration space make understanding these processes innately difficult. Herein, we address the sampling problem using a novel methodology we term "movable type". Conceptually it can be understood by analogy with the evolution of printing and, hence, the name movable type. For example, a common approach to the study of protein-ligand complexation involves taking a database of intact drug-like molecules and exhaustively docking them into a binding pocket. This is reminiscent of early woodblock printing where each page had to be laboriously created prior to printing a book. However, printing evolved to an approach where a database of symbols (letters, numerals, etc.) was created and then assembled using a movable type system, which allowed for the creation of all possible combinations of symbols on a given page, thereby, revolutionizing the dissemination of knowledge. Our movable type (MT) method involves the identification of all atom pairs seen in protein-ligand complexes and then creating two databases: one with their associated pairwise distant dependent energies and another associated with the probability of how these pairs can combine in terms of bonds, angles, dihedrals and non-bonded interactions. Combining these two databases coupled with the principles of statistical mechanics allows us to accurately estimate binding free energies as well as the pose of a ligand in a receptor. This method, by its mathematical construction, samples all of configuration space of a selected region (the protein active site here) in one shot without resorting to brute force sampling schemes involving Monte Carlo, genetic algorithms or molecular dynamics simulations making the methodology extremely efficient. Importantly, this method explores the free energy surface eliminating the need to estimate the enthalpy and entropy components individually. Finally, low free energy structures can be obtained via a free energy minimization procedure yielding all low free energy poses on a given free energy surface. Besides revolutionizing the protein-ligand docking and scoring problem this approach can be utilized in a wide range of applications in computational biology which involve the computation of free energies for systems with extensive phase spaces including protein folding, protein-protein docking and protein design.
DOE Office of Scientific and Technical Information (OSTI.GOV)
De Re, Eleonora; Schlau-Cohen, Gabriela S.; Leverenz, Ryan L.
Carotenoids play an essential role in photoprotection, interacting with other pigments to safely dissipate excess absorbed energy as heat. In cyanobacteria, the short time scale photoprotective mechanisms involve the photoactive orange carotenoid protein (OCP), which binds a single carbonyl carotenoid. Blue-green light induces the photoswitching of OCP from its ground state form (OCPO) to a metastable photoproduct (OCPR). OCPR can bind to the phycobilisome antenna and induce fluorescence quenching. The photoswitching is accompanied by structural and functional changes at the level of the protein and of the bound carotenoid. In this study, we use broadband two-dimensional electronic spectroscopy to lookmore » at the differences in excited state dynamics of the carotenoid in the two forms of OCP. Our results provide insight into the origin of the pronounced vibrational lineshape and oscillatory dynamics observed in linear absorption and 2D electronic spectroscopy of OCPO and the large inhomogeneous broadening in OCPR, with consequences for the chemical function of the two forms.« less
Quantify spatial relations to discover handwritten graphical symbols
NASA Astrophysics Data System (ADS)
Li, Jinpeng; Mouchère, Harold; Viard-Gaudin, Christian
2012-01-01
To model a handwritten graphical language, spatial relations describe how the strokes are positioned in the 2-dimensional space. Most of existing handwriting recognition systems make use of some predefined spatial relations. However, considering a complex graphical language, it is hard to express manually all the spatial relations. Another possibility would be to use a clustering technique to discover the spatial relations. In this paper, we discuss how to create a relational graph between strokes (nodes) labeled with graphemes in a graphical language. Then we vectorize spatial relations (edges) for clustering and quantization. As the targeted application, we extract the repetitive sub-graphs (graphical symbols) composed of graphemes and learned spatial relations. On two handwriting databases, a simple mathematical expression database and a complex flowchart database, the unsupervised spatial relations outperform the predefined spatial relations. In addition, we visualize the frequent patterns on two text-lines containing Chinese characters.
Wang, Jingjing; Sun, Tao; Gao, Ni; Menon, Desmond Dev; Luo, Yanxia; Gao, Qi; Li, Xia; Wang, Wei; Zhu, Huiping; Lv, Pingxin; Liang, Zhigang; Tao, Lixin; Liu, Xiangtong; Guo, Xiuhua
2014-01-01
To determine the value of contourlet textural features obtained from solitary pulmonary nodules in two dimensional CT images used in diagnoses of lung cancer. A total of 6,299 CT images were acquired from 336 patients, with 1,454 benign pulmonary nodule images from 84 patients (50 male, 34 female) and 4,845 malignant from 252 patients (150 male, 102 female). Further to this, nineteen patient information categories, which included seven demographic parameters and twelve morphological features, were also collected. A contourlet was used to extract fourteen types of textural features. These were then used to establish three support vector machine models. One comprised a database constructed of nineteen collected patient information categories, another included contourlet textural features and the third one contained both sets of information. Ten-fold cross-validation was used to evaluate the diagnosis results for the three databases, with sensitivity, specificity, accuracy, the area under the curve (AUC), precision, Youden index, and F-measure were used as the assessment criteria. In addition, the synthetic minority over-sampling technique (SMOTE) was used to preprocess the unbalanced data. Using a database containing textural features and patient information, sensitivity, specificity, accuracy, AUC, precision, Youden index, and F-measure were: 0.95, 0.71, 0.89, 0.89, 0.92, 0.66, and 0.93 respectively. These results were higher than results derived using the database without textural features (0.82, 0.47, 0.74, 0.67, 0.84, 0.29, and 0.83 respectively) as well as the database comprising only textural features (0.81, 0.64, 0.67, 0.72, 0.88, 0.44, and 0.85 respectively). Using the SMOTE as a pre-processing procedure, new balanced database generated, including observations of 5,816 benign ROIs and 5,815 malignant ROIs, and accuracy was 0.93. Our results indicate that the combined contourlet textural features of solitary pulmonary nodules in CT images with patient profile information could potentially improve the diagnosis of lung cancer.
Basco, Davide; Nicchia, Grazia Paola; Desaphy, Jean-François; Camerino, Diana Conte; Frigeri, Antonio; Svelto, Maria
2010-12-01
Muscle atrophy occurring in several pathophysiological conditions determines decreases in muscle protein synthesis, increases in the rate of proteolysis and changes in muscle fiber composition. To determine the effect of muscle atrophy induced by hindlimb unloading (HU) on membrane proteins from rat soleus, a proteomic approach based on two-dimensional Blue Native/SDS-PAGE was performed. Proteomic analysis of normal and HU soleus muscle demonstrates statistically significant changes in the relative level of 36 proteins. Among the proteins identified by mass spectrometry, most are involved in pathways associated with muscle fuel utilization, indicating a shift in metabolism from oxidative to glycolytic. Moreover, immunoblotting analysis revealed an increase in aquaporin-4 (AQP4) water channel and an alteration of proteins belonging to the dystrophin-glycoprotein complex (DGC). AQP4 and DGC are regulated in soleus muscle subjected to simulated microgravity in response to compensatory mechanisms induced by muscle atrophy, and they parallel the slow-to-fast twitch conversion that occurs in soleus fibers during HU. In conclusion, the alterations of soleus muscle membrane proteome may play a pivotal role in the mechanisms involved in disuse-induced muscle atrophy.
7A projection map of the S-layer protein sbpA obtained with trehalose-embedded monolayer crystals.
Norville, Julie E; Kelly, Deborah F; Knight, Thomas F; Belcher, Angela M; Walz, Thomas
2007-12-01
Two-dimensional crystallization on lipid monolayers is a versatile tool to obtain structural information of proteins by electron microscopy. An inherent problem with this approach is to prepare samples in a way that preserves the crystalline order of the protein array and produces specimens that are sufficiently flat for high-resolution data collection at high tilt angles. As a test specimen to optimize the preparation of lipid monolayer crystals for electron microscopy imaging, we used the S-layer protein sbpA, a protein with potential for designing arrays of both biological and inorganic materials with engineered properties for a variety of nanotechnology applications. Sugar embedding is currently considered the best method to prepare two-dimensional crystals of membrane proteins reconstituted into lipid bilayers. We found that using a loop to transfer lipid monolayer crystals to an electron microscopy grid followed by embedding in trehalose and quick-freezing in liquid ethane also yielded the highest resolution images for sbpA lipid monolayer crystals. Using images of specimens prepared in this way we could calculate a projection map of sbpA at 7A resolution, one of the highest resolution projection structures obtained with lipid monolayer crystals to date.
NASA Astrophysics Data System (ADS)
Iyer, Ganesh Hariharan
The first part of this research involved a study of the nature and extent of nonbonded interactions at crystal and oligomer interfaces. A survey was compiled of several characteristics of intersubunit contacts in 58 different oligomeric proteins, and of the intermolecular contacts in 223 protein crystal structures. Routines written in "S" language were utilized for the generation of the observed and expected contacts. The information in the Protein Data Bank (PDB) was extracted using the database management system, Protein Knowledge Base (PKB). Potentials of mean force for atom-atom contacts and residue-residue contacts were derived by comparison of the number of observed interactions with the number expected by mass action. Preference association matrices and log-linear analyses were applied to determine the different factors that could contribute to the overall interactions at the interfaces of oligomers and crystals. Surface patches at oligomer and crystal interfaces were also studied to further investigate the origin of the differences in their stabilities. Total number of atoms in contact and the secondary structure elements involved are similar in the two types of interfaces. Crystal contacts result from more numerous interactions by polar residues, compared with a tendency toward nonpolar amino acid prominent in oligomer interfaces. Contact potentials indicate that hydrophobic interactions at oligomer interfaces favor aromatic amino acids and methionine over aliphatic amino acids; and that crystal contacts form in such a way as to avoid inclusion of hydrophobic interactions. The second part involved the development of a new class of biomaterials from two-dimensional arrays of ordered proteins. Point mutations were planned to introduce cysteine residues at appropriate locations to enable cross-linking at the molecular interface within given crystallographic planes. Crystallization and subsequent cross-linking of the modified protein would lead to the formation of arrays on subsequent dissociation of the crystal. Novel protein architectures can be generated from these cross-linked nanostructures. Experiments with model protein, maltose-binding protein (MBP) were performed to develop purification, cross-linking and crystallization techniques. The long-term goal of this project is to apply the experience gained with MBP to the fabrication of nanomaterials from other, application-specific proteins for ultrafiltration and microelectronic devices.
Navigating through the Jungle of Allergens: Features and Applications of Allergen Databases.
Radauer, Christian
2017-01-01
The increasing number of available data on allergenic proteins demanded the establishment of structured, freely accessible allergen databases. In this review article, features and applications of 6 of the most widely used allergen databases are discussed. The WHO/IUIS Allergen Nomenclature Database is the official resource of allergen designations. Allergome is the most comprehensive collection of data on allergens and allergen sources. AllergenOnline is aimed at providing a peer-reviewed database of allergen sequences for prediction of allergenicity of proteins, such as those planned to be inserted into genetically modified crops. The Structural Database of Allergenic Proteins (SDAP) provides a database of allergen sequences, structures, and epitopes linked to bioinformatics tools for sequence analysis and comparison. The Immune Epitope Database (IEDB) is the largest repository of T-cell, B-cell, and major histocompatibility complex protein epitopes including epitopes of allergens. AllFam classifies allergens into families of evolutionarily related proteins using definitions from the Pfam protein family database. These databases contain mostly overlapping data, but also show differences in terms of their targeted users, the criteria for including allergens, data shown for each allergen, and the availability of bioinformatics tools. © 2017 S. Karger AG, Basel.
Detection of alternative splice variants at the proteome level in Aspergillus flavus.
Chang, Kung-Yen; Georgianna, D Ryan; Heber, Steffen; Payne, Gary A; Muddiman, David C
2010-03-05
Identification of proteins from proteolytic peptides or intact proteins plays an essential role in proteomics. Researchers use search engines to match the acquired peptide sequences to the target proteins. However, search engines depend on protein databases to provide candidates for consideration. Alternative splicing (AS), the mechanism where the exon of pre-mRNAs can be spliced and rearranged to generate distinct mRNA and therefore protein variants, enable higher eukaryotic organisms, with only a limited number of genes, to have the requisite complexity and diversity at the proteome level. Multiple alternative isoforms from one gene often share common segments of sequences. However, many protein databases only include a limited number of isoforms to keep minimal redundancy. As a result, the database search might not identify a target protein even with high quality tandem MS data and accurate intact precursor ion mass. We computationally predicted an exhaustive list of putative isoforms of Aspergillus flavus proteins from 20 371 expressed sequence tags to investigate whether an alternative splicing protein database can assign a greater proportion of mass spectrometry data. The newly constructed AS database provided 9807 new alternatively spliced variants in addition to 12 832 previously annotated proteins. The searches of the existing tandem MS spectra data set using the AS database identified 29 new proteins encoded by 26 genes. Nine fungal genes appeared to have multiple protein isoforms. In addition to the discovery of splice variants, AS database also showed potential to improve genome annotation. In summary, the introduction of an alternative splicing database helps identify more proteins and unveils more information about a proteome.
Singh, Raghvendra Pratap; Singh, Ram Nageena; Srivastava, Manish K; Srivastava, Alok Kumar; Kumar, Sudheer; Dubey, Ramesh Chandra; Sharma, Arun Kumar
2012-01-01
Methylobacteria are ubiquitous in the biosphere which are capable of growing on C1 compounds such as formate, formaldehyde, methanol and methylamine as well as on a wide range of multi-carbon growth substrates such as C2, C3 and C4 compounds due to the methylotrophic enzymes methanol dehydrogenase (MDH). MDH is performing these functions with the help of a key protein mxaF. Unfortunately, detailed structural analysis and homology modeling of mxaF is remains undefined. Hence, the objective of this research is the characterization and three dimensional modeling of mxaF protein from three different methylotrophs by using I-TASSER server. The predicted model were further optimize and validate by Profile 3D, Errat, Verifiy3-D and PROCHECK server. Predicted and best evaluated models have been successfully deposited to PMDB database with PMDB ID PM0077505, PM0077506 and PM0077507. Active site identification revealed 11, 13 and 14 putative functional site residues in respected models. It may play a major role during protein-protein, and protein-cofactor interactions. This study can provide us an ab-initio and detail information to understand the structure, mechanism of action and regulation of mxaF protein.
Singh, Raghvendra Pratap; Singh, Ram Nageena; Srivastava, Manish K; Srivastava, Alok Kumar; Kumar, Sudheer; Dubey, Ramesh Chandra; Sharma, Arun Kumar
2012-01-01
Methylobacteria are ubiquitous in the biosphere which are capable of growing on C1 compounds such as formate, formaldehyde, methanol and methylamine as well as on a wide range of multi-carbon growth substrates such as C2, C3 and C4 compounds due to the methylotrophic enzymes methanol dehydrogenase (MDH). MDH is performing these functions with the help of a key protein mxaF. Unfortunately, detailed structural analysis and homology modeling of mxaF is remains undefined. Hence, the objective of this research is the characterization and three dimensional modeling of mxaF protein from three different methylotrophs by using I-TASSER server. The predicted model were further optimize and validate by Profile 3D, Errat, Verifiy3-D and PROCHECK server. Predicted and best evaluated models have been successfully deposited to PMDB database with PMDB ID PM0077505, PM0077506 and PM0077507. Active site identification revealed 11, 13 and 14 putative functional site residues in respected models. It may play a major role during protein-protein, and protein-cofactor interactions. This study can provide us an ab-initio and detail information to understand the structure, mechanism of action and regulation of mxaF protein. PMID:23275704
Xie, Hai-Yang; Xia, Wei-Liang; Zhang, Chun-Chao; Wu, Li-Ming; Ji, Hao-Feng; Cheng, Yu; Zheng, Shu-Sen
2007-07-01
The effect of cyclosporine A (CsA) on hepatitis B virus (HBV) replication was investigated, and proteomics expression differentiation after CsA treatment was studied in order to provide clues to explore the effect of CsA on HBV replication. Methyl thiazolyl tetrazolium (MTT) assay was used to evaluate the cytotoxicity of CsA. The HBV replication level in the HBV genomic DNA transfected HepG2.2.15 cell line was determined by an ELISA analysis of hepatitis B surface antigens (HBsAg) and Hepatitis B e antigens (HBeAg) in culture supernatant, while the intracellular HBV DNA replication level was analyzed by slot blot hybridization. Two-dimensional electrophoresis was used to investigate the alteration of protein expression in HepG2.2.15 after CsA treatment in vitro. The differentially-expressed proteins were identified by Matrix-assisted laser desorption/ionization-time of flight mass spectrometry combined with an online database search. CsA was able to inhibit the expression of HBsAg, HBeAg, and HBV DNA replication in vitro in a dose-dependent manner. A proteomics analysis indicated that the expression of 17 proteins changed significantly in the CsA treatment group compared to the control group. Eleven of the 17 proteins were identified, including the overexpression of eukaryotic translation initiation factors (eIF) 3k, otubain 1, 14.3.3 protein, eIF2-1 alpha, eIF5A, and the tyrosine 3/tryptophan 5-mono-oxygenase activation protein in CsA-treated HepG2.2.15 cells. The downregulation of the ferritin light subunit, erythrocyte cytosolic protein of 51 kDa (ECP-51), stathmin 1/oncoprotein, adenine phosphoribosyl-transferase, and the position of a tumor protein, translationally controlled 1, was shifted, suggesting it had undergone posttranslational modifications. Our study identified the inhibitory effect of CsA on HBV replication, and found that a group of proteins may be responsible for this inhibitory effect.
Bertolde, F Z; Almeida, A-A F; Silva, F A C; Oliveira, T M; Pirovani, C P
2014-07-04
Theobroma cacao is a woody and recalcitrant plant with a very high level of interfering compounds. Standard protocols for protein extraction were proposed for various types of samples, but the presence of interfering compounds in many samples prevented the isolation of proteins suitable for two-dimensional gel electrophoresis (2-DE). An efficient method to extract root proteins for 2-DE was established to overcome these problems. The main features of this protocol are: i) precipitation with trichloroacetic acid/acetone overnight to prepare the acetone dry powder (ADP), ii) several additional steps of sonication in the ADP preparation and extractions with dense sodium dodecyl sulfate and phenol, and iii) adding two stages of phenol extractions. Proteins were extracted from roots using this new protocol (Method B) and a protocol described in the literature for T. cacao leaves and meristems (Method A). Using these methods, we obtained a protein yield of about 0.7 and 2.5 mg per 1.0 g lyophilized root, and a total of 60 and 400 spots could be separated, respectively. Through Method B, it was possible to isolate high-quality protein and a high yield of roots from T. cacao for high-quality 2-DE gels. To demonstrate the quality of the extracted proteins from roots of T. cacao using Method B, several protein spots were cut from the 2-DE gels, analyzed by tandem mass spectrometry, and identified. Method B was further tested on Citrus roots, with a protein yield of about 2.7 mg per 1.0 g lyophilized root and 800 detected spots.
Côté, Richard G; Jones, Philip; Martens, Lennart; Kerrien, Samuel; Reisinger, Florian; Lin, Quan; Leinonen, Rasko; Apweiler, Rolf; Hermjakob, Henning
2007-10-18
Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and code examples are available at http://www.ebi.ac.uk/Tools/picr.
Côté, Richard G; Jones, Philip; Martens, Lennart; Kerrien, Samuel; Reisinger, Florian; Lin, Quan; Leinonen, Rasko; Apweiler, Rolf; Hermjakob, Henning
2007-01-01
Background Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. Results We have created the Protein Identifier Cross-Reference (PICR) service, a web application that provides interactive and programmatic (SOAP and REST) access to a mapping algorithm that uses the UniProt Archive (UniParc) as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV) or Microsoft Excel (XLS) files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. Conclusion We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR interface, documentation and code examples are available at . PMID:17945017
Transterm: a database to aid the analysis of regulatory sequences in mRNAs
Jacobs, Grant H.; Chen, Augustine; Stevens, Stewart G.; Stockwell, Peter A.; Black, Michael A.; Tate, Warren P.; Brown, Chris M.
2009-01-01
Messenger RNAs, in addition to coding for proteins, may contain regulatory elements that affect how the protein is translated. These include protein and microRNA-binding sites. Transterm (http://mRNA.otago.ac.nz/Transterm.html) is a database of regions and elements that affect translation with two major unique components. The first is integrated results of analysis of general features that affect translation (initiation, elongation, termination) for species or strains in Genbank, processed through a standard pipeline. The second is curated descriptions of experimentally determined regulatory elements that function as translational control elements in mRNAs. Transterm focuses on protein binding sites, particularly those in 3′-untranslated regions (3′-UTR). For this release the interface has been extensively updated based on user feedback. The data is now accessible by strain rather than species, for example there are 10 Escherichia coli strains (genomes) analysed separately. In addition to providing a repository of data, the database also provides tools for users to query their own mRNA sequences. Users can search sequences for Transterm or user defined regulatory elements, including protein or miRNA targets. Transterm also provides a central core of links to related resources for complementary analyses. PMID:18984623
3DProIN: Protein-Protein Interaction Networks and Structure Visualization.
Li, Hui; Liu, Chunmei
2014-06-14
3DProIN is a computational tool to visualize protein-protein interaction networks in both two dimensional (2D) and three dimensional (3D) view. It models protein-protein interactions in a graph and explores the biologically relevant features of the tertiary structures of each protein in the network. Properties such as color, shape and name of each node (protein) of the network can be edited in either 2D or 3D views. 3DProIN is implemented using 3D Java and C programming languages. The internet crawl technique is also used to parse dynamically grasped protein interactions from protein data bank (PDB). It is a java applet component that is embedded in the web page and it can be used on different platforms including Linux, Mac and Window using web browsers such as Firefox, Internet Explorer, Chrome and Safari. It also was converted into a mac app and submitted to the App store as a free app. Mac users can also download the app from our website. 3DProIN is available for academic research at http://bicompute.appspot.com.
Three-dimensional electron diffraction of plant light-harvesting complex
Wang, Da Neng; Kühlbrandt, Werner
1992-01-01
Electron diffraction patterns of two-dimensional crystals of light-harvesting chlorophyll a/b-protein complex (LHC-II) from photosynthetic membranes of pea chloroplasts, tilted at different angles up to 60°, were collected to 3.2 Å resolution at -125°C. The reflection intensities were merged into a three-dimensional data set. The Friedel R-factor and the merging R-factor were 21.8 and 27.6%, respectively. Specimen flatness and crystal size were critical for recording electron diffraction patterns from crystals at high tilts. The principal sources of experimental error were attributed to limitations of the number of unit cells contributing to an electron diffraction pattern, and to the critical electron dose. The distribution of strong diffraction spots indicated that the three-dimensional structure of LHC-II is less regular than that of other known membrane proteins and is not dominated by a particular feature of secondary structure. ImagesFIGURE 1FIGURE 2 PMID:19431817
The Brain Database: A Multimedia Neuroscience Database for Research and Teaching
Wertheim, Steven L.
1989-01-01
The Brain Database is an information tool designed to aid in the integration of clinical and research results in neuroanatomy and regional biochemistry. It can handle a wide range of data types including natural images, 2 and 3-dimensional graphics, video, numeric data and text. It is organized around three main entities: structures, substances and processes. The database will support a wide variety of graphical interfaces. Two sample interfaces have been made. This tool is intended to serve as one component of a system that would allow neuroscientists and clinicians 1) to represent clinical and experimental data within a common framework 2) to compare results precisely between experiments and among laboratories, 3) to use computing tools as an aid in collaborative work and 4) to contribute to a shared and accessible body of knowledge about the nervous system.
Robasky, Kimberly; Bulyk, Martha L
2011-01-01
The Universal PBM Resource for Oligonucleotide-Binding Evaluation (UniPROBE) database is a centralized repository of information on the DNA-binding preferences of proteins as determined by universal protein-binding microarray (PBM) technology. Each entry for a protein (or protein complex) in UniPROBE provides the quantitative preferences for all possible nucleotide sequence variants ('words') of length k ('k-mers'), as well as position weight matrix (PWM) and graphical sequence logo representations of the k-mer data. In this update, we describe >130% expansion of the database content, incorporation of a protein BLAST (blastp) tool for finding protein sequence matches in UniPROBE, the introduction of UniPROBE accession numbers and additional database enhancements. The UniPROBE database is available at http://uniprobe.org.
Hexadecameric structure of an invertebrate gap junction channel.
Oshima, Atsunori; Matsuzawa, Tomohiro; Murata, Kazuyoshi; Tani, Kazutoshi; Fujiyoshi, Yoshinori
2016-03-27
Innexins are invertebrate-specific gap junction proteins with four transmembrane helices. These proteins oligomerize to constitute intercellular channels that allow for the passage of small signaling molecules associated with neural and muscular electrical activity. In contrast to the large number of structural and functional studies of connexin gap junction channels, few structural studies of recombinant innexin channels are reported. Here we show the three-dimensional structure of two-dimensionally crystallized Caenorhabditis elegans innexin-6 (INX-6) gap junction channels. The N-terminal deleted INX-6 proteins are crystallized in lipid bilayers. The three-dimensional reconstruction determined by cryo-electron crystallography reveals that a single INX-6 gap junction channel comprises 16 subunits, a hexadecamer, in contrast to chordate connexin channels, which comprise 12 subunits. The channel pore diameters at the cytoplasmic entrance and extracellular gap region are larger than those of connexin26. Two bulb densities are observed in each hemichannel, one in the pore and the other at the cytoplasmic side of the hemichannel in the channel pore pathway. These findings imply a structural diversity of gap junction channels among multicellular organisms. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
NASA Astrophysics Data System (ADS)
Verlinde, Christophe L. M. J.; Rudenko, Gabrielle; Hol, Wim G. J.
1992-04-01
A modular method for pursuing structure-based inhibitor design in the framework of a design cycle is presented. The approach entails four stages: (1) a design pathway is defined in the three-dimensional structure of a target protein; (2) this pathway is divided into subregions; (3) complementary building blocks, also called fragments, are designed in each subregion; complementarity is defined in terms of shape, hydrophobicity, hydrogen bond properties and electrostatics; and (4) fragments from different subregions are linked into potential lead compounds. Stages (3) and (4) are qualitatively guided by force-field calculations. In addition, the designed fragments serve as entries for retrieving existing compounds from chemical databases. This linked-fragment approach has been applied in the design of potentially selective inhibitors of triosephosphate isomerase from Trypanosoma brucei, the causative agent of sleeping sickness.
Elguoshy, Amr; Hirao, Yoshitoshi; Xu, Bo; Saito, Suguru; Quadery, Ali F; Yamamoto, Keiko; Mitsui, Toshiaki; Yamamoto, Tadashi
2017-12-01
In an attempt to complete human proteome project (HPP), Chromosome-Centric Human Proteome Project (C-HPP) launched the journey of missing protein (MP) investigation in 2012. However, 2579 and 572 protein entries in the neXtProt (2017-1) are still considered as missing and uncertain proteins, respectively. Thus, in this study, we proposed a pipeline to analyze, identify, and validate human missing and uncertain proteins in open-access transcriptomics and proteomics databases. Analysis of RNA expression pattern for missing proteins in Human protein Atlas showed that 28% of them, such as Olfactory receptor 1I1 ( O60431 ), had no RNA expression, suggesting the necessity to consider uncommon tissues for transcriptomic and proteomic studies. Interestingly, 21% had elevated expression level in a particular tissue (tissue-enriched proteins), indicating the importance of targeting such proteins in their elevated tissues. Additionally, the analysis of RNA expression level for missing proteins showed that 95% had no or low expression level (0-10 transcripts per million), indicating that low abundance is one of the major obstacles facing the detection of missing proteins. Moreover, missing proteins are predicted to generate fewer predicted unique tryptic peptides than the identified proteins. Searching for these predicted unique tryptic peptides that correspond to missing and uncertain proteins in the experimental peptide list of open-access MS-based databases (PA, GPM) resulted in the detection of 402 missing and 19 uncertain proteins with at least two unique peptides (≥9 aa) at <(5 × 10 -4 )% FDR. Finally, matching the native spectra for the experimentally detected peptides with their SRMAtlas synthetic counterparts at three transition sources (QQQ, QTOF, QTRAP) gave us an opportunity to validate 41 missing proteins by ≥2 proteotypic peptides.
Yamaguchi, Akihiro; Go, Mitiko
2006-01-01
We have been developing FAMSBASE, a protein homology-modeling database of whole ORFs predicted from genome sequences. The latest update of FAMSBASE (http://daisy.nagahama-i-bio.ac.jp/Famsbase/), which is based on the protein three-dimensional (3D) structures released by November 2003, contains modeled 3D structures for 368,724 open reading frames (ORFs) derived from genomes of 276 species, namely 17 archaebacterial, 130 eubacterial, 18 eukaryotic and 111 phage genomes. Those 276 genomes are predicted to have 734,193 ORFs in total and the current FAMSBASE contains protein 3D structure of approximately 50% of the ORF products. However, cases that a modeled 3D structure covers the whole part of an ORF product are rare. When portion of an ORF with 3D structure is compared in three kingdoms of life, in archaebacteria and eubacteria, approximately 60% of the ORFs have modeled 3D structures covering almost the entire amino acid sequences, however, the percentage falls to about 30% in eukaryotes. When annual differences in the number of ORFs with modeled 3D structure are calculated, the fraction of modeled 3D structures of soluble protein for archaebacteria is increased by 5%, and that for eubacteria by 7% in the last 3 years. Assuming that this rate would be maintained and that determination of 3D structures for predicted disordered regions is unattainable, whole soluble protein model structures of prokaryotes without the putative disordered regions will be in hand within 15 years. For eukaryotic proteins, they will be in hand within 25 years. The 3D structures we will have at those times are not the 3D structure of the entire proteins encoded in single ORFs, but the 3D structures of separate structural domains. Measuring or predicting spatial arrangements of structural domains in an ORF will then be a coming issue of structural genomics. PMID:17146617
Song, Wen Jun; Qin, Qi Wei; Qiu, Jin; Huang, Can Hua; Wang, Fan; Hew, Choy Leong
2004-01-01
Here we report the complete genome sequence of Singapore grouper iridovirus (SGIV). Sequencing of the random shotgun and restriction endonuclease genomic libraries showed that the entire SGIV genome consists of 140,131 nucleotide bp. One hundred sixty-two open reading frames (ORFs) from the sense and antisense DNA strands, coding for lengths varying from 41 to 1,268 amino acids, were identified. Computer-assisted analyses of the deduced amino acid sequences revealed that 77 of the ORFs exhibited homologies to known virus genes, 23 of which matched functional iridovirus proteins. Forty-two putative conserved domains or signatures were detected in the National Center for Biotechnology Information CD-Search database and PROSITE database. An assortment of enzyme activities involved in DNA replication, transcription, nucleotide metabolism, cell signaling, etc., were identified. Viruses were cultured on a cell line derived from the embryonated egg of the grouper Epinephelus tauvina, isolated, and purified by sucrose gradient ultracentrifugation. The protein extract from the purified virions was analyzed by polyacrylamide gel electrophoresis followed by in-gel digestion of protein bands. Matrix-assisted laser desorption ionization-time of flight mass spectrometry and database searching led to identification of 26 proteins. Twenty of these represented novel or previously unidentified genes, which were further confirmed by reverse transcription-PCR (RT-PCR) and DNA sequencing of their respective RT-PCR products. PMID:15507645
MIPS: a database for protein sequences, homology data and yeast genome information.
Mewes, H W; Albermann, K; Heumann, K; Liebl, S; Pfeiffer, F
1997-01-01
The MIPS group (Martinsried Institute for Protein Sequences) at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, collects, processes and distributes protein sequence data within the framework of the tripartite association of the PIR-International Protein Sequence Database (,). MIPS contributes nearly 50% of the data input to the PIR-International Protein Sequence Database. The database is distributed on CD-ROM together with PATCHX, an exhaustive supplement of unique, unverified protein sequences from external sources compiled by MIPS. Through its WWW server (http://www.mips.biochem.mpg.de/ ) MIPS permits internet access to sequence databases, homology data and to yeast genome information. (i) Sequence similarity results from the FASTA program () are stored in the FASTA database for all proteins from PIR-International and PATCHX. The database is dynamically maintained and permits instant access to FASTA results. (ii) Starting with FASTA database queries, proteins have been classified into families and superfamilies (PROT-FAM). (iii) The HPT (hashed position tree) data structure () developed at MIPS is a new approach for rapid sequence and pattern searching. (iv) MIPS provides access to the sequence and annotation of the complete yeast genome (), the functional classification of yeast genes (FunCat) and its graphical display, the 'Genome Browser' (). A CD-ROM based on the JAVA programming language providing dynamic interactive access to the yeast genome and the related protein sequences has been compiled and is available on request. PMID:9016498
Hao, Ruijie; Adoligbe, Camus; Jiang, Bijie; Zhao, Xianlin; Gui, Linsheng; Qu, Kaixing; Wu, Sen; Zan, Linsen
2015-01-01
Longissimus dorsi muscle (LD) proteomics provides a novel opportunity to reveal the molecular mechanism behind intramuscular fat deposition. Unfortunately, the vast amounts of lipids and nucleic acids in this tissue hampered LD proteomics analysis. Trichloroacetic acid (TCA)/acetone precipitation is a widely used method to remove contaminants from protein samples. However, the high speed centrifugation employed in this method produces hard precipitates, which restrict contaminant elimination and protein re-dissolution. To address the problem, the centrifugation precipitates were first grinded with a glass tissue grinder and then washed with 90% acetone (TCA/acetone-G-W) in the present study. According to our result, the treatment for solid precipitate facilitated non-protein contaminant removal and protein re-dissolution, ultimately improving two-dimensional gel electrophoresis (2-DE) analysis. Additionally, we also evaluated the effect of sample drying on 2-DE profile as well as protein yield. It was found that 30 min air-drying did not result in significant protein loss, but reduced horizontal streaking and smearing on 2-DE gel compared to 10 min. In summary, we developed an optimized TCA/acetone precipitation method for protein extraction of LD, in which the modifications improved the effectiveness of TCA/acetone method.
Hao, Ruijie; Adoligbe, Camus; Jiang, Bijie; Zhao, Xianlin; Gui, Linsheng; Qu, Kaixing; Wu, Sen; Zan, Linsen
2015-01-01
Longissimus dorsi muscle (LD) proteomics provides a novel opportunity to reveal the molecular mechanism behind intramuscular fat deposition. Unfortunately, the vast amounts of lipids and nucleic acids in this tissue hampered LD proteomics analysis. Trichloroacetic acid (TCA)/acetone precipitation is a widely used method to remove contaminants from protein samples. However, the high speed centrifugation employed in this method produces hard precipitates, which restrict contaminant elimination and protein re-dissolution. To address the problem, the centrifugation precipitates were first grinded with a glass tissue grinder and then washed with 90% acetone (TCA/acetone-G-W) in the present study. According to our result, the treatment for solid precipitate facilitated non-protein contaminant removal and protein re-dissolution, ultimately improving two-dimensional gel electrophoresis (2-DE) analysis. Additionally, we also evaluated the effect of sample drying on 2-DE profile as well as protein yield. It was found that 30 min air-drying did not result in significant protein loss, but reduced horizontal streaking and smearing on 2-DE gel compared to 10 min. In summary, we developed an optimized TCA/acetone precipitation method for protein extraction of LD, in which the modifications improved the effectiveness of TCA/acetone method. PMID:25893432
Contaminant screening of wastewater with HPLC-IM-qTOF-MS and LC+LC-IM-qTOF-MS using a CCS database.
Stephan, Susanne; Hippler, Joerg; Köhler, Timo; Deeb, Ahmad A; Schmidt, Torsten C; Schmitz, Oliver J
2016-09-01
Non-target analysis has become an important tool in the field of water analysis since a broad variety of pollutants from different sources are released to the water cycle. For identification of compounds in such complex samples, liquid chromatography coupled to high resolution mass spectrometry are often used. The introduction of ion mobility spectrometry provides an additional separation dimension and allows determining collision cross sections (CCS) of the analytes as a further physicochemical constant supporting the identification. A CCS database with more than 500 standard substances including drug-like compounds and pesticides was used for CCS data base search in this work. A non-target analysis of a wastewater sample was initially performed with high performance liquid chromatography (HPLC) coupled to an ion mobility-quadrupole-time of flight mass spectrometer (IM-qTOF-MS). A database search including exact mass (±5 ppm) and CCS (±1 %) delivered 22 different compounds. Furthermore, the same sample was analyzed with a two-dimensional LC method, called LC+LC, developed in our group for the coupling to IM-qTOF-MS. This four dimensional separation platform revealed 53 different compounds, identified over exact mass and CCS, in the examined wastewater sample. It is demonstrated that the CCS database can also help to distinguish between isobaric structures exemplified for cyclophosphamide and ifosfamide. Graphical Abstract Scheme of sample analysis and database screening.
A global optimization algorithm for protein surface alignment
2010-01-01
Background A relevant problem in drug design is the comparison and recognition of protein binding sites. Binding sites recognition is generally based on geometry often combined with physico-chemical properties of the site since the conformation, size and chemical composition of the protein surface are all relevant for the interaction with a specific ligand. Several matching strategies have been designed for the recognition of protein-ligand binding sites and of protein-protein interfaces but the problem cannot be considered solved. Results In this paper we propose a new method for local structural alignment of protein surfaces based on continuous global optimization techniques. Given the three-dimensional structures of two proteins, the method finds the isometric transformation (rotation plus translation) that best superimposes active regions of two structures. We draw our inspiration from the well-known Iterative Closest Point (ICP) method for three-dimensional (3D) shapes registration. Our main contribution is in the adoption of a controlled random search as a more efficient global optimization approach along with a new dissimilarity measure. The reported computational experience and comparison show viability of the proposed approach. Conclusions Our method performs well to detect similarity in binding sites when this in fact exists. In the future we plan to do a more comprehensive evaluation of the method by considering large datasets of non-redundant proteins and applying a clustering technique to the results of all comparisons to classify binding sites. PMID:20920230
NPIDB: Nucleic acid-Protein Interaction DataBase.
Kirsanov, Dmitry D; Zanegina, Olga N; Aksianov, Evgeniy A; Spirin, Sergei A; Karyagina, Anna S; Alexeevski, Andrei V
2013-01-01
The Nucleic acid-Protein Interaction DataBase (http://npidb.belozersky.msu.ru/) contains information derived from structures of DNA-protein and RNA-protein complexes extracted from the Protein Data Bank (3846 complexes in October 2012). It provides a web interface and a set of tools for extracting biologically meaningful characteristics of nucleoprotein complexes. The content of the database is updated weekly. The current version of the Nucleic acid-Protein Interaction DataBase is an upgrade of the version published in 2007. The improvements include a new web interface, new tools for calculation of intermolecular interactions, a classification of SCOP families that contains DNA-binding protein domains and data on conserved water molecules on the DNA-protein interface.
The Ramachandran Number: An Order Parameter for Protein Geometry
Mannige, Ranjan V.; Kundu, Joyjit; Whitelam, Stephen; ...
2016-08-04
Three-dimensional protein structures usually contain regions of local order, called secondary structure, such as α-helices and β-sheets. Secondary structure is characterized by the local rotational state of the protein backbone, quantified by two dihedral angles called Øand Ψ. Particular types of secondary structure can generally be described by a single (diffuse) location on a two-dimensional plot drawn in the space of the angles Ø andΨ, called a Ramachandran plot. By contrast, a recently-discovered nanomaterial made from peptoids, structural isomers of peptides, displays a secondary-structure motif corresponding to two regions on the Ramachandran plot [Mannige et al., Nature 526, 415 (2015)].more » In order to describe such 'higher-order' secondary structure in a compact way we introduce here a means of describing regions on the Ramachandran plot in terms of a single Ramachandran number, R, which is a structurally meaningful combination of Ø andΨ. We show that the potential applications of R are numerous: it can be used to describe the geometric content of protein structures, and can be used to draw diagrams that reveal, at a glance, the frequency of occurrence of regular secondary structures and disordered regions in large protein datasets. We propose that R might be used as an order parameter for protein geometry for a wide range of applications.« less
Paquet, M J; Laviolette, M; Pézolet, M; Auger, M
2001-01-01
Two-dimensional infrared correlation spectroscopy (2D-IR) was used in this study to investigate the aggregation of cytochrome c in the presence of dimyristoylphosphatidylglycerol. The influence of temperature on the aggregation has been evaluated by monitoring the intensity of a band at 1616 cm(-1), which is characteristic of aggregated proteins, and the 2D-IR analysis has been used to determine the various secondary structure components of cytochrome c involved before and during its aggregation. The 2D-IR correlation analysis clearly reveals for the first time that aggregation starts to occur between nearly native proteins, which then unfold, yielding to further aggregation of the protein. Later in the aggregation process, the formation of intermolecular bonds and unfolding of the alpha-helices appear to be simultaneous. These results lead us to propose a two-step aggregation process. Finally, the results obtained during the heating period clearly indicate that before the protein starts to aggregate, there is a loosening of the tertiary structure of cytochrome c, resulting in a decrease of the beta-sheet content and an increase of the amount of beta-turns. This study clearly demonstrates the potential of 2D-IR spectroscopy to investigate the aggregation of proteins and this technique could therefore be applied to other proteins such as those involved in fibrilogenesis. PMID:11423415
Clement, Cristina C.; Aphkhazava, David; Nieves, Edward; Callaway, Myrasol; Olszewski, Waldemar; Rotzschke, Olaf; Santambrogio, Laura
2013-01-01
In this study a proteomic approach was used to define the protein content of matched samples of afferent prenodal lymph and plasma derived from healthy volunteers. The analysis was performed using two analytical methodologies coupled with nanoliquid chromatography-tandem mass spectrometry: one-dimensional gel electrophoresis (1DEF nanoLC Orbitrap–ESI–MS/MS), and two-dimensional fluorescence difference-in-gel electrophoresis (2D-DIGE nanoLC–ESI–MS/MS). The 253 significantly identified proteins (p<0.05), obtained from the tandem mass spectrometry data, were further analyzed with pathway analysis (IPA) to define the functional signature of prenodal lymph and matched plasma. The 1DEF coupled with nanoLC–MS–MS revealed that the common proteome between the two biological fluids (144 out of 253 proteins) was dominated by complement activation and blood coagulation components, transporters and protease inhibitors. The enriched proteome of human lymph (72 proteins) consisted of products derived from the extracellular matrix, apoptosis and cellular catabolism. In contrast, the enriched proteome of human plasma (37 proteins) consisted of soluble molecules of the coagulation system and cell–cell signaling factors. The functional networks associated with both common and source-distinctive proteomes highlight the principal biological activity of these immunologically relevant body fluids. PMID:23202415
The value of protein structure classification information—Surveying the scientific literature
Fox, Naomi K.; Brenner, Steven E.
2015-01-01
ABSTRACT The Structural Classification of Proteins (SCOP) and Class, Architecture, Topology, Homology (CATH) databases have been valuable resources for protein structure classification for over 20 years. Development of SCOP (version 1) concluded in June 2009 with SCOP 1.75. The SCOPe (SCOP–extended) database offers continued development of the classic SCOP hierarchy, adding over 33,000 structures. We have attempted to assess the impact of these two decade old resources and guide future development. To this end, we surveyed recent articles to learn how structure classification data are used. Of 571 articles published in 2012–2013 that cite SCOP, 439 actually use data from the resource. We found that the type of use was fairly evenly distributed among four top categories: A) study protein structure or evolution (27% of articles), B) train and/or benchmark algorithms (28% of articles), C) augment non‐SCOP datasets with SCOP classification (21% of articles), and D) examine the classification of one protein/a small set of proteins (22% of articles). Most articles described computational research, although 11% described purely experimental research, and a further 9% included both. We examined how CATH and SCOP were used in 158 articles that cited both databases: while some studies used only one dataset, the majority used data from both resources. Protein structure classification remains highly relevant for a diverse range of problems and settings. Proteins 2015; 83:2025–2038. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. PMID:26313554
Góral, Tomasz; Kwiatek, Michał; Majka, Maciej; Kosmala, Arkadiusz
2014-01-01
Numerous potential components involved in the resistance to Fusarium head blight (FHB) in cereals have been indicated, however, our knowledge regarding this process is still limited and further work is required. Two winter wheat (Triticum aestivum L.) lines differing in their levels of resistance to FHB were analyzed to identify the most crucial proteins associated with resistance in this species. The presented work involved analysis of protein abundance in the kernel bulks of more resistant and more susceptible wheat lines using two-dimensional gel electrophoresis and mass spectrometry identification of proteins, which were differentially accumulated between the analyzed lines, after inoculation with F. culmorum under field conditions. All the obtained two-dimensional patterns were demonstrated to be well-resolved protein maps of kernel proteomes. Although, 11 proteins were shown to have significantly different abundance between these two groups of plants, only two are likely to be crucial and have a potential role in resistance to FHB. Monomeric alpha-amylase and dimeric alpha-amylase inhibitors, both highly accumulated in the more resistant line, after inoculation and in the control conditions. Fusarium pathogens can use hydrolytic enzymes, including amylases to colonize kernels and acquire nitrogen and carbon from the endosperm and we suggest that the inhibition of pathogen amylase activity could be one of the most crucial mechanisms to prevent infection progress in the analyzed wheat line with a higher resistance. Alpha-amylase activity assays confirmed this suggestion as it revealed the highest level of enzyme activity, after F. culmorum infection, in the line more susceptible to FHB. PMID:25340555
Perlikowski, Dawid; Wiśniewska, Halina; Góral, Tomasz; Kwiatek, Michał; Majka, Maciej; Kosmala, Arkadiusz
2014-01-01
Numerous potential components involved in the resistance to Fusarium head blight (FHB) in cereals have been indicated, however, our knowledge regarding this process is still limited and further work is required. Two winter wheat (Triticum aestivum L.) lines differing in their levels of resistance to FHB were analyzed to identify the most crucial proteins associated with resistance in this species. The presented work involved analysis of protein abundance in the kernel bulks of more resistant and more susceptible wheat lines using two-dimensional gel electrophoresis and mass spectrometry identification of proteins, which were differentially accumulated between the analyzed lines, after inoculation with F. culmorum under field conditions. All the obtained two-dimensional patterns were demonstrated to be well-resolved protein maps of kernel proteomes. Although, 11 proteins were shown to have significantly different abundance between these two groups of plants, only two are likely to be crucial and have a potential role in resistance to FHB. Monomeric alpha-amylase and dimeric alpha-amylase inhibitors, both highly accumulated in the more resistant line, after inoculation and in the control conditions. Fusarium pathogens can use hydrolytic enzymes, including amylases to colonize kernels and acquire nitrogen and carbon from the endosperm and we suggest that the inhibition of pathogen amylase activity could be one of the most crucial mechanisms to prevent infection progress in the analyzed wheat line with a higher resistance. Alpha-amylase activity assays confirmed this suggestion as it revealed the highest level of enzyme activity, after F. culmorum infection, in the line more susceptible to FHB.
The PMDB Protein Model Database
Castrignanò, Tiziana; De Meo, Paolo D'Onorio; Cozzetto, Domenico; Talamo, Ivano Giuseppe; Tramontano, Anna
2006-01-01
The Protein Model Database (PMDB) is a public resource aimed at storing manually built 3D models of proteins. The database is designed to provide access to models published in the scientific literature, together with validating experimental data. It is a relational database and it currently contains >74 000 models for ∼240 proteins. The system is accessible at and allows predictors to submit models along with related supporting evidence and users to download them through a simple and intuitive interface. Users can navigate in the database and retrieve models referring to the same target protein or to different regions of the same protein. Each model is assigned a unique identifier that allows interested users to directly access the data. PMID:16381873
PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank.
Tusnády, Gábor E; Dosztányi, Zsuzsanna; Simon, István
2005-01-01
PDB_TM is a database for transmembrane proteins with known structures. It aims to collect all transmembrane proteins that are deposited in the protein structure database (PDB) and to determine their membrane-spanning regions. These assignments are based on the TMDET algorithm, which uses only structural information to locate the most likely position of the lipid bilayer and to distinguish between transmembrane and globular proteins. This algorithm was applied to all PDB entries and the results were collected in the PDB_TM database. By using TMDET algorithm, the PDB_TM database can be automatically updated every week, keeping it synchronized with the latest PDB updates. The PDB_TM database is available at http://www.enzim.hu/PDB_TM.
Worldwide Protein Data Bank validation information: usage and trends.
Smart, Oliver S; Horský, Vladimír; Gore, Swanand; Svobodová Vařeková, Radka; Bendová, Veronika; Kleywegt, Gerard J; Velankar, Sameer
2018-03-01
Realising the importance of assessing the quality of the biomolecular structures deposited in the Protein Data Bank (PDB), the Worldwide Protein Data Bank (wwPDB) partners established Validation Task Forces to obtain advice on the methods and standards to be used to validate structures determined by X-ray crystallography, nuclear magnetic resonance spectroscopy and three-dimensional electron cryo-microscopy. The resulting wwPDB validation pipeline is an integral part of the wwPDB OneDep deposition, biocuration and validation system. The wwPDB Validation Service webserver (https://validate.wwpdb.org) can be used to perform checks prior to deposition. Here, it is shown how validation metrics can be combined to produce an overall score that allows the ranking of macromolecular structures and domains in search results. The ValTrends DB database provides users with a convenient way to access and analyse validation information and other properties of X-ray crystal structures in the PDB, including investigating trends in and correlations between different structure properties and validation metrics.
Worldwide Protein Data Bank validation information: usage and trends
Horský, Vladimír; Gore, Swanand; Svobodová Vařeková, Radka; Bendová, Veronika
2018-01-01
Realising the importance of assessing the quality of the biomolecular structures deposited in the Protein Data Bank (PDB), the Worldwide Protein Data Bank (wwPDB) partners established Validation Task Forces to obtain advice on the methods and standards to be used to validate structures determined by X-ray crystallography, nuclear magnetic resonance spectroscopy and three-dimensional electron cryo-microscopy. The resulting wwPDB validation pipeline is an integral part of the wwPDB OneDep deposition, biocuration and validation system. The wwPDB Validation Service webserver (https://validate.wwpdb.org) can be used to perform checks prior to deposition. Here, it is shown how validation metrics can be combined to produce an overall score that allows the ranking of macromolecular structures and domains in search results. The ValTrendsDB database provides users with a convenient way to access and analyse validation information and other properties of X-ray crystal structures in the PDB, including investigating trends in and correlations between different structure properties and validation metrics. PMID:29533231
NASA Astrophysics Data System (ADS)
Trombley, N.; Weber, E.; Moehl, J.
2017-12-01
Many studies invoke dasymetric mapping to make more accurate depictions of population distribution by spatially restricting populations to inhabited/inhabitable portions of observational units (e.g., census blocks) and/or by varying population density among different land classes. LandScan USA uses this approach by restricting particular population components (such as residents or workers) to building area detected from remotely sensed imagery, but also goes a step further by classifying each cell of building area in accordance with ancillary land use information from national parcel data (CoreLogic, Inc.'s ParcelPoint database). Modeling population density according to land use is critical. For instance, office buildings would have a higher density of workers than warehouses even though the latter would likely have more cells of detection. This paper presents a modeling approach by which different land uses are assigned different densities to more accurately distribute populations within them. For parts of the country where the parcel data is insufficient, an alternate methodology is developed that uses National Land Cover Database (NLCD) data to define the land use type of building detection. Furthermore, LiDAR data is incorporated for many of the largest cities across the US, allowing the independent variables to be updated from two-dimensional building detection area to total building floor space. In the end, four different regression models are created to explain the effect of different land uses on worker distribution: A two-dimensional model using land use types from the parcel data A three-dimensional model using land use types from the parcel data A two-dimensional model using land use types from the NLCD data, and A three-dimensional model using land use types from the NLCD data. By and large, the resultant coefficients followed intuition, but importantly allow the relationships between different land uses to be quantified. For instance, in the model using two-dimensional building area, commercial building area had a density 2.5 times greater than public building area and 4 times greater than industrial building area. These coefficients can be applied to define the ratios at which population is distributed to building cells. Finally, possible avenues for refining the methodology are presented.
Kinase Pathway Database: An Integrated Protein-Kinase and NLP-Based Protein-Interaction Resource
Koike, Asako; Kobayashi, Yoshiyuki; Takagi, Toshihisa
2003-01-01
Protein kinases play a crucial role in the regulation of cellular functions. Various kinds of information about these molecules are important for understanding signaling pathways and organism characteristics. We have developed the Kinase Pathway Database, an integrated database involving major completely sequenced eukaryotes. It contains the classification of protein kinases and their functional conservation, ortholog tables among species, protein–protein, protein–gene, and protein–compound interaction data, domain information, and structural information. It also provides an automatic pathway graphic image interface. The protein, gene, and compound interactions are automatically extracted from abstracts for all genes and proteins by natural-language processing (NLP).The method of automatic extraction uses phrase patterns and the GENA protein, gene, and compound name dictionary, which was developed by our group. With this database, pathways are easily compared among species using data with more than 47,000 protein interactions and protein kinase ortholog tables. The database is available for querying and browsing at http://kinasedb.ontology.ims.u-tokyo.ac.jp/. PMID:12799355
A Survey of Protein Structures from Archaeal Viruses
Dellas, Nikki; Lawrence, C. Martin; Young, Mark J.
2013-01-01
Viruses that infect the third domain of life, Archaea, are a newly emerging field of interest. To date, all characterized archaeal viruses infect archaea that thrive in extreme conditions, such as halophilic, hyperthermophilic, and methanogenic environments. Viruses in general, especially those replicating in extreme environments, contain highly mosaic genomes with open reading frames (ORFs) whose sequences are often dissimilar to all other known ORFs. It has been estimated that approximately 85% of virally encoded ORFs do not match known sequences in the nucleic acid databases, and this percentage is even higher for archaeal viruses (typically 90%–100%). This statistic suggests that either virus genomes represent a larger segment of sequence space and/or that viruses encode genes of novel fold and/or function. Because the overall three-dimensional fold of a protein evolves more slowly than its sequence, efforts have been geared toward structural characterization of proteins encoded by archaeal viruses in order to gain insight into their potential functions. In this short review, we provide multiple examples where structural characterization of archaeal viral proteins has indeed provided significant functional and evolutionary insight. PMID:25371334
Relevance feedback-based building recognition
NASA Astrophysics Data System (ADS)
Li, Jing; Allinson, Nigel M.
2010-07-01
Building recognition is a nontrivial task in computer vision research which can be utilized in robot localization, mobile navigation, etc. However, existing building recognition systems usually encounter the following two problems: 1) extracted low level features cannot reveal the true semantic concepts; and 2) they usually involve high dimensional data which require heavy computational costs and memory. Relevance feedback (RF), widely applied in multimedia information retrieval, is able to bridge the gap between the low level visual features and high level concepts; while dimensionality reduction methods can mitigate the high-dimensional problem. In this paper, we propose a building recognition scheme which integrates the RF and subspace learning algorithms. Experimental results undertaken on our own building database show that the newly proposed scheme appreciably enhances the recognition accuracy.
Monoclonal antibodies against trophectoderm-specific markers during mouse blastocyst formation.
Brûlet, P; Babinet, C; Kemler, R; Jacob, F
1980-01-01
Two-dimensional gel electrophoresis has allowed the detection of proteins characteristic of inner cell mass and trophectoderm in mouse blastocyst. Certain of the proteins characterizing trophectoderm copurify with intermediate filaments from trophectoderm and a trophoblastoma cell line. A monoclonal antibody prepared against proteins of these intermediate filaments labels a filament network in trophectoderm but not in inner cell mass cells. Images PMID:6933460
Tabaqchali, S; Silman, R; Holland, D
1987-01-01
A new rapid automated method for the identification and classification of microorganisms is described. It is based on the incorporation of 35S-methionine into cellular proteins and subsequent separation of the radiolabelled proteins by sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE). The protein patterns produced were species specific and reproducible, permitting discrimination between the species. A large number of Gram negative and Gram positive aerobic and anaerobic organisms were successfully tested. Furthermore, there were sufficient differences within species between the protein profiles to permit subdivision of the species. New typing schemes for Clostridium difficile, coagulase negative staphylococci, and Staphylococcus aureus, including the methicillin resistant strains, could thus be introduced; this has provided the basis for useful epidemiological studies. To standardise and automate the procedure an automated electrophoresis system and a two dimensional scanner were developed to scan the dried gels directly. The scanner is operated by a computer which also stores and analyses the scan data. Specific histograms are produced for each bacterial species. Pattern recognition software is used to construct databases and to compare data obtained from different gels: in this way duplicate "unknowns" can be identified. Specific small areas showing differences between various histograms can also be isolated and expanded to maximise the differences, thus providing differentiation between closely related bacterial species and the identification of differences within the species to provide new typing schemes. This system should be widely applied in clinical microbiology laboratories in the near future. Images Fig 1 Fig 2 Fig 3 Fig 4 Fig 5 Fig 6 Fig 7 Fig 8 PMID:3312300
LIU, YU; PATEL, SANJAY; NIBBE, ROD; MAXWELL, SEAN; CHOWDHURY, SALIM A.; KOYUTURK, MEHMET; ZHU, XIAOFENG; LARKIN, EMMA K.; BUXBAUM, SARAH G; PUNJABI, NARESH M.; GHARIB, SINA A.; REDLINE, SUSAN; CHANCE, MARK R.
2015-01-01
The precise molecular etiology of obstructive sleep apnea (OSA) is unknown; however recent research indicates that several interconnected aberrant pathways and molecular abnormalities are contributors to OSA. Identifying the genes and pathways associated with OSA can help to expand our understanding of the risk factors for the disease as well as provide new avenues for potential treatment. Towards these goals, we have integrated relevant high dimensional data from various sources, such as genome-wide expression data (microarray), protein-protein interaction (PPI) data and results from genome-wide association studies (GWAS) in order to define sub-network elements that connect some of the known pathways related to the disease as well as define novel regulatory modules related to OSA. Two distinct approaches are applied to identify sub-networks significantly associated with OSA. In the first case we used a biased approach based on sixty genes/proteins with known associations with sleep disorders and/or metabolic disease to seed a search using commercial software to discover networks associated with disease followed by information theoretic (mutual information) scoring of the sub-networks. In the second case we used an unbiased approach and generated an interactome constructed from publicly available gene expression profiles and PPI databases, followed by scoring of the network with p-values from GWAS data derived from OSA patients to uncover sub-networks significant for the disease phenotype. A comparison of the approaches reveals a number of proteins that have been previously known to be associated with OSA or sleep. In addition, our results indicate a novel association of Phosphoinositide 3-kinase, the STAT family of proteins and its related pathways with OSA. PMID:21121029
Kim, So W.; Gupta, Ravi; Lee, Seo H.; Min, Cheol W.; Agrawal, Ganesh K.; Rakwal, Randeep; Kim, Jong B.; Jo, Ick H.; Park, Soo-Yun; Kim, Jae K.; Kim, Young-Chang; Bang, Kyong H.; Kim, Sun T.
2016-01-01
Panax ginseng roots are well known for their medicinal properties and have been used in Korean and Chinese traditional medicines for 1000s of years. However, the medicinal value of P. ginseng fruits remain poorly characterized. In this study, we used an integrated biochemical, proteomics, and metabolomics approach to look into the medicinal properties of ginseng fruits. DPPH (1,1-diphenyl-2-picrylhydrazyl) and ABTS [2,2′-azino-bis (3-ethylbenzothiazoline-6-sulphonic acid)] assays showed higher antioxidant activities in ginseng fruits than leaves or roots. Two-dimensional gel electrophoresis (2-DE) profiling of ginseng fruit proteins (cv. Cheongsun) showed more than 400 spots wherein a total of 81 protein spots were identified by mass spectrometry using NCBInr, UniRef, and an in-house developed RNAseq (59,251 protein sequences)-based databases. Gene ontology analysis showed that most of the identified proteins were related to the hydrolase (18%), oxidoreductase (16%), and ATP binding (15%) activities. Further, a comparative proteome analysis of four cultivars of ginseng fruits (cvs. Yunpoong, Gumpoong, Chunpoong, and Cheongsun) led to the identification of 22 differentially modulated protein spots. Using gas chromatography-time of flight mass spectrometry (GC-TOF MS), 66 metabolites including amino acids, sugars, organic acids, phenolic acids, phytosterols, tocopherols, and policosanols were identified and quantified. Some of these are well known medicinal compounds and were not previously identified in ginseng. Interestingly, the concentration of almost all metabolites was higher in the Chunpoong and Gumpoong cultivars. Parallel comparison of the four cultivars also revealed higher amounts of the medicinal metabolites in Chunpoong and Gumpoong cultivars. Taken together, our results demonstrate that ginseng fruits are a rich source of medicinal compounds with potential beneficial health effects. PMID:27458475
The value of protein structure classification information-Surveying the scientific literature
Fox, Naomi K.; Brenner, Steven E.; Chandonia, John -Marc
2015-08-27
The Structural Classification of Proteins (SCOP) and Class, Architecture, Topology, Homology (CATH) databases have been valuable resources for protein structure classification for over 20 years. Development of SCOP (version 1) concluded in June 2009 with SCOP 1.75. The SCOPe (SCOP-extended) database offers continued development of the classic SCOP hierarchy, adding over 33,000 structures. We have attempted to assess the impact of these two decade old resources and guide future development. To this end, we surveyed recent articles to learn how structure classification data are used. Of 571 articles published in 2012-2013 that cite SCOP, 439 actually use data from themore » resource. We found that the type of use was fairly evenly distributed among four top categories: A) study protein structure or evolution (27% of articles), B) train and/or benchmark algorithms (28% of articles), C) augment non-SCOP datasets with SCOP classification (21% of articles), and D) examine the classification of one protein/a small set of proteins (22% of articles). Most articles described computational research, although 11% described purely experimental research, and a further 9% included both. We examined how CATH and SCOP were used in 158 articles that cited both databases: while some studies used only one dataset, the majority used data from both resources. Protein structure classification remains highly relevant for a diverse range of problems and settings.« less
The value of protein structure classification information-Surveying the scientific literature
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fox, Naomi K.; Brenner, Steven E.; Chandonia, John -Marc
The Structural Classification of Proteins (SCOP) and Class, Architecture, Topology, Homology (CATH) databases have been valuable resources for protein structure classification for over 20 years. Development of SCOP (version 1) concluded in June 2009 with SCOP 1.75. The SCOPe (SCOP-extended) database offers continued development of the classic SCOP hierarchy, adding over 33,000 structures. We have attempted to assess the impact of these two decade old resources and guide future development. To this end, we surveyed recent articles to learn how structure classification data are used. Of 571 articles published in 2012-2013 that cite SCOP, 439 actually use data from themore » resource. We found that the type of use was fairly evenly distributed among four top categories: A) study protein structure or evolution (27% of articles), B) train and/or benchmark algorithms (28% of articles), C) augment non-SCOP datasets with SCOP classification (21% of articles), and D) examine the classification of one protein/a small set of proteins (22% of articles). Most articles described computational research, although 11% described purely experimental research, and a further 9% included both. We examined how CATH and SCOP were used in 158 articles that cited both databases: while some studies used only one dataset, the majority used data from both resources. Protein structure classification remains highly relevant for a diverse range of problems and settings.« less
2013-01-01
Background SNPs&GO is a method for the prediction of deleterious Single Amino acid Polymorphisms (SAPs) using protein functional annotation. In this work, we present the web server implementation of SNPs&GO (WS-SNPs&GO). The server is based on Support Vector Machines (SVM) and for a given protein, its input comprises: the sequence and/or its three-dimensional structure (when available), a set of target variations and its functional Gene Ontology (GO) terms. The output of the server provides, for each protein variation, the probabilities to be associated to human diseases. Results The server consists of two main components, including updated versions of the sequence-based SNPs&GO (recently scored as one of the best algorithms for predicting deleterious SAPs) and of the structure-based SNPs&GO3d programs. Sequence and structure based algorithms are extensively tested on a large set of annotated variations extracted from the SwissVar database. Selecting a balanced dataset with more than 38,000 SAPs, the sequence-based approach achieves 81% overall accuracy, 0.61 correlation coefficient and an Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve of 0.88. For the subset of ~6,600 variations mapped on protein structures available at the Protein Data Bank (PDB), the structure-based method scores with 84% overall accuracy, 0.68 correlation coefficient, and 0.91 AUC. When tested on a new blind set of variations, the results of the server are 79% and 83% overall accuracy for the sequence-based and structure-based inputs, respectively. Conclusions WS-SNPs&GO is a valuable tool that includes in a unique framework information derived from protein sequence, structure, evolutionary profile, and protein function. WS-SNPs&GO is freely available at http://snps.biofold.org/snps-and-go. PMID:23819482
Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai
2017-06-01
Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Sang Beom; Dsilva, Carmeline J.; Debenedetti, Pablo G., E-mail: pdebene@princeton.edu
Understanding the mechanisms by which proteins fold from disordered amino-acid chains to spatially ordered structures remains an area of active inquiry. Molecular simulations can provide atomistic details of the folding dynamics which complement experimental findings. Conventional order parameters, such as root-mean-square deviation and radius of gyration, provide structural information but fail to capture the underlying dynamics of the protein folding process. It is therefore advantageous to adopt a method that can systematically analyze simulation data to extract relevant structural as well as dynamical information. The nonlinear dimensionality reduction technique known as diffusion maps automatically embeds the high-dimensional folding trajectories inmore » a lower-dimensional space from which one can more easily visualize folding pathways, assuming the data lie approximately on a lower-dimensional manifold. The eigenvectors that parametrize the low-dimensional space, furthermore, are determined systematically, rather than chosen heuristically, as is done with phenomenological order parameters. We demonstrate that diffusion maps can effectively characterize the folding process of a Trp-cage miniprotein. By embedding molecular dynamics simulation trajectories of Trp-cage folding in diffusion maps space, we identify two folding pathways and intermediate structures that are consistent with the previous studies, demonstrating that this technique can be employed as an effective way of analyzing and constructing protein folding pathways from molecular simulations.« less
Protein structure based prediction of catalytic residues.
Fajardo, J Eduardo; Fiser, Andras
2013-02-22
Worldwide structural genomics projects continue to release new protein structures at an unprecedented pace, so far nearly 6000, but only about 60% of these proteins have any sort of functional annotation. We explored a range of features that can be used for the prediction of functional residues given a known three-dimensional structure. These features include various centrality measures of nodes in graphs of interacting residues: closeness, betweenness and page-rank centrality. We also analyzed the distance of functional amino acids to the general center of mass (GCM) of the structure, relative solvent accessibility (RSA), and the use of relative entropy as a measure of sequence conservation. From the selected features, neural networks were trained to identify catalytic residues. We found that using distance to the GCM together with amino acid type provide a good discriminant function, when combined independently with sequence conservation. Using an independent test set of 29 annotated protein structures, the method returned 411 of the initial 9262 residues as the most likely to be involved in function. The output 411 residues contain 70 of the annotated 111 catalytic residues. This represents an approximately 14-fold enrichment of catalytic residues on the entire input set (corresponding to a sensitivity of 63% and a precision of 17%), a performance competitive with that of other state-of-the-art methods. We found that several of the graph based measures utilize the same underlying feature of protein structures, which can be simply and more effectively captured with the distance to GCM definition. This also has the added the advantage of simplicity and easy implementation. Meanwhile sequence conservation remains by far the most influential feature in identifying functional residues. We also found that due the rapid changes in size and composition of sequence databases, conservation calculations must be recalibrated for specific reference databases.
Frozza, Caroline Olivieri da Silva; Ribeiro, Tanara da Silva; Gambato, Gabriela; Menti, Caroline; Moura, Sidnei; Pinto, Paulo Marcos; Staats, Charley Christian; Padilha, Francine Ferreira; Begnini, Karine Rech; de Leon, Priscila Marques Moura; Borsuk, Sibele; Savegnago, Lucielli; Dellagostin, Odir; Collares, Tiago; Seixas, Fabiana Kömmling; Henriques, João Antonio Pêgas; Roesch-Ely, Mariana
2014-01-01
Here we investigated alterations in the protein profile of Hep-2 treated with red propolis using two-dimensional electrophoresis associated to mass spectrometry and apoptotic rates of cells treated with and without red propolis extracts through TUNEL and Annexin-V assays. A total of 325 spots were manually excised from the two-dimensional gel electrophoresis and 177 proteins were identified using LC-MS-MS. Among all proteins identified that presented differential expression, most were down-regulated in presence of red propolis extract at a concentration of 120 μg/mL (IC50): GRP78, PRDX2, LDHB, VIM and TUBA1A. Only two up-regulated proteins were identified in this study in the non-cytotoxic (6 μg/mL) red propolis treated group: RPLP0 and RAD23B. TUNEL staining assay showed a markedly increase in the mid- to late-stage apoptosis of Hep-2 cells induced by red propolis at concentrations of 60 and 120 μg/mL when compared with non-treated cells. The increase of late apoptosis was confirmed by in situ Annexin-V analysis in which red propolis extract induced late apoptosis in a dose-dependent manner. The differences in tumor cell protein profiles warrant further investigations including isolation of major bioactive compounds of red propolis in different cell lines using proteomics and molecular tests to validate the protein expression here observed. Copyright © 2013 Elsevier Ltd. All rights reserved.
Crosara, Karla Tonelli Bicalho; Moffa, Eduardo Buozi; Xiao, Yizhi; Siqueira, Walter Luiz
2018-01-16
Protein-protein interaction is a common physiological mechanism for protection and actions of proteins in an organism. The identification and characterization of protein-protein interactions in different organisms is necessary to better understand their physiology and to determine their efficacy. In a previous in vitro study using mass spectrometry, we identified 43 proteins that interact with histatin 1. Six previously documented interactors were confirmed and 37 novel partners were identified. In this tutorial, we aimed to demonstrate the usefulness of the STRING database for studying protein-protein interactions. We used an in-silico approach along with the STRING database (http://string-db.org/) and successfully performed a fast simulation of a novel constructed histatin 1 protein-protein network, including both the previously known and the predicted interactors, along with our newly identified interactors. Our study highlights the advantages and importance of applying bioinformatics tools to merge in-silico tactics with experimental in vitro findings for rapid advancement of our knowledge about protein-protein interactions. Our findings also indicate that bioinformatics tools such as the STRING protein network database can help predict potential interactions between proteins and thus serve as a guide for future steps in our exploration of the Human Interactome. Our study highlights the usefulness of the STRING protein database for studying protein-protein interactions. The STRING database can collect and integrate data about known and predicted protein-protein associations from many organisms, including both direct (physical) and indirect (functional) interactions, in an easy-to-use interface. Copyright © 2017 Elsevier B.V. All rights reserved.
Sweetness determinant sites of brazzein, a small, heat-stable, sweet-tasting protein.
Assadi-Porter, F M; Aceti, D J; Markley, J L
2000-04-15
Brazzein, originally isolated from the fruit of the African plant Pentadiplandra brazzeana Baillon, is the smallest, most heat-stable and pH-stable member of the set of proteins known to have intrinsic sweetness. These properties make brazzein an ideal system for investigating the chemical and structural requirements of a sweet-tasting protein. We have used the three-dimensional structure of the protein (J. E. Caldwell et al. (1998) Nat. Struct. Biol. 5, 427-431) as a guide in designing 15 synthetic genes in expression constructs aimed at delineating the sweetness determinants of brazzein. Protein was produced heterologously in Escherichia coli, isolated, and purified as described in the companion paper (Assadi-Porter, F. M., Aceti, D., Cheng, H., and Markley, J. L., this issue). Analysis by one-dimensional (1)H NMR spectroscopy indicated that all but one of these variants had folded properly under the conditions used. A taste panel compared the gustatory properties of solutions of these proteins to those of sucrose and brazzein isolated from fruit. Of the 14 mutations in the des-pGlu1-brazzein background, four exhibited almost no sweetness, six had significantly reduced sweetness, two had taste properties equivalent to des-pGlu1-brazzein (two times as sweet as the major form of brazzein isolated from fruit which contains pGlu1), and two were about twice as sweet as des-pGlu1-brazzein. Overall, the results suggest that two regions of the protein are critical for the sweetness of brazzein: a region that includes the N- and C-termini of the protein, which are located close to one another, and a region that includes the flexible loop around Arg43. Copyright 2000 Academic Press.
Using Symmetry to Design Self-Assembling Protein Cages and Nanomaterials on the Mid-Nanometer Scale
NASA Astrophysics Data System (ADS)
Yeates, Todd
Self-assembling molecular structures having diverse cellular functions are widespread in nature. Some of the largest and most sophisticated types are built from many copies of the same or similar protein molecules arranged following principles of symmetry. A long-standing engineering goal has been to design novel protein molecules to self-assemble into geometrically specific structures similar to the extraordinary structures that have evolved in Nature. Practical routes to this goal have been developed by using ideas in symmetry to articulate the minimum design requirements for achieving various types of symmetric architectures, including cages, extended two-dimensional layers, and three-dimensional crystalline materials. The key requirement is that two distinct self-associating interfaces, each conferring one element of rotational symmetry, have to be engineered into the protein molecule (or molecules), following particular geometric specifications. The main principle is that combining two separate symmetry elements into a single molecular entity produces a molecule that necessarily assembles into an architecture dictated by a symmetry group that is the product of the two simpler contributing symmetries. Recent experiments have demonstrated success using a variety of symmetry-based strategies. Strategic variations are emerging that differ from each other with respect to biophysical features such as flexibility vs rigidity in the assembled structures, and with respect to design aspects such as whether the protein interfaces are inherited from natural oligomeric proteins or are designed de novo by advanced computational methods. The success of these strategies has been proven by determining crystal structures of several giant, self-assembling protein cages and clusters (10-25 nm in diameter), created by design. The ability to create sophisticated supramolecular structures from designed protein subunits opens the way to broad applications in synthetic biology and nanotechnology.
Cloning and characterization of an autonomous replication sequence from Coxiella burnetii.
Suhan, M; Chen, S Y; Thompson, H A; Hoover, T A; Hill, A; Williams, J C
1994-01-01
A Coxiella burnetii chromosomal fragment capable of functioning as an origin for the replication of a kanamycin resistance (Kanr) plasmid was isolated by use of origin search methods utilizing an Escherichia coli host. The 5.8-kb fragment was subcloned into phagemid vectors and was deleted progressively by an exonuclease III-S1 technique. Plasmids containing progressively shorter DNA fragments were then tested for their capability to support replication by transformation of an E. coli polA strain. A minimal autonomous replication sequence (ARS) was delimited to 403 bp. Sequencing of the entire 5.8-kb region revealed that the minimal ARS contained two consensus DnaA boxes, three A + T-rich 21-mers, a transcriptional promoter leading rightwards, and potential integration host factor and factor of inversion stimulation binding sites. Database comparisons of deduced amino acid sequences revealed that open reading frames located around the ARS were homologous to genes often, but not always, found near bacterial chromosomal origins; these included identities with rpmH and rnpA in E. coli and identities with the 9K protein and 60K membrane protein in E. coli and Pseudomonas species. These and direct hybridization data suggested that the ARS was chromosomal and not associated with the resident plasmid QpH1. Two-dimensional agarose gel electrophoresis did not reveal the presence of initiating intermediates, indicating that the ARS did not initiate chromosome replication during laboratory growth of C. burnetii. Images PMID:8071197
Jini service to reconstruct tomographic data
NASA Astrophysics Data System (ADS)
Knoll, Peter; Mirzaei, S.; Koriska, K.; Koehn, H.
2002-06-01
A number of imaging systems rely on the reconstruction of a 3- dimensional model from its projections through the process of computed tomography (CT). In medical imaging, for example magnetic resonance imaging (MRI), positron emission tomography (PET), and Single Computer Tomography (SPECT) acquire two-dimensional projections of a three dimensional projections of a three dimensional object. In order to calculate the 3-dimensional representation of the object, i.e. its voxel distribution, several reconstruction algorithms have been developed. Currently, mainly two reconstruct use: the filtered back projection(FBP) and iterative methods. Although the quality of iterative reconstructed SPECT slices is better than that of FBP slices, such iterative algorithms are rarely used for clinical routine studies because of their low availability and increased reconstruction time. We used Jini and a self-developed iterative reconstructions algorithm to design and implement a Jini reconstruction service. With this service, the physician selects the patient study from a database and a Jini client automatically discovers the registered Jini reconstruction services in the department's Intranet. After downloading the proxy object the this Jini service, the SPECT acquisition data are reconstructed. The resulting transaxial slices are visualized using a Jini slice viewer, which can be used for various imaging modalities.
Self-organized neural maps of human protein sequences.
Ferrán, E. A.; Pflugfelder, B.; Ferrara, P.
1994-01-01
We have recently described a method based on artificial neural networks to cluster protein sequences into families. The network was trained with Kohonen's unsupervised learning algorithm using, as inputs, the matrix patterns derived from the dipeptide composition of the proteins. We present here a large-scale application of that method to classify the 1,758 human protein sequences stored in the SwissProt database (release 19.0), whose lengths are greater than 50 amino acids. In the final 2-dimensional topologically ordered map of 15 x 15 neurons, proteins belonging to known families were associated with the same neuron or with neighboring ones. Also, as an attempt to reduce the time-consuming learning procedure, we compared 2 learning protocols: one of 500 epochs (100 SUN CPU-hours [CPU-h]), and another one of 30 epochs (6.7 CPU-h). A further reduction of learning-computing time, by a factor of about 3.3, with similar protein clustering results, was achieved using a matrix of 11 x 11 components to represent the sequences. Although network training is time consuming, the classification of a new protein in the final ordered map is very fast (14.6 CPU-seconds). We also show a comparison between the artificial neural network approach and conventional methods of biosequence analysis. PMID:8019421
iDBPs: a web server for the identification of DNA binding proteins
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-01-01
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. Availability: http://idbps.tau.ac.il/ Contact: NirB@tauex.tau.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20089514
Optimization of Protein Extraction and Two-Dimensional Electrophoresis Protocols for Oil Palm Leaf.
Daim, Leona Daniela Jeffery; Ooi, Tony Eng Keong; Yusof, Hirzun Mohd; Majid, Nazia Abdul; Karsani, Saiful Anuar Bin
2015-08-01
Oil palm (Elaeis guineensis) is an important economic crop cultivated for its nutritional palm oil. A significant amount of effort has been undertaken to understand oil palm growth and physiology at the molecular level, particularly in genomics and transcriptomics. Recently, proteomics studies have begun to garner interest. However, this effort is impeded by technical challenges. Plant sample preparation for proteomics analysis is plagued with technical challenges due to the presence of polysaccharides, secondary metabolites and other interfering compounds. Although protein extraction methods for plant tissues exist, none work universally on all sample types. Therefore, this study aims to compare and optimize different protein extraction protocols for use with two-dimensional gel electrophoresis of young and mature leaves from the oil palm. Four protein extraction methods were evaluated: phenol-guanidine isothiocyanate, trichloroacetic acid-acetone precipitation, sucrose and trichloroacetic acid-acetone-phenol. Of these four protocols, the trichloroacetic acid-acetone-phenol method was found to give the highest resolution and most reproducible gel. The results from this study can be used in sample preparations of oil palm tissue for proteomics work.
Sandovall, A.O.; Andrews, K.; Wahab, A.; Choudhary, M.I.; Ahmed, A.
2014-01-01
The RI-INBRE Centralized Core Facility was established in 2003 and participates annually in Undergraduate Summer Research Program. It provides students hands on research experience in key technologies in biomedical sciences. We present here the isolation and purification of water soluble proteins from ginger, a rhizome of the plant, Zingiber officinale. It is an important ingredient of species used in traditional South Asian cuisines. In Indian, Pakistani and Chinese folk medicine, ginger is used for gastro-intestinal disorders, nausea, vomiting, inflammatory diseases, muscle and joint pain. Limited studies have been reported on the bioactive proteins from ginger extract. The water soluble proteins were extracted from ginger root and successfully purified to homogeneity by using two-dimensional liquid chromatography (FPLC/RP-HPLC) approach. The ginger root was washed with distilled water; skin removed and then emulsified using an electric blender. Sample was stirred for four days at 4°C with and without protease inhibitor. Purification of a 42kDa protein was achieved by employing gel filtration, ion-exchange and reversed phase HPLC. The homogeneity of the protein was confirmed by SDS-PAGE gel electrophoresis and MALDI-TOF mass spectrometry. Future work will be conducted on the protein characterization using mass spectrometry and Edman protein sequencing. Supported by grant 5P20GM103430 from the National Institute of General Medical Sciences, NIH, USA.