Detection of protein complex from protein-protein interaction network using Markov clustering
NASA Astrophysics Data System (ADS)
Ochieng, P. J.; Kusuma, W. A.; Haryanto, T.
2017-05-01
Detection of complexes, or groups of functionally related proteins, is an important challenge while analysing biological networks. However, existing algorithms to identify protein complexes are insufficient when applied to dense networks of experimentally derived interaction data. Therefore, we introduced a graph clustering method based on Markov clustering algorithm to identify protein complex within highly interconnected protein-protein interaction networks. Protein-protein interaction network was first constructed to develop geometrical network, the network was then partitioned using Markov clustering to detect protein complexes. The interest of the proposed method was illustrated by its application to Human Proteins associated to type II diabetes mellitus. Flow simulation of MCL algorithm was initially performed and topological properties of the resultant network were analysed for detection of the protein complex. The results indicated the proposed method successfully detect an overall of 34 complexes with 11 complexes consisting of overlapping modules and 20 non-overlapping modules. The major complex consisted of 102 proteins and 521 interactions with cluster modularity and density of 0.745 and 0.101 respectively. The comparison analysis revealed MCL out perform AP, MCODE and SCPS algorithms with high clustering coefficient (0.751) network density and modularity index (0.630). This demonstrated MCL was the most reliable and efficient graph clustering algorithm for detection of protein complexes from PPI networks.
Le, Duc-Hau
2015-01-01
Protein complexes formed by non-covalent interaction among proteins play important roles in cellular functions. Computational and purification methods have been used to identify many protein complexes and their cellular functions. However, their roles in terms of causing disease have not been well discovered yet. There exist only a few studies for the identification of disease-associated protein complexes. However, they mostly utilize complicated heterogeneous networks which are constructed based on an out-of-date database of phenotype similarity network collected from literature. In addition, they only apply for diseases for which tissue-specific data exist. In this study, we propose a method to identify novel disease-protein complex associations. First, we introduce a framework to construct functional similarity protein complex networks where two protein complexes are functionally connected by either shared protein elements, shared annotating GO terms or based on protein interactions between elements in each protein complex. Second, we propose a simple but effective neighborhood-based algorithm, which yields a local similarity measure, to rank disease candidate protein complexes. Comparing the predictive performance of our proposed algorithm with that of two state-of-the-art network propagation algorithms including one we used in our previous study, we found that it performed statistically significantly better than that of these two algorithms for all the constructed functional similarity protein complex networks. In addition, it ran about 32 times faster than these two algorithms. Moreover, our proposed method always achieved high performance in terms of AUC values irrespective of the ways to construct the functional similarity protein complex networks and the used algorithms. The performance of our method was also higher than that reported in some existing methods which were based on complicated heterogeneous networks. Finally, we also tested our method with prostate cancer and selected the top 100 highly ranked candidate protein complexes. Interestingly, 69 of them were evidenced since at least one of their protein elements are known to be associated with prostate cancer. Our proposed method, including the framework to construct functional similarity protein complex networks and the neighborhood-based algorithm on these networks, could be used for identification of novel disease-protein complex associations.
Protein complex prediction in large ontology attributed protein-protein interaction networks.
Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo
2013-01-01
Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.
Investigation of a protein complex network
NASA Astrophysics Data System (ADS)
Mashaghi, A. R.; Ramezanpour, A.; Karimipour, V.
2004-09-01
The budding yeast Saccharomyces cerevisiae is the first eukaryote whose genome has been completely sequenced. It is also the first eukaryotic cell whose proteome (the set of all proteins) and interactome (the network of all mutual interactions between proteins) has been analyzed. In this paper we study the structure of the yeast protein complex network in which weighted edges between complexes represent the number of shared proteins. It is found that the network of protein complexes is a small world network with scale free behavior for many of its distributions. However we find that there are no strong correlations between the weights and degrees of neighboring complexes. To reveal non-random features of the network we also compare it with a null model in which the complexes randomly select their proteins. Finally we propose a simple evolutionary model based on duplication and divergence of proteins.
Construction of ontology augmented networks for protein complex prediction.
Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian
2013-01-01
Protein complexes are of great importance in understanding the principles of cellular organization and function. The increase in available protein-protein interaction data, gene ontology and other resources make it possible to develop computational methods for protein complex prediction. Most existing methods focus mainly on the topological structure of protein-protein interaction networks, and largely ignore the gene ontology annotation information. In this article, we constructed ontology augmented networks with protein-protein interaction data and gene ontology, which effectively unified the topological structure of protein-protein interaction networks and the similarity of gene ontology annotations into unified distance measures. After constructing ontology augmented networks, a novel method (clustering based on ontology augmented networks) was proposed to predict protein complexes, which was capable of taking into account the topological structure of the protein-protein interaction network, as well as the similarity of gene ontology annotations. Our method was applied to two different yeast protein-protein interaction datasets and predicted many well-known complexes. The experimental results showed that (i) ontology augmented networks and the unified distance measure can effectively combine the structure closeness and gene ontology annotation similarity; (ii) our method is valuable in predicting protein complexes and has higher F1 and accuracy compared to other competing methods.
Wang, Jian; Xie, Dong; Lin, Hongfei; Yang, Zhihao; Zhang, Yijia
2012-06-21
Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.
Lam, Winnie W M; Chan, Keith C C
2012-04-01
Protein molecules interact with each other in protein complexes to perform many vital functions, and different computational techniques have been developed to identify protein complexes in protein-protein interaction (PPI) networks. These techniques are developed to search for subgraphs of high connectivity in PPI networks under the assumption that the proteins in a protein complex are highly interconnected. While these techniques have been shown to be quite effective, it is also possible that the matching rate between the protein complexes they discover and those that are previously determined experimentally be relatively low and the "false-alarm" rate can be relatively high. This is especially the case when the assumption of proteins in protein complexes being more highly interconnected be relatively invalid. To increase the matching rate and reduce the false-alarm rate, we have developed a technique that can work effectively without having to make this assumption. The name of the technique called protein complex identification by discovering functional interdependence (PCIFI) searches for protein complexes in PPI networks by taking into consideration both the functional interdependence relationship between protein molecules and the network topology of the network. The PCIFI works in several steps. The first step is to construct a multiple-function protein network graph by labeling each vertex with one or more of the molecular functions it performs. The second step is to filter out protein interactions between protein pairs that are not functionally interdependent of each other in the statistical sense. The third step is to make use of an information-theoretic measure to determine the strength of the functional interdependence between all remaining interacting protein pairs. Finally, the last step is to try to form protein complexes based on the measure of the strength of functional interdependence and the connectivity between proteins. For performance evaluation, PCIFI was used to identify protein complexes in real PPI network data and the protein complexes it found were matched against those that were previously known in MIPS. The results show that PCIFI can be an effective technique for the identification of protein complexes. The protein complexes it found can match more known protein complexes with a smaller false-alarm rate and can provide useful insights into the understanding of the functional interdependence relationships between proteins in protein complexes.
Liu, Lizhen; Sun, Xiaowu; Song, Wei; Du, Chao
2018-06-01
Predicting protein complexes from protein-protein interaction (PPI) network is of great significance to recognize the structure and function of cells. A protein may interact with different proteins under different time or conditions. Existing approaches only utilize static PPI network data that may lose much temporal biological information. First, this article proposed a novel method that combines gene expression data at different time points with traditional static PPI network to construct different dynamic subnetworks. Second, to further filter out the data noise, the semantic similarity based on gene ontology is regarded as the network weight together with the principal component analysis, which is introduced to deal with the weight computing by three traditional methods. Third, after building a dynamic PPI network, a predicting protein complexes algorithm based on "core-attachment" structural feature is applied to detect complexes from each dynamic subnetworks. Finally, it is revealed from the experimental results that our method proposed in this article performs well on detecting protein complexes from dynamic weighted PPI networks.
Identifying protein complexes in PPI network using non-cooperative sequential game.
Maulik, Ujjwal; Basu, Srinka; Ray, Sumanta
2017-08-21
Identifying protein complexes from protein-protein interaction (PPI) network is an important and challenging task in computational biology as it helps in better understanding of cellular mechanisms in various organisms. In this paper we propose a noncooperative sequential game based model for protein complex detection from PPI network. The key hypothesis is that protein complex formation is driven by mechanism that eventually optimizes the number of interactions within the complex leading to dense subgraph. The hypothesis is drawn from the observed network property named small world. The proposed multi-player game model translates the hypothesis into the game strategies. The Nash equilibrium of the game corresponds to a network partition where each protein either belong to a complex or form a singleton cluster. We further propose an algorithm to find the Nash equilibrium of the sequential game. The exhaustive experiment on synthetic benchmark and real life yeast networks evaluates the structural as well as biological significance of the network partitions.
Cao, Buwen; Deng, Shuguang; Qin, Hua; Ding, Pingjian; Chen, Shaopeng; Li, Guanghui
2018-06-15
High-throughput technology has generated large-scale protein interaction data, which is crucial in our understanding of biological organisms. Many complex identification algorithms have been developed to determine protein complexes. However, these methods are only suitable for dense protein interaction networks, because their capabilities decrease rapidly when applied to sparse protein⁻protein interaction (PPI) networks. In this study, based on penalized matrix decomposition ( PMD ), a novel method of penalized matrix decomposition for the identification of protein complexes (i.e., PMD pc ) was developed to detect protein complexes in the human protein interaction network. This method mainly consists of three steps. First, the adjacent matrix of the protein interaction network is normalized. Second, the normalized matrix is decomposed into three factor matrices. The PMD pc method can detect protein complexes in sparse PPI networks by imposing appropriate constraints on factor matrices. Finally, the results of our method are compared with those of other methods in human PPI network. Experimental results show that our method can not only outperform classical algorithms, such as CFinder, ClusterONE, RRW, HC-PIN, and PCE-FR, but can also achieve an ideal overall performance in terms of a composite score consisting of F-measure, accuracy (ACC), and the maximum matching ratio (MMR).
Predicting protein complex geometries with a neural network.
Chae, Myong-Ho; Krull, Florian; Lorenzen, Stephan; Knapp, Ernst-Walter
2010-03-01
A major challenge of the protein docking problem is to define scoring functions that can distinguish near-native protein complex geometries from a large number of non-native geometries (decoys) generated with noncomplexed protein structures (unbound docking). In this study, we have constructed a neural network that employs the information from atom-pair distance distributions of a large number of decoys to predict protein complex geometries. We found that docking prediction can be significantly improved using two different types of polar hydrogen atoms. To train the neural network, 2000 near-native decoys of even distance distribution were used for each of the 185 considered protein complexes. The neural network normalizes the information from different protein complexes using an additional protein complex identity input neuron for each complex. The parameters of the neural network were determined such that they mimic a scoring funnel in the neighborhood of the native complex structure. The neural network approach avoids the reference state problem, which occurs in deriving knowledge-based energy functions for scoring. We show that a distance-dependent atom pair potential performs much better than a simple atom-pair contact potential. We have compared the performance of our scoring function with other empirical and knowledge-based scoring functions such as ZDOCK 3.0, ZRANK, ITScore-PP, EMPIRE, and RosettaDock. In spite of the simplicity of the method and its functional form, our neural network-based scoring function achieves a reasonable performance in rigid-body unbound docking of proteins. Proteins 2010. (c) 2009 Wiley-Liss, Inc.
From pull-down data to protein interaction networks and complexes with biological relevance.
Zhang, Bing; Park, Byung-Hoon; Karpinets, Tatiana; Samatova, Nagiza F
2008-04-01
Recent improvements in high-throughput Mass Spectrometry (MS) technology have expedited genome-wide discovery of protein-protein interactions by providing a capability of detecting protein complexes in a physiological setting. Computational inference of protein interaction networks and protein complexes from MS data are challenging. Advances are required in developing robust and seamlessly integrated procedures for assessment of protein-protein interaction affinities, mathematical representation of protein interaction networks, discovery of protein complexes and evaluation of their biological relevance. A multi-step but easy-to-follow framework for identifying protein complexes from MS pull-down data is introduced. It assesses interaction affinity between two proteins based on similarity of their co-purification patterns derived from MS data. It constructs a protein interaction network by adopting a knowledge-guided threshold selection method. Based on the network, it identifies protein complexes and infers their core components using a graph-theoretical approach. It deploys a statistical evaluation procedure to assess biological relevance of each found complex. On Saccharomyces cerevisiae pull-down data, the framework outperformed other more complicated schemes by at least 10% in F(1)-measure and identified 610 protein complexes with high-functional homogeneity based on the enrichment in Gene Ontology (GO) annotation. Manual examination of the complexes brought forward the hypotheses on cause of false identifications. Namely, co-purification of different protein complexes as mediated by a common non-protein molecule, such as DNA, might be a source of false positives. Protein identification bias in pull-down technology, such as the hydrophilic bias could result in false negatives.
Discovering protein complexes in protein interaction networks via exploring the weak ties effect
2012-01-01
Background Studying protein complexes is very important in biological processes since it helps reveal the structure-functionality relationships in biological networks and much attention has been paid to accurately predict protein complexes from the increasing amount of protein-protein interaction (PPI) data. Most of the available algorithms are based on the assumption that dense subgraphs correspond to complexes, failing to take into account the inherence organization within protein complex and the roles of edges. Thus, there is a critical need to investigate the possibility of discovering protein complexes using the topological information hidden in edges. Results To provide an investigation of the roles of edges in PPI networks, we show that the edges connecting less similar vertices in topology are more significant in maintaining the global connectivity, indicating the weak ties phenomenon in PPI networks. We further demonstrate that there is a negative relation between the weak tie strength and the topological similarity. By using the bridges, a reliable virtual network is constructed, in which each maximal clique corresponds to the core of a complex. By this notion, the detection of the protein complexes is transformed into a classic all-clique problem. A novel core-attachment based method is developed, which detects the cores and attachments, respectively. A comprehensive comparison among the existing algorithms and our algorithm has been made by comparing the predicted complexes against benchmark complexes. Conclusions We proved that the weak tie effect exists in the PPI network and demonstrated that the density is insufficient to characterize the topological structure of protein complexes. Furthermore, the experimental results on the yeast PPI network show that the proposed method outperforms the state-of-the-art algorithms. The analysis of detected modules by the present algorithm suggests that most of these modules have well biological significance in context of complexes, suggesting that the roles of edges are critical in discovering protein complexes. PMID:23046740
Shen, Xianjun; Yi, Li; Jiang, Xingpeng; He, Tingting; Yang, Jincai; Xie, Wei; Hu, Po; Hu, Xiaohua
2017-01-01
How to identify protein complex is an important and challenging task in proteomics. It would make great contribution to our knowledge of molecular mechanism in cell life activities. However, the inherent organization and dynamic characteristic of cell system have rarely been incorporated into the existing algorithms for detecting protein complexes because of the limitation of protein-protein interaction (PPI) data produced by high throughput techniques. The availability of time course gene expression profile enables us to uncover the dynamics of molecular networks and improve the detection of protein complexes. In order to achieve this goal, this paper proposes a novel algorithm DCA (Dynamic Core-Attachment). It detects protein-complex core comprising of continually expressed and highly connected proteins in dynamic PPI network, and then the protein complex is formed by including the attachments with high adhesion into the core. The integration of core-attachment feature into the dynamic PPI network is responsible for the superiority of our algorithm. DCA has been applied on two different yeast dynamic PPI networks and the experimental results show that it performs significantly better than the state-of-the-art techniques in terms of prediction accuracy, hF-measure and statistical significance in biology. In addition, the identified complexes with strong biological significance provide potential candidate complexes for biologists to validate.
Ren, Jun; Zhou, Wei; Wang, Jianxin
2014-01-01
Many evidences have demonstrated that protein complexes are overlapping and hierarchically organized in PPI networks. Meanwhile, the large size of PPI network wants complex detection methods have low time complexity. Up to now, few methods can identify overlapping and hierarchical protein complexes in a PPI network quickly. In this paper, a novel method, called MCSE, is proposed based on λ-module and “seed-expanding.” First, it chooses seeds as essential PPIs or edges with high edge clustering values. Then, it identifies protein complexes by expanding each seed to a λ-module. MCSE is suitable for large PPI networks because of its low time complexity. MCSE can identify overlapping protein complexes naturally because a protein can be visited by different seeds. MCSE uses the parameter λ_th to control the range of seed expanding and can detect a hierarchical organization of protein complexes by tuning the value of λ_th. Experimental results of S. cerevisiae show that this hierarchical organization is similar to that of known complexes in MIPS database. The experimental results also show that MCSE outperforms other previous competing algorithms, such as CPM, CMC, Core-Attachment, Dpclus, HC-PIN, MCL, and NFC, in terms of the functional enrichment and matching with known protein complexes. PMID:25143945
Liu, Zhiming; Luo, Jiawei
2017-08-01
Associating protein complexes to human inherited diseases is critical for better understanding of biological processes and functional mechanisms of the disease. Many protein complexes have been identified and functionally annotated by computational and purification methods so far, however, the particular roles they were playing in causing disease have not yet been well determined. In this study, we present a novel method to identify associations between protein complexes and diseases. First, we construct a disease-protein heterogeneous network based on data integration and laplacian normalization. Second, we apply a random walk with restart on heterogeneous network (RWRH) algorithm on this network to quantify the strength of the association between proteins and the query disease. Third, we sum over the scores of member proteins to obtain a summary score for each candidate protein complex, and then rank all candidate protein complexes according to their scores. With a series of leave-one-out cross-validation experiments, we found that our method not only possesses high performance but also demonstrates robustness regarding the parameters and the network structure. We test our approach with breast cancer and select top 20 highly ranked protein complexes, 17 of the selected protein complexes are evidenced to be connected with breast cancer. Our proposed method is effective in identifying disease-related protein complexes based on data integration and laplacian normalization. Copyright © 2017. Published by Elsevier Ltd.
CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks.
Li, Min; Li, Dongyan; Tang, Yu; Wu, Fangxiang; Wang, Jianxin
2017-08-31
Nowadays, cluster analysis of biological networks has become one of the most important approaches to identifying functional modules as well as predicting protein complexes and network biomarkers. Furthermore, the visualization of clustering results is crucial to display the structure of biological networks. Here we present CytoCluster, a cytoscape plugin integrating six clustering algorithms, HC-PIN (Hierarchical Clustering algorithm in Protein Interaction Networks), OH-PIN (identifying Overlapping and Hierarchical modules in Protein Interaction Networks), IPCA (Identifying Protein Complex Algorithm), ClusterONE (Clustering with Overlapping Neighborhood Expansion), DCU (Detecting Complexes based on Uncertain graph model), IPC-MCE (Identifying Protein Complexes based on Maximal Complex Extension), and BinGO (the Biological networks Gene Ontology) function. Users can select different clustering algorithms according to their requirements. The main function of these six clustering algorithms is to detect protein complexes or functional modules. In addition, BinGO is used to determine which Gene Ontology (GO) categories are statistically overrepresented in a set of genes or a subgraph of a biological network. CytoCluster can be easily expanded, so that more clustering algorithms and functions can be added to this plugin. Since it was created in July 2013, CytoCluster has been downloaded more than 9700 times in the Cytoscape App store and has already been applied to the analysis of different biological networks. CytoCluster is available from http://apps.cytoscape.org/apps/cytocluster.
CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks
Li, Min; Li, Dongyan; Tang, Yu; Wang, Jianxin
2017-01-01
Nowadays, cluster analysis of biological networks has become one of the most important approaches to identifying functional modules as well as predicting protein complexes and network biomarkers. Furthermore, the visualization of clustering results is crucial to display the structure of biological networks. Here we present CytoCluster, a cytoscape plugin integrating six clustering algorithms, HC-PIN (Hierarchical Clustering algorithm in Protein Interaction Networks), OH-PIN (identifying Overlapping and Hierarchical modules in Protein Interaction Networks), IPCA (Identifying Protein Complex Algorithm), ClusterONE (Clustering with Overlapping Neighborhood Expansion), DCU (Detecting Complexes based on Uncertain graph model), IPC-MCE (Identifying Protein Complexes based on Maximal Complex Extension), and BinGO (the Biological networks Gene Ontology) function. Users can select different clustering algorithms according to their requirements. The main function of these six clustering algorithms is to detect protein complexes or functional modules. In addition, BinGO is used to determine which Gene Ontology (GO) categories are statistically overrepresented in a set of genes or a subgraph of a biological network. CytoCluster can be easily expanded, so that more clustering algorithms and functions can be added to this plugin. Since it was created in July 2013, CytoCluster has been downloaded more than 9700 times in the Cytoscape App store and has already been applied to the analysis of different biological networks. CytoCluster is available from http://apps.cytoscape.org/apps/cytocluster. PMID:28858211
Identifying protein complexes based on brainstorming strategy.
Shen, Xianjun; Zhou, Jin; Yi, Li; Hu, Xiaohua; He, Tingting; Yang, Jincai
2016-11-01
Protein complexes comprising of interacting proteins in protein-protein interaction network (PPI network) play a central role in driving biological processes within cells. Recently, more and more swarm intelligence based algorithms to detect protein complexes have been emerging, which have become the research hotspot in proteomics field. In this paper, we propose a novel algorithm for identifying protein complexes based on brainstorming strategy (IPC-BSS), which is integrated into the main idea of swarm intelligence optimization and the improved K-means algorithm. Distance between the nodes in PPI network is defined by combining the network topology and gene ontology (GO) information. Inspired by human brainstorming process, IPC-BSS algorithm firstly selects the clustering center nodes, and then they are separately consolidated with the other nodes with short distance to form initial clusters. Finally, we put forward two ways of updating the initial clusters to search optimal results. Experimental results show that our IPC-BSS algorithm outperforms the other classic algorithms on yeast and human PPI networks, and it obtains many predicted protein complexes with biological significance. Copyright © 2016 Elsevier Inc. All rights reserved.
Lewis, Brian A
2010-01-15
The regulation of transcription and of many other cellular processes involves large multi-subunit protein complexes. In the context of transcription, it is known that these complexes serve as regulatory platforms that connect activator DNA-binding proteins to a target promoter. However, there is still a lack of understanding regarding the function of these complexes. Why do multi-subunit complexes exist? What is the molecular basis of the function of their constituent subunits, and how are these subunits organized within a complex? What is the reason for physical connections between certain subunits and not others? In this article, I address these issues through a model of network allostery and its application to the eukaryotic RNA polymerase II Mediator transcription complex. The multiple allosteric networks model (MANM) suggests that protein complexes such as Mediator exist not only as physical but also as functional networks of interconnected proteins through which information is transferred from subunit to subunit by the propagation of an allosteric state known as conformational spread. Additionally, there are multiple distinct sub-networks within the Mediator complex that can be defined by their connections to different subunits; these sub-networks have discrete functions that are activated when specific subunits interact with other activator proteins.
Nariai, N; Kim, S; Imoto, S; Miyano, S
2004-01-01
We propose a statistical method to estimate gene networks from DNA microarray data and protein-protein interactions. Because physical interactions between proteins or multiprotein complexes are likely to regulate biological processes, using only mRNA expression data is not sufficient for estimating a gene network accurately. Our method adds knowledge about protein-protein interactions to the estimation method of gene networks under a Bayesian statistical framework. In the estimated gene network, a protein complex is modeled as a virtual node based on principal component analysis. We show the effectiveness of the proposed method through the analysis of Saccharomyces cerevisiae cell cycle data. The proposed method improves the accuracy of the estimated gene networks, and successfully identifies some biological facts.
Protein-Protein Interface and Disease: Perspective from Biomolecular Networks.
Hu, Guang; Xiao, Fei; Li, Yuqian; Li, Yuan; Vongsangnak, Wanwipa
Protein-protein interactions are involved in many important biological processes and molecular mechanisms of disease association. Structural studies of interfacial residues in protein complexes provide information on protein-protein interactions. Characterizing protein-protein interfaces, including binding sites and allosteric changes, thus pose an imminent challenge. With special focus on protein complexes, approaches based on network theory are proposed to meet this challenge. In this review we pay attention to protein-protein interfaces from the perspective of biomolecular networks and their roles in disease. We first describe the different roles of protein complexes in disease through several structural aspects of interfaces. We then discuss some recent advances in predicting hot spots and communication pathway analysis in terms of amino acid networks. Finally, we highlight possible future aspects of this area with respect to both methodology development and applications for disease treatment.
Identifying Dynamic Protein Complexes Based on Gene Expression Profiles and PPI Networks
Li, Min; Chen, Weijie; Wang, Jianxin; Pan, Yi
2014-01-01
Identification of protein complexes from protein-protein interaction networks has become a key problem for understanding cellular life in postgenomic era. Many computational methods have been proposed for identifying protein complexes. Up to now, the existing computational methods are mostly applied on static PPI networks. However, proteins and their interactions are dynamic in reality. Identifying dynamic protein complexes is more meaningful and challenging. In this paper, a novel algorithm, named DPC, is proposed to identify dynamic protein complexes by integrating PPI data and gene expression profiles. According to Core-Attachment assumption, these proteins which are always active in the molecular cycle are regarded as core proteins. The protein-complex cores are identified from these always active proteins by detecting dense subgraphs. Final protein complexes are extended from the protein-complex cores by adding attachments based on a topological character of “closeness” and dynamic meaning. The protein complexes produced by our algorithm DPC contain two parts: static core expressed in all the molecular cycle and dynamic attachments short-lived. The proposed algorithm DPC was applied on the data of Saccharomyces cerevisiae and the experimental results show that DPC outperforms CMC, MCL, SPICi, HC-PIN, COACH, and Core-Attachment based on the validation of matching with known complexes and hF-measures. PMID:24963481
Protein-protein interaction networks (PPI) and complex diseases
Safari-Alighiarloo, Nahid; Taghizadeh, Mohammad; Rezaei-Tavirani, Mostafa; Goliaei, Bahram
2014-01-01
The physical interaction of proteins which lead to compiling them into large densely connected networks is a noticeable subject to investigation. Protein interaction networks are useful because of making basic scientific abstraction and improving biological and biomedical applications. Based on principle roles of proteins in biological function, their interactions determine molecular and cellular mechanisms, which control healthy and diseased states in organisms. Therefore, such networks facilitate the understanding of pathogenic (and physiologic) mechanisms that trigger the onset and progression of diseases. Consequently, this knowledge can be translated into effective diagnostic and therapeutic strategies. Furthermore, the results of several studies have proved that the structure and dynamics of protein networks are disturbed in complex diseases such as cancer and autoimmune disorders. Based on such relationship, a novel paradigm is suggested in order to confirm that the protein interaction networks can be the target of therapy for treatment of complex multi-genic diseases rather than individual molecules with disrespect the network. PMID:25436094
Modeling and simulating networks of interdependent protein interactions.
Stöcker, Bianca K; Köster, Johannes; Zamir, Eli; Rahmann, Sven
2018-05-21
Protein interactions are fundamental building blocks of biochemical reaction systems underlying cellular functions. The complexity and functionality of these systems emerge not only from the protein interactions themselves but also from the dependencies between these interactions, as generated by allosteric effects or mutual exclusion due to steric hindrance. Therefore, formal models for integrating and utilizing information about interaction dependencies are of high interest. Here, we describe an approach for endowing protein networks with interaction dependencies using propositional logic, thereby obtaining constrained protein interaction networks ("constrained networks"). The construction of these networks is based on public interaction databases as well as text-mined information about interaction dependencies. We present an efficient data structure and algorithm to simulate protein complex formation in constrained networks. The efficiency of the model allows fast simulation and facilitates the analysis of many proteins in large networks. In addition, this approach enables the simulation of perturbation effects, such as knockout of single or multiple proteins and changes of protein concentrations. We illustrate how our model can be used to analyze a constrained human adhesome protein network, which is responsible for the formation of diverse and dynamic cell-matrix adhesion sites. By comparing protein complex formation under known interaction dependencies versus without dependencies, we investigate how these dependencies shape the resulting repertoire of protein complexes. Furthermore, our model enables investigating how the interplay of network topology with interaction dependencies influences the propagation of perturbation effects across a large biochemical system. Our simulation software CPINSim (for Constrained Protein Interaction Network Simulator) is available under the MIT license at http://github.com/BiancaStoecker/cpinsim and as a Bioconda package (https://bioconda.github.io).
The BioPlex Network: A Systematic Exploration of the Human Interactome.
Huttlin, Edward L; Ting, Lily; Bruckner, Raphael J; Gebreab, Fana; Gygi, Melanie P; Szpyt, John; Tam, Stanley; Zarraga, Gabriela; Colby, Greg; Baltier, Kurt; Dong, Rui; Guarani, Virginia; Vaites, Laura Pontano; Ordureau, Alban; Rad, Ramin; Erickson, Brian K; Wühr, Martin; Chick, Joel; Zhai, Bo; Kolippakkam, Deepak; Mintseris, Julian; Obar, Robert A; Harris, Tim; Artavanis-Tsakonas, Spyros; Sowa, Mathew E; De Camilli, Pietro; Paulo, Joao A; Harper, J Wade; Gygi, Steven P
2015-07-16
Protein interactions form a network whose structure drives cellular function and whose organization informs biological inquiry. Using high-throughput affinity-purification mass spectrometry, we identify interacting partners for 2,594 human proteins in HEK293T cells. The resulting network (BioPlex) contains 23,744 interactions among 7,668 proteins with 86% previously undocumented. BioPlex accurately depicts known complexes, attaining 80%-100% coverage for most CORUM complexes. The network readily subdivides into communities that correspond to complexes or clusters of functionally related proteins. More generally, network architecture reflects cellular localization, biological process, and molecular function, enabling functional characterization of thousands of proteins. Network structure also reveals associations among thousands of protein domains, suggesting a basis for examining structurally related proteins. Finally, BioPlex, in combination with other approaches, can be used to reveal interactions of biological or clinical significance. For example, mutations in the membrane protein VAPB implicated in familial amyotrophic lateral sclerosis perturb a defined community of interactors. Copyright © 2015 Elsevier Inc. All rights reserved.
The BioPlex Network: A Systematic Exploration of the Human Interactome
Huttlin, Edward L.; Ting, Lily; Bruckner, Raphael J.; Gebreab, Fana; Gygi, Melanie P.; Szpyt, John; Tam, Stanley; Zarraga, Gabriela; Colby, Greg; Baltier, Kurt; Dong, Rui; Guarani, Virginia; Vaites, Laura Pontano; Ordureau, Alban; Rad, Ramin; Erickson, Brian K.; Wühr, Martin; Chick, Joel; Zhai, Bo; Kolippakkam, Deepak; Mintseris, Julian; Obar, Robert A.; Harris, Tim; Artavanis-Tsakonas, Spyros; Sowa, Mathew E.; DeCamilli, Pietro; Paulo, Joao A.; Harper, J. Wade; Gygi, Steven P.
2015-01-01
SUMMARY Protein interactions form a network whose structure drives cellular function and whose organization informs biological inquiry. Using high-throughput affinity-purification mass spectrometry, we identify interacting partners for 2,594 human proteins in HEK293T cells. The resulting network (BioPlex) contains 23,744 interactions among 7,668 proteins with 86% previously undocumented. BioPlex accurately depicts known complexes, attaining 80-100% coverage for most CORUM complexes. The network readily subdivides into communities that correspond to complexes or clusters of functionally related proteins. More generally, network architecture reflects cellular localization, biological process, and molecular function, enabling functional characterization of thousands of proteins. Network structure also reveals associations among thousands of protein domains, suggesting a basis for examining structurally-related proteins. Finally, BioPlex, in combination with other approaches can be used to reveal interactions of biological or clinical significance. For example, mutations in the membrane protein VAPB implicated in familial Amyotrophic Lateral Sclerosis perturb a defined community of interactors. PMID:26186194
Networking at the Protein Society symposium.
McKnight, C James; Cordes, Matthew H J
2005-10-01
From the complex behavior of multicomponent signaling networks to the structures of large protein complexes and aggregates, questions once viewed as daunting are now being tackled fearlessly by protein scientists. The 19th Annual Symposium of the Protein Society in Boston highlighted the maturation of systems biology as applied to proteins.
Fluctuations in Mass-Action Equilibrium of Protein Binding Networks
NASA Astrophysics Data System (ADS)
Yan, Koon-Kiu; Walker, Dylan; Maslov, Sergei
2008-12-01
We consider two types of fluctuations in the mass-action equilibrium in protein binding networks. The first type is driven by slow changes in total concentrations of interacting proteins. The second type (spontaneous) is caused by quickly decaying thermodynamic deviations away from equilibrium. We investigate the effects of network connectivity on fluctuations by comparing them to scenarios in which the interacting pair is isolated from the network and analytically derives bounds on fluctuations. Collective effects are shown to sometimes lead to large amplification of spontaneous fluctuations. The strength of both types of fluctuations is positively correlated with the complex connectivity and negatively correlated with complex concentration. Our general findings are illustrated using a curated network of protein interactions and multiprotein complexes in baker’s yeast, with empirical protein concentrations.
Lautz, Jonathan D; Brown, Emily A; VanSchoiack, Alison A Williams; Smith, Stephen E P
2018-05-27
Cells utilize dynamic, network level rearrangements in highly interconnected protein interaction networks to transmit and integrate information from distinct signaling inputs. Despite the importance of protein interaction network dynamics, the organizational logic underlying information flow through these networks is not well understood. Previously, we developed the quantitative multiplex co-immunoprecipitation platform, which allows for the simultaneous and quantitative measurement of the amount of co-association between large numbers of proteins in shared complexes. Here, we adapt quantitative multiplex co-immunoprecipitation to define the activity dependent dynamics of an 18-member protein interaction network in order to better understand the underlying principles governing glutamatergic signal transduction. We first establish that immunoprecipitation detected by flow cytometry can detect activity dependent changes in two known protein-protein interactions (Homer1-mGluR5 and PSD-95-SynGAP). We next demonstrate that neuronal stimulation elicits a coordinated change in our targeted protein interaction network, characterized by the initial dissociation of Homer1 and SynGAP-containing complexes followed by increased associations among glutamate receptors and PSD-95. Finally, we show that stimulation of distinct glutamate receptor types results in different modular sets of protein interaction network rearrangements, and that cells activate both modules in order to integrate complex inputs. This analysis demonstrates that cells respond to distinct types of glutamatergic input by modulating different combinations of protein co-associations among a targeted network of proteins. Our data support a model of synaptic plasticity in which synaptic stimulation elicits dissociation of preexisting multiprotein complexes, opening binding slots in scaffold proteins and allowing for the recruitment of additional glutamatergic receptors. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Pan, Joshua; Meyers, Robin M; Michel, Brittany C; Mashtalir, Nazar; Sizemore, Ann E; Wells, Jonathan N; Cassel, Seth H; Vazquez, Francisca; Weir, Barbara A; Hahn, William C; Marsh, Joseph A; Tsherniak, Aviad; Kadoch, Cigall
2018-05-23
Protein complexes are assemblies of subunits that have co-evolved to execute one or many coordinated functions in the cellular environment. Functional annotation of mammalian protein complexes is critical to understanding biological processes, as well as disease mechanisms. Here, we used genetic co-essentiality derived from genome-scale RNAi- and CRISPR-Cas9-based fitness screens performed across hundreds of human cancer cell lines to assign measures of functional similarity. From these measures, we systematically built and characterized functional similarity networks that recapitulate known structural and functional features of well-studied protein complexes and resolve novel functional modules within complexes lacking structural resolution, such as the mammalian SWI/SNF complex. Finally, by integrating functional networks with large protein-protein interaction networks, we discovered novel protein complexes involving recently evolved genes of unknown function. Taken together, these findings demonstrate the utility of genetic perturbation screens alone, and in combination with large-scale biophysical data, to enhance our understanding of mammalian protein complexes in normal and disease states. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
The Correlation Fractal Dimension of Complex Networks
NASA Astrophysics Data System (ADS)
Wang, Xingyuan; Liu, Zhenzhen; Wang, Mogei
2013-05-01
The fractality of complex networks is studied by estimating the correlation dimensions of the networks. Comparing with the previous algorithms of estimating the box dimension, our algorithm achieves a significant reduction in time complexity. For four benchmark cases tested, that is, the Escherichia coli (E. Coli) metabolic network, the Homo sapiens protein interaction network (H. Sapiens PIN), the Saccharomyces cerevisiae protein interaction network (S. Cerevisiae PIN) and the World Wide Web (WWW), experiments are provided to demonstrate the validity of our algorithm.
Centralities in simplicial complexes. Applications to protein interaction networks.
Estrada, Ernesto; Ross, Grant J
2018-02-07
Complex networks can be used to represent complex systems which originate in the real world. Here we study a transformation of these complex networks into simplicial complexes, where cliques represent the simplices of the complex. We extend the concept of node centrality to that of simplicial centrality and study several mathematical properties of degree, closeness, betweenness, eigenvector, Katz, and subgraph centrality for simplicial complexes. We study the degree distributions of these centralities at the different levels. We also compare and describe the differences between the centralities at the different levels. Using these centralities we study a method for detecting essential proteins in PPI networks of cells and explain the varying abilities of the centrality measures at the different levels in identifying these essential proteins. Copyright © 2017 Elsevier Ltd. All rights reserved.
An automated method for finding molecular complexes in large protein interaction networks
Bader, Gary D; Hogue, Christopher WV
2003-01-01
Background Recent advances in proteomics technologies such as two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of biomolecular interaction networks. Initial mapping efforts have already produced a wealth of data. As the size of the interaction set increases, databases and computational methods will be required to store, visualize and analyze the information in order to effectively aid in knowledge discovery. Results This paper describes a novel graph theoretic clustering algorithm, "Molecular Complex Detection" (MCODE), that detects densely connected regions in large protein-protein interaction networks that may represent molecular complexes. The method is based on vertex weighting by local neighborhood density and outward traversal from a locally dense seed protein to isolate the dense regions according to given parameters. The algorithm has the advantage over other graph clustering methods of having a directed mode that allows fine-tuning of clusters of interest without considering the rest of the network and allows examination of cluster interconnectivity, which is relevant for protein networks. Protein interaction and complex information from the yeast Saccharomyces cerevisiae was used for evaluation. Conclusion Dense regions of protein interaction networks can be found, based solely on connectivity data, many of which correspond to known protein complexes. The algorithm is not affected by a known high rate of false positives in data from high-throughput interaction techniques. The program is available from . PMID:12525261
2010-01-01
Background The reconstruction of protein complexes from the physical interactome of organisms serves as a building block towards understanding the higher level organization of the cell. Over the past few years, several independent high-throughput experiments have helped to catalogue enormous amount of physical protein interaction data from organisms such as yeast. However, these individual datasets show lack of correlation with each other and also contain substantial number of false positives (noise). Over these years, several affinity scoring schemes have also been devised to improve the qualities of these datasets. Therefore, the challenge now is to detect meaningful as well as novel complexes from protein interaction (PPI) networks derived by combining datasets from multiple sources and by making use of these affinity scoring schemes. In the attempt towards tackling this challenge, the Markov Clustering algorithm (MCL) has proved to be a popular and reasonably successful method, mainly due to its scalability, robustness, and ability to work on scored (weighted) networks. However, MCL produces many noisy clusters, which either do not match known complexes or have additional proteins that reduce the accuracies of correctly predicted complexes. Results Inspired by recent experimental observations by Gavin and colleagues on the modularity structure in yeast complexes and the distinctive properties of "core" and "attachment" proteins, we develop a core-attachment based refinement method coupled to MCL for reconstruction of yeast complexes from scored (weighted) PPI networks. We combine physical interactions from two recent "pull-down" experiments to generate an unscored PPI network. We then score this network using available affinity scoring schemes to generate multiple scored PPI networks. The evaluation of our method (called MCL-CAw) on these networks shows that: (i) MCL-CAw derives larger number of yeast complexes and with better accuracies than MCL, particularly in the presence of natural noise; (ii) Affinity scoring can effectively reduce the impact of noise on MCL-CAw and thereby improve the quality (precision and recall) of its predicted complexes; (iii) MCL-CAw responds well to most available scoring schemes. We discuss several instances where MCL-CAw was successful in deriving meaningful complexes, and where it missed a few proteins or whole complexes due to affinity scoring of the networks. We compare MCL-CAw with several recent complex detection algorithms on unscored and scored networks, and assess the relative performance of the algorithms on these networks. Further, we study the impact of augmenting physical datasets with computationally inferred interactions for complex detection. Finally, we analyse the essentiality of proteins within predicted complexes to understand a possible correlation between protein essentiality and their ability to form complexes. Conclusions We demonstrate that core-attachment based refinement in MCL-CAw improves the predictions of MCL on yeast PPI networks. We show that affinity scoring improves the performance of MCL-CAw. PMID:20939868
2018-01-01
Stoichiometric balance, or dosage balance, implies that proteins that are subunits of obligate complexes (e.g. the ribosome) should have copy numbers expressed to match their stoichiometry in that complex. Establishing balance (or imbalance) is an important tool for inferring subunit function and assembly bottlenecks. We show here that these correlations in protein copy numbers can extend beyond complex subunits to larger protein-protein interactions networks (PPIN) involving a range of reversible binding interactions. We develop a simple method for quantifying balance in any interface-resolved PPINs based on network structure and experimentally observed protein copy numbers. By analyzing such a network for the clathrin-mediated endocytosis (CME) system in yeast, we found that the real protein copy numbers were significantly more balanced in relation to their binding partners compared to randomly sampled sets of yeast copy numbers. The observed balance is not perfect, highlighting both under and overexpressed proteins. We evaluate the potential cost and benefits of imbalance using two criteria. First, a potential cost to imbalance is that ‘leftover’ proteins without remaining functional partners are free to misinteract. We systematically quantify how this misinteraction cost is most dangerous for strong-binding protein interactions and for network topologies observed in biological PPINs. Second, a more direct consequence of imbalance is that the formation of specific functional complexes depends on relative copy numbers. We therefore construct simple kinetic models of two sub-networks in the CME network to assess multi-protein assembly of the ARP2/3 complex and a minimal, nine-protein clathrin-coated vesicle forming module. We find that the observed, imperfectly balanced copy numbers are less effective than balanced copy numbers in producing fast and complete multi-protein assemblies. However, we speculate that strategic imbalance in the vesicle forming module allows cells to tune where endocytosis occurs, providing sensitive control over cargo uptake via clathrin-coated vesicles. PMID:29518071
Holland, David O; Johnson, Margaret E
2018-03-01
Stoichiometric balance, or dosage balance, implies that proteins that are subunits of obligate complexes (e.g. the ribosome) should have copy numbers expressed to match their stoichiometry in that complex. Establishing balance (or imbalance) is an important tool for inferring subunit function and assembly bottlenecks. We show here that these correlations in protein copy numbers can extend beyond complex subunits to larger protein-protein interactions networks (PPIN) involving a range of reversible binding interactions. We develop a simple method for quantifying balance in any interface-resolved PPINs based on network structure and experimentally observed protein copy numbers. By analyzing such a network for the clathrin-mediated endocytosis (CME) system in yeast, we found that the real protein copy numbers were significantly more balanced in relation to their binding partners compared to randomly sampled sets of yeast copy numbers. The observed balance is not perfect, highlighting both under and overexpressed proteins. We evaluate the potential cost and benefits of imbalance using two criteria. First, a potential cost to imbalance is that 'leftover' proteins without remaining functional partners are free to misinteract. We systematically quantify how this misinteraction cost is most dangerous for strong-binding protein interactions and for network topologies observed in biological PPINs. Second, a more direct consequence of imbalance is that the formation of specific functional complexes depends on relative copy numbers. We therefore construct simple kinetic models of two sub-networks in the CME network to assess multi-protein assembly of the ARP2/3 complex and a minimal, nine-protein clathrin-coated vesicle forming module. We find that the observed, imperfectly balanced copy numbers are less effective than balanced copy numbers in producing fast and complete multi-protein assemblies. However, we speculate that strategic imbalance in the vesicle forming module allows cells to tune where endocytosis occurs, providing sensitive control over cargo uptake via clathrin-coated vesicles.
Evolution of an intricate J-protein network driving protein disaggregation in eukaryotes.
Nillegoda, Nadinath B; Stank, Antonia; Malinverni, Duccio; Alberts, Niels; Szlachcic, Anna; Barducci, Alessandro; De Los Rios, Paolo; Wade, Rebecca C; Bukau, Bernd
2017-05-15
Hsp70 participates in a broad spectrum of protein folding processes extending from nascent chain folding to protein disaggregation. This versatility in function is achieved through a diverse family of J-protein cochaperones that select substrates for Hsp70. Substrate selection is further tuned by transient complexation between different classes of J-proteins, which expands the range of protein aggregates targeted by metazoan Hsp70 for disaggregation. We assessed the prevalence and evolutionary conservation of J-protein complexation and cooperation in disaggregation. We find the emergence of a eukaryote-specific signature for interclass complexation of canonical J-proteins. Consistently, complexes exist in yeast and human cells, but not in bacteria, and correlate with cooperative action in disaggregation in vitro. Signature alterations exclude some J-proteins from networking, which ensures correct J-protein pairing, functional network integrity and J-protein specialization. This fundamental change in J-protein biology during the prokaryote-to-eukaryote transition allows for increased fine-tuning and broadening of Hsp70 function in eukaryotes.
Mehranfar, Adele; Ghadiri, Nasser; Kouhsar, Morteza; Golshani, Ashkan
2017-09-01
Detecting the protein complexes is an important task in analyzing the protein interaction networks. Although many algorithms predict protein complexes in different ways, surveys on the interaction networks indicate that about 50% of detected interactions are false positives. Consequently, the accuracy of existing methods needs to be improved. In this paper we propose a novel algorithm to detect the protein complexes in 'noisy' protein interaction data. First, we integrate several biological data sources to determine the reliability of each interaction and determine more accurate weights for the interactions. A data fusion component is used for this step, based on the interval type-2 fuzzy voter that provides an efficient combination of the information sources. This fusion component detects the errors and diminishes their effect on the detection protein complexes. So in the first step, the reliability scores have been assigned for every interaction in the network. In the second step, we have proposed a general protein complex detection algorithm by exploiting and adopting the strong points of other algorithms and existing hypotheses regarding real complexes. Finally, the proposed method has been applied for the yeast interaction datasets for predicting the interactions. The results show that our framework has a better performance regarding precision and F-measure than the existing approaches. Copyright © 2017 Elsevier Ltd. All rights reserved.
Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali
2018-06-01
Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.
Kim, Inhae; Lee, Heetak; Han, Seong Kyu; Kim, Sanguk
2014-10-01
The modular architecture of protein-protein interaction (PPI) networks is evident in diverse species with a wide range of complexity. However, the molecular components that lead to the evolution of modularity in PPI networks have not been clearly identified. Here, we show that weak domain-linear motif interactions (DLIs) are more likely to connect different biological modules than strong domain-domain interactions (DDIs). This molecular division of labor is essential for the evolution of modularity in the complex PPI networks of diverse eukaryotic species. In particular, DLIs may compensate for the reduction in module boundaries that originate from increased connections between different modules in complex PPI networks. In addition, we show that the identification of biological modules can be greatly improved by including molecular characteristics of protein interactions. Our findings suggest that transient interactions have played a unique role in shaping the architecture and modularity of biological networks over the course of evolution.
Predicting Physical Interactions between Protein Complexes*
Clancy, Trevor; Rødland, Einar Andreas; Nygard, Ståle; Hovig, Eivind
2013-01-01
Protein complexes enact most biochemical functions in the cell. Dynamic interactions between protein complexes are frequent in many cellular processes. As they are often of a transient nature, they may be difficult to detect using current genome-wide screens. Here, we describe a method to computationally predict physical interactions between protein complexes, applied to both humans and yeast. We integrated manually curated protein complexes and physical protein interaction networks, and we designed a statistical method to identify pairs of protein complexes where the number of protein interactions between a complex pair is due to an actual physical interaction between the complexes. An evaluation against manually curated physical complex-complex interactions in yeast revealed that 50% of these interactions could be predicted in this manner. A community network analysis of the highest scoring pairs revealed a biologically sensible organization of physical complex-complex interactions in the cell. Such analyses of proteomes may serve as a guide to the discovery of novel functional cellular relationships. PMID:23438732
RRW: repeated random walks on genome-scale protein networks for local cluster discovery
Macropol, Kathy; Can, Tolga; Singh, Ambuj K
2009-01-01
Background We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW) for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. Results We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL), and find a significant improvement in the RRW clusters' precision and accuracy values. Conclusion RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multi-functional proteins by allowing overlapping clusters. PMID:19740439
Characterization of known protein complexes using k-connectivity and other topological measures
Gallagher, Suzanne R; Goldberg, Debra S
2015-01-01
Many protein complexes are densely packed, so proteins within complexes often interact with several other proteins in the complex. Steric constraints prevent most proteins from simultaneously binding more than a handful of other proteins, regardless of the number of proteins in the complex. Because of this, as complex size increases, several measures of the complex decrease within protein-protein interaction networks. However, k-connectivity, the number of vertices or edges that need to be removed in order to disconnect a graph, may be consistently high for protein complexes. The property of k-connectivity has been little used previously in the investigation of protein-protein interactions. To understand the discriminative power of k-connectivity and other topological measures for identifying unknown protein complexes, we characterized these properties in known Saccharomyces cerevisiae protein complexes in networks generated both from highly accurate X-ray crystallography experiments which give an accurate model of each complex, and also as the complexes appear in high-throughput yeast 2-hybrid studies in which new complexes may be discovered. We also computed these properties for appropriate random subgraphs.We found that clustering coefficient, mutual clustering coefficient, and k-connectivity are better indicators of known protein complexes than edge density, degree, or betweenness. This suggests new directions for future protein complex-finding algorithms. PMID:26913183
Ultrastable cellulosome-adhesion complex tightens under load.
Schoeler, Constantin; Malinowska, Klara H; Bernardi, Rafael C; Milles, Lukas F; Jobst, Markus A; Durner, Ellis; Ott, Wolfgang; Fried, Daniel B; Bayer, Edward A; Schulten, Klaus; Gaub, Hermann E; Nash, Michael A
2014-12-08
Challenging environments have guided nature in the development of ultrastable protein complexes. Specialized bacteria produce discrete multi-component protein networks called cellulosomes to effectively digest lignocellulosic biomass. While network assembly is enabled by protein interactions with commonplace affinities, we show that certain cellulosomal ligand-receptor interactions exhibit extreme resistance to applied force. Here, we characterize the ligand-receptor complex responsible for substrate anchoring in the Ruminococcus flavefaciens cellulosome using single-molecule force spectroscopy and steered molecular dynamics simulations. The complex withstands forces of 600-750 pN, making it one of the strongest bimolecular interactions reported, equivalent to half the mechanical strength of a covalent bond. Our findings demonstrate force activation and inter-domain stabilization of the complex, and suggest that certain network components serve as mechanical effectors for maintaining network integrity. This detailed understanding of cellulosomal network components may help in the development of biocatalysts for production of fuels and chemicals from renewable plant-derived biomass.
A new multi-scale method to reveal hierarchical modular structures in biological networks.
Jiao, Qing-Ju; Huang, Yan; Shen, Hong-Bin
2016-11-15
Biological networks are effective tools for studying molecular interactions. Modular structure, in which genes or proteins may tend to be associated with functional modules or protein complexes, is a remarkable feature of biological networks. Mining modular structure from biological networks enables us to focus on a set of potentially important nodes, which provides a reliable guide to future biological experiments. The first fundamental challenge in mining modular structure from biological networks is that the quality of the observed network data is usually low owing to noise and incompleteness in the obtained networks. The second problem that poses a challenge to existing approaches to the mining of modular structure is that the organization of both functional modules and protein complexes in networks is far more complicated than was ever thought. For instance, the sizes of different modules vary considerably from each other and they often form multi-scale hierarchical structures. To solve these problems, we propose a new multi-scale protocol for mining modular structure (named ISIMB) driven by a node similarity metric, which works in an iteratively converged space to reduce the effects of the low data quality of the observed network data. The multi-scale node similarity metric couples both the local and the global topology of the network with a resolution regulator. By varying this resolution regulator to give different weightings to the local and global terms in the metric, the ISIMB method is able to fit the shape of modules and to detect them on different scales. Experiments on protein-protein interaction and genetic interaction networks show that our method can not only mine functional modules and protein complexes successfully, but can also predict functional modules from specific to general and reveal the hierarchical organization of protein complexes.
Uhart, Marina; Flores, Gabriel; Bustos, Diego M.
2016-01-01
Posttranslational regulation of protein function is an ubiquitous mechanism in eukaryotic cells. Here, we analyzed biological properties of nodes and edges of a human protein-protein interaction phosphorylation-based network, especially of those nodes critical for the network controllability. We found that the minimal number of critical nodes needed to control the whole network is 29%, which is considerably lower compared to other real networks. These critical nodes are more regulated by posttranslational modifications and contain more binding domains to these modifications than other kinds of nodes in the network, suggesting an intra-group fast regulation. Also, when we analyzed the edges characteristics that connect critical and non-critical nodes, we found that the former are enriched in domain-to-eukaryotic linear motif interactions, whereas the later are enriched in domain-domain interactions. Our findings suggest a possible structure for protein-protein interaction networks with a densely interconnected and self-regulated central core, composed of critical nodes with a high participation in the controllability of the full network, and less regulated peripheral nodes. Our study offers a deeper understanding of complex network control and bridges the controllability theorems for complex networks and biological protein-protein interaction phosphorylation-based networked systems. PMID:27195976
Methods for the Analysis of Protein Phosphorylation-Mediated Cellular Signaling Networks
NASA Astrophysics Data System (ADS)
White, Forest M.; Wolf-Yadlin, Alejandro
2016-06-01
Protein phosphorylation-mediated cellular signaling networks regulate almost all aspects of cell biology, including the responses to cellular stimulation and environmental alterations. These networks are highly complex and comprise hundreds of proteins and potentially thousands of phosphorylation sites. Multiple analytical methods have been developed over the past several decades to identify proteins and protein phosphorylation sites regulating cellular signaling, and to quantify the dynamic response of these sites to different cellular stimulation. Here we provide an overview of these methods, including the fundamental principles governing each method, their relative strengths and weaknesses, and some examples of how each method has been applied to the analysis of complex signaling networks. When applied correctly, each of these techniques can provide insight into the topology, dynamics, and regulation of protein phosphorylation signaling networks.
Architecture of the human interactome defines protein communities and disease networks
Huttlin, Edward L.; Bruckner, Raphael J.; Paulo, Joao A.; Cannon, Joe R.; Ting, Lily; Baltier, Kurt; Colby, Greg; Gebreab, Fana; Gygi, Melanie P.; Parzen, Hannah; Szpyt, John; Tam, Stanley; Zarraga, Gabriela; Pontano-Vaites, Laura; Swarup, Sharan; White, Anne E.; Schweppe, Devin K.; Rad, Ramin; Erickson, Brian K.; Obar, Robert A.; Guruharsha, K.G.; Li, Kejie; Artavanis-Tsakonas, Spyros; Gygi, Steven P.; Harper, J. Wade
2017-01-01
The physiology of a cell can be viewed as the product of thousands of proteins acting in concert to shape the cellular response. Coordination is achieved in part through networks of protein-protein interactions that assemble functionally related proteins into complexes, organelles, and signal transduction pathways. Understanding the architecture of the human proteome has the potential to inform cellular, structural, and evolutionary mechanisms and is critical to elucidation of how genome variation contributes to disease1–3. Here, we present BioPlex 2.0 (Biophysical Interactions of ORFEOME-derived complexes), which employs robust affinity purification-mass spectrometry (AP-MS) methodology4 to elucidate protein interaction networks and co-complexes nucleated by more than 25% of protein coding genes from the human genome, and constitutes the largest such network to date. With >56,000 candidate interactions, BioPlex 2.0 contains >29,000 previously unknown co-associations and provides functional insights into hundreds of poorly characterized proteins while enhancing network-based analyses of domain associations, subcellular localization, and co-complex formation. Unsupervised Markov clustering (MCL)5 of interacting proteins identified more than 1300 protein communities representing diverse cellular activities. Genes essential for cell fitness6,7 are enriched within 53 communities representing central cellular functions. Moreover, we identified 442 communities associated with more than 2000 disease annotations, placing numerous candidate disease genes into a cellular framework. BioPlex 2.0 exceeds previous experimentally derived interaction networks in depth and breadth, and will be a valuable resource for exploring the biology of incompletely characterized proteins and for elucidating larger-scale patterns of proteome organization. PMID:28514442
Sardiu, Mihaela E; Gilmore, Joshua M; Carrozza, Michael J; Li, Bing; Workman, Jerry L; Florens, Laurence; Washburn, Michael P
2009-10-06
Protein complexes are key molecular machines executing a variety of essential cellular processes. Despite the availability of genome-wide protein-protein interaction studies, determining the connectivity between proteins within a complex remains a major challenge. Here we demonstrate a method that is able to predict the relationship of proteins within a stable protein complex. We employed a combination of computational approaches and a systematic collection of quantitative proteomics data from wild-type and deletion strain purifications to build a quantitative deletion-interaction network map and subsequently convert the resulting data into an interdependency-interaction model of a complex. We applied this approach to a data set generated from components of the Saccharomyces cerevisiae Rpd3 histone deacetylase complexes, which consists of two distinct small and large complexes that are held together by a module consisting of Rpd3, Sin3 and Ume1. The resulting representation reveals new protein-protein interactions and new submodule relationships, providing novel information for mapping the functional organization of a complex.
Identification of hybrid node and link communities in complex networks
He, Dongxiao; Jin, Di; Chen, Zheng; Zhang, Weixiong
2015-01-01
Identifying communities in complex networks is an effective means for analyzing complex systems, with applications in diverse areas such as social science, engineering, biology and medicine. Finding communities of nodes and finding communities of links are two popular schemes for network analysis. These schemes, however, have inherent drawbacks and are inadequate to capture complex organizational structures in real networks. We introduce a new scheme and an effective approach for identifying complex mixture structures of node and link communities, called hybrid node-link communities. A central piece of our approach is a probabilistic model that accommodates node, link and hybrid node-link communities. Our extensive experiments on various real-world networks, including a large protein-protein interaction network and a large network of semantically associated words, illustrated that the scheme for hybrid communities is superior in revealing network characteristics. Moreover, the new approach outperformed the existing methods for finding node or link communities separately. PMID:25728010
Identification of hybrid node and link communities in complex networks.
He, Dongxiao; Jin, Di; Chen, Zheng; Zhang, Weixiong
2015-03-02
Identifying communities in complex networks is an effective means for analyzing complex systems, with applications in diverse areas such as social science, engineering, biology and medicine. Finding communities of nodes and finding communities of links are two popular schemes for network analysis. These schemes, however, have inherent drawbacks and are inadequate to capture complex organizational structures in real networks. We introduce a new scheme and an effective approach for identifying complex mixture structures of node and link communities, called hybrid node-link communities. A central piece of our approach is a probabilistic model that accommodates node, link and hybrid node-link communities. Our extensive experiments on various real-world networks, including a large protein-protein interaction network and a large network of semantically associated words, illustrated that the scheme for hybrid communities is superior in revealing network characteristics. Moreover, the new approach outperformed the existing methods for finding node or link communities separately.
Identification of hybrid node and link communities in complex networks
NASA Astrophysics Data System (ADS)
He, Dongxiao; Jin, Di; Chen, Zheng; Zhang, Weixiong
2015-03-01
Identifying communities in complex networks is an effective means for analyzing complex systems, with applications in diverse areas such as social science, engineering, biology and medicine. Finding communities of nodes and finding communities of links are two popular schemes for network analysis. These schemes, however, have inherent drawbacks and are inadequate to capture complex organizational structures in real networks. We introduce a new scheme and an effective approach for identifying complex mixture structures of node and link communities, called hybrid node-link communities. A central piece of our approach is a probabilistic model that accommodates node, link and hybrid node-link communities. Our extensive experiments on various real-world networks, including a large protein-protein interaction network and a large network of semantically associated words, illustrated that the scheme for hybrid communities is superior in revealing network characteristics. Moreover, the new approach outperformed the existing methods for finding node or link communities separately.
A Novel Algorithm for Detecting Protein Complexes with the Breadth First Search
Tang, Xiwei; Wang, Jianxin; Li, Min; He, Yiming; Pan, Yi
2014-01-01
Most biological processes are carried out by protein complexes. A substantial number of false positives of the protein-protein interaction (PPI) data can compromise the utility of the datasets for complexes reconstruction. In order to reduce the impact of such discrepancies, a number of data integration and affinity scoring schemes have been devised. The methods encode the reliabilities (confidence) of physical interactions between pairs of proteins. The challenge now is to identify novel and meaningful protein complexes from the weighted PPI network. To address this problem, a novel protein complex mining algorithm ClusterBFS (Cluster with Breadth-First Search) is proposed. Based on the weighted density, ClusterBFS detects protein complexes of the weighted network by the breadth first search algorithm, which originates from a given seed protein used as starting-point. The experimental results show that ClusterBFS performs significantly better than the other computational approaches in terms of the identification of protein complexes. PMID:24818139
Evolution of SH2 domains and phosphotyrosine signalling networks
Liu, Bernard A.; Nash, Piers D.
2012-01-01
Src homology 2 (SH2) domains mediate selective protein–protein interactions with tyrosine phosphorylated proteins, and in doing so define specificity of phosphotyrosine (pTyr) signalling networks. SH2 domains and protein-tyrosine phosphatases expand alongside protein-tyrosine kinases (PTKs) to coordinate cellular and organismal complexity in the evolution of the unikont branch of the eukaryotes. Examination of conserved families of PTKs and SH2 domain proteins provides fiduciary marks that trace the evolutionary landscape for the development of complex cellular systems in the proto-metazoan and metazoan lineages. The evolutionary provenance of conserved SH2 and PTK families reveals the mechanisms by which diversity is achieved through adaptations in tissue-specific gene transcription, altered ligand binding, insertions of linear motifs and the gain or loss of domains following gene duplication. We discuss mechanisms by which pTyr-mediated signalling networks evolve through the development of novel and expanded families of SH2 domain proteins and the elaboration of connections between pTyr-signalling proteins. These changes underlie the variety of general and specific signalling networks that give rise to tissue-specific functions and increasingly complex developmental programmes. Examination of SH2 domains from an evolutionary perspective provides insight into the process by which evolutionary expansion and modification of molecular protein interaction domain proteins permits the development of novel protein-interaction networks and accommodates adaptation of signalling networks. PMID:22889907
Protein intrinsic disorder in plants.
Pazos, Florencio; Pietrosemoli, Natalia; García-Martín, Juan A; Solano, Roberto
2013-09-12
To some extent contradicting the classical paradigm of the relationship between protein 3D structure and function, now it is clear that large portions of the proteomes, especially in higher organisms, lack a fixed structure and still perform very important functions. Proteins completely or partially unstructured in their native (functional) form are involved in key cellular processes underlain by complex networks of protein interactions. The intrinsic conformational flexibility of these disordered proteins allows them to bind multiple partners in transient interactions of high specificity and low affinity. In concordance, in plants this type of proteins has been found in processes requiring these complex and versatile interaction networks. These include transcription factor networks, where disordered proteins act as integrators of different signals or link different transcription factor subnetworks due to their ability to interact (in many cases simultaneously) with different partners. Similarly, they also serve as signal integrators in signaling cascades, such as those related to response to external stimuli. Disordered proteins have also been found in plants in many stress-response processes, acting as protein chaperones or protecting other cellular components and structures. In plants, it is especially important to have complex and versatile networks able to quickly and efficiently respond to changing environmental conditions since these organisms cannot escape and have no other choice than adapting to them. Consequently, protein disorder can play an especially important role in plants, providing them with a fast mechanism to obtain complex, interconnected and versatile molecular networks.
Protein intrinsic disorder in plants
Pazos, Florencio; Pietrosemoli, Natalia; García-Martín, Juan A.; Solano, Roberto
2013-01-01
To some extent contradicting the classical paradigm of the relationship between protein 3D structure and function, now it is clear that large portions of the proteomes, especially in higher organisms, lack a fixed structure and still perform very important functions. Proteins completely or partially unstructured in their native (functional) form are involved in key cellular processes underlain by complex networks of protein interactions. The intrinsic conformational flexibility of these disordered proteins allows them to bind multiple partners in transient interactions of high specificity and low affinity. In concordance, in plants this type of proteins has been found in processes requiring these complex and versatile interaction networks. These include transcription factor networks, where disordered proteins act as integrators of different signals or link different transcription factor subnetworks due to their ability to interact (in many cases simultaneously) with different partners. Similarly, they also serve as signal integrators in signaling cascades, such as those related to response to external stimuli. Disordered proteins have also been found in plants in many stress-response processes, acting as protein chaperones or protecting other cellular components and structures. In plants, it is especially important to have complex and versatile networks able to quickly and efficiently respond to changing environmental conditions since these organisms cannot escape and have no other choice than adapting to them. Consequently, protein disorder can play an especially important role in plants, providing them with a fast mechanism to obtain complex, interconnected and versatile molecular networks. PMID:24062761
Ultrastable cellulosome-adhesion complex tightens under load
Schoeler, Constantin; Malinowska, Klara H.; Bernardi, Rafael C.; Milles, Lukas F.; Jobst, Markus A.; Durner, Ellis; Ott, Wolfgang; Fried, Daniel B.; Bayer, Edward A.; Schulten, Klaus; Gaub, Hermann E.; Nash, Michael A.
2014-01-01
Challenging environments have guided nature in the development of ultrastable protein complexes. Specialized bacteria produce discrete multi-component protein networks called cellulosomes to effectively digest lignocellulosic biomass. While network assembly is enabled by protein interactions with commonplace affinities, we show that certain cellulosomal ligand–receptor interactions exhibit extreme resistance to applied force. Here, we characterize the ligand–receptor complex responsible for substrate anchoring in the Ruminococcus flavefaciens cellulosome using single-molecule force spectroscopy and steered molecular dynamics simulations. The complex withstands forces of 600–750 pN, making it one of the strongest bimolecular interactions reported, equivalent to half the mechanical strength of a covalent bond. Our findings demonstrate force activation and inter-domain stabilization of the complex, and suggest that certain network components serve as mechanical effectors for maintaining network integrity. This detailed understanding of cellulosomal network components may help in the development of biocatalysts for production of fuels and chemicals from renewable plant-derived biomass. PMID:25482395
Rapid Sampling of Hydrogen Bond Networks for Computational Protein Design.
Maguire, Jack B; Boyken, Scott E; Baker, David; Kuhlman, Brian
2018-05-08
Hydrogen bond networks play a critical role in determining the stability and specificity of biomolecular complexes, and the ability to design such networks is important for engineering novel structures, interactions, and enzymes. One key feature of hydrogen bond networks that makes them difficult to rationally engineer is that they are highly cooperative and are not energetically favorable until the hydrogen bonding potential has been satisfied for all buried polar groups in the network. Existing computational methods for protein design are ill-equipped for creating these highly cooperative networks because they rely on energy functions and sampling strategies that are focused on pairwise interactions. To enable the design of complex hydrogen bond networks, we have developed a new sampling protocol in the molecular modeling program Rosetta that explicitly searches for sets of amino acid mutations that can form self-contained hydrogen bond networks. For a given set of designable residues, the protocol often identifies many alternative sets of mutations/networks, and we show that it can readily be applied to large sets of residues at protein-protein interfaces or in the interior of proteins. The protocol builds on a recently developed method in Rosetta for designing hydrogen bond networks that has been experimentally validated for small symmetric systems but was not extensible to many larger protein structures and complexes. The sampling protocol we describe here not only recapitulates previously validated designs with performance improvements but also yields viable hydrogen bond networks for cases where the previous method fails, such as the design of large, asymmetric interfaces relevant to engineering protein-based therapeutics.
NASA Astrophysics Data System (ADS)
Keane, Harriet; Ryan, Brent J.; Jackson, Brendan; Whitmore, Alan; Wade-Martins, Richard
2015-11-01
Neurodegenerative diseases are complex multifactorial disorders characterised by the interplay of many dysregulated physiological processes. As an exemplar, Parkinson’s disease (PD) involves multiple perturbed cellular functions, including mitochondrial dysfunction and autophagic dysregulation in preferentially-sensitive dopamine neurons, a selective pathophysiology recapitulated in vitro using the neurotoxin MPP+. Here we explore a network science approach for the selection of therapeutic protein targets in the cellular MPP+ model. We hypothesised that analysis of protein-protein interaction networks modelling MPP+ toxicity could identify proteins critical for mediating MPP+ toxicity. Analysis of protein-protein interaction networks constructed to model the interplay of mitochondrial dysfunction and autophagic dysregulation (key aspects of MPP+ toxicity) enabled us to identify four proteins predicted to be key for MPP+ toxicity (P62, GABARAP, GBRL1 and GBRL2). Combined, but not individual, knockdown of these proteins increased cellular susceptibility to MPP+ toxicity. Conversely, combined, but not individual, over-expression of the network targets provided rescue of MPP+ toxicity associated with the formation of autophagosome-like structures. We also found that modulation of two distinct proteins in the protein-protein interaction network was necessary and sufficient to mitigate neurotoxicity. Together, these findings validate our network science approach to multi-target identification in complex neurological diseases.
Srihari, Sriganesh; Yong, Chern Han; Patil, Ashwini; Wong, Limsoon
2015-09-14
Complexes of physically interacting proteins constitute fundamental functional units responsible for driving biological processes within cells. A faithful reconstruction of the entire set of complexes is therefore essential to understand the functional organisation of cells. In this review, we discuss the key contributions of computational methods developed till date (approximately between 2003 and 2015) for identifying complexes from the network of interacting proteins (PPI network). We evaluate in depth the performance of these methods on PPI datasets from yeast, and highlight their limitations and challenges, in particular at detecting sparse and small or sub-complexes and discerning overlapping complexes. We describe methods for integrating diverse information including expression profiles and 3D structures of proteins with PPI networks to understand the dynamics of complex formation, for instance, of time-based assembly of complex subunits and formation of fuzzy complexes from intrinsically disordered proteins. Finally, we discuss methods for identifying dysfunctional complexes in human diseases, an application that is proving invaluable to understand disease mechanisms and to discover novel therapeutic targets. We hope this review aptly commemorates a decade of research on computational prediction of complexes and constitutes a valuable reference for further advancements in this exciting area. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Characterization of the ternary Usher syndrome SANS/ush2a/whirlin protein complex.
Sorusch, Nasrin; Bauß, Katharina; Plutniok, Janet; Samanta, Ananya; Knapp, Barbara; Nagel-Wolfrum, Kerstin; Wolfrum, Uwe
2017-03-15
The Usher syndrome (USH) is the most common form of inherited deaf-blindness, accompanied by vestibular dysfunction. Due to the heterogeneous manifestation of the clinical symptoms, three USH types (USH1-3) and additional atypical forms are distinguished. USH1 and USH2 proteins have been shown to function together in multiprotein networks in photoreceptor cells and hair cells. Mutations in USH proteins are considered to disrupt distinct USH protein networks and finally lead to the development of USH.To get novel insights into the molecular pathomechanisms underlying USH, we further characterize the periciliary USH protein network in photoreceptor cells. We show the direct interaction between the scaffold protein SANS (USH1G) and the transmembrane adhesion protein ush2a and that both assemble into a ternary USH1/USH2 complex together with the PDZ-domain protein whirlin (USH2D) via mutual interactions. Immunohistochemistry and proximity ligation assays demonstrate co-localization of complex partners and complex formation, respectively, in the periciliary region, the inner segment and at the synapses of rodent and human photoreceptor cells. Protein-protein interaction assays and co-expression of complex partners reveal that pathogenic mutations in USH1G severely affect formation of the SANS/ush2a/whirlin complex. Translational read-through drug treatment, targeting the c.728C > A (p.S243X) nonsense mutation, restored SANS scaffold function. We conclude that USH1 and USH2 proteins function together in higher order protein complexes. The maintenance of USH1/USH2 protein complexes depends on multiple USH1/USH2 protein interactions, which are disrupted by pathogenic mutations in USH1G protein SANS. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Benzekry, Sebastian; Tuszynski, Jack A; Rietman, Edward A; Lakka Klement, Giannoula
2015-05-28
The ever-increasing expanse of online bioinformatics data is enabling new ways to, not only explore the visualization of these data, but also to apply novel mathematical methods to extract meaningful information for clinically relevant analysis of pathways and treatment decisions. One of the methods used for computing topological characteristics of a space at different spatial resolutions is persistent homology. This concept can also be applied to network theory, and more specifically to protein-protein interaction networks, where the number of rings in an individual cancer network represents a measure of complexity. We observed a linear correlation of R = -0.55 between persistent homology and 5-year survival of patients with a variety of cancers. This relationship was used to predict the proteins within a protein-protein interaction network with the most impact on cancer progression. By re-computing the persistent homology after computationally removing an individual node (protein) from the protein-protein interaction network, we were able to evaluate whether such an inhibition would lead to improvement in patient survival. The power of this approach lied in its ability to identify the effects of inhibition of multiple proteins and in the ability to expose whether the effect of a single inhibition may be amplified by inhibition of other proteins. More importantly, we illustrate specific examples of persistent homology calculations, which correctly predict the survival benefit observed effects in clinical trials using inhibitors of the identified molecular target. We propose that computational approaches such as persistent homology may be used in the future for selection of molecular therapies in clinic. The technique uses a mathematical algorithm to evaluate the node (protein) whose inhibition has the highest potential to reduce network complexity. The greater the drop in persistent homology, the greater reduction in network complexity, and thus a larger potential for survival benefit. We hope that the use of advanced mathematics in medicine will provide timely information about the best drug combination for patients, and avoid the expense associated with an unsuccessful clinical trial, where drug(s) did not show a survival benefit.
Holden, Brian J; Pinney, John W; Lovell, Simon C; Amoutzias, Grigoris D; Robertson, David L
2007-01-01
Background Alternative representations of biochemical networks emphasise different aspects of the data and contribute to the understanding of complex biological systems. In this study we present a variety of automated methods for visualisation of a protein-protein interaction network, using the basic helix-loop-helix (bHLH) family of transcription factors as an example. Results Network representations that arrange nodes (proteins) according to either continuous or discrete information are investigated, revealing the existence of protein sub-families and the retention of interactions following gene duplication events. Methods of network visualisation in conjunction with a phylogenetic tree are presented, highlighting the evolutionary relationships between proteins, and clarifying the context of network hubs and interaction clusters. Finally, an optimisation technique is used to create a three-dimensional layout of the phylogenetic tree upon which the protein-protein interactions may be projected. Conclusion We show that by incorporating secondary genomic, functional or phylogenetic information into network visualisation, it is possible to move beyond simple layout algorithms based on network topology towards more biologically meaningful representations. These new visualisations can give structure to complex networks and will greatly help in interpreting their evolutionary origins and functional implications. Three open source software packages (InterView, TVi and OptiMage) implementing our methods are available. PMID:17683601
2013-01-01
Despite its prominence for characterization of complex mixtures, LC–MS/MS frequently fails to identify many proteins. Network-based analysis methods, based on protein–protein interaction networks (PPINs), biological pathways, and protein complexes, are useful for recovering non-detected proteins, thereby enhancing analytical resolution. However, network-based analysis methods do come in varied flavors for which the respective efficacies are largely unknown. We compare the recovery performance and functional insights from three distinct instances of PPIN-based approaches, viz., Proteomics Expansion Pipeline (PEP), Functional Class Scoring (FCS), and Maxlink, in a test scenario of valproic acid (VPA)-treated mice. We find that the most comprehensive functional insights, as well as best non-detected protein recovery performance, are derived from FCS utilizing real biological complexes. This outstrips other network-based methods such as Maxlink or Proteomics Expansion Pipeline (PEP). From FCS, we identified known biological complexes involved in epigenetic modifications, neuronal system development, and cytoskeletal rearrangements. This is congruent with the observed phenotype where adult mice showed an increase in dendritic branching to allow the rewiring of visual cortical circuitry and an improvement in their visual acuity when tested behaviorally. In addition, PEP also identified a novel complex, comprising YWHAB, NR1, NR2B, ACTB, and TJP1, which is functionally related to the observed phenotype. Although our results suggest different network analysis methods can produce different results, on the whole, the findings are mutually supportive. More critically, the non-overlapping information each provides can provide greater holistic understanding of complex phenotypes. PMID:23557376
Protein complex prediction for large protein protein interaction networks with the Core&Peel method.
Pellegrini, Marco; Baglioni, Miriam; Geraci, Filippo
2016-11-08
Biological networks play an increasingly important role in the exploration of functional modularity and cellular organization at a systemic level. Quite often the first tools used to analyze these networks are clustering algorithms. We concentrate here on the specific task of predicting protein complexes (PC) in large protein-protein interaction networks (PPIN). Currently, many state-of-the-art algorithms work well for networks of small or moderate size. However, their performance on much larger networks, which are becoming increasingly common in modern proteome-wise studies, needs to be re-assessed. We present a new fast algorithm for clustering large sparse networks: Core&Peel, which runs essentially in time and storage O(a(G)m+n) for a network G of n nodes and m arcs, where a(G) is the arboricity of G (which is roughly proportional to the maximum average degree of any induced subgraph in G). We evaluated Core&Peel on five PPI networks of large size and one of medium size from both yeast and homo sapiens, comparing its performance against those of ten state-of-the-art methods. We demonstrate that Core&Peel consistently outperforms the ten competitors in its ability to identify known protein complexes and in the functional coherence of its predictions. Our method is remarkably robust, being quite insensible to the injection of random interactions. Core&Peel is also empirically efficient attaining the second best running time over large networks among the tested algorithms. Our algorithm Core&Peel pushes forward the state-of the-art in PPIN clustering providing an algorithmic solution with polynomial running time that attains experimentally demonstrable good output quality and speed on challenging large real networks.
Wang, Jingwen; Zhao, Yuqi; Wang, Yanjie; Huang, Jingfei
2013-01-16
Coevolution between proteins is crucial for understanding protein-protein interaction. Simultaneous changes allow a protein complex to maintain its overall structural-functional integrity. In this study, we combined statistical coupling analysis (SCA) and molecular dynamics simulations on the CDK6-CDKN2A protein complex to evaluate coevolution between proteins. We reconstructed an inter-protein residue coevolution network, consisting of 37 residues and 37 interactions. It shows that most of the coevolved residue pairs are spatially proximal. When the mutations happened, the stable local structures were broken up and thus the protein interaction was decreased or inhibited, with a following increased risk of melanoma. The identification of inter-protein coevolved residues in the CDK6-CDKN2A complex can be helpful for designing protein engineering experiments. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Network Analysis of Protein Adaptation: Modeling the Functional Impact of Multiple Mutations
Beleva Guthrie, Violeta; Masica, David L; Fraser, Andrew; Federico, Joseph; Fan, Yunfan; Camps, Manel; Karchin, Rachel
2018-01-01
Abstract The evolution of new biochemical activities frequently involves complex dependencies between mutations and rapid evolutionary radiation. Mutation co-occurrence and covariation have previously been used to identify compensating mutations that are the result of physical contacts and preserve protein function and fold. Here, we model pairwise functional dependencies and higher order interactions that enable evolution of new protein functions. We use a network model to find complex dependencies between mutations resulting from evolutionary trade-offs and pleiotropic effects. We present a method to construct these networks and to identify functionally interacting mutations in both extant and reconstructed ancestral sequences (Network Analysis of Protein Adaptation). The time ordering of mutations can be incorporated into the networks through phylogenetic reconstruction. We apply NAPA to three distantly homologous β-lactamase protein clusters (TEM, CTX-M-3, and OXA-51), each of which has experienced recent evolutionary radiation under substantially different selective pressures. By analyzing the network properties of each protein cluster, we identify key adaptive mutations, positive pairwise interactions, different adaptive solutions to the same selective pressure, and complex evolutionary trajectories likely to increase protein fitness. We also present evidence that incorporating information from phylogenetic reconstruction and ancestral sequence inference can reduce the number of spurious links in the network, whereas preserving overall network community structure. The analysis does not require structural or biochemical data. In contrast to function-preserving mutation dependencies, which are frequently from structural contacts, gain-of-function mutation dependencies are most commonly between residues distal in protein structure. PMID:29522102
The Robustness of a Signaling Complex to Domain Rearrangements Facilitates Network Evolution
Sato, Paloma M.; Yoganathan, Kogulan; Jung, Jae H.; Peisajovich, Sergio G.
2014-01-01
The rearrangement of protein domains is known to have key roles in the evolution of signaling networks and, consequently, is a major tool used to synthetically rewire networks. However, natural mutational events leading to the creation of proteins with novel domain combinations, such as in frame fusions followed by domain loss, retrotranspositions, or translocations, to name a few, often simultaneously replace pre-existing genes. Thus, while proteins with new domain combinations may establish novel network connections, it is not clear how the concomitant deletions are tolerated. We investigated the mechanisms that enable signaling networks to tolerate domain rearrangement-mediated gene replacements. Using as a model system the yeast mitogen activated protein kinase (MAPK)-mediated mating pathway, we analyzed 92 domain-rearrangement events affecting 11 genes. Our results indicate that, while domain rearrangement events that result in the loss of catalytic activities within the signaling complex are not tolerated, domain rearrangements can drastically alter protein interactions without impairing function. This suggests that signaling complexes can maintain function even when some components are recruited to alternative sites within the complex. Furthermore, we also found that the ability of the complex to tolerate changes in interaction partners does not depend on long disordered linkers that often connect domains. Taken together, our results suggest that some signaling complexes are dynamic ensembles with loose spatial constraints that could be easily re-shaped by evolution and, therefore, are ideal targets for cellular engineering. PMID:25490747
Pires, Mathias M.; Cantor, Maurício; Guimarães, Paulo R.; de Aguiar, Marcus A. M.; dos Reis, Sérgio F.; Coltri, Patricia P.
2015-01-01
The network structure of biological systems provides information on the underlying processes shaping their organization and dynamics. Here we examined the structure of the network depicting protein interactions within the spliceosome, the macromolecular complex responsible for splicing in eukaryotic cells. We show the interactions of less connected spliceosome proteins are nested subsets of the connections of the highly connected proteins. At the same time, the network has a modular structure with groups of proteins sharing similar interaction patterns. We then investigated the role of affinity and specificity in shaping the spliceosome network by adapting a probabilistic model originally designed to reproduce food webs. This food-web model was as successful in reproducing the structure of protein interactions as it is in reproducing interactions among species. The good performance of the model suggests affinity and specificity, partially determined by protein size and the timing of association to the complex, may be determining network structure. Moreover, because network models allow building ensembles of realistic networks while encompassing uncertainty they can be useful to examine the dynamics and vulnerability of intracelullar processes. Unraveling the mechanisms organizing the spliceosome interactions is important to characterize the role of individual proteins on splicing catalysis and regulation. PMID:26443080
Detecting complexes from edge-weighted PPI networks via genes expression analysis.
Zhang, Zehua; Song, Jian; Tang, Jijun; Xu, Xinying; Guo, Fei
2018-04-24
Identifying complexes from PPI networks has become a key problem to elucidate protein functions and identify signal and biological processes in a cell. Proteins binding as complexes are important roles of life activity. Accurate determination of complexes in PPI networks is crucial for understanding principles of cellular organization. We propose a novel method to identify complexes on PPI networks, based on different co-expression information. First, we use Markov Cluster Algorithm with an edge-weighting scheme to calculate complexes on PPI networks. Then, we propose some significant features, such as graph information and gene expression analysis, to filter and modify complexes predicted by Markov Cluster Algorithm. To evaluate our method, we test on two experimental yeast PPI networks. On DIP network, our method has Precision and F-Measure values of 0.6004 and 0.5528. On MIPS network, our method has F-Measure and S n values of 0.3774 and 0.3453. Comparing to existing methods, our method improves Precision value by at least 0.1752, F-Measure value by at least 0.0448, S n value by at least 0.0771. Experiments show that our method achieves better results than some state-of-the-art methods for identifying complexes on PPI networks, with the prediction quality improved in terms of evaluation criteria.
Structural reducibility of multilayer networks
NASA Astrophysics Data System (ADS)
de Domenico, Manlio; Nicosia, Vincenzo; Arenas, Alexandre; Latora, Vito
2015-04-01
Many complex systems can be represented as networks consisting of distinct types of interactions, which can be categorized as links belonging to different layers. For example, a good description of the full protein-protein interactome requires, for some organisms, up to seven distinct network layers, accounting for different genetic and physical interactions, each containing thousands of protein-protein relationships. A fundamental open question is then how many layers are indeed necessary to accurately represent the structure of a multilayered complex system. Here we introduce a method based on quantum theory to reduce the number of layers to a minimum while maximizing the distinguishability between the multilayer network and the corresponding aggregated graph. We validate our approach on synthetic benchmarks and we show that the number of informative layers in some real multilayer networks of protein-genetic interactions, social, economical and transportation systems can be reduced by up to 75%.
Chang, Dong W; Hayashi, Shinichi; Gharib, Sina A; Vaisar, Tomas; King, S Trevor; Tsuchiya, Mitsuhiro; Ruzinski, John T; Park, David R; Matute-Bello, Gustavo; Wurfel, Mark M; Bumgarner, Roger; Heinecke, Jay W; Martin, Thomas R
2008-10-01
Acute lung injury causes complex changes in protein expression in the lungs. Whereas most prior studies focused on single proteins, newer methods allowing the simultaneous study of many proteins could lead to a better understanding of pathogenesis and new targets for treatment. The purpose of this study was to examine the changes in protein expression in the bronchoalveolar lavage fluid (BALF) of patients during the course of the acute respiratory distress syndrome (ARDS). Using two-dimensional difference gel electrophoresis (DIGE), the expression of proteins in the BALF from patients on Days 1 (n = 7), 3 (n = 8), and 7 (n = 5) of ARDS were compared with findings in normal volunteers (n = 9). The patterns of protein expression were analyzed using principal component analysis (PCA). Biological processes that were enriched in the BALF proteins of patients with ARDS were identified using Gene Ontology (GO) analysis. Protein networks that model the protein interactions in the BALF were generated using Ingenuity Pathway Analysis. An average of 991 protein spots were detected using DIGE. Of these, 80 protein spots, representing 37 unique proteins in all of the fluids, were identified using mass spectrometry. PCA confirmed important differences between the proteins in the ARDS and normal samples. GO analysis showed that these differences are due to the enrichment of proteins involved in inflammation, infection, and injury. The protein network analysis showed that the protein interactions in ARDS are complex and redundant, and revealed unexpected central components in the protein networks. Proteomics and protein network analysis reveals the complex nature of lung protein interactions in ARDS. The results provide new insights about protein networks in injured lungs, and identify novel mediators that are likely to be involved in the pathogenesis and progression of acute lung injury.
Dissortativity and duplications in oral cancer
NASA Astrophysics Data System (ADS)
Shinde, Pramod; Yadav, Alok; Rai, Aparna; Jalan, Sarika
2015-08-01
More than 300 000 new cases worldwide are being diagnosed with oral cancer annually. Complexity of oral cancer renders designing drug targets very difficult. We analyse protein-protein interaction network for the normal and oral cancer tissue and detect crucial changes in the structural properties of the networks in terms of the interactions of the hub proteins and the degree-degree correlations. Further analysis of the spectra of both the networks, while exhibiting universal statistical behaviour, manifest distinction in terms of the zero degeneracy, providing insight to the complexity of the underlying system.
Evolution of the Max and Mlx networks in animals.
McFerrin, Lisa G; Atchley, William R
2011-01-01
Transcription factors (TFs) are essential for the regulation of gene expression and often form emergent complexes to perform vital roles in cellular processes. In this paper, we focus on the parallel Max and Mlx networks of TFs because of their critical involvement in cell cycle regulation, proliferation, growth, metabolism, and apoptosis. A basic-helix-loop-helix-zipper (bHLHZ) domain mediates the competitive protein dimerization and DNA binding among Max and Mlx network members to form a complex system of cell regulation. To understand the importance of these network interactions, we identified the bHLHZ domain of Max and Mlx network proteins across the animal kingdom and carried out several multivariate statistical analyses. The presence and conservation of Max and Mlx network proteins in animal lineages stemming from the divergence of Metazoa indicate that these networks have ancient and essential functions. Phylogenetic analysis of the bHLHZ domain identified clear relationships among protein families with distinct points of radiation and divergence. Multivariate discriminant analysis further isolated specific amino acid changes within the bHLHZ domain that classify proteins, families, and network configurations. These analyses on Max and Mlx network members provide a model for characterizing the evolution of TFs involved in essential networks.
Rule-based modeling and simulations of the inner kinetochore structure.
Tschernyschkow, Sergej; Herda, Sabine; Gruenert, Gerd; Döring, Volker; Görlich, Dennis; Hofmeister, Antje; Hoischen, Christian; Dittrich, Peter; Diekmann, Stephan; Ibrahim, Bashar
2013-09-01
Combinatorial complexity is a central problem when modeling biochemical reaction networks, since the association of a few components can give rise to a large variation of protein complexes. Available classical modeling approaches are often insufficient for the analysis of very large and complex networks in detail. Recently, we developed a new rule-based modeling approach that facilitates the analysis of spatial and combinatorially complex problems. Here, we explore for the first time how this approach can be applied to a specific biological system, the human kinetochore, which is a multi-protein complex involving over 100 proteins. Applying our freely available SRSim software to a large data set on kinetochore proteins in human cells, we construct a spatial rule-based simulation model of the human inner kinetochore. The model generates an estimation of the probability distribution of the inner kinetochore 3D architecture and we show how to analyze this distribution using information theory. In our model, the formation of a bridge between CenpA and an H3 containing nucleosome only occurs efficiently for higher protein concentration realized during S-phase but may be not in G1. Above a certain nucleosome distance the protein bridge barely formed pointing towards the importance of chromatin structure for kinetochore complex formation. We define a metric for the distance between structures that allow us to identify structural clusters. Using this modeling technique, we explore different hypothetical chromatin layouts. Applying a rule-based network analysis to the spatial kinetochore complex geometry allowed us to integrate experimental data on kinetochore proteins, suggesting a 3D model of the human inner kinetochore architecture that is governed by a combinatorial algebraic reaction network. This reaction network can serve as bridge between multiple scales of modeling. Our approach can be applied to other systems beyond kinetochores. Copyright © 2013 Elsevier Ltd. All rights reserved.
A generalized approach to complex networks
NASA Astrophysics Data System (ADS)
Costa, L. Da F.; da Rocha, L. E. C.
2006-03-01
This work describes how the formalization of complex network concepts in terms of discrete mathematics, especially mathematical morphology, allows a series of generalizations and important results ranging from new measurements of the network topology to new network growth models. First, the concepts of node degree and clustering coefficient are extended in order to characterize not only specific nodes, but any generic subnetwork. Second, the consideration of distance transform and rings are used to further extend those concepts in order to obtain a signature, instead of a single scalar measurement, ranging from the single node to whole graph scales. The enhanced discriminative potential of such extended measurements is illustrated with respect to the identification of correspondence between nodes in two complex networks, namely a protein-protein interaction network and a perturbed version of it.
AlignNemo: a local network alignment method to integrate homology and topology.
Ciriello, Giovanni; Mina, Marco; Guzzi, Pietro H; Cannataro, Mario; Guerra, Concettina
2012-01-01
Local network alignment is an important component of the analysis of protein-protein interaction networks that may lead to the identification of evolutionary related complexes. We present AlignNemo, a new algorithm that, given the networks of two organisms, uncovers subnetworks of proteins that relate in biological function and topology of interactions. The discovered conserved subnetworks have a general topology and need not to correspond to specific interaction patterns, so that they more closely fit the models of functional complexes proposed in the literature. The algorithm is able to handle sparse interaction data with an expansion process that at each step explores the local topology of the networks beyond the proteins directly interacting with the current solution. To assess the performance of AlignNemo, we ran a series of benchmarks using statistical measures as well as biological knowledge. Based on reference datasets of protein complexes, AlignNemo shows better performance than other methods in terms of both precision and recall. We show our solutions to be biologically sound using the concept of semantic similarity applied to Gene Ontology vocabularies. The binaries of AlignNemo and supplementary details about the algorithms and the experiments are available at: sourceforge.net/p/alignnemo.
The Curriculum Prerequisite Network: Modeling the Curriculum as a Complex System
ERIC Educational Resources Information Center
Aldrich, Preston R.
2015-01-01
This article advances the prerequisite network as a means to visualize the hidden structure in an academic curriculum. Networks have been used to represent a variety of complex systems ranging from social systems to biochemical pathways and protein interactions. Here, I treat the academic curriculum as a complex system with nodes representing…
Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases.
Berger, Seth I; Posner, Jeremy M; Ma'ayan, Avi
2007-10-04
In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP), generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.
2012-01-01
Background The use of biological molecular network information for diagnostic and prognostic purposes and elucidation of molecular disease mechanism is a key objective in systems biomedicine. The network of regulatory miRNA-target and functional protein interactions is a rich source of information to elucidate the function and the prognostic value of miRNAs in cancer. The objective of this study is to identify miRNAs that have high influence on target protein complexes in prostate cancer as a case study. This could provide biomarkers or therapeutic targets relevant for prostate cancer treatment. Results Our findings demonstrate that a miRNA’s functional role can be explained by its target protein connectivity within a physical and functional interaction network. To detect miRNAs with high influence on target protein modules, we integrated miRNA and mRNA expression profiles with a sequence based miRNA-target network and human functional and physical protein interactions (FPI). miRNAs with high influence on target protein complexes play a role in prostate cancer progression and are promising diagnostic or prognostic biomarkers. We uncovered several miRNA-regulated protein modules which were enriched in focal adhesion and prostate cancer genes. Several miRNAs such as miR-96, miR-182, and miR-143 demonstrated high influence on their target protein complexes and could explain most of the gene expression changes in our analyzed prostate cancer data set. Conclusions We describe a novel method to identify active miRNA-target modules relevant to prostate cancer progression and outcome. miRNAs with high influence on protein networks are valuable biomarkers that can be used in clinical investigations for prostate cancer treatment. PMID:22929553
Folding energy landscape and network dynamics of small globular proteins
Hori, Naoto; Chikenji, George; Berry, R. Stephen; Takada, Shoji
2009-01-01
The folding energy landscape of proteins has been suggested to be funnel-like with some degree of ruggedness on the slope. How complex the landscape, however, is still rather unclear. Many experiments for globular proteins suggested relative simplicity, whereas molecular simulations of shorter peptides implied more complexity. Here, by using complete conformational sampling of 2 globular proteins, protein G and src SH3 domain and 2 related random peptides, we investigated their energy landscapes, topological properties of folding networks, and folding dynamics. The projected energy surfaces of globular proteins were funneled in the vicinity of the native but also have other quite deep, accessible minima, whereas the randomized peptides have many local basins, including some leading to seriously misfolded forms. Dynamics in the denatured part of the network exhibited basin-hopping itinerancy among many conformations, whereas the protein reached relatively well-defined final stages that led to their native states. We also found that the folding network has the hierarchic nature characterized by the scale-free and the small-world properties. PMID:19114654
Folding energy landscape and network dynamics of small globular proteins.
Hori, Naoto; Chikenji, George; Berry, R Stephen; Takada, Shoji
2009-01-06
The folding energy landscape of proteins has been suggested to be funnel-like with some degree of ruggedness on the slope. How complex the landscape, however, is still rather unclear. Many experiments for globular proteins suggested relative simplicity, whereas molecular simulations of shorter peptides implied more complexity. Here, by using complete conformational sampling of 2 globular proteins, protein G and src SH3 domain and 2 related random peptides, we investigated their energy landscapes, topological properties of folding networks, and folding dynamics. The projected energy surfaces of globular proteins were funneled in the vicinity of the native but also have other quite deep, accessible minima, whereas the randomized peptides have many local basins, including some leading to seriously misfolded forms. Dynamics in the denatured part of the network exhibited basin-hopping itinerancy among many conformations, whereas the protein reached relatively well-defined final stages that led to their native states. We also found that the folding network has the hierarchic nature characterized by the scale-free and the small-world properties.
NASA Astrophysics Data System (ADS)
Palla, Gergely; Derenyi, Imre; Farkas, Illes J.; Vicsek, Tamas
2006-03-01
Most tasks in a cell are performed not by individual proteins, but by functional groups of proteins (either physically interacting with each other or associated in other ways). In gene (protein) association networks these groups show up as sets of densely connected nodes. In the yeast, Saccharomyces cerevisiae, known physically interacting groups of proteins (called protein complexes) strongly overlap: the total number of proteins contained by these complexes by far underestimates the sum of their sizes (2750 vs. 8932). Thus, most functional groups of proteins, both physically interacting and other, are likely to share many of their members with other groups. However, current algorithms searching for dense groups of nodes in networks usually exclude overlaps. With the aim to discover both novel functions of individual proteins and novel protein functional groups we combine in protein association networks (i) a search for overlapping dense subgraphs based on the Clique Percolation Method (CPM) (Palla, G., et.al. Nature 435, 814-818 (2005), http://angel.elte.hu/clustering), which explicitly allows for overlaps among the groups, and (ii) a verification and characterization of the identified groups of nodes (proteins) with the help of standard annotation databases listing known functions.
Predicting disease-related proteins based on clique backbone in protein-protein interaction network.
Yang, Lei; Zhao, Xudong; Tang, Xianglong
2014-01-01
Network biology integrates different kinds of data, including physical or functional networks and disease gene sets, to interpret human disease. A clique (maximal complete subgraph) in a protein-protein interaction network is a topological module and possesses inherently biological significance. A disease-related clique possibly associates with complex diseases. Fully identifying disease components in a clique is conductive to uncovering disease mechanisms. This paper proposes an approach of predicting disease proteins based on cliques in a protein-protein interaction network. To tolerate false positive and negative interactions in protein networks, extending cliques and scoring predicted disease proteins with gene ontology terms are introduced to the clique-based method. Precisions of predicted disease proteins are verified by disease phenotypes and steadily keep to more than 95%. The predicted disease proteins associated with cliques can partly complement mapping between genotype and phenotype, and provide clues for understanding the pathogenesis of serious diseases.
Verma, Amit K; Diwan, Danish; Raut, Sandeep; Dobriyal, Neha; Brown, Rebecca E; Gowda, Vinita; Hines, Justin K; Sahi, Chandan
2017-06-07
Heat shock proteins of 70 kDa (Hsp70s) partner with structurally diverse Hsp40s (J proteins), generating distinct chaperone networks in various cellular compartments that perform myriad housekeeping and stress-associated functions in all organisms. Plants, being sessile, need to constantly maintain their cellular proteostasis in response to external environmental cues. In these situations, the Hsp70:J protein machines may play an important role in fine-tuning cellular protein quality control. Although ubiquitous, the functional specificity and complexity of the plant Hsp70:J protein network has not been studied. Here, we analyzed the J protein network in the cytosol of Arabidopsis thaliana and, using yeast genetics, show that the functional specificities of most plant J proteins in fundamental chaperone functions are conserved across long evolutionary timescales. Detailed phylogenetic and functional analysis revealed that increased number, regulatory differences, and neofunctionalization in J proteins together contribute to the emerging functional diversity and complexity in the Hsp70:J protein network in higher plants. Based on the data presented, we propose that higher plants have orchestrated their "chaperome," especially their J protein complement, according to their specialized cellular and physiological stipulations. Copyright © 2017 Verma et al.
The physics of complex systems in information and biology
NASA Astrophysics Data System (ADS)
Walker, Dylan
Citation networks have re-emerged as a topic intense interest in the complex networks community with the recent availability of large-scale data sets. The ranking of citation networks is a necessary practice as a means to improve information navigability and search. Unlike many information networks, the aging characteristics of citation networks require the development of new ranking methods. To account for strong aging characteristics of citation networks, we modify the PageRank algorithm by initially distributing random surfers exponentially with age, in favor of more recent publications. The output of this algorithm, which we call CiteRank, is interpreted as approximate traffic to individual publications in a simple model of how researchers find new information. We optimize parameters of our algorithm to achieve the best performance. The results are compared for two rather different citation networks: all American Physical Society publications between 1893-2003 and the set of high-energy physics theory (hep-th) preprints. Despite major differences between these two networks, we find that their optimal parameters for the CiteRank algorithm are remarkably similar. The advantages and performance of CiteRank over more conventional methods of ranking publications are discussed. Collaborative voting systems have emerged as an abundant form of real-world, complex information systems that exist in a variety of online applications. These systems are comprised of large populations of users that collectively submit and vote on objects. While the specific properties of these systems vary widely, many of them share a core set of features and dynamical behaviors that govern their evolution. We study a subset of these systems that involve material of a time-critical nature as in the popular example of news items. We consider a general model system in which articles are introduced, voted on by a population of users, and subsequently expire after a proscribed period of time. To study the interaction between popularity and quality, we introduce simple stochastic models of user behavior that approximate differing user quality and susceptibility to the common notion of popularity. We define a metric to quantify user reputation in a manner that is self-consistent, adaptable and content-blind and shows good correlation with the probability that a user behaves in an optimal fashion. We further construct a mechanism for ranking documents that take into account user reputation and provides substantial improvement in the time-critical performance of the system. The structure of complex systems have been well studied in the context of both information and biological systems. More recently, dynamics in complex systems that occur over the background of the underlying network has received a great deal of attention. In particular, the study of fluctuations in complex systems has emerged as an issue central to understanding dynamical behavior. We approach the problem of collective effects of the underlying network on dynamical fluctuations by considering the protein-protein interaction networks for the system of the living cell. We consider two types of fluctuations in the mass-action equilibrium in protein binding networks. The first type is driven by relatively slow changes in total concentrations (copy numbers) of interacting proteins. The second type, to which we refer to as spontaneous, is caused by quickly decaying thermodynamic deviations away from the mass-action equilibrium of the system. As such they are amenable to methods of equilibrium statistical mechanics used in our study. We investigate the effects of network connectivity on these fluctuations by comparing them to different scenarios in which the interacting pair is isolated form the rest of the network. Such comparison allows us to analytically derive upper and lower bounds on network fluctuations. The collective effects are shown to sometimes lead to relatively large amplification of spontaneous fluctuations as compared to the expectation for isolated dimers. As a consequence of this, the strength of both types of fluctuations is positively correlated with the overall network connectivity of proteins forming the complex. On the other hand, the relative amplitude of fluctuations is negatively correlated with the equilibrium concentration of the complex. Our general findings are illustrated using a curated network of protein-protein interactions and multi-protein complexes in bakers yeast with experimentally determined protein concentrations.
Evidence for network evolution in an arabidopsis interactome map
USDA-ARS?s Scientific Manuscript database
Plants have unique features that evolved in response to their environments and ecosystems. A full account of the complex cellular networks that underlie plant-specific functions is still missing. We describe a proteome-wide binary protein-protein interaction map for the interactome network of the pl...
DeBlasio, Stacy L; Johnson, Richard; Sweeney, Michelle M; Karasev, Alexander; Gray, Stewart M; MacCoss, Michael J; Cilia, Michelle
2015-06-01
Potato leafroll virus (PLRV) produces a readthrough protein (RTP) via translational readthrough of the coat protein amber stop codon. The RTP functions as a structural component of the virion and as a nonincorporated protein in concert with numerous insect and plant proteins to regulate virus movement/transmission and tissue tropism. Affinity purification coupled to quantitative MS was used to generate protein interaction networks for a PLRV mutant that is unable to produce the read through domain (RTD) and compared to the known wild-type PLRV protein interaction network. By quantifying differences in the protein interaction networks, we identified four distinct classes of PLRV-plant interactions: those plant and nonstructural viral proteins interacting with assembled coat protein (category I); plant proteins in complex with both coat protein and RTD (category II); plant proteins in complex with the RTD (category III); and plant proteins that had higher affinity for virions lacking the RTD (category IV). Proteins identified as interacting with the RTD are potential candidates for regulating viral processes that are mediated by the RTP such as phloem retention and systemic movement and can potentially be useful targets for the development of strategies to prevent infection and/or viral transmission of Luteoviridae species that infect important crop species. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Dimitrakopoulos, Christos; Theofilatos, Konstantinos; Pegkas, Andreas; Likothanassis, Spiros; Mavroudi, Seferina
2016-07-01
Proteins are vital biological molecules driving many fundamental cellular processes. They rarely act alone, but form interacting groups called protein complexes. The study of protein complexes is a key goal in systems biology. Recently, large protein-protein interaction (PPI) datasets have been published and a plethora of computational methods that provide new ideas for the prediction of protein complexes have been implemented. However, most of the methods suffer from two major limitations: First, they do not account for proteins participating in multiple functions and second, they are unable to handle weighted PPI graphs. Moreover, the problem remains open as existing algorithms and tools are insufficient in terms of predictive metrics. In the present paper, we propose gradually expanding neighborhoods with adjustment (GENA), a new algorithm that gradually expands neighborhoods in a graph starting from highly informative "seed" nodes. GENA considers proteins as multifunctional molecules allowing them to participate in more than one protein complex. In addition, GENA accepts weighted PPI graphs by using a weighted evaluation function for each cluster. In experiments with datasets from Saccharomyces cerevisiae and human, GENA outperformed Markov clustering, restricted neighborhood search and clustering with overlapping neighborhood expansion, three state-of-the-art methods for computationally predicting protein complexes. Seven PPI networks and seven evaluation datasets were used in total. GENA outperformed existing methods in 16 out of 18 experiments achieving an average improvement of 5.5% when the maximum matching ratio metric was used. Our method was able to discover functionally homogeneous protein clusters and uncover important network modules in a Parkinson expression dataset. When used on the human networks, around 47% of the detected clusters were enriched in gene ontology (GO) terms with depth higher than five in the GO hierarchy. In the present manuscript, we introduce a new method for the computational prediction of protein complexes by making the realistic assumption that proteins participate in multiple protein complexes and cellular functions. Our method can detect accurate and functionally homogeneous clusters. Copyright © 2016 Elsevier B.V. All rights reserved.
Nam, Hyun-Jun; Kim, Inhae; Bowie, James U.; Kim, Sanguk
2015-01-01
A central question in animal evolution is how multicellular animals evolved from unicellular ancestors. We hypothesize that membrane proteins must be key players in the development of multicellularity because they are well positioned to form the cell-cell contacts and to provide the intercellular communication required for the creation of complex organisms. Here we find that a major mechanism for the necessary increase in membrane protein complexity in the transition from non-metazoan to metazoan life was the new incorporation of domains from soluble proteins. The membrane proteins that have incorporated soluble domains in metazoans are enriched in many of the functions unique to multicellular organisms such as cell-cell adhesion, signaling, immune defense and developmental processes. They also show enhanced protein-protein interaction (PPI) network complexity and centrality, suggesting an important role in the cellular diversification found in complex organisms. Our results expose an evolutionary mechanism that contributed to the development of higher life forms. PMID:25923201
Wang, W; Zhang, W; Jiang, R; Luan, Y
2010-05-01
It is of vital importance to find genetic variants that underlie human complex diseases and locate genes that are responsible for these diseases. Since proteins are typically composed of several structural domains, it is reasonable to assume that harmful genetic variants may alter structures of protein domains, affect functions of proteins and eventually cause disorders. With this understanding, the authors explore the possibility of recovering associations between protein domains and complex diseases. The authors define associations between protein domains and disease families on the basis of associations between non-synonymous single nucleotide polymorphisms (nsSNPs) and complex diseases, similarities between diseases, and relations between proteins and domains. Based on a domain-domain interaction network, the authors propose a 'guilt-by-proximity' principle to rank candidate domains according to their average distance to a set of seed domains in the domain-domain interaction network. The authors validate the method through large-scale cross-validation experiments on simulated linkage intervals, random controls and the whole genome. Results show that areas under receiver operating characteristic curves (AUC scores) can be as high as 77.90%, and the mean rank ratios can be as low as 21.82%. The authors further offer a freely accessible web interface for a genome-wide landscape of associations between domains and disease families.
A Global Protein Kinase and Phosphatase Interaction Network in Yeast
Breitkreutz, Ashton; Choi, Hyungwon; Sharom, Jeffrey R.; Boucher, Lorrie; Neduva, Victor; Larsen, Brett; Lin, Zhen-Yuan; Breitkreutz, Bobby-Joe; Stark, Chris; Liu, Guomin; Ahn, Jessica; Dewar-Darch, Danielle; Reguly, Teresa; Tang, Xiaojing; Almeida, Ricardo; Qin, Zhaohui Steve; Pawson, Tony; Gingras, Anne-Claude; Nesvizhskii, Alexey I.; Tyers, Mike
2011-01-01
The interactions of protein kinases and phosphatases with their regulatory subunits and substrates underpin cellular regulation. We identified a kinase and phosphatase interaction (KPI) network of 1844 interactions in budding yeast by mass spectrometric analysis of protein complexes. The KPI network contained many dense local regions of interactions that suggested new functions. Notably, the cell cycle phosphatase Cdc14 associated with multiple kinases that revealed roles for Cdc14 in mitogen-activated protein kinase signaling, the DNA damage response, and metabolism, whereas interactions of the target of rapamycin complex 1 (TORC1) uncovered new effector kinases in nitrogen and carbon metabolism. An extensive backbone of kinase-kinase interactions cross-connects the proteome and may serve to coordinate diverse cellular responses. PMID:20489023
Discovering disease-associated genes in weighted protein-protein interaction networks
NASA Astrophysics Data System (ADS)
Cui, Ying; Cai, Meng; Stanley, H. Eugene
2018-04-01
Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.
NASA Astrophysics Data System (ADS)
Champeimont, Raphaël; Laine, Elodie; Hu, Shuang-Wei; Penin, Francois; Carbone, Alessandra
2016-05-01
A novel computational approach of coevolution analysis allowed us to reconstruct the protein-protein interaction network of the Hepatitis C Virus (HCV) at the residue resolution. For the first time, coevolution analysis of an entire viral genome was realized, based on a limited set of protein sequences with high sequence identity within genotypes. The identified coevolving residues constitute highly relevant predictions of protein-protein interactions for further experimental identification of HCV protein complexes. The method can be used to analyse other viral genomes and to predict the associated protein interaction networks.
Protein complexes and functional modules in molecular networks
NASA Astrophysics Data System (ADS)
Spirin, Victor; Mirny, Leonid A.
2003-10-01
Proteins, nucleic acids, and small molecules form a dense network of molecular interactions in a cell. Molecules are nodes of this network, and the interactions between them are edges. The architecture of molecular networks can reveal important principles of cellular organization and function, similarly to the way that protein structure tells us about the function and organization of a protein. Computational analysis of molecular networks has been primarily concerned with node degree [Wagner, A. & Fell, D. A. (2001) Proc. R. Soc. London Ser. B 268, 1803-1810; Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N. & Barabasi, A. L. (2000) Nature 407, 651-654] or degree correlation [Maslov, S. & Sneppen, K. (2002) Science 296, 910-913], and hence focused on single/two-body properties of these networks. Here, by analyzing the multibody structure of the network of protein-protein interactions, we discovered molecular modules that are densely connected within themselves but sparsely connected with the rest of the network. Comparison with experimental data and functional annotation of genes showed two types of modules: (i) protein complexes (splicing machinery, transcription factors, etc.) and (ii) dynamic functional units (signaling cascades, cell-cycle regulation, etc.). Discovered modules are highly statistically significant, as is evident from comparison with random graphs, and are robust to noise in the data. Our results provide strong support for the network modularity principle introduced by Hartwell et al. [Hartwell, L. H., Hopfield, J. J., Leibler, S. & Murray, A. W. (1999) Nature 402, C47-C52], suggesting that found modules constitute the "building blocks" of molecular networks.
Toufighi, Kiana; Yang, Jae-Seong; Luis, Nuno Miguel; Aznar Benitah, Salvador; Lehner, Ben; Serrano, Luis; Kiel, Christina
2015-01-01
The molecular details underlying the time-dependent assembly of protein complexes in cellular networks, such as those that occur during differentiation, are largely unexplored. Focusing on the calcium-induced differentiation of primary human keratinocytes as a model system for a major cellular reorganization process, we look at the expression of genes whose products are involved in manually-annotated protein complexes. Clustering analyses revealed only moderate co-expression of functionally related proteins during differentiation. However, when we looked at protein complexes, we found that the majority (55%) are composed of non-dynamic and dynamic gene products (‘di-chromatic’), 19% are non-dynamic, and 26% only dynamic. Considering three-dimensional protein structures to predict steric interactions, we found that proteins encoded by dynamic genes frequently interact with a common non-dynamic protein in a mutually exclusive fashion. This suggests that during differentiation, complex assemblies may also change through variation in the abundance of proteins that compete for binding to common proteins as found in some cases for paralogous proteins. Considering the example of the TNF-α/NFκB signaling complex, we suggest that the same core complex can guide signals into diverse context-specific outputs by addition of time specific expressed subunits, while keeping other cellular functions constant. Thus, our analysis provides evidence that complex assembly with stable core components and competition could contribute to cell differentiation. PMID:25946651
Correlations between Community Structure and Link Formation in Complex Networks
Liu, Zhen; He, Jia-Lin; Kapoor, Komal; Srivastava, Jaideep
2013-01-01
Background Links in complex networks commonly represent specific ties between pairs of nodes, such as protein-protein interactions in biological networks or friendships in social networks. However, understanding the mechanism of link formation in complex networks is a long standing challenge for network analysis and data mining. Methodology/Principal Findings Links in complex networks have a tendency to cluster locally and form so-called communities. This widely existed phenomenon reflects some underlying mechanism of link formation. To study the correlations between community structure and link formation, we present a general computational framework including a theory for network partitioning and link probability estimation. Our approach enables us to accurately identify missing links in partially observed networks in an efficient way. The links having high connection likelihoods in the communities reveal that links are formed preferentially to create cliques and accordingly promote the clustering level of the communities. The experimental results verify that such a mechanism can be well captured by our approach. Conclusions/Significance Our findings provide a new insight into understanding how links are created in the communities. The computational framework opens a wide range of possibilities to develop new approaches and applications, such as community detection and missing link prediction. PMID:24039818
Blacklock, Kristin; Verkhivker, Gennady M.
2014-01-01
The fundamental role of the Hsp90 chaperone in supporting functional activity of diverse protein clients is anchored by specific cochaperones. A family of immune sensing client proteins is delivered to the Hsp90 system with the aid of cochaperones Sgt1 and Rar1 that act cooperatively with Hsp90 to form allosterically regulated dynamic complexes. In this work, functional dynamics and protein structure network modeling are combined to dissect molecular mechanisms of Hsp90 regulation by the client recruiter cochaperones. Dynamic signatures of the Hsp90-cochaperone complexes are manifested in differential modulation of the conformational mobility in the Hsp90 lid motif. Consistent with the experiments, we have determined that targeted reorganization of the lid dynamics is a unifying characteristic of the client recruiter cochaperones. Protein network analysis of the essential conformational space of the Hsp90-cochaperone motions has identified structurally stable interaction communities, interfacial hubs and key mediating residues of allosteric communication pathways that act concertedly with the shifts in conformational equilibrium. The results have shown that client recruiter cochaperones can orchestrate global changes in the dynamics and stability of the interaction networks that could enhance the ATPase activity and assist in the client recruitment. The network analysis has recapitulated a broad range of structural and mutagenesis experiments, particularly clarifying the elusive role of Rar1 as a regulator of the Hsp90 interactions and a stability enhancer of the Hsp90-cochaperone complexes. Small-world organization of the interaction networks in the Hsp90 regulatory complexes gives rise to a strong correspondence between highly connected local interfacial hubs, global mediator residues of allosteric interactions and key functional hot spots of the Hsp90 activity. We have found that cochaperone-induced conformational changes in Hsp90 may be determined by specific interaction networks that can inhibit or promote progression of the ATPase cycle and thus control the recruitment of client proteins. PMID:24466147
Blacklock, Kristin; Verkhivker, Gennady M
2014-01-01
The fundamental role of the Hsp90 chaperone in supporting functional activity of diverse protein clients is anchored by specific cochaperones. A family of immune sensing client proteins is delivered to the Hsp90 system with the aid of cochaperones Sgt1 and Rar1 that act cooperatively with Hsp90 to form allosterically regulated dynamic complexes. In this work, functional dynamics and protein structure network modeling are combined to dissect molecular mechanisms of Hsp90 regulation by the client recruiter cochaperones. Dynamic signatures of the Hsp90-cochaperone complexes are manifested in differential modulation of the conformational mobility in the Hsp90 lid motif. Consistent with the experiments, we have determined that targeted reorganization of the lid dynamics is a unifying characteristic of the client recruiter cochaperones. Protein network analysis of the essential conformational space of the Hsp90-cochaperone motions has identified structurally stable interaction communities, interfacial hubs and key mediating residues of allosteric communication pathways that act concertedly with the shifts in conformational equilibrium. The results have shown that client recruiter cochaperones can orchestrate global changes in the dynamics and stability of the interaction networks that could enhance the ATPase activity and assist in the client recruitment. The network analysis has recapitulated a broad range of structural and mutagenesis experiments, particularly clarifying the elusive role of Rar1 as a regulator of the Hsp90 interactions and a stability enhancer of the Hsp90-cochaperone complexes. Small-world organization of the interaction networks in the Hsp90 regulatory complexes gives rise to a strong correspondence between highly connected local interfacial hubs, global mediator residues of allosteric interactions and key functional hot spots of the Hsp90 activity. We have found that cochaperone-induced conformational changes in Hsp90 may be determined by specific interaction networks that can inhibit or promote progression of the ATPase cycle and thus control the recruitment of client proteins.
CORUM: the comprehensive resource of mammalian protein complexes
Ruepp, Andreas; Brauner, Barbara; Dunger-Kaltenbach, Irmtraud; Frishman, Goar; Montrone, Corinna; Stransky, Michael; Waegele, Brigitte; Schmidt, Thorsten; Doudieu, Octave Noubibou; Stümpflen, Volker; Mewes, H. Werner
2008-01-01
Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The CORUM (http://mips.gsf.de/genre/proj/corum/index.html) database is a collection of experimentally verified mammalian protein complexes. Information is manually derived by critical reading of the scientific literature from expert annotators. Information about protein complexes includes protein complex names, subunits, literature references as well as the function of the complexes. For functional annotation, we use the FunCat catalogue that enables to organize the protein complex space into biologically meaningful subsets. The database contains more than 1750 protein complexes that are built from 2400 different genes, thus representing 12% of the protein-coding genes in human. A web-based system is available to query, view and download the data. CORUM provides a comprehensive dataset of protein complexes for discoveries in systems biology, analyses of protein networks and protein complex-associated diseases. Comparable to the MIPS reference dataset of protein complexes from yeast, CORUM intends to serve as a reference for mammalian protein complexes. PMID:17965090
Finding Correlation between Protein Protein Interaction Modules Using Semantic Web Techniques
NASA Astrophysics Data System (ADS)
Kargar, Mehdi; Moaven, Shahrouz; Abolhassani, Hassan
Many complex networks such as social networks and computer show modular structures, where edges between nodes are much denser within modules than between modules. It is strongly believed that cellular networks are also modular, reflecting the relative independence and coherence of different functional units in a cell. In this paper we used a human curated dataset. In this paper we consider each module in the PPI network as ontology. Using techniques in ontology alignment, we compare each pair of modules in the network. We want to see that is there a correlation between the structure of each module or they have totally different structures. Our results show that there is no correlation between proteins in a protein protein interaction network.
Deconstructing the core dynamics from a complex time-lagged regulatory biological circuit.
Eriksson, O; Brinne, B; Zhou, Y; Björkegren, J; Tegnér, J
2009-03-01
Complex regulatory dynamics is ubiquitous in molecular networks composed of genes and proteins. Recent progress in computational biology and its application to molecular data generate a growing number of complex networks. Yet, it has been difficult to understand the governing principles of these networks beyond graphical analysis or extensive numerical simulations. Here the authors exploit several simplifying biological circumstances which thereby enable to directly detect the underlying dynamical regularities driving periodic oscillations in a dynamical nonlinear computational model of a protein-protein network. System analysis is performed using the cell cycle, a mathematically well-described complex regulatory circuit driven by external signals. By introducing an explicit time delay and using a 'tearing-and-zooming' approach the authors reduce the system to a piecewise linear system with two variables that capture the dynamics of this complex network. A key step in the analysis is the identification of functional subsystems by identifying the relations between state-variables within the model. These functional subsystems are referred to as dynamical modules operating as sensitive switches in the original complex model. By using reduced mathematical representations of the subsystems the authors derive explicit conditions on how the cell cycle dynamics depends on system parameters, and can, for the first time, analyse and prove global conditions for system stability. The approach which includes utilising biological simplifying conditions, identification of dynamical modules and mathematical reduction of the model complexity may be applicable to other well-characterised biological regulatory circuits. [Includes supplementary material].
Ren, Li-Hong; Ding, Yong-Sheng; Shen, Yi-Zhen; Zhang, Xiang-Feng
2008-10-01
Recently, a collective effort from multiple research areas has been made to understand biological systems at the system level. This research requires the ability to simulate particular biological systems as cells, organs, organisms, and communities. In this paper, a novel bio-network simulation platform is proposed for system biology studies by combining agent approaches. We consider a biological system as a set of active computational components interacting with each other and with an external environment. Then, we propose a bio-network platform for simulating the behaviors of biological systems and modelling them in terms of bio-entities and society-entities. As a demonstration, we discuss how a protein-protein interaction (PPI) network can be seen as a society of autonomous interactive components. From interactions among small PPI networks, a large PPI network can emerge that has a remarkable ability to accomplish a complex function or task. We also simulate the evolution of the PPI networks by using the bio-operators of the bio-entities. Based on the proposed approach, various simulators with different functions can be embedded in the simulation platform, and further research can be done from design to development, including complexity validation of the biological system.
How actin network dynamics control the onset of actin-based motility
Kawska, Agnieszka; Carvalho, Kévin; Manzi, John; Boujemaa-Paterski, Rajaa; Blanchoin, Laurent; Martiel, Jean-Louis; Sykes, Cécile
2012-01-01
Cells use their dynamic actin network to control their mechanics and motility. These networks are made of branched actin filaments generated by the Arp2/3 complex. Here we study under which conditions the microscopic organization of branched actin networks builds up a sufficient stress to trigger sustained motility. In our experimental setup, dynamic actin networks or “gels” are grown on a hard bead in a controlled minimal protein system containing actin monomers, profilin, the Arp2/3 complex and capping protein. We vary protein concentrations and follow experimentally and through simulations the shape and mechanical properties of the actin gel growing around beads. Actin gel morphology is controlled by elementary steps including “primer” contact, growth of the network, entanglement, mechanical interaction and force production. We show that varying the biochemical orchestration of these steps can lead to the loss of network cohesion and the lack of effective force production. We propose a predictive phase diagram of actin gel fate as a function of protein concentrations. This work unveils how, in growing actin networks, a tight biochemical and physical coupling smoothens initial primer-caused heterogeneities and governs force buildup and cell motility. PMID:22908255
Genome-wide protein-protein interactions and protein function exploration in cyanobacteria
Lv, Qi; Ma, Weimin; Liu, Hui; Li, Jiang; Wang, Huan; Lu, Fang; Zhao, Chen; Shi, Tieliu
2015-01-01
Genome-wide network analysis is well implemented to study proteins of unknown function. Here, we effectively explored protein functions and the biological mechanism based on inferred high confident protein-protein interaction (PPI) network in cyanobacteria. We integrated data from seven different sources and predicted 1,997 PPIs, which were evaluated by experiments in molecular mechanism, text mining of literatures in proved direct/indirect evidences, and “interologs” in conservation. Combined the predicted PPIs with known PPIs, we obtained 4,715 no-redundant PPIs (involving 3,231 proteins covering over 90% of genome) to generate the PPI network. Based on the PPI network, terms in Gene ontology (GO) were assigned to function-unknown proteins. Functional modules were identified by dissecting the PPI network into sub-networks and analyzing pathway enrichment, with which we investigated novel function of underlying proteins in protein complexes and pathways. Examples of photosynthesis and DNA repair indicate that the network approach is a powerful tool in protein function analysis. Overall, this systems biology approach provides a new insight into posterior functional analysis of PPIs in cyanobacteria. PMID:26490033
DiGiuseppe, Stephen; Bienkowska-Haba, Malgorzata; Hilbig, Lydia; Sapp, Martin
2014-01-01
The Human papillomavirus (HPV) capsid is composed of the major and minor capsid proteins, L1 and L2, respectively. Infectious entry requires a complex series of conformational changes in both proteins that lead to uptake and allow uncoating to occur. During entry, the capsid is disassembled and host cyclophilins dissociate L1 protein from the L2/DNA complex. Herein, we describe a mutant HPV16 L2 protein (HPV16 L2-R302/5A) that traffics pseudogenome to the trans-Golgi network (TGN) but fails to egress. Our data provide further evidence that HPV16 traffics through the TGN and demonstrates that L2 is essential for TGN egress. Furthermore, we show that cyclophilin activity is required for the L2/DNA complex to be transported to the TGN which is accompanied by a reduced L1 protein levels. PMID:24928042
Topological properties of complex networks in protein structures
NASA Astrophysics Data System (ADS)
Kim, Kyungsik; Jung, Jae-Won; Min, Seungsik
2014-03-01
We study topological properties of networks in structural classification of proteins. We model the native-state protein structure as a network made of its constituent amino-acids and their interactions. We treat four structural classes of proteins composed predominantly of α helices and β sheets and consider several proteins from each of these classes whose sizes range from amino acids of the Protein Data Bank. Particularly, we simulate and analyze the network metrics such as the mean degree, the probability distribution of degree, the clustering coefficient, the characteristic path length, the local efficiency, and the cost. This work was supported by the KMAR and DP under Grant WISE project (153-3100-3133-302-350).
Verkhivker, Gennady M
2016-01-01
The human protein kinome presents one of the largest protein families that orchestrate functional processes in complex cellular networks, and when perturbed, can cause various cancers. The abundance and diversity of genetic, structural, and biochemical data underlies the complexity of mechanisms by which targeted and personalized drugs can combat mutational profiles in protein kinases. Coupled with the evolution of system biology approaches, genomic and proteomic technologies are rapidly identifying and charactering novel resistance mechanisms with the goal to inform rationale design of personalized kinase drugs. Integration of experimental and computational approaches can help to bring these data into a unified conceptual framework and develop robust models for predicting the clinical drug resistance. In the current study, we employ a battery of synergistic computational approaches that integrate genetic, evolutionary, biochemical, and structural data to characterize the effect of cancer mutations in protein kinases. We provide a detailed structural classification and analysis of genetic signatures associated with oncogenic mutations. By integrating genetic and structural data, we employ network modeling to dissect mechanisms of kinase drug sensitivities to oncogenic EGFR mutations. Using biophysical simulations and analysis of protein structure networks, we show that conformational-specific drug binding of Lapatinib may elicit resistant mutations in the EGFR kinase that are linked with the ligand-mediated changes in the residue interaction networks and global network properties of key residues that are responsible for structural stability of specific functional states. A strong network dependency on high centrality residues in the conformation-specific Lapatinib-EGFR complex may explain vulnerability of drug binding to a broad spectrum of mutations and the emergence of drug resistance. Our study offers a systems-based perspective on drug design by unravelling complex relationships between robustness of targeted kinase genes and binding specificity of targeted kinase drugs. We discuss how these approaches can exploit advances in chemical biology and network science to develop novel strategies for rationally tailored and robust personalized drug therapies.
Novel insights into the architecture and protein interaction network of yeast eIF3.
Khoshnevis, Sohail; Hauer, Florian; Milón, Pohl; Stark, Holger; Ficner, Ralf
2012-12-01
Translation initiation in eukaryotes is a multistep process requiring the orchestrated interaction of several eukaryotic initiation factors (eIFs). The largest of these factors, eIF3, forms the scaffold for other initiation factors, promoting their binding to the 40S ribosomal subunit. Biochemical and structural studies on eIF3 need highly pure eIF3. However, natively purified eIF3 comprise complexes containing other proteins such as eIF5. Therefore we have established in vitro reconstitution protocols for Saccharomyces cerevisiae eIF3 using its five recombinantly expressed and purified subunits. This reconstituted eIF3 complex (eIF3(rec)) exhibits the same size and activity as the natively purified eIF3 (eIF3(nat)). The homogeneity and stoichiometry of eIF3(rec) and eIF3(nat) were confirmed by analytical size exclusion chromatography, mass spectrometry, and multi-angle light scattering, demonstrating the presence of one copy of each subunit in the eIF3 complex. The reconstituted and native eIF3 complexes were compared by single-particle electron microscopy showing a high degree of structural conservation. The interaction network between eIF3 proteins was studied by means of limited proteolysis, analytical size exclusion chromatography, in vitro binding assays, and isothermal titration calorimetry, unveiling distinct protein domains and subcomplexes that are critical for the integrity of the protein network in yeast eIF3. Taken together, the data presented here provide a novel procedure to obtain highly pure yeast eIF3, suitable for biochemical and structural analysis, in addition to a detailed picture of the network of protein interactions within this complex.
NASA Astrophysics Data System (ADS)
Li, Yuanyuan; Jin, Suoqin; Lei, Lei; Pan, Zishu; Zou, Xiufen
2015-03-01
The early diagnosis and investigation of the pathogenic mechanisms of complex diseases are the most challenging problems in the fields of biology and medicine. Network-based systems biology is an important technique for the study of complex diseases. The present study constructed dynamic protein-protein interaction (PPI) networks to identify dynamical network biomarkers (DNBs) and analyze the underlying mechanisms of complex diseases from a systems level. We developed a model-based framework for the construction of a series of time-sequenced networks by integrating high-throughput gene expression data into PPI data. By combining the dynamic networks and molecular modules, we identified significant DNBs for four complex diseases, including influenza caused by either H3N2 or H1N1, acute lung injury and type 2 diabetes mellitus, which can serve as warning signals for disease deterioration. Function and pathway analyses revealed that the identified DNBs were significantly enriched during key events in early disease development. Correlation and information flow analyses revealed that DNBs effectively discriminated between different disease processes and that dysfunctional regulation and disproportional information flow may contribute to the increased disease severity. This study provides a general paradigm for revealing the deterioration mechanisms of complex diseases and offers new insights into their early diagnoses.
NASA Astrophysics Data System (ADS)
McMillen, Laura M.; Vavylonis, Dimitrios
2016-12-01
Cell protrusion through polymerization of actin filaments at the leading edge of motile cells may be influenced by spatial gradients of diffuse actin and regulators. Here we study the distribution of two of the most important regulators, capping protein and Arp2/3 complex, which regulate actin polymerization in the lamellipodium through capping and nucleation of free barbed ends. We modeled their kinetics using data from prior single molecule microscopy experiments on XTC cells. These experiments have provided evidence for a broad distribution of diffusion coefficients of both capping protein and Arp2/3 complex. The slowly diffusing proteins appear as extended ‘clouds’ while proteins bound to the actin filament network appear as speckles that undergo retrograde flow. Speckle appearance and disappearance events correspond to assembly and dissociation from the actin filament network and speckle lifetimes correspond to the dissociation rate. The slowly diffusing capping protein could represent severed capped actin filament fragments or membrane-bound capping protein. Prior evidence suggests that slowly diffusing Apr2/3 complex associates with the membrane. We use the measured rates and estimates of diffusion coefficients of capping protein and Arp2/3 complex in a Monte Carlo simulation that includes particles in association with a filament network and diffuse in the cytoplasm. We consider two separate pools of diffuse proteins, representing fast and slowly diffusing species. We find a steady state with concentration gradients involving a balance of diffusive flow of fast and slow species with retrograde flow. We show that simulations of FRAP are consistent with prior experiments performed on different cell types. We provide estimates for the ratio of bound to diffuse complexes and calculate conditions where Arp2/3 complex recycling by diffusion may become limiting. We discuss the implications of slowly diffusing populations and suggest experiments to distinguish among mechanisms that influence long range transport.
Mudgal, Richa; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2017-07-01
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non-homologous protein families, leading to mis-annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold-function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold-function-binding site relationships has been systematically generated. A network-based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one-to-one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly-pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319-1335. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
In silico modeling of the yeast protein and protein family interaction network
NASA Astrophysics Data System (ADS)
Goh, K.-I.; Kahng, B.; Kim, D.
2004-03-01
Understanding of how protein interaction networks of living organisms have evolved or are organized can be the first stepping stone in unveiling how life works on a fundamental ground. Here we introduce an in silico ``coevolutionary'' model for the protein interaction network and the protein family network. The essential ingredient of the model includes the protein family identity and its robustness under evolution, as well as the three previously proposed: gene duplication, divergence, and mutation. This model produces a prototypical feature of complex networks in a wide range of parameter space, following the generalized Pareto distribution in connectivity. Moreover, we investigate other structural properties of our model in detail with some specific values of parameters relevant to the yeast Saccharomyces cerevisiae, showing excellent agreement with the empirical data. Our model indicates that the physical constraints encoded via the domain structure of proteins play a crucial role in protein interactions.
Du, Guixin; Stinski, Mark F.
2013-01-01
Human cytomegalovirus protein IE2-p86 exerts its functions through interaction with other viral and cellular proteins. To further delineate its protein interaction network, we generated a recombinant virus expressing SG-tagged IE2-p86 and used tandem affinity purification coupled with mass spectrometry. A total of 9 viral proteins and 75 cellular proteins were found to associate with IE2-p86 protein during the first 48 hours of infection. The protein profile at 8, 24, and 48 h post infection revealed that UL84 tightly associated with IE2-p86, and more viral and cellular proteins came into association with IE2-p86 with the progression of virus infection. A computational analysis of the protein-protein interaction network indicated that all of the 9 viral proteins and most of the cellular proteins identified in the study are interconnected to varying degrees. Of the cellular proteins that were confirmed to associate with IE2-p86 by immunoprecipitation, C1QBP was further shown to be upregulated by HCMV infection and colocalized with IE2-p86, UL84 and UL44 in the virus replication compartment of the nucleus. The IE2-p86 interactome network demonstrated the temporal development of stable and abundant protein complexes that associate with IE2-p86 and provided a framework to benefit future studies of various protein complexes during HCMV infection. PMID:24358118
Complex network theory for the identification and assessment of candidate protein targets.
McGarry, Ken; McDonald, Sharon
2018-06-01
In this work we use complex network theory to provide a statistical model of the connectivity patterns of human proteins and their interaction partners. Our intention is to identify important proteins that may be predisposed to be potential candidates as drug targets for therapeutic interventions. Target proteins usually have more interaction partners than non-target proteins, but there are no hard-and-fast rules for defining the actual number of interactions. We devise a statistical measure for identifying hub proteins, we score our target proteins with gene ontology annotations. The important druggable protein targets are likely to have similar biological functions that can be assessed for their potential therapeutic value. Our system provides a statistical analysis of the local and distant neighborhood protein interactions of the potential targets using complex network measures. This approach builds a more accurate model of drug-to-target activity and therefore the likely impact on treating diseases. We integrate high quality protein interaction data from the HINT database and disease associated proteins from the DrugTarget database. Other sources include biological knowledge from Gene Ontology and drug information from DrugBank. The problem is a very challenging one since the data is highly imbalanced between target proteins and the more numerous nontargets. We use undersampling on the training data and build Random Forest classifier models which are used to identify previously unclassified target proteins. We validate and corroborate these findings from the available literature. Copyright © 2018 Elsevier Ltd. All rights reserved.
Computational Methods to Predict Protein Interaction Partners
NASA Astrophysics Data System (ADS)
Valencia, Alfonso; Pazos, Florencio
In the new paradigm for studying biological phenomena represented by Systems Biology, cellular components are not considered in isolation but as forming complex networks of relationships. Protein interaction networks are among the first objects studied from this new point of view. Deciphering the interactome (the whole network of interactions for a given proteome) has been shown to be a very complex task. Computational techniques for detecting protein interactions have become standard tools for dealing with this problem, helping and complementing their experimental counterparts. Most of these techniques use genomic or sequence features intuitively related with protein interactions and are based on "first principles" in the sense that they do not involve training with examples. There are also other computational techniques that use other sources of information (i.e. structural information or even experimental data) or are based on training with examples.
Neuron-Like Networks Between Ribosomal Proteins Within the Ribosome
NASA Astrophysics Data System (ADS)
Poirot, Olivier; Timsit, Youri
2016-05-01
From brain to the World Wide Web, information-processing networks share common scale invariant properties. Here, we reveal the existence of neural-like networks at a molecular scale within the ribosome. We show that with their extensions, ribosomal proteins form complex assortative interaction networks through which they communicate through tiny interfaces. The analysis of the crystal structures of 50S eubacterial particles reveals that most of these interfaces involve key phylogenetically conserved residues. The systematic observation of interactions between basic and aromatic amino acids at the interfaces and along the extension provides new structural insights that may contribute to decipher the molecular mechanisms of signal transmission within or between the ribosomal proteins. Similar to neurons interacting through “molecular synapses”, ribosomal proteins form a network that suggest an analogy with a simple molecular brain in which the “sensory-proteins” innervate the functional ribosomal sites, while the “inter-proteins” interconnect them into circuits suitable to process the information flow that circulates during protein synthesis. It is likely that these circuits have evolved to coordinate both the complex macromolecular motions and the binding of the multiple factors during translation. This opens new perspectives on nanoscale information transfer and processing.
Network based approaches reveal clustering in protein point patterns
NASA Astrophysics Data System (ADS)
Parker, Joshua; Barr, Valarie; Aldridge, Joshua; Samelson, Lawrence E.; Losert, Wolfgang
2014-03-01
Recent advances in super-resolution imaging have allowed for the sub-diffraction measurement of the spatial location of proteins on the surfaces of T-cells. The challenge is to connect these complex point patterns to the internal processes and interactions, both protein-protein and protein-membrane. We begin analyzing these patterns by forming a geometric network amongst the proteins and looking at network measures, such the degree distribution. This allows us to compare experimentally observed patterns to models. Specifically, we find that the experimental patterns differ from heterogeneous Poisson processes, highlighting an internal clustering structure. Further work will be to compare our results to simulated protein-protein interactions to determine clustering mechanisms.
Havugimana, Pierre C; Hu, Pingzhao; Emili, Andrew
2017-10-01
Elucidation of the networks of physical (functional) interactions present in cells and tissues is fundamental for understanding the molecular organization of biological systems, the mechanistic basis of essential and disease-related processes, and for functional annotation of previously uncharacterized proteins (via guilt-by-association or -correlation). After a decade in the field, we felt it timely to document our own experiences in the systematic analysis of protein interaction networks. Areas covered: Researchers worldwide have contributed innovative experimental and computational approaches that have driven the rapidly evolving field of 'functional proteomics'. These include mass spectrometry-based methods to characterize macromolecular complexes on a global-scale and sophisticated data analysis tools - most notably machine learning - that allow for the generation of high-quality protein association maps. Expert commentary: Here, we recount some key lessons learned, with an emphasis on successful workflows, and challenges, arising from our own and other groups' ongoing efforts to generate, interpret and report proteome-scale interaction networks in increasingly diverse biological contexts.
2017-01-01
Although deep learning approaches have had tremendous success in image, video and audio processing, computer vision, and speech recognition, their applications to three-dimensional (3D) biomolecular structural data sets have been hindered by the geometric and biological complexity. To address this problem we introduce the element-specific persistent homology (ESPH) method. ESPH represents 3D complex geometry by one-dimensional (1D) topological invariants and retains important biological information via a multichannel image-like representation. This representation reveals hidden structure-function relationships in biomolecules. We further integrate ESPH and deep convolutional neural networks to construct a multichannel topological neural network (TopologyNet) for the predictions of protein-ligand binding affinities and protein stability changes upon mutation. To overcome the deep learning limitations from small and noisy training sets, we propose a multi-task multichannel topological convolutional neural network (MM-TCNN). We demonstrate that TopologyNet outperforms the latest methods in the prediction of protein-ligand binding affinities, mutation induced globular protein folding free energy changes, and mutation induced membrane protein folding free energy changes. Availability: weilab.math.msu.edu/TDL/ PMID:28749969
Rudling, Axel; Orro, Adolfo; Carlsson, Jens
2018-02-26
Water plays a major role in ligand binding and is attracting increasing attention in structure-based drug design. Water molecules can make large contributions to binding affinity by bridging protein-ligand interactions or by being displaced upon complex formation, but these phenomena are challenging to model at the molecular level. Herein, networks of ordered water molecules in protein binding sites were analyzed by clustering of molecular dynamics (MD) simulation trajectories. Locations of ordered waters (hydration sites) were first identified from simulations of high resolution crystal structures of 13 protein-ligand complexes. The MD-derived hydration sites reproduced 73% of the binding site water molecules observed in the crystal structures. If the simulations were repeated without the cocrystallized ligands, a majority (58%) of the crystal waters in the binding sites were still predicted. In addition, comparison of the hydration sites obtained from simulations carried out in the absence of ligands to those identified for the complexes revealed that the networks of ordered water molecules were preserved to a large extent, suggesting that the locations of waters in a protein-ligand interface are mainly dictated by the protein. Analysis of >1000 crystal structures showed that hydration sites bridged protein-ligand interactions in complexes with different ligands, and those with high MD-derived occupancies were more likely to correspond to experimentally observed ordered water molecules. The results demonstrate that ordered water molecules relevant for modeling of protein-ligand complexes can be identified from MD simulations. Our findings could contribute to development of improved methods for structure-based virtual screening and lead optimization.
Fung, David C Y; Wilkins, Marc R; Hart, David; Hong, Seok-Hee
2010-07-01
The force-directed layout is commonly used in computer-generated visualizations of protein-protein interaction networks. While it is good for providing a visual outline of the protein complexes and their interactions, it has two limitations when used as a visual analysis method. The first is poor reproducibility. Repeated running of the algorithm does not necessarily generate the same layout, therefore, demanding cognitive readaptation on the investigator's part. The second limitation is that it does not explicitly display complementary biological information, e.g. Gene Ontology, other than the protein names or gene symbols. Here, we present an alternative layout called the clustered circular layout. Using the human DNA replication protein-protein interaction network as a case study, we compared the two network layouts for their merits and limitations in supporting visual analysis.
Systems Proteomics for Translational Network Medicine
Arrell, D. Kent; Terzic, Andre
2012-01-01
Universal principles underlying network science, and their ever-increasing applications in biomedicine, underscore the unprecedented capacity of systems biology based strategies to synthesize and resolve massive high throughput generated datasets. Enabling previously unattainable comprehension of biological complexity, systems approaches have accelerated progress in elucidating disease prediction, progression, and outcome. Applied to the spectrum of states spanning health and disease, network proteomics establishes a collation, integration, and prioritization algorithm to guide mapping and decoding of proteome landscapes from large-scale raw data. Providing unparalleled deconvolution of protein lists into global interactomes, integrative systems proteomics enables objective, multi-modal interpretation at molecular, pathway, and network scales, merging individual molecular components, their plurality of interactions, and functional contributions for systems comprehension. As such, network systems approaches are increasingly exploited for objective interpretation of cardiovascular proteomics studies. Here, we highlight network systems proteomic analysis pipelines for integration and biological interpretation through protein cartography, ontological categorization, pathway and functional enrichment and complex network analysis. PMID:22896016
Protein-protein interaction networks: unraveling the wiring of molecular machines within the cell.
De Las Rivas, Javier; Fontanillo, Celia
2012-11-01
Mapping and understanding of the protein interaction networks with their key modules and hubs can provide deeper insights into the molecular machinery underlying complex phenotypes. In this article, we present the basic characteristics and definitions of protein networks, starting with a distinction of the different types of associations between proteins. We focus the review on protein-protein interactions (PPIs), a subset of associations defined as physical contacts between proteins that occur by selective molecular docking in a particular biological context. We present such definition as opposed to other types of protein associations derived from regulatory, genetic, structural or functional relations. To determine PPIs, a variety of binary and co-complex methods exist; however, not all the technologies provide the same information and data quality. A way of increasing confidence in a given protein interaction is to integrate orthogonal experimental evidences. The use of several complementary methods testing each single interaction assesses the accuracy of PPI data and tries to minimize the occurrence of false interactions. Following this approach there have been important efforts to unify primary databases of experimentally proven PPIs into integrated databases. These meta-databases provide a measure of the confidence of interactions based on the number of experimental proofs that report them. As a conclusion, we can state that integrated information allows the building of more reliable interaction networks. Identification of communities, cliques, modules and hubs by analysing the topological parameters and graph properties of the protein networks allows the discovery of central/critical nodes, which are candidates to regulate cellular flux and dynamics.
A Collaboration Network Model Of Cytokine-Protein Network
NASA Astrophysics Data System (ADS)
Zou, Sheng-Rong; Zhou, Ta; Peng, Yu-Jing; Guo, Zhong-Wei; Gu, Chang-Gui; He, Da-Ren
2008-03-01
Complex networks provide us a new view for investigation of immune systems. We collect data through STRING database and present a network description with cooperation network model. The cytokine-protein network model we consider is constituted by two kinds of nodes, one is immune cytokine types which can be regarded as collaboration acts, the other one is protein type which can be regarded as collaboration actors. From act degree distribution that can be well described by typical SPL (shifted power law) functions [1], we find that HRAS, TNFRSF13C, S100A8, S100A1, MAPK8, S100A7, LIF, CCL4, CXCL13 are highly collaborated with other proteins. It reveals that these mediators are important in cytokine-protein network to regulate immune activity. Dyad in the collaboration networks can be defined as two proteins and they appear in one cytokine collaboration relationship. The dyad act degree distribution can also be well described by typical SPL functions. [1] Assortativity and act degree distribution of some collaboration networks, Hui Chang, Bei-Bei Su, Yue-Ping Zhou, Daren He, Physica A, 383 (2007) 687-702
He, Jieyue; Li, Chaojun; Ye, Baoliu; Zhong, Wei
2012-06-25
Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy.
eQTL networks unveil enriched mRNA master integrators downstream of complex disease-associated SNPs.
Li, Haiquan; Pouladi, Nima; Achour, Ikbel; Gardeux, Vincent; Li, Jianrong; Li, Qike; Zhang, Hao Helen; Martinez, Fernando D; 'Skip' Garcia, Joe G N; Lussier, Yves A
2015-12-01
The causal and interplay mechanisms of Single Nucleotide Polymorphisms (SNPs) associated with complex diseases (complex disease SNPs) investigated in genome-wide association studies (GWAS) at the transcriptional level (mRNA) are poorly understood despite recent advancements such as discoveries reported in the Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTex). Protein interaction network analyses have successfully improved our understanding of both single gene diseases (Mendelian diseases) and complex diseases. Whether the mRNAs downstream of complex disease genes are central or peripheral in the genetic information flow relating DNA to mRNA remains unclear and may be disease-specific. Using expression Quantitative Trait Loci (eQTL) that provide DNA to mRNA associations and network centrality metrics, we hypothesize that we can unveil the systems properties of information flow between SNPs and the transcriptomes of complex diseases. We compare different conditions such as naïve SNP assignments and stringent linkage disequilibrium (LD) free assignments for transcripts to remove confounders from LD. Additionally, we compare the results from eQTL networks between lymphoblastoid cell lines and liver tissue. Empirical permutation resampling (p<0.001) and theoretic Mann-Whitney U test (p<10(-30)) statistics indicate that mRNAs corresponding to complex disease SNPs via eQTL associations are likely to be regulated by a larger number of SNPs than expected. We name this novel property mRNA hubness in eQTL networks, and further term mRNAs with high hubness as master integrators. mRNA master integrators receive and coordinate the perturbation signals from large numbers of polymorphisms and respond to the personal genetic architecture integratively. This genetic signal integration contrasts with the mechanism underlying some Mendelian diseases, where a genetic polymorphism affecting a single protein hub produces a divergent signal that affects a large number of downstream proteins. Indeed, we verify that this property is independent of the hubness in protein networks for which these mRNAs are transcribed. Our findings provide novel insights into the pleiotropy of mRNAs targeted by complex disease polymorphisms and the architecture of the information flow between the genetic polymorphisms and transcriptomes of complex diseases. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Network representation of protein interactions: Theory of graph description and analysis.
Kurzbach, Dennis
2016-09-01
A methodological framework is presented for the graph theoretical interpretation of NMR data of protein interactions. The proposed analysis generalizes the idea of network representations of protein structures by expanding it to protein interactions. This approach is based on regularization of residue-resolved NMR relaxation times and chemical shift data and subsequent construction of an adjacency matrix that represents the underlying protein interaction as a graph or network. The network nodes represent protein residues. Two nodes are connected if two residues are functionally correlated during the protein interaction event. The analysis of the resulting network enables the quantification of the importance of each amino acid of a protein for its interactions. Furthermore, the determination of the pattern of correlations between residues yields insights into the functional architecture of an interaction. This is of special interest for intrinsically disordered proteins, since the structural (three-dimensional) architecture of these proteins and their complexes is difficult to determine. The power of the proposed methodology is demonstrated at the example of the interaction between the intrinsically disordered protein osteopontin and its natural ligand heparin. © 2016 The Protein Society.
A global interaction network maps a wiring diagram of cellular function
Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles
2017-01-01
We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008
Network biology discovers pathogen contact points in host protein-protein interactomes.
Ahmed, Hadia; Howton, T C; Sun, Yali; Weinberger, Natascha; Belkhadir, Youssef; Mukhtar, M Shahid
2018-06-13
In all organisms, major biological processes are controlled by complex protein-protein interactions networks (interactomes), yet their structural complexity presents major analytical challenges. Here, we integrate a compendium of over 4300 phenotypes with Arabidopsis interactome (AI-1 MAIN ). We show that nodes with high connectivity and betweenness are enriched and depleted in conditional and essential phenotypes, respectively. Such nodes are located in the innermost layers of AI-1 MAIN and are preferential targets of pathogen effectors. We extend these network-centric analyses to Cell Surface Interactome (CSI LRR ) and predict its 35 most influential nodes. To determine their biological relevance, we show that these proteins physically interact with pathogen effectors and modulate plant immunity. Overall, our findings contrast with centrality-lethality rule, discover fast information spreading nodes, and highlight the structural properties of pathogen targets in two different interactomes. Finally, this theoretical framework could possibly be applicable to other inter-species interactomes to reveal pathogen contact points.
Saito, Rintaro; Suzuki, Harukazu; Hayashizaki, Yoshihide
2003-04-12
Recent screening techniques have made large amounts of protein-protein interaction data available, from which biologically important information such as the function of uncharacterized proteins, the existence of novel protein complexes, and novel signal-transduction pathways can be discovered. However, experimental data on protein interactions contain many false positives, making these discoveries difficult. Therefore computational methods of assessing the reliability of each candidate protein-protein interaction are urgently needed. We developed a new 'interaction generality' measure (IG2) to assess the reliability of protein-protein interactions using only the topological properties of their interaction-network structure. Using yeast protein-protein interaction data, we showed that reliable protein-protein interactions had significantly lower IG2 values than less-reliable interactions, suggesting that IG2 values can be used to evaluate and filter interaction data to enable the construction of reliable protein-protein interaction networks.
Naegle, Kristen M.; White, Forest M.; Lauffenburger, Douglas A.; Yaffe, Michael B.
2012-01-01
Cell signaling networks propagate information from extracellular cues via dynamic modulation of protein–protein interactions in a context-dependent manner. Networks based on receptor tyrosine kinases (RTKs), for example, phosphorylate intracellular proteins in response to extracellular ligands, resulting in dynamic protein–protein interactions that drive phenotypic changes. Most commonly used methods for discovering these protein–protein interactions, however, are optimized for detecting stable, longer-lived complexes, rather than the type of transient interactions that are essential components of dynamic signaling networks such as those mediated by RTKs. Substrate phosphorylation downstream of RTK activation modifies substrate activity and induces phospho-specific binding interactions, resulting in the formation of large transient macromolecular signaling complexes. Since protein complex formation should follow the trajectory of events that drive it, we reasoned that mining phosphoproteomic datasets for highly similar dynamic behavior of measured phosphorylation sites on different proteins could be used to predict novel, transient protein–protein interactions that had not been previously identified. We applied this method to explore signaling events downstream of EGFR stimulation. Our computational analysis of robustly co-regulated phosphorylation sites, based on multiple clustering analysis of quantitative time-resolved mass-spectrometry phosphoproteomic data, not only identified known sitewise-specific recruitment of proteins to EGFR, but also predicted novel, a priori interactions. A particularly intriguing prediction of EGFR interaction with the cytoskeleton-associated protein PDLIM1 was verified within cells using co-immunoprecipitation and in situ proximity ligation assays. Our approach thus offers a new way to discover protein–protein interactions in a dynamic context- and phosphorylation site-specific manner. PMID:22851037
Li, Min; Li, Wenkai; Wu, Fang-Xiang; Pan, Yi; Wang, Jianxin
2018-06-14
Essential proteins are important participants in various life activities and play a vital role in the survival and reproduction of living organisms. Identification of essential proteins from protein-protein interaction (PPI) networks has great significance to facilitate the study of human complex diseases, the design of drugs and the development of bioinformatics and computational science. Studies have shown that highly connected proteins in a PPI network tend to be essential. A series of computational methods have been proposed to identify essential proteins by analyzing topological structures of PPI networks. However, the high noise in the PPI data can degrade the accuracy of essential protein prediction. Moreover, proteins must be located in the appropriate subcellular localization to perform their functions, and only when the proteins are located in the same subcellular localization, it is possible that they can interact with each other. In this paper, we propose a new network-based essential protein discovery method based on sub-network partition and prioritization by integrating subcellular localization information, named SPP. The proposed method SPP was tested on two different yeast PPI networks obtained from DIP database and BioGRID database. The experimental results show that SPP can effectively reduce the effect of false positives in PPI networks and predict essential proteins more accurately compared with other existing computational methods DC, BC, CC, SC, EC, IC, NC. Copyright © 2018 Elsevier Ltd. All rights reserved.
Offdiagonal complexity: A computationally quick complexity measure for graphs and networks
NASA Astrophysics Data System (ADS)
Claussen, Jens Christian
2007-02-01
A vast variety of biological, social, and economical networks shows topologies drastically differing from random graphs; yet the quantitative characterization remains unsatisfactory from a conceptual point of view. Motivated from the discussion of small scale-free networks, a biased link distribution entropy is defined, which takes an extremum for a power-law distribution. This approach is extended to the node-node link cross-distribution, whose nondiagonal elements characterize the graph structure beyond link distribution, cluster coefficient and average path length. From here a simple (and computationally cheap) complexity measure can be defined. This offdiagonal complexity (OdC) is proposed as a novel measure to characterize the complexity of an undirected graph, or network. While both for regular lattices and fully connected networks OdC is zero, it takes a moderately low value for a random graph and shows high values for apparently complex structures as scale-free networks and hierarchical trees. The OdC approach is applied to the Helicobacter pylori protein interaction network and randomly rewired surrogates.
The Network Organization of Cancer-associated Protein Complexes in Human Tissues
Zhao, Jing; Lee, Sang Hoon; Huss, Mikael; Holme, Petter
2013-01-01
Differential gene expression profiles for detecting disease genes have been studied intensively in systems biology. However, it is known that various biological functions achieved by proteins follow from the ability of the protein to form complexes by physically binding to each other. In other words, the functional units are often protein complexes rather than individual proteins. Thus, we seek to replace the perspective of disease-related genes by disease-related complexes, exemplifying with data on 39 human solid tissue cancers and their original normal tissues. To obtain the differential abundance levels of protein complexes, we apply an optimization algorithm to genome-wide differential expression data. From the differential abundance of complexes, we extract tissue- and cancer-selective complexes, and investigate their relevance to cancer. The method is supported by a clustering tendency of bipartite cancer-complex relationships, as well as a more concrete and realistic approach to disease-related proteomics. PMID:23567845
Characterizing and controlling the inflammatory network during influenza A virus infection
NASA Astrophysics Data System (ADS)
Jin, Suoqin; Li, Yuanyuan; Pan, Ruangang; Zou, Xiufen
2014-01-01
To gain insights into the pathogenesis of influenza A virus (IAV) infections, this study focused on characterizing the inflammatory network and identifying key proteins by combining high-throughput data and computational techniques. We constructed the cell-specific normal and inflammatory networks for H5N1 and H1N1 infections through integrating high-throughput data. We demonstrated that better discrimination between normal and inflammatory networks by network entropy than by other topological metrics. Moreover, we identified different dynamical interactions among TLR2, IL-1β, IL10 and NFκB between normal and inflammatory networks using optimization algorithm. In particular, good robustness and multistability of inflammatory sub-networks were discovered. Furthermore, we identified a complex, TNFSF10/HDAC4/HDAC5, which may play important roles in controlling inflammation, and demonstrated that changes in network entropy of this complex negatively correlated to those of three proteins: TNFα, NFκB and COX-2. These findings provide significant hypotheses for further exploring the molecular mechanisms of infectious diseases and developing control strategies.
Guerra, Concettina
2015-01-01
Protein complexes are key molecular entities that perform a variety of essential cellular functions. The connectivity of proteins within a complex has been widely investigated with both experimental and computational techniques. We developed a computational approach to identify and characterise proteins that play a role in interconnecting complexes. We computed a measure of inter-complex centrality, the crossroad index, based on disjoint paths connecting proteins in distinct complexes and identified inter-complex hubs as proteins with a high value of the crossroad index. We applied the approach to a set of stable complexes in Saccharomyces cerevisiae and in Homo sapiens. Just as done for hubs, we evaluated the topological and biological properties of inter-complex hubs addressing the following questions. Do inter-complex hubs tend to be evolutionary conserved? What is the relation between crossroad index and essentiality? We found a good correlation between inter-complex hubs and both evolutionary conservation and essentiality.
Alignment and integration of complex networks by hypergraph-based spectral clustering
NASA Astrophysics Data System (ADS)
Michoel, Tom; Nachtergaele, Bruno
2012-11-01
Complex networks possess a rich, multiscale structure reflecting the dynamical and functional organization of the systems they model. Often there is a need to analyze multiple networks simultaneously, to model a system by more than one type of interaction, or to go beyond simple pairwise interactions, but currently there is a lack of theoretical and computational methods to address these problems. Here we introduce a framework for clustering and community detection in such systems using hypergraph representations. Our main result is a generalization of the Perron-Frobenius theorem from which we derive spectral clustering algorithms for directed and undirected hypergraphs. We illustrate our approach with applications for local and global alignment of protein-protein interaction networks between multiple species, for tripartite community detection in folksonomies, and for detecting clusters of overlapping regulatory pathways in directed networks.
Alignment and integration of complex networks by hypergraph-based spectral clustering.
Michoel, Tom; Nachtergaele, Bruno
2012-11-01
Complex networks possess a rich, multiscale structure reflecting the dynamical and functional organization of the systems they model. Often there is a need to analyze multiple networks simultaneously, to model a system by more than one type of interaction, or to go beyond simple pairwise interactions, but currently there is a lack of theoretical and computational methods to address these problems. Here we introduce a framework for clustering and community detection in such systems using hypergraph representations. Our main result is a generalization of the Perron-Frobenius theorem from which we derive spectral clustering algorithms for directed and undirected hypergraphs. We illustrate our approach with applications for local and global alignment of protein-protein interaction networks between multiple species, for tripartite community detection in folksonomies, and for detecting clusters of overlapping regulatory pathways in directed networks.
Andreopoulos, Bill; Winter, Christof; Labudde, Dirk; Schroeder, Michael
2009-06-27
A lot of high-throughput studies produce protein-protein interaction networks (PPINs) with many errors and missing information. Even for genome-wide approaches, there is often a low overlap between PPINs produced by different studies. Second-level neighbors separated by two protein-protein interactions (PPIs) were previously used for predicting protein function and finding complexes in high-error PPINs. We retrieve second level neighbors in PPINs, and complement these with structural domain-domain interactions (SDDIs) representing binding evidence on proteins, forming PPI-SDDI-PPI triangles. We find low overlap between PPINs, SDDIs and known complexes, all well below 10%. We evaluate the overlap of PPI-SDDI-PPI triangles with known complexes from Munich Information center for Protein Sequences (MIPS). PPI-SDDI-PPI triangles have ~20 times higher overlap with MIPS complexes than using second-level neighbors in PPINs without SDDIs. The biological interpretation for triangles is that a SDDI causes two proteins to be observed with common interaction partners in high-throughput experiments. The relatively few SDDIs overlapping with PPINs are part of highly connected SDDI components, and are more likely to be detected in experimental studies. We demonstrate the utility of PPI-SDDI-PPI triangles by reconstructing myosin-actin processes in the nucleus, cytoplasm, and cytoskeleton, which were not obvious in the original PPIN. Using other complementary datatypes in place of SDDIs to form triangles, such as PubMed co-occurrences or threading information, results in a similar ability to find protein complexes. Given high-error PPINs with missing information, triangles of mixed datatypes are a promising direction for finding protein complexes. Integrating PPINs with SDDIs improves finding complexes. Structural SDDIs partially explain the high functional similarity of second-level neighbors in PPINs. We estimate that relatively little structural information would be sufficient for finding complexes involving most of the proteins and interactions in a typical PPIN.
Andreopoulos, Bill; Winter, Christof; Labudde, Dirk; Schroeder, Michael
2009-01-01
Background A lot of high-throughput studies produce protein-protein interaction networks (PPINs) with many errors and missing information. Even for genome-wide approaches, there is often a low overlap between PPINs produced by different studies. Second-level neighbors separated by two protein-protein interactions (PPIs) were previously used for predicting protein function and finding complexes in high-error PPINs. We retrieve second level neighbors in PPINs, and complement these with structural domain-domain interactions (SDDIs) representing binding evidence on proteins, forming PPI-SDDI-PPI triangles. Results We find low overlap between PPINs, SDDIs and known complexes, all well below 10%. We evaluate the overlap of PPI-SDDI-PPI triangles with known complexes from Munich Information center for Protein Sequences (MIPS). PPI-SDDI-PPI triangles have ~20 times higher overlap with MIPS complexes than using second-level neighbors in PPINs without SDDIs. The biological interpretation for triangles is that a SDDI causes two proteins to be observed with common interaction partners in high-throughput experiments. The relatively few SDDIs overlapping with PPINs are part of highly connected SDDI components, and are more likely to be detected in experimental studies. We demonstrate the utility of PPI-SDDI-PPI triangles by reconstructing myosin-actin processes in the nucleus, cytoplasm, and cytoskeleton, which were not obvious in the original PPIN. Using other complementary datatypes in place of SDDIs to form triangles, such as PubMed co-occurrences or threading information, results in a similar ability to find protein complexes. Conclusion Given high-error PPINs with missing information, triangles of mixed datatypes are a promising direction for finding protein complexes. Integrating PPINs with SDDIs improves finding complexes. Structural SDDIs partially explain the high functional similarity of second-level neighbors in PPINs. We estimate that relatively little structural information would be sufficient for finding complexes involving most of the proteins and interactions in a typical PPIN. PMID:19558694
Ruan, Peiying; Hayashida, Morihiro; Maruyama, Osamu; Akutsu, Tatsuya
2013-01-01
Since many proteins express their functional activity by interacting with other proteins and forming protein complexes, it is very useful to identify sets of proteins that form complexes. For that purpose, many prediction methods for protein complexes from protein-protein interactions have been developed such as MCL, MCODE, RNSC, PCP, RRW, and NWE. These methods have dealt with only complexes with size of more than three because the methods often are based on some density of subgraphs. However, heterodimeric protein complexes that consist of two distinct proteins occupy a large part according to several comprehensive databases of known complexes. In this paper, we propose several feature space mappings from protein-protein interaction data, in which each interaction is weighted based on reliability. Furthermore, we make use of prior knowledge on protein domains to develop feature space mappings, domain composition kernel and its combination kernel with our proposed features. We perform ten-fold cross-validation computational experiments. These results suggest that our proposed kernel considerably outperforms the naive Bayes-based method, which is the best existing method for predicting heterodimeric protein complexes. PMID:23776458
Network analysis reveals the recognition mechanism for complex formation of mannose-binding lectins
NASA Astrophysics Data System (ADS)
Jian, Yiren; Zhao, Yunjie; Zeng, Chen
The specific carbohydrate binding of lectin makes the protein a powerful molecular tool for various applications including cancer cell detection due to its glycoprotein profile on the cell surface. Most biologically active lectins are dimeric. To understand the structure-function relation of lectin complex, it is essential to elucidate the short- and long-range driving forces behind the dimer formation. Here we report our molecular dynamics simulations and associated dynamical network analysis on a particular lectin, i.e., the mannose-binding lectin from garlic. Our results, further supported by sequence coevolution analysis, shed light on how different parts of the complex communicate with each other. We propose a general framework for deciphering the recognition mechanism underlying protein-protein interactions that may have potential applications in signaling pathways.
Wang, Rui-Sheng; Loscalzo, Joseph
2018-05-20
Understanding the genetic basis of complex diseases is challenging. Prior work shows that disease-related proteins do not typically function in isolation. Rather, they often interact with each other to form a network module that underlies dysfunctional mechanistic pathways. Identifying such disease modules will provide insights into a systems-level understanding of molecular mechanisms of diseases. Owing to the incompleteness of our knowledge of disease proteins and limited information on the biological mediators of pathobiological processes, the key proteins (seed proteins) for many diseases appear scattered over the human protein-protein interactome and form a few small branches, rather than coherent network modules. In this paper, we develop a network-based algorithm, called the Seed Connector algorithm (SCA), to pinpoint disease modules by adding as few additional linking proteins (seed connectors) to the seed protein pool as possible. Such seed connectors are hidden disease module elements that are critical for interpreting the functional context of disease proteins. The SCA aims to connect seed disease proteins so that disease mechanisms and pathways can be decoded based on predicted coherent network modules. We validate the algorithm using a large corpus of 70 complex diseases and binding targets of over 200 drugs, and demonstrate the biological relevance of the seed connectors. Lastly, as a specific proof of concept, we apply the SCA to a set of seed proteins for coronary artery disease derived from a meta-analysis of large-scale genome-wide association studies and obtain a coronary artery disease module enriched with important disease-related signaling pathways and drug targets not previously recognized. Copyright © 2018 Elsevier Ltd. All rights reserved.
PROPER: global protein interaction network alignment through percolation matching.
Kazemi, Ehsan; Hassani, Hamed; Grossglauser, Matthias; Pezeshgi Modarres, Hassan
2016-12-12
The alignment of protein-protein interaction (PPI) networks enables us to uncover the relationships between different species, which leads to a deeper understanding of biological systems. Network alignment can be used to transfer biological knowledge between species. Although different PPI-network alignment algorithms were introduced during the last decade, developing an accurate and scalable algorithm that can find alignments with high biological and structural similarities among PPI networks is still challenging. In this paper, we introduce a new global network alignment algorithm for PPI networks called PROPER. Compared to other global network alignment methods, our algorithm shows higher accuracy and speed over real PPI datasets and synthetic networks. We show that the PROPER algorithm can detect large portions of conserved biological pathways between species. Also, using a simple parsimonious evolutionary model, we explain why PROPER performs well based on several different comparison criteria. We highlight that PROPER has high potential in further applications such as detecting biological pathways, finding protein complexes and PPI prediction. The PROPER algorithm is available at http://proper.epfl.ch .
Ichibangase, Tomoko; Sugawara, Yasuhiro; Yamabe, Akio; Koshiyama, Akiyo; Yoshimura, Akari; Enomoto, Takemi; Imai, Kazuhiro
2012-01-01
Systems biology aims to understand biological phenomena in terms of complex biological and molecular interactions, and thus proteomics plays an important role in elucidating protein networks. However, many proteomic methods have suffered from their high variability, resulting in only showing altered protein names. Here, we propose a strategy for elucidating cellular protein networks based on an FD-LC-MS/MS proteomic method. The strategy permits reproducible relative quantitation of differences in protein levels between different cell populations and allows for integration of the data with those obtained through other methods. We demonstrate the validity of the approach through a comparison of differential protein expression in normal and conditional superoxide dismutase 1 gene knockout cells and believe that beginning with an FD-LC-MS/MS proteomic approach will enable researchers to elucidate protein networks more easily and comprehensively. PMID:23029042
A Localized Complex of Two Protein Oligomers Controls the Orientation of Cell Polarity
Lasker, Keren; Ahrens, Daniel G.; Eckart, Michael R.
2017-01-01
ABSTRACT Signaling hubs at bacterial cell poles establish cell polarity in the absence of membrane-bound compartments. In the asymmetrically dividing bacterium Caulobacter crescentus, cell polarity stems from the cell cycle-regulated localization and turnover of signaling protein complexes in these hubs, and yet the mechanisms that establish the identity of the two cell poles have not been established. Here, we recapitulate the tripartite assembly of a cell fate signaling complex that forms during the G1-S transition. Using in vivo and in vitro analyses of dynamic polar protein complex formation, we show that a polymeric cell polarity protein, SpmX, serves as a direct bridge between the PopZ polymeric network and the cell fate-directing DivJ histidine kinase. We demonstrate the direct binding between these three proteins and show that a polar microdomain spontaneously assembles when the three proteins are coexpressed heterologously in an Escherichia coli test system. The relative copy numbers of these proteins are essential for complex formation, as overexpression of SpmX in Caulobacter reorganizes the polarity of the cell, generating ectopic cell poles containing PopZ and DivJ. Hierarchical formation of higher-order SpmX oligomers nucleates new PopZ microdomain assemblies at the incipient lateral cell poles, driving localized outgrowth. By comparison to self-assembling protein networks and polar cell growth mechanisms in other bacterial species, we suggest that the cooligomeric PopZ-SpmX protein complex in Caulobacter illustrates a paradigm for coupling cell cycle progression to the controlled geometry of cell pole establishment. PMID:28246363
Bi, Dongbin; Ning, Hao; Liu, Shuai; Que, Xinxiang; Ding, Kejia
2015-06-01
To explore molecular mechanisms of bladder cancer (BC), network strategy was used to find biomarkers for early detection and diagnosis. The differentially expressed genes (DEGs) between bladder carcinoma patients and normal subjects were screened using empirical Bayes method of the linear models for microarray data package. Co-expression networks were constructed by differentially co-expressed genes and links. Regulatory impact factors (RIF) metric was used to identify critical transcription factors (TFs). The protein-protein interaction (PPI) networks were constructed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and clusters were obtained through molecular complex detection (MCODE) algorithm. Centralities analyses for complex networks were performed based on degree, stress and betweenness. Enrichment analyses were performed based on Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Co-expression networks and TFs (based on expression data of global DEGs and DEGs in different stages and grades) were identified. Hub genes of complex networks, such as UBE2C, ACTA2, FABP4, CKS2, FN1 and TOP2A, were also obtained according to analysis of degree. In gene enrichment analyses of global DEGs, cell adhesion, proteinaceous extracellular matrix and extracellular matrix structural constituent were top three GO terms. ECM-receptor interaction, focal adhesion, and cell cycle were significant pathways. Our results provide some potential underlying biomarkers of BC. However, further validation is required and deep studies are needed to elucidate the pathogenesis of BC. Copyright © 2015 Elsevier Ltd. All rights reserved.
Suratanee, Apichat; Plaimas, Kitiporn
2017-01-01
The associations between proteins and diseases are crucial information for investigating pathological mechanisms. However, the number of known and reliable protein-disease associations is quite small. In this study, an analysis framework to infer associations between proteins and diseases was developed based on a large data set of a human protein-protein interaction network integrating an effective network search, namely, the reverse k -nearest neighbor (R k NN) search. The R k NN search was used to identify an impact of a protein on other proteins. Then, associations between proteins and diseases were inferred statistically. The method using the R k NN search yielded a much higher precision than a random selection, standard nearest neighbor search, or when applying the method to a random protein-protein interaction network. All protein-disease pair candidates were verified by a literature search. Supporting evidence for 596 pairs was identified. In addition, cluster analysis of these candidates revealed 10 promising groups of diseases to be further investigated experimentally. This method can be used to identify novel associations to better understand complex relationships between proteins and diseases.
Kaufholdt, David; Baillie, Christin-Kirsty; Meinen, Rieke; Mendel, Ralf R; Hänsch, Robert
2017-01-01
Survival of plants and nearly all organisms depends on the pterin based molybdenum cofactor (Moco) as well as its effective biosynthesis and insertion into apo-enzymes. To this end, both the central Moco biosynthesis enzymes are characterized and the conserved four-step reaction pathway for Moco biosynthesis is well-understood. However, protection mechanisms to prevent degradation during biosynthesis as well as transfer of the highly oxygen sensitive Moco and its intermediates are not fully enlightened. The formation of protein complexes involving transient protein-protein interactions is an efficient strategy for protected metabolic channelling of sensitive molecules. In this review, Moco biosynthesis and allocation network is presented and discussed. This network was intensively studied based on two in vivo interaction methods: bimolecular fluorescence complementation (BiFC) and split-luciferase. Whereas BiFC allows localisation of interacting partners, split-luciferase assay determines interaction strengths in vivo . Results demonstrate (i) interaction of Cnx2 and Cnx3 within the mitochondria and (ii) assembly of a biosynthesis complex including the cytosolic enzymes Cnx5, Cnx6, Cnx7, and Cnx1, which enables a protected transfer of intermediates. The whole complex is associated with actin filaments via Cnx1 as anchor protein. After biosynthesis, Moco needs to be handed over to the specific apo-enzymes. A potential pathway was discovered. Molybdenum-containing enzymes of the sulphite oxidase family interact directly with Cnx1. In contrast, the xanthine oxidoreductase family acquires Moco indirectly via a Moco binding protein (MoBP2) and Moco sulphurase ABA3. In summary, the uncovered interaction matrix enables an efficient transfer for intermediate and product protection via micro-compartmentation.
Maerker, Tina; van Wijk, Erwin; Overlack, Nora; Kersten, Ferry F J; McGee, Joann; Goldmann, Tobias; Sehn, Elisabeth; Roepman, Ronald; Walsh, Edward J; Kremer, Hannie; Wolfrum, Uwe
2008-01-01
The human Usher syndrome (USH) is the most frequent cause of combined deaf-blindness. USH is genetically heterogeneous with at least 12 chromosomal loci assigned to three clinical types, USH1-3. Although these USH types exhibit similar phenotypes in human, the corresponding gene products belong to very different protein classes and families. The scaffold protein harmonin (USH1C) was shown to integrate all identified USH1 and USH2 molecules into protein networks. Here, we analyzed a protein network organized in the absence of harmonin by the scaffold proteins SANS (USH1G) and whirlin (USH2D). Immunoelectron microscopic analyses disclosed the colocalization of all network components in the apical inner segment collar and the ciliary apparatus of mammalian photoreceptor cells. In this complex, whirlin and SANS directly interact. Furthermore, SANS provides a linkage to the microtubule transport machinery, whereas whirlin may anchor USH2A isoform b and VLGR1b (very large G-protein coupled receptor 1b) via binding to their cytodomains at specific membrane domains. The long ectodomains of both transmembrane proteins extend into the gap between the adjacent membranes of the connecting cilium and the apical inner segment. Analyses of Vlgr1/del7TM mice revealed the ectodomain of VLGR1b as a component of fibrous links present in this gap. Comparative analyses of mouse and Xenopus photoreceptors demonstrated that this USH protein network is also part of the periciliary ridge complex in Xenopus. Since this structural specialization in amphibian photoreceptor cells defines a specialized membrane domain for docking and fusion of transport vesicles, we suggest a prominent role of the USH proteins in cargo shipment.
Rossin, Elizabeth J.; Lage, Kasper; Raychaudhuri, Soumya; Xavier, Ramnik J.; Tatar, Diana; Benita, Yair
2011-01-01
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein–protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease. PMID:21249183
RNA regulatory networks diversified through curvature of the PUF protein scaffold
Wilinski, Daniel; Qiu, Chen; Lapointe, Christopher P.; ...
2015-09-14
Proteins bind and control mRNAs, directing their localization, translation and stability. Members of the PUF family of RNA-binding proteins control multiple mRNAs in a single cell, and play key roles in development, stem cell maintenance and memory formation. Here we identified the mRNA targets of a S. cerevisiae PUF protein, Puf5p, by ultraviolet-crosslinking-affinity purification and high-throughput sequencing (HITS-CLIP). The binding sites recognized by Puf5p are diverse, with variable spacer lengths between two specific sequences. Each length of site correlates with a distinct biological function. Crystal structures of Puf5p–RNA complexes reveal that the protein scaffold presents an exceptionally flat and extendedmore » interaction surface relative to other PUF proteins. In complexes with RNAs of different lengths, the protein is unchanged. A single PUF protein repeat is sufficient to induce broadening of specificity. Changes in protein architecture, such as alterations in curvature, may lead to evolution of mRNA regulatory networks.« less
RNA regulatory networks diversified through curvature of the PUF protein scaffold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilinski, Daniel; Qiu, Chen; Lapointe, Christopher P.
Proteins bind and control mRNAs, directing their localization, translation and stability. Members of the PUF family of RNA-binding proteins control multiple mRNAs in a single cell, and play key roles in development, stem cell maintenance and memory formation. Here we identified the mRNA targets of a S. cerevisiae PUF protein, Puf5p, by ultraviolet-crosslinking-affinity purification and high-throughput sequencing (HITS-CLIP). The binding sites recognized by Puf5p are diverse, with variable spacer lengths between two specific sequences. Each length of site correlates with a distinct biological function. Crystal structures of Puf5p–RNA complexes reveal that the protein scaffold presents an exceptionally flat and extendedmore » interaction surface relative to other PUF proteins. In complexes with RNAs of different lengths, the protein is unchanged. A single PUF protein repeat is sufficient to induce broadening of specificity. Changes in protein architecture, such as alterations in curvature, may lead to evolution of mRNA regulatory networks.« less
Weak Links: Stabilizers of Complex Systems from Proteins to Social Networks
NASA Astrophysics Data System (ADS)
Csermely, Peter
Why do women stabilize our societies? Why can we enjoy and understand Shakespeare? Why are fruitflies uniform? Why do omnivorous eating habits aid our survival? Why is Mona Lisa's smile beautiful? -- Is there any answer to these questions? This book shows that the statement: "weak links stabilize complex systems" holds the answers to all of the surprising questions above. The author (recipientof several distinguished science communication prizes) uses weak (low affinity, low probability) interactions as a thread to introduce a vast varietyof networks from proteins to ecosystems.
Wu, Min; Kwoh, Chee-Keong; Li, Xiaoli; Zheng, Jie
2014-09-11
The regulatory mechanism of recombination is one of the most fundamental problems in genomics, with wide applications in genome wide association studies (GWAS), birth-defect diseases, molecular evolution, cancer research, etc. Recombination events cluster into short genomic regions called "recombination hotspots". Recently, a zinc finger protein PRDM9 was reported to regulate recombination hotspots in human and mouse genomes. In addition, a 13-mer motif contained in the binding sites of PRDM9 is found to be enriched in human hotspots. However, this 13-mer motif only covers a fraction of hotspots, indicating that PRDM9 is not the only regulator of recombination hotspots. Therefore, the challenge of discovering other regulators of recombination hotspots becomes significant. Furthermore, recombination is a complex process. Hence, multiple proteins acting as machinery, rather than individual proteins, are more likely to carry out this process in a precise and stable manner. Therefore, the extension of the prediction of individual trans-regulators to protein complexes is also highly desired. In this paper, we introduce a pipeline to identify genes and protein complexes associated with recombination hotspots. First, we prioritize proteins associated with hotspots based on their preference of binding to hotspots and coldspots. Second, using the above identified genes as seeds, we apply the Random Walk with Restart algorithm (RWR) to propagate their influences to other proteins in protein-protein interaction (PPI) networks. Hence, many proteins without DNA-binding information will also be assigned a score to implicate their roles in recombination hotspots. Third, we construct sub-PPI networks induced by top genes ranked by RWR for various species (e.g., yeast, human and mouse) and detect protein complexes in those sub-PPI networks. The GO term analysis show that our prioritizing methods and the RWR algorithm are capable of identifying novel genes associated with recombination hotspots. The trans-regulators predicted by our pipeline are enriched with epigenetic functions (e.g., histone modifications), demonstrating the epigenetic regulatory mechanisms of recombination hotspots. The identified protein complexes also provide us with candidates to further investigate the molecular machineries for recombination hotspots. Moreover, the experimental data and results are available on our web site http://www.ntu.edu.sg/home/zhengjie/data/RecombinationHotspot/NetPipe/.
Efficient prediction of human protein-protein interactions at a global scale.
Schoenrock, Andrew; Samanfar, Bahram; Pitre, Sylvain; Hooshyar, Mohsen; Jin, Ke; Phillips, Charles A; Wang, Hui; Phanse, Sadhna; Omidi, Katayoun; Gui, Yuan; Alamgir, Md; Wong, Alex; Barrenäs, Fredrik; Babu, Mohan; Benson, Mikael; Langston, Michael A; Green, James R; Dehne, Frank; Golshani, Ashkan
2014-12-10
Our knowledge of global protein-protein interaction (PPI) networks in complex organisms such as humans is hindered by technical limitations of current methods. On the basis of short co-occurring polypeptide regions, we developed a tool called MP-PIPE capable of predicting a global human PPI network within 3 months. With a recall of 23% at a precision of 82.1%, we predicted 172,132 putative PPIs. We demonstrate the usefulness of these predictions through a range of experiments. The speed and accuracy associated with MP-PIPE can make this a potential tool to study individual human PPI networks (from genomic sequences alone) for personalized medicine.
Peroxisome protein import: a complex journey.
Baker, Alison; Lanyon-Hogg, Thomas; Warriner, Stuart L
2016-06-15
The import of proteins into peroxisomes possesses many unusual features such as the ability to import folded proteins, and a surprising diversity of targeting signals with differing affinities that can be recognized by the same receptor. As understanding of the structure and function of many components of the protein import machinery has grown, an increasingly complex network of factors affecting each step of the import pathway has emerged. Structural studies have revealed the presence of additional interactions between cargo proteins and the PEX5 receptor that affect import potential, with a subtle network of cargo-induced conformational changes in PEX5 being involved in the import process. Biochemical studies have also indicated an interdependence of receptor-cargo import with release of unloaded receptor from the peroxisome. Here, we provide an update on recent literature concerning mechanisms of protein import into peroxisomes. © 2016 The Author(s).
P-Finder: Reconstruction of Signaling Networks from Protein-Protein Interactions and GO Annotations.
Young-Rae Cho; Yanan Xin; Speegle, Greg
2015-01-01
Because most complex genetic diseases are caused by defects of cell signaling, illuminating a signaling cascade is essential for understanding their mechanisms. We present three novel computational algorithms to reconstruct signaling networks between a starting protein and an ending protein using genome-wide protein-protein interaction (PPI) networks and gene ontology (GO) annotation data. A signaling network is represented as a directed acyclic graph in a merged form of multiple linear pathways. An advanced semantic similarity metric is applied for weighting PPIs as the preprocessing of all three methods. The first algorithm repeatedly extends the list of nodes based on path frequency towards an ending protein. The second algorithm repeatedly appends edges based on the occurrence of network motifs which indicate the link patterns more frequently appearing in a PPI network than in a random graph. The last algorithm uses the information propagation technique which iteratively updates edge orientations based on the path strength and merges the selected directed edges. Our experimental results demonstrate that the proposed algorithms achieve higher accuracy than previous methods when they are tested on well-studied pathways of S. cerevisiae. Furthermore, we introduce an interactive web application tool, called P-Finder, to visualize reconstructed signaling networks.
Sequence co-evolution gives 3D contacts and structures of protein complexes
Hopf, Thomas A; Schärfe, Charlotta P I; Rodrigues, João P G L M; Green, Anna G; Kohlbacher, Oliver; Sander, Chris; Bonvin, Alexandre M J J; Marks, Debora S
2014-01-01
Protein–protein interactions are fundamental to many biological processes. Experimental screens have identified tens of thousands of interactions, and structural biology has provided detailed functional insight for select 3D protein complexes. An alternative rich source of information about protein interactions is the evolutionary sequence record. Building on earlier work, we show that analysis of correlated evolutionary sequence changes across proteins identifies residues that are close in space with sufficient accuracy to determine the three-dimensional structure of the protein complexes. We evaluate prediction performance in blinded tests on 76 complexes of known 3D structure, predict protein–protein contacts in 32 complexes of unknown structure, and demonstrate how evolutionary couplings can be used to distinguish between interacting and non-interacting protein pairs in a large complex. With the current growth of sequences, we expect that the method can be generalized to genome-wide elucidation of protein–protein interaction networks and used for interaction predictions at residue resolution. DOI: http://dx.doi.org/10.7554/eLife.03430.001 PMID:25255213
Aligning Biomolecular Networks Using Modular Graph Kernels
NASA Astrophysics Data System (ADS)
Towfic, Fadi; Greenlee, M. Heather West; Honavar, Vasant
Comparative analysis of biomolecular networks constructed using measurements from different conditions, tissues, and organisms offer a powerful approach to understanding the structure, function, dynamics, and evolution of complex biological systems. We explore a class of algorithms for aligning large biomolecular networks by breaking down such networks into subgraphs and computing the alignment of the networks based on the alignment of their subgraphs. The resulting subnetworks are compared using graph kernels as scoring functions. We provide implementations of the resulting algorithms as part of BiNA, an open source biomolecular network alignment toolkit. Our experiments using Drosophila melanogaster, Saccharomyces cerevisiae, Mus musculus and Homo sapiens protein-protein interaction networks extracted from the DIP repository of protein-protein interaction data demonstrate that the performance of the proposed algorithms (as measured by % GO term enrichment of subnetworks identified by the alignment) is competitive with some of the state-of-the-art algorithms for pair-wise alignment of large protein-protein interaction networks. Our results also show that the inter-species similarity scores computed based on graph kernels can be used to cluster the species into a species tree that is consistent with the known phylogenetic relationships among the species.
Jiang, Li; Edwards, Stefan M; Thomsen, Bo; Workman, Christopher T; Guldbrandtsen, Bernt; Sørensen, Peter
2014-09-24
Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data from genome-wide association studies, and will help in the understanding of how the associated genetic variants influence disease or quantitative phenotypes.
Elastic Network Model of a Nuclear Transport Complex
NASA Astrophysics Data System (ADS)
Ryan, Patrick; Liu, Wing K.; Lee, Dockjin; Seo, Sangjae; Kim, Young-Jin; Kim, Moon K.
2010-05-01
The structure of Kap95p was obtained from the Protein Data Bank (www.pdb.org) and analyzed RanGTP plays an important role in both nuclear protein import and export cycles. In the nucleus, RanGTP releases macromolecular cargoes from importins and conversely facilitates cargo binding to exportins. Although the crystal structure of the nuclear import complex formed by importin Kap95p and RanGTP was recently identified, its molecular mechanism still remains unclear. To understand the relationship between structure and function of a nuclear transport complex, a structure-based mechanical model of Kap95p:RanGTP complex is introduced. In this model, a protein structure is simply modeled as an elastic network in which a set of coarse-grained point masses are connected by linear springs representing biochemical interactions at atomic level. Harmonic normal mode analysis (NMA) and anharmonic elastic network interpolation (ENI) are performed to predict the modes of vibrations and a feasible pathway between locked and unlocked conformations of Kap95p, respectively. Simulation results imply that the binding of RanGTP to Kap95p induces the release of the cargo in the nucleus as well as prevents any new cargo from attaching to the Kap95p:RanGTP complex.
Network representations of immune system complexity
Subramanian, Naeha; Torabi-Parizi, Parizad; Gottschalk, Rachel A.; Germain, Ronald N.; Dutta, Bhaskar
2015-01-01
The mammalian immune system is a dynamic multi-scale system composed of a hierarchically organized set of molecular, cellular and organismal networks that act in concert to promote effective host defense. These networks range from those involving gene regulatory and protein-protein interactions underlying intracellular signaling pathways and single cell responses to increasingly complex networks of in vivo cellular interaction, positioning and migration that determine the overall immune response of an organism. Immunity is thus not the product of simple signaling events but rather non-linear behaviors arising from dynamic, feedback-regulated interactions among many components. One of the major goals of systems immunology is to quantitatively measure these complex multi-scale spatial and temporal interactions, permitting development of computational models that can be used to predict responses to perturbation. Recent technological advances permit collection of comprehensive datasets at multiple molecular and cellular levels while advances in network biology support representation of the relationships of components at each level as physical or functional interaction networks. The latter facilitate effective visualization of patterns and recognition of emergent properties arising from the many interactions of genes, molecules, and cells of the immune system. We illustrate the power of integrating ‘omics’ and network modeling approaches for unbiased reconstruction of signaling and transcriptional networks with a focus on applications involving the innate immune system. We further discuss future possibilities for reconstruction of increasingly complex cellular and organism-level networks and development of sophisticated computational tools for prediction of emergent immune behavior arising from the concerted action of these networks. PMID:25625853
Physical properties of mixed dairy food proteins
USDA-ARS?s Scientific Manuscript database
Mixed food protein gels are complex systems, which changes functional behaviors such as gelling properties and viscosity depending on the miscibility of the proteins. We have noted that differences in co-solubility of mixed proteins created unique network structures and gel properties. The effects o...
Modelling protein functional domains in signal transduction using Maude
NASA Technical Reports Server (NTRS)
Sriram, M. G.
2003-01-01
Modelling of protein-protein interactions in signal transduction is receiving increased attention in computational biology. This paper describes recent research in the application of Maude, a symbolic language founded on rewriting logic, to the modelling of functional domains within signalling proteins. Protein functional domains (PFDs) are a critical focus of modern signal transduction research. In general, Maude models can simulate biological signalling networks and produce specific testable hypotheses at various levels of abstraction. Developing symbolic models of signalling proteins containing functional domains is important because of the potential to generate analyses of complex signalling networks based on structure-function relationships.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baker, Lewis A.; Habershon, Scott, E-mail: S.Habershon@warwick.ac.uk
Pigment-protein complexes (PPCs) play a central role in facilitating excitation energy transfer (EET) from light-harvesting antenna complexes to reaction centres in photosynthetic systems; understanding molecular organisation in these biological networks is key to developing better artificial light-harvesting systems. In this article, we combine quantum-mechanical simulations and a network-based picture of transport to investigate how chromophore organization and protein environment in PPCs impacts on EET efficiency and robustness. In a prototypical PPC model, the Fenna-Matthews-Olson (FMO) complex, we consider the impact on EET efficiency of both disrupting the chromophore network and changing the influence of (local and global) environmental dephasing. Surprisingly,more » we find a large degree of resilience to changes in both chromophore network and protein environmental dephasing, the extent of which is greater than previously observed; for example, FMO maintains EET when 50% of the constituent chromophores are removed, or when environmental dephasing fluctuations vary over two orders-of-magnitude relative to the in vivo system. We also highlight the fact that the influence of local dephasing can be strongly dependent on the characteristics of the EET network and the initial excitation; for example, initial excitations resulting in rapid coherent decay are generally insensitive to the environment, whereas the incoherent population decay observed following excitation at weakly coupled chromophores demonstrates a more pronounced dependence on dephasing rate as a result of the greater possibility of local exciton trapping. Finally, we show that the FMO electronic Hamiltonian is not particularly optimised for EET; instead, it is just one of many possible chromophore organisations which demonstrate a good level of EET transport efficiency following excitation at different chromophores. Overall, these robustness and efficiency characteristics are attributed to the highly connected nature of the chromophore network and the presence of multiple EET pathways, features which might easily be built into artificial photosynthetic systems.« less
Lakshminarasimhan, Mahadevan; Boanca, Gina; Banks, Charles A. S.; Hattem, Gaye L.; Gabriel, Ana E.; Groppe, Brad D.; Smoyer, Christine; Malanowski, Kate E.; Peak, Allison; Florens, Laurence; Washburn, Michael P.
2016-01-01
The highly conserved yeast R2TP complex, consisting of Rvb1, Rvb2, Pih1, and Tah1, participates in diverse cellular processes ranging from assembly of protein complexes to apoptosis. Rvb1 and Rvb2 are closely related proteins belonging to the AAA+ superfamily and are essential for cell survival. Although Rvbs have been shown to be associated with various protein complexes including the Ino80 and Swr1chromatin remodeling complexes, we performed a systematic quantitative proteomic analysis of their associated proteins and identified two additional complexes that associate with Rvb1 and Rvb2: the chaperonin-containing T-complex and the 19S regulatory particle of the proteasome complex. We also analyzed Rvb1 and Rvb2 purified from yeast strains devoid of PIH1 and TAH1. These analyses revealed that both Rvb1 and Rvb2 still associated with Hsp90 and were highly enriched with RNA polymerase II complex components. Our analyses also revealed that both Rvb1 and Rvb2 were recruited to the Ino80 and Swr1 chromatin remodeling complexes even in the absence of Pih1 and Tah1 proteins. Using further biochemical analysis, we showed that Rvb1 and Rvb2 directly interacted with Hsp90 as well as with the RNA polymerase II complex. RNA-Seq analysis of the deletion strains compared with the wild-type strains revealed an up-regulation of ribosome biogenesis and ribonucleoprotein complex biogenesis genes, down-regulation of response to abiotic stimulus genes, and down-regulation of response to temperature stimulus genes. A Gene Ontology analysis of the 80 proteins whose protein associations were altered in the PIH1 or TAH1 deletion strains found ribonucleoprotein complex proteins to be the most enriched category. This suggests an important function of the R2TP complex in ribonucleoprotein complex biogenesis at both the proteomic and genomic levels. Finally, these results demonstrate that deletion network analyses can provide novel insights into cellular systems. PMID:26831523
Fujiwara, Ikuko; Remmert, Kirsten; Piszczek, Grzegorz; Hammer, John A.
2014-01-01
Although capping protein (CP) terminates actin filament elongation, it promotes Arp2/3-dependent actin network assembly and accelerates actin-based motility both in vitro and in vivo. In vitro, capping protein Arp2/3 myosin I linker (CARMIL) antagonizes CP by reducing its affinity for the barbed end and by uncapping CP-capped filaments, whereas the protein V-1/myotrophin sequesters CP in an inactive complex. Previous work showed that CARMIL can readily retrieve CP from the CP:V-1 complex, thereby converting inactive CP into a version with moderate affinity for the barbed end. Here we further clarify the mechanism of this exchange reaction, and we demonstrate that the CP:CARMIL complex created by complex exchange slows the rate of barbed-end elongation by rapidly associating with, and dissociating from, the barbed end. Importantly, the cellular concentrations of V-1 and CP determined here argue that most CP is sequestered by V-1 at steady state in vivo. Finally, we show that CARMIL is recruited to the plasma membrane and only at cell edges undergoing active protrusion. Assuming that CARMIL is active only at this location, our data argue that a large pool of freely diffusing, inactive CP (CP:V-1) feeds, via CARMIL-driven complex exchange, the formation of weak-capping complexes (CP:CARMIL) at the plasma membrane of protruding edges. In vivo, therefore, CARMIL should promote Arp2/3-dependent actin network assembly at the leading edge by promoting barbed-end capping there. PMID:24778263
Decoding the non-coding RNAs in Alzheimer's disease.
Schonrock, Nicole; Götz, Jürgen
2012-11-01
Non-coding RNAs (ncRNAs) are integral components of biological networks with fundamental roles in regulating gene expression. They can integrate sequence information from the DNA code, epigenetic regulation and functions of multimeric protein complexes to potentially determine the epigenetic status and transcriptional network in any given cell. Humans potentially contain more ncRNAs than any other species, especially in the brain, where they may well play a significant role in human development and cognitive ability. This review discusses their emerging role in Alzheimer's disease (AD), a human pathological condition characterized by the progressive impairment of cognitive functions. We discuss the complexity of the ncRNA world and how this is reflected in the regulation of the amyloid precursor protein and Tau, two proteins with central functions in AD. By understanding this intricate regulatory network, there is hope for a better understanding of disease mechanisms and ultimately developing diagnostic and therapeutic tools.
PubNet: a flexible system for visualizing literature derived networks
Douglas, Shawn M; Montelione, Gaetano T; Gerstein, Mark
2005-01-01
We have developed PubNet, a web-based tool that extracts several types of relationships returned by PubMed queries and maps them into networks, allowing for graphical visualization, textual navigation, and topological analysis. PubNet supports the creation of complex networks derived from the contents of individual citations, such as genes, proteins, Protein Data Bank (PDB) IDs, Medical Subject Headings (MeSH) terms, and authors. This feature allows one to, for example, examine a literature derived network of genes based on functional similarity. PMID:16168087
Rehman, Zia Ur; Idris, Adnan; Khan, Asifullah
2018-06-01
Protein-Protein Interactions (PPI) play a vital role in cellular processes and are formed because of thousands of interactions among proteins. Advancements in proteomics technologies have resulted in huge PPI datasets that need to be systematically analyzed. Protein complexes are the locally dense regions in PPI networks, which extend important role in metabolic pathways and gene regulation. In this work, a novel two-phase protein complex detection and grouping mechanism is proposed. In the first phase, topological and biological features are extracted for each complex, and prediction performance is investigated using Bagging based Ensemble classifier (PCD-BEns). Performance evaluation through cross validation shows improvement in comparison to CDIP, MCode, CFinder and PLSMC methods Second phase employs Multi-Dimensional Scaling (MDS) for the grouping of known complexes by exploring inter complex relations. It is experimentally observed that the combination of topological and biological features in the proposed approach has greatly enhanced prediction performance for protein complex detection, which may help to understand various biological processes, whereas application of MDS based exploration may assist in grouping potentially similar complexes. Copyright © 2018 Elsevier Ltd. All rights reserved.
Diverse Supramolecular Nanofiber Networks Assembled by Functional Low-Complexity Domains.
An, Bolin; Wang, Xinyu; Cui, Mengkui; Gui, Xinrui; Mao, Xiuhai; Liu, Yan; Li, Ke; Chu, Cenfeng; Pu, Jiahua; Ren, Susu; Wang, Yanyi; Zhong, Guisheng; Lu, Timothy K; Liu, Cong; Zhong, Chao
2017-07-25
Self-assembling supramolecular nanofibers, common in the natural world, are of fundamental interest and technical importance to both nanotechnology and materials science. Despite important advances, synthetic nanofibers still lack the structural and functional diversity of biological molecules, and the controlled assembly of one type of molecule into a variety of fibrous structures with wide-ranging functional attributes remains challenging. Here, we harness the low-complexity (LC) sequence domain of fused in sarcoma (FUS) protein, an essential cellular nuclear protein with slow kinetics of amyloid fiber assembly, to construct random copolymer-like, multiblock, and self-sorted supramolecular fibrous networks with distinct structural features and fluorescent functionalities. We demonstrate the utilities of these networks in the templated, spatially controlled assembly of ligand-decorated gold nanoparticles, quantum dots, nanorods, DNA origami, and hybrid structures. Owing to the distinguishable nanoarchitectures of these nanofibers, this assembly is structure-dependent. By coupling a modular genetic strategy with kinetically controlled complex supramolecular self-assembly, we demonstrate that a single type of protein molecule can be used to engineer diverse one-dimensional supramolecular nanostructures with distinct functionalities.
BIND: the Biomolecular Interaction Network Database
Bader, Gary D.; Betel, Doron; Hogue, Christopher W. V.
2003-01-01
The Biomolecular Interaction Network Database (BIND: http://bind.ca) archives biomolecular interaction, complex and pathway information. A web-based system is available to query, view and submit records. BIND continues to grow with the addition of individual submissions as well as interaction data from the PDB and a number of large-scale interaction and complex mapping experiments using yeast two hybrid, mass spectrometry, genetic interactions and phage display. We have developed a new graphical analysis tool that provides users with a view of the domain composition of proteins in interaction and complex records to help relate functional domains to protein interactions. An interaction network clustering tool has also been developed to help focus on regions of interest. Continued input from users has helped further mature the BIND data specification, which now includes the ability to store detailed information about genetic interactions. The BIND data specification is available as ASN.1 and XML DTD. PMID:12519993
MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks
USDA-ARS?s Scientific Manuscript database
Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...
2013-01-01
Background In recent years, various types of cellular networks have penetrated biology and are nowadays used omnipresently for studying eukaryote and prokaryote organisms. Still, the relation and the biological overlap among phenomenological and inferential gene networks, e.g., between the protein interaction network and the gene regulatory network inferred from large-scale transcriptomic data, is largely unexplored. Results We provide in this study an in-depth analysis of the structural, functional and chromosomal relationship between a protein-protein network, a transcriptional regulatory network and an inferred gene regulatory network, for S. cerevisiae and E. coli. Further, we study global and local aspects of these networks and their biological information overlap by comparing, e.g., the functional co-occurrence of Gene Ontology terms by exploiting the available interaction structure among the genes. Conclusions Although the individual networks represent different levels of cellular interactions with global structural and functional dissimilarities, we observe crucial functions of their network interfaces for the assembly of protein complexes, proteolysis, transcription, translation, metabolic and regulatory interactions. Overall, our results shed light on the integrability of these networks and their interfacing biological processes. PMID:23663484
Manning, Brendan D
2012-07-10
In their study published in Science Signaling (Research Article, 27 March 2012, DOI: 10.1126/scisignal.2002469), Dalle Pezze et al. tackle the dynamic and complex wiring of the signaling network involving the protein kinase mTOR, which exists within two distinct protein complexes (mTORC1 and mTORC2) that differ in their regulation and function. The authors use a combination of immunoblotting for specific phosphorylation events and computational modeling. The primary experimental tool employed is to monitor the autophosphorylation of mTOR on Ser(2481) in cell lysates as a surrogate for mTOR activity, which the authors conclude is a specific readout for mTORC2. However, Ser(2481) phosphorylation occurs on both mTORC1 and mTORC2 and will dynamically change as the network through which these two complexes are connected is manipulated. Therefore, models of mTOR network regulation built using this tool are inherently imperfect and open to alternative explanations. Specific issues with the main conclusion made in this study, involving the TSC1-TSC2 (tuberous sclerosis complex 1 and 2) complex and its potential regulation of mTORC2, are discussed here. A broader goal of this Letter is to clarify to other investigators the caveats of using mTOR Ser(2481) phosphorylation in cell lysates as a specific readout for either of the two mTOR complexes.
Mining protein-protein interaction networks: denoising effects
NASA Astrophysics Data System (ADS)
Marras, Elisabetta; Capobianco, Enrico
2009-01-01
A typical instrument to pursue analysis in complex network studies is the analysis of the statistical distributions. They are usually computed for measures which characterize network topology, and are aimed at capturing both structural and dynamics aspects. Protein-protein interaction networks (PPIN) have also been studied through several measures. It is in general observed that a power law is expected to characterize scale-free networks. However, mixing the original noise cover with outlying information and other system-dependent fluctuations makes the empirical detection of the power law a difficult task. As a result the uncertainty level increases when looking at the observed sample; in particular, one may wonder whether the computed features may be sufficient to explain the interactome. We then address noise problems by implementing both decomposition and denoising techniques that reduce the impact of factors known to affect the accuracy of power law detection.
Optimal network alignment with graphlet degree vectors.
Milenković, Tijana; Ng, Weng Leong; Hayes, Wayne; Przulj, Natasa
2010-06-30
Important biological information is encoded in the topology of biological networks. Comparative analyses of biological networks are proving to be valuable, as they can lead to transfer of knowledge between species and give deeper insights into biological function, disease, and evolution. We introduce a new method that uses the Hungarian algorithm to produce optimal global alignment between two networks using any cost function. We design a cost function based solely on network topology and use it in our network alignment. Our method can be applied to any two networks, not just biological ones, since it is based only on network topology. We use our new method to align protein-protein interaction networks of two eukaryotic species and demonstrate that our alignment exposes large and topologically complex regions of network similarity. At the same time, our alignment is biologically valid, since many of the aligned protein pairs perform the same biological function. From the alignment, we predict function of yet unannotated proteins, many of which we validate in the literature. Also, we apply our method to find topological similarities between metabolic networks of different species and build phylogenetic trees based on our network alignment score. The phylogenetic trees obtained in this way bear a striking resemblance to the ones obtained by sequence alignments. Our method detects topologically similar regions in large networks that are statistically significant. It does this independent of protein sequence or any other information external to network topology.
Raman, Malavika; Sergeev, Mikhail; Garnaas, Maija; Lydeard, John R.; Huttlin, Edward L.; Goessling, Wolfram; Shah, Jagesh V.; Harper, J. Wade
2015-01-01
The AAA-ATPase VCP (also known as p97 or CDC48) uses ATP hydrolysis to “segregate” ubiquitinated proteins from their binding partners. VCP acts via UBX-domain containing adaptors that provide target specificity, but targets and functions of UBXD proteins remain poorly understood. Through systematic proteomic analysis of UBXD proteins in human cells, we reveal a network of over 195 interacting proteins, implicating VCP in diverse cellular pathways. We have explored one such complex between an unstudied adaptor UBXN10 and the intraflagellar transport B (IFT-B) complex, which regulates anterograde transport into cilia. UBXN10 localizes to cilia in a VCP-dependent manner and both VCP and UBXN10 are required for ciliogenesis. Pharmacological inhibition of VCP destabilized the IFT-B complex and increased trafficking rates. Depletion of UBXN10 in zebrafish embryos causes defects in left-right asymmetry, which depends on functional cilia. This study provides a resource for exploring the landscape of UBXD proteins in biology and identifies an unexpected requirement for VCP-UBXN10 in ciliogenesis. PMID:26389662
Raman, Malavika; Sergeev, Mikhail; Garnaas, Maija; Lydeard, John R; Huttlin, Edward L; Goessling, Wolfram; Shah, Jagesh V; Harper, J Wade
2015-10-01
The AAA-ATPase VCP (also known as p97 or CDC48) uses ATP hydrolysis to 'segregate' ubiquitylated proteins from their binding partners. VCP acts through UBX-domain-containing adaptors that provide target specificity, but the targets and functions of UBXD proteins remain poorly understood. Through systematic proteomic analysis of UBXD proteins in human cells, we reveal a network of over 195 interacting proteins, implicating VCP in diverse cellular pathways. We have explored one such complex between an unstudied adaptor UBXN10 and the intraflagellar transport B (IFT-B) complex, which regulates anterograde transport into cilia. UBXN10 localizes to cilia in a VCP-dependent manner and both VCP and UBXN10 are required for ciliogenesis. Pharmacological inhibition of VCP destabilized the IFT-B complex and increased trafficking rates. Depletion of UBXN10 in zebrafish embryos causes defects in left-right asymmetry, which depends on functional cilia. This study provides a resource for exploring the landscape of UBXD proteins in biology and identifies an unexpected requirement for VCP-UBXN10 in ciliogenesis.
Modes of Interaction between Individuals Dominate the Topologies of Real World Networks
Lee, Insuk; Kim, Eiru; Marcotte, Edward M.
2015-01-01
We find that the topologies of real world networks, such as those formed within human societies, by the Internet, or among cellular proteins, are dominated by the mode of the interactions considered among the individuals. Specifically, a major dichotomy in previously studied networks arises from modeling networks in terms of pairwise versus group tasks. The former often intrinsically give rise to scale-free, disassortative, hierarchical networks, whereas the latter often give rise to single- or broad-scale, assortative, nonhierarchical networks. These dependencies explain contrasting observations among previous topological analyses of real world complex systems. We also observe this trend in systems with natural hierarchies, in which alternate representations of the same networks, but which capture different levels of the hierarchy, manifest these signature topological differences. For example, in both the Internet and cellular proteomes, networks of lower-level system components (routers within domains or proteins within biological processes) are assortative and nonhierarchical, whereas networks of upper-level system components (internet domains or biological processes) are disassortative and hierarchical. Our results demonstrate that network topologies of complex systems must be interpreted in light of their hierarchical natures and interaction types. PMID:25793969
Cellular and synaptic network defects in autism
Peça, João; Feng, Guoping
2012-01-01
Many candidate genes are now thought to confer susceptibility to autism spectrum disorder (ASD). Here we review four interrelated complexes, each composed of multiple families of genes that functionally coalesce on common cellular pathways. We illustrate a common thread in the organization of glutamatergic synapses and suggest a link between genes involved in Tuberous Sclerosis Complex, Fragile X syndrome, Angelman syndrome and several synaptic ASD candidate genes. When viewed in this context, progress in deciphering the molecular architecture of cellular protein-protein interactions together with the unraveling of synaptic dysfunction in neural networks may prove pivotal to advancing our understanding of ASDs. PMID:22440525
Motif structure and cooperation in real-world complex networks
NASA Astrophysics Data System (ADS)
Salehi, Mostafa; Rabiee, Hamid R.; Jalili, Mahdi
2010-12-01
Networks of dynamical nodes serve as generic models for real-world systems in many branches of science ranging from mathematics to physics, technology, sociology and biology. Collective behavior of agents interacting over complex networks is important in many applications. The cooperation between selfish individuals is one of the most interesting collective phenomena. In this paper we address the interplay between the motifs’ cooperation properties and their abundance in a number of real-world networks including yeast protein-protein interaction, human brain, protein structure, email communication, dolphins’ social interaction, Zachary karate club and Net-science coauthorship networks. First, the amount of cooperativity for all possible undirected subgraphs with three to six nodes is calculated. To this end, the evolutionary dynamics of the Prisoner’s Dilemma game is considered and the cooperativity of each subgraph is calculated as the percentage of cooperating agents at the end of the simulation time. Then, the three- to six-node motifs are extracted for each network. The significance of the abundance of a motif, represented by a Z-value, is obtained by comparing them with some properly randomized versions of the original network. We found that there is always a group of motifs showing a significant inverse correlation between their cooperativity amount and Z-value, i.e. the more the Z-value the less the amount of cooperativity. This suggests that networks composed of well-structured units do not have good cooperativity properties.
PPI layouts: BioJS components for the display of Protein-Protein Interactions.
Salazar, Gustavo A; Meintjes, Ayton; Mulder, Nicola
2014-01-01
We present two web-based components for the display of Protein-Protein Interaction networks using different self-organizing layout methods: force-directed and circular. These components conform to the BioJS standard and can be rendered in an HTML5-compliant browser without the need for third-party plugins. We provide examples of interaction networks and how the components can be used to visualize them, and refer to a more complex tool that uses these components. http://github.com/biojs/biojs; http://dx.doi.org/10.5281/zenodo.7753.
NASA Astrophysics Data System (ADS)
Morgan, Sarah E.; Cole, Daniel J.; Chin, Alex W.
2016-11-01
Collective protein modes are expected to be important for facilitating energy transfer in the Fenna-Matthews-Olson (FMO) complex of photosynthetic green sulphur bacteria, however to date little work has focussed on the microscopic details of these vibrations. The nonlinear network model (NNM) provides a computationally inexpensive approach to studying vibrational modes at the microscopic level in large protein structures, whilst incorporating anharmonicity in the inter-residue interactions which can influence protein dynamics. We apply the NNM to the entire trimeric FMO complex and find evidence for the existence of nonlinear discrete breather modes. These modes tend to transfer energy to the highly connected core pigments, potentially opening up alternative excitation energy transfer routes through their influence on pigment properties. Incorporating localised modes based on these discrete breathers in the optical spectra calculations for FMO using ab initio site energies and excitonic couplings can substantially improve their agreement with experimental results.
Yugandhar, K; Gromiha, M Michael
2014-09-01
Protein-protein interactions are intrinsic to virtually every cellular process. Predicting the binding affinity of protein-protein complexes is one of the challenging problems in computational and molecular biology. In this work, we related sequence features of protein-protein complexes with their binding affinities using machine learning approaches. We set up a database of 185 protein-protein complexes for which the interacting pairs are heterodimers and their experimental binding affinities are available. On the other hand, we have developed a set of 610 features from the sequences of protein complexes and utilized Ranker search method, which is the combination of Attribute evaluator and Ranker method for selecting specific features. We have analyzed several machine learning algorithms to discriminate protein-protein complexes into high and low affinity groups based on their Kd values. Our results showed a 10-fold cross-validation accuracy of 76.1% with the combination of nine features using support vector machines. Further, we observed accuracy of 83.3% on an independent test set of 30 complexes. We suggest that our method would serve as an effective tool for identifying the interacting partners in protein-protein interaction networks and human-pathogen interactions based on the strength of interactions. © 2014 Wiley Periodicals, Inc.
Dong, Yadong; Sun, Yongqi; Qin, Chao
2018-01-01
The existing protein complex detection methods can be broadly divided into two categories: unsupervised and supervised learning methods. Most of the unsupervised learning methods assume that protein complexes are in dense regions of protein-protein interaction (PPI) networks even though many true complexes are not dense subgraphs. Supervised learning methods utilize the informative properties of known complexes; they often extract features from existing complexes and then use the features to train a classification model. The trained model is used to guide the search process for new complexes. However, insufficient extracted features, noise in the PPI data and the incompleteness of complex data make the classification model imprecise. Consequently, the classification model is not sufficient for guiding the detection of complexes. Therefore, we propose a new robust score function that combines the classification model with local structural information. Based on the score function, we provide a search method that works both forwards and backwards. The results from experiments on six benchmark PPI datasets and three protein complex datasets show that our approach can achieve better performance compared with the state-of-the-art supervised, semi-supervised and unsupervised methods for protein complex detection, occasionally significantly outperforming such methods.
Fragkostefanakis, Sotirios; Röth, Sascha; Schleiff, Enrico; Scharf, Klaus-Dieter
2015-09-01
Cell survival under high temperature conditions involves the activation of heat stress response (HSR), which in principle is highly conserved among different organisms, but shows remarkable complexity and unique features in plant systems. The transcriptional reprogramming at higher temperatures is controlled by the activity of the heat stress transcription factors (Hsfs). Hsfs allow the transcriptional activation of HSR genes, among which heat shock proteins (Hsps) are best characterized. Hsps belong to multigene families encoding for molecular chaperones involved in various processes including maintenance of protein homeostasis as a requisite for optimal development and survival under stress conditions. Hsfs form complex networks to activate downstream responses, but are concomitantly subjected to cell-type-dependent feedback regulation through factor-specific physical and functional interactions with chaperones belonging to Hsp90, Hsp70 and small Hsp families. There is increasing evidence that the originally assumed specialized function of Hsf/chaperone networks in the HSR turns out to be a complex central stress response system that is involved in the regulation of a broad variety of other stress responses and may also have substantial impact on various developmental processes. Understanding in detail the function of such regulatory networks is prerequisite for sustained improvement of thermotolerance in important agricultural crops. © 2014 John Wiley & Sons Ltd.
Visualisation and graph-theoretic analysis of a large-scale protein structural interactome
Bolser, Dan; Dafas, Panos; Harrington, Richard; Park, Jong; Schroeder, Michael
2003-01-01
Background Large-scale protein interaction maps provide a new, global perspective with which to analyse protein function. PSIMAP, the Protein Structural Interactome Map, is a database of all the structurally observed interactions between superfamilies of protein domains with known three-dimensional structure in the PDB. PSIMAP incorporates both functional and evolutionary information into a single network. Results We present a global analysis of PSIMAP using several distinct network measures relating to centrality, interactivity, fault-tolerance, and taxonomic diversity. We found the following results: Centrality: we show that the center and barycenter of PSIMAP do not coincide, and that the superfamilies forming the barycenter relate to very general functions, while those constituting the center relate to enzymatic activity. Interactivity: we identify the P-loop and immunoglobulin superfamilies as the most highly interactive. We successfully use connectivity and cluster index, which characterise the connectivity of a superfamily's neighbourhood, to discover superfamilies of complex I and II. This is particularly significant as the structure of complex I is not yet solved. Taxonomic diversity: we found that highly interactive superfamilies are in general taxonomically very diverse and are thus amongst the oldest. Fault-tolerance: we found that the network is very robust as for the majority of superfamilies removal from the network will not break up the network. Conclusions Overall, we can single out the P-loop containing nucleotide triphosphate hydrolases superfamily as it is the most highly connected and has the highest taxonomic diversity. In addition, this superfamily has the highest interaction rank, is the barycenter of the network (it has the shortest average path to every other superfamily in the network), and is an articulation vertex, whose removal will disconnect the network. More generally, we conclude that the graph-theoretic and taxonomic analysis of PSIMAP is an important step towards the understanding of protein function and could be an important tool for tracing the evolution of life at the molecular level. PMID:14531933
s-core network decomposition: A generalization of k-core analysis to weighted networks
NASA Astrophysics Data System (ADS)
Eidsaa, Marius; Almaas, Eivind
2013-12-01
A broad range of systems spanning biology, technology, and social phenomena may be represented and analyzed as complex networks. Recent studies of such networks using k-core decomposition have uncovered groups of nodes that play important roles. Here, we present s-core analysis, a generalization of k-core (or k-shell) analysis to complex networks where the links have different strengths or weights. We demonstrate the s-core decomposition approach on two random networks (ER and configuration model with scale-free degree distribution) where the link weights are (i) random, (ii) correlated, and (iii) anticorrelated with the node degrees. Finally, we apply the s-core decomposition approach to the protein-interaction network of the yeast Saccharomyces cerevisiae in the context of two gene-expression experiments: oxidative stress in response to cumene hydroperoxide (CHP), and fermentation stress response (FSR). We find that the innermost s-cores are (i) different from innermost k-cores, (ii) different for the two stress conditions CHP and FSR, and (iii) enriched with proteins whose biological functions give insight into how yeast manages these specific stresses.
Cytoscape: a software environment for integrated models of biomolecular interaction networks.
Shannon, Paul; Markiel, Andrew; Ozier, Owen; Baliga, Nitin S; Wang, Jonathan T; Ramage, Daniel; Amin, Nada; Schwikowski, Benno; Ideker, Trey
2003-11-01
Cytoscape is an open source software project for integrating biomolecular interaction networks with high-throughput expression data and other molecular states into a unified conceptual framework. Although applicable to any system of molecular components and interactions, Cytoscape is most powerful when used in conjunction with large databases of protein-protein, protein-DNA, and genetic interactions that are increasingly available for humans and model organisms. Cytoscape's software Core provides basic functionality to layout and query the network; to visually integrate the network with expression profiles, phenotypes, and other molecular states; and to link the network to databases of functional annotations. The Core is extensible through a straightforward plug-in architecture, allowing rapid development of additional computational analyses and features. Several case studies of Cytoscape plug-ins are surveyed, including a search for interaction pathways correlating with changes in gene expression, a study of protein complexes involved in cellular recovery to DNA damage, inference of a combined physical/functional interaction network for Halobacterium, and an interface to detailed stochastic/kinetic gene regulatory models.
Iida, M; Takemoto, K
2018-09-30
Environmental contaminant exposure can pose significant risks to human health. Therefore, evaluating the impact of this exposure is of great importance; however, it is often difficult because both the molecular mechanism of disease and the mode of action of the contaminants are complex. We used network biology techniques to quantitatively assess the impact of environmental contaminants on the human interactome and diseases with a particular focus on seven major contaminant categories: persistent organic pollutants (POPs), dioxins, polycyclic aromatic hydrocarbons (PAHs), pesticides, perfluorochemicals (PFCs), metals, and pharmaceutical and personal care products (PPCPs). We integrated publicly available data on toxicogenomics, the diseasome, protein-protein interactions (PPIs), and gene essentiality and found that a few contaminants were targeted to many genes, and a few genes were targeted by many contaminants. The contaminant targets were hub proteins in the human PPI network, whereas the target proteins in most categories did not contain abundant essential proteins. Generally, contaminant targets and disease-associated proteins were closely associated with the PPI network, and the closeness of the associations depended on the disease type and chemical category. Network biology techniques were used to identify environmental contaminants with broad effects on the human interactome and contaminant-sensitive biomarkers. Moreover, this method enabled us to quantify the relationship between environmental contaminants and human diseases, which was supported by epidemiological and experimental evidence. These methods and findings have facilitated the elucidation of the complex relationship between environmental exposure and adverse health outcomes. Copyright © 2018 Elsevier Inc. All rights reserved.
Semantic integration to identify overlapping functional modules in protein interaction networks
Cho, Young-Rae; Hwang, Woochang; Ramanathan, Murali; Zhang, Aidong
2007-01-01
Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification. PMID:17650343
Gurung, A B; Bhattacharjee, A; Ajmal Ali, M; Al-Hemaid, F; Lee, Joongku
2017-02-01
Protein-protein interaction is a vital process which drives many important physiological processes in the cell and has also been implicated in several diseases. Though the protein-protein interaction network is quite complex but understanding its interacting partners using both in silico as well as molecular biology techniques can provide better insights for targeting such interactions. Targeting protein-protein interaction with small molecules is a challenging task because of druggability issues. Nevertheless, several studies on the kinetics as well as thermodynamic properties of protein-protein interactions have immensely contributed toward better understanding of the affinity of these complexes. But, more recent studies on hot spots and interface residues have opened up new avenues in the drug discovery process. This approach has been used in the design of hot spot based modulators targeting protein-protein interaction with the objective of normalizing such interactions.
HExpoChem: a systems biology resource to explore human exposure to chemicals.
Taboureau, Olivier; Jacobsen, Ulrik Plesner; Kalhauge, Christian; Edsgärd, Daniel; Rigina, Olga; Gupta, Ramneek; Audouze, Karine
2013-05-01
Humans are exposed to diverse hazardous chemicals daily. Although an exposure to these chemicals is suspected to have adverse effects on human health, mechanistic insights into how they interact with the human body are still limited. Therefore, acquisition of curated data and development of computational biology approaches are needed to assess the health risks of chemical exposure. Here we present HExpoChem, a tool based on environmental chemicals and their bioactivities on human proteins with the objective of aiding the qualitative exploration of human exposure to chemicals. The chemical-protein interactions have been enriched with a quality-scored human protein-protein interaction network, a protein-protein association network and a chemical-chemical interaction network, thus allowing the study of environmental chemicals through formation of protein complexes and phenotypic outcomes enrichment. HExpoChem is available at http://www.cbs.dtu.dk/services/HExpoChem-1.0/.
Taipale, Mikko; Tucker, George; Peng, Jian; Krykbaeva, Irina; Lin, Zhen-Yuan; Larsen, Brett; Choi, Hyungwon; Berger, Bonnie; Gingras, Anne-Claude; Lindquist, Susan
2014-01-01
Chaperones are abundant cellular proteins that promote the folding and function of their substrate proteins (clients). In vivo, chaperones also associate with a large and diverse set of co-factors (co-chaperones) that regulate their specificity and function. However, how these co-chaperones regulate protein folding and whether they have chaperone-independent biological functions is largely unknown. We have combined mass spectrometry and quantitative high-throughput LUMIER assays to systematically characterize the chaperone/co-chaperone/client interaction network in human cells. We uncover hundreds of novel chaperone clients, delineate their participation in specific co-chaperone complexes, and establish a surprisingly distinct network of protein/protein interactions for co-chaperones. As a salient example of the power of such analysis, we establish that NUDC family co-chaperones specifically associate with structurally related but evolutionarily distinct β-propeller folds. We provide a framework for deciphering the proteostasis network, its regulation in development and disease, and expand the use of chaperones as sensors for drug/target engagement. PMID:25036637
System Analysis of LWDH Related Genes Based on Text Mining in Biological Networks
Miao, Yingbo; Zhang, Liangcai; Wang, Yang; Feng, Rennan; Yang, Lei; Zhang, Shihua; Jiang, Yongshuai; Liu, Guiyou
2014-01-01
Liuwei-dihuang (LWDH) is widely used in traditional Chinese medicine (TCM), but its molecular mechanism about gene interactions is unclear. LWDH genes were extracted from the existing literatures based on text mining technology. To simulate the complex molecular interactions that occur in the whole body, protein-protein interaction networks (PPINs) were constructed and the topological properties of LWDH genes were analyzed. LWDH genes have higher centrality properties and may play important roles in the complex biological network environment. It was also found that the distances within LWDH genes are smaller than expected, which means that the communication of LWDH genes during the biological process is rapid and effectual. At last, a comprehensive network of LWDH genes, including the related drugs and regulatory pathways at both the transcriptional and posttranscriptional levels, was constructed and analyzed. The biological network analysis strategy used in this study may be helpful for the understanding of molecular mechanism of TCM. PMID:25243143
deepNF: Deep network fusion for protein function prediction.
Gligorijevic, Vladimir; Barot, Meet; Bonneau, Richard
2018-06-01
The prevalence of high-throughput experimental methods has resulted in an abundance of large-scale molecular and functional interaction networks. The connectivity of these networks provides a rich source of information for inferring functional annotations for genes and proteins. An important challenge has been to develop methods for combining these heterogeneous networks to extract useful protein feature representations for function prediction. Most of the existing approaches for network integration use shallow models that encounter difficulty in capturing complex and highly-nonlinear network structures. Thus, we propose deepNF, a network fusion method based on Multimodal Deep Autoencoders to extract high-level features of proteins from multiple heterogeneous interaction networks. We apply this method to combine STRING networks to construct a common low-dimensional representation containing high-level protein features. We use separate layers for different network types in the early stages of the multimodal autoencoder, later connecting all the layers into a single bottleneck layer from which we extract features to predict protein function. We compare the cross-validation and temporal holdout predictive performance of our method with state-of-the-art methods, including the recently proposed method Mashup. Our results show that our method outperforms previous methods for both human and yeast STRING networks. We also show substantial improvement in the performance of our method in predicting GO terms of varying type and specificity. deepNF is freely available at: https://github.com/VGligorijevic/deepNF. vgligorijevic@flatironinstitute.org, rb133@nyu.edu. Supplementary data are available at Bioinformatics online.
Structural basis for spectrin recognition by ankyrin.
Ipsaro, Jonathan J; Mondragón, Alfonso
2010-05-20
Maintenance of membrane integrity and organization in the metazoan cell is accomplished through intracellular tethering of membrane proteins to an extensive, flexible protein network. Spectrin, the principal component of this network, is anchored to membrane proteins through the adaptor protein ankyrin. To elucidate the atomic basis for this interaction, we determined a crystal structure of human betaI-spectrin repeats 13 to 15 in complex with the ZU5-ANK domain of human ankyrin R. The structure reveals the role of repeats 14 to 15 in binding, the electrostatic and hydrophobic contributions along the interface, and the necessity for a particular orientation of the spectrin repeats. Using structural and biochemical data as a guide, we characterized the individual proteins and their interactions by binding and thermal stability analyses. In addition to validating the structural model, these data provide insight into the nature of some mutations associated with cell morphology defects, including those found in human diseases such as hereditary spherocytosis and elliptocytosis. Finally, analysis of the ZU5 domain suggests it is a versatile protein-protein interaction module with distinct interaction surfaces. The structure represents not only the first of a spectrin fragment in complex with its binding partner, but also that of an intermolecular complex involving a ZU5 domain.
Liu, Bernard A.; Shah, Eshana; Jablonowski, Karl; Stergachis, Andrew; Engelmann, Brett; Nash, Piers D.
2014-01-01
The Src homology 2 (SH2) domains are participants in metazoan signal transduction, acting as primary mediators for regulated protein-protein interactions with tyrosine-phosphorylated substrates. Here, we describe the origin and evolution of SH2 domain proteins by means of sequence analysis from 21 eukaryotic organisms from the basal unicellular eukaryotes, where SH2 domains first appeared, through the multicellular animals and increasingly complex metazoans. On the basis of our results, SH2 domains and phosphotyrosine signaling emerged in the early Unikonta, and the numbers of SH2 domains expanded in the choanoflagellate and metazoan lineages with the development of tyrosine kinases, leading to rapid elaboration of phosphotyrosine signaling in early multicellular animals. Our results also indicated that SH2 domains coevolved and the number of the domains expanded alongside protein tyrosine kinases and tyrosine phosphatases, thereby coupling phosphotyrosine signaling to downstream signaling networks. Gene duplication combined with domain gain or loss produced novel SH2-containing proteins that function within phosphotyrosine signaling, which likely have contributed to diversity and complexity in metazoans. We found that intra- and intermolecular interactions within and between SH2 domain proteins increased in prevalence along with organismal complexity and may function to generate more highly connected and robust phosphotyrosine signaling networks. PMID:22155787
Saha, Sudipto; Dazard, Jean-Eudes; Xu, Hua; Ewing, Rob M.
2013-01-01
Large-scale protein–protein interaction data sets have been generated for several species including yeast and human and have enabled the identification, quantification, and prediction of cellular molecular networks. Affinity purification-mass spectrometry (AP-MS) is the preeminent methodology for large-scale analysis of protein complexes, performed by immunopurifying a specific “bait” protein and its associated “prey” proteins. The analysis and interpretation of AP-MS data sets is, however, not straightforward. In addition, although yeast AP-MS data sets are relatively comprehensive, current human AP-MS data sets only sparsely cover the human interactome. Here we develop a framework for analysis of AP-MS data sets that addresses the issues of noise, missing data, and sparsity of coverage in the context of a current, real world human AP-MS data set. Our goal is to extend and increase the density of the known human interactome by integrating bait–prey and cocomplexed preys (prey–prey associations) into networks. Our framework incorporates a score for each identified protein, as well as elements of signal processing to improve the confidence of identified protein–protein interactions. We identify many protein networks enriched in known biological processes and functions. In addition, we show that integrated bait–prey and prey–prey interactions can be used to refine network topology and extend known protein networks. PMID:22845868
Bayesian Mixed-Membership Models of Complex and Evolving Networks
2006-12-01
R. Hughes, J. Parkinson , M. Gerstein, S . J. Wodak, A. Emili, and J. F. Greenblatt. Global landscape of protein complexes in the yeast Saccharomyces...provision of law , no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a currently valid...Membership Models of Complex and Evolving Networks 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR( S ) 5d. PROJECT NUMBER 5e
PPI layouts: BioJS components for the display of Protein-Protein Interactions
Salazar, Gustavo A.; Meintjes, Ayton; Mulder, Nicola
2014-01-01
Summary: We present two web-based components for the display of Protein-Protein Interaction networks using different self-organizing layout methods: force-directed and circular. These components conform to the BioJS standard and can be rendered in an HTML5-compliant browser without the need for third-party plugins. We provide examples of interaction networks and how the components can be used to visualize them, and refer to a more complex tool that uses these components. Availability: http://github.com/biojs/biojs; http://dx.doi.org/10.5281/zenodo.7753 PMID:25075288
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Yingchun; Yang, Feng; Fu, Yi
Abstract - Brain development and spinal cord regeneration require neurite sprouting and growth cone navigation in response to extension and collapsing factors present in the extracellular environment. These external guidance cues control neurite growth cone extension and retraction processes through intracellular protein phosphorylation of numerous cytoskeletal, adhesion, and polarity complex signaling proteins. However, the complex kinase/substrate signaling networks that mediate neuritogenesis have not been investigated. Here, we compare the neurite phosphoproteome under growth and retraction conditions using neurite purification methodology combined with mass spectrometry. More than 4000 non-redundant phosphorylation sites from 1883 proteins have been annotated and mapped to signalingmore » pathways that control kinase/phosphatase networks, cytoskeleton remodeling, and axon/dendrite specification. Comprehensive informatics and functional studies revealed a compartmentalized ERK activation/deactivation cytoskeletal switch that governs neurite growth and retraction, respectively. Our findings provide the first system-wide analysis of the phosphoprotein signaling networks that enable neurite growth and retraction and reveal an important molecular switch that governs neuritogenesis.« less
Scale-space measures for graph topology link protein network architecture to function.
Hulsman, Marc; Dimitrakopoulos, Christos; de Ridder, Jeroen
2014-06-15
The network architecture of physical protein interactions is an important determinant for the molecular functions that are carried out within each cell. To study this relation, the network architecture can be characterized by graph topological characteristics such as shortest paths and network hubs. These characteristics have an important shortcoming: they do not take into account that interactions occur across different scales. This is important because some cellular functions may involve a single direct protein interaction (small scale), whereas others require more and/or indirect interactions, such as protein complexes (medium scale) and interactions between large modules of proteins (large scale). In this work, we derive generalized scale-aware versions of known graph topological measures based on diffusion kernels. We apply these to characterize the topology of networks across all scales simultaneously, generating a so-called graph topological scale-space. The comprehensive physical interaction network in yeast is used to show that scale-space based measures consistently give superior performance when distinguishing protein functional categories and three major types of functional interactions-genetic interaction, co-expression and perturbation interactions. Moreover, we demonstrate that graph topological scale spaces capture biologically meaningful features that provide new insights into the link between function and protein network architecture. Matlab(TM) code to calculate the scale-aware topological measures (STMs) is available at http://bioinformatics.tudelft.nl/TSSA © The Author 2014. Published by Oxford University Press.
The binary protein-protein interaction landscape of Escherichia coli
Rajagopala, Seesandra V.; Vlasblom, James; Arnold, Roland; Franca-Koh, Jonathan; Pakala, Suman B.; Phanse, Sadhna; Ceol, Arnaud; Häuser, Roman; Siszler, Gabriella; Wuchty, Stefan; Emili, Andrew; Babu, Mohan; Aloy, Patrick; Pieper, Rembert; Uetz, Peter
2014-01-01
Efforts to map the Escherichia coli interactome have identified several hundred macromolecular complexes, but direct binary protein-protein interactions (PPIs) have not been surveyed on a large scale. Here we performed yeast two-hybrid screens of 3,305 baits against 3,606 preys (~70% of the E. coli proteome) in duplicate to generate a map of 2,234 interactions, approximately doubling the number of known binary PPIs in E. coli. Integration of binary PPIs and genetic interactions revealed functional dependencies among components involved in cellular processes, including envelope integrity, flagellum assembly and protein quality control. Many of the binary interactions that could be mapped within multi-protein complexes were informative regarding internal topology and indicated that interactions within complexes are significantly more conserved than those interactions connecting different complexes. This resource will be useful for inferring bacterial gene function and provides a draft reference of the basic physical wiring network of this evolutionarily significant model microbe. PMID:24561554
Quality control methodology for high-throughput protein-protein interaction screening.
Vazquez, Alexei; Rual, Jean-François; Venkatesan, Kavitha
2011-01-01
Protein-protein interactions are key to many aspects of the cell, including its cytoskeletal structure, the signaling processes in which it is involved, or its metabolism. Failure to form protein complexes or signaling cascades may sometimes translate into pathologic conditions such as cancer or neurodegenerative diseases. The set of all protein interactions between the proteins encoded by an organism constitutes its protein interaction network, representing a scaffold for biological function. Knowing the protein interaction network of an organism, combined with other sources of biological information, can unravel fundamental biological circuits and may help better understand the molecular basics of human diseases. The protein interaction network of an organism can be mapped by combining data obtained from both low-throughput screens, i.e., "one gene at a time" experiments and high-throughput screens, i.e., screens designed to interrogate large sets of proteins at once. In either case, quality controls are required to deal with the inherent imperfect nature of experimental assays. In this chapter, we discuss experimental and statistical methodologies to quantify error rates in high-throughput protein-protein interactions screens.
Wang, Jiguang; Sun, Yidan; Zheng, Si; Zhang, Xiang-Sun; Zhou, Huarong; Chen, Luonan
2013-01-01
Synergistic interactions among transcription factors (TFs) and their cofactors collectively determine gene expression in complex biological systems. In this work, we develop a novel graphical model, called Active Protein-Gene (APG) network model, to quantify regulatory signals of transcription in complex biomolecular networks through integrating both TF upstream-regulation and downstream-regulation high-throughput data. Firstly, we theoretically and computationally demonstrate the effectiveness of APG by comparing with the traditional strategy based only on TF downstream-regulation information. We then apply this model to study spontaneous type 2 diabetic Goto-Kakizaki (GK) and Wistar control rats. Our biological experiments validate the theoretical results. In particular, SP1 is found to be a hidden TF with changed regulatory activity, and the loss of SP1 activity contributes to the increased glucose production during diabetes development. APG model provides theoretical basis to quantitatively elucidate transcriptional regulation by modelling TF combinatorial interactions and exploiting multilevel high-throughput information.
Wang, Jiguang; Sun, Yidan; Zheng, Si; Zhang, Xiang-Sun; Zhou, Huarong; Chen, Luonan
2013-01-01
Synergistic interactions among transcription factors (TFs) and their cofactors collectively determine gene expression in complex biological systems. In this work, we develop a novel graphical model, called Active Protein-Gene (APG) network model, to quantify regulatory signals of transcription in complex biomolecular networks through integrating both TF upstream-regulation and downstream-regulation high-throughput data. Firstly, we theoretically and computationally demonstrate the effectiveness of APG by comparing with the traditional strategy based only on TF downstream-regulation information. We then apply this model to study spontaneous type 2 diabetic Goto-Kakizaki (GK) and Wistar control rats. Our biological experiments validate the theoretical results. In particular, SP1 is found to be a hidden TF with changed regulatory activity, and the loss of SP1 activity contributes to the increased glucose production during diabetes development. APG model provides theoretical basis to quantitatively elucidate transcriptional regulation by modelling TF combinatorial interactions and exploiting multilevel high-throughput information. PMID:23346354
NASA Astrophysics Data System (ADS)
Buchanan, Mark; Caldarelli, Guido; De Los Rios, Paolo; Rao, Francesco; Vendruscolo, Michele
2010-05-01
Introduction; 1. Network views of the cell Paolo De Los Rios and Michele Vendruscolo; 2. Transcriptional regulatory networks Sarath Chandra Janga and M. Madan Babu; 3. Transcription factors and gene regulatory networks Matteo Brilli, Elissa Calistri and Pietro Lió; 4. Experimental methods for protein interaction identification Peter Uetz, Björn Titz, Seesandra V. Rajagopala and Gerard Cagney; 5. Modeling protein interaction networks Francesco Rao; 6. Dynamics and evolution of metabolic networks Daniel Segré; 7. Hierarchical modularity in biological networks: the case of metabolic networks Erzsébet Ravasz Regan; 8. Signalling networks Gian Paolo Rossini; Appendix 1. Complex networks: from local to global properties D. Garlaschelli and G. Caldarelli; Appendix 2. Modelling the local structure of networks D. Garlaschelli and G. Caldarelli; Appendix 3. Higher-order topological properties S. Ahnert, T. Fink and G. Caldarelli; Appendix 4. Elementary mathematical concepts A. Gabrielli and G. Caldarelli; References.
A Synthetic Biology Framework for Programming Eukaryotic Transcription Functions
Khalil, Ahmad S.; Lu, Timothy K.; Bashor, Caleb J.; Ramirez, Cherie L.; Pyenson, Nora C.; Joung, J. Keith; Collins, James J.
2013-01-01
SUMMARY Eukaryotic transcription factors (TFs) perform complex and combinatorial functions within transcriptional networks. Here, we present a synthetic framework for systematically constructing eukaryotic transcription functions using artificial zinc fingers, modular DNA-binding domains found within many eukaryotic TFs. Utilizing this platform, we construct a library of orthogonal synthetic transcription factors (sTFs) and use these to wire synthetic transcriptional circuits in yeast. We engineer complex functions, such as tunable output strength and transcriptional cooperativity, by rationally adjusting a decomposed set of key component properties, e.g., DNA specificity, affinity, promoter design, protein-protein interactions. We show that subtle perturbations to these properties can transform an individual sTF between distinct roles (activator, cooperative factor, inhibitory factor) within a transcriptional complex, thus drastically altering the signal processing behavior of multi-input systems. This platform provides new genetic components for synthetic biology and enables bottom-up approaches to understanding the design principles of eukaryotic transcriptional complexes and networks. PMID:22863014
2012-01-01
Cell membranes represent the “front line” of cellular defense and the interface between a cell and its environment. To determine the range of proteins and protein complexes that are present in the cell membranes of a target organism, we have utilized a “tagless” process for the system-wide isolation and identification of native membrane protein complexes. As an initial subject for study, we have chosen the Gram-negative sulfate-reducing bacterium Desulfovibrio vulgaris. With this tagless methodology, we have identified about two-thirds of the outer membrane- associated proteins anticipated. Approximately three-fourths of these appear to form homomeric complexes. Statistical and machine-learning methods used to analyze data compiled over multiple experiments revealed networks of additional protein–protein interactions providing insight into heteromeric contacts made between proteins across this region of the cell. Taken together, these results establish a D. vulgaris outer membrane protein data set that will be essential for the detection and characterization of environment-driven changes in the outer membrane proteome and in the modeling of stress response pathways. The workflow utilized here should be effective for the global characterization of membrane protein complexes in a wide range of organisms. PMID:23098413
Enhancing the Functional Content of Eukaryotic Protein Interaction Networks
Pandey, Gaurav; Arora, Sonali; Manocha, Sahil; Whalen, Sean
2014-01-01
Protein interaction networks are a promising type of data for studying complex biological systems. However, despite the rich information embedded in these networks, these networks face important data quality challenges of noise and incompleteness that adversely affect the results obtained from their analysis. Here, we apply a robust measure of local network structure called common neighborhood similarity (CNS) to address these challenges. Although several CNS measures have been proposed in the literature, an understanding of their relative efficacies for the analysis of interaction networks has been lacking. We follow the framework of graph transformation to convert the given interaction network into a transformed network corresponding to a variety of CNS measures evaluated. The effectiveness of each measure is then estimated by comparing the quality of protein function predictions obtained from its corresponding transformed network with those from the original network. Using a large set of human and fly protein interactions, and a set of over GO terms for both, we find that several of the transformed networks produce more accurate predictions than those obtained from the original network. In particular, the measure and other continuous CNS measures perform well this task, especially for large networks. Further investigation reveals that the two major factors contributing to this improvement are the abilities of CNS measures to prune out noisy edges and enhance functional coherence in the transformed networks. PMID:25275489
Inferring drug-disease associations based on known protein complexes.
Yu, Liang; Huang, Jianbin; Ma, Zhixin; Zhang, Jing; Zou, Yapeng; Gao, Lin
2015-01-01
Inferring drug-disease associations is critical in unveiling disease mechanisms, as well as discovering novel functions of available drugs, or drug repositioning. Previous work is primarily based on drug-gene-disease relationship, which throws away many important information since genes execute their functions through interacting others. To overcome this issue, we propose a novel methodology that discover the drug-disease association based on protein complexes. Firstly, the integrated heterogeneous network consisting of drugs, protein complexes, and disease are constructed, where we assign weights to the drug-disease association by using probability. Then, from the tripartite network, we get the indirect weighted relationships between drugs and diseases. The larger the weight, the higher the reliability of the correlation. We apply our method to mental disorders and hypertension, and validate the result by using comparative toxicogenomics database. Our ranked results can be directly reinforced by existing biomedical literature, suggesting that our proposed method obtains higher specificity and sensitivity. The proposed method offers new insight into drug-disease discovery. Our method is publicly available at http://1.complexdrug.sinaapp.com/Drug_Complex_Disease/Data_Download.html.
Inferring drug-disease associations based on known protein complexes
2015-01-01
Inferring drug-disease associations is critical in unveiling disease mechanisms, as well as discovering novel functions of available drugs, or drug repositioning. Previous work is primarily based on drug-gene-disease relationship, which throws away many important information since genes execute their functions through interacting others. To overcome this issue, we propose a novel methodology that discover the drug-disease association based on protein complexes. Firstly, the integrated heterogeneous network consisting of drugs, protein complexes, and disease are constructed, where we assign weights to the drug-disease association by using probability. Then, from the tripartite network, we get the indirect weighted relationships between drugs and diseases. The larger the weight, the higher the reliability of the correlation. We apply our method to mental disorders and hypertension, and validate the result by using comparative toxicogenomics database. Our ranked results can be directly reinforced by existing biomedical literature, suggesting that our proposed method obtains higher specificity and sensitivity. The proposed method offers new insight into drug-disease discovery. Our method is publicly available at http://1.complexdrug.sinaapp.com/Drug_Complex_Disease/Data_Download.html. PMID:26044949
How Egg Case Proteins Can Protect Cuttlefish Offspring?
Cornet, Valérie; Henry, Joël; Goux, Didier; Duval, Emilie; Bernay, Benoit; Le Corguillé, Gildas; Corre, Erwan; Zatylny-Gaudin, Céline
2015-01-01
Sepia officinalis egg protection is ensured by a complex capsule produced by the female accessory genital glands and the ink bag. Our study is focused on the proteins constituting the main egg case. De novo transcriptomes from female genital glands provided essential databases for protein identification. A proteomic approach in SDS-PAGE coupled with MS unveiled a new egg case protein family: SepECPs, for Sepia officinalis Egg Case Proteins. N-glycosylation was demonstrated by PAS staining SDS-PAGE gels. These glycoproteins are mainly produced in the main nidamental glands. SepECPs share high sequence homology, especially in the signal peptide and the three cysteine-rich domains. SepECPs have a high number of cysteines, with conserved motifs involved in 3D-structure. SDS-PAGE showed that SepECPs could form dimers; this result was confirmed by TEM observations, which also revealed a protein network. This network is similar to the capsule network, and it associates these structural proteins with polysaccharides, melanin and bacteria to form a tight mesh. Its hardness and elasticity provide physical protection to the embryo. In addition, SepECPs also have bacteriostatic antimicrobial activity on GRAM- bacteria. By observing the SepECP / Vibrio aestuarianus complex in SEM, we demonstrated the ability of these proteins to agglomerate bacteria and thus inhibit their growth. These original proteins identified from the outer egg case ensure the survival of the species by providing physical and chemical protection to the embryos released in the environment without any maternal protection. PMID:26168161
Kirkwood, Kathryn J.; Ahmad, Yasmeen; Larance, Mark; Lamond, Angus I.
2013-01-01
Proteins form a diverse array of complexes that mediate cellular function and regulation. A largely unexplored feature of such protein complexes is the selective participation of specific protein isoforms and/or post-translationally modified forms. In this study, we combined native size-exclusion chromatography (SEC) with high-throughput proteomic analysis to characterize soluble protein complexes isolated from human osteosarcoma (U2OS) cells. Using this approach, we have identified over 71,500 peptides and 1,600 phosphosites, corresponding to over 8,000 proteins, distributed across 40 SEC fractions. This represents >50% of the predicted U2OS cell proteome, identified with a mean peptide sequence coverage of 27% per protein. Three biological replicates were performed, allowing statistical evaluation of the data and demonstrating a high degree of reproducibility in the SEC fractionation procedure. Specific proteins were detected interacting with multiple independent complexes, as typified by the separation of distinct complexes for the MRFAP1-MORF4L1-MRGBP interaction network. The data also revealed protein isoforms and post-translational modifications that selectively associated with distinct subsets of protein complexes. Surprisingly, there was clear enrichment for specific Gene Ontology terms associated with differential size classes of protein complexes. This study demonstrates that combined SEC/MS analysis can be used for the system-wide annotation of protein complexes and to predict potential isoform-specific interactions. All of these SEC data on the native separation of protein complexes have been integrated within the Encyclopedia of Proteome Dynamics, an online, multidimensional data-sharing resource available to the community. PMID:24043423
Kirkwood, Kathryn J; Ahmad, Yasmeen; Larance, Mark; Lamond, Angus I
2013-12-01
Proteins form a diverse array of complexes that mediate cellular function and regulation. A largely unexplored feature of such protein complexes is the selective participation of specific protein isoforms and/or post-translationally modified forms. In this study, we combined native size-exclusion chromatography (SEC) with high-throughput proteomic analysis to characterize soluble protein complexes isolated from human osteosarcoma (U2OS) cells. Using this approach, we have identified over 71,500 peptides and 1,600 phosphosites, corresponding to over 8,000 proteins, distributed across 40 SEC fractions. This represents >50% of the predicted U2OS cell proteome, identified with a mean peptide sequence coverage of 27% per protein. Three biological replicates were performed, allowing statistical evaluation of the data and demonstrating a high degree of reproducibility in the SEC fractionation procedure. Specific proteins were detected interacting with multiple independent complexes, as typified by the separation of distinct complexes for the MRFAP1-MORF4L1-MRGBP interaction network. The data also revealed protein isoforms and post-translational modifications that selectively associated with distinct subsets of protein complexes. Surprisingly, there was clear enrichment for specific Gene Ontology terms associated with differential size classes of protein complexes. This study demonstrates that combined SEC/MS analysis can be used for the system-wide annotation of protein complexes and to predict potential isoform-specific interactions. All of these SEC data on the native separation of protein complexes have been integrated within the Encyclopedia of Proteome Dynamics, an online, multidimensional data-sharing resource available to the community.
Unlocking Proteomic Heterogeneity in Complex Diseases through Visual Analytics
Bhavnani, Suresh K.; Dang, Bryant; Bellala, Gowtham; Divekar, Rohit; Visweswaran, Shyam; Brasier, Allan; Kurosky, Alex
2015-01-01
Despite years of preclinical development, biological interventions designed to treat complex diseases like asthma often fail in phase III clinical trials. These failures suggest that current methods to analyze biomedical data might be missing critical aspects of biological complexity such as the assumption that cases and controls come from homogeneous distributions. Here we discuss why and how methods from the rapidly evolving field of visual analytics can help translational teams (consisting of biologists, clinicians, and bioinformaticians) to address the challenge of modeling and inferring heterogeneity in the proteomic and phenotypic profiles of patients with complex diseases. Because a primary goal of visual analytics is to amplify the cognitive capacities of humans for detecting patterns in complex data, we begin with an overview of the cognitive foundations for the field of visual analytics. Next, we organize the primary ways in which a specific form of visual analytics called networks have been used to model and infer biological mechanisms, which help to identify the properties of networks that are particularly useful for the discovery and analysis of proteomic heterogeneity in complex diseases. We describe one such approach called subject-protein networks, and demonstrate its application on two proteomic datasets. This demonstration provides insights to help translational teams overcome theoretical, practical, and pedagogical hurdles for the widespread use of subject-protein networks for analyzing molecular heterogeneities, with the translational goal of designing biomarker-based clinical trials, and accelerating the development of personalized approaches to medicine. PMID:25684269
Reddy, Vijay S
2017-09-01
Adenoviruses are respiratory, ocular and enteric pathogens that form complex capsids, which are assembled from seven different structural proteins and composed of several core proteins that closely interact with the packaged dsDNA genome. The recent near-atomic resolution structures revealed that the interlacing continuous hexagonal network formed by the protein IX molecules is conserved among different human adenoviruses (HAdVs), but not in non-HAdVs. In this report, we propose a distinct role for the hexon protein as a "molecular mold" in enabling the formation of such hexagonal protein IX network that has been shown to preserve the stability and infectivity of HAdVs. Copyright © 2017 Elsevier Ltd. All rights reserved.
Dong, Yongbin; Wang, Qilei; Zhang, Long; Du, Chunguang; Xiong, Wenwei; Chen, Xinjian; Deng, Fei; Ma, Zhiyan; Qiao, Dahe; Hu, Chunhui; Ren, Yangliu; Li, Yuling
2015-01-01
The formation and development of maize kernel is a complex dynamic physiological and biochemical process that involves the temporal and spatial expression of many proteins and the regulation of metabolic pathways. In this study, the protein profiles of the endosperm and pericarp at three important developmental stages were analyzed by isobaric tags for relative and absolute quantification (iTRAQ) labeling coupled with LC-MS/MS in popcorn inbred N04. Comparative quantitative proteomic analyses among developmental stages and between tissues were performed, and the protein networks were integrated. A total of 6,876 proteins were identified, of which 1,396 were nonredundant. Specific proteins and different expression patterns were observed across developmental stages and tissues. The functional annotation of the identified proteins revealed the importance of metabolic and cellular processes, and binding and catalytic activities for the development of the tissues. The whole, endosperm-specific and pericarp-specific protein networks integrated 125, 9 and 77 proteins, respectively, which were involved in 54 KEGG pathways and reflected their complex metabolic interactions. Confirmation for the iTRAQ endosperm proteins by two-dimensional gel electrophoresis showed that 44.44% proteins were commonly found. However, the concordance between mRNA level and the protein abundance varied across different proteins, stages, tissues and inbred lines, according to the gene cloning and expression analyses of four relevant proteins with important functions and different expression levels. But the result by western blot showed their same expression tendency for the four proteins as by iTRAQ. These results could provide new insights into the developmental mechanisms of endosperm and pericarp, and grain formation in maize.
Du, Chunguang; Xiong, Wenwei; Chen, Xinjian; Deng, Fei; Ma, Zhiyan; Qiao, Dahe; Hu, Chunhui; Ren, Yangliu; Li, Yuling
2015-01-01
The formation and development of maize kernel is a complex dynamic physiological and biochemical process that involves the temporal and spatial expression of many proteins and the regulation of metabolic pathways. In this study, the protein profiles of the endosperm and pericarp at three important developmental stages were analyzed by isobaric tags for relative and absolute quantification (iTRAQ) labeling coupled with LC-MS/MS in popcorn inbred N04. Comparative quantitative proteomic analyses among developmental stages and between tissues were performed, and the protein networks were integrated. A total of 6,876 proteins were identified, of which 1,396 were nonredundant. Specific proteins and different expression patterns were observed across developmental stages and tissues. The functional annotation of the identified proteins revealed the importance of metabolic and cellular processes, and binding and catalytic activities for the development of the tissues. The whole, endosperm-specific and pericarp-specific protein networks integrated 125, 9 and 77 proteins, respectively, which were involved in 54 KEGG pathways and reflected their complex metabolic interactions. Confirmation for the iTRAQ endosperm proteins by two-dimensional gel electrophoresis showed that 44.44% proteins were commonly found. However, the concordance between mRNA level and the protein abundance varied across different proteins, stages, tissues and inbred lines, according to the gene cloning and expression analyses of four relevant proteins with important functions and different expression levels. But the result by western blot showed their same expression tendency for the four proteins as by iTRAQ. These results could provide new insights into the developmental mechanisms of endosperm and pericarp, and grain formation in maize. PMID:26587848
Mapping mechanical force propagation through biomolecular complexes
Schoeler, Constantin; Bernardi, Rafael C.; Malinowska, Klara H.; ...
2015-08-11
In this paper, we employ single-molecule force spectroscopy with an atomic force microscope (AFM) and steered molecular dynamics (SMD) simulations to reveal force propagation pathways through a mechanically ultrastable multidomain cellulosome protein complex. We demonstrate a new combination of network-based correlation analysis supported by AFM directional pulling experiments, which allowed us to visualize stiff paths through the protein complex along which force is transmitted. Finally, the results implicate specific force-propagation routes nonparallel to the pulling axis that are advantageous for achieving high dissociation forces.
Liu, Shiwei; Liu, Yihui; Zhao, Jiawei; Cai, Shitao; Qian, Hongmei; Zuo, Kaijing; Zhao, Lingxia; Zhang, Lida
2017-04-01
Rice (Oryza sativa) is one of the most important staple foods for more than half of the global population. Many rice traits are quantitative, complex and controlled by multiple interacting genes. Thus, a full understanding of genetic relationships will be critical to systematically identify genes controlling agronomic traits. We developed a genome-wide rice protein-protein interaction network (RicePPINet, http://netbio.sjtu.edu.cn/riceppinet) using machine learning with structural relationship and functional information. RicePPINet contained 708 819 predicted interactions for 16 895 non-transposable element related proteins. The power of the network for discovering novel protein interactions was demonstrated through comparison with other publicly available protein-protein interaction (PPI) prediction methods, and by experimentally determined PPI data sets. Furthermore, global analysis of domain-mediated interactions revealed RicePPINet accurately reflects PPIs at the domain level. Our studies showed the efficiency of the RicePPINet-based method in prioritizing candidate genes involved in complex agronomic traits, such as disease resistance and drought tolerance, was approximately 2-11 times better than random prediction. RicePPINet provides an expanded landscape of computational interactome for the genetic dissection of agronomically important traits in rice. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Modular synchronization in complex networks.
Oh, E; Rho, K; Hong, H; Kahng, B
2005-10-01
We study the synchronization transition (ST) of a modified Kuramoto model on two different types of modular complex networks. It is found that the ST depends on the type of intermodular connections. For the network with decentralized (centralized) intermodular connections, the ST occurs at finite coupling constant (behaves abnormally). Such distinct features are found in the yeast protein interaction network and the Internet, respectively. Moreover, by applying the finite-size scaling analysis to an artificial network with decentralized intermodular connections, we obtain the exponent associated with the order parameter of the ST to be beta approximately 1 different from beta(MF) approximately 1/2 obtained from the scale-free network with the same degree distribution but the absence of modular structure, corresponding to the mean field value.
Lee, Kenneth K; Sardiu, Mihaela E; Swanson, Selene K; Gilmore, Joshua M; Torok, Michael; Grant, Patrick A; Florens, Laurence; Workman, Jerry L; Washburn, Michael P
2011-07-05
Despite the availability of several large-scale proteomics studies aiming to identify protein interactions on a global scale, little is known about how proteins interact and are organized within macromolecular complexes. Here, we describe a technique that consists of a combination of biochemistry approaches, quantitative proteomics and computational methods using wild-type and deletion strains to investigate the organization of proteins within macromolecular protein complexes. We applied this technique to determine the organization of two well-studied complexes, Spt-Ada-Gcn5 histone acetyltransferase (SAGA) and ADA, for which no comprehensive high-resolution structures exist. This approach revealed that SAGA/ADA is composed of five distinct functional modules, which can persist separately. Furthermore, we identified a novel subunit of the ADA complex, termed Ahc2, and characterized Sgf29 as an ADA family protein present in all Gcn5 histone acetyltransferase complexes. Finally, we propose a model for the architecture of the SAGA and ADA complexes, which predicts novel functional associations within the SAGA complex and provides mechanistic insights into phenotypical observations in SAGA mutants.
Lee, Kenneth K; Sardiu, Mihaela E; Swanson, Selene K; Gilmore, Joshua M; Torok, Michael; Grant, Patrick A; Florens, Laurence; Workman, Jerry L; Washburn, Michael P
2011-01-01
Despite the availability of several large-scale proteomics studies aiming to identify protein interactions on a global scale, little is known about how proteins interact and are organized within macromolecular complexes. Here, we describe a technique that consists of a combination of biochemistry approaches, quantitative proteomics and computational methods using wild-type and deletion strains to investigate the organization of proteins within macromolecular protein complexes. We applied this technique to determine the organization of two well-studied complexes, Spt–Ada–Gcn5 histone acetyltransferase (SAGA) and ADA, for which no comprehensive high-resolution structures exist. This approach revealed that SAGA/ADA is composed of five distinct functional modules, which can persist separately. Furthermore, we identified a novel subunit of the ADA complex, termed Ahc2, and characterized Sgf29 as an ADA family protein present in all Gcn5 histone acetyltransferase complexes. Finally, we propose a model for the architecture of the SAGA and ADA complexes, which predicts novel functional associations within the SAGA complex and provides mechanistic insights into phenotypical observations in SAGA mutants. PMID:21734642
Understanding cancer complexome using networks, spectral graph theory and multilayer framework
NASA Astrophysics Data System (ADS)
Rai, Aparna; Pradhan, Priodyuti; Nagraj, Jyothi; Lohitesh, K.; Chowdhury, Rajdeep; Jalan, Sarika
2017-02-01
Cancer complexome comprises a heterogeneous and multifactorial milieu that varies in cytology, physiology, signaling mechanisms and response to therapy. The combined framework of network theory and spectral graph theory along with the multilayer analysis provides a comprehensive approach to analyze the proteomic data of seven different cancers, namely, breast, oral, ovarian, cervical, lung, colon and prostate. Our analysis demonstrates that the protein-protein interaction networks of the normal and the cancerous tissues associated with the seven cancers have overall similar structural and spectral properties. However, few of these properties implicate unsystematic changes from the normal to the disease networks depicting difference in the interactions and highlighting changes in the complexity of different cancers. Importantly, analysis of common proteins of all the cancer networks reveals few proteins namely the sensors, which not only occupy significant position in all the layers but also have direct involvement in causing cancer. The prediction and analysis of miRNAs targeting these sensor proteins hint towards the possible role of these proteins in tumorigenesis. This novel approach helps in understanding cancer at the fundamental level and provides a clue to develop promising and nascent concept of single drug therapy for multiple diseases as well as personalized medicine.
Understanding cancer complexome using networks, spectral graph theory and multilayer framework.
Rai, Aparna; Pradhan, Priodyuti; Nagraj, Jyothi; Lohitesh, K; Chowdhury, Rajdeep; Jalan, Sarika
2017-02-03
Cancer complexome comprises a heterogeneous and multifactorial milieu that varies in cytology, physiology, signaling mechanisms and response to therapy. The combined framework of network theory and spectral graph theory along with the multilayer analysis provides a comprehensive approach to analyze the proteomic data of seven different cancers, namely, breast, oral, ovarian, cervical, lung, colon and prostate. Our analysis demonstrates that the protein-protein interaction networks of the normal and the cancerous tissues associated with the seven cancers have overall similar structural and spectral properties. However, few of these properties implicate unsystematic changes from the normal to the disease networks depicting difference in the interactions and highlighting changes in the complexity of different cancers. Importantly, analysis of common proteins of all the cancer networks reveals few proteins namely the sensors, which not only occupy significant position in all the layers but also have direct involvement in causing cancer. The prediction and analysis of miRNAs targeting these sensor proteins hint towards the possible role of these proteins in tumorigenesis. This novel approach helps in understanding cancer at the fundamental level and provides a clue to develop promising and nascent concept of single drug therapy for multiple diseases as well as personalized medicine.
Proteomic Analysis of Virus-Host Interactions in an Infectious Context Using Recombinant Viruses*
Komarova, Anastassia V.; Combredet, Chantal; Meyniel-Schicklin, Laurène; Chapelle, Manuel; Caignard, Grégory; Camadro, Jean-Michel; Lotteau, Vincent; Vidalain, Pierre-Olivier; Tangy, Frédéric
2011-01-01
RNA viruses exhibit small-sized genomes encoding few proteins, but still establish complex networks of interactions with host cell components to achieve replication and spreading. Ideally, these virus-host protein interactions should be mapped directly in infected cell culture, but such a high standard is often difficult to reach when using conventional approaches. We thus developed a new strategy based on recombinant viruses expressing tagged viral proteins to capture both direct and indirect physical binding partners during infection. As a proof of concept, we engineered a recombinant measles virus (MV) expressing one of its virulence factors, the MV-V protein, with a One-STrEP amino-terminal tag. This allowed virus-host protein complex analysis directly from infected cells by combining modified tandem affinity chromatography and mass spectrometry analysis. Using this approach, we established a prosperous list of 245 cellular proteins interacting either directly or indirectly with MV-V, and including four of the nine already known partners of this viral factor. These interactions were highly specific of MV-V because they were not recovered when the nucleoprotein MV-N, instead of MV-V, was tagged. Besides key components of the antiviral response, cellular proteins from mitochondria, ribosomes, endoplasmic reticulum, protein phosphatase 2A, and histone deacetylase complex were identified for the first time as prominent targets of MV-V and the critical role of the later protein family in MV replication was addressed. Most interestingly, MV-V showed some preferential attachment to essential proteins in the human interactome network, as assessed by centrality and interconnectivity measures. Furthermore, the list of MV-V interactors also showed a massive enrichment for well-known targets of other viruses. Altogether, this clearly supports our approach based on reverse genetics of viruses combined with high-throughput proteomics to probe the interaction network that viruses establish in infected cells. PMID:21911578
Diaz-Montana, Juan J.; Diaz-Diaz, Norberto
2014-01-01
Gene networks are one of the main computational models used to study the interaction between different elements during biological processes being widely used to represent gene–gene, or protein–protein interaction complexes. We present GFD-Net, a Cytoscape app for visualizing and analyzing the functional dissimilarity of gene networks. PMID:25400907
Zhao, Mingzhu; Wei, Dong-Qing
2013-01-01
The traditional Chinese medicine (TCM), which has thousands of years of clinical application among China and other Asian countries, is the pioneer of the “multicomponent-multitarget” and network pharmacology. Although there is no doubt of the efficacy, it is difficult to elucidate convincing underlying mechanism of TCM due to its complex composition and unclear pharmacology. The use of ligand-protein networks has been gaining significant value in the history of drug discovery while its application in TCM is still in its early stage. This paper firstly surveys TCM databases for virtual screening that have been greatly expanded in size and data diversity in recent years. On that basis, different screening methods and strategies for identifying active ingredients and targets of TCM are outlined based on the amount of network information available, both on sides of ligand bioactivity and the protein structures. Furthermore, applications of successful in silico target identification attempts are discussed in detail along with experiments in exploring the ligand-protein networks of TCM. Finally, it will be concluded that the prospective application of ligand-protein networks can be used not only to predict protein targets of a small molecule, but also to explore the mode of action of TCM. PMID:23818932
Gloaguen, Pauline; Alban, Claude; Ravanel, Stéphane; Seigneurin-Berny, Daphné; Matringe, Michel; Ferro, Myriam; Bruley, Christophe; Rolland, Norbert; Vandenbrouck, Yves
2017-01-01
Higher plants, as autotrophic organisms, are effective sources of molecules. They hold great promise for metabolic engineering, but the behavior of plant metabolism at the network level is still incompletely described. Although structural models (stoichiometry matrices) and pathway databases are extremely useful, they cannot describe the complexity of the metabolic context, and new tools are required to visually represent integrated biocurated knowledge for use by both humans and computers. Here, we describe ChloroKB, a Web application (http://chlorokb.fr/) for visual exploration and analysis of the Arabidopsis (Arabidopsis thaliana) metabolic network in the chloroplast and related cellular pathways. The network was manually reconstructed through extensive biocuration to provide transparent traceability of experimental data. Proteins and metabolites were placed in their biological context (spatial distribution within cells, connectivity in the network, participation in supramolecular complexes, and regulatory interactions) using CellDesigner software. The network contains 1,147 reviewed proteins (559 localized exclusively in plastids, 68 in at least one additional compartment, and 520 outside the plastid), 122 proteins awaiting biochemical/genetic characterization, and 228 proteins for which genes have not yet been identified. The visual presentation is intuitive and browsing is fluid, providing instant access to the graphical representation of integrated processes and to a wealth of refined qualitative and quantitative data. ChloroKB will be a significant support for structural and quantitative kinetic modeling, for biological reasoning, when comparing novel data with established knowledge, for computer analyses, and for educational purposes. ChloroKB will be enhanced by continuous updates following contributions from plant researchers. PMID:28442501
2013-01-01
Background In the heart, cytoplasmic actin networks are thought to have important roles in mechanical support, myofibrillogenesis, and ion channel function. However, subcellular localization of cytoplasmic actin isoforms and proteins involved in the modulation of the cytoplasmic actin networks are elusive. Mena and VASP are important regulators of actin dynamics. Due to the lethal phenotype of mice with combined deficiency in Mena and VASP, however, distinct cardiac roles of the proteins remain speculative. In the present study, we analyzed the physiological functions of Mena and VASP in the heart and also investigated the role of the proteins in the organization of cytoplasmic actin networks. Results We generated a mouse model, which simultaneously lacks Mena and VASP in the heart. Mena/VASP double-deficiency induced dilated cardiomyopathy and conduction abnormalities. In wild-type mice, Mena and VASP specifically interacted with a distinct αII-Spectrin splice variant (SH3i), which is in cardiomyocytes exclusively localized at Z- and intercalated discs. At Z- and intercalated discs, Mena and β-actin localized to the edges of the sarcomeres, where the thin filaments are anchored. In Mena/VASP double-deficient mice, β-actin networks were disrupted and the integrity of Z- and intercalated discs was markedly impaired. Conclusions Together, our data suggest that Mena, VASP, and αII-Spectrin assemble cardiac multi-protein complexes, which regulate cytoplasmic actin networks. Conversely, Mena/VASP deficiency results in disrupted β-actin assembly, Z- and intercalated disc malformation, and induces dilated cardiomyopathy and conduction abnormalities. PMID:23937664
Benz, Peter M; Merkel, Carla J; Offner, Kristin; Abeßer, Marco; Ullrich, Melanie; Fischer, Tobias; Bayer, Barbara; Wagner, Helga; Gambaryan, Stepan; Ursitti, Jeanine A; Adham, Ibrahim M; Linke, Wolfgang A; Feller, Stephan M; Fleming, Ingrid; Renné, Thomas; Frantz, Stefan; Unger, Andreas; Schuh, Kai
2013-08-12
In the heart, cytoplasmic actin networks are thought to have important roles in mechanical support, myofibrillogenesis, and ion channel function. However, subcellular localization of cytoplasmic actin isoforms and proteins involved in the modulation of the cytoplasmic actin networks are elusive. Mena and VASP are important regulators of actin dynamics. Due to the lethal phenotype of mice with combined deficiency in Mena and VASP, however, distinct cardiac roles of the proteins remain speculative. In the present study, we analyzed the physiological functions of Mena and VASP in the heart and also investigated the role of the proteins in the organization of cytoplasmic actin networks. We generated a mouse model, which simultaneously lacks Mena and VASP in the heart. Mena/VASP double-deficiency induced dilated cardiomyopathy and conduction abnormalities. In wild-type mice, Mena and VASP specifically interacted with a distinct αII-Spectrin splice variant (SH3i), which is in cardiomyocytes exclusively localized at Z- and intercalated discs. At Z- and intercalated discs, Mena and β-actin localized to the edges of the sarcomeres, where the thin filaments are anchored. In Mena/VASP double-deficient mice, β-actin networks were disrupted and the integrity of Z- and intercalated discs was markedly impaired. Together, our data suggest that Mena, VASP, and αII-Spectrin assemble cardiac multi-protein complexes, which regulate cytoplasmic actin networks. Conversely, Mena/VASP deficiency results in disrupted β-actin assembly, Z- and intercalated disc malformation, and induces dilated cardiomyopathy and conduction abnormalities.
The amyloid interactome: Exploring protein aggregation
Mastrokalou, Chara V.; Hamodrakas, Stavros J.
2017-01-01
Protein-protein interactions are the quintessence of physiological activities, but also participate in pathological conditions. Amyloid formation, an abnormal protein-protein interaction process, is a widespread phenomenon in divergent proteins and peptides, resulting in a variety of aggregation disorders. The complexity of the mechanisms underlying amyloid formation/amyloidogenicity is a matter of great scientific interest, since their revelation will provide important insight on principles governing protein misfolding, self-assembly and aggregation. The implication of more than one protein in the progression of different aggregation disorders, together with the cited synergistic occurrence between amyloidogenic proteins, highlights the necessity for a more universal approach, during the study of these proteins. In an attempt to address this pivotal need we constructed and analyzed the human amyloid interactome, a protein-protein interaction network of amyloidogenic proteins and their experimentally verified interactors. This network assembled known interconnections between well-characterized amyloidogenic proteins and proteins related to amyloid fibril formation. The consecutive extended computational analysis revealed significant topological characteristics and unraveled the functional roles of all constituent elements. This study introduces a detailed protein map of amyloidogenicity that will aid immensely towards separate intervention strategies, specifically targeting sub-networks of significant nodes, in an attempt to design possible novel therapeutics for aggregation disorders. PMID:28249044
Prediction of heterotrimeric protein complexes by two-phase learning using neighboring kernels
2014-01-01
Background Protein complexes play important roles in biological systems such as gene regulatory networks and metabolic pathways. Most methods for predicting protein complexes try to find protein complexes with size more than three. It, however, is known that protein complexes with smaller sizes occupy a large part of whole complexes for several species. In our previous work, we developed a method with several feature space mappings and the domain composition kernel for prediction of heterodimeric protein complexes, which outperforms existing methods. Results We propose methods for prediction of heterotrimeric protein complexes by extending techniques in the previous work on the basis of the idea that most heterotrimeric protein complexes are not likely to share the same protein with each other. We make use of the discriminant function in support vector machines (SVMs), and design novel feature space mappings for the second phase. As the second classifier, we examine SVMs and relevance vector machines (RVMs). We perform 10-fold cross-validation computational experiments. The results suggest that our proposed two-phase methods and SVM with the extended features outperform the existing method NWE, which was reported to outperform other existing methods such as MCL, MCODE, DPClus, CMC, COACH, RRW, and PPSampler for prediction of heterotrimeric protein complexes. Conclusions We propose two-phase prediction methods with the extended features, the domain composition kernel, SVMs and RVMs. The two-phase method with the extended features and the domain composition kernel using SVM as the second classifier is particularly useful for prediction of heterotrimeric protein complexes. PMID:24564744
Revealing the hidden language of complex networks.
Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Davis, Darren; Levnajic, Zoran; Janjic, Vuk; Karapandza, Rasa; Stojmirovic, Aleksandar; Pržulj, Nataša
2014-04-01
Sophisticated methods for analysing complex networks promise to be of great benefit to almost all scientific disciplines, yet they elude us. In this work, we make fundamental methodological advances to rectify this. We discover that the interaction between a small number of roles, played by nodes in a network, can characterize a network's structure and also provide a clear real-world interpretation. Given this insight, we develop a framework for analysing and comparing networks, which outperforms all existing ones. We demonstrate its strength by uncovering novel relationships between seemingly unrelated networks, such as Facebook, metabolic, and protein structure networks. We also use it to track the dynamics of the world trade network, showing that a country's role of a broker between non-trading countries indicates economic prosperity, whereas peripheral roles are associated with poverty. This result, though intuitive, has escaped all existing frameworks. Finally, our approach translates network topology into everyday language, bringing network analysis closer to domain scientists.
Protein Network of the Pseudomonas aeruginosa Denitrification Apparatus
Borrero-de Acuña, José Manuel; Rohde, Manfred; Wissing, Josef; Jänsch, Lothar; Schobert, Max; Molinari, Gabriella; Timmis, Kenneth N.
2016-01-01
ABSTRACT Oxidative phosphorylation using multiple-component, membrane-associated protein complexes is the most effective way for a cell to generate energy. Here, we systematically investigated the multiple protein-protein interactions of the denitrification apparatus of the pathogenic bacterium Pseudomonas aeruginosa. During denitrification, nitrate (Nar), nitrite (Nir), nitric oxide (Nor), and nitrous oxide (Nos) reductases catalyze the reaction cascade of NO3− → NO2− → NO → N2O → N2. Genetic experiments suggested that the nitric oxide reductase NorBC and the regulatory protein NosR are the nucleus of the denitrification protein network. We utilized membrane interactomics in combination with electron microscopy colocalization studies to elucidate the corresponding protein-protein interactions. The integral membrane proteins NorC, NorB, and NosR form the core assembly platform that binds the nitrate reductase NarGHI and the periplasmic nitrite reductase NirS via its maturation factor NirF. The periplasmic nitrous oxide reductase NosZ is linked via NosR. The nitrate transporter NarK2, the nitrate regulatory system NarXL, various nitrite reductase maturation proteins, NirEJMNQ, and the Nos assembly lipoproteins NosFL were also found to be attached. A number of proteins associated with energy generation, including electron-donating dehydrogenases, the complete ATP synthase, almost all enzymes of the tricarboxylic acid (TCA) cycle, and the Sec system of protein transport, among many other proteins, were found to interact with the denitrification proteins. This deduced nitrate respirasome is presumably only one part of an extensive cytoplasmic membrane-anchored protein network connecting cytoplasmic, inner membrane, and periplasmic proteins to mediate key activities occurring at the barrier/interface between the cytoplasm and the external environment. IMPORTANCE The processes of cellular energy generation are catalyzed by large multiprotein enzyme complexes. The molecular basis for the interaction of these complexes is poorly understood. We employed membrane interactomics and electron microscopy to determine the protein-protein interactions involved. The well-investigated enzyme complexes of denitrification of the pathogenic bacterium Pseudomonas aeruginosa served as a model. Denitrification is one essential step of the universal N cycle and provides the bacterium with an effective alternative to oxygen respiration. This process allows the bacterium to form biofilms, which create low-oxygen habitats and which are a key in the infection mechanism. Our results provide new insights into the molecular basis of respiration, as well as opening a new window into the infection strategies of this pathogen. PMID:26903416
Understanding protein-nanoparticle interaction: a new gateway to disease therapeutics.
Giri, Karuna; Shameer, Khader; Zimmermann, Michael T; Saha, Sounik; Chakraborty, Prabir K; Sharma, Anirudh; Arvizo, Rochelle R; Madden, Benjamin J; Mccormick, Daniel J; Kocher, Jean-Pierre A; Bhattacharya, Resham; Mukherjee, Priyabrata
2014-06-18
Molecular identification of protein molecules surrounding nanoparticles (NPs) may provide useful information that influences NP clearance, biodistribution, and toxicity. Hence, nanoproteomics provides specific information about the environment that NPs interact with and can therefore report on the changes in protein distribution that occurs during tumorigenesis. Therefore, we hypothesized that characterization and identification of protein molecules that interact with 20 nm AuNPs from cancer and noncancer cells may provide mechanistic insights into the biology of tumor growth and metastasis and identify new therapeutic targets in ovarian cancer. Hence, in the present study, we systematically examined the interaction of the protein molecules with 20 nm AuNPs from cancer and noncancerous cell lysates. Time-resolved proteomic profiles of NP-protein complexes demonstrated electrostatic interaction to be the governing factor in the initial time-points which are dominated by further stabilization interaction at longer time-points as determined by ultraviolet-visible spectroscopy (UV-vis), dynamic light scattering (DLS), ζ-potential measurements, transmission electron microscopy (TEM), and tandem mass spectrometry (MS/MS). Reduction in size, charge, and number of bound proteins were observed as the protein-NP complex stabilized over time. Interestingly, proteins related to mRNA processing were overwhelmingly represented on the NP-protein complex at all times. More importantly, comparative proteomic analyses revealed enrichment of a number of cancer-specific proteins on the AuNP surface. Network analyses of these proteins highlighted important hub nodes that could potentially be targeted for maximal therapeutic advantage in the treatment of ovarian cancer. The importance of this methodology and the biological significance of the network proteins were validated by a functional study of three hubs that exhibited variable connectivity, namely, PPA1, SMNDC1, and PI15. Western blot analysis revealed overexpression of these proteins in ovarian cancer cells when compared to normal cells. Silencing of PPA1, SMNDC1, and PI15 by the siRNA approach significantly inhibited proliferation of ovarian cancer cells and the effect correlated with the connectivity pattern obtained from our network analyses.
A comparative study of disease genes and drug targets in the human protein interactome
2015-01-01
Background Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. Results In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. Conclusions The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing. PMID:25861037
A comparative study of disease genes and drug targets in the human protein interactome.
Sun, Jingchun; Zhu, Kevin; Zheng, W; Xu, Hua
2015-01-01
Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing.
The diminishing role of hubs in dynamical processes on complex networks.
Quax, Rick; Apolloni, Andrea; Sloot, Peter M A
2013-11-06
It is notoriously difficult to predict the behaviour of a complex self-organizing system, where the interactions among dynamical units form a heterogeneous topology. Even if the dynamics of each microscopic unit is known, a real understanding of their contributions to the macroscopic system behaviour is still lacking. Here, we develop information-theoretical methods to distinguish the contribution of each individual unit to the collective out-of-equilibrium dynamics. We show that for a system of units connected by a network of interaction potentials with an arbitrary degree distribution, highly connected units have less impact on the system dynamics when compared with intermediately connected units. In an equilibrium setting, the hubs are often found to dictate the long-term behaviour. However, we find both analytically and experimentally that the instantaneous states of these units have a short-lasting effect on the state trajectory of the entire system. We present qualitative evidence of this phenomenon from empirical findings about a social network of product recommendations, a protein-protein interaction network and a neural network, suggesting that it might indeed be a widespread property in nature.
oGNM: online computation of structural dynamics using the Gaussian Network Model
Yang, Lee-Wei; Rader, A. J.; Liu, Xiong; Jursa, Cristopher Jon; Chen, Shann Ching; Karimi, Hassan A.; Bahar, Ivet
2006-01-01
An assessment of the equilibrium dynamics of biomolecular systems, and in particular their most cooperative fluctuations accessible under native state conditions, is a first step towards understanding molecular mechanisms relevant to biological function. We present a web-based system, oGNM that enables users to calculate online the shape and dispersion of normal modes of motion for proteins, oligonucleotides and their complexes, or associated biological units, using the Gaussian Network Model (GNM). Computations with the new engine are 5–6 orders of magnitude faster than those using conventional normal mode analyses. Two cases studies illustrate the utility of oGNM. The first shows that the thermal fluctuations predicted for 1250 non-homologous proteins correlate well with X-ray crystallographic data over a broad range [7.3–15 Å] of inter-residue interaction cutoff distances and the correlations improve with increasing observation temperatures. The second study, focused on 64 oligonucleotides and oligonucleotide–protein complexes, shows that good agreement with experiments is achieved by representing each nucleotide by three GNM nodes (as opposed to one-node-per-residue in proteins) along with uniform interaction ranges for all components of the complexes. These results open the way to a rapid assessment of the dynamics of DNA/RNA-containing complexes. The server can be accessed at . PMID:16845002
Jia, Peilin; Chen, Xiangning; Fanous, Ayman H; Zhao, Zhongming
2018-05-24
Genetic components susceptible to complex disease such as schizophrenia include a wide spectrum of variants, including common variants (CVs) and de novo mutations (DNMs). Although CVs and DNMs differ by origin, it remains elusive whether and how they interact at the gene, pathway, and network levels that leads to the disease. In this work, we characterized the genes harboring schizophrenia-associated CVs (CVgenes) and the genes harboring DNMs (DNMgenes) using measures from network, tissue-specific expression profile, and spatiotemporal brain expression profile. We developed an algorithm to link the DNMgenes and CVgenes in spatiotemporal brain co-expression networks. DNMgenes tended to have central roles in the human protein-protein interaction (PPI) network, evidenced in their high degree and high betweenness values. DNMgenes and CVgenes connected with each other significantly more often than with other genes in the networks. However, only CVgenes remained significantly connected after adjusting for their degree. In our gene co-expression PPI network, we found DNMgenes and CVgenes connected in a tissue-specific fashion, and such a pattern was similar to that in GTEx brain but not in other GTEx tissues. Importantly, DNMgene-CVgene subnetworks were enriched with pathways of chromatin remodeling, MHC protein complex binding, and neurotransmitter activities. In summary, our results unveiled that both DNMgenes and CVgenes contributed to a core set of biologically important pathways and networks, and their interactions may attribute to the risk for schizophrenia. Our results also suggested a stronger biological effect of DNMgenes than CVgenes in schizophrenia.
Model identification of signal transduction networks from data using a state regulator problem.
Gadkar, K G; Varner, J; Doyle, F J
2005-03-01
Advances in molecular biology provide an opportunity to develop detailed models of biological processes that can be used to obtain an integrated understanding of the system. However, development of useful models from the available knowledge of the system and experimental observations still remains a daunting task. In this work, a model identification strategy for complex biological networks is proposed. The approach includes a state regulator problem (SRP) that provides estimates of all the component concentrations and the reaction rates of the network using the available measurements. The full set of the estimates is utilised for model parameter identification for the network of known topology. An a priori model complexity test that indicates the feasibility of performance of the proposed algorithm is developed. Fisher information matrix (FIM) theory is used to address model identifiability issues. Two signalling pathway case studies, the caspase function in apoptosis and the MAP kinase cascade system, are considered. The MAP kinase cascade, with measurements restricted to protein complex concentrations, fails the a priori test and the SRP estimates are poor as expected. The apoptosis network structure used in this work has moderate complexity and is suitable for application of the proposed tools. Using a measurement set of seven protein concentrations, accurate estimates for all unknowns are obtained. Furthermore, the effects of measurement sampling frequency and quality of information in the measurement set on the performance of the identified model are described.
Wang, Le; Tan, Nana; Hu, Jiayao; Wang, Huan; Duan, Dongzhu; Ma, Lin; Xiao, Jian; Wang, Xiaoling
2017-12-28
Osmanthus fragrans has been used as folk medicine for thousands of years. The extracts of Osmanthus fragrans flowers were reported to have various bioactivities including free radical scavenging, anti-inflammation, neuroprotection and antitumor effects. However, there is still lack of knowledge about its essential oil. In this work, we analyzed the chemical composition of the essential oil from Osmanthus fragrans var. thunbergii by GC-MS. A complex network approach was applied to investigate the interrelationships between the ingredients, target proteins, and related pathways for the essential oil. Statistical characteristics of the networks were further studied to explore the main active ingredients and potential bioactivities of O. fragrans var. thunbergii essential oil. A total of 44 ingredients were selected from the chemical composition of O. fragrans var. thunbergii essential oil, and that 191 potential target proteins together with 70 pathways were collected for these compounds. An ingredient-target-pathway network was constructed based on these data and showed scale-free property as well as power-law degree distribution. Eugenol and geraniol were screened as main active ingredients with much higher degree values. Potential neuroprotective and anti-tumor effect of the essential oil were also found. A core subnetwork was extracted from the ingredient-target-pathway network, and indicated that eugenol and geraniol contributed most to the neuroprotection of this essential oil. Furthermore, a pathway-based protein association network was built and exhibited small-world property. MAPK1 and MAPK3 were considered as key proteins with highest scores of centrality indices, which might play an important role in the anti-tumor effect of the essential oil. This work predicted the main active ingredients and bioactivities of O. fragrans var. thunbergii essential oil, which would benefit the development and utilization of Osmanthus fragrans flowers. The application of complex network theory was proved to be effective in bioactivities studies of essential oil. Moreover, it provides a novel strategy for exploring the molecular mechanisms of traditional medicines.
Ukleja, Marta; Valpuesta, José María; Dziembowski, Andrzej; Cuellar, Jorge
2016-10-01
Large protein assemblies are usually the effectors of major cellular processes. The intricate cell homeostasis network is divided into numerous interconnected pathways, each controlled by a set of protein machines. One of these master regulators is the CCR4-NOT complex, which ultimately controls protein expression levels. This multisubunit complex assembles around a scaffold platform, which enables a wide variety of well-studied functions from mRNA synthesis to transcript decay, as well as other tasks still being identified. Solving the structure of the entire CCR4-NOT complex will help to define the distribution of its functions. The recently published three-dimensional reconstruction of the complex, in combination with the known crystal structures of some of the components, has begun to address this. Methodological improvements in structural biology, especially in cryoelectron microscopy, encourage further structural and protein-protein interaction studies, which will advance our comprehension of the gene expression machinery. © 2016 WILEY Periodicals, Inc.
MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks.
Keel, Brittney N; Deng, Bo; Moriyama, Etsuko N
2018-04-15
Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary history of proteins is hence best modeled through networks that incorporate information both from the sequence divergence and the domain content. Here, a game-theoretic approach proposed for protein network construction is adapted into the framework of multi-objective optimization, and extended to incorporate clustering refinement procedure. The new method, MOCASSIN-prot, was applied to cluster multi-domain proteins from ten genomes. The performance of MOCASSIN-prot was compared against two protein clustering methods, Markov clustering (TRIBE-MCL) and spectral clustering (SCPS). We showed that compared to these two methods, MOCASSIN-prot, which uses both domain composition and quantitative sequence similarity information, generates fewer false positives. It achieves more functionally coherent protein clusters and better differentiates protein families. MOCASSIN-prot, implemented in Perl and Matlab, is freely available at http://bioinfolab.unl.edu/emlab/MOCASSINprot. emoriyama2@unl.edu. Supplementary data are available at Bioinformatics online.
Liu, Yun; Wang, Huixiang; Liu, Qingping; Qu, Haiyun; Liu, Baohong; Yang, Pengyuan
2010-11-07
A microfluidic reactor has been developed for rapid enhancement of protein digestion by constructing an alumina network within a poly(ethylene terephthalate) (PET) microchannel. Trypsin is stably immobilized in a sol-gel network on the PET channel surface after pretreatment, which produces a protein-resistant interface to reduce memory effects, as characterized by X-ray fluorescence spectrometry and electroosmotic flow. The gel-derived network within a microchannel provides a large surface-to-volume ratio stationary phase for highly efficient proteolysis of proteins existing both at a low level and in complex extracts. The maximum reaction rate of the encapsulated trypsin reactor, measured by kinetic analysis, is much faster than in bulk solution. Due to the microscopic confinement effect, high levels of enzyme entrapment and the biocompatible microenvironment provided by the alumina gel network, the low-level proteins can be efficiently digested using such a microreactor within a very short residence time of a few seconds. The on-chip microreactor is further applied to the identification of a mixture of proteins extracted from normal mouse liver cytoplasm sample via integration with 2D-LC-ESI-MS/MS to show its potential application for large-scale protein identification.
Stetz, Gabrielle; Verkhivker, Gennady M
2016-08-22
Although molecular mechanisms of allosteric regulation in the Hsp70 chaperones have been extensively studied at both structural and functional levels, the current understanding of allosteric inhibition of chaperone activities by small molecules is still lacking. In the current study, using a battery of computational approaches, we probed allosteric inhibition mechanisms of E. coli Hsp70 (DnaK) and human Hsp70 proteins by small molecule inhibitors PET-16 and novolactone. Molecular dynamics simulations and binding free energy analysis were combined with network-based modeling of residue interactions and allosteric communications to systematically characterize and compare molecular signatures of the apo form, substrate-bound, and inhibitor-bound chaperone complexes. The results suggested a mechanism by which the allosteric inhibitors may leverage binding energy hotspots in the interaction networks to stabilize a specific conformational state and impair the interdomain allosteric control. Using the network-based centrality analysis and community detection, we demonstrated that substrate binding may strengthen the connectivity of local interaction communities, leading to a dense interaction network that can promote an efficient allosteric communication. In contrast, binding of PET-16 to DnaK may induce significant dynamic changes and lead to a fractured interaction network and impaired allosteric communications in the DnaK complex. By using a mechanistic-based analysis of distance fluctuation maps and allosteric propensities of protein residues, we determined that the allosteric network in the PET-16 complex may be small and localized due to the reduced communication and low cooperativity of the substrate binding loops, which may promote the higher rates of substrate dissociation and the decreased substrate affinity. In comparison with the significant effect of PET-16, binding of novolactone to HSPA1A may cause only moderate network changes and preserve allosteric coupling between the allosteric pocket and the substrate binding region. The impact of novolactone on the conformational dynamics and allosteric communications in the HSPA1A complex was comparable to the substrate effect, which is consistent with the experimental evidence that PET-16, but not novolactone binding, can significantly decrease substrate affinity. We argue that the unique dynamic and network signatures of PET-16 and novolactone may be linked with the experimentally observed functional effects of these inhibitors on allosteric regulation and substrate binding.
Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae
Reguly, Teresa; Breitkreutz, Ashton; Boucher, Lorrie; Breitkreutz, Bobby-Joe; Hon, Gary C; Myers, Chad L; Parsons, Ainslie; Friesen, Helena; Oughtred, Rose; Tong, Amy; Stark, Chris; Ho, Yuen; Botstein, David; Andrews, Brenda; Boone, Charles; Troyanskya, Olga G; Ideker, Trey; Dolinski, Kara; Batada, Nizar N; Tyers, Mike
2006-01-01
Background The study of complex biological networks and prediction of gene function has been enabled by high-throughput (HTP) methods for detection of genetic and protein interactions. Sparse coverage in HTP datasets may, however, distort network properties and confound predictions. Although a vast number of well substantiated interactions are recorded in the scientific literature, these data have not yet been distilled into networks that enable system-level inference. Results We describe here a comprehensive database of genetic and protein interactions, and associated experimental evidence, for the budding yeast Saccharomyces cerevisiae, as manually curated from over 31,793 abstracts and online publications. This literature-curated (LC) dataset contains 33,311 interactions, on the order of all extant HTP datasets combined. Surprisingly, HTP protein-interaction datasets currently achieve only around 14% coverage of the interactions in the literature. The LC network nevertheless shares attributes with HTP networks, including scale-free connectivity and correlations between interactions, abundance, localization, and expression. We find that essential genes or proteins are enriched for interactions with other essential genes or proteins, suggesting that the global network may be functionally unified. This interconnectivity is supported by a substantial overlap of protein and genetic interactions in the LC dataset. We show that the LC dataset considerably improves the predictive power of network-analysis approaches. The full LC dataset is available at the BioGRID () and SGD () databases. Conclusion Comprehensive datasets of biological interactions derived from the primary literature provide critical benchmarks for HTP methods, augment functional prediction, and reveal system-level attributes of biological networks. PMID:16762047
Proteolytic crosstalk in multi-protease networks
NASA Astrophysics Data System (ADS)
Ogle, Curtis T.; Mather, William H.
2016-04-01
Processive proteases, such as ClpXP in E. coli, are conserved enzyme assemblies that can recognize and rapidly degrade proteins. These proteases are used for a number of purposes, including degrading mistranslated proteins and controlling cellular stress response. However, proteolytic machinery within the cell is limited in capacity and can lead to a bottleneck in protein degradation, whereby many proteins compete (‘queue’) for proteolytic resources. Previous work has demonstrated that such queueing can lead to pronounced statistical relationships between different protein counts when proteins compete for a single common protease. However, real cells contain many different proteases, e.g. ClpXP, ClpAP, and Lon in E. coli, and it is not clear how competition between proteins for multiple classes of protease would influence the dynamics of cellular networks. In the present work, we theoretically demonstrate that a multi-protease proteolytic bottleneck can substantially couple the dynamics for both simple and complex (oscillatory) networks, even between substrates with substantially different affinities for protease. For these networks, queueing often leads to strong positive correlations between protein counts, and these correlations are strongest near the queueing theoretic point of balance. Furthermore, we find that the qualitative behavior of these networks depends on the relative size of the absolute affinity of substrate to protease compared to the cross affinity of substrate to protease, leading in certain regimes to priority queue statistics.
Lin, Che; Lin, Chin-Nan; Wang, Yu-Chao; Liu, Fang-Yu; Chuang, Yung-Jen; Lan, Chung-Yu; Hsieh, Wen-Ping; Chen, Bor-Sen
2014-10-24
The immune system is a key biological system present in vertebrates. Exposure to pathogens elicits various defensive immune mechanisms that protect the host from potential threats and harmful substances derived from pathogens such as parasites, bacteria, and viruses. The complex immune system of humans and many other vertebrates can be divided into two major categories: the innate and the adaptive immune systems. At present, analysis of the complex interactions between the two subsystems that regulate host defense and inflammatory responses remains challenging. Based on time-course microarray data following primary and secondary infection of zebrafish by Candida albicans, we constructed two intracellular protein-protein interaction (PPI) networks for primary and secondary responses of the host. 57 proteins and 341 PPIs were identified for primary infection while 90 proteins and 385 PPIs were identified for secondary infection. There were 20 proteins in common while 37 and 70 proteins specific to primary and secondary infection. By inspecting the hub proteins of each network and comparing significant changes in the number of linkages between the two PPI networks, we identified TGF-β signaling and apoptosis as two of the main functional modules involved in primary and secondary infection. Our initial in silico analyses pave the way for further investigation into the interesting roles played by the TGF-β signaling pathway and apoptosis in innate and adaptive immunity in zebrafish. Such insights could lead to therapeutic advances and improved drug design in the continual battle against infectious diseases.
The ribonucleoprotein Csr network.
Seyll, Ethel; Van Melderen, Laurence
2013-11-08
Ribonucleoprotein complexes are essential regulatory components in bacteria. In this review, we focus on the carbon storage regulator (Csr) network, which is well conserved in the bacterial world. This regulatory network is composed of the CsrA master regulator, its targets and regulators. CsrA binds to mRNA targets and regulates translation either negatively or positively. Binding to small non-coding RNAs controls activity of this protein. Expression of these regulators is tightly regulated at the level of transcription and stability by various global regulators (RNAses, two-component systems, alarmone). We discuss the implications of these complex regulations in bacterial adaptation.
Flavivirus Replication Complex Assembly Revealed by DNAJC14 Functional Mapping
Yi, Zhigang; Yuan, Zhenghong; Rice, Charles M.
2012-01-01
DNAJC14 is an Hsp40 family member that broadly modulates flavivirus replication. The mechanism by which DNAJC14 stoichiometrically participates in flavivirus replication complex (RC) formation is unknown; both reduced and elevated levels result in replication inhibition. Using yellow fever virus (YFV), we demonstrate that DNAJC14 redistributes and clusters with YFV nonstructural proteins via a transmembrane domain and a newly identified membrane-binding domain (MBD), which both mediate targeting to detergent-resistant membranes. Furthermore, the RC and DNAJC14 reside as part of a protein interaction network that remains after 1% Triton solubilization. Mutagenesis studies demonstrate that entry into this protein interaction network requires the DNAJC14 C-terminal self-interaction domain. Fusion of the DNAJC14 MBD and self-interaction domain with another Hsp40 family protein is sufficient to confer YFV-inhibitory activity. Our findings support a novel model of DNAJC14 action that includes specific membrane targeting of both DNAJC14 and YFV replication proteins, the formation of protein interactions, and a microdomain-specific chaperone event leading to RC formation. This process alters the properties of the RC membrane and results in the formation of a protein scaffold that maintains the RC. PMID:22915803
Horváth, Gergő; Bencsura, Ákos; Simon, Ágnes; Tochtrop, Gregory P; DeKoster, Gregory T; Covey, Douglas F; Cistola, David P; Toke, Orsolya
2016-02-01
Besides aiding digestion, bile salts are important signal molecules exhibiting a regulatory role in metabolic processes. Human ileal bile acid binding protein (I-BABP) is an intracellular carrier of bile salts in the epithelial cells of the distal small intestine and has a key role in the enterohepatic circulation of bile salts. Positive binding cooperativity combined with site selectivity of glycocholate and glycochenodeoxycholate, the two most abundant bile salts in the human body, make human I-BABP a unique member of the family of intracellular lipid binding proteins. Solution NMR structure of the ternary complex of human I-BABP with glycocholate and glycochenodeoxycholate reveals an extensive network of hydrogen bonds and hydrophobic interactions stabilizing the bound bile salts. Conformational changes accompanying bile salt binding affects four major regions in the protein including the C/D, E/F and G/H loops as well as the helical segment. Most of these protein regions coincide with a previously described network of millisecond time scale fluctuations in the apo protein, a motion absent in the bound state. Comparison of the heterotypic doubly ligated complex with the unligated form provides further evidence of a conformation selection mechanism of ligand entry. Structural and dynamic aspects of human I-BABP-bile salt interaction are discussed and compared with characteristics of ligand binding in other members of the intracellular lipid binding protein family. The coordinates of the 10 lowest energy structures of the human I-BABP : GCDA : GCA complex as well as the distance restraints used to calculate the final ensemble have been deposited in the Brookhaven Protein Data Bank with accession number 2MM3. © 2015 FEBS.
Rheological and structural characterization of agar/whey proteins insoluble complexes.
Rocha, Cristina M R; Souza, Hiléia K S; Magalhães, Natália F; Andrade, Cristina T; Gonçalves, Maria Pilar
2014-09-22
Complex coacervation between whey proteins and carboxylated or highly sulphated polysaccharides has been widely studied. The aim of this work was to characterise a slightly sulphated polysaccharide (agar) and whey protein insoluble complexes in terms of yield, composition and physicochemical properties as well as to study their rheological behaviour for better understanding their structure. Unlike other sulphated polysaccharides, complexation of agar and whey protein at pH 3 in the absence of a buffering agent resulted in a coacervate that was a gel at 20°C with rheological properties and structure similar to those of simple agar gels, reinforced by proteins electrostatically aggregated to the agar network. The behaviour towards heat treatment was similar to that of agar alone, with a high thermal hysteresis and almost full reversibility. In the presence of citrate buffer, the result was a "flocculated solid", with low water content (75-81%), whose properties were governed by protein behaviour. Copyright © 2014 Elsevier Ltd. All rights reserved.
Jers, Carsten; Soufi, Boumediene; Grangeasse, Christophe; Deutscher, Josef; Mijakovic, Ivan
2008-08-01
Bacteria use protein phosphorylation to regulate all kinds of physiological processes. Protein phosphorylation plays a role in several key steps of the infection process of bacterial pathogens, such as adhesion to the host, triggering and regulation of pathogenic functions as well as biochemical warfare; scrambling the host signaling cascades and impairing its defense mechanisms. Recent phosphoproteomic studies indicate that the bacterial protein phosphorylation networks could be more complex than initially expected, comprising promiscuous kinases that regulate several distinct cellular functions by phosphorylating different protein substrates. Recent advances in protein labeling with stable isotopes in the field of quantitative mass spectrometry phosphoproteomics will enable us to chart the global phosphorylation networks and to understand the implication of protein phosphorylation in cellular regulation on the systems scale. For the study of bacterial pathogens, in particular, this research avenue will enable us to dissect phosphorylation-related events during different stages of infection and stimulate our efforts to find inhibitors for key kinases and phosphatases implicated therein.
Boyanova, Desislava; Nilla, Santosh; Klau, Gunnar W.; Dandekar, Thomas; Müller, Tobias; Dittrich, Marcus
2014-01-01
The continuously evolving field of proteomics produces increasing amounts of data while improving the quality of protein identifications. Albeit quantitative measurements are becoming more popular, many proteomic studies are still based on non-quantitative methods for protein identification. These studies result in potentially large sets of identified proteins, where the biological interpretation of proteins can be challenging. Systems biology develops innovative network-based methods, which allow an integrated analysis of these data. Here we present a novel approach, which combines prior knowledge of protein-protein interactions (PPI) with proteomics data using functional similarity measurements of interacting proteins. This integrated network analysis exactly identifies network modules with a maximal consistent functional similarity reflecting biological processes of the investigated cells. We validated our approach on small (H9N2 virus-infected gastric cells) and large (blood constituents) proteomic data sets. Using this novel algorithm, we identified characteristic functional modules in virus-infected cells, comprising key signaling proteins (e.g. the stress-related kinase RAF1) and demonstrate that this method allows a module-based functional characterization of cell types. Analysis of a large proteome data set of blood constituents resulted in clear separation of blood cells according to their developmental origin. A detailed investigation of the T-cell proteome further illustrates how the algorithm partitions large networks into functional subnetworks each representing specific cellular functions. These results demonstrate that the integrated network approach not only allows a detailed analysis of proteome networks but also yields a functional decomposition of complex proteomic data sets and thereby provides deeper insights into the underlying cellular processes of the investigated system. PMID:24807868
Nicotine affects protein complex rearrangement in Caenorhabditis elegans cells.
Sobkowiak, Robert; Zielezinski, Andrzej; Karlowski, Wojciech M; Lesicki, Andrzej
2017-10-01
Nicotine may affect cell function by rearranging protein complexes. We aimed to determine nicotine-induced alterations of protein complexes in Caenorhabditis elegans (C. elegans) cells, thereby revealing links between nicotine exposure and protein complex modulation. We compared the proteomic alterations induced by low and high nicotine concentrations (0.01 mM and 1 mM) with the control (no nicotine) in vivo by using mass spectrometry (MS)-based techniques, specifically the cetyltrimethylammonium bromide (CTAB) discontinuous gel electrophoresis coupled with liquid chromatography (LC)-MS/MS and spectral counting. As a result, we identified dozens of C. elegans proteins that are present exclusively or in higher abundance in either nicotine-treated or untreated worms. Based on these results, we report a possible network that captures the key protein components of nicotine-induced protein complexes and speculate how the different protein modules relate to their distinct physiological roles. Using functional annotation of detected proteins, we hypothesize that the identified complexes can modulate the energy metabolism and level of oxidative stress. These proteins can also be involved in modulation of gene expression and may be crucial in Alzheimer's disease. The findings reported in our study reveal putative intracellular interactions of many proteins with the cytoskeleton and may contribute to the understanding of the mechanisms of nicotinic acetylcholine receptor (nAChR) signaling and trafficking in cells.
VEGF Triggers the Activation of Cofilin and the Arp2/3 Complex within the Growth Cone
Schlau, Matthias; Terheyden-Keighley, Daniel; Theis, Verena; Mannherz, Hans Georg; Theiss, Carsten
2018-01-01
A crucial neuronal structure for the development and regeneration of neuronal networks is the axonal growth cone. Affected by different guidance cues, it grows in a predetermined direction to reach its final destination. One of those cues is the vascular endothelial growth factor (VEGF), which was identified as a positive effector for growth cone movement. These positive effects are mainly mediated by a reorganization of the actin network. This study shows that VEGF triggers a tight colocalization of cofilin and the Arp2/3 complex to the actin cytoskeleton within chicken dorsal root ganglia (DRG). Live cell imaging after microinjection of GFP (green fluorescent protein)-cofilin and RFP (red fluorescent protein)-LifeAct revealed that both labeled proteins rapidly redistributed within growth cones, and showed a congruent distribution pattern after VEGF supplementation. Disruption of signaling upstream of cofilin via blocking LIM-kinase (LIMK) activity resulted in growth cones displaying regressive growth behavior. Microinjection of GFP-p16b (a subunit of the Arp2/3 complex) and RFP-LifeAct revealed that both proteins redistributed into lamellipodia of the growth cone within minutes after VEGF stimulation. Disruption of the signaling to the Arp2/3 complex in the presence of VEGF by inhibition of N-WASP (neuronal Wiskott–Aldrich–Scott protein) caused retraction of growth cones. Hence, cofilin and the Arp2/3 complex appear to be downstream effector proteins of VEGF signaling to the actin cytoskeleton of DRG growth cones. Our data suggest that VEGF simultaneously affects different pathways for signaling to the actin cytoskeleton, since activation of cofilin occurs via inhibition of LIMK, whereas activation of Arp2/3 is achieved by stimulation of N-WASP. PMID:29382077
Structural principles within the human-virus protein-protein interaction network
Franzosa, Eric A.; Xia, Yu
2011-01-01
General properties of the antagonistic biomolecular interactions between viruses and their hosts (exogenous interactions) remain poorly understood, and may differ significantly from known principles governing the cooperative interactions within the host (endogenous interactions). Systems biology approaches have been applied to study the combined interaction networks of virus and human proteins, but such efforts have so far revealed only low-resolution patterns of host-virus interaction. Here, we layer curated and predicted 3D structural models of human-virus and human-human protein complexes on top of traditional interaction networks to reconstruct the human-virus structural interaction network. This approach reveals atomic resolution, mechanistic patterns of host-virus interaction, and facilitates systematic comparison with the host’s endogenous interactions. We find that exogenous interfaces tend to overlap with and mimic endogenous interfaces, thereby competing with endogenous binding partners. The endogenous interfaces mimicked by viral proteins tend to participate in multiple endogenous interactions which are transient and regulatory in nature. While interface overlap in the endogenous network results largely from gene duplication followed by divergent evolution, viral proteins frequently achieve interface mimicry without any sequence or structural similarity to an endogenous binding partner. Finally, while endogenous interfaces tend to evolve more slowly than the rest of the protein surface, exogenous interfaces—including many sites of endogenous-exogenous overlap—tend to evolve faster, consistent with an evolutionary “arms race” between host and pathogen. These significant biophysical, functional, and evolutionary differences between host-pathogen and within-host protein-protein interactions highlight the distinct consequences of antagonism versus cooperation in biological networks. PMID:21680884
Inborn errors of metabolism and the human interactome: a systems medicine approach.
Woidy, Mathias; Muntau, Ania C; Gersting, Søren W
2018-02-05
The group of inborn errors of metabolism (IEM) displays a marked heterogeneity and IEM can affect virtually all functions and organs of the human organism; however, IEM share that their associated proteins function in metabolism. Most proteins carry out cellular functions by interacting with other proteins, and thus are organized in biological networks. Therefore, diseases are rarely the consequence of single gene mutations but of the perturbations caused in the related cellular network. Systematic approaches that integrate multi-omics and database information into biological networks have successfully expanded our knowledge of complex disorders but network-based strategies have been rarely applied to study IEM. We analyzed IEM on a proteome scale and found that IEM-associated proteins are organized as a network of linked modules within the human interactome of protein interactions, the IEM interactome. Certain IEM disease groups formed self-contained disease modules, which were highly interlinked. On the other hand, we observed disease modules consisting of proteins from many different disease groups in the IEM interactome. Moreover, we explored the overlap between IEM and non-IEM disease genes and applied network medicine approaches to investigate shared biological pathways, clinical signs and symptoms, and links to drug targets. The provided resources may help to elucidate the molecular mechanisms underlying new IEM, to uncover the significance of disease-associated mutations, to identify new biomarkers, and to develop novel therapeutic strategies.
Gloaguen, Pauline; Bournais, Sylvain; Alban, Claude; Ravanel, Stéphane; Seigneurin-Berny, Daphné; Matringe, Michel; Tardif, Marianne; Kuntz, Marcel; Ferro, Myriam; Bruley, Christophe; Rolland, Norbert; Vandenbrouck, Yves; Curien, Gilles
2017-06-01
Higher plants, as autotrophic organisms, are effective sources of molecules. They hold great promise for metabolic engineering, but the behavior of plant metabolism at the network level is still incompletely described. Although structural models (stoichiometry matrices) and pathway databases are extremely useful, they cannot describe the complexity of the metabolic context, and new tools are required to visually represent integrated biocurated knowledge for use by both humans and computers. Here, we describe ChloroKB, a Web application (http://chlorokb.fr/) for visual exploration and analysis of the Arabidopsis ( Arabidopsis thaliana ) metabolic network in the chloroplast and related cellular pathways. The network was manually reconstructed through extensive biocuration to provide transparent traceability of experimental data. Proteins and metabolites were placed in their biological context (spatial distribution within cells, connectivity in the network, participation in supramolecular complexes, and regulatory interactions) using CellDesigner software. The network contains 1,147 reviewed proteins (559 localized exclusively in plastids, 68 in at least one additional compartment, and 520 outside the plastid), 122 proteins awaiting biochemical/genetic characterization, and 228 proteins for which genes have not yet been identified. The visual presentation is intuitive and browsing is fluid, providing instant access to the graphical representation of integrated processes and to a wealth of refined qualitative and quantitative data. ChloroKB will be a significant support for structural and quantitative kinetic modeling, for biological reasoning, when comparing novel data with established knowledge, for computer analyses, and for educational purposes. ChloroKB will be enhanced by continuous updates following contributions from plant researchers. © 2017 American Society of Plant Biologists. All Rights Reserved.
Vaquero, J; Nguyen Ho-Bouldoires, T H; Clapéron, A; Fouassier, L
2017-06-01
The transmission of cellular information requires fine and subtle regulation of proteins that need to interact in a coordinated and specific way to form efficient signaling networks. The spatial and temporal coordination relies on scaffold proteins. Thanks to protein interaction domains such as PDZ domains, scaffold proteins organize multiprotein complexes enabling the proper transmission of cellular information through intracellular networks. NHERF1/EBP50 is a PDZ-scaffold protein that was initially identified as an organizer and regulator of transporters and channels at the apical side of epithelia through actin-binding ezrin-moesin-radixin proteins. Since, NHERF1/EBP50 has emerged as a major regulator of cancer signaling network by assembling cancer-related proteins. The PDZ-scaffold EBP50 carries either anti-tumor or pro-tumor functions, two antinomic functions dictated by EBP50 expression or subcellular localization. The dual function of NHERF1/EBP50 encompasses the regulation of several major signaling pathways engaged in cancer, including the receptor tyrosine kinases PDGFR and EGFR, PI3K/PTEN/AKT and Wnt-β-catenin pathways.
Community of protein complexes impacts disease association
Wang, Qianghu; Liu, Weisha; Ning, Shangwei; Ye, Jingrun; Huang, Teng; Li, Yan; Wang, Peng; Shi, Hongbo; Li, Xia
2012-01-01
One important challenge in the post-genomic era is uncovering the relationships among distinct pathophenotypes by using molecular signatures. Given the complex functional interdependencies between cellular components, a disease is seldom the consequence of a defect in a single gene product, instead reflecting the perturbations of a group of closely related gene products that carry out specific functions together. Therefore, it is meaningful to explore how the community of protein complexes impacts disease associations. Here, by integrating a large amount of information from protein complexes and the cellular basis of diseases, we built a human disease network in which two diseases are linked if they share common disease-related protein complex. A systemic analysis revealed that linked disease pairs exhibit higher comorbidity than those that have no links, and that the stronger association two diseases have based on protein complexes, the higher comorbidity they are prone to display. Moreover, more connected diseases tend to be malignant, which have high prevalence. We provide novel disease associations that cannot be identified through previous analysis. These findings will potentially provide biologists and clinicians new insights into the etiology, classification and treatment of diseases. PMID:22549411
Hidden long evolutionary memory in a model biochemical network
NASA Astrophysics Data System (ADS)
Ali, Md. Zulfikar; Wingreen, Ned S.; Mukhopadhyay, Ranjan
2018-04-01
We introduce a minimal model for the evolution of functional protein-interaction networks using a sequence-based mutational algorithm, and apply the model to study neutral drift in networks that yield oscillatory dynamics. Starting with a functional core module, random evolutionary drift increases network complexity even in the absence of specific selective pressures. Surprisingly, we uncover a hidden order in sequence space that gives rise to long-term evolutionary memory, implying strong constraints on network evolution due to the topology of accessible sequence space.
A mathematical model for generating bipartite graphs and its application to protein networks
NASA Astrophysics Data System (ADS)
Nacher, J. C.; Ochiai, T.; Hayashida, M.; Akutsu, T.
2009-12-01
Complex systems arise in many different contexts from large communication systems and transportation infrastructures to molecular biology. Most of these systems can be organized into networks composed of nodes and interacting edges. Here, we present a theoretical model that constructs bipartite networks with the particular feature that the degree distribution can be tuned depending on the probability rate of fundamental processes. We then use this model to investigate protein-domain networks. A protein can be composed of up to hundreds of domains. Each domain represents a conserved sequence segment with specific functional tasks. We analyze the distribution of domains in Homo sapiens and Arabidopsis thaliana organisms and the statistical analysis shows that while (a) the number of domain types shared by k proteins exhibits a power-law distribution, (b) the number of proteins composed of k types of domains decays as an exponential distribution. The proposed mathematical model generates bipartite graphs and predicts the emergence of this mixing of (a) power-law and (b) exponential distributions. Our theoretical and computational results show that this model requires (1) growth process and (2) copy mechanism.
Mazloom, Amin R.; Dannenfelser, Ruth; Clark, Neil R.; Grigoryan, Arsen V.; Linder, Kathryn M.; Cardozo, Timothy J.; Bond, Julia C.; Boran, Aislyn D. W.; Iyengar, Ravi; Malovannaya, Anna; Lanz, Rainer B.; Ma'ayan, Avi
2011-01-01
Coregulator proteins (CoRegs) are part of multi-protein complexes that transiently assemble with transcription factors and chromatin modifiers to regulate gene expression. In this study we analyzed data from 3,290 immuno-precipitations (IP) followed by mass spectrometry (MS) applied to human cell lines aimed at identifying CoRegs complexes. Using the semi-quantitative spectral counts, we scored binary protein-protein and domain-domain associations with several equations. Unlike previous applications, our methods scored prey-prey protein-protein interactions regardless of the baits used. We also predicted domain-domain interactions underlying predicted protein-protein interactions. The quality of predicted protein-protein and domain-domain interactions was evaluated using known binary interactions from the literature, whereas one protein-protein interaction, between STRN and CTTNBP2NL, was validated experimentally; and one domain-domain interaction, between the HEAT domain of PPP2R1A and the Pkinase domain of STK25, was validated using molecular docking simulations. The scoring schemes presented here recovered known, and predicted many new, complexes, protein-protein, and domain-domain interactions. The networks that resulted from the predictions are provided as a web-based interactive application at http://maayanlab.net/HT-IP-MS-2-PPI-DDI/. PMID:22219718
L-GRAAL: Lagrangian graphlet-based network aligner.
Malod-Dognin, Noël; Pržulj, Nataša
2015-07-01
Discovering and understanding patterns in networks of protein-protein interactions (PPIs) is a central problem in systems biology. Alignments between these networks aid functional understanding as they uncover important information, such as evolutionary conserved pathways, protein complexes and functional orthologs. A few methods have been proposed for global PPI network alignments, but because of NP-completeness of underlying sub-graph isomorphism problem, producing topologically and biologically accurate alignments remains a challenge. We introduce a novel global network alignment tool, Lagrangian GRAphlet-based ALigner (L-GRAAL), which directly optimizes both the protein and the interaction functional conservations, using a novel alignment search heuristic based on integer programming and Lagrangian relaxation. We compare L-GRAAL with the state-of-the-art network aligners on the largest available PPI networks from BioGRID and observe that L-GRAAL uncovers the largest common sub-graphs between the networks, as measured by edge-correctness and symmetric sub-structures scores, which allow transferring more functional information across networks. We assess the biological quality of the protein mappings using the semantic similarity of their Gene Ontology annotations and observe that L-GRAAL best uncovers functionally conserved proteins. Furthermore, we introduce for the first time a measure of the semantic similarity of the mapped interactions and show that L-GRAAL also uncovers best functionally conserved interactions. In addition, we illustrate on the PPI networks of baker's yeast and human the ability of L-GRAAL to predict new PPIs. Finally, L-GRAAL's results are the first to show that topological information is more important than sequence information for uncovering functionally conserved interactions. L-GRAAL is coded in C++. Software is available at: http://bio-nets.doc.ic.ac.uk/L-GRAAL/. n.malod-dognin@imperial.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Theory for the Emergence of Modularity in Complex Systems
NASA Astrophysics Data System (ADS)
Deem, Michael; Park, Jeong-Man
2013-03-01
Biological systems are modular, and this modularity evolves over time and in different environments. A number of observations have been made of increased modularity in biological systems under increased environmental pressure. We here develop a theory for the dynamics of modularity in these systems. We find a principle of least action for the evolved modularity at long times. In addition, we find a fluctuation dissipation relation for the rate of change of modularity at short times. We discuss a number of biological and social systems that can be understood with this framework. The modularity of the protein-protein interaction network increases when yeast are exposed to heat shock, and the modularity of the protein-protein networks in both yeast and E. coli appears to have increased over evolutionary time. Food webs in low-energy, stressful environments are more modular than those in plentiful environments, arid ecologies are more modular during droughts, and foraging of sea otters is more modular when food is limiting. The modularity of social networks changes over time: stock brokers instant messaging networks are more modular under stressful market conditions, criminal networks are more modular under increased police pressure, and world trade network modularity has decreased
Chen, Langdong; Cao, Yan; Zhang, Hai; Lv, Diya; Zhao, Yahong; Liu, Yanjun; Ye, Guan; Chai, Yifeng
2018-01-31
Yangxinshi tablet (YXST) is an effective treatment for heart failure and myocardial infarction; it consists of 13 herbal medicines formulated according to traditional Chinese Medicine (TCM) practices. It has been used for the treatment of cardiovascular disease for many years in China. In this study, a network pharmacology-based strategy was used to elucidate the mechanism of action of YXST for the treatment of heart failure. Cardiovascular disease-related protein target and compound databases were constructed for YXST. A molecular docking platform was used to predict the protein targets of YXST. The affinity between proteins and ingredients was determined using surface plasmon resonance (SPR) assays. The action modes between targets and representative ingredients were calculated using Glide docking, and the related pathways were predicted using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. A protein target database containing 924 proteins was constructed; 179 compounds in YXST were identified, and 48 compounds with high relevance to the proteins were defined as representative ingredients. Thirty-four protein targets of the 48 representative ingredients were analyzed and classified into two categories: immune and cardiovascular systems. The SPR assay and molecular docking partly validated the interplay between protein targets and representative ingredients. Moreover, 28 pathways related to heart failure were identified, which provided directions for further research on YXST. This study demonstrated that the cardiovascular protective effect of YXST mainly involved the immune and cardiovascular systems. Through the research strategy based on network pharmacology, we analysis the complex system of YXST and found 48 representative compounds, 34 proteins and 28 related pathways of YXST, which could help us understand the underlying mechanism of YSXT's anti-heart failure effect. The network-based investigation could help researchers simplify the complex system of YXSY. It may also offer a feasible approach to decipher the chemical and pharmacological bases of other TCM formulas. Copyright © 2018 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Canino, Lawrence S.; Shen, Tongye; McCammon, J. Andrew
2002-12-01
We extend the self-consistent pair contact probability method to the evaluation of the partition function for a protein complex at thermodynamic equilibrium. Specifically, we adapt the method for multichain models and introduce a parametrization for amino acid-specific pairwise interactions. This method is similar to the Gaussian network model but allows for the adjusting of the strengths of native state contacts. The method is first validated on a high resolution x-ray crystal structure of bovine Pancreatic Phospholipase A2 by comparing calculated B-factors with reported values. We then examine binding-induced changes in flexibility in protein-protein complexes, comparing computed results with those obtained from x-ray crystal structures and molecular dynamics simulations. In particular, we focus on the mouse acetylcholinesterase:fasciculin II and the human α-thrombin:thrombomodulin complexes.
Surface energetics and protein-protein interactions: analysis and mechanistic implications
Peri, Claudio; Morra, Giulia; Colombo, Giorgio
2016-01-01
Understanding protein-protein interactions (PPI) at the molecular level is a fundamental task in the design of new drugs, the prediction of protein function and the clarification of the mechanisms of (dis)regulation of biochemical pathways. In this study, we use a novel computational approach to investigate the energetics of aminoacid networks located on the surface of proteins, isolated and in complex with their respective partners. Interestingly, the analysis of individual proteins identifies patches of surface residues that, when mapped on the structure of their respective complexes, reveal regions of residue-pair couplings that extend across the binding interfaces, forming continuous motifs. An enhanced effect is visible across the proteins of the dataset forming larger quaternary assemblies. The method indicates the presence of energetic signatures in the isolated proteins that are retained in the bound form, which we hypothesize to determine binding orientation upon complex formation. We propose our method, BLUEPRINT, as a complement to different approaches ranging from the ab-initio characterization of PPIs, to protein-protein docking algorithms, for the physico-chemical and functional investigation of protein-protein interactions. PMID:27050828
Mezlini, Aziz M; Goldenberg, Anna
2017-10-01
Discovering genetic mechanisms driving complex diseases is a hard problem. Existing methods often lack power to identify the set of responsible genes. Protein-protein interaction networks have been shown to boost power when detecting gene-disease associations. We introduce a Bayesian framework, Conflux, to find disease associated genes from exome sequencing data using networks as a prior. There are two main advantages to using networks within a probabilistic graphical model. First, networks are noisy and incomplete, a substantial impediment to gene discovery. Incorporating networks into the structure of a probabilistic models for gene inference has less impact on the solution than relying on the noisy network structure directly. Second, using a Bayesian framework we can keep track of the uncertainty of each gene being associated with the phenotype rather than returning a fixed list of genes. We first show that using networks clearly improves gene detection compared to individual gene testing. We then show consistently improved performance of Conflux compared to the state-of-the-art diffusion network-based method Hotnet2 and a variety of other network and variant aggregation methods, using randomly generated and literature-reported gene sets. We test Hotnet2 and Conflux on several network configurations to reveal biases and patterns of false positives and false negatives in each case. Our experiments show that our novel Bayesian framework Conflux incorporates many of the advantages of the current state-of-the-art methods, while offering more flexibility and improved power in many gene-disease association scenarios.
ESCRT-dependent degradation of ubiquitylated plasma membrane proteins in plants.
Isono, Erika; Kalinowska, Kamila
2017-12-01
To control the abundance of plasma membrane receptors and transporters is crucial for proper perception and response to extracellular signals from surrounding cells and the environment. Posttranslational modification of plasma membrane proteins, especially ubiquitin conjugation or ubiquitylation, is key for the determination of stability for many transmembrane proteins localized on the cell surface. The targeted degradation is ensured by a complex network of proteins among which the endosomal sorting complex required for transport (ESCRT) plays a central role. This review focuses on progresses made in recent years on the understanding of the function of the ESCRT machinery in the degradation of ubiquitylated plasma membrane proteins in plants. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lapek, John D; Greninger, Patricia; Morris, Robert; Amzallag, Arnaud; Pruteanu-Malinici, Iulian; Benes, Cyril H; Haas, Wilhelm
2017-10-01
The formation of protein complexes and the co-regulation of the cellular concentrations of proteins are essential mechanisms for cellular signaling and for maintaining homeostasis. Here we use isobaric-labeling multiplexed proteomics to analyze protein co-regulation and show that this allows the identification of protein-protein associations with high accuracy. We apply this 'interactome mapping by high-throughput quantitative proteome analysis' (IMAHP) method to a panel of 41 breast cancer cell lines and show that deviations of the observed protein co-regulations in specific cell lines from the consensus network affects cellular fitness. Furthermore, these aberrant interactions serve as biomarkers that predict the drug sensitivity of cell lines in screens across 195 drugs. We expect that IMAHP can be broadly used to gain insight into how changing landscapes of protein-protein associations affect the phenotype of biological systems.
Prediction of cassava protein interactome based on interolog method.
Thanasomboon, Ratana; Kalapanulak, Saowalak; Netrphan, Supatcharee; Saithong, Treenut
2017-12-08
Cassava is a starchy root crop whose role in food security becomes more significant nowadays. Together with the industrial uses for versatile purposes, demand for cassava starch is continuously growing. However, in-depth study to uncover the mystery of cellular regulation, especially the interaction between proteins, is lacking. To reduce the knowledge gap in protein-protein interaction (PPI), genome-scale PPI network of cassava was constructed using interolog-based method (MePPI-In, available at http://bml.sbi.kmutt.ac.th/ppi ). The network was constructed from the information of seven template plants. The MePPI-In included 90,173 interactions from 7,209 proteins. At least, 39 percent of the total predictions were found with supports from gene/protein expression data, while further co-expression analysis yielded 16 highly promising PPIs. In addition, domain-domain interaction information was employed to increase reliability of the network and guide the search for more groups of promising PPIs. Moreover, the topology and functional content of MePPI-In was similar to the networks of Arabidopsis and rice. The potential contribution of MePPI-In for various applications, such as protein-complex formation and prediction of protein function, was discussed and exemplified. The insights provided by our MePPI-In would hopefully enable us to pursue precise trait improvement in cassava.
Loris, R; De Greve, H; Dao-Thi, M H; Messens, J; Imberty, A; Wyns, L
2000-08-25
Protein-carbohydrate interactions are the language of choice for inter- cellular communication. The legume lectins form a large family of homologous proteins that exhibit a wide variety of carbohydrate specificities. The legume lectin family is therefore highly suitable as a model system to study the structural principles of protein-carbohydrate recognition. Until now, structural data are only available for two specificity families: Man/Glc and Gal/GalNAc. No structural data are available for any of the fucose or chitobiose specific lectins. The crystal structure of Ulex europaeus (UEA-II) is the first of a legume lectin belonging to the chitobiose specificity group. The complexes with N-acetylglucosamine, galactose and fucosylgalactose show a promiscuous primary binding site capable of accommodating both N-acetylglucos amine or galactose in the primary binding site. The hydrogen bonding network in these complexes can be considered suboptimal, in agreement with the low affinities of these sugars. In the complexes with chitobiose, lactose and fucosyllactose this suboptimal hydrogen bonding network is compensated by extensive hydrophobic interactions in a Glc/GlcNAc binding subsite. UEA-II thus forms the first example of a legume lectin with a promiscuous binding site and illustrates the importance of hydrophobic interactions in protein-carbohydrate complexes. Together with other known legume lectin crystal structures, it shows how different specificities can be grafted upon a conserved structural framework. Copyright 2000 Academic Press.
Rose, Rachel H; Briddon, Stephen J; Holliday, Nicholas D
2010-01-01
There is increasing complexity in the organization of seven transmembrane domain (7TM) receptor signalling pathways, and in the ability of their ligands to modulate and direct this signalling. Underlying these events is a network of protein interactions between the 7TM receptors themselves and associated effectors, such as G proteins and β-arrestins. Bimolecular fluorescence complementation, or BiFC, is a technique capable of detecting these protein–protein events essential for 7TM receptor function. Fluorescent proteins, such as those from Aequorea victoria, are split into two non-fluorescent halves, which then tag the proteins under study. On association, these fragments refold and regenerate a mature fluorescent protein, producing a BiFC signal indicative of complex formation. Here, we review the experimental criteria for successful application of BiFC, considered in the context of 7TM receptor signalling events such as receptor dimerization, G protein and β-arrestin signalling. The advantages and limitations of BiFC imaging are compared with alternative resonance energy transfer techniques. We show that the essential simplicity of the fluorescent BiFC measurement allows high-content and advanced imaging applications, and that it can probe more complex multi-protein interactions alone or in combination with resonance energy transfer. These capabilities suggest that BiFC techniques will become ever more useful in the analysis of ligand and 7TM receptor pharmacology at the molecular level of protein–protein interactions. This article is part of a themed section on Imaging in Pharmacology. To view the editorial for this themed section visit http://dx.doi.org/10.1111/j.1476-5381.2010.00685.x PMID:20015298
Xia, Kai; Dong, Dong; Han, Jing-Dong J
2006-01-01
Background Although protein-protein interaction (PPI) networks have been explored by various experimental methods, the maps so built are still limited in coverage and accuracy. To further expand the PPI network and to extract more accurate information from existing maps, studies have been carried out to integrate various types of functional relationship data. A frequently updated database of computationally analyzed potential PPIs to provide biological researchers with rapid and easy access to analyze original data as a biological network is still lacking. Results By applying a probabilistic model, we integrated 27 heterogeneous genomic, proteomic and functional annotation datasets to predict PPI networks in human. In addition to previously studied data types, we show that phenotypic distances and genetic interactions can also be integrated to predict PPIs. We further built an easy-to-use, updatable integrated PPI database, the Integrated Network Database (IntNetDB) online, to provide automatic prediction and visualization of PPI network among genes of interest. The networks can be visualized in SVG (Scalable Vector Graphics) format for zooming in or out. IntNetDB also provides a tool to extract topologically highly connected network neighborhoods from a specific network for further exploration and research. Using the MCODE (Molecular Complex Detections) algorithm, 190 such neighborhoods were detected among all the predicted interactions. The predicted PPIs can also be mapped to worm, fly and mouse interologs. Conclusion IntNetDB includes 180,010 predicted protein-protein interactions among 9,901 human proteins and represents a useful resource for the research community. Our study has increased prediction coverage by five-fold. IntNetDB also provides easy-to-use network visualization and analysis tools that allow biological researchers unfamiliar with computational biology to access and analyze data over the internet. The web interface of IntNetDB is freely accessible at . Visualization requires Mozilla version 1.8 (or higher) or Internet Explorer with installation of SVGviewer. PMID:17112386
Hub Protein Controversy: Taking a Closer Look at Plant Stress Response Hubs
Vandereyken, Katy; Van Leene, Jelle; De Coninck, Barbara; Cammue, Bruno P. A.
2018-01-01
Plant stress responses involve numerous changes at the molecular and cellular level and are regulated by highly complex signaling pathways. Studying protein-protein interactions (PPIs) and the resulting networks is therefore becoming increasingly important in understanding these responses. Crucial in PPI networks are the so-called hubs or hub proteins, commonly defined as the most highly connected central proteins in scale-free PPI networks. However, despite their importance, a growing amount of confusion and controversy seems to exist regarding hub protein identification, characterization and classification. In order to highlight these inconsistencies and stimulate further clarification, this review critically analyses the current knowledge on hub proteins in the plant interactome field. We focus on current hub protein definitions, including the properties generally seen as hub-defining, and the challenges and approaches associated with hub protein identification. Furthermore, we give an overview of the most important large-scale plant PPI studies of the last decade that identified hub proteins, pointing out the lack of overlap between different studies. As such, it appears that although major advances are being made in the plant interactome field, defining hub proteins is still heavily dependent on the quality, origin and interpretation of the acquired PPI data. Nevertheless, many hub proteins seem to have a reported role in the plant stress response, including transcription factors, protein kinases and phosphatases, ubiquitin proteasome system related proteins, (co-)chaperones and redox signaling proteins. A significant number of identified plant stress hubs are however still functionally uncharacterized, making them interesting targets for future research. This review clearly shows the ongoing improvements in the plant interactome field but also calls attention to the need for a more comprehensive and precise identification of hub proteins, allowing a more efficient systems biology driven unraveling of complex processes, including those involved in stress responses. PMID:29922309
Li, Yongsheng; Sahni, Nidhi; Yi, Song
2016-11-29
Comprehensive understanding of human cancer mechanisms requires the identification of a thorough list of cancer-associated genes, which could serve as biomarkers for diagnoses and therapies in various types of cancer. Although substantial progress has been made in functional studies to uncover genes involved in cancer, these efforts are often time-consuming and costly. Therefore, it remains challenging to comprehensively identify cancer candidate genes. Network-based methods have accelerated this process through the analysis of complex molecular interactions in the cell. However, the extent to which various interactome networks can contribute to prediction of candidate genes responsible for cancer is still enigmatic. In this study, we evaluated different human protein-protein interactome networks and compared their application to cancer gene prioritization. Our results indicate that network analyses can increase the power to identify novel cancer genes. In particular, such predictive power can be enhanced with the use of unbiased systematic protein interaction maps for cancer gene prioritization. Functional analysis reveals that the top ranked genes from network predictions co-occur often with cancer-related terms in literature, and further, these candidate genes are indeed frequently mutated across cancers. Finally, our study suggests that integrating interactome networks with other omics datasets could provide novel insights into cancer-associated genes and underlying molecular mechanisms.
Colloid Surface Chemistry Critically Affects Multiple Particle Tracking Measurements of Biomaterials
Valentine, M. T.; Perlman, Z. E.; Gardel, M. L.; Shin, J. H.; Matsudaira, P.; Mitchison, T. J.; Weitz, D. A.
2004-01-01
Characterization of the properties of complex biomaterials using microrheological techniques has the promise of providing fundamental insights into their biomechanical functions; however, precise interpretations of such measurements are hindered by inadequate characterization of the interactions between tracers and the networks they probe. We here show that colloid surface chemistry can profoundly affect multiple particle tracking measurements of networks of fibrin, entangled F-actin solutions, and networks of cross-linked F-actin. We present a simple protocol to render the surface of colloidal probe particles protein-resistant by grafting short amine-terminated methoxy-poly(ethylene glycol) to the surface of carboxylated microspheres. We demonstrate that these poly(ethylene glycol)-coated tracers adsorb significantly less protein than particles coated with bovine serum albumin or unmodified probe particles. We establish that varying particle surface chemistry selectively tunes the sensitivity of the particles to different physical properties of their microenvironments. Specifically, particles that are weakly bound to a heterogeneous network are sensitive to changes in network stiffness, whereas protein-resistant tracers measure changes in the viscosity of the fluid and in the network microstructure. We demonstrate experimentally that two-particle microrheology analysis significantly reduces differences arising from tracer surface chemistry, indicating that modifications of network properties near the particle do not introduce large-scale heterogeneities. Our results establish that controlling colloid-protein interactions is crucial to the successful application of multiple particle tracking techniques to reconstituted protein networks, cytoplasm, and cells. PMID:15189896
2014-12-03
DNA damage . It is controlled by a complex network involving the RecA and LexA proteins. We have previously shown that the SOS response to DNA damage ...Research Triangle Park, NC 27709-2211 enteric bacterium E. coli, SOS Response, DNA damage REPORT DOCUMENTATION PAGE 11. SPONSOR/MONITOR’S REPORT...Report Title The Escherichia coli (E. coli) SOS response is the largest, most complex, and best characterized bacterial network induced by DNA damage
PROXiMATE: a database of mutant protein-protein complex thermodynamics and kinetics.
Jemimah, Sherlyn; Yugandhar, K; Michael Gromiha, M
2017-09-01
We have developed PROXiMATE, a database of thermodynamic data for more than 6000 missense mutations in 174 heterodimeric protein-protein complexes, supplemented with interaction network data from STRING database, solvent accessibility, sequence, structural and functional information, experimental conditions and literature information. Additional features include complex structure visualization, search and display options, download options and a provision for users to upload their data. The database is freely available at http://www.iitm.ac.in/bioinfo/PROXiMATE/ . The website is implemented in Python, and supports recent versions of major browsers such as IE10, Firefox, Chrome and Opera. gromiha@iitm.ac.in. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
A G protein alpha null mutation confers prolificacy potential in maize
Urano, Daisuke; Jackson, David; Jones, Alan M.
2015-05-06
Plasticity in plant development is controlled by environmental signals through largely unknown signalling networks. Signalling coupled by the heterotrimeric G protein complex underlies various developmental pathways in plants. The morphology of two plastic developmental pathways, root system architecture and female inflorescence formation, was quantitatively assessed in a mutant compact plant 2 (ct2) lacking the alpha subunit of the heterotrimeric G protein complex in maize. The ct2 mutant partially compensated for a reduced shoot height by increased total leaf number, and had far more ears, even in the presence of pollination signals. Lastly, the maize heterotrimeric G protein complex is importantmore » in some plastic developmental traits in maize. In particular, the maize Gα subunit is required to dampen the overproduction of female inflorescences.« less
Quantitative interaction proteomics using mass spectrometry.
Wepf, Alexander; Glatter, Timo; Schmidt, Alexander; Aebersold, Ruedi; Gstaiger, Matthias
2009-03-01
We present a mass spectrometry-based strategy for the absolute quantification of protein complex components isolated through affinity purification. We quantified bait proteins via isotope-labeled reference peptides corresponding to an affinity tag sequence and prey proteins by label-free correlational quantification using the precursor ion signal intensities of proteotypic peptides generated in reciprocal purifications. We used this method to quantitatively analyze interaction stoichiometries in the human protein phosphatase 2A network.
2015-01-01
Background Cellular processes are known to be modular and are realized by groups of proteins implicated in common biological functions. Such groups of proteins are called functional modules, and many community detection methods have been devised for their discovery from protein interaction networks (PINs) data. In current agglomerative clustering approaches, vertices with just a very few neighbors are often classified as separate clusters, which does not make sense biologically. Also, a major limitation of agglomerative techniques is that their computational efficiency do not scale well to large PINs. Finally, PIN data obtained from large scale experiments generally contain many false positives, and this makes it hard for agglomerative clustering methods to find the correct clusters, since they are known to be sensitive to noisy data. Results We propose a local similarity premetric, the relative vertex clustering value, as a new criterion allowing to decide when a node can be added to a given node's cluster and which addresses the above three issues. Based on this criterion, we introduce a novel and very fast agglomerative clustering technique, FAC-PIN, for discovering functional modules and protein complexes from a PIN data. Conclusions Our proposed FAC-PIN algorithm is applied to nine PIN data from eight different species including the yeast PIN, and the identified functional modules are validated using Gene Ontology (GO) annotations from DAVID Bioinformatics Resources. Identified protein complexes are also validated using experimentally verified complexes. Computational results show that FAC-PIN can discover functional modules or protein complexes from PINs more accurately and more efficiently than HC-PIN and CNM, the current state-of-the-art approaches for clustering PINs in an agglomerative manner. PMID:25734691
Vranish, James N; Das, Deepika; Barondeau, David P
2016-11-18
Iron-sulfur (Fe-S) clusters are protein cofactors that are required for many essential cellular functions. Fe-S clusters are synthesized and inserted into target proteins by an elaborate biosynthetic process. The insensitivity of most Fe-S assembly and transfer assays requires high concentrations for components and places major limits on reaction complexity. Recently, fluorophore labels were shown to be effective at reporting cluster content for Fe-S proteins. Here, the incorporation of this labeling approach allowed the design and interrogation of complex Fe-S cluster biosynthetic reactions that mimic in vivo conditions. A bacterial Fe-S assembly complex, composed of the cysteine desulfurase IscS and scaffold protein IscU, was used to generate [2Fe-2S] clusters for transfer to mixtures of putative intermediate carrier and acceptor proteins. The focus of this study was to test whether the monothiol glutaredoxin, Grx4, functions as an obligate [2Fe-2S] carrier protein in the Fe-S cluster distribution network. Interestingly, [2Fe-2S] clusters generated by the IscS-IscU complex transferred to Grx4 at rates comparable to previous assays using uncomplexed IscU as a cluster source in chaperone-assisted transfer reactions. Further, we provide evidence that [2Fe-2S]-Grx4 delivers clusters to multiple classes of Fe-S targets via direct ligand exchange in a process that is both dynamic and reversible. Global fits of cluster transfer kinetics support a model in which Grx4 outcompetes terminal target proteins for IscU-bound [2Fe-2S] clusters and functions as an intermediate cluster carrier. Overall, these studies demonstrate the power of chemically conjugated fluorophore reporters for unraveling mechanistic details of biological metal cofactor assembly and distribution networks.
Nuclear pore complex tethers to the cytoskeleton.
Goldberg, Martin W
2017-08-01
The nuclear envelope is tethered to the cytoskeleton. The best known attachments of all elements of the cytoskeleton are via the so-called LINC complex. However, the nuclear pore complexes, which mediate the transport of soluble and membrane bound molecules, are also linked to the microtubule network, primarily via motor proteins (dynein and kinesins) which are linked, most importantly, to the cytoplasmic filament protein of the nuclear pore complex, Nup358, by the adaptor BicD2. The evidence for such linkages and possible roles in nuclear migration, cell cycle control, nuclear transport and cell architecture are discussed. Copyright © 2017. Published by Elsevier Ltd.
Revealing the Hidden Language of Complex Networks
Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Davis, Darren; Levnajic, Zoran; Janjic, Vuk; Karapandza, Rasa; Stojmirovic, Aleksandar; Pržulj, Nataša
2014-01-01
Sophisticated methods for analysing complex networks promise to be of great benefit to almost all scientific disciplines, yet they elude us. In this work, we make fundamental methodological advances to rectify this. We discover that the interaction between a small number of roles, played by nodes in a network, can characterize a network's structure and also provide a clear real-world interpretation. Given this insight, we develop a framework for analysing and comparing networks, which outperforms all existing ones. We demonstrate its strength by uncovering novel relationships between seemingly unrelated networks, such as Facebook, metabolic, and protein structure networks. We also use it to track the dynamics of the world trade network, showing that a country's role of a broker between non-trading countries indicates economic prosperity, whereas peripheral roles are associated with poverty. This result, though intuitive, has escaped all existing frameworks. Finally, our approach translates network topology into everyday language, bringing network analysis closer to domain scientists. PMID:24686408
Usher syndrome: molecular links of pathogenesis, proteins and pathways.
Kremer, Hannie; van Wijk, Erwin; Märker, Tina; Wolfrum, Uwe; Roepman, Ronald
2006-10-15
Usher syndrome is the most common form of deaf-blindness. The syndrome is both clinically and genetically heterogeneous, and to date, eight causative genes have been identified. The proteins encoded by these genes are part of a dynamic protein complex that is present in hair cells of the inner ear and in photoreceptor cells of the retina. The localization of the Usher proteins and the phenotype in animal models indicate that the Usher protein complex is essential in the morphogenesis of the stereocilia bundle in hair cells and in the calycal processes of photoreceptor cells. In addition, the Usher proteins are important in the synaptic processes of both cell types. The association of other proteins with the complex indicates functional links to a number of basic cell-biological processes. Prominently present is the connection to the dynamics of the actin cytoskeleton, involved in cellular morphology, cell polarity and cell-cell interactions. The Usher protein complex can also be linked to the cadherins/catenins in the adherens junction-associated protein complexes, suggesting a role in cell polarity and tissue organization. A third link can be established to the integrin transmembrane signaling network. The Usher interactome, as outlined in this review, participates in pathways common in inner ear and retina that are disrupted in the Usher syndrome.
Crucial HSP70 co–chaperone complex unlocks metazoan protein disaggregation
Nillegoda, Nadinath B.; Kirstein, Janine; Szlachcic, Anna; Berynskyy, Mykhaylo; Stank, Antonia; Stengel, Florian; Arnsburg, Kristin; Gao, Xuechao; Scior, Annika; Aebersold, Ruedi; Guilbride, D. Lys; Wade, Rebecca C.; Morimoto, Richard I.; Mayer, Matthias P.; Bukau, Bernd
2016-01-01
Protein aggregates are the hallmark of stressed and ageing cells, and characterize several pathophysiological states1,2. Healthy metazoan cells effectively eliminate intracellular protein aggregates3,4, indicating that efficient disaggregation and/or degradation mechanisms exist. However, metazoans lack the key heat-shock protein disaggregase HSP100 of non-metazoan HSP70-dependent protein disaggregation systems5,6, and the human HSP70 system alone, even with the crucial HSP110 nucleotide exchange factor, has poor disaggregation activity in vitro4,7. This unresolved conundrum is central to protein quality control biology. Here we show that synergic cooperation between complexed J-protein co-chaperones of classes A and B unleashes highly efficient protein disaggregation activity in human and nematode HSP70 systems. Metazoan mixed-class J-protein complexes are transient, involve complementary charged regions conserved in the J-domains and carboxy-terminal domains of each J-protein class, and are flexible with respect to subunit composition. Complex formation allows J-proteins to initiate transient higher order chaperone structures involving HSP70 and interacting nucleotide exchange factors. A network of cooperative class A and B J-protein interactions therefore provides the metazoan HSP70 machinery with powerful, flexible, and finely regulatable disaggregase activity and a further level of regulation crucial for cellular protein quality control. PMID:26245380
Thermostability of In Vitro Evolved Bacillus subtilis Lipase A: A Network and Dynamics Perspective
Srivastava, Ashutosh; Sinha, Somdatta
2014-01-01
Proteins in thermophilic organisms remain stable and function optimally at high temperatures. Owing to their important applicability in many industrial processes, such thermostable proteins have been studied extensively, and several structural factors attributed to their enhanced stability. How these factors render the emergent property of thermostability to proteins, even in situations where no significant changes occur in their three-dimensional structures in comparison to their mesophilic counter-parts, has remained an intriguing question. In this study we treat Lipase A from Bacillus subtilis and its six thermostable mutants in a unified manner and address the problem with a combined complex network-based analysis and molecular dynamic studies to find commonality in their properties. The Protein Contact Networks (PCN) of the wild-type and six mutant Lipase A structures developed at a mesoscopic scale were analyzed at global network and local node (residue) level using network parameters and community structure analysis. The comparative PCN analysis of all proteins pointed towards important role of specific residues in the enhanced thermostability. Network analysis results were corroborated with finer-scale molecular dynamics simulations at both room and high temperatures. Our results show that this combined approach at two scales can uncover small but important changes in the local conformations that add up to stabilize the protein structure in thermostable mutants, even when overall conformation differences among them are negligible. Our analysis not only supports the experimentally determined stabilizing factors, but also unveils the important role of contacts, distributed throughout the protein, that lead to thermostability. We propose that this combined mesoscopic-network and fine-grained molecular dynamics approach is a convenient and useful scheme not only to study allosteric changes leading to protein stability in the face of negligible over-all conformational changes due to mutations, but also in other molecular networks where change in function does not accompany significant change in the network structure. PMID:25122499
Hao, Tong; Zeng, Zheng; Wang, Bin; Zhang, Yichen; Liu, Yichen; Geng, Xuyun; Sun, Jinsheng
2014-03-27
The protein-protein interaction network (PIN) is an effective information tool for understanding the complex biological processes inside the cell and solving many biological problems such as signaling pathway identification and prediction of protein functions. Eriocheir sinensis is a highly-commercial aquaculture species with an unclear proteome background which hinders the construction and development of PIN for E. sinensis. However, in recent years, the development of next-generation deep-sequencing techniques makes it possible to get high throughput data of E. sinensis tanscriptome and subsequently obtain a systematic overview of the protein-protein interaction system. In this work we sequenced the transcriptional RNA of eyestalk, Y-organ and hepatopancreas in E. sinensis and generated a PIN of E. sinensis which included 3,223 proteins and 35,787 interactions. Each protein-protein interaction in the network was scored according to the homology and genetic relationship. The signaling sub-network, representing the signal transduction pathways in E. sinensis, was extracted from the global network, which depicted a global view of the signaling systems in E. sinensis. Seven basic signal transduction pathways were identified in E. sinensis. By investigating the evolution paths of the seven pathways, we found that these pathways got mature in different evolutionary stages. Moreover, the functions of unclassified proteins and unigenes in the PIN of E. sinensis were predicted. Specifically, the functions of 549 unclassified proteins related to 864 unclassified unigenes were assigned, which respectively covered 76% and 73% of all the unclassified proteins and unigenes in the network. The PIN generated in this work is the first large-scale PIN of aquatic crustacean, thereby providing a paradigmatic blueprint of the aquatic crustacean interactome. Signaling sub-network extracted from the global PIN depicts the interaction of different signaling proteins and the evolutionary paths of the identified signal transduction pathways. Furthermore, the function assignment of unclassified proteins based on the PIN offers a new reference in protein function exploration. More importantly, the construction of the E. sinensis PIN provides necessary experience for the exploration of PINs in other aquatic crustacean species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang Bo; Huang Bo; School of Public Health, University of South China, Hengyang, Hunan 421001
Mitotic catastrophe, a form of cell death resulting from abnormal mitosis, is a cytotoxic death pathway as well as an appealing mechanistic strategy for the development of anti-cancer drugs. In this study, 6-bromine-5-hydroxy-4-methoxybenzaldehyde was demonstrated to induce DNA double-strand break, multipolar spindles, sustain mitotic arrest and generate multinucleated cells, all of which indicate mitotic catastrophe, in human hepatoma HepG2 cells. We used proteomic profiling to identify the differentially expressed proteins underlying mitotic catastrophe. A total of 137 differentially expressed proteins (76 upregulated and 61 downregulated proteins) were identified. Some of the changed proteins have previously been associated with mitotic catastrophe,more » such as DNA-PKcs, FoxM1, RCC1, cyclin E, PLK1-pT210, 14-3-3{sigma} and HSP70. Multiple isoforms of 14-3-3, heat-shock proteins and tubulin were upregulated. Analysis of functional significance revealed that the 14-3-3-mediated signaling network was the most significantly enriched for the differentially expressed proteins. The modulated proteins were found to be involved in macromolecule complex assembly, cell death, cell cycle, chromatin remodeling and DNA repair, tubulin and cytoskeletal organization. These findings revealed the overall molecular events and functional signaling networks associated with spindle disruption and mitotic catastrophe. - Graphical abstract: Display Omitted Research highlights: > 6-bromoisovanillin induced spindle disruption and sustained mitotic arrest, consequently resulted in mitotic catastrophe. > Proteomic profiling identified 137 differentially expressed proteins associated mitotic catastrophe. > The 14-3-3-mediated signaling network was the most significantly enriched for the altered proteins. > The macromolecule complex assembly, cell cycle, chromatin remodeling and DNA repair, tubulin organization were also shown involved in mitotic catastrophe.« less
The protein-protein interface evolution acts in a similar way to antibody affinity maturation.
Li, Bohua; Zhao, Lei; Wang, Chong; Guo, Huaizu; Wu, Lan; Zhang, Xunming; Qian, Weizhu; Wang, Hao; Guo, Yajun
2010-02-05
Understanding the evolutionary mechanism that acts at the interfaces of protein-protein complexes is a fundamental issue with high interest for delineating the macromolecular complexes and networks responsible for regulation and complexity in biological systems. To investigate whether the evolution of protein-protein interface acts in a similar way as antibody affinity maturation, we incorporated evolutionary information derived from antibody affinity maturation with common simulation techniques to evaluate prediction success rates of the computational method in affinity improvement in four different systems: antibody-receptor, antibody-peptide, receptor-membrane ligand, and receptor-soluble ligand. It was interesting to find that the same evolutionary information could improve the prediction success rates in all the four protein-protein complexes with an exceptional high accuracy (>57%). One of the most striking findings in our present study is that not only in the antibody-combining site but in other protein-protein interfaces almost all of the affinity-enhancing mutations are located at the germline hotspot sequences (RGYW or WA), indicating that DNA hot spot mechanisms may be widely used in the evolution of protein-protein interfaces. Our data suggest that the evolution of distinct protein-protein interfaces may use the same basic strategy under selection pressure to maintain interactions. Additionally, our data indicate that classical simulation techniques incorporating the evolutionary information derived from in vivo antibody affinity maturation can be utilized as a powerful tool to improve the binding affinity of protein-protein complex with a high accuracy.
Bayesian module identification from multiple noisy networks.
Zamani Dadaneh, Siamak; Qian, Xiaoning
2016-12-01
Module identification has been studied extensively in order to gain deeper understanding of complex systems, such as social networks as well as biological networks. Modules are often defined as groups of vertices in these networks that are topologically cohesive with similar interaction patterns with the rest of the vertices. Most of the existing module identification algorithms assume that the given networks are faithfully measured without errors. However, in many real-world applications, for example, when analyzing protein-protein interaction networks from high-throughput profiling techniques, there is significant noise with both false positive and missing links between vertices. In this paper, we propose a new model for more robust module identification by taking advantage of multiple observed networks with significant noise so that signals in multiple networks can be strengthened and help improve the solution quality by combining information from various sources. We adopt a hierarchical Bayesian model to integrate multiple noisy snapshots that capture the underlying modular structure of the networks under study. By introducing a latent root assignment matrix and its relations to instantaneous module assignments in all the observed networks to capture the underlying modular structure and combine information across multiple networks, an efficient variational Bayes algorithm can be derived to accurately and robustly identify the underlying modules from multiple noisy networks. Experiments on synthetic and protein-protein interaction data sets show that our proposed model enhances both the accuracy and resolution in detecting cohesive modules, and it is less vulnerable to noise in the observed data. In addition, it shows higher power in predicting missing edges compared to individual-network methods.
Mapping Cellular Polarity Networks Using Mass Spectrometry-based Strategies.
Daulat, Avais M; Puvirajesinghe, Tania M; Camoin, Luc; Borg, Jean-Paul
2018-05-18
Cell polarity is a vital biological process involved in the building, maintenance and normal functioning of tissues in invertebrates and vertebrates. Unsurprisingly, molecular defects affecting polarity organization and functions have a strong impact on tissue homeostasis, embryonic development and adult life, and may directly or indirectly lead to diseases. Genetic studies have demonstrated the causative effect of several polarity genes in diseases; however, much remains to be clarified before a comprehensive view of the molecular organization and regulation of the protein networks associated with polarity proteins is obtained. This challenge can be approached head-on using proteomics to identify protein complexes involved in cell polarity and their modifications in a spatio-temporal manner. We review the fundamental basics of mass spectrometry techniques and provide an in-depth analysis of how mass spectrometry has been instrumental in understanding the complex and dynamic nature of some cell polarity networks at the tissue (apico-basal and planar cell polarities) and cellular (cell migration, ciliogenesis) levels, with the fine dissection of the interconnections between prototypic cell polarity proteins and signal transduction cascades in normal and pathological situations. This review primarily focuses on epithelial structures which are the fundamental building blocks for most metazoan tissues, used as the archetypal model to study cellular polarity. This field offers broad perspectives thanks to the ever-increasing sensitivity of mass spectrometry and its use in combination with recently developed molecular strategies able to probe in situ proteomic networks. Copyright © 2018 Elsevier Ltd. All rights reserved.
The DNA damage response (DDR) is a highly regulated signal transduction network that orchestrates the temporal and spatial organization of protein complexes required to repair (or tolerate) DNA damage (e.g., nucleotide excision repair, base excision repair, homologous recombination, non-homologous end joining, post-replication repair).
Systems Biology Graphical Notation: Entity Relationship language Level 1 Version 2.
Sorokin, Anatoly; Le Novère, Nicolas; Luna, Augustin; Czauderna, Tobias; Demir, Emek; Haw, Robin; Mi, Huaiyu; Moodie, Stuart; Schreiber, Falk; Villéger, Alice
2015-09-04
The Systems Biological Graphical Notation (SBGN) is an international community effort for standardized graphical representations of biological pathways and networks. The goal of SBGN is to provide unambiguous pathway and network maps for readers with different scientific backgrounds as well as to support efficient and accurate exchange of biological knowledge between different research communities, industry, and other players in systems biology. Three SBGN languages, Process Description (PD), Entity Relationship (ER) and Activity Flow (AF), allow for the representation of different aspects of biological and biochemical systems at different levels of detail. The SBGN Entity Relationship language (ER) represents biological entities and their interactions and relationships within a network. SBGN ER focuses on all potential relationships between entities without considering temporal aspects. The nodes (elements) describe biological entities, such as proteins and complexes. The edges (connections) provide descriptions of interactions and relationships (or influences), e.g., complex formation, stimulation and inhibition. Among all three languages of SBGN, ER is the closest to protein interaction networks in biological literature and textbooks, but its well-defined semantics offer a superior precision in expressing biological knowledge.
Role of the Retromer Complex in Neurodegenerative Diseases
Li, Chaosi; Shah, Syed Zahid Ali; Zhao, Deming; Yang, Lifeng
2016-01-01
The retromer complex is a protein complex that plays a central role in endosomal trafficking. Retromer dysfunction has been linked to a growing number of neurological disorders. The process of intracellular trafficking and recycling is crucial for maintaining normal intracellular homeostasis, which is partly achieved through the activity of the retromer complex. The retromer complex plays a primary role in sorting endosomal cargo back to the cell surface for reuse, to the trans-Golgi network (TGN), or alternatively to specialized endomembrane compartments, in which the cargo is not subjected to lysosomal-mediated degradation. In most cases, the retromer acts as a core that interacts with associated proteins, including sorting nexin family member 27 (SNX27), members of the vacuolar protein sorting 10 (VPS10) receptor family, the major endosomal actin polymerization-promoting complex known as Wiskott-Aldrich syndrome protein and scar homolog (WASH), and other proteins. Some of the molecules carried by the retromer complex are risk factors for neurodegenerative diseases. Defects such as haplo-insufficiency or mutations in one or several units of the retromer complex lead to various pathologies. Here, we summarize the molecular architecture of the retromer complex and the roles of this system in intracellular trafficking related the pathogenesis of neurodegenerative diseases. PMID:26973516
NASA Astrophysics Data System (ADS)
Nussinov, Ruth; Panchenko, Anna R.; Przytycka, Teresa
2011-06-01
Physics approaches focus on uncovering, modeling and quantitating the general principles governing the micro and macro universe. This has always been an important component of biological research, however recent advances in experimental techniques and the accumulation of unprecedented genome-scale experimental data produced by these novel technologies now allow for addressing fundamental questions on a large scale. These relate to molecular interactions, principles of bimolecular recognition, and mechanisms of signal propagation. The functioning of a cell requires a variety of intermolecular interactions including protein-protein, protein-DNA, protein-RNA, hormones, peptides, small molecules, lipids and more. Biomolecules work together to provide specific functions and perturbations in intermolecular communication channels often lead to cellular malfunction and disease. A full understanding of the interactome requires an in-depth grasp of the biophysical principles underlying individual interactions as well as their organization in cellular networks. Phenomena can be described at different levels of abstraction. Computational and systems biology strive to model cellular processes by integrating and analyzing complex data from multiple experimental sources using interdisciplinary tools. As a result, both the causal relationships between the variables and the general features of the system can be discovered, which even without knowing the details of the underlying mechanisms allow for putting forth hypotheses and predicting the behavior of the systems in response to perturbation. And here lies the strength of in silico models which provide control and predictive power. At the same time, the complexity of individual elements and molecules can be addressed by the fields of molecular biophysics, physical biology and structural biology, which focus on the underlying physico-chemical principles and may explain the molecular mechanisms of cellular function. In this issue we have assembled a representative set of papers written by experts with diverse scientific backgrounds, each offering a unique viewpoint on using computational and physics methods to study biological systems at different levels of organization. We start with studies that aim to decipher the mechanisms of molecular recognition using biophysics methods and then expand our scale, concluding the issue with studies of interaction networks at cellular and population levels. Biomolecules interact with each other in a highly specific manner and selectively recognize their partners among hundreds of thousands of other molecules. As the paper by Zhang et al points out, this recognition process should be fast and guided by long-range electrostatic forces that select and bring the interacting partners together. The authors show that the increase of salt concentration leads to destabilization of protein complexes, suggesting an optimization of the charge-charge interactions across the protein binding interfaces. The following paper by Berezovsky further explores the balance of different interactions in protein complexes and uses physical concepts to explain the entire spectrum of protein structural classes, from intrinsically disordered to hyperthermostable proteins. The author describes highly unstructured viral proteins at one end of the spectrum and discusses the balance of stabilizing interactions in protein complexes from thermophilic organisms at the other. Recently accumulated evidence has indicated that native proteins do not necessarily require a unique structure to be biologically active, and in some cases structural disorder or intrinsic flexibility can be a prerequisite for their function. From the physical point of view, these disordered/flexible proteins exist in dynamic equilibrium between different conformational states, some of which could be selected upon binding to another partner. Such a property allows disordered proteins to achieve specific binding and at the same time reversibility and diversity in their interactions. Interestingly, as is shown in the paper by Mészáros et al, even though some disordered regions and proteins have a tendency to fold upon binding, the structures of their complexes still reveal their inherent flexibility. Indeed, disordered proteins and their complexes have certain properties which distinguish them from proteins with well-defined structures. This is evident from the papers by Lobanov and Galzitskaya, and Mészáros et al, which show that such characteristic features of disordered proteins allow their successful computational prediction from the sequence alone. Computational prediction of protein disorder has been used in another study by Takeda et al where the authors investigate the role of disorder in the function of a specific actin capping protein. The paper presents normal mode analysis with the elastic network model to examine the mechanisms of intrinsic flexibility and its biological role in actin function. Analysis of the underlying mechanisms and key factors in protein recognition might be essential for the prediction of protein-protein interactions. The papers by Tuncbag et al and Hashimoto et al demonstrate how incorporating the physico-chemical properties of binding interfaces and their atomic details obtained from protein crystal structures might be used to increase the accuracy of predicted protein-protein interactions and provide data on relative orientations of interacting proteins and on the locations of binding sites. Moreover, analysis of protein-protein interactions might require further fine-tuning for different types of assemblies, like that shown in the example of homooligomers by Hashimoto et al. Studies of protein-protein interactions at the molecular level have contributed considerably to understanding the principles of large-scale organization of the cellular interactome. Using graph theory as a unifying language, many characteristic properties of bimolecular networks have been identified, including scale free distribution of the vertex degree, network motifs, and modularity, to name a few. These studies of network organization require the network to be as complete as possible, which given the limitations of experimental techniques is not currently the case. Therefore, experimental procedures for detecting biomolecular interactions should be complemented by computational approaches. The paper by Lees et al provides a review of computational methods, integrating multiple independent sources of data to infer physical and functional protein-protein interaction networks. One of the important aspects of protein interactions that should be accounted for in the prediction of protein interaction networks is that many proteins are composed of distinct domains. Protein domains may mediate protein interactions while proteins and their interaction networks may gain complexity through gene duplication and expansion of existing domain architectures via domain rearrangements. The latter mechanisms have been explored in detail in the paper by Cohen-Gihon et al. Protein-protein interactions are not the only component of the cell's interactome. Regulation of cell activity can be achieved at the level of transcription and involve a transcription factor—DNA binding which typically requires recognition of a specific DNA sequence motif. Chip-Chip and the more recent Chip-Seq technologies allow in vivo identification of DNA binding sites and, together with novel in vitro approaches, provide data necessary for deciphering the corresponding binding motifs. Such information, complemented by structures of protein-DNA complexes and knowledge of the differences in binding sites among homologs, opens the door to constructing predictive binding models. The paper by Persikov and Singh provides an example of such a model in the Cys2His2 zinc finger family. Recent studies have indicated that the presence of such binding motifs is, however, neither necessary nor sufficient for transcription factor activity. Transcription regulation is a complex and still not fully understood process involving, in addition to protein-DNA binding, other factors such as epigenetic modifications and three-dimensional DNA organization. In this issue, Levens and Benham discuss another important mechanism which is likely to contribute to overall gene regulation—changes of DNA secondary structure in response to supercoiling-induced stress. Pointing out that DNA is "more than a cipher", they argue that the DNA structural transitions driven by negative supercoiling may have profound consequences for the cell and have to be accounted for in detailed models. There is considerable progress in physical modeling of DNA dynamics in response to stress. Such efforts, supported by experimental data, will bring us closer to an understanding of the role of supercoiling in gene regulation. Large-scale biomolecular interaction networks not only provide a system-level view of cellular processes, but are also increasingly used to model communications between molecules. The lack of sufficient biochemical data and the gigantic scale of the network prevented detailed modeling of network dynamics and have stimulated the development of simplified models such as the information flow approach described by Kim et al in this issue. Importantly, despite their simplicity, such models proved to be extremely useful for identifying network modules, essential nodes, and molecular pathways which are dysregulated in complex diseases such as cancer. Finally, moving from studies of single cells towards populations, one has to recognize the heterogeneity present within a population of cells. In the context of protein abundance, such cell-to-cell variation within clonal populations of cells, referred to as expression noise, has recently become a focus of intense cross-disciplinary research. Concerted efforts of experimentalists, physicists and mathematicians have brought us closer to understanding the source, potential drawbacks and benefits of noise for cell function. Differences in protein expression levels are even more pronounced in samples from mixed cell populations. How does such a mixture of cell populations affect the measurements of total gene expression? This question is addressed by Hebenstreit and Teichmann who show that decomposing a signal coming from a mixture of cellular populations requires insights from theoretical modeling. Recent technological advancements permitting genome-wide scale measurements of diverse molecular properties and consequently higher levels of quantitative reasoning are attracting physicists, mathematicians and computer scientists to the study of biological systems. Building on the synergy between these fields, we are entering an exciting era where physics methods are used in conjunction with these disciplines which, combined with statistical methods, provide quantitative descriptions of biology. Acknowledgments This project was funded with federal funds from the National Cancer Institute, National Institutes of Health, under contract number HHSN261200800001E. This research was supported by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research and the National Library of Medicine at National Institutes of Health/DHHS. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products or organizations imply endorsement by the US Government.
Engin, H. Billur; Guney, Emre; Keskin, Ozlem; Oliva, Baldo; Gursoy, Attila
2013-01-01
Blocking specific protein interactions can lead to human diseases. Accordingly, protein interactions and the structural knowledge on interacting surfaces of proteins (interfaces) have an important role in predicting the genotype-phenotype relationship. We have built the phenotype specific sub-networks of protein-protein interactions (PPIs) involving the relevant genes responsible for lung and brain metastasis from primary tumor in breast cancer. First, we selected the PPIs most relevant to metastasis causing genes (seed genes), by using the “guilt-by-association” principle. Then, we modeled structures of the interactions whose complex forms are not available in Protein Databank (PDB). Finally, we mapped mutations to interface structures (real and modeled), in order to spot the interactions that might be manipulated by these mutations. Functional analyses performed on these sub-networks revealed the potential relationship between immune system-infectious diseases and lung metastasis progression, but this connection was not observed significantly in the brain metastasis. Besides, structural analyses showed that some PPI interfaces in both metastasis sub-networks are originating from microbial proteins, which in turn were mostly related with cell adhesion. Cell adhesion is a key mechanism in metastasis, therefore these PPIs may be involved in similar molecular pathways that are shared by infectious disease and metastasis. Finally, by mapping the mutations and amino acid variations on the interface regions of the proteins in the metastasis sub-networks we found evidence for some mutations to be involved in the mechanisms differentiating the type of the metastasis. PMID:24278371
Noninvasive imaging of protein-protein interactions in living organisms.
Haberkorn, Uwe; Altmann, Annette
2003-06-01
Genomic research is expected to generate new types of complex observational data, changing the types of experiments as well as our understanding of biological processes. The investigation and definition of relationships among proteins is essential for understanding the function of each gene and the mechanisms of biological processes that specific genes are involved in. Recently, a study by Paulmurugan et al. demonstrated a tool for in vivo noninvasive imaging of protein-protein interactions and intracellular networks.
Large-scale De Novo Prediction of Physical Protein-Protein Association*
Elefsinioti, Antigoni; Saraç, Ömer Sinan; Hegele, Anna; Plake, Conrad; Hubner, Nina C.; Poser, Ina; Sarov, Mihail; Hyman, Anthony; Mann, Matthias; Schroeder, Michael; Stelzl, Ulrich; Beyer, Andreas
2011-01-01
Information about the physical association of proteins is extensively used for studying cellular processes and disease mechanisms. However, complete experimental mapping of the human interactome will remain prohibitively difficult in the near future. Here we present a map of predicted human protein interactions that distinguishes functional association from physical binding. Our network classifies more than 5 million protein pairs predicting 94,009 new interactions with high confidence. We experimentally tested a subset of these predictions using yeast two-hybrid analysis and affinity purification followed by quantitative mass spectrometry. Thus we identified 462 new protein-protein interactions and confirmed the predictive power of the network. These independent experiments address potential issues of circular reasoning and are a distinctive feature of this work. Analysis of the physical interactome unravels subnetworks mediating between different functional and physical subunits of the cell. Finally, we demonstrate the utility of the network for the analysis of molecular mechanisms of complex diseases by applying it to genome-wide association studies of neurodegenerative diseases. This analysis provides new evidence implying TOMM40 as a factor involved in Alzheimer's disease. The network provides a high-quality resource for the analysis of genomic data sets and genetic association studies in particular. Our interactome is available via the hPRINT web server at: www.print-db.org. PMID:21836163
Vaiman, Daniel; Miralles, Francisco
2016-01-01
Preeclampsia (PE) is a pregnancy disorder defined by hypertension and proteinuria. This disease remains a major cause of maternal and fetal morbidity and mortality. Defective placentation is generally described as being at the root of the disease. The characterization of the transcriptome signature of the preeclamptic placenta has allowed to identify differentially expressed genes (DEGs). However, we still lack a detailed knowledge on how these DEGs impact the function of the placenta. The tools of network biology offer a methodology to explore complex diseases at a systems level. In this study we performed a cross-platform meta-analysis of seven publically available gene expression datasets comparing non-pathological and preeclamptic placentas. Using the rank product algorithm we identified a total of 369 DEGs consistently modified in PE. The DEGs were used as seeds to build both an extended physical protein-protein interactions network and a transcription factors regulatory network. Topological and clustering analysis was conducted to analyze the connectivity properties of the networks. Finally both networks were merged into a composite network which presents an integrated view of the regulatory pathways involved in preeclampsia and the crosstalk between them. This network is a useful tool to explore the relationship between the DEGs and enable hypothesis generation for functional experimentation. PMID:27802351
A Continuum Model of Actin Waves in Dictyostelium discoideum
Khamviwath, Varunyu; Hu, Jifeng; Othmer, Hans G.
2013-01-01
Actin waves are complex dynamical patterns of the dendritic network of filamentous actin in eukaryotes. We developed a model of actin waves in PTEN-deficient Dictyostelium discoideum by deriving an approximation of the dynamics of discrete actin filaments and combining it with a signaling pathway that controls filament branching. This signaling pathway, together with the actin network, contains a positive feedback loop that drives the actin waves. Our model predicts the structure, composition, and dynamics of waves that are consistent with existing experimental evidence, as well as the biochemical dependence on various protein partners. Simulation suggests that actin waves are initiated when local actin network activity, caused by an independent process, exceeds a certain threshold. Moreover, diffusion of proteins that form a positive feedback loop with the actin network alone is sufficient for propagation of actin waves at the observed speed of . Decay of the wave back can be caused by scarcity of network components, and the shape of actin waves is highly dependent on the filament disassembly rate. The model allows retraction of actin waves and captures formation of new wave fronts in broken waves. Our results demonstrate that a delicate balance between a positive feedback, filament disassembly, and local availability of network components is essential for the complex dynamics of actin waves. PMID:23741312
Tandem Repeat Proteins Inspired By Squid Ring Teeth
NASA Astrophysics Data System (ADS)
Pena-Francesch, Abdon
Proteins are large biomolecules consisting of long chains of amino acids that hierarchically assemble into complex structures, and provide a variety of building blocks for biological materials. The repetition of structural building blocks is a natural evolutionary strategy for increasing the complexity and stability of protein structures. However, the relationship between amino acid sequence, structure, and material properties of protein systems remains unclear due to the lack of control over the protein sequence and the intricacies of the assembly process. In order to investigate the repetition of protein building blocks, a recently discovered protein from squids is examined as an ideal protein system. Squid ring teeth are predatory appendages located inside the suction cups that provide a strong grasp of prey, and are solely composed of a group of proteins with tandem repetition of building blocks. The objective of this thesis is the understanding of sequence, structure and property relationship in repetitive protein materials inspired in squid ring teeth for the first time. Specifically, this work focuses on squid-inspired structural proteins with tandem repeat units in their sequence (i.e., repetition of alternating building blocks) that are physically cross-linked via beta-sheet structures. The research work presented here tests the hypothesis that, in these systems, increasing the number of building blocks in the polypeptide chain decreases the protein network defects and improves the material properties. Hence, the sequence, nanostructure, and properties (thermal, mechanical, and conducting) of tandem repeat squid-inspired protein materials are examined. Spectroscopic structural analysis, advanced materials characterization, and entropic elasticity theory are combined to elucidate the structure and material properties of these repetitive proteins. This approach is applied not only to native squid proteins but also to squid-inspired synthetic polypeptides that allow for a fine control of the sequence and network morphology. The results provided in this work establish a clear dependence between the repetitive building blocks, the network morphology, and the properties of squid-inspired repetitive protein materials. Increasing the number of tandem repeat units in SRT-inspired proteins led to more effective protein networks with superior properties. Through increasing tandem repetition and optimization of network morphology, highly efficient protein materials capable of withstanding deformations up to 400% of their original length, with MPa-GPa modulus, high energy absorption (50 MJ m-3), peak proton conductivity of 3.7 mS cm-1 (at pH 7, highest reported to date for biological materials), and peak thermal conductivity of 1.4 W m-1 K -1 (which exceeds that of most polymer materials) were developed. These findings introduce new design rules in the engineering of proteins based on tandem repetition and morphology control, and provide a novel framework for tailoring and optimizing the properties of protein-based materials.
Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jin, R; McCallen, S; Almaas, E
2007-05-28
Complex networks have been used successfully in scientific disciplines ranging from sociology to microbiology to describe systems of interacting units. Until recently, studies of complex networks have mainly focused on their network topology. However, in many real world applications, the edges and vertices have associated attributes that are frequently represented as vertex or edge weights. Furthermore, these weights are often not static, instead changing with time and forming a time series. Hence, to fully understand the dynamics of the complex network, we have to consider both network topology and related time series data. In this work, we propose a motifmore » mining approach to identify trend motifs for such purposes. Simply stated, a trend motif describes a recurring subgraph where each of its vertices or edges displays similar dynamics over a userdefined period. Given this, each trend motif occurrence can help reveal significant events in a complex system; frequent trend motifs may aid in uncovering dynamic rules of change for the system, and the distribution of trend motifs may characterize the global dynamics of the system. Here, we have developed efficient mining algorithms to extract trend motifs. Our experimental validation using three disparate empirical datasets, ranging from the stock market, world trade, to a protein interaction network, has demonstrated the efficiency and effectiveness of our approach.« less
Hierarchy Measure for Complex Networks
Mones, Enys; Vicsek, Lilla; Vicsek, Tamás
2012-01-01
Nature, technology and society are full of complexity arising from the intricate web of the interactions among the units of the related systems (e.g., proteins, computers, people). Consequently, one of the most successful recent approaches to capturing the fundamental features of the structure and dynamics of complex systems has been the investigation of the networks associated with the above units (nodes) together with their relations (edges). Most complex systems have an inherently hierarchical organization and, correspondingly, the networks behind them also exhibit hierarchical features. Indeed, several papers have been devoted to describing this essential aspect of networks, however, without resulting in a widely accepted, converging concept concerning the quantitative characterization of the level of their hierarchy. Here we develop an approach and propose a quantity (measure) which is simple enough to be widely applicable, reveals a number of universal features of the organization of real-world networks and, as we demonstrate, is capable of capturing the essential features of the structure and the degree of hierarchy in a complex network. The measure we introduce is based on a generalization of the m-reach centrality, which we first extend to directed/partially directed graphs. Then, we define the global reaching centrality (GRC), which is the difference between the maximum and the average value of the generalized reach centralities over the network. We investigate the behavior of the GRC considering both a synthetic model with an adjustable level of hierarchy and real networks. Results for real networks show that our hierarchy measure is related to the controllability of the given system. We also propose a visualization procedure for large complex networks that can be used to obtain an overall qualitative picture about the nature of their hierarchical structure. PMID:22470477
Garamszegi, Sara; Franzosa, Eric A; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology.
Molecular Analysis of Core Kinetochore Composition and Assembly in Drosophila melanogaster
Przewloka, Marcin R.; Archambault, Vincent; D'Avino, Pier Paolo; Lilley, Kathryn S.; Laue, Ernest D.; McAinsh, Andrew D.; Glover, David M.
2007-01-01
Background Kinetochores are large multiprotein complexes indispensable for proper chromosome segregation. Although Drosophila is a classical model organism for studies of chromosome segregation, little is known about the organization of its kinetochores. Methodology/Principal Findings We employed bioinformatics, proteomics and cell biology methods to identify and analyze the interaction network of Drosophila kinetochore proteins. We have shown that three Drosophila proteins highly diverged from human and yeast Ndc80, Nuf2 and Mis12 are indeed their orthologues. Affinity purification of these proteins from cultured Drosophila cells identified a further five interacting proteins with weak similarity to subunits of the SPC105/KNL-1, MIND/MIS12 and NDC80 kinetochore complexes together with known kinetochore associated proteins such as dynein/dynactin, spindle assembly checkpoint components and heterochromatin proteins. All eight kinetochore complex proteins were present at the kinetochore during mitosis and MIND/MIS12 complex proteins were also centromeric during interphase. Their down-regulation led to dramatic defects in chromosome congression/segregation frequently accompanied by mitotic spindle elongation. The systematic depletion of each individual protein allowed us to establish dependency relationships for their recruitment onto the kinetochore. This revealed the sequential recruitment of individual members of first, the MIND/MIS12 and then, NDC80 complex. Conclusions/Significance The Drosophila MIND/MIS12 and NDC80 complexes and the Spc105 protein, like their counterparts from other eukaryotic species, are essential for chromosome congression and segregation, but are highly diverged in sequence. Hierarchical dependence relationships of individual proteins regulate the assembly of Drosophila kinetochore complexes in a manner similar, but not identical, to other organisms. PMID:17534428
Effect of CMC Molecular Weight on Acid-Induced Gelation of Heated WPI-CMC Soluble Complex.
Huan, Yan; Zhang, Sha; Vardhanabhuti, Bongkosh
2016-02-01
Acid-induced gelation properties of heated whey protein isolate (WPI) and carboxymethylcellulose (CMC) soluble complex were investigated as a function of CMC molecular weight (270, 680, and 750 kDa) and concentrations (0% to 0.125%). Heated WPI-CMC soluble complex with 6% protein was made by heating biopolymers together at pH 7.0 and 85 °C for 30 min and diluted to 5% protein before acid-induced gelation. Acid-induced gel formed from heated WPI-CMC complexes exhibited increased hardness and decreased water holding capacity with increasing CMC concentrations but gel strength decreased at higher CMC content. The highest gel strength was observed with CMC 750 k at 0.05%. Gels with low CMC concentration showed homogenous microstructure which was independent of CMC molecular weight, while increasing CMC concentration led to microphase separation with higher CMC molecular weight showing more extensive phase separation. When heated WPI-CMC complexes were prepared at 9% protein the acid gels showed improved gel hardness and water holding capacity, which was supported by the more interconnected protein network with less porosity when compared to complexes heated at 6% protein. It is concluded that protein concentration and biopolymer ratio during complex formation are the major factors affecting gel properties while the effect of CMC molecular weight was less significant. © 2016 Institute of Food Technologists®
Salivary Defense Proteins: Their Network and Role in Innate and Acquired Oral Immunity
Fábián, Tibor Károly; Hermann, Péter; Beck, Anita; Fejérdy, Pál; Fábián, Gábor
2012-01-01
There are numerous defense proteins present in the saliva. Although some of these molecules are present in rather low concentrations, their effects are additive and/or synergistic, resulting in an efficient molecular defense network of the oral cavity. Moreover, local concentrations of these proteins near the mucosal surfaces (mucosal transudate), periodontal sulcus (gingival crevicular fluid) and oral wounds and ulcers (transudate) may be much greater, and in many cases reinforced by immune and/or inflammatory reactions of the oral mucosa. Some defense proteins, like salivary immunoglobulins and salivary chaperokine HSP70/HSPAs (70 kDa heat shock proteins), are involved in both innate and acquired immunity. Cationic peptides and other defense proteins like lysozyme, bactericidal/permeability increasing protein (BPI), BPI-like proteins, PLUNC (palate lung and nasal epithelial clone) proteins, salivary amylase, cystatins, prolin-rich proteins, mucins, peroxidases, statherin and others are primarily responsible for innate immunity. In this paper, this complex system and function of the salivary defense proteins will be reviewed. PMID:22605979
In-vivo detection of binary PKA network interactions upon activation of endogenous GPCRs
Röck, Ruth; Bachmann, Verena; Bhang, Hyo-eun C; Malleshaiah, Mohan; Raffeiner, Philipp; Mayrhofer, Johanna E; Tschaikner, Philipp M; Bister, Klaus; Aanstad, Pia; Pomper, Martin G; Michnick, Stephen W; Stefan, Eduard
2015-01-01
Membrane receptor-sensed input signals affect and modulate intracellular protein-protein interactions (PPIs). Consequent changes occur to the compositions of protein complexes, protein localization and intermolecular binding affinities. Alterations of compartmentalized PPIs emanating from certain deregulated kinases are implicated in the manifestation of diseases such as cancer. Here we describe the application of a genetically encoded Protein-fragment Complementation Assay (PCA) based on the Renilla Luciferase (Rluc) enzyme to compare binary PPIs of the spatially and temporally controlled protein kinase A (PKA) network in diverse eukaryotic model systems. The simplicity and sensitivity of this cell-based reporter allows for real-time recordings of mutually exclusive PPIs of PKA upon activation of selected endogenous G protein-coupled receptors (GPCRs) in cancer cells, xenografts of mice, budding yeast, and zebrafish embryos. This extends the application spectrum of Rluc PCA for the quantification of PPI-based receptor-effector relationships in physiological and pathological model systems. PMID:26099953
Garamszegi, Sara; Franzosa, Eric A.; Xia, Yu
2013-01-01
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology. PMID:24339775
Revealing networks from dynamics: an introduction
NASA Astrophysics Data System (ADS)
Timme, Marc; Casadiego, Jose
2014-08-01
What can we learn from the collective dynamics of a complex network about its interaction topology? Taking the perspective from nonlinear dynamics, we briefly review recent progress on how to infer structural connectivity (direct interactions) from accessing the dynamics of the units. Potential applications range from interaction networks in physics, to chemical and metabolic reactions, protein and gene regulatory networks as well as neural circuits in biology and electric power grids or wireless sensor networks in engineering. Moreover, we briefly mention some standard ways of inferring effective or functional connectivity.
Mass-action equilibrium and non-specific interactions in protein binding networks
NASA Astrophysics Data System (ADS)
Maslov, Sergei
2009-03-01
Large-scale protein binding networks serve as a paradigm of complex properties of living cells. These networks are naturally weighted with edges characterized by binding strength and protein-nodes -- by their concentrations. However, the state-of-the-art high-throughput experimental techniques generate just a binary (yes or no) information about individual interactions. As a result, most of the previous research concentrated just on topology of these networks. In a series of recent publications [1-4] my collaborators and I went beyond purely topological studies and calculated the mass-action equilibrium of a genome-wide binding network using experimentally determined protein concentrations, localizations, and reliable binding interactions in baker's yeast. We then studied how this equilibrium responds to large perturbations [1-2] and noise [3] in concentrations of proteins. We demonstrated that the change in the equilibrium concentration of a protein exponentially decays (and sign-alternates) with its network distance away from the perturbed node. This explains why, despite a globally connected topology, individual functional modules in such networks are able to operate fairly independently. In a separate study [4] we quantified the interplay between specific and non-specific binding interactions under crowded conditions inside living cells. We show how the need to limit the waste of resources constrains the number of types and concentrations of proteins that are present at the same time and at the same place in yeast cells. [1] S Maslov, I. Ispolatov, PNAS 104:13655 (2007). [2] S. Maslov, K. Sneppen, I. Ispolatov, New J. of Phys. 9: 273 (2007). [3] K-K. Yan, D. Walker, S. Maslov, PRL accepted (2008). [4] J. Zhang, S. Maslov, and E. I. Shakhnovich, Mol Syst Biol 4, 210 (2008).
Transport According to GARP: Receiving Retrograde Cargo at the Trans-Golgi Network
Bonifacino, Juan S.; Hierro, Aitor
2010-01-01
Tethering factors are large protein complexes that capture transport vesicles and enable their fusion with acceptor organelles at different stages of the endomembrane system. Recent studies have shed new light on the structure and function of a heterotetrameric tethering factor named Golgi-associated retrograde protein (GARP), which promotes fusion of endosome-derived, retrograde transport carriers to the trans-Golgi network (TGN). X-ray crystallography of the Vps53 and Vps54 subunits of GARP has revealed that this complex is structurally related to other tethering factors such as the exocyst, COG and Dsl1, indicating that they all might work by a similar mechanism. Loss of GARP function compromises the growth, fertility and/or viability of the defective organisms, underscoring the essential nature of GARP-mediated retrograde transport. PMID:21183348
Enzyme Sequestration as a Tuning Point in Controlling Response Dynamics of Signalling Networks
Ollivier, Julien F.; Soyer, Orkun S.
2016-01-01
Signalling networks result from combinatorial interactions among many enzymes and scaffolding proteins. These complex systems generate response dynamics that are often essential for correct decision-making in cells. Uncovering biochemical design principles that underpin such response dynamics is a prerequisite to understand evolved signalling networks and to design synthetic ones. Here, we use in silico evolution to explore the possible biochemical design space for signalling networks displaying ultrasensitive and adaptive response dynamics. By running evolutionary simulations mimicking different biochemical scenarios, we find that enzyme sequestration emerges as a key mechanism for enabling such dynamics. Inspired by these findings, and to test the role of sequestration, we design a generic, minimalist model of a signalling cycle, featuring two enzymes and a single scaffolding protein. We show that this simple system is capable of displaying both ultrasensitive and adaptive response dynamics. Furthermore, we find that tuning the concentration or kinetics of the sequestering protein can shift system dynamics between these two response types. These empirical results suggest that enzyme sequestration through scaffolding proteins is exploited by evolution to generate diverse response dynamics in signalling networks and could provide an engineering point in synthetic biology applications. PMID:27163612
Investigations of photosynthetic light harvesting by two-dimensional electronic spectroscopy
NASA Astrophysics Data System (ADS)
Read, Elizabeth Louise
Photosynthesis begins with the harvesting of sunlight by antenna pigments, organized in a network of pigment-protein complexes that rapidly funnel energy to photochemical reaction centers. The intricate design of these systems---the widely varying structural motifs of pigment organization within proteins and protein organization within a larger, cooperative network---underlies the remarkable speed and efficiency of light harvesting. Advances in femtosecond laser spectroscopy have enabled researchers to follow light energy on its course through the energetic levels of photosynthetic systems. Now, newly-developed femtosecond two-dimensional electronic spectroscopy reveals deeper insight into the fundamental molecular interactions and dynamics that emerge in these structures. The following chapters present investigations of a number of natural light-harvesting complexes using two-dimensional electronic spectroscopy. These studies demonstrate the various types of information contained in experimental two-dimensional spectra, and they show that the technique makes it possible to probe pigment-protein complexes on the length- and time-scales relevant to their functioning. New methods are described that further extend the capabilities of two-dimensional electronic spectroscopy, for example, by independently controlling the excitation laser pulse polarizations. The experiments, coupled with theoretical simulation, elucidate spatial pathways of energy flow, unravel molecular and electronic structures, and point to potential new quantum mechanical mechanisms of light harvesting.
Exploring network operations for data and information networks
NASA Astrophysics Data System (ADS)
Yao, Bing; Su, Jing; Ma, Fei; Wang, Xiaomin; Zhao, Xiyang; Yao, Ming
2017-01-01
Barabási and Albert, in 1999, formulated scale-free models based on some real networks: World-Wide Web, Internet, metabolic and protein networks, language or sexual networks. Scale-free networks not only appear around us, but also have high qualities in the world. As known, high quality information networks can transfer feasibly and efficiently data, clearly, their topological structures are very important for data safety. We build up network operations for constructing large scale of dynamic networks from smaller scale of network models having good property and high quality. We focus on the simplest operators to formulate complex operations, and are interesting on the closeness of operations to desired network properties.
Biclustering Protein Complex Interactions with a Biclique FindingAlgorithm
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ding, Chris; Zhang, Anne Ya; Holbrook, Stephen
2006-12-01
Biclustering has many applications in text mining, web clickstream mining, and bioinformatics. When data entries are binary, the tightest biclusters become bicliques. We propose a flexible and highly efficient algorithm to compute bicliques. We first generalize the Motzkin-Straus formalism for computing the maximal clique from L{sub 1} constraint to L{sub p} constraint, which enables us to provide a generalized Motzkin-Straus formalism for computing maximal-edge bicliques. By adjusting parameters, the algorithm can favor biclusters with more rows less columns, or vice verse, thus increasing the flexibility of the targeted biclusters. We then propose an algorithm to solve the generalized Motzkin-Straus optimizationmore » problem. The algorithm is provably convergent and has a computational complexity of O(|E|) where |E| is the number of edges. It relies on a matrix vector multiplication and runs efficiently on most current computer architectures. Using this algorithm, we bicluster the yeast protein complex interaction network. We find that biclustering protein complexes at the protein level does not clearly reflect the functional linkage among protein complexes in many cases, while biclustering at protein domain level can reveal many underlying linkages. We show several new biologically significant results.« less
OmicsNet: a web-based tool for creation and visual analysis of biological networks in 3D space.
Zhou, Guangyan; Xia, Jianguo
2018-06-07
Biological networks play increasingly important roles in omics data integration and systems biology. Over the past decade, many excellent tools have been developed to support creation, analysis and visualization of biological networks. However, important limitations remain: most tools are standalone programs, the majority of them focus on protein-protein interaction (PPI) or metabolic networks, and visualizations often suffer from 'hairball' effects when networks become large. To help address these limitations, we developed OmicsNet - a novel web-based tool that allows users to easily create different types of molecular interaction networks and visually explore them in a three-dimensional (3D) space. Users can upload one or multiple lists of molecules of interest (genes/proteins, microRNAs, transcription factors or metabolites) to create and merge different types of biological networks. The 3D network visualization system was implemented using the powerful Web Graphics Library (WebGL) technology that works natively in most major browsers. OmicsNet supports force-directed layout, multi-layered perspective layout, as well as spherical layout to help visualize and navigate complex networks. A rich set of functions have been implemented to allow users to perform coloring, shading, topology analysis, and enrichment analysis. OmicsNet is freely available at http://www.omicsnet.ca.
Detection of Significant Pneumococcal Meningitis Biomarkers by Ego Network.
Wang, Qian; Lou, Zhifeng; Zhai, Liansuo; Zhao, Haibin
2017-06-01
To identify significant biomarkers for detection of pneumococcal meningitis based on ego network. Based on the gene expression data of pneumococcal meningitis and global protein-protein interactions (PPIs) data recruited from open access databases, the authors constructed a differential co-expression network (DCN) to identify pneumococcal meningitis biomarkers in a network view. Here EgoNet algorithm was employed to screen the significant ego networks that could accurately distinguish pneumococcal meningitis from healthy controls, by sequentially seeking ego genes, searching candidate ego networks, refinement of candidate ego networks and significance analysis to identify ego networks. Finally, the functional inference of the ego networks was performed to identify significant pathways for pneumococcal meningitis. By differential co-expression analysis, the authors constructed the DCN that covered 1809 genes and 3689 interactions. From the DCN, a total of 90 ego genes were identified. Starting from these ego genes, three significant ego networks (Module 19, Module 70 and Module 71) that could predict clinical outcomes for pneumococcal meningitis were identified by EgoNet algorithm, and the corresponding ego genes were GMNN, MAD2L1 and TPX2, respectively. Pathway analysis showed that these three ego networks were related to CDT1 association with the CDC6:ORC:origin complex, inactivation of APC/C via direct inhibition of the APC/C complex pathway, and DNA strand elongation, respectively. The authors successfully screened three significant ego modules which could accurately predict the clinical outcomes for pneumococcal meningitis and might play important roles in host response to pathogen infection in pneumococcal meningitis.
Improving prediction of heterodimeric protein complexes using combination with pairwise kernel.
Ruan, Peiying; Hayashida, Morihiro; Akutsu, Tatsuya; Vert, Jean-Philippe
2018-02-19
Since many proteins become functional only after they interact with their partner proteins and form protein complexes, it is essential to identify the sets of proteins that form complexes. Therefore, several computational methods have been proposed to predict complexes from the topology and structure of experimental protein-protein interaction (PPI) network. These methods work well to predict complexes involving at least three proteins, but generally fail at identifying complexes involving only two different proteins, called heterodimeric complexes or heterodimers. There is however an urgent need for efficient methods to predict heterodimers, since the majority of known protein complexes are precisely heterodimers. In this paper, we use three promising kernel functions, Min kernel and two pairwise kernels, which are Metric Learning Pairwise Kernel (MLPK) and Tensor Product Pairwise Kernel (TPPK). We also consider the normalization forms of Min kernel. Then, we combine Min kernel or its normalization form and one of the pairwise kernels by plugging. We applied kernels based on PPI, domain, phylogenetic profile, and subcellular localization properties to predicting heterodimers. Then, we evaluate our method by employing C-Support Vector Classification (C-SVC), carrying out 10-fold cross-validation, and calculating the average F-measures. The results suggest that the combination of normalized-Min-kernel and MLPK leads to the best F-measure and improved the performance of our previous work, which had been the best existing method so far. We propose new methods to predict heterodimers, using a machine learning-based approach. We train a support vector machine (SVM) to discriminate interacting vs non-interacting protein pairs, based on informations extracted from PPI, domain, phylogenetic profiles and subcellular localization. We evaluate in detail new kernel functions to encode these data, and report prediction performance that outperforms the state-of-the-art.
Knowledge Discovery in Spectral Data by Means of Complex Networks
Zanin, Massimiliano; Papo, David; Solís, José Luis González; Espinosa, Juan Carlos Martínez; Frausto-Reyes, Claudio; Anda, Pascual Palomares; Sevilla-Escoboza, Ricardo; Boccaletti, Stefano; Menasalvas, Ernestina; Sousa, Pedro
2013-01-01
In the last decade, complex networks have widely been applied to the study of many natural and man-made systems, and to the extraction of meaningful information from the interaction structures created by genes and proteins. Nevertheless, less attention has been devoted to metabonomics, due to the lack of a natural network representation of spectral data. Here we define a technique for reconstructing networks from spectral data sets, where nodes represent spectral bins, and pairs of them are connected when their intensities follow a pattern associated with a disease. The structural analysis of the resulting network can then be used to feed standard data-mining algorithms, for instance for the classification of new (unlabeled) subjects. Furthermore, we show how the structure of the network is resilient to the presence of external additive noise, and how it can be used to extract relevant knowledge about the development of the disease. PMID:24957895
Knowledge discovery in spectral data by means of complex networks.
Zanin, Massimiliano; Papo, David; Solís, José Luis González; Espinosa, Juan Carlos Martínez; Frausto-Reyes, Claudio; Anda, Pascual Palomares; Sevilla-Escoboza, Ricardo; Jaimes-Reategui, Rider; Boccaletti, Stefano; Menasalvas, Ernestina; Sousa, Pedro
2013-03-11
In the last decade, complex networks have widely been applied to the study of many natural and man-made systems, and to the extraction of meaningful information from the interaction structures created by genes and proteins. Nevertheless, less attention has been devoted to metabonomics, due to the lack of a natural network representation of spectral data. Here we define a technique for reconstructing networks from spectral data sets, where nodes represent spectral bins, and pairs of them are connected when their intensities follow a pattern associated with a disease. The structural analysis of the resulting network can then be used to feed standard data-mining algorithms, for instance for the classification of new (unlabeled) subjects. Furthermore, we show how the structure of the network is resilient to the presence of external additive noise, and how it can be used to extract relevant knowledge about the development of the disease.
NASA Astrophysics Data System (ADS)
OświÈ©cimka, Paweł; Livi, Lorenzo; DroŻdŻ, Stanisław
2016-10-01
We investigate the scaling of the cross-correlations calculated for two-variable time series containing vertex properties in the context of complex networks. Time series of such observables are obtained by means of stationary, unbiased random walks. We consider three vertex properties that provide, respectively, short-, medium-, and long-range information regarding the topological role of vertices in a given network. In order to reveal the relation between these quantities, we applied the multifractal cross-correlation analysis technique, which provides information about the nonlinear effects in coupling of time series. We show that the considered network models are characterized by unique multifractal properties of the cross-correlation. In particular, it is possible to distinguish between Erdös-Rényi, Barabási-Albert, and Watts-Strogatz networks on the basis of fractal cross-correlation. Moreover, the analysis of protein contact networks reveals characteristics shared with both scale-free and small-world models.
Systems-level analysis of risk genes reveals the modular nature of schizophrenia.
Liu, Jiewei; Li, Ming; Luo, Xiong-Jian; Su, Bing
2018-05-19
Schizophrenia (SCZ) is a complex mental disorder with high heritability. Genetic studies (especially recent genome-wide association studies) have identified many risk genes for schizophrenia. However, the physical interactions among the proteins encoded by schizophrenia risk genes remain elusive and it is not known whether the identified risk genes converge on common molecular networks or pathways. Here we systematically investigated the network characteristics of schizophrenia risk genes using the high-confidence protein-protein interactions (PPI) from the human interactome. We found that schizophrenia risk genes encode a densely interconnected PPI network (P = 4.15 × 10 -31 ). Compared with the background genes, the schizophrenia risk genes in the interactome have significantly higher degree (P = 5.39 × 10 -11 ), closeness centrality (P = 7.56 × 10 -11 ), betweeness centrality (P = 1.29 × 10 -11 ), clustering coefficient (P = 2.22 × 10 -2 ), and shorter average shortest path length (P = 7.56 × 10 -11 ). Based on the densely interconnected PPI network, we identified 48 hub genes and 4 modules formed by highly interconnected schizophrenia genes. We showed that the proteins encoded by schizophrenia hub genes have significantly more direct physical interactions. Gene ontology (GO) analysis revealed that cell adhesion, cell cycle, immune system response, and GABR-receptor complex categories were enriched in the modules formed by highly interconnected schizophrenia risk genes. Our study reveals that schizophrenia risk genes encode a densely interconnected molecular network and demonstrates the modular nature of schizophrenia. Copyright © 2018 Elsevier B.V. All rights reserved.
Principles of Protein Recognition and Properties of Protein-protein Interfaces
NASA Astrophysics Data System (ADS)
Keskin, Ozlem; Gursoy, Attila; Nussinov, Ruth
In this chapter we address two aspects - the static physical interactions which allow the information transfer for the function to be performed; and the dynamic, i.e. how the information is transmitted between the binding sites in the single protein molecule and in the network. We describe the single protein molecules and their complexes; and the analogy between protein folding and protein binding. Eventually, to fully understand the interactome and how it performs the essential cellular functions, we have to put all together - and hierarchically progress through the system.
Grappling with the HOX network in hematopoiesis and leukemia.
McGonigle, Glenda J; Lappin, Terence R J; Thompson, Alexander
2008-05-01
The mammalian HOX gene network encodes a family of proteins which act as master regulators of developmental processes such as embryogenesis and hematopoiesis. The complex arrangement, regulation and co-factor association of HOX has been an area of intense research, particularly in cancer biology, for over a decade. The concept of redeployment of embryonic regulators in the neoplastic arena has received support from many quarters. Observations of altered HOX gene expression in various solid tumours and leukemia appear to support the thesis that 'oncology recapitulates ontogeny' but the identification of critical HOX subsets and their functional role in cancer onset and maintenance requires further investigation. The application of novel techniques and model systems will continue to enhance our understanding of the HOX network in the years to come. Better understanding of the intricacy of the complex as well as identification of functional pathways and direct targets of the encoded proteins will permit harnessing of this family of genes for clinical application.
Prediction of C. elegans Longevity Genes by Human and Worm Longevity Networks
de Magalhães, João Pedro; Ruvkun, Gary; Fraifeld, Vadim E.; Curran, Sean P.
2012-01-01
Intricate and interconnected pathways modulate longevity, but screens to identify the components of these pathways have not been saturating. Because biological processes are often executed by protein complexes and fine-tuned by regulatory factors, the first-order protein-protein interactors of known longevity genes are likely to participate in the regulation of longevity. Data-rich maps of protein interactions have been established for many cardinal organisms such as yeast, worms, and humans. We propose that these interaction maps could be mined for the identification of new putative regulators of longevity. For this purpose, we have constructed longevity networks in both humans and worms. We reasoned that the essential first-order interactors of known longevity-associated genes in these networks are more likely to have longevity phenotypes than randomly chosen genes. We have used C. elegans to determine whether post-developmental inactivation of these essential genes modulates lifespan. Our results suggest that the worm and human longevity networks are functionally relevant and possess a high predictive power for identifying new longevity regulators. PMID:23144747
Mallik, Moushami; Lakhotia, Subhash C
2010-12-01
Polyglutamine (polyQ) diseases, resulting from a dynamic expansion of glutamine repeats in a polypeptide, are a class of genetically inherited late onset neurodegenerative disorders which, despite expression of the mutated gene widely in brain and other tissues, affect defined subpopulations of neurons in a disease-specific manner. We briefly review the different polyQ-expansion-induced neurodegenerative disorders and the advantages of modelling them in Drosophila. Studies using the fly models have successfully identified a variety of genetic modifiers and have helped in understanding some of the molecular events that follow expression of the abnormal polyQ proteins. Expression of the mutant polyQ proteins causes, as a consequence of intra-cellular and inter-cellular networking, mis-regulation at multiple steps like transcriptional and posttranscriptional regulations, cell signalling, protein quality control systems (protein folding and degradation networks), axonal transport machinery etc., in the sensitive neurons, resulting ultimately in their death. The diversity of genetic modifiers of polyQ toxicity identified through extensive genetic screens in fly and other models clearly reflects a complex network effect of the presence of the mutated protein. Such network effects pose a major challenge for therapeutic applications.
Network-based prediction and knowledge mining of disease genes.
Carson, Matthew B; Lu, Hui
2015-01-01
In recent years, high-throughput protein interaction identification methods have generated a large amount of data. When combined with the results from other in vivo and in vitro experiments, a complex set of relationships between biological molecules emerges. The growing popularity of network analysis and data mining has allowed researchers to recognize indirect connections between these molecules. Due to the interdependent nature of network entities, evaluating proteins in this context can reveal relationships that may not otherwise be evident. We examined the human protein interaction network as it relates to human illness using the Disease Ontology. After calculating several topological metrics, we trained an alternating decision tree (ADTree) classifier to identify disease-associated proteins. Using a bootstrapping method, we created a tree to highlight conserved characteristics shared by many of these proteins. Subsequently, we reviewed a set of non-disease-associated proteins that were misclassified by the algorithm with high confidence and searched for evidence of a disease relationship. Our classifier was able to predict disease-related genes with 79% area under the receiver operating characteristic (ROC) curve (AUC), which indicates the tradeoff between sensitivity and specificity and is a good predictor of how a classifier will perform on future data sets. We found that a combination of several network characteristics including degree centrality, disease neighbor ratio, eccentricity, and neighborhood connectivity help to distinguish between disease- and non-disease-related proteins. Furthermore, the ADTree allowed us to understand which combinations of strongly predictive attributes contributed most to protein-disease classification. In our post-processing evaluation, we found several examples of potential novel disease-related proteins and corresponding literature evidence. In addition, we showed that first- and second-order neighbors in the PPI network could be used to identify likely disease associations. We analyzed the human protein interaction network and its relationship to disease and found that both the number of interactions with other proteins and the disease relationship of neighboring proteins helped to determine whether a protein had a relationship to disease. Our classifier predicted many proteins with no annotated disease association to be disease-related, which indicated that these proteins have network characteristics that are similar to disease-related proteins and may therefore have disease associations not previously identified. By performing a post-processing step after the prediction, we were able to identify evidence in literature supporting this possibility. This method could provide a useful filter for experimentalists searching for new candidate protein targets for drug repositioning and could also be extended to include other network and data types in order to refine these predictions.
The topology and dynamics of complex networks
NASA Astrophysics Data System (ADS)
Dezso, Zoltan
We start with a brief introduction about the topological properties of real networks. Most real networks are scale-free, being characterized by a power-law degree distribution. The scale-free nature of real networks leads to unexpected properties such as the vanishing epidemic threshold. Traditional methods aiming to reduce the spreading rate of viruses cannot succeed on eradicating the epidemic on a scale-free network. We demonstrate that policies that discriminate between the nodes, curing mostly the highly connected nodes, can restore a finite epidemic threshold and potentially eradicate the virus. We find that the more biased a policy is towards the hubs, the more chance it has to bring the epidemic threshold above the virus' spreading rate. We continue by studying a large Web portal as a model system for a rapidly evolving network. We find that the visitation pattern of a news document decays as a power law, in contrast with the exponential prediction provided by simple models of site visitation. This is rooted in the inhomogeneous nature of the browsing pattern characterizing individual users: the time interval between consecutive visits by the same user to the site follows a power law distribution, in contrast with the exponential expected for Poisson processes. We show that the exponent characterizing the individual user's browsing patterns determines the power-law decay in a document's visitation. Finally, we turn our attention to biological networks and demonstrate quantitatively that protein complexes in the yeast, Saccharomyces cerevisiae, are comprised of a core in which subunits are highly coexpressed, display the same deletion phenotype (essential or non-essential) and share identical functional classification and cellular localization. The results allow us to define the deletion phenotype and cellular task of most known complexes, and to identify with high confidence the biochemical role of hundreds of proteins with yet unassigned functionality.
Weyhe, Martin; Eschen-Lippold, Lennart; Pecher, Pascal; Scheel, Dierk; Lee, Justin
2014-01-01
Out of the 34 members of the VQ-motif-containing protein (VQP) family, 10 are phosphorylated by the mitogen-activated protein kinases (MAPKs), MPK3 and MPK6. Most of these MPK3/6-targeted VQPs (MVQs) interacted with specific sub-groups of WRKY transcription factors in a VQ-motif-dependent manner. In some cases, the MAPK appears to phosphorylate either the MVQ or the WRKY, while in other cases, both proteins have been reported to act as MAPK substrates. We propose a network of dynamic interactions between members from the MAPK, MVQ and WRKY families - either as binary or as tripartite interactions. The compositions of the WRKY-MVQ transcriptional protein complexes may change - for instance, through MPK3/6-mediated modulation of protein stability - and therefore control defense gene transcription.
Theofilatos, Konstantinos; Pavlopoulou, Niki; Papasavvas, Christoforos; Likothanassis, Spiros; Dimitrakopoulos, Christos; Georgopoulos, Efstratios; Moschopoulos, Charalampos; Mavroudi, Seferina
2015-03-01
Proteins are considered to be the most important individual components of biological systems and they combine to form physical protein complexes which are responsible for certain molecular functions. Despite the large availability of protein-protein interaction (PPI) information, not much information is available about protein complexes. Experimental methods are limited in terms of time, efficiency, cost and performance constraints. Existing computational methods have provided encouraging preliminary results, but they phase certain disadvantages as they require parameter tuning, some of them cannot handle weighted PPI data and others do not allow a protein to participate in more than one protein complex. In the present paper, we propose a new fully unsupervised methodology for predicting protein complexes from weighted PPI graphs. The proposed methodology is called evolutionary enhanced Markov clustering (EE-MC) and it is a hybrid combination of an adaptive evolutionary algorithm and a state-of-the-art clustering algorithm named enhanced Markov clustering. EE-MC was compared with state-of-the-art methodologies when applied to datasets from the human and the yeast Saccharomyces cerevisiae organisms. Using public available datasets, EE-MC outperformed existing methodologies (in some datasets the separation metric was increased by 10-20%). Moreover, when applied to new human datasets its performance was encouraging in the prediction of protein complexes which consist of proteins with high functional similarity. In specific, 5737 protein complexes were predicted and 72.58% of them are enriched for at least one gene ontology (GO) function term. EE-MC is by design able to overcome intrinsic limitations of existing methodologies such as their inability to handle weighted PPI networks, their constraint to assign every protein in exactly one cluster and the difficulties they face concerning the parameter tuning. This fact was experimentally validated and moreover, new potentially true human protein complexes were suggested as candidates for further validation using experimental techniques. Copyright © 2015 Elsevier B.V. All rights reserved.
Sato, Masanao; Tsuda, Kenichi; Wang, Lin; Coller, John; Watanabe, Yuichiro; Glazebrook, Jane; Katagiri, Fumiaki
2010-01-01
Biological signaling processes may be mediated by complex networks in which network components and network sectors interact with each other in complex ways. Studies of complex networks benefit from approaches in which the roles of individual components are considered in the context of the network. The plant immune signaling network, which controls inducible responses to pathogen attack, is such a complex network. We studied the Arabidopsis immune signaling network upon challenge with a strain of the bacterial pathogen Pseudomonas syringae expressing the effector protein AvrRpt2 (Pto DC3000 AvrRpt2). This bacterial strain feeds multiple inputs into the signaling network, allowing many parts of the network to be activated at once. mRNA profiles for 571 immune response genes of 22 Arabidopsis immunity mutants and wild type were collected 6 hours after inoculation with Pto DC3000 AvrRpt2. The mRNA profiles were analyzed as detailed descriptions of changes in the network state resulting from the genetic perturbations. Regulatory relationships among the genes corresponding to the mutations were inferred by recursively applying a non-linear dimensionality reduction procedure to the mRNA profile data. The resulting static network model accurately predicted 23 of 25 regulatory relationships reported in the literature, suggesting that predictions of novel regulatory relationships are also accurate. The network model revealed two striking features: (i) the components of the network are highly interconnected; and (ii) negative regulatory relationships are common between signaling sectors. Complex regulatory relationships, including a novel negative regulatory relationship between the early microbe-associated molecular pattern-triggered signaling sectors and the salicylic acid sector, were further validated. We propose that prevalent negative regulatory relationships among the signaling sectors make the plant immune signaling network a “sector-switching” network, which effectively balances two apparently conflicting demands, robustness against pathogenic perturbations and moderation of negative impacts of immune responses on plant fitness. PMID:20661428
Transcription initiation complex structures elucidate DNA opening.
Plaschka, C; Hantsche, M; Dienemann, C; Burzinski, C; Plitzko, J; Cramer, P
2016-05-19
Transcription of eukaryotic protein-coding genes begins with assembly of the RNA polymerase (Pol) II initiation complex and promoter DNA opening. Here we report cryo-electron microscopy (cryo-EM) structures of yeast initiation complexes containing closed and open DNA at resolutions of 8.8 Å and 3.6 Å, respectively. DNA is positioned and retained over the Pol II cleft by a network of interactions between the TATA-box-binding protein TBP and transcription factors TFIIA, TFIIB, TFIIE, and TFIIF. DNA opening occurs around the tip of the Pol II clamp and the TFIIE 'extended winged helix' domain, and can occur in the absence of TFIIH. Loading of the DNA template strand into the active centre may be facilitated by movements of obstructing protein elements triggered by allosteric binding of the TFIIE 'E-ribbon' domain. The results suggest a unified model for transcription initiation with a key event, the trapping of open promoter DNA by extended protein-protein and protein-DNA contacts.
Insights into the Specificity of Lysine Acetyltransferases
Tucker, Alex C.; Taylor, Keenan C.; Rank, Katherine C.; ...
2014-11-07
Reversible lysine acetylation by protein acetyltransferases is a conserved regulatory mechanism that controls diverse cellular pathways. Gcn5-related N-acetyltransferases (GNATs), named after their founding member, are found in all domains of life. GNATs are known for their role as histone acetyltransferases, but non-histone bacterial protein acetytransferases have been identified. Only structures of GNAT complexes with short histone peptide substrates are available in databases. Given the biological importance of this modification and the abundance of lysine in polypeptides, how specificity is attained for larger protein substrates is central to understanding acetyl-lysine-regulated networks. In this paper, we report the structure of a GNATmore » in complex with a globular protein substrate solved to 1.9 Å. GNAT binds the protein substrate with extensive surface interactions distinct from those reported for GNAT-peptide complexes. Finally, our data reveal determinants needed for the recognition of a protein substrate and provide insight into the specificity of GNATs.« less
An ensemble framework for clustering protein-protein interaction networks.
Asur, Sitaram; Ucar, Duygu; Parthasarathy, Srinivasan
2007-07-01
Protein-Protein Interaction (PPI) networks are believed to be important sources of information related to biological processes and complex metabolic functions of the cell. The presence of biologically relevant functional modules in these networks has been theorized by many researchers. However, the application of traditional clustering algorithms for extracting these modules has not been successful, largely due to the presence of noisy false positive interactions as well as specific topological challenges in the network. In this article, we propose an ensemble clustering framework to address this problem. For base clustering, we introduce two topology-based distance metrics to counteract the effects of noise. We develop a PCA-based consensus clustering technique, designed to reduce the dimensionality of the consensus problem and yield informative clusters. We also develop a soft consensus clustering variant to assign multifaceted proteins to multiple functional groups. We conduct an empirical evaluation of different consensus techniques using topology-based, information theoretic and domain-specific validation metrics and show that our approaches can provide significant benefits over other state-of-the-art approaches. Our analysis of the consensus clusters obtained demonstrates that ensemble clustering can (a) produce improved biologically significant functional groupings; and (b) facilitate soft clustering by discovering multiple functional associations for proteins. Supplementary data are available at Bioinformatics online.
Cazade, Pierre-André; Berezovska, Ganna; Meuwly, Markus
2015-05-01
The nature of ligand motion in proteins is difficult to characterize directly using experiment. Specifically, it is unclear to what degree these motions are coupled. All-atom simulations are used to sample ligand motion in truncated Hemoglobin N. A transition network analysis including ligand- and protein-degrees of freedom is used to analyze the microscopic dynamics. Clustering of two different subsets of MD trajectories highlights the importance of a diverse and exhaustive description to define the macrostates for a ligand-migration network. Monte Carlo simulations on the transition matrices from one particular clustering are able to faithfully capture the atomistic simulations. Contrary to clustering by ligand positions only, including a protein degree of freedom yields considerably improved coarse grained dynamics. Analysis with and without imposing detailed balance agree closely which suggests that the underlying atomistic simulations are converged with respect to sampling transitions between neighboring sites. Protein and ligand dynamics are not independent from each other and ligand migration through globular proteins is not passive diffusion. Transition network analysis is a powerful tool to analyze and characterize the microscopic dynamics in complex systems. This article is part of a Special Issue entitled Recent developments of molecular dynamics. Copyright © 2014 Elsevier B.V. All rights reserved.
Computational prediction of protein-protein interactions in Leishmania predicted proteomes.
Rezende, Antonio M; Folador, Edson L; Resende, Daniela de M; Ruiz, Jeronimo C
2012-01-01
The Trypanosomatids parasites Leishmania braziliensis, Leishmania major and Leishmania infantum are important human pathogens. Despite of years of study and genome availability, effective vaccine has not been developed yet, and the chemotherapy is highly toxic. Therefore, it is clear just interdisciplinary integrated studies will have success in trying to search new targets for developing of vaccines and drugs. An essential part of this rationale is related to protein-protein interaction network (PPI) study which can provide a better understanding of complex protein interactions in biological system. Thus, we modeled PPIs for Trypanosomatids through computational methods using sequence comparison against public database of protein or domain interaction for interaction prediction (Interolog Mapping) and developed a dedicated combined system score to address the predictions robustness. The confidence evaluation of network prediction approach was addressed using gold standard positive and negative datasets and the AUC value obtained was 0.94. As result, 39,420, 43,531 and 45,235 interactions were predicted for L. braziliensis, L. major and L. infantum respectively. For each predicted network the top 20 proteins were ranked by MCC topological index. In addition, information related with immunological potential, degree of protein sequence conservation among orthologs and degree of identity compared to proteins of potential parasite hosts was integrated. This information integration provides a better understanding and usefulness of the predicted networks that can be valuable to select new potential biological targets for drug and vaccine development. Network modularity which is a key when one is interested in destabilizing the PPIs for drug or vaccine purposes along with multiple alignments of the predicted PPIs were performed revealing patterns associated with protein turnover. In addition, around 50% of hypothetical protein present in the networks received some degree of functional annotation which represents an important contribution since approximately 60% of Leishmania predicted proteomes has no predicted function.
Host-pathogen interaction in Fusarium oxysporum infections: where do we stand?
Husaini, Amjad M; Sakina, Aafreen; Cambay, Souliha R
2018-03-16
Fusarium oxysporum, a ubiquitous soil-borne pathogen causes devastating vascular wilt in more than 100 plant species and ranks fifth among top ten fungal plant pathogens. It has emerged as a human pathogen too, causing infections in immune-compromised patients. It is, therefore, important to gain insight into the molecular processes involved in the pathogenesis of this trans-kingdom pathogen. A complex network comprising of interconnected and over lapping signal pathways; mitogen-activated protein kinase (MAPK) signaling pathways, Ras proteins, G-protein signaling components and their downstream pathways, components of the velvet (LaeA/VeA/VelB) complex and cAMP pathways, is involved in perceiving the host. This network regulates the expression of various pathogenicity genes. Plants have however evolved an elaborate protection system to combat this attack. They too possess intricate mechanisms at molecular level, which once triggered by pathogen attack transduce signals to activate defense response. This review focuses on understanding and presenting a wholistic picture of the molecular mechanisms of F. oxysporum-host interactions in plant immunity.
Impact of the Usher syndrome on olfaction.
Jansen, Fabian; Kalbe, Benjamin; Scholz, Paul; Mikosz, Marta; Wunderlich, Kirsten A; Kurtenbach, Stefan; Nagel-Wolfrum, Kerstin; Wolfrum, Uwe; Hatt, Hanns; Osterloh, Sabrina
2016-02-01
Usher syndrome is a genetically and clinically heterogeneous disease in humans, characterized by sensorineural hearing loss, retinitis pigmentosa and vestibular dysfunction. This disease is caused by mutations in genes encoding proteins that form complex networks in different cellular compartments. Currently, it remains unclear whether the Usher proteins also form networks within the olfactory epithelium (OE). Here, we describe Usher gene expression at the mRNA and protein level in the OE of mice and showed interactions between these proteins and olfactory signaling proteins. Additionally, we analyzed the odor sensitivity of different Usher syndrome mouse models using electro-olfactogram recordings and monitored significant changes in the odor detection capabilities in mice expressing mutant Usher proteins. Furthermore, we observed changes in the expression of signaling proteins that might compensate for the Usher protein deficiency. In summary, this study provides novel insights into the presence and purpose of the Usher proteins in olfactory signal transduction. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Christensen, Claire Petra
Across diverse fields ranging from physics to biology, sociology, and economics, the technological advances of the past decade have engendered an unprecedented explosion of data on highly complex systems with thousands, if not millions of interacting components. These systems exist at many scales of size and complexity, and it is becoming ever-more apparent that they are, in fact, universal, arising in every field of study. Moreover, they share fundamental properties---chief among these, that the individual interactions of their constituent parts may be well-understood, but the characteristic behaviour produced by the confluence of these interactions---by these complex networks---is unpredictable; in a nutshell, the whole is more than the sum of its parts. There is, perhaps, no better illustration of this concept than the discoveries being made regarding complex networks in the biological sciences. In particular, though the sequencing of the human genome in 2003 was a remarkable feat, scientists understand that the "cellular-level blueprints" for the human being are cellular-level parts lists, but they say nothing (explicitly) about cellular-level processes. The challenge of modern molecular biology is to understand these processes in terms of the networks of parts---in terms of the interactions among proteins, enzymes, genes, and metabolites---as it is these processes that ultimately differentiate animate from inanimate, giving rise to life! It is the goal of systems biology---an umbrella field encapsulating everything from molecular biology to epidemiology in social systems---to understand processes in terms of fundamental networks of core biological parts, be they proteins or people. By virtue of the fact that there are literally countless complex systems, not to mention tools and techniques used to infer, simulate, analyze, and model these systems, it is impossible to give a truly comprehensive account of the history and study of complex systems. The author's own publications have contributed network inference, simulation, modeling, and analysis methods to the much larger body of work in systems biology, and indeed, in network science. The aim of this thesis is therefore twofold: to present this original work in the historical context of network science, but also to provide sufficient review and reference regarding complex systems (with an emphasis on complex networks in systems biology) and tools and techniques for their inference, simulation, analysis, and modeling, such that the reader will be comfortable in seeking out further information on the subject. The review-like Chapters 1, 2, and 4 are intended to convey the co-evolution of network science and the slow but noticeable breakdown of boundaries between disciplines in academia as research and comparison of diverse systems has brought to light the shared properties of these systems. It is the author's hope that theses chapters impart some sense of the remarkable and rapid progress in complex systems research that has led to this unprecedented academic synergy. Chapters 3 and 5 detail the author's original work in the context of complex systems research. Chapter 3 presents the methods and results of a two-stage modeling process that generates candidate gene-regulatory networks of the bacterium B.subtilis from experimentally obtained, yet mathematically underdetermined microchip array data. These networks are then analyzed from a graph theoretical perspective, and their biological viability is critiqued by comparing the networks' graph theoretical properties to those of other biological systems. The results of topological perturbation analyses revealing commonalities in behavior at multiple levels of complexity are also presented, and are shown to be an invaluable means by which to ascertain the level of complexity to which the network inference process is robust to noise. Chapter 5 outlines a learning algorithm for the development of a realistic, evolving social network (a city) into which a disease is introduced. The results of simulations in populations spanning two orders of magnitude are compared to prevaccine era measles data for England and Wales and demonstrate that the simulations are able to capture the quantitative and qualitative features of epidemics in populations as small as 10,000 people. The work presented in Chapter 5 validates the utility of network simulation in concurrently probing contact network dynamics and disease dynamics.
Dautel, Franziska; Kalkhof, Stefan; Trump, Saskia; Michaelson, Jacob; Beyer, Andreas; Lehmann, Irina; von Bergen, Martin
2011-02-04
Although the effects of high concentrations of the carcinogen benzo[a]pyrene (B[a]P) have been studied extensively, little is known about its effects at subacute toxic concentrations, which are typical for environmental pollutants. We exposed murine Hepa1c1c7 cells to a toxic concentration (5 μM) and a subacute concentration (50 nM) of B[a]P over a period of 2-24 h to differentiate between acute and pseudochronic effects and conducted a time-course analysis of B[a]P-influenced protein expression by DIGE. In total, a set of 120 spots were found to be significantly altered due to B[a]P exposure of which 112 were subsequently identified by mass spectrometry. Clustering and principal component analysis were conducted to identify sets of proteins responding in a concerted manner to the exposure. Our results indicate an immediate response to the contaminant at the protein level and demonstrate that B[a]P exposure alters the cellular response by disturbing proteins involved in oxidative stress, cell cycle regulation, apoptosis, and cytoskeleton organization. Furthermore, network analysis of protein-protein interactions revealed a complex network of interacting, B[a]P-regulated proteins mostly belonging to the cytoskeleton organization and several signal transduction pathways.
Cooperation of TOM and TIM23 complexes during translocation of proteins into mitochondria.
Waegemann, Karin; Popov-Čeleketić, Dušan; Neupert, Walter; Azem, Abdussalam; Mokranjac, Dejana
2015-03-13
Translocation of the majority of mitochondrial proteins from the cytosol into mitochondria requires the cooperation of TOM and TIM23 complexes in the outer and inner mitochondrial membranes. The molecular mechanisms underlying this cooperation remain largely unknown. Here, we present biochemical and genetic evidence that at least two contacts from the side of the TIM23 complex play an important role in TOM-TIM23 cooperation in vivo. Tim50, likely through its very C-terminal segment, interacts with Tom22. This interaction is stimulated by translocating proteins and is independent of any other TOM-TIM23 contact known so far. Furthermore, the exposure of Tim23 on the mitochondrial surface depends not only on its interaction with Tim50 but also on the dynamics of the TOM complex. Destabilization of the individual contacts reduces the efficiency of import of proteins into mitochondria and destabilization of both contacts simultaneously is not tolerated by yeast cells. We conclude that an intricate and coordinated network of protein-protein interactions involving primarily Tim50 and also Tim23 is required for efficient translocation of proteins across both mitochondrial membranes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Data on the association of the nuclear envelope protein Sun1 with nucleoli.
Moujaber, Ossama; Omran, Nawal; Kodiha, Mohamed; Pié, Brigitte; Cooper, Ellis; Presley, John F; Stochaj, Ursula
2017-08-01
SUN proteins participate in diverse cellular activities, many of which are connected to the nuclear envelope. Recently, the family member SUN1 has been linked to novel biological activities. These include the regulation of nucleoli, intranuclear compartments that assemble ribosomal subunits. We show that SUN1 associates with nucleoli in several mammalian epithelial cell lines. This nucleolar localization is not shared by all cell types, as SUN1 concentrates at the nuclear envelope in ganglionic neurons and non-neuronal satellite cells. Database analyses and Western blotting emphasize the complexity of SUN1 protein profiles in different mammalian cells. We constructed a STRING network which identifies SUN1-related proteins as part of a larger network that includes several nucleolar proteins. Taken together, the current data highlight the diversity of SUN1 proteins and emphasize the possible links between SUN1 and nucleoli.
The HER2 Signaling Network in Breast Cancer--Like a Spider in its Web.
Dittrich, A; Gautrey, H; Browell, D; Tyson-Capper, A
2014-12-01
The human epidermal growth factor receptor 2 (HER2) is a major player in the survival and proliferation of tumour cells and is overexpressed in up to 30 % of breast cancer cases. A considerable amount of work has been undertaken to unravel the activity and function of HER2 to try and develop effective therapies that impede its action in HER2 positive breast tumours. Research has focused on exploring the HER2 activated phosphoinositide-3-kinase (PI3K)/AKT and rat sarcoma/mitogen-activated protein kinase (RAS/MAPK) pathways for therapies. Despite the advances, cases of drug resistance and recurrence of disease still remain a challenge to overcome. An important aspect for drug resistance is the complexity of the HER2 signaling network. This includes the crosstalk between HER2 and hormone receptors; its function as a transcription factor; the regulation of HER2 by protein-tyrosine phosphatases and a complex network of positive and negative feedback-loops. This review summarises the current knowledge of many different HER2 interactions to illustrate the complexity of the HER2 network from the transcription of HER2 to the effect of its downstream targets. Exploring the novel avenues of the HER2 signaling could yield a better understanding of treatment resistance and give rise to developing new and more effective therapies.
Smith, Benjamin A; Padrick, Shae B; Doolittle, Lynda K; Daugherty-Clarke, Karen; Corrêa, Ivan R; Xu, Ming-Qun; Goode, Bruce L; Rosen, Michael K; Gelles, Jeff
2013-09-03
During cell locomotion and endocytosis, membrane-tethered WASP proteins stimulate actin filament nucleation by the Arp2/3 complex. This process generates highly branched arrays of filaments that grow toward the membrane to which they are tethered, a conflict that seemingly would restrict filament growth. Using three-color single-molecule imaging in vitro we revealed how the dynamic associations of Arp2/3 complex with mother filament and WASP are temporally coordinated with initiation of daughter filament growth. We found that WASP proteins dissociated from filament-bound Arp2/3 complex prior to new filament growth. Further, mutations that accelerated release of WASP from filament-bound Arp2/3 complex proportionally accelerated branch formation. These data suggest that while WASP promotes formation of pre-nucleation complexes, filament growth cannot occur until it is triggered by WASP release. This provides a mechanism by which membrane-bound WASP proteins can stimulate network growth without restraining it. DOI:http://dx.doi.org/10.7554/eLife.01008.001.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vasan, Neil; Hutagalung, Alex; Novick, Peter
2010-08-13
The Golgi-associated retrograde protein (GARP) complex is a membrane-tethering complex that functions in traffic from endosomes to the trans-Golgi network. Here we present the structure of a C-terminal fragment of the Vps53 subunit, important for binding endosome-derived vesicles, at a resolution of 2.9 {angstrom}. We show that the C terminus consists of two {alpha}-helical bundles arranged in tandem, and we identify a highly conserved surface patch, which may play a role in vesicle recognition. Mutations of the surface result in defects in membrane traffic. The fold of the Vps53 C terminus is strongly reminiscent of proteins that belong to threemore » other tethering complexes - Dsl1, conserved oligomeric Golgi, and the exocyst - thought to share a common evolutionary origin. Thus, the structure of the Vps53 C terminus suggests that GARP belongs to this family of complexes.« less
Proteome-scale human interactomics
Luck, Katja; Sheynkman, Gloria M.; Zhang, Ivy; Vidal, Marc
2017-01-01
Cellular functions are mediated by complex interactome networks of physical, biochemical, and functional interactions between DNA sequences, RNA molecules, proteins, lipids, and small metabolites. A thorough understanding of cellular organization requires accurate and relatively complete models of interactome networks at proteome-scale. The recent publication of four human protein-protein interaction (PPI) maps represents a technological breakthrough and an unprecedented resource for the scientific community, heralding a new era of proteome-scale human interactomics. Our knowledge gained from these and complementary studies provides fresh insights into the opportunities and challenges when analyzing systematically generated interactome data, defines a clear roadmap towards the generation of a first reference interactome, and reveals new perspectives on the organization of cellular life. PMID:28284537
Crosstalk and the evolvability of intracellular communication.
Rowland, Michael A; Greenbaum, Joseph M; Deeds, Eric J
2017-07-10
Metazoan signalling networks are complex, with extensive crosstalk between pathways. It is unclear what pressures drove the evolution of this architecture. We explore the hypothesis that crosstalk allows different cell types, each expressing a specific subset of signalling proteins, to activate different outputs when faced with the same inputs, responding differently to the same environment. We find that the pressure to generate diversity leads to the evolution of networks with extensive crosstalk. Using available data, we find that human tissues exhibit higher levels of diversity between cell types than networks with random expression patterns or networks with no crosstalk. We also find that crosstalk and differential expression can influence drug activity: no protein has the same impact on two tissues when inhibited. In addition to providing a possible explanation for the evolution of crosstalk, our work indicates that consideration of cellular context will likely be crucial for targeting signalling networks.
A Bayesian Active Learning Experimental Design for Inferring Signaling Networks.
Ness, Robert O; Sachs, Karen; Mallick, Parag; Vitek, Olga
2018-06-21
Machine learning methods for learning network structure are applied to quantitative proteomics experiments and reverse-engineer intracellular signal transduction networks. They provide insight into the rewiring of signaling within the context of a disease or a phenotype. To learn the causal patterns of influence between proteins in the network, the methods require experiments that include targeted interventions that fix the activity of specific proteins. However, the interventions are costly and add experimental complexity. We describe an active learning strategy for selecting optimal interventions. Our approach takes as inputs pathway databases and historic data sets, expresses them in form of prior probability distributions on network structures, and selects interventions that maximize their expected contribution to structure learning. Evaluations on simulated and real data show that the strategy reduces the detection error of validated edges as compared with an unguided choice of interventions and avoids redundant interventions, thereby increasing the effectiveness of the experiment.
An integrative approach to inferring biologically meaningful gene modules.
Cho, Ji-Hoon; Wang, Kai; Galas, David J
2011-07-26
The ability to construct biologically meaningful gene networks and modules is critical for contemporary systems biology. Though recent studies have demonstrated the power of using gene modules to shed light on the functioning of complex biological systems, most modules in these networks have shown little association with meaningful biological function. We have devised a method which directly incorporates gene ontology (GO) annotation in construction of gene modules in order to gain better functional association. We have devised a method, Semantic Similarity-Integrated approach for Modularization (SSIM) that integrates various gene-gene pairwise similarity values, including information obtained from gene expression, protein-protein interactions and GO annotations, in the construction of modules using affinity propagation clustering. We demonstrated the performance of the proposed method using data from two complex biological responses: 1. the osmotic shock response in Saccharomyces cerevisiae, and 2. the prion-induced pathogenic mouse model. In comparison with two previously reported algorithms, modules identified by SSIM showed significantly stronger association with biological functions. The incorporation of semantic similarity based on GO annotation with gene expression and protein-protein interaction data can greatly enhance the functional relevance of inferred gene modules. In addition, the SSIM approach can also reveal the hierarchical structure of gene modules to gain a broader functional view of the biological system. Hence, the proposed method can facilitate comprehensive and in-depth analysis of high throughput experimental data at the gene network level.
MacGilvray, Matthew E; Shishkova, Evgenia; Chasman, Deborah; Place, Michael; Gitter, Anthony; Coon, Joshua J; Gasch, Audrey P
2018-05-01
Cells respond to stressful conditions by coordinating a complex, multi-faceted response that spans many levels of physiology. Much of the response is coordinated by changes in protein phosphorylation. Although the regulators of transcriptome changes during stress are well characterized in Saccharomyces cerevisiae, the upstream regulatory network controlling protein phosphorylation is less well dissected. Here, we developed a computational approach to infer the signaling network that regulates phosphorylation changes in response to salt stress. We developed an approach to link predicted regulators to groups of likely co-regulated phospho-peptides responding to stress, thereby creating new edges in a background protein interaction network. We then use integer linear programming (ILP) to integrate wild type and mutant phospho-proteomic data and predict the network controlling stress-activated phospho-proteomic changes. The network we inferred predicted new regulatory connections between stress-activated and growth-regulating pathways and suggested mechanisms coordinating metabolism, cell-cycle progression, and growth during stress. We confirmed several network predictions with co-immunoprecipitations coupled with mass-spectrometry protein identification and mutant phospho-proteomic analysis. Results show that the cAMP-phosphodiesterase Pde2 physically interacts with many stress-regulated transcription factors targeted by PKA, and that reduced phosphorylation of those factors during stress requires the Rck2 kinase that we show physically interacts with Pde2. Together, our work shows how a high-quality computational network model can facilitate discovery of new pathway interactions during osmotic stress.
Yu, Jia-Lu; Song, Qi-Fang; Xie, Zhi-Wei; Jiang, Wen-Hui; Chen, Jia-Hui; Fan, Hui-Feng; Xie, Ya-Ping; Lu, Gen
2017-09-25
Mycoplasma pneumoniae (MP) is a leading cause of community-acquired pneumonia in children and young adults. Although MP pneumonia is usually benign and self-limited, in some cases it can develop into life-threating refractory MP pneumonia (RMPP). However, the pathogenesis of RMPP is poorly understood. The identification and characterization of proteins related to RMPP could provide a proof of principle to facilitate appropriate diagnostic and therapeutic strategies for treating paients with MP. In this study, we used a quantitative proteomic technique (iTRAQ) to analyze MP-related proteins in serum samples from 5 patients with RMPP, 5 patients with non-refractory MP pneumonia (NRMPP), and 5 healthy children. Functional classification, sub-cellular localization, and protein interaction network analysis were carried out based on protein annotation through evolutionary relationship (PANTHER) and Cytoscape analysis. A total of 260 differentially expressed proteins were identified in the RMPP and NRMPP groups. Compared to the control group, the NRMPP and RMPP groups showed 134 (70 up-regulated and 64 down-regulated) and 126 (63 up-regulated and 63 down-regulated) differentially expressed proteins, respectively. The complex functional classification and protein interaction network of the identified proteins reflected the complex pathogenesis of RMPP. Our study provides the first comprehensive proteome map of RMPP-related proteins from MP pneumonia. These profiles may be useful as part of a diagnostic panel, and the identified proteins provide new insights into the pathological mechanisms underlying RMPP.
[The Usher Syndrome, a Human Ciliopathy].
Wolfrum, Uwe; Nagel-Wolfrum, Kerstin
2018-03-01
The human Usher syndrome (USH) is a complex, rare disease manifesting in its most common form of inherited deaf-blindness. Due to the heterogeneous manifestation of the clinical symptoms, three clinical types (USH1-3) are distinguished according to the severity of the disease pattern. For a correct diagnosis, in addition to the auditory tests in early newborn screening, ophthalmological examinations and molecular genetic analysis are important. Ten known USH genes encode proteins, which are from heterogeneous protein families, interact in functional protein networks. In the eye and in the ear, USH proteins are expressed primarily in the mechano-sensitive hair cells and the rod and cone photoreceptor cells, respectively. In the hair cells, the USH protein networks are essential for the correct differentiation of the hair bundles as well as for the function of the mechano-electrical transduction complex in the matured cell. In the photoreceptor cells, USH proteins are located in the ciliary region and participate in intracellular transport processes. In addition, a USH protein network is present in the so-called calyceal processes. The lack of calyceal processes and the absence of a prominent visual phenotype in the mouse disqualifies mice as models for studies on the ophthalmic component of USH. While hearing impairments can be compensated with hearing aids and cochlear implants, there is no practical therapy for USH in the eye. Currently, gene-based therapy concepts, such as gene addition, applications of antisense oligonucleotides and TRIDs ("translational readthrough inducing drugs") for the readthrough of nonsense mutations are preclinically evaluated. For USH1B/MYO7A the UshStat gene therapy clinical trial is ongoing. Georg Thieme Verlag KG Stuttgart · New York.
The evolution and regulation of the mucosal immune complexity in the basal chordate amphioxus.
Huang, Shengfeng; Wang, Xin; Yan, Qingyu; Guo, Lei; Yuan, Shaochun; Huang, Guangrui; Huang, Huiqing; Li, Jun; Dong, Meiling; Chen, Shangwu; Xu, Anlong
2011-02-15
Both amphioxus and the sea urchin encode a complex innate immune gene repertoire in their genomes, but the composition and mechanisms of their innate immune systems, as well as the fundamental differences between two systems, remain largely unexplored. In this study, we dissect the mucosal immune complexity of amphioxus into different evolutionary-functional modes and regulatory patterns by integrating information from phylogenetic inferences, genome-wide digital expression profiles, time course expression dynamics, and functional analyses. With these rich data, we reconstruct several major immune subsystems in amphioxus and analyze their regulation during mucosal infection. These include the TNF/IL-1R network, TLR and NLR networks, complement system, apoptosis network, oxidative pathways, and other effector genes (e.g., peptidoglycan recognition proteins, Gram-negative binding proteins, and chitin-binding proteins). We show that beneath the superficial similarity to that of the sea urchin, the amphioxus innate system, despite preserving critical invertebrate components, is more similar to that of the vertebrates in terms of composition, expression regulation, and functional strategies. For example, major effectors in amphioxus gut mucous tissue are the well-developed complement and oxidative-burst systems, and the signaling network in amphioxus seems to emphasize signal transduction/modulation more than initiation. In conclusion, we suggest that the innate immune systems of amphioxus and the sea urchin are strategically different, possibly representing two successful cases among many expanded immune systems that arose at the age of the Cambrian explosion. We further suggest that the vertebrate innate immune system should be derived from one of these expanded systems, most likely from the same one that was shared by amphioxus.
Liluashvili, Vaja; Kalayci, Selim; Fluder, Eugene; Wilson, Manda; Gabow, Aaron
2017-01-01
Abstract Visualizations of biomolecular networks assist in systems-level data exploration in many cellular processes. Data generated from high-throughput experiments increasingly inform these networks, yet current tools do not adequately scale with concomitant increase in their size and complexity. We present an open source software platform, interactome-CAVE (iCAVE), for visualizing large and complex biomolecular interaction networks in 3D. Users can explore networks (i) in 3D using a desktop, (ii) in stereoscopic 3D using 3D-vision glasses and a desktop, or (iii) in immersive 3D within a CAVE environment. iCAVE introduces 3D extensions of known 2D network layout, clustering, and edge-bundling algorithms, as well as new 3D network layout algorithms. Furthermore, users can simultaneously query several built-in databases within iCAVE for network generation or visualize their own networks (e.g., disease, drug, protein, metabolite). iCAVE has modular structure that allows rapid development by addition of algorithms, datasets, or features without affecting other parts of the code. Overall, iCAVE is the first freely available open source tool that enables 3D (optionally stereoscopic or immersive) visualizations of complex, dense, or multi-layered biomolecular networks. While primarily designed for researchers utilizing biomolecular networks, iCAVE can assist researchers in any field. PMID:28814063
Liluashvili, Vaja; Kalayci, Selim; Fluder, Eugene; Wilson, Manda; Gabow, Aaron; Gümüs, Zeynep H
2017-08-01
Visualizations of biomolecular networks assist in systems-level data exploration in many cellular processes. Data generated from high-throughput experiments increasingly inform these networks, yet current tools do not adequately scale with concomitant increase in their size and complexity. We present an open source software platform, interactome-CAVE (iCAVE), for visualizing large and complex biomolecular interaction networks in 3D. Users can explore networks (i) in 3D using a desktop, (ii) in stereoscopic 3D using 3D-vision glasses and a desktop, or (iii) in immersive 3D within a CAVE environment. iCAVE introduces 3D extensions of known 2D network layout, clustering, and edge-bundling algorithms, as well as new 3D network layout algorithms. Furthermore, users can simultaneously query several built-in databases within iCAVE for network generation or visualize their own networks (e.g., disease, drug, protein, metabolite). iCAVE has modular structure that allows rapid development by addition of algorithms, datasets, or features without affecting other parts of the code. Overall, iCAVE is the first freely available open source tool that enables 3D (optionally stereoscopic or immersive) visualizations of complex, dense, or multi-layered biomolecular networks. While primarily designed for researchers utilizing biomolecular networks, iCAVE can assist researchers in any field. © The Authors 2017. Published by Oxford University Press.
Pan, Xiaoliang; Schwartz, Steven D
2015-04-30
It has long been recognized that the structure of a protein creates a hierarchy of conformations interconverting on multiple time scales. The conformational heterogeneity of the Michaelis complex is of particular interest in the context of enzymatic catalysis in which the reactant is usually represented by a single conformation of the enzyme/substrate complex. Lactate dehydrogenase (LDH) catalyzes the interconversion of pyruvate and lactate with concomitant interconversion of two forms of the cofactor nicotinamide adenine dinucleotide (NADH and NAD(+)). Recent experimental results suggest that multiple substates exist within the Michaelis complex of LDH, and they show a strong variance in their propensity toward the on-enzyme chemical step. In this study, microsecond-scale all-atom molecular dynamics simulations were performed on LDH to explore the free energy landscape of the Michaelis complex, and network analysis was used to characterize the distribution of the conformations. Our results provide a detailed view of the kinetic network of the Michaelis complex and the structures of the substates at atomistic scales. They also shed light on the complete picture of the catalytic mechanism of LDH.
mAKAP – A Master Scaffold for Cardiac Remodeling
Passariello, Catherine L.; Li, Jinliang; Dodge-Kafka, Kimberly; Kapiloff, Michael S.
2014-01-01
Cardiac remodeling is regulated by an extensive intracellular signal transduction network. Each of the many signaling pathways in this network contributes uniquely to the control of cellular adaptation. In the last few years, it has become apparent that multimolecular signaling complexes or ‘signalosomes’ are important for fidelity in intracellular signaling and for mediating crosstalk between the different signaling pathways. These complexes integrate upstream signals and control downstream effectors. In the cardiac myocyte, the protein mAKAPβ serves as a scaffold for a large signalosome that is responsive to cAMP, calcium, hypoxia, and mitogen-activated protein kinase signaling. The main function of mAKAPβ signalosomes is to modulate stress-related gene expression regulated by the transcription factors NFATc, MEF2 and HIF-1α and type II histone deacetylases that control pathological cardiac hypertrophy. PMID:25551320
Liu, Youtao; Lacal, Jesus; Firtel, Richard A; Kortholt, Arjan
2018-07-04
The directional movement toward extracellular chemical gradients, a process called chemotaxis, is an important property of cells. Central to eukaryotic chemotaxis is the molecular mechanism by which chemoattractant-mediated activation of G-protein coupled receptors (GPCRs) induces symmetry breaking in the activated downstream signaling pathways. Studies with mainly Dictyostelium and mammalian neutrophils as experimental systems have shown that chemotaxis is mediated by a complex network of signaling pathways. Recently, several labs have used extensive and efficient proteomic approaches to further unravel this dynamic signaling network. Together these studies showed the critical role of the interplay between heterotrimeric G-protein subunits and monomeric G proteins in regulating cytoskeletal rearrangements during chemotaxis. Here we highlight how these proteomic studies have provided greater insight into the mechanisms by which the heterotrimeric G protein cycle is regulated, how heterotrimeric G proteins-induced symmetry breaking is mediated through small G protein signaling, and how symmetry breaking in G protein signaling subsequently induces cytoskeleton rearrangements and cell migration.
Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.
Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich
2004-03-01
By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.
DNA-Directed Assembly of Capture Tools for Constitutional Studies of Large Protein Complexes.
Meyer, Rebecca; Faesen, Alex; Vogel, Katrin; Jeganathan, Sadasivam; Musacchio, Andrea; Niemeyer, Christof M
2015-06-10
Large supramolecular protein complexes, such as the molecular machinery involved in gene regulation, cell signaling, or cell division, are key in all fundamental processes of life. Detailed elucidation of structure and dynamics of such complexes can be achieved by reverse-engineering parts of the complexes in order to probe their interactions with distinctive binding partners in vitro. The exploitation of DNA nanostructures to mimic partially assembled supramolecular protein complexes in which the presence and state of two or more proteins are decisive for binding of additional building blocks is reported here. To this end, four-way DNA Holliday junction motifs bearing a fluorescein and a biotin tag, for tracking and affinity capture, respectively, are site-specifically functionalized with centromeric protein (CENP) C and CENP-T. The latter serves as baits for binding of the so-called KMN component, thereby mimicking early stages of the assembly of kinetochores, structures that mediate and control the attachment of microtubules to chromosomes in the spindle apparatus. Results from pull-down experiments are consistent with the hypothesis that CENP-C and CENP-T may bind cooperatively to the KMN network. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Proteomic snapshot of the EGF-induced ubiquitin network
Argenzio, Elisabetta; Bange, Tanja; Oldrini, Barbara; Bianchi, Fabrizio; Peesari, Raghunath; Mari, Sara; Di Fiore, Pier Paolo; Mann, Matthias; Polo, Simona
2011-01-01
The activity, localization and fate of many cellular proteins are regulated through ubiquitination, a process whereby one or more ubiquitin (Ub) monomers or chains are covalently attached to target proteins. While Ub-conjugated and Ub-associated proteomes have been described, we lack a high-resolution picture of the dynamics of ubiquitination in response to signaling. In this study, we describe the epidermal growth factor (EGF)-regulated Ubiproteome, as obtained by two complementary purification strategies coupled to quantitative proteomics. Our results unveil the complex impact of growth factor signaling on Ub-based intracellular networks to levels that extend well beyond what might have been expected. In addition to endocytic proteins, the EGF-regulated Ubiproteome includes a large number of signaling proteins, ubiquitinating and deubiquitinating enzymes, transporters and proteins involved in translation and transcription. The Ub-based signaling network appears to intersect both housekeeping and regulatory circuitries of cellular physiology. Finally, as proof of principle of the biological relevance of the EGF-Ubiproteome, we demonstrated that EphA2 is a novel, downstream ubiquitinated target of epidermal growth factor receptor (EGFR), critically involved in EGFR biological responses. PMID:21245847
Text mining for metabolic pathways, signaling cascades, and protein networks.
Hoffmann, Robert; Krallinger, Martin; Andres, Eduardo; Tamames, Javier; Blaschke, Christian; Valencia, Alfonso
2005-05-10
The complexity of the information stored in databases and publications on metabolic and signaling pathways, the high throughput of experimental data, and the growing number of publications make it imperative to provide systems to help the researcher navigate through these interrelated information resources. Text-mining methods have started to play a key role in the creation and maintenance of links between the information stored in biological databases and its original sources in the literature. These links will be extremely useful for database updating and curation, especially if a number of technical problems can be solved satisfactorily, including the identification of protein and gene names (entities in general) and the characterization of their types of interactions. The first generation of openly accessible text-mining systems, such as iHOP (Information Hyperlinked over Proteins), provides additional functions to facilitate the reconstruction of protein interaction networks, combine database and text information, and support the scientist in the formulation of novel hypotheses. The next challenge is the generation of comprehensive information regarding the general function of signaling pathways and protein interaction networks.
Faciobrachial dystonic seizures result from fronto-temporo-basalganglial network involvement.
Iyer, Rajesh Shankar; Ramakrishnan, T C R; Karunakaran; Shinto, Ajit; Kamaleshwaran, Koramadai Karuppuswamy
2017-01-01
•Faciobrachial dystonic seizures (FBDS) are caused by autoantibodies to leucine-rich glioma-inactivated1 proteins, a component of the voltage-gated potassium channel complex (VGKC-complex) and precede the clinical presentation of limbic encephalitis.•The exact pathophysiology of FBDS is not known and whether they are seizures or movement disorder is still debated.•We suggest the fronto-temporo-basal ganglia network involving the medial frontal and temporal regions along with the corpus striatum and substantia nigra being responsible for the clinical phenomenon of FBDS.•The varied clinical, electrical and imaging features of FBDS in our cases and in the literature are best explained by involvement of this network.•Entrainment from any part of this network will result in similar clinical expression of FBDS, whereas other electro-clinical associations and duration depends on the extent of involvement of the network.
Dual Coordination of Post Translational Modifications in Human Protein Networks
Woodsmith, Jonathan; Kamburov, Atanas; Stelzl, Ulrich
2013-01-01
Post-translational modifications (PTMs) regulate protein activity, stability and interaction profiles and are critical for cellular functioning. Further regulation is gained through PTM interplay whereby modifications modulate the occurrence of other PTMs or act in combination. Integration of global acetylation, ubiquitination and tyrosine or serine/threonine phosphorylation datasets with protein interaction data identified hundreds of protein complexes that selectively accumulate each PTM, indicating coordinated targeting of specific molecular functions. A second layer of PTM coordination exists in these complexes, mediated by PTM integration (PTMi) spots. PTMi spots represent very dense modification patterns in disordered protein regions and showed an equally high mutation rate as functional protein domains in cancer, inferring equivocal importance for cellular functioning. Systematic PTMi spot identification highlighted more than 300 candidate proteins for combinatorial PTM regulation. This study reveals two global PTM coordination mechanisms and emphasizes dataset integration as requisite in proteomic PTM studies to better predict modification impact on cellular signaling. PMID:23505349
CBL-CIPK network for calcium signaling in higher plants
NASA Astrophysics Data System (ADS)
Luan, Sheng
Plants sense their environment by signaling mechanisms involving calcium. Calcium signals are encoded by a complex set of parameters and decoded by a large number of proteins including the more recently discovered CBL-CIPK network. The calcium-binding CBL proteins specifi-cally interact with a family of protein kinases CIPKs and regulate the activity and subcellular localization of these kinases, leading to the modification of kinase substrates. This represents a paradigm shift as compared to a calcium signaling mechanism from yeast and animals. One example of CBL-CIPK signaling pathways is the low-potassium response of Arabidopsis roots. When grown in low-K medium, plants develop stronger K-uptake capacity adapting to the low-K condition. Recent studies show that the increased K-uptake is caused by activation of a specific K-channel by the CBL-CIPK network. A working model for this regulatory pathway will be discussed in the context of calcium coding and decoding processes.
Category Theoretic Analysis of Hierarchical Protein Materials and Social Networks
Spivak, David I.; Giesa, Tristan; Wood, Elizabeth; Buehler, Markus J.
2011-01-01
Materials in biology span all the scales from Angstroms to meters and typically consist of complex hierarchical assemblies of simple building blocks. Here we describe an application of category theory to describe structural and resulting functional properties of biological protein materials by developing so-called ologs. An olog is like a “concept web” or “semantic network” except that it follows a rigorous mathematical formulation based on category theory. This key difference ensures that an olog is unambiguous, highly adaptable to evolution and change, and suitable for sharing concepts with other olog. We consider simple cases of beta-helical and amyloid-like protein filaments subjected to axial extension and develop an olog representation of their structural and resulting mechanical properties. We also construct a representation of a social network in which people send text-messages to their nearest neighbors and act as a team to perform a task. We show that the olog for the protein and the olog for the social network feature identical category-theoretic representations, and we proceed to precisely explicate the analogy or isomorphism between them. The examples presented here demonstrate that the intrinsic nature of a complex system, which in particular includes a precise relationship between structure and function at different hierarchical levels, can be effectively represented by an olog. This, in turn, allows for comparative studies between disparate materials or fields of application, and results in novel approaches to derive functionality in the design of de novo hierarchical systems. We discuss opportunities and challenges associated with the description of complex biological materials by using ologs as a powerful tool for analysis and design in the context of materiomics, and we present the potential impact of this approach for engineering, life sciences, and medicine. PMID:21931622
Targeted interactomics reveals a complex core cell cycle machinery in Arabidopsis thaliana.
Van Leene, Jelle; Hollunder, Jens; Eeckhout, Dominique; Persiau, Geert; Van De Slijke, Eveline; Stals, Hilde; Van Isterdael, Gert; Verkest, Aurine; Neirynck, Sandy; Buffel, Yelle; De Bodt, Stefanie; Maere, Steven; Laukens, Kris; Pharazyn, Anne; Ferreira, Paulo C G; Eloy, Nubia; Renne, Charlotte; Meyer, Christian; Faure, Jean-Denis; Steinbrenner, Jens; Beynon, Jim; Larkin, John C; Van de Peer, Yves; Hilson, Pierre; Kuiper, Martin; De Veylder, Lieven; Van Onckelen, Harry; Inzé, Dirk; Witters, Erwin; De Jaeger, Geert
2010-08-10
Cell proliferation is the main driving force for plant growth. Although genome sequence analysis revealed a high number of cell cycle genes in plants, little is known about the molecular complexes steering cell division. In a targeted proteomics approach, we mapped the core complex machinery at the heart of the Arabidopsis thaliana cell cycle control. Besides a central regulatory network of core complexes, we distinguished a peripheral network that links the core machinery to up- and downstream pathways. Over 100 new candidate cell cycle proteins were predicted and an in-depth biological interpretation demonstrated the hypothesis-generating power of the interaction data. The data set provided a comprehensive view on heterodimeric cyclin-dependent kinase (CDK)-cyclin complexes in plants. For the first time, inhibitory proteins of plant-specific B-type CDKs were discovered and the anaphase-promoting complex was characterized and extended. Important conclusions were that mitotic A- and B-type cyclins form complexes with the plant-specific B-type CDKs and not with CDKA;1, and that D-type cyclins and S-phase-specific A-type cyclins seem to be associated exclusively with CDKA;1. Furthermore, we could show that plants have evolved a combinatorial toolkit consisting of at least 92 different CDK-cyclin complex variants, which strongly underscores the functional diversification among the large family of cyclins and reflects the pivotal role of cell cycle regulation in the developmental plasticity of plants.
Identifying proteins that bind to specific RNAs - focus on simple repeat expansion diseases
Jazurek, Magdalena; Ciesiolka, Adam; Starega-Roslan, Julia; Bilinska, Katarzyna; Krzyzosiak, Wlodzimierz J.
2016-01-01
RNA–protein complexes play a central role in the regulation of fundamental cellular processes, such as mRNA splicing, localization, translation and degradation. The misregulation of these interactions can cause a variety of human diseases, including cancer and neurodegenerative disorders. Recently, many strategies have been developed to comprehensively analyze these complex and highly dynamic RNA–protein networks. Extensive efforts have been made to purify in vivo-assembled RNA–protein complexes. In this review, we focused on commonly used RNA-centric approaches that involve mass spectrometry, which are powerful tools for identifying proteins bound to a given RNA. We present various RNA capture strategies that primarily depend on whether the RNA of interest is modified. Moreover, we briefly discuss the advantages and limitations of in vitro and in vivo approaches. Furthermore, we describe recent advances in quantitative proteomics as well as the methods that are most commonly used to validate robust mass spectrometry data. Finally, we present approaches that have successfully identified expanded repeat-binding proteins, which present abnormal RNA–protein interactions that result in the development of many neurological diseases. PMID:27625393
Narimani, Zahra; Beigy, Hamid; Ahmad, Ashar; Masoudi-Nejad, Ali; Fröhlich, Holger
2017-01-01
Inferring the structure of molecular networks from time series protein or gene expression data provides valuable information about the complex biological processes of the cell. Causal network structure inference has been approached using different methods in the past. Most causal network inference techniques, such as Dynamic Bayesian Networks and ordinary differential equations, are limited by their computational complexity and thus make large scale inference infeasible. This is specifically true if a Bayesian framework is applied in order to deal with the unavoidable uncertainty about the correct model. We devise a novel Bayesian network reverse engineering approach using ordinary differential equations with the ability to include non-linearity. Besides modeling arbitrary, possibly combinatorial and time dependent perturbations with unknown targets, one of our main contributions is the use of Expectation Propagation, an algorithm for approximate Bayesian inference over large scale network structures in short computation time. We further explore the possibility of integrating prior knowledge into network inference. We evaluate the proposed model on DREAM4 and DREAM8 data and find it competitive against several state-of-the-art existing network inference methods.
Mirzarezaee, Mitra; Araabi, Babak N; Sadeghi, Mehdi
2010-12-19
It has been understood that biological networks have modular organizations which are the sources of their observed complexity. Analysis of networks and motifs has shown that two types of hubs, party hubs and date hubs, are responsible for this complexity. Party hubs are local coordinators because of their high co-expressions with their partners, whereas date hubs display low co-expressions and are assumed as global connectors. However there is no mutual agreement on these concepts in related literature with different studies reporting their results on different data sets. We investigated whether there is a relation between the biological features of Saccharomyces Cerevisiae's proteins and their roles as non-hubs, intermediately connected, party hubs, and date hubs. We propose a classifier that separates these four classes. We extracted different biological characteristics including amino acid sequences, domain contents, repeated domains, functional categories, biological processes, cellular compartments, disordered regions, and position specific scoring matrix from various sources. Several classifiers are examined and the best feature-sets based on average correct classification rate and correlation coefficients of the results are selected. We show that fusion of five feature-sets including domains, Position Specific Scoring Matrix-400, cellular compartments level one, and composition pairs with two and one gaps provide the best discrimination with an average correct classification rate of 77%. We study a variety of known biological feature-sets of the proteins and show that there is a relation between domains, Position Specific Scoring Matrix-400, cellular compartments level one, composition pairs with two and one gaps of Saccharomyces Cerevisiae's proteins, and their roles in the protein interaction network as non-hubs, intermediately connected, party hubs and date hubs. This study also confirms the possibility of predicting non-hubs, party hubs and date hubs based on their biological features with acceptable accuracy. If such a hypothesis is correct for other species as well, similar methods can be applied to predict the roles of proteins in those species.
Bioinformatics analysis on molecular mechanism of rheum officinale in treatment of jaundice
NASA Astrophysics Data System (ADS)
Shan, Si; Tu, Jun; Nie, Peng; Yan, Xiaojun
2017-01-01
Objective: To study the molecular mechanism of Rheum officinale in the treatment of Jaundice by building molecular networks and comparing canonical pathways. Methods: Target proteins of Rheum officinale and related genes of Jaundice were searched from Pubchem and Gene databases online respectively. Molecular networks and canonical pathways comparison analyses were performed by Ingenuity Pathway Analysis (IPA). Results: The molecular networks of Rheum officinale and Jaundice were complex and multifunctional. The 40 target proteins of Rheum officinale and 33 Homo sapiens genes of Jaundice were found in databases. There were 19 common pathways both related networks. Rheum officinale could regulate endothelial differentiation, Interleukin-1B (IL-1B) and Tumor Necrosis Factor (TNF) in these pathways. Conclusions: Rheum officinale treat Jaundice by regulating many effective nodes of Apoptotic pathway and cellular immunity related pathways.
The driving regulators of the connectivity protein network of brain malignancies
NASA Astrophysics Data System (ADS)
Tahmassebi, Amirhessam; Pinker-Domenig, Katja; Wengert, Georg; Lobbes, Marc; Stadlbauer, Andreas; Wildburger, Norelle C.; Romero, Francisco J.; Morales, Diego P.; Castillo, Encarnacion; Garcia, Antonio; Botella, Guillermo; Meyer-Bäse, Anke
2017-05-01
An important problem in modern therapeutics at the proteomic level remains to identify therapeutic targets in a plentitude of high-throughput data from experiments relevant to a variety of diseases. This paper presents the application of novel modern control concepts, such as pinning controllability and observability applied to the glioma cancer stem cells (GSCs) protein graph network with known and novel association to glioblastoma (GBM). The theoretical frameworks provides us with the minimal number of "driver nodes", which are necessary, and their location to determine the full control over the obtained graph network in order to provide a change in the network's dynamics from an initial state (disease) to a desired state (non-disease). The achieved results will provide biochemists with techniques to identify more metabolic regions and biological pathways for complex diseases, to design and test novel therapeutic solutions.
Metabolic networks in motion: 13C-based flux analysis
Sauer, Uwe
2006-01-01
Many properties of complex networks cannot be understood from monitoring the components—not even when comprehensively monitoring all protein or metabolite concentrations—unless such information is connected and integrated through mathematical models. The reason is that static component concentrations, albeit extremely informative, do not contain functional information per se. The functional behavior of a network emerges only through the nonlinear gene, protein, and metabolite interactions across multiple metabolic and regulatory layers. I argue here that intracellular reaction rates are the functional end points of these interactions in metabolic networks, hence are highly relevant for systems biology. Methods for experimental determination of metabolic fluxes differ fundamentally from component concentration measurements; that is, intracellular reaction rates cannot be detected directly, but must be estimated through computer model-based interpretation of stable isotope patterns in products of metabolism. PMID:17102807
Statistical Analysis of Big Data on Pharmacogenomics
Fan, Jianqing; Liu, Han
2013-01-01
This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes and proteins and genetic markers for complex diseases, and high dimensional variable selection for identifying important molecules for understanding molecule mechanisms in pharmacogenomics. Their applications to gene network estimation and biomarker selection are used to illustrate the methodological power. Several new challenges of Big data analysis, including complex data distribution, missing data, measurement error, spurious correlation, endogeneity, and the need for robust statistical methods, are also discussed. PMID:23602905
Straube, Ronny
2017-12-01
Much of the complexity of regulatory networks derives from the necessity to integrate multiple signals and to avoid malfunction due to cross-talk or harmful perturbations. Hence, one may expect that the input-output behavior of larger networks is not necessarily more complex than that of smaller network motifs which suggests that both can, under certain conditions, be described by similar equations. In this review, we illustrate this approach by discussing the similarities that exist in the steady state descriptions of a simple bimolecular reaction, covalent modification cycles and bacterial two-component systems. Interestingly, in all three systems fundamental input-output characteristics such as thresholds, ultrasensitivity or concentration robustness are described by structurally similar equations. Depending on the system the meaning of the parameters can differ ranging from protein concentrations and affinity constants to complex parameter combinations which allows for a quantitative understanding of signal integration in these systems. We argue that this approach may also be extended to larger regulatory networks. Copyright © 2017 Elsevier B.V. All rights reserved.
Brorsson, C.; Hansen, N. T.; Lage, K.; Bergholdt, R.; Brunak, S.; Pociot, F.
2009-01-01
Aim To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1 genes. Methods We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein–protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein modules were statistically evaluated using permutation. Results A total of 151 genes could be mapped to nodes within the protein interaction network and their interaction partners were identified. Five protein interaction modules reached statistical significance using this approach. The identified proteins are well known in the pathogenesis of T1D, but the modules also contain additional candidates that have been implicated in β-cell development and diabetic complications. Conclusions The extensive LD within the MHC region makes it important to develop new methods for analysing genotyping data for identification of additional risk genes for T1D. Combining genetic data with knowledge about functional pathways provides new insight into mechanisms underlying T1D. PMID:19143816
Computational gene network study on antibiotic resistance genes of Acinetobacter baumannii.
Anitha, P; Anbarasu, Anand; Ramaiah, Sudha
2014-05-01
Multi Drug Resistance (MDR) in Acinetobacter baumannii is one of the major threats for emerging nosocomial infections in hospital environment. Multidrug-resistance in A. baumannii may be due to the implementation of multi-combination resistance mechanisms such as β-lactamase synthesis, Penicillin-Binding Proteins (PBPs) changes, alteration in porin proteins and in efflux pumps against various existing classes of antibiotics. Multiple antibiotic resistance genes are involved in MDR. These resistance genes are transferred through plasmids, which are responsible for the dissemination of antibiotic resistance among Acinetobacter spp. In addition, these resistance genes may also have a tendency to interact with each other or with their gene products. Therefore, it becomes necessary to understand the impact of these interactions in antibiotic resistance mechanism. Hence, our study focuses on protein and gene network analysis on various resistance genes, to elucidate the role of the interacting proteins and to study their functional contribution towards antibiotic resistance. From the search tool for the retrieval of interacting gene/protein (STRING), a total of 168 functional partners for 15 resistance genes were extracted based on the confidence scoring system. The network study was then followed up with functional clustering of associated partners using molecular complex detection (MCODE). Later, we selected eight efficient clusters based on score. Interestingly, the associated protein we identified from the network possessed greater functional similarity with known resistance genes. This network-based approach on resistance genes of A. baumannii could help in identifying new genes/proteins and provide clues on their association in antibiotic resistance. Copyright © 2014 Elsevier Ltd. All rights reserved.
Force Exertion and Transmission in Cross-Linked Actin Networks
NASA Astrophysics Data System (ADS)
Stam, Samantha
Cells are responsive to external cues in their environment telling them to proliferate or migrate within their surrounding tissue. Sensing of cues that are mechanical in nature, such stiffness of a tissue or forces transmitted from other cells, is believed to involve the cytoskeleton of a cell. The cytoskeleton is a complex network of proteins consisting of polymers that provide structural support, motor proteins that remodel these structures, and many others. We do not yet have a complete understanding of how cytoskeletal components respond to either internal or external mechanical force and stiffness. Such an understanding should involve mechanisms by which constituent molecules, such as motor proteins, are responsive to mechanics. Additionally, physical models of how forces are transmitted through biopolymer networks are necessary. My research has focused on networks formed by the cytoskeletal filament actin and the molecular motor protein myosin II. Actin filaments form networks and bundles that form a structural framework of the cell, and myosin II slides actin filaments. In this thesis, we show that stiffness of an elastic load that opposes myosin-generated actin sliding has a very sharp effect on the myosin force output in simulations. Secondly, we show that the stiffness and connectivity of cytoskeletal filaments regulates the contractility and anisotropy of network deformations that transmit force on material length scales. Together, these results have implications for predicting and interpreting the deformations and forces in biopolymeric active materials.
Millius, Arthur; Watanabe, Naoki; Weiner, Orion D
2012-03-01
The SCAR/WAVE complex drives lamellipodium formation by enhancing actin nucleation by the Arp2/3 complex. Phosphoinositides and Rac activate the SCAR/WAVE complex, but how SCAR/WAVE and Arp2/3 complexes converge at sites of nucleation is unknown. We analyzed the single-molecule dynamics of WAVE2 and p40 (subunits of the SCAR/WAVE and Arp2/3 complexes, respectively) in XTC cells. We observed lateral diffusion of both proteins and captured the transition of p40 from diffusion to network incorporation. These results suggest that a diffusive 2D search facilitates binding of the Arp2/3 complex to actin filaments necessary for nucleation. After nucleation, the Arp2/3 complex integrates into the actin network and undergoes retrograde flow, which results in its broad distribution throughout the lamellipodium. By contrast, the SCAR/WAVE complex is more restricted to the cell periphery. However, with single-molecule imaging, we also observed WAVE2 molecules undergoing retrograde motion. WAVE2 and p40 have nearly identical speeds, lifetimes and sites of network incorporation. Inhibition of actin retrograde flow does not prevent WAVE2 association and disassociation with the membrane but does inhibit WAVE2 removal from the actin cortex. Our results suggest that membrane binding and diffusion expedites the recruitment of nucleation factors to a nucleation site independent of actin assembly, but after network incorporation, ongoing actin polymerization facilitates recycling of SCAR/WAVE and Arp2/3 complexes.
Millius, Arthur; Watanabe, Naoki; Weiner, Orion D.
2012-01-01
The SCAR/WAVE complex drives lamellipodium formation by enhancing actin nucleation by the Arp2/3 complex. Phosphoinositides and Rac activate the SCAR/WAVE complex, but how SCAR/WAVE and Arp2/3 complexes converge at sites of nucleation is unknown. We analyzed the single-molecule dynamics of WAVE2 and p40 (subunits of the SCAR/WAVE and Arp2/3 complexes, respectively) in XTC cells. We observed lateral diffusion of both proteins and captured the transition of p40 from diffusion to network incorporation. These results suggest that a diffusive 2D search facilitates binding of the Arp2/3 complex to actin filaments necessary for nucleation. After nucleation, the Arp2/3 complex integrates into the actin network and undergoes retrograde flow, which results in its broad distribution throughout the lamellipodium. By contrast, the SCAR/WAVE complex is more restricted to the cell periphery. However, with single-molecule imaging, we also observed WAVE2 molecules undergoing retrograde motion. WAVE2 and p40 have nearly identical speeds, lifetimes and sites of network incorporation. Inhibition of actin retrograde flow does not prevent WAVE2 association and disassociation with the membrane but does inhibit WAVE2 removal from the actin cortex. Our results suggest that membrane binding and diffusion expedites the recruitment of nucleation factors to a nucleation site independent of actin assembly, but after network incorporation, ongoing actin polymerization facilitates recycling of SCAR/WAVE and Arp2/3 complexes. PMID:22349699
Tutorial on Protein Ontology Resources
Arighi, Cecilia; Drabkin, Harold; Christie, Karen R.; Ross, Karen; Natale, Darren
2017-01-01
The Protein Ontology (PRO) is the reference ontology for proteins in the Open Biomedical Ontologies (OBO) foundry and consists of three sub-ontologies representing protein classes of homologous genes, proteoforms (e.g., splice isoforms, sequence variants, and post-translationally modified forms), and protein complexes. PRO defines classes of proteins and protein complexes, both species-specific and species non-specific, and indicates their relationships in a hierarchical framework, supporting accurate protein annotation at the appropriate level of granularity, analyses of protein conservation across species, and semantic reasoning. In this first section of this chapter, we describe the PRO framework including categories of PRO terms and the relationship of PRO to other ontologies and protein resources. Next, we provide a tutorial about the PRO website (proconsortium.org) where users can browse and search the PRO hierarchy, view reports on individual PRO terms, and visualize relationships among PRO terms in a hierarchical table view, a multiple sequence alignment view, and a Cytoscape network view. Finally, we describe several examples illustrating the unique and rich information available in PRO. PMID:28150233
Network-based prediction and knowledge mining of disease genes
2015-01-01
Background In recent years, high-throughput protein interaction identification methods have generated a large amount of data. When combined with the results from other in vivo and in vitro experiments, a complex set of relationships between biological molecules emerges. The growing popularity of network analysis and data mining has allowed researchers to recognize indirect connections between these molecules. Due to the interdependent nature of network entities, evaluating proteins in this context can reveal relationships that may not otherwise be evident. Methods We examined the human protein interaction network as it relates to human illness using the Disease Ontology. After calculating several topological metrics, we trained an alternating decision tree (ADTree) classifier to identify disease-associated proteins. Using a bootstrapping method, we created a tree to highlight conserved characteristics shared by many of these proteins. Subsequently, we reviewed a set of non-disease-associated proteins that were misclassified by the algorithm with high confidence and searched for evidence of a disease relationship. Results Our classifier was able to predict disease-related genes with 79% area under the receiver operating characteristic (ROC) curve (AUC), which indicates the tradeoff between sensitivity and specificity and is a good predictor of how a classifier will perform on future data sets. We found that a combination of several network characteristics including degree centrality, disease neighbor ratio, eccentricity, and neighborhood connectivity help to distinguish between disease- and non-disease-related proteins. Furthermore, the ADTree allowed us to understand which combinations of strongly predictive attributes contributed most to protein-disease classification. In our post-processing evaluation, we found several examples of potential novel disease-related proteins and corresponding literature evidence. In addition, we showed that first- and second-order neighbors in the PPI network could be used to identify likely disease associations. Conclusions We analyzed the human protein interaction network and its relationship to disease and found that both the number of interactions with other proteins and the disease relationship of neighboring proteins helped to determine whether a protein had a relationship to disease. Our classifier predicted many proteins with no annotated disease association to be disease-related, which indicated that these proteins have network characteristics that are similar to disease-related proteins and may therefore have disease associations not previously identified. By performing a post-processing step after the prediction, we were able to identify evidence in literature supporting this possibility. This method could provide a useful filter for experimentalists searching for new candidate protein targets for drug repositioning and could also be extended to include other network and data types in order to refine these predictions. PMID:26043920
Blinov, Michael L.; Moraru, Ion I.
2011-01-01
Multi-state molecules and multi-component complexes are commonly involved in cellular signaling. Accounting for molecules that have multiple potential states, such as a protein that may be phosphorylated on multiple residues, and molecules that combine to form heterogeneous complexes located among multiple compartments, generates an effect of combinatorial complexity. Models involving relatively few signaling molecules can include thousands of distinct chemical species. Several software tools (StochSim, BioNetGen) are already available to deal with combinatorial complexity. Such tools need information standards if models are to be shared, jointly evaluated and developed. Here we discuss XML conventions that can be adopted for modeling biochemical reaction networks described by user-specified reaction rules. These could form a basis for possible future extensions of the Systems Biology Markup Language (SBML). PMID:21464833
Genetics Home Reference: familial porencephaly
... one component of a protein called type IV collagen. Type IV collagen molecules attach to each other to form complex ... and support cells in many tissues. Type IV collagen networks play an important role in the basement ...
Complex systems in metabolic engineering.
Winkler, James D; Erickson, Keesha; Choudhury, Alaksh; Halweg-Edwards, Andrea L; Gill, Ryan T
2015-12-01
Metabolic engineers manipulate intricate biological networks to build efficient biological machines. The inherent complexity of this task, derived from the extensive and often unknown interconnectivity between and within these networks, often prevents researchers from achieving desired performance. Other fields have developed methods to tackle the issue of complexity for their unique subset of engineering problems, but to date, there has not been extensive and comprehensive examination of how metabolic engineers use existing tools to ameliorate this effect on their own research projects. In this review, we examine how complexity affects engineering at the protein, pathway, and genome levels within an organism, and the tools for handling these issues to achieve high-performing strain designs. Quantitative complexity metrics and their applications to metabolic engineering versus traditional engineering fields are also discussed. We conclude by predicting how metabolic engineering practices may advance in light of an explicit consideration of design complexity. Copyright © 2015 Elsevier Ltd. All rights reserved.
Le, Nguyen-Quoc-Khanh; Nguyen, Trinh-Trung-Duong; Ou, Yu-Yen
2017-05-01
The electron transport proteins have an important role in storing and transferring electrons in cellular respiration, which is the most proficient process through which cells gather energy from consumed food. According to the molecular functions, the electron transport chain components could be formed with five complexes with several different electron carriers and functions. Therefore, identifying the molecular functions in the electron transport chain is vital for helping biologists understand the electron transport chain process and energy production in cells. This work includes two phases for discriminating electron transport proteins from transport proteins and classifying categories of five complexes in electron transport proteins. In the first phase, the performances from PSSM with AAIndex feature set were successful in identifying electron transport proteins in transport proteins with achieved sensitivity of 73.2%, specificity of 94.1%, and accuracy of 91.3%, with MCC of 0.64 for independent data set. With the second phase, our method can approach a precise model for identifying of five complexes with different molecular functions in electron transport proteins. The PSSM with AAIndex properties in five complexes achieved MCC of 0.51, 0.47, 0.42, 0.74, and 1.00 for independent data set, respectively. We suggest that our study could be a power model for determining new proteins that belongs into which molecular function of electron transport proteins. Copyright © 2017 Elsevier Inc. All rights reserved.
Barouch-Bentov, Rina; Neveu, Gregory; Xiao, Fei; Beer, Melanie; Bekerman, Elena; Schor, Stanford; Campbell, Joseph; Boonyaratanakornkit, Jim; Lindenbach, Brett; Lu, Albert; Jacob, Yves
2016-01-01
ABSTRACT Enveloped viruses commonly utilize late-domain motifs, sometimes cooperatively with ubiquitin, to hijack the endosomal sorting complex required for transport (ESCRT) machinery for budding at the plasma membrane. However, the mechanisms underlying budding of viruses lacking defined late-domain motifs and budding into intracellular compartments are poorly characterized. Here, we map a network of hepatitis C virus (HCV) protein interactions with the ESCRT machinery using a mammalian-cell-based protein interaction screen and reveal nine novel interactions. We identify HRS (hepatocyte growth factor-regulated tyrosine kinase substrate), an ESCRT-0 complex component, as an important entry point for HCV into the ESCRT pathway and validate its interactions with the HCV nonstructural (NS) proteins NS2 and NS5A in HCV-infected cells. Infectivity assays indicate that HRS is an important factor for efficient HCV assembly. Specifically, by integrating capsid oligomerization assays, biophysical analysis of intracellular viral particles by continuous gradient centrifugations, proteolytic digestion protection, and RNase digestion protection assays, we show that HCV co-opts HRS to mediate a late assembly step, namely, envelopment. In the absence of defined late-domain motifs, K63-linked polyubiquitinated lysine residues in the HCV NS2 protein bind the HRS ubiquitin-interacting motif to facilitate assembly. Finally, ESCRT-III and VPS/VTA1 components are also recruited by HCV proteins to mediate assembly. These data uncover involvement of ESCRT proteins in intracellular budding of a virus lacking defined late-domain motifs and a novel mechanism by which HCV gains entry into the ESCRT network, with potential implications for other viruses. PMID:27803188
Glutathione-complexed [2Fe-2S] clusters function in Fe-S cluster storage and trafficking.
Fidai, Insiya; Wachnowsky, Christine; Cowan, J A
2016-10-01
Glutathione-coordinated [2Fe-2S] complex is a non-protein-bound [2Fe-2S] cluster that is capable of reconstituting the human iron-sulfur cluster scaffold protein IscU. This complex demonstrates physiologically relevant solution chemistry and is a viable substrate for iron-sulfur cluster transport by Atm1p exporter protein. Herein, we report on some of the possible functional and physiological roles for this novel [2Fe-2S](GS4) complex in iron-sulfur cluster biosynthesis and quantitatively characterize its role in the broader network of Fe-S cluster transfer reactions. UV-vis and circular dichroism spectroscopy have been used in kinetic studies to determine second-order rate constants for [2Fe-2S] cluster transfer from [2Fe-2S](GS4) complex to acceptor proteins, such as human IscU, Schizosaccharomyces pombe Isa1, human and yeast glutaredoxins (human Grx2 and Saccharomyces cerevisiae Grx3), and human ferredoxins. Second-order rate constants for cluster extraction from these holo proteins were also determined by varying the concentration of glutathione, and a likely common mechanism for cluster uptake was determined by kinetic analysis. The results indicate that the [2Fe-2S](GS4) complex is stable under physiological conditions, and demonstrates reversible cluster exchange with a wide range of Fe-S cluster proteins, thereby supporting a possible physiological role for such centers.
The RING 2.0 web server for high quality residue interaction networks.
Piovesan, Damiano; Minervini, Giovanni; Tosatto, Silvio C E
2016-07-08
Residue interaction networks (RINs) are an alternative way of representing protein structures where nodes are residues and arcs physico-chemical interactions. RINs have been extensively and successfully used for analysing mutation effects, protein folding, domain-domain communication and catalytic activity. Here we present RING 2.0, a new version of the RING software for the identification of covalent and non-covalent bonds in protein structures, including π-π stacking and π-cation interactions. RING 2.0 is extremely fast and generates both intra and inter-chain interactions including solvent and ligand atoms. The generated networks are very accurate and reliable thanks to a complex empirical re-parameterization of distance thresholds performed on the entire Protein Data Bank. By default, RING output is generated with optimal parameters but the web server provides an exhaustive interface to customize the calculation. The network can be visualized directly in the browser or in Cytoscape. Alternatively, the RING-Viz script for Pymol allows visualizing the interactions at atomic level in the structure. The web server and RING-Viz, together with an extensive help and tutorial, are available from URL: http://protein.bio.unipd.it/ring. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cannistraci, Carlo Vittorio; Alanis-Lobato, Gregorio; Ravasi, Timothy
2013-01-01
Growth and remodelling impact the network topology of complex systems, yet a general theory explaining how new links arise between existing nodes has been lacking, and little is known about the topological properties that facilitate link-prediction. Here we investigate the extent to which the connectivity evolution of a network might be predicted by mere topological features. We show how a link/community-based strategy triggers substantial prediction improvements because it accounts for the singular topology of several real networks organised in multiple local communities - a tendency here named local-community-paradigm (LCP). We observe that LCP networks are mainly formed by weak interactions and characterise heterogeneous and dynamic systems that use self-organisation as a major adaptation strategy. These systems seem designed for global delivery of information and processing via multiple local modules. Conversely, non-LCP networks have steady architectures formed by strong interactions, and seem designed for systems in which information/energy storage is crucial. PMID:23563395
Cannistraci, Carlo Vittorio; Alanis-Lobato, Gregorio; Ravasi, Timothy
2013-01-01
Growth and remodelling impact the network topology of complex systems, yet a general theory explaining how new links arise between existing nodes has been lacking, and little is known about the topological properties that facilitate link-prediction. Here we investigate the extent to which the connectivity evolution of a network might be predicted by mere topological features. We show how a link/community-based strategy triggers substantial prediction improvements because it accounts for the singular topology of several real networks organised in multiple local communities - a tendency here named local-community-paradigm (LCP). We observe that LCP networks are mainly formed by weak interactions and characterise heterogeneous and dynamic systems that use self-organisation as a major adaptation strategy. These systems seem designed for global delivery of information and processing via multiple local modules. Conversely, non-LCP networks have steady architectures formed by strong interactions, and seem designed for systems in which information/energy storage is crucial.
An iterative network partition algorithm for accurate identification of dense network modules
Sun, Siqi; Dong, Xinran; Fu, Yao; Tian, Weidong
2012-01-01
A key step in network analysis is to partition a complex network into dense modules. Currently, modularity is one of the most popular benefit functions used to partition network modules. However, recent studies suggested that it has an inherent limitation in detecting dense network modules. In this study, we observed that despite the limitation, modularity has the advantage of preserving the primary network structure of the undetected modules. Thus, we have developed a simple iterative Network Partition (iNP) algorithm to partition a network. The iNP algorithm provides a general framework in which any modularity-based algorithm can be implemented in the network partition step. Here, we tested iNP with three modularity-based algorithms: multi-step greedy (MSG), spectral clustering and Qcut. Compared with the original three methods, iNP achieved a significant improvement in the quality of network partition in a benchmark study with simulated networks, identified more modules with significantly better enrichment of functionally related genes in both yeast protein complex network and breast cancer gene co-expression network, and discovered more cancer-specific modules in the cancer gene co-expression network. As such, iNP should have a broad application as a general method to assist in the analysis of biological networks. PMID:22121225
Franke, Ralf-Peter; Scharnweber, Tim; Fuhrmann, Rosemarie; Wenzel, Folker; Krüger, Anne; Mrowietz, Christof; Jung, Friedrich
2014-01-01
The membrane of red blood cells consists of a phospholipid bilayer with embedded membrane proteins and is associated on the cytoplasmatic side with a network of proteins, the membrane skeleton. Band3 has an important role as centre of the functional complexes e.g. gas exchange complex and as element of attachment for the membrane skeleton maintaining membrane stability and flexibility. Up to now it is unclear if band3 is involved in the morphology change of red blood cells after contact with radiographic contrast media. The study revealed for the first time that Iopromide induced markedly more severe alterations of the membrane skeleton compared to Iodixanol whose effects were similar to erythrocytes suspended in autologous plasma. A remarkable clustering of band3 was found associated with an accumulation of band3 in spicules and also a sequestration of band3 to the extracellular space. This was evidently accompanied by a gross reduction of functional band3 complexes combined with a dissociation of spectrin from band3 leading to a loss of homogeneity of the spectrin network. It could be demonstrated for the first time that RCM not only induced echinocyte formation but also exocytosis of particles at least coated with band3. PMID:24586837
Proteome-Scale Human Interactomics.
Luck, Katja; Sheynkman, Gloria M; Zhang, Ivy; Vidal, Marc
2017-05-01
Cellular functions are mediated by complex interactome networks of physical, biochemical, and functional interactions between DNA sequences, RNA molecules, proteins, lipids, and small metabolites. A thorough understanding of cellular organization requires accurate and relatively complete models of interactome networks at proteome scale. The recent publication of four human protein-protein interaction (PPI) maps represents a technological breakthrough and an unprecedented resource for the scientific community, heralding a new era of proteome-scale human interactomics. Our knowledge gained from these and complementary studies provides fresh insights into the opportunities and challenges when analyzing systematically generated interactome data, defines a clear roadmap towards the generation of a first reference interactome, and reveals new perspectives on the organization of cellular life. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pathway mapping and development of disease-specific biomarkers: protein-based network biomarkers
Chen, Hao; Zhu, Zhitu; Zhu, Yichun; Wang, Jian; Mei, Yunqing; Cheng, Yunfeng
2015-01-01
It is known that a disease is rarely a consequence of an abnormality of a single gene, but reflects the interactions of various processes in a complex network. Annotated molecular networks offer new opportunities to understand diseases within a systems biology framework and provide an excellent substrate for network-based identification of biomarkers. The network biomarkers and dynamic network biomarkers (DNBs) represent new types of biomarkers with protein–protein or gene–gene interactions that can be monitored and evaluated at different stages and time-points during development of disease. Clinical bioinformatics as a new way to combine clinical measurements and signs with human tissue-generated bioinformatics is crucial to translate biomarkers into clinical application, validate the disease specificity, and understand the role of biomarkers in clinical settings. In this article, the recent advances and developments on network biomarkers and DNBs are comprehensively reviewed. How network biomarkers help a better understanding of molecular mechanism of diseases, the advantages and constraints of network biomarkers for clinical application, clinical bioinformatics as a bridge to the development of diseases-specific, stage-specific, severity-specific and therapy predictive biomarkers, and the potentials of network biomarkers are also discussed. PMID:25560835
Häupl, Björn; Ihling, Christian H; Sinz, Andrea
2017-04-07
We present a novel approach that relies on the affinity capture of protein interaction partners from a complex mixture, followed by covalent fixation via UV-induced activation of incorporated diazirine photo-reactive amino acids (photo-methionine and photo-leucine). The captured protein complexes are enzymatically digested and interacting proteins are identified and quantified by label-free LC/MS analysis. Using HeLa cell lysates with photo-methionine and photo-leucine-labeled proteins, we were able to capture and preserve protein interactions that are otherwise elusive in conventional pull-down experiments. Our approach is exemplified for mapping the protein interaction network of protein kinase D2, but has the potential be applied to any protein system. Data are available via ProteomeXchange with identifiers PXD005346 (photo-amino acid incorporation) and PXD005349 (enrichment experiments). This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Evasion Mechanisms Used by Pathogens to Escape the Lectin Complement Pathway.
Rosbjerg, Anne; Genster, Ninette; Pilely, Katrine; Garred, Peter
2017-01-01
The complement system is a crucial defensive network that protects the host against invading pathogens. It is part of the innate immune system and can be initiated via three pathways: the lectin, classical and alternative activation pathway. Overall the network compiles a group of recognition molecules that bind specific patterns on microbial surfaces, a group of associated proteases that initiates the complement cascade, and a group of proteins that interact in proteolytic complexes or the terminal pore-forming complex. In addition, various regulatory proteins are important for controlling the level of activity. The result is a pro-inflammatory response meant to combat foreign microbes. Microbial elimination is, however, not a straight forward procedure; pathogens have adapted to their environment by evolving a collection of evasion mechanisms that circumvent the human complement system. Complement evasion strategies features different ways of exploiting human complement proteins and moreover features different pathogen-derived proteins that interfere with the normal processes. Accumulated, these mechanisms target all three complement activation pathways as well as the final common part of the cascade. This review will cover the currently known lectin pathway evasion mechanisms and give examples of pathogens that operate these to increase their chance of invasion, survival and dissemination.
Matsuura, Tomoaki; Tanimura, Naoki; Hosoda, Kazufumi; Yomo, Tetsuya; Shimizu, Yoshihiro
2017-01-01
To elucidate the dynamic features of a biologically relevant large-scale reaction network, we constructed a computational model of minimal protein synthesis consisting of 241 components and 968 reactions that synthesize the Met-Gly-Gly (MGG) peptide based on an Escherichia coli-based reconstituted in vitro protein synthesis system. We performed a simulation using parameters collected primarily from the literature and found that the rate of MGG peptide synthesis becomes nearly constant in minutes, thus achieving a steady state similar to experimental observations. In addition, concentration changes to 70% of the components, including intermediates, reached a plateau in a few minutes. However, the concentration change of each component exhibits several temporal plateaus, or a quasi-stationary state (QSS), before reaching the final plateau. To understand these complex dynamics, we focused on whether the components reached a QSS, mapped the arrangement of components in a QSS in the entire reaction network structure, and investigated time-dependent changes. We found that components in a QSS form clusters that grow over time but not in a linear fashion, and that this process involves the collapse and regrowth of clusters before the formation of a final large single cluster. These observations might commonly occur in other large-scale biological reaction networks. This developed analysis might be useful for understanding large-scale biological reactions by visualizing complex dynamics, thereby extracting the characteristics of the reaction network, including phase transitions. PMID:28167777
High-Confidence Interactome for RNF41 Built on Multiple Orthogonal Assays.
Masschaele, Delphine; Wauman, Joris; Vandemoortele, Giel; De Sutter, Delphine; De Ceuninck, Leentje; Eyckerman, Sven; Tavernier, Jan
2018-04-06
Ring finger protein 41 (RNF41) is an E3 ubiquitin ligase involved in the ubiquitination and degradation of many proteins including ErbB3 receptors, BIRC6, and parkin. Next to this, RNF41 regulates the intracellular trafficking of certain JAK2-associated cytokine receptors by ubiquitinating and suppressing USP8, which, in turn, destabilizes the ESCRT-0 complex. To further elucidate the function of RNF41 we used different orthogonal approaches to reveal the RNF41 protein complex: affinity purification-mass spectrometry, BioID, and Virotrap. We combined these results with known data sets for RNF41 obtained with microarray MAPPIT and Y2H screens. This way, we establish a comprehensive high-resolution interactome network comprising 175 candidate protein partners. To remove potential methodological artifacts from this network, we distilled the data into a high-confidence interactome map by retaining a total of 19 protein hits identified in two or more of the orthogonal methods. AP2S1, a novel RNF41 interaction partner, was selected from this high-confidence interactome for further functional validation. We reveal a role for AP2S1 in leptin and LIF receptor signaling and show that RNF41 stabilizes and relocates AP2S1.
A Non-ATP Competitive Inhibitor of BCR-ABL for the Therapy of Imatinib-Resistant Cmls
2008-05-01
HSP90 members (e.g. HSP70 and HSP27 ) help maintain the macromolecular complex that assembled the Bcr-Abl/Jak2 signaling Network. Our published papers 4...kinase, Akt and GSK3 beta. Our recent findings suggest that the HSP90/HSP70/ HSP27 chaperone proteins are also part of this Network, and may play a
Vincent, Maxence S.; Canestrari, Mickaël J.; Leone, Philippe; Stathopulos, Julien; Ize, Bérengère; Zoued, Abdelrahim; Cambillau, Christian; Kellenberger, Christine; Roussel, Alain
2017-01-01
The transport of proteins at the cell surface of Bacteroidetes depends on a secretory apparatus known as type IX secretion system (T9SS). This machine is responsible for the cell surface exposition of various proteins, such as adhesins, required for gliding motility in Flavobacterium, S-layer components in Tannerella forsythia, and tooth tissue-degrading enzymes in the oral pathogen Porphyromonas gingivalis. Although a number of subunits of the T9SS have been identified, we lack details on the architecture of this secretion apparatus. Here we provide evidence that five of the genes encoding the core complex of the T9SS are co-transcribed and that the gene products are distributed in the cell envelope. Protein-protein interaction studies then revealed that these proteins oligomerize and interact through a dense network of contacts. PMID:28057754
Won, Seoung Youn; Kim, Cha Yeon; Kim, Doyoun; Ko, Jaewon; Um, Ji Won; Lee, Sung Bae; Buck, Matthias; Kim, Eunjoon; Heo, Won Do; Lee, Jie-Oh; Kim, Ho Min
2017-01-01
The leukocyte common antigen-related receptor protein tyrosine phosphatases (LAR-RPTPs) are cellular receptors of heparan sulfate (HS) and chondroitin sulfate (CS) proteoglycans that direct axonal growth and neuronal regeneration. LAR-RPTPs are also synaptic adhesion molecules that form trans-synaptic adhesion complexes by binding to various postsynaptic adhesion ligands, such as Slit- and Trk-like family of proteins (Slitrks), IL-1 receptor accessory protein-like 1 (IL1RAPL1), interleukin-1 receptor accessory protein (IL-1RAcP) and neurotrophin receptor tyrosine kinase C (TrkC), to regulate synaptogenesis. Here, we determined the crystal structure of the human LAR-RPTP/IL1RAPL1 complex and found that lateral interactions between neighboring LAR-RPTP/IL1RAPL1 complexes in crystal lattices are critical for the higher-order assembly and synaptogenic activity of these complexes. Moreover, we found that LAR-RPTP binding to the postsynaptic adhesion ligands, Slitrk3, IL1RAPL1 and IL-1RAcP, but not TrkC, induces reciprocal higher-order clustering of trans-synaptic adhesion complexes. Although LAR-RPTP clustering was induced by either HS or postsynaptic adhesion ligands, the dominant binding of HS to the LAR-RPTP was capable of dismantling pre-established LAR-RPTP-mediated trans-synaptic adhesion complexes. These findings collectively suggest that LAR-RPTP clustering for synaptogenesis is modulated by a complex synapse-organizing protein network. PMID:29081732
A Rich-Club Organization in Brain Ischemia Protein Interaction Network
Alawieh, Ali; Sabra, Zahraa; Sabra, Mohammed; Tomlinson, Stephen; Zaraket, Fadi A.
2015-01-01
Ischemic stroke involves multiple pathophysiological mechanisms with complex interactions. Efforts to decipher those mechanisms and understand the evolution of cerebral injury is key for developing successful interventions. In an innovative approach, we use literature mining, natural language processing and systems biology tools to construct, annotate and curate a brain ischemia interactome. The curated interactome includes proteins that are deregulated after cerebral ischemia in human and experimental stroke. Network analysis of the interactome revealed a rich-club organization indicating the presence of a densely interconnected hub structure of prominent contributors to disease pathogenesis. Functional annotation of the interactome uncovered prominent pathways and highlighted the critical role of the complement and coagulation cascade in the initiation and amplification of injury starting by activation of the rich-club. We performed an in-silico screen for putative interventions that have pleiotropic effects on rich-club components and we identified estrogen as a prominent candidate. Our findings show that complex network analysis of disease related interactomes may lead to a better understanding of pathogenic mechanisms and provide cost-effective and mechanism-based discovery of candidate therapeutics. PMID:26310627
Prokaryotic ancestry of eukaryotic protein networks mediating innate immunity and apoptosis.
Dunin-Horkawicz, Stanislaw; Kopec, Klaus O; Lupas, Andrei N
2014-04-03
Protein domains characteristic of eukaryotic innate immunity and apoptosis have many prokaryotic counterparts of unknown function. By reconstructing interactomes computationally, we found that bacterial proteins containing these domains are part of a network that also includes other domains not hitherto associated with immunity. This network is connected to the network of prokaryotic signal transduction proteins, such as histidine kinases and chemoreceptors. The network varies considerably in domain composition and degree of paralogy, even between strains of the same species, and its repetitive domains are often amplified recently, with individual repeats sharing up to 100% sequence identity. Both phenomena are evidence of considerable evolutionary pressure and thus compatible with a role in the "arms race" between host and pathogen. In order to investigate the relationship of this network to its eukaryotic counterparts, we performed a cluster analysis of organisms based on a census of its constituent domains across all fully sequenced genomes. We obtained a large central cluster of mainly unicellular organisms, from which multicellular organisms radiate out in two main directions. One is taken by multicellular bacteria, primarily cyanobacteria and actinomycetes, and plants form an extension of this direction, connected via the basal, unicellular cyanobacteria. The second main direction is taken by animals and fungi, which form separate branches with a common root in the α-proteobacteria of the central cluster. This analysis supports the notion that the innate immunity networks of eukaryotes originated from their endosymbionts and that increases in the complexity of these networks accompanied the emergence of multicellularity. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Glazer, Lilah; Roth, Ziv; Weil, Simy; Aflalo, Eliahu D; Khalaila, Isam; Sagi, Amir
2015-10-14
Chitin is a major component of arthropod cuticles, where it forms a three-dimensional network that constitutes the scaffold upon which cuticles form. The chitin fibers that form this network are closely associated with specific structural proteins, while the cuticular matrix contains many additional structural, enzymatic and other proteins. We study the crayfish gastrolith as a simple model for the assembly of calcified cuticular structures, with particular focus on the proteins involved in this process. The present study integrates a gastrolith-forming epithelium transcriptomic library with data from mass spectrometry analysis of proteins extracted from the gastrolith matrix to obtain a near-complete picture of gastrolith protein content. Using native protein separation we identified 24 matrix proteins, of which 14 are novel. Further analysis led to discovery of three putative protein complexes, all containing GAP 65 the most abundant gastrolith structural protein. Using immunological methods we further studied the role of GAP 65 in the gastrolith matrix and forming epithelium, as well as in the newly identified protein complexes. We propose that gastrolith matrix construction is a sequential process in which protein complexes are dynamically assembled and disassembled around GAP 65, thus changing their functional properties to perform each step in the construction process. The scientific interest on which this study is based arises from three main features of gastroliths: (1) Gastroliths possess partial analogy to cuticles both in structural and molecular properties, and may be regarded, with the appropriate reservations (see Introduction), as simple models for cuticle assembly. At the same time, gastroliths are terminally assembled during a well-defined period, which can be controlled in the laboratory, making them significantly easier to study than cuticles. (2) Gastroliths, like the crayfish exoskeleton, contain stable amorphous calcium carbonate (ACC) rather than crystalline calcite. The biological mechanism for the stabilization of a naturally unstable, but at the same time biologically highly available, calcium carbonate polymorph is of great interest from the pharmaceutical point of view. (3) The gastrolith organic matrix is based on a highly structured chitin network that interacts with a variety of substances. This biologically manipulated, biodegradable structure is in itself of biotechnological and pharmaceutical potential. A growing body of evidence indicates that proteins play central roles in all above aspects of gastrolith construction. This study offers the first comprehensive screening of gastrolith proteins, and we believe that the analysis presented in this work can not only help reveal basic biological questions regarding assembly of mineralized and non-mineralized cuticular structures, but may also serve as basis for applied research in the fields of agriculture (e.g. cuticle-based pest management), health (e.g. bioavailable calcium supplements and biodegradable drug carriers) and materials science (e.g. non-toxic scaffolds for water purification). Copyright © 2015. Published by Elsevier B.V.
Riera-Fernández, Pablo; Munteanu, Cristian R; Escobar, Manuel; Prado-Prado, Francisco; Martín-Romalde, Raquel; Pereira, David; Villalba, Karen; Duardo-Sánchez, Aliuska; González-Díaz, Humberto
2012-01-21
Graph and Complex Network theory is expanding its application to different levels of matter organization such as molecular, biological, technological, and social networks. A network is a set of items, usually called nodes, with connections between them, which are called links or edges. There are many different experimental and/or theoretical methods to assign node-node links depending on the type of network we want to construct. Unfortunately, the use of a method for experimental reevaluation of the entire network is very expensive in terms of time and resources; thus the development of cheaper theoretical methods is of major importance. In addition, different methods to link nodes in the same type of network are not totally accurate in such a way that they do not always coincide. In this sense, the development of computational methods useful to evaluate connectivity quality in complex networks (a posteriori of network assemble) is a goal of major interest. In this work, we report for the first time a new method to calculate numerical quality scores S(L(ij)) for network links L(ij) (connectivity) based on the Markov-Shannon Entropy indices of order k-th (θ(k)) for network nodes. The algorithm may be summarized as follows: (i) first, the θ(k)(j) values are calculated for all j-th nodes in a complex network already constructed; (ii) A Linear Discriminant Analysis (LDA) is used to seek a linear equation that discriminates connected or linked (L(ij)=1) pairs of nodes experimentally confirmed from non-linked ones (L(ij)=0); (iii) the new model is validated with external series of pairs of nodes; (iv) the equation obtained is used to re-evaluate the connectivity quality of the network, connecting/disconnecting nodes based on the quality scores calculated with the new connectivity function. This method was used to study different types of large networks. The linear models obtained produced the following results in terms of overall accuracy for network reconstruction: Metabolic networks (72.3%), Parasite-Host networks (93.3%), CoCoMac brain cortex co-activation network (89.6%), NW Spain fasciolosis spreading network (97.2%), Spanish financial law network (89.9%) and World trade network for Intelligent & Active Food Packaging (92.8%). In order to seek these models, we studied an average of 55,388 pairs of nodes in each model and a total of 332,326 pairs of nodes in all models. Finally, this method was used to solve a more complicated problem. A model was developed to score the connectivity quality in the Drug-Target network of US FDA approved drugs. In this last model the θ(k) values were calculated for three types of molecular networks representing different levels of organization: drug molecular graphs (atom-atom bonds), protein residue networks (amino acid interactions), and drug-target network (compound-protein binding). The overall accuracy of this model was 76.3%. This work opens a new door to the computational reevaluation of network connectivity quality (collation) for complex systems in molecular, biomedical, technological, and legal-social sciences as well as in world trade and industry. Copyright © 2011 Elsevier Ltd. All rights reserved.
Shrestha, Kushal; Jakubikova, Elena
2015-08-20
Light-harvesting antennas are protein-pigment complexes that play a crucial role in natural photosynthesis. The antenna complexes absorb light and transfer energy to photosynthetic reaction centers where charge separation occurs. This work focuses on computational studies of the electronic structure of the pigment networks of light-harvesting complex I (LH1), LH1 with the reaction center (RC-LH1), and light-harvesting complex II (LH2) found in purple bacteria. As the pigment networks of LH1, RC-LH1, and LH2 contain thousands of atoms, conventional density functional theory (DFT) and ab initio calculations of these systems are not computationally feasible. Therefore, we utilize DFT in conjunction with the energy-based fragmentation with molecular orbitals method and a semiempirical approach employing the extended Hückel model Hamiltonian to determine the electronic properties of these pigment assemblies. Our calculations provide a deeper understanding of the electronic structure of natural light-harvesting complexes, especially their pigment networks, which could assist in rational design of artificial photosynthetic devices.
NASA Astrophysics Data System (ADS)
Pan, Xiaoliang; Schwartz, Steven
2015-03-01
It has long been recognized that the structure of a protein is a hierarchy of conformations interconverting on multiple time scales. However, the conformational heterogeneity is rarely considered in the context of enzymatic catalysis in which the reactant is usually represented by a single conformation of the enzyme/substrate complex. Lactate dehydrogenase (LDH) catalyzes the interconversion of pyruvate and lactate with concomitant interconversion of two forms of the cofactor nicotinamide adenine dinucleotide (NADH and NAD+). Recent experimental results suggest that multiple substates exist within the Michaelis complex of LDH, and they are catalytic competent at different reaction rates. In this study, millisecond-scale all-atom molecular dynamics simulations were performed on LDH to explore the free energy landscape of the Michaelis complex, and network analysis was used to characterize the distribution of the conformations. Our results provide a detailed view of the kinetic network the Michaelis complex and the structures of the substates at atomistic scale. It also shed some light on understanding the complete picture of the catalytic mechanism of LDH.
Interactome disassembly during apoptosis occurs independent of caspase cleavage.
Scott, Nichollas E; Rogers, Lindsay D; Prudova, Anna; Brown, Nat F; Fortelny, Nikolaus; Overall, Christopher M; Foster, Leonard J
2017-01-12
Protein-protein interaction networks (interactomes) define the functionality of all biological systems. In apoptosis, proteolysis by caspases is thought to initiate disassembly of protein complexes and cell death. Here we used a quantitative proteomics approach, protein correlation profiling (PCP), to explore changes in cytoplasmic and mitochondrial interactomes in response to apoptosis initiation as a function of caspase activity. We measured the response to initiation of Fas-mediated apoptosis in 17,991 interactions among 2,779 proteins, comprising the largest dynamic interactome to date. The majority of interactions were unaffected early in apoptosis, but multiple complexes containing known caspase targets were disassembled. Nonetheless, proteome-wide analysis of proteolytic processing by terminal amine isotopic labeling of substrates (TAILS) revealed little correlation between proteolytic and interactome changes. Our findings show that, in apoptosis, significant interactome alterations occur before and independently of caspase activity. Thus, apoptosis initiation includes a tight program of interactome rearrangement, leading to disassembly of relatively few, select complexes. These early interactome alterations occur independently of cleavage of these protein by caspases. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.
Magnesium degradation as determined by artificial neural networks.
Willumeit, Regine; Feyerabend, Frank; Huber, Norbert
2013-11-01
Magnesium degradation under physiological conditions is a highly complex process in which temperature, the use of cell culture growth medium and the presence of CO2, O2 and proteins can influence the corrosion rate and the composition of the resulting corrosion layer. Due to the complexity of this process it is almost impossible to predict the parameters that are most important and whether some parameters have a synergistic effect on the corrosion rate. Artificial neural networks are a mathematical tool that can be used to approximate and analyse non-linear problems with multiple inputs. In this work we present the first analysis of corrosion data obtained using this method, which reveals that CO2 and the composition of the buffer system play a crucial role in the corrosion of magnesium, whereas O2, proteins and temperature play a less prominent role. Copyright © 2013 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
The LGI1–ADAM22 protein complex directs synapse maturation through regulation of PSD-95 function
Lovero, Kathryn L.; Fukata, Yuko; Granger, Adam J.; Fukata, Masaki; Nicoll, Roger A.
2015-01-01
Synapse development is coordinated by a number of transmembrane and secreted proteins that come together to form synaptic organizing complexes. Whereas a variety of synaptogenic proteins have been characterized, much less is understood about the molecular networks that support the maintenance and functional maturation of nascent synapses. Here, we demonstrate that leucine-rich, glioma-inactivated protein 1 (LGI1), a secreted protein previously shown to modulate synaptic AMPA receptors, is a paracrine signal released from pre- and postsynaptic neurons that acts specifically through a disintegrin and metalloproteinase protein 22 (ADAM22) to set postsynaptic strength. We go on to describe a novel role for ADAM22 in maintaining excitatory synapses through PSD-95/Dlg1/zo-1 (PDZ) domain interactions. Finally, we show that in the absence of LGI1, the mature synapse scaffolding protein PSD-95, but not the immature synapse scaffolding protein SAP102, is unable to modulate synaptic transmission. These results indicate that LGI1 and ADAM22 form an essential synaptic organizing complex that coordinates the maturation of excitatory synapses by regulating the functional incorporation of PSD-95. PMID:26178195
Mallik, Saurav; Basu, Sudipto; Hait, Suman; Kundu, Sudip
2018-04-21
Do coding and regulatory segments of a gene co-evolve with each-other? Seeking answers to this question, here we analyze the case of Escherichia coli ribosomal protein S15, that represses its own translation by specifically binding its messenger RNA (rpsO mRNA) and stabilizing a pseudoknot structure at the upstream untranslated region, thus trapping the ribosome into an incomplete translation initiation complex. In the absence of S15, ribosomal protein S1 recognizes rpsO and promotes translation by melting this very pseudoknot. We employ a robust statistical method to detect signatures of positive epistasis between residue site pairs and find that biophysical constraints of translational regulation (S15-rpsO and S1-rpsO recognition, S15-mediated rpsO structural rearrangement, and S1-mediated melting) are strong predictors of positive epistasis. Transforming the epistatic pairs into a network, we find that signatures of two different, but interconnected regulatory cascades are imprinted in the sequence-space and can be captured in terms of two dense network modules that are sparsely connected to each other. This network topology further reflects a general principle of how functionally coupled components of biological networks are interconnected. These results depict a model case, where translational regulation drives characteristic residue-level epistasis-not only between a protein and its own mRNA but also between a protein and the mRNA of an entirely different protein. © 2018 Wiley Periodicals, Inc.
Actin-myosin network is required for proper assembly of influenza virus particles
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kumakura, Michiko; Kawaguchi, Atsushi, E-mail: ats-kawaguchi@md.tsukuba.ac.jp; Nagata, Kyosuke, E-mail: knagata@md.tsukuba.ac.jp
Actin filaments are known to play a central role in cellular dynamics. After polymerization of actin, various actin-crosslinking proteins including non-muscle myosin II facilitate the formation of spatially organized actin filament networks. The actin-myosin network is highly expanded beneath plasma membrane. The genome of influenza virus (vRNA) replicates in the cell nucleus. Then, newly synthesized vRNAs are nuclear-exported to the cytoplasm as ribonucleoprotein complexes (vRNPs), followed by transport to the beneath plasma membrane where virus particles assemble. Here, we found that, by inhibiting actin-myosin network formation, the virus titer tends to be reduced and HA viral spike protein is aggregatedmore » on the plasma membrane. These results indicate that the actin-myosin network plays an important role in the virus formation. - Highlights: • Actin-myosin network is important for the influenza virus production. • HA forms aggregations at the plasma membrane in the presence of blebbistatin. • M1 is recruited to the budding site through the actin-myosin network.« less
A novel role for WAVE1 in controlling actin network growth rate and architecture
Sweeney, Meredith O.; Collins, Agnieszka; Padrick, Shae B.; Goode, Bruce L.
2015-01-01
Branched actin filament networks in cells are assembled through the combined activities of Arp2/3 complex and different WASP/WAVE proteins. Here we used TIRF and electron microscopy to directly compare for the first time the assembly kinetics and architectures of actin filament networks produced by Arp2/3 complex and dimerized VCA regions of WAVE1, WAVE2, or N-WASP. WAVE1 produced strikingly different networks from WAVE2 or N-WASP, which comprised unexpectedly short filaments. Further analysis showed that the WAVE1-specific activity stemmed from an inhibitory effect on filament elongation both in the presence and absence of Arp2/3 complex, which was observed even at low stoichiometries of WAVE1 to actin monomers, precluding an effect from monomer sequestration. Using a series of VCA chimeras, we mapped the elongation inhibitory effects of WAVE1 to its WH2 (“V”) domain. Further, mutating a single conserved lysine residue potently disrupted WAVE1's inhibitory effects. Taken together, our results show that WAVE1 has unique activities independent of Arp2/3 complex that can govern both the growth rates and architectures of actin filament networks. Such activities may underlie previously observed differences between the cellular functions of WAVE1 and WAVE2. PMID:25473116
An integrative approach to inferring biologically meaningful gene modules
2011-01-01
Background The ability to construct biologically meaningful gene networks and modules is critical for contemporary systems biology. Though recent studies have demonstrated the power of using gene modules to shed light on the functioning of complex biological systems, most modules in these networks have shown little association with meaningful biological function. We have devised a method which directly incorporates gene ontology (GO) annotation in construction of gene modules in order to gain better functional association. Results We have devised a method, Semantic Similarity-Integrated approach for Modularization (SSIM) that integrates various gene-gene pairwise similarity values, including information obtained from gene expression, protein-protein interactions and GO annotations, in the construction of modules using affinity propagation clustering. We demonstrated the performance of the proposed method using data from two complex biological responses: 1. the osmotic shock response in Saccharomyces cerevisiae, and 2. the prion-induced pathogenic mouse model. In comparison with two previously reported algorithms, modules identified by SSIM showed significantly stronger association with biological functions. Conclusions The incorporation of semantic similarity based on GO annotation with gene expression and protein-protein interaction data can greatly enhance the functional relevance of inferred gene modules. In addition, the SSIM approach can also reveal the hierarchical structure of gene modules to gain a broader functional view of the biological system. Hence, the proposed method can facilitate comprehensive and in-depth analysis of high throughput experimental data at the gene network level. PMID:21791051
Mdm2 mediates FMRP- and Gp1 mGluR-dependent protein translation and neural network activity.
Liu, Dai-Chi; Seimetz, Joseph; Lee, Kwan Young; Kalsotra, Auinash; Chung, Hee Jung; Lu, Hua; Tsai, Nien-Pei
2017-10-15
Activating Group 1 (Gp1) metabotropic glutamate receptors (mGluRs), including mGluR1 and mGluR5, elicits translation-dependent neural plasticity mechanisms that are crucial to animal behavior and circuit development. Dysregulated Gp1 mGluR signaling has been observed in numerous neurological and psychiatric disorders. However, the molecular pathways underlying Gp1 mGluR-dependent plasticity mechanisms are complex and have been elusive. In this study, we identified a novel mechanism through which Gp1 mGluR mediates protein translation and neural plasticity. Using a multi-electrode array (MEA) recording system, we showed that activating Gp1 mGluR elevates neural network activity, as demonstrated by increased spontaneous spike frequency and burst activity. Importantly, we validated that elevating neural network activity requires protein translation and is dependent on fragile X mental retardation protein (FMRP), the protein that is deficient in the most common inherited form of mental retardation and autism, fragile X syndrome (FXS). In an effort to determine the mechanism by which FMRP mediates protein translation and neural network activity, we demonstrated that a ubiquitin E3 ligase, murine double minute-2 (Mdm2), is required for Gp1 mGluR-induced translation and neural network activity. Our data showed that Mdm2 acts as a translation suppressor, and FMRP is required for its ubiquitination and down-regulation upon Gp1 mGluR activation. These data revealed a novel mechanism by which Gp1 mGluR and FMRP mediate protein translation and neural network activity, potentially through de-repressing Mdm2. Our results also introduce an alternative way for understanding altered protein translation and brain circuit excitability associated with Gp1 mGluR in neurological diseases such as FXS. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Reverse and forward engineering of protein pattern formation.
Kretschmer, Simon; Harrington, Leon; Schwille, Petra
2018-05-26
Living systems employ protein pattern formation to regulate important life processes in space and time. Although pattern-forming protein networks have been identified in various prokaryotes and eukaryotes, their systematic experimental characterization is challenging owing to the complex environment of living cells. In turn, cell-free systems are ideally suited for this goal, as they offer defined molecular environments that can be precisely controlled and manipulated. Towards revealing the molecular basis of protein pattern formation, we outline two complementary approaches: the biochemical reverse engineering of reconstituted networks and the de novo design, or forward engineering, of artificial self-organizing systems. We first illustrate the reverse engineering approach by the example of the Escherichia coli Min system, a model system for protein self-organization based on the reversible and energy-dependent interaction of the ATPase MinD and its activating protein MinE with a lipid membrane. By reconstituting MinE mutants impaired in ATPase stimulation, we demonstrate how large-scale Min protein patterns are modulated by MinE activity and concentration. We then provide a perspective on the de novo design of self-organizing protein networks. Tightly integrated reverse and forward engineering approaches will be key to understanding and engineering the intriguing phenomenon of protein pattern formation.This article is part of the theme issue 'Self-organization in cell biology'. © 2018 The Author(s).
Coarse-graining and self-dissimilarity of complex networks
NASA Astrophysics Data System (ADS)
Itzkovitz, Shalev; Levitt, Reuven; Kashtan, Nadav; Milo, Ron; Itzkovitz, Michael; Alon, Uri
2005-01-01
Can complex engineered and biological networks be coarse-grained into smaller and more understandable versions in which each node represents an entire pattern in the original network? To address this, we define coarse-graining units as connectivity patterns which can serve as the nodes of a coarse-grained network and present algorithms to detect them. We use this approach to systematically reverse-engineer electronic circuits, forming understandable high-level maps from incomprehensible transistor wiring: first, a coarse-grained version in which each node is a gate made of several transistors is established. Then the coarse-grained network is itself coarse-grained, resulting in a high-level blueprint in which each node is a circuit module made of many gates. We apply our approach also to a mammalian protein signal-transduction network, to find a simplified coarse-grained network with three main signaling channels that resemble multi-layered perceptrons made of cross-interacting MAP-kinase cascades. We find that both biological and electronic networks are “self-dissimilar,” with different network motifs at each level. The present approach may be used to simplify a variety of directed and nondirected, natural and designed networks.
Tropomyosin modulates erythrocyte membrane stability
An, Xiuli; Salomao, Marcela; Guo, Xinhua; Gratzer, Walter; Mohandas, Narla
2007-01-01
The ternary complex of spectrin, actin, and 4.1R (human erythrocyte protein 4.1) defines the nodes of the erythrocyte membrane skeletal network and is inseparable from membrane stability under mechanical stress. These junctions also contain tropomyosin (TM) and the other actin-binding proteins, adducin, protein 4.9, tropomodulin, and a small proportion of capZ, the functions of which are poorly defined. Here, we have examined the consequences of selective elimination of TM from the membrane. We have shown that the mechanical stability of the membranes of resealed ghosts devoid of TM is grossly, but reversibly, impaired. That the decreased membrane stability of TM-depleted membranes is the result of destabilization of the ternary complex of the network junctions is demonstrated by the strongly facilitated entry into the junctions in situ of a β-spectrin peptide, containing the actin- and 4.1R-binding sites, after extraction of the TM. The stabilizing effect of TM is highly specific, in that it is only the endogenous isotype, and not the slightly longer muscle TM that can bind to the depleted membranes and restore their mechanical stability. These findings have enabled us identify a function for TM in elevating the mechanical stability of erythrocyte membranes by stabilizing the spectrin-actin-4.1R junctional complex. PMID:17008534
Multi-omics approach identifies molecular mechanisms of plant-fungus mycorrhizal interaction
Larsen, Peter E.; Sreedasyam, Avinash; Trivedi, Geetika; ...
2016-01-19
In mycorrhizal symbiosis, plant roots form close, mutually beneficial interactions with soil fungi. Before this mycorrhizal interaction can be established however, plant roots must be capable of detecting potential beneficial fungal partners and initiating the gene expression patterns necessary to begin symbiosis. To predict a plant root – mycorrhizal fungi sensor systems, we analyzed in vitro experiments of Populus tremuloides (aspen tree) and Laccaria bicolor (mycorrhizal fungi) interaction and leveraged over 200 previously published transcriptomic experimental data sets, 159 experimentally validated plant transcription factor binding motifs, and more than 120-thousand experimentally validated protein-protein interactions to generate models of pre-mycorrhizal sensormore » systems in aspen root. These sensor mechanisms link extracellular signaling molecules with gene regulation through a network comprised of membrane receptors, signal cascade proteins, transcription factors, and transcription factor biding DNA motifs. Modeling predicted four pre-mycorrhizal sensor complexes in aspen that interact with fifteen transcription factors to regulate the expression of 1184 genes in response to extracellular signals synthesized by Laccaria. Predicted extracellular signaling molecules include common signaling molecules such as phenylpropanoids, salicylate, and, jasmonic acid. Lastly, this multi-omic computational modeling approach for predicting the complex sensory networks yielded specific, testable biological hypotheses for mycorrhizal interaction signaling compounds, sensor complexes, and mechanisms of gene regulation.« less
Multi-omics approach identifies molecular mechanisms of plant-fungus mycorrhizal interaction
DOE Office of Scientific and Technical Information (OSTI.GOV)
Larsen, Peter E.; Sreedasyam, Avinash; Trivedi, Geetika
In mycorrhizal symbiosis, plant roots form close, mutually beneficial interactions with soil fungi. Before this mycorrhizal interaction can be established however, plant roots must be capable of detecting potential beneficial fungal partners and initiating the gene expression patterns necessary to begin symbiosis. To predict a plant root – mycorrhizal fungi sensor systems, we analyzed in vitro experiments of Populus tremuloides (aspen tree) and Laccaria bicolor (mycorrhizal fungi) interaction and leveraged over 200 previously published transcriptomic experimental data sets, 159 experimentally validated plant transcription factor binding motifs, and more than 120-thousand experimentally validated protein-protein interactions to generate models of pre-mycorrhizal sensormore » systems in aspen root. These sensor mechanisms link extracellular signaling molecules with gene regulation through a network comprised of membrane receptors, signal cascade proteins, transcription factors, and transcription factor biding DNA motifs. Modeling predicted four pre-mycorrhizal sensor complexes in aspen that interact with fifteen transcription factors to regulate the expression of 1184 genes in response to extracellular signals synthesized by Laccaria. Predicted extracellular signaling molecules include common signaling molecules such as phenylpropanoids, salicylate, and, jasmonic acid. Lastly, this multi-omic computational modeling approach for predicting the complex sensory networks yielded specific, testable biological hypotheses for mycorrhizal interaction signaling compounds, sensor complexes, and mechanisms of gene regulation.« less
Duardo-Sánchez, Aliuska; Munteanu, Cristian R; Riera-Fernández, Pablo; López-Díaz, Antonio; Pazos, Alejandro; González-Díaz, Humberto
2014-01-27
The use of numerical parameters in Complex Network analysis is expanding to new fields of application. At a molecular level, we can use them to describe the molecular structure of chemical entities, protein interactions, or metabolic networks. However, the applications are not restricted to the world of molecules and can be extended to the study of macroscopic nonliving systems, organisms, or even legal or social networks. On the other hand, the development of the field of Artificial Intelligence has led to the formulation of computational algorithms whose design is based on the structure and functioning of networks of biological neurons. These algorithms, called Artificial Neural Networks (ANNs), can be useful for the study of complex networks, since the numerical parameters that encode information of the network (for example centralities/node descriptors) can be used as inputs for the ANNs. The Wiener index (W) is a graph invariant widely used in chemoinformatics to quantify the molecular structure of drugs and to study complex networks. In this work, we explore for the first time the possibility of using Markov chains to calculate analogues of node distance numbers/W to describe complex networks from the point of view of their nodes. These parameters are called Markov-Wiener node descriptors of order k(th) (W(k)). Please, note that these descriptors are not related to Markov-Wiener stochastic processes. Here, we calculated the W(k)(i) values for a very high number of nodes (>100,000) in more than 100 different complex networks using the software MI-NODES. These networks were grouped according to the field of application. Molecular networks include the Metabolic Reaction Networks (MRNs) of 40 different organisms. In addition, we analyzed other biological and legal and social networks. These include the Interaction Web Database Biological Networks (IWDBNs), with 75 food webs or ecological systems and the Spanish Financial Law Network (SFLN). The calculated W(k)(i) values were used as inputs for different ANNs in order to discriminate correct node connectivity patterns from incorrect random patterns. The MIANN models obtained present good values of Sensitivity/Specificity (%): MRNs (78/78), IWDBNs (90/88), and SFLN (86/84). These preliminary results are very promising from the point of view of a first exploratory study and suggest that the use of these models could be extended to the high-throughput re-evaluation of connectivity in known complex networks (collation).
Smith, Benjamin A; Padrick, Shae B; Doolittle, Lynda K; Daugherty-Clarke, Karen; Corrêa, Ivan R; Xu, Ming-Qun; Goode, Bruce L; Rosen, Michael K; Gelles, Jeff
2013-01-01
During cell locomotion and endocytosis, membrane-tethered WASP proteins stimulate actin filament nucleation by the Arp2/3 complex. This process generates highly branched arrays of filaments that grow toward the membrane to which they are tethered, a conflict that seemingly would restrict filament growth. Using three-color single-molecule imaging in vitro we revealed how the dynamic associations of Arp2/3 complex with mother filament and WASP are temporally coordinated with initiation of daughter filament growth. We found that WASP proteins dissociated from filament-bound Arp2/3 complex prior to new filament growth. Further, mutations that accelerated release of WASP from filament-bound Arp2/3 complex proportionally accelerated branch formation. These data suggest that while WASP promotes formation of pre-nucleation complexes, filament growth cannot occur until it is triggered by WASP release. This provides a mechanism by which membrane-bound WASP proteins can stimulate network growth without restraining it. DOI: http://dx.doi.org/10.7554/eLife.01008.001 PMID:24015360
Ouyang, Hui; Ali, Yousuf O.; Ravichandran, Mani; Dong, Aiping; Qiu, Wei; MacKenzie, Farrell; Dhe-Paganon, Sirano; Arrowsmith, Cheryl H.; Zhai, R. Grace
2012-01-01
The aggresome pathway is activated when proteasomal clearance of misfolded proteins is hindered. Misfolded polyubiquitinated protein aggregates are recruited and transported to the aggresome via the microtubule network by a protein complex consisting of histone deacetylase 6 (HDAC6) and the dynein motor complex. The current model suggests that HDAC6 recognizes protein aggregates by binding directly to polyubiquitinated proteins. Here, we show that there are substantial amounts of unanchored ubiquitin in protein aggregates with solvent-accessible C termini. The ubiquitin-binding domain (ZnF-UBP) of HDAC6 binds exclusively to the unanchored C-terminal diglycine motif of ubiquitin instead of conjugated polyubiquitin. The unanchored ubiquitin C termini in the aggregates are generated in situ by aggregate-associated deubiquitinase ataxin-3. These results provide structural and mechanistic bases for the role of HDAC6 in aggresome formation and further suggest a novel ubiquitin-mediated signaling pathway, where the exposure of ubiquitin C termini within protein aggregates enables HDAC6 recognition and transport to the aggresome. PMID:22069321
A new graph-based method for pairwise global network alignment
Klau, Gunnar W
2009-01-01
Background In addition to component-based comparative approaches, network alignments provide the means to study conserved network topology such as common pathways and more complex network motifs. Yet, unlike in classical sequence alignment, the comparison of networks becomes computationally more challenging, as most meaningful assumptions instantly lead to NP-hard problems. Most previous algorithmic work on network alignments is heuristic in nature. Results We introduce the graph-based maximum structural matching formulation for pairwise global network alignment. We relate the formulation to previous work and prove NP-hardness of the problem. Based on the new formulation we build upon recent results in computational structural biology and present a novel Lagrangian relaxation approach that, in combination with a branch-and-bound method, computes provably optimal network alignments. The Lagrangian algorithm alone is a powerful heuristic method, which produces solutions that are often near-optimal and – unlike those computed by pure heuristics – come with a quality guarantee. Conclusion Computational experiments on the alignment of protein-protein interaction networks and on the classification of metabolic subnetworks demonstrate that the new method is reasonably fast and has advantages over pure heuristics. Our software tool is freely available as part of the LISA library. PMID:19208162
A proactive role of water molecules in acceptor recognition by protein O-fucosyltransferase 2.
Valero-González, Jessika; Leonhard-Melief, Christina; Lira-Navarrete, Erandi; Jiménez-Osés, Gonzalo; Hernández-Ruiz, Cristina; Pallarés, María Carmen; Yruela, Inmaculada; Vasudevan, Deepika; Lostao, Anabel; Corzana, Francisco; Takeuchi, Hideyuki; Haltiwanger, Robert S; Hurtado-Guerrero, Ramon
2016-04-01
Protein O-fucosyltransferase 2 (POFUT2) is an essential enzyme that fucosylates serine and threonine residues of folded thrombospondin type 1 repeats (TSRs). To date, the mechanism by which this enzyme recognizes very dissimilar TSRs has been unclear. By engineering a fusion protein, we report the crystal structure of Caenorhabditis elegans POFUT2 (CePOFUT2) in complex with GDP and human TSR1 that suggests an inverting mechanism for fucose transfer assisted by a catalytic base and shows that nearly half of the TSR1 is embraced by CePOFUT2. A small number of direct interactions and a large network of water molecules maintain the complex. Site-directed mutagenesis demonstrates that POFUT2 fucosylates threonine preferentially over serine and relies on folded TSRs containing the minimal consensus sequence C-X-X-S/T-C. Crystallographic and mutagenesis data, together with atomic-level simulations, uncover a binding mechanism by which POFUT2 promiscuously recognizes the structural fingerprint of poorly homologous TSRs through a dynamic network of water-mediated interactions.
Kinetic models of gene expression including non-coding RNAs
NASA Astrophysics Data System (ADS)
Zhdanov, Vladimir P.
2011-03-01
In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.
Vella, Danila; Zoppis, Italo; Mauri, Giancarlo; Mauri, Pierluigi; Di Silvestre, Dario
2017-12-01
The reductionist approach of dissecting biological systems into their constituents has been successful in the first stage of the molecular biology to elucidate the chemical basis of several biological processes. This knowledge helped biologists to understand the complexity of the biological systems evidencing that most biological functions do not arise from individual molecules; thus, realizing that the emergent properties of the biological systems cannot be explained or be predicted by investigating individual molecules without taking into consideration their relations. Thanks to the improvement of the current -omics technologies and the increasing understanding of the molecular relationships, even more studies are evaluating the biological systems through approaches based on graph theory. Genomic and proteomic data are often combined with protein-protein interaction (PPI) networks whose structure is routinely analyzed by algorithms and tools to characterize hubs/bottlenecks and topological, functional, and disease modules. On the other hand, co-expression networks represent a complementary procedure that give the opportunity to evaluate at system level including organisms that lack information on PPIs. Based on these premises, we introduce the reader to the PPI and to the co-expression networks, including aspects of reconstruction and analysis. In particular, the new idea to evaluate large-scale proteomic data by means of co-expression networks will be discussed presenting some examples of application. Their use to infer biological knowledge will be shown, and a special attention will be devoted to the topological and module analysis.
Integration of biological networks and gene expression data using Cytoscape
Cline, Melissa S; Smoot, Michael; Cerami, Ethan; Kuchinsky, Allan; Landys, Nerius; Workman, Chris; Christmas, Rowan; Avila-Campilo, Iliana; Creech, Michael; Gross, Benjamin; Hanspers, Kristina; Isserlin, Ruth; Kelley, Ryan; Killcoyne, Sarah; Lotia, Samad; Maere, Steven; Morris, John; Ono, Keiichiro; Pavlovic, Vuk; Pico, Alexander R; Vailaya, Aditya; Wang, Peng-Liang; Adler, Annette; Conklin, Bruce R; Hood, Leroy; Kuiper, Martin; Sander, Chris; Schmulevich, Ilya; Schwikowski, Benno; Warner, Guy J; Ideker, Trey; Bader, Gary D
2013-01-01
Cytoscape is a free software package for visualizing, modeling and analyzing molecular and genetic interaction networks. This protocol explains how to use Cytoscape to analyze the results of mRNA expression profiling, and other functional genomics and proteomics experiments, in the context of an interaction network obtained for genes of interest. Five major steps are described: (i) obtaining a gene or protein network, (ii) displaying the network using layout algorithms, (iii) integrating with gene expression and other functional attributes, (iv) identifying putative complexes and functional modules and (v) identifying enriched Gene Ontology annotations in the network. These steps provide a broad sample of the types of analyses performed by Cytoscape. PMID:17947979
Chimeric Protein Complexes in Hybrid Species Generate Novel Phenotypes
Piatkowska, Elzbieta M.; Naseeb, Samina; Knight, David; Delneri, Daniela
2013-01-01
Hybridization between species is an important mechanism for the origin of novel lineages and adaptation to new environments. Increased allelic variation and modification of the transcriptional network are the two recognized forces currently deemed to be responsible for the phenotypic properties seen in hybrids. However, since the majority of the biological functions in a cell are carried out by protein complexes, inter-specific protein assemblies therefore represent another important source of natural variation upon which evolutionary forces can act. Here we studied the composition of six protein complexes in two different Saccharomyces “sensu stricto” hybrids, to understand whether chimeric interactions can be freely formed in the cell in spite of species-specific co-evolutionary forces, and whether the different types of complexes cause a change in hybrid fitness. The protein assemblies were isolated from the hybrids via affinity chromatography and identified via mass spectrometry. We found evidence of spontaneous chimericity for four of the six protein assemblies tested and we showed that different types of complexes can cause a variety of phenotypes in selected environments. In the case of TRP2/TRP3 complex, the effect of such chimeric formation resulted in the fitness advantage of the hybrid in an environment lacking tryptophan, while only one type of parental combination of the MBF complex allowed the hybrid to grow under respiratory conditions. These phenotypes were dependent on both genetic and environmental backgrounds. This study provides empirical evidence that chimeric protein complexes can freely assemble in cells and reveals a new mechanism to generate phenotypic novelty and plasticity in hybrids to complement the genomic innovation resulting from gene duplication. The ability to exchange orthologous members has also important implications for the adaptation and subsequent genome evolution of the hybrids in terms of pattern of gene loss. PMID:24137105
Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav
2002-12-01
The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit.
Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav
2002-01-01
The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit. PMID:12515387
MPact: the MIPS protein interaction resource on yeast.
Güldener, Ulrich; Münsterkötter, Martin; Oesterheld, Matthias; Pagel, Philipp; Ruepp, Andreas; Mewes, Hans-Werner; Stümpflen, Volker
2006-01-01
In recent years, the Munich Information Center for Protein Sequences (MIPS) yeast protein-protein interaction (PPI) dataset has been used in numerous analyses of protein networks and has been called a gold standard because of its quality and comprehensiveness [H. Yu, N. M. Luscombe, H. X. Lu, X. Zhu, Y. Xia, J. D. Han, N. Bertin, S. Chung, M. Vidal and M. Gerstein (2004) Genome Res., 14, 1107-1118]. MPact and the yeast protein localization catalog provide information related to the proximity of proteins in yeast. Beside the integration of high-throughput data, information about experimental evidence for PPIs in the literature was compiled by experts adding up to 4300 distinct PPIs connecting 1500 proteins in yeast. As the interaction data is a complementary part of CYGD, interactive mapping of data on other integrated data types such as the functional classification catalog [A. Ruepp, A. Zollner, D. Maier, K. Albermann, J. Hani, M. Mokrejs, I. Tetko, U. Güldener, G. Mannhaupt, M. Münsterkötter and H. W. Mewes (2004) Nucleic Acids Res., 32, 5539-5545] is possible. A survey of signaling proteins and comparison with pathway data from KEGG demonstrates that based on these manually annotated data only an extensive overview of the complexity of this functional network can be obtained in yeast. The implementation of a web-based PPI-analysis tool allows analysis and visualization of protein interaction networks and facilitates integration of our curated data with high-throughput datasets. The complete dataset as well as user-defined sub-networks can be retrieved easily in the standardized PSI-MI format. The resource can be accessed through http://mips.gsf.de/genre/proj/mpact.
NASA Astrophysics Data System (ADS)
Samanta, Sudipta; Mukherjee, Sanchita
2017-10-01
The p53 protein activation protects the organism from propagation of cells with damaged DNA having oncogenic mutations. In normal cells, activity of p53 is controlled by interaction with MDM2. The well understood p53-MDM2 interaction facilitates design of ligands that could potentially disrupt or prevent the complexation owing to its emergence as an important objective for cancer therapy. However, thermodynamic quantification of the p53-peptide induced structural changes of the MDM2-protein remains an area to be explored. This study attempts to understand the conformational free energy and entropy costs due to this complex formation from the histograms of dihedral angles generated from molecular dynamics simulations. Residue-specific quantification illustrates that, hydrophobic residues of the protein contribute maximum to the conformational thermodynamic changes. Thermodynamic quantification of structural changes of the protein unfold the fact that, p53 binding provides a source of inter-element cooperativity among the protein secondary structural elements, where the highest affected structural elements (α2 and α4) found at the binding site of the protein affects faraway structural elements (β1 and Loop1) of the protein. The communication perhaps involves water mediated hydrogen bonded network formation. Further, we infer that in inhibitory F19A mutation of P53, though Phe19 is important in the recognition process, it has less prominent contribution in the stability of the complex. Collectively, this study provides vivid microscopic understanding of the interaction within the protein complex along with exploring mutation sites, which will contribute further to engineer the protein function and binding affinity.
Text Mining for Protein Docking
Badal, Varsha D.; Kundrotas, Petras J.; Vakser, Ilya A.
2015-01-01
The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking). Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu). The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features) approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound benchmark set, significantly increasing the docking success rate. PMID:26650466
Bhattacharyya, Moitrayee; Vishveshwara, Saraswathi
2011-07-01
In this article, we present a novel application of a quantum clustering (QC) technique to objectively cluster the conformations, sampled by molecular dynamics simulations performed on different ligand bound structures of the protein. We further portray each conformational population in terms of dynamically stable network parameters which beautifully capture the ligand induced variations in the ensemble in atomistic detail. The conformational populations thus identified by the QC method and verified by network parameters are evaluated for different ligand bound states of the protein pyrrolysyl-tRNA synthetase (DhPylRS) from D. hafniense. The ligand/environment induced re-distribution of protein conformational ensembles forms the basis for understanding several important biological phenomena such as allostery and enzyme catalysis. The atomistic level characterization of each population in the conformational ensemble in terms of the re-orchestrated networks of amino acids is a challenging problem, especially when the changes are minimal at the backbone level. Here we demonstrate that the QC method is sensitive to such subtle changes and is able to cluster MD snapshots which are similar at the side-chain interaction level. Although we have applied these methods on simulation trajectories of a modest time scale (20 ns each), we emphasize that our methodology provides a general approach towards an objective clustering of large-scale MD simulation data and may be applied to probe multistate equilibria at higher time scales, and to problems related to protein folding for any protein or protein-protein/RNA/DNA complex of interest with a known structure.
Targeted interactomics reveals a complex core cell cycle machinery in Arabidopsis thaliana
Van Leene, Jelle; Hollunder, Jens; Eeckhout, Dominique; Persiau, Geert; Van De Slijke, Eveline; Stals, Hilde; Van Isterdael, Gert; Verkest, Aurine; Neirynck, Sandy; Buffel, Yelle; De Bodt, Stefanie; Maere, Steven; Laukens, Kris; Pharazyn, Anne; Ferreira, Paulo C G; Eloy, Nubia; Renne, Charlotte; Meyer, Christian; Faure, Jean-Denis; Steinbrenner, Jens; Beynon, Jim; Larkin, John C; Van de Peer, Yves; Hilson, Pierre; Kuiper, Martin; De Veylder, Lieven; Van Onckelen, Harry; Inzé, Dirk; Witters, Erwin; De Jaeger, Geert
2010-01-01
Cell proliferation is the main driving force for plant growth. Although genome sequence analysis revealed a high number of cell cycle genes in plants, little is known about the molecular complexes steering cell division. In a targeted proteomics approach, we mapped the core complex machinery at the heart of the Arabidopsis thaliana cell cycle control. Besides a central regulatory network of core complexes, we distinguished a peripheral network that links the core machinery to up- and downstream pathways. Over 100 new candidate cell cycle proteins were predicted and an in-depth biological interpretation demonstrated the hypothesis-generating power of the interaction data. The data set provided a comprehensive view on heterodimeric cyclin-dependent kinase (CDK)–cyclin complexes in plants. For the first time, inhibitory proteins of plant-specific B-type CDKs were discovered and the anaphase-promoting complex was characterized and extended. Important conclusions were that mitotic A- and B-type cyclins form complexes with the plant-specific B-type CDKs and not with CDKA;1, and that D-type cyclins and S-phase-specific A-type cyclins seem to be associated exclusively with CDKA;1. Furthermore, we could show that plants have evolved a combinatorial toolkit consisting of at least 92 different CDK–cyclin complex variants, which strongly underscores the functional diversification among the large family of cyclins and reflects the pivotal role of cell cycle regulation in the developmental plasticity of plants. PMID:20706207
The GARP Complex Is Involved in Intracellular Cholesterol Transport via Targeting NPC2 to Lysosomes.
Wei, Jian; Zhang, Ying-Yu; Luo, Jie; Wang, Ju-Qiong; Zhou, Yu-Xia; Miao, Hong-Hua; Shi, Xiong-Jie; Qu, Yu-Xiu; Xu, Jie; Li, Bo-Liang; Song, Bao-Liang
2017-06-27
Proper intracellular cholesterol trafficking is critical for cellular function. Two lysosome-resident proteins, NPC1 and NPC2, mediate the egress of low-density lipoprotein-derived cholesterol from lysosomes. However, other proteins involved in this process remain largely unknown. Through amphotericin B-based selection, we isolated two cholesterol transport-defective cell lines. Subsequent whole-transcriptome-sequencing analysis revealed two cell lines bearing the same mutation in the vacuolar protein sorting 53 (Vps53) gene. Depletion of VPS53 or other subunits of the Golgi-associated retrograde protein (GARP) complex impaired NPC2 sorting to lysosomes and caused cholesterol accumulation. GARP deficiency blocked the retrieval of the cation-independent mannose 6-phosphate receptor (CI-MPR) to the trans-Golgi network. Further, Vps54 mutant mice displayed reduced cellular NPC2 protein levels and increased cholesterol accumulation, underscoring the physiological role of the GARP complex in cholesterol transport. We conclude that the GARP complex contributes to intracellular cholesterol transport by targeting NPC2 to lysosomes in a CI-MPR-dependent manner. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
Verkhivker, G M
2016-10-20
Protein kinases are central to proper functioning of cellular networks and are an integral part of many signal transduction pathways. The family of protein kinases represents by far the largest and most important class of therapeutic targets in oncology. Dimerization-induced activation has emerged as a common mechanism of allosteric regulation in BRAF kinases, which play an important role in growth factor signalling and human diseases. Recent studies have revealed that most of the BRAF inhibitors can induce dimerization and paradoxically stimulate enzyme transactivation by conferring an active conformation in the second monomer of the kinase dimer. The emerging connections between inhibitor binding and BRAF kinase domain dimerization have suggested a molecular basis of the activation mechanism in which BRAF inhibitors may allosterically modulate the stability of the dimerization interface and affect the organization of residue interaction networks in BRAF kinase dimers. In this work, we integrated structural bioinformatics analysis, molecular dynamics and binding free energy simulations with the protein structure network analysis of the BRAF crystal structures to determine dynamic signatures of BRAF conformations in complexes with different types of inhibitors and probe the mechanisms of the inhibitor-induced dimerization and paradoxical activation. The results of this study highlight previously unexplored relationships between types of BRAF inhibitors, inhibitor-induced changes in the residue interaction networks and allosteric modulation of the kinase activity. This study suggests a mechanism by which BRAF inhibitors could promote or interfere with the paradoxical activation of BRAF kinases, which may be useful in informing discovery efforts to minimize the unanticipated adverse biological consequences of these therapeutic agents.
Proteome complexity and the forces that drive proteome imbalance.
Harper, J Wade; Bennett, Eric J
2016-09-15
The cellular proteome is a complex microcosm of structural and regulatory networks that requires continuous surveillance and modification to meet the dynamic needs of the cell. It is therefore crucial that the protein flux of the cell remains in balance to ensure proper cell function. Genetic alterations that range from chromosome imbalance to oncogene activation can affect the speed, fidelity and capacity of protein biogenesis and degradation systems, which often results in proteome imbalance. An improved understanding of the causes and consequences of proteome imbalance is helping to reveal how these systems can be targeted to treat diseases such as cancer.
Csermely, Peter; Korcsmáros, Tamás; Kiss, Huba J.M.; London, Gábor; Nussinov, Ruth
2013-01-01
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only gives a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The “central hit strategy” selectively targets central node/edges of the flexible networks of infectious agents or cancer cells to kill them. The “network influence strategy” works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach. PMID:23384594
Heix, J; Zomerdijk, J C; Ravanpay, A; Tjian, R; Grummt, I
1997-03-04
Promoter selectivity for all three classes of eukaryotic RNA polymerases is brought about by multimeric protein complexes containing TATA box binding protein (TBP) and specific TBP-associated factors (TAFs). Unlike class II- and III-specific TBP-TAF complexes, the corresponding murine and human class I-specific transcription initiation factor TIF-IB/SL1 exhibits a pronounced selectivity for its homologous promoter. As a first step toward understanding the molecular basis of species-specific promoter recognition, we cloned the cDNAs encoding the three mouse pol I-specific TBP-associated factors (TAFIs) and compared the amino acid sequences of the murine TAFIs with their human counterparts. The four subunits from either species can form stable chimeric complexes that contain stoichiometric amounts of TBP and TAFIs, demonstrating that differences in the primary structure of human and mouse TAFIs do not dramatically alter the network of protein-protein contacts responsible for assembly of the multimeric complex. Thus, primate vs. rodent promoter selectivity mediated by the TBP-TAFI complex is likely to be the result of cumulative subtle differences between individual subunits that lead to species-specific properties of RNA polymerase I transcription.
Roles of NHERF Family of PDZ-Binding Proteins in Regulating GPCR Functions.
Broadbent, David; Ahmadzai, Mohammad M; Kammala, Ananth K; Yang, Canchai; Occhiuto, Christopher; Das, Rupali; Subramanian, Hariharan
2017-01-01
Multicellular organisms are equipped with an array of G-protein-coupled receptors (GPCRs) that mediate cell-cell signaling allowing them to adapt to environmental cues and ultimately survive. This is mechanistically possible through complex intracellular GPCR machinery that encompasses a vast network of proteins. Within this network, there is a group called scaffolding proteins that facilitate proper localization of signaling proteins for a quick and robust GPCR response. One protein family within this scaffolding group is the PSD-95/Dlg/ZO-1 (PDZ) family which is important for GPCR localization, internalization, recycling, and downstream signaling. Although the PDZ family of proteins regulate the functions of several receptors, this chapter focuses on a subfamily within the PDZ protein family called the Na + /H + exchanger regulatory factors (NHERFs). Here we extensively review the predominantly characterized roles of NHERFs in renal phosphate absorption, intestinal ion regulation, cancer progression, and immune cell functions. Finally, we discuss the future perspectives and possible clinical application of targeting NHERFs in several disorders. © 2017 Elsevier Inc. All rights reserved.
Molecular Interaction Map of the Mammalian Cell Cycle Control and DNA Repair Systems
Kohn, Kurt W.
1999-01-01
Eventually to understand the integrated function of the cell cycle regulatory network, we must organize the known interactions in the form of a diagram, map, and/or database. A diagram convention was designed capable of unambiguous representation of networks containing multiprotein complexes, protein modifications, and enzymes that are substrates of other enzymes. To facilitate linkage to a database, each molecular species is symbolically represented only once in each diagram. Molecular species can be located on the map by means of indexed grid coordinates. Each interaction is referenced to an annotation list where pertinent information and references can be found. Parts of the network are grouped into functional subsystems. The map shows how multiprotein complexes could assemble and function at gene promoter sites and at sites of DNA damage. It also portrays the richness of connections between the p53-Mdm2 subsystem and other parts of the network. PMID:10436023
Xie, Wei; Burke, Brian
2017-07-04
Nuclear lamins are intermediate filament proteins that represent important structural components of metazoan nuclear envelopes (NEs). By combining proteomics and superresolution microscopy, we recently reported that both A- and B-type nuclear lamins form spatially distinct filament networks at the nuclear periphery of mouse fibroblasts. In particular, A-type lamins exhibit differential association with nuclear pore complexes (NPCs). Our studies reveal that the nuclear lamina network in mammalian somatic cells is less ordered and more complex than that of amphibian oocytes, the only other system in which the lamina has been visualized at high resolution. In addition, the NPC component Tpr likely links NPCs to the A-type lamin network, an association that appears to be regulated by C-terminal modification of various A-type lamin isoforms. Many questions remain, however, concerning the structure and assembly of lamin filaments, as well as with their mode of association with other nuclear components such as peripheral chromatin.
Sigala, Paul A.; Fafarman, Aaron T.; Schwans, Jason P.; Fried, Stephen D.; Fenn, Timothy D.; Caaveiro, Jose M. M.; Pybus, Brandon; Ringe, Dagmar; Petsko, Gregory A.; Boxer, Steven G.; Herschlag, Daniel
2013-01-01
Hydrogen bond networks are key elements of protein structure and function but have been challenging to study within the complex protein environment. We have carried out in-depth interrogations of the proton transfer equilibrium within a hydrogen bond network formed to bound phenols in the active site of ketosteroid isomerase. We systematically varied the proton affinity of the phenol using differing electron-withdrawing substituents and incorporated site-specific NMR and IR probes to quantitatively map the proton and charge rearrangements within the network that accompany incremental increases in phenol proton affinity. The observed ionization changes were accurately described by a simple equilibrium proton transfer model that strongly suggests the intrinsic proton affinity of one of the Tyr residues in the network, Tyr16, does not remain constant but rather systematically increases due to weakening of the phenol–Tyr16 anion hydrogen bond with increasing phenol proton affinity. Using vibrational Stark spectroscopy, we quantified the electrostatic field changes within the surrounding active site that accompany these rearrangements within the network. We were able to model these changes accurately using continuum electrostatic calculations, suggesting a high degree of conformational restriction within the protein matrix. Our study affords direct insight into the physical and energetic properties of a hydrogen bond network within a protein interior and provides an example of a highly controlled system with minimal conformational rearrangements in which the observed physical changes can be accurately modeled by theoretical calculations. PMID:23798390
Wang, Quan; Jia, Peilin; Cuenco, Karen T.; Feingold, Eleanor; Marazita, Mary L.; Wang, Lily; Zhao, Zhongming
2013-01-01
A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases. PMID:24146904
Prior knowledge based mining functional modules from Yeast PPI networks with gene ontology
2010-01-01
Background In the literature, there are fruitful algorithmic approaches for identification functional modules in protein-protein interactions (PPI) networks. Because of accumulation of large-scale interaction data on multiple organisms and non-recording interaction data in the existing PPI database, it is still emergent to design novel computational techniques that can be able to correctly and scalably analyze interaction data sets. Indeed there are a number of large scale biological data sets providing indirect evidence for protein-protein interaction relationships. Results The main aim of this paper is to present a prior knowledge based mining strategy to identify functional modules from PPI networks with the aid of Gene Ontology. Higher similarity value in Gene Ontology means that two gene products are more functionally related to each other, so it is better to group such gene products into one functional module. We study (i) to encode the functional pairs into the existing PPI networks; and (ii) to use these functional pairs as pairwise constraints to supervise the existing functional module identification algorithms. Topology-based modularity metric and complex annotation in MIPs will be used to evaluate the identified functional modules by these two approaches. Conclusions The experimental results on Yeast PPI networks and GO have shown that the prior knowledge based learning methods perform better than the existing algorithms. PMID:21172053
Uemura, Tomohiro; Kim, Hyeran; Saito, Chieko; Ebine, Kazuo; Ueda, Takashi; Schulze-Lefert, Paul; Nakano, Akihiko
2012-01-01
In all eukaryotic cells, a membrane-trafficking system connects the post-Golgi organelles, such as the trans-Golgi network (TGN), endosomes, vacuoles, and the plasma membrane. This complex network plays critical roles in several higher-order functions in multicellular organisms. The TGN, one of the important organelles for protein transport in the post-Golgi network, functions as a sorting station, where cargo proteins are directed to the appropriate post-Golgi compartments. Unlike its roles in animal and yeast cells, the TGN has also been reported to function like early endosomal compartments in plant cells. However, the physiological roles of the TGN functions in plants are not understood. Here, we report a study of the SYP4 group (SYP41, SYP42, and SYP43), which represents the plant orthologs of the Tlg2/syntaxin16 Qa-SNARE (soluble N-ethylmaleimide sensitive factor attachment protein receptor) that localizes on the TGN in yeast and animal cells. The SYP4 group regulates the secretory and vacuolar transport pathways in the post-Golgi network and maintains the morphology of the Golgi apparatus and TGN. Consistent with a secretory role, SYP4 proteins are required for extracellular resistance responses to a fungal pathogen. We also reveal a plant cell-specific higher-order role of the SYP4 group in the protection of chloroplasts from salicylic acid-dependent biotic stress. PMID:22307646
Recent coselection in human populations revealed by protein-protein interaction network.
Qian, Wei; Zhou, Hang; Tang, Kun
2014-12-21
Genome-wide scans for signals of natural selection in human populations have identified a large number of candidate loci that underlie local adaptations. This is surprising given the relatively short evolutionary time since the divergence of the human population. One hypothesis that has not been formally examined is whether and how the recent human evolution may have been shaped by coselection in the context of complex molecular interactome. In this study, genome-wide signals of selection were scanned in East Asians, Europeans, and Africans using 1000 Genome data, and subsequently mapped onto the protein-protein interaction (PPI) network. We found that the candidate genes of recent positive selection localized significantly closer to each other on the PPI network than expected, revealing substantial clustering of selected genes. Furthermore, gene pairs of shorter PPI network distances showed higher similarities of their recent evolutionary paths than those further apart. Last, subnetworks enriched with recent coselection signals were identified, which are substantially overrepresented in biological pathways related to signal transduction, neurogenesis, and immune function. These results provide the first genome-wide evidence for association of recent selection signals with the PPI network, shedding light on the potential mechanisms of recent coselection in the human genome. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Feijão, Tália; Afonso, Olga; Maia, André F; Sunkel, Claudio E
2013-10-01
Kinetochores bind spindle microtubules and also act as signaling centers that monitor this interaction. Defects in kinetochore assembly lead to chromosome missegregation and aneuploidy. The interaction between microtubules and chromosomes involves a conserved super-complex of proteins, known as the KNL1Mis12Ndc80 (KMN) network, composed by the KNL1 (Spc105), Mis12, and Ndc80 complexes. Previous studies indicate that all components of the network are required for kinetochore-microtubule attachment and all play relevant functions in chromosome congression, biorientation, and segregation. Here, we report a comparative study addressing the role of the different KMN components using dsRNA and in vivo fluorescence microscopy in Drosophila S2 cells allowing us to suggest that different KMN network components might perform different roles in chromosome segregation and the mitotic checkpoint signaling. Depletion of different components results in mostly lateral kinetochore-microtubule attachments that are relatively stable on depletion of Mis12 or Ndc80 but very unstable after Spc105 depletion. In vivo analysis on depletion of Mis12, Ndc80, and to some extent Spc105, shows that lateral kinetochore-microtubule interactions are still functional allowing poleward kinetochore movement. We also find that different KMN network components affect differently the localization of spindle assembly checkpoint (SAC) proteins at kinetochores. Depletion of Ndc80 and Spc105 abolishes the mitotic checkpoint, whereas depletion of Mis12 causes a delay in mitotic progression. Taken together, our results suggest that Mis12 and Ndc80 complexes help to properly orient microtubule attachment, whereas Spc105 plays a predominant role in the kinetochore-microtubule attachment as well as in the poleward movement of chromosomes, SAC response, and cell viability. Copyright © 2013 Wiley Periodicals, Inc.
Structure of MyTH4-FERM domains in myosin VIIa tail bound to cargo.
Wu, Lin; Pan, Lifeng; Wei, Zhiyi; Zhang, Mingjie
2011-02-11
The unconventional myosin VIIa (MYO7A) is one of the five proteins that form a network of complexes involved in formation of stereocilia. Defects in these proteins cause syndromic deaf-blindness in humans [Usher syndrome I (USH1)]. Many disease-causing mutations occur in myosin tail homology 4-protein 4.1, ezrin, radixin, moesin (MyTH4-FERM) domains in the myosin tail that binds to another USH1 protein, Sans. We report the crystal structure of MYO7A MyTH4-FERM domains in complex with the central domain (CEN) of Sans at 2.8 angstrom resolution. The MyTH4 and FERM domains form an integral structural and functional supramodule binding to two highly conserved segments (CEN1 and 2) of Sans. The MyTH4-FERM/CEN complex structure provides mechanistic explanations for known deafness-causing mutations in MYO7A MyTH4-FERM. The structure will also facilitate mechanistic and functional studies of MyTH4-FERM domains in other myosins.
Network Approach to Disease Diagnosis
NASA Astrophysics Data System (ADS)
Sharma, Amitabh; Bashan, Amir; Barabasi, Alber-Laszlo
2014-03-01
Human diseases could be viewed as perturbations of the underlying biological system. A thorough understanding of the topological and dynamical properties of the biological system is crucial to explain the mechanisms of many complex diseases. Recently network-based approaches have provided a framework for integrating multi-dimensional biological data that results in a better understanding of the pathophysiological state of complex diseases. Here we provide a network-based framework to improve the diagnosis of complex diseases. This framework is based on the integration of transcriptomics and the interactome. We analyze the overlap between the differentially expressed (DE) genes and disease genes (DGs) based on their locations in the molecular interaction network (''interactome''). Disease genes and their protein products tend to be much more highly connected than random, hence defining a disease sub-graph (called disease module) in the interactome. DE genes, even though different from the known set of DGs, may be significantly associated with the disease when considering their closeness to the disease module in the interactome. This new network approach holds the promise to improve the diagnosis of patients who cannot be diagnosed using conventional tools. Support was provided by HL066289 and HL105339 grants from the U.S. National Institutes of Health.
Characterization of MRP RNA–protein interactions within the perinucleolar compartment
Pollock, Callie; Daily, Kelly; Nguyen, Van Trung; Wang, Chen; Lewandowska, Marzena Anna; Bensaude, Olivier; Huang, Sui
2011-01-01
The perinucleolar compartment (PNC) forms in cancer cells and is highly enriched with a subset of polymerase III RNAs and RNA-binding proteins. Here we report that PNC components mitochondrial RNA–processing (MRP) RNA, pyrimidine tract–binding protein (PTB), and CUG-binding protein (CUGBP) interact in vivo, as demonstrated by coimmunoprecipitation and RNA pull-down experiments. Glycerol gradient analyses show that this complex is large and sediments at a different fraction from known MRP RNA–containing complexes, the MRP ribonucleoprotein ribozyme and human telomerase reverse transcriptase. Tethering PNC components to a LacO locus recruits other PNC components, further confirming the in vivo interactions. These interactions are present both in PNC-containing and -lacking cells. High-resolution localization analyses demonstrate that MRP RNA, CUGBP, and PTB colocalize at the PNC as a reticulated network, intertwining with newly synthesized RNA. Furthermore, green fluorescent protein (GFP)–PTB and GFP-CUGBP show a slower rate of fluorescence recovery after photobleaching at the PNC than in the nucleoplasm, illustrating the different molecular interaction of the complexes associated with the PNC. These findings support a working model in which the MRP RNA–protein complex becomes nucleated at the PNC in cancer cells and may play a role in gene expression regulation at the DNA locus that associates with the PNC. PMID:21233287
Characterization of MRP RNA-protein interactions within the perinucleolar compartment.
Pollock, Callie; Daily, Kelly; Nguyen, Van Trung; Wang, Chen; Lewandowska, Marzena Anna; Bensaude, Olivier; Huang, Sui
2011-03-15
The perinucleolar compartment (PNC) forms in cancer cells and is highly enriched with a subset of polymerase III RNAs and RNA-binding proteins. Here we report that PNC components mitochondrial RNA-processing (MRP) RNA, pyrimidine tract-binding protein (PTB), and CUG-binding protein (CUGBP) interact in vivo, as demonstrated by coimmunoprecipitation and RNA pull-down experiments. Glycerol gradient analyses show that this complex is large and sediments at a different fraction from known MRP RNA-containing complexes, the MRP ribonucleoprotein ribozyme and human telomerase reverse transcriptase. Tethering PNC components to a LacO locus recruits other PNC components, further confirming the in vivo interactions. These interactions are present both in PNC-containing and -lacking cells. High-resolution localization analyses demonstrate that MRP RNA, CUGBP, and PTB colocalize at the PNC as a reticulated network, intertwining with newly synthesized RNA. Furthermore, green fluorescent protein (GFP)-PTB and GFP-CUGBP show a slower rate of fluorescence recovery after photobleaching at the PNC than in the nucleoplasm, illustrating the different molecular interaction of the complexes associated with the PNC. These findings support a working model in which the MRP RNA-protein complex becomes nucleated at the PNC in cancer cells and may play a role in gene expression regulation at the DNA locus that associates with the PNC.
Exploring Biomolecular Recognition by Modeling and Simulation
NASA Astrophysics Data System (ADS)
Wade, Rebecca
2007-12-01
Biomolecular recognition is complex. The balance between the different molecular properties that contribute to molecular recognition, such as shape, electrostatics, dynamics and entropy, varies from case to case. This, along with the extent of experimental characterization, influences the choice of appropriate computational approaches to study biomolecular interactions. I will present computational studies in which we aim to make concerted use of bioinformatics, biochemical network modeling and molecular simulation techniques to study protein-protein and protein-small molecule interactions and to facilitate computer-aided drug design.
2011-01-01
Background To make sense out of gene expression profiles, such analyses must be pushed beyond the mere listing of affected genes. For example, if a group of genes persistently display similar changes in expression levels under particular experimental conditions, and the proteins encoded by these genes interact and function in the same cellular compartments, this could be taken as very strong indicators for co-regulated protein complexes. One of the key requirements is having appropriate tools to detect such regulatory patterns. Results We have analyzed the global adaptations in gene expression patterns in the budding yeast when the Hsp90 molecular chaperone complex is perturbed either pharmacologically or genetically. We integrated these results with publicly accessible expression, protein-protein interaction and intracellular localization data. But most importantly, all experimental conditions were simultaneously and dynamically visualized with an animation. This critically facilitated the detection of patterns of gene expression changes that suggested underlying regulatory networks that a standard analysis by pairwise comparison and clustering could not have revealed. Conclusions The results of the animation-assisted detection of changes in gene regulatory patterns make predictions about the potential roles of Hsp90 and its co-chaperone p23 in regulating whole sets of genes. The simultaneous dynamic visualization of microarray experiments, represented in networks built by integrating one's own experimental with publicly accessible data, represents a powerful discovery tool that allows the generation of new interpretations and hypotheses. PMID:21672238
An overview on the delivery of antitumor drug doxorubicin by carrier proteins.
Agudelo, D; Bérubé, G; Tajmir-Riahi, H A
2016-07-01
Serum proteins play an increasing role as drug carriers in the clinical settings. In this review, we have compared the binding modalities of anticancer drug doxorubicin (DOX) to three model carrier proteins, human serum albumin (HSA), bovine serum albumin (BSA) and milk beta-lactoglobulin (β-LG) in order to determine the potential application of these model proteins in DOX delivery. Molecular modeling studies showed stronger binding of DOX with HSA than BSA and β-LG with the free binding energies of -10.75 (DOX-HSA), -9.31 (DOX-BSA) and -8.12kcal/mol (DOX-β-LG). Extensive H-boding network stabilizes DOX-protein conjugation and played a major role in drug-protein complex formation. DOX complexation induced major alterations of HSA and BSA conformations, while did not alter β-LG secondary structure. The literature review shows that these proteins can potentially be used for delivery of DOX in vitro and in vivo. Copyright © 2016 Elsevier B.V. All rights reserved.
Dynamic protein interaction networks and new structural paradigms in signaling
Csizmok, Veronika; Follis, Ariele Viacava; Kriwacki, Richard W.; Forman-Kay, Julie D.
2017-01-01
Understanding signaling and other complex biological processes requires elucidating the critical roles of intrinsically disordered proteins and regions (IDPs/IDRs), which represent ~30% of the proteome and enable unique regulatory mechanisms. In this review we describe the structural heterogeneity of disordered proteins that underpins these mechanisms and the latest progress in obtaining structural descriptions of ensembles of disordered proteins that are needed for linking structure and dynamics to function. We describe the diverse interactions of IDPs that can have unusual characteristics such as “ultrasensitivity” and “regulated folding and unfolding”. We also summarize the mounting data showing that large-scale assembly and protein phase separation occurs within a variety of signaling complexes and cellular structures. In addition, we discuss efforts to therapeutically target disordered proteins with small molecules. Overall, we interpret the remodeling of disordered state ensembles due to binding and post-translational modifications within an expanded framework for allostery that provides significant insights into how disordered proteins transmit biological information. PMID:26922996
[Fanconi anemia: genes and function(s) revisited].
Papadopoulo, Dora; Moustacchi, Ethel
2005-01-01
Fanconi anemia (FA), a rare inherited disorder, exhibits a complex phenotype including progressive bone marrow failure, congenital malformations and increased risk of cancers, mainly acute myeloid leukaemia. At the cellular level, FA is characterized by hypersensitivity to DNA cross-linking agents and by high frequencies of induced chromosomal aberrations, a property used for diagnosis. FA results from mutations in one of the eleven FANC (FANCA to FANCJ) genes. Nine of them have been identified. In addition, FANCD1 gene has been shown to be identical to BRCA2, one of the two breast cancer susceptibility genes. Seven of the FANC proteins form a complex, which exists in four different forms depending of its subcellular localisation. Four FANC proteins (D1(BRCA2), D2, I and J) are not associated to the complex. The presence of the nuclear form of the FA core complex is necessary for the mono-ubiquitinylation of FANCD2 protein, a modification required for its re-localization to nuclear foci, likely to be sites of DNA repair. A clue towards understanding the molecular function of the FANC genes comes from the recently identified connection of FANC to the BRCA1, ATM, NBS1 and ATR genes. Two of the FANC proteins (A and D2) directly interact with BRCA1, which in turn interacts with the MRE11/RAD50/NBS1 complex, which is one of the key components in the mechanisms involved in the cellular response to DNA double strand breaks (DSB). Moreover, ATM, a protein kinase that plays a central role in the network of DSB signalling, phosphorylates in vitro and in vivo FANCD2 in response to ionising radiations. Moreover, the NBS1 protein and the monoubiquitinated form of FANCD2 seem to act together in response to DNA crosslinking agents. Taken together with the previously reported impaired DSB and DNA interstrand crosslinks repair in FA cells, the connection of FANC genes to the ATM, ATR, NBS1 and BRCA1 links the FANC genes function to the finely orchestrated network involved in the sensing, signalling and repair of DNA replication-blocking lesions.
Abascal-Palacios, Guillermo; Schindler, Christina; Rojas, Adriana L; Bonifacino, Juan S.; Hierro, Aitor
2016-01-01
Summary The Golgi-Associated Retrograde Protein (GARP) is a tethering complex involved in the fusion of endosome-derived transport vesicles to the trans-Golgi network through interaction with components of the Syntaxin 6/Syntaxin 16/Vti1a/VAMP4 SNARE complex. The mechanisms by which GARP and other tethering factors engage the SNARE fusion machinery are poorly understood. Herein we report the structural basis for the interaction of the human Ang2 subunit of GARP with Syntaxin 6 and the closely related Syntaxin 10. The crystal structure of Syntaxin 6 Habc domain in complex with a peptide from the N terminus of Ang2 shows a novel binding mode in which a di-tyrosine motif of Ang2 interacts with a highly conserved groove in Syntaxin 6. Structure-based mutational analyses validate the crystal structure and support the phylogenetic conservation of this interaction. The same binding determinants are found in other tethering proteins and syntaxins, suggesting a general interaction mechanism. PMID:23932592
Retriever, a multiprotein complex for retromer-independent endosomal cargo recycling
McNally, Kerrie E.; Faulkner, Rebecca; Steinberg, Florian; Gallon, Matthew; Ghai, Rajesh; Pim, David; Langton, Paul; Pearson, Neil; Danson, Chris M.; Nägele, Heike; Morris, Lindsey M; Singla, Arnika; Overlee, Brittany L; Heesom, Kate J.; Sessions, Richard; Banks, Lawrence; Collins, Brett M; Berger, Imre; Billadeau, Daniel D.; Burstein, Ezra; Cullen, Peter J.
2018-01-01
Following endocytosis and entry into the endosomal network, integral membrane proteins undergo sorting for lysosomal degradation or are alternatively retrieved and recycled back to the cell surface. Here we describe the discovery of an ancient and conserved multi-protein complex which orchestrates cargo retrieval and recycling and importantly, is biochemically and functionally distinct to the established retromer pathway. Composed of a heterotrimer of DSCR3, C16orf62 and VPS29, and bearing striking similarity with retromer, we have called this complex ‘retriever’. We establish that retriever associates with the cargo adaptor sorting nexin 17 (SNX17) and couples to the CCC and WASH complexes to prevent lysosomal degradation and promote cell surface recycling of α5β1-integrin. Through quantitative proteomic analysis we identify over 120 cell surface proteins, including numerous integrins, signalling receptors and solute transporters, which require SNX17-retriever to maintain their surface levels. Our identification of retriever establishes a major new endosomal retrieval and recycling pathway. PMID:28892079
An organelle-specific protein landscape identifies novel diseases and molecular mechanisms
Boldt, Karsten; van Reeuwijk, Jeroen; Lu, Qianhao; Koutroumpas, Konstantinos; Nguyen, Thanh-Minh T.; Texier, Yves; van Beersum, Sylvia E. C.; Horn, Nicola; Willer, Jason R.; Mans, Dorus A.; Dougherty, Gerard; Lamers, Ideke J. C.; Coene, Karlien L. M.; Arts, Heleen H.; Betts, Matthew J.; Beyer, Tina; Bolat, Emine; Gloeckner, Christian Johannes; Haidari, Khatera; Hetterschijt, Lisette; Iaconis, Daniela; Jenkins, Dagan; Klose, Franziska; Knapp, Barbara; Latour, Brooke; Letteboer, Stef J. F.; Marcelis, Carlo L.; Mitic, Dragana; Morleo, Manuela; Oud, Machteld M.; Riemersma, Moniek; Rix, Susan; Terhal, Paulien A.; Toedt, Grischa; van Dam, Teunis J. P.; de Vrieze, Erik; Wissinger, Yasmin; Wu, Ka Man; Apic, Gordana; Beales, Philip L.; Blacque, Oliver E.; Gibson, Toby J.; Huynen, Martijn A.; Katsanis, Nicholas; Kremer, Hannie; Omran, Heymut; van Wijk, Erwin; Wolfrum, Uwe; Kepes, François; Davis, Erica E.; Franco, Brunella; Giles, Rachel H.; Ueffing, Marius; Russell, Robert B.; Roepman, Ronald; Al-Turki, Saeed; Anderson, Carl; Antony, Dinu; Barroso, Inês; Bentham, Jamie; Bhattacharya, Shoumo; Carss, Keren; Chatterjee, Krishna; Cirak, Sebahattin; Cosgrove, Catherine; Danecek, Petr; Durbin, Richard; Fitzpatrick, David; Floyd, Jamie; Reghan Foley, A.; Franklin, Chris; Futema, Marta; Humphries, Steve E.; Hurles, Matt; Joyce, Chris; McCarthy, Shane; Mitchison, Hannah M.; Muddyman, Dawn; Muntoni, Francesco; O'Rahilly, Stephen; Onoufriadis, Alexandros; Payne, Felicity; Plagnol, Vincent; Raymond, Lucy; Savage, David B.; Scambler, Peter; Schmidts, Miriam; Schoenmakers, Nadia; Semple, Robert; Serra, Eva; Stalker, Jim; van Kogelenberg, Margriet; Vijayarangakannan, Parthiban; Walter, Klaudia; Whittall, Ros; Williamson, Kathy
2016-01-01
Cellular organelles provide opportunities to relate biological mechanisms to disease. Here we use affinity proteomics, genetics and cell biology to interrogate cilia: poorly understood organelles, where defects cause genetic diseases. Two hundred and seventeen tagged human ciliary proteins create a final landscape of 1,319 proteins, 4,905 interactions and 52 complexes. Reverse tagging, repetition of purifications and statistical analyses, produce a high-resolution network that reveals organelle-specific interactions and complexes not apparent in larger studies, and links vesicle transport, the cytoskeleton, signalling and ubiquitination to ciliary signalling and proteostasis. We observe sub-complexes in exocyst and intraflagellar transport complexes, which we validate biochemically, and by probing structurally predicted, disruptive, genetic variants from ciliary disease patients. The landscape suggests other genetic diseases could be ciliary including 3M syndrome. We show that 3M genes are involved in ciliogenesis, and that patient fibroblasts lack cilia. Overall, this organelle-specific targeting strategy shows considerable promise for Systems Medicine. PMID:27173435
Evolution of synthetic signaling scaffolds by recombination of modular protein domains.
Lai, Andicus; Sato, Paloma M; Peisajovich, Sergio G
2015-06-19
Signaling scaffolds are proteins that interact via modular domains with multiple partners, regulating signaling networks in space and time and providing an ideal platform from which to alter signaling functions. However, to better exploit scaffolds for signaling engineering, it is necessary to understand the full extent of their modularity. We used a directed evolution approach to identify, from a large library of randomly shuffled protein interaction domains, variants capable of rescuing the signaling defect of a yeast strain in which Ste5, the scaffold in the mating pathway, had been deleted. After a single round of selection, we identified multiple synthetic scaffold variants with diverse domain architectures, able to mediate mating pathway activation in a pheromone-dependent manner. The facility with which this signaling network accommodates changes in scaffold architecture suggests that the mating signaling complex does not possess a single, precisely defined geometry into which the scaffold has to fit. These relaxed geometric constraints may facilitate the evolution of signaling networks, as well as their engineering for applications in synthetic biology.
Bai, Fang; Morcos, Faruck; Cheng, Ryan R; Jiang, Hualiang; Onuchic, José N
2016-12-13
Protein-protein interactions play a central role in cellular function. Improving the understanding of complex formation has many practical applications, including the rational design of new therapeutic agents and the mechanisms governing signal transduction networks. The generally large, flat, and relatively featureless binding sites of protein complexes pose many challenges for drug design. Fragment docking and direct coupling analysis are used in an integrated computational method to estimate druggable protein-protein interfaces. (i) This method explores the binding of fragment-sized molecular probes on the protein surface using a molecular docking-based screen. (ii) The energetically favorable binding sites of the probes, called hot spots, are spatially clustered to map out candidate binding sites on the protein surface. (iii) A coevolution-based interface interaction score is used to discriminate between different candidate binding sites, yielding potential interfacial targets for therapeutic drug design. This approach is validated for important, well-studied disease-related proteins with known pharmaceutical targets, and also identifies targets that have yet to be studied. Moreover, therapeutic agents are proposed by chemically connecting the fragments that are strongly bound to the hot spots.
... one component of a protein called type IV collagen . Type IV collagen molecules attach to each other to form complex ... and support cells in many tissues. Type IV collagen networks play an important role in the basement ...
Pectin/zein beads for potential colon-specific drug delivery: synthesis and in vitro evaluation.
Liu, LinShu; Fishman, Marshall L; Hicks, Kevin B; Kende, Meir; Ruthel, Gordon
2006-01-01
Novel complex hydrogel beads were prepared from two edible polymers: pectin, a carbohydrate from citrus fruits, and zein, a protein from corn. The pectin/zein complex hydrogels did not swell in physiological environments, but hydrolyzed in the presence of pectinases. An in vitro study showed the capacity of the hydrogels to endure protease attack and residence time variation. The physical and biological properties of the new hydrogels were attributed to molecular entanglement of the two polymers. The pectin networks were stabilized by the bound zein molecules. In turn, the pectin networks shielded the bound zein from protease digestion.
Hill, W D; Davies, G; van de Lagemaat, L N; Christoforou, A; Marioni, R E; Fernandes, C P D; Liewald, D C; Croning, M D R; Payton, A; Craig, L C A; Whalley, L J; Horan, M; Ollier, W; Hansell, N K; Wright, M J; Martin, N G; Montgomery, G W; Steen, V M; Le Hellard, S; Espeseth, T; Lundervold, A J; Reinvang, I; Starr, J M; Pendleton, N; Grant, S G N; Bates, T C; Deary, I J
2014-01-01
Differences in general cognitive ability (intelligence) account for approximately half of the variation in any large battery of cognitive tests and are predictive of important life events including health. Genome-wide analyses of common single-nucleotide polymorphisms indicate that they jointly tag between a quarter and a half of the variance in intelligence. However, no single polymorphism has been reliably associated with variation in intelligence. It remains possible that these many small effects might be aggregated in networks of functionally linked genes. Here, we tested a network of 1461 genes in the postsynaptic density and associated complexes for an enriched association with intelligence. These were ascertained in 3511 individuals (the Cognitive Ageing Genetics in England and Scotland (CAGES) consortium) phenotyped for general cognitive ability, fluid cognitive ability, crystallised cognitive ability, memory and speed of processing. By analysing the results of a genome wide association study (GWAS) using Gene Set Enrichment Analysis, a significant enrichment was found for fluid cognitive ability for the proteins found in the complexes of N-methyl-D-aspartate receptor complex; P=0.002. Replication was sought in two additional cohorts (N=670 and 2062). A meta-analytic P-value of 0.003 was found when these were combined with the CAGES consortium. The results suggest that genetic variation in the macromolecular machines formed by membrane-associated guanylate kinase (MAGUK) scaffold proteins and their interaction partners contributes to variation in intelligence. PMID:24399044
Regulation of mTORC1 by PI3K signaling.
Dibble, Christian C; Cantley, Lewis C
2015-09-01
The class I phosphoinositide 3-kinase (PI3K)-mechanistic target of rapamycin (mTOR) complex 1 (mTORC1) signaling network directs cellular metabolism and growth. Activation of mTORC1 [composed of mTOR, regulatory-associated protein of mTOR (Raptor), mammalian lethal with SEC13 protein 8(mLST8), 40-kDa proline-rich Akt substrate (PRAS40), and DEP domain-containing mTOR-interacting protein (DEPTOR)] depends on the Ras-related GTPases (Rags) and Ras homolog enriched in brain (Rheb) GTPase and requires signals from amino acids, glucose, oxygen, energy (ATP), and growth factors (including cytokines and hormones such as insulin). Here we discuss the signal transduction mechanisms through which growth factor-responsive PI3K signaling activates mTORC1. We focus on how PI3K-dependent activation of Akt and spatial regulation of the tuberous sclerosis complex (TSC) complex (TSC complex) [composed of TSC1, TSC2, and Tre2-Bub2-Cdc16-1 domain family member 7 (TBC1D7)] switches on Rheb at the lysosome, where mTORC1 is activated. Integration of PI3K- and amino acid-dependent signals upstream of mTORC1 at the lysosome is detailed in a working model. A coherent understanding of the PI3K-mTORC1 network is imperative as its dysregulation has been implicated in diverse pathologies including cancer, diabetes, autism, and aging. Copyright © 2015 Elsevier Ltd. All rights reserved.
Resemblance of actin-binding protein/actin gels to covalently crosslinked networks
NASA Astrophysics Data System (ADS)
Janmey, Paul A.; Hvidt, Søren; Lamb, Jennifer; Stossel, Thomas P.
1990-05-01
THE maintainance of the shape of cells is often due to their surface elasticity, which arises mainly from an actin-rich cytoplasmic cortex1,2. On locomotion, phagocytosis or fission, however, these cells become partially fluid-like. The finding of proteins that can bind to actin and control the assembly of, or crosslink, actin filaments, and of intracellular messages that regulate the activities of some of these actin-binding proteins, indicates that such 'gel sol' transformations result from the rearrangement of cortical actin-rich networks3. Alternatively, on the basis of a study of the mechanical properties of mixtures of actin filaments and an Acanthamoeba actin-binding protein, α-actinin, it has been proposed that these transformations can be accounted for by rapid exchange of crosslinks between actin filaments4: the cortical network would be solid when the deformation rate is greater than the rate of crosslink exchange, but would deform or 'creep' when deformation is slow enough to permit crosslinker molecules to rearrange. Here we report, however, that mixtures of actin filaments and actin-binding protein (ABP), an actin crosslinking protein of many higher eukaryotes, form gels Theologically equivalent to covalently crosslinked networks. These gels do not creep in response to applied stress on a time scale compatible with most cell-surface movements. These findings support a more complex and controlled mechanism underlying the dynamic mechanical properties of cortical cytoplasm, and can explain why cells do not collapse under the constant shear forces that often exist in tissues.
Bueno, Anibal; Morilla, Ian; Diez, Diego; Moya-Garcia, Aurelio A.; Lozano, José; Ranea, Juan A.G.
2016-01-01
RAS proteins are the founding members of the RAS superfamily of GTPases. They are involved in key signaling pathways regulating essential cellular functions such as cell growth and differentiation. As a result, their deregulation by inactivating mutations often results in aberrant cell proliferation and cancer. With the exception of the relatively well-known KRAS, HRAS and NRAS proteins, little is known about how the interactions of the other RAS human paralogs affect cancer evolution and response to treatment. In this study we performed a comprehensive analysis of the relationship between the phylogeny of RAS proteins and their location in the protein interaction network. This analysis was integrated with the structural analysis of conserved positions in available 3D structures of RAS complexes. Our results show that many RAS proteins with divergent sequences are found close together in the human interactome. We found specific conserved amino acid positions in this group that map to the binding sites of RAS with many of their signaling effectors, suggesting that these pairs could share interacting partners. These results underscore the potential relevance of cross-talking in the RAS signaling network, which should be taken into account when considering the inhibitory activity of drugs targeting specific RAS oncoproteins. This study broadens our understanding of the human RAS signaling network and stresses the importance of considering its potential cross-talk in future therapies. PMID:27713118
Michelin, Adeline; Bittame, Amina; Bordat, Yann; Travier, Laetitia; Mercier, Corinne; Dubremetz, Jean-François; Lebrun, Maryse
2009-02-01
The intracellular protozoan parasite Toxoplasma gondii develops within the parasitophorous vacuole (PV), an intracellular niche in which it secretes proteins from secretory organelles named dense granules and rhoptries. Here, we describe a new dense granule protein that should now be referred to as GRA12, and that displays no homology with other proteins. Immunofluorescence and immuno-electron microscopy showed that GRA12 behaves similarly to both GRA2 and GRA6. It is secreted into the PV from the anterior pole of the parasite soon after the beginning of invasion, transits to the posterior invaginated pocket of the parasite where a membranous tubulovesicular network is first assembled, and finally resides throughout the vacuolar space, associated with the mature membranous nanotubular network. GRA12 fails to localise at the parasite posterior end in the absence of GRA2. Within the vacuolar space, like the other GRA proteins, GRA12 exists in both a soluble and a membrane-associated form. Using affinity chromatography experiments, we showed that in both the parasite and the PV soluble fractions, GRA12 is purified with the complex of GRA proteins associated with a tagged version of GRA2 and that this association is lost in the PV membranous fraction.
Continually emerging mechanistic complexity of the multi-enzyme cellulosome complex.
Smith, Steven P; Bayer, Edward A; Czjzek, Mirjam
2017-06-01
The robust plant cell wall polysaccharide-degrading properties of anaerobic bacteria are harnessed within elegant, marcomolecular assemblages called cellulosomes, in which proteins of complementary activities amass on scaffold protein networks. Research efforts have focused and continue to focus on providing detailed mechanistic insights into cellulosomal complex assembly, topology, and function. The accumulated information is expanding our fundamental understanding of the lignocellulosic biomass decomposition process and enhancing the potential of engineered cellulosomal systems for biotechnological purposes. Ongoing biochemical studies continue to reveal unexpected functional diversity within traditional cellulase families. Genomic, proteomic, and functional analyses have uncovered unanticipated cellulosomal proteins that augment the function of the native and designer cellulosomes. In addition, complementary structural and computational methods are continuing to provide much needed insights on the influence of cellulosomal interdomain linker regions on cellulosomal assembly and activity. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Sherman, Eilon
2016-06-01
Signal transduction is mediated by heterogeneous and dynamic protein complexes. Such complexes play a critical role in diverse cell functions, with the important example of T cell activation. Biochemical studies of signalling complexes and their imaging by diffraction limited microscopy have resulted in an intricate network of interactions downstream the T cell antigen receptor (TCR). However, in spite of their crucial roles in T cell activation, much remains to be learned about these signalling complexes, including their heterogeneous contents and size distribution, their complex arrangements in the PM, and the molecular requirements for their formation. Here, we review how recent advancements in single molecule localization microscopy have helped to shed new light on the organization of signalling complexes in single molecule detail in intact T cells. From these studies emerges a picture where cells extensively employ hierarchical and dynamic patterns of nano-scale organization to control the local concentration of interacting molecular species. These patterns are suggested to play a critical role in cell decision making. The combination of SMLM with more traditional techniques is expected to continue and critically contribute to our understanding of multimolecular protein complexes and their significance to cell function.
Evolution of insect proteomes: insights into synapse organization and synaptic vesicle life cycle
Yanay, Chava; Morpurgo, Noa; Linial, Michal
2008-01-01
Background The molecular components in synapses that are essential to the life cycle of synaptic vesicles are well characterized. Nonetheless, many aspects of synaptic processes, in particular how they relate to complex behaviour, remain elusive. The genomes of flies, mosquitoes, the honeybee and the beetle are now fully sequenced and span an evolutionary breadth of about 350 million years; this provides a unique opportunity to conduct a comparative genomics study of the synapse. Results We compiled a list of 120 gene prototypes that comprise the core of presynaptic structures in insects. Insects lack several scaffolding proteins in the active zone, such as bassoon and piccollo, and the most abundant protein in the mammalian synaptic vesicle, namely synaptophysin. The pattern of evolution of synaptic protein complexes is analyzed. According to this analysis, the components of presynaptic complexes as well as proteins that take part in organelle biogenesis are tightly coordinated. Most synaptic proteins are involved in rich protein interaction networks. Overall, the number of interacting proteins and the degrees of sequence conservation between human and insects are closely correlated. Such a correlation holds for exocytotic but not for endocytotic proteins. Conclusion This comparative study of human with insects sheds light on the composition and assembly of protein complexes in the synapse. Specifically, the nature of the protein interaction graphs differentiate exocytotic from endocytotic proteins and suggest unique evolutionary constraints for each set. General principles in the design of proteins of the presynaptic site can be inferred from a comparative study of human and insect genomes. PMID:18257909
Didier, Caroline; Forno, Guillermina; Etcheverrigaray, Marina; Kratje, Ricardo; Goicoechea, Héctor
2009-09-21
The optimal blends of six compounds that should be present in culture media used in recombinant protein production were determined by means of artificial neural networks (ANN) coupled with crossed mixture experimental design. This combination constitutes a novel approach to develop a medium for cultivating genetically engineered mammalian cells. The compounds were collected in two mixtures of three elements each, and the experimental space was determined by a crossed mixture design. Empirical data from 51 experimental units were used in a multiresponse analysis to train artificial neural networks which satisfy different requirements, in order to define two new culture media (Medium 1 and Medium 2) to be used in a continuous biopharmaceutical production process. These media were tested in a bioreactor to produce a recombinant protein in CHO cells. Remarkably, for both predicted media all responses satisfied the predefined goals pursued during the analysis, except in the case of the specific growth rate (mu) observed for Medium 1. ANN analysis proved to be a suitable methodology to be used when dealing with complex experimental designs, as frequently occurs in the optimization of production processes in the biotechnology area. The present work is a new example of the use of ANN for the resolution of a complex, real life system, successfully employed in the context of a biopharmaceutical production process.
Networks of blood proteins in the neuroimmunology of schizophrenia.
Jeffries, Clark D; Perkins, Diana O; Fournier, Margot; Do, Kim Q; Cuenod, Michel; Khadimallah, Ines; Domenici, Enrico; Addington, Jean; Bearden, Carrie E; Cadenhead, Kristin S; Cannon, Tyrone D; Cornblatt, Barbara A; Mathalon, Daniel H; McGlashan, Thomas H; Seidman, Larry J; Tsuang, Ming; Walker, Elaine F; Woods, Scott W
2018-06-06
Levels of certain circulating cytokines and related immune system molecules are consistently altered in schizophrenia and related disorders. In addition to absolute analyte levels, we sought analytes in correlation networks that could be prognostic. We analyzed baseline blood plasma samples with a Luminex platform from 72 subjects meeting criteria for a psychosis clinical high-risk syndrome; 32 subjects converted to a diagnosis of psychotic disorder within two years while 40 other subjects did not. Another comparison group included 35 unaffected subjects. Assays of 141 analytes passed early quality control. We then used an unweighted co-expression network analysis to identify highly correlated modules in each group. Overall, there was a striking loss of network complexity going from unaffected subjects to nonconverters and thence to converters (applying standard, graph-theoretic metrics). Graph differences were largely driven by proteins regulating tissue remodeling (e.g. blood-brain barrier). In more detail, certain sets of antithetical proteins were highly correlated in unaffected subjects (e.g. SERPINE1 vs MMP9), as expected in homeostasis. However, for particular protein pairs this trend was reversed in converters (e.g. SERPINE1 vs TIMP1, being synthetical inhibitors of remodeling of extracellular matrix and vasculature). Thus, some correlation signals strongly predict impending conversion to a psychotic disorder and directly suggest pharmaceutical targets.
Right-side-stretched multifractal spectra indicate small-worldness in networks
NASA Astrophysics Data System (ADS)
Oświȩcimka, Paweł; Livi, Lorenzo; Drożdż, Stanisław
2018-04-01
Complex network formalism allows to explain the behavior of systems composed by interacting units. Several prototypical network models have been proposed thus far. The small-world model has been introduced to mimic two important features observed in real-world systems: i) local clustering and ii) the possibility to move across a network by means of long-range links that significantly reduce the characteristic path length. A natural question would be whether there exist several ;types; of small-world architectures, giving rise to a continuum of models with properties (partially) shared with other models belonging to different network families. Here, we take advantage of the interplay between network theory and time series analysis and propose to investigate small-world signatures in complex networks by analyzing multifractal characteristics of time series generated from such networks. In particular, we suggest that the degree of right-sided asymmetry of multifractal spectra is linked with the degree of small-worldness present in networks. This claim is supported by numerical simulations performed on several parametric models, including prototypical small-world networks, scale-free, fractal and also real-world networks describing protein molecules. Our results also indicate that right-sided asymmetry emerges with the presence of the following topological properties: low edge density, low average shortest path, and high clustering coefficient.
Wu, Jianlan; Tang, Zhoufei; Gong, Zhihao; Cao, Jianshu; Mukamel, Shaul
2015-04-02
The energy absorbed in a light-harvesting protein complex is often transferred collectively through aggregated chromophore clusters. For population evolution of chromophores, the time-integrated effective rate matrix allows us to construct quantum kinetic clusters quantitatively and determine the reduced cluster-cluster transfer rates systematically, thus defining a minimal model of energy-transfer kinetics. For Fenna-Matthews-Olson (FMO) and light-havrvesting complex II (LCHII) monomers, quantum Markovian kinetics of clusters can accurately reproduce the overall energy-transfer process in the long-time scale. The dominant energy-transfer pathways are identified in the picture of aggregated clusters. The chromophores distributed extensively in various clusters can assist a fast and long-range energy transfer.
Pan, Weiran; Li, Gang; Yang, Xiaoxiao; Miao, Jinming
2015-04-01
This study aims to explore the potential mechanism of glioma through bioinformatic approaches. The gene expression profile (GSE4290) of glioma tumor and non-tumor samples was downloaded from Gene Expression Omnibus database. A total of 180 samples were available, including 23 non-tumor and 157 tumor samples. Then the raw data were preprocessed using robust multiarray analysis, and 8,890 differentially expressed genes (DEGs) were identified by using t-test (false discovery rate < 0.0005). Furthermore, 16 known glioma related genes were abstracted from Genetic Association Database. After mapping 8,890 DEGs and 16 known glioma related genes to Human Protein Reference Database, a glioma associated protein-protein interaction network (GAPN) was constructed. In addition, 51 sub-networks in GAPN were screened out through Molecular Complex Detection (score ≥ 1), and sub-network 1 was found to have the closest interaction (score = 3). What' more, for the top 10 sub-networks, Gene Ontology (GO) enrichment analysis (p value < 0.05) was performed, and DEGs involved in sub-network 1 and 2, such as BRMS1L and CCNA1, were predicted to regulate cell growth, cell cycle, and DNA replication via interacting with known glioma related genes. Finally, the overlaps of DEGs and human essential, housekeeping, tissue-specific genes were calculated (p value = 1.0, 1.0, and 0.00014, respectively) and visualized by Venn Diagram package in R. About 61% of human tissue-specific genes were DEGs as well. This research shed new light on the pathogenesis of glioma based on DEGs and GAPN, and our findings might provide potential targets for clinical glioma treatment.
Detecting Network Communities: An Application to Phylogenetic Analysis
Andrade, Roberto F. S.; Rocha-Neto, Ivan C.; Santos, Leonardo B. L.; de Santana, Charles N.; Diniz, Marcelo V. C.; Lobão, Thierry Petit; Goés-Neto, Aristóteles; Pinho, Suani T. R.; El-Hani, Charbel N.
2011-01-01
This paper proposes a new method to identify communities in generally weighted complex networks and apply it to phylogenetic analysis. In this case, weights correspond to the similarity indexes among protein sequences, which can be used for network construction so that the network structure can be analyzed to recover phylogenetically useful information from its properties. The analyses discussed here are mainly based on the modular character of protein similarity networks, explored through the Newman-Girvan algorithm, with the help of the neighborhood matrix . The most relevant networks are found when the network topology changes abruptly revealing distinct modules related to the sets of organisms to which the proteins belong. Sound biological information can be retrieved by the computational routines used in the network approach, without using biological assumptions other than those incorporated by BLAST. Usually, all the main bacterial phyla and, in some cases, also some bacterial classes corresponded totally (100%) or to a great extent (>70%) to the modules. We checked for internal consistency in the obtained results, and we scored close to 84% of matches for community pertinence when comparisons between the results were performed. To illustrate how to use the network-based method, we employed data for enzymes involved in the chitin metabolic pathway that are present in more than 100 organisms from an original data set containing 1,695 organisms, downloaded from GenBank on May 19, 2007. A preliminary comparison between the outcomes of the network-based method and the results of methods based on Bayesian, distance, likelihood, and parsimony criteria suggests that the former is as reliable as these commonly used methods. We conclude that the network-based method can be used as a powerful tool for retrieving modularity information from weighted networks, which is useful for phylogenetic analysis. PMID:21573202
Modeling cytoskeletal traffic: an interplay between passive diffusion and active transport.
Neri, Izaak; Kern, Norbert; Parmeggiani, Andrea
2013-03-01
We introduce the totally asymmetric simple exclusion process with Langmuir kinetics on a network as a microscopic model for active motor protein transport on the cytoskeleton, immersed in the diffusive cytoplasm. We discuss how the interplay between active transport along a network and infinite diffusion in a bulk reservoir leads to a heterogeneous matter distribution on various scales: we find three regimes for steady state transport, corresponding to the scale of the network, of individual segments, or local to sites. At low exchange rates strong density heterogeneities develop between different segments in the network. In this regime one has to consider the topological complexity of the whole network to describe transport. In contrast, at moderate exchange rates the transport through the network decouples, and the physics is determined by single segments and the local topology. At last, for very high exchange rates the homogeneous Langmuir process dominates the stationary state. We introduce effective rate diagrams for the network to identify these different regimes. Based on this method we develop an intuitive but generic picture of how the stationary state of excluded volume processes on complex networks can be understood in terms of the single-segment phase diagram.
C. elegans network biology: a beginning.
Piano, Fabio; Gunsalus, Kristin C; Hill, David E; Vidal, Marc
2006-01-01
The architecture and dynamics of molecular networks can provide an understanding of complex biological processes complementary to that obtained from the in-depth study of single genes and proteins. With a completely sequenced and well-annotated genome, a fully characterized cell lineage, and powerful tools available to dissect development, Caenorhabditis elegans, among metazoans, provides an optimal system to bridge cellular and organismal biology with the global properties of macromolecular networks. This chapter considers omic technologies available for C. elegans to describe molecular networks--encompassing transcriptional and phenotypic profiling as well as physical interaction mapping--and discusses how their individual and integrated applications are paving the way for a network-level understanding of C. elegans biology. PMID:18050437
BioLayout(Java): versatile network visualisation of structural and functional relationships.
Goldovsky, Leon; Cases, Ildefonso; Enright, Anton J; Ouzounis, Christos A
2005-01-01
Visualisation of biological networks is becoming a common task for the analysis of high-throughput data. These networks correspond to a wide variety of biological relationships, such as sequence similarity, metabolic pathways, gene regulatory cascades and protein interactions. We present a general approach for the representation and analysis of networks of variable type, size and complexity. The application is based on the original BioLayout program (C-language implementation of the Fruchterman-Rheingold layout algorithm), entirely re-written in Java to guarantee portability across platforms. BioLayout(Java) provides broader functionality, various analysis techniques, extensions for better visualisation and a new user interface. Examples of analysis of biological networks using BioLayout(Java) are presented.
Introduction to focus issue: quantitative approaches to genetic networks.
Albert, Réka; Collins, James J; Glass, Leon
2013-06-01
All cells of living organisms contain similar genetic instructions encoded in the organism's DNA. In any particular cell, the control of the expression of each different gene is regulated, in part, by binding of molecular complexes to specific regions of the DNA. The molecular complexes are composed of protein molecules, called transcription factors, combined with various other molecules such as hormones and drugs. Since transcription factors are coded by genes, cellular function is partially determined by genetic networks. Recent research is making large strides to understand both the structure and the function of these networks. Further, the emerging discipline of synthetic biology is engineering novel gene circuits with specific dynamic properties to advance both basic science and potential practical applications. Although there is not yet a universally accepted mathematical framework for studying the properties of genetic networks, the strong analogies between the activation and inhibition of gene expression and electric circuits suggest frameworks based on logical switching circuits. This focus issue provides a selection of papers reflecting current research directions in the quantitative analysis of genetic networks. The work extends from molecular models for the binding of proteins, to realistic detailed models of cellular metabolism. Between these extremes are simplified models in which genetic dynamics are modeled using classical methods of systems engineering, Boolean switching networks, differential equations that are continuous analogues of Boolean switching networks, and differential equations in which control is based on power law functions. The mathematical techniques are applied to study: (i) naturally occurring gene networks in living organisms including: cyanobacteria, Mycoplasma genitalium, fruit flies, immune cells in mammals; (ii) synthetic gene circuits in Escherichia coli and yeast; and (iii) electronic circuits modeling genetic networks using field-programmable gate arrays. Mathematical analyses will be essential for understanding naturally occurring genetic networks in diverse organisms and for providing a foundation for the improved development of synthetic genetic networks.
Introduction to Focus Issue: Quantitative Approaches to Genetic Networks
NASA Astrophysics Data System (ADS)
Albert, Réka; Collins, James J.; Glass, Leon
2013-06-01
All cells of living organisms contain similar genetic instructions encoded in the organism's DNA. In any particular cell, the control of the expression of each different gene is regulated, in part, by binding of molecular complexes to specific regions of the DNA. The molecular complexes are composed of protein molecules, called transcription factors, combined with various other molecules such as hormones and drugs. Since transcription factors are coded by genes, cellular function is partially determined by genetic networks. Recent research is making large strides to understand both the structure and the function of these networks. Further, the emerging discipline of synthetic biology is engineering novel gene circuits with specific dynamic properties to advance both basic science and potential practical applications. Although there is not yet a universally accepted mathematical framework for studying the properties of genetic networks, the strong analogies between the activation and inhibition of gene expression and electric circuits suggest frameworks based on logical switching circuits. This focus issue provides a selection of papers reflecting current research directions in the quantitative analysis of genetic networks. The work extends from molecular models for the binding of proteins, to realistic detailed models of cellular metabolism. Between these extremes are simplified models in which genetic dynamics are modeled using classical methods of systems engineering, Boolean switching networks, differential equations that are continuous analogues of Boolean switching networks, and differential equations in which control is based on power law functions. The mathematical techniques are applied to study: (i) naturally occurring gene networks in living organisms including: cyanobacteria, Mycoplasma genitalium, fruit flies, immune cells in mammals; (ii) synthetic gene circuits in Escherichia coli and yeast; and (iii) electronic circuits modeling genetic networks using field-programmable gate arrays. Mathematical analyses will be essential for understanding naturally occurring genetic networks in diverse organisms and for providing a foundation for the improved development of synthetic genetic networks.
Exploring G protein-coupled receptor signaling networks using SILAC-based phosphoproteomics
Williams, Grace R.; Bethard, Jennifer R.; Berkaw, Mary N.; Nagel, Alexis K.; Luttrell, Louis M.; Ball, Lauren E.
2015-01-01
The type 1 parathyroid hormone receptor (PTH1R) is a key regulator of calcium homeostasis and bone turnover. Here, we employed SILAC-based quantitative mass spectrometry combined with bioinformatic pathways analysis to examine global changes in protein phosphorylation following short-term stimulation of endogenously expressed PTH1R in osteoblastic cells in vitro. Following 5 min exposure to the conventional agonist, PTH(1-34), we detected significant changes in the phosphorylation of 224 distinct proteins. Kinase substrate motif enrichment demonstrated that consensus motifs for PKA and CAMK2 were the most heavily upregulated within the phosphoproteome, while consensus motifs for mitogen-activated protein kinases were strongly downregulated. Signaling pathways analysis identified ERK1/2 and AKT as important nodal kinases in the downstream network and revealed strong regulation of small GTPases involved in cytoskeletal rearrangement, cell motility, and focal adhesion complex signaling. Our data illustrate the utility of quantitative mass spectrometry in measuring dynamic changes in protein phosphorylation following GPCR activation. PMID:26160508