SA-Mot: a web server for the identification of motifs of interest extracted from protein loops
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-01-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr. PMID:21665924
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-07-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.
Efficacy of function specific 3D-motifs in enzyme classification according to their EC-numbers.
Rahimi, Amir; Madadkar-Sobhani, Armin; Touserkani, Rouzbeh; Goliaei, Bahram
2013-11-07
Due to the increasing number of protein structures with unknown function originated from structural genomics projects, protein function prediction has become an important subject in bioinformatics. Among diverse function prediction methods, exploring known 3D-motifs, which are associated with functional elements in unknown protein structures is one of the most biologically meaningful methods. Homologous enzymes inherit such motifs in their active sites from common ancestors. However, slight differences in the properties of these motifs, results in variation in the reactions and substrates of the enzymes. In this study, we examined the possibility of discriminating highly related active site patterns according to their EC-numbers by 3D-motifs. For each EC-number, the spatial arrangement of an active site, which has minimum average distance to other active sites with the same function, was selected as a representative 3D-motif. In order to characterize the motifs, various points in active site elements were tested. The results demonstrated the possibility of predicting full EC-number of enzymes by 3D-motifs. However, the discriminating power of 3D-motifs varies among different enzyme families and depends on selecting the appropriate points and features. © 2013 Elsevier Ltd. All rights reserved.
Automatic annotation of protein motif function with Gene Ontology terms.
Lu, Xinghua; Zhai, Chengxiang; Gopalakrishnan, Vanathi; Buchanan, Bruce G
2004-09-02
Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, a much needed and important task is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO) project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. This paper presents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifs is viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association is found to be a very useful feature. We take advantage of the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correct association. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about the functions of newly discovered candidate protein motifs.
Ahnert, S E; Fink, T M A
2016-07-01
Network motifs have been studied extensively over the past decade, and certain motifs, such as the feed-forward loop, play an important role in regulatory networks. Recent studies have used Boolean network motifs to explore the link between form and function in gene regulatory networks and have found that the structure of a motif does not strongly determine its function, if this is defined in terms of the gene expression patterns the motif can produce. Here, we offer a different, higher-level definition of the 'function' of a motif, in terms of two fundamental properties of its dynamical state space as a Boolean network. One is the basin entropy, which is a complexity measure of the dynamics of Boolean networks. The other is the diversity of cyclic attractor lengths that a given motif can produce. Using these two measures, we examine all 104 topologically distinct three-node motifs and show that the structural properties of a motif, such as the presence of feedback loops and feed-forward loops, predict fundamental characteristics of its dynamical state space, which in turn determine aspects of its functional versatility. We also show that these higher-level properties have a direct bearing on real regulatory networks, as both basin entropy and cycle length diversity show a close correspondence with the prevalence, in neural and genetic regulatory networks, of the 13 connected motifs without self-interactions that have been studied extensively in the literature. © 2016 The Authors.
NASA Astrophysics Data System (ADS)
Fernandez-Chamorro, Javier; Lozano, Gloria; Garcia-Martin, Juan Antonio; Ramajo, Jorge; Dotu, Ivan; Clote, Peter; Martinez-Salas, Encarnacion
2016-04-01
The function of Internal Ribosome Entry Site (IRES) elements is intimately linked to their RNA structure. Viral IRES elements are organized in modular domains consisting of one or more stem-loops that harbor conserved RNA motifs critical for internal initiation of translation. A conserved motif is the pyrimidine-tract located upstream of the functional initiation codon in type I and II picornavirus IRES. By computationally designing synthetic RNAs to fold into a structure that sequesters the polypyrimidine tract in a hairpin, we establish a correlation between predicted inaccessibility of the pyrimidine tract and IRES activity, as determined in both in vitro and in vivo systems. Our data supports the hypothesis that structural sequestration of the pyrimidine-tract within a stable hairpin inactivates IRES activity, since the stronger the stability of the hairpin the higher the inhibition of protein synthesis. Destabilization of the stem-loop immediately upstream of the pyrimidine-tract also decreases IRES activity. Our work introduces a hybrid computational/experimental method to determine the importance of structural motifs for biological function. Specifically, we show the feasibility of using the software RNAiFold to design synthetic RNAs with particular sequence and structural motifs that permit subsequent experimental determination of the importance of such motifs for biological function.
Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario
2018-03-01
Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.
Grierson, Claire S.
2018-01-01
Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli. Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution. PMID:29670941
Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M
2017-08-01
Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.
Verma, Anjali; Rajagopalan, Pavithra; Lotke, Rishikesh; Varghese, Rebu; Selvam, Deepak; Kundu, Tapas K.
2016-01-01
ABSTRACT Of the various genetic subtypes of human immunodeficiency virus types 1 and 2 (HIV-1 and HIV-2) and simian immunodeficiency virus (SIV), only in subtype C of HIV-1 is a genetically variant NF-κB binding site found at the core of the viral promoter in association with a subtype-specific Sp1III motif. How the subtype-associated variations in the core transcription factor binding sites (TFBS) influence gene expression from the viral promoter has not been examined previously. Using panels of infectious viral molecular clones, we demonstrate that subtype-specific NF-κB and Sp1III motifs have evolved for optimal gene expression, and neither of the motifs can be replaced by a corresponding TFBS variant. The variant NF-κB motif binds NF-κB with an affinity 2-fold higher than that of the generic NF-κB site. Importantly, in the context of an infectious virus, the subtype-specific Sp1III motif demonstrates a profound loss of function in association with the generic NF-κB motif. An additional substitution of the Sp1III motif fully restores viral replication, suggesting that the subtype C-specific Sp1III has evolved to function with the variant, but not generic, NF-κB motif. A change of only two base pairs in the central NF-κB motif completely suppresses viral transcription from the provirus and converts the promoter into heterochromatin refractory to tumor necrosis factor alpha (TNF-α) induction. The present work represents the first demonstration of functional incompatibility between an otherwise functional NF-κB motif and a unique Sp1 site in the context of an HIV-1 promoter. Our work provides important leads as to the evolution of the HIV-1 subtype C viral promoter with relevance for gene expression regulation and viral latency. IMPORTANCE Subtype-specific genetic variations provide a powerful tool to examine how these variations offer a replication advantage to specific viral subtypes, if any. Only in subtype C of HIV-1 are two genetically distinct transcription factor binding sites positioned at the most critical location of the viral promoter. Since a single promoter regulates viral gene expression, the promoter variations can play a critical role in determining the replication fitness of the viral strains. Our work for the first time provides a scientific explanation for the presence of a unique NF-κB binding motif in subtype C, a major HIV-1 genetic family responsible for half of the global HIV-1 infections. The results offer compelling evidence that the subtype C viral promoter not only is stronger but also is endowed with a qualitative gain-of-function advantage. The genetically variant NF-κB and the Sp1III motifs may be respond differently to specific cell signal pathways, and these mechanisms must be examined. PMID:27194770
Yan, Shuo; Wang, Zhongni; Liu, Yuan; Li, Wei; Wu, Feng; Lin, Xuelei; Meng, Zheng
2015-07-01
Late stage pollen-specific promoters are important tools in crop molecular breeding. Several such promoters, and their functional motifs, have been well characterized in dicotyledonous plants such as tomato and tobacco. However, knowledge about the functional architecture of such promoters is limited in the monocotyledonous plant rice. Here, pollen-late-stage-promoter 1 (PLP1) and pollen-late-stage-promoter 2 (PLP2) were characterized using a stable transformation system in rice. Histochemical staining showed that the two promoters exclusively drive GUS expression in late-stage pollen grains in rice. 5' deletion analysis revealed that four regions, including the -1159 to -720 and the -352 to -156 regions of PLP1 and the -740 to -557 and the -557 to -339 regions of PLP2, are important in maintaining the activity and specificity of these promoters. Motif mutation analysis indicated that 'AGAAA' and 'CAAT' motifs in the -740 to -557 region of PLP2 act as enhancers in the promoter. Gain of function experiments indicated that the novel TA-rich motif 'TACATAA' and 'TATTCAT' in the core region of the PLP1 and PLP2 promoters is necessary, but not sufficient, for pollen-specific expression in rice. Our results provide evidence that the enhancer motif 'AGAAA' is conserved in the pollen-specific promoters of both monocots and eudicots, but that some functional architecture characteristics are different.
Busk, Peter Kamp; Lange, Lene
2013-06-01
Functional prediction of carbohydrate-active enzymes is difficult due to low sequence identity. However, similar enzymes often share a few short motifs, e.g., around the active site, even when the overall sequences are very different. To exploit this notion for functional prediction of carbohydrate-active enzymes, we developed a simple algorithm, peptide pattern recognition (PPR), that can divide proteins into groups of sequences that share a set of short conserved sequences. When this method was used on 118 glycoside hydrolase 5 proteins with 9% average pairwise identity and representing four characterized enzymatic functions, 97% of the proteins were sorted into groups correlating with their enzymatic activity. Furthermore, we analyzed 8,138 glycoside hydrolase 13 proteins including 204 experimentally characterized enzymes with 28 different functions. There was a 91% correlation between group and enzyme activity. These results indicate that the function of carbohydrate-active enzymes can be predicted with high precision by finding short, conserved motifs in their sequences. The glycoside hydrolase 61 family is important for fungal biomass conversion, but only a few proteins of this family have been functionally characterized. Interestingly, PPR divided 743 glycoside hydrolase 61 proteins into 16 subfamilies useful for targeted investigation of the function of these proteins and pinpointed three conserved motifs with putative importance for enzyme activity. Furthermore, the conserved sequences were useful for cloning of new, subfamily-specific glycoside hydrolase 61 proteins from 14 fungi. In conclusion, identification of conserved sequence motifs is a new approach to sequence analysis that can predict carbohydrate-active enzyme functions with high precision.
Occurrence probability of structured motifs in random sequences.
Robin, S; Daudin, J-J; Richard, H; Sagot, M-F; Schbath, S
2002-01-01
The problem of extracting from a set of nucleic acid sequences motifs which may have biological function is more and more important. In this paper, we are interested in particular motifs that may be implicated in the transcription process. These motifs, called structured motifs, are composed of two ordered parts separated by a variable distance and allowing for substitutions. In order to assess their statistical significance, we propose approximations of the probability of occurrences of such a structured motif in a given sequence. An application of our method to evaluate candidate promoters in E. coli and B. subtilis is presented. Simulations show the goodness of the approximations.
Anion induced conformational preference of Cα NN motif residues in functional proteins.
Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb
2017-12-01
Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.
Computational study of stability of an H-H-type pseudoknot motif.
Wang, Jun; Zhao, Yunjie; Wang, Jian; Xiao, Yi
2015-12-01
Motifs in RNA tertiary structures are important to their structural organizations and biological functions. Here we consider an H-H-type pseudoknot (HHpk) motif that consists of two hairpins connected by a junction loop and with kissing interactions between the two hairpin loops. Such a tertiary structural motif is recurrently found in RNA tertiary structures, but is difficult to predict computationally. So it is important to understand the mechanism of its formation and stability. Here we investigate the stability of the HHpk tertiary structure by using an all-atom molecular dynamics simulation. The results indicate that the HHpk tertiary structure is stable. However, it is found that this stability is not due to the helix-helix packing, as is usually expected, but is maintained by the combined action of the kissing hairpin loops and junctions, although the former plays the main role. Stable HHpk motifs may form structural platforms for the molecules to realize their biological functions. These results are useful for understanding the construction principle of RNA tertiary structures and structure prediction.
Gaji, Rajshekhar Y; Howe, Daniel K
2009-07-01
The apicomplexan parasite Sarcocystis neurona undergoes a complex process of intracellular development, during which many genes are temporally regulated. The described study was undertaken to begin identifying the basic promoter elements that control gene expression in S. neurona. Sequence analysis of the 5'-flanking region of five S. neurona genes revealed a conserved heptanucleotide motif GAGACGC that is similar to the WGAGACG motif described upstream of multiple genes in Toxoplasma gondii. The promoter region for the major surface antigen gene SnSAG1, which contains three heptanucleotide motifs within 135 bases of the transcription start site, was dissected by functional analysis using a dual luciferase reporter assay. These analyses revealed that a minimal promoter fragment containing all three motifs was sufficient to drive reporter molecule expression, with the presence and orientation of the 5'-most heptanucleotide motif being absolutely critical for promoter function. Further studies should help to identify additional sequence elements important for promoter function and for controlling gene expression during intracellular development by this apicomplexan pathogen.
ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins
Puntervoll, Pål; Linding, Rune; Gemünd, Christine; Chabanis-Davidson, Sophie; Mattingsdal, Morten; Cameron, Scott; Martin, David M. A.; Ausiello, Gabriele; Brannetti, Barbara; Costantini, Anna; Ferrè, Fabrizio; Maselli, Vincenza; Via, Allegra; Cesareni, Gianni; Diella, Francesca; Superti-Furga, Giulio; Wyrwicz, Lucjan; Ramu, Chenna; McGuigan, Caroline; Gudavalli, Rambabu; Letunic, Ivica; Bork, Peer; Rychlewski, Leszek; Küster, Bernhard; Helmer-Citterich, Manuela; Hunter, William N.; Aasland, Rein; Gibson, Toby J.
2003-01-01
Multidomain proteins predominate in eukaryotic proteomes. Individual functions assigned to different sequence segments combine to create a complex function for the whole protein. While on-line resources are available for revealing globular domains in sequences, there has hitherto been no comprehensive collection of small functional sites/motifs comparable to the globular domain resources, yet these are as important for the function of multidomain proteins. Short linear peptide motifs are used for cell compartment targeting, protein–protein interaction, regulation by phosphorylation, acetylation, glycosylation and a host of other post-translational modifications. ELM, the Eukaryotic Linear Motif server at http://elm.eu.org/, is a new bioinformatics resource for investigating candidate short non-globular functional motifs in eukaryotic proteins, aiming to fill the void in bioinformatics tools. Sequence comparisons with short motifs are difficult to evaluate because the usual significance assessments are inappropriate. Therefore the server is implemented with several logical filters to eliminate false positives. Current filters are for cell compartment, globular domain clash and taxonomic range. In favourable cases, the filters can reduce the number of retained matches by an order of magnitude or more. PMID:12824381
SVM2Motif—Reconstructing Overlapping DNA Sequence Motifs by Mimicking an SVM Predictor
Vidovic, Marina M. -C.; Görnitz, Nico; Müller, Klaus-Robert; Rätsch, Gunnar; Kloft, Marius
2015-01-01
Identifying discriminative motifs underlying the functionality and evolution of organisms is a major challenge in computational biology. Machine learning approaches such as support vector machines (SVMs) achieve state-of-the-art performances in genomic discrimination tasks, but—due to its black-box character—motifs underlying its decision function are largely unknown. As a remedy, positional oligomer importance matrices (POIMs) allow us to visualize the significance of position-specific subsequences. Although being a major step towards the explanation of trained SVM models, they suffer from the fact that their size grows exponentially in the length of the motif, which renders their manual inspection feasible only for comparably small motif sizes, typically k ≤ 5. In this work, we extend the work on positional oligomer importance matrices, by presenting a new machine-learning methodology, entitled motifPOIM, to extract the truly relevant motifs—regardless of their length and complexity—underlying the predictions of a trained SVM model. Our framework thereby considers the motifs as free parameters in a probabilistic model, a task which can be phrased as a non-convex optimization problem. The exponential dependence of the POIM size on the oligomer length poses a major numerical challenge, which we address by an efficient optimization framework that allows us to find possibly overlapping motifs consisting of up to hundreds of nucleotides. We demonstrate the efficacy of our approach on a synthetic data set as well as a real-world human splice site data set. PMID:26690911
Knabe, Johannes F; Nehaniv, Chrystopher L; Schilstra, Maria J
2008-01-01
Methods that analyse the topological structure of networks have recently become quite popular. Whether motifs (subgraph patterns that occur more often than in randomized networks) have specific functions as elementary computational circuits has been cause for debate. As the question is difficult to resolve with currently available biological data, we approach the issue using networks that abstractly model natural genetic regulatory networks (GRNs) which are evolved to show dynamical behaviors. Specifically one group of networks was evolved to be capable of exhibiting two different behaviors ("differentiation") in contrast to a group with a single target behavior. In both groups we find motif distribution differences within the groups to be larger than differences between them, indicating that evolutionary niches (target functions) do not necessarily mold network structure uniquely. These results show that variability operators can have a stronger influence on network topologies than selection pressures, especially when many topologies can create similar dynamics. Moreover, analysis of motif functional relevance by lesioning did not suggest that motifs were of greater importance to the functioning of the network than arbitrary subgraph patterns. Only when drastically restricting network size, so that one motif corresponds to a whole functionally evolved network, was preference for particular connection patterns found. This suggests that in non-restricted, bigger networks, entanglement with the rest of the network hinders topological subgraph analysis.
Evolutionary analysis of FAM83H in vertebrates.
Huang, Wushuang; Yang, Mei; Wang, Changning; Song, Yaling
2017-01-01
Amelogenesis imperfecta is a group of disorders causing abnormalities in enamel formation in various phenotypes. Many mutations in the FAM83H gene have been identified to result in autosomal dominant hypocalcified amelogenesis imperfecta in different populations. However, the structure and function of FAM83H and its pathological mechanism have yet to be further explored. Evolutionary analysis is an alternative for revealing residues or motifs that are important for protein function. In the present study, we chose 50 vertebrate species in public databases representative of approximately 230 million years of evolution, including 1 amphibian, 2 fishes, 7 sauropsidas and 40 mammals, and we performed evolutionary analysis on the FAM83H protein. By sequence alignment, conserved residues and motifs were indicated, and the loss of important residues and motifs of five special species (Malayan pangolin, platypus, minke whale, nine-banded armadillo and aardvark) was discovered. A phylogenetic time tree showed the FAM83H divergent process. Positive selection sites in the C-terminus suggested that the C-terminus of FAM83H played certain adaptive roles during evolution. The results confirmed some important motifs reported in previous findings and identified some new highly conserved residues and motifs that need further investigation. The results suggest that the C-terminus of FAM83H contain key conserved regions critical to enamel formation and calcification.
The Methionine-aromatic Motif Plays a Unique Role in Stabilizing Protein Structure*
Valley, Christopher C.; Cembran, Alessandro; Perlmutter, Jason D.; Lewis, Andrew K.; Labello, Nicholas P.; Gao, Jiali; Sachs, Jonathan N.
2012-01-01
Of the 20 amino acids, the precise function of methionine (Met) remains among the least well understood. To establish a determining characteristic of methionine that fundamentally differentiates it from purely hydrophobic residues, we have used in vitro cellular experiments, molecular simulations, quantum calculations, and a bioinformatics screen of the Protein Data Bank. We show that approximately one-third of all known protein structures contain an energetically stabilizing Met-aromatic motif and, remarkably, that greater than 10,000 structures contain this motif more than 10 times. Critically, we show that as compared with a purely hydrophobic interaction, the Met-aromatic motif yields an additional stabilization of 1–1.5 kcal/mol. To highlight its importance and to dissect the energetic underpinnings of this motif, we have studied two clinically relevant TNF ligand-receptor complexes, namely TRAIL-DR5 and LTα-TNFR1. In both cases, we show that the motif is necessary for high affinity ligand binding as well as function. Additionally, we highlight previously overlooked instances of the motif in several disease-related Met mutations. Our results strongly suggest that the Met-aromatic motif should be exploited in the rational design of therapeutics targeting a range of proteins. PMID:22859300
Ham, Jong Hyun; Majerczak, Doris R; Nomura, Kinya; Mecey, Christy; Uribe, Francisco; He, Sheng-Yang; Mackey, David; Coplin, David L
2009-06-01
The broadly conserved AvrE-family of type III effectors from gram-negative plant-pathogenic bacteria includes important virulence factors, yet little is known about the mechanisms by which these effectors function inside plant cells to promote disease. We have identified two conserved motifs in AvrE-family effectors: a WxxxE motif and a putative C-terminal endoplasmic reticulum membrane retention/retrieval signal (ERMRS). The WxxxE and ERMRS motifs are both required for the virulence activities of WtsE and AvrE, which are major virulence factors of the corn pathogen Pantoea stewartii subsp. stewartii and the tomato or Arabidopsis pathogen Pseudomonas syringae pv. tomato, respectively. The WxxxE and the predicted ERMRS motifs are also required for other biological activities of WtsE, including elicitation of the hypersensitive response in nonhost plants and suppression of defense responses in Arabidopsis. A family of type III effectors from mammalian bacterial pathogens requires WxxxE and subcellular targeting motifs for virulence functions that involve their ability to mimic activated G-proteins. The conservation of related motifs and their necessity for the function of type III effectors from plant pathogens indicates that disturbing host pathways by mimicking activated host G-proteins may be a virulence mechanism employed by plant pathogens as well.
DMINDA: an integrated web server for DNA motif identification and analyses
Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying
2014-01-01
DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. PMID:24753419
Gibbs motif sampling: detection of bacterial outer membrane protein repeats.
Neuwald, A. F.; Liu, J. S.; Lawrence, C. E.
1995-01-01
The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles. PMID:8520488
Cao, Yunpeng; Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping
2018-04-23
The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice ( Oryza sativa ), maize ( Zea mays ), and Arabidopsis ( Arabidopsis thaliana ). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis , respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis , respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis . A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis , respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus , and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis .
McCune, Broc T; Tang, Wei; Lu, Jia; Eaglesham, James B; Thorne, Lucy; Mayer, Anne E; Condiff, Emily; Nice, Timothy J; Goodfellow, Ian; Krezel, Andrzej M; Virgin, Herbert W
2017-07-11
The Norovirus genus contains important human pathogens, but the role of host pathways in norovirus replication is largely unknown. Murine noroviruses provide the opportunity to study norovirus replication in cell culture and in small animals. The human norovirus nonstructural protein NS1/2 interacts with the host protein VAMP-associated protein A (VAPA), but the significance of the NS1/2-VAPA interaction is unexplored. Here we report decreased murine norovirus replication in VAPA- and VAPB-deficient cells. We characterized the role of VAPA in detail. VAPA was required for the efficiency of a step(s) in the viral replication cycle after entry of viral RNA into the cytoplasm but before the synthesis of viral minus-sense RNA. The interaction of VAPA with viral NS1/2 proteins is conserved between murine and human noroviruses. Murine norovirus NS1/2 directly bound the major sperm protein (MSP) domain of VAPA through its NS1 domain. Mutations within NS1 that disrupted interaction with VAPA inhibited viral replication. Structural analysis revealed that the viral NS1 domain contains a mimic of the phenylalanine-phenylalanine-acidic-tract (FFAT) motif that enables host proteins to bind to the VAPA MSP domain. The NS1/2-FFAT mimic region interacted with the VAPA-MSP domain in a manner similar to that seen with bona fide host FFAT motifs. Amino acids in the FFAT mimic region of the NS1 domain that are important for viral replication are highly conserved across murine norovirus strains. Thus, VAPA interaction with a norovirus protein that functionally mimics host FFAT motifs is important for murine norovirus replication. IMPORTANCE Human noroviruses are a leading cause of gastroenteritis worldwide, but host factors involved in norovirus replication are incompletely understood. Murine noroviruses have been studied to define mechanisms of norovirus replication. Here we defined the importance of the interaction between the hitherto poorly studied NS1/2 norovirus protein and the VAPA host protein. The NS1/2-VAPA interaction is conserved between murine and human noroviruses and was important for early steps in murine norovirus replication. Using structure-function analysis, we found that NS1/2 contains a short sequence that molecularly mimics the FFAT motif that is found in multiple host proteins that bind VAPA. This represents to our knowledge the first example of functionally important mimicry of a host FFAT motif by a microbial protein. Copyright © 2017 McCune et al.
DMINDA: an integrated web server for DNA motif identification and analyses.
Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying
2014-07-01
DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Identifying DNA-binding proteins using structural motifs and the electrostatic potential
Shanahan, Hugh P.; Garcia, Mario A.; Jones, Susan; Thornton, Janet M.
2004-01-01
Robust methods to detect DNA-binding proteins from structures of unknown function are important for structural biology. This paper describes a method for identifying such proteins that (i) have a solvent accessible structural motif necessary for DNA-binding and (ii) a positive electrostatic potential in the region of the binding region. We focus on three structural motifs: helix–turn-helix (HTH), helix–hairpin–helix (HhH) and helix–loop–helix (HLH). We find that the combination of these variables detect 78% of proteins with an HTH motif, which is a substantial improvement over previous work based purely on structural templates and is comparable to more complex methods of identifying DNA-binding proteins. Similar true positive fractions are achieved for the HhH and HLH motifs. We see evidence of wide evolutionary diversity for DNA-binding proteins with an HTH motif, and much smaller diversity for those with an HhH or HLH motif. PMID:15356290
Papadopoulos, Dimitrios K.; Reséndez-Pérez, Diana; Cárdenas-Chávez, Diana L.; Villanueva-Segura, Karina; Canales-del-Castillo, Ricardo; Felix, Daniel A.; Fünfschilling, Raphael; Gehring, Walter J.
2011-01-01
Segmental identity along the anteroposterior axis of bilateral animals is specified by Hox genes. These genes encode transcription factors, harboring the conserved homeodomain and, generally, a YPWM motif, which binds Hox cofactors and increases Hox transcriptional specificity in vivo. Here we derive synthetic Drosophila Antennapedia genes, consisting only of the YPWM motif and homeodomain, and investigate their functional role throughout development. Synthetic peptides and full-length Antennapedia proteins cause head-to-thorax transformations in the embryo, as well as antenna-to-tarsus and eye-to-wing transformations in the adult, thus converting the entire head to a mesothorax. This conversion is achieved by repression of genes required for head and antennal development and ectopic activation of genes promoting thoracic and tarsal fates, respectively. Synthetic Antennapedia peptides bind DNA specifically and interact with Extradenticle and Bric-à-brac interacting protein 2 cofactors in vitro and ex vivo. Substitution of the YPWM motif by alanines abolishes Antennapedia homeotic function, whereas substitution of YPWM by the WRPW repressor motif, which binds the transcriptional corepressor Groucho, allows all proteins to act as repressors only. Finally, naturally occurring variations in the size of the linker between the homeodomain and YPWM motif enhance Antennapedia repressive or activating efficiency, emphasizing the importance of linker size, rather than sequence, for specificity. Our results clearly show that synthetic Antennapedia genes are functional in vivo and therefore provide powerful tools for synthetic biology. Moreover, the YPWM motif is necessary—whereas the entire N terminus of the protein is dispensable—for Antennapedia homeotic function, indicating its dual role in transcriptional activation and repression by recruiting either coactivators or corepressors. PMID:21712439
Effector prediction in host-pathogen interaction based on a Markov model of a ubiquitous EPIYA motif
2010-01-01
Background Effector secretion is a common strategy of pathogen in mediating host-pathogen interaction. Eight EPIYA-motif containing effectors have recently been discovered in six pathogens. Once these effectors enter host cells through type III/IV secretion systems (T3SS/T4SS), tyrosine in the EPIYA motif is phosphorylated, which triggers effectors binding other proteins to manipulate host-cell functions. The objectives of this study are to evaluate the distribution pattern of EPIYA motif in broad biological species, to predict potential effectors with EPIYA motif, and to suggest roles and biological functions of potential effectors in host-pathogen interactions. Results A hidden Markov model (HMM) of five amino acids was built for the EPIYA-motif based on the eight known effectors. Using this HMM to search the non-redundant protein database containing 9,216,047 sequences, we obtained 107,231 sequences with at least one EPIYA motif occurrence and 3115 sequences with multiple repeats of the EPIYA motif. Although the EPIYA motif exists among broad species, it is significantly over-represented in some particular groups of species. For those proteins containing at least four copies of EPIYA motif, most of them are from intracellular bacteria, extracellular bacteria with T3SS or T4SS or intracellular protozoan parasites. By combining the EPIYA motif and the adjacent SH2 binding motifs (KK, R4, Tarp and Tir), we built HMMs of nine amino acids and predicted many potential effectors in bacteria and protista by the HMMs. Some potential effectors for pathogens (such as Lawsonia intracellularis, Plasmodium falciparum and Leishmania major) are suggested. Conclusions Our study indicates that the EPIYA motif may be a ubiquitous functional site for effectors that play an important pathogenicity role in mediating host-pathogen interactions. We suggest that some intracellular protozoan parasites could secrete EPIYA-motif containing effectors through secretion systems similar to the T3SS/T4SS in bacteria. Our predicted effectors provide useful hypotheses for further studies. PMID:21143776
The C-terminal CGHC motif of protein disulfide isomerase supports thrombosis
Zhou, Junsong; Wu, Yi; Wang, Lu; Rauova, Lubica; Hayes, Vincent M.; Poncz, Mortimer; Essex, David W.
2015-01-01
Protein disulfide isomerase (PDI) has two distinct CGHC redox-active sites; however, the contribution of these sites during different physiologic reactions, including thrombosis, is unknown. Here, we evaluated the role of PDI and redox-active sites of PDI in thrombosis by generating mice with blood cells and vessel wall cells lacking PDI (Mx1-Cre Pdifl/fl mice) and transgenic mice harboring PDI that lacks a functional C-terminal CGHC motif [PDI(ss-oo) mice]. Both mouse models showed decreased fibrin deposition and platelet accumulation in laser-induced cremaster arteriole injury, and PDI(ss-oo) mice had attenuated platelet accumulation in FeCl3-induced mesenteric arterial injury. These defects were rescued by infusion of recombinant PDI containing only a functional C-terminal CGHC motif [PDI(oo-ss)]. PDI infusion restored fibrin formation, but not platelet accumulation, in eptifibatide-treated wild-type mice, suggesting a direct role of PDI in coagulation. In vitro aggregation of platelets from PDI(ss-oo) mice and PDI-null platelets was reduced; however, this defect was rescued by recombinant PDI(oo-ss). In human platelets, recombinant PDI(ss-oo) inhibited aggregation, while recombinant PDI(oo-ss) potentiated aggregation. Platelet secretion assays demonstrated that the C-terminal CGHC motif of PDI is important for P-selectin expression and ATP secretion through a non-αIIbβ3 substrate. In summary, our results indicate that the C-terminal CGHC motif of PDI is important for platelet function and coagulation. PMID:26529254
Genome Wide Identification, Evolutionary, and Expression Analysis of VQ Genes from Two Pyrus Species
Meng, Dandan; Abdullah, Muhammad; Jin, Qing; Lin, Yi; Cai, Yongping
2018-01-01
The VQ motif-containing gene, a member of the plant-specific genes, is involved in the plant developmental process and various stress responses. The VQ motif-containing gene family has been studied in several plants, such as rice (Oryza sativa), maize (Zea mays), and Arabidopsis (Arabidopsis thaliana). However, no systematic study has been performed in Pyrus species, which have important economic value. In our study, we identified 41 and 28 VQ motif-containing genes in Pyrus bretschneideri and Pyrus communis, respectively. Phylogenetic trees were calculated using A. thaliana and O. sativa VQ motif-containing genes as a template, allowing us to categorize these genes into nine subfamilies. Thirty-two and eight paralogous of VQ motif-containing genes were found in P. bretschneideri and P. communis, respectively, showing that the VQ motif-containing genes had a more remarkable expansion in P. bretschneideri than in P. communis. A total of 31 orthologous pairs were identified from the P. bretschneideri and P. communis VQ motif-containing genes. Additionally, among the paralogs, we found that these duplication gene pairs probably derived from segmental duplication/whole-genome duplication (WGD) events in the genomes of P. bretschneideri and P. communis, respectively. The gene expression profiles in both P. bretschneideri and P. communis fruits suggested functional redundancy for some orthologous gene pairs derived from a common ancestry, and sub-functionalization or neo-functionalization for some of them. Our study provided the first systematic evolutionary analysis of the VQ motif-containing genes in Pyrus, and highlighted the diversification and duplication of VQ motif-containing genes in both P. bretschneideri and P. communis. PMID:29690608
FoxG1 and TLE2 act cooperatively to regulate ventral telencephalon formation.
Roth, Martin; Bonev, Boyan; Lindsay, Jennefer; Lea, Robert; Panagiotaki, Niki; Houart, Corinne; Papalopulu, Nancy
2010-05-01
FoxG1 is a conserved transcriptional repressor that plays a key role in the specification, proliferation and differentiation of the telencephalon, and is expressed from the earliest stages of telencephalic development through to the adult. How the interaction with co-factors might influence the multiplicity and diversity of FoxG1 function is not known. Here, we show that interaction of FoxG1 with TLE2, a Xenopus tropicalis co-repressor of the Groucho/TLE family, is crucial for regulating the early activity of FoxG1. We show that TLE2 is co-expressed with FoxG1 in the ventral telencephalon from the early neural plate stage and functionally cooperates with FoxG1 in an ectopic neurogenesis assay. FoxG1 has two potential TLE binding sites: an N-terminal eh1 motif and a C-terminal YWPMSPF motif. Although direct binding seems to be mediated by the N-terminal motif, both motifs appear important for functional synergism. In the neurogenesis assay, mutation of either motif abolishes functional cooperation of TLE2 with FoxG1, whereas in the forebrain deletion of both motifs renders FoxG1 unable to induce the ventral telencephalic marker Nkx2.1. Knocking down either FoxG1 or TLE2 disrupts the development of the ventral telencephalon, supporting the idea that endogenous TLE2 and FoxG1 work together to specify the ventral telencephalon.
FoxG1 and TLE2 act cooperatively to regulate ventral telencephalon formation
Roth, Martin; Bonev, Boyan; Lindsay, Jennefer; Lea, Robert; Panagiotaki, Niki; Houart, Corinne; Papalopulu, Nancy
2010-01-01
FoxG1 is a conserved transcriptional repressor that plays a key role in the specification, proliferation and differentiation of the telencephalon, and is expressed from the earliest stages of telencephalic development through to the adult. How the interaction with co-factors might influence the multiplicity and diversity of FoxG1 function is not known. Here, we show that interaction of FoxG1 with TLE2, a Xenopus tropicalis co-repressor of the Groucho/TLE family, is crucial for regulating the early activity of FoxG1. We show that TLE2 is co-expressed with FoxG1 in the ventral telencephalon from the early neural plate stage and functionally cooperates with FoxG1 in an ectopic neurogenesis assay. FoxG1 has two potential TLE binding sites: an N-terminal eh1 motif and a C-terminal YWPMSPF motif. Although direct binding seems to be mediated by the N-terminal motif, both motifs appear important for functional synergism. In the neurogenesis assay, mutation of either motif abolishes functional cooperation of TLE2 with FoxG1, whereas in the forebrain deletion of both motifs renders FoxG1 unable to induce the ventral telencephalic marker Nkx2.1. Knocking down either FoxG1 or TLE2 disrupts the development of the ventral telencephalon, supporting the idea that endogenous TLE2 and FoxG1 work together to specify the ventral telencephalon. PMID:20356955
Biological network motif detection and evaluation
2011-01-01
Background Molecular level of biological data can be constructed into system level of data as biological networks. Network motifs are defined as over-represented small connected subgraphs in networks and they have been used for many biological applications. Since network motif discovery involves computationally challenging processes, previous algorithms have focused on computational efficiency. However, we believe that the biological quality of network motifs is also very important. Results We define biological network motifs as biologically significant subgraphs and traditional network motifs are differentiated as structural network motifs in this paper. We develop five algorithms, namely, EDGEGO-BNM, EDGEBETWEENNESS-BNM, NMF-BNM, NMFGO-BNM and VOLTAGE-BNM, for efficient detection of biological network motifs, and introduce several evaluation measures including motifs included in complex, motifs included in functional module and GO term clustering score in this paper. Experimental results show that EDGEGO-BNM and EDGEBETWEENNESS-BNM perform better than existing algorithms and all of our algorithms are applicable to find structural network motifs as well. Conclusion We provide new approaches to finding network motifs in biological networks. Our algorithms efficiently detect biological network motifs and further improve existing algorithms to find high quality structural network motifs, which would be impossible using existing algorithms. The performances of the algorithms are compared based on our new evaluation measures in biological contexts. We believe that our work gives some guidelines of network motifs research for the biological networks. PMID:22784624
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Cellular automata simulation of topological effects on the dynamics of feed-forward motifs
Apte, Advait A; Cain, John W; Bonchev, Danail G; Fong, Stephen S
2008-01-01
Background Feed-forward motifs are important functional modules in biological and other complex networks. The functionality of feed-forward motifs and other network motifs is largely dictated by the connectivity of the individual network components. While studies on the dynamics of motifs and networks are usually devoted to the temporal or spatial description of processes, this study focuses on the relationship between the specific architecture and the overall rate of the processes of the feed-forward family of motifs, including double and triple feed-forward loops. The search for the most efficient network architecture could be of particular interest for regulatory or signaling pathways in biology, as well as in computational and communication systems. Results Feed-forward motif dynamics were studied using cellular automata and compared with differential equation modeling. The number of cellular automata iterations needed for a 100% conversion of a substrate into a target product was used as an inverse measure of the transformation rate. Several basic topological patterns were identified that order the specific feed-forward constructions according to the rate of dynamics they enable. At the same number of network nodes and constant other parameters, the bi-parallel and tri-parallel motifs provide higher network efficacy than single feed-forward motifs. Additionally, a topological property of isodynamicity was identified for feed-forward motifs where different network architectures resulted in the same overall rate of the target production. Conclusion It was shown for classes of structural motifs with feed-forward architecture that network topology affects the overall rate of a process in a quantitatively predictable manner. These fundamental results can be used as a basis for simulating larger networks as combinations of smaller network modules with implications on studying synthetic gene circuits, small regulatory systems, and eventually dynamic whole-cell models. PMID:18304325
Kon, Shigeyuki; Nakayama, Yosuke; Matsumoto, Naoki; Ito, Koyu; Kanayama, Masashi; Kimura, Chiemi; Kouro, Hitomi; Ashitomi, Dai; Matsuda, Tadashi; Uede, Toshimitsu
2014-01-01
Osteopontin (OPN) is a multifunctional protein that has been linked to various intractable inflammatory diseases. One way by which OPN induces inflammation is the production of various functional fragments by enzyme cleavage. It has been well appreciated that OPN is cleaved by thrombin, and/or matrix metalloproteinase-3 and -7 (MMP-3/7). Although the function of thrombin-cleaved OPN is well characterized, little is known about the function of MMP-3/7-cleaved OPN. In this study, we found a novel motif, LRSKSRSFQVSDEQY, in the C-terminal fragment of MMP-3/7-cleaved mouse OPN binds to α9β1 integrin. Importantly, this novel motif is involved in the development of anti-type II collagen antibody-induced arthritis (CAIA). This study provides the first in vitro and in vivo evidence that OPN cleavage by MMP-3/7 is an important regulatory mechanism for CAIA. PMID:25545242
Green oxidations of furans--initiated by molecular oxygen--that give key natural product motifs.
Montagnon, Tamsyn; Noutsias, Dimitris; Alexopoulou, Ioanna; Tofi, Maria; Vassilikogiannakis, Georgios
2011-04-07
In this article, we explore how changes in the positioning of pendant hydroxyl functionalities in the photooxygenation substrate dramatically alter the course of furan oxidations that are initiated by singlet oxygen; and, how these different reactivities can be harnessed through cascade reaction sequences to access, rapidly and effectively, a broad range of important natural product motifs.
NASA Astrophysics Data System (ADS)
Wei Poh, Zhong; Heng Gan, Chin; Lee, Eric J.; Guo, Suxian; Yip, George W.; Lam, Yulin
2015-09-01
Glycosaminoglycans (GAGs) regulate many important physiological processes. A pertinent issue to address is whether GAGs encode important functional information via introduction of position specific sulfate groups in the GAG structure. However, procurement of pure, homogenous GAG motifs to probe the “sulfation code” is a challenging task due to isolation difficulty and structural complexity. To this end, we devised a versatile synthetic strategy to obtain all the 16 theoretically possible sulfation patterns in the chondroitin sulfate (CS) repeating unit; these include rare but potentially important sulfated motifs which have not been isolated earlier. Biological evaluation indicated that CS sulfation patterns had differing effects for different breast cancer cell types, and the greatest inhibitory effect was observed for the most aggressive, triple negative breast cancer cell line MDA-MB-231.
Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude
2011-06-20
One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
2011-01-01
Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins. PMID:21689388
Mining for class-specific motifs in protein sequence classification
2013-01-01
Background In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new substitution-based scoring function for identifying discriminative n-grams that are highly specific to a class. Results We present a scoring function based on discriminative n-grams that can effectively discriminate between classes. The scoring function, initially, harvests the entire set of 4- to 8-grams from the protein sequences of different classes in the dataset. Similar n-grams of the same size are combined to form new n-grams, where the similarity is defined by positive amino acid substitution scores in the BLOSUM62 matrix. Substitution has resulted in a large increase in the number of discriminatory n-grams harvested. Due to the unbalanced nature of the dataset, the frequencies of the n-grams are normalized using a dampening factor, which gives more weightage to the n-grams that appear in fewer classes and vice-versa. After the n-grams are normalized, the scoring function identifies discriminative 4- to 8-grams for each class that are frequent enough to be above a selection threshold. By mapping these discriminative n-grams back to the protein sequences, we obtained contiguous n-grams that represent short class-specific motifs in protein sequences. Our method fared well compared to an existing motif finding method known as Wordspy. We have validated our enriched set of class-specific motifs against the functionally important motifs obtained from the NLSdb, Prosite and ELM databases. We demonstrate that this method is very generic; thus can be widely applied to detect class-specific motifs in many protein sequence classification tasks. Conclusion The proposed scoring function and methodology is able to identify class-specific motifs using discriminative n-grams derived from the protein sequences. The implementation of amino acid substitution scores for similarity detection, and the dampening factor to normalize the unbalanced datasets have significant effect on the performance of the scoring function. Our multipronged validation tests demonstrate that this method can detect class-specific motifs from a wide variety of protein sequence classes with a potential application to detecting proteome-specific motifs of different organisms. PMID:23496846
Functional interaction of proliferating cell nuclear antigen with MSH2-MSH6 and MSH2-MSH3 complexes.
Clark, A B; Valle, F; Drotschmann, K; Gary, R K; Kunkel, T A
2000-11-24
Eukaryotic DNA mismatch repair requires the concerted action of several proteins, including proliferating cell nuclear antigen (PCNA) and heterodimers of MSH2 complexed with either MSH3 or MSH6. Here we report that MSH3 and MSH6, but not MSH2, contain N-terminal sequence motifs characteristic of proteins that bind to PCNA. MSH3 and MSH6 peptides containing these motifs bound PCNA, as did the intact Msh2-Msh6 complex. This binding was strongly reduced when alanine was substituted for conserved residues in the motif. Yeast strains containing alanine substitutions in the PCNA binding motif of Msh6 or Msh3 had elevated mutation rates, indicating that these interactions are important for genome stability. When human MSH3 or MSH6 peptides containing the PCNA binding motif were added to a human cell extract, mismatch repair activity was inhibited at a step preceding DNA resynthesis. Thus, MSH3 and MSH6 interactions with PCNA may facilitate early steps in DNA mismatch repair and may also be important for other roles of these eukaryotic MutS homologs.
Song, Wen; Liu, Li; Wang, Jizong; Wu, Zhen; Zhang, Heqiao; Tang, Jiao; Lin, Guangzhong; Wang, Yichuan; Wen, Xing; Li, Wenyang; Han, Zhifu; Guo, Hongwei; Chai, Jijie
2016-06-01
Peptide-mediated cell-to-cell signaling has crucial roles in coordination and definition of cellular functions in plants. Peptide-receptor matching is important for understanding the mechanisms underlying peptide-mediated signaling. Here we report the structure-guided identification of root meristem growth factor (RGF) receptors important for plant development. An assay based on a signature ligand recognition motif (Arg-x-Arg) conserved in a subfamily of leucine-rich repeat receptor kinases (LRR-RKs) identified the functionally uncharacterized LRR-RK At4g26540 as a receptor of RGF1 (RGFR1). We further solved the crystal structure of RGF1 in complex with the LRR domain of RGFR1 at a resolution of 2.6 Å, which reveals that the Arg-x-Gly-Gly (RxGG) motif is responsible for specific recognition of the sulfate group of RGF1 by RGFR1. Based on the RxGG motif, we identified additional four RGFRs. Participation of the five RGFRs in RGF-induced signaling is supported by biochemical and genetic data. We also offer evidence showing that SERKs function as co-receptors for RGFs. Taken together, our study identifies RGF receptors and co-receptors that can link RGF signals with their downstream components and provides a proof of principle for structure-based matching of LRR-RKs with their peptide ligands.
Massive GGAAs in genomic repetitive sequences serve as a nuclear reservoir of NF-κB.
Wu, Jian; Wang, Qiao; Dai, Wei; Wang, Wei; Yue, Ming; Wang, Jinke
2018-04-13
Nuclear factor κB (NF-κB) is a DNA-binding transcription factor. Characterizing its genomic binding sites is crucial for understanding its gene regulatory function and mechanism in cells. This study characterized the binding sites of NF-κB RelA/p65 in the tumor neurosis factor-α (TNFα) stimulated HeLa cells by a precise chromatin immunoprecipitation-sequencing (ChIP-seq). The results revealed that NF-κB binds nontraditional motifs (nt-motifs) containing conserved GGAA quadruplet. Moreover, nt-motifs mainly distribute in the peaks nearby centromeres that contain a larger number of repetitive elements such as satellite, simple repeats and short interspersed nuclear elements (SINEs). This intracellular binding pattern was then confirmed by the in vitro detection, indicating that NF-κB dimers can bind the nontraditional κB (nt-κB) sites with low affinity. However, this binding hardly activates transcription. This study thus deduced that NF-κB binding nt-motifs may realize functions other than gene regulation as NF-κB binding traditional motifs (t-motifs). To testify the deduction, many ChIP-seq data of other cell lines were then analyzed. The results indicate that NF-κB binding nt-motifs is also widely present in other cells. The ChIP-seq data analysis also revealed that nt-motifs more widely distribute in the peaks with low-fold enrichment. Importantly, it was also found that NF-κB binding nt-motifs is mainly present in the resting cells, whereas NF-κB binding t-motifs is mainly present in the stimulated cells. Astonishingly, no known function was enriched by the gene annotation of nt-motif peaks. Based on these results, this study proposed that the nt-κB sites that extensively distribute in larger numbers of repeat elements function as a nuclear reservoir of NF-κB. The nuclear NF-κB proteins stored at nt-κB sites in the resting cells may be recruited to the t-κB sites for regulating its target genes upon stimulation. Copyright © 2018 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. Published by Elsevier Ltd. All rights reserved.
Smyth, Redmond P; Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe; von Kleist, Max; Marquet, Roland
2018-05-18
Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5' region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5' PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production.
Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe
2018-01-01
Abstract Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5′ region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5′ PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production. PMID:29514260
Hombach, Antje; Ommen, Gabi; Chrobak, Mareike; Clos, Joachim
2013-04-01
The heat shock protein 90 plays a pivotal role in the life cycle control of Leishmania donovani promoting the fast-growing insect stage of this parasite. Equally important for insect stage growth is the co-chaperone Sti1. We show that replacement of Sti1 is only feasible in the presence of additional Sti1 transgenes indicating an essential role. To better understand the impact of Sti1 and its interaction with Hsp90, we performed a mutational analysis of Hsp90. We established that a single amino acid exchange in the Leishmania Hsp90 renders that protein resistant to the inhibitor radicicol (RAD), yet does not interfere with its functionality. Based on this RAD-resistant Hsp90, we established a combined chemical knockout/gene complementation (CKC) approach. We can show that Hsp90 function is required in both insect and mammalian life stages and that the Sti1-binding motif of Hsp90 is crucial for proliferation of insect and mammalian stages of the parasite. The Sti1-binding motif in Leishmania Hsp90 is suboptimal - optimizing the motif increased initial intracellular proliferation underscoring the importance of the Hsp90-Sti1 interaction for this important parasitic protozoan. The CKC strategy we developed will allow the future analysis of more Hsp90 domains and motifs in parasite viability and infectivity. © 2012 Blackwell Publishing Ltd.
2012-01-01
Background To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. Results We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. Conclusions SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery. PMID:23281852
Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon
2012-01-01
To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.
Structural and biochemical analysis of Bcl-2 interaction with the hepatitis B virus protein HBx.
Jiang, Tianyu; Liu, Minhao; Wu, Jianping; Shi, Yigong
2016-02-23
HBx is a hepatitis B virus protein that is required for viral infectivity and replication. Anti-apoptotic Bcl-2 family members are thought to be among the important host targets of HBx. However, the structure and function of HBx are poorly understood and the molecular mechanism of HBx-induced carcinogenesis remains unknown. In this study, we report biochemical and structural characterization of HBx. The recombinant HBx protein contains metal ions, in particular iron and zinc. A BH3-like motif in HBx (residues 110-135) binds Bcl-2 with a dissociation constant of ∼193 μM, which is drastically lower than that for a canonical BH3 motif from Bim or Bad. Structural analysis reveals that, similar to other BH3 motifs, the BH3-like motif of HBx adopts an amphipathic α-helix and binds the conserved BH3-binding groove on Bcl-2. Unlike the helical Bim or Bad BH3 motif, the C-terminal portion of the bound HBx BH3-like motif has an extended conformation and makes considerably fewer interactions with Bcl-2. These observations suggest that HBx may modulate Bcl-2 function in a way that is different from that of the classical BH3-only proteins.
Zhang, Lu; Xu, Jinhao; Ma, Jinbiao
2016-07-25
RNA-binding protein exerts important biological function by specifically recognizing RNA motif. SELEX (Systematic evolution of ligands by exponential enrichment), an in vitro selection method, can obtain consensus motif with high-affinity and specificity for many target molecules from DNA or RNA libraries. Here, we combined SELEX with next-generation sequencing to study the protein-RNA interaction in vitro. A pool of RNAs with 20 bp random sequences were transcribed by T7 promoter, and target protein was inserted into plasmid containing SBP-tag, which can be captured by streptavidin beads. Through only one cycle, the specific RNA motif can be obtained, which dramatically improved the selection efficiency. Using this method, we found that human hnRNP A1 RRMs domain (UP1 domain) bound RNA motifs containing AGG and AG sequences. The EMSA experiment indicated that hnRNP A1 RRMs could bind the obtained RNA motif. Taken together, this method provides a rapid and effective method to study the RNA binding specificity of proteins.
Khurana, Simran; Chakraborty, Sharmistha; Zhao, Xuan; Liu, Yu; Guan, Dongyin; Lam, Minh; Huang, Wei; Yang, Sichun; Kao, Hung-Ying
2012-01-01
α-Actinins (ACTNs) are a family of proteins cross-linking actin filaments that maintain cytoskeletal organization and cell motility. Recently, it has also become clear that ACTN4 can function in the nucleus. In this report, we found that ACTN4 (full length) and its spliced isoform ACTN4 (Iso) possess an unusual LXXLL nuclear receptor interacting motif. Both ACTN4 (full length) and ACTN4 (Iso) potentiate basal transcription activity and directly interact with estrogen receptor α, although ACTN4 (Iso) binds ERα more strongly. We have also found that both ACTN4 (full length) and ACTN4 (Iso) interact with the ligand-independent and the ligand-dependent activation domains of estrogen receptor α. Although ACTN4 (Iso) interacts efficiently with transcriptional co-activators such as p300/CBP-associated factor (PCAF) and steroid receptor co-activator 1 (SRC-1), the full length ACTN4 protein either does not or does so weakly. More importantly, the flanking sequences of the LXXLL motif are important not only for interacting with nuclear receptors but also for the association with co-activators. Taken together, we have identified a novel extended LXXLL motif that is critical for interactions with both receptors and co-activators. This motif functions more efficiently in a spliced isoform of ACTN4 than it does in the full-length protein. PMID:22908231
Multi-scale modularity and motif distributional effect in metabolic networks.
Gao, Shang; Chen, Alan; Rahmani, Ali; Zeng, Jia; Tan, Mehmet; Alhajj, Reda; Rokne, Jon; Demetrick, Douglas; Wei, Xiaohui
2016-01-01
Metabolism is a set of fundamental processes that play important roles in a plethora of biological and medical contexts. It is understood that the topological information of reconstructed metabolic networks, such as modular organization, has crucial implications on biological functions. Recent interpretations of modularity in network settings provide a view of multiple network partitions induced by different resolution parameters. Here we ask the question: How do multiple network partitions affect the organization of metabolic networks? Since network motifs are often interpreted as the super families of evolved units, we further investigate their impact under multiple network partitions and investigate how the distribution of network motifs influences the organization of metabolic networks. We studied Homo sapiens, Saccharomyces cerevisiae and Escherichia coli metabolic networks; we analyzed the relationship between different community structures and motif distribution patterns. Further, we quantified the degree to which motifs participate in the modular organization of metabolic networks.
Ouyang, Ping; Zhang, He; Fan, Zhaolan; Wei, Pei; Huang, Zhigang; Wang, Sen; Li, Tao
2016-11-05
NKX2.5 plays important roles in heart development. Being a transcription factor, NKX2.5 exerts its biological functions in nucleus. However, the sequence motif that localize NKX2.5 into nucleus is still not clear. Here, we found a R/K-rich sequence motif from Q187 to R197 (QNRRYKCKRQR) was required for exclusive nuclear localization of NKX2.5. Eight truncated plasmids (E109X, Q149X, Q170X, Q187X, Q198X, Y256X, Y259X, and C264X) which were associated with congenital heart disease (CHD) were constructed. Compared with the wild type NKX2.5, the proteins E109X, Q149X, Q170X, Q187X without intact homeodomain (HD) showed no transcriptional activity while Q198X, Y256X, Y259X and C264X with intact HD showed 50 to 66% transcriptional activity. E109X, Q149X, Q170X, Q187X without intact HD localized in the cytoplasm and nucleus simultaneously and Q198X, Y256X, Y259X and C264X with intact HD localized completely in nucleus. These results inferred the indispensability of 187QNRRYKCKRQR197 in exclusive nucleus localization. Additionally, this sequence motif was very conservative among human, mouse and rat, indicating this motif was important for NKX2.5 function. Thus, we concluded that R/K-rich sequence motif 187QNRRYKCKRQR197 played a central role for NKX2.5 nuclear localization. Our findings provided a clue to understand the mechanisms between the truncated NKX2.5 mutants and CHD. Copyright © 2016 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shimomura, Tadanori; Miyamura, Norio; Hata, Shoji
2014-01-17
Highlights: •Loss of the PDZ-binding motif inhibits constitutively active YAP (5SA)-induced oncogenic cell transformation. •The PDZ-binding motif of YAP promotes its nuclear localization in cultured cells and mouse liver. •Loss of the PDZ-binding motif inhibits YAP (5SA)-induced CTGF transcription in cultured cells and mouse liver. -- Abstract: YAP is a transcriptional co-activator that acts downstream of the Hippo signaling pathway and regulates multiple cellular processes, including proliferation. Hippo pathway-dependent phosphorylation of YAP negatively regulates its function. Conversely, attenuation of Hippo-mediated phosphorylation of YAP increases its ability to stimulate proliferation and eventually induces oncogenic transformation. The C-terminus of YAP contains amore » highly conserved PDZ-binding motif that regulates YAP’s functions in multiple ways. However, to date, the importance of the PDZ-binding motif to the oncogenic cell transforming activity of YAP has not been determined. In this study, we disrupted the PDZ-binding motif in the YAP (5SA) protein, in which the sites normally targeted by Hippo pathway-dependent phosphorylation are mutated. We found that loss of the PDZ-binding motif significantly inhibited the oncogenic transformation of cultured cells induced by YAP (5SA). In addition, the increased nuclear localization of YAP (5SA) and its enhanced activation of TEAD-dependent transcription of the cell proliferation gene CTGF were strongly reduced when the PDZ-binding motif was deleted. Similarly, in mouse liver, deletion of the PDZ-binding motif suppressed nuclear localization of YAP (5SA) and YAP (5SA)-induced CTGF expression. Taken together, our results indicate that the PDZ-binding motif of YAP is critical for YAP-mediated oncogenesis, and that this effect is mediated by YAP’s co-activation of TEAD-mediated CTGF transcription.« less
Kim, Hyun-Jun; Kwon, Hye-Rim; Bae, Chang-Dae; Park, Joobae; Hong, Kyung U
2010-05-15
During mitosis, regulation of protein structures and functions by phosphorylation plays critical roles in orchestrating a series of complex events essential for the cell division process. Tumor-associated microtubule-associated protein (TMAP), also known as cytoskeleton-associated protein 2 (CKAP2), is a novel player in spindle assembly and chromosome segregation. We have previously reported that TMAP is phosphorylated at multiple residues specifically during mitosis. However, the mechanisms and functional importance of phosphorylation at most of the sites identified are currently unknown. Here, we report that TMAP is a novel substrate of the Aurora B kinase. Ser627 of TMAP was specifically phosphorylated by Aurora B both in vitro and in vivo. Ser627 and neighboring conserved residues were strictly required for efficient phosphorylation of TMAP by Aurora B, as even minor amino acid substitutions of the phosphorylation motif significantly diminished the efficiency of the substrate phosphorylation. Nearly all mutations at the phosphorylation motif had dramatic effects on the subcellular localization of TMAP. Instead of being localized to the chromosome region during late mitosis, the mutants remained associated with microtubules and centrosomes throughout mitosis. However, the changes in the subcellular localization of these mutants could not be completely explained by the phosphorylation status on Ser627. Our findings suggest that the motif surrounding Ser627 ((625) RRSRRL (630)) is a critical part of a functionally important sequence motif which not only governs the kinase-substrate recognition, but also regulates the subcellular localization of TMAP during mitosis.
Papanikolopoulou, Katerina; van Raaij, Mark J; Mitraki, Anna
2008-01-01
Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, beta-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple beta-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.
Feedback Inhibition Shapes Emergent Computational Properties of Cortical Microcircuit Motifs.
Jonke, Zeno; Legenstein, Robert; Habenschuss, Stefan; Maass, Wolfgang
2017-08-30
Cortical microcircuits are very complex networks, but they are composed of a relatively small number of stereotypical motifs. Hence, one strategy for throwing light on the computational function of cortical microcircuits is to analyze emergent computational properties of these stereotypical microcircuit motifs. We are addressing here the question how spike timing-dependent plasticity shapes the computational properties of one motif that has frequently been studied experimentally: interconnected populations of pyramidal cells and parvalbumin-positive inhibitory cells in layer 2/3. Experimental studies suggest that these inhibitory neurons exert some form of divisive inhibition on the pyramidal cells. We show that this data-based form of feedback inhibition, which is softer than that of winner-take-all models that are commonly considered in theoretical analyses, contributes to the emergence of an important computational function through spike timing-dependent plasticity: The capability to disentangle superimposed firing patterns in upstream networks, and to represent their information content through a sparse assembly code. SIGNIFICANCE STATEMENT We analyze emergent computational properties of a ubiquitous cortical microcircuit motif: populations of pyramidal cells that are densely interconnected with inhibitory neurons. Simulations of this model predict that sparse assembly codes emerge in this microcircuit motif under spike timing-dependent plasticity. Furthermore, we show that different assemblies will represent different hidden sources of upstream firing activity. Hence, we propose that spike timing-dependent plasticity enables this microcircuit motif to perform a fundamental computational operation on neural activity patterns. Copyright © 2017 the authors 0270-6474/17/378511-13$15.00/0.
NASA Astrophysics Data System (ADS)
Papanikolopoulou, Katerina; van Raaij, Mark J.; Mitraki, Anna
Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, β-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple β-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.
Identity and functions of CxxC-derived motifs.
Fomenko, Dmitri E; Gladyshev, Vadim N
2003-09-30
Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.
Counting motifs in dynamic networks.
Mukherjee, Kingshuk; Hasan, Md Mahmudul; Boucher, Christina; Kahveci, Tamer
2018-04-11
A network motif is a sub-network that occurs frequently in a given network. Detection of such motifs is important since they uncover functions and local properties of the given biological network. Finding motifs is however a computationally challenging task as it requires solving the costly subgraph isomorphism problem. Moreover, the topology of biological networks change over time. These changing networks are called dynamic biological networks. As the network evolves, frequency of each motif in the network also changes. Computing the frequency of a given motif from scratch in a dynamic network as the network topology evolves is infeasible, particularly for large and fast evolving networks. In this article, we design and develop a scalable method for counting the number of motifs in a dynamic biological network. Our method incrementally updates the frequency of each motif as the underlying network's topology evolves. Our experiments demonstrate that our method can update the frequency of each motif in orders of magnitude faster than counting the motif embeddings every time the network changes. If the network evolves more frequently, the margin with which our method outperforms the existing static methods, increases. We evaluated our method extensively using synthetic and real datasets, and show that our method is highly accurate(≥ 96%) and that it can be scaled to large dense networks. The results on real data demonstrate the utility of our method in revealing interesting insights on the evolution of biological processes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Slabaugh, Erin; Scavuzzo-Duggan, Tess; Chaves, Arielle
2015-12-08
Cellulose synthases (CESAs) synthesize the β-1,4-glucan chains that coalesce to form cellulose microfibrils in plant cell walls. In addition to a large cytosolic (catalytic) domain, CESAs have eight predicted transmembrane helices (TMHs). However, analogous to the structure of BcsA, a bacterial CESA, predicted TMH5 in CESA may instead be an interfacial helix. This would place the conserved FxVTxK motif in the plant cell cytosol where it could function as a substrate-gating loop as occurs in BcsA. To define the functional importance of the CESA region containing FxVTxK, we tested five parallel mutations in Arabidopsis thaliana CESA1 and Physcomitrella patens CESA5more » in complementation assays of the relevant cesa mutants. In both organisms, the substitution of the valine or lysine residues in FxVTxK severely affected CESA function. In Arabidopsis roots, both changes were correlated with lower cellulose anisotropy, as revealed by Pontamine Fast Scarlet. Analysis of hypocotyl inner cell wall layers by atomic force microscopy showed that two altered versions of Atcesa1 could rescue cell wall phenotypes observed in the mutant background line. Overall, the data show that the FxVTxK motif is functionally important in two phylogenetically distant plant CESAs. The results show that Physcomitrella provides an efficient model for assessing the effects of engineered CESA mutations affecting primary cell wall synthesis and that diverse testing systems can lead to nuanced insights into CESA structure–function relationships. Although CESA membrane topology needs to be experimentally determined, the results support the possibility that the FxVTxK region functions similarly in CESA and BcsA.« less
Ang, Swee Kim; Zhang, Mengqi; Lodi, Tiziana; Lu, Hui
2014-01-01
Erv1 (essential for respiration and viability 1), is an essential component of the MIA (mitochondrial import and assembly) pathway, playing an important role in the oxidative folding of mitochondrial intermembrane space proteins. In the MIA pathway, Mia40, a thiol oxidoreductase with a CPC motif at its active site, oxidizes newly imported substrate proteins. Erv1 a FAD-dependent thiol oxidase, in turn reoxidizes Mia40 via its N-terminal Cys30–Cys33 shuttle disulfide. However, it is unclear how the two shuttle cysteine residues of Erv1 relay electrons from the Mia40 CPC motif to the Erv1 active-site Cys130–Cys133 disulfide. In the present study, using yeast genetic approaches we showed that both shuttle cysteine residues of Erv1 are required for cell growth. In organelle and in vitro studies confirmed that both shuttle cysteine residues were indeed required for import of MIA pathway substrates and Erv1 enzyme function to oxidize Mia40. Furthermore, our results revealed that the two shuttle cysteine residues of Erv1 are functionally distinct. Although Cys33 is essential for forming the intermediate disulfide Cys33–Cys130′ and transferring electrons to the redox active-site directly, Cys30 plays two important roles: (i) dominantly interacts and receives electrons from the Mia40 CPC motif; and (ii) resolves the Erv1 Cys33–Cys130 intermediate disulfide. Taken together, we conclude that both shuttle cysteine residues are required for Erv1 function, and play complementary, but distinct, roles to ensure rapid turnover of active Erv1. PMID:24625320
Dietz, Andrea N; Villinger, Clarissa; Becker, Stefan; Frick, Manfred; von Einem, Jens
2018-01-01
The human cytomegalovirus (HCMV) tegument protein pUL71 is required for efficient secondary envelopment and accumulates at the Golgi compartment-derived viral assembly complex (vAC) during infection. Analysis of various C-terminally truncated pUL71 proteins fused to enhanced green fluorescent protein (eGFP) identified amino acids 23 to 34 as important determinants for its Golgi complex localization. Sequence analysis and mutational verification revealed the presence of an N-terminal tyrosine-based trafficking motif (YXXΦ) in pUL71. This led us to hypothesize a requirement of the YXXΦ motif for the function of pUL71 in infection. Mutation of both the tyrosine residue and the entire YXXΦ motif resulted in an altered distribution of mutant pUL71 at the plasma membrane and in the cytoplasm during infection. Both YXXΦ mutant viruses exhibited similarly decreased focal growth and reduced virus yields in supernatants. Ultrastructurally, mutant-virus-infected cells exhibited impaired secondary envelopment manifested by accumulations of capsids undergoing an envelopment process. Additionally, clusters of capsid accumulations surrounding the vAC were observed, similar to the ultrastructural phenotype of a UL71-deficient mutant. The importance of endocytosis and thus the YXXΦ motif for targeting pUL71 to the Golgi complex was further demonstrated when clathrin-mediated endocytosis was inhibited either by coexpression of the C-terminal part of cellular AP180 (AP180-C) or by treatment with methyl-β-cyclodextrin. Both conditions resulted in a plasma membrane accumulation of pUL71. Altogether, these data reveal the presence of a functional N-terminal endocytosis motif that is an important determinant for intracellular localization of pUL71 and that is furthermore required for the function of pUL71 during secondary envelopment of HCMV capsids at the vAC. IMPORTANCE Human cytomegalovirus (HCMV) is the leading cause of birth defects among congenital virus infections and can lead to life-threatening infections in immunocompromised hosts. Current antiviral treatments target viral genome replication and are increasingly overcome by viral mutations. Therefore, identifying new targets for antiviral therapy is important for future development of novel treatment options. A detailed molecular understanding of the complex virus morphogenesis will identify potential viral as well as cellular targets for antiviral intervention. Secondary envelopment is an important viral process through which infectious virus particles are generated and which involves the action of several viral proteins, such as tegument protein pUL71. Targeting of pUL71 to the site of secondary envelopment appears to be crucial for its function during this process and is regulated by utilizing host trafficking mechanisms that are commonly exploited by viral glycoproteins. Thus, intracellular trafficking, if targeted, might present a novel target for antiviral therapy. Copyright © 2017 American Society for Microbiology.
2012-01-01
Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
Lee, Patricia; Ng, Hwee L.; Yang, Otto O.
2012-01-01
Human immunodeficiency virus type 1 (HIV-1) Nef downregulates major histocompatibility complex class I (MHC-I), impairing the clearance of infected cells by CD8+ cytotoxic T lymphocytes (CTLs). While sequence motifs mediating this function have been determined by in vitro mutagenesis studies of laboratory-adapted HIV-1 molecular clones, it is unclear whether the highly variable Nef sequences of primary isolates in vivo rely on the same sequence motifs. To address this issue, nef quasispecies from nine chronically HIV-1-infected persons were examined for sequence evolution and altered MHC-I downregulatory function under Gag-specific CTL immune pressure in vitro. This selection resulted in decreased nef diversity and strong purifying selection. Site-by-site analysis identified 13 codons undergoing purifying selection and 1 undergoing positive selection. Of the former, only 6 have been reported to have roles in Nef function, including 4 associated with MHC-I downregulation. Functional testing of naturally occurring in vivo polymorphisms at the 7 sites with no previously known functional role revealed 3 mutations (A84D, Y135F, and G140R) that ablated MHC-I downregulation and 3 (N52A, S169I, and V180E) that partially impaired MHC-I downregulation. Globally, the CTL pressure in vitro selected functional Nef from the in vivo quasispecies mixtures that predominately lacked MHC-I downregulatory function at the baseline. Overall, these data demonstrate that CTL pressure exerts a strong purifying selective pressure for MHC-I downregulation and identifies novel functional motifs present in Nef sequences in vivo. PMID:22553319
Core Promoter Functions in the Regulation of Gene Expression of Drosophila Dorsal Target Genes*
Zehavi, Yonathan; Kuznetsov, Olga; Ovadia-Shochat, Avital; Juven-Gershon, Tamar
2014-01-01
Developmental processes are highly dependent on transcriptional regulation by RNA polymerase II. The RNA polymerase II core promoter is the ultimate target of a multitude of transcription factors that control transcription initiation. Core promoters consist of core promoter motifs, e.g. the initiator, TATA box, and the downstream core promoter element (DPE), which confer specific properties to the core promoter. Here, we explored the importance of core promoter functions in the dorsal-ventral developmental gene regulatory network. This network includes multiple genes that are activated by different nuclear concentrations of Dorsal, an NFκB homolog transcription factor, along the dorsal-ventral axis. We show that over two-thirds of Dorsal target genes contain DPE sequence motifs, which is significantly higher than the proportion of DPE-containing promoters in Drosophila genes. We demonstrate that multiple Dorsal target genes are evolutionarily conserved and functionally dependent on the DPE. Furthermore, we have analyzed the activation of key Dorsal target genes by Dorsal, as well as by another Rel family transcription factor, Relish, and the dependence of their activation on the DPE motif. Using hybrid enhancer-promoter constructs in Drosophila cells and embryo extracts, we have demonstrated that the core promoter composition is an important determinant of transcriptional activity of Dorsal target genes. Taken together, our results provide evidence for the importance of core promoter composition in the regulation of Dorsal target genes. PMID:24634215
Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins
Kinjo, Akira R.; Nakamura, Haruki
2012-01-01
Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478
Methylation of class I translation termination factors: structural and functional aspects.
Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie
2012-07-01
During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures
Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa
2017-02-16
Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stewart, Jaimie Marie; Subramanian, Hari K. K.; Franco, Elisa
Rational design of nucleic acidmolecules yields selfassembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. We demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 m in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowlymore » annealed, and a one-pot transcription and anneal procedure. We then identify the tile nick position as a structural requirement for lattice formation. These results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components.« less
Self-assembly of multi-stranded RNA motifs into lattices and tubular structures
Stewart, Jaimie Marie; Subramanian, Hari K. K.
2017-01-01
Abstract Rational design of nucleic acid molecules yields self-assembling scaffolds with increasing complexity, size and functionality. It is an open question whether design methods tailored to build DNA nanostructures can be adapted to build RNA nanostructures with comparable features. Here we demonstrate the formation of RNA lattices and tubular assemblies from double crossover (DX) tiles, a canonical motif in DNA nanotechnology. Tubular structures can exceed 1 μm in length, suggesting that this DX motif can produce very robust lattices. Some of these tubes spontaneously form with left-handed chirality. We obtain assemblies by using two methods: a protocol where gel-extracted RNA strands are slowly annealed, and a one-pot transcription and anneal procedure. We identify the tile nick position as a structural requirement for lattice formation. Our results demonstrate that stable RNA structures can be obtained with design tools imported from DNA nanotechnology. These large assemblies could be potentially integrated with a variety of functional RNA motifs for drug or nanoparticle delivery, or for colocalization of cellular components. PMID:28204562
Conservation of tubulin-binding sequences in TRPV1 throughout evolution.
Sardar, Puspendu; Kumar, Abhishek; Bhandari, Anita; Goswami, Chandan
2012-01-01
Transient Receptor Potential Vanilloid sub type 1 (TRPV1), commonly known as capsaicin receptor can detect multiple stimuli ranging from noxious compounds, low pH, temperature as well as electromagnetic wave at different ranges. In addition, this receptor is involved in multiple physiological and sensory processes. Therefore, functions of TRPV1 have direct influences on adaptation and further evolution also. Availability of various eukaryotic genomic sequences in public domain facilitates us in studying the molecular evolution of TRPV1 protein and the respective conservation of certain domains, motifs and interacting regions that are functionally important. Using statistical and bioinformatics tools, our analysis reveals that TRPV1 has evolved about ∼420 million years ago (MYA). Our analysis reveals that specific regions, domains and motifs of TRPV1 has gone through different selection pressure and thus have different levels of conservation. We found that among all, TRP box is the most conserved and thus have functional significance. Our results also indicate that the tubulin binding sequences (TBS) have evolutionary significance as these stretch sequences are more conserved than many other essential regions of TRPV1. The overall distribution of positively charged residues within the TBS motifs is conserved throughout evolution. In silico analysis reveals that the TBS-1 and TBS-2 of TRPV1 can form helical structures and may play important role in TRPV1 function. Our analysis identifies the regions of TRPV1, which are important for structure-function relationship. This analysis indicates that tubulin binding sequence-1 (TBS-1) near the TRP-box forms a potential helix and the tubulin interactions with TRPV1 via TBS-1 have evolutionary significance. This interaction may be required for the proper channel function and regulation and may also have significance in the context of Taxol®-induced neuropathy.
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.
Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B
2018-06-07
RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Searching RNA motifs and their intermolecular contacts with constraint networks.
Thébault, P; de Givry, S; Schiex, T; Gaspin, C
2006-09-01
Searching RNA gene occurrences in genomic sequences is a task whose importance has been renewed by the recent discovery of numerous functional RNA, often interacting with other ligands. Even if several programs exist for RNA motif search, none exists that can represent and solve the problem of searching for occurrences of RNA motifs in interaction with other molecules. We present a constraint network formulation of this problem. RNA are represented as structured motifs that can occur on more than one sequence and which are related together by possible hybridization. The implemented tool MilPat is used to search for several sRNA families in genomic sequences. Results show that MilPat allows to efficiently search for interacting motifs in large genomic sequences and offers a simple and extensible framework to solve such problems. New and known sRNA are identified as H/ACA candidates in Methanocaldococcus jannaschii. http://carlit.toulouse.inra.fr/MilPaT/MilPat.pl.
Lu, Shun-Wen; Chen, Shiyan; Wang, Jianying; Yu, Hang; Chronis, Demosthenis; Mitchum, Melissa G; Wang, Xiaohong
2009-09-01
Plant CLAVATA3/ESR-related (CLE) peptides have diverse roles in plant growth and development. Here, we report the isolation and functional characterization of five new CLE genes from the potato cyst nematode Globodera rostochiensis. Unlike typical plant CLE peptides that contain a single CLE motif, four of the five Gr-CLE genes encode CLE proteins with multiple CLE motifs. These Gr-CLE genes were found to be specifically expressed within the dorsal esophageal gland cell of nematode parasitic stages, suggesting a role for their encoded proteins in plant parasitism. Overexpression phenotypes of Gr-CLE genes in Arabidopsis mimicked those of plant CLE genes, and Gr-CLE proteins could rescue the Arabidopsis clv3-2 mutant phenotype when expressed within meristems. A short root phenotype was observed when synthetic GrCLE peptides were exogenously applied to roots of Arabidopsis or potato similar to the overexpression of Gr-CLE genes in Arabidopsis and potato hairy roots. These results reveal that G. rostochiensis CLE proteins with either single or multiple CLE motifs function similarly to plant CLE proteins and that CLE signaling components are conserved in both Arabidopsis and potato roots. Furthermore, our results provide evidence to suggest that the evolution of multiple CLE motifs may be an important mechanism for generating functional diversity in nematode CLE proteins to facilitate parasitism.
Advantages and disadvantages in usage of bioinformatic programs in promoter region analysis
NASA Astrophysics Data System (ADS)
Pawełkowicz, Magdalena E.; Skarzyńska, Agnieszka; Posyniak, Kacper; ZiÄ bska, Karolina; PlÄ der, Wojciech; Przybecki, Zbigniew
2015-09-01
An important computational challenge is finding the regulatory elements across the promotor region. In this work we present the advantages and disadvantages from the application of different bioinformatics programs for localization of transcription factor binding sites in the upstream region of genes connected with sex determination in cucumber. We use PlantCARE, PlantPAN and SignalScan to find motifs in the promotor regions. The results have been compared and possible function of chosen motifs has been described.
Molecular Signaling Network Motifs Provide a Mechanistic Basis for Cellular Threshold Responses
Bhattacharya, Sudin; Conolly, Rory B.; Clewell, Harvey J.; Kaminski, Norbert E.; Andersen, Melvin E.
2014-01-01
Background: Increasingly, there is a move toward using in vitro toxicity testing to assess human health risk due to chemical exposure. As with in vivo toxicity testing, an important question for in vitro results is whether there are thresholds for adverse cellular responses. Empirical evaluations may show consistency with thresholds, but the main evidence has to come from mechanistic considerations. Objectives: Cellular response behaviors depend on the molecular pathway and circuitry in the cell and the manner in which chemicals perturb these circuits. Understanding circuit structures that are inherently capable of resisting small perturbations and producing threshold responses is an important step towards mechanistically interpreting in vitro testing data. Methods: Here we have examined dose–response characteristics for several biochemical network motifs. These network motifs are basic building blocks of molecular circuits underpinning a variety of cellular functions, including adaptation, homeostasis, proliferation, differentiation, and apoptosis. For each motif, we present biological examples and models to illustrate how thresholds arise from specific network structures. Discussion and Conclusion: Integral feedback, feedforward, and transcritical bifurcation motifs can generate thresholds. Other motifs (e.g., proportional feedback and ultrasensitivity)produce responses where the slope in the low-dose region is small and stays close to the baseline. Feedforward control may lead to nonmonotonic or hormetic responses. We conclude that network motifs provide a basis for understanding thresholds for cellular responses. Computational pathway modeling of these motifs and their combinations occurring in molecular signaling networks will be a key element in new risk assessment approaches based on in vitro cellular assays. Citation: Zhang Q, Bhattacharya S, Conolly RB, Clewell HJ III, Kaminski NE, Andersen ME. 2014. Molecular signaling network motifs provide a mechanistic basis for cellular threshold responses. Environ Health Perspect 122:1261–1270; http://dx.doi.org/10.1289/ehp.1408244 PMID:25117432
Identification of multiple nuclear localization signals in murine Elf3, an ETS transcription factor.
Do, Hyun-Jin; Song, Hyuk; Yang, Heung-Mo; Kim, Dong-Ku; Kim, Nam-Hyung; Kim, Jin-Hoi; Cha, Kwang-Yul; Chung, Hyung-Min; Kim, Jae-Hwan
2006-03-20
We investigated nuclear localization signal (NLS) determinants within the AT-hook and ETS DNA-binding domains of murine Elf3 (mElf3), a member of the subfamily of epithelium-specific ETS transcription factors. Deletion mutants containing the AT-hook, ETS domain or both localized strictly in the nucleus, suggesting that these individual domains contain independent NLS motif(s). Within the AT-hook domain, four basic residues (244KRKR247) were critical for strong NLS activity, and two potent bipartite NLS motifs (236-252 and 249-267) were sufficient for nuclear import of mElf3, although less efficient than the full domain. In addition, one stretch of basic residues (318KKK320) within the ETS domain appears to be essential for mElf3 nuclear localization. Taken together, mElf3 contains multiple NLS motifs, which may function cooperatively to effect efficient nuclear transport.
Nishio, Koji; Ma, Qian
2016-01-01
The maintenance of mitochondrial membrane potential is essential for cell growth and survival. Mitochondrial uncoupling protein 2 plays the most important roles in uncoupling oxidative phosphorylation and decreasing mitochondrial O2- production by regulating the mitochondrial membrane potential. We propose that mouse UCP2 has two glycine-rich motifs, motif 1: EGIRGLWKG (170-178) and a known Walker A-like motif 2: EGPRAFYKG (264-272). These motifs seem to be important for the function of UCP2. We investigated the biological effects of overproduced-UCP2 and its physiological consequence in Cos7 cells. We introduced several amino acid changes in the motif 1. The expression vectors of the green fluorescent protein (GFP)-fused UCP2 and mutant UCP2 were constructed and expressed in Cos7 cells. The UCP2-GFP-expressed cells significantly down-regulated the mitochondrial membrane potentials and induced the enlarged cell shapes. Next we generated the stably UCP2-GFP-expressed Cos7 cells by selection with the antibiotic Genecitin (G418). Within the first few weeks following G418-selection, the stably UCP2-GFP-expressed cells could not divide well and gradually manifested the irregular and enlarged senescent-like cell morphology. The UCP2/K177E- or UCP2/G174L-expressed cells did not induce the enlarged cell shapes. Hence, UCP2/K177E and UCP2/G174L produced the functional incompetence of the glycine-rich motif 1. The senescent-like cells significantly decreased the mitochondrial membrane potentials and finally died nearly one month. Overproduction of UCP2 irreversibly reduces the mitochondrial membrane potentials and induces the senescent-like morphology and finally oncotic cell death in Cos7 cells. These changes seem to occur from the irreversible metabolic changes following total loss of cellular ATP.
Multiple TPR motifs characterize the Fanconi anemia FANCG protein.
Blom, Eric; van de Vrugt, Henri J; de Vries, Yne; de Winter, Johan P; Arwert, Fré; Joenje, Hans
2004-01-05
The genome protection pathway that is defective in patients with Fanconi anemia (FA) is controlled by at least eight genes, including BRCA2. A key step in the pathway involves the monoubiquitylation of FANCD2, which critically depends on a multi-subunit nuclear 'core complex' of at least six FANC proteins (FANCA, -C, -E, -F, -G, and -L). Except for FANCL, which has WD40 repeats and a RING finger domain, no significant domain structure has so far been recognized in any of the core complex proteins. By using a homology search strategy comparing the human FANCG protein sequence with its ortholog sequences in Oryzias latipes (Japanese rice fish) and Danio rerio (zebrafish) we identified at least seven tetratricopeptide repeat motifs (TPRs) covering a major part of this protein. TPRs are degenerate 34-amino acid repeat motifs which function as scaffolds mediating protein-protein interactions, often found in multiprotein complexes. In four out of five TPR motifs tested (TPR1, -2, -5, and -6), targeted missense mutagenesis disrupting the motifs at the critical position 8 of each TPR caused complete or partial loss of FANCG function. Loss of function was evident from failure of the mutant proteins to complement the cellular FA phenotype in FA-G lymphoblasts, which was correlated with loss of binding to FANCA. Although the TPR4 mutant fully complemented the cells, it showed a reduced interaction with FANCA, suggesting that this TPR may also be of functional importance. The recognition of FANCG as a typical TPR protein predicts this protein to play a key role in the assembly and/or stabilization of the nuclear FA protein core complex.
Smith, Robert A; Anderson, Donovan J; Preston, Bradley D
2006-07-01
Human immunodeficiency virus type 1 (HIV-1) reverse transcriptase (RT) contains four structural motifs (A, B, C, and D) that are conserved in polymerases from diverse organisms. Motif B interacts with the incoming nucleotide, the template strand, and key active-site residues from other motifs, suggesting that motif B is an important determinant of substrate specificity. To examine the functional role of this region, we performed "random scanning mutagenesis" of 11 motif B residues and screened replication-competent mutants for altered substrate analog sensitivity in culture. Single amino acid replacements throughout the targeted region conferred resistance to lamivudine and/or hypersusceptibility to zidovudine (AZT). Substitutions at residue Q151 increased the sensitivity of HIV-1 to multiple nucleoside analogs, and a subset of these Q151 variants was also hypersusceptible to the pyrophosphate analog phosphonoformic acid (PFA). Other AZT-hypersusceptible mutants were resistant to PFA and are therefore phenotypically similar to PFA-resistant variants selected in vitro and in infected patients. Collectively, these data show that specific amino acid replacements in motif B confer broad-spectrum hypersusceptibility to substrate analog inhibitors. Our results suggest that motif B influences RT-deoxynucleoside triphosphate interactions at multiple steps in the catalytic cycle of polymerization.
Li, Wan; Chen, Lina; Li, Xia; Jia, Xu; Feng, Chenchen; Zhang, Liangcai; He, Weiming; Lv, Junjie; He, Yuehan; Li, Weiguo; Qu, Xiaoli; Zhou, Yanyan; Shi, Yuchen
2013-12-01
Network motifs in central positions are considered to not only have more in-coming and out-going connections but are also localized in an area where more paths reach the networks. These central motifs have been extensively investigated to determine their consistent functions or associations with specific function categories. However, their functional potentials in the maintenance of cross-talk between different functional communities are unclear. In this paper, we constructed an integrated human signaling network from the Pathway Interaction Database. We identified 39 essential cancer-related motifs in central roles, which we called cancer-related marketing centrality motifs, using combined centrality indices on the system level. Our results demonstrated that these cancer-related marketing centrality motifs were pivotal units in the signaling network, and could mediate cross-talk between 61 biological pathways (25 could be mediated by one motif on average), most of which were cancer-related pathways. Further analysis showed that molecules of most marketing centrality motifs were in the same or adjacent subcellular localizations, such as the motif containing PI3K, PDK1 and AKT1 in the plasma membrane, to mediate signal transduction between 32 cancer-related pathways. Finally, we analyzed the pivotal roles of cancer genes in these marketing centrality motifs in the pathogenesis of cancers, and found that non-cancer genes were potential cancer-related genes.
Gallon, Matthew; Clairfeuille, Thomas; Steinberg, Florian; Mas, Caroline; Ghai, Rajesh; Sessions, Richard B; Teasdale, Rohan D; Collins, Brett M; Cullen, Peter J
2014-09-02
The sorting nexin 27 (SNX27)-retromer complex is a major regulator of endosome-to-plasma membrane recycling of transmembrane cargos that contain a PSD95, Dlg1, zo-1 (PDZ)-binding motif. Here we describe the core interaction in SNX27-retromer assembly and its functional relevance for cargo sorting. Crystal structures and NMR experiments reveal that an exposed β-hairpin in the SNX27 PDZ domain engages a groove in the arrestin-like structure of the vacuolar protein sorting 26A (VPS26A) retromer subunit. The structure establishes how the SNX27 PDZ domain simultaneously binds PDZ-binding motifs and retromer-associated VPS26. Importantly, VPS26A binding increases the affinity of the SNX27 PDZ domain for PDZ- binding motifs by an order of magnitude, revealing cooperativity in cargo selection. With disruption of SNX27 and retromer function linked to synaptic dysfunction and neurodegenerative disease, our work provides the first step, to our knowledge, in the molecular description of this important sorting complex, and more broadly describes a unique interaction between a PDZ domain and an arrestin-like fold.
Network motif frequency vectors reveal evolving metabolic network organisation.
Pearcy, Nicole; Crofts, Jonathan J; Chuzhanova, Nadia
2015-01-01
At the systems level many organisms of interest may be described by their patterns of interaction, and as such, are perhaps best characterised via network or graph models. Metabolic networks, in particular, are fundamental to the proper functioning of many important biological processes, and thus, have been widely studied over the past decade or so. Such investigations have revealed a number of shared topological features, such as a short characteristic path-length, large clustering coefficient and hierarchical modular structure. However, the extent to which evolutionary and functional properties of metabolism manifest via this underlying network architecture remains unclear. In this paper, we employ a novel graph embedding technique, based upon low-order network motifs, to compare metabolic network structure for 383 bacterial species categorised according to a number of biological features. In particular, we introduce a new global significance score which enables us to quantify important evolutionary relationships that exist between organisms and their physical environments. Using this new approach, we demonstrate a number of significant correlations between environmental factors, such as growth conditions and habitat variability, and network motif structure, providing evidence that organism adaptability leads to increased complexities in the resultant metabolic networks.
Xu, Hongyun; Shi, Xinxin; Wang, Zhibo; Gao, Caiqiu; Wang, Chao; Wang, Yucheng
2017-08-01
WRKY transcription factors play important roles in many biological processes, and mainly bind to the W-box element to regulate gene expression. Previously, we characterized a WRKY gene from Tamarix hispida, ThWRKY4, in response to abiotic stress, and showed that it bound to the W-box motif. However, whether ThWRKY4 could bind to other motifs remains unknown. In this study, we employed a Transcription Factor-Centered Yeast one Hybrid (TF-Centered Y1H) screen to study the motifs recognized by ThWRKY4. In addition to the W-box core cis-element (termed W-box), we identified that ThWRKY4 could bind to two other motifs: the RAV1A element (CAACA) and a novel motif with sequence of GTCTA (W-box like sequence, WLS). The distributions of these motifs were screened in the promoter regions of genes regulated by some WRKYs. The results showed that the W-box, RAV1A, and WLS motifs were all present in high numbers, suggesting that they play key roles in gene expression mediated by WRKYs. Furthermore, five WRKY proteins from different WRKY subfamilies in Arabidopsis thaliana were selected and confirmed to bind to the RAV1A and WLS motifs, indicating that they are recognized commonly by WRKYs. These findings will help to further reveal the functions of WRKY proteins. Copyright © 2017 Elsevier B.V. All rights reserved.
Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M
2015-12-01
Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.
RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching
NASA Astrophysics Data System (ADS)
Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.
Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.
Al-Momani, Shireen; Qi, Da; Ren, Zhe; Jones, Andrew R
2018-06-15
Phosphorylation is one of the most prevalent post-translational modifications and plays a key role in regulating cellular processes. We carried out a bioinformatics analysis of pre-existing phosphoproteomics data, to profile two model species representing the largest subclasses in flowering plants the dicot Arabidopsis thaliana and the monocot Oryza sativa, to understand the extent to which phosphorylation signaling and function is conserved across evolutionary divergent plants. We identified 6537 phosphopeptides from 3189 phosphoproteins in Arabidopsis and 2307 phosphopeptides from 1613 phosphoproteins in rice. We identified phosphorylation motifs, finding nineteen pS motifs and two pT motifs shared in rice and Arabidopsis. The majority of shared motif-containing proteins were mapped to the same biological processes with similar patterns of fold enrichment, indicating high functional conservation. We also identified shared patterns of crosstalk between phosphoserines with enrichment for motifs pSXpS, pSXXpS and pSXXXpS, where X is any amino acid. Lastly, our results identified several pairs of motifs that are significantly enriched to co-occur in Arabidopsis proteins, indicating cross-talk between different sites, but this was not observed in rice. Our results demonstrate that there are evolutionary conserved mechanisms of phosphorylation-mediated signaling in plants, via analysis of high-throughput phosphorylation proteomics data from key monocot and dicot species: rice and Arabidposis thaliana. The results also suggest that there is increased crosstalk between phosphorylation sites in A. thaliana compared with rice. The results are important for our general understanding of cell signaling in plants, and the ability to use A. thaliana as a general model for plant biology. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Correlated Mutation in the Evolution of Catalysis in Uracil DNA Glycosylase Superfamily
NASA Astrophysics Data System (ADS)
Xia, Bo; Liu, Yinling; Guevara, Jose; Li, Jing; Jilich, Celeste; Yang, Ye; Wang, Liangjiang; Dominy, Brian N.; Cao, Weiguo
2017-04-01
Enzymes in Uracil DNA glycosylase (UDG) superfamily are essential for the removal of uracil. Family 4 UDGa is a robust uracil DNA glycosylase that only acts on double-stranded and single-stranded uracil-containing DNA. Based on mutational, kinetic and modeling analyses, a catalytic mechanism involving leaving group stabilization by H155 in motif 2 and water coordination by N89 in motif 3 is proposed. Mutual Information analysis identifies a complexed correlated mutation network including a strong correlation in the EG doublet in motif 1 of family 4 UDGa and in the QD doublet in motif 1 of family 1 UNG. Conversion of EG doublet in family 4 Thermus thermophilus UDGa to QD doublet increases the catalytic efficiency by over one hundred-fold and seventeen-fold over the E41Q and G42D single mutation, respectively, rectifying the strong correlation in the doublet. Molecular dynamics simulations suggest that the correlated mutations in the doublet in motif 1 position the catalytic H155 in motif 2 to stabilize the leaving uracilate anion. The integrated approach has important implications in studying enzyme evolution and protein structure and function.
Viral infection and human disease - insights from minimotifs
Kadaveru, Krishna; Vyas, Jay; Schiller, Martin R.
2008-01-01
Short functional peptide motifs cooperate in many molecular functions including protein interactions, protein trafficking, and posttranslational modifications. Viruses exploit these motifs as a principal mechanism for hijacking cells and many motifs are necessary for the viral life-cycle. A virus can accommodate many short motifs in its small genome size providing a plethora of ways for the virus to acquire host molecular machinery. Host enzymes that act on motifs such as kinases, proteases, and lipidation enzymes, as well as protein interaction domains, are commonly mutated in human disease, suggesting that the short peptide motif targets of these enzymes may also be mutated in disease; however, this is not observed. How can we explain why viruses have evolved to be so dependent on motifs, yet these motifs, in general do not seem to be as necessary for human viability? We propose that short motifs are used at the system level. This system architecture allows viruses to exploit a motif, whereas the viability of the host is not affected by mutation of a single motif. PMID:18508672
Overexpression of TRIM25 in Lung Cancer Regulates Tumor Cell Progression.
Qin, Ying; Cui, He; Zhang, Hua
2016-10-01
Lung cancer is one of the most common causes of cancer-related deaths worldwide. Although great efforts and progressions have been made in the study of the lung cancer in the recent decades, the mechanism of lung cancer formation remains elusive. To establish effective therapeutic methods, new targets implied in lung cancer processes have to be identified. Tripartite motif-containing 25 has been associated with ovarian and breast cancer and is thought to positively promote cell growth by targeting the cell cycle. However, whether tripartite motif-containing 25 has a function in lung cancer development remains unknown. In this study, we found that tripartite motif-containing 25 was overexpressed in human lung cancer tissues. Expression of tripartite motif-containing 25 in lung cancer cells is important for cell proliferation and migration. Knockdown of tripartite motif-containing 25 markedly reduced proliferation of lung cancer cells both in vitro and in vivo and reduced migration of lung cancer cells in vitro Meanwhile, tripartite motif-containing 25 silencing also increased the sensitivity of doxorubicin and significantly increased death and apoptosis of lung cancer cells by doxorubicin were achieved with knockdown of tripartite motif-containing 25. We also observed that tripartite motif-containing 25 formed a complex with p53 and mouse double minute 2 homolog (MDM2) in both human lung cancer tissues and in lung cancer cells and tripartite motif-containing 25 silencing increased the expression of p53. These results provide evidence that tripartite motif-containing 25 contributes to the pathogenesis of lung cancer probably by promoting proliferation and migration of lung cancer cells. Therefore, targeting tripartite motif-containing 25 may provide a potential therapeutic intervention for lung cancer. © The Author(s) 2015.
Slabaugh, Erin; Scavuzzo-Duggan, Tess; Chaves, Arielle; Wilson, Liza; Wilson, Carmen; Davis, Jonathan K; Cosgrove, Daniel J; Anderson, Charles T; Roberts, Alison W; Haigler, Candace H
2016-05-01
Cellulose synthases (CESAs) synthesize the β-1,4-glucan chains that coalesce to form cellulose microfibrils in plant cell walls. In addition to a large cytosolic (catalytic) domain, CESAs have eight predicted transmembrane helices (TMHs). However, analogous to the structure of BcsA, a bacterial CESA, predicted TMH5 in CESA may instead be an interfacial helix. This would place the conserved FxVTxK motif in the plant cell cytosol where it could function as a substrate-gating loop as occurs in BcsA. To define the functional importance of the CESA region containing FxVTxK, we tested five parallel mutations in Arabidopsis thaliana CESA1 and Physcomitrella patens CESA5 in complementation assays of the relevant cesa mutants. In both organisms, the substitution of the valine or lysine residues in FxVTxK severely affected CESA function. In Arabidopsis roots, both changes were correlated with lower cellulose anisotropy, as revealed by Pontamine Fast Scarlet. Analysis of hypocotyl inner cell wall layers by atomic force microscopy showed that two altered versions of Atcesa1 could rescue cell wall phenotypes observed in the mutant background line. Overall, the data show that the FxVTxK motif is functionally important in two phylogenetically distant plant CESAs. The results show that Physcomitrella provides an efficient model for assessing the effects of engineered CESA mutations affecting primary cell wall synthesis and that diverse testing systems can lead to nuanced insights into CESA structure-function relationships. Although CESA membrane topology needs to be experimentally determined, the results support the possibility that the FxVTxK region functions similarly in CESA and BcsA. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Huang, He; Sarai, Akinori
2012-12-01
The evolvability of proteins is not only restricted by functional and structural importance, but also by other factors such as gene duplication, protein stability, and an organism's robustness. Recently, intrinsically disordered proteins (IDPs)/regions (IDRs) have been suggested to play a role in facilitating protein evolution. However, the mechanisms by which this occurs remain largely unknown. To address this, we have systematically analyzed the relationship between the evolvability, stability, and function of IDPs/IDRs. Evolutionary analysis shows that more recently emerged IDRs have higher evolutionary rates with more functional constraints relaxed (or experiencing more positive selection), and that this may have caused accelerated evolution in the flanking regions and in the whole protein. A systematic analysis of observed stability changes due to single amino acid mutations in IDRs and ordered regions shows that while most mutations induce a destabilizing effect in proteins, mutations in IDRs cause smaller stability changes than in ordered regions. The weaker impact of mutations in IDRs on protein stability may have advantages for protein evolvability in the gain of new functions. Interestingly, however, an analysis of functional motifs in the PROSITE and ELM databases showed that motifs in IDRs are more conserved, characterized by smaller entropy and lower evolutionary rate, than in ordered regions. This apparently opposing evolutionary effect may be partly due to the flexible nature of motifs in IDRs, which require some key amino acid residues to engage in tighter interactions with other molecules. Our study suggests that the unique conformational and thermodynamic characteristics of IDPs/IDRs play an important role in the evolvability of proteins to gain new functions. Copyright © 2012 Elsevier Ltd. All rights reserved.
FPGA implementation of motifs-based neuronal network and synchronization analysis
NASA Astrophysics Data System (ADS)
Deng, Bin; Zhu, Zechen; Yang, Shuangming; Wei, Xile; Wang, Jiang; Yu, Haitao
2016-06-01
Motifs in complex networks play a crucial role in determining the brain functions. In this paper, 13 kinds of motifs are implemented with Field Programmable Gate Array (FPGA) to investigate the relationships between the networks properties and motifs properties. We use discretization method and pipelined architecture to construct various motifs with Hindmarsh-Rose (HR) neuron as the node model. We also build a small-world network based on these motifs and conduct the synchronization analysis of motifs as well as the constructed network. We find that the synchronization properties of motif determine that of motif-based small-world network, which demonstrates effectiveness of our proposed hardware simulation platform. By imitation of some vital nuclei in the brain to generate normal discharges, our proposed FPGA-based artificial neuronal networks have the potential to replace the injured nuclei to complete the brain function in the treatment of Parkinson's disease and epilepsy.
Beusch, Irene; Barraud, Pierre; Moursy, Ahmed; Cléry, Antoine; Allain, Frédéric Hai-Trieu
2017-01-01
HnRNP A1 regulates many alternative splicing events by the recognition of splicing silencer elements. Here, we provide the solution structures of its two RNA recognition motifs (RRMs) in complex with short RNA. In addition, we show by NMR that both RRMs of hnRNP A1 can bind simultaneously to a single bipartite motif of the human intronic splicing silencer ISS-N1, which controls survival of motor neuron exon 7 splicing. RRM2 binds to the upstream motif and RRM1 to the downstream motif. Combining the insights from the structure with in cell splicing assays we show that the architecture and organization of the two RRMs is essential to hnRNP A1 function. The disruption of the inter-RRM interaction or the loss of RNA binding capacity of either RRM impairs splicing repression by hnRNP A1. Furthermore, both binding sites within the ISS-N1 are important for splicing repression and their contributions are cumulative rather than synergistic. DOI: http://dx.doi.org/10.7554/eLife.25736.001 PMID:28650318
Functional Motifs Responsible for Human Metapneumovirus M2-2-mediated Innate Immune Evasion
Chen, Yu; Deng, Xiaoling; Deng, Junfang; Zhou, Jiehua; Ren, Yuping; Liu, Shengxuan; Prusak, Deborah J.; Wood, Thomas G.; Bao, Xiaoyong
2016-01-01
Human metapneumovirus (hMPV) is a major cause of lower respiratory infection in young children. Repeated infections occur throughout life, but its immune evasion mechanisms are largely unknown. We recently found that hMPV M2-2 protein elicits immune evasion by targeting mitochondrial antiviral-signaling protein (MAVS), an antiviral signaling molecule. However, the molecular mechanisms underlying such inhibition are not known. Our mutagenesis studies revealed that PDZ-binding motifs, 29-DEMI-32 and 39-KEALSDGI-46, located in an immune inhibitory region of M2-2, are responsible for M2-2-mediated immune evasion. We also found both motifs prevent TRAF5 and TRAF6, the MAVS downstream adaptors, to be recruited to MAVS, while the motif 39-KEALSDGI-46 also blocks TRAF3 migrating to MAVS. In parallel, these TRAFs are important in activating transcription factors NF-kB and/or IRF-3 by hMPV. Our findings collectively demonstrate that M2-2 uses its PDZ motifs to launch the hMPV immune evasion through blocking the interaction of MAVS and its downstream TRAFs. PMID:27743962
Functional motifs responsible for human metapneumovirus M2-2-mediated innate immune evasion.
Chen, Yu; Deng, Xiaoling; Deng, Junfang; Zhou, Jiehua; Ren, Yuping; Liu, Shengxuan; Prusak, Deborah J; Wood, Thomas G; Bao, Xiaoyong
2016-12-01
Human metapneumovirus (hMPV) is a major cause of lower respiratory infection in young children. Repeated infections occur throughout life, but its immune evasion mechanisms are largely unknown. We recently found that hMPV M2-2 protein elicits immune evasion by targeting mitochondrial antiviral-signaling protein (MAVS), an antiviral signaling molecule. However, the molecular mechanisms underlying such inhibition are not known. Our mutagenesis studies revealed that PDZ-binding motifs, 29-DEMI-32 and 39-KEALSDGI-46, located in an immune inhibitory region of M2-2, are responsible for M2-2-mediated immune evasion. We also found both motifs prevent TRAF5 and TRAF6, the MAVS downstream adaptors, to be recruited to MAVS, while the motif 39-KEALSDGI-46 also blocks TRAF3 migrating to MAVS. In parallel, these TRAFs are important in activating transcription factors NF-kB and/or IRF-3 by hMPV. Our findings collectively demonstrate that M2-2 uses its PDZ motifs to launch the hMPV immune evasion through blocking the interaction of MAVS and its downstream TRAFs. Copyright © 2016 Elsevier Inc. All rights reserved.
Bruce, A. Gregory; Horst, Jeremy A.; Rose, Timothy M.
2016-01-01
The envelope-associated glycoprotein B (gB) is highly conserved within the Herpesviridae and plays a critical role in viral entry. We analyzed the evolutionary conservation of sequence and structural motifs within the Kaposi’s sarcoma-associated herpesvirus (KSHV) gB and homologs of Old World primate rhadinoviruses belonging to the distinct RV1 and RV2 rhadinovirus lineages. In addition to gB homologs of rhadinoviruses infecting the pig-tailed and rhesus macaques, we cloned and sequenced gB homologs of RV1 and RV2 rhadinoviruses infecting chimpanzees. A structural model of the KSHV gB was determined, and functional motifs and sequence variants were mapped to the model structure. Conserved domains and motifs were identified, including an “RGD” motif that plays a critical role in KSHV binding and entry through the cellular integrin αVβ3. The RGD motif was only detected in RV1 rhadinoviruses suggesting an important difference in cell tropism between the two rhadinovirus lineages. PMID:27070755
Velagapudi, Sai Pradeep; Seedhouse, Steven J.; French, Jonathan
2011-01-01
RNA is an important therapeutic target, however, RNA targets are generally underexploited due to a lack of understanding of the small molecules that bind RNA and the RNA motifs that bind small molecules. Herein, we describe the identification of the RNA internal loops derived from a 4096-member 3×3 nucleotide loop library that are the most specific and highest affinity binders to a series of four designer, drug-like benzimidazoles. These studies establish a potentially general protocol to define the highest affinity and most specific RNA motif targets for heterocyclic small molecules. Such information could be used to target functionally important RNAs in genomic sequence. PMID:21604752
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mohr, Georg; Del Campo, Mark; Turner, Kathryn G.
The Saccharomyces cerevisiae DEAD-box protein Mss116p is a general RNA chaperone that functions in splicing mitochondrial group I and group II introns. Recent X-ray crystal structures of Mss116p in complex with ATP analogs and single-stranded RNA show that the helicase core induces a bend in the bound RNA, as in other DEAD-box proteins, while a C-terminal extension (CTE) induces a second bend, resulting in RNA crimping. Here, we illuminate these structures by using high-throughput genetic selections, unigenic evolution, and analyses of in vivo splicing activity to comprehensively identify functionally important regions and permissible amino acid substitutions throughout Mss116p. The functionallymore » important regions include those containing conserved sequence motifs involved in ATP and RNA binding or interdomain interactions, as well as previously unidentified regions, including surface loops that may function in protein-protein interactions. The genetic selections recapitulate major features of the conserved helicase motifs seen in other DEAD-box proteins but also show surprising variations, including multiple novel variants of motif III (SAT). Patterns of amino acid substitutions indicate that the RNA bend induced by the helicase core depends on ionic and hydrogen-bonding interactions with the bound RNA; identify a subset of critically interacting residues; and indicate that the bend induced by the CTE results primarily from a steric block. Finally, we identified two conserved regions - one the previously noted post II region in the helicase core and the other in the CTE - that may help displace or sequester the opposite RNA strand during RNA unwinding.« less
Functional analysis of a viroid RNA motif mediating cell-to-cell movement in Nicotiana benthamiana.
Jiang, Dongmei; Wang, Meng; Li, Shifang
2017-01-01
Cell-to-cell trafficking through different cellular layers is a key process for various RNAs including those of plant viruses and viroids, but the regulatory mechanisms involved are still not fully elucidated and good model systems are important. Here, we analyse the function of a simple RNA motif (termed 'loop19') in potato spindle tuber viroid (PSTVd) which is required for trafficking in Nicotiana benthamiana leaves. Northern blotting, reverse transcriptase PCR (RT-PCR) and in situ hybridization analyses demonstrated that unlike wild-type PSTVd, which was present in the nuclei in all cell types, the trafficking-defective loop19 mutants were visible only in the nuclei of upper epidermal and palisade mesophyll cells, which shows that PSTVd loop19 plays a role in mediating RNA trafficking from palisade to spongy mesophyll cells in N.benthamiana leaves. Our findings and approaches have broad implications for studying the RNA motifs mediating trafficking of RNAs across specific cellular boundaries in other biological systems.
Characterization of a Smad motif similar to Drosophila mad in the mouse Msx 1 promoter.
Alvarez Martinez, Cristina E; Binato, Renata; Gonzalez, Sayonara; Pereira, Monica; Robert, Benoit; Abdelhay, Eliana
2002-03-01
Mouse Msx 1 gene, orthologous of the Drosophila msh, is involved in several developmental processes. BMP family members are major proteins in the regulation of Msx 1 expression. BMP signaling activates Smad 1/5/8 proteins, which associate to Smad 4 before translocating to the nucleus. Analysis of Msx 1 promoter revealed the presence of three elements similar to the consensus established for Mad, the Smad 1 Drosophila counterpart. Notably, such an element was identified in an enhancer important for Msx 1 regulation. Gel shift analysis demonstrated that proteins from 13.5 dpc embryo associate to this enhancer. Remarkably, supershift assays showed that Smad proteins are present in the complex. Purified Smad 1 and 4 also bind to this fragment. We demonstrate that functional binding sites in this enhancer are confined to the Mad motif and flanking region. Our data suggest that this Mad motif may be functional in response to BMP signaling. ©2002 Elsevier Science (USA).
Growth factor pleiotropy is controlled by a receptor Tyr/Ser motif that acts as a binary switch
Guthridge, Mark A; Powell, Jason A; Barry, Emma F; Stomski, Frank C; McClure, Barbara J; Ramshaw, Hayley; Felquer, Fernando A; Dottore, Mara; Thomas, Daniel T; To, Bik; Begley, C Glenn; Lopez, Angel F
2006-01-01
Pleiotropism is a hallmark of cytokines and growth factors; yet, the underlying mechanisms are not clearly understood. We have identified a motif in the granulocyte macrophage-colony-stimulating factor receptor composed of a tyrosine and a serine residue that functions as a binary switch for the independent regulation of multiple biological activities. Signalling occurs either through Ser585 at lower cytokine concentrations, leading to cell survival only, or through Tyr577 at higher cytokine concentrations, leading to cell survival as well as proliferation, differentiation or functional activation. The phosphorylation of Ser585 and Tyr577 is mutually exclusive and occurs via a unidirectional mechanism that involves protein kinase A and tyrosine kinases, respectively, and is deregulated in at least some leukemias. We have identified similar Tyr/Ser motifs in other cell surface receptors, suggesting that such signalling switches may play important roles in generating specificity and pleiotropy in other biological systems. PMID:16437163
Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin
2016-08-09
Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular.
DNA nanotechnology based on i-motif structures.
Dong, Yuanchen; Yang, Zhongqiang; Liu, Dongsheng
2014-06-17
CONSPECTUS: Most biological processes happen at the nanometer scale, and understanding the energy transformations and material transportation mechanisms within living organisms has proved challenging. To better understand the secrets of life, researchers have investigated artificial molecular motors and devices over the past decade because such systems can mimic certain biological processes. DNA nanotechnology based on i-motif structures is one system that has played an important role in these investigations. In this Account, we summarize recent advances in functional DNA nanotechnology based on i-motif structures. The i-motif is a DNA quadruplex that occurs as four stretches of cytosine repeat sequences form C·CH(+) base pairs, and their stabilization requires slightly acidic conditions. This unique property has produced the first DNA molecular motor driven by pH changes. The motor is reliable, and studies show that it is capable of millisecond running speeds, comparable to the speed of natural protein motors. With careful design, the output of these types of motors was combined to drive micrometer-sized cantilevers bend. Using established DNA nanostructure assembly and functionalization methods, researchers can easily integrate the motor within other DNA assembled structures and functional units, producing DNA molecular devices with new functions such as suprahydrophobic/suprahydrophilic smart surfaces that switch, intelligent nanopores triggered by pH changes, molecular logic gates, and DNA nanosprings. Recently, researchers have produced motors driven by light and electricity, which have allowed DNA motors to be integrated within silicon-based nanodevices. Moreover, some devices based on i-motif structures have proven useful for investigating processes within living cells. The pH-responsiveness of the i-motif structure also provides a way to control the stepwise assembly of DNA nanostructures. In addition, because of the stability of the i-motif, this structure can serve as the stem of one-dimensional nanowires, and a four-strand stem can provide a new basis for three-dimensional DNA structures such as pillars. By sacrificing some accuracy in assembly, we used these properties to prepare the first fast-responding pure DNA supramolecular hydrogel. This hydrogel does not swell and cannot encapsulate small molecules. These unique properties could lead to new developments in smart materials based on DNA assembly and support important applications in fields such as tissue engineering. We expect that DNA nanotechnology will continue to develop rapidly. At a fundamental level, further studies should lead to greater understanding of the energy transformation and material transportation mechanisms at the nanometer scale. In terms of applications, we expect that many of these elegant molecular devices will soon be used in vivo. These further studies could demonstrate the power of DNA nanotechnology in biology, material science, chemistry, and physics.
Novel functions of CCM1 delimit the relationship of PTB/PH domains.
Zhang, Jun; Dubey, Pallavi; Padarti, Akhil; Zhang, Aileen; Patel, Rinkal; Patel, Vipulkumar; Cistola, David; Badr, Ahmed
2017-10-01
Three NPXY motifs and one FERM domain in CCM1 makes it a versatile scaffold protein for tethering the signaling components together within the CCM signaling complex (CSC). The cellular role of CCM1 protein remains inadequately expounded. Both phosphotyrosine binding (PTB) and pleckstrin homology (PH) domains were recognized as structurally related but functionally distinct domains. By utilizing molecular cloning, protein binding assays and RT-qPCR to identify novel cellular partners of CCM1 and its cellular expression patterns; by screening candidate PTB/PH proteins and subsequently structurally simulation in combining with current X-ray crystallography and NMR data to defined the essential structure of PTB/PH domain for NPXY-binding and the relationship among PTB, PH and FERM domain(s). We identified a group of 28 novel cellular partners of CCM1, all of which contain either PTB or PH domain(s), and developed a novel classification system for these PTB/PH proteins based on their relationship with different NPXY motifs of CCM1. Our results demonstrated that CCM1 has a wide spectrum of binding to different PTB/PH proteins and perpetuates their specificity to interact with certain PTB/PH domains through selective combination of three NPXY motifs. We also demonstrated that CCM1 can be assembled into oligomers through intermolecular interaction between its F3 lobe in FERM domain and one of the three NPXY motifs. Despite being embedded in FERM domain as F3 lobe, F3 module acts as a fully functional PH domain to interact with NPXY motif. The most salient feature of the study was that both PTB and PH domains are structurally and functionally comparable, suggesting that PTB domain is likely evolved from PH domain with polymorphic structural additions at its N-terminus. A new β1A-strand of the PTB domain was discovered and new minimum structural requirement of PTB/PH domain for NPXY motif-binding was determined. Based on our data, a novel theory of structure, function and relationship of PTB, PH and FERM domains has been proposed, which extends the importance of the NPXY-PTB/PH interaction on the CSC signaling and/or other cell receptors with great potential pointing to new therapeutic strategies. The study provides new insight into the structural characteristics of PTB/PH domains, essential structural elements of PTB/PH domain required for NPXY motif-binding, and function and relationship among PTB, PH and FERM domains. Copyright © 2017 Elsevier B.V. All rights reserved.
Chung, Lawton K; Philip, Naomi H; Schmidt, Valentina A; Koller, Antonius; Strowig, Till; Flavell, Richard A; Brodsky, Igor E; Bliska, James B
2014-07-01
YopM is a leucine-rich repeat (LRR)-containing effector in several Yersinia species, including Yersinia pestis and Y. pseudotuberculosis. Different Yersinia strains encode distinct YopM isoforms with variable numbers of LRRs but conserved C-terminal tails. A 15-LRR isoform in Y. pseudotuberculosis YPIII was recently shown to bind and inhibit caspase-1 via a YLTD motif in LRR 10, and attenuation of YopM(-) YPIII was reversed in mice lacking caspase-1, indicating that caspase-1 inhibition is a major virulence function of YopM(YPIII). To determine if other YopM proteins inhibit caspase-1, we utilized Y. pseudotuberculosis strains natively expressing a 21-LRR isoform lacking the YLTD motif (YopM(32777)) or ectopically expressing a Y. pestis 15-LRR version with a functional (YopM(KIM)) or inactivated (YopM(KIM) D271A) YLTD motif. Results of mouse and macrophage infections with these strains showed that YopM(32777), YopM(KIM), and YopM(KIM) D271A inhibit caspase-1 activation, indicating that the YLTD motif is dispensable for this activity. Analysis of YopM(KIM) deletion variants revealed that LRRs 6 to 15 and the C-terminal tail are required to inhibit caspase-1 activation. YopM(32777), YopM(KIM), and YopM(KIM) deletion variants were purified, and binding partners in macrophage lysates were identified. Caspase-1 bound to YopM(KIM) but not YopM(32777). Additionally, YopM(KIM) bound IQGAP1 and the use of Iqgap1(-/-) macrophages revealed that this scaffolding protein is important for caspase-1 activation upon infection with YopM(-) Y. pseudotuberculosis. Thus, while multiple YopM isoforms inhibit caspase-1 activation, their variable LRR domains bind different host proteins to perform this function and the LRRs of YopM(KIM) target IQGAP1, a novel regulator of caspase-1, in macrophages. Importance: Activation of caspase-1, mediated by macromolecular complexes termed inflammasomes, is important for innate immune defense against pathogens. Pathogens can, in turn, subvert caspase-1-dependent responses through the action of effector proteins. For example, the Yersinia effector YopM inhibits caspase-1 activation by arresting inflammasome formation. This caspase-1 inhibitory activity has been studied in a specific YopM isoform, and in this case, the protein was shown to act as a pseudosubstrate to bind and inhibit caspase-1. Different Yersinia strains encode distinct YopM isoforms, many of which lack the pseudosubstrate motif. We studied additional isoforms and found that these YopM proteins inhibit caspase-1 activation independently of a pseudosubstrate motif. We also identified IQGAP1 as a novel binding partner of the Yersinia pestis YopM(KIM) isoform and demonstrated that IQGAP1 is important for caspase-1 activation in macrophages infected with Yersinia. Thus, this study reveals new insights into inflammasome regulation during Yersinia infection. Copyright © 2014 Chung et al.
Finding specific RNA motifs: Function in a zeptomole world?
KNIGHT, ROB; YARUS, MICHAEL
2003-01-01
We have developed a new method for estimating the abundance of any modular (piecewise) RNA motif within a longer random region. We have used this method to estimate the size of the active motifs available to modern SELEX experiments (picomoles of unique sequences) and to a plausible RNA World (zeptomoles of unique sequences: 1 zmole = 602 sequences). Unexpectedly, activities such as specific isoleucine binding are almost certainly present in zeptomoles of molecules, and even ribozymes such as self-cleavage motifs may appear (depending on assumptions about the minimal structures). The number of specified nucleotides is not the only important determinant of a motif’s rarity: The number of modules into which it is divided, and the details of this division, are also crucial. We propose three maxims for easily isolated motifs: the Maxim of Minimization, the Maxim of Multiplicity, and the Maxim of the Median. These maxims together state that selected motifs should be small and composed of as many separate, equally sized modules as possible. For evenly divided motifs with four modules, the largest accessible activity in picomole scale (1–1000 pmole) pools of length 100 is about 34 nucleotides; while for zeptomole scale (1–1000 zmole) pools it is about 20 specific nucleotides (50% probability of occurrence). This latter figure includes some ribozymes and aptamers. Consequently, an RNA metabolism apparently could have begun with only zeptomoles of RNA molecules. PMID:12554865
Contributions of vitamin D response elements and HLA promoters to multiple sclerosis risk.
Nolan, David; Castley, Alison; Tschochner, Monika; James, Ian; Qiu, Wei; Sayer, David; Christiansen, Frank T; Witt, Campbell; Mastaglia, Frank; Carroll, William; Kermode, Allan
2012-08-07
The identification of a vitamin D-responsive (VDRE) motif within the HLA-DRB1*15:01 promoter region provides an attractive explanation for the combined effects of HLA-DR inheritance and vitamin D exposure on multiple sclerosis (MS) risk. We therefore sought to incorporate HLA-DRB1 promoter variation, including the VDRE motif, in an assessment of HLA-DRB1-associated MS risk. We utilized 32 homozygous HLA cell lines (covering 17 DRB1 alleles) and 53 heterozygote MS samples (20 DRB1 alleles) for HLA-DRB1 promoter sequencing. The influence of HLA-DRB1 variation on MS risk was then assessed among 466 MS cases and 498 controls. The majority of HLA*DRB1 alleles (including HLA-DRB1*15:01) express the functional VDRE motif, apart from HLA-DRB1*04, *07, and *09 alleles that comprise the HLA-DR53 serologic group. Allele-specific variation within functional X-box and Y-box motifs was also associated with serologically defined HLA-DR haplotypes. Incorporating these results in an analysis of MS risk, we identified a strong protective effect of HLA-DRB1*04, *07, and *09 (DR53) alleles (p = 10(-12)) and elevated risk associated with DRB1*15 and *16 (DR51) and *08 (DR8) alleles (p < 10(-18)). HLA-DRB1 groups corresponding to serologic HLA-DR profiles as well as promoter polymorphism haplotypes effectively stratified MS risk over an 11-fold range, suggesting functional relationships between risk-modifying HLA-DRB1 alleles. An independent contribution of VDRE motif variation to increase MS risk was not discernible, although vitamin D-dependent regulation of HLA-DR expression may still play an important role given that HLA-DRB1*04/*07/*09 (DR53) alleles that express the "nonresponsive" VDRE motif were associated with significantly reduced risk of MS.
Deciphering functional glycosaminoglycan motifs in development.
Townley, Robert A; Bülow, Hannes E
2018-03-23
Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.
qPMS9: An Efficient Algorithm for Quorum Planted Motif Search
NASA Astrophysics Data System (ADS)
Nicolae, Marius; Rajasekaran, Sanguthevar
2015-01-01
Discovering patterns in biological sequences is a crucial problem. For example, the identification of patterns in DNA sequences has resulted in the determination of open reading frames, identification of gene promoter elements, intron/exon splicing sites, and SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have led to domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, discovery of short functional motifs, etc. In this paper we focus on the identification of an important class of patterns, namely, motifs. We study the (l, d) motif search problem or Planted Motif Search (PMS). PMS receives as input n strings and two integers l and d. It returns all sequences M of length l that occur in each input string, where each occurrence differs from M in at most d positions. Another formulation is quorum PMS (qPMS), where the motif appears in at least q% of the strings. We introduce qPMS9, a parallel exact qPMS algorithm that offers significant runtime improvements on DNA and protein datasets. qPMS9 solves the challenging DNA (l, d)-instances (28, 12) and (30, 13). The source code is available at https://code.google.com/p/qpms9/.
Wiese, Claudia; Hinz, John M; Tebbs, Robert S; Nham, Peter B; Urbin, Salustra S; Collins, David W; Thompson, Larry H; Schild, David
2006-01-01
In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks (ICLs). Ectopic expression of wild-type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.
Di Bartolomeo, Francesca; Doan, Kim Nguyen; Athenstaedt, Karin; Becker, Thomas; Daum, Günther
2017-07-01
In the yeast Saccharomyces cerevisiae, the mitochondrial phosphatidylserine decarboxylase 1 (Psd1p) produces the largest amount of cellular phosphatidylethanolamine (PE). Psd1p is synthesized as a larger precursor on cytosolic ribosomes and then imported into mitochondria in a three-step processing event leading to the formation of an α-subunit and a β-subunit. The α-subunit harbors a highly conserved motif, which was proposed to be involved in phosphatidylserine (PS) binding. Here, we present a molecular analysis of this consensus motif for the function of Psd1p by using Psd1p variants bearing either deletions or point mutations in this region. Our data show that mutations in this motif affect processing and stability of Psd1p, and consequently the enzyme's activity. Thus, we conclude that this consensus motif is essential for structural integrity and processing of Psd1p. Copyright © 2017 Elsevier B.V. All rights reserved.
Marty, Naomi J.; Teresinski, Howard J.; Hwang, Yeen Ting; Clendening, Eric A.; Gidda, Satinder K.; Sliwinska, Elwira; Zhang, Daiyuan; Miernyk, Ján A.; Brito, Glauber C.; Andrews, David W.; Dyer, John M.; Mullen, Robert T.
2014-01-01
Tail-anchored (TA) proteins are a unique class of functionally diverse membrane proteins defined by their single C-terminal membrane-spanning domain and their ability to insert post-translationally into specific organelles with an Ncytoplasm-Corganelle interior orientation. The molecular mechanisms by which TA proteins are sorted to the proper organelles are not well-understood. Herein we present results indicating that a dibasic targeting motif (i.e., -R-R/K/H-X{X≠E}) identified previously in the C terminus of the mitochondrial isoform of the TA protein cytochrome b5, also exists in many other A. thaliana outer mitochondrial membrane (OMM)-TA proteins. This motif is conspicuously absent, however, in all but one of the TA protein subunits of the translocon at the outer membrane of mitochondria (TOM), suggesting that these two groups of proteins utilize distinct biogenetic pathways. Consistent with this premise, we show that the TA sequences of the dibasic-containing proteins are both necessary and sufficient for targeting to mitochondria, and are interchangeable, while the TA regions of TOM proteins lacking a dibasic motif are necessary, but not sufficient for localization, and cannot be functionally exchanged. We also present results from a comprehensive mutational analysis of the dibasic motif and surrounding sequences that not only greatly expands the functional definition and context-dependent properties of this targeting signal, but also led to the identification of other novel putative OMM-TA proteins. Collectively, these results provide important insight to the complexity of the targeting pathways involved in the biogenesis of OMM-TA proteins and help define a consensus targeting motif that is utilized by at least a subset of these proteins. PMID:25237314
A structural-alphabet-based strategy for finding structural motifs across protein families
Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay
2010-01-01
Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Ankyrin-repeat containing proteins of microbes: a conserved structure with functional diversity
Al-Khodor, Souhaila; Price, Christopher T.; Kalia, Awdhesh; Kwaik, Yousef Abu
2009-01-01
Summary The ankyrin repeat (ANK) is the most common protein-protein interaction motif in nature and predominantly found in eukaryotic proteins. The genome sequencing of various pathogenic or symbiotic bacteria and eukaryotic viruses identified numerous genes encoding ANK-containing proteins that were proposed to have been acquired from eukaryotes by horizontal gene transfer. However, the recent discovery of additional ANK-containing proteins encoded in the genomes of archaea and free-living bacteria suggests either a more ancient origin of the ANK motif or multiple convergent evolution events. Many bacterial pathogens employ various types of secretion systems to deliver ANK-containing proteins into eukaryotic cells where they mimic or manipulate various host functions. Understanding the molecular and biochemical functions of this family of proteins will enhance our understanding of important host-microbe interactions. PMID:19962898
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)
NASA Astrophysics Data System (ADS)
Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd
2018-04-01
The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
A new subfamily LIP of the major intrinsic proteins.
Khabudaev, Kirill Vladimirovich; Petrova, Darya Petrovna; Grachev, Mikhail Aleksandrovich; Likhoshway, Yelena Valentinovna
2014-03-04
Proteins of the major intrinsic protein (MIP) family, or aquaporins, have been detected in almost all organisms. These proteins are important in cells and organisms because they allow for passive transmembrane transport of water and other small, uncharged polar molecules. We compared the predicted amino acid sequences of 20 MIPs from several algae species of the phylum Heterokontophyta (Kingdom Chromista) with the sequences of MIPs from other organisms. Multiple sequence alignments revealed motifs that were homologous to functionally important NPA motifs and the so-called ar/R-selective filter of glyceroporins and aquaporins. The MIP sequences of the studied chromists fell into several clusters that belonged to different groups of MIPs from a wide variety of organisms from different Kingdoms. Two of these proteins belong to Plasma membrane intrinsic proteins (PIPs), four of them belong to GlpF-like intrinsic proteins (GIPs), and one of them belongs to a specific MIPE subfamily from green algae. Three proteins belong to the unclassified MIPs, two of which are of bacterial origin. Eight of the studied MIPs contain an NPM-motif in place of the second conserved NPA-motif typical of the majority of MIPs. The MIPs of heterokonts within all detected clusters can differ from other MIPs in the same cluster regarding the structure of the ar/R-selective filter and other generally conserved motifs. We proposed placing nine MIPs from heterokonts into a new group, which we have named the LIPs (large intrinsic proteins). The possible substrate specificities of the studied MIPs are discussed.
Alamo, Lorenzo; Pinto, Antonio; Sulbarán, Guidenn; Mavárez, Jesús; Padrón, Raúl
2017-09-04
Tarantula's leg muscle thick filament is the ideal model for the study of the structure and function of skeletal muscle thick filaments. Its analysis has given rise to a series of structural and functional studies, leading, among other things, to the discovery of the myosin interacting-heads motif (IHM). Further electron microscopy (EM) studies have shown the presence of IHM in frozen-hydrated and negatively stained thick filaments of striated, cardiac, and smooth muscle of bilaterians, most showing the IHM parallel to the filament axis. EM studies on negatively stained heavy meromyosin of different species have shown the presence of IHM on sponges, animals that lack muscle, extending the presence of IHM to metazoans. The IHM evolved about 800 MY ago in the ancestor of Metazoa, and independently with functional differences in the lineage leading to the slime mold Dictyostelium discoideum (Mycetozoa). This motif conveys important functional advantages, such as Ca 2+ regulation and ATP energy-saving mechanisms. Recent interest has focused on human IHM structure in order to understand the structural basis underlying various conditions and situations of scientific and medical interest: the hypertrophic and dilated cardiomyopathies, overfeeding control, aging and hormone deprival muscle weakness, drug design for schistosomiasis control, and conditioning exercise physiology for the training of power athletes.
Yasuhiko, Yukuto; Shiokawa, Koichiro; Mochizuki, Toshio; Asashima, Makoto; Yokoyama, Takahiko
2006-04-01
The homozygous inv (inversion of embryonic turning) mouse mutant shows situs inversus and polycystic kidney disease, both of which result from the lack of the inv gene. Previously, we suggested that inv may be important for the left-right axis formation, not only in mice but also in Xenopus, and that calmodulin regulates this inv protein function. Here, we isolated and characterized two Xenopus laevis homologs (Xinv-1 and Xinv-2) of the mouse inv gene, and performed functional analysis of the conserved IQ motifs that interact with calmodulin. Xinv-1 expresses early in development in the same manner as mouse inv does. Unexpectedly, a full-length Xenopus inv mRNA did not randomize cardiac orientation when injected into Xenopus embryos, which is different from mouse inv mRNA. Contrary to mouse inv mRNA, Xenopus inv mRNA with mutated IQ randomized cardiac orientation. The present study indicates that calmodulin binding sites (IQ motifs) are crucial in controlling the biological activity of both mouse and Xenopus inv proteins. Although mouse and Xenopus inv genes have a quite similar structure, the interaction with calmodulin and IQ motifs of Xenopus inv and mouse inv proteins may regulate their function in different ways.
Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A
2013-09-02
In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome-wide collection of reference RNA motif regulons is available in the RegPrecise database (http://regprecise.lbl.gov/).
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.
Li, Sanshu; Breaker, Ronald R
2017-10-13
With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Kawaguchi, Tsutomu; Komatsu, Shuhei; Ichikawa, Daisuke; Hirajima, Shoji; Nishimura, Yukihisa; Konishi, Hirotaka; Shiozaki, Atsushi; Fujiwara, Hitoshi; Okamoto, Kazuma; Tsuda, Hitoshi; Otsuji, Eigo
2017-06-01
Recent studies have shown that some members of the tripartite motif-containing protein family function as important regulators for carcinogenesis. In this study, we investigated whether tripartite motif-containing protein 44 acts as a cancer-promoting gene through its overexpression in esophageal squamous cell carcinoma. We analyzed esophageal squamous cell carcinoma cell lines to evaluate malignant potential and also analyzed 68 primary tumors to evaluate clinical relevance of tripartite motif-containing protein 44 protein in esophageal squamous cell carcinoma patients. Expression of the tripartite motif-containing protein 44 protein was detected in esophageal squamous cell carcinoma cell lines (8/14 cell lines; 57%) and primary tumor samples of esophageal squamous cell carcinoma (39/68 cases; 57%). Knockdown of tripartite motif-containing protein 44 expression in esophageal squamous cell carcinoma cells using several specific small interfering RNAs inhibited cell migration and invasion, but not cell proliferation. Immunohistochemical analysis demonstrated that the overexpression of the tripartite motif-containing protein 44 protein in the tumor infiltrated region was associated with the status of lymph node metastasis ( p = 0.049), and the overall survival rates were significantly worse among patients with tripartite motif-containing protein 44-overexpressing tumors than those with non-expressing tumors ( p = 0.029). Moreover, multivariate Cox regression model identified that overexpression of the tripartite motif-containing protein 44 protein was an independent worse prognostic factor (hazard ratio = 2.815; p = 0.041), as well as lymphatic invasion (hazard ratio = 2.735; p = 0.037). These results suggest that tripartite motif-containing protein 44 protein could play a crucial role in tumor invasion through its overexpression and highlight its usefulness as a predictor and potential therapeutic target in esophageal squamous cell carcinoma.
The snoRNA domain of vertebrate telomerase RNA functions to localize the RNA within the nucleus.
Lukowiak, A A; Narayanan, A; Li, Z H; Terns, R M; Terns, M P
2001-01-01
Telomerase RNA is an essential component of the ribonucleoprotein enzyme involved in telomere length maintenance, a process implicated in cellular senescence and cancer. Vertebrate telomerase RNAs contain a box H/ACA snoRNA motif that is not required for telomerase activity in vitro but is essential in vivo. Using the Xenopus oocyte system, we have found that the box H/ACA motif functions in the subcellular localization of telomerase RNA. We have characterized the transport and biogenesis of telomerase RNA by injecting labeled wild-type and variant RNAs into Xenopus oocytes and assaying nucleocytoplasmic distribution, intranuclear localization, modification, and protein binding. Although yeast telomerase RNA shares characteristics of spliceosomal snRNAs, we show that human telomerase RNA is not associated with Sm proteins or efficiently imported into the nucleus. In contrast, the transport properties of vertebrate telomerase RNA resemble those of snoRNAs; telomerase RNA is retained in the nucleus and targeted to nucleoli. Furthermore, both nuclear retention and nucleolar localization depend on the box H/ACA motif. Our findings suggest that the H/ACA motif confers functional localization of vertebrate telomerase RNAs to the nucleus, the compartment where telomeres are synthesized. We have also found that telomerase RNA localizes to Cajal bodies, intranuclear structures where it is thought that assembly of various cellular RNPs takes place. Our results identify the Cajal body as a potential site of telomerase RNP biogenesis. PMID:11780638
Erceg, Jelena; Saunders, Timothy E.; Girardot, Charles; Devos, Damien P.; Hufnagel, Lars; Furlong, Eileen E. M.
2014-01-01
Deciphering the specific contribution of individual motifs within cis-regulatory modules (CRMs) is crucial to understanding how gene expression is regulated and how this process is affected by sequence variation. But despite vast improvements in the ability to identify where transcription factors (TFs) bind throughout the genome, we are limited in our ability to relate information on motif occupancy to function from sequence alone. Here, we engineered 63 synthetic CRMs to systematically assess the relationship between variation in the content and spacing of motifs within CRMs to CRM activity during development using Drosophila transgenic embryos. In over half the cases, very simple elements containing only one or two types of TF binding motifs were capable of driving specific spatio-temporal patterns during development. Different motif organizations provide different degrees of robustness to enhancer activity, ranging from binary on-off responses to more subtle effects including embryo-to-embryo and within-embryo variation. By quantifying the effects of subtle changes in motif organization, we were able to model biophysical rules that explain CRM behavior and may contribute to the spatial positioning of CRM activity in vivo. For the same enhancer, the effects of small differences in motif positions varied in developmentally related tissues, suggesting that gene expression may be more susceptible to sequence variation in one tissue compared to another. This result has important implications for human eQTL studies in which many associated mutations are found in cis-regulatory regions, though the mechanism for how they affect tissue-specific gene expression is often not understood. PMID:24391522
Solution structure of CEH-37 homeodomain of the nematode Caenorhabditis elegans
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moon, Sunjin; Lee, Yong Woo; Kim, Woo Taek
Highlights: •We have determined solution structures of CEH-37 homedomain. •CEH-37 HD has a compact α-helical structure with HTH DNA binding motif. •Solution structure of CEH-37 HD shares its molecular topology with that of the homeodomain proteins. •Residues in the N-terminal region and HTH motif are important in binding to Caenorhabditis elegans telomeric DNA. •CEH-37 could play an important role in telomere function via DNA binding. -- Abstract: The nematode Caenorhabditis elegans protein CEH-37 belongs to the paired OTD/OTX family of homeobox-containing homeodomain proteins. CEH-37 shares sequence similarity with homeodomain proteins, although it specifically binds to double-stranded C. elegans telomeric DNA,more » which is unusual to homeodomain proteins. Here, we report the solution structure of CEH-37 homeodomain and molecular interaction with double-stranded C. elegans telomeric DNA using nuclear magnetic resonance (NMR) spectroscopy. NMR structure shows that CEH-37 homeodomain is composed of a flexible N-terminal region and three α-helices with a helix-turn-helix (HTH) DNA binding motif. Data from size-exclusion chromatography and fluorescence spectroscopy reveal that CEH-37 homeodomain interacts strongly with double-stranded C. elegans telomeric DNA. NMR titration experiments identified residues responsible for specific binding to nematode double-stranded telomeric DNA. These results suggest that C. elegans homeodomain protein, CEH-37 could play an important role in telomere function via DNA binding.« less
NASA Astrophysics Data System (ADS)
Krishnan, Gopi; Verheijen, Marcel A.; Ten Brink, Gert H.; Palasantzas, George; Kooi, Bart J.
2013-05-01
Nowadays bimetallic nanoparticles (NPs) have emerged as key materials for important modern applications in nanoplasmonics, catalysis, biodiagnostics, and nanomagnetics. Consequently the control of bimetallic structural motifs with specific shapes provides increasing functionality and selectivity for related applications. However, producing bimetallic NPs with well controlled structural motifs still remains a formidable challenge. Hence, we present here a general methodology for gas phase synthesis of bimetallic NPs with distinctively different structural motifs ranging at a single particle level from a fully mixed alloy to core-shell, to onion (multi-shell), and finally to a Janus/dumbbell, with the same overall particle composition. These concepts are illustrated for Mo-Cu NPs, where the precise control of the bimetallic NPs with various degrees of chemical ordering, including different shapes from spherical to cube, is achieved by tailoring the energy and thermal environment that the NPs experience during their production. The initial state of NP growth, either in the liquid or in the solid state phase, has important implications for the different structural motifs and shapes of synthesized NPs. Finally we demonstrate that we are able to tune the alloying regime, for the otherwise bulk immiscible Mo-Cu, by achieving an increase of the critical size, below which alloying occurs, closely up to an order of magnitude. It is discovered that the critical size of the NP alloy is not only affected by controlled tuning of the alloying temperature but also by the particle shape.Nowadays bimetallic nanoparticles (NPs) have emerged as key materials for important modern applications in nanoplasmonics, catalysis, biodiagnostics, and nanomagnetics. Consequently the control of bimetallic structural motifs with specific shapes provides increasing functionality and selectivity for related applications. However, producing bimetallic NPs with well controlled structural motifs still remains a formidable challenge. Hence, we present here a general methodology for gas phase synthesis of bimetallic NPs with distinctively different structural motifs ranging at a single particle level from a fully mixed alloy to core-shell, to onion (multi-shell), and finally to a Janus/dumbbell, with the same overall particle composition. These concepts are illustrated for Mo-Cu NPs, where the precise control of the bimetallic NPs with various degrees of chemical ordering, including different shapes from spherical to cube, is achieved by tailoring the energy and thermal environment that the NPs experience during their production. The initial state of NP growth, either in the liquid or in the solid state phase, has important implications for the different structural motifs and shapes of synthesized NPs. Finally we demonstrate that we are able to tune the alloying regime, for the otherwise bulk immiscible Mo-Cu, by achieving an increase of the critical size, below which alloying occurs, closely up to an order of magnitude. It is discovered that the critical size of the NP alloy is not only affected by controlled tuning of the alloying temperature but also by the particle shape. Electronic supplementary information (ESI) available: Experimental details including schematics of the gas phase synthesis set up, target arrangement, synthesis condition for various structures, and TEM images of alloy, core-shell and Mo-Cu-Mo onion nanoparticles. See DOI: 10.1039/c3nr00565h
Mapping the nuclear localization signal in the matrix protein of potato yellow dwarf virus.
Anderson, Gavin; Jang, Chanyong; Wang, Renyuan; Goodin, Michael
2018-05-01
The ability of the matrix (M) protein of potato yellow dwarf virus (PYDV) to remodel nuclear membranes is controlled by a di-leucine motif located at residues 223 and 224 of its primary structure. This function can be uncoupled from that of its nuclear localization signal (NLS), which is controlled primarily by lysine and arginine residues immediately downstream of the LL motif. In planta localization of green fluorescent protein fusions, bimolecular fluorescence complementation assays with nuclear import receptor importin-α1 and yeast-based nuclear import assays provided three independent experimental approaches to validate the authenticity of the M-NLS. The carboxy terminus of M is predicted to contain a nuclear export signal, which is belived to be functional, given the ability of M to bind the Arabidopsis nuclear export receptor 1 (XPO1). The nuclear shuttle activity of M has implications for the cell-to-cell movement of PYDV nucleocapsids, based upon its interaction with the N and Y proteins.
Velagapudi, Sai Pradeep; Disney, Matthew D
2013-10-15
RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3×3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure-activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif-aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. Copyright © 2013 Elsevier Ltd. All rights reserved.
Kawano, Yasuhiro; Neeley, Shane; Adachi, Kei; Nakai, Hiroyuki
2013-01-01
Overlapping open reading frames (ORFs) in viral genomes undergo co-evolution; however, how individual amino acids coded by overlapping ORFs are structurally, functionally, and co-evolutionarily constrained remains difficult to address by conventional homologous sequence alignment approaches. We report here a new experimental and computational evolution-based methodology to address this question and report its preliminary application to elucidating a mode of co-evolution of the frame-shifted overlapping ORFs in the adeno-associated virus (AAV) serotype 2 viral genome. These ORFs encode both capsid VP protein and non-structural assembly-activating protein (AAP). To show proof of principle of the new method, we focused on the evolutionarily conserved QVKEVTQ and KSKRSRR motifs, a pair of overlapping heptapeptides in VP and AAP, respectively. In the new method, we first identified a large number of capsid-forming VP3 mutants and functionally competent AAP mutants of these motifs from mutant libraries by experimental directed evolution under no co-evolutionary constraints. We used Illumina sequencing to obtain a large dataset and then statistically assessed the viability of VP and AAP heptapeptide mutants. The obtained heptapeptide information was then integrated into an evolutionary algorithm, with which VP and AAP were co-evolved from random or native nucleotide sequences in silico. As a result, we demonstrate that these two heptapeptide motifs could exhibit high degeneracy if coded by separate nucleotide sequences, and elucidate how overlap-evoked co-evolutionary constraints play a role in making the VP and AAP heptapeptide sequences into the present shape. Specifically, we demonstrate that two valine (V) residues and β-strand propensity in QVKEVTQ are structurally important, the strongly negative and hydrophilic nature of KSKRSRR is functionally important, and overlap-evoked co-evolution imposes strong constraints on serine (S) residues in KSKRSRR, despite high degeneracy of the motifs in the absence of co-evolutionary constraints.
Rutsdottir, Gudrun; Härmark, Johan; Weide, Yoran; Hebert, Hans; Rasmussen, Morten I; Wernersson, Sven; Respondek, Michal; Akke, Mikael; Højrup, Peter; Koeck, Philip J B; Söderberg, Christopher A G; Emanuelsson, Cecilia
2017-05-12
Small heat-shock proteins (sHsps) prevent aggregation of thermosensitive client proteins in a first line of defense against cellular stress. The mechanisms by which they perform this function have been hard to define due to limited structural information; currently, there is only one high-resolution structure of a plant sHsp published, that of the cytosolic Hsp16.9. We took interest in Hsp21, a chloroplast-localized sHsp crucial for plant stress resistance, which has even longer N-terminal arms than Hsp16.9, with a functionally important and conserved methionine-rich motif. To provide a framework for investigating structure-function relationships of Hsp21 and understanding these sequence variations, we developed a structural model of Hsp21 based on homology modeling, cryo-EM, cross-linking mass spectrometry, NMR, and small-angle X-ray scattering. Our data suggest a dodecameric arrangement of two trimer-of-dimer discs stabilized by the C-terminal tails, possibly through tail-to-tail interactions between the discs, mediated through extended I X V X I motifs. Our model further suggests that six N-terminal arms are located on the outside of the dodecamer, accessible for interaction with client proteins, and distinct from previous undefined or inwardly facing arms. To test the importance of the I X V X I motif, we created the point mutant V181A, which, as expected, disrupts the Hsp21 dodecamer and decreases chaperone activity. Finally, our data emphasize that sHsp chaperone efficiency depends on oligomerization and that client interactions can occur both with and without oligomer dissociation. These results provide a generalizable workflow to explore sHsps, expand our understanding of sHsp structural motifs, and provide a testable Hsp21 structure model to inform future investigations. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barber, Claire; Netherton, Chris; Goatley, Lynnett
The African swine fever virus DP71L protein recruits protein phosphatase 1 (PP1) to dephosphorylate the translation initiation factor 2α (eIF2α) and avoid shut-off of global protein synthesis and downstream activation of the pro-apoptotic factor CHOP. Residues V16 and F18A were critical for binding of DP71L to PP1. Mutation of this PP1 binding motif or deletion of residues between 52 and 66 reduced the ability of DP71L to cause dephosphorylation of eIF2α and inhibit CHOP induction. The residues LSAVL, between 57 and 61, were also required. PP1 was co-precipitated with wild type DP71L and the mutant lacking residues 52- 66 ormore » the LSAVL motif, but not with the PP1 binding motif mutant. The residues in the LSAVL motif play a critical role in DP71L function but do not interfere with binding to PP1. Instead we propose these residues are important for DP71L binding to eIF2α. - Highlights: •The African swine fever virus DP71L protein recruits protein phosphatase 1 (PP1) to dephosphorylate translation initiation factor eIF2α (eIF2α). •The residues V{sup 16}, F{sup 18} of DP71L are required for binding to the α, β and γ isoforms of PP1 and for DP71L function. •The sequence LSAVL downstream from the PP1 binding site (residues 57–61) are also important for DP71L function. •DP71L mutants of the LSAVL sequence retain ability to co-precipitate with PP1 showing these sequences have a different role to PP1 binding.« less
MOHANTY, BIJAYALAXMI; KRISHNAN, S. P. T.; SWARUP, SANJAY; BAJIC, VLADIMIR B.
2005-01-01
• Background and Aims Plants can suffer from oxygen limitation during flooding or more complete submergence and may therefore switch from Kreb's cycle respiration to fermentation in association with the expression of anaerobically inducible genes coding for enzymes involved in glycolysis and fermentation. The aim of this study was to clarify mechanisms of transcriptional regulation of these anaerobic genes by identifying motifs shared by their promoter regions. • Methods Statistically significant motifs were detected by an in silico method from 13 promoters of anaerobic genes. The selected motifs were common for the majority of analysed promoters. Their significance was evaluated by searching for their presence in transcription factor-binding site databases (TRANSFAC, PlantCARE and PLACE). Using several negative control data sets, it was tested whether the motifs found were specific to the anaerobic group. • Key Results Previously, anaerobic response elements have been identified in maize (Zea mays) and arabidopsis (Arabidopsis thaliana) genes. Known functional motifs were detected, such as GT and GC motifs, but also other motifs shared by most of the genes examined. Five motifs detected have not been found in plants hitherto but are present in the promoters of animal genes with various functions. The consensus sequences of these novel motifs are 5′-AAACAAA-3′, 5′-AGCAGC-3′, 5′-TCATCAC-3′, 5′-GTTT(A/C/T)GCAA-3′ and 5′-TTCCCTGTT-3′. • Conclusions It is believed that the promoter motifs identified could be functional by conferring anaerobic sensitivity to the genes that possess them. This proposal now requires experimental verification. PMID:16027132
McDonald, Caleb B.; McIntosh, Samantha K. N.; Mikles, David C.; Bhat, Vikas; Deegan, Brian J.; Seldeen, Kenneth L.; Saeed, Ali M.; Buffa, Laura; Sudol, Marius; Nawaz, Zafar; Farooq, Amjad
2011-01-01
YAP2 transcriptional regulator mediates a plethora of cellular functions, including the newly discovered Hippo tumor suppressor pathway, by virtue of its ability to recognize WBP1 and WBP2 signaling adaptors among a wide variety of other ligands. Herein, using isothermal titration calorimery (ITC) and circular dichroism (CD) in combination with molecular modeling (MM) and molecular dynamics (MD), we provide evidence that the WW1 and WW2 domains of YAP2 recognize various PPXY motifs within WBP1 and WBP2 in a highly promiscuous and subtle manner. Thus, although both WW domains strictly require the integrity of the consensus PPXY sequence, non-consensus residues within and flanking this motif are not critical for high-affinity binding, implying that they most likely play a role in stabilizing the polyproline type II (PPII) helical conformation of the PPXY ligands. Of particular interest is the observation that both WW domains bind to a PPXYXG motif with highest affinity, implicating a preference for a non-bulky and flexible glycine one-residue C-terminal to the consensus tyrosine. Importantly, a large set of residues within both WW domains and the PPXY motifs appear to undergo rapid fluctuations on a nanosecond time scale, arguing that WW-ligand interactions are highly dynamic and that such conformational entropy may be an integral part of the reversible and temporal nature of cellular signaling cascades. Collectively, our study sheds light on the molecular determinants of a key WW-ligand interaction pertinent to cellular functions in health and disease. PMID:21981024
McDonald, Caleb B; McIntosh, Samantha K N; Mikles, David C; Bhat, Vikas; Deegan, Brian J; Seldeen, Kenneth L; Saeed, Ali M; Buffa, Laura; Sudol, Marius; Nawaz, Zafar; Farooq, Amjad
2011-11-08
The YAP2 transcriptional regulator mediates a plethora of cellular functions, including the newly discovered Hippo tumor suppressor pathway, by virtue of its ability to recognize WBP1 and WBP2 signaling adaptors among a wide variety of other ligands. Herein, using isothermal titration calorimery and circular dichroism in combination with molecular modeling and molecular dynamics, we provide evidence that the WW1 and WW2 domains of YAP2 recognize various PPXY motifs within WBP1 and WBP2 in a highly promiscuous and subtle manner. Thus, although both WW domains strictly require the integrity of the consensus PPXY sequence, nonconsensus residues within and flanking this motif are not critical for high-affinity binding, implying that they most likely play a role in stabilizing the polyproline type II helical conformation of the PPXY ligands. Of particular interest is the observation that both WW domains bind to a PPXYXG motif with highest affinity, implicating a preference for a nonbulky and flexible glycine one residue to the C-terminal side of the consensus tyrosine. Importantly, a large set of residues within both WW domains and the PPXY motifs appear to undergo rapid fluctuations on a nanosecond time scale, suggesting that WW-ligand interactions are highly dynamic and that such conformational entropy may be an integral part of the reversible and temporal nature of cellular signaling cascades. Collectively, our study sheds light on the molecular determinants of a key WW-ligand interaction pertinent to cellular functions in health and disease.
Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M
2014-12-01
G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs. Copyright © 2014. Published by Elsevier B.V.
Visualizing frequent patterns in large multivariate time series
NASA Astrophysics Data System (ADS)
Hao, M.; Marwah, M.; Janetzko, H.; Sharma, R.; Keim, D. A.; Dayal, U.; Patnaik, D.; Ramakrishnan, N.
2011-01-01
The detection of previously unknown, frequently occurring patterns in time series, often called motifs, has been recognized as an important task. However, it is difficult to discover and visualize these motifs as their numbers increase, especially in large multivariate time series. To find frequent motifs, we use several temporal data mining and event encoding techniques to cluster and convert a multivariate time series to a sequence of events. Then we quantify the efficiency of the discovered motifs by linking them with a performance metric. To visualize frequent patterns in a large time series with potentially hundreds of nested motifs on a single display, we introduce three novel visual analytics methods: (1) motif layout, using colored rectangles for visualizing the occurrences and hierarchical relationships of motifs in a multivariate time series, (2) motif distortion, for enlarging or shrinking motifs as appropriate for easy analysis and (3) motif merging, to combine a number of identical adjacent motif instances without cluttering the display. Analysts can interactively optimize the degree of distortion and merging to get the best possible view. A specific motif (e.g., the most efficient or least efficient motif) can be quickly detected from a large time series for further investigation. We have applied these methods to two real-world data sets: data center cooling and oil well production. The results provide important new insights into the recurring patterns.
2012-01-01
Background Discovery of functionally significant short, statistically overrepresented subsequence patterns (motifs) in a set of sequences is a challenging problem in bioinformatics. Oftentimes, not all sequences in the set contain a motif. These non-motif-containing sequences complicate the algorithmic discovery of motifs. Filtering the non-motif-containing sequences from the larger set of sequences while simultaneously determining the identity of the motif is, therefore, desirable and a non-trivial problem in motif discovery research. Results We describe MotifCatcher, a framework that extends the sensitivity of existing motif-finding tools by employing random sampling to effectively remove non-motif-containing sequences from the motif search. We developed two implementations of our algorithm; each built around a commonly used motif-finding tool, and applied our algorithm to three diverse chromatin immunoprecipitation (ChIP) data sets. In each case, the motif finder with the MotifCatcher extension demonstrated improved sensitivity over the motif finder alone. Our approach organizes candidate functionally significant discovered motifs into a tree, which allowed us to make additional insights. In all cases, we were able to support our findings with experimental work from the literature. Conclusions Our framework demonstrates that additional processing at the sequence entry level can significantly improve the performance of existing motif-finding tools. For each biological data set tested, we were able to propose novel biological hypotheses supported by experimental work from the literature. Specifically, in Escherichia coli, we suggested binding site motifs for 6 non-traditional LexA protein binding sites; in Saccharomyces cerevisiae, we hypothesize 2 disparate mechanisms for novel binding sites of the Cse4p protein; and in Halobacterium sp. NRC-1, we discoverd subtle differences in a general transcription factor (GTF) binding site motif across several data sets. We suggest that small differences in our discovered motif could confer specificity for one or more homologous GTF proteins. We offer a free implementation of the MotifCatcher software package at http://www.bme.ucdavis.edu/facciotti/resources_data/software/. PMID:23181585
Oropeza-Aburto, Araceli; Cruz-Ramírez, Alfredo; Acevedo-Hernández, Gustavo J.; Pérez-Torres, Claudia-Anahí; Caballero-Pérez, Juan; Herrera-Estrella, Luis
2012-01-01
Plants have evolved a plethora of responses to cope with phosphate (Pi) deficiency, including the transcriptional activation of a large set of genes. Among Pi-responsive genes, the expression of the Arabidopsis phospholipase DZ2 (PLDZ2) is activated to participate in the degradation of phospholipids in roots in order to release Pi to support other cellular activities. A deletion analysis was performed to identify the regions determining the strength, tissue-specific expression, and Pi responsiveness of this regulatory region. This study also reports the identification and characterization of a transcriptional enhancer element that is present in the PLDZ2 promoter and able to confer Pi responsiveness to a minimal, inactive 35S promoter. This enhancer also shares the cytokinin and sucrose responsive properties observed for the intact PLDZ2 promoter. The EZ2 element contains two P1BS motifs, each of which is the DNA binding site of transcription factor PHR1. Mutation analysis showed that the P1BS motifs present in EZ2 are necessary but not sufficient for the enhancer function, revealing the importance of adjacent sequences. The structural organization of EZ2 is conserved in the orthologous genes of at least eight families of rosids, suggesting that architectural features such as the distance between the two P1BS motifs are also important for the regulatory properties of this enhancer element. PMID:22210906
Comparative and Evolutionary Analysis of the Interleukin 17 Gene Family in Invertebrates
Huang, Xian-De; Zhang, Hua; He, Mao-Xian
2015-01-01
Interleukin 17 (IL-17) is an important pro-inflammatory cytokine and plays critical roles in the immune response to pathogens and in the pathogenesis of inflammatory and autoimmune diseases. Despite its important functions, the origin and evolution of IL-17 in animal phyla have not been characterized. As determined in this study, the distribution of the IL-17 family among 10 invertebrate species and 7 vertebrate species suggests that the IL-17 gene may have originated from Nematoda but is absent from Saccoglossus kowalevskii (Hemichordata) and Insecta. Moreover, the gene number, protein length and domain number of IL-17 differ widely. A comparison of IL-17-containing domains and conserved motifs indicated somewhat low amino acid sequence similarity but high conservation at the motif level, although some motifs were lost in certain species. The third disulfide bond for the cystine knot fold is formed by two cysteine residues in invertebrates, but these have been replaced by two serine residues in Chordata and vertebrates. One third of invertebrate IL-17 proteins were found to have no predicted signal peptide. Furthermore, an analysis of phylogenetic trees and exon–intron structures indicated that the IL-17 family lacks conservation and displays high divergence. These results suggest that invertebrate IL-17 proteins have undergone complex differentiation and that their members may have developed novel functions during evolution. PMID:26218896
Structure-Based Mutational Analysis of the Hepatitis C Virus NS3 Helicase
Tai, Chun-Ling; Pan, Wen-Ching; Liaw, Shwu-Huey; Yang, Ueng-Cheng; Hwang, Lih-Hwa; Chen, Ding-Shinn
2001-01-01
The carboxyl terminus of the hepatitis C virus (HCV) nonstructural protein 3 (NS3) possesses ATP-dependent RNA helicase activity. Based on the conserved sequence motifs and the crystal structures of the helicase domain, 17 mutants of the HCV NS3 helicase were generated. The ATP hydrolysis, RNA binding, and RNA unwinding activities of the mutant proteins were examined in vitro to determine the functional role of the mutated residues. The data revealed that Lys-210 in the Walker A motif and Asp-290, Glu-291, and His-293 in the Walker B motif were crucial to ATPase activity and that Thr-322 and Thr-324 in motif III and Arg-461 in motif VI significantly influenced ATPase activity. When the pairing between His-293 and Gln-460, referred to as gatekeepers, was replaced with the Asp-293/His-460 pair, which makes the NS3 helicase more like the DEAD helicase subgroup, ATPase activity was not restored. It thus indicated that the whole microenvironment surrounding the gatekeepers, rather than the residues per se, was important to the enzymatic activities. Arg-461 and Trp-501 are important residues for RNA binding, while Val-432 may only play a coadjutant role. The data demonstrated that RNA helicase activity was possibly abolished by the loss of ATPase activity or by reduced RNA binding activity. Nevertheless, a low threshold level of ATPase activity was found sufficient for helicase activity. Results in this study provide a valuable reference for efforts under way to develop anti-HCV therapeutic drugs targeting NS3. PMID:11483774
CoSMoS: Conserved Sequence Motif Search in the proteome
Liu, Xiao I; Korde, Neeraj; Jakob, Ursula; Leichert, Lars I
2006-01-01
Background With the ever-increasing number of gene sequences in the public databases, generating and analyzing multiple sequence alignments becomes increasingly time consuming. Nevertheless it is a task performed on a regular basis by researchers in many labs. Results We have now created a database called CoSMoS to find the occurrences and at the same time evaluate the significance of sequence motifs and amino acids encoded in the whole genome of the model organism Escherichia coli K12. We provide a precomputed set of multiple sequence alignments for each individual E. coli protein with all of its homologues in the RefSeq database. The alignments themselves, information about the occurrence of sequence motifs together with information on the conservation of each of the more than 1.3 million amino acids encoded in the E. coli genome can be accessed via the web interface of CoSMoS. Conclusion CoSMoS is a valuable tool to identify highly conserved sequence motifs, to find regions suitable for mutational studies in functional analyses and to predict important structural features in E. coli proteins. PMID:16433915
SLiMSearch 2.0: biological context for short linear motifs in proteins
Davey, Norman E.; Haslam, Niall J.; Shields, Denis C.
2011-01-01
Short, linear motifs (SLiMs) play a critical role in many biological processes. The SLiMSearch 2.0 (Short, Linear Motif Search) web server allows researchers to identify occurrences of a user-defined SLiM in a proteome, using conservation and protein disorder context statistics to rank occurrences. User-friendly output and visualizations of motif context allow the user to quickly gain insight into the validity of a putatively functional motif occurrence. For each motif occurrence, overlapping UniProt features and annotated SLiMs are displayed. Visualization also includes annotated multiple sequence alignments surrounding each occurrence, showing conservation and protein disorder statistics in addition to known and predicted SLiMs, protein domains and known post-translational modifications. In addition, enrichment of Gene Ontology terms and protein interaction partners are provided as indicators of possible motif function. All web server results are available for download. Users can search motifs against the human proteome or a subset thereof defined by Uniprot accession numbers or GO term. The SLiMSearch server is available at: http://bioware.ucd.ie/slimsearch2.html. PMID:21622654
Gálvez, José Héctor; Tai, Helen H; Lagüe, Martin; Zebarth, Bernie J; Strömvik, Martina V
2016-05-19
Nitrogen (N) is the most important nutrient for the growth of potato (Solanum tuberosum L.). Foliar gene expression in potato plants with and without N supplementation at 180 kg N ha(-1) was compared at mid-season. Genes with consistent differences in foliar expression due to N supplementation over three cultivars and two developmental time points were examined. In total, thirty genes were found to be over-expressed and nine genes were found to be under-expressed with supplemented N. Functional relationships between over-expressed genes were found. The main metabolic pathway represented among differentially expressed genes was amino acid metabolism. The 1000 bp upstream flanking regions of the differentially expressed genes were analysed and nine overrepresented motifs were found using three motif discovery algorithms (Seeder, Weeder and MEME). These results point to coordinated gene regulation at the transcriptional level controlling steady state potato responses to N sufficiency.
Gálvez, José Héctor; Tai, Helen H.; Lagüe, Martin; Zebarth, Bernie J.; Strömvik, Martina V.
2016-01-01
Nitrogen (N) is the most important nutrient for the growth of potato (Solanum tuberosum L.). Foliar gene expression in potato plants with and without N supplementation at 180 kg N ha−1 was compared at mid-season. Genes with consistent differences in foliar expression due to N supplementation over three cultivars and two developmental time points were examined. In total, thirty genes were found to be over-expressed and nine genes were found to be under-expressed with supplemented N. Functional relationships between over-expressed genes were found. The main metabolic pathway represented among differentially expressed genes was amino acid metabolism. The 1000 bp upstream flanking regions of the differentially expressed genes were analysed and nine overrepresented motifs were found using three motif discovery algorithms (Seeder, Weeder and MEME). These results point to coordinated gene regulation at the transcriptional level controlling steady state potato responses to N sufficiency. PMID:27193058
Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya
2015-01-01
Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Fault tolerance in protein interaction networks: stable bipartite subgraphs and redundant pathways.
Brady, Arthur; Maxwell, Kyle; Daniels, Noah; Cowen, Lenore J
2009-01-01
As increasing amounts of high-throughput data for the yeast interactome become available, more system-wide properties are uncovered. One interesting question concerns the fault tolerance of protein interaction networks: whether there exist alternative pathways that can perform some required function if a gene essential to the main mechanism is defective, absent or suppressed. A signature pattern for redundant pathways is the BPM (between-pathway model) motif, introduced by Kelley and Ideker. Past methods proposed to search the yeast interactome for BPM motifs have had several important limitations. First, they have been driven heuristically by local greedy searches, which can lead to the inclusion of extra genes that may not belong in the motif; second, they have been validated solely by functional coherence of the putative pathways using GO enrichment, making it difficult to evaluate putative BPMs in the absence of already known biological annotation. We introduce stable bipartite subgraphs, and show they form a clean and efficient way of generating meaningful BPMs which naturally discard extra genes included by local greedy methods. We show by GO enrichment measures that our BPM set outperforms previous work, covering more known complexes and functional pathways. Perhaps most importantly, since our BPMs are initially generated by examining the genetic-interaction network only, the location of edges in the protein-protein physical interaction network can then be used to statistically validate each candidate BPM, even with sparse GO annotation (or none at all). We uncover some interesting biological examples of previously unknown putative redundant pathways in such areas as vesicle-mediated transport and DNA repair.
Fault Tolerance in Protein Interaction Networks: Stable Bipartite Subgraphs and Redundant Pathways
Brady, Arthur; Maxwell, Kyle; Daniels, Noah; Cowen, Lenore J.
2009-01-01
As increasing amounts of high-throughput data for the yeast interactome become available, more system-wide properties are uncovered. One interesting question concerns the fault tolerance of protein interaction networks: whether there exist alternative pathways that can perform some required function if a gene essential to the main mechanism is defective, absent or suppressed. A signature pattern for redundant pathways is the BPM (between-pathway model) motif, introduced by Kelley and Ideker. Past methods proposed to search the yeast interactome for BPM motifs have had several important limitations. First, they have been driven heuristically by local greedy searches, which can lead to the inclusion of extra genes that may not belong in the motif; second, they have been validated solely by functional coherence of the putative pathways using GO enrichment, making it difficult to evaluate putative BPMs in the absence of already known biological annotation. We introduce stable bipartite subgraphs, and show they form a clean and efficient way of generating meaningful BPMs which naturally discard extra genes included by local greedy methods. We show by GO enrichment measures that our BPM set outperforms previous work, covering more known complexes and functional pathways. Perhaps most importantly, since our BPMs are initially generated by examining the genetic-interaction network only, the location of edges in the protein-protein physical interaction network can then be used to statistically validate each candidate BPM, even with sparse GO annotation (or none at all). We uncover some interesting biological examples of previously unknown putative redundant pathways in such areas as vesicle-mediated transport and DNA repair. PMID:19399174
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shaw, Debra J.; Morse, Robert; Todd, Adrian G.
The Ewing Sarcoma (EWS) protein is a ubiquitously expressed RNA processing factor that localises predominantly to the nucleus. However, the mechanism through which EWS enters the nucleus remains unclear, with differing reports identifying three separate import signals within the EWS protein. Here we have utilized a panel of truncated EWS proteins to clarify the reported nuclear localisation signals. We describe three C-terminal domains that are important for efficient EWS nuclear localization: (1) the third RGG-motif; (2) the last 10 amino acids (known as the PY-import motif); and (3) the zinc-finger motif. Although these three domains are involved in nuclear import,more » they are not independently capable of driving the efficient import of a GFP-moiety. However, collectively they form a complex tripartite signal that efficiently drives GFP-import into the nucleus. This study helps clarify the EWS import signal, and the identification of the involvement of both the RGG- and zinc-finger motifs has wide reaching implications.« less
Di Scala, Coralie; Baier, Carlos J; Evans, Luke S; Williamson, Philip T F; Fantini, Jacques; Barrantes, Francisco J
2017-01-01
Cholesterol is a ubiquitous neutral lipid, which finely tunes the activity of a wide range of membrane proteins, including neurotransmitter and hormone receptors and ion channels. Given the scarcity of available X-ray crystallographic structures and the even fewer in which cholesterol sites have been directly visualized, application of in silico computational methods remains a valid alternative for the detection and thermodynamic characterization of cholesterol-specific sites in functionally important membrane proteins. The membrane-embedded segments of the paradigm neurotransmitter receptor for acetylcholine display a series of cholesterol consensus domains (which we have coined "CARC"). The CARC motif exhibits a preference for the outer membrane leaflet and its mirror motif, CRAC, for the inner one. Some membrane proteins possess the double CARC-CRAC sequences within the same transmembrane domain. In addition to in silico molecular modeling, the affinity, concentration dependence, and specificity of the cholesterol-recognition motif-protein interaction have recently found experimental validation in other biophysical approaches like monolayer techniques and nuclear magnetic resonance spectroscopy. From the combined studies, it becomes apparent that the CARC motif is now more firmly established as a high-affinity cholesterol-binding domain for membrane-bound receptors and remarkably conserved along phylogenetic evolution. © 2017 Elsevier Inc. All rights reserved.
Ca2+-binding Motif of βγ-Crystallins*
Srivastava, Shanti Swaroop; Mishra, Amita; Krishnan, Bal; Sharma, Yogendra
2014-01-01
βγ-Crystallin-type double clamp (N/D)(N/D)XX(S/T)S motif is an established but sparsely investigated motif for Ca2+ binding. A βγ-crystallin domain is formed of two Greek key motifs, accommodating two Ca2+-binding sites. βγ-Crystallins make a separate class of Ca2+-binding proteins (CaBP), apparently a major group of CaBP in bacteria. Paralleling the diversity in βγ-crystallin domains, these motifs also show great diversity, both in structure and in function. Although the expression of some of them has been associated with stress, virulence, and adhesion, the functional implications of Ca2+ binding to βγ-crystallins in mediating biological processes are yet to be elucidated. PMID:24567326
Xiong, Wangdan; Fu, Jianyu; Köllner, Tobias G; Chen, Xinlu; Jia, Qidong; Guo, Haobo; Qian, Ping; Guo, Hong; Wu, Guojiang; Chen, Feng
2018-05-01
Microbial terpene synthase-like (MTPSL) genes are a type of terpene synthase genes only recently identified in plants. In contrast to typical plant terpene synthase genes, which are ubiquitous in land plants, MTPSL genes appear to occur only in nonseed plants. Our knowledge of catalytic functions of MTPSLs is very limited. Here we report biochemical characterization of the enzymes encoded by MTPSL genes from two closely related species of hornworts, Anthoceros punctatus and Anthoceros agrestis. Seven full-length MTPSL genes were identified in A. punctatus (ApMTPSL1-7) based on the analysis of its genome sequence. Using homology-based cloning, the apparent orthologs for six of the ApMTPSL genes, except ApMTPSL2, were cloned from A. agrestis. They were designated AaMTPSL1, 3-7. The coding sequences for each of the 13 Anthoceros MTPSL genes were cloned into a protein expression vector. Escherichia coli-expressed recombinant MTPSLs from hornworts were assayed for terpene synthase activities. Six ApMTPSLs and five AaMTPSLs, except for ApMTPSL5 and AaMTPSL5, showed catalytic activities with one or more isoprenyl diphosphate substrates. All functional MTPSLs exhibited sesquiterpene synthase activities. In contrast, only ApMTPSL7 and AaMTPSL7 showed monoterpene synthase activity and only ApMTPSL2, ApMTPSL6 and AaMTPSL6 showed diterpene synthase activity. Most MTPSLs from Anthoceros contain uncanonical aspartate-rich motif in the form of either 'DDxxxD' or 'DDxxx'. Homology-based structural modeling analysis of ApMTPSL1 and ApMTPSL7, which contain 'DDxxxD' and 'DDxxx' motif, respectively, showed that 'DDxxxD' and 'DDxxx' motifs are localized in the similar positions as the canonical 'DDxxD' motif in known terpene synthases. To further understand the role of individual aspartate residues in the motifs, ApMTPSL1 and ApMTPSL7 were selected as two representatives for site-directed mutagenesis studies. No activities were detected when any of the conserved aspartic acid was mutated into alanine. This study provides new information about the catalytic functions of MTPSLs and the functionality of their uncanonical aspartate-rich motifs, and builds a knowledge base for studying the biological importance of MTPSL genes and their terpene products in nonseed plants. Copyright © 2018 Elsevier Ltd. All rights reserved.
ProMotE: an efficient algorithm for counting independent motifs in uncertain network topologies.
Ren, Yuanfang; Sarkar, Aisharjya; Kahveci, Tamer
2018-06-26
Identifying motifs in biological networks is essential in uncovering key functions served by these networks. Finding non-overlapping motif instances is however a computationally challenging task. The fact that biological interactions are uncertain events further complicates the problem, as it makes the existence of an embedding of a given motif an uncertain event as well. In this paper, we develop a novel method, ProMotE (Probabilistic Motif Embedding), to count non-overlapping embeddings of a given motif in probabilistic networks. We utilize a polynomial model to capture the uncertainty. We develop three strategies to scale our algorithm to large networks. Our experiments demonstrate that our method scales to large networks in practical time with high accuracy where existing methods fail. Moreover, our experiments on cancer and degenerative disease networks show that our method helps in uncovering key functional characteristics of biological networks.
I-motif DNA structures are formed in the nuclei of human cells
NASA Astrophysics Data System (ADS)
Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel
2018-06-01
Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.
Prieto, Gorka; Fullaondo, Asier; Rodríguez, Jose A.
2016-01-01
Large-scale sequencing projects are uncovering a growing number of missense mutations in human tumors. Understanding the phenotypic consequences of these alterations represents a formidable challenge. In silico prediction of functionally relevant amino acid motifs disrupted by cancer mutations could provide insight into the potential impact of a mutation, and guide functional tests. We have previously described Wregex, a tool for the identification of potential functional motifs, such as nuclear export signals (NESs), in proteins. Here, we present an improved version that allows motif prediction to be combined with data from large repositories, such as the Catalogue of Somatic Mutations in Cancer (COSMIC), and to be applied to a whole proteome scale. As an example, we have searched the human proteome for candidate NES motifs that could be altered by cancer-related mutations included in the COSMIC database. A subset of the candidate NESs identified was experimentally tested using an in vivo nuclear export assay. A significant proportion of the selected motifs exhibited nuclear export activity, which was abrogated by the COSMIC mutations. In addition, our search identified a cancer mutation that inactivates the NES of the human deubiquitinase USP21, and leads to the aberrant accumulation of this protein in the nucleus. PMID:27174732
Role of sequence encoded κB DNA geometry in gene regulation by Dorsal
Mrinal, Nirotpal; Tomar, Archana; Nagaraju, Javaregowda
2011-01-01
Many proteins of the Rel family can act as both transcriptional activators and repressors. However, mechanism that discerns the ‘activator/repressor’ functions of Rel-proteins such as Dorsal (Drosophila homologue of mammalian NFκB) is not understood. Using genomic, biophysical and biochemical approaches, we demonstrate that the underlying principle of this functional specificity lies in the ‘sequence-encoded structure’ of the κB-DNA. We show that Dorsal-binding motifs exist in distinct activator and repressor conformations. Molecular dynamics of DNA-Dorsal complexes revealed that repressor κB-motifs typically have A-tract and flexible conformation that facilitates interaction with co-repressors. Deformable structure of repressor motifs, is due to changes in the hydrogen bonding in A:T pair in the ‘A-tract’ core. The sixth nucleotide in the nonameric κB-motif, ‘A’ (A6) in the repressor motifs and ‘T’ (T6) in the activator motifs, is critical to confer this functional specificity as A6 → T6 mutation transformed flexible repressor conformation into a rigid activator conformation. These results highlight that ‘sequence encoded κB DNA-geometry’ regulates gene expression by exerting allosteric effect on binding of Rel proteins which in turn regulates interaction with co-regulators. Further, we identified and characterized putative repressor motifs in Dl-target genes, which can potentially aid in functional annotation of Dorsal gene regulatory network. PMID:21890896
La Sala, Giuseppina; Riccardi, Laura; Gaspari, Roberto; Cavalli, Andrea; Hantschel, Oliver; De Vivo, Marco
2016-11-08
A number of structural factors modulate the activity of Abelson (Abl) tyrosine kinase, whose deregulation is often related to oncogenic processes. First, only the open conformation of the Abl kinase domain's activation loop (A-loop) favors ATP binding to the catalytic cleft. In this regard, the trans-autophosphorylation of the Y412 residue, which is located along the A-loop, favors the stability of the open conformation, in turn enhancing Abl activity. Another key factor for full Abl activity is the formation of active conformations of the catalytic DFG motif in the Abl kinase domain. Furthermore, binding of the SH2 domain to the N-lobe of the Abl kinase was recently demonstrated to have a long-range allosteric effect on the stabilization of the A-loop open state. Intriguingly, these distinct structural factors imply a complex signal transmission network for controlling the A-loop's flexibility and conformational preference for optimal Abl function. However, the exact dynamical features of this signal transmission network structure remain unclear. Here, we report on microsecond-long molecular dynamics coupled with enhanced sampling simulations of multiple Abl model systems, in the presence or absence of the SH2 domain and with the DFG motif flipped in two ways (in or out conformation). Through comparative analysis, our simulations augment the interpretation of the existing Abl experimental data, revealing a dynamical network of interactions that interconnect SH2 domain binding with A-loop plasticity and Y412 autophosphorylation in Abl. This signaling network engages the DFG motif and, importantly, other conserved structural elements of the kinase domain, namely, the EPK-ELK H-bond network and the HRD motif. Our results show that the signal propagation for modulating the A-loop spatial localization is highly dependent on the HRD motif conformation, which thus acts as the central hub of this (allosteric) signaling network controlling Abl activation and function.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wei, Zhuang; Laboratory of System Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031; Zou, Xinle
2015-01-16
Highlight: • The N-terminal leucine-zipper motif in PTRF/cavin-1 determines caveolar association. • Different cellular localization of PTRF/cavin-1 influences its serine 389 and 391 phosphorylation state. • PTRF/cavin-1 regulates cell motility via its caveolar association. - Abstract: PTRF/cavin-1 is a protein of two lives. Its reported functions in ribosomal RNA synthesis and in caveolae formation happen in two different cellular locations: nucleus vs. plasma membrane. Here, we identified that the N-terminal leucine-zipper motif in PTRF/cavin-1 was essential for the protein to be associated with caveolae in plasma membrane. It could counteract the effect of nuclear localization sequence in the molecule (AAmore » 235–251). Deletion of this leucine-zipper motif from PTRF/cavin-1 caused the mutant to be exclusively localized in nuclei. The fusion of this leucine-zipper motif with histone 2A, which is a nuclear protein, could induce the fusion protein to be exported from nucleus. Cell migration was greatly inhibited in PTRF/cavin-1{sup −/−} mouse embryonic fibroblasts (MEFs). The inhibited cell motility could only be rescued by exogenous cavin-1 but not the leucine-zipper motif deleted cavin-1 mutant. Plasma membrane dynamics is an important factor in cell motility control. Our results suggested that the membrane dynamics in cell migration is affected by caveolae associated PTRF/cavin-1.« less
OSR1 regulates a subset of inward rectifier potassium channels via a binding motif variant.
Taylor, Clinton A; An, Sung-Wan; Kankanamalage, Sachith Gallolu; Stippec, Steve; Earnest, Svetlana; Trivedi, Ashesh T; Yang, Jonathan Zijiang; Mirzaei, Hamid; Huang, Chou-Long; Cobb, Melanie H
2018-04-10
The with-no-lysine (K) (WNK) signaling pathway to STE20/SPS1-related proline- and alanine-rich kinase (SPAK) and oxidative stress-responsive 1 (OSR1) kinase is an important mediator of cell volume and ion transport. SPAK and OSR1 associate with upstream kinases WNK 1-4, substrates, and other proteins through their C-terminal domains which interact with linear R-F-x-V/I sequence motifs. In this study we find that SPAK and OSR1 also interact with similar affinity with a motif variant, R-x-F-x-V/I. Eight of 16 human inward rectifier K + channels have an R-x-F-x-V motif. We demonstrate that two of these channels, Kir2.1 and Kir2.3, are activated by OSR1, while Kir4.1, which does not contain the motif, is not sensitive to changes in OSR1 or WNK activity. Mutation of the motif prevents activation of Kir2.3 by OSR1. Both siRNA knockdown of OSR1 and chemical inhibition of WNK activity disrupt NaCl-induced plasma membrane localization of Kir2.3. Our results suggest a mechanism by which WNK-OSR1 enhance Kir2.1 and Kir2.3 channel activity by increasing their plasma membrane localization. Regulation of members of the inward rectifier K + channel family adds functional and mechanistic insight into the physiological impact of the WNK pathway.
Takahashi, Kaori; Takabayashi, Atsushi; Tanaka, Ayumi; Tanaka, Ryouichi
2014-01-01
The light-harvesting complex (LHC) constitutes the major light-harvesting antenna of photosynthetic eukaryotes. LHC contains a characteristic sequence motif, termed LHC motif, consisting of 25–30 mostly hydrophobic amino acids. This motif is shared by a number of transmembrane proteins from oxygenic photoautotrophs that are termed light-harvesting-like (LIL) proteins. To gain insights into the functions of LIL proteins and their LHC motifs, we functionally characterized a plant LIL protein, LIL3. This protein has been shown previously to stabilize geranylgeranyl reductase (GGR), a key enzyme in phytol biosynthesis. It is hypothesized that LIL3 functions to anchor GGR to membranes. First, we conjugated the transmembrane domain of LIL3 or that of ascorbate peroxidase to GGR and expressed these chimeric proteins in an Arabidopsis mutant lacking LIL3 protein. As a result, the transgenic plants restored phytol-synthesizing activity. These results indicate that GGR is active as long as it is anchored to membranes, even in the absence of LIL3. Subsequently, we addressed the question why the LHC motif is conserved in the LIL3 sequences. We modified the transmembrane domain of LIL3, which contains the LHC motif, by substituting its conserved amino acids (Glu-171, Asn-174, and Asp-189) with alanine. As a result, the Arabidopsis transgenic plants partly recovered the phytol-biosynthesizing activity. However, in these transgenic plants, the LIL3-GGR complexes were partially dissociated. Collectively, these results indicate that the LHC motif of LIL3 is involved in the complex formation of LIL3 and GGR, which might contribute to the GGR reaction. PMID:24275650
An intact PDZ motif is essential for correct P2Y12 purinoceptor traffic in human platelets.
Nisar, Shaista; Daly, Martina E; Federici, Augusto B; Artoni, Andrea; Mumford, Andrew D; Watson, Stephen P; Mundell, Stuart J
2011-11-17
The platelet P2Y(12) purinoceptor (P2Y(12)R), which plays a crucial role in hemostasis, undergoes internalization and subsequent recycling to maintain receptor responsiveness, processes that are essential for normal platelet function. Here, we observe that P2Y(12)R function is compromised after deletion or mutation of the 4 amino acids at the extreme C-terminus of this receptor (ETPM), a putative postsynaptic density 95/disc large/zonula occludens-1 (PDZ)-binding motif. In cell line models, removal of this sequence or mutation of one of its core residues (P341A), attenuates receptor internalization and receptor recycling back to the membrane, thereby blocking receptor resensitization. The physiologic significance of these findings in the regulation of platelet function is shown by identification of a patient with a heterozygous mutation in the PDZ binding sequence of their P2Y(12)R (P341A) that is associated with reduced expression of the P2Y(12)R on the cell surface. Importantly, platelets from this subject showed significantly compromised P2Y(12)R recycling, emphasizing the importance of the extreme C-terminus of this receptor to ensure correct receptor traffic.
Distribution and diversity of ribosome binding sites in prokaryotic genomes.
Omotajo, Damilola; Tate, Travis; Cho, Hyuk; Choudhary, Madhusudan
2015-08-14
Prokaryotic translation initiation involves the proper docking, anchoring, and accommodation of mRNA to the 30S ribosomal subunit. Three initiation factors (IF1, IF2, and IF3) and some ribosomal proteins mediate the assembly and activation of the translation initiation complex. Although the interaction between Shine-Dalgarno (SD) sequence and its complementary sequence in the 16S rRNA is important in initiation, some genes lacking an SD ribosome binding site (RBS) are still well expressed. The objective of this study is to examine the pattern of distribution and diversity of RBS in fully sequenced bacterial genomes. The following three hypotheses were tested: SD motifs are prevalent in bacterial genomes; all previously identified SD motifs are uniformly distributed across prokaryotes; and genes with specific cluster of orthologous gene (COG) functions differ in their use of SD motifs. Data for 2,458 bacterial genomes, previously generated by Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm) and currently available at the National Center for Biotechnology Information (NCBI), were analyzed. Of the total genes examined, ~77.0% use an SD RBS, while ~23.0% have no RBS. Majority of the genes with the most common SD motifs are distributed in a manner that is representative of their abundance for each COG functional category, while motifs 13 (5'-GGA-3'/5'-GAG-3'/5'-AGG-3') and 27 (5'-AGGAGG-3') appear to be predominantly used by genes for information storage and processing, and translation and ribosome biogenesis, respectively. These findings suggest that an SD sequence is not obligatory for translation initiation; instead, other signals, such as the RBS spacer, may have an overarching influence on translation of mRNAs. Subsequent analyses of the 5' secondary structure of these mRNAs may provide further insight into the translation initiation mechanism.
Li, Chuang; Peng, Qiongfang; Wan, Xiao; Sun, Haili; Tang, Jun
2017-10-15
Promyelocytic leukemia protein (PML) nuclear bodies (NBs), which are sub-nuclear protein structures, are involved in a variety of important cellular functions. PML-NBs are assembled by PML isoforms, and contact between small ubiquitin-like modifiers (SUMOs) with the SUMO interaction motif (SIM) are critically involved in this process. PML isoforms contain a common N-terminal region and a variable C-terminus. However, the contribution of the C-terminal regions to PML-NB formation remains poorly defined. Here, using high-resolution microscopy, we show that mutation of the SIM distinctively influences the structure of NBs formed by each individual PML isoform, with that of PML-III and PML-V minimally changed, and PML-I and PML-IV dramatically impaired. We further identify several C-terminal elements that are important in regulating NB structure and provide strong evidence to suggest that the 8b element in PML-IV possesses a strong ability to interact with SUMO-1 and SUMO-2, and critically participates in NB formation. Our findings highlight the importance of PML C-termini in NB assembly and function, and provide molecular insight into the PML-NB assembly of each distinctive isoform. © 2017. Published by The Company of Biologists Ltd.
Wang, Yeda; Li, Zeming; Lu, Yuanan; Hu, Guangfu; Lin, Li; Zeng, Lingbing; Zhou, Yong; Liu, Xueqin
2016-10-09
Tripartite motif-containing protein 32 (TRIM32) belongs to the tripartite motif (TRIM) family, which consists of a large number of proteins containing a RING (Really Interesting New Gene) domain, one or two B-box domains, and coiled coil motif followed by different C-terminal domains. The TRIM family is known to be implicated in multiple cellular functions, including antiviral activity. However, it is presently unknown whether TRIM32 of common carp ( Cyprinus carpio ) has the antiviral effect. In this study, the sequence, expression, and antiviral function of TRIM32 homolog from common carp were analyzed. The full-length coding sequence region of trim32 was cloned from common carp. The results showed that the expression of TRIM32 (mRNA) was highest in the brain, remained stably expressed during embryonic development, and significantly increased following spring viraemia of carp virus (SVCV) infection. Transient overexpression of TRIM32 in affected Epithelioma papulosum cyprinid cells led to significant decrease of SVCV production as compared to the control group. These results suggested a potentially important role of common carp TRIM32 in enhancing host immune response during SVCV infection both in vivo and in vitro.
Wang, Yeda; Li, Zeming; Lu, Yuanan; Hu, Guangfu; Lin, Li; Zeng, Lingbing; Zhou, Yong; Liu, Xueqin
2016-01-01
Tripartite motif-containing protein 32 (TRIM32) belongs to the tripartite motif (TRIM) family, which consists of a large number of proteins containing a RING (Really Interesting New Gene) domain, one or two B-box domains, and coiled coil motif followed by different C-terminal domains. The TRIM family is known to be implicated in multiple cellular functions, including antiviral activity. However, it is presently unknown whether TRIM32 of common carp (Cyprinus carpio) has the antiviral effect. In this study, the sequence, expression, and antiviral function of TRIM32 homolog from common carp were analyzed. The full-length coding sequence region of trim32 was cloned from common carp. The results showed that the expression of TRIM32 (mRNA) was highest in the brain, remained stably expressed during embryonic development, and significantly increased following spring viraemia of carp virus (SVCV) infection. Transient overexpression of TRIM32 in affected Epithelioma papulosum cyprinid cells led to significant decrease of SVCV production as compared to the control group. These results suggested a potentially important role of common carp TRIM32 in enhancing host immune response during SVCV infection both in vivo and in vitro. PMID:27735853
Exploitation of peptide motif sequences and their use in nanobiotechnology.
Shiba, Kiyotaka
2010-08-01
Short amino acid sequences extracted from natural proteins or created using in vitro evolution systems are sometimes associated with particular biological functions. These peptides, called peptide motifs, can serve as functional units for the creation of various tools for nanobiotechnology. In particular, peptide motifs that have the ability to specifically recognize the surfaces of solid materials and to mineralize certain inorganic materials have been linking biological science to material science. Here, I review how these peptide motifs have been isolated from natural proteins or created using in vitro evolution systems, and how they have been used in the nanobiotechnology field. Copyright © 2010 Elsevier Ltd. All rights reserved.
Differences in Krox20-dependent regulation of Hoxa2 and Hoxb2 during hindbrain development.
Maconochie, M K; Nonchev, S; Manzanares, M; Marshall, H; Krumlauf, R
2001-05-15
During hindbrain development, segmental regulation of the paralogous Hoxa2 and Hoxb2 genes in rhombomeres (r) 3 and 5 involves Krox20-dependent enhancers that have been conserved during the duplication of the vertebrate Hox clusters from a common ancestor. Examining these evolutionarily related control regions could provide important insight into the degree to which the basic Krox20-dependent mechanisms, cis-regulatory components, and their organization have been conserved. Toward this goal we have performed a detailed functional analysis of a mouse Hoxa2 enhancer capable of directing reporter expression in r3 and r5. The combined activities of five separate cis-regions, in addition to the conserved Krox20 binding sites, are involved in mediating enhancer function. A CTTT (BoxA) motif adjacent to the Krox20 binding sites is important for r3/r5 activity. The BoxA motif is similar to one (Box1) found in the Hoxb2 enhancer and indicates that the close proximity of these Box motifs to Krox20 sites is a common feature of Krox20 targets in vivo. Two other rhombomeric elements (RE1 and RE3) are essential for r3/r5 activity and share common TCT motifs, indicating that they interact with a similar cofactor(s). TCT motifs are also found in the Hoxb2 enhancer, suggesting that they may be another common feature of Krox20-dependent control regions. The two remaining Hoxa2 cis-elements, RE2 and RE4, are not conserved in the Hoxb2 enhancer and define differences in some of components that can contribute to the Krox20-dependent activities of these enhancers. Furthermore, analysis of regulatory activities of these enhancers in a Krox20 mutant background has uncovered differences in their degree of dependence upon Krox20 for segmental expression. Together, this work has revealed a surprising degree of complexity in the number of cis-elements and regulatory components that contribute to segmental expression mediated by Krox20 and sheds light on the diversity and evolution of Krox20 target sites and Hox regulatory elements in vertebrates. Copyright 2001 Academic Press.
Kinjo, Akira R; Nakamura, Haruki
2013-01-01
Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.
Hayama, Ryo; Sparks, Samuel; Hecht, Lee M.; Dutta, Kaushik; Karp, Jerome M.; Cabana, Christina M.; Rout, Michael P.; Cowburn, David
2018-01-01
Intrinsically disordered proteins (IDPs) play important roles in many biological systems. Given the vast conformational space that IDPs can explore, the thermodynamics of the interactions with their partners is closely linked to their biological functions. Intrinsically disordered regions of Phe–Gly nucleoporins (FG Nups) that contain multiple phenylalanine–glycine repeats are of particular interest, as their interactions with transport factors (TFs) underlie the paradoxically rapid yet also highly selective transport of macromolecules mediated by the nuclear pore complex. Here, we used NMR and isothermal titration calorimetry to thermodynamically characterize these multivalent interactions. These analyses revealed that a combination of low per-FG motif affinity and the enthalpy–entropy balance prevents high-avidity interaction between FG Nups and TFs, whereas the large number of FG motifs promotes frequent FG–TF contacts, resulting in enhanced selectivity. Our thermodynamic model underlines the importance of functional disorder of FG Nups. It helps explain the rapid and selective translocation of TFs through the nuclear pore complex and further expands our understanding of the mechanisms of “fuzzy” interactions involving IDPs. PMID:29374059
Urata, Shuzo; Noda, Takeshi; Kawaoka, Yoshihiro; Morikawa, Shigeru; Yokosawa, Hideyoshi; Yasuda, Jiro
2007-01-01
Marburg virus (MARV) VP40 is a matrix protein that can be released from mammalian cells in the form of virus-like particles (VLPs) and contains the PPPY sequence, which is an L-domain motif. Here, we demonstrate that the PPPY motif is important for VP40-induced VLP budding and that VLP production is significantly enhanced by coexpression of NP and GP. We show that Tsg101 interacts with VP40 depending on the presence of the PPPY motif, but not the PT/SAP motif as in the case of Ebola virus, and plays an important role in VLP budding. These findings provide new insights into the mechanism of MARV budding. PMID:17301151
Thomas, L R; Foshage, A M; Weissmiller, A M; Popay, T M; Grieb, B C; Qualls, S J; Ng, V; Carboneau, B; Lorey, S; Eischen, C M; Tansey, W P
2016-07-07
The MYC family of oncogenes encodes a set of three related transcription factors that are overexpressed in many human tumors and contribute to the cancer-related deaths of more than 70,000 Americans every year. MYC proteins drive tumorigenesis by interacting with co-factors that enable them to regulate the expression of thousands of genes linked to cell growth, proliferation, metabolism and genome stability. One effective way to identify critical co-factors required for MYC function has been to focus on sequence motifs within MYC that are conserved throughout evolution, on the assumption that their conservation is driven by protein-protein interactions that are vital for MYC activity. In addition to their DNA-binding domains, MYC proteins carry five regions of high sequence conservation known as Myc boxes (Mb). To date, four of the Mb motifs (MbI, MbII, MbIIIa and MbIIIb) have had a molecular function assigned to them, but the precise role of the remaining Mb, MbIV, and the reason for its preservation in vertebrate Myc proteins, is unknown. Here, we show that MbIV is required for the association of MYC with the abundant transcriptional coregulator host cell factor-1 (HCF-1). We show that the invariant core of MbIV resembles the tetrapeptide HCF-binding motif (HBM) found in many HCF-interaction partners, and demonstrate that MYC interacts with HCF-1 in a manner indistinguishable from the prototypical HBM-containing protein VP16. Finally, we show that rationalized point mutations in MYC that disrupt interaction with HCF-1 attenuate the ability of MYC to drive tumorigenesis in mice. Together, these data expose a molecular function for MbIV and indicate that HCF-1 is an important co-factor for MYC.
Janus-faced Sestrin2 controls ROS and mTOR signalling through two separate functional domains
NASA Astrophysics Data System (ADS)
Kim, Hanseong; An, Sojin; Ro, Seung-Hyun; Teixeira, Filipa; Jin Park, Gyeong; Kim, Cheal; Cho, Chun-Seok; Kim, Jeong-Sig; Jakob, Ursula; Hee Lee, Jun; Cho, Uhn-Soo
2015-11-01
Sestrins are stress-inducible metabolic regulators with two seemingly unrelated but physiologically important functions: reduction of reactive oxygen species (ROS) and inhibition of the mechanistic target of rapamycin complex 1 (mTORC1). How Sestrins fulfil this dual role has remained elusive so far. Here we report the crystal structure of human Sestrin2 (hSesn2), and show that hSesn2 is twofold pseudo-symmetric with two globular subdomains, which are structurally similar but functionally distinct from each other. While the N-terminal domain (Sesn-A) reduces alkylhydroperoxide radicals through its helix-turn-helix oxidoreductase motif, the C-terminal domain (Sesn-C) modified this motif to accommodate physical interaction with GATOR2 and subsequent inhibition of mTORC1. These findings clarify the molecular mechanism of how Sestrins can attenuate degenerative processes such as aging and diabetes by acting as a simultaneous inhibitor of ROS accumulation and mTORC1 activation.
Redemptive Rhetoric: The Continuity Motif in the Rhetoric of Right to Life.
ERIC Educational Resources Information Center
Solomon, Martha
1980-01-01
Traces the use of the "continuity" motif in the Right to Life movement's rhetoric and its influence on the depiction of the abortion controversy. Analyzes how the motif functions rhetorically to aid the movement in defining its activities and involvement. (PD)
Zan, Xinyi; Tang, Xin; Chu, Linfang; Song, Yuanda
2018-03-21
Although multiple roles of lipases have been reported in yeasts and microalgae, the functions of lipases have not been studied in oleaginous filamentous fungi. Lipase Lip6 has been reported in the oleaginous filamentous fungus Mucor circinelloides with the consensus lipase motif GXSXG and the typical acyltransferase motif of H-(X) 4 -D. To demonstrate that Lip6 might play dual roles as a lipase and an acyltransferase, we performed site-directed mutagenesis in the lipase motif and the acyltransferase motif of Lip6. Mutation in the lipase motif increased cell biomass by 12%-18% and promoted lipid accumulation by 9%-24%, while mutation in the acyltransferase motif induced lipid degradation. In vitro, purified Lip6 had a slight lipase activity but had a stronger phospholipid:DAG acyltransferase activity. Enzyme activity assays in vivo and phospholipid synthesis pathway analysis suggested that phosphatidyl serine and phosphatidyl ethanolamine can be the supplier of a fatty acyl moiety to form TAG in M. circinelloides.
A novel swarm intelligence algorithm for finding DNA motifs.
Lei, Chengwei; Ruan, Jianhua
2009-01-01
Discovering DNA motifs from co-expressed or co-regulated genes is an important step towards deciphering complex gene regulatory networks and understanding gene functions. Despite significant improvement in the last decade, it still remains one of the most challenging problems in computational molecular biology. In this work, we propose a novel motif finding algorithm that finds consensus patterns using a population-based stochastic optimisation technique called Particle Swarm Optimisation (PSO), which has been shown to be effective in optimising difficult multidimensional problems in continuous domains. We propose to use a word dissimilarity graph to remap the neighborhood structure of the solution space of DNA motifs, and propose a modification of the naive PSO algorithm to accommodate discrete variables. In order to improve efficiency, we also propose several strategies for escaping from local optima and for automatically determining the termination criteria. Experimental results on simulated challenge problems show that our method is both more efficient and more accurate than several existing algorithms. Applications to several sets of real promoter sequences also show that our approach is able to detect known transcription factor binding sites, and outperforms two of the most popular existing algorithms.
Bhawna; Chaduvula, Pavan K; Bonthala, Venkata S; Manjusha, Verma; Siddiq, Ebrahimali A; Polumetla, Ananda K; Prasad, Gajula M N V
2015-01-01
Cucumis melo L. that belongs to Cucurbitaceae family ranks among one of the highest valued horticulture crops being cultivated across the globe. Besides its economical and medicinal importance, Cucumis melo L. is a valuable resource and model system for the evolutionary studies of cucurbit family. However, very limited numbers of molecular markers were reported for Cucumis melo L. so far that limits the pace of functional genomic research in melon and other similar horticulture crops. We developed the first whole genome based microsatellite DNA marker database of Cucumis melo L. and comprehensive web resource that aids in variety identification and physical mapping of Cucurbitaceae family. The Cucumis melo L. microsatellite database (CmMDb: http://65.181.125.102/cmmdb2/index.html) encompasses 39,072 SSR markers along with its motif repeat, motif length, motif sequence, marker ID, motif type and chromosomal locations. The database is featured with novel automated primer designing facility to meet the needs of wet lab researchers. CmMDb is a freely available web resource that facilitates the researchers to select the most appropriate markers for marker-assisted selection in melons and to improve breeding strategies.
Netz, Daili J. A.; Pierik, Antonio J.; Stümpfig, Martin; Bill, Eckhard; Sharma, Anil K.; Pallesen, Leif J.; Walden, William E.; Lill, Roland
2012-01-01
The essential P-loop NTPases Cfd1 and Nbp35 of the cytosolic iron-sulfur (Fe-S) protein assembly machinery perform a scaffold function for Fe-S cluster synthesis. Both proteins contain a nucleotide binding motif of unknown function and a C-terminal motif with four conserved cysteine residues. The latter motif defines the Mrp/Nbp35 subclass of P-loop NTPases and is suspected to be involved in transient Fe-S cluster binding. To elucidate the function of these two motifs, we first created cysteine mutant proteins of Cfd1 and Nbp35 and investigated the consequences of these mutations by genetic, cell biological, biochemical, and spectroscopic approaches. The two central cysteine residues (CPXC) of the C-terminal motif were found to be crucial for cell viability, protein function, coordination of a labile [4Fe-4S] cluster, and Cfd1-Nbp35 hetero-tetramer formation. Surprisingly, the two proximal cysteine residues were dispensable for all these functions, despite their strict evolutionary conservation. Several lines of evidence suggest that the C-terminal CPXC motifs of Cfd1-Nbp35 coordinate a bridging [4Fe-4S] cluster. Upon mutation of the nucleotide binding motifs Fe-S clusters could no longer be assembled on these proteins unless wild-type copies of Cfd1 and Nbp35 were present in trans. This result indicated that Fe-S cluster loading on these scaffold proteins is a nucleotide-dependent step. We propose that the bridging coordination of the C-terminal Fe-S cluster may be ideal for its facile assembly, labile binding, and efficient transfer to target Fe-S apoproteins, a step facilitated by the cytosolic iron-sulfur (Fe-S) protein assembly proteins Nar1 and Cia1 in vivo. PMID:22362766
Chen, Lie; Bi, Danlei; Tian, Lijun; McClafferty, Heather; Steeb, Franziska; Ruth, Peter; Knaus, Hans Guenther; Shipston, Michael J.
2013-01-01
Regulatory β-subunits of large conductance calcium- and voltage-activated potassium (BK) channels play an important role in generating functional diversity and control of cell surface expression of the pore forming α-subunits. However, in contrast to α-subunits, the role of reversible post-translational modification of intracellular residues on β-subunit function is largely unknown. Here we demonstrate that the human β4-subunit is S-acylated (palmitoylated) on a juxtamembrane cysteine residue (Cys-193) in the intracellular C terminus of the regulatory β-subunit. β4-Subunit palmitoylation is important for cell surface expression and endoplasmic reticulum (ER) exit of the β4-subunit alone. Importantly, palmitoylated β4-subunits promote the ER exit and surface expression of the pore-forming α-subunit, whereas β4-subunits that cannot be palmitoylated do not increase ER exit or surface expression of α-subunits. Strikingly, however, this palmitoylation- and β4-dependent enhancement of α-subunit surface expression was only observed in α-subunits that contain a putative trafficking motif (… REVEDEC) at the very C terminus of the α-subunit. Engineering this trafficking motif to other C-terminal α-subunit splice variants results in α-subunits with reduced surface expression that can be rescued by palmitoylated, but not depalmitoylated, β4-subunits. Our data reveal a novel mechanism by which palmitoylated β4-subunit controls surface expression of BK channels through masking of a trafficking motif in the C terminus of the α-subunit. As palmitoylation is dynamic, this mechanism would allow precise control of specific splice variants to the cell surface. Our data provide new insights into how complex interplay between the repertoire of post-transcriptional and post-translational mechanisms controls cell surface expression of BK channels. PMID:23504458
Sun, Bo; Guo, Wenting; Tian, Xixi; Yao, Jinjing; Zhang, Lin; Wang, Ruiwu; Chen, S R Wayne
2016-12-09
The ryanodine receptor (RyR) channel pore is formed by four S6 inner helices, with its intracellular gate located at the S6 helix bundle crossing region. The cytoplasmic region of the extended S6 helix is held by the U motif of the central domain and is thought to control the opening and closing of the S6 helix bundle. However, the functional significance of the S6 cytoplasmic region in channel gating is unknown. Here we assessed the role of the S6 cytoplasmic region in the function of cardiac RyR (RyR2) via structure-guided site-directed mutagenesis. We mutated each residue in the S6 cytoplasmic region of the mouse RyR2 ( 4876 QQEQVKEDM 4884 ) and characterized their functional impact. We found that mutations Q4876A, V4880A, K4881A, and M4884A, located mainly on one side of the S6 helix that faces the U motif, enhanced basal channel activity and the sensitivity to Ca 2+ or caffeine activation, whereas mutations Q4877A, E4878A, Q4879A, and D4883A, located largely on the opposite side of S6, suppressed channel activity. Furthermore, V4880A, a cardiac arrhythmia-associated mutation, markedly enhanced the frequency of spontaneous openings and the sensitivity to cytosolic and luminal Ca 2+ activation of single RyR2 channels. V4880A also increased the propensity and reduced the threshold for arrhythmogenic spontaneous Ca 2+ release in HEK293 cells. Collectively, our data suggest that interactions between the cytoplasmic region of S6 and the U motif of RyR2 are important for stabilizing the closed state of the channel. Mutations in the S6/U motif domain interface likely destabilize the closed state of RyR2, resulting in enhanced basal channel activity and sensitivity to activation and increased propensity for spontaneous Ca 2+ release and cardiac arrhythmias. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
AIRE is a critical spindle-associated protein in embryonic stem cells
Gu, Bin; Lambert, Jean-Philippe; Cockburn, Katie; Gingras, Anne-Claude; Rossant, Janet
2017-01-01
Embryonic stem (ES) cells go though embryo-like cell cycles regulated by specialized molecular mechanisms. However, it is not known whether there are ES cell-specific mechanisms regulating mitotic fidelity. Here we showed that Autoimmune Regulator (Aire), a transcription coordinator involved in immune tolerance processes, is a critical spindle-associated protein in mouse ES(mES) cells. BioID analysis showed that AIRE associates with spindle-associated proteins in mES cells. Loss of function analysis revealed that Aire was important for centrosome number regulation and spindle pole integrity specifically in mES cells. We also identified the c-terminal LESLL motif as a critical motif for AIRE’s mitotic function. Combined maternal and zygotic knockout further revealed Aire’s critical functions for spindle assembly in preimplantation embryos. These results uncovered a previously unappreciated function for Aire and provide new insights into the biology of stem cell proliferation and potential new angles to understand fertility defects in humans carrying Aire mutations. DOI: http://dx.doi.org/10.7554/eLife.28131.001 PMID:28742026
Lee, Mihwa; Sadowska, Agata; Bekere, Indra; Ho, Diwei; Gully, Benjamin S.; Lu, Yanling; Iyer, K. Swaminathan; Trewhella, Jill; Fox, Archa H.; Bond, Charles S.
2015-01-01
SFPQ, (a.k.a. PSF), is a human tumor suppressor protein that regulates many important functions in the cell nucleus including coordination of long non-coding RNA molecules into nuclear bodies. Here we describe the first crystal structures of Splicing Factor Proline and Glutamine Rich (SFPQ), revealing structural similarity to the related PSPC1/NONO heterodimer and a strikingly extended structure (over 265 Å long) formed by an unusual anti-parallel coiled-coil that results in an infinite linear polymer of SFPQ dimers within the crystals. Small-angle X-ray scattering and transmission electron microscopy experiments show that polymerization is reversible in solution and can be templated by DNA. We demonstrate that the ability to polymerize is essential for the cellular functions of SFPQ: disruptive mutation of the coiled-coil interaction motif results in SFPQ mislocalization, reduced formation of nuclear bodies, abrogated molecular interactions and deficient transcriptional regulation. The coiled-coil interaction motif thus provides a molecular explanation for the functional aggregation of SFPQ that directs its role in regulating many aspects of cellular nucleic acid metabolism. PMID:25765647
Warren, Jeremy G.; Lincoln, James E.; Kirkpatrick, Bruce C.
2015-01-01
Polygalacturonases (EC 3.2.1.15) catalyze the random hydrolysis of 1, 4-alpha-D-galactosiduronic linkages in pectate and other galacturonans. Xylella fastidiosa possesses a single polygalacturonase gene, pglA (PD1485), and X. fastidiosa mutants deficient in the production of polygalacturonase are non-pathogenic and show a compromised ability to systemically infect grapevines. These results suggested that grapevines expressing sufficient amounts of an inhibitor of X. fastidiosa polygalacturonase might be protected from disease. Previous work in our laboratory and others have tried without success to produce soluble active X. fastidiosa polygalacturonase for use in inhibition assays. In this study, we created two enzymatically active X. fastidiosa / A. vitis polygalacturonase chimeras, AX1A and AX2A to explore the functionality of X. fastidiosa polygalacturonase in vitro. The AX1A chimera was constructed to specifically test if recombinant chimeric protein, produced in Escherichia coli, is soluble and if the X. fastidiosa polygalacturonase catalytic amino acids are able to hydrolyze polygalacturonic acid. The AX2A chimera was constructed to evaluate the ability of a unique QMK motif of X. fastidiosa polygalacturonase, most polygalacturonases have a R(I/L)K motif, to bind to and allow the hydrolysis of polygalacturonic acid. Furthermore, the AX2A chimera was also used to explore what effect modification of the QMK motif of X. fastidiosa polygalacturonase to a conserved RIK motif has on enzymatic activity. These experiments showed that both the AX1A and AX2A polygalacturonase chimeras were soluble and able to hydrolyze the polygalacturonic acid substrate. Additionally, the modification of the QMK motif to the conserved RIK motif eliminated hydrolytic activity, suggesting that the QMK motif is important for the activity of X. fastidiosa polygalacturonase. This result suggests X. fastidiosa polygalacturonase may preferentially hydrolyze a different pectic substrate or, alternatively, it has a different mechanism of substrate binding than other polygalacturonases characterized to date. PMID:26571265
Warren, Jeremy G; Lincoln, James E; Kirkpatrick, Bruce C
2015-01-01
Polygalacturonases (EC 3.2.1.15) catalyze the random hydrolysis of 1, 4-alpha-D-galactosiduronic linkages in pectate and other galacturonans. Xylella fastidiosa possesses a single polygalacturonase gene, pglA (PD1485), and X. fastidiosa mutants deficient in the production of polygalacturonase are non-pathogenic and show a compromised ability to systemically infect grapevines. These results suggested that grapevines expressing sufficient amounts of an inhibitor of X. fastidiosa polygalacturonase might be protected from disease. Previous work in our laboratory and others have tried without success to produce soluble active X. fastidiosa polygalacturonase for use in inhibition assays. In this study, we created two enzymatically active X. fastidiosa / A. vitis polygalacturonase chimeras, AX1A and AX2A to explore the functionality of X. fastidiosa polygalacturonase in vitro. The AX1A chimera was constructed to specifically test if recombinant chimeric protein, produced in Escherichia coli, is soluble and if the X. fastidiosa polygalacturonase catalytic amino acids are able to hydrolyze polygalacturonic acid. The AX2A chimera was constructed to evaluate the ability of a unique QMK motif of X. fastidiosa polygalacturonase, most polygalacturonases have a R(I/L)K motif, to bind to and allow the hydrolysis of polygalacturonic acid. Furthermore, the AX2A chimera was also used to explore what effect modification of the QMK motif of X. fastidiosa polygalacturonase to a conserved RIK motif has on enzymatic activity. These experiments showed that both the AX1A and AX2A polygalacturonase chimeras were soluble and able to hydrolyze the polygalacturonic acid substrate. Additionally, the modification of the QMK motif to the conserved RIK motif eliminated hydrolytic activity, suggesting that the QMK motif is important for the activity of X. fastidiosa polygalacturonase. This result suggests X. fastidiosa polygalacturonase may preferentially hydrolyze a different pectic substrate or, alternatively, it has a different mechanism of substrate binding than other polygalacturonases characterized to date.
Godet, Angélique N; Guergnon, Julien; Maire, Virginie; Croset, Amélie; Garcia, Alphonse
2010-04-01
Previous studies established that PP1 is a target for Bcl-2 proteins and an important regulator of apoptosis. The two distinct functional PP1 consensus docking motifs, R/Kx((0,1))V/IxF and FxxR/KxR/K, involved in PP1 binding and cell death were previously characterized in the BH1 and BH3 domains of some Bcl-2 proteins. In this study, we demonstrate that DPT-AIF(1), a peptide containing the AIF(562-571) sequence located in a c-terminal domain of AIF, is a new PP1 interacting and cell penetrating molecule. We also showed that DPT-AIF(1) provoked apoptosis in several human cell lines. Furthermore, DPT-APAF(1) a bi-partite cell penetrating peptide containing APAF-1(122-131), a non penetrating sequence from APAF-1 protein, linked to our previously described DPT-sh1 peptide shuttle, is also a PP1-interacting death molecule. Both AIF(562-571) and APAF-1(122-131) sequences contain a common R/Kx((0,1))V/IxFxxR/KxR/K motif, shared by several proteins involved in control of cell survival pathways. This motif combines the two distinct PP1c consensus docking motifs initially identified in some Bcl-2 proteins. Interestingly DPT-AIF(2) and DPT-APAF(2) that carry a F to A mutation within this combinatorial motif, no longer exhibited any PP1c binding or apoptotic effects. Moreover the F to A mutation in DPT-AIF(2) also suppressed cell penetration. These results indicate that the combinatorial PP1c docking motif R/Kx((0,1))V/IxFxxR/KxR/K, deduced from AIF(562-571) and APAF-1(122-131) sequences, is a new PP1c-dependent Apoptotic Signature. This motif is also a new tool for drug design that could be used to characterize potential anti-tumour molecules.
Molecular cloning and characterization of sea bass (Dicentrarchus labrax, L.) calreticulin.
Pinto, Rute D; Moreira, Ana R; Pereira, Pedro J B; dos Santos, Nuno M S
2013-06-01
Mammalian calreticulin (CRT) is a key molecular chaperone and regulator of Ca(2+) homeostasis in endoplasmic reticulum (ER), also being implicated in a variety of physiological/pathological processes outside the ER. Importantly, it is involved in assembly of MHC class I molecules. In this work, sea bass (Dicentrarchus labrax) CRT (Dila-CRT) gene and cDNA have been isolated and characterized. The mature protein retains two conserved motifs, three structural/functional domains (N, P and C), three type 1 and 2 motifs repeated in tandem, a conserved pair of cysteines and ER-retention motif. It is a single-copy gene composed of 9 exons. Dila-CRT three-dimensional homology models are consistent with the structural features described for mammalian molecules. Together, these results are supportive of a highly conserved structure of CRT through evolution. Moreover, the present data provides information that will allow further studies on sea bass CRT involvement in immunity and in particular class I antigen presentation. Copyright © 2013 Elsevier Ltd. All rights reserved.
De novo discovery of structural motifs in RNA 3D structures through clustering.
Ge, Ping; Islam, Shahidul; Zhong, Cuncong; Zhang, Shaojie
2018-05-18
As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.
Huang, Mengmeng; Mu, Changkao; Wu, Yuehong; Ye, Fei; Wang, Dan; Sun, Cong; Lv, Zhengbing; Han, Bingnan; Wang, Chunlin; Xu, Xue-Wei
2017-11-01
C-type lectins are a superfamily of Ca 2+ -dependent carbohydrate-recognition proteins, which play crucial roles in innate immunity including nonself-recognition and pathogen elimination. In the present study, two single-CRD containing C-type lectins were identified from swimming crab Portunus trituberculatus (designated as PtCTL-2 and PtCTL-3). The open reading frame (ORF) of PtCTL-2 encoded polypeptides of 485 amino acids with a signal peptide and a single carbohydrate-recognition domain (CRD), while PtCTL-3's ORF encoded polypeptides of 241 amino acids with a coiled-coil region and a single-CRD. The key motifs determining carbohydrate binding specificity in PtCTL-2 and PtCTL-3 were EPR (Glu-Pro-Arg) and QPD (Gln-Pro-Asp). EPR is a motif being identified for the first time, whereas QPD is a typical motif in C-type lectins. Different PAMPs binding features of the two recombinant proteins - PtCTL-2 (rPtCTL-2) and PtCTL-3 (rPtCTL-3) have been observed in our experiments. rPtCTL-2 could bind three pathogen-associated molecular patterns (PAMPs) with relatively high affinity, including glucan, lipopolysaccharide (LPS) and peptidoglycan (PGN), while rPtCTL-3 could barely bind any of them. However, rPtCTL-2 could bind seven kinds of microbes and rPtCTL-3 could bind six kinds in microbe binding assay. Moreover, rPtCTL-2 and rPtCTL-3 exhibited similar agglutination activity against Gram-positive bacteria, Gram-negative bacteria and fungi in agglutination assay. All these results illustrated that PtCTL-2 and PtCTL-3 could function as important pattern-recognition receptors (PRR) with broad nonself-recognition spectrum involved in immune defense against invaders. In addition, the results of carbohydrate binding specificity showed that PtCTL-2 with novel key motif had broad carbohydrate binding specificity, while PtCTL-3 with typical key motif possessed different carbohydrate binding specificity from the classical binding rule. Furthermore, PtCTL-2 and PtCTL-3 could also function as opsonin to enhance encapsulation of hemocytes against Ni-NTA beads. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lauf, Peter K; Heiny, Judith; Meller, Jarek; Lepera, Michael A; Koikov, Leonid; Alter, Gerald M; Brown, Thomas L; Adragna, Norma C
2013-01-01
Chelerythrine [CET], a protein kinase C [PKC] inhibitor, is a prop-apoptotic BH3-mimetic binding to BH1-like motifs of Bcl-2 proteins. CET action was examined on PKC phosphorylation-dependent membrane transporters (Na+/K+ pump/ATPase [NKP, NKA], Na+-K+-2Cl+ [NKCC] and K+-Cl- [KCC] cotransporters, and channel-supported K+ loss) in human lens epithelial cells [LECs]. K+ loss and K+ uptake, using Rb+ as congener, were measured by atomic absorption/emission spectrophotometry with NKP and NKCC inhibitors, and Cl- replacement by NO3ˉ to determine KCC. 3H-Ouabain binding was performed on a pig renal NKA in the presence and absence of CET. Bcl-2 protein and NKA sequences were aligned and motifs identified and mapped using PROSITE in conjunction with BLAST alignments and analysis of conservation and structural similarity based on prediction of secondary and crystal structures. CET inhibited NKP and NKCC by >90% (IC50 values ~35 and ~15 μM, respectively) without significant KCC activity change, and stimulated K+ loss by ~35% at 10-30 μM. Neither ATP levels nor phosphorylation of the NKA α1 subunit changed. 3H-ouabain was displaced from pig renal NKA only at 100 fold higher CET concentrations than the ligand. Sequence alignments of NKA with BH1- and BH3-like motifs containing pro-survival Bcl-2 and BclXl proteins showed more than one BH1-like motif within NKA for interaction with CET or with BH3 motifs. One NKA BH1-like motif (ARAAEILARDGPN) was also found in all P-type ATPases. Also, NKA possessed a second motif similar to that near the BH3 region of Bcl-2. Findings support the hypothesis that CET inhibits NKP by binding to BH1-like motifs and disrupting the α1 subunit catalytic activity through conformational changes. By interacting with Bcl-2 proteins through their complementary BH1- or BH3-like-motifs, NKP proteins may be sensors of normal and pathological cell functions, becoming important yet unrecognized signal transducers in the initial phases of apoptosis. CET action on NKCC1 and K+ channels may involve PKC-regulated mechanisms; however, limited sequence homologies to BH1-like motifs cannot exclude direct effects.
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions
Chica, Claudia; Diella, Francesca; Gibson, Toby J.
2009-01-01
Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
Kim, Sung Hyun; Ryan, Timothy A.
2009-01-01
The mechanisms of how, following exocytosis, the approximately nine types of synaptic vesicle (SV) transmembrane proteins are accurately resorted to form SVs are poorly understood. The time course of SV endocytosis is very sensitive to perturbations in clathrin and dynamin, supporting the model that SV endocytosis occurs through a clathrin-mediated pathway. We recently demonstrated that removal of the clathrin adaptor protein AP-2, the key protein thought to coordinate cargo selection into clathrin-coated pits, results in a significant impairment in endocytosis kinetics. Endocytosis, however, still proceeds in the absence of AP-2, bringing into question the role of AP-2 in cargo sorting in this process. Using quantitative endocytosis assays at nerve terminals, we examined how endocytosis depends on the integrity of μ2 function. Our experiments indicate that no single perturbation in μ2 prevents restoration of endocytic function when mutated μ2 replaces native μ2, whereas introduction of multiple distributed mutations significantly impairs endocytosis. We also examined whether the presence of AP-2 is important for the functionality of the previously identified endocytic motif in an SV cargo protein, the dileucine motif in vGlut-1. These data show that while mutations in the dileucine motif slow the retrieval of vGlut-1, they only do so in the presence of AP-2. These data thus indicate that AP-2 plays a role in cargo selection but that no single aspect of μ2 function is critical, implying that a more distributed network of interactions supports AP-2 function in SV endocytosis. PMID:19762466
The regulation of integrin function by divalent cations
Zhang, Kun; Chen, JianFeng
2012-01-01
Integrins are a family of α/β heterodimeric adhesion metalloprotein receptors and their functions are highly dependent on and regulated by different divalent cations. Recently advanced studies have revolutionized our perception of integrin metal ion-binding sites and their specific functions. Ligand binding to integrins is bridged by a divalent cation bound at the MIDAS motif on top of either α I domain in I domain-containing integrins or β I domain in α I domain-less integrins. The MIDAS motif in β I domain is flanked by ADMIDAS and SyMBS, the other two crucial metal ion binding sites playing pivotal roles in the regulation of integrin affinity and bidirectional signaling across the plasma membrane. The β-propeller domain of α subunit contains three or four β-hairpin loop-like Ca2+-binding motifs that have essential roles in integrin biogenesis. The function of another Ca2+-binding motif located at the genu of α subunit remains elusive. Here, we provide an overview of the integrin metal ion-binding sites and discuss their roles in the regulation of integrin functions. PMID:22647937
CPI motif interaction is necessary for capping protein function in cells
Edwards, Marc; McConnell, Patrick; Schafer, Dorothy A.; Cooper, John A.
2015-01-01
Capping protein (CP) has critical roles in actin assembly in vivo and in vitro. CP binds with high affinity to the barbed end of actin filaments, blocking the addition and loss of actin subunits. Heretofore, models for actin assembly in cells generally assumed that CP is constitutively active, diffusing freely to find and cap barbed ends. However, CP can be regulated by binding of the ‘capping protein interaction' (CPI) motif, found in a diverse and otherwise unrelated set of proteins that decreases, but does not abolish, the actin-capping activity of CP and promotes uncapping in biochemical experiments. Here, we report that CP localization and the ability of CP to function in cells requires interaction with a CPI-motif-containing protein. Our discovery shows that cells target and/or modulate the capping activity of CP via CPI motif interactions in order for CP to localize and function in cells. PMID:26412145
Assessment of composite motif discovery methods.
Klepper, Kjetil; Sandve, Geir K; Abul, Osman; Johansen, Jostein; Drablos, Finn
2008-02-26
Computational discovery of regulatory elements is an important area of bioinformatics research and more than a hundred motif discovery methods have been published. Traditionally, most of these methods have addressed the problem of single motif discovery - discovering binding motifs for individual transcription factors. In higher organisms, however, transcription factors usually act in combination with nearby bound factors to induce specific regulatory behaviours. Hence, recent focus has shifted from single motifs to the discovery of sets of motifs bound by multiple cooperating transcription factors, so called composite motifs or cis-regulatory modules. Given the large number and diversity of methods available, independent assessment of methods becomes important. Although there have been several benchmark studies of single motif discovery, no similar studies have previously been conducted concerning composite motif discovery. We have developed a benchmarking framework for composite motif discovery and used it to evaluate the performance of eight published module discovery tools. Benchmark datasets were constructed based on real genomic sequences containing experimentally verified regulatory modules, and the module discovery programs were asked to predict both the locations of these modules and to specify the single motifs involved. To aid the programs in their search, we provided position weight matrices corresponding to the binding motifs of the transcription factors involved. In addition, selections of decoy matrices were mixed with the genuine matrices on one dataset to test the response of programs to varying levels of noise. Although some of the methods tested tended to score somewhat better than others overall, there were still large variations between individual datasets and no single method performed consistently better than the rest in all situations. The variation in performance on individual datasets also shows that the new benchmark datasets represents a suitable variety of challenges to most methods for module discovery.
Zhang, ZhiZhuo; Chang, Cheng Wei; Hugo, Willy; Cheung, Edwin; Sung, Wing-Kin
2013-03-01
Although de novo motifs can be discovered through mining over-represented sequence patterns, this approach misses some real motifs and generates many false positives. To improve accuracy, one solution is to consider some additional binding features (i.e., position preference and sequence rank preference). This information is usually required from the user. This article presents a de novo motif discovery algorithm called SEME (sampling with expectation maximization for motif elicitation), which uses pure probabilistic mixture model to model the motif's binding features and uses expectation maximization (EM) algorithms to simultaneously learn the sequence motif, position, and sequence rank preferences without asking for any prior knowledge from the user. SEME is both efficient and accurate thanks to two important techniques: the variable motif length extension and importance sampling. Using 75 large-scale synthetic datasets, 32 metazoan compendium benchmark datasets, and 164 chromatin immunoprecipitation sequencing (ChIP-Seq) libraries, we demonstrated the superior performance of SEME over existing programs in finding transcription factor (TF) binding sites. SEME is further applied to a more difficult problem of finding the co-regulated TF (coTF) motifs in 15 ChIP-Seq libraries. It identified significantly more correct coTF motifs and, at the same time, predicted coTF motifs with better matching to the known motifs. Finally, we show that the learned position and sequence rank preferences of each coTF reveals potential interaction mechanisms between the primary TF and the coTF within these sites. Some of these findings were further validated by the ChIP-Seq experiments of the coTFs. The application is available online.
DLocalMotif: a discriminative approach for discovering local motifs in protein sequences.
Mehdi, Ahmed M; Sehgal, Muhammad Shoaib B; Kobe, Bostjan; Bailey, Timothy L; Bodén, Mikael
2013-01-01
Local motifs are patterns of DNA or protein sequences that occur within a sequence interval relative to a biologically defined anchor or landmark. Current protein motif discovery methods do not adequately consider such constraints to identify biologically significant motifs that are only weakly over-represented but spatially confined. Using negatives, i.e. sequences known to not contain a local motif, can further increase the specificity of their discovery. This article introduces the method DLocalMotif that makes use of positional information and negative data for local motif discovery in protein sequences. DLocalMotif combines three scoring functions, measuring degrees of motif over-representation, entropy and spatial confinement, specifically designed to discriminatively exploit the availability of negative data. The method is shown to outperform current methods that use only a subset of these motif characteristics. We apply the method to several biological datasets. The analysis of peroxisomal targeting signals uncovers several novel motifs that occur immediately upstream of the dominant peroxisomal targeting signal-1 signal. The analysis of proline-tyrosine nuclear localization signals uncovers multiple novel motifs that overlap with C2H2 zinc finger domains. We also evaluate the method on classical nuclear localization signals and endoplasmic reticulum retention signals and find that DLocalMotif successfully recovers biologically relevant sequence properties. http://bioinf.scmb.uq.edu.au/dlocalmotif/
Rigden, Daniel J.; Woodhead, Duncan D.; Wong, Prudence W. H.; Galperin, Michael Y.
2011-01-01
Binding of calcium ions (Ca2+) to proteins can have profound effects on their structure and function. Common roles of calcium binding include structure stabilization and regulation of activity. It is known that diverse families – EF-hands being one of at least twelve – use a Dx[DN]xDG linear motif to bind calcium in near-identical fashion. Here, four novel structural contexts for the motif are described. Existing experimental data for one of them, a thermophilic archaeal subtilisin, demonstrate for the first time a role for Dx[DN]xDG-bound calcium in protein folding. An integrin-like embedding of the motif in the blade of a β-propeller fold – here named the calcium blade – is discovered in structures of bacterial and fungal proteins. Furthermore, sensitive database searches suggest a common origin for the calcium blade in β-propeller structures of different sizes and a pan-kingdom distribution of these proteins. Factors favouring the multiple convergent evolution of the motif appear to include its general Asp-richness, the regular spacing of the Asp residues and the fact that change of Asp into Gly and vice versa can occur though a single nucleotide change. Among the known structural contexts for the Dx[DN]xDG motif, only the calcium blade and the EF-hand are currently found intracellularly in large numbers, perhaps because the higher extracellular concentration of Ca2+ allows for easier fixing of newly evolved motifs that have acquired useful functions. The analysis presented here will inform ongoing efforts toward prediction of similar calcium-binding motifs from sequence information alone. PMID:21720552
An intact PDZ motif is essential for correct P2Y12 purinoceptor traffic in human platelets
Nisar, Shaista; Daly, Martina E.; Federici, Augusto B.; Artoni, Andrea; Mumford, Andrew D.; Watson, Stephen P.
2011-01-01
The platelet P2Y12 purinoceptor (P2Y12R), which plays a crucial role in hemostasis, undergoes internalization and subsequent recycling to maintain receptor responsiveness, processes that are essential for normal platelet function. Here, we observe that P2Y12R function is compromised after deletion or mutation of the 4 amino acids at the extreme C-terminus of this receptor (ETPM), a putative postsynaptic density 95/disc large/zonula occludens-1 (PDZ)–binding motif. In cell line models, removal of this sequence or mutation of one of its core residues (P341A), attenuates receptor internalization and receptor recycling back to the membrane, thereby blocking receptor resensitization. The physiologic significance of these findings in the regulation of platelet function is shown by identification of a patient with a heterozygous mutation in the PDZ binding sequence of their P2Y12R (P341A) that is associated with reduced expression of the P2Y12R on the cell surface. Importantly, platelets from this subject showed significantly compromised P2Y12R recycling, emphasizing the importance of the extreme C-terminus of this receptor to ensure correct receptor traffic. PMID:21937696
A Bioinformatics Approach for Detecting Repetitive Nested Motifs using Pattern Matching.
Romero, José R; Carballido, Jessica A; Garbus, Ingrid; Echenique, Viviana C; Ponzoni, Ignacio
2016-01-01
The identification of nested motifs in genomic sequences is a complex computational problem. The detection of these patterns is important to allow the discovery of transposable element (TE) insertions, incomplete reverse transcripts, deletions, and/or mutations. In this study, a de novo strategy for detecting patterns that represent nested motifs was designed based on exhaustive searches for pairs of motifs and combinatorial pattern analysis. These patterns can be grouped into three categories, motifs within other motifs, motifs flanked by other motifs, and motifs of large size. The methodology used in this study, applied to genomic sequences from the plant species Aegilops tauschii and Oryza sativa , revealed that it is possible to identify putative nested TEs by detecting these three types of patterns. The results were validated through BLAST alignments, which revealed the efficacy and usefulness of the new method, which is called Mamushka.
Guzina, Jelena
2016-01-01
ABSTRACT Extracytoplasmic function (ECF) σ factors are the largest and the most diverse group of alternative σ factors, but their mechanisms of transcription are poorly studied. This subfamily is considered to exhibit a rigid promoter structure and an absence of mixing and matching; both −35 and −10 elements are considered necessary for initiating transcription. This paradigm, however, is based on very limited data, which bias the analysis of diverse ECF σ subgroups. Here we investigate DNA and protein recognition motifs involved in ECF σ factor transcription by a computational analysis of canonical ECF subfamily members, much less studied ECF σ subgroups, and the group outliers, obtained from recently sequenced bacteriophages. The analysis identifies an extended −10 element in promoters for phage ECF σ factors; a comparison with bacterial σ factors points to a putative 6-amino-acid motif just C-terminal of domain σ2, which is responsible for the interaction with the identified extension of the −10 element. Interestingly, a similar protein motif is found C-terminal of domain σ2 in canonical ECF σ factors, at a position where it is expected to interact with a conserved motif further upstream of the −10 element. Moreover, the phiEco32 ECF σ factor lacks a recognizable −35 element and σ4 domain, which we identify in a homologous phage, 7-11, indicating that the extended −10 element can compensate for the lack of −35 element interactions. Overall, the results reveal greater flexibility in promoter recognition by ECF σ factors than previously recognized and raise the possibility that mixing and matching also apply to this group, a notion that remains to be biochemically tested. IMPORTANCE ECF σ factors are the most numerous group of alternative σ factors but have been little studied. Their promoter recognition mechanisms are obscured by the large diversity within the ECF σ factor group and the limited similarity with the well-studied housekeeping σ factors. Here we extensively compare bacterial and bacteriophage ECF σ factors and their promoters in order to infer DNA and protein recognition motifs involved in transcription initiation. We predict a more flexible promoter structure than is recognized by the current paradigm, which assumes rigidness, and propose that ECF σ promoter elements may complement (mix and match with) each other's strengths. These results warrant the refocusing of research efforts from the well-studied housekeeping σ factors toward the physiologically highly important, but insufficiently understood, alternative σ factors. PMID:27137497
Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise
ERIC Educational Resources Information Center
Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San
2018-01-01
Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…
A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities
Martínez-Bonet, Marta; Palladino, Claudia; Briz, Veronica; Rudolph, Jochen M.; Fackler, Oliver T.; Relloso, Miguel; Muñoz-Fernandez, Maria Angeles; Madrid, Ricardo
2015-01-01
To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121–137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection. PMID:26700863
Nicholson, Judith; Scherl, Alex; Way, Luke; Blackburn, Elizabeth A; Walkinshaw, Malcolm D; Ball, Kathryn L; Hupp, Ted R
2014-06-01
Linear motifs mediate protein-protein interactions (PPI) that allow expansion of a target protein interactome at a systems level. This study uses a proteomics approach and linear motif sub-stratifications to expand on PPIs of MDM2. MDM2 is a multi-functional protein with over one hundred known binding partners not stratified by hierarchy or function. A new linear motif based on a MDM2 interaction consensus is used to select novel MDM2 interactors based on Nutlin-3 responsiveness in a cell-based proteomics screen. MDM2 binds a subset of peptide motifs corresponding to real proteins with a range of allosteric responses to MDM2 ligands. We validate cyclophilin B as a novel protein with a consensus MDM2 binding motif that is stabilised by Nutlin-3 in vivo, thus identifying one of the few known interactors of MDM2 that is stabilised by Nutlin-3. These data invoke two modes of peptide binding at the MDM2 N-terminus that rely on a consensus core motif to control the equilibrium between MDM2 binding proteins. This approach stratifies MDM2 interacting proteins based on the linear motif feature and provides a new biomarker assay to define clinically relevant Nutlin-3 responsive MDM2 interactors. Copyright © 2014 Elsevier Inc. All rights reserved.
Goldie, Belinda J; Fitzsimmons, Chantel; Weidenhofer, Judith; Atkins, Joshua R; Wang, Dan O; Cairns, Murray J
2017-01-01
While the cytoplasmic function of microRNA (miRNA) as post-transcriptional regulators of mRNA has been the subject of significant research effort, their activity in the nucleus is less well characterized. Here we use a human neuronal cell model to show that some mature miRNA are preferentially enriched in the nucleus. These molecules were predominantly primate-specific and contained a sequence motif with homology to the consensus MAZ transcription factor binding element. Precursor miRNA containing this motif were shown to have affinity for MAZ protein in nuclear extract. We then used Ago1/2 RIP-Seq to explore nuclear miRNA-associated mRNA targets. Interestingly, the genes for Ago2-associated transcripts were also significantly enriched with MAZ binding sites and neural function, whereas Ago1-transcripts were associated with general metabolic processes and localized with SC35 spliceosomes. These findings suggest the MAZ transcription factor is associated with miRNA in the nucleus and may influence the regulation of neuronal development through Ago2-associated miRNA induced silencing complexes. The MAZ transcription factor may therefore be important for organizing higher order integration of transcriptional and post-transcriptional processes in primate neurons.
Habisov, Sabrina; Huber, Jessica; Ichimura, Yoshinobu; Akutsu, Masato; Rogova, Natalia; Loehr, Frank; McEwan, David G.; Johansen, Terje; Dikic, Ivan; Doetsch, Volker; Komatsu, Masaaki; Rogov, Vladimir V.; Kirkin, Vladimir
2016-01-01
The covalent conjugation of ubiquitin-fold modifier 1 (UFM1) to proteins generates a signal that regulates transcription, response to cell stress, and differentiation. Ufmylation is initiated by ubiquitin-like modifier activating enzyme 5 (UBA5), which activates and transfers UFM1 to ubiquitin-fold modifier-conjugating enzyme 1 (UFC1). The details of the interaction between UFM1 and UBA5 required for UFM1 activation and its downstream transfer are however unclear. In this study, we described and characterized a combined linear LC3-interacting region/UFM1-interacting motif (LIR/UFIM) within the C terminus of UBA5. This single motif ensures that UBA5 binds both UFM1 and light chain 3/γ-aminobutyric acid receptor-associated proteins (LC3/GABARAP), two ubiquitin (Ub)-like proteins. We demonstrated that LIR/UFIM is required for the full biological activity of UBA5 and for the effective transfer of UFM1 onto UFC1 and a downstream protein substrate both in vitro and in cells. Taken together, our study provides important structural and functional insights into the interaction between UBA5 and Ub-like modifiers, improving the understanding of the biology of the ufmylation pathway. PMID:26929408
Stapf, Christopher; Cartwright, Edward; Bycroft, Mark; Hofmann, Kay; Buchberger, Alexander
2011-01-01
Cellular functions of the essential, ubiquitin-selective AAA ATPase p97/valosin-containing protein (VCP) are controlled by regulatory cofactors determining substrate specificity and fate. Most cofactors bind p97 through a ubiquitin regulatory X (UBX) or UBX-like domain or linear sequence motifs, including the hitherto ill defined p97/VCP-interacting motif (VIM). Here, we present the new, minimal consensus sequence RX5AAX2R as a general definition of the VIM that unites a novel family of known and putative p97 cofactors, among them UBXD1 and ZNF744/ANKZF1. We demonstrate that this minimal VIM consensus sequence is necessary and sufficient for p97 binding. Using NMR chemical shift mapping, we identified several residues of the p97 N-terminal domain (N domain) that are critical for VIM binding. Importantly, we show that cellular stress resistance conferred by the yeast VIM-containing cofactor Vms1 depends on the physical interaction between its VIM and the critical N domain residues of the yeast p97 homolog, Cdc48. Thus, the VIM-N domain interaction characterized in this study is required for the physiological function of Vms1 and most likely other members of the newly defined VIM family of cofactors. PMID:21896481
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.
Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique
2015-06-01
Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Colinet, Anne-Sophie; Thines, Louise; Deschamps, Antoine; Flémal, Gaëlle; Demaegd, Didier; Morsomme, Pierre
2017-07-01
The UPF0016 family is a recently identified group of poorly characterized membrane proteins whose function is conserved through evolution and that are defined by the presence of 1 or 2 copies of the E-φ-G-D-[KR]-[TS] consensus motif in their transmembrane domain. We showed that 2 members of this family, the human TMEM165 and the budding yeast Gdt1p, are functionally related and are likely to form a new group of Ca 2+ transporters. Mutations in TMEM165 have been demonstrated to cause a new type of rare human genetic diseases denominated as Congenital Disorders of Glycosylation. Using site-directed mutagenesis, we generated 17 mutations in the yeast Golgi-localized Ca 2+ transporter Gdt1p. Single alanine substitutions were targeted to the highly conserved consensus motifs, 4 acidic residues localized in the central cytosolic loop, and the arginine at position 71. The mutants were screened in a yeast strain devoid of both the endogenous Gdt1p exchanger and Pmr1p, the Ca 2+ -ATPase of the Golgi apparatus. We show here that acidic and polar uncharged residues of the consensus motifs play a crucial role in calcium tolerance and calcium transport activity and are therefore likely to be architectural components of the cation binding site of Gdt1p. Importantly, we confirm the essential role of the E53 residue whose mutation in humans triggers congenital disorders of glycosylation. © 2017 John Wiley & Sons Ltd.
Structural and energetic study of cation-π-cation interactions in proteins.
Pinheiro, Silvana; Soteras, Ignacio; Gelpí, Josep Lluis; Dehez, François; Chipot, Christophe; Luque, F Javier; Curutchet, Carles
2017-04-12
Cation-π interactions of aromatic rings and positively charged groups are among the most important interactions in structural biology. The role and energetic characteristics of these interactions are well established. However, the occurrence of cation-π-cation interactions is an unexpected motif, which raises intriguing questions about its functional role in proteins. We present a statistical analysis of the occurrence, composition and geometrical preferences of cation-π-cation interactions identified in a set of non-redundant protein structures taken from the Protein Data Bank. Our results demonstrate that this structural motif is observed at a small, albeit non-negligible frequency in proteins, and suggest a preference to establish cation-π-cation motifs with Trp, followed by Tyr and Phe. Furthermore, we have found that cation-π-cation interactions tend to be highly conserved, which supports their structural or functional role. Finally, we have performed an energetic analysis of a representative subset of cation-π-cation complexes combining quantum-chemical and continuum solvation calculations. Our results point out that the protein environment can strongly screen the cation-cation repulsion, leading to an attractive interaction in 64% of the complexes analyzed. Together with the high degree of conservation observed, these results suggest a potential stabilizing role in the protein fold, as demonstrated recently for a miniature protein (Craven et al., J. Am. Chem. Soc. 2016, 138, 1543). From a computational point of view, the significant contribution of non-additive three-body terms challenges the suitability of standard additive force fields for describing cation-π-cation motifs in molecular simulations.
El Sahili, Abbas; Li, Si-Zhe; Lang, Julien; Virus, Cornelia; Planamente, Sara; Ahmar, Mohammed; Guimaraes, Beatriz G.; Aumont-Nicaise, Magali; Vigouroux, Armelle; Soulère, Laurent; Reader, John; Queneau, Yves; Faure, Denis; Moréra, Solange
2015-01-01
Periplasmic binding proteins (PBPs) in association with ABC transporters select and import a wide variety of ligands into bacterial cytoplasm. They can also take up toxic molecules, as observed in the case of the phytopathogen Agrobacterium tumefaciens strain C58. This organism contains a PBP called AccA that mediates the import of the antibiotic agrocin 84, as well as the opine agrocinopine A that acts as both a nutrient and a signalling molecule for the dissemination of virulence genes through quorum-sensing. Here, we characterized the binding mode of AccA using purified agrocin 84 and synthetic agrocinopine A by X-ray crystallography at very high resolution and performed affinity measurements. Structural and affinity analyses revealed that AccA recognizes an uncommon and specific motif, a pyranose-2-phosphate moiety which is present in both imported molecules via the L-arabinopyranose moiety in agrocinopine A and the D-glucopyranose moiety in agrocin 84. We hypothesized that AccA is a gateway allowing the import of any compound possessing a pyranose-2-phosphate motif at one end. This was structurally and functionally confirmed by experiments using four synthetic compounds: agrocinopine 3’-O-benzoate, L-arabinose-2-isopropylphosphate, L-arabinose-2-phosphate and D-glucose-2-phosphate. By combining affinity measurements and in vivo assays, we demonstrated that both L-arabinose-2-phosphate and D-glucose-2-phosphate, which are the AccF mediated degradation products of agrocinopine A and agrocin 84 respectively, interact with the master transcriptional regulator AccR and activate the quorum-sensing signal synthesis and Ti plasmid transfer in A. tumefaciens C58. Our findings shed light on the role of agrocinopine and antibiotic agrocin 84 on quorum-sensing regulation in A. tumefaciens and reveal how the PBP AccA acts as vehicle for the importation of both molecules by means of a key-recognition motif. It also opens future possibilities for the rational design of antibiotic and anti-virulence compounds against A. tumefaciens or other pathogens possessing similar PBPs. PMID:26244338
El Sahili, Abbas; Li, Si-Zhe; Lang, Julien; Virus, Cornelia; Planamente, Sara; Ahmar, Mohammed; Guimaraes, Beatriz G; Aumont-Nicaise, Magali; Vigouroux, Armelle; Soulère, Laurent; Reader, John; Queneau, Yves; Faure, Denis; Moréra, Solange
2015-08-01
Periplasmic binding proteins (PBPs) in association with ABC transporters select and import a wide variety of ligands into bacterial cytoplasm. They can also take up toxic molecules, as observed in the case of the phytopathogen Agrobacterium tumefaciens strain C58. This organism contains a PBP called AccA that mediates the import of the antibiotic agrocin 84, as well as the opine agrocinopine A that acts as both a nutrient and a signalling molecule for the dissemination of virulence genes through quorum-sensing. Here, we characterized the binding mode of AccA using purified agrocin 84 and synthetic agrocinopine A by X-ray crystallography at very high resolution and performed affinity measurements. Structural and affinity analyses revealed that AccA recognizes an uncommon and specific motif, a pyranose-2-phosphate moiety which is present in both imported molecules via the L-arabinopyranose moiety in agrocinopine A and the D-glucopyranose moiety in agrocin 84. We hypothesized that AccA is a gateway allowing the import of any compound possessing a pyranose-2-phosphate motif at one end. This was structurally and functionally confirmed by experiments using four synthetic compounds: agrocinopine 3'-O-benzoate, L-arabinose-2-isopropylphosphate, L-arabinose-2-phosphate and D-glucose-2-phosphate. By combining affinity measurements and in vivo assays, we demonstrated that both L-arabinose-2-phosphate and D-glucose-2-phosphate, which are the AccF mediated degradation products of agrocinopine A and agrocin 84 respectively, interact with the master transcriptional regulator AccR and activate the quorum-sensing signal synthesis and Ti plasmid transfer in A. tumefaciens C58. Our findings shed light on the role of agrocinopine and antibiotic agrocin 84 on quorum-sensing regulation in A. tumefaciens and reveal how the PBP AccA acts as vehicle for the importation of both molecules by means of a key-recognition motif. It also opens future possibilities for the rational design of antibiotic and anti-virulence compounds against A. tumefaciens or other pathogens possessing similar PBPs.
Prediction of virus-host protein-protein interactions mediated by short linear motifs.
Becerra, Andrés; Bucheli, Victor A; Moreno, Pedro A
2017-03-09
Short linear motifs in host organisms proteins can be mimicked by viruses to create protein-protein interactions that disable or control metabolic pathways. Given that viral linear motif instances of host motif regular expressions can be found by chance, it is necessary to develop filtering methods of functional linear motifs. We conduct a systematic comparison of linear motifs filtering methods to develop a computational approach for predicting motif-mediated protein-protein interactions between human and the human immunodeficiency virus 1 (HIV-1). We implemented three filtering methods to obtain linear motif sets: 1) conserved in viral proteins (C), 2) located in disordered regions (D) and 3) rare or scarce in a set of randomized viral sequences (R). The sets C,D,R are united and intersected. The resulting sets are compared by the number of protein-protein interactions correctly inferred with them - with experimental validation. The comparison is done with HIV-1 sequences and interactions from the National Institute of Allergy and Infectious Diseases (NIAID). The number of correctly inferred interactions allows to rank the interactions by the sets used to deduce them: D∪R and C. The ordering of the sets is descending on the probability of capturing functional interactions. With respect to HIV-1, the sets C∪R, D∪R, C∪D∪R infer all known interactions between HIV1 and human proteins mediated by linear motifs. We found that the majority of conserved linear motifs in the virus are located in disordered regions. We have developed a method for predicting protein-protein interactions mediated by linear motifs between HIV-1 and human proteins. The method only use protein sequences as inputs. We can extend the software developed to any other eukaryotic virus and host in order to find and rank candidate interactions. In future works we will use it to explore possible viral attack mechanisms based on linear motif mimicry.
Lofgren, Michael; Koutmos, Markos; Banerjee, Ruma
2013-10-25
MeaB is an accessory GTPase protein involved in the assembly, protection, and reactivation of 5'-deoxyadenosyl cobalamin-dependent methylmalonyl-CoA mutase (MCM). Mutations in the human ortholog of MeaB result in methylmalonic aciduria, an inborn error of metabolism. G-proteins typically utilize conserved switch I and II motifs for signaling to effector proteins via conformational changes elicited by nucleotide binding and hydrolysis. Our recent discovery that MeaB utilizes an unusual switch III region for bidirectional signaling with MCM raised questions about the roles of the switch I and II motifs in MeaB. In this study, we addressed the functions of conserved switch II residues by performing alanine-scanning mutagenesis. Our results demonstrate that the GTPase activity of MeaB is autoinhibited by switch II and that this loop is important for coupling nucleotide-sensitive conformational changes in switch III to elicit the multiple chaperone functions of MeaB. Furthermore, we report the structure of MeaB·GDP crystallized in the presence of AlFx(-) to form the putative transition state analog, GDP·AlF4(-). The resulting crystal structure and its comparison with related G-proteins support the conclusion that the catalytic site of MeaB is incomplete in the absence of the GTPase-activating protein MCM and therefore unable to stabilize the transition state analog. Favoring an inactive conformation in the absence of the client MCM protein might represent a strategy for suppressing the intrinsic GTPase activity of MeaB in which the switch II loop plays an important role.
Ang, Swee Kim; Lu, Hui
2009-10-16
Erv1p is a FAD-dependent sulfhydryl oxidase of the mitochondrial intermembrane space. It contains three conserved disulfide bonds arranged in two CXXC motifs and one CX(16)C motif. Experimental evidence for the specific roles of the individual disulfide bonds is lacking. In this study, structural and functional roles of the disulfides were dissected systematically using a wide range of biochemical and biophysical methods. Three double cysteine mutants with each pair of cysteines mutated to serines were generated. All of the mutants were purified with the normal FAD binding properties as the wild type Erv1p, showing that none of the three disulfides are essential for FAD binding. Thermal denaturation and trypsin digestion studies showed that the CX(16)C disulfide plays an important role in stabilizing the folding of Erv1p. To understand the functional role of each disulfide, small molecules and the physiological substrate protein Mia40 were used as electron donors in oxygen consumption assays. We show that both CXXC disulfides are required for Erv1 oxidase activity. The active site disulfide is well protected thus requires the shuttle disulfide for its function. Although both mutants of the CXXC motifs were individually inactive, Erv1p activity was partially recovered by mixing these two mutants together, and the recovery was rapid. Thus, we provided the first experimental evidence of electron transfer between the shuttle and active site disulfides of Erv1p, and we propose that both intersubunit and intermolecular electron transfer can occur.
Ang, Swee Kim; Lu, Hui
2009-01-01
Erv1p is a FAD-dependent sulfhydryl oxidase of the mitochondrial intermembrane space. It contains three conserved disulfide bonds arranged in two CXXC motifs and one CX16C motif. Experimental evidence for the specific roles of the individual disulfide bonds is lacking. In this study, structural and functional roles of the disulfides were dissected systematically using a wide range of biochemical and biophysical methods. Three double cysteine mutants with each pair of cysteines mutated to serines were generated. All of the mutants were purified with the normal FAD binding properties as the wild type Erv1p, showing that none of the three disulfides are essential for FAD binding. Thermal denaturation and trypsin digestion studies showed that the CX16C disulfide plays an important role in stabilizing the folding of Erv1p. To understand the functional role of each disulfide, small molecules and the physiological substrate protein Mia40 were used as electron donors in oxygen consumption assays. We show that both CXXC disulfides are required for Erv1 oxidase activity. The active site disulfide is well protected thus requires the shuttle disulfide for its function. Although both mutants of the CXXC motifs were individually inactive, Erv1p activity was partially recovered by mixing these two mutants together, and the recovery was rapid. Thus, we provided the first experimental evidence of electron transfer between the shuttle and active site disulfides of Erv1p, and we propose that both intersubunit and intermolecular electron transfer can occur. PMID:19679655
Members of the Meloidogyne avirulence protein family contain multiple plant ligand-like motifs.
Rutter, William B; Hewezi, Tarek; Maier, Tom R; Mitchum, Melissa G; Davis, Eric L; Hussey, Richard S; Baum, Thomas J
2014-08-01
Sedentary plant-parasitic nematodes engage in complex interactions with their host plants by secreting effector proteins. Some effectors of both root-knot nematodes (Meloidogyne spp.) and cyst nematodes (Heterodera and Globodera spp.) mimic plant ligand proteins. Most prominently, cyst nematodes secrete effectors that mimic plant CLAVATA3/ESR-related (CLE) ligand proteins. However, only cyst nematodes have been shown to secrete such effectors and to utilize CLE ligand mimicry in their interactions with host plants. Here, we document the presence of ligand-like motifs in bona fide root-knot nematode effectors that are most similar to CLE peptides from plants and cyst nematodes. We have identified multiple tandem CLE-like motifs conserved within the previously identified Meloidogyne avirulence protein (MAP) family that are secreted from root-knot nematodes and have been shown to function in planta. By searching all 12 MAP family members from multiple Meloidogyne spp., we identified 43 repetitive CLE-like motifs composing 14 unique variants. At least one CLE-like motif was conserved in each MAP family member. Furthermore, we documented the presence of other conserved sequences that resemble the variable domains described in Heterodera and Globodera CLE effectors. These findings document that root-knot nematodes appear to use CLE ligand mimicry and point toward a common host node targeted by two evolutionarily diverse groups of nematodes. As a consequence, it is likely that CLE signaling pathways are important in other phytonematode pathosystems as well.
Interconnected network motifs control podocyte morphology and kidney function.
Azeloglu, Evren U; Hardy, Simon V; Eungdamrong, Narat John; Chen, Yibang; Jayaraman, Gomathi; Chuang, Peter Y; Fang, Wei; Xiong, Huabao; Neves, Susana R; Jain, Mohit R; Li, Hong; Ma'ayan, Avi; Gordon, Ronald E; He, John Cijiang; Iyengar, Ravi
2014-02-04
Podocytes are kidney cells with specialized morphology that is required for glomerular filtration. Diseases, such as diabetes, or drug exposure that causes disruption of the podocyte foot process morphology results in kidney pathophysiology. Proteomic analysis of glomeruli isolated from rats with puromycin-induced kidney disease and control rats indicated that protein kinase A (PKA), which is activated by adenosine 3',5'-monophosphate (cAMP), is a key regulator of podocyte morphology and function. In podocytes, cAMP signaling activates cAMP response element-binding protein (CREB) to enhance expression of the gene encoding a differentiation marker, synaptopodin, a protein that associates with actin and promotes its bundling. We constructed and experimentally verified a β-adrenergic receptor-driven network with multiple feedback and feedforward motifs that controls CREB activity. To determine how the motifs interacted to regulate gene expression, we mapped multicompartment dynamical models, including information about protein subcellular localization, onto the network topology using Petri net formalisms. These computational analyses indicated that the juxtaposition of multiple feedback and feedforward motifs enabled the prolonged CREB activation necessary for synaptopodin expression and actin bundling. Drug-induced modulation of these motifs in diseased rats led to recovery of normal morphology and physiological function in vivo. Thus, analysis of regulatory motifs using network dynamics can provide insights into pathophysiology that enable predictions for drug intervention strategies to treat kidney disease.
Interconnected Network Motifs Control Podocyte Morphology and Kidney Function
Azeloglu, Evren U.; Hardy, Simon V.; Eungdamrong, Narat John; Chen, Yibang; Jayaraman, Gomathi; Chuang, Peter Y.; Fang, Wei; Xiong, Huabao; Neves, Susana R.; Jain, Mohit R.; Li, Hong; Ma’ayan, Avi; Gordon, Ronald E.; He, John Cijiang; Iyengar, Ravi
2014-01-01
Podocytes are kidney cells with specialized morphology that is required for glomerular filtration. Diseases, such as diabetes, or drug exposure that causes disruption of the podocyte foot process morphology results in kidney pathophysiology. Proteomic analysis of glomeruli isolated from rats with puromycin-induced kidney disease and control rats indicated that protein kinase A (PKA), which is activated by adenosine 3′,5′-monophosphate (cAMP), is a key regulator of podocyte morphology and function. In podocytes, cAMP signaling activates cAMP response element–binding protein (CREB) to enhance expression of the gene encoding a differentiation marker, synaptopodin, a protein that associates with actin and promotes its bundling. We constructed and experimentally verified a β-adrenergic receptor–driven network with multiple feedback and feedforward motifs that controls CREB activity. To determine how the motifs interacted to regulate gene expression, we mapped multicompartment dynamical models, including information about protein subcellular localization, onto the network topology using Petri net formalisms. These computational analyses indicated that the juxtaposition of multiple feedback and feedforward motifs enabled the prolonged CREB activation necessary for synaptopodin expression and actin bundling. Drug-induced modulation of these motifs in diseased rats led to recovery of normal morphology and physiological function in vivo. Thus, analysis of regulatory motifs using network dynamics can provide insights into pathophysiology that enable predictions for drug intervention strategies to treat kidney disease. PMID:24497609
Combinatorics of feedback in cellular uptake and metabolism of small molecules.
Krishna, Sandeep; Semsey, Szabolcs; Sneppen, Kim
2007-12-26
We analyze the connection between structure and function for regulatory motifs associated with cellular uptake and usage of small molecules. Based on the boolean logic of the feedback we suggest four classes: the socialist, consumer, fashion, and collector motifs. We find that the socialist motif is good for homeostasis of a useful but potentially poisonous molecule, whereas the consumer motif is optimal for nutrition molecules. Accordingly, examples of these motifs are found in, respectively, the iron homeostasis system in various organisms and in the uptake of sugar molecules in bacteria. The remaining two motifs have no obvious analogs in small molecule regulation, but we illustrate their behavior using analogies to fashion and obesity. These extreme motifs could inspire construction of synthetic systems that exhibit bistable, history-dependent states, and homeostasis of flux (rather than concentration).
Tanaka, Naoto; Delemotte, Lucie; Klein, Michael L.; Komáromy, András M.; Tanaka, Jacqueline C.
2014-01-01
Cone cyclic nucleotide-gated channels are tetramers formed by CNGA3 and CNGB3 subunits; CNGA3 subunits function as homotetrameric channels but CNGB3 exhibits channel function only when co-expressed with CNGA3. An aspartatic acid (Asp) to asparagine (Asn) missense mutation at position 262 in the canine CNGB3 (D262N) subunit results in loss of cone function (daylight blindness), suggesting an important role for this aspartic acid residue in channel biogenesis and/or function. Asp 262 is located in a conserved region of the second transmembrane segment containing three Asp residues designated the Tri-Asp motif. This motif is conserved in all CNG channels. Here we examine mutations in canine CNGA3 homomeric channels using a combination of experimental and computational approaches. Mutations of these conserved Asp residues result in the absence of nucleotide-activated currents in heterologous expression. A fluorescent tag on CNGA3 shows mislocalization of mutant channels. Co-expressing CNGB3 Tri-Asp mutants with wild type CNGA3 results in some functional channels, however, their electrophysiological characterization matches the properties of homomeric CNGA3 channels. This failure to record heteromeric currents suggests that Asp/Asn mutations affect heteromeric subunit assembly. A homology model of S1–S6 of the CNGA3 channel was generated and relaxed in a membrane using molecular dynamics simulations. The model predicts that the Tri-Asp motif is involved in non-specific salt bridge pairings with positive residues of S3/S4. We propose that the D262N mutation in dogs with CNGB3-day blindness results in the loss of these inter-helical interactions altering the electrostatic equilibrium within in the S1–S4 bundle. Because residues analogous to Tri-Asp in the voltage-gated Shaker potassium channel family were implicated in monomer folding, we hypothesize that destabilizing these electrostatic interactions impairs the monomer folding state in D262N mutant CNG channels during biogenesis. PMID:24586388
Identification and preliminary characterization of a protein motif related to the zinc finger.
Lovering, R; Hanson, I M; Borden, K L; Martin, S; O'Reilly, N J; Evan, G I; Rahman, D; Pappin, D J; Trowsdale, J; Freemont, P S
1993-01-01
We have identified a protein motif, related to the zinc finger, which defines a newly discovered family of proteins. The motif was found in the sequence of the human RING1 gene, which is proximal to the major histocompatibility complex region on chromosome six. We propose naming this motif the "RING finger" and it is found in 27 proteins, all of which have putative DNA binding functions. We have synthesized a peptide corresponding to the RING1 motif and examined a number of properties, including metal and DNA binding. We provide evidence to support the suggestion that the RING finger motif is the DNA binding domain of this newly defined family of proteins. Images Fig. 1 Fig. 4 PMID:7681583
Karnik, Rahul; Beer, Michael A.
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs. PMID:26465884
Karnik, Rahul; Beer, Michael A
2015-01-01
The generation of genomic binding or accessibility data from massively parallel sequencing technologies such as ChIP-seq and DNase-seq continues to accelerate. Yet state-of-the-art computational approaches for the identification of DNA binding motifs often yield motifs of weak predictive power. Here we present a novel computational algorithm called MotifSpec, designed to find predictive motifs, in contrast to over-represented sequence elements. The key distinguishing feature of this algorithm is that it uses a dynamic search space and a learned threshold to find discriminative motifs in combination with the modeling of motifs using a full PWM (position weight matrix) rather than k-mer words or regular expressions. We demonstrate that our approach finds motifs corresponding to known binding specificities in several mammalian ChIP-seq datasets, and that our PWMs classify the ChIP-seq signals with accuracy comparable to, or marginally better than motifs from the best existing algorithms. In other datasets, our algorithm identifies novel motifs where other methods fail. Finally, we apply this algorithm to detect motifs from expression datasets in C. elegans using a dynamic expression similarity metric rather than fixed expression clusters, and find novel predictive motifs.
Miller, Bradley R; Sundlov, Jesse A; Drake, Eric J; Makin, Thomas A; Gulick, Andrew M
2014-10-01
Nonribosomal peptide synthetases (NRPSs) are multimodular proteins capable of producing important peptide natural products. Using an assembly line process, the amino acid substrate and peptide intermediates are passed between the active sites of different catalytic domains of the NRPS while bound covalently to a peptidyl carrier protein (PCP) domain. Examination of the linker sequences that join the NRPS adenylation and PCP domains identified several conserved proline residues that are not found in standalone adenylation domains. We examined the roles of these proline residues and neighboring conserved sequences through mutagenesis and biochemical analysis of the reaction catalyzed by the adenylation domain and the fully reconstituted NRPS pathway. In particular, we identified a conserved LPxP motif at the start of the adenylation-PCP linker. The LPxP motif interacts with a region on the adenylation domain to stabilize a critical catalytic lysine residue belonging to the A10 motif that immediately precedes the linker. Further, this interaction with the C-terminal subdomain of the adenylation domain may coordinate movement of the PCP with the conformational change of the adenylation domain. Through this work, we extend the conserved A10 motif of the adenylation domain and identify residues that enable proper adenylation domain function. © 2014 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wanitchang, Asawin; Narkpuk, Jaraspim; Jongkaewwattana, Anan, E-mail: anan.jon@biotec.or.th
The nucleoprotein of influenza B virus (BNP) shares several characteristics with its influenza A virus counterpart (ANP), including localization in the host's nucleus. However, while the nuclear localization signal(s) (NLS) of ANP are well characterized, little is known about those of BNP. In this study, we showed that the fusion protein bearing the BNP N-terminus fused with GFP (N70–GFP) is exclusively nuclear, and identified a highly conserved KRXR motif spanning residues 44–47 as a putative NLS. In addition, we demonstrated that residues 3–15 of BNP, though not an NLS, are also crucial for nuclear import. Results from mutational analyses ofmore » N70–GFP and the full-length BNP suggest that this region may be required for protection of the N-terminus from proteolytic cleavage. Altogether, we propose that the N-terminal region of BNP contains the NLS and cleavage-protection motif, which together drive its nuclear localization. - Highlights: • The N-terminal region of BNP is required for nuclear accumulation. • The conserved motif at position 44–47 is a putative nuclear localization signal. • The first 15 amino acids of BNP may function as a cleavage-protection motif. • BNP may get access to the nucleus via a mechanism distinct from ANP.« less
Beltrán-Valero de Bernabé, D; Jimenez, F J; Aquaron, R; Rodríguez de Córdoba, S
1999-01-01
We recently showed that alkaptonuria (AKU) is caused by loss-of-function mutations in the homogentisate 1,2 dioxygenase gene (HGO). Herein we describe haplotype and mutational analyses of HGO in seven new AKU pedigrees. These analyses identified two novel single-nucleotide polymorphisms (INV4+31A-->G and INV11+18A-->G) and six novel AKU mutations (INV1-1G-->A, W60G, Y62C, A122D, P230T, and D291E), which further illustrates the remarkable allelic heterogeneity found in AKU. Reexamination of all 29 mutations and polymorphisms thus far described in HGO shows that these nucleotide changes are not randomly distributed; the CCC sequence motif and its inverted complement, GGG, are preferentially mutated. These analyses also demonstrated that the nucleotide substitutions in HGO do not involve CpG dinucleotides, which illustrates important differences between HGO and other genes for the occurrence of mutation at specific short-sequence motifs. Because the CCC sequence motifs comprise a significant proportion (34.5%) of all mutated bases that have been observed in HGO, we conclude that the CCC triplet is a mutational hot spot in HGO. PMID:10205262
Tlatli, Rym; Nozach, Hervé; Collet, Guillaume; Beau, Fabrice; Vera, Laura; Stura, Enrico; Dive, Vincent; Cuniasse, Philippe
2013-01-01
Artificial miniproteins that are able to target catalytic sites of matrix metalloproteinases (MMPs) were designed using a functional motif-grafting approach. The motif corresponded to the four N-terminal residues of TIMP-2, a broad-spectrum protein inhibitor of MMPs. Scaffolds that are able to reproduce the functional topology of this motif were obtained by exhaustive screening of the Protein Data Bank (PDB) using STAMPS software (search for three-dimensional atom motifs in protein structures). Ten artificial protein binders were produced. The designed proteins bind catalytic sites of MMPs with affinities ranging from 450 nm to 450 μm prior to optimization. The crystal structure of one artificial binder in complex with the catalytic domain of MMP-12 showed that the inter-molecular interactions established by the functional motif in the artificial binder corresponded to those found in the MMP-14-TIMP-2 complex, albeit with some differences in geometry. Molecular dynamics simulations of the ten binders in complex with MMP-14 suggested that these scaffolds may allow partial reproduction of native inter-molecular interactions, but differences in geometry and stability may contribute to the lower affinity of the artificial protein binders compared to the natural protein binder. Nevertheless, these results show that the in silico design method used provides sets of protein binders that target a specific binding site with a good rate of success. This approach may constitute the first step of an efficient hybrid computational/experimental approach to protein binder design. © 2012 The Authors Journal compilation © 2012 FEBS.
Schmidt, H-M A; Andres, S; Nilsson, C; Kovach, Z; Kaakoush, N O; Engstrand, L; Goh, K-L; Fock, K M; Forman, D; Mitchell, H
2010-04-01
Helicobacter pylori-related disease is at least partially attributable to the genotype of the infecting strain, particularly the presence of specific virulence factors. We investigated the prevalence of a novel combination of H. pylori virulence factors, including the cag pathogenicity island (PAI), and their association with severe disease in isolates from the three major ethnicities in Malaysia and Singapore, and evaluated whether the cag PAI was intact and functional in vitro. Polymerase chain reaction (PCR) was used to detect dupA, cagA, cagE, cagT, cagL and babA, and to type vacA, the EPIYA motifs, HP0521 alleles and oipA ON status in 159 H. pylori clinical isolates. Twenty-two strains were investigated for IL-8 induction and CagA translocation in vitro. The prevalence of cagA, cagE, cagL, cagT, babA, oipA ON and vacA s1 and i1 was >85%, irrespective of the disease state or ethnicity. The prevalence of dupA and the predominant HP0521 allele and EPIYA motif varied significantly with ethnicity (p < 0.05). A high prevalence of an intact cag PAI was found in all ethnic groups; however, no association was observed between any virulence factor and disease state. The novel association between the HP0521 alleles, EPIYA motifs and host ethnicity indicates that further studies to determine the function of this gene are important.
Bi, Sai; Yue, Shuzhen; Wu, Qiang; Ye, Jiayan
2016-09-15
Here we program an initiator-catalyzed self-assembly of duplex-looped DNA hairpin motif based on strand displacement reaction. Due to the recycling of initiator and performance in a cascade manner, this system is versatilely extended to logic operations, including the construction of concatenated logic circuits with a feedback function and a biocomputing keypad-lock security system. Compared with previously reported molecular security systems, the prominent feature of our keypad lock is that it can be spontaneously reset and recycled with no need of any external stimulus and human intervention. Moreover, through integrating with an isothermal amplification technique of rolling circle amplification (RCA), this programming catalytic DNA self-assembly strategy readily achieves sensitive and selective biosensing of initiator. Importantly, a magnetic graphene oxide (MGO) is introduced to remarkably reduced background, which plays an important role in enhancing the signal-to-noise ratio and improving the detection sensitivity. Therefore, the proposed sophisticated DNA strand displacement-based methodology with engineering dynamic functions may find broad applications in the construction of programming DNA nanostructures, amplification biosensing platform, and large-scale DNA circuits. Copyright © 2016 Elsevier B.V. All rights reserved.
Development of a graphical user interface for the global land information system (GLIS)
Alstad, Susan R.; Jackson, David A.
1993-01-01
The process of developing a Motif Graphical User Interface for the Global Land Information System (GLIS) involved incorporating user requirements, in-house visual and functional design requirements, and Open Software Foundation (OSF) Motif style guide standards. Motif user interface windows have been developed using the software to support Motif window functions war written using the C programming language. The GLIS architecture was modified to support multiple servers and remote handlers running the X Window System by forming a network of servers and handlers connected by TCP/IP communications. In April 1993, prior to release the GLIS graphical user interface and system architecture modifications were test by developers and users located at the EROS Data Center and 11 beta test sites across the country.
Uemura, Satoshi; Shishido, Fumi; Kashimura, Madoka; Inokuchi, Jin-ichi
2015-12-01
In the Golgi maturation model, the Golgi cisternae dynamically mature along a secretory pathway. In this dynamic process, glycosyltransferases are transported from the endoplasmic reticulum (ER) to the Golgi apparatus where they remain and function. The precise mechanism behind this maturation process remains unclear. We investigated two glycosyltransferases, ST3Gal5 (ST3G5) and B4GalNAcT1 (B4GN1), involved in ganglioside synthesis and examined their signal sequences for ER export and Golgi retention. Reports have suggested that the [R/K](X)[R/K] motif functions as an ER exporting signal; however, this signal sequence is insufficient in stably expressed, full-length ST3G5. Through further analysis, we have clarified that the (2)R(3)R(X)(5) (9)K(X)(3) (13)K sequence in ST3G5 is essential for ER export. We have named the sequence the R/K-based motif. On the other hand, for ER export of B4GN1, the homodimer formation in addition to the R/K-based motif is required for ER export suggesting the importance of unidentified lumenal side interaction. We found that ST3G5 R2A/R3A and K9A/K13A mutants localized not only in Golgi apparatus but also in endosomes. Furthermore, the amounts of mature type asparagine-linked (N)-glycans in ST3G5 R2A/R3A and K9A/K13A mutants were decreased compared with those in wild-type proteins, and the stability of the mutants was lower. These results suggest that the R/K-based motif is necessary for the Golgi retention of ST3G5 and that the retention is involved in the maturation of N-glycans and in stability. Thus, several basic amino acids located on the cytoplasmic tail of ST3G5 play important roles in both ER export and Golgi retention. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Wu, Dongni; Zhang, Shuangying; Zhao, Yuyuan; Ao, Ningjian; Ramakrishna, Seeram; He, Liumin
2018-03-16
RADA16-I (Ac-(RADA) 4 -CONH 2 ) is a widely investigated self-assembling peptide (SAP) in the biomedical field. It can undergo ordered self-assembly to form stable secondary structures, thereby further forming a nanofiber hydrogel. The modification of RADA16-I with functional peptide motifs has become a popular research topic. Researchers aim to exhibit particular biomedical signaling, and subsequently, further expand its applications. However, only a few fundamental reports are available on the influences of the peptide motifs on self-assembly mechanisms of designer functional RADA16-I SAPs. In this study, we designed RGD-modified RADA16-I SAPs with a series of net charges and amphiphilicities. The assembly/reassembly of these functionally designer SAPs was thoroughly studied using Raman spectroscopy, CD spectroscopy, and AFM. The nanofiber morphology and the secondary structure largely depended on the balance between the hydrophobic effects versus like-charge repulsions of the motifs, which should be to the focus in order to achieve a tailored nanostructure. Our study would contribute insight into considerations for sophisticated design of SAPs for biomedical applications.
Motif discovery and motif finding from genome-mapped DNase footprint data.
Kulakovskiy, Ivan V; Favorov, Alexander V; Makeev, Vsevolod J
2009-09-15
Footprint data is an important source of information on transcription factor recognition motifs. However, a footprinting fragment can contain no sequences similar to known protein recognition sites. Inspection of genome fragments nearby can help to identify missing site positions. Genome fragments containing footprints were supplied to a pipeline that constructed a position weight matrix (PWM) for different motif lengths and selected the optimal PWM. Fragments were aligned with the SeSiMCMC sampler and a new heuristic algorithm, Bigfoot. Footprints with missing hits were found for approximately 50% of factors. Adding only 2 bp on both sides of a footprinting fragment recovered most hits. We automatically constructed motifs for 41 Drosophila factors. New motifs can recognize footprints with a greater sensitivity at the same false positive rate than existing models. Also we discuss possible overfitting of constructed motifs. Software and the collection of regulatory motifs are freely available at http://line.imb.ac.ru/DMMPMM.
In vivo functional mapping of the conserved protein domains within murine Themis1.
Zvezdova, Ekaterina; Lee, Jan; El-Khoury, Dalal; Barr, Valarie; Akpan, Itoro; Samelson, Lawrence; Love, Paul E
2014-09-01
Thymocyte development requires the coordinated input of signals that originate from numerous cell surface molecules. Although the majority of thymocyte signal-initiating receptors are lineage-specific, most trigger 'ubiquitous' downstream signaling pathways. T-lineage-specific receptors are coupled to these signaling pathways by lymphocyte-restricted adapter molecules. We and others recently identified a new putative adapter protein, Themis1, whose expression is largely restricted to the T lineage. Mice lacking Themis1 exhibit a severe block in thymocyte development and a striking paucity of mature T cells revealing a critical role for Themis1 in T-cell maturation. Themis1 orthologs contain three conserved domains: a proline-rich region (PRR) that binds to the ubiquitous cytosolic adapter Grb2, a nuclear localization sequence (NLS), and two copies of a novel cysteine-containing globular (CABIT) domain. In the present study, we evaluated the functional importance of each of these motifs by retroviral reconstitution of Themis1(-/-) progenitor cells. The results demonstrate an essential requirement for the PRR and NLS motifs but not the conserved CABIT cysteines for Themis1 function.
Cytoplasmic Motifs in the Nipah Virus Fusion Protein Modulate Virus Particle Assembly and Egress.
Johnston, Gunner P; Contreras, Erik M; Dabundo, Jeffrey; Henderson, Bryce A; Matz, Keesha M; Ortega, Victoria; Ramirez, Alfredo; Park, Arnold; Aguilar, Hector C
2017-05-15
Nipah virus (NiV), a paramyxovirus in the genus Henipavirus , has a mortality rate in humans of approximately 75%. While several studies have begun our understanding of NiV particle formation, the mechanism of this process remains to be fully elucidated. For many paramyxoviruses, M proteins drive viral assembly and egress; however, some paramyxoviral glycoproteins have been reported as important or essential in budding. For NiV the matrix protein (M), the fusion glycoprotein (F) and, to a much lesser extent, the attachment glycoprotein (G) autonomously induce the formation of virus-like particles (VLPs). However, functional interactions between these proteins during assembly and egress remain to be fully understood. Moreover, if the F-driven formation of VLPs occurs through interactions with host cell machinery, the cytoplasmic tail (CT) of F is a likely interactive domain. Therefore, we analyzed NiV F CT deletion and alanine mutants and report that several but not all regions of the F CT are necessary for efficient VLP formation. Two of these regions contain YXXØ or dityrosine motifs previously shown to interact with cellular machinery involved in F endocytosis and transport. Importantly, our results showed that F-driven, M-driven, and M/F-driven viral particle formation enhanced the recruitment of G into VLPs. By identifying key motifs, specific residues, and functional viral protein interactions important for VLP formation, we improve our understanding of the viral assembly/egress process and point to potential interactions with host cell machinery. IMPORTANCE Henipaviruses can cause deadly infections of medical, veterinary, and agricultural importance. With recent discoveries of new henipa-like viruses, understanding the mechanisms by which these viruses reproduce is paramount. We have focused this study on identifying the functional interactions of three Nipah virus proteins during viral assembly and particularly on the role of one of these proteins, the fusion glycoprotein, in the incorporation of other viral proteins into viral particles. By identifying several regions in the fusion glycoprotein that drive viral assembly, we further our understanding of how these viruses assemble and egress from infected cells. The results presented will likely be useful toward designing treatments targeting this aspect of the viral life cycle and for the production of new viral particle-based vaccines. Copyright © 2017 American Society for Microbiology.
kpLogo: positional k-mer analysis reveals hidden specificity in biological sequences
2017-01-01
Abstract Motifs of only 1–4 letters can play important roles when present at key locations within macromolecules. Because existing motif-discovery tools typically miss these position-specific short motifs, we developed kpLogo, a probability-based logo tool for integrated detection and visualization of position-specific ultra-short motifs from a set of aligned sequences. kpLogo also overcomes the limitations of conventional motif-visualization tools in handling positional interdependencies and utilizing ranked or weighted sequences increasingly available from high-throughput assays. kpLogo can be found at http://kplogo.wi.mit.edu/. PMID:28460012
Common fold in helix–hairpin–helix proteins
Shao, Xuguang; Grishin, Nick V.
2000-01-01
Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
Kshirsagar, Rucha; Khan, Krishnendu; Joshi, Mamata V; Hosur, Ramakrishna V; Muniyappa, K
2017-05-23
A plethora of evidence suggests that different types of DNA quadruplexes are widely present in the genome of all organisms. The existence of a growing number of proteins that selectively bind and/or process these structures underscores their biological relevance. Moreover, G-quadruplex DNA has been implicated in the alignment of four sister chromatids by forming parallel guanine quadruplexes during meiosis; however, the underlying mechanism is not well defined. Here we show that a G/C-rich motif associated with a meiosis-specific DNA double-strand break (DSB) in Saccharomyces cerevisiae folds into G-quadruplex, and the C-rich sequence complementary to the G-rich sequence forms an i-motif. The presence of G-quadruplex or i-motif structures upstream of the green fluorescent protein-coding sequence markedly reduces the levels of gfp mRNA expression in S. cerevisiae cells, with a concomitant decrease in green fluorescent protein abundance, and blocks primer extension by DNA polymerase, thereby demonstrating the functional significance of these structures. Surprisingly, although S. cerevisiae Hop1, a component of synaptonemal complex axial/lateral elements, exhibits strong affinity to G-quadruplex DNA, it displays a much weaker affinity for the i-motif structure. However, the Hop1 C-terminal but not the N-terminal domain possesses strong i-motif binding activity, implying that the C-terminal domain has a distinct substrate specificity. Additionally, we found that Hop1 promotes intermolecular pairing between G/C-rich DNA segments associated with a meiosis-specific DSB site. Our results support the idea that the G/C-rich motifs associated with meiosis-specific DSBs fold into intramolecular G-quadruplex and i-motif structures, both in vitro and in vivo, thus revealing an important link between non-B form DNA structures and Hop1 in meiotic chromosome synapsis and recombination. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Xu, Minli; Lawrence, Jeffrey G; Durand, Dannie
2018-03-16
Highly Iterated Palindrome 1 (HIP1, GCGATCGC) is hyper-abundant in most cyanobacterial genomes. In some cyanobacteria, average HIP1 abundance exceeds one motif per gene. Such high abundance suggests a significant role in cyanobacterial biology. However, 20 years of study have not revealed whether HIP1 has a function, much less what that function might be. We show that HIP1 is 15- to 300-fold over-represented in genomes analyzed. More importantly, HIP1 sites are conserved both within and between open reading frames, suggesting that their overabundance is maintained by selection rather than by continual replenishment by neutral processes, such as biased DNA repair. This evidence for selection suggests a functional role for HIP1. No evidence was found to support a functional role as a peptide or RNA motif or a role in the regulation of gene expression. Rather, we demonstrate that the distribution of HIP1 along cyanobacterial chromosomes is significantly periodic, with periods ranging from 10 to 90 kb, consistent in scale with periodicities reported for co-regulated, co-expressed and evolutionarily correlated genes. The periodicity we observe is also comparable in scale to chromosomal interaction domains previously described in other bacteria. In this context, our findings imply HIP1 functions associated with chromosome and nucleoid structure.
Xu, Minli; Lawrence, Jeffrey G; Durand, Dannie
2018-01-01
Abstract Highly Iterated Palindrome 1 (HIP1, GCGATCGC) is hyper-abundant in most cyanobacterial genomes. In some cyanobacteria, average HIP1 abundance exceeds one motif per gene. Such high abundance suggests a significant role in cyanobacterial biology. However, 20 years of study have not revealed whether HIP1 has a function, much less what that function might be. We show that HIP1 is 15- to 300-fold over-represented in genomes analyzed. More importantly, HIP1 sites are conserved both within and between open reading frames, suggesting that their overabundance is maintained by selection rather than by continual replenishment by neutral processes, such as biased DNA repair. This evidence for selection suggests a functional role for HIP1. No evidence was found to support a functional role as a peptide or RNA motif or a role in the regulation of gene expression. Rather, we demonstrate that the distribution of HIP1 along cyanobacterial chromosomes is significantly periodic, with periods ranging from 10 to 90 kb, consistent in scale with periodicities reported for co-regulated, co-expressed and evolutionarily correlated genes. The periodicity we observe is also comparable in scale to chromosomal interaction domains previously described in other bacteria. In this context, our findings imply HIP1 functions associated with chromosome and nucleoid structure. PMID:29432573
Process-based network decomposition reveals backbone motif structure
Wang, Guanyu; Du, Chenghang; Chen, Hao; Simha, Rahul; Rong, Yongwu; Xiao, Yi; Zeng, Chen
2010-01-01
A central challenge in systems biology today is to understand the network of interactions among biomolecules and, especially, the organizing principles underlying such networks. Recent analysis of known networks has identified small motifs that occur ubiquitously, suggesting that larger networks might be constructed in the manner of electronic circuits by assembling groups of these smaller modules. Using a unique process-based approach to analyzing such networks, we show for two cell-cycle networks that each of these networks contains a giant backbone motif spanning all the network nodes that provides the main functional response. The backbone is in fact the smallest network capable of providing the desired functionality. Furthermore, the remaining edges in the network form smaller motifs whose role is to confer stability properties rather than provide function. The process-based approach used in the above analysis has additional benefits: It is scalable, analytic (resulting in a single analyzable expression that describes the behavior), and computationally efficient (all possible minimal networks for a biological process can be identified and enumerated). PMID:20498084
Joseph, Prem Raj B.; Sawant, Kirti V.; Isley, Angela; Pedroza, Mesias; Garofalo, Roberto P.; Richardson, Ricardo M.; Rajarathnam, Krishna
2014-01-01
Chemokines mediate diverse functions from organogenesis to mobilizing leucocytes, and are unusual agonists for class-A GPCRs (G-protein-coupled receptors) because of their large size and multi-domain structure. The current model for receptor activation, which involves interactions between chemokine N-loop and receptor N-terminal residues (Site-I) and between chemokine N-terminal and receptor extracellular loop/transmembrane residues (Site-II), fails to describe differences in ligand/receptor selectivity and the activation of multiple signalling pathways. In the present study, we show in neutrophil-activating chemokine CXCL8 that the highly conserved GP (glycine-proline) motif located distal to both N-terminal and N-loop residues couples Site-I and Site-II interactions. Mutations in the GP motif caused various differences from native-like function to complete loss of activity that could not be correlated with the specific mutation, receptor affinity or subtype, or a specific signalling pathway. NMR studies indicated that the GP motif does not influence Site-I interactions, but molecular dynamics simulations suggested that this motif dictates substates of the CXCL8 conformational ensemble. We conclude that the GP motif enables diverse receptor functions by controlling cross-talk between Site-I and Site-II, and further propose that the repertoire of chemokine functions is best described by a conformational ensemble model in which a network of long-range coupled indirect interactions mediate receptor activity. PMID:24032673
Boyen, Peter; Van Dyck, Dries; Neven, Frank; van Ham, Roeland C H J; van Dijk, Aalt D J
2011-01-01
Correlated motif mining (cmm) is the problem of finding overrepresented pairs of patterns, called motifs, in sequences of interacting proteins. Algorithmic solutions for cmm thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-driven approach where the support of candidate motif pairs is evaluated in the network. We experimentally establish the superiority of the Chi-square-based support measure over other support measures. Furthermore, we obtain that cmm is an np-hard problem for a large class of support measures (including Chi-square) and reformulate the search for correlated motifs as a combinatorial optimization problem. We then present the generic metaheuristic slider which uses steepest ascent with a neighborhood function based on sliding motifs and employs the Chi-square-based support measure. We show that slider outperforms existing motif-driven cmm methods and scales to large protein-protein interaction networks. The slider-implementation and the data used in the experiments are available on http://bioinformatics.uhasselt.be.
Chinese lexical networks: The structure, function and formation
NASA Astrophysics Data System (ADS)
Li, Jianyu; Zhou, Jie; Luo, Xiaoyue; Yang, Zhanxin
2012-11-01
In this paper Chinese phrases are modeled using complex networks theory. We analyze statistical properties of the networks and find that phrase networks display some important features: not only small world and the power-law distribution, but also hierarchical structure and disassortative mixing. These statistical traits display the global organization of Chinese phrases. The origin and formation of such traits are analyzed from a macroscopic Chinese culture and philosophy perspective. It is interesting to find that Chinese culture and philosophy may shape the formation and structure of Chinese phrases. To uncover the structural design principles of networks, network motif patterns are studied. It is shown that they serve as basic building blocks to form the whole phrase networks, especially triad 38 (feed forward loop) plays a more important role in forming most of the phrases and other motifs. The distinct structure may not only keep the networks stable and robust, but also be helpful for information processing. The results of the paper can give some insight into Chinese language learning and language acquisition. It strengthens the idea that learning the phrases helps to understand Chinese culture. On the other side, understanding Chinese culture and philosophy does help to learn Chinese phrases. The hub nodes in the networks show the close relationship with Chinese culture and philosophy. Learning or teaching the hub characters, hub-linking phrases and phrases which are meaning related based on motif feature should be very useful and important for Chinese learning and acquisition.
info-gibbs: a motif discovery algorithm that directly optimizes information content during sampling.
Defrance, Matthieu; van Helden, Jacques
2009-10-15
Discovering cis-regulatory elements in genome sequence remains a challenging issue. Several methods rely on the optimization of some target scoring function. The information content (IC) or relative entropy of the motif has proven to be a good estimator of transcription factor DNA binding affinity. However, these information-based metrics are usually used as a posteriori statistics rather than during the motif search process itself. We introduce here info-gibbs, a Gibbs sampling algorithm that efficiently optimizes the IC or the log-likelihood ratio (LLR) of the motif while keeping computation time low. The method compares well with existing methods like MEME, BioProspector, Gibbs or GAME on both synthetic and biological datasets. Our study shows that motif discovery techniques can be enhanced by directly focusing the search on the motif IC or the motif LLR. http://rsat.ulb.ac.be/rsat/info-gibbs
A Gibbs sampler for motif detection in phylogenetically close sequences
NASA Astrophysics Data System (ADS)
Siddharthan, Rahul; van Nimwegen, Erik; Siggia, Eric
2004-03-01
Genes are regulated by transcription factors that bind to DNA upstream of genes and recognize short conserved ``motifs'' in a random intergenic ``background''. Motif-finders such as the Gibbs sampler compare the probability of these short sequences being represented by ``weight matrices'' to the probability of their arising from the background ``null model'', and explore this space (analogous to a free-energy landscape). But closely related species may show conservation not because of functional sites but simply because they have not had sufficient time to diverge, so conventional methods will fail. We introduce a new Gibbs sampler algorithm that accounts for common ancestry when searching for motifs, while requiring minimal ``prior'' assumptions on the number and types of motifs, assessing the significance of detected motifs by ``tracking'' clusters that stay together. We apply this scheme to motif detection in sporulation-cycle genes in the yeast S. cerevisiae, using recent sequences of other closely-related Saccharomyces species.
G-Quadruplexes influence pri-microRNA processing.
Rouleau, Samuel G; Garant, Jean-Michel; Bolduc, François; Bisaillon, Martin; Perreault, Jean-Pierre
2018-02-01
RNA G-Quadruplexes (G4) have been shown to possess many biological functions, including the regulation of microRNA (miRNA) biogenesis and function. However, their impact on pri-miRNA processing remains unknown. We identified G4 located near the Drosha cleavage site in three distinct pri-miRNAs: pri-mir200c, pri-mir451a, and pri-mir497. The folding of the potential G4 motifs was determined in solution. Subsequently, mutations disrupting G4 folding led to important changes in the mature miRNAs levels in cells. Moreover, using small antisense oligonucleotides binding to the pri-miRNA, it was possible to modulate, either positively or negatively, the mature miRNA levels. Together, these data demonstrate that G4 motifs could contribute to the regulation of pri-mRNA processing, a novel role for G4. Considering that bio-informatics screening indicates that between 9% and 50% of all pri-miRNAs contain a putative G4, these structures possess interesting potential as future therapeutic targets.
Siponen, Marina I.; Wisniewska, Magdalena; Lehtiö, Lari; Johansson, Ida; Svensson, Linda; Raszewski, Grzegorz; Nilsson, Lennart; Sigvardsson, Mikael; Berglund, Helena
2010-01-01
The early B-cell factor (EBF) transcription factors are central regulators of development in several organs and tissues. This protein family shows low sequence similarity to other protein families, which is why structural information for the functional domains of these proteins is crucial to understand their biochemical features. We have used a modular approach to determine the crystal structures of the structured domains in the EBF family. The DNA binding domain reveals a striking resemblance to the DNA binding domains of the Rel homology superfamily of transcription factors but contains a unique zinc binding structure, termed zinc knuckle. Further the EBF proteins contain an IPT/TIG domain and an atypical helix-loop-helix domain with a novel type of dimerization motif. The data presented here provide insights into unique structural features of the EBF proteins and open possibilities for detailed molecular investigations of this important transcription factor family. PMID:20592035
MOTIFSIM 2.1: An Enhanced Software Platform for Detecting Similarity in Multiple DNA Motif Data Sets
Huang, Chun-Hsi
2017-01-01
Abstract Finding binding site motifs plays an important role in bioinformatics as it reveals the transcription factors that control the gene expression. The development for motif finders has flourished in the past years with many tools have been introduced to the research community. Although these tools possess exceptional features for detecting motifs, they report different results for an identical data set. Hence, using multiple tools is recommended because motifs reported by several tools are likely biologically significant. However, the results from multiple tools need to be compared for obtaining common significant motifs. MOTIFSIM web tool and command-line tool were developed for this purpose. In this work, we present several technical improvements as well as additional features to further support the motif analysis in our new release MOTIFSIM 2.1. PMID:28632401
Fambrini, M; Mariotti, L; Parlanti, S; Salvini, M; Pugliesi, C
2015-11-01
The GRAS proteins belong to a plant transcriptional regulator family that function in the regulation of plant growth and development. Despite their important roles, in sunflower only one GRAS gene (HaDella1) with the DELLA domain has been reported. Here, we provide a functional characterisation of a GRAS-like gene from Helianthus annuus (Ha-GRASL) lacking the DELLA motif. The Ha-GRASL gene contains an intronless open reading frame of 1,743 bp encoding 580 amino acids. Conserved motifs in the GRAS domain are detected, including VHIID, PFYRE, SAW and two LHR motifs. Within the VHII motif, the P-H-N-D-Q-L residues are entirely maintained. Phylogenetic analysis reveals that Ha-GRASL belongs to the SCARECROW LIKE4/7 (SCL4/7) subfamily of the GRAS consensus tree. Accumulation of Ha-GRASL mRNA at the adaxial boundaries from P6/P7 leaf primordia suggests a role of Ha-GRASL in the initiation of median and basal axillary meristems (AMs) of sunflower. When Ha-GRASL is over-expressed in Arabidopsis wild-type plants, the number of lateral bolts increases differently from untransformed plants. However, Ha-GRASL slightly affects the lateral suppressor (las-4-) mutation. Therefore, we hypothesise that Ha-GRASL and LAS are not functionally equivalent. The over-expression of Ha-GRASL reduces metabolic flow of gibberellins (GAs) in Arabidopsis and this modification could be relevant in AM development. Phylogenetic analysis includes LAS and SCL4/7 in the same major clade, suggesting a more recent separation of these genes with respect to other GRAS members. We propose that some features of their ancestor, as well as AM initiation and outgrowth, are partially retained in both LAS and SCL4/7. © 2015 German Botanical Society and The Royal Botanical Society of the Netherlands.
Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.
Müller-Molina, Arnoldo J; Schöler, Hans R; Araúzo-Bravo, Marcos J
2012-01-01
To know the map between transcription factors (TFs) and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs) have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.
Comprehensive Human Transcription Factor Binding Site Map for Combinatory Binding Motifs Discovery
Müller-Molina, Arnoldo J.; Schöler, Hans R.; Araúzo-Bravo, Marcos J.
2012-01-01
To know the map between transcription factors (TFs) and their binding sites is essential to reverse engineer the regulation process. Only about 10%–20% of the transcription factor binding motifs (TFBMs) have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory “DNA words.” From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%—far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of “DNA words,” newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters. PMID:23209563
Direct AUC optimization of regulatory motifs.
Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang
2017-07-15
The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Structural basis of RNA folding and recognition in an AMP-RNA aptamer complex.
Jiang, F; Kumar, R A; Jones, R A; Patel, D J
1996-07-11
The catalytic properties of RNA and its well known role in gene expression and regulation are the consequence of its unique solution structures. Identification of the structural determinants of ligand recognition by RNA molecules is of fundamental importance for understanding the biological functions of RNA, as well as for the rational design of RNA Sequences with specific catalytic activities. Towards this latter end, Szostak et al. used in vitro selection techniques to isolate RNA sequences ('aptamers') containing a high-affinity binding site for ATP, the universal currency of cellular energy, and then used this motif to engineer ribozymes with polynucleotide kinase activity. Here we present the solution structure, as determined by multidimensional NMR spectroscopy and molecular dynamics calculations, of both uniformly and specifically 13C-, 15N-labelled 40-mer RNA containing the ATP-binding motif complexed with AMP. The aptamer adopts an L-shaped structure with two nearly orthogonal stems, each capped proximally by a G x G mismatch pair, binding the AMP ligand at their junction in a GNRA-like motif.
Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S
2010-10-07
PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out to dissect the PHB gene function. The conserved gene evolution indicated that the study in the model species can be translated to human and mammalian studies.
Statistical tests to compare motif count exceptionalities
Robin, Stéphane; Schbath, Sophie; Vandewalle, Vincent
2007-01-01
Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use. PMID:17346349
Chaotic Motifs in Gene Regulatory Networks
Zhang, Zhaoyang; Ye, Weiming; Qian, Yu; Zheng, Zhigang; Huang, Xuhui; Hu, Gang
2012-01-01
Chaos should occur often in gene regulatory networks (GRNs) which have been widely described by nonlinear coupled ordinary differential equations, if their dimensions are no less than 3. It is therefore puzzling that chaos has never been reported in GRNs in nature and is also extremely rare in models of GRNs. On the other hand, the topic of motifs has attracted great attention in studying biological networks, and network motifs are suggested to be elementary building blocks that carry out some key functions in the network. In this paper, chaotic motifs (subnetworks with chaos) in GRNs are systematically investigated. The conclusion is that: (i) chaos can only appear through competitions between different oscillatory modes with rivaling intensities. Conditions required for chaotic GRNs are found to be very strict, which make chaotic GRNs extremely rare. (ii) Chaotic motifs are explored as the simplest few-node structures capable of producing chaos, and serve as the intrinsic source of chaos of random few-node GRNs. Several optimal motifs causing chaos with atypically high probability are figured out. (iii) Moreover, we discovered that a number of special oscillators can never produce chaos. These structures bring some advantages on rhythmic functions and may help us understand the robustness of diverse biological rhythms. (iv) The methods of dominant phase-advanced driving (DPAD) and DPAD time fraction are proposed to quantitatively identify chaotic motifs and to explain the origin of chaotic behaviors in GRNs. PMID:22792171
Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.
Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D
2003-08-15
DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.
Sampaio, Elizabeth P; Ding, Li; Rose, Stacey R; Cruz, Phillip; Hsu, Amy P; Kashyap, Anuj; Rosen, Lindsey B; Smelkinson, Margery; Tavella, Tatyana A; Ferre, Elise M N; Wierman, Meredith K; Zerbe, Christa S; Lionakis, Michail S; Holland, Steven M
2018-05-01
Sumoylation is a posttranslational reversible modification of cellular proteins through the conjugation of small ubiquitin-related modifier (SUMO) and comprises an important regulator of protein function. We sought to characterize the molecular mechanism of a novel mutation at the SUMO motif on signal transducer and activator of transcription 1 (STAT1). STAT1 sequencing and functional characterization were performed in transfection experiments by using immunoblotting and immunoprecipitation in STAT1-deficient cell lines. Transcriptional response and target gene activation were also investigated in PBMCs. We identified a novel STAT1 mutation (c.2114A>T, p.E705V) within the SUMO motif ( 702 IKTE 705 ) in a patient with disseminated Rhodococcus species infection, Norwegian scabies, chronic mucocutaneous candidiasis, hypothyroidism, and esophageal squamous cell carcinoma. The mutation is located in the tail segment and is predicted to disrupt STAT1 sumoylation. Immunoprecipitation experiments performed in transfected cells confirmed absent STAT1 sumoylation for E705V, whereas it was present in wild-type (WT) STAT1 cells, as well as the loss-of-function mutants L706S and Y701C. Furthermore, stimulation with IFN-γ led to enhanced STAT1 phosphorylation, enhanced transcriptional activity, and target gene expression in the E705V-transfected compared with WT-transfected cells. Computer modeling of WT and mutant STAT1 molecules showed variations in the accessibility of the phosphorylation site Y701, which corresponded to the loss-of-function and gain-of-function variants. This is the first report of a mutation in the STAT1 sumoylation motif associated with clinical disease. These data reinforce sumoylation as a key posttranslational regulatory modification of STAT1 and identify a novel mechanism for gain-of-function STAT1 disease in human subjects. Copyright © 2017 American Academy of Allergy, Asthma & Immunology. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Lei; Zhang, Qing; Yang, Yu
Highlights: • RNA recognition motif domains of RBM5 are essential for cell proliferation inhibition. • RNA recognition motif domains of RBM5 are essential for apoptosis induction. • RNA recognition motif domains of RBM5 are essential for RNA binding. • RNA recognition motif domains of RBM5 are essential for caspase-2 alternative splicing. - Abstract: RBM5 is a known putative tumor suppressor gene that has been shown to function in cell growth inhibition by modulating apoptosis. RBM5 also plays a critical role in alternative splicing as an RNA binding protein. However, it is still unclear which domains of RBM5 are required formore » RNA binding and related functional activities. We hypothesized the two putative RNA recognition motif (RRM) domains of RBM5 spanning from amino acids 98–178 and 231–315 are essential for RBM5-mediated cell growth inhibition, apoptosis regulation, and RNA binding. To investigate this hypothesis, we evaluated the activities of the wide-type and mutant RBM5 gene transfer in low-RBM5 expressing A549 cells. We found that, unlike wild-type RBM5 (RBM5-wt), a RBM5 mutant lacking the two RRM domains (RBM5-ΔRRM), is unable to bind RNA, has compromised caspase-2 alternative splicing activity, lacks cell proliferation inhibition and apoptosis induction function in A549 cells. These data provide direct evidence that the two RRM domains of RBM5 are required for RNA binding and the RNA binding activity of RBM5 contributes to its function on apoptosis induction and cell growth inhibition.« less
Grumbt, Barbara; Stroobant, Vincent; Terziyska, Nadia; Israel, Lars; Hell, Kai
2007-12-28
Mia40p and Erv1p are components of a translocation pathway for the import of cysteine-rich proteins into the intermembrane space of mitochondria. We have characterized the redox behavior of Mia40p and reconstituted the disulfide transfer system of Mia40p by using recombinant functional C-terminal fragment of Mia40p, Mia40C, and Erv1p. Oxidized Mia40p contains three intramolecular disulfide bonds. One disulfide bond connects the first two cysteine residues in the CPC motif. The second and the third bonds belong to the twin CX(9)C motif and bridge the cysteine residues of two CX(9)C segments. In contrast to the stabilizing disulfide bonds of the twin CX(9)C motif, the first disulfide bond was easily accessible to reducing agents. Partially reduced Mia40C generated by opening of this bond as well as fully reduced Mia40C were oxidized by Erv1p in vitro. In the course of this reaction, mixed disulfides of Mia40C and Erv1p were formed. Reoxidation of fully reduced Mia40C required the presence of the first two cysteine residues in Mia40C. However, efficient reoxidation of a Mia40C variant containing only the cysteine residues of the twin CX(9)C motif was observed when in addition to Erv1p low amounts of wild type Mia40C were present. In the reconstituted system the thiol oxidase Erv1p was sufficient to transfer disulfide bonds to Mia40C, which then could oxidize the variant of Mia40C. In summary, we reconstituted a disulfide relay system consisting of Mia40C and Erv1p.
Nakashige, Toshiki G; Stephan, Jules R; Cunden, Lisa S; Brophy, Megan Brunjes; Wommack, Andrew J; Keegan, Brenna C; Shearer, Jason M; Nolan, Elizabeth M
2016-09-21
Human calprotectin (CP, S100A8/S100A9 oligomer, MRP-8/MRP-14 oligomer) is an abundant host-defense protein that is involved in the metal-withholding innate immune response. CP coordinates a variety of divalent first-row transition metal ions, which is implicated in its antimicrobial function, and its ability to sequester nutrient Zn(II) ions from microbial pathogens has been recognized for over two decades. CP has two distinct transition-metal-binding sites formed at the S100A8/S100A9 dimer interface, including a histidine-rich site composed of S100A8 residues His17 and His27 and S100A9 residues His91 and His95. In this study, we report that CP binds Zn(II) at this site using a hexahistidine motif, completed by His103 and His105 of the S100A9 C-terminal tail and previously identified as the high-affinity Mn(II) and Fe(II) coordination site. Zn(II) binding at this unique site shields the S100A9 C-terminal tail from proteolytic degradation by proteinase K. X-ray absorption spectroscopy and Zn(II) competition titrations support the formation of a Zn(II)-His6 motif. Microbial growth studies indicate that the hexahistidine motif is important for preventing microbial Zn(II) acquisition from CP by the probiotic Lactobacillus plantarum and the opportunistic human pathogen Candida albicans. The Zn(II)-His6 site of CP expands the known biological coordination chemistry of Zn(II) and provides new insight into how the human innate immune system starves microbes of essential metal nutrients.
Targeting cysteine-mediated dimerization of the MUC1-C oncoprotein in human cancer cells
RAINA, DEEPAK; AHMAD, REHAN; RAJABI, HASAN; PANCHAMOORTHY, GOVIND; KHARBANDA, SURENDER; KUFE, DONALD
2012-01-01
The MUC1 heterodimeric protein is aberrantly overexpressed in diverse human carcinomas and contributes to the malignant phenotype. The MUC1-C transmembrane subunit contains a CQC motif in the cytoplasmic domain that has been implicated in the formation of dimers and in its oncogenic function. The present study demonstrates that MUC1-C forms dimers in human breast and lung cancer cells. MUC1-C dimerization was detectable in the cytoplasm and was independent of MUC1-N, the N-terminal mucin subunit that extends outside the cell. We show that the MUC1-C cytoplasmic domain forms dimers in vitro that are disrupted by reducing agents. Moreover, dimerization of the MUC1-C subunit in cancer cells was blocked by reducing agents and increased by oxidative stress, supporting involvement of the CQC motif in forming disulfide bonds. In support of these observations, mutation of the MUC1-C CQC motif to AQA completely blocked MUC1-C dimerization. Importantly, this study was performed with MUC1-C devoid of fluorescent proteins, such as GFP, CFP and YFP. In this regard, we show that GFP, CFP and YFP themselves form dimers that are readily detectable with cross-linking agents. The present results further demonstrate that a cell-penetrating peptide that targets the MUC1-C CQC cysteines blocks MUC1-C dimerization in cancer cells. These findings provide definitive evidence that: i) the MUC1-C cytoplasmic domain cysteines are necessary and sufficient for MUC1-C dimerization, and ii) these CQC motif cysteines represent an Achilles’ heel for targeting MUC1-C function. PMID:22200620
Liu, Pan; Liu, Jie; Dong, Huixue; Sun, Jiaqiang
2018-02-01
Bread wheat (Triticum aestivum) spike architecture is an important agronomic trait. The Q gene plays a key role in the domestication of bread wheat spike architecture. However, the regulatory mechanisms of Q expression and transcriptional activity remain largely unknown. In this study, we show that overexpression of bread wheat tae-miR172 caused a speltoid-like spike phenotype, reminiscent of that in wheat plants with the q gene. The reduction in Q transcript levels in the tae-miR172 overexpression transgenic bread wheat lines suggests that the Q expression can be suppressed by tae-miR172 in bread wheat. Indeed, our RACE analyses confirmed that the Q mRNA is targeted by tae-miR172 for cleavage. According to our analyses, the Q protein is localized in nucleus and confers transcriptional repression activity. Meanwhile, the Q protein could physically interact with the bread wheat transcriptional co-repressor TOPLESS (TaTPL). Specifically, the N-terminal ethylene-responsive element binding factor-associated amphiphilic repression (EAR) (LDLNVE) motif but not the C-terminal EAR (LDLDLR) motif of Q protein mediates its interaction with the CTLH motif of TaTPL. Moreover, we show that the N-terminal EAR motif of Q protein is also essentially required for the transcriptional repression activity of Q protein. Taken together, we reveal the functional regulation of Q protein by tae-miR172 and transcriptional co-repressor TaTPL in controlling the bread wheat spike architecture. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Xue, You-Lin; Wang, Hao; Riedy, Michael; Roberts, Brittany-Lee; Sun, Yuna; Song, Yong-Bo; Jones, Gary W; Masison, Daniel C; Song, Youtao
2018-05-01
Genetic screens using Saccharomyces cerevisiae have identified an array of Hsp40 (Ydj1p) J-domain mutants that are impaired in the ability to cure the yeast [URE3] prion through disrupting functional interactions with Hsp70. However, biochemical analysis of some of these Hsp40 J-domain mutants has so far failed to provide major insight into the specific functional changes in Hsp40-Hsp70 interactions. To explore the detailed structural and dynamic properties of the Hsp40 J-domain, 20 ns molecular dynamic simulations of 4 mutants (D9A, D36A, A30T, and F45S) and wild-type J-domain were performed, followed by Hsp70 docking simulations. Results demonstrated that although the Hsp70 interaction mechanism of the mutants may vary, the major structural change was targeted to the critical HPD motif of the J-domain. Our computational analysis fits well with previous yeast genetics studies regarding highlighting the importance of J-domain function in prion propagation. During the molecular dynamics simulations several important residues were identified and predicted to play an essential role in J-domain structure. Among these residues, Y26 and F45 were confirmed, using both in silico and in vivo methods, as being critical for Ydj1p function.
Deineko, Viktor
2006-01-01
Human multisynthetase complex auxiliary component, protein p43 is an endothelial monocyte-activating polypeptide II precursor. In this study, comprehensive sequence analysis of N-terminus has been performed to identify structural domains, motifs, sites of post-translation modification and other functionally important parameters. The spatial structure model of full-chain protein p43 is obtained.
Zurnic, Irena; Hütter, Sylvia; Rzeha, Ute; Stanke, Nicole; Reh, Juliane; Müllers, Erik; Hamann, Martin V.; Kern, Tobias; Gerresheim, Gesche K.; Serrao, Erik; Lesbats, Paul; Engelman, Alan N.; Cherepanov, Peter; Lindemann, Dirk
2016-01-01
Unlike for other retroviruses, only a few host cell factors that aid the replication of foamy viruses (FVs) via interaction with viral structural components are known. Using a yeast-two-hybrid (Y2H) screen with prototype FV (PFV) Gag protein as bait we identified human polo-like kinase 2 (hPLK2), a member of cell cycle regulatory kinases, as a new interactor of PFV capsids. Further Y2H studies confirmed interaction of PFV Gag with several PLKs of both human and rat origin. A consensus Ser-Thr/Ser-Pro (S-T/S-P) motif in Gag, which is conserved among primate FVs and phosphorylated in PFV virions, was essential for recognition by PLKs. In the case of rat PLK2, functional kinase and polo-box domains were required for interaction with PFV Gag. Fluorescently-tagged PFV Gag, through its chromatin tethering function, selectively relocalized ectopically expressed eGFP-tagged PLK proteins to mitotic chromosomes in a Gag STP motif-dependent manner, confirming a specific and dominant nature of the Gag-PLK interaction in mammalian cells. The functional relevance of the Gag-PLK interaction was examined in the context of replication-competent FVs and single-round PFV vectors. Although STP motif mutated viruses displayed wild type (wt) particle release, RNA packaging and intra-particle reverse transcription, their replication capacity was decreased 3-fold in single-cycle infections, and up to 20-fold in spreading infections over an extended time period. Strikingly similar defects were observed when cells infected with single-round wt Gag PFV vectors were treated with a pan PLK inhibitor. Analysis of entry kinetics of the mutant viruses indicated a post-fusion defect resulting in delayed and reduced integration, which was accompanied with an enhanced preference to integrate into heterochromatin. We conclude that interaction between PFV Gag and cellular PLK proteins is important for early replication steps of PFV within host cells. PMID:27579920
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Chi; Miller, Darcie J.; Guibao, Cristina D.
The Cas family scaffolding protein p130Cas is a Src substrate localized in focal adhesions (FAs) and functions in integrin signaling to promote cell motility, invasion, proliferation, and survival. p130Cas targeting to FAs is essential for its tyrosine phosphorylation and downstream signaling. Although the N-terminal SH3 domain is important for p130Cas localization, it has also been reported that the C-terminal region is involved in p130Cas FA targeting. The C-terminal region of p130Cas or Cas family homology domain (CCHD) has been reported to adopt a structure similar to that of the focal adhesion kinase C-terminal focal adhesion-targeting domain. The mechanism by whichmore » the CCHD promotes FA targeting of p130Cas, however, remains unclear. In this study, using a calorimetry approach, we identified the first LD motif (LD1) of the FA-associated protein paxillin as the binding partner of the p130Cas CCHD (in a 1:1 stoichiometry with a Kd ~4.2 μM) and elucidated the structure of the p130Cas CCHD in complex with the paxillin LD1 motif by X-ray crystallography. Of note, a comparison of the CCHD/LD1 complex with a previously solved structure of CCHD in complex with the SH2-containing protein NSP3 revealed that LD1 had almost identical positioning of key hydrophobic and acidic residues relative to NSP3. Because paxillin is one of the key scaffold molecules in FAs, we propose that the interaction between the p130Cas CCHD and the LD1 motif of paxillin plays an important role in p130Cas FA targeting.« less
2010-01-01
Background An important focus of genomic science is the discovery and characterization of all functional elements within genomes. In silico methods are used in genome studies to discover putative regulatory genomic elements (called words or motifs). Although a number of methods have been developed for motif discovery, most of them lack the scalability needed to analyze large genomic data sets. Methods This manuscript presents WordSeeker, an enumerative motif discovery toolkit that utilizes multi-core and distributed computational platforms to enable scalable analysis of genomic data. A controller task coordinates activities of worker nodes, each of which (1) enumerates a subset of the DNA word space and (2) scores words with a distributed Markov chain model. Results A comprehensive suite of performance tests was conducted to demonstrate the performance, speedup and efficiency of WordSeeker. The scalability of the toolkit enabled the analysis of the entire genome of Arabidopsis thaliana; the results of the analysis were integrated into The Arabidopsis Gene Regulatory Information Server (AGRIS). A public version of WordSeeker was deployed on the Glenn cluster at the Ohio Supercomputer Center. Conclusion WordSeeker effectively utilizes concurrent computing platforms to enable the identification of putative functional elements in genomic data sets. This capability facilitates the analysis of the large quantity of sequenced genomic data. PMID:21210985
The third RNA recognition motif of Drosophila ELAV protein has a role in multimerization.
Toba, Gakuta; White, Kalpana
2008-03-01
ELAV is a neuron-specific RNA-binding protein in Drosophila that is required for development and maintenance of neurons. ELAV regulates alternative splicing of Neuroglian and erect wing (ewg) transcripts, and has been shown to form a multimeric complex on the last ewg intron. The protein has three RNA recognition motifs (RRM1, 2 and 3) with a hinge region between RRM2 and 3. In this study, we used the yeast two-hybrid system to determine the multimerization domain of ELAV. Using deletion constructs, we mapped an interaction activity to a region containing most of RRM3. We found three conserved short sequences in RRM3 that were essential for the interaction, and also sufficient to give the interaction activity to RRM2 when introduced into it. In our in vivo functional assay, a mutation in one of the three sequences showed reduced activity in splicing regulation, underlining the functional importance of multimerization. However, RRM2 with the three RRM3 interaction sequences did not function as RRM3 in vivo, which suggested that multimerization is not the only function of RRM3. Our results are consistent with a model in which RRM3 serves as a bi-functional domain that interacts with both RNA and protein.
The third RNA recognition motif of Drosophila ELAV protein has a role in multimerization
Toba, Gakuta; White, Kalpana
2008-01-01
ELAV is a neuron-specific RNA-binding protein in Drosophila that is required for development and maintenance of neurons. ELAV regulates alternative splicing of Neuroglian and erect wing (ewg) transcripts, and has been shown to form a multimeric complex on the last ewg intron. The protein has three RNA recognition motifs (RRM1, 2 and 3) with a hinge region between RRM2 and 3. In this study, we used the yeast two-hybrid system to determine the multimerization domain of ELAV. Using deletion constructs, we mapped an interaction activity to a region containing most of RRM3. We found three conserved short sequences in RRM3 that were essential for the interaction, and also sufficient to give the interaction activity to RRM2 when introduced into it. In our in vivo functional assay, a mutation in one of the three sequences showed reduced activity in splicing regulation, underlining the functional importance of multimerization. However, RRM2 with the three RRM3 interaction sequences did not function as RRM3 in vivo, which suggested that multimerization is not the only function of RRM3. Our results are consistent with a model in which RRM3 serves as a bi-functional domain that interacts with both RNA and protein. PMID:18203745
Zhang, Hailing; Cao, Yingping; Shang, Chen; Li, Jikai; Wang, Jianli; Wu, Zhenying; Ma, Lichao; Qi, Tianxiong; Fu, Chunxiang; Hu, Baozhong
2017-01-01
The GRAS gene family is a large plant-specific family of transcription factors that are involved in diverse processes during plant development. Medicago truncatula is an ideal model plant for genetic research in legumes, and specifically for studying nodulation, which is crucial for nitrogen fixation. In this study, 59 MtGRAS genes were identified and classified into eight distinct subgroups based on phylogenetic relationships. Motifs located in the C-termini were conserved across the subgroups, while motifs in the N-termini were subfamily specific. Gene duplication was the main evolutionary force for MtGRAS expansion, especially proliferation of the LISCL subgroup. Seventeen duplicated genes showed strong effects of purifying selection and diverse expression patterns, highlighting their functional importance and diversification after duplication. Thirty MtGRAS genes, including NSP1 and NSP2, were preferentially expressed in nodules, indicating possible roles in the process of nodulation. A transcriptome study, combined with gene expression analysis under different stress conditions, suggested potential functions of MtGRAS genes in various biological pathways and stress responses. Taken together, these comprehensive analyses provide basic information for understanding the potential functions of GRAS genes, and will facilitate further discovery of MtGRAS gene functions. PMID:28945786
Wang, Hong; Guo, Haoran; Su, Jiaming; Rui, Yajuan; Zheng, Wenwen; Gao, Wenying; Zhang, Wenyan; Li, Zhaolong; Liu, Guanchen; Markham, Richard B; Wei, Wei; Yu, Xiao-Fang
2017-05-01
The lentiviral accessory proteins Vpx and Vpr are known to utilize CRL4 (DCAF1) E3 ligase to induce the degradation of the host restriction factor SAMHD1 or host helicase transcription factor (HLTF), respectively. Selective disruption of viral CRL4 (DCAF1) E3 ligase could be a promising antiviral strategy. Recently, we have determined that posttranslational modification (neddylation) of Cullin-4 is required for the activation of Vpx-CRL4 (DCAF1) E3 ligase. However, the mechanism of Vpx/Vpr-CRL4 (DCAF1) E3 ligase assembly is still poorly understood. Here, we report that zinc coordination is an important regulator of Vpx-CRL4 E3 ligase assembly. Residues in a conserved zinc-binding motif of Vpx were essential for the recruitment of the CRL4 (DCAF1) E3 complex and Vpx-induced SAMHD1 degradation. Importantly, altering the intracellular zinc concentration by treatment with the zinc chelator N , N , N '-tetrakis-(2'-pyridylmethyl)ethylenediamine (TPEN) potently blocked Vpx-mediated SAMHD1 degradation and inhibited wild-type SIVmac (simian immunodeficiency virus of macaques) infection of myeloid cells, even in the presence of Vpx. TPEN selectively inhibited Vpx and DCAF1 binding but not the Vpx-SAMHD1 interaction or Vpx virion packaging. Moreover, we have shown that zinc coordination is also important for the assembly of the HIV-1 Vpr-CRL4 E3 ligase. In particular, Vpr zinc-binding motif mutation or TPEN treatment efficiently inhibited Vpr-CRL4 (DCAF1) E3 ligase assembly and Vpr-mediated HLTF degradation or Vpr-induced G 2 cell cycle arrest. Collectively, our study sheds light on a conserved strategy by the viral proteins Vpx and Vpr to recruit host CRL4 (DCAF1) E3 ligase, which represents a target for novel anti-human immunodeficiency virus (HIV) drug development. IMPORTANCE The Vpr and its paralog Vpx are accessory proteins encoded by different human immunodeficiency virus (HIV)/simian immunodeficiency virus (SIV) lentiviruses. To facilitate viral replication, Vpx has evolved to induce SAMHD1 degradation and Vpr to mediate HLTF degradation. Both Vpx and Vpr perform their functions by recruiting CRL4 (DCAF1) E3 ligase. In this study, we demonstrate that the assembly of the Vpx- or Vpr-CRL4 E3 ligase requires a highly conserved zinc-binding motif. This motif is specifically required for the DCAF1 interaction but not for the interaction of Vpx or Vpr with its substrate. Selective disruption of Vpx- or Vpr-CRL4 E3 ligase function was achieved by zinc sequestration using N , N , N '-tetrakis-(2'-pyridylmethyl)ethylenediamine (TPEN). At the same time, zinc sequestration had no effect on zinc-dependent cellular protein functions. Therefore, information obtained from this study may be important for novel anti-HIV drug development. Copyright © 2017 American Society for Microbiology.
Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R.; Kehn-Hall, Kylene; Omichinski, James G.
2015-01-01
Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH. PMID:25918396
Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R; Kehn-Hall, Kylene; Omichinski, James G
2015-05-12
Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH.
Molecular origin of the binding of WWOX tumor suppressor to ErbB4 receptor tyrosine kinase.
Schuchardt, Brett J; Bhat, Vikas; Mikles, David C; McDonald, Caleb B; Sudol, Marius; Farooq, Amjad
2013-12-23
The ability of WWOX tumor suppressor to physically associate with the intracellular domain (ICD) of ErbB4 receptor tyrosine kinase is believed to play a central role in downregulating the transcriptional function of the latter. Herein, using various biophysical methods, we show that while the WW1 domain of WWOX binds to PPXY motifs located within the ICD of ErbB4 in a physiologically relevant manner, the WW2 domain does not. Importantly, while the WW1 domain absolutely requires the integrity of the PPXY consensus sequence, nonconsensus residues within and flanking this motif do not appear to be critical for binding. This strongly suggests that the WW1 domain of WWOX is rather promiscuous toward its cellular partners. We also provide evidence that the lack of binding of the WW2 domain of WWOX to PPXY motifs is due to the replacement of a signature tryptophan, lining the hydrophobic ligand binding groove, with tyrosine (Y85). Consistent with this notion, the Y85W substitution within the WW2 domain exquisitely restores its binding to PPXY motifs in a manner akin to the binding of the WW1 domain of WWOX. Of particular significance is the observation that the WW2 domain augments the binding of the WW1 domain to ErbB4, implying that the former serves as a chaperone within the context of the WW1-WW2 tandem module of WWOX in agreement with our findings reported previously. Altogether, our study sheds new light on the molecular basis of an important WW-ligand interaction involved in mediating a plethora of cellular processes.
Molecular Origin of the Binding of WWOX Tumor Suppressor to ErbB4 Receptor Tyrosine Kinase
Schuchardt, Brett J.; Bhat, Vikas; Mikles, David C.; McDonald, Caleb B.; Sudol, Marius; Farooq, Amjad
2014-01-01
The ability of WWOX tumor suppressor to physically associate with the intracellular domain (ICD) of ErbB4 receptor tyrosine kinase is believed to play a central role in down-regulating the transcriptional function of the latter. Herein, using various biophysical methods, we show that while the WW1 domain of WWOX binds to PPXY motifs located within the ICD of ErbB4 in a physiologically-relevant manner, the WW2 domain does not. Importantly, while the WW1 domain absolutely requires the integrity of the PPXY consensus sequence, non-consensus residues within and flanking this motif do not appear to be critical for binding. This strongly suggests that the WW1 domain of WWOX is rather promiscuous toward its cellular partners. We also provide evidence that the lack of binding of WW2 domain of WWOX to PPXY motifs is due to the replacement of a signature tryptophan, lining the hydrophobic ligand binding groove, with tyrosine (Y85). Consistent with this notion, the Y85W substitution within the WW2 domain exquisitely restores its binding to PPXY motifs in a manner akin to the binding of WW1 domain of WWOX. Of particular significance is the observation that WW2 domain augments the binding of WW1 domain to ErbB4, implying that the former serves as a chaperone within the context of the WW1–WW2 tandem module of WWOX in agreement with our findings reported previously. Taken together, our study sheds new light on the molecular basis of an important WW-ligand interaction involved in mediating a plethora of cellular processes. PMID:24308844
Characterization of the targeting signal in mitochondrial β-barrel proteins
Jores, Tobias; Klinger, Anna; Groß, Lucia E.; Kawano, Shin; Flinner, Nadine; Duchardt-Ferner, Elke; Wöhnert, Jens; Kalbacher, Hubert; Endo, Toshiya; Schleiff, Enrico; Rapaport, Doron
2016-01-01
Mitochondrial β-barrel proteins are synthesized on cytosolic ribosomes and must be specifically targeted to the organelle before their integration into the mitochondrial outer membrane. The signal that assures such precise targeting and its recognition by the organelle remained obscure. In the present study we show that a specialized β-hairpin motif is this long searched for signal. We demonstrate that a synthetic β-hairpin peptide competes with the import of mitochondrial β-barrel proteins and that proteins harbouring a β-hairpin peptide fused to passenger domains are targeted to mitochondria. Furthermore, a β-hairpin motif from mitochondrial proteins targets chloroplast β-barrel proteins to mitochondria. The mitochondrial targeting depends on the hydrophobicity of the β-hairpin motif. Finally, this motif interacts with the mitochondrial import receptor Tom20. Collectively, we reveal that β-barrel proteins are targeted to mitochondria by a dedicated β-hairpin element, and this motif is recognized at the organelle surface by the outer membrane translocase. PMID:27345737
ELM: the status of the 2010 eukaryotic linear motif resource
Gould, Cathryn M.; Diella, Francesca; Via, Allegra; Puntervoll, Pål; Gemünd, Christine; Chabanis-Davidson, Sophie; Michael, Sushama; Sayadi, Ahmed; Bryne, Jan Christian; Chica, Claudia; Seiler, Markus; Davey, Norman E.; Haslam, Niall; Weatheritt, Robert J.; Budd, Aidan; Hughes, Tim; Paś, Jakub; Rychlewski, Leszek; Travé, Gilles; Aasland, Rein; Helmer-Citterich, Manuela; Linding, Rune; Gibson, Toby J.
2010-01-01
Linear motifs are short segments of multidomain proteins that provide regulatory functions independently of protein tertiary structure. Much of intracellular signalling passes through protein modifications at linear motifs. Many thousands of linear motif instances, most notably phosphorylation sites, have now been reported. Although clearly very abundant, linear motifs are difficult to predict de novo in protein sequences due to the difficulty of obtaining robust statistical assessments. The ELM resource at http://elm.eu.org/ provides an expanding knowledge base, currently covering 146 known motifs, with annotation that includes >1300 experimentally reported instances. ELM is also an exploratory tool for suggesting new candidates of known linear motifs in proteins of interest. Information about protein domains, protein structure and native disorder, cellular and taxonomic contexts is used to reduce or deprecate false positive matches. Results are graphically displayed in a ‘Bar Code’ format, which also displays known instances from homologous proteins through a novel ‘Instance Mapper’ protocol based on PHI-BLAST. ELM server output provides links to the ELM annotation as well as to a number of remote resources. Using the links, researchers can explore the motifs, proteins, complex structures and associated literature to evaluate whether candidate motifs might be worth experimental investigation. PMID:19920119
RNA motif search with data-driven element ordering.
Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa
2016-05-18
In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .
A flexible motif search technique based on generalized profiles.
Bucher, P; Karplus, K; Moeri, N; Hofmann, K
1996-03-01
A flexible motif search technique is presented which has two major components: (1) a generalized profile syntax serving as a motif definition language; and (2) a motif search method specifically adapted to the problem of finding multiple instances of a motif in the same sequence. The new profile structure, which is the core of the generalized profile syntax, combines the functions of a variety of motif descriptors implemented in other methods, including regular expression-like patterns, weight matrices, previously used profiles, and certain types of hidden Markov models (HMMs). The relationship between generalized profiles and other biomolecular motif descriptors is analyzed in detail, with special attention to HMMs. Generalized profiles are shown to be equivalent to a particular class of HMMs, and conversion procedures in both directions are given. The conversion procedures provide an interpretation for local alignment in the framework of stochastic models, allowing for clear, simple significance tests. A mathematical statement of the motif search problem defines the new method exactly without linking it to a specific algorithmic solution. Part of the definition includes a new definition of disjointness of alignments.
Protein–DNA Interactions: The Story so Far and a New Method for Prediction
Jones, Susan; Thornton, Janet M.
2003-01-01
This review describes methods for the prediction of DNA binding function, and specifically summarizes a new method using 3D structural templates. The new method features the HTH motif that is found in approximately one-third of DNAbinding protein families. A library of 3D structural templates of HTH motifs was derived from proteins in the PDB. Templates were scanned against complete protein structures and the optimal superposition of a template on a structure calculated. Significance thresholds in terms of a minimum root mean squared deviation (rmsd) of an optimal superposition, and a minimum motif accessible surface area (ASA), have been calculated. Inmore » this way, it is possible to scan the template library against proteins of unknown function to make predictions about DNA-binding functionality.« less
A motif detection and classification method for peptide sequences using genetic programming.
Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki
2008-08-01
An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.
The Thiamin Pyrophosphate-Motif
NASA Technical Reports Server (NTRS)
Dominiak, P.; Ciszak, E.
2003-01-01
Using databases the authors have identified a common thiamin pyrophosphate (TPP)-motif in the family of functionally diverse TPP-dependent enzymes. This common motif consists of multimeric organization of subunits and two catalytic centers. Each catalytic center (PP:PYR) is formed at the interface of the PP-domain binding the magnesium ion, pyrophosphate and amhopyrimidine ring of TPP, and the PYR-domain binding the aminopyrimidine ring of that cofactor. A pair of these catalytic centers constitutes the catalytic core (PP:PYR)(sub 2) within these enzymes. Analysis of the structural elements of this catalytic core reveals novel definition of the common amino acid sequences, which are GXPhiX(sub 4)(G)PhiXXGQ and GDGX(sub 25-30)NN in the PP-domain, and the EX(sub 4)(G)PhiXXGPhi in the PYR-domain, where Phi corresponds to a hydrophobic amino acid. This TPP-motif provides a novel tool for annotation of TPP-dependent enzymes useful in advancing functional proteomics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schürpf, Thomas; Chen, Qiang; Liu, Jin-huan
Developmental endothelial cell locus-1 (Del-1) glycoprotein is secreted by endothelial cells and a subset of macrophages. Del-1 plays a regulatory role in vascular remodeling and functions in innate immunity through interaction with integrin {alpha}{sub V}{beta}{sub 3}. Del-1 contains 3 epidermal growth factor (EGF)-like repeats and 2 discoidin-like domains. An Arg-Gly-Asp (RGD) motif in the second EGF domain (EGF2) mediates adhesion by endothelial cells and phagocytes. We report the crystal structure of its 3 EGF domains. The RGD motif of EGF2 forms a type II' {beta} turn at the tip of a long protruding loop, dubbed the RGD finger. Whereas EGF2more » and EGF3 constitute a rigid rod via an interdomain calcium ion binding site, the long linker between EGF1 and EGF2 lends considerable flexibility to EGF1. Two unique O-linked glycans and 1 N-linked glycan locate to the opposite side of EGF2 from the RGD motif. These structural features favor integrin binding of the RGD finger. Mutagenesis data confirm the importance of having the RGD motif at the tip of the RGD finger. A database search for EGF domain sequences shows that this RGD finger is likely an evolutionary insertion and unique to the EGF domain of Del-1 and its homologue milk fat globule-EGF 8. The RGD finger of Del-1 is a unique structural feature critical for integrin binding.« less
Velagapudi, Sai Pradeep; Disney, Matthew D.
2013-01-01
RNA is an extremely important target for the development of chemical probes of function or small molecule therapeutics. Aminoglycosides are the most well studied class of small molecules to target RNA. However, the RNA motifs outside of the bacterial rRNA A-site that are likely to be bound by these compounds in biological systems is largely unknown. If such information were known, it could allow for aminoglycosides to be exploited to target other RNAs and, in addition, could provide invaluable insights into potential bystander targets of these clinically used drugs. We utilized two-dimensional combinatorial screening (2DCS), a library-versus-library screening approach, to select the motifs displayed in a 3 × 3 nucleotide internal loop library and in a 6-nucleotide hairpin library that bind with high affinity and selectivity to six aminoglycoside derivatives. The selected RNA motifs were then analyzed using structure–activity relationships through sequencing (StARTS), a statistical approach that defines the privileged RNA motif space that binds a small molecule. StARTS allowed for the facile annotation of the selected RNA motif–aminoglycoside interactions in terms of affinity and selectivity. The interactions selected by 2DCS generally have nanomolar affinities, which is higher affinity than the binding of aminoglycosides to a mimic of their therapeutic target, the bacterial rRNA A-site. PMID:23719281
Local Renyi entropic profiles of DNA sequences.
Vinga, Susana; Almeida, Jonas S
2007-10-16
In a recent report the authors presented a new measure of continuous entropy for DNA sequences, which allows the estimation of their randomness level. The definition therein explored was based on the Rényi entropy of probability density estimation (pdf) using the Parzen's window method and applied to Chaos Game Representation/Universal Sequence Maps (CGR/USM). Subsequent work proposed a fractal pdf kernel as a more exact solution for the iterated map representation. This report extends the concepts of continuous entropy by defining DNA sequence entropic profiles using the new pdf estimations to refine the density estimation of motifs. The new methodology enables two results. On the one hand it shows that the entropic profiles are directly related with the statistical significance of motifs, allowing the study of under and over-representation of segments. On the other hand, by spanning the parameters of the kernel function it is possible to extract important information about the scale of each conserved DNA region. The computational applications, developed in Matlab m-code, the corresponding binary executables and additional material and examples are made publicly available at http://kdbio.inesc-id.pt/~svinga/ep/. The ability to detect local conservation from a scale-independent representation of symbolic sequences is particularly relevant for biological applications where conserved motifs occur in multiple, overlapping scales, with significant future applications in the recognition of foreign genomic material and inference of motif structures.
Local Renyi entropic profiles of DNA sequences
Vinga, Susana; Almeida, Jonas S
2007-01-01
Background In a recent report the authors presented a new measure of continuous entropy for DNA sequences, which allows the estimation of their randomness level. The definition therein explored was based on the Rényi entropy of probability density estimation (pdf) using the Parzen's window method and applied to Chaos Game Representation/Universal Sequence Maps (CGR/USM). Subsequent work proposed a fractal pdf kernel as a more exact solution for the iterated map representation. This report extends the concepts of continuous entropy by defining DNA sequence entropic profiles using the new pdf estimations to refine the density estimation of motifs. Results The new methodology enables two results. On the one hand it shows that the entropic profiles are directly related with the statistical significance of motifs, allowing the study of under and over-representation of segments. On the other hand, by spanning the parameters of the kernel function it is possible to extract important information about the scale of each conserved DNA region. The computational applications, developed in Matlab m-code, the corresponding binary executables and additional material and examples are made publicly available at . Conclusion The ability to detect local conservation from a scale-independent representation of symbolic sequences is particularly relevant for biological applications where conserved motifs occur in multiple, overlapping scales, with significant future applications in the recognition of foreign genomic material and inference of motif structures. PMID:17939871
A New Scheme to Characterize and Identify Protein Ubiquitination Sites.
Nguyen, Van-Nui; Huang, Kai-Yao; Huang, Chien-Hsun; Lai, K Robert; Lee, Tzong-Yi
2017-01-01
Protein ubiquitination, involving the conjugation of ubiquitin on lysine residue, serves as an important modulator of many cellular functions in eukaryotes. Recent advancements in proteomic technology have stimulated increasing interest in identifying ubiquitination sites. However, most computational tools for predicting ubiquitination sites are focused on small-scale data. With an increasing number of experimentally verified ubiquitination sites, we were motivated to design a predictive model for identifying lysine ubiquitination sites for large-scale proteome dataset. This work assessed not only single features, such as amino acid composition (AAC), amino acid pair composition (AAPC) and evolutionary information, but also the effectiveness of incorporating two or more features into a hybrid approach to model construction. The support vector machine (SVM) was applied to generate the prediction models for ubiquitination site identification. Evaluation by five-fold cross-validation showed that the SVM models learned from the combination of hybrid features delivered a better prediction performance. Additionally, a motif discovery tool, MDDLogo, was adopted to characterize the potential substrate motifs of ubiquitination sites. The SVM models integrating the MDDLogo-identified substrate motifs could yield an average accuracy of 68.70 percent. Furthermore, the independent testing result showed that the MDDLogo-clustered SVM models could provide a promising accuracy (78.50 percent) and perform better than other prediction tools. Two cases have demonstrated the effective prediction of ubiquitination sites with corresponding substrate motifs.
A conserved truncated isoform of the ATR-X syndrome protein lacking the SWI/SNF-homology domain.
Garrick, David; Samara, Vassiliki; McDowell, Tarra L; Smith, Andrew J H; Dobbie, Lorraine; Higgs, Douglas R; Gibbons, Richard J
2004-02-04
Mutations in the ATRX gene cause a severe X-linked mental retardation syndrome that is frequently associated with alpha thalassemia (ATR-X syndrome). The previously characterized ATRX protein (approximately 280 kDa) contains both a Plant homeodomain (PHD)-like zinc finger motif as well as an ATPase domain of the SNF2 family. These motifs suggest that ATRX may function as a regulator of gene expression, probably by exerting an effect on chromatin structure, although the exact cellular role of ATRX has not yet been fully elucidated. Here we characterize a truncated (approximately 200 kDa) isoform of ATRX (called here ATRXt) that has been highly conserved between mouse and human. In both species, ATRXt arises due to the failure to splice intron 11 from the primary transcript, and the use of a proximal intronic poly(A) signal. We show that the relative expression of the full length and ATRXt isoforms is subject to tissue-specific regulation. The ATRXt isoform contains the PHD-like domain but not the SWI/SNF-like motifs and is therefore unlikely to be functionally equivalent to the full length protein. We used indirect immunofluorescence to demonstrate that the full length and ATRXt isoforms are colocalized at blocks of pericentromeric heterochromatin but unlike full length ATRX, the truncated isoform does not associate with promyelocytic leukemia (PML) nuclear bodies. The high degree of conservation of ATRXt and the tight regulation of its expression relative to the full length protein suggest that this truncated isoform fulfills an important biological function.
Dimerization of the docking/adaptor protein HEF1 via a carboxy-terminal helix-loop-helix domain.
Law, S F; Zhang, Y Z; Fashena, S J; Toby, G; Estojak, J; Golemis, E A
1999-10-10
HEF1, p130(Cas), and Efs define a family of multidomain docking proteins which plays a central coordinating role for tyrosine-kinase-based signaling related to cell adhesion. HEF1 function has been specifically implicated in signaling pathways important for cell adhesion and differentiation in lymphoid and epithelial cells. While the SH3 domains and SH2-binding site domains (substrate domains) of HEF1 family proteins are well characterized and binding partners known, to date the highly conserved carboxy-terminal domains of the three proteins have lacked functional definition. In this study, we have determined that the carboxy-terminal domain of HEF1 contains a divergent helix-loop-helix (HLH) motif. This motif mediates HEF1 homodimerization and HEF1 heterodimerization with a recognition specificity similar to that of the transcriptional regulatory HLH proteins Id2, E12, and E47. We had previously demonstrated that the HEF1 carboxy-terminus expressed as a separate domain in yeast reprograms cell division patterns, inducing constitutive pseudohyphal growth. Here we show that pseudohyphal induction by HEF1 requires an intact HLH, further supporting the idea that this motif has an effector activity for HEF1, and implying that HEF1 pseudohyphal activity derives in part from interactions with yeast helix-loop-helix proteins. These combined results provide initial insight into the mode of function of the HEF1 carboxy-terminal domain and suggest that the HEF1 protein may interact with cellular proteins which control differentiation. Copyright 1999 Academic Press.
González-Hernández, Mariana; Hoffmann, Markus; Brinkmann, Constantin; Nehls, Julia; Winkler, Michael; Schindler, Michael; Pöhlmann, Stefan
2018-04-18
The interferon-induced antiviral host cell protein tetherin can inhibit the release of several enveloped viruses from infected cells. The Ebola virus (EBOV) glycoprotein (GP) antagonizes tetherin but the domains and amino acids in GP that are required for tetherin antagonism have not been fully defined. A GXXXA motif within the transmembrane domain (TMD) of EBOV-GP was previously shown to be important for GP-mediated cellular detachment. Here, we investigated whether this motif also contributes to tetherin antagonism. Mutation of the GXXXA motif did not impact GP expression or particle incorporation and only modestly reduced EBOV-GP-driven entry. In contrast, the GXXXA motif was required for tetherin antagonism in transfected cells. Moreover, alteration of the GXXXA motif increased tetherin-sensitivity of a replication-competent vesicular stomatitis virus (VSV) chimera encoding EBOV-GP. Although these results await confirmation with authentic EBOV, they indicate that a GXXXA motif in the TMD of EBOV-GP is important for tetherin antagonism. Moreover, they provide the first evidence that GP can antagonize tetherin in the context of an infectious EBOV surrogate. IMPORTANCE The glycoprotein (GP) of Ebola virus (EBOV) inhibits the antiviral host cell protein tetherin and may promote viral spread in tetherin-positive cells. However, tetherin antagonism by GP has so far only been demonstrated using virus-like particles and it is unknown whether GP can block tetherin in infected cells. Moreover, a mutation in GP that selectively abrogates tetherin antagonism is unknown. Here, we show that a GXXXA motif in the transmembrane domain of EBOV-GP, which was previously reported to be required for GP-mediated cell rounding, is also important for tetherin counteraction. Moreover, analysis of this mutation in the context of vesicular stomatitis virus chimeras encoding EBOV-GP revealed that GP-mediated tetherin counteraction is operative in infected cells. To our knowledge, these findings demonstrate for the first time that GP can antagonize tetherin in infected cells and provide a tool to study the impact of GP-dependent tetherin counteraction on EBOV spread. Copyright © 2018 American Society for Microbiology.
Xiao, Jing; Kim, Leslie S.
2006-01-01
The auxilin family of J-domain proteins load Hsp70 onto clathrin-coated vesicles (CCVs) to drive uncoating. In vitro, auxilin function requires its ability to bind clathrin and stimulate Hsp70 ATPase activity via its J-domain. To test these requirements in vivo, we performed a mutational analysis of Swa2p, the yeast auxilin ortholog. Swa2p is a modular protein with three N-terminal clathrin-binding (CB) motifs, a ubiquitin association (UBA) domain, a tetratricopeptide repeat (TPR) domain, and a C-terminal J-domain. In vitro, clathrin binding is mediated by multiple weak interactions, but a Swa2p truncation lacking two CB motifs and the UBA domain retains nearly full function in vivo. Deletion of all CB motifs strongly abrogates clathrin disassembly but does not eliminate Swa2p function in vivo. Surprisingly, mutation of the invariant HPD motif within the J-domain to AAA only partially affects Swa2p function. Similarly, a TPR point mutation (G388R) causes a modest phenotype. However, Swa2p function is abolished when these TPR and J mutations are combined. The TPR and J-domains are not functionally redundant because deletion of either domain renders Swa2p nonfunctional. These data suggest that the TPR and J-domains collaborate in a bipartite interaction with Hsp70 to regulate its activity in clathrin disassembly. PMID:16687570
Toffano-Nioche, Claire; Gautheret, Daniel; Leclerc, Fabrice
2015-01-01
A structural and functional classification of H/ACA and H/ACA-like motifs is obtained from the analysis of the H/ACA guide RNAs which have been identified previously in the genomes of Euryarchaea (Pyrococcus) and Crenarchaea (Pyrobaculum). A unified structure/function model is proposed based on the common structural determinants shared by H/ACA and H/ACA-like motifs in both Euryarchaea and Crenarchaea. Using a computational approach, structural and energetic rules for the guide:target RNA-RNA interactions are derived from structural and functional data on the H/ACA RNP particles. H/ACA(-like) motifs found in Pyrococcus are evaluated through the classification and their biological relevance is discussed. Extra-ribosomal targets found in both Pyrococcus and Pyrobaculum might support the hypothesis of a gene regulation mediated by H/ACA(-like) guide RNAs in archaea. PMID:26240384
De Novo Evolutionary Emergence of a Symmetrical Protein Is Shaped by Folding Constraints
Smock, Robert G.; Yadid, Itamar; Dym, Orly; Clarke, Jane; Tawfik, Dan S.
2016-01-01
Summary Molecular evolution has focused on the divergence of molecular functions, yet we know little about how structurally distinct protein folds emerge de novo. We characterized the evolutionary trajectories and selection forces underlying emergence of β-propeller proteins, a globular and symmetric fold group with diverse functions. The identification of short propeller-like motifs (<50 amino acids) in natural genomes indicated that they expanded via tandem duplications to form extant propellers. We phylogenetically reconstructed 47-residue ancestral motifs that form five-bladed lectin propellers via oligomeric assembly. We demonstrate a functional trajectory of tandem duplications of these motifs leading to monomeric lectins. Foldability, i.e., higher efficiency of folding, was the main parameter leading to improved functionality along the entire evolutionary trajectory. However, folding constraints changed along the trajectory: initially, conflicts between monomer folding and oligomer assembly dominated, whereas subsequently, upon tandem duplication, tradeoffs between monomer stability and foldability took precedence. PMID:26806127
The role of symmetry in the regulation of brain dynamics
NASA Astrophysics Data System (ADS)
Tang, Evelyn; Giusti, Chad; Cieslak, Matthew; Grafton, Scott; Bassett, Danielle
Synchronous neural processes regulate a wide range of behaviors from attention to learning. Yet structural constraints on these processes are far from understood. We draw on new theoretical links between structural symmetries and the control of synchronous function, to offer a reconceptualization of the relationships between brain structure and function in human and non-human primates. By classifying 3-node motifs in macaque connectivity data, we find the most prevalent motifs can theoretically ensure a diversity of function including strict synchrony as well as control to arbitrary states. The least prevalent motifs are theoretically controllable to arbitrary states, which may not be desirable in a biological system. In humans, regions with high topological similarity of connections (a continuous notion related to symmetry) are most commonly found in fronto-parietal systems, which may account for their critical role in cognitive control. Collectively, our work underscores the role of symmetry and topological similarity in regulating dynamics of brain function.
Motifs, modules and games in bacteria.
Wolf, Denise M; Arkin, Adam P
2003-04-01
Global explorations of regulatory network dynamics, organization and evolution have become tractable thanks to high-throughput sequencing and molecular measurement of bacterial physiology. From these, a nascent conceptual framework is developing, that views the principles of regulation in term of motifs, modules and games. Motifs are small, repeated, and conserved biological units ranging from molecular domains to small reaction networks. They are arranged into functional modules, genetically dissectible cellular functions such as the cell cycle, or different stress responses. The dynamical functioning of modules defines the organism's strategy to survive in a game, pitting cell against cell, and cell against environment. Placing pathway structure and dynamics into an evolutionary context begins to allow discrimination between those physical and molecular features that particularize a species to its surroundings, and those that provide core physiological function. This approach promises to generate a higher level understanding of cellular design, pathway evolution and cellular bioengineering.
Motifs, modules and games in bacteria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wolf, Denise M.; Arkin, Adam P.
2003-04-01
Global explorations of regulatory network dynamics, organization and evolution have become tractable thanks to high-throughput sequencing and molecular measurement of bacterial physiology. From these, a nascent conceptual framework is developing, that views the principles of regulation in term of motifs, modules and games. Motifs are small, repeated, and conserved biological units ranging from molecular domains to small reaction networks. They are arranged into functional modules, genetically dissectible cellular functions such as the cell cycle, or different stress responses. The dynamical functioning of modules defines the organism's strategy to survive in a game, pitting cell against cell, and cell against environment.more » Placing pathway structure and dynamics into an evolutionary context begins to allow discrimination between those physical and molecular features that particularize a species to its surroundings, and those that provide core physiological function. This approach promises to generate a higher level understanding of cellular design, pathway evolution and cellular bioengineering.« less
Discriminative motif discovery via simulated evolution and random under-sampling.
Song, Tao; Gu, Hong
2014-01-01
Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.
Qin, Xiaoqiong; Coku, Ardian; Inoue, Kentaro; Tian, Li
2011-10-01
Carotenoids perform many critical functions in plants, animals, and humans. It is therefore important to understand carotenoid biosynthesis and its regulation in plants. Phytoene synthase (PSY) catalyzes the first committed and rate-limiting step in carotenoid biosynthesis. While PSY is present as a single copy gene in Arabidopsis, duplicated PSY genes have been identified in many economically important monocot and dicot crops. CmPSY1 was previously identified from melon (Cucumis melo L.), but was not functionally characterized. We isolated a second PSY gene, CmPSY2, from melon in this work. CmPSY2 possesses a unique intron/exon structure that has not been observed in other plant PSYs. Both CmPSY1 and CmPSY2 are functional in vitro, but exhibit distinct expression patterns in different melon tissues and during fruit development, suggesting differential regulation of the duplicated melon PSY genes. In vitro chloroplast import assays verified the plastidic localization of CmPSY1 and CmPSY2 despite the lack of an obvious plastid target peptide in CmPSY2. Promoter motif analysis of the duplicated melon and tomato PSY genes and the Arabidopsis PSY revealed distinctive cis-regulatory structures of melon PSYs and identified gibberellin-responsive motifs in all PSYs except for SlPSY1, which has not been reported previously. Overall, these data provide new insights into the evolutionary history of plant PSY genes and the regulation of PSY expression by developmental and environmental signals that may involve different regulatory networks.
Redox pathways of the mitochondrion.
Koehler, Carla M; Beverly, Kristen N; Leverich, Edward P
2006-01-01
The mitochondrion houses a variety of redox pathways, utilized for protection from oxidative damage and assembly of the organelle. The glutathione/glutaredoxin and thioredoxin systems function in the mitochondrial matrix. The intermembrane space is protected from oxidative damage via superoxide dismutase and glutathione. Subunits in the cytochrome bc (1) complex utilize disulfide bonds for enzymatic activity, whereas cytochrome oxidase relies on disulfide linkages for copper acquisition. A redox pathway (Mia40p and Erv1p) mediates the import of intermembrane space proteins such as the small Tim proteins, Cox17p, and Cox19p, which have disulfide bonds. Many of the candidate proteins with disulfide bridges possess a twin CX3C motif or CX9C motif and utilize both metal binding and disulfide linkages for function. It may seem surprising that the intermembrane space has developed redox pathways, considering that the buffered environment should be reducing like the cytosol. However, the prokaryotic origin of the mitochondrion suggests that the intermembrane space may be akin to the oxidative environment of the bacterial periplasm. Although the players forming disulfide bonds are not conserved between mitochondria and prokaryotes, the mitochondrion may have maintained redox chemistry as an assembly mechanism in the intermembrane space for the import of proteins and metals and enzymatic activity.
Skvortsov, A N; Zatulovskiĭ, E A; Puchkova, L V
2012-01-01
It was shown recently, that high affinity Cu(I) importer eukaryotic protein CTR1 can also transport in vitro abiogenic Ag(I) ions and anticancer drug cisplatin. At present there is no rational explanation how CTR1 can transfer platinum group, which is different by coordination properties from highly similar Cu(I) and Ag(I). To understand this phenomenon we analyzed 25 sequences of chordate CTR1 proteins, and found out conserved patterns of organization of N-terminal extracellular part of CTR1 which correspond to initial metal binding. Extracellular copper-binding motifs were qualified by their coordination properties. It was shown that relative position of Met- and His-rich copper-binding motifs in CTR1 predisposes the extracellular CTR1 part to binding of copper, silver and cisplatin. Relation between tissue-specific expression of CTR1 gene, steady-state copper concentration, and silver and platinum accumulation in organs of mice in vivo was analyzed. Significant positive but incomplete correlation exists between these variables. Basing on structural and functional peculiarities of N-terminal part of CTR1 a hypothesis of coupled transport of copper and cisplatin has been suggested, which avoids the disagreement between CTR1-mediated cisplatin transport in vitro, and irreversible binding of platinum to Met-rich peptides.
Detection of core-periphery structure in networks based on 3-tuple motifs
NASA Astrophysics Data System (ADS)
Ma, Chuang; Xiang, Bing-Bing; Chen, Han-Shuang; Small, Michael; Zhang, Hai-Feng
2018-05-01
Detecting mesoscale structure, such as community structure, is of vital importance for analyzing complex networks. Recently, a new mesoscale structure, core-periphery (CP) structure, has been identified in many real-world systems. In this paper, we propose an effective algorithm for detecting CP structure based on a 3-tuple motif. In this algorithm, we first define a 3-tuple motif in terms of the patterns of edges as well as the property of nodes, and then a motif adjacency matrix is constructed based on the 3-tuple motif. Finally, the problem is converted to find a cluster that minimizes the smallest motif conductance. Our algorithm works well in different CP structures: including single or multiple CP structure, and local or global CP structures. Results on the synthetic and the empirical networks validate the high performance of our method.
Sangadala, Sreedhara; Rao Metpally, Raghu Prasad; B Reddy, Boojala Vijay
2007-08-01
Abstract The ubiquitin-proteasome proteolytic pathway is essential for various important biological processes including cell cycle progression, gene transcription, and signal transduction. One of the important regulatory mechanisms by which the bone-inducing activity of the bone morphogenetic protein (BMP) signaling is modulated involves ubiquitin-mediated proteasomal degradation. The BMP induced receptor signal is transmitted intracellularly by phosphorylation of Smad proteins by the activated receptor I. The phosphorylated Smads 1, 5, and 8 (R-Smads) oligomerize with the co-Smad (Smad4). The complex, thus, formed translocates to the nucleus and interacts with other cofactors to regulate the expression of downstream target genes. R-Smads contain PPXY motif in the linker region that interacts with Smad ubiquitin regulatory factor 1 (Smurf1), an E3 ubiquitin ligase that catalyzes ubiquitination of target proteins for proteasomal degradation. Smurf1 contains a HECT domain, a C2 domain, and 2 WW domains (WW1, WW2). The PPXY motif in target proteins and its interaction with Smurf1 may form the basis for regulation of steady-state levels of Smads in controlling BMP-responsiveness of cells. Here, we present a homology-based model of the Smurf1 WW2 domain and the target octa-peptides containing PPXY motif of Smurf1- interacting Smads. We carried out docking of Smurf1 WW2 domain with the PPXY motifs of Smadl, Smad5, and Smad6 and identified the key amino acid residues involved in interaction. Furthermore, we present experimental evidence that WW2 domain of Smurf1 does indeed interact with the Smad proteins and that the deletion of WW2 domain of Smurf1 results in loss of its binding to Smads using the purified recombinant proteins. Finally, we also present data confirming that the deletion of WW2 domain in Smurf1 abolishes its ubiquitination activity on Smad1 in an in vitro ubiquitination assay. It shows that the interaction between the WW domain and Smad PPXY motif is a key step in Smurf1-mediated ubiquitination of its natural targets such as Smad1, Smad5, and Smad6. This work facilitates further strategies to unravel the biological function of such interactions and help in designing effective mimetic compounds that either mimic or disrupt the specific interaction.
Sangadala, Sreedhara; Metpally, Raghu Prasad Rao; Reddy, Boojala Vijay B
2007-08-01
The ubiquitin-proteasome proteolytic pathway is essential for various important biological processes including cell cycle progression, gene transcription, and signal transduction. One of the important regulatory mechanisms by which the bone-inducing activity of the bone morphogenetic protein (BMP) signaling is modulated involves ubiquitin-mediated proteasomal degradation. The BMP induced receptor signal is transmitted intracellularly by phosphorylation of Smad proteins by the activated receptor I. The phosphorylated Smads 1, 5, and 8 (R-Smads) oligomerize with the co-Smad (Smad4). The complex, thus, formed translocates to the nucleus and interacts with other cofactors to regulate the expression of downstream target genes. R-Smads contain PPXY motif in the linker region that interacts with Smad ubiquitin regulatory factor 1 (Smurf1), an E3 ubiquitin ligase that catalyzes ubiquitination of target proteins for proteasomal degradation. Smurf1 contains a HECT domain, a C2 domain, and 2 WW domains (WW1, WW2). The PPXY motif in target proteins and its interaction with Smurf1 may form the basis for regulation of steady-state levels of Smads in controlling BMP-responsiveness of cells. Here, we present a homology-based model of the Smurf1 WW2 domain and the target octa-peptides containing PPXY motif of Smurf1-interacting Smads. We carried out docking of Smurf1 WW2 domain with the PPXY motifs of Smad1, Smad5, and Smad6 and identified the key amino acid residues involved in interaction. Furthermore, we present experimental evidence that WW2 domain of Smurf1 does indeed interact with the Smad proteins and that the deletion of WW2 domain of Smurf1 results in loss of its binding to Smads using the purified recombinant proteins. Finally, we also present data confirming that the deletion of WW2 domain in Smurf1 abolishes its ubiquitination activity on Smad1 in an in vitro ubiquitination assay. It shows that the interaction between the WW domain and Smad PPXY motif is a key step in Smurf1-mediated ubiquitination of its natural targets such as Smad1, Smad5, and Smad6. This work facilitates further strategies to unravel the biological function of such interactions and help in designing effective mimetic compounds that either mimic or disrupt the specific interaction.
Programmable assembly of nanoarchitectures using genetically engineered viruses.
Huang, Yu; Chiang, Chung-Yi; Lee, Soo Kwan; Gao, Yan; Hu, Evelyn L; De Yoreo, James; Belcher, Angela M
2005-07-01
Biological systems possess inherent molecular recognition and self-assembly capabilities and are attractive templates for constructing complex material structures with molecular precision. Here we report the assembly of various nanoachitectures including nanoparticle arrays, hetero-nanoparticle architectures, and nanowires utilizing highly engineered M13 bacteriophage as templates. The genome of M13 phage can be rationally engineered to produce viral particles with distinct substrate-specific peptides expressed on the filamentous capsid and the ends, providing a generic template for programmable assembly of complex nanostructures. Phage clones with gold-binding motifs on the capsid and streptavidin-binding motifs at one end are created and used to assemble Au and CdSe nanocrytals into ordered one-dimensional arrays and more complex geometries. Initial studies show such nanoparticle arrays can further function as templates to nucleate highly conductive nanowires that are important for addressing/interconnecting individual nanostructures.
Degradation of Hof1 by SCFGrr1 is important for actomyosin contraction during cytokinesis in yeast
Blondel, Marc; Bach, Stéphane; Bamps, Sophie; Dobbelaere, Jeroen; Wiget, Philippe; Longaretti, Céline; Barral, Yves; Meijer, Laurent; Peter, Matthias
2005-01-01
SCF-type (SCF: Skp1–Cullin–F-box protein complex) E3 ligases regulate ubiquitin-dependent degradation of many cell cycle regulators, mainly at the G1/S transition. Here, we show that SCFGrr1 functions during cytokinesis by degrading the PCH protein Hof1. While Hof1 is required early in mitosis to assemble a functional actomyosin ring, it is specifically degraded late in mitosis and remains unstable during the entire G1 phase of the cell cycle. Degradation of Hof1 depends on its PEST motif and a functional 26S proteasome. Interestingly, degradation of Hof1 is independent of APCCdh1, but instead requires the SCFGrr1 E3 ligase. Grr1 is recruited to the mother–bud neck region after activation of the mitotic-exit network, and interacts with Hof1 in a PEST motif-dependent manner. Our results also show that downregulation of Hof1 at the end of mitosis is necessary to allow efficient contraction of the actomyosin ring and cell separation during cytokinesis. SCFGrr1-mediated degradation of Hof1 may thus represent a novel mechanism to couple exit from mitosis with initiation of cytokinesis. PMID:15775961
Structural and functional analysis of the GABARAP interaction motif (GIM)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rogov, Vladimir V.; Stolz, Alexandra; Ravichandran, Arvind C.
Through the canonical LC3 interaction motif (LIR), [W/F/Y]–X 1–X 2[I/L/V], protein complexes are recruited to autophagosomes to perform their functions as either autophagy adaptors or receptors. How these adaptors/receptors selectively interact with either LC3 or GABARAP families remains unclear. Herein, we determine the range of selectivity of 30 known core LIR motifs towards individual LC3s and GABARAPs. From these, we define a GABARAP Interaction Motif (GIM) sequence ([W/F]–[V/I]–X 2–V) that the adaptor protein PLEKHM1 tightly conforms to. Using biophysical and structural approaches, we show that the PLEKHM1–LIR is indeed 11–fold more specific for GABARAP than LC3B. Selective mutation of themore » X 1 and X 2 positions either completely abolished the interaction with all LC3 and GABARAPs or increased PLEKHM1–GIM selectivity 20–fold towards LC3B. Finally, we show that conversion of p62/SQSTM1, FUNDC1 and FIP200 LIRs into our newly defined GIM, by introducing two valine residues, enhances their interaction with endogenous GABARAP over LC3B. In conclusion, the identification of a GABARAP–specific interaction motif will aid the identification and characterization of the expanding array of autophagy receptor and adaptor proteins and their in vivo functions.« less
Structural and functional analysis of the GABARAP interaction motif (GIM)
Rogov, Vladimir V.; Stolz, Alexandra; Ravichandran, Arvind C.; ...
2017-06-27
Through the canonical LC3 interaction motif (LIR), [W/F/Y]–X 1–X 2[I/L/V], protein complexes are recruited to autophagosomes to perform their functions as either autophagy adaptors or receptors. How these adaptors/receptors selectively interact with either LC3 or GABARAP families remains unclear. Herein, we determine the range of selectivity of 30 known core LIR motifs towards individual LC3s and GABARAPs. From these, we define a GABARAP Interaction Motif (GIM) sequence ([W/F]–[V/I]–X 2–V) that the adaptor protein PLEKHM1 tightly conforms to. Using biophysical and structural approaches, we show that the PLEKHM1–LIR is indeed 11–fold more specific for GABARAP than LC3B. Selective mutation of themore » X 1 and X 2 positions either completely abolished the interaction with all LC3 and GABARAPs or increased PLEKHM1–GIM selectivity 20–fold towards LC3B. Finally, we show that conversion of p62/SQSTM1, FUNDC1 and FIP200 LIRs into our newly defined GIM, by introducing two valine residues, enhances their interaction with endogenous GABARAP over LC3B. In conclusion, the identification of a GABARAP–specific interaction motif will aid the identification and characterization of the expanding array of autophagy receptor and adaptor proteins and their in vivo functions.« less
Discovering Sequence Motifs with Arbitrary Insertions and Deletions
Frith, Martin C.; Saunders, Neil F. W.; Kobe, Bostjan; Bailey, Timothy L.
2008-01-01
Biology is encoded in molecular sequences: deciphering this encoding remains a grand scientific challenge. Functional regions of DNA, RNA, and protein sequences often exhibit characteristic but subtle motifs; thus, computational discovery of motifs in sequences is a fundamental and much-studied problem. However, most current algorithms do not allow for insertions or deletions (indels) within motifs, and the few that do have other limitations. We present a method, GLAM2 (Gapped Local Alignment of Motifs), for discovering motifs allowing indels in a fully general manner, and a companion method GLAM2SCAN for searching sequence databases using such motifs. glam2 is a generalization of the gapless Gibbs sampling algorithm. It re-discovers variable-width protein motifs from the PROSITE database significantly more accurately than the alternative methods PRATT and SAM-T2K. Furthermore, it usefully refines protein motifs from the ELM database: in some cases, the refined motifs make orders of magnitude fewer overpredictions than the original ELM regular expressions. GLAM2 performs respectably on the BAliBASE multiple alignment benchmark, and may be superior to leading multiple alignment methods for “motif-like” alignments with N- and C-terminal extensions. Finally, we demonstrate the use of GLAM2 to discover protein kinase substrate motifs and a gapped DNA motif for the LIM-only transcriptional regulatory complex: using GLAM2SCAN, we identify promising targets for the latter. GLAM2 is especially promising for short protein motifs, and it should improve our ability to identify the protein cleavage sites, interaction sites, post-translational modification attachment sites, etc., that underlie much of biology. It may be equally useful for arbitrarily gapped motifs in DNA and RNA, although fewer examples of such motifs are known at present. GLAM2 is public domain software, available for download at http://bioinformatics.org.au/glam2. PMID:18437229
Triadic motifs in the dependence networks of virtual societies.
Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing
2014-06-10
In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.
Triadic motifs in the dependence networks of virtual societies
NASA Astrophysics Data System (ADS)
Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing
2014-06-01
In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.
Triadic motifs in the dependence networks of virtual societies
Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing
2014-01-01
In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs. PMID:24912755
Li, Yanhua; Shyu, Duan-Liang; Shang, Pengcheng; Bai, Jianfa; Ouyang, Kang; Dhakal, Santosh; Hiremath, Jagadish; Binjawadagi, Basavaraj
2016-01-01
ABSTRACT Porcine reproductive and respiratory syndrome virus (PRRSV) nonstructural protein 1β (nsp1β) is a multifunctional viral protein, which is involved in suppressing the host innate immune response and activating a unique −2/−1 programmed ribosomal frameshifting (PRF) signal for the expression of frameshifting products. In this study, site-directed mutagenesis analysis showed that the R128A or R129A mutation introduced into a highly conserved motif (123GKYLQRRLQ131) reduced the ability of nsp1β to suppress interferon beta (IFN-β) activation and also impaired nsp1β's function as a PRF transactivator. Three recombinant viruses, vR128A, vR129A, and vRR129AA, carrying single or double mutations in the GKYLQRRLQ motif were characterized. In comparison to the wild-type (WT) virus, vR128A and vR129A showed slightly reduced growth abilities, while the vRR129AA mutant had a significantly reduced growth ability in infected cells. Consistent with the attenuated growth phenotype in vitro, pigs infected with nsp1β mutants had lower levels of viremia than did WT virus-infected pigs. Compared to the WT virus in infected cells, all three mutated viruses stimulated high levels of IFN-α expression and exhibited a reduced ability to suppress the mRNA expression of selected interferon-stimulated genes (ISGs). In pigs infected with nsp1β mutants, IFN-α production was increased in the lungs at early time points postinfection, which was correlated with increased innate NK cell function. Furthermore, the augmented innate response was consistent with the increased production of IFN-γ in pigs infected with mutated viruses. These data demonstrate that residues R128 and R129 are critical for nsp1β function and that modifying these key residues in the GKYLQRRLQ motif attenuates virus growth ability and improves the innate and adaptive immune responses in infected animals. IMPORTANCE PRRSV infection induces poor antiviral innate IFN and cytokine responses, which results in weak adaptive immunity. One of the strategies in next-generation vaccine construction is to manipulate viral proteins/genetic elements involved in antagonizing the host immune response. PRRSV nsp1β was identified to be a strong innate immune antagonist. In this study, two basic amino acids, R128 and R129, in a highly conserved GKYLQRRLQ motif were determined to be critical for nsp1β function. Mutations introduced into these two residues attenuated virus growth and improved the innate and adaptive immune responses of infected animals. Technologies developed in this study could be broadly applied to current commercial PRRSV modified live-virus (MLV) vaccines and other candidate vaccines. PMID:26792733
Transsynaptic Coordination of Synaptic Growth, Function, and Stability by the L1-Type CAM Neuroglian
Moreno, Eliza; Stephan, Raiko; Boerner, Jana; Godenschwege, Tanja A.; Pielage, Jan
2013-01-01
The precise control of synaptic connectivity is essential for the development and function of neuronal circuits. While there have been significant advances in our understanding how cell adhesion molecules mediate axon guidance and synapse formation, the mechanisms controlling synapse maintenance or plasticity in vivo remain largely uncharacterized. In an unbiased RNAi screen we identified the Drosophila L1-type CAM Neuroglian (Nrg) as a central coordinator of synapse growth, function, and stability. We demonstrate that the extracellular Ig-domains and the intracellular Ankyrin-interaction motif are essential for synapse development and stability. Nrg binds to Ankyrin2 in vivo and mutations reducing the binding affinities to Ankyrin2 cause an increase in Nrg mobility in motoneurons. We then demonstrate that the Nrg–Ank2 interaction controls the balance of synapse growth and stability at the neuromuscular junction. In contrast, at a central synapse, transsynaptic interactions of pre- and postsynaptic Nrg require a dynamic, temporal and spatial, regulation of the intracellular Ankyrin-binding motif to coordinate pre- and postsynaptic development. Our study at two complementary model synapses identifies the regulation of the interaction between the L1-type CAM and Ankyrin as an important novel module enabling local control of synaptic connectivity and function while maintaining general neuronal circuit architecture. PMID:23610557
Enneking, Eva-Maria; Kudumala, Sirisha R; Moreno, Eliza; Stephan, Raiko; Boerner, Jana; Godenschwege, Tanja A; Pielage, Jan
2013-01-01
The precise control of synaptic connectivity is essential for the development and function of neuronal circuits. While there have been significant advances in our understanding how cell adhesion molecules mediate axon guidance and synapse formation, the mechanisms controlling synapse maintenance or plasticity in vivo remain largely uncharacterized. In an unbiased RNAi screen we identified the Drosophila L1-type CAM Neuroglian (Nrg) as a central coordinator of synapse growth, function, and stability. We demonstrate that the extracellular Ig-domains and the intracellular Ankyrin-interaction motif are essential for synapse development and stability. Nrg binds to Ankyrin2 in vivo and mutations reducing the binding affinities to Ankyrin2 cause an increase in Nrg mobility in motoneurons. We then demonstrate that the Nrg-Ank2 interaction controls the balance of synapse growth and stability at the neuromuscular junction. In contrast, at a central synapse, transsynaptic interactions of pre- and postsynaptic Nrg require a dynamic, temporal and spatial, regulation of the intracellular Ankyrin-binding motif to coordinate pre- and postsynaptic development. Our study at two complementary model synapses identifies the regulation of the interaction between the L1-type CAM and Ankyrin as an important novel module enabling local control of synaptic connectivity and function while maintaining general neuronal circuit architecture.
Majidi, Asia; Nikkhah, Maryam; Sadeghian, Faranak; Hosseinkhani, Saman
2016-10-01
In last decades great efforts have been devoted to the study of development of recombinant peptide based vectors that consist of biological motifs with potential applications in gene therapy. Recombinant Biomimetic Chimeric Vectors (rBCVs) are biopolymeric nanocarriers that are designed to mimic viral features to overcome the cellular obstacles in gene transferring pathway into cell nucleus. In this research, we designed and genetically engineered three novel rBCVs with similar sequences that differed in motifs arrangement and motif abundance: MPG-2H1, 2TMPG-2H1 and 2RMPG-2H1. The MPG as a famous amphipathic cell penetrating peptide is the main segment of these constructs which was studied for the first time in association with truncated histone H1 DNA condensing motif. Through the performance of several physicochemical and biological assays, the rBCVs were remarkably examined regarding transfection efficiency. The main objective of this study is focused on the importance of motif design in transfection efficiency of rBCVs on one hand, and the assessment of correlation between structural features and functionality of motifs on the other hand. The results revealed that all three kinds of rBCVs/pDNA nanoparticles with average sizes of 200nm could overwhelm the cellular obstacles associated with gene transfer, and lead to efficient gene delivery. Furthermore, no significant toxicity was perceived and efficient endosome disruptive activity was obtained. It is noteworthy to say among three mentioned constructs 2RMPG-2H1 showed the highest transfection efficiency. Overall the peptide based vectors hold great promise as a nontoxic and effective gene carrier in vitro and in vivo, besides the rational design possibility as the most vital advantages over the other non-viral gene delivery vectors. Copyright © 2016 Elsevier B.V. All rights reserved.
Johnson, Glynis; Moore, Samuel W
2013-09-01
Short linear motifs confer evolutionary flexibility on proteins as they can be added with relative ease allowing the acquisition of new functions. Such motifs may mediate a variety of signalling functions. The adhesion-mediating Leu-Arg-Glu (LRE) motif is enriched in laminin beta 2, and has been observed in other proteins, including members of the carboxylesterase/cholinesterase family. It acts as a stop signal for growing axons in the developing neuromuscular junction, binding to the voltage-gated calcium channel. In this bioinformatic analysis, we have investigated the presence of the motif in proteins of the neuromuscular junction, and have also examined its structural position and potential for ligand interaction, as well as phylogenetic conservation, in the carboxylesterase/cholinesterase family. The motif was observed to occur with a significantly higher frequency than expected in the UniProt/Swiss-Prot database, as well as in four individual species (human, mouse, Caenorhabditis elegans and Drosophila melanogaster). Examination of its presence in neuromuscular junction proteins showed it to be enriched in certain proteins of the synaptic basement membrane, including laminin, agrin, acetylcholinesterase and tenascin. A highly significant enrichment was observed in cytoskeletal proteins, particularly intermediate filament proteins and members of the spectrin family. In the carboxylesterase/cholinesterase family, the motif was observed in four conserved positions in the protein structure. It is present in the majority of mammalian acetylcholinesterases, as well as acetylcholinesterases from electric fish and a number of invertebrates. In insects, it is present in the ace-2, rather than in the synaptic ace-1, enzyme. It is also observed in the cholinesterase-like adhesion molecules (neuroligins, neurotactin and glutactin). It is never seen in butyrylcholinesterases, which do not mediate cell adhesion. In conclusion, the significant enrichment of the motif in certain classes of protein, as well as its conserved presence and structural positioning in one protein family, suggests that it has specific functions both in cell adhesion in the neuromuscular junction and in maintaining the structural integrity of the cytoskeleton. Copyright © 2013 Elsevier Inc. All rights reserved.
Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank
2013-02-01
Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Krystkowiak, Izabella; Manguy, Jean; Davey, Norman E
2018-06-05
There is a pressing need for in silico tools that can aid in the identification of the complete repertoire of protein binding (SLiMs, MoRFs, miniMotifs) and modification (moiety attachment/removal, isomerization, cleavage) motifs. We have created PSSMSearch, an interactive web-based tool for rapid statistical modeling, visualization, discovery and annotation of protein motif specificity determinants to discover novel motifs in a proteome-wide manner. PSSMSearch analyses proteomes for regions with significant similarity to a motif specificity determinant model built from a set of aligned motif-containing peptides. Multiple scoring methods are available to build a position-specific scoring matrix (PSSM) describing the motif specificity determinant model. This model can then be modified by a user to add prior knowledge of specificity determinants through an interactive PSSM heatmap. PSSMSearch includes a statistical framework to calculate the significance of specificity determinant model matches against a proteome of interest. PSSMSearch also includes the SLiMSearch framework's annotation, motif functional analysis and filtering tools to highlight relevant discriminatory information. Additional tools to annotate statistically significant shared keywords and GO terms, or experimental evidence of interaction with a motif-recognizing protein have been added. Finally, PSSM-based conservation metrics have been created for taxonomic range analyses. The PSSMSearch web server is available at http://slim.ucd.ie/pssmsearch/.
Alves-Silva, J.; Sánchez-Soriano, N.; Beaven, R.; Klein, M.; Parkin, J.; Millard, T.H.; Bellen, H. J; Venken, K. J.T.; Ballestrem, C.; Kammerer, R.A.; Prokop, A.
2013-01-01
The correct outgrowth of axons is essential for the development and regeneration of nervous systems. Axon growth is primarily driven by microtubules. Key regulators of microtubules in this context are the spectraplakins, a family of evolutionarily conserved actin-microtubule linkers. Loss of function of the mouse spectraplakin ACF7 or of its close Drosophila homologue Short stop/Shot similarly cause severe axon shortening and microtubule disorganisation. How spectraplakins perform these functions is not known. Here we show that axonal growth promoting roles of Shot require interaction with EB1 (End binding protein) at polymerising plus ends of microtubules. We show that binding of Shot to EB1 requires SxIP motifs in Shot’s carboxyterminal tail (Ctail), mutations of these motifs abolish Shot functions in axonal growth, loss of EB1 function phenocopies Shot loss, and genetic interaction studies reveal strong functional links between Shot and EB1 in axonal growth and microtubule organisation. In addition, we report that Shot localises along microtubule shafts and stabilises them against pharmacologically induced depolymerisation. This function is EB1-independent but requires net positive charges within Ctail which essentially contribute to the microtubule shaft association of Shot. Therefore, spectraplakins are true members of two important classes of neuronal microtubule regulating proteins: +TIPs (plus end regulators) and structural MAPs (microtubule associated proteins). From our data we deduce a model that relates the different features of the spectraplakin carboxy-terminus to the two functions of Shot during axonal growth. PMID:22764224
Tran, Tuan; Disney, Matthew D
2012-01-01
RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here, we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (among a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole and pyridinium chemotypes allow for specific recognition of RNA motifs. As targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses.
Tran, Tuan; Disney, Matthew D.
2012-01-01
RNA is an important therapeutic target but information about RNA-ligand interactions is limited. Here we report a screening method that probes over 3,000,000 combinations of RNA motif-small molecule interactions to identify the privileged RNA structures and chemical spaces that interact. Specifically, a small molecule library biased for binding RNA was probed for binding to over 70,000 unique RNA motifs in a high throughput solution-based screen. The RNA motifs that specifically bind each small molecule were identified by microarray-based selection. In this library-versus-library or multidimensional combinatorial screening approach, hairpin loops (amongst a variety of RNA motifs) were the preferred RNA motif space that binds small molecules. Furthermore, it was shown that indole, 2-phenyl indole, 2-phenyl benzimidazole, and pyridinium chemotypes allow for specific recognition of RNA motifs. Since targeting RNA with small molecules is an extremely challenging area, these studies provide new information on RNA-ligand interactions that has many potential uses. PMID:23047683
2013-01-01
Background The arginine of the D/E/NRY motif in Rhodopsin family G protein-coupled receptors (GPCRs) is conserved in 96% of these proteins. In some GPCRs, this arginine in transmembrane 3 can form a salt bridge with an aspartic acid or glutamic acid in transmembrane 6. The Drosophila melanogaster GPCR Trapped in endoderm-1 (Tre1) is required for normal primordial germ cell migration. In a mutant form of the protein, Tre1sctt, eight amino acids RYILIACH are missing, resulting in a severe disruption of primordial germ cell development. The impact of the loss of these amino acids on Tre1 structure is unknown. Since the missing amino acids in Tre1sctt include the arginine that is part of the D/E/NRY motif in Tre1, molecular dynamics simulations were performed to explore the hypothesis that these amino acids are involved in salt bridge formation and help maintain Tre1 structure. Results Structural predictions of wild type Tre1 (Tre1+) and Tre1sctt were subjected to over 250 ns of molecular dynamics simulations. The ability of the model systems to form a salt bridge between the arginine of the D/E/NRY motif and an aspartic acid residue in transmembrane 6 was analyzed. The results indicate that a stable salt bridge can form in the Tre1+ systems and a weak salt bridge or no salt bridge, using an alternative arginine, is likely in the Tre1sctt systems. Conclusions The weak salt bridge or lack of a salt bridge in the Tre1sctt systems could be one possible explanation for the disrupted function of Tre1sctt in primordial germ cell migration. These results provide a framework for studying the importance of the arginine of the D/E/NRY motif in the structure and function of other GPCRs that are involved in cell migration, such as CXCR4 in the mouse, zebrafish, and chicken. PMID:24044607
Pruitt, Margaret M; Lamm, Monica H; Coffman, Clark R
2013-09-18
The arginine of the D/E/NRY motif in Rhodopsin family G protein-coupled receptors (GPCRs) is conserved in 96% of these proteins. In some GPCRs, this arginine in transmembrane 3 can form a salt bridge with an aspartic acid or glutamic acid in transmembrane 6. The Drosophila melanogaster GPCR Trapped in endoderm-1 (Tre1) is required for normal primordial germ cell migration. In a mutant form of the protein, Tre1sctt, eight amino acids RYILIACH are missing, resulting in a severe disruption of primordial germ cell development. The impact of the loss of these amino acids on Tre1 structure is unknown. Since the missing amino acids in Tre1sctt include the arginine that is part of the D/E/NRY motif in Tre1, molecular dynamics simulations were performed to explore the hypothesis that these amino acids are involved in salt bridge formation and help maintain Tre1 structure. Structural predictions of wild type Tre1 (Tre1+) and Tre1sctt were subjected to over 250 ns of molecular dynamics simulations. The ability of the model systems to form a salt bridge between the arginine of the D/E/NRY motif and an aspartic acid residue in transmembrane 6 was analyzed. The results indicate that a stable salt bridge can form in the Tre1+ systems and a weak salt bridge or no salt bridge, using an alternative arginine, is likely in the Tre1sctt systems. The weak salt bridge or lack of a salt bridge in the Tre1sctt systems could be one possible explanation for the disrupted function of Tre1sctt in primordial germ cell migration. These results provide a framework for studying the importance of the arginine of the D/E/NRY motif in the structure and function of other GPCRs that are involved in cell migration, such as CXCR4 in the mouse, zebrafish, and chicken.
Rawat, Manmeet; Vijay, Sonam; Gupta, Yash; Tiwari, Pramod Kumar; Sharma, Arun
2013-01-01
Plasmepsin V (PM-V) have functionally conserved orthologues across the Plasmodium genus who's binding and antigenic processing at the PEXEL motifs for export about 200-300 essential proteins is important for the virulence and viability of the causative Plasmodium species. This study was undertaken to determine P. vivax plasmepsin V Ind (PvPM-V-Ind) PEXEL motif export pathway for pathogenicity-related proteins/antigens export thereby altering plasmodium exportome during erythrocytic stages. We identify and characterize Plasmodium vivax plasmepsin-V-Ind (mutant) gene by cloning, sequence analysis, in silico bioinformatic protocols and structural modeling predictions based on docking studies on binding capacity with PEXEL motifs processing in terms of binding and accessibility of export proteins. Cloning and sequence analysis for genetic diversity demonstrates PvPM-V-Ind (mutant) gene is highly conserved among all isolates from different geographical regions of India. Imperfect duplicate insertion types of mutations (SVSE from 246-249 AA and SLSE from 266-269 AA) were identified among all Indian isolates in comparison to P.vivax Sal-1 (PvPM-V-Sal 1) isolate. In silico bioinformatics interaction studies of PEXEL peptide and active enzyme reveal that PvPM-V-Ind (mutant) is only active in endoplasmic reticulum lumen and membrane embedding is essential for activation of plasmepsin V. Structural modeling predictions based on docking studies with PEXEL motif show significant variation in substrate protein binding of these imperfect mutations with data mined PEXEL sequences. The predicted variation in the docking score and interacting amino acids of PvPM-V-Ind (mutant) proteins with PEXEL and lopinavir suggests a modulation in the activity of PvPM-V in terms of binding and accessibility at these sites. Our functional modeled validation of PvPM-V-Ind (mutant) imperfect duplicate insertions with data mined PEXEL sequences leading to altered binding and substrate accessibility of the enzyme makes it a plausible target to investigate export mechanisms for in silico virtual screening and novel pharmacophore designing.
Rawat, Manmeet; Vijay, Sonam; Gupta, Yash; Tiwari, Pramod Kumar; Sharma, Arun
2013-01-01
Introduction Plasmepsin V (PM-V) have functionally conserved orthologues across the Plasmodium genus who's binding and antigenic processing at the PEXEL motifs for export about 200–300 essential proteins is important for the virulence and viability of the causative Plasmodium species. This study was undertaken to determine P. vivax plasmepsin V Ind (PvPM-V-Ind) PEXEL motif export pathway for pathogenicity-related proteins/antigens export thereby altering plasmodium exportome during erythrocytic stages. Method We identify and characterize Plasmodium vivax plasmepsin-V-Ind (mutant) gene by cloning, sequence analysis, in silico bioinformatic protocols and structural modeling predictions based on docking studies on binding capacity with PEXEL motifs processing in terms of binding and accessibility of export proteins. Results Cloning and sequence analysis for genetic diversity demonstrates PvPM-V-Ind (mutant) gene is highly conserved among all isolates from different geographical regions of India. Imperfect duplicate insertion types of mutations (SVSE from 246–249 AA and SLSE from 266–269 AA) were identified among all Indian isolates in comparison to P.vivax Sal-1 (PvPM-V-Sal 1) isolate. In silico bioinformatics interaction studies of PEXEL peptide and active enzyme reveal that PvPM-V-Ind (mutant) is only active in endoplasmic reticulum lumen and membrane embedding is essential for activation of plasmepsin V. Structural modeling predictions based on docking studies with PEXEL motif show significant variation in substrate protein binding of these imperfect mutations with data mined PEXEL sequences. The predicted variation in the docking score and interacting amino acids of PvPM-V-Ind (mutant) proteins with PEXEL and lopinavir suggests a modulation in the activity of PvPM-V in terms of binding and accessibility at these sites. Conclusion/Significance Our functional modeled validation of PvPM-V-Ind (mutant) imperfect duplicate insertions with data mined PEXEL sequences leading to altered binding and substrate accessibility of the enzyme makes it a plausible target to investigate export mechanisms for in silico virtual screening and novel pharmacophore designing. PMID:23555891
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, S.; Tainer, J.A.
2001-08-01
ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core hasmore » been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT motif structural framework. Thus, we propose here that the ARTT motif represents an experimentally testable general recognition motif region for many ADP-ribosyltransferases and thereby potentially provides a unified structural understanding of substrate recognition in ADP-ribosylation processes.« less
Gerevich, József
2015-01-01
One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.
Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko
2013-07-01
AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element
Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko
2013-01-01
AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5′-NNCCAC-3′ and 5′-GCGMGN′N′-3′ (M:A or C; N and N′ form Watson–Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences. PMID:23709277
Cheatle Jarvela, Alys M.; Brubaker, Lisa; Vedenko, Anastasia; Gupta, Anisha; Armitage, Bruce A.; Bulyk, Martha L.; Hinman, Veronica F.
2014-01-01
Gene regulatory networks (GRNs) describe the progression of transcriptional states that take a single-celled zygote to a multicellular organism. It is well documented that GRNs can evolve extensively through mutations to cis-regulatory modules (CRMs). Transcription factor proteins that bind these CRMs may also evolve to produce novelty. Coding changes are considered to be rarer, however, because transcription factors are multifunctional and hence are more constrained to evolve in ways that will not produce widespread detrimental effects. Recent technological advances have unearthed a surprising variation in DNA-binding abilities, such that individual transcription factors may recognize both a preferred primary motif and an additional secondary motif. This provides a source of modularity in function. Here, we demonstrate that orthologous transcription factors can also evolve a changed preference for a secondary binding motif, thereby offering an unexplored mechanism for GRN evolution. Using protein-binding microarray, surface plasmon resonance, and in vivo reporter assays, we demonstrate an important difference in DNA-binding preference between Tbrain protein orthologs in two species of echinoderms, the sea star, Patiria miniata, and the sea urchin, Strongylocentrotus purpuratus. Although both orthologs recognize the same primary motif, only the sea star Tbr also has a secondary binding motif. Our in vivo assays demonstrate that this difference may allow for greater evolutionary change in timing of regulatory control. This uncovers a layer of transcription factor binding divergence that could exist for many pairs of orthologs. We hypothesize that this divergence provides modularity that allows orthologous transcription factors to evolve novel roles in GRNs through modification of binding to secondary sites. PMID:25016582
Discriminative motif optimization based on perceptron training
Patel, Ronak Y.; Stormo, Gary D.
2014-01-01
Motivation: Generating accurate transcription factor (TF) binding site motifs from data generated using the next-generation sequencing, especially ChIP-seq, is challenging. The challenge arises because a typical experiment reports a large number of sequences bound by a TF, and the length of each sequence is relatively long. Most traditional motif finders are slow in handling such enormous amount of data. To overcome this limitation, tools have been developed that compromise accuracy with speed by using heuristic discrete search strategies or limited optimization of identified seed motifs. However, such strategies may not fully use the information in input sequences to generate motifs. Such motifs often form good seeds and can be further improved with appropriate scoring functions and rapid optimization. Results: We report a tool named discriminative motif optimizer (DiMO). DiMO takes a seed motif along with a positive and a negative database and improves the motif based on a discriminative strategy. We use area under receiver-operating characteristic curve (AUC) as a measure of discriminating power of motifs and a strategy based on perceptron training that maximizes AUC rapidly in a discriminative manner. Using DiMO, on a large test set of 87 TFs from human, drosophila and yeast, we show that it is possible to significantly improve motifs identified by nine motif finders. The motifs are generated/optimized using training sets and evaluated on test sets. The AUC is improved for almost 90% of the TFs on test sets and the magnitude of increase is up to 39%. Availability and implementation: DiMO is available at http://stormo.wustl.edu/DiMO Contact: rpatel@genetics.wustl.edu, ronakypatel@gmail.com PMID:24369152
Takeda, Ryuta; Petrov, Anton I.; Leontis, Neocles B.; Ding, Biao
2011-01-01
Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5′-CGA-3′...5′-GAC-3′ flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes. PMID:21258006
Takeda, Ryuta; Petrov, Anton I; Leontis, Neocles B; Ding, Biao
2011-01-01
Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5'-CGA-3'...5'-GAC-3' flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Yong; Kovach, Amanda; Suino-Powell, Kelly
2008-07-23
The functional interaction between the peroxisome proliferator-activated receptor {gamma} (PPAR{gamma}) and its coactivator PGC-1{alpha} is crucial for the normal physiology of PPAR{gamma} and its pharmacological response to antidiabetic treatment with rosiglitazone. Here we report the crystal structure of the PPAR{gamma} ligand-binding domain bound to rosiglitazone and to a large PGC-1{alpha} fragment that contains two LXXLL-related motifs. The structure reveals critical contacts mediated through the first LXXLL motif of PGC-1{alpha} and the PPAR{gamma} coactivator binding site. Through a combination of biochemical and structural studies, we demonstrate that the first LXXLL motif is the most potent among all nuclear receptor coactivator motifsmore » tested, and only this motif of the two LXXLL-related motifs in PGC-1{alpha} is capable of binding to PPAR{gamma}. Our studies reveal that the strong interaction of PGC-1{alpha} and PPAR{gamma} is mediated through both hydrophobic and specific polar interactions. Mutations within the context of the full-length PGC-1{alpha} indicate that the first PGC-1{alpha} motif is necessary and sufficient for PGC-1{alpha} to coactivate PPAR{gamma} in the presence or absence of rosiglitazone. These results provide a molecular basis for specific recruitment and functional interplay between PPAR{gamma} and PGC-1{alpha} in glucose homeostasis and adipocyte differentiation.« less
Analysis of secondary structural elements in human microRNA hairpin precursors.
Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D
2016-03-01
MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.
Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.
Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L
2016-06-17
The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.
Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel
2008-01-01
Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284
A kinesin-1 binding motif in vaccinia virus that is widespread throughout the human genome
Dodding, Mark P; Mitter, Richard; Humphries, Ashley C; Way, Michael
2011-01-01
Transport of cargoes by kinesin-1 is essential for many cellular processes. Nevertheless, the number of proteins known to recruit kinesin-1 via its cargo binding light chain (KLC) is still quite small. We also know relatively little about the molecular features that define kinesin-1 binding. We now show that a bipartite tryptophan-based kinesin-1 binding motif, originally identified in Calsyntenin is present in A36, a vaccinia integral membrane protein. This bipartite motif in A36 is required for kinesin-1-dependent transport of the virus to the cell periphery. Bioinformatic analysis reveals that related bipartite tryptophan-based motifs are present in over 450 human proteins. Using vaccinia as a surrogate cargo, we show that regions of proteins containing this motif can function to recruit KLC and promote virus transport in the absence of A36. These proteins interact with the kinesin light chain outside the context of infection and have distinct preferences for KLC1 and KLC2. Our observations demonstrate that KLC binding can be conferred by a common set of features that are found in a wide range of proteins associated with diverse cellular functions and human diseases. PMID:21915095
Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel
2008-05-01
Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.
Yang, Hui; Douglas, Ganka; Monaghan, Kristin G; Retterer, Kyle; Cho, Megan T; Escobar, Luis F; Tucker, Megan E; Stoler, Joan; Rodan, Lance H; Stein, Diane; Marks, Warren; Enns, Gregory M; Platt, Julia; Cox, Rachel; Wheeler, Patricia G; Crain, Carrie; Calhoun, Amy; Tryon, Rebecca; Richard, Gabriele; Vitazka, Patrik; Chung, Wendy K
2015-10-01
Whole-exome sequencing (WES) represents a significant breakthrough in clinical genetics, and identifies a genetic etiology in up to 30% of cases of intellectual disability (ID). Using WES, we identified seven unrelated patients with a similar clinical phenotype of severe intellectual disability or neurodevelopmental delay who were all heterozygous for de novo truncating variants in the AT-hook DNA-binding motif-containing protein 1 (AHDC1). The patients were all minimally verbal or nonverbal and had variable neurological problems including spastic quadriplegia, ataxia, nystagmus, seizures, autism, and self-injurious behaviors. Additional common clinical features include dysmorphic facial features and feeding difficulties associated with failure to thrive and short stature. The AHDC1 gene has only one coding exon, and the protein contains conserved regions including AT-hook motifs and a PDZ binding domain. We postulate that all seven variants detected in these patients result in a truncated protein missing critical functional domains, disrupting interactions with other proteins important for brain development. Our study demonstrates that truncating variants in AHDC1 are associated with ID and are primarily associated with a neurodevelopmental phenotype.
Unique Structural Features and Sequence Motifs of Proline Utilization A (PutA)
Singh, Ranjan K.; Tanner, John J.
2013-01-01
Proline utilization A proteins (PutAs) are bifunctional enzymes that catalyze the oxidation of proline to glutamate using spatially separated proline dehydrogenase and pyrroline-5-carboxylate dehydrogenase active sites. Here we use the crystal structure of the minimalist PutA from Bradyrhizobium japonicum (BjPutA) along with sequence analysis to identify unique structural features of PutAs. This analysis shows that PutAs have secondary structural elements and domains not found in the related monofunctional enzymes. Some of these extra features are predicted to be important for substrate channeling in BjPutA. Multiple sequence alignment analysis shows that some PutAs have a 17-residue conserved motif in the C-terminal 20–30 residues of the polypeptide chain. The BjPutA structure shows that this motif helps seal the internal substrate-channeling cavity from the bulk medium. Finally, it is shown that some PutAs have a 100–200 residue domain of unknown function in the C-terminus that is not found in minimalist PutAs. Remote homology detection suggests that this domain is homologous to the oligomerization beta-hairpin and Rossmann fold domain of BjPutA. PMID:22201760
Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J. M.
2014-01-01
ABSTRACT Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. IMPORTANCE The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses. PMID:24600002
Bonsor, Daniel A.; Pham, Kieu T.; Beadenkopf, Robert; Diederichs, Kay; Haas, Rainer; Beckett, Dorothy; Fischer, Wolfgang; Sundberg, Eric J.
2015-01-01
Arginine-aspartate-glycine (RGD) motifs are recognized by integrins to bridge cells to one another and the extracellular matrix. RGD motifs typically reside in exposed loop conformations. X-ray crystal structures of the Helicobacter pylori protein CagL revealed that RGD motifs can also exist in helical regions of proteins. Interactions between CagL and host gastric epithelial cell via integrins are required for the translocation of the bacterial oncoprotein CagA. Here, we have investigated the molecular basis of the CagL-host cell interactions using structural, biophysical, and functional analyses. We solved an x-ray crystal structure of CagL that revealed conformational changes induced by low pH not present in previous structures. Using analytical ultracentrifugation, we found that pH-induced conformational changes in CagL occur in solution and not just in the crystalline environment. By designing numerous CagL mutants based on all available crystal structures, we probed the functional roles of CagL conformational changes on cell surface integrin engagement. Together, our data indicate that the helical RGD motif in CagL is buried by a neighboring helix at low pH to inhibit CagL binding to integrin, whereas at neutral pH the neighboring helix is displaced to allow integrin access to the CagL RGD motif. This novel molecular mechanism of regulating integrin-RGD motif interactions by changes in the chemical environment provides new insight to H. pylori-mediated oncogenesis. PMID:25837254
Hsiao, Yu-Yun; Jeng, Mei-Fen; Tsai, Wen-Chieh; Chuang, Yu-Chen; Li, Chia-Ying; Wu, Tian-Shung; Kuoh, Chang-Sheng; Chen, Wen-Huei; Chen, Hong-Hwa
2008-09-01
Geranyl diphosphate (GDP) is the precursor of monoterpenes, which are the major floral scent compounds in Phalaenopsis bellina. The cDNA of P. bellina GDP synthase (PbGDPS) was cloned, and its sequence corresponds to the second Asp-rich motif (SARM), but not to any aspartate-rich (Asp-rich) motif. The recombinant PbGDPS enzyme exhibits dual prenyltransferase activity, producing both GDP and farnesyl diphosphate (FDP), and a yeast two-hybrid assay and gel filtration revealed that PbGDPS was able to form a homodimer. Spatial and temporal expression analyses showed that the expression of PbGDPS was flower specific, and that maximal PbGDPS expression was concomitant with maximal emission of monoterpenes on day 5 post-anthesis. Homology modelling of PbGDPS indicated that the Glu-rich motif might provide a binding site for Mg(2+) and catalyze the formation of prenyl products in a similar way to SARM. Replacement of the key Glu residues with alanine totally abolished enzyme activity, whereas their mutation to Asp resulted in a mutant with two-thirds of the activity of the wild-type protein. Phylogenetic analysis indicated that plant GDPS proteins formed four clades: members of both GDPS-a and GDPS-b clades contain Asp-rich motifs, and function as homodimers. In contrast, proteins in the GDPS-c and GDPS-d clades do not contain Asp-rich motifs, but although members of the GDPS-c clade function as heterodimers, PbGDPS, which is more closely related to the GDPS-c clade proteins than to GDPS-a and GDPS-b proteins, and is currently the sole member of the GDPS-d clade, functions as a homodimer.
cWINNOWER Algorithm for Finding Fuzzy DNA Motifs
NASA Technical Reports Server (NTRS)
Liang, Shoudan
2003-01-01
The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if multiple mutated copies of the motif (i.e., the signals) are present in the DNA sequence in sufficient abundance. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum number of detectable motifs qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc, by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12000 for (l,d) = (15,4).
A distance constrained synaptic plasticity model of C. elegans neuronal network
NASA Astrophysics Data System (ADS)
Badhwar, Rahul; Bagler, Ganesh
2017-03-01
Brain research has been driven by enquiry for principles of brain structure organization and its control mechanisms. The neuronal wiring map of C. elegans, the only complete connectome available till date, presents an incredible opportunity to learn basic governing principles that drive structure and function of its neuronal architecture. Despite its apparently simple nervous system, C. elegans is known to possess complex functions. The nervous system forms an important underlying framework which specifies phenotypic features associated to sensation, movement, conditioning and memory. In this study, with the help of graph theoretical models, we investigated the C. elegans neuronal network to identify network features that are critical for its control. The 'driver neurons' are associated with important biological functions such as reproduction, signalling processes and anatomical structural development. We created 1D and 2D network models of C. elegans neuronal system to probe the role of features that confer controllability and small world nature. The simple 1D ring model is critically poised for the number of feed forward motifs, neuronal clustering and characteristic path-length in response to synaptic rewiring, indicating optimal rewiring. Using empirically observed distance constraint in the neuronal network as a guiding principle, we created a distance constrained synaptic plasticity model that simultaneously explains small world nature, saturation of feed forward motifs as well as observed number of driver neurons. The distance constrained model suggests optimum long distance synaptic connections as a key feature specifying control of the network.
The tripartite motif coiled-coil is an elongated antiparallel hairpin dimer.
Sanchez, Jacint G; Okreglicka, Katarzyna; Chandrasekaran, Viswanathan; Welker, Jordan M; Sundquist, Wesley I; Pornillos, Owen
2014-02-18
Tripartite motif (TRIM) proteins make up a large family of coiled-coil-containing RING E3 ligases that function in many cellular processes, particularly innate antiviral response pathways. Both dimerization and higher-order assembly are important elements of TRIM protein function, but the atomic details of TRIM tertiary and quaternary structure have not been fully understood. Here, we present crystallographic and biochemical analyses of the TRIM coiled-coil and show that TRIM proteins dimerize by forming interdigitating antiparallel helical hairpins that position the N-terminal catalytic RING domains at opposite ends of the dimer and the C-terminal substrate-binding domains at the center. The dimer core comprises an antiparallel coiled-coil with a distinctive, symmetric pattern of flanking heptad and central hendecad repeats that appear to be conserved across the entire TRIM family. Our studies reveal how the coiled-coil organizes TRIM25 to polyubiquitylate the RIG-I/viral RNA recognition complex and how dimers of the TRIM5α protein are arranged within hexagonal arrays that recognize the HIV-1 capsid lattice and restrict retroviral replication.
The tripartite motif coiled-coil is an elongated antiparallel hairpin dimer
Sanchez, Jacint G.; Okreglicka, Katarzyna; Chandrasekaran, Viswanathan; Welker, Jordan M.; Sundquist, Wesley I.; Pornillos, Owen
2014-01-01
Tripartite motif (TRIM) proteins make up a large family of coiled-coil-containing RING E3 ligases that function in many cellular processes, particularly innate antiviral response pathways. Both dimerization and higher-order assembly are important elements of TRIM protein function, but the atomic details of TRIM tertiary and quaternary structure have not been fully understood. Here, we present crystallographic and biochemical analyses of the TRIM coiled-coil and show that TRIM proteins dimerize by forming interdigitating antiparallel helical hairpins that position the N-terminal catalytic RING domains at opposite ends of the dimer and the C-terminal substrate-binding domains at the center. The dimer core comprises an antiparallel coiled-coil with a distinctive, symmetric pattern of flanking heptad and central hendecad repeats that appear to be conserved across the entire TRIM family. Our studies reveal how the coiled-coil organizes TRIM25 to polyubiquitylate the RIG-I/viral RNA recognition complex and how dimers of the TRIM5α protein are arranged within hexagonal arrays that recognize the HIV-1 capsid lattice and restrict retroviral replication. PMID:24550273
Khandelwal, Risha; Govinda Rajan, Sriivatsan; Kumar, Raviranjan
2017-01-01
Hox mediated neuroblast apoptosis is a prevalent way to pattern larval central nervous system (CNS) by different Hox genes, but the mechanism of this apoptosis is not understood. Our studies with Abdominal-A (Abd-A) mediated larval neuroblast (pNB) apoptosis suggests that AbdA, its cofactor Extradenticle (Exd), a helix-loop-helix transcription factor Grainyhead (Grh), and Notch signaling transcriptionally contribute to expression of RHG family of apoptotic genes. We find that Grh, AbdA, and Exd function together at multiple motifs on the apoptotic enhancer. In vivo mutagenesis of these motifs suggest that they are important for the maintenance of the activity of the enhancer rather than its initiation. We also find that Exd function is independent of its known partner homothorax in this apoptosis. We extend some of our findings to Deformed expressing region of sub-esophageal ganglia where pNBs undergo a similar Hox dependent apoptosis. We propose a mechanism where common players like Exd-Grh-Notch work with different Hox genes through region specific enhancers to pattern respective segments of larval central nervous system. PMID:29023471
Khandelwal, Risha; Sipani, Rashmi; Govinda Rajan, Sriivatsan; Kumar, Raviranjan; Joshi, Rohit
2017-10-01
Hox mediated neuroblast apoptosis is a prevalent way to pattern larval central nervous system (CNS) by different Hox genes, but the mechanism of this apoptosis is not understood. Our studies with Abdominal-A (Abd-A) mediated larval neuroblast (pNB) apoptosis suggests that AbdA, its cofactor Extradenticle (Exd), a helix-loop-helix transcription factor Grainyhead (Grh), and Notch signaling transcriptionally contribute to expression of RHG family of apoptotic genes. We find that Grh, AbdA, and Exd function together at multiple motifs on the apoptotic enhancer. In vivo mutagenesis of these motifs suggest that they are important for the maintenance of the activity of the enhancer rather than its initiation. We also find that Exd function is independent of its known partner homothorax in this apoptosis. We extend some of our findings to Deformed expressing region of sub-esophageal ganglia where pNBs undergo a similar Hox dependent apoptosis. We propose a mechanism where common players like Exd-Grh-Notch work with different Hox genes through region specific enhancers to pattern respective segments of larval central nervous system.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yuan, Puwei; Bartlam, Mark; Lou, Zhiyong
2009-11-10
The heterotrimeric influenza virus polymerase, containing the PA, PB1 and PB2 proteins, catalyses viral RNA replication and transcription in the nucleus of infected cells. PB1 holds the polymerase active site and reportedly harbours endonuclease activity, whereas PB2 is responsible for cap binding. The PA amino terminus is understood to be the major functional part of the PA protein and has been implicated in several roles, including endonuclease and protease activities as well as viral RNA/complementary RNA promoter binding. Here we report the 2.2 angstrom (A) crystal structure of the N-terminal 197 residues of PA, termed PA(N), from an avian influenzamore » H5N1 virus. The PA(N) structure has an alpha/beta architecture and reveals a bound magnesium ion coordinated by a motif similar to the (P)DX(N)(D/E)XK motif characteristic of many endonucleases. Structural comparisons and mutagenesis analysis of the motif identified in PA(N) provide further evidence that PA(N) holds an endonuclease active site. Furthermore, functional analysis with in vivo ribonucleoprotein reconstitution and direct in vitro endonuclease assays strongly suggest that PA(N) holds the endonuclease active site and has critical roles in endonuclease activity of the influenza virus polymerase, rather than PB1. The high conservation of this endonuclease active site among influenza strains indicates that PA(N) is an important target for the design of new anti-influenza therapeutics.« less
Sun, Yaping; Iyer, Matthew; McEachin, Richard; Zhao, Meng; Wu, Yi-Mi; Cao, Xuhong; Oravecz-Wilson, Katherine; Zajac, Cynthia; Mathewson, Nathan; Wu, Shin-Rong Julia; Rossi, Corinne; Toubai, Tomomi; Qin, Zhaohui S.; Chinnaiya, Arul M.; Reddy, Pavan
2016-01-01
STAT3 is a master transcriptional regulator that plays an important role in the induction of both immune activation and immune tolerance in dendritic cells (DCs). The transcriptional targets of STAT3 in promoting DC activation are becoming increasingly understood; however, the mechanisms underpinning its role in causing DC suppression remain largely unknown. To determine the functional gene targets of STAT3, we compared the genome-wide binding of STAT3 using ChIP-seq coupled with gene expression microarrays to determine STAT3-dependent gene regulation in DCs after histone deacetylase (HDAC) inhibition. HDAC inhibition boosted the ability of STAT3 to bind to distinct DNA targets and regulate gene expression. Among the top 500 STAT3 binding sites, the frequency of canonical motifs was significantly higher than that of non-canonical motifs. Functional analysis revealed that after treatment with an HDAC inhibitor, the upregulated STAT3 target genes were those that were primarily the negative regulators of pro-inflammatory cytokines and those in the IL-10 signaling pathway. The downregulated STAT3-dependent targets were those involved in immune effector processes and antigen processing/presentation. The expression and functional relevance of these genes were validated. Specifically, functional studies confirmed that the upregulation of IL-10Ra by STAT3 contributed to the suppressive function of DCs following HDAC inhibition. PMID:27866206
Selective integrin endocytosis is driven by interactions between the integrin α-chain and AP2
De Franceschi, Nicola; Arjonen, Antti; Elkhatib, Nadia; Denessiouk, Konstantin; Wrobel, Antoni G; Wilson, Thomas A; Pouwels, Jeroen; Montagnac, Guillaume; Owen, David J; Ivaska, Johanna
2016-01-01
Integrins are heterodimeric cell-surface adhesion molecules comprising one of possible 18 α-chains and one of possible 8 β-chains. They control a range of cell functions in a matrix- and ligand-specific manner. Integrins can be internalised by clathrin-mediated endocytosis (CME) through β subunit-based motifs found in all integrin heterodimers. However, whether specific integrin heterodimers can be selectively endocytosed was unknown. Here, we found that a subset of α subunits contain an evolutionarily conserved and functional YxxΦ motif directing integrins to selective internalisation by the most abundant endocytic clathrin adaptor, AP2. We determined the structure of the human integrin α4-tail motif in complex with AP2 C-µ2 subunit and confirmed the interaction by isothermal titration calorimetry. Mutagenesis of the motif impaired selective heterodimer endocytosis and attenuated integrin-mediated cell migration. We propose that integrins evolved to enable selective integrin-receptor turnover in response to changing matrix conditions. PMID:26779610
Selective integrin endocytosis is driven by interactions between the integrin α-chain and AP2.
De Franceschi, Nicola; Arjonen, Antti; Elkhatib, Nadia; Denessiouk, Konstantin; Wrobel, Antoni G; Wilson, Thomas A; Pouwels, Jeroen; Montagnac, Guillaume; Owen, David J; Ivaska, Johanna
2016-02-01
Integrins are heterodimeric cell-surface adhesion molecules comprising one of 18 possible α-chains and one of eight possible β-chains. They control a range of cell functions in a matrix- and ligand-specific manner. Integrins can be internalized by clathrin-mediated endocytosis (CME) through β subunit-based motifs found in all integrin heterodimers. However, whether specific integrin heterodimers can be selectively endocytosed was unknown. Here, we found that a subset of α subunits contain an evolutionarily conserved and functional YxxΦ motif directing integrins to selective internalization by the most abundant endocytic clathrin adaptor, AP2. We determined the structure of the human integrin α4-tail motif in complex with the AP2 C-μ2 subunit and confirmed the interaction by isothermal titration calorimetry. Mutagenesis of the motif impaired selective heterodimer endocytosis and attenuated integrin-mediated cell migration. We propose that integrins evolved to enable selective integrin-receptor turnover in response to changing matrix conditions.
The bioactive acidic serine- and aspartate-rich motif peptide.
Minamizaki, Tomoko; Yoshiko, Yuji
2015-01-01
The organic component of the bone matrix comprises 40% dry weight of bone. The organic component is mostly composed of type I collagen and small amounts of non-collagenous proteins (NCPs) (10-15% of the total bone protein content). The small integrin-binding ligand N-linked glycoprotein (SIBLING) family, a NCP, is considered to play a key role in bone mineralization. SIBLING family of proteins share common structural features and includes the arginine-glycine-aspartic acid (RGD) motif and acidic serine- and aspartic acid-rich motif (ASARM). Clinical manifestations of gene mutations and/or genetically modified mice indicate that SIBLINGs play diverse roles in bone and extraskeletal tissues. ASARM peptides might not be primary responsible for the functional diversity of SIBLINGs, but this motif is suggested to be a key domain of SIBLINGs. However, the exact function of ASARM peptides is poorly understood. In this article, we discuss the considerable progress made in understanding the role of ASARM as a bioactive peptide.
The Thiamin Pyrophosphate-Motif
NASA Technical Reports Server (NTRS)
Dominiak, Paulina M.; Ciszak, Ewa M.
2003-01-01
Using databases the authors have identified a common thiamin pyrophosphate (TPP)-motif in the family of functionally diverse TPP-dependent enzymes. This common motif consists of multimeric organization of subunits, two catalytic centers, common amino acid sequence, and specific contacts to provide a flip-flop, or alternate site, mechanism of action. Each catalytic center [PP:PYR] is formed at the interface of the PP-domain binding the magnesium ion, pyrophosphate and aminopyrimidine ring of TPP, and the PYR-domain binding the aminopyrimidine ring of that cofactor. A pair of these catalytic centers constitutes the catalytic core [PP:PYR]* within these enzymes. Analysis of the structural elements of this catalytic core reveals novel definition of the common amino acid sequences, which are GX@&(G)@XXGQ, and GDGX25-30 within the PP- domain, and the E&(G)@XXG@ within the PYR-domain, where Q, corresponds to a hydrophobic amino acid. This TPP-motif provides a novel tool for annotation of TPP-dependent enzymes useful in advancing functional proteomics.
Multilayer motif analysis of brain networks
NASA Astrophysics Data System (ADS)
Battiston, Federico; Nicosia, Vincenzo; Chavez, Mario; Latora, Vito
2017-04-01
In the last decade, network science has shed new light both on the structural (anatomical) and on the functional (correlations in the activity) connectivity among the different areas of the human brain. The analysis of brain networks has made possible to detect the central areas of a neural system and to identify its building blocks by looking at overabundant small subgraphs, known as motifs. However, network analysis of the brain has so far mainly focused on anatomical and functional networks as separate entities. The recently developed mathematical framework of multi-layer networks allows us to perform an analysis of the human brain where the structural and functional layers are considered together. In this work, we describe how to classify the subgraphs of a multiplex network, and we extend the motif analysis to networks with an arbitrary number of layers. We then extract multi-layer motifs in brain networks of healthy subjects by considering networks with two layers, anatomical and functional, respectively, obtained from diffusion and functional magnetic resonance imaging. Results indicate that subgraphs in which the presence of a physical connection between brain areas (links at the structural layer) coexists with a non-trivial positive correlation in their activities are statistically overabundant. Finally, we investigate the existence of a reinforcement mechanism between the two layers by looking at how the probability to find a link in one layer depends on the intensity of the connection in the other one. Showing that functional connectivity is non-trivially constrained by the underlying anatomical network, our work contributes to a better understanding of the interplay between the structure and function in the human brain.
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.
Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark
2003-07-04
The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
NASA Technical Reports Server (NTRS)
Donoho, Greg; Brenneman, Mark A.; Cui, Tracy X.; Donoviel, Dorit; Vogel, Hannes; Goodwin, Edwin H.; Chen, David J.; Hasty, Paul
2003-01-01
The Brca2 tumor-suppressor gene contributes to genomic stability, at least in part by a role in homologous recombinational repair. BRCA2 protein is presumed to function in homologous recombination through interactions with RAD51. Both exons 11 and 27 of Brca2 code for domains that interact with RAD51; exon 11 encodes eight BRC motifs, whereas exon 27 encodes a single, distinct interaction domain. Deletion of all RAD51-interacting domains causes embryonic lethality in mice. A less severe phenotype is seen with BRAC2 truncations that preserve some, but not all, of the BRC motifs. These mice can survive beyond weaning, but are runted and infertile, and die very young from cancer. Cells from such mice show hypersensitivity to some genotoxic agents and chromosomal instability. Here, we have analyzed mice and cells with a deletion of only the RAD51-interacting region encoded by exon 27. Mice homozygous for this mutation (called brca2(lex1)) have a shorter life span than that of control littermates, possibly because of early onsets of cancer and sepsis. No other phenotype was observed in these animals; therefore, the brca2(lex1) mutation is less severe than truncations that delete some BRC motifs. However, at the cellular level, the brca2(lex1) mutation causes reduced viability, hypersensitivity to the DNA interstrand crosslinking agent mitomycin C, and gross chromosomal instability, much like more severe truncations. Thus, the extreme carboxy-terminal region encoded by exon 27 is important for BRCA2 function, probably because it is required for a fully functional interaction between BRCA2 and RAD51. Copyright 2003 Wiley-Liss, Inc.
USDA-ARS?s Scientific Manuscript database
G4-quadruplexes are reversible DNA structures that likely function in gene regulation, but exactly how they work is not known. G4 DNA can be predicted from sequence motifs such as the pattern G-G-G-N(1,7)-G-G-G-N(1,7)-G-G-G-N(1,7)-G-G-G-N(1,7). In the maize genome, G4 motifs were found to occupy ...
NK cell activation: distinct stimulatory pathways counterbalancing inhibitory signals.
Bakker, A B; Wu, J; Phillips, J H; Lanier, L L
2000-01-01
A delicate balance between positive and negative signals regulates NK cell effector function. Activation of NK cells may be initiated by the triggering of multiple adhesion or costimulatory molecules, and can be counterbalanced by inhibitory signals induced by receptors for MHC class I. A common pathway of inhibitory signaling is provided by immunoreceptor tyrosine-based inhibitory motifs (ITIMs) in the cytoplasmic domains of these receptors which mediate the recruitment of SH2 domain-bearing tyrosine phosphate-1 (SHP-1). In contrast to the extensive progress that has been made regarding the negative regulation of NK cell function, our knowledge of the signals that activate NK cells is still poor. Recent studies of the activating receptor complexes have shed new light on the induction of NK cell effector function. Several NK receptors using novel adaptors with immunoreceptor tyrosine-based activation motifs (ITAMs) and with PI 3-kinase recruiting motifs have been implicated in NK cell stimulation.
The mechanism of transforming diamond nanowires to carbon nanostructures.
Sorkin, Anastassia; Su, Haibin
2014-01-24
The transformation of diamond nanowires (DNWs) with different diameters and geometries upon heating is investigated with density-functional-based tight-binding molecular dynamics. DNWs of {100} and {111} oriented cross-section with projected average line density between 7 and 20 atoms Å(-1) transform into carbon nanotubes (CNTs) under gradual heating up to 3500-4000 K. DNWs with projected average line density larger than 25 atoms Å(-1) transform into double-wall CNTs. The route of transformation into CNTs clearly exhibits three stages, with the intriguing intermediate structural motif of a carbon nanoscroll (CNS). Moreover, the morphology plays an important role in the transformation involving the CNS as one important intermediate motif to form CNTs. When starting with [Formula: see text] oriented DNWs with a square cross-section consisting of two {111} facets facing each other, one interesting structure with 'nano-bookshelf' shape emerges: a number of graphene 'shelves' located inside the CNT, bonding to the CNT walls with sp(3) hybridized atoms. The nano-bookshelf structures exist in a wide range of temperatures up to 3,000 K. The further transformation from nano-bookshelf structures depends on the strength of the joints connecting shelves with CNT walls. Notably, the nano-bookshelf structure can evolve into two end products: one is CNT via the CNS pathway, the other is graphene transformed directly from the nano-bookshelf structure at high temperature. This work sheds light on the microscopic insight of carbon nanostructure formation mechanisms with the featured motifs highlighted in the pathways.
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data
Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa
2017-01-01
Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities
Narasimhan, Kamesh; Lambert, Samuel A; Yang, Ally WH; Riddell, Jeremy; Mnaimneh, Sanie; Zheng, Hong; Albu, Mihai; Najafabadi, Hamed S; Reece-Hoyes, John S; Fuxman Bass, Juan I; Walhout, Albertha JM; Weirauch, Matthew T; Hughes, Timothy R
2015-01-01
Caenorhabditis elegans is a powerful model for studying gene regulation, as it has a compact genome and a wealth of genomic tools. However, identification of regulatory elements has been limited, as DNA-binding motifs are known for only 71 of the estimated 763 sequence-specific transcription factors (TFs). To address this problem, we performed protein binding microarray experiments on representatives of canonical TF families in C. elegans, obtaining motifs for 129 TFs. Additionally, we predict motifs for many TFs that have DNA-binding domains similar to those already characterized, increasing coverage of binding specificities to 292 C. elegans TFs (∼40%). These data highlight the diversification of binding motifs for the nuclear hormone receptor and C2H2 zinc finger families and reveal unexpected diversity of motifs for T-box and DM families. Motif enrichment in promoters of functionally related genes is consistent with known biology and also identifies putative regulatory roles for unstudied TFs. DOI: http://dx.doi.org/10.7554/eLife.06967.001 PMID:25905672
Identifying novel sequence variants of RNA 3D motifs
Zirbel, Craig L.; Roll, James; Sweeney, Blake A.; Petrov, Anton I.; Pirrung, Meg; Leontis, Neocles B.
2015-01-01
Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download. PMID:26130723
Bandyopadhyay, Deepak; Huan, Jun; Prins, Jan; Snoeyink, Jack; Wang, Wei; Tropsha, Alexander
2009-11-01
Protein function prediction is one of the central problems in computational biology. We present a novel automated protein structure-based function prediction method using libraries of local residue packing patterns that are common to most proteins in a known functional family. Critical to this approach is the representation of a protein structure as a graph where residue vertices (residue name used as a vertex label) are connected by geometrical proximity edges. The approach employs two steps. First, it uses a fast subgraph mining algorithm to find all occurrences of family-specific labeled subgraphs for all well characterized protein structural and functional families. Second, it queries a new structure for occurrences of a set of motifs characteristic of a known family, using a graph index to speed up Ullman's subgraph isomorphism algorithm. The confidence of function inference from structure depends on the number of family-specific motifs found in the query structure compared with their distribution in a large non-redundant database of proteins. This method can assign a new structure to a specific functional family in cases where sequence alignments, sequence patterns, structural superposition and active site templates fail to provide accurate annotation.
Motifs in triadic random graphs based on Steiner triple systems
NASA Astrophysics Data System (ADS)
Winkler, Marco; Reichardt, Jörg
2013-08-01
Conventionally, pairwise relationships between nodes are considered to be the fundamental building blocks of complex networks. However, over the last decade, the overabundance of certain subnetwork patterns, i.e., the so-called motifs, has attracted much attention. It has been hypothesized that these motifs, instead of links, serve as the building blocks of network structures. Although the relation between a network's topology and the general properties of the system, such as its function, its robustness against perturbations, or its efficiency in spreading information, is the central theme of network science, there is still a lack of sound generative models needed for testing the functional role of subgraph motifs. Our work aims to overcome this limitation. We employ the framework of exponential random graph models (ERGMs) to define models based on triadic substructures. The fact that only a small portion of triads can actually be set independently poses a challenge for the formulation of such models. To overcome this obstacle, we use Steiner triple systems (STSs). These are partitions of sets of nodes into pair-disjoint triads, which thus can be specified independently. Combining the concepts of ERGMs and STSs, we suggest generative models capable of generating ensembles of networks with nontrivial triadic Z-score profiles. Further, we discover inevitable correlations between the abundance of triad patterns, which occur solely for statistical reasons and need to be taken into account when discussing the functional implications of motif statistics. Moreover, we calculate the degree distributions of our triadic random graphs analytically.
Hobo, T; Asada, M; Kowyama, Y; Hattori, T
1999-09-01
ACGT-containing ABA response elements (ABREs) have been functionally identified in the promoters of various genes. In addition, single copies of ABRE have been found to require a cis-acting, coupling element to achieve ABA induction. A coupling element 3 (CE3) sequence, originally identified as such in the barley HVA1 promoter, is found approximately 30 bp downstream of motif A (ACGT-containing ABRE) in the promoter of the Osem gene. The relationship between these two elements was further defined by linker-scan analyses of a 55 bp fragment of the Osem promoter, which is sufficient for ABA-responsiveness and VP1 activation. The analyses revealed that both motif A and CE3 sequence were required not only for ABA-responsiveness but also for VP1 activation. Since the sequences of motif A and CE3 were found to be similar, motif-exchange experiments were carried out. The experiments demonstrated that motif A and CE3 were interchangeable by each other with respect to both ABA and VP1 regulation. In addition, both sequences were shown to be recognized by a VP1-interacting, ABA-responsive bZIP factor TRAB1. These results indicate that ACGT-containing ABREs and CE3 are functionally equivalent cis-acting elements. Furthermore, TRAB1 was shown to bind two other non-ACGT ABREs. Based on these results, all these ABREs including CE3 are proposed to be categorized into a single class of cis-acting elements.
Garcia, Mayra L.; Reynolds, Tracy D.; Mothes, Walther; Robek, Michael D.
2013-01-01
The hepatitis B virus (HBV) Core protein encodes a late (L)-domain like motif (129PPAYRPPNAP138) that has been purported to serve as a docking site for recruitment of host factors such as Nedd4 that can mediate viral particle release from infected cells. However, mutation of this region of Core typically disrupts nucleocapsid formation in the cytoplasm, making it difficult to ascertain if the Core PPAY motif constitutes a functional L-domain that mediates HBV release in the context of replicating virus. Since many viral L-domains are functionally interchangeable between different virus families, and such swapping experiments have been used as a tool to identify other viral sequences with L-domain activity, we generated chimeric constructs between murine leukemia virus (MLV) Gag and HBV Core to determine if the potential HBV L-domain motif is sufficient to stimulate virus release. We found that the HBV Core PPAY motif, but not the PNAP motif, demonstrates L-domain activity in the context of MLV replication to direct virus release and infectious virion production. Additionally, we found that overexpression of the cellular Nedd4 or WWP1 ubiquitin ligases stimulates release of a partially defective PPAY domain mutant, providing further evidence supporting a role for the Nedd4 ubiquitin ligase in promoting HBV release. These studies lend further insight into the mechanisms used by HBV to mediate its release from infected cells. PMID:24009707
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shih, Chun-Ho; Chiang, Tin-Bin; Wang, Wen-Jeng, E-mail: wjwang@mail.cgust.edu.tw
Convulxin (CVX), a C-type lectin-like protein (CLPs), is a potent platelet aggregation inducer. To evaluate its potential applications in angiogenic diseases, the multimeric CVX were further explored on its mode of actions toward human coronary artery smooth muscle cells (HCASMCs). The N-terminus of β-chain of CVX (CVX-β) contains a putative disintegrin-like domain with a conserved motif upon the sequence comparison with other CLPs. Importantly, native CVX had no cytotoxic activity as examined by electrophoretic pattern. A Trp-Ala–Asp (WAD)-containing octapeptide, MTWADAEK, was thereafter synthesized and analyzed in functional assays. In the case of specific integrin antagonists as positive controls, the anti-angiogenicmore » effects of CVX on HCASMCs were investigated by series of functional analyses. CVX showed to exhibit multiple inhibitory activities toward HCASMCs proliferation, adhesion and invasion with a dose- and integrin αvβ3-dependent fashion. However, the WAD-octapeptide exerting a minor potency could also work as an active peptidomimetic. In addition, flow cytometric analysis demonstrated both the intact CVX and synthetic peptide can specifically interact with integrin-αv on HCASMCs and CVX was shown to have a down-regulatory effect on the gene expression of CXC-chemokines, such as growth-related oncogene and interleukin-8. According to nuclear factor-κB (NF-κB) p65 translocation assay and Western blotting analysis, the NF-κB activation was not involved in the signaling events of CVX-induced gene expression. In conclusion, CVX may act as a disintegrin-like protein via the interactions of WAD-motif in CVX-β with integrin-αv on HCASMCs and it also is a gene suppressor with the ability to diminish the expression of two CXC-chemokines in a NF-κB-independent manner. Indeed, more extensive investigations are needed and might create a new avenue for the development of a novel angiostatic agent. - Highlights: • The tetrameric convulxin (CVX) with WAD-motif could affect HCASMC multi-functions. • CVX significantly diminished cell proliferation, adhesion and invasion in HCASMCs. • The WAD-motif/integrin-αv interaction was involved in its suppressive mechanism. • CVX impaired the gene expression of GRO and IL-8 with a NF-κB independent manner.« less
MotifMark: Finding regulatory motifs in DNA sequences.
Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D
2017-07-01
The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.
Zhang, Hong; Hu, Weiguo; Hao, Jilei; Lv, Shikai; Wang, Changyou; Tong, Wei; Wang, Yajuan; Wang, Yanzhen; Liu, Xinlun; Ji, Wanquan
2016-03-15
Stripe rust (Puccinia striiformis f. sp. tritici; Pst) and powdery mildew (Blumeria graminis f. sp. tritici; Bgt) are important diseases of wheat (Triticum aestivum) worldwide. Increasingly evidences suggest that long intergenic ncRNAs (lincRNAs) are developmentally regulated and play important roles in development and stress responses of plants. However, identification of lincRNAs in wheat is still limited comparing with functional gene expression. The transcriptome of the hexaploid wheat line N9134 inoculated with the Chinese Pst race CYR31 and Bgt race E09 at 1, 2, and 3 days post-inoculation was recapitulated to detect the lincRNAs. Here, 283 differential expressed lincRNAs were identified from 58218 putative lincRNAs, which account for 31.2% of transcriptome. Of which, 254 DE-LincRNAs responded to the Bgt stress, and 52 lincRNAs in Pst. Among them, 1328 SnRNP motifs (sm sites) were detected and showed RRU4-11RR sm site element and consensus RRU1-9VU1-7RR SnRNP motifs, where the total number of uridine was more than 3 but less than 11. Additionally, 101 DE-lincRNAs were predicted as targets of miRNA by psRNATarget, while 5 target mimics were identified using target mimicry search in TAPIR. Taken together, our findings indicate that the lincRNA of wheat responded to Bgt and Pst stress and played important roles in splicesome and inter-regulating with miRNA. The sm site of wheat showed a more complex construction than that in mammal and model plant. The mass sequence data generated in this study provide a cue for future functional and molecular research on wheat-fungus interactions.
Properties of the [NiFe]-hydrogenase maturation protein HypD.
Blokesch, Melanie; Böck, August
2006-07-24
A mutational screen of amino acid residues of hydrogenase maturation protein HypD from Escherichia coli disclosed that seven conserved cysteine residues located in three different motifs in HypD are essential. Evidence is presented for potential functions of these motifs in the maturation process.
Different effects of the TAR structure on HIV-1 and HIV-2 genomic RNA translation
Soto-Rifo, Ricardo; Limousin, Taran; Rubilar, Paulina S.; Ricci, Emiliano P.; Décimo, Didier; Moncorgé, Olivier; Trabaud, Mary-Anne; André, Patrice; Cimarelli, Andrea; Ohlmann, Théophile
2012-01-01
The 5′-untranslated region (5′-UTR) of the genomic RNA of human immunodeficiency viruses type-1 (HIV-1) and type-2 (HIV-2) is composed of highly structured RNA motifs essential for viral replication that are expected to interfere with Gag and Gag-Pol translation. Here, we have analyzed and compared the properties by which the viral 5′-UTR drives translation from the genomic RNA of both human immunodeficiency viruses. Our results showed that translation from the HIV-2 gRNA was very poor compared to that of HIV-1. This was rather due to the intrinsic structural motifs in their respective 5′-UTR without involvement of any viral protein. Further investigation pointed to a different role of TAR RNA, which was much inhibitory for HIV-2 translation. Altogether, these data highlight important structural and functional differences between these two human pathogens. PMID:22121214
Effective Feature Selection for Classification of Promoter Sequences.
K, Kouser; P G, Lavanya; Rangarajan, Lalitha; K, Acharya Kshitish
2016-01-01
Exploring novel computational methods in making sense of biological data has not only been a necessity, but also productive. A part of this trend is the search for more efficient in silico methods/tools for analysis of promoters, which are parts of DNA sequences that are involved in regulation of expression of genes into other functional molecules. Promoter regions vary greatly in their function based on the sequence of nucleotides and the arrangement of protein-binding short-regions called motifs. In fact, the regulatory nature of the promoters seems to be largely driven by the selective presence and/or the arrangement of these motifs. Here, we explore computational classification of promoter sequences based on the pattern of motif distributions, as such classification can pave a new way of functional analysis of promoters and to discover the functionally crucial motifs. We make use of Position Specific Motif Matrix (PSMM) features for exploring the possibility of accurately classifying promoter sequences using some of the popular classification techniques. The classification results on the complete feature set are low, perhaps due to the huge number of features. We propose two ways of reducing features. Our test results show improvement in the classification output after the reduction of features. The results also show that decision trees outperform SVM (Support Vector Machine), KNN (K Nearest Neighbor) and ensemble classifier LibD3C, particularly with reduced features. The proposed feature selection methods outperform some of the popular feature transformation methods such as PCA and SVD. Also, the methods proposed are as accurate as MRMR (feature selection method) but much faster than MRMR. Such methods could be useful to categorize new promoters and explore regulatory mechanisms of gene expressions in complex eukaryotic species.
Efficient activation of transcription in yeast by the BPV1 E2 protein.
Stanway, C A; Sowden, M P; Wilson, L E; Kingsman, A J; Kingsman, S M
1989-01-01
The full-length gene product encoded by the E2 open reading frame (ORF) of bovine papillomavirus type 1 (BPV1) is a transcriptional transactivator. It is believed to mediate its effect on the BPV1 long control region (LCR) by binding to motifs with the consensus sequence ACCN6GGT. The minimal functional cis active site, called the E2 response element (E2RE), in mammalian cells comprises two copies of this motif. Here we have shown that E2 can function in Saccharomyces cerevisiae by placing an E2RE upstream of a synthetic yeast assay promoter which consists of a TATA motif and an mRNA initiation site, spaced correctly. This E2RE-minimal promoter is only transcriptionally active in the presence of E2 protein and the resulting mRNA is initiated at the authentic start site. This is the first report of a mammalian viral transactivator functioning in yeast. The level of activation by E2 via the E2RE was the same as observed with the highly efficient authentic PGK promoter where the upstream activation sequence is composed of three distinct elements. Furthermore a single E2 motif which is insufficient in mammalian cells as an activation site was as efficiently utilized in yeast as the E2RE (2 motifs). Previous studies have shown that mammalian cellular activators can function in yeast and our data now extend this to viral-specific activators. Our data indicate however that while the mechanism of transactivation is broadly conserved there may be significant differences at the detailed level. Images PMID:2539584
Martínez, Miguel A.; Verdaguer, Nuria; Mateu, Mauricio G.; Domingo, Esteban
1997-01-01
Aphthoviruses use a conserved Arg-Gly-Asp triplet for attachment to host cells and this motif is believed to be essential for virus viability. Here we report that this triplet—which is also a widespread motif involved in cell-to-cell adhesion—can become dispensable upon short-term evolution of the virus harboring it. Foot-and-mouth disease virus (FMDV), which was multiply passaged in cell culture, showed an altered repertoire of antigenic variants resistant to a neutralizing monoclonal antibody. The altered repertoire includes variants with substitutions at the Arg-Gly-Asp motif. Mutants lacking this sequence replicated normally in cell culture and were indistinguishable from the parental virus. Studies with individual FMDV clones indicate that amino acid replacements on the capsid surface located around the loop harboring the Arg-Gly-Asp triplet may mediate in the dispensability of this motif. The results show that FMDV quasispecies evolving in a constant biological environment have the capability of rendering totally dispensable a receptor recognition motif previously invariant, and to ensure an alternative pathway for normal viral replication. Thus, variability of highly conserved motifs, even those that viruses have adapted from functional cellular motifs, can contribute to phenotypic flexibility of RNA viruses in nature. PMID:9192645
Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family
Soufari, Heddy
2017-01-01
Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515
BlockLogo: visualization of peptide and sequence motif conservation
Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian; Sun, Jing; Schönbach, Christian; Reinherz, Ellis L.; Zhang, Guang Lan; Brusic, Vladimir
2013-01-01
BlockLogo is a web-server application for visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, selection of motif positions, type of sequence, and output format definition. The output has BlockLogo along with the sequence logo, and a table of motif frequencies. We deployed BlockLogo as an online application and have demonstrated its utility through examples that show visualization of T-cell epitopes and B-cell epitopes (both continuous and discontinuous). Our additional example shows a visualization and analysis of structural motifs that determine specificity of peptide binding to HLA-DR molecules. The BlockLogo server also employs selected experimentally validated prediction algorithms to enable on-the-fly prediction of MHC binding affinity to 15 common HLA class I and class II alleles as well as visual analysis of discontinuous epitopes from multiple sequence alignments. It enables the visualization and analysis of structural and functional motifs that are usually described as regular expressions. It provides a compact view of discontinuous motifs composed of distant positions within biological sequences. BlockLogo is available at: http://research4.dfci.harvard.edu/cvc/blocklogo/ and http://methilab.bu.edu/blocklogo/ PMID:24001880
An experimental test of a fundamental food web motif.
Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia
2010-06-07
Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.
Discovering Motifs in Biological Sequences Using the Micron Automata Processor.
Roy, Indranil; Aluru, Srinivas
2016-01-01
Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.
Dynamics of brain activity underlying working memory for music in a naturalistic condition.
Burunat, Iballa; Alluri, Vinoo; Toiviainen, Petri; Numminen, Jussi; Brattico, Elvira
2014-08-01
We aimed at determining the functional neuroanatomy of working memory (WM) recognition of musical motifs that occurs while listening to music by adopting a non-standard procedure. Western tonal music provides naturally occurring repetition and variation of motifs. These serve as WM triggers, thus allowing us to study the phenomenon of motif tracking within real music. Adopting a modern tango as stimulus, a behavioural test helped to identify the stimulus motifs and build a time-course regressor of WM neural responses. This regressor was then correlated with the participants' (musicians') functional magnetic resonance imaging (fMRI) signal obtained during a continuous listening condition. In order to fine-tune the identification of WM processes in the brain, the variance accounted for by the sensory processing of a set of the stimulus' acoustic features was pruned from participants' neurovascular responses to music. Motivic repetitions activated prefrontal and motor cortical areas, basal ganglia, medial temporal lobe (MTL) structures, and cerebellum. The findings suggest that WM processing of motifs while listening to music emerges from the integration of neural activity distributed over cognitive, motor and limbic subsystems. The recruitment of the hippocampus stands as a novel finding in auditory WM. Effective connectivity and agglomerative hierarchical clustering analyses indicate that the hippocampal connectivity is modulated by motif repetitions, showing strong connections with WM-relevant areas (dorsolateral prefrontal cortex - dlPFC, supplementary motor area - SMA, and cerebellum), which supports the role of the hippocampus in the encoding of the musical motifs in WM, and may evidence long-term memory (LTM) formation, enabled by the use of a realistic listening condition. Copyright © 2014 Elsevier Ltd. All rights reserved.
Computational and experimental analysis of short peptide motifs for enzyme inhibition.
Fu, Jinglin; Larini, Luca; Cooper, Anthony J; Whittaker, John W; Ahmed, Azka; Dong, Junhao; Lee, Minyoung; Zhang, Ting
2017-01-01
The metabolism of living systems involves many enzymes that play key roles as catalysts and are essential to biological function. Searching ligands with the ability to modulate enzyme activities is central to diagnosis and therapeutics. Peptides represent a promising class of potential enzyme modulators due to the large chemical diversity, and well-established methods for library synthesis. Peptides and their derivatives are found to play critical roles in modulating enzymes and mediating cellular uptakes, which are increasingly valuable in therapeutics. We present a methodology that uses molecular dynamics (MD) and point-variant screening to identify short peptide motifs that are critical for inhibiting β-galactosidase (β-Gal). MD was used to simulate the conformations of peptides and to suggest short motifs that were most populated in simulated conformations. The function of the simulated motifs was further validated by the experimental point-variant screening as critical segments for inhibiting the enzyme. Based on the validated motifs, we eventually identified a 7-mer short peptide for inhibiting an enzyme with low μM IC50. The advantage of our methodology is the relatively simplified simulation that is informative enough to identify the critical sequence of a peptide inhibitor, with a precision comparable to truncation and alanine scanning experiments. Our combined experimental and computational approach does not rely on a detailed understanding of mechanistic and structural details. The MD simulation suggests the populated motifs that are consistent with the results of the experimental alanine and truncation scanning. This approach appears to be applicable to both natural and artificial peptides. With more discovered short motifs in the future, they could be exploited for modulating biocatalysis, and developing new medicine.
Ito, Masaki; Hayashi, Kazumi; Minamisawa, Tamiko; Homma, Sadamu; Koido, Shigeo; Shiba, Kiyotaka
2017-01-01
Adjuvants are indispensable for achieving a sufficient immune response from vaccinations. From a functional viewpoint, adjuvants are classified into two categories: "physical adjuvants" increase the efficacy of antigen presentation by antigen-presenting cells (APC) and "signal adjuvants" induce the maturation of APC. Our previous study has demonstrated that a physical adjuvant can be encrypted into proteinous antigens by creating artificial proteins from combinatorial assemblages of epitope peptides and those peptide sequences having propensities to form certain protein structures (motif programming). However, the artificial antigens still require a signal adjuvant to maturate the APC; for example, co-administration of the Toll-like receptor 4 (TLR4) agonist monophosphoryl lipid A (MPLA) was required to induce an in vivo immunoreaction. In this study, we further modified the previous artificial antigens by appending the peptide motifs, which have been reported to have agonistic activity for TLR4, to create "adjuvant-free" antigens. The created antigens with triple TLR4 agonistic motifs in their C-terminus have activated NF-κB signaling pathways through TLR4. These proteins also induced the production of the inflammatory cytokine TNF-α, and the expression of the co-stimulatory molecule CD40 in APC, supporting the maturation of APC in vitro. Unexpectedly, these signal adjuvant-encrypted proteins have lost their ability to be physical adjuvants because they did not induce cytotoxic T lymphocytes (CTL) in vivo, while the parental proteins induced CTL. These results confirmed that the manifestation of a motif's function is context-dependent and simple addition does not always work for motif-programing. Further optimization of the molecular context of the TLR4 agonistic motifs in antigens should be required to create adjuvant-free antigens.
Ye, Ping; Peyser, Brian D; Spencer, Forrest A; Bader, Joel S
2005-01-01
Background In a genetic interaction, the phenotype of a double mutant differs from the combined phenotypes of the underlying single mutants. When the single mutants have no growth defect, but the double mutant is lethal or exhibits slow growth, the interaction is termed synthetic lethality or synthetic fitness. These genetic interactions reveal gene redundancy and compensating pathways. Recently available large-scale data sets of genetic interactions and protein interactions in Saccharomyces cerevisiae provide a unique opportunity to elucidate the topological structure of biological pathways and how genes function in these pathways. Results We have defined congruent genes as pairs of genes with similar sets of genetic interaction partners and constructed a genetic congruence network by linking congruent genes. By comparing path lengths in three types of networks (genetic interaction, genetic congruence, and protein interaction), we discovered that high genetic congruence not only exhibits correlation with direct protein interaction linkage but also exhibits commensurate distance with the protein interaction network. However, consistent distances were not observed between genetic and protein interaction networks. We also demonstrated that congruence and protein networks are enriched with motifs that indicate network transitivity, while the genetic network has both transitive (triangle) and intransitive (square) types of motifs. These results suggest that robustness of yeast cells to gene deletions is due in part to two complementary pathways (square motif) or three complementary pathways, any two of which are required for viability (triangle motif). Conclusion Genetic congruence is superior to genetic interaction in prediction of protein interactions and function associations. Genetically interacting pairs usually belong to parallel compensatory pathways, which can generate transitive motifs (any two of three pathways needed) or intransitive motifs (either of two pathways needed). PMID:16283923
A RHIM with a View: FLYing with Functional Amyloids.
Shin, Sunny; Cherry, Sara
2017-10-17
Recognition of bacterial peptidoglycan by the Drosophila IMD pathway triggers NF-κB activation and an associated immune response. In this issue of Immunity, Kleino et al. (2017) show that proteins in the IMD pathway form functional amyloids via a cryptic motif resembling the RHIM motif found in mammalian RIPK proteins. Amyloid formation can be negatively regulated, suggesting that it presents a regulatory point in multiple biological processes. Copyright © 2017 Elsevier Inc. All rights reserved.
The K-turn motif in riboswitches and other RNA species☆
Lilley, David M.J.
2014-01-01
The kink turn is a widespread structure motif that introduces a tight bend into the axis of duplex RNA. This generally functions to mediate tertiary interactions, and to serve as a specific protein binding site. K-turns or closely related structures are found in at least seven different riboswitch structures, where they function as key architectural elements that help generate the ligand binding pocket. This article is part of a Special Issue entitled: Riboswitches. PMID:24798078
Sehra, Bhupinder; Franks, Robert G.
2017-01-01
In the Arabidopsis thaliana seed pod, pod shatter and seed dispersal properties are in part determined by the development of a longitudinally orientated dehiscence zone (DZ) that derives from cells of the gynoecial valve margin (VM). Transcriptional regulation of the MADS protein encoding transcription factors genes SHATTERPROOF1 (SHP1) and SHATTERPROOF2 (SHP2) are critical for proper VM identity specification and later on for DZ development. Current models of SHP1 and SHP2 regulation indicate that the transcription factors FRUITFULL (FUL) and REPLUMLESS (RPL) repress these SHP genes in the developing valve and replum domains, respectively. Thus the expression of the SHP genes is restricted to the VM. FUL encodes a MADS-box containing transcription factor that is predicted to act through CArG-box containing cis-regulatory motifs. Here we delimit functional modules within the SHP2 cis-regulatory region and examine the functional importance of CArG box motifs within these regulatory regions. We have characterized a 2.2kb region upstream of the SHP2 translation start site that drives early and late medial domain expression in the gynoecium, as well as expression within the VM and DZ. We identified two separable, independent cis-regulatory modules, a 1kb promoter region and a 700bp enhancer region, that are capable of giving VM and DZ expression. Our results argue for multiple independent cis-regulatory modules that support SHP2 expression during VM development and may contribute to the robustness of SHP2 expression in this tissue. Additionally, three closely positioned CArG box motifs located in the SHP2 upstream regulatory region were mutated in the context of the 2.2kb reporter construct. Mutating simultaneously all three CArG boxes caused a moderate de-repression of the SHP2 reporter that was detected within the valve domain, suggesting that these CArG boxes are involved in SHP2 repression in the valve. PMID:29085379
Borisova, Anna S; Eneyskaya, Elena V; Jana, Suvamay; Badino, Silke F; Kari, Jeppe; Amore, Antonella; Karlsson, Magnus; Hansson, Henrik; Sandgren, Mats; Himmel, Michael E; Westh, Peter; Payne, Christina M; Kulminskaya, Anna A; Ståhlberg, Jerry
2018-01-01
The ascomycete fungus Trichoderma reesei is the predominant source of enzymes for industrial conversion of lignocellulose. Its glycoside hydrolase family 7 cellobiohydrolase (GH7 CBH) Tre Cel7A constitutes nearly half of the enzyme cocktail by weight and is the major workhorse in the cellulose hydrolysis process. The orthologs from Trichoderma atroviride ( Tat Cel7A) and Trichoderma harzianum ( Tha Cel7A) show high sequence identity with Tre Cel7A, ~ 80%, and represent naturally evolved combinations of cellulose-binding tunnel-enclosing loop motifs, which have been suggested to influence intrinsic cellobiohydrolase properties, such as endo-initiation, processivity, and off-rate. The Tat Cel7A, Tha Cel7A, and Tre Cel7A enzymes were characterized for comparison of function. The catalytic domain of Tat Cel7A was crystallized, and two structures were determined: without ligand and with thio-cellotriose in the active site. Initial hydrolysis of bacterial cellulose was faster with Tat Cel7A than either Tha Cel7A or Tre Cel7A. In synergistic saccharification of pretreated corn stover, both Tat Cel7A and Tha Cel7A were more efficient than Tre Cel7A, although Tat Cel7A was more sensitive to thermal inactivation. Structural analyses and molecular dynamics (MD) simulations were performed to elucidate important structure/function correlations. Moreover, reverse conservation analysis (RCA) of sequence diversity revealed divergent regions of interest located outside the cellulose-binding tunnel of Trichoderma spp. GH7 CBHs. We hypothesize that the combination of loop motifs is the main determinant for the observed differences in Cel7A activity on cellulosic substrates. Fine-tuning of the loop flexibility appears to be an important evolutionary target in Trichoderma spp., a conclusion supported by the RCA data. Our results indicate that, for industrial use, it would be beneficial to combine loop motifs from Tat Cel7A with the thermostability features of Tre Cel7A. Furthermore, one region implicated in thermal unfolding is suggested as a primary target for protein engineering.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Borisova, Anna S.; Eneyskaya, Elena V.; Jana, Suvamay
The ascomycete fungus Trichoderma reesei is the predominant source of enzymes for industrial conversion of lignocellulose. Its glycoside hydrolase family 7 cellobiohydrolase (GH7 CBH) TreCel7A constitutes nearly half of the enzyme cocktail by weight and is the major workhorse in the cellulose hydrolysis process. The orthologs from Trichoderma atroviride (TatCel7A) and Trichoderma harzianum (ThaCel7A) show high sequence identity with TreCel7A, ~ 80%, and represent naturally evolved combinations of cellulose-binding tunnel-enclosing loop motifs, which have been suggested to influence intrinsic cellobiohydrolase properties, such as endo-initiation, processivity, and off-rate. The TatCel7A, ThaCel7A, and TreCel7A enzymes were characterized for comparison of function. Themore » catalytic domain of TatCel7A was crystallized, and two structures were determined: without ligand and with thio-cellotriose in the active site. Initial hydrolysis of bacterial cellulose was faster with TatCel7A than either ThaCel7A or TreCel7A. In synergistic saccharification of pretreated corn stover, both TatCel7A and ThaCel7A were more efficient than TreCel7A, although TatCel7A was more sensitive to thermal inactivation. Structural analyses and molecular dynamics (MD) simulations were performed to elucidate important structure/function correlations. Moreover, reverse conservation analysis (RCA) of sequence diversity revealed divergent regions of interest located outside the cellulose-binding tunnel of Trichoderma spp. GH7 CBHs. We hypothesize that the combination of loop motifs is the main determinant for the observed differences in Cel7A activity on cellulosic substrates. Fine-tuning of the loop flexibility appears to be an important evolutionary target in Trichoderma spp., a conclusion supported by the RCA data. Our results indicate that, for industrial use, it would be beneficial to combine loop motifs from TatCel7A with the thermostability features of TreCel7A. Furthermore, one region implicated in thermal unfolding is suggested as a primary target for protein engineering.« less
Borisova, Anna S.; Eneyskaya, Elena V.; Jana, Suvamay; ...
2018-01-13
The ascomycete fungus Trichoderma reesei is the predominant source of enzymes for industrial conversion of lignocellulose. Its glycoside hydrolase family 7 cellobiohydrolase (GH7 CBH) TreCel7A constitutes nearly half of the enzyme cocktail by weight and is the major workhorse in the cellulose hydrolysis process. The orthologs from Trichoderma atroviride (TatCel7A) and Trichoderma harzianum (ThaCel7A) show high sequence identity with TreCel7A, ~ 80%, and represent naturally evolved combinations of cellulose-binding tunnel-enclosing loop motifs, which have been suggested to influence intrinsic cellobiohydrolase properties, such as endo-initiation, processivity, and off-rate. The TatCel7A, ThaCel7A, and TreCel7A enzymes were characterized for comparison of function. Themore » catalytic domain of TatCel7A was crystallized, and two structures were determined: without ligand and with thio-cellotriose in the active site. Initial hydrolysis of bacterial cellulose was faster with TatCel7A than either ThaCel7A or TreCel7A. In synergistic saccharification of pretreated corn stover, both TatCel7A and ThaCel7A were more efficient than TreCel7A, although TatCel7A was more sensitive to thermal inactivation. Structural analyses and molecular dynamics (MD) simulations were performed to elucidate important structure/function correlations. Moreover, reverse conservation analysis (RCA) of sequence diversity revealed divergent regions of interest located outside the cellulose-binding tunnel of Trichoderma spp. GH7 CBHs. We hypothesize that the combination of loop motifs is the main determinant for the observed differences in Cel7A activity on cellulosic substrates. Fine-tuning of the loop flexibility appears to be an important evolutionary target in Trichoderma spp., a conclusion supported by the RCA data. Our results indicate that, for industrial use, it would be beneficial to combine loop motifs from TatCel7A with the thermostability features of TreCel7A. Furthermore, one region implicated in thermal unfolding is suggested as a primary target for protein engineering.« less
Mushtaq, Ameeq Ul; Lee, Yejin; Hwang, Eunha; Bang, Jeong Kyu; Hong, Eunmi; Byun, Youngjoo; Song, Ji-Joon; Jeon, Young Ho
2018-01-01
MeCP2 is a chromatin associated protein which is highly expressed in brain and relevant with Rett syndrome (RTT). There are AT-hook motifs in MeCP2 which can bind with AT-rich DNA, suggesting a role in chromatin binding. Here, we report the identification and characterization of another AT-rich DNA binding motif (residues 295 to 313) from the C-terminal transcription repression domain of MeCP2 by nuclear magnetic resonance (NMR) and isothermal calorimetry (ITC). This motif shows a micromolar affinity to AT-rich DNA, and it binds to the minor groove of DNA like AT-hook motifs. Together with the previous studies, our results provide an insight into a critical role of this motif in chromatin structure and function. Copyright © 2017 Elsevier Inc. All rights reserved.
An artificial intelligence approach fit for tRNA gene studies in the era of big sequence data.
Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi
2017-09-12
Unsupervised data mining capable of extracting a wide range of knowledge from big data without prior knowledge or particular models is a timely application in the era of big sequence data accumulation in genome research. By handling oligonucleotide compositions as high-dimensional data, we have previously modified the conventional self-organizing map (SOM) for genome informatics and established BLSOM, which can analyze more than ten million sequences simultaneously. Here, we develop BLSOM specialized for tRNA genes (tDNAs) that can cluster (self-organize) more than one million microbial tDNAs according to their cognate amino acid solely depending on tetra- and pentanucleotide compositions. This unsupervised clustering can reveal combinatorial oligonucleotide motifs that are responsible for the amino acid-dependent clustering, as well as other functionally and structurally important consensus motifs, which have been evolutionarily conserved. BLSOM is also useful for identifying tDNAs as phylogenetic markers for special phylotypes. When we constructed BLSOM with 'species-unknown' tDNAs from metagenomic sequences plus 'species-known' microbial tDNAs, a large portion of metagenomic tDNAs self-organized with species-known tDNAs, yielding information on microbial communities in environmental samples. BLSOM can also enhance accuracy in the tDNA database obtained from big sequence data. This unsupervised data mining should become important for studying numerous functionally unclear RNAs obtained from a wide range of organisms.
Fast social-like learning of complex behaviors based on motor motifs.
Calvo Tapia, Carlos; Tyukin, Ivan Y; Makarov, Valeri A
2018-05-01
Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n-1)! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n-1) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.
Transient α-helices in the disordered RPEL motifs of the serum response factor coactivator MKL1
NASA Astrophysics Data System (ADS)
Mizuguchi, Mineyuki; Fuju, Takahiro; Obita, Takayuki; Ishikawa, Mitsuru; Tsuda, Masaaki; Tabuchi, Akiko
2014-06-01
The megakaryoblastic leukemia 1 (MKL1) protein functions as a transcriptional coactivator of the serum response factor. MKL1 has three RPEL motifs (RPEL1, RPEL2, and RPEL3) in its N-terminal region. MKL1 binds to monomeric G-actin through RPEL motifs, and the dissociation of MKL1 from G-actin promotes the translocation of MKL1 to the nucleus. Although structural data are available for RPEL motifs of MKL1 in complex with G-actin, the structural characteristics of RPEL motifs in the free state have been poorly defined. Here we characterized the structures of free RPEL motifs using NMR and CD spectroscopy. NMR and CD measurements showed that free RPEL motifs are largely unstructured in solution. However, NMR analysis identified transient α-helices in the regions where helices α1 and α2 are induced upon binding to G-actin. Proline mutagenesis showed that the transient α-helices are locally formed without helix-helix interactions. The helix content is higher in the order of RPEL1, RPEL2, and RPEL3. The amount of preformed structure may correlate with the binding affinity between the intrinsically disordered protein and its target molecule.
GPUmotif: An Ultra-Fast and Energy-Efficient Motif Analysis Program Using Graphics Processing Units
Zandevakili, Pooya; Hu, Ming; Qin, Zhaohui
2012-01-01
Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU)-accelerated motif analysis program named GPUmotif. We proposed a “fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/ PMID:22662128
Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J
2008-02-15
KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.
Fast social-like learning of complex behaviors based on motor motifs
NASA Astrophysics Data System (ADS)
Calvo Tapia, Carlos; Tyukin, Ivan Y.; Makarov, Valeri A.
2018-05-01
Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n -1 )! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n -1 ) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.
GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.
Zandevakili, Pooya; Hu, Ming; Qin, Zhaohui
2012-01-01
Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU)-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/
Grate, Jay W.; Mo, Kai -For; Daily, Michael D.
2016-02-10
Sequence control in polymers, well-known in nature, encodes structure and functionality. Here we introduce a new architecture, based on the nucleophilic aromatic substitution chemistry of cyanuric chloride, that creates a new class of sequence-defined polymers dubbed TZPs. Proof of concept is demonstrated with two synthesized hexamers, having neutral and ionizable side chains. Molecular dynamics simulations show backbone–backbone interactions, including H-bonding motifs and pi–pi interactions. This architecture is arguably biomimetic while differing from sequence-defined polymers having peptide bonds. In conclusion, the synthetic methodology supports the structural diversity of side chains known in peptides, as well as backbone–backbone hydrogen-bonding motifs, and willmore » thus enable new macromolecules and materials with useful functions.« less
Grate, Jay W; Mo, Kai-For; Daily, Michael D
2016-03-14
Sequence control in polymers, well-known in nature, encodes structure and functionality. Here we introduce a new architecture, based on the nucleophilic aromatic substitution chemistry of cyanuric chloride, that creates a new class of sequence-defined polymers dubbed TZPs. Proof of concept is demonstrated with two synthesized hexamers, having neutral and ionizable side chains. Molecular dynamics simulations show backbone-backbone interactions, including H-bonding motifs and pi-pi interactions. This architecture is arguably biomimetic while differing from sequence-defined polymers having peptide bonds. The synthetic methodology supports the structural diversity of side chains known in peptides, as well as backbone-backbone hydrogen-bonding motifs, and will thus enable new macromolecules and materials with useful functions. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Grate, Jay W.; Mo, Kai -For; Daily, Michael D.
Sequence control in polymers, well-known in nature, encodes structure and functionality. Here we introduce a new architecture, based on the nucleophilic aromatic substitution chemistry of cyanuric chloride, that creates a new class of sequence-defined polymers dubbed TZPs. Proof of concept is demonstrated with two synthesized hexamers, having neutral and ionizable side chains. Molecular dynamics simulations show backbone–backbone interactions, including H-bonding motifs and pi–pi interactions. This architecture is arguably biomimetic while differing from sequence-defined polymers having peptide bonds. In conclusion, the synthetic methodology supports the structural diversity of side chains known in peptides, as well as backbone–backbone hydrogen-bonding motifs, and willmore » thus enable new macromolecules and materials with useful functions.« less
Alves-Silva, Juliana; Sánchez-Soriano, Natalia; Beaven, Robin; Klein, Melanie; Parkin, Jill; Millard, Thomas H; Bellen, Hugo J; Venken, Koen J T; Ballestrem, Christoph; Kammerer, Richard A; Prokop, Andreas
2012-07-04
The correct outgrowth of axons is essential for the development and regeneration of nervous systems. Axon growth is primarily driven by microtubules. Key regulators of microtubules in this context are the spectraplakins, a family of evolutionarily conserved actin-microtubule linkers. Loss of function of the mouse spectraplakin ACF7 or of its close Drosophila homolog Short stop/Shot similarly cause severe axon shortening and microtubule disorganization. How spectraplakins perform these functions is not known. Here we show that axonal growth-promoting roles of Shot require interaction with EB1 (End binding protein) at polymerizing plus ends of microtubules. We show that binding of Shot to EB1 requires SxIP motifs in Shot's C-terminal tail (Ctail), mutations of these motifs abolish Shot functions in axonal growth, loss of EB1 function phenocopies Shot loss, and genetic interaction studies reveal strong functional links between Shot and EB1 in axonal growth and microtubule organization. In addition, we report that Shot localizes along microtubule shafts and stabilizes them against pharmacologically induced depolymerization. This function is EB1-independent but requires net positive charges within Ctail which essentially contribute to the microtubule shaft association of Shot. Therefore, spectraplakins are true members of two important classes of neuronal microtubule regulating proteins: +TIPs (tip interacting proteins; plus end regulators) and structural MAPs (microtubule-associated proteins). From our data we deduce a model that relates the different features of the spectraplakin C terminus to the two functions of Shot during axonal growth.
Fauteux, François; Strömvik, Martina V
2009-01-01
Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs. The majority of discovered motifs match experimentally characterized cis-regulatory elements. These results provide a good starting point for further experimental analysis of plant seed-specific promoters and our methodology can be used to unravel more transcriptional regulatory mechanisms in plants and other eukaryotes. PMID:19843335
Adaptive evolution of the matrix extracellular phosphoglycoprotein in mammals
2011-01-01
Background Matrix extracellular phosphoglycoprotein (MEPE) belongs to a family of small integrin-binding ligand N-linked glycoproteins (SIBLINGs) that play a key role in skeleton development, particularly in mineralization, phosphate regulation and osteogenesis. MEPE associated disorders cause various physiological effects, such as loss of bone mass, tumors and disruption of renal function (hypophosphatemia). The study of this developmental gene from an evolutionary perspective could provide valuable insights on the adaptive diversification of morphological phenotypes in vertebrates. Results Here we studied the adaptive evolution of the MEPE gene in 26 Eutherian mammals and three birds. The comparative genomic analyses revealed a high degree of evolutionary conservation of some coding and non-coding regions of the MEPE gene across mammals indicating a possible regulatory or functional role likely related with mineralization and/or phosphate regulation. However, the majority of the coding region had a fast evolutionary rate, particularly within the largest exon (1467 bp). Rodentia and Scandentia had distinct substitution rates with an increased accumulation of both synonymous and non-synonymous mutations compared with other mammalian lineages. Characteristics of the gene (e.g. biochemical, evolutionary rate, and intronic conservation) differed greatly among lineages of the eight mammalian orders. We identified 20 sites with significant positive selection signatures (codon and protein level) outside the main regulatory motifs (dentonin and ASARM) suggestive of an adaptive role. Conversely, we find three sites under selection in the signal peptide and one in the ASARM motif that were supported by at least one selection model. The MEPE protein tends to accumulate amino acids promoting disorder and potential phosphorylation targets. Conclusion MEPE shows a high number of selection signatures, revealing the crucial role of positive selection in the evolution of this SIBLING member. The selection signatures were found mainly outside the functional motifs, reinforcing the idea that other regions outside the dentonin and the ASARM might be crucial for the function of the protein and future studies should be undertaken to understand its importance. PMID:22103247
Marston, Steven; Memo, Massimiliano; Messer, Andrew; Papadaki, Maria; Nowak, Kristen; McNamara, Elyshia; Ong, Royston; El-Mezgueldi, Mohammed; Li, Xiaochuan; Lehman, William
2013-01-01
The congenital myopathies include a wide spectrum of clinically, histologically and genetically variable neuromuscular disorders many of which are caused by mutations in genes for sarcomeric proteins. Some congenital myopathy patients have a hypercontractile phenotype. Recent functional studies demonstrated that ACTA1 K326N and TPM2 ΔK7 mutations were associated with hypercontractility that could be explained by increased myofibrillar Ca2+ sensitivity. A recent structure of the complex of actin and tropomyosin in the relaxed state showed that both these mutations are located in the actin–tropomyosin interface. Tropomyosin is an elongated molecule with a 7-fold repeated motif of around 40 amino acids corresponding to the 7 actin monomers it interacts with. Actin binds to tropomyosin electrostatically at two points, through Asp25 and through a cluster of amino acids that includes Lys326, mutated in the gain-of-function mutation. Asp25 interacts with tropomyosin K6, next to K7 that was mutated in the other gain-of-function mutation. We identified four tropomyosin motifs interacting with Asp25 (K6-K7, K48-K49, R90-R91 and R167-K168) and three E-E/D-K/R motifs interacting with Lys326 (E139, E181 and E218), and we predicted that the known skeletal myopathy mutations ΔK7, ΔK49, R91G, ΔE139, K168E and E181K would cause a gain of function. Tests by an in vitro motility assay confirmed that these mutations increased Ca2+ sensitivity, while mutations not in these motifs (R167H, R244G) decreased Ca2+ sensitivity. The work reported here explains the molecular mechanism for 6 out of 49 known disease-causing mutations in the TPM2 and TPM3 genes, derived from structural data of the actin–tropomyosin interface. PMID:23886664
You, Ronghui; Huang, Xiaodi; Zhu, Shanfeng
2018-06-06
As of April 2018, UniProtKB has collected more than 115 million protein sequences. Less than 0.15% of these proteins, however, have been associated with experimental GO annotations. As such, the use of automatic protein function prediction (AFP) to reduce this huge gap becomes increasingly important. The previous studies conclude that sequence homology based methods are highly effective in AFP. In addition, mining motif, domain, and functional information from protein sequences has been found very helpful for AFP. Other than sequences, alternative information sources such as text, however, may be useful for AFP as well. Instead of using BOW (bag of words) representation in traditional text-based AFP, we propose a new method called DeepText2GO that relies on deep semantic text representation, together with different kinds of available protein information such as sequence homology, families, domains, and motifs, to improve large-scale AFP. Furthermore, DeepText2GO integrates text-based methods with sequence-based ones by means of a consensus approach. Extensive experiments on the benchmark dataset extracted from UniProt/SwissProt have demonstrated that DeepText2GO significantly outperformed both text-based and sequence-based methods, validating its superiority. Copyright © 2018 Elsevier Inc. All rights reserved.
Pan, Xiufang; Sittaramane, Vinoth; Gurung, Suman; Chandrasekhar, Anand
2014-02-01
Van gogh-like 2 (Vangl2), a core component of the Wnt/planar cell polarity (PCP) signaling pathway, is a four-pass transmembrane protein with N-terminal and C-terminal domains located in the cytosol, and is structurally conserved from flies to mammals. In vertebrates, Vangl2 plays an essential role in convergence and extension (CE) movements during gastrulation and in facial branchiomotor (FBM) neuron migration in the hindbrain. However, the roles of specific Vangl2 domains, of membrane association, and of specific extracellular and intracellular motifs have not been examined, especially in the context of FBM neuron migration. Through heat shock-inducible expression of various Vangl2 transgenes, we found that membrane associated functions of the N-terminal and C-terminal domains of Vangl2 are involved in regulating FBM neuron migration. Importantly, through temperature shift experiments, we found that the critical period for Vangl2 function coincides with the initial stages of FBM neuron migration out of rhombomere 4. Intriguingly, we have also uncovered a putative nuclear localization motif in the C-terminal domain that may play a role in regulating CE movements. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion
Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.
2017-01-01
Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442
An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.
Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres
2017-06-20
RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan
2013-01-01
Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of '43 pre-miRNA candidates bearing different types of SSR motifs'. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted 'pre-miRNA candidates bearing SSRs'. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted 'pre-miRNA candidates'. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of 'tandem repeats' in miRNAs.
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.
Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D
2017-12-03
A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first evidence for a significant enrichment of X motifs in the genes of an extant organism. They raise two hypotheses: the X motifs may be evolutionary relics of the primitive codes used for translation, or they may continue to play a functional role in the complex processes of genome decoding and protein synthesis.
Meng, Jinhong; Counsell, John R; Reza, Mojgan; Laval, Steven H; Danos, Olivier; Thrasher, Adrian; Lochmüller, Hanns; Muntoni, Francesco; Morgan, Jennifer E
2016-01-27
Autologous stem cells that have been genetically modified to express dystrophin are a possible means of treating Duchenne Muscular Dystrophy (DMD). To maximize the therapeutic effect, dystrophin construct needs to contain as many functional motifs as possible, within the packaging capacity of the viral vector. Existing dystrophin constructs used for transduction of muscle stem cells do not contain the nNOS binding site, an important functional motif within the dystrophin gene. In this proof-of-concept study, using stem cells derived from skeletal muscle of a DMD patient (mdcs) transplanted into an immunodeficient mouse model of DMD, we report that two novel dystrophin constructs, C1 (ΔR3-R13) and C2 (ΔH2-R23), can be lentivirally transduced into mdcs and produce dystrophin. These dystrophin proteins were functional in vivo, as members of the dystrophin glycoprotein complex were restored in muscle fibres containing donor-derived dystrophin. In muscle fibres derived from cells that had been transduced with construct C1, the largest dystrophin construct packaged into a lentiviral system, nNOS was restored. The combination of autologous stem cells and a lentivirus expressing a novel dystrophin construct which optimally restores proteins of the dystrophin glycoprotein complex may have therapeutic application for all DMD patients, regardless of their dystrophin mutation.
Functional and Structural Analysis of the Conserved EFhd2 Protein
Acosta, Yancy Ferrer; Rodríguez Cruz, Eva N.; Vaquer, Ana del C.; Vega, Irving E.
2013-01-01
EFhd2 is a novel protein conserved from C. elegans to H. sapiens. This novel protein was originally identified in cells of the immune and central nervous systems. However, it is most abundant in the central nervous system, where it has been found associated with pathological forms of the microtubule-associated protein tau. The physiological or pathological roles of EFhd2 are poorly understood. In this study, a functional and structural analysis was carried to characterize the molecular requirements for EFhd2’s calcium binding activity. The results showed that mutations of a conserved aspartate on either EF-hand motif disrupted the calcium binding activity, indicating that these motifs work in pair as a functional calcium binding domain. Furthermore, characterization of an identified single-nucleotide polymorphisms (SNP) that introduced a missense mutation indicates the importance of a conserved phenylalanine on EFhd2 calcium binding activity. Structural analysis revealed that EFhd2 is predominantly composed of alpha helix and random coil structures and that this novel protein is thermostable. EFhd2’s thermo stability depends on its N-terminus. In the absence of the N-terminus, calcium binding restored EFhd2’s thermal stability. Overall, these studies contribute to our understanding on EFhd2 functional and structural properties, and introduce it into the family of canonical EF-hand domain containing proteins. PMID:22973849
Dietrich, Daniela; Schmuths, Heike; Lousa, Carine De Marcos; Baldwin, Jocelyn M.; Baldwin, Stephen A.; Baker, Alison; Holdsworth, Michael J.
2009-01-01
COMATOSE (CTS), the Arabidopsis homologue of human Adrenoleukodystrophy protein (ALDP), is required for import of substrates for peroxisomal β-oxidation. A new allelic series and a homology model based on the bacterial ABC transporter, Sav1866, provide novel insights into structure-function relations of ABC subfamily D proteins. In contrast to ALDP, where the majority of mutations result in protein absence from the peroxisomal membrane, all CTS mutants produced stable protein. Mutation of conserved residues in the Walker A and B motifs in CTS nucleotide-binding domain (NBD) 1 resulted in a null phenotype but had little effect in NBD2, indicating that the NBDs are functionally distinct in vivo. Two alleles containing mutations in NBD1 outside the Walker motifs (E617K and C631Y) exhibited resistance to auxin precursors 2,4-dichlorophenoxybutyric acid (2,4-DB) and indole butyric acid (IBA) but were wild type in all other tests. The homology model predicted that the transmission interfaces are domain-swapped in CTS, and the differential effects of mutations in the conserved “EAA motif” of coupling helix 2 supported this prediction, consistent with distinct roles for each NBD. Our findings demonstrate that CTS functions can be separated by mutagenesis and the structural model provides a framework for interpretation of phenotypic data. PMID:19019987
Ito, Takuya; Nagata, Noriko; Yoshiba, Yoshu; Ohme-Takagi, Masaru; Ma, Hong; Shinozaki, Kazuo
2007-01-01
The Arabidopsis thaliana MALE STERILITY1 (MS1) gene encodes a nuclear protein with Leu zipper–like and PHD-finger motifs and is important for postmeiotic pollen development. Here, we examined MS1 function using both cell biological and molecular biological approaches. We introduced a fusion construct of MS1 and a transcriptional repression domain (MS1-SRDX) into wild-type Arabidopsis, and the transgenic plants showed a semisterile phenotype similar to that of ms1. Since the repression domain can convert various kinds of transcriptional activators to dominant repressors, this suggested that MS1 functioned as a transcriptional activator. The Leu zipper–like region and the PHD motif were required for the MS1 function. Phenotypic analysis of the ms1 mutant and the MS1-SRDX transgenic Arabidopsis indicated that MS1 was involved in formation of pollen exine and pollen cytosolic components as well as tapetum development. Next, we searched for MS1 downstream genes by analyzing publicly available microarray data and identified 95 genes affected by MS1. Using a transgenic ms1 plant showing dexamethasone-inducible recovery of fertility, we further examined whether these genes were immediately downstream of MS1. From these results, we discuss a role of MS1 in pollen and tapetum development and the conservation of MS1 function in flowering plants. PMID:18032630
NASA Astrophysics Data System (ADS)
Schmidt, Thomas P.; Perna, Anna M.; Fugmann, Tim; Böhm, Manja; Jan Hiss; Haller, Sarah; Götz, Camilla; Tegtmeyer, Nicole; Hoy, Benjamin; Rau, Tilman T.; Neri, Dario; Backert, Steffen; Schneider, Gisbert; Wessler, Silja
2016-03-01
The cell adhesion protein and tumour suppressor E-cadherin exhibits important functions in the prevention of gastric cancer. As a class-I carcinogen, Helicobacter pylori (H. pylori) has developed a unique strategy to interfere with E-cadherin functions. In previous studies, we have demonstrated that H. pylori secretes the protease high temperature requirement A (HtrA) which cleaves off the E-cadherin ectodomain (NTF) on epithelial cells. This opens cell-to-cell junctions, allowing bacterial transmigration across the polarised epithelium. Here, we investigated the molecular mechanism of the HtrA-E-cadherin interaction and identified E-cadherin cleavage sites for HtrA. Mass-spectrometry-based proteomics and Edman degradation revealed three signature motifs containing the [VITA]-[VITA]-x-x-D-[DN] sequence pattern, which were preferentially cleaved by HtrA. Based on these sites, we developed a substrate-derived peptide inhibitor that selectively bound and inhibited HtrA, thereby blocking transmigration of H. pylori. The discovery of HtrA-targeted signature sites might further explain why we detected a stable 90 kDa NTF fragment during H. pylori infection, but also additional E-cadherin fragments ranging from 105 kDa to 48 kDa in in vitro cleavage experiments. In conclusion, HtrA targets E-cadherin signature sites that are accessible in in vitro reactions, but might be partially masked on epithelial cells through functional homophilic E-cadherin interactions.
Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J M; Ferrer-Orta, Cristina; Verdaguer, Núria
2014-05-01
Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses.
Arginine-glycine-aspartic acid motif is critical for human parechovirus 1 entry.
Boonyakiat, Y; Hughes, P J; Ghazi, F; Stanway, G
2001-10-01
The human parechovirus 1 RGD motif in VP1 was studied by mutagenesis. An RGD-to-RGE change gave only revertant viruses with a restored RGD, while deletion of GD was lethal and nonrevertable. Mutations at the +1 and +2 positions had some effect on growth properties and a +1 M-to-P change was lethal. These studies indicate that the RGD motif plays a critical role in infectivity, presumably by interacting with integrins, and that downstream amino acids can have an influence on function.
Clark-Lewis, I; Dewald, B; Loetscher, M; Moser, B; Baggiolini, M
1994-06-10
Structure-activity relationships of human interleukin-8 (IL-8) were probed using chemically synthesized analogs with single or double amino acid substitutions, as well as hybrids derived by substituting IL-8 regions into IP10, a related protein that lacks IL-8 activity. The analogs were tested for functional activity by measuring induction of elastase release from human neutrophils and competition for binding of radiolabeled IL-8. The hybrid studies indicated that Gly31 and Pro32, as well as the NH2-terminal region from IL-8 are required to convert IP10 into a fully functional protein, suggesting that these elements are critical for IL-8 activity. Both disulfide bridges, linking residue 7 to 34 and residue 9 to 50, were critical for function, as shown by substituting the cysteine pairs with alpha-aminobutyric acid. Single conservative substitutions were generally accepted into the 10-22 region of IL-8, which contrasts with the ELR motif (residues 4-6), previously shown to be essential for activity. The importance of residues within the 10-15 region and the 17-22 region was demonstrated with hybrids. In addition, some of the 4-22 residues have structural roles that may be important; for example, Tyr13, Phe17, and Phe21 are involved in aromatic interactions in the IL-8 structure, and are also moderately sensitive to modification. Except for Cys50, the results argue against a role for the 36-72 region, including the COOH-terminal alpha-helix, in receptor binding. We conclude that the disulfide bridges and 30-35 turn provide a structural scaffold for the NH2-terminal region which includes the primary receptor-binding site (the ELR motif) and secondary binding and conformational determinants between residues 10 and 22.
Le, Duc H T; Tsutsui, Yoko; Sugawara-Narutaki, Ayae; Yukawa, Hiroshi; Baba, Yoshinobu; Ohtsuki, Chikara
2017-09-01
We have recently developed a novel double-hydrophobic elastin-like triblock polypeptide called GPG, designed after the uneven distribution of two different hydrophobic domains found in elastin, an extracellular matrix protein providing elasticity and resilience to tissues. Upon temperature trigger, GPG undergoes a sequential self-assembling process to form flexible beaded nanofibers with high homogeneity and excellent dispersibility in water. Given that GPG might be a potential elastin-mimetic material, we sought to explore the biological activities of this block polypeptide. Besides GPG, several functionalized derivatives were also constructed by fusing functional motifs such as KAAK or KAAKGRGDS at the C-terminal of GPG. Although the added motifs affected the kinetics of fiber formation and β-sheet contents, all three GPGs assembled into beaded nanofibers at the physiological temperature. The resulting GPG nanofibers preserved their beaded structures in cell culture medium; therefore, they were coated on polystyrene substrates to study their cytocompatibility toward mouse embryonic fibroblasts, NIH-3T3. Among the three polypeptides, GPG having the cell-binding motif GRGDS derived from fibronectin showed excellent cell adhesion and cell proliferation properties compared to other conventional materials, suggesting its promising applications as extracellular matrices for mammalian cells. © 2017 Wiley Periodicals, Inc. J Biomed Mater Res Part A: 105A: 2475-2484, 2017. © 2017 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mathur, Chhavi; Savithri, Handanahal S., E-mail: bchss@biochem.iisc.ernet.in
2012-10-12
Highlights: Black-Right-Pointing-Pointer Pepper vein banding potyvirus VPg harbors Walker motifs. Black-Right-Pointing-Pointer VPg exhibits ATPase activity in the presence of NIa-Pro. Black-Right-Pointing-Pointer Plausible structural and functional interplay between VPg and NIa-Pro. Black-Right-Pointing-Pointer Functional relevance of prolonged presence of VPg-Pro during infection. -- Abstract: Potyviruses temporally regulate their protein function by polyprotein processing. Previous studies have shown that VPg (Viral Protein genome-linked) of Pepper vein banding virus interacts with the NIa-Pro (Nuclear Inclusion-a protease) domain, and modulates the kinetics of the protease. In the present study, we report for the first time that VPg harbors the Walker motifs A and B, andmore » the presence of NIa-Pro, especially in cis (cleavage site (E191A) VPg-Pro mutant), is essential for manifestation of the ATPase activity. Mutation of Lys47 (Walker motif A) and Asp88:Glu89 (Walker motif B) to alanine in E191A VPg-Pro lead to reduced ATPase activity, confirming that this activity was inherent to VPg. We propose that potyviral VPg, established as an intrinsically disordered domain, undergoes plausible structural alterations upon interaction with globular NIa-Pro which induces the ATPase activity.« less
Readily functionalized AAA-DDD triply hydrogen-bonded motifs.
Tong, Feng; Linares-Mendez, Iamnica J; Han, Yi-Fei; Wisner, James A; Wang, Hong-Bo
2018-04-25
Herein we present a new, readily functionalized AAA-DDD hydrogen bond array. A novel AAA monomeric unit (3a-b) was obtained from a two-step synthetic procedure starting with 2-aminonicotinaldehyde via microwave radiation (overall yield of 52-66%). 1H NMR and fluorescence spectroscopy confirmed the complexation event with a calculated association constant of 1.57 × 107 M-1. Likewise, the usefulness of this triple hydrogen bond motif in supramolecular polymerization was demonstrated through viscosity measurements in a crosslinked supramolecular alternating copolymer.
Pisanti, Nadia; Soldano, Henry; Carpentier, Mathilde; Pothier, Joel
2009-12-01
The geometrical configurations of atoms in protein structures can be viewed as approximate relations among them. Then, finding similar common substructures within a set of protein structures belongs to a new class of problems that generalizes that of finding repeated motifs. The novelty lies in the addition of constraints on the motifs in terms of relations that must hold between pairs of positions of the motifs. We will hence denote them as relational motifs. For this class of problems, we present an algorithm that is a suitable extension of the KMR paradigm and, in particular, of the KMRC as it uses a degenerate alphabet. Our algorithm contains several improvements that become especially useful when-as it is required for relational motifs-the inference is made by partially overlapping shorter motifs, rather than concatenating them. The efficiency, correctness and completeness of the algorithm is ensured by several non-trivial properties that are proven in this paper. The algorithm has been applied in the important field of protein common 3D substructure searching. The methods implemented have been tested on several examples of protein families such as serine proteases, globins and cytochromes P450 additionally. The detected motifs have been compared to those found by multiple structural alignments methods.
Ainsztein, Alexandra M.; Kandels-Lewis, Stefanie E.; Mackay, Alastair M.; Earnshaw, William C.
1998-01-01
The inner centromere protein (INCENP) has a modular organization, with domains required for chromosomal and cytoskeletal functions concentrated near the amino and carboxyl termini, respectively. In this study we have identified an autonomous centromere- and midbody-targeting module in the amino-terminal 68 amino acids of INCENP. Within this module, we have identified two evolutionarily conserved amino acid sequence motifs: a 13–amino acid motif that is required for targeting to centromeres and transfer to the spindle, and an 11–amino acid motif that is required for transfer to the spindle by molecules that have targeted previously to the centromere. To begin to understand the mechanisms of INCENP function in mitosis, we have performed a yeast two-hybrid screen for interacting proteins. These and subsequent in vitro binding experiments identify a physical interaction between INCENP and heterochromatin protein HP1Hsα. Surprisingly, this interaction does not appear to be involved in targeting INCENP to the centromeric heterochromatin, but may instead have a role in its transfer from the chromosomes to the anaphase spindle. PMID:9864353
Anchoring of LPXTG-Like Proteins to the Gram-Positive Cell Wall Envelope.
Siegel, Sara D; Reardon, Melissa E; Ton-That, Hung
2017-01-01
In Gram-positive bacteria, protein precursors with a signal peptide and a cell wall sorting signal (CWSS)-which begins with an LPXTG motif, followed by a hydrophobic domain and a tail of positively charged residues-are targeted to the cell envelope by a transpeptidase enzyme call sortase. Evolution and selective pressure gave rise to six classes of sortase, i.e., SrtA-F. Only class C sortases are capable of polymerizing substrates harboring the pilin motif and CWSS into protein polymers known as pili or fimbriae, whereas the others perform cell wall anchoring functions. Regardless of the products generated from these sortases, the basic principle of sortase-catalyzed transpeptidation is the same. It begins with the cleavage of the LPXTG motif, followed by the cross-linking of this cleaved product at the threonine residue to a nucleophile, i.e., an active amino group of the peptidoglycan stem peptide or the lysine residue of the pilin motif. This chapter will summarize the efforts to identify and characterize sortases and their associated pathways with emphasis on the cell wall anchoring function.
Position specific variation in the rate of evolution in transcription factor binding sites
Moses, Alan M; Chiang, Derek Y; Kellis, Manolis; Lander, Eric S; Eisen, Michael B
2003-01-01
Background The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Results Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms. Conclusion As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA. PMID:12946282
The glycine-rich motif of Pyrococcus abyssi DNA polymerase D is critical for protein stability.
Castrec, Benoît; Laurent, Sébastien; Henneke, Ghislaine; Flament, Didier; Raffin, Jean-Paul
2010-03-05
A glycine-rich motif described as being involved in human polymerase delta proliferating cell nuclear antigen (PCNA) binding has also been identified in all euryarchaeal DNA polymerase D (Pol D) family members. We redefined the motif as the (G)-PYF box. In the present study, Pol D (G)-PYF box motif mutants from Pyrococcus abyssi were generated to investigate its role in functional interactions with the cognate PCNA. We demonstrated that this motif is not essential for interactions between PabPol D (P. abyssi Pol D) and PCNA, using surface plasmon resonance and primer extension studies. Interestingly, the (G)-PYF box is located in a hydrophobic region close to the active site. The (G)-PYF box mutants exhibited altered DNA binding properties. In addition, the thermal stability of all mutants was reduced compared to that of wild type, and this effect could be attributed to increased exposure of the hydrophobic region. These studies suggest that the (G)-PYF box motif mediates intersubunit interactions and that it may be crucial for the thermostability of PabPol D. (c) 2010 Elsevier Ltd. All rights reserved.
cWINNOWER algorithm for finding fuzzy dna motifs
NASA Technical Reports Server (NTRS)
Liang, S.; Samanta, M. P.; Biegel, B. A.
2004-01-01
The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in protein-binding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if a clique consisting of a sufficiently large number of mutated copies of the motif (i.e., the signals) is present in the DNA sequence. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum detectable clique size qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting three-member sub-cliques. Imposing consensus constraints reduces qc by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N = 12,000 for (l, d) = (15, 4). Copyright Imperial College Press.
Microbial-type terpene synthase genes occur widely in nonseed land plants, but not in seed plants
Jia, Qidong; Li, Guanglin; Köllner, Tobias G.; ...
2016-10-10
Here, the vast abundance of terpene natural products in nature is due to enzymes known as terpene synthases (TPSs) that convert acyclic prenyl diphosphate precursors into a multitude of cyclic and acyclic carbon skeletons. Yet the evolution of TPSs is not well understood at higher levels of classification. Microbial TPSs from bacteria and fungi are only distantly related to typical plant TPSs, whereas genes similar to microbial TPS genes have been recently identified in the lycophyte Selaginella moellendorffii. The goal of this study was to investigate the distribution, evolution, and biochemical functions of microbial terpene synthase-like ( MTPSL) genes inmore » other plants. By analyzing the transcriptomes of 1,103 plant species ranging from green algae to flowering plants, putative MTPSL genes were identified predominantly from nonseed plants, including liverworts, mosses, hornworts, lycophytes, and monilophytes. Directed searching for MTPSL genes in the sequenced genomes of a wide range of seed plants confirmed their general absence in this group. Among themselves, MTPSL proteins from nonseed plants form four major groups, with two of these more closely related to bacterial TPSs and the other two to fungal TPSs. Two of the four groups contain a canonical aspartate-rich “DDxxD” motif. The third group has a “DDxxxD” motif, and the fourth group has only the first two “DD” conserved in this motif. Upon heterologous expression, representative members from each of the four groups displayed diverse catalytic functions as monoterpene and sesquiterpene synthases, suggesting these are important for terpene formation in nonseed plants.« less
Microbial-type terpene synthase genes occur widely in nonseed land plants, but not in seed plants
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jia, Qidong; Li, Guanglin; Köllner, Tobias G.
Here, the vast abundance of terpene natural products in nature is due to enzymes known as terpene synthases (TPSs) that convert acyclic prenyl diphosphate precursors into a multitude of cyclic and acyclic carbon skeletons. Yet the evolution of TPSs is not well understood at higher levels of classification. Microbial TPSs from bacteria and fungi are only distantly related to typical plant TPSs, whereas genes similar to microbial TPS genes have been recently identified in the lycophyte Selaginella moellendorffii. The goal of this study was to investigate the distribution, evolution, and biochemical functions of microbial terpene synthase-like ( MTPSL) genes inmore » other plants. By analyzing the transcriptomes of 1,103 plant species ranging from green algae to flowering plants, putative MTPSL genes were identified predominantly from nonseed plants, including liverworts, mosses, hornworts, lycophytes, and monilophytes. Directed searching for MTPSL genes in the sequenced genomes of a wide range of seed plants confirmed their general absence in this group. Among themselves, MTPSL proteins from nonseed plants form four major groups, with two of these more closely related to bacterial TPSs and the other two to fungal TPSs. Two of the four groups contain a canonical aspartate-rich “DDxxD” motif. The third group has a “DDxxxD” motif, and the fourth group has only the first two “DD” conserved in this motif. Upon heterologous expression, representative members from each of the four groups displayed diverse catalytic functions as monoterpene and sesquiterpene synthases, suggesting these are important for terpene formation in nonseed plants.« less
Mechanism-based Proteomic Screening Identifies Targets of Thioredoxin-like Proteins*
Nakao, Lia S.; Everley, Robert A.; Marino, Stefano M.; Lo, Sze M.; de Souza, Luiz E.; Gygi, Steven P.; Gladyshev, Vadim N.
2015-01-01
Thioredoxin (Trx)-fold proteins are protagonists of numerous cellular pathways that are subject to thiol-based redox control. The best characterized regulator of thiols in proteins is Trx1 itself, which together with thioredoxin reductase 1 (TR1) and peroxiredoxins (Prxs) comprises a key redox regulatory system in mammalian cells. However, there are numerous other Trx-like proteins, whose functions and redox interactors are unknown. It is also unclear if the principles of Trx1-based redox control apply to these proteins. Here, we employed a proteomic strategy to four Trx-like proteins containing CXXC motifs, namely Trx1, Rdx12, Trx-like protein 1 (Txnl1) and nucleoredoxin 1 (Nrx1), whose cellular targets were trapped in vivo using mutant Trx-like proteins, under conditions of low endogenous expression of these proteins. Prxs were detected as key redox targets of Trx1, but this approach also supported the detection of TR1, which is the Trx1 reductant, as well as mitochondrial intermembrane proteins AIF and Mia40. In addition, glutathione peroxidase 4 was found to be a Rdx12 redox target. In contrast, no redox targets of Txnl1 and Nrx1 could be detected, suggesting that their CXXC motifs do not engage in mixed disulfides with cellular proteins. For some Trx-like proteins, the method allowed distinguishing redox and non-redox interactions. Parallel, comparative analyses of multiple thiol oxidoreductases revealed differences in the functions of their CXXC motifs, providing important insights into thiol-based redox control of cellular processes. PMID:25561728
Shape-specific nanostructured protein mimics from de novo designed chimeric peptides.
Jiang, Linhai; Yang, Su; Lund, Reidar; Dong, He
2018-01-30
Natural proteins self-assemble into highly-ordered nanoscaled architectures to perform specific functions. The intricate functions of proteins have provided great impetus for researchers to develop strategies for designing and engineering synthetic nanostructures as protein mimics. Compared to the success in engineering fibrous protein mimetics, the design of discrete globular protein-like nanostructures has been challenging mainly due to the lack of precise control over geometric packing and intermolecular interactions among synthetic building blocks. In this contribution, we report an effective strategy to construct shape-specific nanostructures based on the self-assembly of chimeric peptides consisting of a coiled coil dimer and a collagen triple helix folding motif. Under salt-free conditions, we showed spontaneous self-assembly of the chimeric peptides into monodisperse, trigonal bipyramidal-like nanoparticles with precise control over the stoichiometry of two folding motifs and the geometrical arrangements relative to one another. Three coiled coil dimers are interdigitated on the equatorial plane while the two collagen triple helices are located in the axial position, perpendicular to the coiled coil plane. A detailed molecular model was proposed and further validated by small angle X-ray scattering experiments and molecular dynamics (MD) simulation. The results from this study indicated that the molecular folding of each motif within the chimeric peptides and their geometric packing played important roles in the formation of discrete protein-like nanoparticles. The peptide design and self-assembly mechanism may open up new routes for the construction of highly organized, discrete self-assembling protein-like nanostructures with greater levels of control over assembly accuracy.
2017-01-01
Biological chelating molecules called siderophores are used to sequester iron and maintain its ferric state. Bacterial substrate-binding proteins (SBPs) bind iron–siderophore complexes and deliver these complexes to ATP-binding cassette (ABC) transporters for import into the cytoplasm, where the iron can be transferred from the siderophore to catalytic enzymes. In Yersinia pestis, the causative agent of plague, the Yersinia iron-uptake (Yiu) ABC transporter has been shown to improve iron acquisition under iron-chelated conditions. The Yiu transporter has been proposed to be an iron–siderophore transporter; however, the precise siderophore substrate is unknown. Therefore, the precise role of the Yiu transporter in Y. pestis survival remains uncharacterized. To better understand the function of the Yiu transporter, the crystal structure of YiuA (YPO1310/y2875), an SBP which functions to present the iron–siderophore substrate to the transporter for import into the cytoplasm, was determined. The 2.20 and 1.77 Å resolution X-ray crystal structures reveal a basic triad binding motif at the YiuA canonical substrate-binding site, indicative of a metal-chelate binding site. Structural alignment and computational docking studies support the function of YiuA in binding chelated metal. Additionally, YiuA contains two mobile helices, helix 5 and helix 10, that undergo 2–3 Å shifts across crystal forms and demonstrate structural breathing of the c-clamp architecture. The flexibility in both c-clamp lobes suggest that YiuA substrate transfer resembles the Venus flytrap mechanism that has been proposed for other SBPs. PMID:29095164
Kumar, Akhilesh; Bachhawat, Anand Kumar
2010-06-01
OXP1/YKL215c, an uncharacterized ORF of Saccharomyces cerevisiae, encodes a functional ATP-dependent 5-oxoprolinase of 1286 amino acids. The yeast 5-oxoprolinase activity was demonstrated in vivo by utilization of 5-oxoproline as a source of glutamate and OTC, a 5-oxoproline sulfur analogue, as a source of sulfur in cells overexpressing OXP1. In vitro characterization by expression and purification of the recombinant protein in S. cerevisiae revealed that the enzyme exists and functions as a dimer, and has a K(m) of 159 microM and a V(max) of 3.5 nmol h(-1) microg(-1) protein. The enzyme was found to be functionally separable in two distinct domains. An 'actin-like ATPase motif' could be identified in 5-oxprolinases, and mutation of key residues within this motif led to complete loss in ATPase and 5-oxoprolinase activity of the enzyme. The results are discussed in the light of the previously postulated truncated gamma-glutamyl cycle of yeasts.
The evolution of function within the Nudix homology clan
Srouji, John R.; Xu, Anting; Park, Annsea; Kirsch, Jack F.
2017-01-01
ABSTRACT The Nudix homology clan encompasses over 80,000 protein domains from all three domains of life, defined by homology to each other. Proteins with a domain from this clan fall into four general functional classes: pyrophosphohydrolases, isopentenyl diphosphate isomerases (IDIs), adenine/guanine mismatch‐specific adenine glycosylases (A/G‐specific adenine glycosylases), and nonenzymatic activities such as protein/protein interaction and transcriptional regulation. The largest group, pyrophosphohydrolases, encompasses more than 100 distinct hydrolase specificities. To understand the evolution of this vast number of activities, we assembled and analyzed experimental and structural data for 205 Nudix proteins collected from the literature. We corrected erroneous functions or provided more appropriate descriptions for 53 annotations described in the Gene Ontology Annotation database in this family, and propose 275 new experimentally‐based annotations. We manually constructed a structure‐guided sequence alignment of 78 Nudix proteins. Using the structural alignment as a seed, we then made an alignment of 347 “select” Nudix homology domains, curated from structurally determined, functionally characterized, or phylogenetically important Nudix domains. Based on our review of Nudix pyrophosphohydrolase structures and specificities, we further analyzed a loop region downstream of the Nudix hydrolase motif previously shown to contact the substrate molecule and possess known functional motifs. This loop region provides a potential structural basis for the functional radiation and evolution of substrate specificity within the hydrolase family. Finally, phylogenetic analyses of the 347 select protein domains and of the complete Nudix homology clan revealed general monophyly with regard to function and a few instances of probable homoplasy. Proteins 2017; 85:775–811. © 2016 Wiley Periodicals, Inc. PMID:27936487
Structural and biophysical properties of h-FANCI ARM repeat protein.
Siddiqui, Mohd Quadir; Choudhary, Rajan Kumar; Thapa, Pankaj; Kulkarni, Neha; Rajpurohit, Yogendra S; Misra, Hari S; Gadewal, Nikhil; Kumar, Satish; Hasan, Syed K; Varma, Ashok K
2017-11-01
Fanconi anemia complementation groups - I (FANCI) protein facilitates DNA ICL (Inter-Cross-link) repair and plays a crucial role in genomic integrity. FANCI is a 1328 amino acids protein which contains armadillo (ARM) repeats and EDGE motif at the C-terminus. ARM repeats are functionally diverse and evolutionarily conserved domain that plays a pivotal role in protein-protein and protein-DNA interactions. Considering the importance of ARM repeats, we have explored comprehensive in silico and in vitro approach to examine folding pattern. Size exclusion chromatography, dynamic light scattering (DLS) and glutaraldehyde crosslinking studies suggest that FANCI ARM repeat exist as monomer as well as in oligomeric forms. Circular dichroism (CD) and fluorescence spectroscopy results demonstrate that protein has predominantly α- helices and well-folded tertiary structure. DNA binding was analysed using electrophoretic mobility shift assay by autoradiography. Temperature-dependent CD, Fluorescence spectroscopy and DLS studies concluded that protein unfolds and start forming oligomer from 30°C. The existence of stable portion within FANCI ARM repeat was examined using limited proteolysis and mass spectrometry. The normal mode analysis, molecular dynamics and principal component analysis demonstrated that helix-turn-helix (HTH) motif present in ARM repeat is highly dynamic and has anti-correlated motion. Furthermore, FANCI ARM repeat has HTH structural motif which binds to double-stranded DNA.
Woznica, Arielle; Haeussler, Maximilian; Starobinska, Ella; Jemmett, Jessica; Li, Younan; Mount, David; Davidson, Brad
2012-08-01
The complex, partially redundant gene regulatory architecture underlying vertebrate heart formation has been difficult to characterize. Here, we dissect the primary cardiac gene regulatory network in the invertebrate chordate, Ciona intestinalis. The Ciona heart progenitor lineage is first specified by Fibroblast Growth Factor/Map Kinase (FGF/MapK) activation of the transcription factor Ets1/2 (Ets). Through microarray analysis of sorted heart progenitor cells, we identified the complete set of primary genes upregulated by FGF/Ets shortly after heart progenitor emergence. Combinatorial sequence analysis of these co-regulated genes generated a hypothetical regulatory code consisting of Ets binding sites associated with a specific co-motif, ATTA. Through extensive reporter analysis, we confirmed the functional importance of the ATTA co-motif in primary heart progenitor gene regulation. We then used the Ets/ATTA combination motif to successfully predict a number of additional heart progenitor gene regulatory elements, including an intronic element driving expression of the core conserved cardiac transcription factor, GATAa. This work significantly advances our understanding of the Ciona heart gene network. Furthermore, this work has begun to elucidate the precise regulatory architecture underlying the conserved, primary role of FGF/Ets in chordate heart lineage specification. Copyright © 2012 Elsevier Inc. All rights reserved.
Regulation of HTLV-1 Gag budding by Vps4A, Vps4B, and AIP1/Alix
Urata, Shuzo; Yokosawa, Hideyoshi; Yasuda, Jiro
2007-01-01
Background HTLV-1 Gag protein is a matrix protein that contains the PTAP and PPPY sequences as L-domain motifs and which can be released from mammalian cells in the form of virus-like particles (VLPs). The cellular factors Tsg101 and Nedd4.1 interact with PTAP and PPPY, respectively, within the HTLV-1 Gag polyprotein. Tsg101 forms a complex with Vps28 and Vps37 (ESCRT-I complex) and plays an important role in the class E Vps pathway, which mediates protein sorting and invagination of vesicles into multivesicular bodies. Nedd4.1 is an E3 ubiquitin ligase that binds to the PPPY motif through its WW motif, but its function is still unknown. In the present study, to investigate the mechanism of HTLV-1 budding in detail, we analyzed HTLV-1 budding using dominant negative (DN) forms of the class E proteins. Results Here, we report that DN forms of Vps4A, Vps4B, and AIP1 inhibit HTLV-1 budding. Conclusion These findings suggest that HTLV-1 budding utilizes the MVB pathway and that these class E proteins may be targets for prevention of mother-to-infant vertical transmission of the virus. PMID:17601348
Identification of sequence motifs significantly associated with antisense activity.
McQuisten, Kyle A; Peek, Andrew S
2007-06-07
Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic mediators to speed the process along like the RNA Induced Silencing Complex (RISC) in RNAi. The independence of motif position and antisense activity also allows us to bypass consideration of this feature in the modelling process, promoting model efficiency and reducing the chance of overfitting when predicting antisense activity. The increase in SVR correlation with significant features compared to nearest-neighbour features indicates that thermodynamics alone is likely not the only factor in determining antisense efficiency.
Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie
2014-02-17
As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of recombination hotspots among individuals, opening a new avenue for motif finding. Tested on an established motif and simulated datasets, LDsplit shows promise to discover novel DNA motifs for meiotic recombination hotspots.
2014-01-01
Background As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Results Recently, an algorithm called “LDsplit” has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. Conclusions LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of recombination hotspots among individuals, opening a new avenue for motif finding. Tested on an established motif and simulated datasets, LDsplit shows promise to discover novel DNA motifs for meiotic recombination hotspots. PMID:24533858
Sumoylation promotes optimal APC/C Activation and Timely Anaphase.
Lee, Christine C; Li, Bing; Yu, Hongtao; Matunis, Michael J
2018-03-08
The Anaphase Promoting Complex/Cyclosome (APC/C) is a ubiquitin E3 ligase that functions as the gatekeeper to mitotic exit. APC/C activity is controlled by an interplay of multiple pathways during mitosis, including the spindle assembly checkpoint (SAC), that are not yet fully understood. Here, we show that sumoylation of the APC4 subunit of the APC/C peaks during mitosis and is critical for timely APC/C activation and anaphase onset. We have also identified a functionally important SUMO interacting motif in the cullin-homology domain of APC2 located near the APC4 sumoylation sites and APC/C catalytic core. Our findings provide evidence of an important regulatory role for SUMO modification and binding in affecting APC/C activation and mitotic exit. © 2018, Lee et al.
van Lith, Marcel; Hartigan, Nichola; Hatch, Jennifer; Benham, Adam M
2005-01-14
Protein disulfide isomerase (PDI) is the archetypal enzyme involved in the formation and reshuffling of disulfide bonds in the endoplasmic reticulum (ER). PDI achieves its redox function through two highly conserved thioredoxin domains, and PDI can also operate as an ER chaperone. The substrate specificities and the exact functions of most other PDI family proteins remain important unsolved questions in biology. Here, we characterize a new and striking member of the PDI family, which we have named protein disulfide isomerase-like protein of the testis (PDILT). PDILT is the first eukaryotic SXXC protein to be characterized in the ER. Our experiments have unveiled a novel, glycosylated PDI-like protein whose tissue-specific expression and unusual motifs have implications for the evolution, catalytic function, and substrate selection of thioredoxin family proteins. We show that PDILT is an ER resident glycoprotein that liaises with partner proteins in disulfide-dependent complexes within the testis. PDILT interacts with the oxidoreductase Ero1alpha, demonstrating that the N-terminal cysteine of the CXXC sequence is not required for binding of PDI family proteins to ER oxidoreductases. The expression of PDILT, in addition to PDI in the testis, suggests that PDILT performs a specialized chaperone function in testicular cells. PDILT is an unusual PDI relative that highlights the adaptability of chaperone and redox function in enzymes of the endoplasmic reticulum.
Zhang, Nan; Qiao, Zhenyi; Liang, Zheng; Mei, Bing; Xu, Zhengkai; Song, Rentao
2012-01-01
Zea mays (maize) Opaque-2 (ZmO2) protein is an important bZIP transcription factor that regulates the expression of major storage proteins (22-kD zeins) and other important genes during maize seed development. ZmO2 is subject to functional regulation through protein-protein interactions. To unveil the potential regulatory network associated with ZmO2, a protein-protein interaction study was carried out using the truncated version of ZmO2 (O2-2) as bait in a yeast two-hybrid screen with a maize seed cDNA library. A protein with homology to Taxilin was found to have stable interaction with ZmO2 in yeast and was designated as ZmTaxilin. Sequence analysis indicated that ZmTaxilin has a long coiled-coil domain containing three conserved zipper motifs. Each of the three zipper motifs is individually able to interact with ZmO2 in yeast. A GST pull-down assay demonstrated the interaction between GST-fused ZmTaxilin and ZmO2 extracted from developing maize seeds. Using onion epidermal cells as in vivo assay system, we found that ZmTaxilin could change the sub-cellular distribution of ZmO2. We also demonstrated that this change significantly repressed the transcriptional activity of ZmO2 on the 22-kD zein promoter. Our study suggests that a Taxilin-mediated change in sub-cellular distribution of ZmO2 may have important functional consequences for ZmO2 activity. PMID:22937104
Kalloush, Rawan M.; Vivet-Boudou, Valérie; Ali, Lizna M.; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A.
2016-01-01
MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2′hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5′ region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. PMID:27095024
Childs-Disney, Jessica L; Hoskins, Jason; Rzuczek, Suzanne G; Thornton, Charles A; Disney, Matthew D
2012-05-18
RNA is an important drug target, but it is difficult to design or discover small molecules that modulate RNA function. In the present study, we report that rationally designed, modularly assembled small molecules that bind the RNA that causes myotonic dystrophy type 1 (DM1) are potently bioactive in cell culture models. DM1 is caused when an expansion of r(CUG) repeats, or r(CUG)(exp), is present in the 3' untranslated region (UTR) of the dystrophia myotonica protein kinase (DMPK) mRNA. r(CUG)(exp) folds into a hairpin with regularly repeating 5'CUG/3'GUC motifs and sequesters muscleblind-like 1 protein (MBNL1). A variety of defects are associated with DM1, including (i) formation of nuclear foci, (ii) decreased translation of DMPK mRNA due to its nuclear retention, and (iii) pre-mRNA splicing defects due to inactivation of MBNL1, which controls the alternative splicing of various pre-mRNAs. Previously, modularly assembled ligands targeting r(CUG)(exp) were designed using information in an RNA motif-ligand database. These studies showed that a bis-benzimidazole (H) binds the 5'CUG/3'GUC motif in r(CUG)(exp.) Therefore, we designed multivalent ligands to bind simultaneously multiple copies of this motif in r(CUG)(exp). Herein, we report that the designed compounds improve DM1-associated defects including improvement of translational and pre-mRNA splicing defects and the disruption of nuclear foci. These studies may establish a foundation to exploit other RNA targets in genomic sequence.
2013-01-01
Background Fungal pathogens cause devastating losses in economically important cereal crops by utilising pathogen proteins to infect host plants. Secreted pathogen proteins are referred to as effectors and have thus far been identified by selecting small, cysteine-rich peptides from the secretome despite increasing evidence that not all effectors share these attributes. Results We take advantage of the availability of sequenced fungal genomes and present an unbiased method for finding putative pathogen proteins and secreted effectors in a query genome via comparative hidden Markov model analyses followed by unsupervised protein clustering. Our method returns experimentally validated fungal effectors in Stagonospora nodorum and Fusarium oxysporum as well as the N-terminal Y/F/WxC-motif from the barley powdery mildew pathogen. Application to the cereal pathogen Fusarium graminearum reveals a secreted phosphorylcholine phosphatase that is characteristic of hemibiotrophic and necrotrophic cereal pathogens and shares an ancient selection process with bacterial plant pathogens. Three F. graminearum protein clusters are found with an enriched secretion signal. One of these putative effector clusters contains proteins that share a [SG]-P-C-[KR]-P sequence motif in the N-terminal and show features not commonly associated with fungal effectors. This motif is conserved in secreted pathogenic Fusarium proteins and a prime candidate for functional testing. Conclusions Our pipeline has successfully uncovered conservation patterns, putative effectors and motifs of fungal pathogens that would have been overlooked by existing approaches that identify effectors as small, secreted, cysteine-rich peptides. It can be applied to any pathogenic proteome data, such as microbial pathogen data of plants and other organisms. PMID:24252298
Uchiumi, Fumiaki; Watanabe, Takeshi; Tanuma, Sei-ichi
2010-05-15
DNA helicases are important in the regulation of DNA transaction and thereby various cellular functions. In this study, we developed a cost-effective multiple DNA transfection assay with DEAE-dextran reagent and analyzed the promoter activities of the human DNA helicases. The 5'-flanking regions of the human DNA helicase-encoding genes were isolated and subcloned into luciferase (Luc) expression plasmids. They were coated onto 96-well plate and used for co-transfection with a renilla-Luc expression vector into various cells, and dual-Luc assays were performed. The profiles of promoter activities were dependent on cell lines used. Among these human DNA helicase genes, XPB, RecQL5, and RTEL promoters were activated during TPA-induced HL-60 cell differentiation. Interestingly, duplicated ets (GGAA) elements are commonly located around the transcription start sites of these genes. The duplicated GGAA motifs are also found in the promoters of DNA replication/repair synthesis factor genes including PARG, ATR, TERC, and Rb1. Mutation analyses suggested that the duplicated GGAA-motifs are necessary for the basal promoter activity in various cells and some of them positively respond to TPA in HL-60 cells. TPA-induced response of 44-bp in the RTEL promoter was attenuated by co-transfection of the PU.1 expression vector. These findings suggest that the duplicated ets motifs regulate DNA-repair associated gene expressions during macrophage-like differentiation of HL-60 cells. Copyright 2010 Elsevier Inc. All rights reserved.
Close encounters of the third kind: disordered domains and the interactions of proteins.
Tompa, Peter; Fuxreiter, Monika; Oldfield, Christopher J; Simon, Istvan; Dunker, A Keith; Uversky, Vladimir N
2009-03-01
Protein-protein interactions are thought to be mediated by domains, which are autonomous folding units of proteins. Recently, a second type of interaction has been suggested, mediated by short segments termed linear motifs, which are related to recognition elements of intrinsically disordered regions. Here, we propose a third kind of protein-protein recognition mechanism, mediated by disordered regions longer than 20-30 residues. Bioinformatics predictions and well-characterized examples, such as the kinase-inhibitory domain of Cdk inhibitors and the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 of actin-binding proteins, show that these disordered regions conform to the definition of domains rather than motifs, i.e., they represent functional, evolutionary, and structural units. Their functions are distinct from those of short motifs and ordered domains, and establish a third kind of interaction principle. With these points, we argue that these long disordered regions should be recognized as a distinct class of biologically functional protein domains.
Ladam, Franck; Stanney, William; Donaldson, Ian J; Yildiz, Ozge; Bobola, Nicoletta; Sagerström, Charles G
2018-06-18
TALE factors are broadly expressed embryonically and known to function in complexes with transcription factors (TFs) like Hox proteins at gastrula/segmentation stages, but it is unclear if such generally expressed factors act by the same mechanism throughout embryogenesis. We identify a TALE-dependent gene regulatory network (GRN) required for anterior development and detect TALE occupancy associated with this GRN throughout embryogenesis. At blastula stages, we uncover a novel functional mode for TALE factors, where they occupy genomic DECA motifs with nearby NF-Y sites. We demonstrate that TALE and NF-Y form complexes and regulate chromatin state at genes of this GRN. At segmentation stages, GRN-associated TALE occupancy expands to include HEXA motifs near PBX:HOX sites. Hence, TALE factors control a key GRN, but utilize distinct DNA motifs and protein partners at different stages - a strategy that may also explain their oncogenic potential and may be employed by other broadly expressed TFs. © 2018, Ladam et al.
Havrila, Marek; Réblová, Kamila; Zirbel, Craig L.; Leontis, Neocles B.; Šponer, Jiří
2013-01-01
The Sarcin-Ricin RNA motif (SR motif) is one of the most prominent recurrent RNA building blocks that occurs in many different RNA contexts and folds autonomously, i.e., in a context-independent manner. In this study, we combined bioinformatics analysis with explicit-solvent molecular dynamics (MD) simulations to better understand the relation between the RNA sequence and the evolutionary patterns of SR motif. SHAPE probing experiment was also performed to confirm fidelity of MD simulations. We identified 57 instances of the SR motif in a non-redundant subset of the RNA X-ray structure database and analyzed their basepairing, base-phosphate, and backbone-backbone interactions. We extracted sequences aligned to these instances from large ribosomal RNA alignments to determine frequency of occurrence for different sequence variants. We then used a simple scoring scheme based on isostericity to suggest 10 sequence variants with highly variable expected degree of compatibility with the SR motif 3D structure. We carried out MD simulations of SR motifs with these base substitutions. Non isosteric base substitutions led to unstable structures, but so did isosteric substitutions which were unable to make key base-phosphate interactions. MD technique explains why some potentially isosteric SR motifs are not realized during evolution. We also found that inability to form stable cWW geometry is an important factor in case of the first base pair of the flexible region of the SR motif. Comparison of structural, bioinformatics, SHAPE probing and MD simulation data reveals that explicit solvent MD simulations neatly reflect viability of different sequence variants of the SR motif. Thus, MD simulations can efficiently complement bioinformatics tools in studies of conservation patterns of RNA motifs and provide atomistic insight into the role of their different signature interactions. PMID:24144333
NoFold: RNA structure clustering without folding or alignment.
Middleton, Sarah A; Kim, Junhyong
2014-11-01
Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Vascular gene expression: a hypothesis
Martínez-Navarro, Angélica C.; Galván-Gordillo, Santiago V.; Xoconostle-Cázares, Beatriz; Ruiz-Medrano, Roberto
2013-01-01
The phloem is the conduit through which photoassimilates are distributed from autotrophic to heterotrophic tissues and is involved in the distribution of signaling molecules that coordinate plant growth and responses to the environment. Phloem function depends on the coordinate expression of a large array of genes. We have previously identified conserved motifs in upstream regions of the Arabidopsis genes, encoding the homologs of pumpkin phloem sap mRNAs, displaying expression in vascular tissues. This tissue-specific expression in Arabidopsis is predicted by the overrepresentation of GA/CT-rich motifs in gene promoters. In this work we have searched for common motifs in upstream regions of the homologous genes from plants considered to possess a “primitive” vascular tissue (a lycophyte), as well as from others that lack a true vascular tissue (a bryophyte), and finally from chlorophytes. Both lycophyte and bryophyte display motifs similar to those found in Arabidopsis with a significantly low E-value, while the chlorophytes showed either a different conserved motif or no conserved motif at all. These results suggest that these same genes are expressed coordinately in non-vascular plants; this coordinate expression may have been one of the prerequisites for the development of conducting tissues in plants. We have also analyzed the phylogeny of conserved proteins that may be involved in phloem function and development. The presence of CmPP16, APL, FT, and YDA in chlorophytes suggests the recruitment of ancient regulatory networks for the development of the vascular tissue during evolution while OPS is a novel protein specific to vascular plants. PMID:23882276
Building a stable RNA U-turn with a protonated cytidine
Gottstein-Schmidtke, Sina R.; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E.; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens
2014-01-01
The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5′-UNR-3′ (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3′ phosphate group of the R residue as well as a hydrogen bond between the 2′-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3′ from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. PMID:24951555
An Efficient Scheme for Crystal Structure Prediction Based on Structural Motifs
Zhu, Zizhong; Wu, Ping; Wu, Shunqing; ...
2017-05-15
An efficient scheme based on structural motifs is proposed for the crystal structure prediction of materials. The key advantage of the present method comes in two fold: first, the degrees of freedom of the system are greatly reduced, since each structural motif, regardless of its size, can always be described by a set of parameters (R, θ, φ) with five degrees of freedom; second, the motifs could always appear in the predicted structures when the energies of the structures are relatively low. Both features make the present scheme a very efficient method for predicting desired materials. The method has beenmore » applied to the case of LiFePO 4, an important cathode material for lithium-ion batteries. Numerous new structures of LiFePO 4 have been found, compared to those currently available, available, demonstrating the reliability of the present methodology and illustrating the promise of the concept of structural motifs.« less
An Efficient Scheme for Crystal Structure Prediction Based on Structural Motifs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhu, Zizhong; Wu, Ping; Wu, Shunqing
An efficient scheme based on structural motifs is proposed for the crystal structure prediction of materials. The key advantage of the present method comes in two fold: first, the degrees of freedom of the system are greatly reduced, since each structural motif, regardless of its size, can always be described by a set of parameters (R, θ, φ) with five degrees of freedom; second, the motifs could always appear in the predicted structures when the energies of the structures are relatively low. Both features make the present scheme a very efficient method for predicting desired materials. The method has beenmore » applied to the case of LiFePO 4, an important cathode material for lithium-ion batteries. Numerous new structures of LiFePO 4 have been found, compared to those currently available, available, demonstrating the reliability of the present methodology and illustrating the promise of the concept of structural motifs.« less
Cruickshank, Mark N; Dods, James; Taylor, Rhonda L; Karimi, Mahdad; Fenwick, Emily J; Quail, Elizabeth A; Rea, Alexander J; Holers, V Michael; Abraham, Lawrence J; Ulgiati, Daniela
2015-07-01
Complement receptor 2 (CR2/CD21) plays an important role in the generation of normal B cell immune responses. As transcription appears to be the prime mechanism via which surface CR2/CD21 expression is controlled, understanding transcriptional regulation of this gene will have broader implications to B cell biology. Here we report opposing, cell-context specific control of CR2/CD21 promoter activity by tandem E-box elements, spaced 22 bp apart and within 70 bp of the transcription initiation site. We have identified E2A and USF transcription factors as binding to the distal and proximal E-box sites respectively in CR2-positive B-cells, at a site that is hypersensitive to restriction enzyme digestion compared to non-expressing K562 cells. However, additional unidentified proteins have also been found to bind these functionally important elements. By utilizing a proteomics approach we have identified a repressor protein, RP58, binding the distal E-box motif. Co-transfection experiments using RP58 overexpression constructs demonstrated a specific 10-fold repression of CR2/CD21 transcriptional activity mediated through the distal E-box repressor element. Taken together, our results indicate that repression of the CR2/CD21 promoter can occur through one of the E-box motifs via recruitment of RP58 and other factors to bring about a silenced chromatin context within CR2/CD21 non-expressing cells. Copyright © 2015 Elsevier Ltd. All rights reserved.
The mechanism of transforming diamond nanowires to carbon nanostructures
NASA Astrophysics Data System (ADS)
Sorkin, Anastassia; Su, Haibin
2014-01-01
The transformation of diamond nanowires (DNWs) with different diameters and geometries upon heating is investigated with density-functional-based tight-binding molecular dynamics. DNWs of <100> and <111> oriented cross-section with projected average line density between 7 and 20 atoms Å-1 transform into carbon nanotubes (CNTs) under gradual heating up to 3500-4000 K. DNWs with projected average line density larger than 25 atoms Å-1 transform into double-wall CNTs. The route of transformation into CNTs clearly exhibits three stages, with the intriguing intermediate structural motif of a carbon nanoscroll (CNS). Moreover, the morphology plays an important role in the transformation involving the CNS as one important intermediate motif to form CNTs. When starting with \\langle \\bar {2}1 1\\rangle oriented DNWs with a square cross-section consisting of two {111} facets facing each other, one interesting structure with ‘nano-bookshelf’ shape emerges: a number of graphene ‘shelves’ located inside the CNT, bonding to the CNT walls with sp3 hybridized atoms. The nano-bookshelf structures exist in a wide range of temperatures up to 3000 K. The further transformation from nano-bookshelf structures depends on the strength of the joints connecting shelves with CNT walls. Notably, the nano-bookshelf structure can evolve into two end products: one is CNT via the CNS pathway, the other is graphene transformed directly from the nano-bookshelf structure at high temperature. This work sheds light on the microscopic insight of carbon nanostructure formation mechanisms with the featured motifs highlighted in the pathways.
Denesyuk, Alexander; Denessiouk, Konstantin; Johnson, Mark S
2018-02-01
An integrin-like β-propeller domain contains seven repeats of a four-stranded antiparallel β-sheet motif (blades). Previously we described a 3D structural motif within each blade of the integrin-type β-propeller. Here, we show unique structural links that join different blades of the β-propeller structure, which together with the structural motif for a single blade are repeated in a β-propeller to provide the functional top face of the barrel, found to be involved in protein-protein interactions and substrate recognition. We compare functional top face diagrams of the integrin-type β-propeller domain and two non-integrin type β-propeller domains of virginiamycin B lyase and WD Repeat-Containing Protein 5. Copyright © 2017 Elsevier Inc. All rights reserved.
Noborn, Fredrik; Gomez Toledo, Alejandro; Green, Anders; Nasir, Waqas; Sihlbom, Carina; Nilsson, Jonas; Larson, Göran
2016-10-03
Heparan sulfate (HS) and chondroitin sulfate (CS) are complex polysaccharides that regulate important biological pathways in virtually all metazoan organisms. The polysaccharides often display opposite effects on cell functions with HS and CS structural motifs presenting unique binding sites for specific ligands. Still, the mechanisms by which glycan biosynthesis generates complex HS and CS polysaccharides required for the regulation of mammalian physiology remain elusive. Here we present a glycoproteomic approach that identifies and differentiates between HS and CS attachment sites and provides identity to the core proteins. Glycopeptides were prepared from perlecan, a complex proteoglycan known to be substituted with both HS and CS chains, further digested with heparinase or chondroitinase ABC to reduce the HS and CS chain lengths respectively, and thereafter analyzed by nLC-MS/MS. This protocol enabled the identification of three consensus HS sites and one hybrid site, carrying either a HS or a CS chain. Inspection of the amino acid sequence at the hybrid attachment locus indicates that certain peptide motifs may encode for the chain type selection process. This analytical approach will become useful when addressing fundamental questions in basic biology specifically in elucidating the functional roles of site-specific glycosylations of proteoglycans.
Balana, Bartosz; Maslennikov, Innokentiy; Kwiatkowski, Witek; Stern, Kalyn M.; Bahima, Laia; Choe, Senyon; Slesinger, Paul A.
2011-01-01
G protein-gated inwardly rectifying potassium (GIRK) channels are important gatekeepers of neuronal excitability. The surface expression of neuronal GIRK channels is regulated by the psychostimulant-sensitive sorting nexin 27 (SNX27) protein through a class I (-X-Ser/Thr-X-Φ, where X is any residue and Φ is a hydrophobic amino acid) PDZ-binding interaction. The G protein-insensitive inward rectifier channel (IRK1) contains the same class I PDZ-binding motif but associates with a different synaptic PDZ protein, postsynaptic density protein 95 (PSD95). The mechanism by which SNX27 and PSD95 discriminate these channels was previously unclear. Using high-resolution structures coupled with biochemical and functional analyses, we identified key amino acids upstream of the channel's canonical PDZ-binding motif that associate electrostatically with a unique structural pocket in the SNX27-PDZ domain. Changing specific charged residues in the channel's carboxyl terminus or in the PDZ domain converts the selective association and functional regulation by SNX27. Elucidation of this unique interaction site between ion channels and PDZ-containing proteins could provide a therapeutic target for treating brain diseases. PMID:21422294
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rha, Geun Bae; Wu, Guangteng; Shoelson, Steven E.
2010-04-15
Hepatocyte nuclear factor 4{alpha} (HNF4{alpha}) is a novel nuclear receptor that participates in a hierarchical network of transcription factors regulating the development and physiology of such vital organs as the liver, pancreas, and kidney. Among the various transcriptional coregulators with which HNF4{alpha} interacts, peroxisome proliferation-activated receptor {gamma} (PPAR{gamma}) coactivator 1{alpha} (PGC-1{alpha}) represents a novel coactivator whose activation is unusually robust and whose binding mode appears to be distinct from that of canonical coactivators such as NCoA/SRC/p160 family members. To elucidate the potentially unique molecular mechanism of PGC-1{alpha} recruitment, we have determined the crystal structure of HNF4{alpha} in complex with amore » fragment of PGC-1{alpha} containing all three of its LXXLL motifs. Despite the presence of all three LXXLL motifs available for interactions, only one is bound at the canonical binding site, with no additional contacts observed between the two proteins. However, a close inspection of the electron density map indicates that the bound LXXLL motif is not a selected one but an averaged structure of more than one LXXLL motif. Further biochemical and functional studies show that the individual LXXLL motifs can bind but drive only minimal transactivation. Only when more than one LXXLL motif is involved can significant transcriptional activity be measured, and full activation requires all three LXXLL motifs. These findings led us to propose a model wherein each LXXLL motif has an additive effect, and the multiple binding modes by HNF4{alpha} toward the LXXLL motifs of PGC-1{alpha} could account for the apparent robust activation by providing a flexible mechanism for combinatorial recruitment of additional coactivators and mediators.« less
SALAD database: a motif-based database of protein annotations for plant comparative genomics
Mihara, Motohiro; Itoh, Takeshi; Izawa, Takeshi
2010-01-01
Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209 529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named ‘SALAD on ARRAYs’ to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis. PMID:19854933
SALAD database: a motif-based database of protein annotations for plant comparative genomics.
Mihara, Motohiro; Itoh, Takeshi; Izawa, Takeshi
2010-01-01
Proteins often have several motifs with distinct evolutionary histories. Proteins with similar motifs have similar biochemical properties and thus related biological functions. We constructed a unique comparative genomics database termed the SALAD database (http://salad.dna.affrc.go.jp/salad/) from plant-genome-based proteome data sets. We extracted evolutionarily conserved motifs by MEME software from 209,529 protein-sequence annotation groups selected by BLASTP from the proteome data sets of 10 species: rice, sorghum, Arabidopsis thaliana, grape, a lycophyte, a moss, 3 algae, and yeast. Similarity clustering of each protein group was performed by pairwise scoring of the motif patterns of the sequences. The SALAD database provides a user-friendly graphical viewer that displays a motif pattern diagram linked to the resulting bootstrapped dendrogram for each protein group. Amino-acid-sequence-based and nucleotide-sequence-based phylogenetic trees for motif combination alignment, a logo comparison diagram for each clade in the tree, and a Pfam-domain pattern diagram are also available. We also developed a viewer named 'SALAD on ARRAYs' to view arbitrary microarray data sets of paralogous genes linked to the same dendrogram in a window. The SALAD database is a powerful tool for comparing protein sequences and can provide valuable hints for biological analysis.
Alenton, Rod Russel R; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo
2017-04-04
C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation.
Alenton, Rod Russel R.; Koiwai, Keiichiro; Miyaguchi, Kohei; Kondo, Hidehiro; Hirono, Ikuo
2017-01-01
C-type lectins (CTLs) are calcium-dependent carbohydrate-binding proteins known to assist the innate immune system as pattern recognition receptors (PRRs). The binding specificity of CTLs lies in the motif of their carbohydrate recognition domain (CRD), the tripeptide motifs EPN and QPD bind to mannose and galactose, respectively. However, variants of these motifs were discovered including a QAP sequence reported in shrimp believed to have the same carbohydrate specificity as QPD. Here, we characterized a novel C-type lectin (MjGCTL) possessing a CRD with a QAP motif. The recombinant MjGCTL has a calcium-dependent agglutinating capability against both Gram-negative and Gram-positive bacteria, and its sugar specificity did not involve either mannose or galactose. In an encapsulation assay, agarose beads coated with rMjGCTL were immediately encapsulated from 0 h followed by melanization at 4 h post-incubation with hemocytes. These results confirm that MjGCTL functions as a classical CTL. The structure of QAP motif and carbohydrate-specificity of rMjGCTL was found to be different to both EPN and QPD, suggesting that QAP is a new motif. Furthermore, MjGCTL acts as a PRR binding to hemocytes to activate their adherent state and initiate encapsulation. PMID:28374848
Synthetic Phage for Tissue Regeneration
Merzlyak, Anna; Lee, Seung-Wuk
2014-01-01
Controlling structural organization and signaling motif display is of great importance to design the functional tissue regenerating materials. Synthetic phage, genetically engineered M13 bacteriophage has been recently introduced as novel tissue regeneration materials to display a high density of cell-signaling peptides on their major coat proteins for tissue regeneration purposes. Structural advantages of their long-rod shape and monodispersity can be taken together to construct nanofibrous scaffolds which support cell proliferation and differentiation as well as direct orientation of their growth in two or three dimensions. This review demonstrated how functional synthetic phage is designed and subsequently utilized for tissue regeneration that offers potential cell therapy. PMID:24991085
Multiscale structural gradients enhance the biomechanical functionality of the spider fang
Bar-On, Benny; Barth, Friedrich G.; Fratzl, Peter; Politi, Yael
2014-01-01
The spider fang is a natural injection needle, hierarchically built from a complex composite material comprising multiscale architectural gradients. Considering its biomechanical function, the spider fang has to sustain significant mechanical loads. Here we apply experiment-based structural modelling of the fang, followed by analytical mechanical description and Finite-Element simulations, the results of which indicate that the naturally evolved fang architecture results in highly adapted effective structural stiffness and damage resilience. The analysis methods and physical insights of this work are potentially important for investigating and understanding the architecture and structural motifs of sharp-edge biological elements such as stingers, teeth, claws and more. PMID:24866935
Berg, Stefan; Starbuck, James; Torrelles, Jordi B; Vissa, Varalakshmi D; Crick, Dean C; Chatterjee, Delphi; Brennan, Patrick J
2005-02-18
D-Arabinans, composed of D-arabinofuranose (D-Araf), dominate the structure of mycobacterial cell walls in two settings, as part of lipoarabinomannan (LAM) and arabinogalactan, each with markedly different structures and functions. Little is known of the complexity of their biosynthesis. beta-D-Arabinofuranosyl-1-monophosphoryldecaprenol is the only known sugar donor. EmbA, EmbB, and EmbC, products of the paralogous genes embA, embB, and embC, the sites of resistance to the anti-tuberculosis drug ethambutol (EMB), are the only known implicated enzymes. EmbA and -B apparently contribute to the synthesis of arabinogalactan, whereas EmbC is reserved for the synthesis of LAM. The Emb proteins show no overall similarity to any known proteins beyond Mycobacterium and related genera. However, functional motifs, equivalent to a proline-rich motif of several bacterial polysaccharide co-polymerases and a superfamily of glycosyltransferases, were found. Site-directed mutagenesis in glycosyltransferase superfamily C resulted in complete ablation of LAM synthesis. Point mutations in three amino acids of the proline motif of EmbC resulted in marked reduction of LAM-arabinan synthesis and accumulation of an unknown intermediate and of the known precursor lipomannan. Yet the pattern of the differently linked d-Araf units observed in wild type LAM-arabinan was largely retained in the proline motif mutants. The results allow for the presentation of a unique model of arabinan synthesis.
Gusto, Gaelle; Schbath, Sophie
2005-01-01
We propose an original statistical method to estimate how the occurrences of a given process along a genome, genes or motifs for instance, may be influenced by the occurrences of a second process. More precisely, the aim is to detect avoided and/or favored distances between two motifs, for instance, suggesting possible interactions at a molecular level. For this, we consider occurrences along the genome as point processes and we use the so-called Hawkes' model. In such model, the intensity at position t depends linearly on the distances to past occurrences of both processes via two unknown profile functions to estimate. We perform a non parametric estimation of both profiles by using B-spline decompositions and a constrained maximum likelihood method. Finally, we use the AIC criterion for the model selection. Simulations show the excellent behavior of our estimation procedure. We then apply it to study (i) the dependence between gene occurrences along the E. coli genome and the occurrences of a motif known to be part of the major promoter for this bacterium, and (ii) the dependence between the yeast S. cerevisiae genes and the occurrences of putative polyadenylation signals. The results are coherent with known biological properties or previous predictions, meaning this method can be of great interest for functional motif detection, or to improve knowledge of some biological mechanisms.
Tumlirsch, Tony; Jendrossek, Dieter
2017-04-01
On the basis of bioinformatic evidence, we suspected that proteins with a CYTH ( Cy aB th iamine triphosphatase) domain and/or a CHAD ( c onserved h istidine α -helical d omain) motif might represent polyphosphate (polyP) granule-associated proteins. We found no evidence of polyP targeting by proteins with CYTH domains. In contrast, two CHAD motif-containing proteins from Ralstonia eutropha H16 (A0104 and B1017) that were expressed as fusions with enhanced yellow fluorescent protein (eYFP) colocalized with polyP granules. While the expression of B1017 was not detectable, the A0104 protein was specifically identified in an isolated polyP granule fraction by proteome analysis. Moreover, eYFP fusions with the CHAD motif-containing proteins MGMSRV2-1987 from Magnetospirillum gryphiswaldense and PP2307 from Pseudomonas putida also colocalized with polyP granules in a transspecies-specific manner. These data indicated that CHAD-containing proteins are generally attached to polyP granules. Together with the findings from four previously polyP-attached proteins (polyP kinases), the results of this study raised the number of polyP-associated proteins in R. eutropha to six. We suggest designating polyP granule-bound proteins with CHAD motifs as phosins ( pho sphate), analogous to pha sins and oleo sins that are specifically bound to the surface of polyhydroxyalkanoate (PHA) granules in PHA-accumulating bacteria and to oil droplets in oil seed plants, respectively. IMPORTANCE The importance of polyphosphate (polyP) for life is evident from the ubiquitous presence of polyP in all species on earth. In unicellular eukaryotic microorganisms, polyP is located in specific membrane-enclosed organelles, called acidocalcisomes. However, in most prokaryotes, polyP is present as insoluble granules that have been designated previously as volutin granules. Almost nothing is known regarding the macromolecular composition of polyP granules. Particularly, the absence or presence of cellular compounds on the surface of polyP granules has not yet been investigated. In this study, we identified a novel class of proteins that are attached to the surface of polyP granules in three model species of Alphaproteobacteria , Betaproteobacteria , and Gammaproteobacteria These proteins are characterized by the presence of a CHAD ( c onserved h istidine α -helical d omain) motif that functions as a polyP granule-targeting signal. We suggest designating CHAD motif-containing proteins as phosins [analogous to phasins for poly(3-hydroxybutyrate)-associated proteins and to oleosins for oil droplet-associated proteins in oil seed plants]. The expression of phosins in different species confirmed their polyP-targeting function in a transspecies-specific manner. We postulate that polyP granules in prokaryotic species generally have a complex surface structure that consists of one to several polyP kinases and phosin proteins. We suggest differentiating polyP granules from acidocalcisomes by designating them as polyphosphatosomes. Copyright © 2017 American Society for Microbiology.
How pathogens use linear motifs to perturb host cell networks.
Via, Allegra; Uyar, Bora; Brun, Christine; Zanzoni, Andreas
2015-01-01
Molecular mimicry is one of the powerful stratagems that pathogens employ to colonise their hosts and take advantage of host cell functions to guarantee their replication and dissemination. In particular, several viruses have evolved the ability to interact with host cell components through protein short linear motifs (SLiMs) that mimic host SLiMs, thus facilitating their internalisation and the manipulation of a wide range of cellular networks. Here we present convincing evidence from the literature that motif mimicry also represents an effective, widespread hijacking strategy in prokaryotic and eukaryotic parasites. Further insights into host motif mimicry would be of great help in the elucidation of the molecular mechanisms behind host cell invasion and the development of anti-infective therapeutic strategies. Copyright © 2014 Elsevier Ltd. All rights reserved.
Towler, D A; Bennett, C D; Rodan, G A
1994-05-01
A detailed analysis of the transcriptional machinery responsible for osteoblast-specific gene expression should provide tools useful for understanding osteoblast commitment and differentiation. We have defined three cis-elements important for basal activity of the rat osteocalcin (OC) promoter, located at about -200 to -180, -170 to -138, and -121 to -64 relative to the transcription initiation site. A motif (TCTGATTGTGT) present in the region between -200 and -170 that binds a multisubunit CP1/NFY/CBF-like CAAT factor complex contributes significantly to high level basal activity and presumably functions as the CAAT box for the rat OC promoter. We show that the region -121 to 32 is sufficient to confer osteoblastic cell type specificity in transient transfection assays of cultured cell lines using luciferase as a reporter. The basal promoter is active in rodent osteoblastic cell lines, but not in rodent fibroblastic or muscle cell lines. Although the rat OC box (-100 to -74) contains a CAAT motif, we could not detect CP1-like CAAT factor binding to this region. In fact, we demonstrate that a Msx-1 (Hox 7.1) homeodomain binding motif (ACTAATTG; bottom strand) in the 3'-end of the rat OC box is necessary for high level activity of the rat OC basal promoter in osteoblastic cells. A nuclear factor that recognizes this motif appears to be present in osteoblastic ROS 17/2.8 cells, which produce OC, but not in fibroblastic ROS 25/1 cells, which fail to express OC. This ROS 17/2.8 nuclear factor also recognizes the A/T-rich DNA cognates of the homeodomain-containing POU family of transcription factors. Taken together, these data suggest that a ubiquitous CP1-like CAAT factor and a cell type-restricted homeodomain containing (Msx or POU family) transcription factor interact with the proximal rat OC promoter to direct appropriate basal OC transcription in osteoblastic cells.
Raasch, Martin; Rennert, Knut; Jahn, Tobias; Peters, Sven; Henkel, Thomas; Huber, Otmar; Schulz, Ingo; Becker, Holger; Lorkowski, Stefan; Funke, Harald; Mosig, Alexander
2015-03-02
Hemodynamic forces generated by the blood flow are of central importance for the function of endothelial cells (ECs), which form a biologically active cellular monolayer in blood vessels and serve as a selective barrier for macromolecular permeability. Mechanical stimulation of the endothelial monolayer induces morphological remodeling in its cytoskeleton. For in vitro studies on EC biology culture devices are desirable that simulate conditions of flow in blood vessels and allow flow-based adhesion/permeability assays under optimal perfusion conditions. With this aim we designed a biochip comprising a perfusable membrane that serves as cell culture platform multi-organ-tissue-flow (MOTiF biochip). This biochip allows an effective supply with nutrition medium, discharge of catabolic cell metabolites and defined application of shear stress to ECs under laminar flow conditions. To characterize EC layers cultured in the MOTiF biochip we investigated cell viability, expression of EC marker proteins and cell adhesion molecules of ECs dynamically cultured under low and high shear stress, and compared them with an endothelial culture in established two-dimensionally perfused flow chambers and under static conditions. We show that ECs cultured in the MOTiF biochip form a tight EC monolayer with increased cellular density, enhanced cell layer thickness, presumably as the result of a rapid and effective adaption to shear stress by remodeling of the cytoskeleton. Moreover, endothelial layers in the MOTiF biochip express higher amounts of EC marker proteins von-Willebrand-factor and PECAM-1. EC layers were highly responsive to stimulation with TNFα as detected at the level of ICAM-1, VCAM-1 and E-selectin expression and modulation of endothelial permeability in response to TNFα/IFNγ treatment under flow conditions. Compared to static and two-dimensionally perfused cell culture condition we consider MOTiF biochips as a valuable tool for studying EC biology in vitro under advanced culture conditions more closely resembling the in vivo situation.
Conservation of RNA chaperone activity of the human La-related proteins 4, 6 and 7.
Hussain, Rawaa H; Zawawi, Mariam; Bayfield, Mark A
2013-10-01
The La module is a conserved tandem arrangement of a La motif and RNA recognition motif whose function has been best characterized in genuine La proteins. The best-characterized substrates of La proteins are pre-tRNAs, and previous work using tRNA mediated suppression in Schizosaccharomyces pombe has demonstrated that yeast and human La enhance the maturation of these using two distinguishable activities: UUU-3'OH-dependent trailer binding/protection and a UUU-3'OH independent activity related to RNA chaperone function. The La module has also been identified in several conserved families of La-related proteins (LARPs) that engage other RNAs, but their mode of RNA binding and function(s) are not well understood. We demonstrate that the La modules of the human LARPs 4, 6 and 7 are also active in tRNA-mediated suppression, even in the absence of stable UUU-3'OH trailer protection. Rather, the capacity of these to enhance pre-tRNA maturation is associated with RNA chaperone function, which we demonstrate to be a conserved activity for each hLARP in vitro. Our work reveals insight into the mechanisms by which La module containing proteins discriminate RNA targets and demonstrates that RNA chaperone activity is a conserved function across representative members of the La motif-containing superfamily.
Sebestyén, Endre; Nagy, Tibor; Suhai, Sándor; Barta, Endre
2009-01-01
Background The comparative genomic analysis of a large number of orthologous promoter regions of the chordate and plant genes from the DoOP databases shows thousands of conserved motifs. Most of these motifs differ from any known transcription factor binding site (TFBS). To identify common conserved motifs, we need a specific tool to be able to search amongst them. Since conserved motifs from the DoOP databases are linked to genes, the result of such a search can give a list of genes that are potentially regulated by the same transcription factor(s). Results We have developed a new tool called DoOPSearch for the analysis of the conserved motifs in the promoter regions of chordate or plant genes. We used the orthologous promoters of the DoOP database to extract thousands of conserved motifs from different taxonomic groups. The advantage of this approach is that different sets of conserved motifs might be found depending on how broad the taxonomic coverage of the underlying orthologous promoter sequence collection is (consider e.g. primates vs. mammals or Brassicaceae vs. Viridiplantae). The DoOPSearch tool allows the users to search these motif collections or the promoter regions of DoOP with user supplied query sequences or any of the conserved motifs from the DoOP database. To find overrepresented gene ontologies, the gene lists obtained can be analysed further using a modified version of the GeneMerge program. Conclusion We present here a comparative genomics based promoter analysis tool. Our system is based on a unique collection of conserved promoter motifs characteristic of different taxonomic groups. We offer both a command line and a web-based tool for searching in these motif collections using user specified queries. These can be either short promoter sequences or consensus sequences of known transcription factor binding sites. The GeneMerge analysis of the search results allows the user to identify statistically overrepresented Gene Ontology terms that might provide a clue on the function of the motifs and genes. PMID:19534755
The Regulatory Factor ZFHX3 Modifies Circadian Function in SCN via an AT Motif-Driven Axis
Parsons, Michael J.; Brancaccio, Marco; Sethi, Siddharth; Maywood, Elizabeth S.; Satija, Rahul; Edwards, Jessica K.; Jagannath, Aarti; Couch, Yvonne; Finelli, Mattéa J.; Smyllie, Nicola J.; Esapa, Christopher; Butler, Rachel; Barnard, Alun R.; Chesham, Johanna E.; Saito, Shoko; Joynson, Greg; Wells, Sara; Foster, Russell G.; Oliver, Peter L.; Simon, Michelle M.; Mallon, Ann-Marie; Hastings, Michael H.; Nolan, Patrick M.
2015-01-01
Summary We identified a dominant missense mutation in the SCN transcription factor Zfhx3, termed short circuit (Zfhx3Sci), which accelerates circadian locomotor rhythms in mice. ZFHX3 regulates transcription via direct interaction with predicted AT motifs in target genes. The mutant protein has a decreased ability to activate consensus AT motifs in vitro. Using RNA sequencing, we found minimal effects on core clock genes in Zfhx3Sci/+ SCN, whereas the expression of neuropeptides critical for SCN intercellular signaling was significantly disturbed. Moreover, mutant ZFHX3 had a decreased ability to activate AT motifs in the promoters of these neuropeptide genes. Lentiviral transduction of SCN slices showed that the ZFHX3-mediated activation of AT motifs is circadian, with decreased amplitude and robustness of these oscillations in Zfhx3Sci/+ SCN slices. In conclusion, by cloning Zfhx3Sci, we have uncovered a circadian transcriptional axis that determines the period and robustness of behavioral and SCN molecular rhythms. PMID:26232227
A naturally occurring, noncanonical GTP aptamer made of simple tandem repeats
Curtis, Edward A; Liu, David R
2014-01-01
Recently, we used in vitro selection to identify a new class of naturally occurring GTP aptamer called the G motif. Here we report the discovery and characterization of a second class of naturally occurring GTP aptamer, the “CA motif.” The primary sequence of this aptamer is unusual in that it consists entirely of tandem repeats of CA-rich motifs as short as three nucleotides. Several active variants of the CA motif aptamer lack the ability to form consecutive Watson-Crick base pairs in any register, while others consist of repeats containing only cytidine and adenosine residues, indicating that noncanonical interactions play important roles in its structure. The circular dichroism spectrum of the CA motif aptamer is distinct from that of A-form RNA and other major classes of nucleic acid structures. Bioinformatic searches indicate that the CA motif is absent from most archaeal and bacterial genomes, but occurs in at least 70 percent of approximately 400 eukaryotic genomes examined. These searches also uncovered several phylogenetically conserved examples of the CA motif in rodent (mouse and rat) genomes. Together, these results reveal the existence of a second class of naturally occurring GTP aptamer whose sequence requirements, like that of the G motif, are not consistent with those of a canonical secondary structure. They also indicate a new and unexpected potential biochemical activity of certain naturally occurring tandem repeats. PMID:24824832
Robinson, Angela K.; Leal, Belinda Z.; Chadwell, Linda V.; Wang, Renjing; Ilangovan, Udayar; Kaur, Yogeet; Junco, Sarah E.; Schirf, Virgil; Osmulski, Pawel A.; Gaczynska, Maria; Hinck, Andrew P.; Demeler, Borries; McEwen, Donald G.; Kim, Chongwoo A.
2012-01-01
Polyhomeotic (Ph), a member of the Polycomb Group (PcG), is a gene silencer critical for proper development. We present a previously unrecognized way of controlling Ph function through modulation of its sterile alpha motif (SAM) polymerization leading to the identification of a novel target for tuning the activities of proteins. SAM domain containing proteins have been shown to require SAM polymerization for proper function. However, the role of the Ph SAM polymer in PcG-mediated gene silencing was uncertain. Here, we first show that Ph SAM polymerization is indeed required for its gene silencing function. Interestingly, the unstructured linker sequence N-terminal to Ph SAM can shorten the length of polymers compared with when Ph SAM is individually isolated. Substituting the native linker with a random, unstructured sequence (RLink) can still limit polymerization, but not as well as the native linker. Consequently, the increased polymeric Ph RLink exhibits better gene silencing ability. In the Drosophila wing disc, Ph RLink expression suppresses growth compared with no effect for wild-type Ph, and opposite to the overgrowth phenotype observed for polymer-deficient Ph mutants. These data provide the first demonstration that the inherent activity of a protein containing a polymeric SAM can be enhanced by increasing SAM polymerization. Because the SAM linker had not been previously considered important for the function of SAM-containing proteins, our finding opens numerous opportunities to manipulate linker sequences of hundreds of polymeric SAM proteins to regulate a diverse array of intracellular functions. PMID:22275371
Comparative analysis of the XopD T3S effector family in plant pathogenic bacteria
Kim, Jung-Gun; Taylor, Kyle W.; Mudgett, Mary Beth
2011-01-01
SUMMARY XopD is a type III effector protein that is required for Xanthomonas campestris pathovar vesicatoria (Xcv) growth in tomato. It is a modular protein consisting of an N-terminal DNA-binding domain, two EAR transcriptional repressor motifs, and a C-terminal SUMO protease. In tomato, XopD functions as a transcriptional repressor, resulting in the suppression of defense responses at late stages of infection. A survey of available genome sequences for phytopathogenic bacteria revealed that XopD homologs are limited to species within three Genera of Proteobacteria – Xanthomonas, Acidovorax, and Pseudomonas. While the EAR motif(s) and SUMO protease domain are conserved in all the XopD-like proteins, variation exists in the length and sequence identity of the N-terminal domains. Comparative analysis of the DNA sequences surrounding xopD and xopD-like genes led to revised annotation of the xopD gene. Edman degradation sequence analysis and functional complementation studies confirmed that the xopD gene from Xcv encodes a 760 amino acid protein with a longer N-terminal domain than previously predicted. None of the XopD-like proteins studied complemented Xcv ΔxopD mutant phenotypes in tomato leaves suggesting that the N-terminus of XopD defines functional specificity. Xcv ΔxopD strains expressing chimeric fusion proteins containing the N-terminus of XopD fused to the EAR motif(s) and SUMO protease domain of the XopD-like protein from Xanthomonas campestris pathovar campestris strain B100 were fully virulent in tomato demonstrating that the N-terminus of XopD controls specificity in tomato. PMID:21726373
Krobath, I; Römer, H; Hartbauer, M
2017-01-01
Males of a trilling species in the Mecopoda complex produce conspicuous calling songs that consist of two motifs: an amplitude-modulated motif with alternating loud and soft segments (AM-motif) and a continuous, high-intensity trill. The function of these song motifs for female attraction and competition between males was investigated. We tested the hypothesis that males modify their signaling behavior depending on the social environment (presence/absence of females or rival males) when they compete for mates. Therefore, we analyzed acoustic signaling of males in three different situations: (1) solo singing, (2) acoustic interaction with another male, and (3) singing in the presence of a female. In addition, the preference of females for these song motifs and further song parameters was studied in two-choice experiments. As expected, females showed a preference for conspicuous and loud song elements, but nevertheless, males increased the proportion of the AM-motif in the presence of a female. In acoustic interactions, males reduced bout duration significantly compared to both other situations. However, song bouts in this situation still overlapped more than expected by chance, which indicates intentionally simultaneous singing. A multivariate statistical analysis revealed that the proportion of the AM-motif and the duration of loud segments within the AM-motif allow a reliable prediction of whether males sing in isolation, compete with another male, or sing in the presence of a female. These results indicate that the AM-motif plays a dominant role especially in close-range courtship and that males are challenged in finding a balance between attracting females and saving energy during repeated acoustic interactions. Males of acoustic insects often produce conspicuous calling songs that have a dual function in male-male competition and mate attraction. High signal amplitudes and signal rates are associated with high energetic costs for signal production. We would therefore predict that males adjust their signaling behavior according to their perception of the social context. Here we studied signal production and mate choice in a katydid, where males switch between loud and soft song segments in a dynamic way. Additionally, we examined the attractiveness of different song elements in female choice tests. Our results show how males of this katydid deal with the conflict of remaining attractive for females and competing with a costly signal with rivals.
Differential pleiotropy and HOX functional organization.
Sivanantharajah, Lovesha; Percival-Smith, Anthony
2015-02-01
Key studies led to the idea that transcription factors are composed of defined modular protein motifs or domains, each with separable, unique function. During evolution, the recombination of these modular domains could give rise to transcription factors with new properties, as has been shown using recombinant molecules. This archetypic, modular view of transcription factor organization is based on the analyses of a few transcription factors such as GAL4, which may represent extreme exemplars rather than an archetype or the norm. Recent work with a set of Homeotic selector (HOX) proteins has revealed differential pleiotropy: the observation that highly-conserved HOX protein motifs and domains make small, additive, tissue specific contributions to HOX activity. Many of these differentially pleiotropic HOX motifs may represent plastic sequence elements called short linear motifs (SLiMs). The coupling of differential pleiotropy with SLiMs, suggests that protein sequence changes in HOX transcription factors may have had a greater impact on morphological diversity during evolution than previously believed. Furthermore, differential pleiotropy may be the genetic consequence of an ensemble nature of HOX transcription factor allostery, where HOX proteins exist as an ensemble of states with the capacity to integrate an extensive array of developmental information. Given a new structural model for HOX functional domain organization, the properties of the archetypic TF may require reassessment. Copyright © 2014 Elsevier Inc. All rights reserved.
iELM—a web server to explore short linear motif-mediated interactions
Weatheritt, Robert J.; Jehl, Peter; Dinkel, Holger; Gibson, Toby J.
2012-01-01
The recent expansion in our knowledge of protein–protein interactions (PPIs) has allowed the annotation and prediction of hundreds of thousands of interactions. However, the function of many of these interactions remains elusive. The interactions of Eukaryotic Linear Motif (iELM) web server provides a resource for predicting the function and positional interface for a subset of interactions mediated by short linear motifs (SLiMs). The iELM prediction algorithm is based on the annotated SLiM classes from the Eukaryotic Linear Motif (ELM) resource and allows users to explore both annotated and user-generated PPI networks for SLiM-mediated interactions. By incorporating the annotated information from the ELM resource, iELM provides functional details of PPIs. This can be used in proteomic analysis, for example, to infer whether an interaction promotes complex formation or degradation. Furthermore, details of the molecular interface of the SLiM-mediated interactions are also predicted. This information is displayed in a fully searchable table, as well as graphically with the modular architecture of the participating proteins extracted from the UniProt and Phospho.ELM resources. A network figure is also presented to aid the interpretation of results. The iELM server supports single protein queries as well as large-scale proteomic submissions and is freely available at http://i.elm.eu.org. PMID:22638578
Computational Analyses of Synergism in Small Molecular Network Motifs
Zhang, Yili; Smolen, Paul; Baxter, Douglas A.; Byrne, John H.
2014-01-01
Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically) to alter the responses of the motifs to stimuli. Synergism (or antagonism) was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions. PMID:24651495
Crystal structure of bacterial cell-surface alginate-binding protein with an M75 peptidase motif
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maruyama, Yukie; Ochiai, Akihito; Mikami, Bunzo
Research highlights: {yields} Bacterial alginate-binding Algp7 is similar to component EfeO of Fe{sup 2+} transporter. {yields} We determined the crystal structure of Algp7 with a metal-binding motif. {yields} Algp7 consists of two helical bundles formed through duplication of a single bundle. {yields} A deep cleft involved in alginate binding locates around the metal-binding site. {yields} Algp7 may function as a Fe{sup 2+}-chelated alginate-binding protein. -- Abstract: A gram-negative Sphingomonas sp. A1 directly incorporates alginate polysaccharide into the cytoplasm via the cell-surface pit and ABC transporter. A cell-surface alginate-binding protein, Algp7, functions as a concentrator of the polysaccharide in the pit.more » Based on the primary structure and genetic organization in the bacterial genome, Algp7 was found to be homologous to an M75 peptidase motif-containing EfeO, a component of a ferrous ion transporter. Despite the presence of an M75 peptidase motif with high similarity, the Algp7 protein purified from recombinant Escherichia coli cells was inert on insulin B chain and N-benzoyl-Phe-Val-Arg-p-nitroanilide, both of which are substrates for a typical M75 peptidase, imelysin, from Pseudomonas aeruginosa. The X-ray crystallographic structure of Algp7 was determined at 2.10 A resolution by single-wavelength anomalous diffraction. Although a metal-binding motif, HxxE, conserved in zinc ion-dependent M75 peptidases is also found in Algp7, the crystal structure of Algp7 contains no metal even at the motif. The protein consists of two structurally similar up-and-down helical bundles as the basic scaffold. A deep cleft between the bundles is sufficiently large to accommodate macromolecules such as alginate polysaccharide. This is the first structural report on a bacterial cell-surface alginate-binding protein with an M75 peptidase motif.« less
Human β-glucuronidase: structure, function, and application in enzyme replacement therapy.
Naz, Huma; Islam, Asimul; Waheed, Abdul; Sly, William S; Ahmad, Faizan; Hassan, Imtaiyaz
2013-10-01
Lysosomal storage diseases occur due to incomplete metabolic degradation of macromolecules by various hydrolytic enzymes in the lysosome. Despite structural differences, most of the lysosomal enzymes share many common features including a lysosomal targeting motif and phosphotransferase recognition sites. β-Glucuronidase (GUSB) is an important lysosomal enzyme involved in the degradation of glucuronate-containing glycosaminoglycan. The deficiency of GUSB causes mucopolysaccharidosis type VII (MPSVII), leading to lysosomal storage in the brain. GUSB is a well-studied protein for its expression, sequence, structure, and function. The purpose of this review is to summarize our current understanding of sequence, structure, function, and evolution of GUSB and its lysosomal enzyme targeting. Enzyme replacement therapy reported for this protein is also discussed.
Mistri, Tapan Kumar; Arindrarto, Wibowo; Ng, Wei Ping; Wang, Choayang; Lim, Leng Hiong; Sun, Lili; Chambers, Ian; Wohland, Thorsten; Robson, Paul
2018-03-20
Oct4 and Sox2 regulate the expression of target genes such as Nanog, Fgf4 , and Utf1 , by binding to their respective regulatory motifs. Their functional cooperation is reflected in their ability to heterodimerize on adjacent cis regulatory motifs, the composite Sox/Oct motif. Given that Oct4 and Sox2 regulate many developmental genes, a quantitative analysis of their synergistic action on different Sox/Oct motifs would yield valuable insights into the mechanisms of early embryonic development. In the present study, we measured binding affinities of Oct4 and Sox2 to different Sox/Oct motifs using fluorescence correlation spectroscopy. We found that the synergistic binding interaction is driven mainly by the level of Sox2 in the case of the Fgf4 Sox/Oct motif. Taking into account Sox2 expression levels fluctuate more than Oct4 , our finding provides an explanation on how Sox2 controls the segregation of the epiblast and primitive endoderm populations within the inner cell mass of the developing rodent blastocyst. © 2018 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.
SSMART: Sequence-structure motif identification for RNA-binding proteins.
Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe
2018-06-11
RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.
Molecular cloning and characterization of sea bass (Dicentrarchus labrax, L.) Tapasin.
Pinto, Rute D; da Silva, Diogo V; Pereira, Pedro J B; dos Santos, Nuno M S
2012-01-01
Mammalian tapasin (TPN) is a key member of the major histocompatibility complex (MHC) class I antigen presentation pathway, being part of the multi-protein complex called the peptide loading complex (PLC). Several studies describe its important roles in stabilizing empty MHC class I complexes, facilitating peptide loading and editing the repertoire of bound peptides, with impact on CD8(+) T cell immune responses. In this work, the gene and cDNA of the sea bass (Dicentrarchus labrax) glycoprotein TPN have been isolated and characterized. The coding sequence has a 1329 bp ORF encoding a 442-residue precursor protein with a predicted 24-amino acid leader peptide, generating a 418-amino acid mature form that retains a conserved N-glycosylation site, three conserved mammalian tapasin motifs, two Ig superfamily domains, a transmembrane domain and an ER-retention di-lysine motif at the C-terminus, suggestive of a function similar to mammalian tapasins. Similar to the human counterpart, the sea bass TPN gene comprises 8 exons, some of which correspond to separate functional domains of the protein. A three-dimensional homology model of sea bass tapasin was calculated and is consistent with the structural features described for the human molecule. Together, these results support the concept that the basic structure of TPN has been maintained through evolution. Moreover, the present data provides information that will allow further studies on cell-mediated immunity and class I antigen presentation pathway in particular, in this important fish species. Copyright © 2011 Elsevier Ltd. All rights reserved.
Zeeshan, Mohammad; Kaur, Inderjeet; Joy, Joseph; Saini, Ekta; Paul, Gourab; Kaushik, Abhinav; Dabral, Surbhi; Mohmmed, Asif; Gupta, Dinesh; Malhotra, Pawan
2017-02-03
Plasmodium falciparum undergoes a tightly regulated developmental process in human erythrocytes, and recent studies suggest an important regulatory role of post-translational modifications (PTMs). As compared with Plasmodium phosphoproteome, little is known about other PTMs in the parasite. In the present study, we performed a global analysis of asexual blood stages of Plasmodium falciparum to identify arginine-methylated proteins. Using two different methyl arginine-specific antibodies, we immunoprecipitated the arginine-methylated proteins from the stage-specific parasite lysates and identified 843 putative arginine-methylated proteins by LC-MS/MS. Motif analysis of the protein sequences unveiled that the methylation sites are associated with the previously known methylation motifs such as GRx/RGx, RxG, GxxR, or WxxxR. We identified Plasmodium homologues of known arginine-methylated proteins in trypanosomes, yeast, and human. Hydrophilic interaction liquid chromatography (HILIC) was performed on the immunoprecipitates from the trophozoite stage to enrich arginine-methylated peptides. Mass spectrometry analysis of immunoprecipitated and HILIC fractions identified 55 arginine-methylated peptides having 62 methylated arginine sites. Functional classification revealed that the arginine-methylated proteins are involved in RNA metabolism, protein synthesis, intracellular protein trafficking, proteolysis, protein folding, chromatin organization, hemoglobin metabolic process, and several other functions. Summarily, the findings suggest that protein methylation of arginine residues is a widespread phenomenon in Plasmodium, and the PTM may play an important regulatory role in a diverse set of biological pathways, including host-pathogen interactions.
Chen, Xiao-Ren; Huang, Shen-Xin; Zhang, Ye; Sheng, Gui-Lin; Li, Yan-Peng; Zhu, Feng
2018-03-23
Phytophthora capsici is a hemibiotrophic, phytopathogenic oomycete that infects a wide range of crops, resulting in significant economic losses worldwide. By means of a diverse arsenal of secreted effector proteins, hemibiotrophic pathogens may manipulate plant cell death to establish a successful infection and colonization. In this study, we described the analysis of the gene family encoding necrosis- and ethylene-inducing peptide 1 (Nep1)-like proteins (NLPs) in P. capsici, and identified 39 real NLP genes and 26 NLP pseudogenes. Out of the 65 predicted NLP genes, 48 occur in groups with two or more genes, whereas the remainder appears to be singletons distributed randomly among the genome. Phylogenetic analysis of the 39 real NLPs delineated three groups. Key residues/motif important for the effector activities are degenerated in most NLPs, including the nlp24 peptide consisting of the conserved region I (11-aa immunogenic part) and conserved region II (the heptapeptide GHRHDWE motif) that is important for phytotoxic activity. Transcriptional profiling of eight selected NLP genes indicated that they were differentially expressed during the developmental and plant infection phases of P. capsici. Functional analysis of ten cloned NLPs demonstrated that Pc11951, Pc107869, Pc109174 and Pc118548 were capable of inducing cell death in the Solanaceae, including Nicotiana benthamiana and hot pepper. This study provides an overview of the P. capsici NLP gene family, laying a foundation for further elucidating the pathogenicity mechanism of this devastating pathogen.
Hayashi, Kazumi; Minamisawa, Tamiko; Homma, Sadamu; Koido, Shigeo; Shiba, Kiyotaka
2017-01-01
Adjuvants are indispensable for achieving a sufficient immune response from vaccinations. From a functional viewpoint, adjuvants are classified into two categories: “physical adjuvants” increase the efficacy of antigen presentation by antigen-presenting cells (APC) and “signal adjuvants” induce the maturation of APC. Our previous study has demonstrated that a physical adjuvant can be encrypted into proteinous antigens by creating artificial proteins from combinatorial assemblages of epitope peptides and those peptide sequences having propensities to form certain protein structures (motif programming). However, the artificial antigens still require a signal adjuvant to maturate the APC; for example, co-administration of the Toll-like receptor 4 (TLR4) agonist monophosphoryl lipid A (MPLA) was required to induce an in vivo immunoreaction. In this study, we further modified the previous artificial antigens by appending the peptide motifs, which have been reported to have agonistic activity for TLR4, to create “adjuvant-free” antigens. The created antigens with triple TLR4 agonistic motifs in their C-terminus have activated NF-κB signaling pathways through TLR4. These proteins also induced the production of the inflammatory cytokine TNF-α, and the expression of the co-stimulatory molecule CD40 in APC, supporting the maturation of APC in vitro. Unexpectedly, these signal adjuvant-encrypted proteins have lost their ability to be physical adjuvants because they did not induce cytotoxic T lymphocytes (CTL) in vivo, while the parental proteins induced CTL. These results confirmed that the manifestation of a motif’s function is context-dependent and simple addition does not always work for motif-programing. Further optimization of the molecular context of the TLR4 agonistic motifs in antigens should be required to create adjuvant-free antigens. PMID:29190754
Kamsteeg, Erik-Jan; Stoffels, Monique; Tamma, Grazia; Konings, Irene B M; Deen, Peter M T
2009-10-01
Regulation of body water homeostasis occurs by the vasopressin-dependent sorting of aquaporin-2 (AQP2) water channels to and from the apical membrane of renal principal cells. Mutations in AQP2 cause autosomal nephrogenic diabetes insipidus (NDI), a disease that renders the kidney unresponsive to vasopressin, resulting in polyuria and polydipsia. The AQP2 mutant c.772G>A; p.Glu258Lys (AQP2-E258K) causes dominant NDI by oligomerizing with wild-type AQP2 and missorting of this AQP2 complex to multivesicular bodies instead of the apical membrane. The motif causing this missorting of AQP2-E258K was identified here. Functional analyses and plasma membrane expression studies of truncation mutants in oocytes revealed that AQP2-E258K shortened to Leu259 is still intracellular retained. Alanine scanning and glutamic acid to arginine exchanges revealed increased function and plasma membrane expression for AQP2-E258K mutants with the following additional changes: Leu259Ala, Arg252Glu, Arg253Glu, or Arg252Ala-Arg254Ala, or for the AQP2 mutant p.Glu258Ala, indicating that the motif RRRxxxK(258)L confers AQP2-E258K retention. Fusion of this motif to aquaporin-1 also resulted in missorting of that water channel, indicating that this retention motif is transferable. In conclusion, our data reveal that the RRRxxxKL motif and repulsion between K258 and the arginine-triplet within this motif are the primary cause of missorting of AQP2-E258K in NDI.
Building a stable RNA U-turn with a protonated cytidine.
Gottstein-Schmidtke, Sina R; Duchardt-Ferner, Elke; Groher, Florian; Weigand, Julia E; Gottstein, Daniel; Suess, Beatrix; Wöhnert, Jens
2014-08-01
The U-turn is a classical three-dimensional RNA folding motif first identified in the anticodon and T-loops of tRNAs. It also occurs frequently as a building block in other functional RNA structures in many different sequence and structural contexts. U-turns induce sharp changes in the direction of the RNA backbone and often conform to the 3-nt consensus sequence 5'-UNR-3' (N = any nucleotide, R = purine). The canonical U-turn motif is stabilized by a hydrogen bond between the N3 imino group of the U residue and the 3' phosphate group of the R residue as well as a hydrogen bond between the 2'-hydroxyl group of the uridine and the N7 nitrogen of the R residue. Here, we demonstrate that a protonated cytidine can functionally and structurally replace the uridine at the first position of the canonical U-turn motif in the apical loop of the neomycin riboswitch. Using NMR spectroscopy, we directly show that the N3 imino group of the protonated cytidine forms a hydrogen bond with the backbone phosphate 3' from the third nucleotide of the U-turn analogously to the imino group of the uridine in the canonical motif. In addition, we compare the stability of the hydrogen bonds in the mutant U-turn motif to the wild type and describe the NMR signature of the C+-phosphate interaction. Our results have implications for the prediction of RNA structural motifs and suggest simple approaches for the experimental identification of hydrogen bonds between protonated C-imino groups and the phosphate backbone. © 2014 Gottstein-Schmidtke et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
NASA Astrophysics Data System (ADS)
Zhang, Liyuan; Fan, Denggui; Wang, Qingyun
2018-06-01
Studies on the structural-functional connectomes of the human brain have demonstrated the existence of synchronous firings in a specific brain network motif. In particular, synchronization of high-frequency oscillations (HFOs) has been observed in the experimental data sets of temporal lobe epilepsy (TLE). In addition, both clinical and experimental evidences have accumulated to demonstrate the effect of electrical stimulation on TLE, which, however, remains largely unexplored. In this work, we first employ our previously proposed dentate gyrus (DG)-CA3 network model to investigate the influence of an external electrical stimulus on the HFO transitions. The results indicate that the reinforcing stimulus can induce the HFO transitions of the DG-CA3 system from the gamma band to the fast ripples band. Along with that, the consistent oscillations of neurons within DG-CA3 can also be enhanced with the increasing of stimulus. Then, we expand into a simple motif of three coupled DG-CA3 systems in both the feedforward inhibition and feedback inhibition connections, to investigate the synchronous evolutions of HFOs by regulating both the stimulation strength and inhibitory function. It is shown that the comprehensive effects, which lead to band transition, are independent of the motif configurations. The enhanced external electrical stimulus weakens the synchronism and correlation of connected motifs. In contrast, we demonstrate that the increased inhibitory coupling could facilitate correlation to some extent. Overall, our work highlights the possible origin of synchronous HFOs of hippocampal motifs governed by external inputs and inhibitory connection, which might contribute to a better understanding of the interplay between synchronization dynamics and epileptic structure in the human brain.